BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017049
(378 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 210/363 (57%), Positives = 264/363 (72%), Gaps = 6/363 (1%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
FP+ Y++V + +G PPK F FD DTGSDLTWVQCDAPC+GCT PP QYKP NI+PCS
Sbjct: 44 FPL-GYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTLPPNLQYKPKGNIIPCS 102
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
NP C ALHWPN P C +P +QCDYE++Y D GSS+GALVTD FPL+ NGS P+ FG
Sbjct: 103 NPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSMGALVTDQFPLKLVNGSFMQPPVAFG 162
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CGY+Q P PP TAGVLGLGRG+I +++QL GL RNV+GHC+ G G LF GD
Sbjct: 163 CGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGFLFFGDN 222
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
VPS GVAWTP+L HY GPA+LL++GK GLK L LIFD+G+SY YF S+ YQ
Sbjct: 223 LVPSIGVAWTPLLSQD---NHYTTGPADLLFNGKPTGLKGLKLIFDTGSSYTYFNSKAYQ 279
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRL 308
I++LI DL +PLK+A +DKTLPICW+G PFK++ +V +FK + ++FTN R + +L
Sbjct: 280 TIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINFTNGRRNTQL 339
Query: 309 VVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+ PE YL++S NVCLG+LNGSE + +N+IG+I MQ M+IYDNEKQ++GW DC
Sbjct: 340 YLAPELYLIVSKTGNVCLGLLNGSEVGLQNSNVIGDISMQGLMMIYDNEKQQLGWVSSDC 399
Query: 369 NTL 371
N L
Sbjct: 400 NKL 402
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 433 bits (1114), Expect = e-119, Method: Compositional matrix adjust.
Identities = 206/363 (56%), Positives = 262/363 (72%), Gaps = 6/363 (1%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
FP+ Y++V L +G PPK F+FD DTGSD+TWVQCDAPCTGC PP+ QYKP N VPCS
Sbjct: 49 FPL-GYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGCNLPPKLQYKPKGNTVPCS 107
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+P C ALH+PN P+C +P +QCDYE+ Y D GSS+GALV D FP + NGS L FG
Sbjct: 108 DPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPFKLLNGSAMQPRLAFG 167
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CGY+Q P PP TAGVLGLGRG+I +++QL GL RNV+GHC+ G G LF GD
Sbjct: 168 CGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGYLFFGDT 227
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
+PS GVAWTP+L HY GPAELL++GK GLK L LIFD+G+SY YF S+ YQ
Sbjct: 228 LIPSLGVAWTPLLPPD---NHYTTGPAELLFNGKPTGLKGLKLIFDTGSSYTYFNSKTYQ 284
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRL 308
IV+LI DL +PLK+A +DKTLPICW+G PFK++ +V +FK + ++FTN R + +L
Sbjct: 285 TIVNLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINFTNARRNTQL 344
Query: 309 VVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+PPE+YL+IS N CLG+LNGSE + +N+IG+I MQ ++IYDNEKQ++GW +C
Sbjct: 345 QIPPESYLIISKTGNACLGLLNGSEVGLQNSNVIGDISMQGLLIIYDNEKQQLGWVSSNC 404
Query: 369 NTL 371
N L
Sbjct: 405 NKL 407
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 207/358 (57%), Positives = 263/358 (73%), Gaps = 5/358 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y++V L +G PPK FDFD DTGSDLTWVQCDAPC GCTKP +K YKP N+VPCSN C
Sbjct: 53 YYSVILNIGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRDKLYKPKNNLVPCSNSLCQ 112
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
A+ C P+DQCDYEIEY D GSSIG L++D FPLR SNG++ + FGCGY+Q
Sbjct: 113 AVSTGENYHCDAPDDQCDYEIEYADLGSSIGVLLSDSFPLRLSNGTLLQPKMAFGCGYDQ 172
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
+ GP PPDTAG+LGLGRG++SI+SQLR G+ +NV+GHC + G LF GD PSS
Sbjct: 173 KHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARGGFLFFGDHLFPSS 232
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
+ WTPML++S+D Y GPAELL+ GK G+K L LIFDSG+SY YF ++VYQ I++L
Sbjct: 233 RITWTPMLRSSSD-TLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSILNL 291
Query: 256 IMRDLIGTPLKLAPDDKTLPICWR--GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ +DL G PLK AP +K L +CW+ P K++ + YFKPL +SF N +N V+L + PE
Sbjct: 292 VRKDLAGKPLKDAP-EKELAVCWKTAKPIKSILDIKSYFKPLTISFMNAKN-VQLQLAPE 349
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
YL+I+ NVCLGILNGSE ++G N+IG+IFMQD++VIYDNEKQ+IGW P +C+ L
Sbjct: 350 DYLIITKDGNVCLGILNGSEQQLGNFNVIGDIFMQDRVVIYDNEKQQIGWFPANCDRL 407
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 199/358 (55%), Positives = 257/358 (71%), Gaps = 4/358 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y++V+L +G PPKLF+ D DTGSDLTWVQCDAPCTGCTKP YKP N++ C +P C+
Sbjct: 66 YYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLHHLYKPRNNLLSCIDPLCS 125
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
A+ +C+ DQCDYEI+Y D GSS+G LVTD FPLR NGS +TFGCGY+Q
Sbjct: 126 AVQNSGTYQCQSATDQCDYEIQYADEGSSLGVLVTDYFPLRLMNGSFLRPKMTFGCGYDQ 185
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
+PGP++PP T GVLGLG G+ SI+SQL+ G++ NVIGHC+ + G G LF G VPS
Sbjct: 186 KSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSRKGGGFLFFGQDPVPSF 245
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G++W PM Q S D K+Y GPAELLY GK G K IFDSG+SY YF ++VYQ ++L
Sbjct: 246 GISWAPMSQKSLD-KYYASGPAELLYGGKPTGTKAEEFIFDSGSSYTYFNAQVYQSTLNL 304
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
I ++L G PL+ AP++K L ICW+G FK++ +V YFKP ALSFT + SV+L +PPE
Sbjct: 305 IRKELSGKPLRDAPEEKALAICWKGTKRFKSVNEVKSYFKPFALSFT-KAKSVQLQIPPE 363
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
YL+++ NVCLGILNGSE +G N+IG+ QDK+VIYD++K +IGW P +C+ L
Sbjct: 364 DYLIVTNDGNVCLGILNGSEVGLGNFNVIGDNLFQDKLVIYDSDKHQIGWIPANCDRL 421
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 416 bits (1070), Expect = e-114, Method: Compositional matrix adjust.
Identities = 203/365 (55%), Positives = 258/365 (70%), Gaps = 3/365 (0%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V L +G PPKLFD D DTGSDLTWVQCDAPC GCTKP KQYKP+ N +PCS+
Sbjct: 64 LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHIL 123
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+ L P C P DQCDYEI Y D SSIGALVTD PL+ +NGS+ N+ LTFGCGY
Sbjct: 124 CSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGY 183
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q NPGP PP TAG+LGLGRG++ + +QL+ G+ +NVI HC+ G+G L +GD VP
Sbjct: 184 DQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVP 243
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSGV WT + NS K+Y+ GPAELL++ K+ G+K + ++FDSG+SY YF + YQ I+
Sbjct: 244 SSGVTWTSLATNSPS-KNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAIL 302
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
LI +DL G PL DDK+LP+CW+G P K+L +V +YFK + L F N++N VP
Sbjct: 303 DLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVP 362
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PE+YL+I+ + VCLGILNG+E + NIIG+I Q MVIYDNEKQRIGW DC+ L
Sbjct: 363 PESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDCDKL 422
Query: 372 LSLNH 376
++NH
Sbjct: 423 PNVNH 427
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 205/360 (56%), Positives = 262/360 (72%), Gaps = 6/360 (1%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y++VNL +G PPK ++ D DTGSDLTWVQCDAPC GCT P ++QYKPH N+V C +P
Sbjct: 45 LGYYSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCKGCTLPRDRQYKPHGNLVKCVDPL 104
Query: 74 CAALH-WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CAA+ PNPP C +PN+QCDYE+EY D GSS+G LV D+ PL+ +NG++ + L FGCG
Sbjct: 105 CAAIQSAPNPP-CVNPNEQCDYEVEYADQGSSLGVLVRDIIPLKLTNGTLTHSMLAFGCG 163
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
Y+Q + G PP AGVLGLG GR SI+SQL GLIRNV+GHC+ G G LF GD +
Sbjct: 164 YDQTHVGHNPPPSAAGVLGLGNGRASILSQLNSKGLIRNVVGHCLSGTGGGFLFFGDQLI 223
Query: 193 PSSGVAWTPMLQNSAD-LKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQE 251
P SGV WTP+LQ+S+ LKHY GPA++ ++GK+ +K L L FDSG+SY YF S ++
Sbjct: 224 PQSGVVWTPILQSSSSLLKHYKTGPADMFFNGKATSVKGLELTFDSGSSYTYFNSLAHKA 283
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLV 309
+V LI D+ G PL A +D +LPICW+G PFK+L VT FKPL LSFT +NS+
Sbjct: 284 LVDLITNDIKGKPLSRATEDPSLPICWKGPKPFKSLHDVTSNFKPLVLSFTKSKNSL-FQ 342
Query: 310 VPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
VPPEAYL+++ NVCLGIL+G+E +G NIIG+I +QDK+VIYDNEKQRIGW +C+
Sbjct: 343 VPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVIYDNEKQRIGWASANCD 402
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 199/358 (55%), Positives = 257/358 (71%), Gaps = 4/358 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V+L +G PPKL+D D D+GSDLTWVQCDAPC GCTKP ++ YKP+ N+V C + C+
Sbjct: 63 HYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKPNHNLVQCVDQLCS 122
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+ C P+DQCDYE+EY D GSS+G LV D P +F+NGSV + FGCGY+Q
Sbjct: 123 EVQLSMEYTCASPDDQCDYEVEYADHGSSLGVLVRDYIPFQFTNGSVVRPRVAFGCGYDQ 182
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
G SPP T+GVLGLG GR SI+SQL GLI NV+GHC+ G G LF GD +PSS
Sbjct: 183 KYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHNVVGHCLSARGGGFLFFGDDFIPSS 242
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G+ WT ML +S++ KHY GPAEL+++GK+ +K L LIFDSG+SY YF S+ YQ +V L
Sbjct: 243 GIVWTSMLPSSSE-KHYSSGPAELVFNGKATVVKGLELIFDSGSSYTYFNSQAYQAVVDL 301
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ +DL G LK A DD +LPICW+G FK+L V +YFKPLALSFT + +++ +PPE
Sbjct: 302 VTQDLKGKQLKRATDDPSLPICWKGAKSFKSLSDVKKYFKPLALSFT-KTKILQMHLPPE 360
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
AYL+I+ NVCLGIL+G+E + NIIG+I +QDKMVIYDNEKQ+IGW +C+ L
Sbjct: 361 AYLIITKHGNVCLGILDGTEVGLENLNIIGDISLQDKMVIYDNEKQQIGWVSSNCDRL 418
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 202/366 (55%), Positives = 258/366 (70%), Gaps = 3/366 (0%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V L +G PPKLFD D DTGSDLTWVQCDAPC GCTKP KQYKP+ N +PCS+
Sbjct: 65 LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHLL 124
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+ L C P DQCDYEI Y D SSIGALVTD FPL+ +NGS+ N LTFGCGY
Sbjct: 125 CSGLDLTQNRPCDDPEDQCDYEIGYSDHASSIGALVTDEFPLKLANGSIMNPHLTFGCGY 184
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q NPGP PP TAG+LGLGRG++ I +QL+ G+ +NVI HC+ G+G L +GD VP
Sbjct: 185 DQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVP 244
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSGV WT + NSA K+Y+ GPAELL++ K+ G+K + ++FDSG+SY YF + YQ I+
Sbjct: 245 SSGVTWTSLATNSAS-KNYMTGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAIL 303
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
LI +DL G PL DDK+LP+CW+G P K+L +V +YFK + L F ++N VP
Sbjct: 304 DLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGYQKNGQLFQVP 363
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PE+YL+I+ + NVCLGILNG+E + NI+G+I Q MVIYDNEKQRIGW DC+ +
Sbjct: 364 PESYLIITEKGNVCLGILNGTEVGLDSYNIVGDISFQGIMVIYDNEKQRIGWISSDCDKI 423
Query: 372 LSLNHF 377
++N +
Sbjct: 424 PNVNDY 429
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 197/358 (55%), Positives = 259/358 (72%), Gaps = 7/358 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
+++V L +G PPK FD D DTGSDLTWVQCDAPC GCTKP +K YKP N VPC++ C
Sbjct: 67 HYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDKLYKPKNNRVPCASSLCQ 126
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
A+ N C P +QCDYE+EY D GSS+G L++D FPLR +NGS+ + FGCGY+Q
Sbjct: 127 AIQNNN---CDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQPRIAFGCGYDQ 183
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
GP SPPDTAG+LGLGRG+ SI+SQLR G+ +NV+GHC + G LF GD +P S
Sbjct: 184 KYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGFLFFGDHLLPPS 243
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G+ WTPML++S+D Y GPAELL+ GK G+K L LIFDSG+SY YF ++VYQ I++L
Sbjct: 244 GITWTPMLRSSSD-TLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSILNL 302
Query: 256 IMRDLIGTPLKLAPDDKTLPICWR--GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ +DL G PLK AP++K L +CW+ P K++ + +FKPL ++F +N V+L + PE
Sbjct: 303 VRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSFFKPLTINFIKAKN-VQLQLAPE 361
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
YL+I+ NVCLGILNG E +G N+IG+IFMQD++V+YDNE+Q+IGW P +CN L
Sbjct: 362 DYLIITKDGNVCLGILNGGEQGLGNLNVIGDIFMQDRVVVYDNERQQIGWFPTNCNRL 419
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 201/360 (55%), Positives = 254/360 (70%), Gaps = 3/360 (0%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V L +G PPKLFD D DTGSDLTWVQCDAPC GCTKP KQYKP+ N +PCS+
Sbjct: 64 LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHIL 123
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+ L P C P DQCDYEI Y D SSIGALVTD PL+ +NGS+ N+ LTFGCGY
Sbjct: 124 CSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGY 183
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q NPGP PP TAG+LGLGRG++ + +QL+ G+ +NVI HC+ G+G L +GD VP
Sbjct: 184 DQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVP 243
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSGV WT + NS K+Y+ GPAELL++ K+ G+K + ++FDSG+SY YF + YQ I+
Sbjct: 244 SSGVTWTSLATNSPS-KNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAIL 302
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
LI +DL G PL DDK+LP+CW+G P K+L +V +YFK + L F N++N VP
Sbjct: 303 DLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVP 362
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PE+YL+I+ + VCLGILNG+E + NIIG+I Q MVIYDNEKQRIGW DC+ L
Sbjct: 363 PESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDCDKL 422
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 410 bits (1054), Expect = e-112, Method: Compositional matrix adjust.
Identities = 197/358 (55%), Positives = 253/358 (70%), Gaps = 4/358 (1%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V+L +G PPK++D D DTGSDLTWVQCDAPC GCT P + YKP+ N+V C +P
Sbjct: 61 LGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRNRLYKPNGNLVKCGDPL 120
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C A+ C PN+QCDYE+EY D GSS+G L+ D PL+F+NGS+ L FGCGY
Sbjct: 121 CKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLARPILAFGCGY 180
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q + G TAGVLGLG G+ SI+SQL GLIRNV+GHC+ + G G LF GD VP
Sbjct: 181 DQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERGGGFLFFGDQLVP 240
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SGV WTP+LQ+S+ +HY GPA+L + K +K L LIFDSG+SY YF S+ ++ +V
Sbjct: 241 QSGVVWTPLLQSSS-TQHYKTGPADLFFDRKPTSVKGLQLIFDSGSSYTYFNSKAHKALV 299
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+L+ DL G PL A +D +LPICWRG PFK+L VT FKPL LSFT +NS+ L +P
Sbjct: 300 NLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLHDVTSNFKPLLLSFTKSKNSL-LQLP 358
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
PEAYL+++ NVCLGIL+G+E +G NIIG+I +QDK+VIYDNEKQ+IGW +C+
Sbjct: 359 PEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVIYDNEKQQIGWASANCD 416
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 198/360 (55%), Positives = 256/360 (71%), Gaps = 4/360 (1%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
++ V+L +G PPKL+D D D+GSDLTWVQCDAPC GCTKP ++ YKP+ N+V C +
Sbjct: 61 LGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKPNHNLVQCVDQL 120
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+ +H C P+D CDYE+EY D GSS+G LV D P +F+NGSV + FGCGY
Sbjct: 121 CSEVHLSMAYNCPSPDDPCDYEVEYADHGSSLGVLVRDYIPFQFTNGSVVRPRVAFGCGY 180
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q G SPP T+GVLGLG GR SI+SQL GLIRNV+GHC+ G G LF GD +P
Sbjct: 181 DQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIRNVVGHCLSAQGGGFLFFGDDFIP 240
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSG+ WT ML +S+ KHY GPAEL+++GK+ +K L LIFDSG+SY YF S+ YQ +V
Sbjct: 241 SSGIVWTSMLSSSS-EKHYSSGPAELVFNGKATAVKGLELIFDSGSSYTYFNSQAYQAVV 299
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
L+ +DL G LK A DD +LPICW+G F++L V +YFKPLALSF N +++ +P
Sbjct: 300 DLVTKDLKGKQLKRATDDPSLPICWKGAKSFESLSDVKKYFKPLALSFKKSXN-LQMHLP 358
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PE+YL+I+ NVCLGIL+G+E + NIIG+I +QDKMVIYDNEKQ+IGW +C+ L
Sbjct: 359 PESYLIITKHGNVCLGILDGTEVGLENLNIIGDITLQDKMVIYDNEKQQIGWVSSNCDRL 418
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 200/358 (55%), Positives = 254/358 (70%), Gaps = 4/358 (1%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V+L +G PPK++D D DTGSDLTWVQCDAPC GCT P + YKPH ++V C +P
Sbjct: 61 LGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRNRLYKPHGDLVKCVDPL 120
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
CAA+ C PN+QCDYE+EY D GSS+G L+ D PL+F+NGS+ L FGCGY
Sbjct: 121 CAAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLARPMLAFGCGY 180
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q + G PP TAGVLGLG GR SI+SQL GLIRNV+GHC+ G G LF GD +P
Sbjct: 181 DQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCLSGRGGGFLFFGDQLIP 240
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SGV WTP+LQ+S+ +HY GPA+L + K+ +K L LIFDSG+SY YF S+ ++ +V
Sbjct: 241 PSGVVWTPLLQSSS-AQHYKTGPADLFFDRKTTSVKGLELIFDSGSSYTYFNSQAHKALV 299
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+LI DL G PL A D +LPICW+G PFK+L VT FKPL LSFT +NS L +P
Sbjct: 300 NLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLHDVTSNFKPLLLSFTKSKNS-PLQLP 358
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
PEAYL+++ NVCLGIL+G+E +G NIIG+I +QDK+VIYDNEKQ+IGW +C+
Sbjct: 359 PEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVIYDNEKQQIGWASANCD 416
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 193/356 (54%), Positives = 253/356 (71%), Gaps = 5/356 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y++V++ +GK + F+FD D+GSDLTWVQCDAPCT CTKP E+ YKP+ N + C P C
Sbjct: 54 YYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALNCFEPLCT 113
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH CK +DQC YEIEY D GSS+G LV D PL+ +NGS+ + FGCGY+
Sbjct: 114 SLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNGSLAAPRIAFGCGYDH 173
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P S P TAGVLGLG G +S +SQL G++RNV+GHC+ G G LF GD VPSS
Sbjct: 174 KYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEG-GFLFFGDEFVPSS 232
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
GV WT M S +Y GPAE+ +SGK+ G+KDLTL+FDSG+SY YF S+ Y I++L
Sbjct: 233 GVTWTSMSHESIG-SYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAYNSILAL 291
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ +L G PL+ AP+DK+LP+CW+G PFK+L V +YF PLAL FT +N+ ++ +PPE
Sbjct: 292 VKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNPLALRFTKTKNA-QIQLPPE 350
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
YL+I+ NVC GILNG+E +G+ NIIG+I ++DKMVIYDNE++RIGW P +CN
Sbjct: 351 NYLIITKYGNVCFGILNGTEVGLGDLNIIGDISLKDKMVIYDNERRRIGWFPTNCN 406
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 198/360 (55%), Positives = 251/360 (69%), Gaps = 8/360 (2%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V L +G PPKLFD D DTGSDLTWVQCDAPC GCTK YKP+ N +PCS+
Sbjct: 64 LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTK-----YKPNHNTLPCSHIL 118
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+ L P C P DQCDYEI Y D SSIGALVTD PL+ +NGS+ N+ LTFGCGY
Sbjct: 119 CSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGY 178
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q NPGP PP TAG+LGLGRG++ + +QL+ G+ +NVI HC+ G+G L +GD VP
Sbjct: 179 DQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVP 238
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSGV WT + NS K+Y+ GPAELL++ K+ G+K + ++FDSG+SY YF + YQ I+
Sbjct: 239 SSGVTWTSLATNSPS-KNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAIL 297
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
LI +DL G PL DDK+LP+CW+G P K+L +V +YFK + L F N++N VP
Sbjct: 298 DLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVP 357
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PE+YL+I+ + VCLGILNG+E + NIIG+I Q MVIYDNEKQRIGW DC+ L
Sbjct: 358 PESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISSDCDKL 417
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 191/356 (53%), Positives = 251/356 (70%), Gaps = 5/356 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y++V++ +GK + F+FD D+GSDLTWVQCDAPCT CTKP E+ YKP+ N + C P C
Sbjct: 54 YYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALNCFEPLCT 113
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH CK +DQC YEIEY D GSS+G LV D PL+ +NGS+ + FGCGY+
Sbjct: 114 SLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNGSLAAPRIAFGCGYDH 173
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P S P TAGVLGLG G +S +SQL G++RNV+GHC+ G G LF GD VPSS
Sbjct: 174 KYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEG-GFLFFGDEFVPSS 232
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
GV WT M S +Y GPAE+ + GK+ G+KDLTL+FDSG+SY YF S+ Y I++L
Sbjct: 233 GVTWTSMSHESIG-SYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFNSQAYNSILAL 291
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ +L G PL+ AP+DK+LP+CW+G PFK+L V +YF LAL FT +N+ ++ +PPE
Sbjct: 292 VKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNLLALRFTKTKNA-QIQLPPE 350
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
YL+I+ NVC GILNG+E +G+ NIIG+I ++DKMVIYDNE++RIGW P +CN
Sbjct: 351 NYLIITKYGNVCFGILNGTEVGLGDLNIIGDISLKDKMVIYDNERRRIGWFPTNCN 406
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 192/358 (53%), Positives = 261/358 (72%), Gaps = 6/358 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
+++V+L +G PPK + D D+GSDLTW+QCDAPC CTK P YKP+K + C++P C+
Sbjct: 67 FYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPNKGPITCNDPMCS 126
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
ALHWP+ P CK ++QCDYE+ Y D GSS+G LV D+F L+ +NG++ L FGCGY+Q
Sbjct: 127 ALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTLAAPRLAFGCGYDQ 186
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
PGP +PP GVLGLG G+ SIV+QLR GLIR+++GHC+ G G LFLGDG +
Sbjct: 187 SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGLSTTP 246
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G+ WTPM + S + Y LGPA+LL++G++ G+K L L+FDSG+SY YF ++ Y+ +SL
Sbjct: 247 GIIWTPMSRKSGE-SAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSL 305
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ + L G + A D++LP+CWRG PFK++ +V YFKP ALSFT + S +L +PPE
Sbjct: 306 VRKYLNGKLKETA--DESLPVCWRGAKPFKSIFEVKNYFKPFALSFT-KAKSAQLQLPPE 362
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+YL+IS N CLGILNGSE +G++N+IG+I QDKMVIYDNE+Q+IGW P+DCN L
Sbjct: 363 SYLIISKHGNACLGILNGSEVGLGDSNVIGDIAFQDKMVIYDNERQQIGWVPKDCNKL 420
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 193/358 (53%), Positives = 250/358 (69%), Gaps = 6/358 (1%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
+F V++T+G PPK+F+ D DTGSDLTWVQCDAPCTGCT P ++ YKPH N+V C P
Sbjct: 52 LGHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGCTLPHDRLYKPHNNVVRCGEPL 111
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+AL + CK+PNDQCDYE+EY D GSSIG LV D PLR +NG++ L FGCGY
Sbjct: 112 CSALFSASKSPCKNPNDQCDYEVEYADHGSSIGVLVKDPVPLRLTNGTILAPNLGFGCGY 171
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+QHN G PP TAGVLGLG + ++ +QL +RNV+GHC G G LF G VP
Sbjct: 172 DQHNGGSQLPPLTAGVLGLGNSKATMATQLSALSHVRNVLGHCFSGQGGGFLFFGGDLVP 231
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSG++W P+L+ Y GPAE+ + G G++ L L FDSG+SY YF S+VY ++
Sbjct: 232 SSGMSWMPILRTPGG--KYSAGPAEVYFGGNPVGIRGLILTFDSGSSYTYFNSQVYGAVL 289
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+L+ L G PL+ AP+DKTLPICW+G FK++ V +FKPLALSF N + V+ +P
Sbjct: 290 NLLRNGLKGQPLRDAPEDKTLPICWKGSKAFKSVADVRNFFKPLALSFGNSK--VQFQIP 347
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
PEAYL+IS NVCLGILNGS+ +G N+IG+I M DKM++YDNE+Q+IGW P +C+
Sbjct: 348 PEAYLIISNLGNVCLGILNGSQVGLGNVNLIGDISMLDKMMVYDNERQQIGWAPANCS 405
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 192/358 (53%), Positives = 261/358 (72%), Gaps = 6/358 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
+++V+L +G PPK + D D+GSDLTW+QCDAPC CTK P YKP+K + C++P C+
Sbjct: 34 FYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPNKGPITCNDPMCS 93
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
ALHWP+ P CK ++QCDYE+ Y D GSS+G LV D+F L+ +NG++ L FGCGY+Q
Sbjct: 94 ALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTLAAPRLAFGCGYDQ 153
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
PGP +PP GVLGLG G+ SIV+QLR GLIR+++GHC+ G G LFLGDG +
Sbjct: 154 SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGLSTTP 213
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G+ WTPM + S + Y LGPA+LL++G++ G+K L L+FDSG+SY YF ++ Y+ +SL
Sbjct: 214 GIIWTPMSRKSGE-SAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSL 272
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ + L G + A D++LP+CWRG PFK++ +V YFKP ALSFT + S +L +PPE
Sbjct: 273 VRKYLNGKLKETA--DESLPVCWRGAKPFKSIFEVKNYFKPFALSFT-KAKSAQLQLPPE 329
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+YL+IS N CLGILNGSE +G++N+IG+I QDKMVIYDNE+Q+IGW P+DCN L
Sbjct: 330 SYLIISKHGNACLGILNGSEVGLGDSNVIGDIAFQDKMVIYDNERQQIGWVPKDCNKL 387
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 190/368 (51%), Positives = 251/368 (68%), Gaps = 12/368 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 57 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 116
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH + RC+ P +QCDYE+EY DGGSS+G LV D+F + ++ G L GCGY
Sbjct: 117 CKALHLNSNQRCETP-EQCDYEVEYADGGSSLGVLVRDVFSMNYTKGLRLTPRLALGCGY 175
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG S GVLGLGRG++SI+SQL G ++NVIGHC+ G G+LF GD
Sbjct: 176 DQ-IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 234
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + + KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 235 SSRVSWTPMSREYS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 290
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 291 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 350
Query: 308 LVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPED 367
+PPEAYL+IS + NVCLGILNG+E + N+IG+I MQD+M+IYDNEKQ IGW P D
Sbjct: 351 FEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPAD 410
Query: 368 CNTLLSLN 375
C+ L SL
Sbjct: 411 CDELASLK 418
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 190/368 (51%), Positives = 251/368 (68%), Gaps = 12/368 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 57 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 116
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH + RC+ P +QCDYE+EY DGGSS+G LV D+F + ++ G L GCGY
Sbjct: 117 CKALHLNSNQRCETP-EQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGY 175
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG S GVLGLGRG++SI+SQL G ++NVIGHC+ G G+LF GD
Sbjct: 176 DQ-IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 234
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + + KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 235 SSRVSWTPMSREYS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 290
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 291 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 350
Query: 308 LVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPED 367
+PPEAYL+IS + NVCLGILNG+E + N+IG+I MQD+M+IYDNEKQ IGW P D
Sbjct: 351 FEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVD 410
Query: 368 CNTLLSLN 375
C+ L SL
Sbjct: 411 CDELASLK 418
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 190/368 (51%), Positives = 251/368 (68%), Gaps = 12/368 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 45 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 104
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH + RC+ P +QCDYE+EY DGGSS+G LV D+F + ++ G L GCGY
Sbjct: 105 CKALHLNSNQRCETP-EQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGY 163
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG S GVLGLGRG++SI+SQL G ++NVIGHC+ G G+LF GD
Sbjct: 164 DQ-IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 222
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + + KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 223 SSRVSWTPMSREYS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 278
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 279 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 338
Query: 308 LVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPED 367
+PPEAYL+IS + NVCLGILNG+E + N+IG+I MQD+M+IYDNEKQ IGW P D
Sbjct: 339 FEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVD 398
Query: 368 CNTLLSLN 375
C+ L SL
Sbjct: 399 CDELASLK 406
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 187/368 (50%), Positives = 251/368 (68%), Gaps = 12/368 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 54 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVHCLEAPHPLYQPSNDLIPCNDPL 113
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH+ RC+ P +QCDYE+EY DGGSS+G LV D+F L ++ G L GCGY
Sbjct: 114 CKALHFNGNHRCETP-EQCDYEVEYADGGSSLGVLVRDVFSLNYTKGLRLTPRLALGCGY 172
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG GVLGLGRG++SI+SQL G ++NV+GHC+ G G+LF G+
Sbjct: 173 DQ-IPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSSLGGGILFFGNDLYD 231
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + ++ KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 232 SSRVSWTPMARENS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 287
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 288 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 347
Query: 308 LVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPED 367
+PPEAYL+IS + NVCLGILNG+E + N+IG+I MQD+M+IYDNEKQ IGW P D
Sbjct: 348 FEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWIPAD 407
Query: 368 CNTLLSLN 375
C+ + SL
Sbjct: 408 CDEIASLK 415
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 199/366 (54%), Positives = 248/366 (67%), Gaps = 9/366 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ V L +G+P K + D DTGSDLTW+QCDAPC CT+ P Y+P N+VPC +P C
Sbjct: 33 YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRNNLVPCMDPICQ 92
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH RC++P QCDYE+EY DGGSS G LVTD F L F++ + L GCGY+Q
Sbjct: 93 SLHSNGDHRCENPG-QCDYEVEYADGGSSFGVLVTDTFNLNFTSEKRHSPLLALGCGYDQ 151
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
G P D GVLGLG+G+ SIVSQL GL+RNVIGHC+ +G G LF GD SS
Sbjct: 152 FPGGSHHPID--GVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGFLFFGDDLYDSS 209
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
VAWTPM S D KHY G AEL + GK+ G K+L FDSGASY Y S+ YQ ++SL
Sbjct: 210 RVAWTPM---SPDAKHYSPGLAELTFDGKTTGFKNLLTTFDSGASYTYLNSQAYQGLISL 266
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNS-VRLVVPP 312
+ ++L G PL+ A DD+TLP+CW+G PFK++ V +YFK ALSFTN R S L PP
Sbjct: 267 LKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKKYFKTFALSFTNERKSKTELEFPP 326
Query: 313 EAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLL 372
EAYL+IS + N CLGILNG+E + + N+IG+I MQD++VIYDNEK+RIGW P +CN L
Sbjct: 327 EAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVVIYDNEKERIGWAPGNCNRLP 386
Query: 373 SLNHFI 378
FI
Sbjct: 387 KSKSFI 392
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 189/361 (52%), Positives = 248/361 (68%), Gaps = 9/361 (2%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V+L++G+PPK + D DTGSDL+W+QCDAPC CTK P Y+P+ N+V C +P
Sbjct: 64 LGYYYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHPLYRPNNNLVICKDPM 123
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
CA+LH P +C+HP +QCDYE+EY DGGSS+G LV D+FPL F+NG L GCGY
Sbjct: 124 CASLHPPG-YKCEHP-EQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGY 181
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q P D GVLGLG+G+ SIVSQL G+IRNV+GHC+ G G LF GD
Sbjct: 182 DQIPGQSYHPLD--GVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGFLFFGDDLYD 239
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SS V WTPML++ HY G AEL+ GK+ K+L + FDSG+SY Y S YQ +V
Sbjct: 240 SSRVVWTPMLRDQH--THYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALV 297
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVRLVV 310
L+ ++L P++ A DD+TLP+CWRG PFK++ V ++FKPLALSF R + +
Sbjct: 298 HLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALSFPGGGRTKTQYDI 357
Query: 311 PPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
P E+YL+IS + NVCLGILNG+EA + + N+IG+I MQDKMV+YDNEK +IGW P +C+
Sbjct: 358 PLESYLIISLKGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDR 417
Query: 371 L 371
L
Sbjct: 418 L 418
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 189/356 (53%), Positives = 244/356 (68%), Gaps = 5/356 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
+F V L +G P K+F+ D DTGSDLTWVQCD C GCT P + Y+PH N V +P CA
Sbjct: 52 HFTVLLNIGNPSKVFELDIDTGSDLTWVQCDVECIGCTLPRDMLYRPHNNAVSREDPLCA 111
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
AL K+PNDQC YE+EY D GSS+G LV DL P+R +NG + L FGCGY+Q
Sbjct: 112 ALSSLGKFIFKNPNDQCAYEVEYADHGSSVGVLVKDLVPMRLTNGKRISPNLGFGCGYDQ 171
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
N PP AGVLGL + +IVSQL + G + NV+GHC+ G G LF G VPSS
Sbjct: 172 ENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRGGGFLFFGGDVVPSS 231
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G++WTP+L+NS Y GPAE+ ++G++ G+ LTL FDSG+SY YF S+VY+ I L
Sbjct: 232 GMSWTPILRNSE--GKYSSGPAEVYFNGRAVGIGGLTLTFDSGSSYTYFNSQVYRAIEKL 289
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ DL G PLKLA DDKTL +CW+G PF+++ V +FKPLA+SF N +N V+ +PPE
Sbjct: 290 LKNDLKGNPLKLASDDKTLELCWKGPKPFESVVDVRNFFKPLAMSFKNSKN-VQFQIPPE 348
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
AYL+IS NVCLGIL+GS+ +G NIIG+I M +K+V+YDNE++RIGW +CN
Sbjct: 349 AYLIISEFGNVCLGILDGSKEGMGNVNIIGDISMLNKIVVYDNERERIGWASSNCN 404
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 192/385 (49%), Positives = 252/385 (65%), Gaps = 29/385 (7%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 35 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 94
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH + RC+ P +QCDYE+EY DGGSS+G LV D+F + ++ G L GCGY
Sbjct: 95 CKALHLNSNQRCETP-EQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGY 153
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG S GVLGLGRG++SI+SQL G ++NVIGHC+ G G+LF GD
Sbjct: 154 DQ-IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 212
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + + KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 213 SSRVSWTPMSREYS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 268
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 269 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 328
Query: 308 LVVPPEAYLVIS---------GR--------KNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+PPEAYL+IS GR NVCLGILNG+E + N+IG+I MQD+
Sbjct: 329 FEIPPEAYLIISVWFSHTMLKGRFIKMLQMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 388
Query: 351 MVIYDNEKQRIGWKPEDCNTLLSLN 375
M+IYDNEKQ IGW P DC+ L SL
Sbjct: 389 MIIYDNEKQSIGWMPVDCDELASLK 413
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 190/357 (53%), Positives = 245/357 (68%), Gaps = 9/357 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ V +G+PPK + D DTGSDLTW+QCDAPC CT P Y+P ++V C +P CA
Sbjct: 66 YYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQPTNDLVVCKDPICA 125
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH P+ RC P DQCDYE+EY DGGSSIG LV DLFP+ ++G LT GCGY+Q
Sbjct: 126 SLH-PDNYRCDDP-DQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMRARPRLTIGCGYDQ 183
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P D GVLGLGRG SIV+QL GL+RNV+GHC + G G LF GD SS
Sbjct: 184 LPGIAYHPLD--GVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGDDIYDSS 241
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
V WTPM ++ LKHY G AEL+ +G+S GLK+L ++FDSG+SY YF ++ YQ ++S
Sbjct: 242 KVIWTPMSRDY--LKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQTLLSF 299
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVRLVVPP 312
I +DL G PLK A +D TLP+CWRG PFK++ +YFKPLALSF + + + +
Sbjct: 300 IKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWKTKSQFEIQQ 359
Query: 313 EAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
E+YL+IS + +VCLGILNG+E + NIIG+I MQ+K+VIYDNEKQ IGW+P +C+
Sbjct: 360 ESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQEKLVIYDNEKQVIGWQPSNCD 416
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 368 bits (944), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 184/359 (51%), Positives = 245/359 (68%), Gaps = 7/359 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L VG+PPK + D DTGSDLTW+QCDAPC CT+ Y+P ++VPC +P C
Sbjct: 56 FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCM 115
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH RC++P DQCDYE+EY DGGSS+G LV D+FPL +NG L GCGY+Q
Sbjct: 116 SLHSSMDHRCENP-DQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQ 174
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
+PG S G+LGLGRG +SIVSQL G++RNV+GHC G G LF GDG
Sbjct: 175 -DPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPY 233
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
+ WTPM ++ KHY G EL+++G+S GL++L ++FDSG+SY YF ++ YQ + SL
Sbjct: 234 RLVWTPMSRDYP--KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSL 291
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN-RRNSVRLVVPP 312
+ R+L G PL+ A DD TLP+CWRG P K+L V +YFKPLALSF++ R+ +P
Sbjct: 292 LNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPT 351
Query: 313 EAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
E Y++IS NVCLGILNG++ + +NIIG+I MQDKMV+Y+NEKQ IGW +C+ +
Sbjct: 352 EGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRV 410
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 367 bits (941), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 196/360 (54%), Positives = 245/360 (68%), Gaps = 10/360 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ V L +G+P K + D DTGSDLTW+QCDAPC CT+ P Y+P N+VPC +P C
Sbjct: 19 YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRNNLVPCMDPICQ 78
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG-CGYN 134
+LH RC++P QCDYE+EY DGGSS G LV D F L F++ + L G CGY+
Sbjct: 79 SLHSNGDHRCENPG-QCDYEVEYADGGSSFGVLVRDTFNLNFTSEKRHSPLLALGLCGYD 137
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS 194
Q G P D GVLGLG+G+ SIVSQL GL+RNVIGHC+ +G G LF GD S
Sbjct: 138 QFPGGSHHPID--GVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGFLFFGDDLYDS 195
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
S VAWTPM S D KHY G AEL + GK+ G K+L FDSGASY Y S+ YQ ++S
Sbjct: 196 SRVAWTPM---SPDAKHYSPGLAELTFDGKTTGFKNLLTTFDSGASYTYLNSQAYQGLIS 252
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNS-VRLVVP 311
L+ ++L G PL+ A DD+TLP+CW+G PFK++ V +YFK ALSFTN R S L P
Sbjct: 253 LLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKKYFKTFALSFTNERKSKTELEFP 312
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PEAYL+IS + N CLGILNG+E + + N+IG+I MQD++VIYDNEK+RIGW P +CN L
Sbjct: 313 PEAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVVIYDNEKERIGWAPGNCNRL 372
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 363 bits (933), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 187/361 (51%), Positives = 245/361 (67%), Gaps = 11/361 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V+L++G+PP + D TGSDL+W+QCDAPC CTK Y+P+ N+V C +P
Sbjct: 64 LGYYYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAXHXLYRPNNNLVICKDPM 123
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
CA LH P +C+HP +QCDYE+EY DGGSS+G LV D+FPL F+NG L GCGY
Sbjct: 124 CAXLHPPG-YKCEHP-EQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGY 181
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q P D GVLGLG+G+ SIVSQL G+IRNV+GHC+ +G G LF GD
Sbjct: 182 DQIPGXSYHPLD--GVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGGGFLFFGDDLYD 239
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SS V WTPML++ HY G AEL+ GK+ K+L + FDSG+SY Y S YQ +V
Sbjct: 240 SSRVVWTPMLRDQH--THYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALV 297
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT-NRRNSVRLVV 310
L+ ++L P++ A DD+TLP+CWRG PFK++ V ++FKPLALSF R + +
Sbjct: 298 HLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVRKFFKPLALSFAGGGRTKTQYDI 357
Query: 311 PPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
P E+YL+ISG NVCLGILNG+EA + + N+IG+I MQDKMV+YDNEK +IGW P +C+
Sbjct: 358 PLESYLIISG--NVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDR 415
Query: 371 L 371
L
Sbjct: 416 L 416
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 179/371 (48%), Positives = 253/371 (68%), Gaps = 15/371 (4%)
Query: 11 FPIFS------YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK 64
FPI+ ++ V L +G+PP+ + D DTGS+LTW+QCDAPC+ C++ P YKP
Sbjct: 62 FPIYGNVYPVGFYNVTLNIGQPPRPYFLDVDTGSELTWLQCDAPCSQCSETPHPLYKPSN 121
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ +PC +P CA+L + C+ PN QCDYEI+Y D S++G L+ D++ L F+NG
Sbjct: 122 DFIPCKDPLCASLQPTDDYTCEDPN-QCDYEIKYADQYSTLGVLLNDVYLLNFTNGVQLK 180
Query: 125 VPLTFGCGYNQ-HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
V + GCGY+Q +P P D G+LGLGRG+ S++SQL GL+RNV+GHC+ G G
Sbjct: 181 VRMALGCGYDQIFSPSTYHPLD--GILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGG 238
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+F G+ SS ++WTP+ + KHY GPAEL++ G+ G+ L +IFD+G+SY Y
Sbjct: 239 YIFFGN-VYDSSRMSWTPISSIDSG-KHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTY 296
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
F S+ YQ ++SL+ ++L P+K APDD+TLP+CW G PF+++ +V +YFKPL LSFTN
Sbjct: 297 FNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEVKKYFKPLTLSFTN 356
Query: 302 -RRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
R + +PPEAYL+IS NVCLGILNG E +GE N+IG+I M DK++++DNEKQ
Sbjct: 357 GGRVKPQFEIPPEAYLIISNMGNVCLGILNGPEVGLGELNLIGDISMLDKVMVFDNEKQL 416
Query: 361 IGWKPEDCNTL 371
IGW P DCN++
Sbjct: 417 IGWGPADCNSV 427
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 176/360 (48%), Positives = 244/360 (67%), Gaps = 9/360 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L +G+PP+ + D DTGSDLTW+QCDAPC+ C++ P Y+P + VPC + CA
Sbjct: 76 FYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLYRPSNDFVPCRHSLCA 135
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH + C+ P+ QCDYE++Y D SS+G L+ D++ L F+NG V + GCGY+Q
Sbjct: 136 SLHHSDNYDCEVPH-QCDYEVQYADHYSSLGVLLHDVYTLNFTNGVQLKVRMALGCGYDQ 194
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P P P G+LGLGRG+ S+ SQL GL+RNVIGHC+ G G +F GD SS
Sbjct: 195 IFPDPSHHP-LDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYIFFGD-VYDSS 252
Query: 196 GVAWTPMLQNSADLKHY-ILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
+ WTPM +S D KHY G AELL+ GK G+ L +FD+G+SY YF YQ ++S
Sbjct: 253 RLTWTPM--SSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSSYTYFNPYAYQALIS 310
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT-NRRNSVRLVVP 311
+ ++ G PLK A DD+TLP+CWRG PF+++ +V +YFKP+ LSFT N R+ + +P
Sbjct: 311 WLGKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYFKPIVLSFTSNGRSKAQFEMP 370
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PEAYL+IS NVCLGILNGSE +G+ N+IG+I M +K++++DN+KQ IGW P DC+ +
Sbjct: 371 PEAYLIISNMGNVCLGILNGSEVGMGDLNLIGDISMLNKVMVFDNDKQLIGWTPADCDQV 430
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 174/360 (48%), Positives = 246/360 (68%), Gaps = 9/360 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L +G+PP+ + D DTGSDLTW+QCDAPC+ C++ P Y+P ++VPC + CA
Sbjct: 78 FYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLYRPSNDLVPCRHALCA 137
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH + C+ P+ QCDYE++Y D SS+G L+ D++ L F+NG V + GCGY+Q
Sbjct: 138 SLHLSDNYDCEVPH-QCDYEVQYADHYSSLGVLLHDVYTLNFTNGVQLKVRMALGCGYDQ 196
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P P P G+LGLGRG+ S+ SQL GL+RNVIGHC+ G G +F GD S
Sbjct: 197 IFPDPSHHP-LDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYIFFGD-VYDSF 254
Query: 196 GVAWTPMLQNSADLKHY-ILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
+ WTPM +S D KHY + G AELL+ GK G+ +L +FD+G+SY YF S YQ ++S
Sbjct: 255 RLTWTPM--SSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSSYTYFNSYAYQVLIS 312
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT-NRRNSVRLVVP 311
+ ++ G PLK A DD+TLP+CWRG PF+++ +V +YFKP+ LSFT N R+ + +
Sbjct: 313 WLKKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYFKPIVLSFTSNGRSKAQFEML 372
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PEAYL++S NVCLGILNGSE +G+ N+IG+I M +K++++DN+KQ IGW P DC+ +
Sbjct: 373 PEAYLIVSNMGNVCLGILNGSEVGMGDLNLIGDISMLNKVMVFDNDKQLIGWAPADCDQV 432
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 350 bits (898), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 183/359 (50%), Positives = 244/359 (67%), Gaps = 7/359 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L VG+PPK + D DTGSDLTW+QCDAPC CT+ Y+P ++VPC +P C
Sbjct: 56 FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCM 115
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH RC++P DQCDYE+EY DGGSS+G LV D+FPL +NG L GCGY+Q
Sbjct: 116 SLHSSMDHRCENP-DQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQ 174
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
+PG S G+LGLGRG +SIVSQL G++RNV+GHC G G F GDG
Sbjct: 175 -DPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYXFFGDGIYDPY 233
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
+ WTPM ++ KHY G EL+++G+S GL++L ++FDSG+SY YF ++ YQ + SL
Sbjct: 234 RLVWTPMSRDYP--KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSL 291
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN-RRNSVRLVVPP 312
+ R+L G PL+ A DD TLP+CWRG P K+L V +YFKPLALSF++ R+ +P
Sbjct: 292 LNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPT 351
Query: 313 EAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
E Y++IS NVCLGILNG++ + +NIIG+I MQDKMV+Y+NEKQ IGW +C+ +
Sbjct: 352 EGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRV 410
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 350 bits (898), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 175/364 (48%), Positives = 245/364 (67%), Gaps = 14/364 (3%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
+P+ ++ V + +G PP+ + D DTGSDLTW+QCDAPC+ C++ P Y+P ++VPC
Sbjct: 80 YPV-GFYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLYRPSNDLVPCR 138
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+P CA++H + C+ + QCDYE+EY D SS+G LV D++ L F+NG V + G
Sbjct: 139 HPLCASVHQTDNYECEVEH-QCDYEVEYADHYSSLGVLVNDVYVLNFTNGVQLKVRMALG 197
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CGY+Q P P G+LGLGRG+ S++SQL GL+RNV+GHC+ G G +F GD
Sbjct: 198 CGYDQIFPDSSYHP-VDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSAQGGGYIFFGD- 255
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS +AWTPM +S D KHY G AEL+ GK G +L +FD+G+SY YF S YQ
Sbjct: 256 VYDSSRLAWTPM--SSRDYKHYSAGAAELVLGGKRTGFGNLLAVFDAGSSYTYFNSNAYQ 313
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ ++L G P+K AP+D+TLP+CW G PF+++ +V +YFKP+ALSF +RR+ +
Sbjct: 314 -----LTKELAGKPIKEAPEDQTLPLCWYGKRPFRSVYEVKKYFKPIALSFPGSRRSKAQ 368
Query: 308 LVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPED 367
+PPEAYL+IS NVCLGIL+GSE V + N+IG+I M DK++++DNEKQ IGW D
Sbjct: 369 FEIPPEAYLIISNMGNVCLGILDGSEVGVEDLNLIGDISMLDKVMVFDNEKQLIGWTAAD 428
Query: 368 CNTL 371
CN +
Sbjct: 429 CNRV 432
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 189/360 (52%), Positives = 242/360 (67%), Gaps = 10/360 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L +G+P K + D DTGSDLTW+QCD P CT+ P YKP N+V C +P C
Sbjct: 19 FYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPHPYYKPSNNLVACKDPICQ 78
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG-CGYN 134
+LH RC++P QCDYE+EY DGGSS+G LV D F L F++ + L G CGY+
Sbjct: 79 SLHTGGDQRCENPG-QCDYEVEYADGGSSLGVLVKDAFNLNFTSEKRQSPLLALGLCGYD 137
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS 194
Q G P D GVLGLGRG+ SIVSQL GL+RNVIGHC+ G G LF GD S
Sbjct: 138 QLPGGTYHPID--GVLGLGRGKPSIVSQLSGLGLVRNVIGHCLSGRGGGFLFFGDDLYDS 195
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
S VAWTPM N+ KHY G AEL + GK+ G K+L + FDSGASY Y S+VYQ ++S
Sbjct: 196 SRVAWTPMSPNA---KHYSPGFAELTFDGKTTGFKNLIVAFDSGASYTYLNSQVYQGLIS 252
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNR-RNSVRLVVP 311
LI R+L PL+ A DD+TLPICW+G PFK++ V +YFK ALSF N ++ +L P
Sbjct: 253 LIKRELSTKPLREALDDQTLPICWKGRKPFKSVRDVKKYFKTFALSFANDGKSKTQLEFP 312
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PEAYL++S + N CLG+LNG+E + + N+IG+I MQD++VIYDNEKQ IGW P +C+ +
Sbjct: 313 PEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDISMQDRVVIYDNEKQLIGWAPRNCDRI 372
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 347 bits (891), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 176/360 (48%), Positives = 240/360 (66%), Gaps = 10/360 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L +G+P + + D DTGSDLTW+QCDAPCT C++ P Y+P + VPC +P CA
Sbjct: 68 FYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAPCTHCSETPHPLYRPSNDFVPCRDPLCA 127
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+L C+HP DQCDYEI Y D S+ G L+ D++ L F+NG V + GCGY+Q
Sbjct: 128 SLQPTEDYNCEHP-DQCDYEINYADQYSTFGVLLNDVYLLNFTNGVQLKVRMALGCGYDQ 186
Query: 136 -HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS 194
+P P D LG G+ S++SQL GL+RNVIGHC+ G G +F G+ S
Sbjct: 187 VFSPSSYHPLDGLLGLGRGKA--SLISQLNSQGLVRNVIGHCLSAQGGGYIFFGNA-YDS 243
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
+ V WTP+ +S D KHY GPAEL++ G+ G+ LT +FD+G+SY YF S YQ ++S
Sbjct: 244 ARVTWTPI--SSVDSKHYSAGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHAYQALLS 301
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN-RRNSVRLVVP 311
+ ++L G PLK+APDD+TLP+CW G PF +L +V +YFKP+AL FTN R + +
Sbjct: 302 WLKKELSGKPLKVAPDDQTLPLCWHGKRPFTSLREVRKYFKPVALGFTNGGRTKAQFEIL 361
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PEAYL+IS NVCLGILNGSE + E N+IG+I MQDK+++++NEKQ IGW P DC+ +
Sbjct: 362 PEAYLIISNLGNVCLGILNGSEVGLEELNLIGDISMQDKVMVFENEKQLIGWGPADCSRI 421
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 180/361 (49%), Positives = 239/361 (66%), Gaps = 10/361 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ V L++G+P K + D DTGSDLTW+QCDAPC C + P Y+P N+V C +P CA
Sbjct: 70 YYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPCRQCIEAPHPLYRPSNNLVICEDPLCA 129
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+L P C+ P DQCDYE+EY DGGSS+G LV D+F L F+NG N L GCGY+Q
Sbjct: 130 SLQPPGVHNCQDP-DQCDYEVEYADGGSSLGVLVKDVFVLNFTNGKRLNPLLALGCGYDQ 188
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
PG + P G+LGLGRG SI SQL GL+ NVIGHC+ G G LF G+ SS
Sbjct: 189 L-PGRSNHP-LDGILGLGRGISSIPSQLSSQGLVSNVIGHCLSGRGGGFLFFGEDIYDSS 246
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
GV WTPM ++ LKHY G AEL++ GKS G+++L ++FDSG+SY Y ++ YQ +V
Sbjct: 247 GVTWTPMSRDH--LKHYSPGFAELIFDGKSTGIRNLLVVFDSGSSYTYLNAQAYQHLVFS 304
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF---TNRRNSVRLVV 310
+ R+L P+ A DD+TLP+CW+G PFK++ V +YFKP AL F + R + +
Sbjct: 305 LKRELSRKPISEALDDQTLPLCWKGKRPFKSIRDVKKYFKPFALVFKTSSGRSSKTQFEF 364
Query: 311 PPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
PEAYL+IS + N CLGILNG+E + + N+IG++ M D++VIY+NEKQ IGW C+
Sbjct: 365 SPEAYLIISSKGNACLGILNGTEVGLRDLNVIGDVSMLDRLVIYNNEKQMIGWAAASCDR 424
Query: 371 L 371
L
Sbjct: 425 L 425
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 177/365 (48%), Positives = 240/365 (65%), Gaps = 14/365 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCD---APCTGCTKPPEKQYKPH-KNIVPCSNP 72
+ V++ +G PPK ++ D DTGSDLTWVQCD APC GCT P +K YKP+ K +V CS+P
Sbjct: 62 YTVSINIGNPPKPYELDIDTGSDLTWVQCDGPDAPCKGCTMPKDKLYKPNGKQVVKCSDP 121
Query: 73 RCAALHWPNP--PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C A + C + C Y ++Y D S++G LV D + + S + + FG
Sbjct: 122 ICVATQSTHVLGQICSKQSPPCVYNVQYADHASTLGVLVRDYMHIGSPSSSTKDPLVAFG 181
Query: 131 CGYNQHNPGPLSPPDT--AGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
CGY Q GP +PP + AG+LGLG G+ SI+SQL G I NV+GHC+ G G LFLG
Sbjct: 182 CGYEQKFSGP-TPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCLSAEGGGYLFLG 240
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRV 248
D VPSSG+ WTP++Q+S + KHY GP +L ++GK K L +IFDSG+SY YF+S V
Sbjct: 241 DKFVPSSGIVWTPIIQSSLE-KHYNTGPVDLFFNGKPTPAKGLQIIFDSGSSYTYFSSPV 299
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSV 306
Y + +++ DL G PL D +LPICW+G PFK+L +V YFKPL LSFT +N +
Sbjct: 300 YTIVANMVNNDLKGKPLSRV-KDPSLPICWKGVKPFKSLNEVNNYFKPLTLSFTKSKN-L 357
Query: 307 RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
+ +PP AYL+I+ NVCLGILNG+EA +G N++G+I +QDK+V+YDNEKQ+IGW
Sbjct: 358 QFQLPPVAYLIITKYGNVCLGILNGNEAGLGNRNVVGDISLQDKVVVYDNEKQQIGWASA 417
Query: 367 DCNTL 371
+C +
Sbjct: 418 NCKQI 422
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 174/360 (48%), Positives = 239/360 (66%), Gaps = 10/360 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L +G+P + + D DTGSDLTW+QCDAPCT C++ P ++P + VPC +P CA
Sbjct: 70 FYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAPCTHCSETPHPLHRPSNDFVPCRDPLCA 129
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+L C+HP DQCDYEI Y D S+ G L+ D++ L SNG V + GCGY+Q
Sbjct: 130 SLQPTEDYNCEHP-DQCDYEINYADQYSTYGVLLNDVYLLNSSNGVQLKVRMALGCGYDQ 188
Query: 136 -HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS 194
+P P D LG G+ S++SQL GL+RNVIGHC+ G G +F G+ S
Sbjct: 189 VFSPSSYHPLDGLLGLGRGKA--SLISQLNSQGLVRNVIGHCLSSQGGGYIFFGNA-YDS 245
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
+ V WTP+ +S D KHY GPAEL++ G+ G+ LT +FD+G+SY YF S YQ ++S
Sbjct: 246 ARVTWTPI--SSVDSKHYSAGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHAYQALLS 303
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN-RRNSVRLVVP 311
+ ++L G PLK+APDD+TL +CW G PF +L +V +YFKP+ALSFTN R + +P
Sbjct: 304 WLNKELSGKPLKVAPDDQTLSLCWHGKRPFTSLREVRKYFKPVALSFTNGGRVKAQFEIP 363
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PEAYL+IS NVCLGILNG E + E N++G+I MQDK+++++NEKQ IGW P DC+ +
Sbjct: 364 PEAYLIISNLGNVCLGILNGFEVGLEELNLVGDISMQDKVMVFENEKQLIGWGPADCSRV 423
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 171/345 (49%), Positives = 231/345 (66%), Gaps = 13/345 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P C
Sbjct: 56 YYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLCK 115
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
ALH + RC+ P +QCDYE+EY DGGSS+G LV D+F + ++ G L GCGY+Q
Sbjct: 116 ALHLNSNQRCETP-EQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYDQ 174
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
PG S GVLGLGRG++SI+SQL G ++NVIGHC+ G G+LF GD SS
Sbjct: 175 -IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSS 233
Query: 196 GVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
V+WTPM + + KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ +
Sbjct: 234 RVSWTPMSREYS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAV 289
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVRLV 309
L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 290 TYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFE 349
Query: 310 VPPEAYLVISGRKNVCLGILNGSEAEVGENNII-GEIFMQDKMVI 353
+PPEAYL+IS + NVCLGILNG+E + N+I G +F+ + I
Sbjct: 350 IPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGGTVFILHTLAI 394
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 333 bits (855), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 182/365 (49%), Positives = 239/365 (65%), Gaps = 20/365 (5%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCD---APCTGCTKPPEKQYKPHKN-IVPCSNP 72
+ V++ +G PP ++ D DTGSDLTWVQCD APC GCT P +K YKP+ N +V CS+P
Sbjct: 62 YTVSINIGNPPNPYELDIDTGSDLTWVQCDGPDAPCKGCTLPKDKLYKPNGNQLVKCSDP 121
Query: 73 RCAALHWPNPP---RCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT- 128
CAA+ P +C P C Y++EY D S GAL D + +GS NVPL
Sbjct: 122 ICAAVQPPFSTFGQKCAKPIPPCVYKVEYADNAESTGALARDYMHIGSPSGS--NVPLVV 179
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
FGCGY Q GP PP T GVLGLG G+ISI+SQL G I NV+GHC+ G G LFLG
Sbjct: 180 FGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVLGHCLSAEGGGYLFLG 239
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRV 248
D +PSSG+ WTP++Q+S + KHY GP +L ++GK K L +IFDSG+SY YF+ RV
Sbjct: 240 DKFIPSSGIFWTPIIQSSLE-KHYSTGPVDLFFNGKPTPAKGLQIIFDSGSSYTYFSPRV 298
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSV 306
Y + +++ DL G PL+ D +LPICW+G PFK+L +V YFKPL LSFT +N +
Sbjct: 299 YTIVANMVNNDLKGKPLRRETKDPSLPICWKGVKPFKSLNEVNNYFKPLTLSFTKSKN-L 357
Query: 307 RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
+ +PP + NVCLGILNG+EA +G N++G+I +QDK+V+YDNEKQ+IGW
Sbjct: 358 QFQLPPVKF------GNVCLGILNGNEAGLGNRNVVGDISLQDKVVVYDNEKQQIGWASA 411
Query: 367 DCNTL 371
+C +
Sbjct: 412 NCKQI 416
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 330 bits (845), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 165/360 (45%), Positives = 235/360 (65%), Gaps = 9/360 (2%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V +++G PP+ + D DTGSDLTW+QCDAPC C+K P Y+P KN +VPC + CA
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKLVPCVDQMCA 117
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH +C P QCDYEI+Y D GSS+G LVTD F LR +N S+ L FGCGY
Sbjct: 118 ALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAFGCGY 177
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q T GVLGLG G +S++SQL+++G+ +NV+GHC+ G G LF GD VP
Sbjct: 178 DQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVP 237
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S W PM ++++ +Y G A L + G+ G++ + ++FDSG+S+ YF+++ YQ +V
Sbjct: 238 YSRATWAPMARSTSR-NYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALV 296
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
I DL LK P D +LP+CW+G PFK++ V + FK + LSF+N + ++ + +P
Sbjct: 297 DAIKGDL-SKNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFKTVVLSFSNGKKAL-MEIP 353
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PE YL+++ N CLGILNGSE + + NI+G+I MQD+MVIYDNE+ +IGW C+ +
Sbjct: 354 PENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPCDRI 413
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 164/360 (45%), Positives = 235/360 (65%), Gaps = 9/360 (2%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V +++G PP+ + D DTGSDLTW+QCDAPC C+K P Y+P KN +VPC + CA
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKLVPCVDQMCA 117
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH +C P QCDYEI+Y D GSS+G LVTD F LR +N S+ L FGCGY
Sbjct: 118 ALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAFGCGY 177
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q T GVLGLG G +S++SQL+++G+ +NV+GHC+ G G LF GD VP
Sbjct: 178 DQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVP 237
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S W PM ++++ +Y G A L + G+ G++ + ++FDSG+S+ YF+++ YQ +V
Sbjct: 238 YSRATWAPMARSTSR-NYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALV 296
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
I DL LK P D +LP+CW+G PFK++ V + F+ + LSF+N + ++ + +P
Sbjct: 297 DAIKGDL-SKNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKKAL-MEIP 353
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PE YL+++ N CLGILNGSE + + NI+G+I MQD+MVIYDNE+ +IGW C+ +
Sbjct: 354 PENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPCDRI 413
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 165/360 (45%), Positives = 233/360 (64%), Gaps = 9/360 (2%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V +++G PP+ + D DTGSDLTW+QCDAPC C+K P Y+P KN +VPC + CA
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKLVPCVDQMCA 117
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH +C P QCDYEI+Y D GSS+G LVTD F LR +N S+ L FGCGY
Sbjct: 118 ALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAFGCGY 177
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q T GVLGLG G +S++SQL+++G+ +NV+GHC+ G G LF GD VP
Sbjct: 178 DQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVP 237
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S W PM + S +Y G A L + G+ G++ + ++FDSG+S+ YF+++ YQ +V
Sbjct: 238 YSRATWAPMAR-STSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALV 296
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
I DL LK P D +LP+CW+G PFK++ V + F+ + LSF+N + ++ + +P
Sbjct: 297 DAIKGDL-SKNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKKAL-MEIP 353
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PE YL+++ N CLGILNGSE + + NI+G+I MQD+MVIYDNE+ +IGW C+ +
Sbjct: 354 PENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPCDRI 413
>gi|356507650|ref|XP_003522577.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 326
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 171/346 (49%), Positives = 226/346 (65%), Gaps = 29/346 (8%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALH 78
+++T+ +L++ D DTGSDLTW Q DAPC GCT P +K KPH +V C + CAA+H
Sbjct: 1 MSITITSSSELYELDIDTGSDLTWFQWDAPCQGCTLPRDKLNKPHCKLVKCGDRLCAAIH 60
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNP 138
C P++QCDYE+EY D GSS+G LV D L+F++GS+ P+
Sbjct: 61 ---SEPCADPDEQCDYEVEYADQGSSLGVLVLDNIALKFTSGSLAR-PI----------- 105
Query: 139 GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVA 198
L+ PD +GL G+ SI+SQL GLIRNV+GHC+ + G G LF GD +P SGV
Sbjct: 106 --LAAPD----MGLATGKTSILSQLHSLGLIRNVVGHCLSRRGGGFLFFGDQLIPQSGVV 159
Query: 199 WTPMLQNSA---DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
WTP+LQNS+ HY GPA++ ++GK+ +K L L FDSG+SY F S ++ +V L
Sbjct: 160 WTPLLQNSSVTYTRPHYKTGPADMFFNGKATSVKGLELTFDSGSSYTXFNSHAHKALVGL 219
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
I D+ G A +D +LPICW+ P FK+L VT YFKP+ALSFT +NS+ L +PPE
Sbjct: 220 ITNDIKGKSFSRATEDPSLPICWKNPKTFKSLHDVTNYFKPIALSFTKSKNSL-LQLPPE 278
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
AYL+ G NVCLGIL+G+E +G NIIG+I +QDKMVIYDNEKQ
Sbjct: 279 AYLIKYG--NVCLGILDGTEIGLGNTNIIGDISLQDKMVIYDNEKQ 322
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 163/360 (45%), Positives = 234/360 (65%), Gaps = 9/360 (2%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V +++G PP+ + D DTGSDLTW+QCDAPC C K P Y+P KN IVPC + C+
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPHPLYRPTKNKIVPCVDQLCS 117
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
+LH +C P QCDYEI+Y D GSS+G L+TD F +R +N S+ L FGCGY
Sbjct: 118 SLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAVRLANSSIVRPSLAFGCGY 177
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q T GVLGLG G IS++SQL+++G+ +NV+GHC+ G G LF GD VP
Sbjct: 178 DQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCLSIRGGGFLFFGDNLVP 237
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S W PM++ SA +Y G A L + G+S G++ + ++ DSG+S+ YF ++ YQ +V
Sbjct: 238 YSRATWVPMVR-SAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSGSSFTYFGAQPYQALV 296
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+ + DL T ++ D +LP+CW+G PFK++ V + FK L LSF+N + ++ + +P
Sbjct: 297 TALKSDLSKTLKEVF--DPSLPLCWKGKKPFKSVLDVKKEFKSLVLSFSNGKKAL-MEIP 353
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PE YL+++ N CLGILNGSE + + NI+G+I MQD+MVIYDNE+ +IGW C+ +
Sbjct: 354 PENYLIVTKFGNACLGILNGSEIGLKDLNIVGDITMQDQMVIYDNERGQIGWIRAPCDRI 413
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 166/361 (45%), Positives = 232/361 (64%), Gaps = 12/361 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V + +G PPK + D DTGSDLTW+QCDAPC C K P Y+P KN +VPC + CA
Sbjct: 66 YYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTKNKLVPCVDQLCA 125
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
+LH +C P +QCDY I+Y D GSS G LV D F LR +NGSV L FGCGY
Sbjct: 126 SLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVLVNDSFALRLANGSVVRPSLAFGCGY 185
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q + G +SP D GVLGLG G +S++SQ +++G+ +NV+GHC+ G G LF GD V
Sbjct: 186 DQQVSSGEMSPTD--GVLGLGTGSVSLLSQFKQHGVTKNVVGHCLSLRGGGFLFFGDDLV 243
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P V WTPM++ S +Y G A L + +S +K ++FDSG+S+ YF ++ YQ +
Sbjct: 244 PYQRVTWTPMVR-SPLRNYYSPGSASLYFGDQSLRVKLTEVVFDSGSSFTYFAAQPYQAL 302
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
V+ + DL T +++ D +LP+CW+G PFK++ V + FK L L+F N N + +
Sbjct: 303 VTALKGDLSRTLKEVS--DPSLPLCWKGKKPFKSVLDVKKEFKSLVLNFGNG-NKAFMEI 359
Query: 311 PPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
PP+ YL+++ N CLGILNGSE + + +I+G+I MQD+MVIYDNEK +IGW C+
Sbjct: 360 PPQNYLIVTKYGNACLGILNGSEVGLKDLSILGDITMQDQMVIYDNEKGQIGWIRAPCDR 419
Query: 371 L 371
+
Sbjct: 420 I 420
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 314 bits (804), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 165/360 (45%), Positives = 233/360 (64%), Gaps = 12/360 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V + +G PPK + D D+GSDLTW+QCDAPC C + P Y+P K+ +VPC + CA
Sbjct: 64 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKLVPCVHRLCA 123
Query: 76 ALHWP---NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+LH RC+ P++QCDY I+Y D GSS G LV D F LR +NGSV + FGCG
Sbjct: 124 SLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLTNGSVARPSVAFGCG 183
Query: 133 YNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
Y+Q G LS P T GVLGLG G +S++SQL++ G+ +NV+GHC+ G G LF GD
Sbjct: 184 YDQQVRSGDLSSP-TDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGFLFFGDDL 242
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQE 251
VP WTPM + SA +Y G A L + +S G++ ++FDSG+S+ YF ++ YQ
Sbjct: 243 VPYQRATWTPMAR-SAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFAAKPYQA 301
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLV 309
+V+ ++D + L+ PD +LP+CW+G PFK++ V + FK L L+F + + ++ +
Sbjct: 302 LVT-ALKDGLSRTLEEEPD-TSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKKTL-ME 358
Query: 310 VPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+PPE YL+++ N CLGILNGSE + + +IIG+I MQD MVIYDNEK +IGW C+
Sbjct: 359 IPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQDHMVIYDNEKGKIGWIRAPCD 418
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 313 bits (803), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 164/359 (45%), Positives = 232/359 (64%), Gaps = 11/359 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V + +G PPK + D D+GSDLTW+QCDAPC C + P Y+P K+ +VPC + CA
Sbjct: 57 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKLVPCVHRLCA 116
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
+LH RC P++QCDY I+Y D GSS G L+ D F LR +NGSV + FGCGY
Sbjct: 117 SLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAFGCGY 176
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q G LS P T GVLGLG G +S++SQL++ G+ +NV+GHC+ G G LF GD V
Sbjct: 177 DQQVRSGDLSSP-TDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGFLFFGDDLV 235
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P WTPM + SA +Y G A L + +S G++ ++FDSG+S+ YF ++ YQ +
Sbjct: 236 PYQRATWTPMAR-SAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFAAKPYQAL 294
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
V+ ++D + L+ PD +LP+CW+G PFK++ V + FK L L+F + + ++ + +
Sbjct: 295 VT-ALKDGLSRTLEEEPD-TSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKKTL-MEI 351
Query: 311 PPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
PPE YL+++ N CLGILNGSE + + +IIG+I MQD MVIYDNEK +IGW C+
Sbjct: 352 PPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQDHMVIYDNEKGKIGWIRAPCD 410
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 313 bits (802), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 164/359 (45%), Positives = 232/359 (64%), Gaps = 11/359 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V + +G PPK + D D+GSDLTW+QCDAPC C + P Y+P K+ +VPC + CA
Sbjct: 66 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKLVPCVHRLCA 125
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
+LH RC P++QCDY I+Y D GSS G L+ D F LR +NGSV + FGCGY
Sbjct: 126 SLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAFGCGY 185
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q G LS P T GVLGLG G +S++SQL++ G+ +NV+GHC+ G G LF GD V
Sbjct: 186 DQQVRSGDLSSP-TDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGFLFFGDDLV 244
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P WTPM + SA +Y G A L + +S G++ ++FDSG+S+ YF ++ YQ +
Sbjct: 245 PYQRATWTPMAR-SAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFAAKPYQAL 303
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
V+ ++D + L+ PD +LP+CW+G PFK++ V + FK L L+F + + ++ + +
Sbjct: 304 VT-ALKDGLSRTLEEEPD-TSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKKTL-MEI 360
Query: 311 PPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
PPE YL+++ N CLGILNGSE + + +IIG+I MQD MVIYDNEK +IGW C+
Sbjct: 361 PPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQDHMVIYDNEKGKIGWIRAPCD 419
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 165/367 (44%), Positives = 232/367 (63%), Gaps = 20/367 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
+P Y+ V + +G P K + D DTGSDLTW+QCDAPC C K P Y+P N +VPC
Sbjct: 48 YPTGHYY-VTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPC 106
Query: 70 SNPRCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLF--PLRFSNGSVFNVP 126
+N C ALH K P+ QCDY+I+Y D SS G L+ D F P+R SN
Sbjct: 107 ANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN---IRPG 163
Query: 127 LTFGCGYNQH---NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
LTFGCGY+Q N + D G+LGLGRG +S+VSQL++ G+ +NV+GHC+ NG G
Sbjct: 164 LTFGCGYDQQVGKNGAVQAAID--GMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGG 221
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
LF GD VPSS V W PM Q ++ +Y G L + +S G+K + ++FDSG++Y Y
Sbjct: 222 FLFFGDDVVPSSRVTWVPMAQRTSG-NYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTY 280
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
FT++ YQ +VS + L + +++ D TLP+CW+G FK++ V FK + LSF++
Sbjct: 281 FTAQPYQAVVSALKGGLSKSLKQVS--DPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFSS 338
Query: 302 RRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRI 361
+N+ + +PPE YL+++ NVCLGIL+G+ A++ N+IG+I MQD+MVIYDNEK ++
Sbjct: 339 AKNAA-MEIPPENYLIVTKNGNVCLGILDGTAAKL-SFNVIGDITMQDQMVIYDNEKSQL 396
Query: 362 GWKPEDC 368
GW C
Sbjct: 397 GWARGAC 403
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 165/367 (44%), Positives = 231/367 (62%), Gaps = 20/367 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
+P Y+ V + +G P K + D DTGSDLTW+QCDAPC C K P Y+P N +VPC
Sbjct: 48 YPTGHYY-VTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPC 106
Query: 70 SNPRCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLF--PLRFSNGSVFNVP 126
+N C ALH K P+ QCDY+I+Y D SS G L+ D F P+R SN
Sbjct: 107 ANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN---IRPG 163
Query: 127 LTFGCGYNQH---NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
LTFGCGY+Q N + D G+LGLGRG +S+VSQL++ G+ +NV+GHC+ NG G
Sbjct: 164 LTFGCGYDQQVGKNGAVQAAID--GMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGG 221
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
LF GD VPSS V W PM Q ++ +Y G L + +S G+K + ++FDSG++Y Y
Sbjct: 222 FLFFGDDVVPSSRVTWVPMAQRTSG-NYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTY 280
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
FT++ YQ +VS + L + +++ D TLP+CW+G FK++ V FK + LSF +
Sbjct: 281 FTAQPYQAVVSALKGGLSKSLKQVS--DPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFAS 338
Query: 302 RRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRI 361
+N+ + +PPE YL+++ NVCLGIL+G+ A++ N+IG+I MQD+MVIYDNEK ++
Sbjct: 339 AKNAA-MEIPPENYLIVTKNGNVCLGILDGTAAKL-SFNVIGDITMQDQMVIYDNEKSQL 396
Query: 362 GWKPEDC 368
GW C
Sbjct: 397 GWARGAC 403
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 303 bits (777), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 159/359 (44%), Positives = 231/359 (64%), Gaps = 12/359 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRC 74
++ V + +G P K + D DTGSDLTW+QCDAPC C K P Y+P KN +VPC+N C
Sbjct: 56 HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKLVPCANSIC 115
Query: 75 AALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH + P K QCDY+I+Y D SS+G LVTD F L N S L+FGCGY
Sbjct: 116 TALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKSNVRPSLSFGCGY 175
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q +P T G+LGLGRG +S++SQL++ G+ +NV+GHC+ +G G LF GD V
Sbjct: 176 DQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGGGFLFFGDDMV 235
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P+S V W PM+++++ +Y G A L + +S K + ++FDSG++Y YF+++ YQ
Sbjct: 236 PTSRVTWVPMVRSTSG-NYYSPGSATLYFDRRSLSTKPMEVVFDSGSTYTYFSAQPYQAT 294
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
+S I L + +++ D +LP+CW+G FK++ V + FK +L F +N+V + +
Sbjct: 295 ISAIKGSLSKSLKQVS--DPSLPLCWKGQKAFKSVSDVKKDFK--SLQFIFGKNAV-MEI 349
Query: 311 PPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
PPE YL+++ NVCLGIL+GS A++ +IIG+I MQD+MVIYDNEK ++GW C+
Sbjct: 350 PPENYLIVTKNGNVCLGILDGSAAKL-SFSIIGDITMQDQMVIYDNEKAQLGWIRGSCS 407
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 164/362 (45%), Positives = 229/362 (63%), Gaps = 15/362 (4%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
+PI Y+ V + +G P K + D DTGSDLTW+QCDAPC C K P YKP KN IVPC
Sbjct: 68 YPIGHYY-VTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKPTKNKIVPC 126
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C +L PN +C P QCDY+I+Y D SS+G L+ D F L N S LTF
Sbjct: 127 AASLCTSLT-PNK-KCAVPQ-QCDYQIKYTDKASSLGVLIADNFTLSLRNSSTVRANLTF 183
Query: 130 GCGYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
GCGY+Q T G+LGLG+G +S++SQL++ G+ +NV+GHC NG G LF G
Sbjct: 184 GCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNGGGFLFFG 243
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRV 248
D VP+S V W PM + ++ +Y G L + +S G+K + ++FDSG++YAYF +
Sbjct: 244 DDIVPTSRVTWVPMARTTSG-NYYSPGSGTLYFDRRSLGMKPMEVVFDSGSTYAYFAAEP 302
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSV 306
YQ VS + L + +++ D +LP+CW+G FK++ +V FK L LSF +NSV
Sbjct: 303 YQATVSALKAGLSKSLKEVS--DVSLPLCWKGQKVFKSVSEVKNDFKSLFLSF--GKNSV 358
Query: 307 RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
+ +PPE YL+++ NVCLGIL+G+ A++ + NIIG+I MQD+M+IYDNEK ++GW
Sbjct: 359 -MEIPPENYLIVTKYGNVCLGILDGTTAKL-KFNIIGDITMQDQMIIYDNEKGQLGWIRG 416
Query: 367 DC 368
C
Sbjct: 417 SC 418
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 159/360 (44%), Positives = 225/360 (62%), Gaps = 14/360 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRC 74
++ V + +G P K + D DTGSDLTW+QCDAPC C K P YKP KN +VPC+ C
Sbjct: 51 HYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKPTKNKLVPCAASIC 110
Query: 75 AALHWPNPP--RCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
LH P +C P QCDY+I+Y D SS+G LVTD F L N S TFGCG
Sbjct: 111 TTLHSAQSPNKKCAVPQ-QCDYQIKYTDSASSLGVLVTDNFTLPLRNSSSVRPSFTFGCG 169
Query: 133 YNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
Y+Q + T G+LGLG+G +S+VSQL+ G+ +NV+GHC+ NG G LF GD
Sbjct: 170 YDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLSTNGGGFLFFGDNV 229
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQE 251
VP+S W PM+++++ +Y G L + +S G+K + ++FDSG++Y YF ++ YQ
Sbjct: 230 VPTSRATWVPMVRSTSG-NYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFAAQPYQA 288
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLV 309
VS + L + +++ D +LP+CW+G FK++ V FK L LSF +NSV L
Sbjct: 289 TVSALKAGLSKSLQQVS--DPSLPLCWKGQKVFKSVSDVKNDFKSLFLSFV--KNSV-LE 343
Query: 310 VPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+PPE YL+++ N CLGIL+GS A++ NIIG+I MQD+++IYDNE+ ++GW C+
Sbjct: 344 IPPENYLIVTKNGNACLGILDGSAAKL-TFNIIGDITMQDQLIIYDNERGQLGWIRGSCS 402
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 162/355 (45%), Positives = 225/355 (63%), Gaps = 19/355 (5%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCAALHWPN 81
+G P K + D DTGSDLTW+QCDAPC C K P Y+P N +VPC+N C ALH
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTALHSGQ 60
Query: 82 PPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLF--PLRFSNGSVFNVPLTFGCGYNQH-- 136
K P+ QCDY+I+Y D SS G L+ D F P+R SN LTFGCGY+Q
Sbjct: 61 GSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN---IRPGLTFGCGYDQQVG 117
Query: 137 -NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
N + D G+LGLGRG +S+VSQL++ G+ +NV+GHC+ NG G LF GD VPSS
Sbjct: 118 KNGAVQAAID--GMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGGFLFFGDDVVPSS 175
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
V W PM Q ++ +Y G L + +S G+K + ++FDSG++Y YFT++ YQ +VS
Sbjct: 176 RVTWVPMAQRTSG-NYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSA 234
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ L + +++ D TLP+CW+G FK++ V FK + LSF + +N+ + +PPE
Sbjct: 235 LKGGLSKSLKQVS--DPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFASAKNAA-MEIPPE 291
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
YL+++ NVCLGIL+G+ A++ N+IG+I MQD+MVIYDNEK ++GW C
Sbjct: 292 NYLIVTKNGNVCLGILDGTAAKL-SFNVIGDITMQDQMVIYDNEKSQLGWARGAC 345
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 158/359 (44%), Positives = 229/359 (63%), Gaps = 12/359 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRC 74
++ V + +G P K + D DTGSDLTW+QCDAPC C K P Y+P KN +VPC+N C
Sbjct: 56 HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKLVPCANSIC 115
Query: 75 AALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH + P K QCDY+I+Y D SS+G LV D F L N S L+FGCGY
Sbjct: 116 TALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKSNVRPSLSFGCGY 175
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q +P T G+LGLGRG +S++SQL++ G+ +NV+GHC+ +G G LF GD V
Sbjct: 176 DQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGGGFLFFGDDMV 235
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P+S V W M+++++ +Y G A L + +S K + ++FDSG++Y YF+++ YQ
Sbjct: 236 PTSRVTWVSMVRSTSG-NYYSPGSATLYFDRRSLSTKPMEVVFDSGSTYTYFSAQPYQAT 294
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
+S I L + +++ D +LP+CW+G FK++ V + FK +L F +N+V + +
Sbjct: 295 ISAIKGSLSKSLKQVS--DPSLPLCWKGQKAFKSVSDVKKDFK--SLQFIFGKNAV-MDI 349
Query: 311 PPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
PPE YL+I+ NVCLGIL+GS A++ +IIG+I MQD+MVIYDNEK ++GW C+
Sbjct: 350 PPENYLIITKNGNVCLGILDGSAAKL-SFSIIGDITMQDQMVIYDNEKAQLGWIRGSCS 407
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 148/329 (44%), Positives = 208/329 (63%), Gaps = 9/329 (2%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V +++G PP+ + D DTGSDLTW+QCDAPC C+K P Y+P KN +VPC + CA
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKLVPCVDQMCA 117
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH +C P QCDYEI+Y D GSS+G LVTD F LR +N S+ L FGCGY
Sbjct: 118 ALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAFGCGY 177
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q T GVLGLG G +S++SQL+++G+ +NV+GHC+ G G LF GD VP
Sbjct: 178 DQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVP 237
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S W PM + S +Y G A L + G+ G++ + ++FDSG+S+ YF+++ YQ +V
Sbjct: 238 YSRATWAPMAR-STSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALV 296
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
I DL LK P D +LP+CW+G PFK++ V + F+ + LSF+N + ++ + +P
Sbjct: 297 DAIKGDL-SKNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKKAL-MEIP 353
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENN 340
PE YL+++ N CLGILNGSE G +
Sbjct: 354 PENYLIVTKYGNACLGILNGSELPQGSEH 382
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 152/376 (40%), Positives = 218/376 (57%), Gaps = 20/376 (5%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNP 72
+ + V + VG P K + D D+GS+LTW+QCDAPC C K P YK K ++VP +P
Sbjct: 76 YGLYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYKLKKGSLVPSKDP 135
Query: 73 RCAAL-----HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CAA+ H+ N K + +CDY++ Y D G S G LV D +N +V
Sbjct: 136 LCAAVQAGSGHYHNH---KEASQRCDYDVAYADHGYSEGFLVRDSVRALLTNKTVLTANS 192
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVL 185
FGCGYNQ P+S T G+LGLG G S+ SQ + GLI+NVIGHCI GR G +
Sbjct: 193 VFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCIFGAGRDGGYM 252
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-----GLKDLTLIFDSGAS 240
F GD V +S + W PML + +KHY +G A++ + K G K +IFDSG++
Sbjct: 253 FFGDDLVSTSAMTWVPMLGRPS-IKHYYVGAAQMNFGNKPLDKDGDGKKLGGIIFDSGST 311
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALS 298
Y YFT++ Y +S++ +L G L+ D L +CWR F+++ + YFKPL L
Sbjct: 312 YTYFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCWRRKEGFRSVAEAAAYFKPLTLK 371
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F + + ++ + PE YLV++ + NVCLGILNG+ + + N++G+I Q ++V+YDNEK
Sbjct: 372 FRSTKTK-QMEIFPEGYLVVNKKGNVCLGILNGTAIGIVDTNVLGDISFQGQLVVYDNEK 430
Query: 359 QRIGWKPEDCNTLLSL 374
+IGW DC + L
Sbjct: 431 NQIGWARSDCQEISKL 446
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 150/370 (40%), Positives = 209/370 (56%), Gaps = 20/370 (5%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCA 75
+ + L +G PPKL+ D DTGSDLTW QCDAPC C P Y P K +V C P CA
Sbjct: 40 YYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLYNPKKAKVVDCHLPVCA 99
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+ C QCDYE+EY DG S++G LV D +R +NG++ GCGY+Q
Sbjct: 100 QIQQGGSYECNSDVKQCDYEVEYADGSSTMGVLVEDTLTVRLTNGTLIQTKAIIGCGYDQ 159
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDGKVP 193
SP T GV+GL ++++ +QL E G+I+NV+GHC+ G NG G LF GD VP
Sbjct: 160 QGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCLADGSNGGGYLFFGDELVP 219
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL---KDLT-----LIFDSGASYAYFT 245
S G+ WTPM+ ++ Y + Y G S L +DLT ++FDSG S+ Y
Sbjct: 220 SWGMTWTPMM-GKPEMLGYQARLQSIRYGGDSLVLNNDEDLTRSTSSVMFDSGTSFTYLV 278
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRR 303
+ Y ++S + + + L D TLP CWRG PF+++ V +YFK L L F R
Sbjct: 279 PQAYASVLSAVTKQ---SGLLRVKSDTTLPYCWRGPSPFQSITDVHQYFKTLTLDFGGRN 335
Query: 304 ---NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
L + P+ YL++S + NVCLGIL+ S A + NIIG++ M+ +V+YDN + R
Sbjct: 336 WFATDSTLDLSPQGYLIVSTQGNVCLGILDASGASLEVTNIIGDVSMRGYLVVYDNVRDR 395
Query: 361 IGWKPEDCNT 370
IGW +C++
Sbjct: 396 IGWIRRNCHS 405
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 150/373 (40%), Positives = 204/373 (54%), Gaps = 24/373 (6%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVP 68
+P Y+ L +G P KL+ D DTGSDLTW+QCDAPC C P Y P K +V
Sbjct: 17 IYPDGLYYMAML-IGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLYDPKKARLVD 75
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C P CA + C P QCDY++EY DG S++G L+ D L +NG+
Sbjct: 76 CRVPLCALVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLLLTNGTRSKTTAI 135
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLF 186
GCGY+Q +P T GV+GL +IS+ SQL + G++RNVIGHC+ G NG G LF
Sbjct: 136 IGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLAGGSNGGGYLF 195
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASY 241
GD VP+ G+ WTP++ S I G GKS D T ++FDSG S+
Sbjct: 196 FGDSLVPALGMTWTPIMGKS------ITGN----IGGKSGDADDKTGDIGGVMFDSGTSF 245
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF 299
Y Y ++S + + + L D TLP CWRG PF+++ V YFK + L F
Sbjct: 246 TYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPFCWRGPSPFESVADVQRYFKTVTLDF 305
Query: 300 TNRR---NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
R S L + PE YL++S + NVCLGIL+ S A + NIIG++ M+ +V+YDN
Sbjct: 306 GKRNWYSASRVLELSPEGYLIVSTQGNVCLGILDASGASLEVTNIIGDVSMRGYLVVYDN 365
Query: 357 EKQRIGWKPEDCN 369
+ +IGW +C+
Sbjct: 366 ARNQIGWVRRNCH 378
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 148/376 (39%), Positives = 212/376 (56%), Gaps = 18/376 (4%)
Query: 17 FAVNLTVGKPP--KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPR 73
+ + VGKP + + D DTGS+LTW+QCDAPCT C K + YKP K N+V S
Sbjct: 30 YYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEAF 89
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C + QCDYEIEY D S+G L D F L+ NGS+ + FGCGY
Sbjct: 90 CVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGY 149
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLGDGK 191
+Q + T G+LGL R +IS+ SQL G+I NV+GHC+ NG G +F+G
Sbjct: 150 DQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDL 209
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYAYFTS 246
VPS G+ W PML +S L Y + ++ Y L ++FD+G+SY YF +
Sbjct: 210 VPSHGMTWVPMLHDSR-LDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPN 268
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG----PFKALGQVTEYFKPLALSFTNR 302
+ Y ++V+ ++++ G L D+TLPICWR PF +L V ++F+P+ L ++
Sbjct: 269 QAYSQLVT-SLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSK 327
Query: 303 --RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
S +L++ PE YL+IS + NVCLGIL+GS G I+G+I M+ +++YDN K+R
Sbjct: 328 WLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRGHLIVYDNVKRR 387
Query: 361 IGWKPEDCNTLLSLNH 376
IGW DC ++H
Sbjct: 388 IGWMKSDCVRPREIDH 403
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 148/378 (39%), Positives = 213/378 (56%), Gaps = 18/378 (4%)
Query: 17 FAVNLTVGKPP--KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPR 73
+ + VGKP + + D DTGS+LTW+QCDAPCT C K + YKP K N+V S
Sbjct: 203 YYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEAF 262
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C + QCDYEIEY D S+G L D F L+ NGS+ + FGCGY
Sbjct: 263 CVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGY 322
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLGDGK 191
+Q + T G+LGL R +IS+ SQL G+I NV+GHC+ NG G +F+G
Sbjct: 323 DQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDL 382
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYAYFTS 246
VPS G+ W PML +S L Y + ++ Y L ++FD+G+SY YF +
Sbjct: 383 VPSHGMTWVPMLHDSR-LDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPN 441
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG----PFKALGQVTEYFKPLALSFTNR 302
+ Y ++V+ ++++ G L D+TLPICWR PF +L V ++F+P+ L ++
Sbjct: 442 QAYSQLVT-SLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSK 500
Query: 303 --RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
S +L++ PE YL+IS + NVCLGIL+GS G I+G+I M+ +++YDN K+R
Sbjct: 501 WLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRGHLIVYDNVKRR 560
Query: 361 IGWKPEDCNTLLSLNHFI 378
IGW DC ++H +
Sbjct: 561 IGWMKSDCVRPREIDHNV 578
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 147/368 (39%), Positives = 208/368 (56%), Gaps = 18/368 (4%)
Query: 17 FAVNLTVGKPP--KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPR 73
+ + VGKP + + D DTGSDLTW+QCDAPCT C K + YKP K N+V S P
Sbjct: 198 YYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEPF 257
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C + QCDYEIEY D S+G L D F L+ NGS+ + FGCGY
Sbjct: 258 CVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGY 317
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLGDGK 191
+Q + T G+LGL R +IS+ SQL G+I NV+GHC+ NG G +F+G
Sbjct: 318 DQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDL 377
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYAYFTS 246
VPS G+ W PML + L+ Y + ++ Y L ++FD+G+SY YF +
Sbjct: 378 VPSHGMTWVPMLHH-PHLEVYQMQVTKMSYGNAMLSLDGENGRVGKVLFDTGSSYTYFPN 436
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG----PFKALGQVTEYFKPLALSFTNR 302
+ Y ++V+ ++++ L D+ LPICWR P +L V ++F+P+ L ++
Sbjct: 437 QAYSQLVT-SLQEVSDLELTRDDSDEALPICWRAKTNSPISSLSDVKKFFRPITLQIGSK 495
Query: 303 --RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
S +L++ PE YL+IS + NVCLGIL+GS G IIG+I M+ ++++YDN KQR
Sbjct: 496 WLIISKKLLIQPEDYLIISNKGNVCLGILDGSNVHDGSTIIIGDISMRGRLIVYDNVKQR 555
Query: 361 IGWKPEDC 368
IGW DC
Sbjct: 556 IGWMKSDC 563
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 207/370 (55%), Gaps = 15/370 (4%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
+P YF L VG PP+ + D DT SDLTW+QCDAPCT C K YKP + NIV
Sbjct: 203 YPDGLYFTYIL-VGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGANALYKPRRDNIVTP 261
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C LH QCDYEIEY D SS+G L D L +NGS N+ F
Sbjct: 262 KDSLCVELHRNQKAGYCETCQQCDYEIEYADHSSSMGVLARDELHLTMANGSSTNLKFNF 321
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GC Y+Q + T G+LGL + ++S+ SQL G+I NV+GHC+ + G G +FL
Sbjct: 322 GCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLANDVVGGGYMFL 381
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYA 242
GD VP G++W PML +S + Y +L Y L + ++FDSG+SY
Sbjct: 382 GDDFVPRWGMSWVPML-DSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRRIVFDSGSSYT 440
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
YFT Y E+V+ ++ + G L D TLP CWR P +++ V +YFK L L F
Sbjct: 441 YFTKEAYSELVA-SLKQVSGEALIQDTSDPTLPFCWRAKFPIRSVIDVKQYFKTLTLQFG 499
Query: 301 NR--RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
++ S + +PPE YL+IS + NVCLGIL+GS+ G + I+G+I ++ +++IYDN
Sbjct: 500 SKWWIISTKFRIPPEGYLIISNKGNVCLGILDGSDVHDGSSIILGDISLRGQLIIYDNVN 559
Query: 359 QRIGWKPEDC 368
+IGW DC
Sbjct: 560 NKIGWTQSDC 569
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 143/365 (39%), Positives = 203/365 (55%), Gaps = 14/365 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCA 75
+ + + +G P KL+ D DTGSDLTW+QCDAPC C P Y P + +V C P CA
Sbjct: 31 YYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRARVVDCRRPTCA 90
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+ C QCDYE++Y DG S++G LV D L +NG+ F GCGY+Q
Sbjct: 91 QVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNGTRFQTRAVIGCGYDQ 150
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDGKVP 193
+P T GV+GL +IS+ SQL G+ NVIGHC+ G NG G LF GD VP
Sbjct: 151 QGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGGSNGGGYLFFGDTLVP 210
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYAYFTSRV 248
+ G+ WTPM+ ++ Y + Y G+ L+ T +FDSG S+ Y
Sbjct: 211 ALGMTWTPMIGRPL-VEGYQARLRSIKYGGEVLELEGTTDDVGGAMFDSGTSFTYLVPNA 269
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF---TNRR 303
Y ++S ++R + L+ D TLP CWRG PF+++ V+ YFK + L F T
Sbjct: 270 YTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGPSPFESVADVSAYFKTVTLDFGGSTWWS 329
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+ L + PE YL++S + NVCLG+L+ S A + NI+G+I M+ +V+YDN +++IGW
Sbjct: 330 SGKLLELSPEGYLIVSTQGNVCLGVLDASVASLEVTNILGDISMRGYLVVYDNMREQIGW 389
Query: 364 KPEDC 368
+C
Sbjct: 390 VRRNC 394
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 152/374 (40%), Positives = 216/374 (57%), Gaps = 20/374 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-KNIVPC 69
+PI +F V + +G P K + D DTGS LTW+QCD PC C K P YKP K V C
Sbjct: 33 YPIGHFF-VTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKYAVKC 91
Query: 70 SNPRCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ RCA L+ P +C P +QC Y I+Y GGSSIG L+ D F L SNG+ +
Sbjct: 92 TEQRCADLYADLRKPMKCG-PKNQCHYGIQY-VGGSSIGVLIVDSFSLPASNGT-NPTSI 148
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLF 186
FGCGYNQ P G+LGLGRG+++++SQL+ G+I ++V+GHCI G+G LF
Sbjct: 149 AFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLF 208
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYI--LGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
GD KVP+SGV W+PM + + KHY G + + K + +IFDSGA+Y YF
Sbjct: 209 FGDAKVPTSGVTWSPM---NREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTYF 265
Query: 245 TSRVYQEIVSLIMRDLIGT---PLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF 299
+ Y +S++ L ++ D+ L +CW+G + + +V + F+ L+L F
Sbjct: 266 ALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKCFRSLSLKF 325
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAE--VGENNIIGEIFMQDKMVIYDNE 357
+ L +PPE YL+IS +VCLGIL+GS+ + N+IG I M D+MVIYD+E
Sbjct: 326 ADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSE 385
Query: 358 KQRIGWKPEDCNTL 371
+ +GW C+ +
Sbjct: 386 RSLLGWVNYQCDRI 399
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 148/369 (40%), Positives = 211/369 (57%), Gaps = 14/369 (3%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
+P YF ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K N+VP
Sbjct: 96 YPNGLYFT-HIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPL 154
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C + +QCDYEIEY D SS+G L +D L +NGS+ + + F
Sbjct: 155 KDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMF 214
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GC Y+Q S T G+LGL + ++S+ SQL +I NV+GHC+ + G G +FL
Sbjct: 215 GCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFL 274
Query: 188 GDGKVPSSGVAWTPMLQNSADLKH----YILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
GD VP G+AW PML + + H I + L G+ G + ++FD+G+SY Y
Sbjct: 275 GDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTE-RVVFDTGSSYTY 333
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
F Y +V+ ++D+ L D TLP+CWR P +++ V ++F+PL L F +
Sbjct: 334 FPKEAYYALVA-SLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRS 392
Query: 302 RR--NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ S + +PPE YL+IS + NVCLGIL+GS G I+G+I ++ K+V+YDN Q
Sbjct: 393 KWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQ 452
Query: 360 RIGWKPEDC 368
+IGW C
Sbjct: 453 KIGWAQSTC 461
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 148/369 (40%), Positives = 211/369 (57%), Gaps = 14/369 (3%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
+P YF ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K N+VP
Sbjct: 309 YPNGLYFT-HIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPL 367
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C + +QCDYEIEY D SS+G L +D L +NGS+ + + F
Sbjct: 368 KDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMF 427
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GC Y+Q S T G+LGL + ++S+ SQL +I NV+GHC+ + G G +FL
Sbjct: 428 GCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFL 487
Query: 188 GDGKVPSSGVAWTPMLQNSADLKH----YILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
GD VP G+AW PML + + H I + L G+ G + ++FD+G+SY Y
Sbjct: 488 GDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTE-RVVFDTGSSYTY 546
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
F Y +V+ ++D+ L D TLP+CWR P +++ V ++F+PL L F +
Sbjct: 547 FPKEAYYALVA-SLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRS 605
Query: 302 R--RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ S + +PPE YL+IS + NVCLGIL+GS G I+G+I ++ K+V+YDN Q
Sbjct: 606 KWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQ 665
Query: 360 RIGWKPEDC 368
+IGW C
Sbjct: 666 KIGWAQSTC 674
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 151/374 (40%), Positives = 215/374 (57%), Gaps = 20/374 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-KNIVPC 69
+PI +F V + + P K + D DTGS LTW+QCD PC C K P YKP K V C
Sbjct: 33 YPIGHFF-VTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKYAVKC 91
Query: 70 SNPRCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ RCA L+ P +C P +QC Y I+Y GGSSIG L+ D F L SNG+ +
Sbjct: 92 TEQRCADLYADLRKPMKCG-PKNQCHYGIQY-VGGSSIGVLIVDSFSLPASNGT-NPTSI 148
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLF 186
FGCGYNQ P G+LGLGRG+++++SQL+ G+I ++V+GHCI G+G LF
Sbjct: 149 AFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLF 208
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKS--CGLKDLTLIFDSGASYAYF 244
GD KVP+SGV W+PM + + KHY L ++ S + +IFDSGA+Y YF
Sbjct: 209 FGDAKVPTSGVTWSPM---NREHKHYSPRQGTLHFNSNSKPISAAPMEVIFDSGATYTYF 265
Query: 245 TSRVYQEIVSLIMRDLIGT---PLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF 299
+ Y +S++ L ++ D+ L +CW+G + + +V + F+ L+L F
Sbjct: 266 ALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKCFRSLSLKF 325
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAE--VGENNIIGEIFMQDKMVIYDNE 357
+ L +PPE YL+IS +VCLGIL+GS+ + N+IG I M D+MVIYD+E
Sbjct: 326 ADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSE 385
Query: 358 KQRIGWKPEDCNTL 371
+ +GW C+ +
Sbjct: 386 RSLLGWVNYQCDRI 399
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 150/375 (40%), Positives = 214/375 (57%), Gaps = 21/375 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-KNIVPC 69
+PI +F V + + P K + D DTGS LTW+QCD PC C K P YKP K V C
Sbjct: 33 YPIGHFF-VTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKYAVKC 91
Query: 70 SNPRCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ RCA L+ P +C P +QC Y I+Y GGSSIG L+ D F L SNG+ +
Sbjct: 92 TEQRCADLYADLRKPMKCG-PKNQCHYGIQY-VGGSSIGVLIVDSFSLPASNGT-NPTSI 148
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLF 186
FGCGYNQ P G+LGLGRG+++++SQL+ G+I ++V+GHCI G+G LF
Sbjct: 149 AFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLF 208
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKS---CGLKDLTLIFDSGASYAY 243
GD KVP+SGV W+PM + + KHY L ++ + +IFDSGA+Y Y
Sbjct: 209 FGDAKVPTSGVTWSPM---NREHKHYSPRQGTLHFNSNKQSPISAAPMEVIFDSGATYTY 265
Query: 244 FTSRVYQEIVSLIMRDLIGT---PLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALS 298
F + Y +S++ L ++ D+ L +CW+G + + +V + F+ L+L
Sbjct: 266 FALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKCFRSLSLK 325
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAE--VGENNIIGEIFMQDKMVIYDN 356
F + L +PPE YL+IS +VCLGIL+GS+ + N+IG I M D+MVIYD+
Sbjct: 326 FADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDS 385
Query: 357 EKQRIGWKPEDCNTL 371
E+ +GW C+ +
Sbjct: 386 ERSLLGWVNYQCDRI 400
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 149/374 (39%), Positives = 215/374 (57%), Gaps = 20/374 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-KNIVPC 69
+PI +F + + +G P K + D DTGS LTW+QCDAPCT C P YKP K +V C
Sbjct: 33 YPIGHFF-ITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKLVTC 91
Query: 70 SNPRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
++ C L+ P RC QCDY I+Y D SS+G LV D F L SNG+ +
Sbjct: 92 ADSLCTDLYTDLGKPKRCG-SQKQCDYVIQYVD-SSSMGVLVIDRFSLSASNGT-NPTTI 148
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLF 186
FGCGY+Q P +LGL RG+++++SQL+ G+I ++V+GHCI G G LF
Sbjct: 149 AFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGGFLF 208
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD--LTLIFDSGASYAYF 244
GD +VP+SGV WTPM + + K+Y G L + S + + +IFDSGA+Y YF
Sbjct: 209 FGDAQVPTSGVTWTPM---NREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYF 265
Query: 245 TSRVYQEIVSLIMRDLIGT---PLKLAPDDKTLPICWRGPFK--ALGQVTEYFKPLALSF 299
++ YQ +S++ L ++ D+ L +CW+G K + +V + F+ L+L F
Sbjct: 266 AAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTIDEVKKCFRSLSLEF 325
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAE--VGENNIIGEIFMQDKMVIYDNE 357
+ L +PPE YL+IS +VCLGIL+GS+ + N+IG I M D+MVIYD+E
Sbjct: 326 ADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGITMLDQMVIYDSE 385
Query: 358 KQRIGWKPEDCNTL 371
+ +GW C+ +
Sbjct: 386 RSLLGWVNYQCDRI 399
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 157/387 (40%), Positives = 218/387 (56%), Gaps = 25/387 (6%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
+P YF + L VG PPK + D DTGSDLTW+QCDAPC C K QYKP + N+V
Sbjct: 189 YPDGLYFTI-LRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQYKPTRSNVVSS 247
Query: 70 SNPRCAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ C + N H QCDYEI+Y D SS+G LV D L +NGS + +
Sbjct: 248 VDSLCLDVQ-KNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHLVTTNGSKTKLNV 306
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG--VL 185
FGCGY+Q + T G++GL R ++S+ QL GLI+NV+GHC+ +G G +
Sbjct: 307 VFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSNDGAGGGYM 366
Query: 186 FLGDGKVPSSGVAWTPMLQN-SADLKHYIL-----GPAELLYSGKSCGLKDLTLIFDSGA 239
FLGD VP G+ W PM + DL + G +L + G+S K + FDSG+
Sbjct: 367 FLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFDGQS---KVGKVFFDSGS 423
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPF--KALGQVTEYFKPLAL 297
SY YF Y ++V+ + ++ G L D TLPICW+ F +++ V +YFK L L
Sbjct: 424 SYTYFPKEAYLDLVA-SLNEVSGLGLVQDDSDTTLPICWQANFQIRSIKDVKDYFKTLTL 482
Query: 298 SFTNR--RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
F ++ S +PPE YL+IS + +VCLGIL+GS+ G + I+G+I ++ V+YD
Sbjct: 483 RFGSKWWILSTLFQIPPEGYLIISNKGHVCLGILDGSKVNDGSSIILGDISLRGYSVVYD 542
Query: 356 NEKQRIGWKPEDC----NTLLSLNHFI 378
N KQ+IGWK DC + L N+FI
Sbjct: 543 NVKQKIGWKRADCGMPSSRLRKKNNFI 569
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 147/369 (39%), Positives = 212/369 (57%), Gaps = 19/369 (5%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-KNIVPCSNPRC 74
+F + + +G P K + D DTGS LTW+QCDAPCT C P YKP K +V C++ C
Sbjct: 402 HFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKLVTCADSLC 461
Query: 75 AALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
L+ P RC QCDY I+Y D SS+G LV D F L SNG+ + FGCG
Sbjct: 462 TDLYTDLGKPKRCG-SQKQCDYVIQYVD-SSSMGVLVIDRFSLSASNGT-NPTTIAFGCG 518
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLFLGDGK 191
Y+Q P +LGL RG+++++SQL+ G+I ++V+GHCI G G LF GD +
Sbjct: 519 YDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGGFLFFGDAQ 578
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD--LTLIFDSGASYAYFTSRVY 249
VP+SGV WTPM + + K+Y G L + S + + +IFDSGA+Y YF ++ Y
Sbjct: 579 VPTSGVTWTPM---NREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYFAAQPY 635
Query: 250 QEIVSLIMRDLIGT---PLKLAPDDKTLPICWRGPFK--ALGQVTEYFKPLALSFTNRRN 304
Q +S++ L ++ D+ L +CW+G K + +V + F+ L+L F +
Sbjct: 636 QATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTIDEVKKCFRSLSLEFADGDK 695
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAE--VGENNIIGEIFMQDKMVIYDNEKQRIG 362
L +PPE YL+IS +VCLGIL+GS+ + N+IG I M D+MVIYD+E+ +G
Sbjct: 696 KATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGITMLDQMVIYDSERSLLG 755
Query: 363 WKPEDCNTL 371
W C+ +
Sbjct: 756 WVNYQCDRI 764
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 123/286 (43%), Positives = 176/286 (61%), Gaps = 30/286 (10%)
Query: 91 QCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFNVPLTFGCGYNQ---HNPGPLSPPDT 146
QCDYEI+Y DG S+IGAL+ D F L R + N+P FGCGYNQ N SP +
Sbjct: 28 QCDYEIKYADGASTIGALIVDQFSLPRIATRP--NLP--FGCGYNQGIGENFQQTSPVN- 82
Query: 147 AGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQN 205
G+LGL RG++S VSQL+ G+I ++V+GHC+ G G+LF+GDG +L +
Sbjct: 83 -GILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGGGGLLFVGDGD-------GNLVLLH 134
Query: 206 SADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPL 265
+ +Y G A L + S G+ + ++FDSG++Y YFT++ YQ V I L T L
Sbjct: 135 A---NYYSPGSATLYFDRHSLGMNPMDVVFDSGSTYTYFTAQPYQATVYAIKGGLSSTSL 191
Query: 266 KLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN 323
+ D +LP+CW+G F+++ V + FK L L+F N N+V + +PPE YL+++ N
Sbjct: 192 EQV-SDPSLPLCWKGQKAFESVFDVKKEFKSLQLNFGN--NAV-MEIPPENYLIVTEYGN 247
Query: 324 VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
VCLGIL+G NIIG+I MQD+MVIYDNE++++GW C+
Sbjct: 248 VCLGILHGCRLNF---NIIGDITMQDQMVIYDNEREQLGWIRGSCD 290
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 151/379 (39%), Positives = 207/379 (54%), Gaps = 19/379 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 182 FPDGQYY-TSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKEKIVPP 240
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D L +NG + F
Sbjct: 241 RDLLCQELQ-GNQNYCETCK-QCDYEIEYADQSSSMGVLARDDMHLIATNGGREKLDFVF 298
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC Y+Q SP T G+LGL IS+ SQL +G+I N+ GHCI Q G G +FL
Sbjct: 299 GCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQGGGGYMFL 358
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD-----LTLIFDSGASYA 242
GD VP G+ WT + +L H + Y + +++ + +IFDSG+SY
Sbjct: 359 GDDYVPRWGITWTSIRSGPDNLYH--TEAHHVKYGDQQLRMREQAGNTVQVIFDSGSSYT 416
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
Y +Y+ +V+ I G D+TLP+CW+ P + L V ++FKPL L F
Sbjct: 417 YLPDEIYENLVAAIKYASPG--FVQDSSDRTLPLCWKADFPVRYLEDVKQFFKPLNLHFG 474
Query: 301 NRR--NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
+ S + PE YL+IS + NVCLG+LNG+E G I+G++ ++ K+V+YDN++
Sbjct: 475 KKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQR 534
Query: 359 QRIGWKPEDCNTLLSLNHF 377
++IGW DC S F
Sbjct: 535 RQIGWTNSDCTKPQSQKGF 553
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 150/373 (40%), Positives = 205/373 (54%), Gaps = 23/373 (6%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 189 FPDGQYY-TSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 247
Query: 70 SNPRCAALHWPNP--PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ C L CK QCDYEIEY D SS+G L D + +NG +
Sbjct: 248 RDLLCQELQGDQNYCATCK----QCDYEIEYADRSSSMGVLAKDDMHMIATNGGREKLDF 303
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVL 185
FGC Y+Q SP T G+LGL IS+ SQL G+I NV GHCI + NG G +
Sbjct: 304 VFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEPNGGGYM 363
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYI-----LGPAELLYSGKSCGLKDLTLIFDSGAS 240
FLGD VP G+ W P+ +L H G +L G++ + +IFDSG+S
Sbjct: 364 FLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAG--SSIQVIFDSGSS 421
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPF--KALGQVTEYFKPLALS 298
Y Y +Y+++V+ I D D TLP+CW+ F + L V ++FKPL L
Sbjct: 422 YTYLPDEIYKKLVTAIKYDY--PSFVQDTSDTTLPLCWKADFDVRYLEDVKQFFKPLNLH 479
Query: 299 FTNRRNSV--RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
F NR + + P+ YL+IS + NVCLG+LNG+E + I+G++ ++ K+V+YDN
Sbjct: 480 FGNRWFVIPRTFTILPDDYLIISDKGNVCLGLLNGAEIDHASTLIVGDVSLRGKLVVYDN 539
Query: 357 EKQRIGWKPEDCN 369
E+++IGW +C
Sbjct: 540 ERRQIGWADSECT 552
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 151/378 (39%), Positives = 207/378 (54%), Gaps = 17/378 (4%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ +G PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 182 FPDGQYY-TSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 240
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D + +NG + F
Sbjct: 241 RDLLCQELQG-NQNYCETCK-QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDFVF 298
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC Y+Q SP T G+LGL IS SQL +G+I NV GHCI Q G G +FL
Sbjct: 299 GCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFL 358
Query: 188 GDGKVPSSGVAWTPMLQNSADL----KHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
GD VP GV WT + +L H++ + L + G + +IFDSG+SY Y
Sbjct: 359 GDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAG-STVQVIFDSGSSYTY 417
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
+ +Y+ +V+ I G D+TLP+CW+ P + L V ++F+PL L F
Sbjct: 418 LPNEIYENLVAAIKYASPG--FVQDTSDRTLPLCWKADFPVRYLEDVKQFFEPLNLHFGK 475
Query: 302 R--RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ S + PE YL+IS + NVCLG+LNG+E G I+G++ ++ K+V+YDN+++
Sbjct: 476 KWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRK 535
Query: 360 RIGWKPEDCNTLLSLNHF 377
+IGW DC S F
Sbjct: 536 QIGWADSDCTKPQSQKGF 553
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 150/393 (38%), Positives = 204/393 (51%), Gaps = 46/393 (11%)
Query: 20 NLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCAALH 78
NL PP+ + DFDTGSDLTW+QCDAPCT C K YKP + NIVP + C +
Sbjct: 193 NLYPDGPPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANAWYKPRRGNIVPPKDLLCMEVQ 252
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNP 138
DQCDYEIEY D SS+G L TD L +NGS+ + FGC Y+Q
Sbjct: 253 RNQKAGYCETCDQCDYEIEYADHSSSMGVLATDKLLLMVANGSLTKLNFIFGCAYDQQGL 312
Query: 139 GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFLGDGKVPSSG 196
+ T G+LGL R ++S+ SQL G+I NVIGHC+ + G G +FLGD VP G
Sbjct: 313 LLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFLGDDFVPRWG 372
Query: 197 VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYAYFTSRVYQE 251
+AW PML +S ++ Y +L Y L + ++FDSG+SY YF Y E
Sbjct: 373 MAWVPML-DSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGSSYTYFPKEAYSE 431
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRGPF----------------------------- 282
+V+ + ++ G L + D TLP+CWR F
Sbjct: 432 LVA-SLNEVSGAGLVQSTSDTTLPLCWRANFPIRKFIYRTELTRPIRRRRRRRRRRRRRR 490
Query: 283 -----KALGQVTEYFKPLALSFTNR--RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAE 335
G V ++FK L F + S + +PPE YL++S + NVCLGIL GS+
Sbjct: 491 RRRRQHIKGDVKKFFKTLTFQFGTKWLVISTKFRIPPEGYLMMSDKGNVCLGILEGSKVH 550
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
G I+G+I ++ ++V+YDN ++IGW P DC
Sbjct: 551 DGSTIILGDISLRGQLVVYDNVNKKIGWTPSDC 583
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 152/387 (39%), Positives = 216/387 (55%), Gaps = 33/387 (8%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-------------PE 57
+PI +F V + +G P K + D DTGS LTW+QCD PC C K P
Sbjct: 33 YPIGHFF-VTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSLFYPRLIGSFVPH 91
Query: 58 KQYKPH-KNIVPCSNPRCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFP 114
YKP K V C+ RCA L+ P +C P +QC Y I+Y GGSSIG L+ D F
Sbjct: 92 GLYKPELKYAVKCTEQRCADLYADLRKPMKCG-PKNQCHYGIQY-VGGSSIGVLIVDSFS 149
Query: 115 LRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVI 173
L SNG+ + FGCGYNQ P G+LGLGRG+++++SQL+ G+I ++V+
Sbjct: 150 LPASNGT-NPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVL 208
Query: 174 GHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYI--LGPAELLYSGKSCGLKDL 231
GHCI G+G LF GD KVP+SGV W+PM + + KHY G + + K +
Sbjct: 209 GHCISSKGKGFLFFGDAKVPTSGVTWSPM---NREHKHYSPRQGTLQFNSNSKPISAAPM 265
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGT---PLKLAPDDKTLPICWRG--PFKALG 286
+IFDSGA+Y YF + Y +S++ L ++ D+ L +CW+G + +
Sbjct: 266 EVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTID 325
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAE--VGENNIIGE 344
+V + F+ L+L F + L +PPE YL+IS +VCLGIL+GS+ + N+IG
Sbjct: 326 EVKKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGG 385
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTL 371
I M D+MVIYD+E+ +GW C+ +
Sbjct: 386 ITMLDQMVIYDSERSLLGWVNYQCDRI 412
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 152/373 (40%), Positives = 210/373 (56%), Gaps = 21/373 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
+P YF + L VG PPK + D DTGSDLTW+QCDAPC C K YKP + N+V
Sbjct: 187 YPDGLYFTI-LRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHVLYKPTRSNVVSS 245
Query: 70 SNPRCAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ C + N H QCDYEI+Y D SS+G LV D L +NGS + +
Sbjct: 246 VDALCLDVQ-KNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHLVTTNGSKTKLNV 304
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG--VL 185
FGCGY+Q + T G++GL R ++S+ QL GLI+NV+GHC+ +G G +
Sbjct: 305 VFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSNDGAGGGYM 364
Query: 186 FLGDGKVPSSGVAWTPMLQN-SADLKHYIL-----GPAELLYSGKSCGLKDLTLIFDSGA 239
FLGD VP G+ W PM + DL + G +L + G+S K ++FDSG+
Sbjct: 365 FLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRFDGQS---KVGKMVFDSGS 421
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLAL 297
SY YF Y ++V+ + ++ G L D TLPICW+ P K++ V +YFK L L
Sbjct: 422 SYTYFPKEAYLDLVA-SLNEVSGLGLVQDDSDTTLPICWQANFPIKSVKDVKDYFKTLTL 480
Query: 298 SFTNR--RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
F ++ S + PE YL+IS + +VCLGIL+GS G + I+G+I ++ V+YD
Sbjct: 481 RFGSKWWILSTLFQISPEGYLIISNKGHVCLGILDGSNVNDGSSIILGDISLRGYSVVYD 540
Query: 356 NEKQRIGWKPEDC 368
N KQ+IGWK DC
Sbjct: 541 NVKQKIGWKRADC 553
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 146/369 (39%), Positives = 205/369 (55%), Gaps = 16/369 (4%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCAALHW 79
+ VG+PP+ + D DTGSDLTWVQCDAPC+ C K YKP + N+V + C +
Sbjct: 203 IMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSPLYKPRRENVVSFKDSLCMEVQR 262
Query: 80 PNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPG 139
QC+YE++Y D SS+G LV D F LRFSNGS+ + FGC Y+Q
Sbjct: 263 NYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRFSNGSLTKLNAIFGCAYDQQGLL 322
Query: 140 PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFLGDGKVPSSGV 197
+ T G+LGL R ++S+ SQL G+I NV+GHC+ + G G LFLGD VP G+
Sbjct: 323 LNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLTGDPAGGGYLFLGDDFVPQWGM 382
Query: 198 AWTPMLQNSADLKHYILGPAELLY-----SGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
AW ML +S + Y + Y S + G ++FDSG+SY YFT Y ++
Sbjct: 383 AWVAML-DSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQVVFDSGSSYTYFTKEAYYQL 441
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNR--RNSVRL 308
V+ + + L D + ICW+ +++ V +FKPL L F +R S +L
Sbjct: 442 VANLEE---VSAFGLILQDSSDTICWKTEQSIRSVKDVKHFFKPLTLQFGSRFWLVSTKL 498
Query: 309 VVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
V+ PE YL+I+ NVCLGIL+GS+ G I+G+ ++ K+V+YDN QRIGW DC
Sbjct: 499 VILPENYLLINKEGNVCLGILDGSQVHDGSTIILGDNALRGKLVVYDNVNQRIGWTSSDC 558
Query: 369 NTLLSLNHF 377
+ + H
Sbjct: 559 HNPRKIKHL 567
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 156/361 (43%), Positives = 197/361 (54%), Gaps = 58/361 (16%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
FP+ Y++V L +G PPK F+FD DTGSDLTWVQCDAPCTGCT PP +QYKP N VPC
Sbjct: 49 FPL-GYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCTLPPIRQYKPKGNTVPCL 107
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+P C ALH+PN P+C +P +QCDYE+ Y D GSS+GALV D FPL+ NGS L FG
Sbjct: 108 DPICLALHFPNKPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPLKLLNGSAMQPRLAFG 167
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CGY+Q P PP TAG VL LG G
Sbjct: 168 CGYDQILPKAHPPPATAG-----------------------------------VLGLGRG 192
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
K+ GV P L +A L ++G G D TLI G ++ S Y
Sbjct: 193 KI---GVL--PQLV-AAGLTRNVVGHCLSSKGGGYLFFGD-TLIPTLGVAWTPLLSPEYT 245
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
I RD + D T FK++ + +FK + ++FTN R +L +
Sbjct: 246 FFFH-ICRDRLQ-------RDYTF-------FKSVLEFKNFFKTITINFTNARRITQLQI 290
Query: 311 PPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
PPE+YL+IS N CLG+LNGSE + +N+IG+I MQ MVIYDNEKQ++GW +CN
Sbjct: 291 PPESYLIISKTGNACLGLLNGSEVGLQNSNVIGDISMQGLMVIYDNEKQQLGWVSSNCNK 350
Query: 371 L 371
L
Sbjct: 351 L 351
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 197/309 (63%), Gaps = 11/309 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V + +G PPK + D D+GSDLTW+QCDAPC C + P Y+P K+ +VPC + CA
Sbjct: 66 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKLVPCVHRLCA 125
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
+LH RC P++QCDY I+Y D GSS G L+ D F LR +NGSV + FGCGY
Sbjct: 126 SLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAFGCGY 185
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q G LS P T GVLGLG G +S++SQL++ G+ +NV+GHC+ G G LF GD V
Sbjct: 186 DQQVRSGDLSSP-TDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGFLFFGDDLV 244
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P WTPM + SA +Y G A L + +S G++ ++FDSG+S+ YF ++ YQ +
Sbjct: 245 PYQRATWTPMAR-SAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFAAKPYQAL 303
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
V+ ++D + L+ P D +LP+CW+G PFK++ V + FK L L+F + + ++ + +
Sbjct: 304 VT-ALKDGLSRTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKKTL-MEI 360
Query: 311 PPEAYLVIS 319
PPE YL+++
Sbjct: 361 PPENYLIVT 369
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 150/378 (39%), Positives = 206/378 (54%), Gaps = 17/378 (4%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ +G PP+ + D DTGSDLTW+QCDAPCT K P YKP K IVP
Sbjct: 182 FPDGQYY-TSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKIVPP 240
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D + +NG + F
Sbjct: 241 RDLLCQELQG-NQNYCETCK-QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDFVF 298
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC Y+Q SP T G+LGL IS SQL +G+I NV GHCI Q G G +FL
Sbjct: 299 GCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFL 358
Query: 188 GDGKVPSSGVAWTPMLQNSADL----KHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
GD VP GV WT + +L H++ + L + G + +IFDSG+SY Y
Sbjct: 359 GDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAG-STVQVIFDSGSSYTY 417
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
+ +Y+ +V+ I G D+TLP+CW+ P + L V ++F+PL L F
Sbjct: 418 LPNEIYENLVAAIKYASPG--FVQDTSDRTLPLCWKADFPVRYLEDVKQFFEPLNLHFGK 475
Query: 302 R--RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ S + PE YL+IS + NVCLG+LNG+E G I+G++ ++ K+V+YDN+++
Sbjct: 476 KWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRK 535
Query: 360 RIGWKPEDCNTLLSLNHF 377
+IGW DC S F
Sbjct: 536 QIGWADSDCTKPQSQKGF 553
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 152/378 (40%), Positives = 201/378 (53%), Gaps = 27/378 (7%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 186 FPDGQYY-TSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 244
Query: 70 SNPRCAALHWPNP--PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ C L CK QCDYEIEY D SS+G L D L +NG +
Sbjct: 245 RDSLCQELQGDQNYCETCK----QCDYEIEYADRSSSMGVLAKDDMHLIATNGGREKLDF 300
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVL 185
FGC Y+Q SP T G+LGL IS+ SQL G+I NV GHCI + NG G +
Sbjct: 301 VFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRETNGGGYM 360
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPA----ELLYSGKSCGLKDLTLIFDSGASY 241
FLGD VP G+ W P+ +L H + L++G S + +IFDSG+SY
Sbjct: 361 FLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNS-----VQVIFDSGSSY 415
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
Y +Y+ ++ I D D TLP+CW+ F V +FKPL L F
Sbjct: 416 TYLPEEMYKNLIDAIKED--SPSFVQDSSDTTLPLCWKADF----SVRSFFKPLNLHFGR 469
Query: 302 RRNSV--RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
R V + P+ YL+IS + NVCLG+LNG+E G I+G++ ++ K+V+YDNE++
Sbjct: 470 RWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERR 529
Query: 360 RIGWKPEDCNTLLSLNHF 377
+IGW +C S F
Sbjct: 530 QIGWANSECTKPQSQKGF 547
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 136/311 (43%), Positives = 195/311 (62%), Gaps = 15/311 (4%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRC 74
++ V + +G P K + D DTGSDLTW+QCDAPC C K P Y+P N +VPC+N C
Sbjct: 53 HYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANSLVPCANALC 112
Query: 75 AALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLF--PLRFSNGSVFNVPLTFGC 131
ALH + K P+ QCDY+I+Y D SS G L+ D F P+R SN LTFGC
Sbjct: 113 TALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPMRSSN---IRPGLTFGC 169
Query: 132 GYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
GY+Q T G+LGLGRG +S+VSQL++ G+ +NV+GHC+ NG G LF GD
Sbjct: 170 GYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCLSTNGGGFLFFGDD 229
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
VP+S V W PM + S + +Y G L + +S G+K + ++FDSG++Y YFT++ YQ
Sbjct: 230 IVPTSRVTWVPMAKISGN--YYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQ 287
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRL 308
+VS + L + +++ D +LP+CW+GP FK++ V + FK L LSF + +N+V +
Sbjct: 288 AVVSALKSGLSKSLKQVS--DPSLPLCWKGPKAFKSVFDVKKEFKSLFLSFASAKNAV-M 344
Query: 309 VVPPEAYLVIS 319
+PPE YL+++
Sbjct: 345 EIPPENYLIVT 355
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 148/369 (40%), Positives = 207/369 (56%), Gaps = 24/369 (6%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA---PCTGCTKPPEKQYKPHKNIVPCSNP 72
+F V + +G+P K + D DTGS+LTW++C A PC C K P Y+P K +VPC++P
Sbjct: 39 HFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLYRP-KKLVPCADP 97
Query: 73 RCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C ALH C+ DQC Y+I Y DG +S+G L+ D F L GS N+ FG
Sbjct: 98 LCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSL--PTGSARNI--AFG 153
Query: 131 CGYNQHNPGPLSPPDTA---GVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLF 186
CGY+Q P+ G+LGLGRG + +VSQL+ G + +NVIGHC+ G G LF
Sbjct: 154 CGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVIGHCLSSKGGGYLF 213
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
+G+ VPSS + + S + HY G A L G K IFDSG++Y Y
Sbjct: 214 IGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKPFKAIFDSGSTYTYLPE 273
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRG--PFKALGQVTEYFKPL-ALSFTNR 302
++ ++VS + LI + LKL D D L +CW+G PFK + + + FK L L F
Sbjct: 274 NLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKGPKPFKTVHDLPKEFKSLVTLKFD-- 331
Query: 303 RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
+ V + +PPE YL+I+G N C GIL E + +IG I MQ+++VI+DNEK R+
Sbjct: 332 -HGVTMTIPPENYLIITGHGNACFGIL---ELPGYDLFVIGGISMQEQLVIHDNEKGRLA 387
Query: 363 WKPEDCNTL 371
W P C+ +
Sbjct: 388 WMPSPCDKM 396
>gi|388518245|gb|AFK47184.1| unknown [Lotus japonicus]
Length = 245
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 120/232 (51%), Positives = 167/232 (71%), Gaps = 6/232 (2%)
Query: 148 GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSA 207
G+LGLGRG+ S+VSQL GL+RNV+GHC+ G G +F GD SS + WTPM +S
Sbjct: 14 GMLGLGRGKSSLVSQLNSQGLVRNVVGHCLSAQGGGYIFFGD-VYDSSRLTWTPM--SSR 70
Query: 208 DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL 267
DLKHY+ G AEL++ GK G+ L +FD+G+SY YF S YQ ++S + ++L G PLK
Sbjct: 71 DLKHYVAGAAELIFGGKKTGIGGLLPVFDTGSSYTYFNSNAYQAVISWLKKELAGKPLKE 130
Query: 268 APDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNR-RNSVRLVVPPEAYLVISGRKNV 324
APDD+TLP+CW G PF+++ +V +YFK +ALSFT+ R + + +PPEAYL++S NV
Sbjct: 131 APDDQTLPLCWHGKRPFRSVYEVRKYFKSMALSFTSSGRTNTQFEIPPEAYLIVSNMGNV 190
Query: 325 CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSLNH 376
CLGIL+GSE +G+ N+IG+I M DK++++DNEK+ IGW P DCN + + H
Sbjct: 191 CLGILDGSEVGMGDLNLIGDISMLDKVMVFDNEKRLIGWAPADCNRVPNSRH 242
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 141/364 (38%), Positives = 194/364 (53%), Gaps = 18/364 (4%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCA 75
+ ++ +G PP+ + D DTGSD TW+ CDAPCT CTK P YKP + IV +P C
Sbjct: 16 YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVHPRDPLCE 75
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
L N C+ QCDYEI Y D SS G L D L ++G + NV FGC +NQ
Sbjct: 76 ELQG-NQNYCETCK-QCDYEITYADRSSSKGVLARDNMQLTTADGEMKNVDFVFGCAHNQ 133
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFLGDGKVP 193
SP T G+LGL G IS+ +QL G+I NV GHC+ + G +FLGD VP
Sbjct: 134 QGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPSSGGYMFLGDDYVP 193
Query: 194 SSGVAWTPMLQN-----SADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRV 248
G+ W P+ S ++ G EL G++ L +IFDSG+SY YF +
Sbjct: 194 RWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQ--VIFDSGSSYTYFPHEI 251
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSV 306
Y +++L+ G D+TLP C + P +++G V + F PL L R +
Sbjct: 252 YTNLIALLEDASPG--FVRDESDQTLPFCMKPNVPVRSVGDVEQLFNPLILQLRKRWFVI 309
Query: 307 --RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+ PE YL+IS + NVCLG+L+G+E IIG+ ++ K V+YDN++ RIGW
Sbjct: 310 PTTFAISPENYLIISDKGNVCLGVLDGTEIGHSSTIIIGDASLRGKFVVYDNDENRIGWV 369
Query: 365 PEDC 368
DC
Sbjct: 370 QSDC 373
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 147/370 (39%), Positives = 198/370 (53%), Gaps = 19/370 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
FP Y+ ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 198 FPDGQYY-TSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 256
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D + +NG + F
Sbjct: 257 KDLLCQELQG-NQNYCETCK-QCDYEIEYADRSSSMGVLARDDMHIITTNGGREKLDFVF 314
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFL 187
GC Y+Q SP T G+LGL IS+ SQL G+I NV GHCI + NG G +FL
Sbjct: 315 GCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGGGYMFL 374
Query: 188 GDGKVPSSGVAWTPMLQNSADLKH-----YILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
GD VP G+ TP+ +L H G +L G S + +IFDSG+SY
Sbjct: 375 GDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASG--NSVQVIFDSGSSYT 432
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
Y +Y+ +++ I D+TLP+C P + L V + FKPL L F
Sbjct: 433 YLPDEIYKNLIAAIKYAYPN--FVQDSSDRTLPLCLATDFPVRYLEDVKQLFKPLNLHFG 490
Query: 301 NRRNSV--RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
R + + P+ YL+IS + NVCLG LNG + + G I+G+ ++ K+V+YDN++
Sbjct: 491 KRWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGKLVVYDNQQ 550
Query: 359 QRIGWKPEDC 368
++IGW DC
Sbjct: 551 RQIGWTNSDC 560
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 147/370 (39%), Positives = 198/370 (53%), Gaps = 19/370 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
FP Y+ ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 199 FPDGQYY-TSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 257
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D + +NG + F
Sbjct: 258 KDLLCQELQG-NQNYCETCK-QCDYEIEYADRSSSMGVLARDDMHIITTNGGREKLDFVF 315
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFL 187
GC Y+Q SP T G+LGL IS+ SQL G+I NV GHCI + NG G +FL
Sbjct: 316 GCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGGGYMFL 375
Query: 188 GDGKVPSSGVAWTPMLQNSADLKH-----YILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
GD VP G+ TP+ +L H G +L G S + +IFDSG+SY
Sbjct: 376 GDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASG--NSVQVIFDSGSSYT 433
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
Y +Y+ +++ I D+TLP+C P + L V + FKPL L F
Sbjct: 434 YLPDEIYKNLIAAIKYAYPN--FVQDSSDRTLPLCLATDFPVRYLEDVKQLFKPLNLHFG 491
Query: 301 NRRNSV--RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
R + + P+ YL+IS + NVCLG LNG + + G I+G+ ++ K+V+YDN++
Sbjct: 492 KRWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGKLVVYDNQQ 551
Query: 359 QRIGWKPEDC 368
++IGW DC
Sbjct: 552 RQIGWTNSDC 561
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 139/370 (37%), Positives = 201/370 (54%), Gaps = 19/370 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
FP Y+ ++ +G PP+ + D DTGSDLTW+QCDAPCT C K P YKP K N+VP
Sbjct: 154 FPDGQYY-TSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVVPP 212
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L + QCDYEI Y D SS+G L D L ++G N+ F
Sbjct: 213 RDSYCQELQGNQ--NYGDTSKQCDYEITYADRSSSMGILARDNMQLITADGERENLDFVF 270
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GCGY+Q SP +T G+LGL IS+ +QL G+I NV GHCI + G +FL
Sbjct: 271 GCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMFL 330
Query: 188 GDGKVPSSGVAWTPMLQN-----SADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
GD VP G+ W P+ S +++ G +L K+ L +IFDSG+SY
Sbjct: 331 GDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLT--QVIFDSGSSYT 388
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
Y Y +++ + + D+TLP C + P +++ V FKPL+L F
Sbjct: 389 YLPHDDYTNLIASLKSLSPSLLQDES--DRTLPFCMKPNFPVRSMDDVKHLFKPLSLVFK 446
Query: 301 NRRNSV--RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
R + V+PPE YL+IS + N+CLG+L+G+E +IG++ ++ K+V+Y+N++
Sbjct: 447 KRLFILPRTFVIPPEDYLIISDKNNICLGVLDGTEIGHDSAIVIGDVSLRGKLVVYNNDE 506
Query: 359 QRIGWKPEDC 368
++IGW DC
Sbjct: 507 KQIGWVQSDC 516
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 139/370 (37%), Positives = 201/370 (54%), Gaps = 19/370 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
FP Y+ ++ +G PP+ + D DTGSDLTW+QCDAPCT C K P YKP K N+VP
Sbjct: 154 FPDGQYY-TSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVVPP 212
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L + QCDYEI Y D SS+G L D L ++G N+ F
Sbjct: 213 RDSYCQELQGNQ--NYGDTSKQCDYEITYADRSSSMGILARDNMQLITADGERENLDFVF 270
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GCGY+Q SP +T G+LGL IS+ +QL G+I NV GHCI + G +FL
Sbjct: 271 GCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMFL 330
Query: 188 GDGKVPSSGVAWTPMLQN-----SADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
GD VP G+ W P+ S +++ G +L K+ L +IFDSG+SY
Sbjct: 331 GDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLT--QVIFDSGSSYT 388
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
Y Y +++ + + D+TLP C + P +++ V FKPL+L F
Sbjct: 389 YLPHDDYTNLIASLKSLSPSLLQDES--DRTLPFCMKPNFPVRSMDDVKHLFKPLSLVFK 446
Query: 301 NRRNSV--RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
R + V+PPE YL+IS + N+CLG+L+G+E +IG++ ++ K+V+Y+N++
Sbjct: 447 KRLFILPRTFVIPPEDYLIISDKNNICLGVLDGTEIGHDSAIVIGDVSLRGKLVVYNNDE 506
Query: 359 QRIGWKPEDC 368
++IGW DC
Sbjct: 507 KQIGWVQSDC 516
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 237 bits (604), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 146/371 (39%), Positives = 205/371 (55%), Gaps = 26/371 (7%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQC---DAPCTGCTKPPEKQYK-PHKNIVPCSN 71
+F V + +G+P + + D DTGS TW++C D PC C K P Y+ K +VPC++
Sbjct: 38 HFYVTMNIGEPAEPYFLDIDTGSSFTWLECHAKDGPCKTCNKVPHPLYRLTRKKLVPCAD 97
Query: 72 PRCAALH--WPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
P C ALH +C +QCDY+++Y DG SS+G L+ D F L G N+
Sbjct: 98 PLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLDKFSL--PTGGARNI--A 153
Query: 129 FGCGYNQHNPGPLSPPDTA---GVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGV 184
FGCGY+Q P+ G+LGLGRG + + SQL+ G + +NVIGHC+ G G
Sbjct: 154 FGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSKNVIGHCLSSKGGGY 213
Query: 185 LFLGDGKVPSSGVAWTPMLQNS-ADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
LF+G+ VPSS V W PM + + HY G A L G K L IFDSG++Y Y
Sbjct: 214 LFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIGTKPLKAIFDSGSTYTY 273
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPL-ALSFT 300
++ ++VS + L + LK D LP+CW+G PFK + + FK L L F
Sbjct: 274 LPENLHAQLVSALKASLSKSSLKQV-SDPALPLCWKGPKPFKTVHDTPKEFKSLVTLKFD 332
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
V +++PPE YL+I+G N C GIL+ + IIG+I MQ+++VIYDNEK R
Sbjct: 333 ---LGVTMIIPPENYLIITGHGNACFGILDMPGL---DQYIIGDITMQEQLVIYDNEKGR 386
Query: 361 IGWKPEDCNTL 371
+ W P C+ +
Sbjct: 387 LAWMPSPCDKI 397
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 139/368 (37%), Positives = 198/368 (53%), Gaps = 26/368 (7%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCA 75
+ ++ +G P + + D DTGS LTW+QCDAPCT CTK P YKP K NIVP + C
Sbjct: 129 YYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKENIVPPRDSHCQ 188
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
L N C QCDYEI Y D SS G L D L ++G N+ L FGC ++Q
Sbjct: 189 ELQG-NQNYCDTCK-QCDYEIAYADRSSSAGVLARDNMELITADGERENMDLVFGCAHDQ 246
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFLGDGKVP 193
SP + G+LGL G +S+ +QL + G+I NV GHCI + G +FLGD VP
Sbjct: 247 QGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSAYMFLGDDYVP 306
Query: 194 SSGVAWTPMLQNSADLKHYIL-----GPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRV 248
G+ W P+ D+ ++ G EL ++ L +IFDSG+SY YF +
Sbjct: 307 RWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQ--VIFDSGSSYTYFPHEI 364
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSV 306
Y +++ + + + D+TLP C + P +++ V + KPL L F+
Sbjct: 365 YTSLITSL--EAVSPGFVRDESDQTLPFCMKPNFPVRSVDDVKQLHKPLLLHFSK----T 418
Query: 307 RLVVP------PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
LV+P PE YL+ISG+ NVCLG+L+G+E +IG++ ++ K+V YDN+ +
Sbjct: 419 WLVIPRTFEISPENYLIISGKGNVCLGVLDGTEIGHSSTIVIGDVSLRGKLVAYDNDANQ 478
Query: 361 IGWKPEDC 368
IGW DC
Sbjct: 479 IGWAQSDC 486
>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
partial [Brachypodium distachyon]
Length = 354
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 136/359 (37%), Positives = 196/359 (54%), Gaps = 48/359 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
+ V +++G+ K + D DTGS LTW++ + ++K
Sbjct: 35 HIYVTMSIGEQEKPYFLDIDTGSTLTWLE------------DVRFKHD------------ 70
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
CK +QCDY++ Y G SS+G L+ D F L G LTFGCGY+Q
Sbjct: 71 ---------CKENPNQCDYDVRYAGGESSLGVLIADKFSL---PGRDARPTLTFGCGYDQ 118
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLFLGDGKVPS 194
P D GVLG+GRG + SQL++ G I NVIGHC+ G G LF G KVPS
Sbjct: 119 EGGKAEMPVD--GVLGIGRGTRDLASQLKQQGAIAENVIGHCLRIQGGGYLFFGHEKVPS 176
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGK---SCGLKDLTLIFDSGASYAYFTSRVYQE 251
S V W PM+ N+ +Y G A L ++G + + ++ DSG++Y Y + Y+
Sbjct: 177 SVVTWVPMVPNN---HYYSPGLAALHFNGNLGNPISVAPMEVVIDSGSTYTYMPTETYRR 233
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLV 309
+V +++ L + L L D LP+CW G PFK +G V + FKPL L+F + +
Sbjct: 234 LVFVVIASLSKSSLTLV-RDPALPVCWAGKEPFKXIGDVKDKFKPLELAFIQGTSQAIME 292
Query: 310 VPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+PPE YL+ISG NVC+GIL+G++A + + N+IG+I MQ+++VIYDNE+ RIGW C
Sbjct: 293 IPPENYLIISGEGNVCMGILDGTQAGLRKLNVIGDISMQNQLVIYDNERARIGWVRAPC 351
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 140/390 (35%), Positives = 209/390 (53%), Gaps = 28/390 (7%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----KPPEKQYK 61
+E +P+ ++A L +G+P K + D DTGS+LTW++C P GC +PP Y
Sbjct: 28 LEGNVYPVGHFYAT-LNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYT 86
Query: 62 PHKN--IVPCSNPRCAALHW--PNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPL 115
P V C +P C A+ P P C ND +C YEI+Y G S G L TD+ +
Sbjct: 87 PADGNLKVVCGSPLCVAVRRDVPGIPECSR-NDPHRCHYEIQYVTGKSE-GDLATDIISV 144
Query: 116 RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIG 174
+ + FGCGY Q P P G+LGLG G+ + +QL+ + +I+ NVIG
Sbjct: 145 NGRDKKR----IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIG 200
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTL 233
HC+ G+GVL++GD P+ GV W PM ++ L +Y G AE+ + G
Sbjct: 201 HCLSSKGKGVLYVGDFNPPTRGVTWAPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEA 257
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEY 291
+FDSG++Y + +++Y EIVS + L + L+ + LP+CW+G PF ++ V
Sbjct: 258 VFDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEV-KGRALPLCWKGKKPFGSVNDVKNQ 316
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENN--IIGEIFMQ 348
FK L+L T+ R + L +PP+ YL + CL IL+ S + + E N +IG + MQ
Sbjct: 317 FKALSLKITHARGTSNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFILIGAVTMQ 376
Query: 349 DKMVIYDNEKQRIGWKPEDCNTLLSLNHFI 378
D VIYDNEK+++GW C+ + L I
Sbjct: 377 DLFVIYDNEKKQLGWVRAQCDRVQELESVI 406
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 140/390 (35%), Positives = 208/390 (53%), Gaps = 28/390 (7%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----KPPEKQYK 61
+E +P+ ++A L +G+P K + D DTGS+LTW++C P GC +PP Y
Sbjct: 28 LEGNVYPVGHFYAT-LNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYT 86
Query: 62 PHKN--IVPCSNPRCAALHW--PNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPL 115
P V C +P C A+ P P C ND +C YEI+Y G S G L TD+ +
Sbjct: 87 PADGNLKVVCGSPLCVAVRRDVPGIPECSR-NDPHRCHYEIQYVTGKSE-GDLATDIISV 144
Query: 116 RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIG 174
+ + FGCGY Q P P G+LGLG G+ +QL+ + +I+ NVIG
Sbjct: 145 NGRDKKR----IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIG 200
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTL 233
HC+ G+GVL++GD P+ GV W PM ++ L +Y G AE+ + G
Sbjct: 201 HCLSSKGKGVLYVGDFNPPTRGVTWAPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEA 257
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEY 291
+FDSG++Y + +++Y EIVS + L + L+ + LP+CW+G PF ++ V
Sbjct: 258 VFDSGSTYTHVPAQIYNEIVSKVRGTLSESSLEEV-KGRALPLCWKGKKPFGSVNDVKNQ 316
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENN--IIGEIFMQ 348
FK L+L T+ R + L +PP+ YL + CL IL+ S + + E N +IG + MQ
Sbjct: 317 FKALSLKITHARGTNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFILIGAVTMQ 376
Query: 349 DKMVIYDNEKQRIGWKPEDCNTLLSLNHFI 378
D VIYDNEK+++GW C+ + L I
Sbjct: 377 DLFVIYDNEKKQLGWVRAQCDRVQELESVI 406
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 136/375 (36%), Positives = 192/375 (51%), Gaps = 31/375 (8%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP-CTGCTKPPEKQYKPHK--NI 66
FP Y+ +++G PP+ + D DTGS TWVQCDAP C C K Y+P + +
Sbjct: 154 LFPEGLYYTA-ISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLYRPARTADA 212
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+P S+P C NP +QCDYEI Y DG SS+G V D +G N
Sbjct: 213 LPASDPLCEGAQHENP-------NQCDYEISYADGSSSMGVYVRDSMQFVGEDGERENAD 265
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV-- 184
+ FGCGY+Q + T GVLGL +S+ +QL G+I N GHC+ + G
Sbjct: 266 IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGG 325
Query: 185 -LFLGDGKVPSSGVAWTPMLQNSAD------LKHYILGPAELLYSGKSCGLKDLTLIFDS 237
LFLGD +P G+ W P+ AD +K G +L GK ++FD+
Sbjct: 326 YLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLT-----QVVFDT 380
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRG--PFKALGQVTEYFKP 294
G++Y YF ++S + +P + D DKTLP C + P +++ V +FKP
Sbjct: 381 GSTYTYFPDEALTRLISSLKE--AASPRFVQDDSDKTLPFCMKSDFPVRSVEDVKHFFKP 438
Query: 295 LALSFTNRRNSVRLV-VPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
L+L F R R + PE YLVIS + NVCLG+LNG+ I+G++ ++ K+V
Sbjct: 439 LSLQFEKRFFFSRTFNIRPEHYLVISDKGNVCLGVLNGTTIGYDSVVIVGDVSLRGKLVA 498
Query: 354 YDNEKQRIGWKPEDC 368
YDN+K +GW DC
Sbjct: 499 YDNDKNEVGWVDFDC 513
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 140/390 (35%), Positives = 207/390 (53%), Gaps = 28/390 (7%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----KPPEKQYK 61
+E +P+ ++A L +G+P K + D DTGS+LTW++C P GC +PP Y
Sbjct: 28 LEGNVYPVGHFYAT-LNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPHPYYT 86
Query: 62 PH--KNIVPCSNPRCAALHW--PNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPL 115
P K V C +P C A+ P P C ND +C YEI+Y G S G L TD+ +
Sbjct: 87 PADGKLKVVCGSPLCVAVRRDVPGIPECSR-NDPHRCHYEIQYVTGKSE-GDLATDIISV 144
Query: 116 RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIG 174
+ + FGCGY Q P P G+LGLG G+ +QL+ +I+ NVIG
Sbjct: 145 NGRDKKR----IAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIG 200
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTL 233
HC+ G+GVL++GD P+ GV W PM ++ L +Y G AE+ + G
Sbjct: 201 HCLSSKGKGVLYVGDFNPPTRGVTWAPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEA 257
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEY 291
+FDSG++Y + +++Y EIVS + + L+ + LP+CW+G PF ++ V
Sbjct: 258 VFDSGSTYTHVPAQIYNEIVSKVRGTFSESSLEEV-KGRALPLCWKGKKPFGSVNDVKNQ 316
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENN--IIGEIFMQ 348
FK L+L T+ R + L +PP+ YL + CL IL+ S + + E N +IG + MQ
Sbjct: 317 FKALSLKITHARGTNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFILIGAVTMQ 376
Query: 349 DKMVIYDNEKQRIGWKPEDCNTLLSLNHFI 378
D VIYDNEK+++GW C+ + L I
Sbjct: 377 DLFVIYDNEKKQLGWVRAQCDRVQELESVI 406
>gi|62954897|gb|AAY23266.1| Similar to nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|77548966|gb|ABA91763.1| Aspartic proteinase Asp1 precursor, putative [Oryza sativa Japonica
Group]
Length = 307
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 119/301 (39%), Positives = 165/301 (54%), Gaps = 57/301 (18%)
Query: 91 QCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGV 149
QCDYEI+Y DG S+IGAL+ D F L R + N+P FGCGYNQ
Sbjct: 28 QCDYEIKYADGASTIGALIVDQFSLPRIATRP--NLP--FGCGYNQ-------------- 69
Query: 150 LGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLFLGDG------------------ 190
G+G S L+ G+I ++V+GHC+ G G+LF+GDG
Sbjct: 70 -GIGE-NFQQTSPLKMLGIITKHVVGHCLSSGGGGLLFVGDGDGNLVLLHASLGSLCPIA 127
Query: 191 -KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVY 249
PSS PML N +Y G A L + S G+ + ++FDSG++Y YFT++ Y
Sbjct: 128 ISTPSS--YNEPMLMN-----YYSPGSATLYFDRHSLGMNPMDVVFDSGSTYTYFTAQPY 180
Query: 250 QEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVR 307
Q V I L T L+ D +LP+CW+G F+++ V + FK L L+F N N+V
Sbjct: 181 QATVYAIKGGLSSTSLEQV-SDPSLPLCWKGQKAFESVFDVKKEFKSLQLNFGN--NAV- 236
Query: 308 LVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPED 367
+ +PPE YL+++ NVCLGIL+G NIIG+I MQD+MVIYDNE++++GW
Sbjct: 237 MEIPPENYLIVTEYGNVCLGILHGCRLNF---NIIGDITMQDQMVIYDNEREQLGWIRGS 293
Query: 368 C 368
C
Sbjct: 294 C 294
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 170/385 (44%), Gaps = 48/385 (12%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKP 62
P + + +G P + F+ DTGSD+ WV C +PC GC +
Sbjct: 79 PFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGCPDSSGLGIELNLFDTTKSS 137
Query: 63 HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNG 120
++PC++P CAA+ +C D C Y Y D + G VTD F +
Sbjct: 138 SARVLPCTDPICAAVS-TTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGES 196
Query: 121 SVFN--VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI- 177
++ N + FGC Q+ + G+ G G+G S++SQL G+ V HC+
Sbjct: 197 TIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLK 256
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL----- 231
G+NG G+L LG+ PS + ++P++ + HY L + SG+ +
Sbjct: 257 GGENGGGILVLGEILEPS--IVYSPLIPSQ---PHYTLKLQSIALSGQLFPNPTMFPISN 311
Query: 232 --TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQ 287
I DSG + AY VY IVS+I A P RG F+
Sbjct: 312 AGETIIDSGTTLAYLVEEVYDWIVSVITS---------AVSQSATPTISRGSQCFRVSMS 362
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYL----VISGRKNVCLGILNGSEAEVGENNIIG 343
V + F L +F +VV PE YL ++S K L + +AE G NI+G
Sbjct: 363 VADIFPVLRFNF---EGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGL-NILG 418
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
++ ++DK+++YD +QRIGW DC
Sbjct: 419 DLVLKDKIIVYDLAQQRIGWANYDC 443
>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
gi|219887685|gb|ACL54217.1| unknown [Zea mays]
Length = 292
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 95/277 (34%), Positives = 139/277 (50%), Gaps = 20/277 (7%)
Query: 105 IGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
+G V D +G N + FGCGY+Q + T GVLGL +S+ +QL
Sbjct: 1 MGVYVRDSMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLA 60
Query: 165 EYGLIRNVIGHCIGQN---GRGVLFLGDGKVPSSGVAWTPMLQNSAD------LKHYILG 215
G+I N GHC+ + G LFLGD +P G+ W P+ AD +K G
Sbjct: 61 SRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHG 120
Query: 216 PAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTL 274
+L GK ++FD+G++Y YF ++S + +P + D DKTL
Sbjct: 121 DQQLNAQGKLT-----QVVFDTGSTYTYFPDEALTRLISSLKE--AASPRFVQDDSDKTL 173
Query: 275 PICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLV-VPPEAYLVISGRKNVCLGILNG 331
P C + P +++ V +FKPL+L F R R + PE YLVIS + NVCLG+LNG
Sbjct: 174 PFCMKSDFPVRSVEDVKHFFKPLSLQFEKRFFFSRTFNIRPEHYLVISDKGNVCLGVLNG 233
Query: 332 SEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+ I+G++ ++ K+V YDN+K +GW DC
Sbjct: 234 TTIGYDSVVIVGDVSLRGKLVAYDNDKNEVGWVDFDC 270
>gi|224097210|ref|XP_002334633.1| predicted protein [Populus trichocarpa]
gi|222873871|gb|EEF11002.1| predicted protein [Populus trichocarpa]
Length = 143
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 72/135 (53%), Positives = 98/135 (72%), Gaps = 3/135 (2%)
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLAL 297
SY Y S+ YQ ++SLI R+L PL+ A DD+TLPICW+G PFK++ V +YFK AL
Sbjct: 1 SYTYLNSQAYQGLISLIKRELSTKPLREALDDQTLPICWKGRKPFKSVHDVKKYFKTFAL 60
Query: 298 SFTNR-RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
SF N ++ +L PPEAYL++S + N CLG+LNG+E + + N+IG+I MQD++VIYDN
Sbjct: 61 SFANDGKSKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDISMQDRVVIYDN 120
Query: 357 EKQRIGWKPEDCNTL 371
EKQ IGW P +C+ L
Sbjct: 121 EKQLIGWAPGNCDRL 135
>gi|172034220|gb|ACB69715.1| putative nucellin-like aspartic protease [Hordeum vulgare]
Length = 310
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 90/255 (35%), Positives = 128/255 (50%), Gaps = 11/255 (4%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGV 184
G ++Q SP T+G+LGL IS+ SQL G+I NV GHCI + NG G
Sbjct: 14 FVLGVTFDQQGQLLSSPAKTSGILGLSSAAISLPSQLASKGIISNVFGHCITRETNGGGY 73
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+FLGD VP G+ W P+ +L H G+ + +I G SY Y
Sbjct: 74 MFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGIP-VQVISRCGTSYTYL 132
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
+Y+ ++ I D D TLP+CW+ F V +FKPL L F R
Sbjct: 133 PEEMYKNLIDAIKED--SPSFVQDSSDTTLPLCWKADFS----VRSFFKPLNLHFGRRWF 186
Query: 305 SV--RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
V + P+ YL+IS + NVCLG+LNG+E G I+G++ ++ K+V+YDNE+++IG
Sbjct: 187 VVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERRQIG 246
Query: 363 WKPEDCNTLLSLNHF 377
W +C S F
Sbjct: 247 WANSECTKPQSQKGF 261
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 111/382 (29%), Positives = 168/382 (43%), Gaps = 45/382 (11%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKP 62
P + + +G P + F+ DTGSD+ WV C +PC GC +
Sbjct: 79 PFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGCPDSSGLGIELNLFDTTKSS 137
Query: 63 HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNG 120
++PC++P CAA+ +C D C Y Y D + G VTD F +
Sbjct: 138 SARVLPCTDPICAAVS-TTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGES 196
Query: 121 SVFN--VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI- 177
++ N + FGC Q+ + G+ G G+G S++SQL G+ V HC+
Sbjct: 197 TIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLK 256
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL----- 231
G+NG G+L LG+ PS + ++P++ + HY L + SG+ +
Sbjct: 257 GGENGGGILVLGEILEPS--IVYSPLIPSQ---PHYTLKLQSIALSGQLFPNPTMFPISN 311
Query: 232 --TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQ 287
I DSG + AY VY IVS+I A P RG F+
Sbjct: 312 AGETIIDSGTTLAYLVEEVYDWIVSVITS---------AVSQSATPTISRGSQCFRVSMS 362
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVI-SGRKNVCLGILNGSEAEVGENNIIGEIF 346
V + F L +F +VV PE YL S + L + +AE G NI+G++
Sbjct: 363 VADIFPVLRFNF---EGIASMVVTPEEYLQFDSIVREPALWCIGFQKAEDGL-NILGDLV 418
Query: 347 MQDKMVIYDNEKQRIGWKPEDC 368
++DK+++YD +QRIGW DC
Sbjct: 419 LKDKIIVYDLARQRIGWANYDC 440
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 172/382 (45%), Gaps = 53/382 (13%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK---------PPEKQYKPH 63
I + + +G PP+ ++ DTGSDL WV C PC GC P + +
Sbjct: 32 IAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGCPAFSDLKIPIVPYDVKASAS 90
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ VPCS+P C + + C N QC Y +YGDG ++G LV D+ + +
Sbjct: 91 SSKVPCSDPSCTLITQISESGCNDQN-QCGYSFQYGDGSGTLGYLVEDVLHYMVNATAT- 148
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNG 181
+ FGCG+ Q S G++G G +S SQL + G NV HC+ G+ G
Sbjct: 149 ---VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERG 205
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------- 233
G+L LG+ P + +TP++ + HY ++ S +LT+
Sbjct: 206 GGILVLGNVIEPD--IQYTPLVPY---MSHY-----NVVLQSISVNNANLTIDPKLFSND 255
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
IFDSG + AY YQ + L+ P L D L R +K V
Sbjct: 256 VMQGTIFDSGTTLAYLPDEAYQAFTQAV--SLVVAPFLLC--DTRLS---RFIYKLFPNV 308
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
YF+ +++ T +R A + G ++ + +E+E+ + I G++ ++
Sbjct: 309 VLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQS-----MGSAESEL-QYTIFGDLVLK 362
Query: 349 DKMVIYDNEKQRIGWKPEDCNT 370
+K+V+YD E+ RIGW+P DC T
Sbjct: 363 NKLVVYDLERGRIGWRPFDCKT 384
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/247 (36%), Positives = 123/247 (49%), Gaps = 25/247 (10%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP-CTGCTKPPEKQYKPHK--NI 66
FP Y+ +++G PP+ + D DTGS TWVQCDAP C C K Y+P + +
Sbjct: 154 LFPEGLYYTA-ISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLYRPARTADA 212
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+P S+P C NP +QCDYEI Y DG SS+G V D +G N
Sbjct: 213 LPASDPLCEGAQHENP-------NQCDYEISYADGSSSMGVYVRDSMQFVGEDGERENAD 265
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV-- 184
+ FGCGY+Q + T GVLGL +S+ +QL G+I N GHC+ + G
Sbjct: 266 IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGG 325
Query: 185 -LFLGDGKVPSSGVAWTPMLQNSAD------LKHYILGPAELLYSGKSCGLKDLTLIFDS 237
LFLGD +P G+ W P+ AD +K G +L GK ++FD+
Sbjct: 326 YLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLT-----QVVFDT 380
Query: 238 GASYAYF 244
G++Y YF
Sbjct: 381 GSTYTYF 387
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 172/378 (45%), Gaps = 43/378 (11%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK---------PPEKQYKPH 63
I + + +G PP+ ++ DTGSDL WV C PC GC P + +
Sbjct: 32 IAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGCPAFSDLKIPIVPYDVKASAS 90
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ VPCS+P C + + C N QC Y +YGDG ++G LV D+ + +
Sbjct: 91 SSKVPCSDPSCTLITQISESGCNDQN-QCGYSFQYGDGSGTLGYLVEDVLHYMVNATAT- 148
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNG 181
+ FGCG+ Q S G++G G +S SQL + G NV HC+ G+ G
Sbjct: 149 ---VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERG 205
Query: 182 RGVLFLGDGKVPSSGVAWTP----MLQNSADLKHYILGPAELLYSGKSCGLKDLT-LIFD 236
G+L LG+ P + +TP M + L+ + A L K + IFD
Sbjct: 206 GGILVLGNVIEPD--IQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFD 263
Query: 237 SGASYAYFTSRVYQ---EIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
SG + AY YQ + VSL++ + +L+ R +K V YF+
Sbjct: 264 SGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTRLS----------RFIYKLFPNVVLYFE 313
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
+++ T +R A + G ++ + +E+E+ + I G++ +++K+V+
Sbjct: 314 GASMTLTPAEYLIRQASAANAPIWCMGWQS-----MGSAESEL-QYTIFGDLVLKNKLVV 367
Query: 354 YDNEKQRIGWKPEDCNTL 371
YD E+ RIGW+P DC L
Sbjct: 368 YDLERGRIGWRPFDCKFL 385
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 181/397 (45%), Gaps = 58/397 (14%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP--------- 56
+ F+F + L +G PP+ F DTGSD+ WV C + C GC
Sbjct: 79 VGFYFGSFCRLYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSS-CNGCPVSSGLHIPLNFF 137
Query: 57 EKQYKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
+ P +++ CS+ RC+ + C N+QC Y +YGDG + G V+DL L
Sbjct: 138 DPGSSPTASLISCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDL--LH 195
Query: 117 FSN---GSVF---NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGL 168
F GSV + P+ FGC Q G L+ PD A G+ G G+ +S++SQL G+
Sbjct: 196 FDTILGGSVMKNSSAPIVFGCSTLQ--TGDLTKPDRAVDGIFGFGQQDMSVISQLASQGI 253
Query: 169 IRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC 226
V HC+ +G G+L LG+ P+ + +TP++ + HY L + +G++
Sbjct: 254 TPRVFSHCLKGDDSGGGILVLGEIVEPN--IVYTPLVPSQ---PHYNLNLQSIYVNGQTL 308
Query: 227 GL--------KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
+ + I DSG + AY T Y +S I ++P P
Sbjct: 309 AIDPSVFATSSNQGTIIDSGTTLAYLTEAAYDPFISAITS-------TVSP--SVSPYLS 359
Query: 279 RGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGS 332
+G + + + F ++L+F +++ P+ YL+ I+G C+G
Sbjct: 360 KGNQCYLTSSSINDVFPQVSLNFA---GGTSMILIPQDYLIQQSSINGAALWCVGF---Q 413
Query: 333 EAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ + E I+G++ ++DK+ +YD QRIGW DC
Sbjct: 414 KIQGQEITILGDLVLKDKIFVYDIAGQRIGWANYDCK 450
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 109/390 (27%), Positives = 174/390 (44%), Gaps = 56/390 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNIV 67
+ + +G PP F+ DTGSD+ WV C++ C+GC + Q + +++
Sbjct: 75 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CSGCPQTSGLQIQLNFFDPGSSSTSSMI 133
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR-FSNGSVFN-- 124
CS+ RC + C N+QC Y +YGDG + G V+D+ L GSV
Sbjct: 134 ACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNS 193
Query: 125 -VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
P+ FGC Q G L+ D A G+ G G+ +S++SQL G+ V HC+
Sbjct: 194 TAPVVFGCSNQQ--TGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDS 251
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
+G G+L LG+ P+ + +T ++ HY L + +G++ +
Sbjct: 252 SGGGILVLGEIVEPN--IVYTSLVPAQ---PHYNLNLQSIAVNGQTLQIDSSVFATSNSR 306
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVT 289
I DSG + AY Y VS I + + RG + VT
Sbjct: 307 GTIVDSGTTLAYLAEEAYDPFVSAITASI---------PQSVHTVVSRGNQCYLITSSVT 357
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEI 345
E F ++L+F +++ P+ YL+ I G C+G + I+G++
Sbjct: 358 EVFPQVSLNFAG---GASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGI---TILGDL 411
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+V+YD QRIGW DC+ LS+N
Sbjct: 412 VLKDKIVVYDLAGQRIGWANYDCS--LSVN 439
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 170/384 (44%), Gaps = 56/384 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G PP+ F+ DTGSD+ WV C + C+ C + + +
Sbjct: 81 YF-TRVKLGTPPREFNVQIDTGSDVLWVTCSS-CSNCPQTSGLGIQLNYFDTTSSSTARL 138
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
VPCS+P C + +C ++QC Y +YGDG + G V+D F G
Sbjct: 139 VPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIAN 198
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIG-- 178
+ + FGC + + G L+ D A G+ G G+G +S++SQL +G+ V HC+
Sbjct: 199 SSAAIVFGC--STYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGE 256
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----- 233
+G G+L LG+ P G+ ++P++ + HY L + SG+ +
Sbjct: 257 DSGGGILVLGEILEP--GIVYSPLVPSQ---PHYNLDLQSIAVSGQLLPIDPAAFATSSN 311
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
I D+G + AY Y VS I A P +G + V
Sbjct: 312 RGTIIDTGTTLAYLVEEAYDPFVSAITA---------AVSQLATPTINKGNQCYLVSNSV 362
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGE 344
+E F P++ +F +++ PE YL+ +G C+G + G I+G+
Sbjct: 363 SEVFPPVSFNFA---GGATMLLKPEEYLMYLTNYAGAALWCIGF----QKIQGGITILGD 415
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDC 368
+ ++DK+ +YD QRIGW DC
Sbjct: 416 LVLKDKIFVYDLAHQRIGWANYDC 439
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 107/390 (27%), Positives = 172/390 (44%), Gaps = 56/390 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNIV 67
+ + +G PP F+ DTGSD+ WV C++ C GC + Q + +++
Sbjct: 78 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CNGCPQTSGLQIQLNFFDPGSSSTSSMI 136
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSVFN- 124
CS+ RC + C N+QC Y +YGDG + G V+D+ L F N
Sbjct: 137 ACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNS 196
Query: 125 -VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
P+ FGC Q G L+ D A G+ G G+ +S++SQL G+ + HC+
Sbjct: 197 TAPVVFGCSNQQ--TGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDS 254
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
+G G+L LG+ P+ + +T ++ HY L + +G++ +
Sbjct: 255 SGGGILVLGEIVEPN--IVYTSLVPAQ---PHYNLNLQSISVNGQTLQIDSSVFATSNSR 309
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVT 289
I DSG + AY Y VS I A + RG + VT
Sbjct: 310 GTIVDSGTTLAYLAEEAYDPFVSAI---------TAAIPQSVRTVVSRGNQCYLITSSVT 360
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEI 345
+ F ++L+F +++ P+ YL+ I G C+G + I+G++
Sbjct: 361 DVFPQVSLNFAG---GASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGI---TILGDL 414
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+V+YD QRIGW DC+ LS+N
Sbjct: 415 VLKDKIVVYDLAGQRIGWANYDCS--LSVN 442
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 168/385 (43%), Gaps = 57/385 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNI 66
YF + +G PP F+ DTGSD+ WV C + C+ C H
Sbjct: 100 YFT-KVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDLHFFDAPGSLTAGS 157
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
V CS+P C+++ +C N+QC Y YGDG + G +TD F G
Sbjct: 158 VTCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVAN 216
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ P+ FGC + + G L+ D A G+ G G+G++S+VSQL G+ V HC+ +
Sbjct: 217 SSAPIVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 181 GR--GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----- 233
G GV LG+ VP G+ ++P++ + HY L + +G+ L
Sbjct: 275 GSGGGVFVLGEILVP--GMVYSPLVPSQ---PHYNLNLLSIGVNGQMLPLDAAVFEASNT 329
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
I D+G + Y Y DL + + PI G + +
Sbjct: 330 RGTIVDTGTTLTYLVKEAY---------DLFLNAISNSVSQLVTPIISNGEQCYLVSTSI 380
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYL----VISGRKNVCLGILNGSEAEVGENNIIGE 344
++ F ++L+F +++ P+ YL + G C+G E E I+G+
Sbjct: 381 SDMFPSVSLNFA---GGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPE----EQTILGD 433
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCN 369
+ ++DK+ +YD +QRIGW DC+
Sbjct: 434 LVLKDKVFVYDLARQRIGWASYDCS 458
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 168/385 (43%), Gaps = 57/385 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNI 66
YF + +G PP F+ DTGSD+ WV C + C+ C H
Sbjct: 105 YFT-KVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDLHFFDAPGSLTAGS 162
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
V CS+P C+++ +C N+QC Y YGDG + G +TD F G
Sbjct: 163 VTCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVAN 221
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ P+ FGC + + G L+ D A G+ G G+G++S+VSQL G+ V HC+ +
Sbjct: 222 SSAPIVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 279
Query: 181 GR--GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----- 233
G GV LG+ VP G+ ++P++ + HY L + +G+ L
Sbjct: 280 GSGGGVFVLGEILVP--GMVYSPLVPSQ---PHYNLNLLSIGVNGQMLPLDAAVFEASNT 334
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
I D+G + Y Y DL + + PI G + +
Sbjct: 335 RGTIVDTGTTLTYLVKEAY---------DLFLNAISNSVSQLVTPIISNGEQCYLVSTSI 385
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYL----VISGRKNVCLGILNGSEAEVGENNIIGE 344
++ F ++L+F +++ P+ YL + G C+G E E I+G+
Sbjct: 386 SDMFPSVSLNFA---GGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPE----EQTILGD 438
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCN 369
+ ++DK+ +YD +QRIGW DC+
Sbjct: 439 LVLKDKVFVYDLARQRIGWASYDCS 463
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 176/389 (45%), Gaps = 55/389 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------------QYKPH 63
YFA + +G P K + DTGSD+ WV C GC + P K +
Sbjct: 155 YFA-KIGIGTPSKDYYVQVDTGSDILWVNC----AGCDRCPTKSDLGVDLTLYDMKASTT 209
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ V C + C+ P P CK P QC Y + YGDG S+ G V D +G+
Sbjct: 210 SDAVGCDDNFCSLYDGP-LPGCK-PGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQ 267
Query: 124 NVP----LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
P + FGCG Q S G+LG G+ S++SQL G ++ V HC+
Sbjct: 268 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN 327
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILG------PAELLYSGKSCGL 228
+G G+ +G+ P V TP++QN A +K +G P++ SG G
Sbjct: 328 VDGGGIFAIGEVVEPK--VNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG- 384
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQ 287
I DSG + AYF VY V LI + L P L+L ++ F G
Sbjct: 385 ----TIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTC-----FDYTGN 432
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGEN-NIIGEI 345
V + F + L F S+ L V P YL C+G N G++ + G++ ++G++
Sbjct: 433 VDDGFPTVTLHFD---KSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDL 489
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+ +K+V+YD EKQ IGW +C++ + +
Sbjct: 490 VLSNKLVVYDLEKQGIGWVEYNCSSSIKV 518
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 174/389 (44%), Gaps = 55/389 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G PP+ F+ DTGSD+ WV C++ C C + +
Sbjct: 66 YFT-KVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTSGLGIQLNFFDSSSSSTAGQ 123
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
V CS+P C + +C DQC Y +YGDG + G V+D G
Sbjct: 124 VRCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDN 183
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ + FGC + + G L+ D A G+ G G+G +S++SQL G+ V HC+ +
Sbjct: 184 SSALIVFGC--SAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGD 241
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
G G L G++ G+ ++P++ + HY L + +G+ +
Sbjct: 242 GSGGGILVLGEILEPGIVYSPLVPSQ---PHYNLNLLSIAVNGQLLPIDPAAFATSNSQG 298
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTE 290
I DSG + AY + Y VS + + I +P PI +G + V++
Sbjct: 299 TIVDSGTTLAYLVAEAYDPFVSAV--NAIVSP-------SVTPITSKGNQCYLVSTSVSQ 349
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEIF 346
F PLA SF N +V+ PE YL+ G C+G +V I+G++
Sbjct: 350 MF-PLA-SF-NFAGGASMVLKPEDYLIPFGSSGGSAMWCIGF-----QKVQGVTILGDLV 401
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+ +YD +QRIGW DC+ LS+N
Sbjct: 402 LKDKIFVYDLVRQRIGWANYDCS--LSVN 428
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 176/389 (45%), Gaps = 55/389 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------------QYKPH 63
YFA + +G P K + DTGSD+ WV C GC + P K +
Sbjct: 74 YFA-KIGIGTPSKDYYVQVDTGSDILWVNC----AGCDRCPTKSDLGVDLTLYDMKASTT 128
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ V C + C+ P P CK P QC Y + YGDG S+ G V D +G+
Sbjct: 129 SDAVGCDDNFCSLYDGP-LPGCK-PGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQ 186
Query: 124 NVP----LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
P + FGCG Q S G+LG G+ S++SQL G ++ V HC+
Sbjct: 187 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN 246
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILG------PAELLYSGKSCGL 228
+G G+ +G+ P V TP++QN A +K +G P++ SG G
Sbjct: 247 VDGGGIFAIGEVVEPK--VNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG- 303
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQ 287
I DSG + AYF VY V LI + L P L+L ++ F G
Sbjct: 304 ----TIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTC-----FDYTGN 351
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGEN-NIIGEI 345
V + F + L F S+ L V P YL C+G N G++ + G++ ++G++
Sbjct: 352 VDDGFPTVTLHF---DKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDL 408
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+ +K+V+YD EKQ IGW +C++ + +
Sbjct: 409 VLSNKLVVYDLEKQGIGWVEYNCSSSIKV 437
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 176/391 (45%), Gaps = 57/391 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C C GC + Y P + +
Sbjct: 90 YF-TRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGEL 147
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C A + P C + C+Y I YGDG S+ G VTD +G +
Sbjct: 148 VTCDQQFCVANYGGVLPSCTSTS-PCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N ++FGCG G L + A G+LG G+ S++SQL G +R + HC+
Sbjct: 207 ANASVSFGCGAKL--GGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTV 264
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY------------ILG-PAELLYSGKSC 226
NG G+ +G+ P V TP++ +D+ HY LG P + SG S
Sbjct: 265 NGGGIFAIGNVVQPK--VKTTPLV---SDMPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319
Query: 227 GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
G I DSG + AY VY+ + +++ ++ D F+ G
Sbjct: 320 GT-----IIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFSC--------FQYSG 366
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNII-GE 344
V + F + F V L+V P YL +G+ C+G NG + + G++ ++ G+
Sbjct: 367 SVDDGFPEVTFHF---EGDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGD 423
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+ + +K+V+YD E Q IGW +C++ + ++
Sbjct: 424 LVLSNKLVLYDLENQAIGWADYNCSSSIKIS 454
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 167/385 (43%), Gaps = 57/385 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNI 66
YF + +G PP F+ DTGSD+ WV C + C+ C H
Sbjct: 100 YF-TKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDLHFFDAPGSLTAGS 157
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
V CS+P C+++ +C N+QC Y YGDG + G +TD F G
Sbjct: 158 VTCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVAN 216
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ P+ FGC + + G L+ D A G+ G G+G++S+VSQL G+ V HC+ +
Sbjct: 217 SSAPIVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 181 GR--GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----- 233
G GV LG+ VP G+ ++P++ + HY L + +G+ L
Sbjct: 275 GSGGGVFVLGEILVP--GMVYSPLVPSQ---PHYNLNLLSIGVNGQMLPLDAAVFEASNT 329
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
I D+G + Y Y DL + + PI G + +
Sbjct: 330 RGTIVDTGTTLTYLVKEAY---------DLFLNAISNSVSQLVTPIISNGEQCYLVSTSI 380
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYL----VISGRKNVCLGILNGSEAEVGENNIIGE 344
++ F ++L+F +++ P+ YL + G C+G E E I+G+
Sbjct: 381 SDMFPSVSLNFA---GGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPE----EQTILGD 433
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCN 369
+ ++DK+ +YD +QRIGW DC
Sbjct: 434 LVLKDKVFVYDLARQRIGWASYDCK 458
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 117/397 (29%), Positives = 178/397 (44%), Gaps = 58/397 (14%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----------PEKQ 59
F + YF + +G PPK + DTGSD+ WV C +PCTGC P+
Sbjct: 86 FMVGLYF-TRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTS 143
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDL--FPLR 116
K +PCS+ RC A + C+ N C Y YGDG + G V+D F
Sbjct: 144 STSSK--IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTV 201
Query: 117 FSNGSVFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNV 172
N N + FGC +Q G L+ D A G+ G G+ ++S+VSQL G+ V
Sbjct: 202 MGNEQTANSSASIVFGCSNSQS--GDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKV 259
Query: 173 IGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD 230
HC+ NG G+L LG+ P G+ +TP++ + HY L ++ +G+ + D
Sbjct: 260 FSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIVVNGQKLPI-D 313
Query: 231 LTL---------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
+L I DSG + AY Y V+ I ++P ++L
Sbjct: 314 SSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITA-------AVSPSVRSLVSKGNQC 366
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV--ISGRKNV--CLGILNGSEAEVG 337
F V F ++L F V + V PE YL+ S NV C+G ++
Sbjct: 367 FVTSSSVDSSFPTVSLYF---MGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQI- 422
Query: 338 ENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
I+G++ ++DK+ +YD R+GW DC+T +++
Sbjct: 423 --TILGDLVLKDKIFVYDLANMRMGWTDYDCSTSVNV 457
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 107/388 (27%), Positives = 167/388 (43%), Gaps = 52/388 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN--- 65
+ + +G PPK + DTGSD+ WV C C + P K Y P +
Sbjct: 86 YYTEIKLGTPPKHYYVQVDTGSDILWVNC----ITCEQCPHKSGLGLDLTLYDPKASSTG 141
Query: 66 -IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL----RFSNG 120
+V C CAA P+C N C+Y + YGDG S+IG+ VTD R
Sbjct: 142 SMVMCDQAFCAATFGGKLPKCG-ANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQT 200
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N + FGCG Q S G+LG G S++SQL G ++ + HC+
Sbjct: 201 QPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTI 260
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
G G+ +GD P V TP++ + + +LK +G PA + G+ G
Sbjct: 261 KGGGIFSIGDVVQPK--VKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKG-- 316
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + Y V++E +M + + D +C++ P G V
Sbjct: 317 ---TIIDSGTTLTYLPELVFKE----VMLAVFNKHQDITFHDVQGFLCFQYP----GSVD 365
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII--GEIFM 347
+ F + F + + L V P Y +G C+G NG+ +I+ G++ +
Sbjct: 366 DGFPTITFHF---EDDLALHVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVL 422
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+K+VIYD E + IGW +C++ + +
Sbjct: 423 SNKLVIYDLENRVIGWTDYNCSSSIKIK 450
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 117/397 (29%), Positives = 178/397 (44%), Gaps = 58/397 (14%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----------PEKQ 59
F + YF + +G PPK + DTGSD+ WV C +PCTGC P+
Sbjct: 86 FMVGLYF-TRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTS 143
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDL--FPLR 116
K +PCS+ RC A + C+ N C Y YGDG + G V+D F
Sbjct: 144 STSSK--IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSV 201
Query: 117 FSNGSVFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNV 172
N N + FGC +Q G L+ D A G+ G G+ ++S+VSQL G+ V
Sbjct: 202 MGNEQTANSSASIVFGCSNSQS--GDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKV 259
Query: 173 IGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD 230
HC+ NG G+L LG+ P G+ +TP++ + HY L ++ +G+ + D
Sbjct: 260 FSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIVVNGQKLPI-D 313
Query: 231 LTL---------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
+L I DSG + AY Y V+ I ++P ++L
Sbjct: 314 SSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAIT-------AAVSPSVRSLVSKGNQC 366
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV--ISGRKNV--CLGILNGSEAEVG 337
F V F ++L F V + V PE YL+ S NV C+G ++
Sbjct: 367 FVTSSSVDSSFPTVSLYF---MGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQI- 422
Query: 338 ENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
I+G++ ++DK+ +YD R+GW DC+T +++
Sbjct: 423 --TILGDLVLKDKIFVYDLANMRMGWTDYDCSTSVNV 457
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 168/383 (43%), Gaps = 53/383 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNI 66
YF + +G PP F+ DTGSD+ WV C + C+ C H
Sbjct: 100 YFT-KVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDLHFFDAPGSFTAGS 157
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
V CS+P C+++ +C N+QC Y YGDG + G +TD F G
Sbjct: 158 VTCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVAN 216
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ P+ FGC + + G L+ D A G+ G G+G++S+VSQL G+ V HC+ +
Sbjct: 217 SSAPIVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 181 GR--GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----- 233
G GV LG+ VP G+ ++P+L + HY L + +G+ +
Sbjct: 275 GSGGGVFVLGEILVP--GMVYSPLLPSQ---PHYNLNLLSIGVNGQILPIDAAVFEASNT 329
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I D+G + Y Y ++ I + + + + + +++
Sbjct: 330 RGTIVDTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNGEQC-------YLVSTSISD 382
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYL----VISGRKNVCLGILNGSEAEVGENNIIGEIF 346
F P++L+F +++ P+ YL G C+G E E I+G++
Sbjct: 383 MFPPVSLNFA---GGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPE----EQTILGDLV 435
Query: 347 MQDKMVIYDNEKQRIGWKPEDCN 369
++DK+ +YD +QRIGW DC+
Sbjct: 436 LKDKVFVYDLARQRIGWANYDCS 458
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 172/391 (43%), Gaps = 59/391 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN-- 65
YF + +G PPK + DTGSD+ WV C C K P K Y P +
Sbjct: 84 YF-TEIKLGTPPKRYYVQVDTGSDILWVNC----ISCEKCPRKSGLGLDLTFYDPKASSS 138
Query: 66 --IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
V C CAA + P C N C+Y + YGDG S+ G VTD G
Sbjct: 139 GSTVSCDQGFCAATYGGKLPGCT-ANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQ 197
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
N +TFGCG Q S G+LG G+ S++SQL G ++ + HC+
Sbjct: 198 TQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDT 257
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG-------------PAELLYSGKS 225
G G+ +G+ P V TP++ AD+ HY + PA + +G+
Sbjct: 258 IKGGGIFAIGNVVQPK--VKTTPLV---ADMPHYNVNLKSIDVGGTTLQLPAHVFETGER 312
Query: 226 CGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
G I DSG + Y V++E+++ I D +C++ P
Sbjct: 313 KG-----TIIDSGTTLTYLPELVFKEVMAAIFNKHQDIVFHNVQD----FMCFQYP---- 359
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNII-G 343
G V + F + F + + L V P Y +G C+G NG+ +++ G++ ++ G
Sbjct: 360 GSVDDGFPTITFHF---EDDLALHVYPHEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMG 416
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
++ + +K+VIYD E Q IGW +C++ + +
Sbjct: 417 DLVLSNKLVIYDLENQVIGWTDYNCSSSIKI 447
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 175/391 (44%), Gaps = 57/391 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C C GC + Y P + +
Sbjct: 90 YF-TRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGEL 147
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C A + P C + C+Y I YGDG S+ G VTD +G +
Sbjct: 148 VTCDQQFCVANYGGVLPSCTSTS-PCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N ++FGCG G L + A G+LG G+ S++SQL G +R + HC+
Sbjct: 207 ANASVSFGCGAKL--GGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTV 264
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY------------ILG-PAELLYSGKSC 226
NG G+ +G+ P V TP++ D+ HY LG P + SG S
Sbjct: 265 NGGGIFAIGNVVQPK--VKTTPLV---PDMPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319
Query: 227 GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
G I DSG + AY VY+ + +++ ++ D F+ G
Sbjct: 320 GT-----IIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFSC--------FQYSG 366
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNII-GE 344
V + F + F V L+V P YL +G+ C+G NG + + G++ ++ G+
Sbjct: 367 SVDDGFPEVTFHF---EGDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGD 423
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+ + +K+V+YD E Q IGW +C++ + ++
Sbjct: 424 LVLSNKLVLYDLENQAIGWADYNCSSSIKIS 454
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 116/392 (29%), Positives = 176/392 (44%), Gaps = 58/392 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----------PEKQYKPHK 64
YF + +G PPK + DTGSD+ WV C +PCTGC P+ K
Sbjct: 117 YF-TRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTSSTSSK 174
Query: 65 NIVPCSNPRCAALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGS 121
+PCS+ RC A + C+ N C Y YGDG + G V+D F N
Sbjct: 175 --IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQ 232
Query: 122 VFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + FGC +Q G L+ D A G+ G G+ ++S+VSQL G+ V HC+
Sbjct: 233 TANSSASIVFGCSNSQS--GDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 290
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
NG G+L LG+ P G+ +TP++ + HY L ++ +G+ + D +L
Sbjct: 291 KGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIVVNGQKLPI-DSSLFT 344
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG + AY Y V+ I ++P ++L F
Sbjct: 345 TSNTQGTIVDSGTTLAYLADGAYDPFVNAITA-------AVSPSVRSLVSKGNQCFVTSS 397
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLV--ISGRKNV--CLGILNGSEAEVGENNII 342
V F ++L F V + V PE YL+ S NV C+G ++ I+
Sbjct: 398 SVDSSFPTVSLYF---MGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQI---TIL 451
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
G++ ++DK+ +YD R+GW DC+T +++
Sbjct: 452 GDLVLKDKIFVYDLANMRMGWTDYDCSTSVNV 483
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 176/389 (45%), Gaps = 56/389 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------------QYKPH 63
YFA + +G P K + DTGSD+ WV C GC + P K +
Sbjct: 155 YFA-KIGIGTPSKDYYVQVDTGSDILWVNC----AGCDRCPTKSDLGVDLTLYDMKASTT 209
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ V C + C+ P P CK P QC Y + YGDG S+ G V D +G+
Sbjct: 210 SDAVGCDDNFCSLYDGP-LPGCK-PGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQ 267
Query: 124 NVP----LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
P + FGCG Q S G+LG G+ S++SQL G ++ V HC+
Sbjct: 268 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN 327
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILG------PAELLYSGKSCGL 228
+G G+ +G+ P V TP++QN A +K +G P++ SG G
Sbjct: 328 VDGGGIFAIGEVVEPK--VNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG- 384
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQ 287
I DSG + AYF VY V LI + L P L+L ++ F G
Sbjct: 385 ----TIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTC-----FDYTGN 432
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGEN-NIIGEI 345
V + F + L F S+ L V P YL C+G N G++ + G++ ++G++
Sbjct: 433 VDDGFPTVTLHFD---KSISLTVYPHEYL-FQHEFEWCIGWQNSGAQTKDGKDLTLLGDL 488
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+ +K+V+YD EKQ IGW +C++ + +
Sbjct: 489 VLSNKLVVYDLEKQGIGWVEYNCSSSIKV 517
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 168/382 (43%), Gaps = 44/382 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNI 66
YF + +G PPK + DTGSD+ W+ C PC C ++
Sbjct: 74 YFT-KIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNASSTSKK 131
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
V C + C+ + + C+ P C Y I Y D +S G + D+ L G + P
Sbjct: 132 VGCDDDFCSFISQSDS--CQ-PALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGP 188
Query: 127 L----TFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
L FGCG +Q G L D+A GV+G G+ S++SQL G + V HC+ N
Sbjct: 189 LGQEVVFGCGSDQ--SGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DN 245
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIF 235
+G G V S V TPM+ N HY + + G S L ++ I
Sbjct: 246 VKGGGIFAVGVVDSPKVKTTPMVPNQM---HYNVMLMGMDVDGTSLDLPRSIVRNGGTIV 302
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + AYF +Y ++ I L P+KL ++T F V E F P+
Sbjct: 303 DSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEETFQC-----FSFSTNVDEAFPPV 354
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG--SEAEVGENNIIGEIFMQDKMVI 353
+ F +SV+L V P YL + C G G + E E ++G++ + +K+V+
Sbjct: 355 SFEF---EDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVV 411
Query: 354 YDNEKQRIGWKPEDCNTLLSLN 375
YD + + IGW +C++ + +
Sbjct: 412 YDLDNEVIGWADHNCSSSIKIK 433
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 114/390 (29%), Positives = 170/390 (43%), Gaps = 52/390 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKNI---- 66
YF + +G P K F DTGSD+ WV C +PCTGC + + P +
Sbjct: 5 YF-TRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTASR 62
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQ---CDYEIEYGDGGSSIGALVTD--LFPLRFSNGS 121
+ CS+ RC A C+ N Q C Y YGDG + G V+D F N
Sbjct: 63 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 122
Query: 122 VFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + FGC +Q G L+ D A G+ G G+ ++S++SQL G+ V HC+
Sbjct: 123 TANSSASIVFGCSNSQS--GDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 180
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
NG G+L LG+ P G+ +TP++ + HY L + +G+ + D +L
Sbjct: 181 KGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIAVNGQKLPI-DSSLFT 234
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG + AY Y VS I ++P ++L F
Sbjct: 235 TSNTQGTIVDSGTTLAYLADGAYDPFVSAI-------AAAVSPSVRSLVSKGSQCFITSS 287
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEI 345
V F + L F V + V PE YL+ N L + + E I+G++
Sbjct: 288 SVDSSFPTVTLYF---MGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDL 344
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+ +YD R+GW DC+ +S+N
Sbjct: 345 VLKDKIFVYDLANMRMGWADYDCS--MSVN 372
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 107/392 (27%), Positives = 177/392 (45%), Gaps = 55/392 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNIV 67
+ + +G PP+ F DTGSD+ WV C PC C + + + +
Sbjct: 41 YYTRIELGTPPRPFYVQIDTGSDILWVNC-KPCNACPLTSGLGVALNFFDPRGSSTASPL 99
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFN-- 124
C + +C + + + C + C Y EYGDG ++G V+D F ++ N V N
Sbjct: 100 SCIDSKCVSSNQISESVCT-TDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNA 158
Query: 125 -VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
+TFGC YNQ G L+ PD A G+ G G+ +S+VSQL GL + HC+
Sbjct: 159 SAKITFGCSYNQS--GDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGAD 216
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
G G+L LG+ P G+ +TP++ + HY L + +G+ +
Sbjct: 217 PGGGILVLGEITEP--GMVYTPIVPSQ---PHYNLNLQGIAVNGQQLSIDPQVFATTNTR 271
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVT 289
I D G + AY Y+ V+ I+ A T P +G F + +
Sbjct: 272 GTIIDCGTTLAYLAEEAYEPFVNTIIA---------AVSQSTQPFMLKGNPCFLTVHSID 322
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV----CLG-ILNGSEA-EVGENNIIG 343
E F + L F + + P+ YL+ + C+G +G +A + + I+G
Sbjct: 323 EIFPSVTLYF----EGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILG 378
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++ ++DK+ +YD E QRIGW DC++ ++++
Sbjct: 379 DLVLKDKVFVYDLENQRIGWTSFDCSSTVNVS 410
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 159/373 (42%), Gaps = 36/373 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK--------PPEKQYKPHKNIV 67
YFA + +G P + F DTGSD+ WV C A C C + P + V
Sbjct: 85 YFA-KIGLGTPSRDFHVQVDTGSDILWVNC-AGCIRCPRKSDLVELTPYDADASSTAKSV 142
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS----VF 123
CS+ C+ + N H C Y I YGDG S+ G LV D+ L G+
Sbjct: 143 SCSDNFCS---YVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGST 199
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
N + FGCG Q S G++G G+ S +SQL G ++ HC+ N G
Sbjct: 200 NGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG 259
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSA----DLKHYILGPAELLYSGKSCGL-KDLTLIFDSG 238
+F G+V S V TPML SA +L +G + L S + D +I DSG
Sbjct: 260 GIF-AIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSG 318
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+ Y VY +++ I+ L D T F + ++ + F +
Sbjct: 319 TTLVYLPDAVYNPLMNQILASHQELNLHTVQDSFTC-------FHYIDRL-DRFPTVTFQ 370
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN--IIGEIFMQDKMVIYDN 356
F SV L V P+ YL C G NG G + I+G++ + +K+V+YD
Sbjct: 371 FD---KSVSLAVYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDI 427
Query: 357 EKQRIGWKPEDCN 369
E Q IGW +C+
Sbjct: 428 ENQVIGWTNHNCS 440
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 103/395 (26%), Positives = 172/395 (43%), Gaps = 59/395 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHK 64
+ + + +G PP+ F DTGSD+ W+ C+ C+ C K +
Sbjct: 81 YGLYTTKVKMGTPPREFTVQIDTGSDILWINCNT-CSNCPKSSGLGIELNFFDTVGSSTA 139
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSV 122
+VPCS+P CA+ +C +QC Y +Y DG + G V+D F + +
Sbjct: 140 ALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTP 199
Query: 123 FNVP----LTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHC 176
NV + FGC + + G L+ D A G+LG G G +S+VSQL G+ V HC
Sbjct: 200 ANVASSATIVFGC--STYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHC 257
Query: 177 I--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL- 233
+ NG G+L LG+ PS + ++P++ + HY L + +G+ +
Sbjct: 258 LKGDGNGGGILVLGEILEPS--IVYSPLVPSQ---PHYNLNLQSIAVNGQVLSINPAVFA 312
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKA 284
I DSG + +Y Y +V+ + A +G +
Sbjct: 313 TSDKRGTIIDSGTTLSYLVQEAYDPLVNAV---------DTAVSQFATSFISKGSQCYLV 363
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENN 340
L + + F ++ +F + + P YL+ G K C+G E
Sbjct: 364 LTSIDDSFPTVSFNF---EGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGV----T 416
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
I+G++ ++DK+V+YD +Q+IGW DC+ +S+N
Sbjct: 417 ILGDLVLKDKIVVYDLARQQIGWTNYDCS--MSVN 449
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 114/390 (29%), Positives = 170/390 (43%), Gaps = 52/390 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKNI---- 66
YF + +G P K F DTGSD+ WV C +PCTGC + + P +
Sbjct: 91 YF-TRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTASR 148
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQ---CDYEIEYGDGGSSIGALVTD--LFPLRFSNGS 121
+ CS+ RC A C+ N Q C Y YGDG + G V+D F N
Sbjct: 149 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 208
Query: 122 VFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + FGC +Q G L+ D A G+ G G+ ++S++SQL G+ V HC+
Sbjct: 209 TANSSASIVFGCSNSQS--GDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 266
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
NG G+L LG+ P G+ +TP++ + HY L + +G+ + D +L
Sbjct: 267 KGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIAVNGQKLPI-DSSLFT 320
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG + AY Y VS I ++P ++L F
Sbjct: 321 TSNTQGTIVDSGTTLAYLADGAYDPFVSAI-------AAAVSPSVRSLVSKGSQCFITSS 373
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEI 345
V F + L F V + V PE YL+ N L + + E I+G++
Sbjct: 374 SVDSSFPTVTLYFM---GGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDL 430
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+ +YD R+GW DC+ +S+N
Sbjct: 431 VLKDKIFVYDLANMRMGWADYDCS--MSVN 458
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 114/390 (29%), Positives = 170/390 (43%), Gaps = 52/390 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKNI---- 66
YF + +G P K F DTGSD+ WV C +PCTGC + + P +
Sbjct: 89 YF-TRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTASR 146
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQ---CDYEIEYGDGGSSIGALVTD--LFPLRFSNGS 121
+ CS+ RC A C+ N Q C Y YGDG + G V+D F N
Sbjct: 147 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 206
Query: 122 VFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + FGC +Q G L+ D A G+ G G+ ++S++SQL G+ V HC+
Sbjct: 207 TANSSASIVFGCSNSQS--GDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 264
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
NG G+L LG+ P G+ +TP++ + HY L + +G+ + D +L
Sbjct: 265 KGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIAVNGQKLPI-DSSLFT 318
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG + AY Y VS I ++P ++L F
Sbjct: 319 TSNTQGTIVDSGTTLAYLADGAYDPFVSAI-------AAAVSPSVRSLVSKGSQCFITSS 371
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEI 345
V F + L F V + V PE YL+ N L + + E I+G++
Sbjct: 372 SVDSSFPTVTLYFM---GGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDL 428
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+ +YD R+GW DC+ +S+N
Sbjct: 429 VLKDKIFVYDLANMRMGWADYDCS--MSVN 456
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 171/380 (45%), Gaps = 36/380 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE---------KQYKPHKNI 66
YFA + +G P + + DTGSD+ WV C A CT C K + N
Sbjct: 74 YFA-KIGLGTPVQDYYVQVDTGSDILWVNC-AGCTNCPKKSDLGIELSLYSPSSSSTSNR 131
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C+ C + + P C P C+Y + YGDG S+ G V D L G +
Sbjct: 132 VTCNQDFCTSTYDGPIPGCT-PELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTS 190
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NG 181
N + FGCG Q + G+LG G+ S++SQL G ++ V HC+ NG
Sbjct: 191 TNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNING 250
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG---PAELLYSGKSCGLKDLT--LIFD 236
G+ +G+ P V TP++ A ++ E+L DL I D
Sbjct: 251 GGIFAIGEVVQPK--VRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDLRKGTIID 308
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG + AYF +Y+ ++S I + LKL ++ F+ G V + F +
Sbjct: 309 SGTTLAYFPDVIYEPLISKIFARQ--STLKLHTVEEQFTC-----FEYDGNVDDGFPTVT 361
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGENNI-IGEIFMQDKMVIY 354
F +S+ L V P YL C+G N G+++ G++ I +G++ +Q+++V+Y
Sbjct: 362 FHF---EDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMY 418
Query: 355 DNEKQRIGWKPEDCNTLLSL 374
D E Q IGW +C++ + +
Sbjct: 419 DLENQTIGWTEYNCSSSIKV 438
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 176/394 (44%), Gaps = 54/394 (13%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-----YKPHKNI- 66
+ + L +G PP+ F DTGSD+ WV C A C GC + Q + P ++
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ CS+ RC+ + C N+ C Y +YGDG + G V+D+ GS
Sbjct: 136 ASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
P+ FGC +Q G L D A G+ G G+ +S++SQL G+ V HC+
Sbjct: 196 VPNSTAPVVFGCSTSQ--TGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253
Query: 178 -GQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
G+N G G+L LG+ P+ + +TP++ + HY + + +G++ +
Sbjct: 254 KGENGGGGILVLGEIVEPN--MVFTPLVPSQ---PHYNVNLLSISVNGQALPINPSVFST 308
Query: 234 ------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKAL 285
I D+G + AY + Y V I A P+ +G +
Sbjct: 309 SNGQGTIIDTGTTLAYLSEAAYVPFVEAITN---------AVSQSVRPVVSKGNQCYVIT 359
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNI 341
V + F P++L+F + + P+ YL+ + G C+G + I
Sbjct: 360 TSVGDIFPPVSLNFA---GGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGI---TI 413
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+G++ ++DK+ +YD QRIGW DC+T ++++
Sbjct: 414 LGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNVS 447
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 176/394 (44%), Gaps = 54/394 (13%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-----YKPHKNI- 66
+ + L +G PP+ F DTGSD+ WV C A C GC + Q + P ++
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ CS+ RC+ + C N+ C Y +YGDG + G V+D+ GS
Sbjct: 136 ASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
P+ FGC +Q G L D A G+ G G+ +S++SQL G+ V HC+
Sbjct: 196 VPNSTAPVVFGCSTSQ--TGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253
Query: 178 -GQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
G+N G G+L LG+ P+ + +TP++ + HY + + +G++ +
Sbjct: 254 KGENGGGGILVLGEIVEPN--MVFTPLVPSQ---PHYNVNLLSISVNGQALPINPSVFST 308
Query: 234 ------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKAL 285
I D+G + AY + Y V I A P+ +G +
Sbjct: 309 SNGQGTIIDTGTTLAYLSEAAYVPFVEAITN---------AVSQSVRPVVSKGNQCYVIT 359
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNI 341
V + F P++L+F + + P+ YL+ + G C+G + I
Sbjct: 360 TSVGDIFPPVSLNFA---GGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGI---TI 413
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+G++ ++DK+ +YD QRIGW DC+T ++++
Sbjct: 414 LGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNVS 447
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 170/374 (45%), Gaps = 31/374 (8%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
+ Y+ + +G PP+ F DTGS LT+V C + C C K + ++P + P
Sbjct: 89 YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPC-STCEQCGKHQDPNFQP--DWSSTYQPL 145
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGCG 132
++ C C Y+ +Y + SS G L D+ + F S T FGC
Sbjct: 146 KCSMEC----TCDSEMMHCVYDRQYAEMSSSSGVLGEDI--VSFGKQSELKPQRTVFGC- 198
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDG 190
G + G++GLGRG +SIV QL E G+I N C G G G + LG G
Sbjct: 199 -ENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLG-G 256
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYAYF 244
P +G+ +T + A +Y + E+ +GK + + I DSG +YAY
Sbjct: 257 ISPPAGMVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYL 314
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
++ IM++L L PD IC+ G + Q+++ F + L F+N
Sbjct: 315 PEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGN- 373
Query: 305 SVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
RL + PE YL + + CLGI + E + ++G I +++ +V+YD E +IG
Sbjct: 374 --RLSLSPENYLFQHSKAHGAYCLGIF---QNENDQTTLLGGIIVRNTLVMYDREHLKIG 428
Query: 363 WKPEDCNTLLSLNH 376
+ +C+ + + H
Sbjct: 429 FWKTNCSEIWEILH 442
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 171/383 (44%), Gaps = 42/383 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN--IVPC 69
+ + +G P K + DTGSD+ WV C C GC QY P + V C
Sbjct: 85 YYTQIEIGSPSKGYYVQVDTGSDILWVNC-IRCDGCPTTSGLGIELTQYDPAGSGTTVGC 143
Query: 70 SNPRCAALHWPN--PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
C A + PN PP C + C + I YGDG S+ G V+D +G+ P
Sbjct: 144 DQEFCVA-NSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPS 202
Query: 127 ---LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGR 182
+TFGCG S G+LG G+ S++SQL +R + HC+ +G
Sbjct: 203 NASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVHGG 262
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------I 234
G+ +G+ P V TP++QN + HY + + G + L T I
Sbjct: 263 GIFAIGNVVQPK--VKTTPLVQN---VTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTI 317
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG + AY VY+ +++ + LA + +C F+ G + + F
Sbjct: 318 IDSGTTLAYLPREVYRTLLTAVFDKY----QDLALHNYQDFVC----FQFSGSIDDGFPV 369
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNII-GEIFMQDKMV 352
+ SF + L V P YL + C+G L+G + + G++ ++ G++ + +K+V
Sbjct: 370 VTFSF---EGEITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLV 426
Query: 353 IYDNEKQRIGWKPEDCNTLLSLN 375
+YD EKQ IGW +C++ + +
Sbjct: 427 VYDLEKQVIGWADYNCSSSIKIQ 449
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 170/382 (44%), Gaps = 47/382 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE--------KQYKPHKN 65
+ Y+ + +G PP+ F DTGS LT+V C + C C K + Y+P K
Sbjct: 89 YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPC-STCEQCGKHQDPNFQPDWSSTYQPLKC 147
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
+ C+ C C Y+ +Y + SS G L D+ + F S
Sbjct: 148 SMECT--------------CDSEMMHCVYDRQYAEMSSSSGVLGEDI--VSFGKQSELKP 191
Query: 126 PLT-FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGR 182
T FGC G + G++GLGRG +SIV QL E G+I N C G G
Sbjct: 192 QRTVFGC--ENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGG 249
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFD 236
G + LG G P +G+ +T + A +Y + E+ +GK + + I D
Sbjct: 250 GAMVLG-GISPPAGMVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILD 306
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG +YAY ++ IM++L L PD IC+ G + Q+++ F +
Sbjct: 307 SGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVD 366
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
L F+N RL + PE YL + + CLGI + E + ++G I +++ +V+Y
Sbjct: 367 LVFSNGN---RLSLSPENYLFQHSKAHGAYCLGIF---QNENDQTTLLGGIIVRNTLVMY 420
Query: 355 DNEKQRIGWKPEDCNTLLSLNH 376
D E +IG+ +C+ + + H
Sbjct: 421 DREHLKIGFWKTNCSEIWEILH 442
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 169/387 (43%), Gaps = 50/387 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKNI- 66
YFA + +G PPK + DTGSD+ WV C C K P K Y P +
Sbjct: 82 YFA-KIGLGNPPKDYYVQVDTGSDILWVNC----ANCDKCPTKSDLGVKLTLYDPQSSTS 136
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG--- 120
+ C + CAA + C + C Y + YGDG S+ G V D G
Sbjct: 137 ATRIYCDDDFCAATYNGVLQGCT-KDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQ 195
Query: 121 -SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
S N + FGCG Q S G+LG G+ S++SQL G ++ V HC+
Sbjct: 196 TSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL-D 254
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
N +G G+V S V TPM+ N + +K +G P ++ +G G
Sbjct: 255 NVKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPTDIFDTGDRRG-- 312
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + AY VY+ +++ I+ + G L + T F+ G V
Sbjct: 313 ---TIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQFTC-------FQYTGNVN 362
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGEN-NIIGEIFM 347
E F + F S+ L V P YL + C G N G +++ G + ++G++ +
Sbjct: 363 EGFPVVKFHF---NGSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLVL 419
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+K+V+YD E Q IGW +C++ + +
Sbjct: 420 SNKLVLYDLENQAIGWTDYNCSSSIKV 446
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 173/384 (45%), Gaps = 57/384 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNIV 67
+ + +G PP+ F+ DTGSD+ WV C + C GC K E Q + ++V
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSELQIQLSFFDPGVSSSASLV 142
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-- 125
CS+ RC + ++ C PN+ C Y +YGDG + G ++D S +
Sbjct: 143 SCSDRRCYS-NFQTESGCS-PNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINS 200
Query: 126 --PLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
P FGC Q G L P A G+ GLG+G +S++SQL GL V HC+ +
Sbjct: 201 SAPFVFGCSNLQ--TGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKD 230
+G G++ LG K P + +TP++ + HY + + +G+ + D
Sbjct: 259 SGGGIMVLGQIKRPDT--VYTPLVPSQ---PHYNVNLQSIAVNGQILPIDPSVFTIATGD 313
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
T+I D+G + AY Y + I A PI + F+
Sbjct: 314 GTII-DTGTTLAYLPDEAYSPFIQAIAN---------AVSQYGRPITYESYQCFEITAGD 363
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVI---SGRKNVCLGILNGSEAEVGENNIIGEI 345
+ F ++LSF +V+ P AYL I SG C+G S + I+G++
Sbjct: 364 VDVFPEVSLSFA---GGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRI---TILGDL 417
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCN 369
++DK+V+YD +QRIGW DC+
Sbjct: 418 VLKDKVVVYDLVRQRIGWAEYDCS 441
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 107/393 (27%), Positives = 176/393 (44%), Gaps = 56/393 (14%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-----TKPPEKQYKPHKN 65
F + YF + +G PPK F DTGSD+ WV C + C GC + P + P +
Sbjct: 79 FLVGLYFT-RVQLGSPPKDFYVQIDTGSDVLWVSCSS-CNGCPVTSGLQIPLTFFDPGSS 136
Query: 66 ----IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF---PLRFS 118
+V CS+ RC A + C +QC Y +YGDG + G V DL L S
Sbjct: 137 TTAALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLS 196
Query: 119 NGSV------FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIR 170
+G + ++ ++F C Q G L+ D A G+ G G+ +S++SQL G+
Sbjct: 197 SGELSQICQTYDSSVSFMCSTLQ--TGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITP 254
Query: 171 NVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL 228
V HC+ +G GVL LG+ P+ + +TP++ + HY L + +G++ +
Sbjct: 255 RVFSHCLKGDDSGGGVLVLGEIVEPN--IVYTPLVPSQ---PHYNLYLQSISVAGQTLAI 309
Query: 229 --------KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
+ I DSG + AY Y VS I ++ + +T
Sbjct: 310 DPSVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITS-------VVSLNARTYLSKGNQ 362
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEV 336
+ V + F ++L+F L++ P+ YL+ + G C+G ++
Sbjct: 363 CYLVTSSVNDVFPQVSLNFA---GGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQI 419
Query: 337 GENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
I+G++ ++DK+ +YD QR+GW DC+
Sbjct: 420 ---TILGDLVLKDKIFVYDIANQRVGWTNYDCS 449
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 105/381 (27%), Positives = 169/381 (44%), Gaps = 40/381 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN--IVPC 69
+ + +G PPK + DTGSD+ WV C C GC QY P + V C
Sbjct: 84 YYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGCPTRSGLGIELTQYDPAGSGTTVGC 142
Query: 70 SNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SVFN 124
C A PP C + C + I YGDG ++ G VTD +G + N
Sbjct: 143 EQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSN 202
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRG 183
+TFGCG S G+LG G+ S++SQL +R + HC+ G G
Sbjct: 203 ASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGG 262
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------IF 235
+ +G+ P V TP++ N + HY + + G + L T I
Sbjct: 263 IFAIGNVVQPK--VKTTPLVPN---VTHYNVNLQGISVGGATLQLPTSTFDSGDSKGTII 317
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + AY VY+ +++ + PL D +C F+ G + + F +
Sbjct: 318 DSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD----FVC----FQFSGSIDDGFPVI 369
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNII-GEIFMQDKMVI 353
SF + + L V P+ YL + C+G L+G + + G++ ++ G++ + +K+V+
Sbjct: 370 TFSF---KGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVV 426
Query: 354 YDNEKQRIGWKPEDCNTLLSL 374
YD EK+ IGW +C++ + +
Sbjct: 427 YDLEKEVIGWTDYNCSSSIKI 447
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 109/395 (27%), Positives = 176/395 (44%), Gaps = 58/395 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNIV 67
+ L +G PP+ F DTGSD+ WV C + C GC + P +++
Sbjct: 52 YYTRLQLGTPPRDFYVQIDTGSDVLWVSCGS-CNGCPVNSGLHIPLNFFDPGSSPTASLI 110
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN---GSVFN 124
CS+ RC+ + C N+ C Y +YGDG + G V+DL L F GSV N
Sbjct: 111 SCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDL--LHFDTVLGGSVMN 168
Query: 125 ---VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI-- 177
P+ FGC Q G L+ D A G+ G G+ +S+VSQL G+ HC+
Sbjct: 169 NSSAPIVFGCSALQ--TGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKG 226
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------K 229
+G G+L LG+ P+ + +TP++ + HY L + +G++ +
Sbjct: 227 DDSGGGILVLGEIVEPN--IVYTPLVPSQ---PHYNLNMQSISVNGQTLAIDPSVFGTSS 281
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL--GQ 287
I DSG + AY Y +S I I +P P +G L
Sbjct: 282 SQGTIIDSGTTLAYLAEAAYDPFISAITS--IVSP-------SVRPYLSKGNHCYLISSS 332
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIG 343
+ + F ++L+F +++ P+ YL+ I G C+G + + I+G
Sbjct: 333 INDIFPQVSLNFA---GGASMILIPQDYLIQQSSIGGAALWCIGF---QKIQGQGITILG 386
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTLLSLNHFI 378
++ ++DK+ +YD QRIGW DC+ ++++ I
Sbjct: 387 DLVLKDKIFVYDIANQRIGWANYDCSMSVNVSTAI 421
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/387 (27%), Positives = 173/387 (44%), Gaps = 50/387 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
Y+A + +G PP F DTGSD+ WV C C+ C K + + Y P + +
Sbjct: 73 YYA-RIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNCPKKSDIGVDLQLYNPKSSSTSTL 130
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
+ C P C+A + P CK P+ C Y++ YGDG ++ G V D L+ + G S
Sbjct: 131 ITCDQPFCSATYDAPIPGCK-PDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
N + FGCG Q S G+LG G+ S++SQL G ++ + HC+
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISG 249
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------- 233
G +F G+V + TP++ N A HY ++ +G G L L
Sbjct: 250 GGIF-AIGEVVEPKLKTTPVVPNQA---HY-----NVVLNGVKVGDTALDLPLGLFETSY 300
Query: 234 ----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + AY +Y ++ I+ L+ D T + + V
Sbjct: 301 KRGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDDQFTCFVFDK-------NVD 353
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVG-ENNIIGEIFM 347
+ F + F S+ L + P YL C+G N G++++ G E ++G++ +
Sbjct: 354 DGFPTVTFKF---EESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVL 410
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTLLSL 374
Q+K+V Y+ E Q IGW +C++ + L
Sbjct: 411 QNKLVYYNLENQTIGWTEYNCSSGIKL 437
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 173/384 (45%), Gaps = 57/384 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNIV 67
+ + +G PP+ F+ DTGSD+ WV C + C GC K E Q + ++V
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSELQIQLSFFDPGVSSSASLV 142
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-- 125
CS+ RC + ++ C PN+ C Y +YGDG + G ++D S +
Sbjct: 143 SCSDRRCYS-NFQTESGCS-PNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINS 200
Query: 126 --PLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
P FGC Q G L P A G+ GLG+G +S++SQL GL V HC+ +
Sbjct: 201 SAPFVFGCSNLQS--GDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKD 230
+G G++ LG K P + +TP++ + HY + + +G+ + D
Sbjct: 259 SGGGIMVLGQIKRPDT--VYTPLVPSQ---PHYNVNLQSIAVNGQILPIDPSVFTIATGD 313
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
T+I D+G + AY Y + + A PI + F+
Sbjct: 314 GTII-DTGTTLAYLPDEAYSPFIQAVAN---------AVSQYGRPITYESYQCFEITAGD 363
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVI---SGRKNVCLGILNGSEAEVGENNIIGEI 345
+ F ++LSF +V+ P AYL I SG C+G S + I+G++
Sbjct: 364 VDVFPQVSLSFA---GGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRI---TILGDL 417
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCN 369
++DK+V+YD +QRIGW DC+
Sbjct: 418 VLKDKVVVYDLVRQRIGWAEYDCS 441
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 105/388 (27%), Positives = 171/388 (44%), Gaps = 54/388 (13%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-----YKPHKNI- 66
+ + + +G PP+ F DTGSD+ WV C A C GC + Q + P ++
Sbjct: 77 VVGLYYTKIRLGSPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
V CS+ RC+ + C N+ C Y +YGDG + G V+D+ GS
Sbjct: 136 ATPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
P+ FGC +Q G L D A G+ G G+ +S++SQL GL V HC+
Sbjct: 196 VPNSTAPVVFGCSTSQ--TGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCL 253
Query: 178 -GQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
G+N G G+L LG+ P+ + +TP++ + HY + + +G++ +
Sbjct: 254 KGENGGGGILVLGEIVEPN--MVFTPLVPSQ---PHYNVNLLSISVNGQALPINPSVFST 308
Query: 234 ------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKAL 285
I D+G + AY + Y V I A P+ +G +
Sbjct: 309 SNGQGTIIDTGTTLAYLSEAAYVPFVEAITN---------AVSQSVRPVVSKGNQCYVIA 359
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNI 341
V + F P++L+F + + P+ YL+ + G C+G + I
Sbjct: 360 TSVADIFPPVSLNFA---GGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGI---TI 413
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+G++ ++DK+ +YD QRIGW DC+
Sbjct: 414 LGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 169/384 (44%), Gaps = 48/384 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI--------- 66
YF + +G PPK + DTGSD+ WV C PC C P + H ++
Sbjct: 74 YFT-KIKLGSPPKEYHVQVDTGSDILWVNC-KPCPEC--PSKTNLNFHLSLFDVNASSTS 129
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
V C + C+ + + C+ P C Y I Y D +S G + D L G +
Sbjct: 130 KKVGCDDDFCSFISQSD--SCQ-PAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQT 186
Query: 125 VPL----TFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
PL FGCG +Q G L D+A GV+G G+ S++SQL G + V HC+
Sbjct: 187 GPLGQEVVFGCGSDQ--SGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL- 243
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTL 233
N +G G V S V TPM+ N HY + + G + L ++
Sbjct: 244 DNVKGGGIFAVGVVDSPKVKTTPMVPNQM---HYNVMLMGMDVDGTALDLPPSIMRNGGT 300
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I DSG + AYF +Y ++ I L P+KL + T F V F
Sbjct: 301 IVDSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEDTFQC-----FSFSENVDVAFP 352
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG--SEAEVGENNIIGEIFMQDKM 351
P++ F +SV+L V P YL ++ C G G + E E ++G++ + +K+
Sbjct: 353 PVSFEF---EDSVKLTVYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKL 409
Query: 352 VIYDNEKQRIGWKPEDCNTLLSLN 375
V+YD E + IGW +C++ + +
Sbjct: 410 VVYDLENEVIGWADHNCSSSIKIK 433
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 110/390 (28%), Positives = 172/390 (44%), Gaps = 56/390 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN-- 65
Y+A + +G PP F DTGSD+ WV C GC+ P+K Y P +
Sbjct: 73 YYA-RIGIGSPPNDFHVQVDTGSDILWVNC----VGCSNCPKKSDIGVDLQLYNPKSSST 127
Query: 66 --IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG--- 120
++ C P C+A + P CK P+ C Y++ YGDG ++ G V D L+ + G
Sbjct: 128 STLITCDQPFCSATYDAPIPGCK-PDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHK 186
Query: 121 -SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
S N + FGCG Q S G+LG G+ S++SQL G ++ + HC+
Sbjct: 187 TSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDS 246
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
G +F G+V + TP++ N A HY ++ +G G L L
Sbjct: 247 ISGGGIF-AIGEVVEPKLXNTPVVPNQA---HY-----NVVLNGVKVGDTALDLPLGLFE 297
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG + AY +Y ++ I+ L+ D T F
Sbjct: 298 TSYKRGAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDDQFTC-------FVFDK 350
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVG-ENNIIGE 344
V + F + F S+ L + P YL C+G N G++++ G E ++G+
Sbjct: 351 NVDDGFPTVTFKF---EESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGD 407
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+ +Q+K+V Y+ E Q IGW +C++ + L
Sbjct: 408 LVLQNKLVYYNLENQTIGWTEYNCSSGIKL 437
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/381 (27%), Positives = 168/381 (44%), Gaps = 40/381 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN--IVPC 69
+ + +G PPK + DTGSD+ WV C C GC QY P + V C
Sbjct: 84 YYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGCPTRSGLGIELTQYDPAGSGTTVGC 142
Query: 70 SNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SVFN 124
C A PP C + C + I YGDG ++ G VTD +G + N
Sbjct: 143 EQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSN 202
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRG 183
+TFGCG S G+LG G+ S++SQL +R + HC+ G G
Sbjct: 203 ASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGG 262
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------IF 235
+ +G+ P V TP++ N + HY + + G + L T I
Sbjct: 263 IFAIGNVVQPK--VKTTPLVPN---VTHYNVNLQGISVGGATLQLPTSTFDSGDSKGTII 317
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + AY VY+ +++ + PL D +C F+ G + + F +
Sbjct: 318 DSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD----FVC----FQFSGSIDDGFPVI 369
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNII-GEIFMQDKMVI 353
SF + L V P+ YL + C+G L+G + + G++ ++ G++ + +K+V+
Sbjct: 370 TFSF---EGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVV 426
Query: 354 YDNEKQRIGWKPEDCNTLLSL 374
YD EK+ IGW +C++ + +
Sbjct: 427 YDLEKEVIGWTDYNCSSSIKI 447
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 113/376 (30%), Positives = 159/376 (42%), Gaps = 42/376 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK----QYKPH-------K 64
YFA + +G P + F DTGSD+ WV C GC + P K + P+
Sbjct: 85 YFA-KIGLGTPSRDFHVQVDTGSDILWVNC----AGCIRCPRKSDLVELTPYDVDASSTA 139
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS--- 121
V CS+ C+ + N H C Y I YGDG S+ G LV D+ L G+
Sbjct: 140 KSVSCSDNFCS---YVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQT 196
Query: 122 -VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
N + FGCG Q S G++G G+ S +SQL G ++ HC+ N
Sbjct: 197 GSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNN 256
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSA----DLKHYILGPAEL-LYSGKSCGLKDLTLIF 235
G +F G+V S V TPML SA +L +G + L L S D +I
Sbjct: 257 NGGGIF-AIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVII 315
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + Y VY +++ I+ L + T C+ K + F +
Sbjct: 316 DSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESFT---CFHYTDK-----LDRFPTV 367
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN--IIGEIFMQDKMVI 353
F SV L V P YL C G NG G + I+G++ + +K+V+
Sbjct: 368 TFQF---DKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVV 424
Query: 354 YDNEKQRIGWKPEDCN 369
YD E Q IGW +C+
Sbjct: 425 YDIENQVIGWTNHNCS 440
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 176/391 (45%), Gaps = 56/391 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G PP F DTGSD+ WV C++ C GC + + ++
Sbjct: 79 YFT-KVKLGTPPMEFTVQIDTGSDILWVNCNS-CNGCPRSSGLGIQLNFFDASSSSSSSL 136
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD--LFPLRFSNGSVFN 124
V CS+P C + +C ++QC Y +YGDG + G V++ F + + N
Sbjct: 137 VSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIAN 196
Query: 125 --VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQ 179
+ FGC + + G L+ D A G+ G G G +S++SQL G+ V HC+ G+
Sbjct: 197 SSASVVFGC--STYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGE 254
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK--------D 230
NG G+L LG+ P G+ ++P++ + HY L + +G++ + +
Sbjct: 255 GNGGGILVLGEVLEP--GIVYSPLVPSQ---PHYNLYLQSISVNGQTLPIDPSVFATSIN 309
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
I DSG + AY Y VS I A P +G + V
Sbjct: 310 RGTIIDSGTTLAYLVEEAYTPFVSAITA---------AVSQSVTPTISKGNQCYLVSTSV 360
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGE 344
E F ++L+F S +V+ PE YL+ G C+G E I+G+
Sbjct: 361 GEIFPLVSLNFA---GSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGV----TILGD 413
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+ M+DK+ +YD +QRIGW DC+ ++++
Sbjct: 414 LVMKDKIFVYDLARQRIGWASYDCSQAVNVS 444
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 166/390 (42%), Gaps = 55/390 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G P K F DTGSD+ W+ C C+ C + +
Sbjct: 83 YF-TKVKLGSPAKEFYVQIDTGSDILWINC-ITCSNCPHSSGLGIELDFFDTAGSSTAAL 140
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF---PLRFSNGSVF 123
V C +P C+ C +QC Y +YGDG + G V+D + V
Sbjct: 141 VSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVA 200
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
N T G + + G L+ D A G+ G G G +S++SQL G+ V HC+ G+
Sbjct: 201 NSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE 260
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKS--------CGLKDL 231
NG GVL LG+ PS + ++P++ + HY L + +G+ +
Sbjct: 261 NGGGVLVLGEILEPS--IVYSPLVPSQ---PHYNLNLQSIAVNGQLLPIDSNVFATTNNQ 315
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVT 289
I DSG + AY Y V I A + PI +G + V
Sbjct: 316 GTIVDSGTTLAYLVQEAYNPFVKAITA---------AVSQFSKPIISKGNQCYLVSNSVG 366
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEI 345
+ F ++L+F +V+ PE YL+ + G C+G + E G I+G++
Sbjct: 367 DIFPQVSLNF---MGGASMVLNPEHYLMHYGFLDGAAMWCIGF---QKVEQGFT-ILGDL 419
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+ +YD QRIGW DC+ LS+N
Sbjct: 420 VLKDKIFVYDLANQRIGWADYDCS--LSVN 447
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 173/387 (44%), Gaps = 52/387 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKNI-- 66
+ + +G P K + DTGSD+ WV C C + P K Y P +
Sbjct: 4 YYTEIGIGTPTKRYYVQVDTGSDILWVNC----ISCDRCPRKSGLGLELTLYDPKDSSTG 59
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
V C CAA + P C + C+Y + YGDG S+ G V+DL +G
Sbjct: 60 SKVSCDQGFCAATYGGLLPGCT-TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQT 118
Query: 123 --FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N +TFGCG Q S G++G G+ S++SQL G ++ + HC+
Sbjct: 119 RPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTI 178
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
NG G+ +G+ P V TP++ N + +LK +G P+ + +G+ G
Sbjct: 179 NGGGIFAIGNVVQPK--VKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKG-- 234
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + Y VY+E IM + + + +C F+ +G+V
Sbjct: 235 ---TIIDSGTTLTYLPEIVYKE----IMLAVFAKHKDITFHNVQEFLC----FQYVGRVD 283
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNI-IGEIFM 347
+ F + F N + L V P Y +G C+G NG +++ G+ + +G++ +
Sbjct: 284 DDFPKITFHF---ENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVL 340
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+K+V+YD E Q IGW +C++ + +
Sbjct: 341 SNKLVVYDLENQVIGWTEYNCSSSIKI 367
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 167/385 (43%), Gaps = 44/385 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN--IVPC 69
+ + +G PPK + DTGSD+ WV C GC QY P + V C
Sbjct: 85 YYTRIEIGSPPKGYYVQVDTGSDILWVN-GISCDGCPTRSGLGIELTQYDPAGSGTTVGC 143
Query: 70 SNPRCAALHWPN--PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SVF 123
C A + PP C C + I YGDG S+ G VTD +G +
Sbjct: 144 EQEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQTTPS 203
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGR 182
NV +TFGCG S G+LG G+ S++SQL +R + HC+ G
Sbjct: 204 NVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDTVRGG 263
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG----------PAELLYSGKSCGLKDLT 232
G+ +G+ P V TP++ N+ + G P SG S G
Sbjct: 264 GIFAIGNVVQPPI-VKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDSGDSKGT---- 318
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
I DSG + AY VY+ +++ + LA + IC F+ G + E F
Sbjct: 319 -IIDSGTTLAYLPREVYRTLLTAVFDK----HPDLAVRNYEDFIC----FQFSGSLDEEF 369
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNII-GEIFMQDK 350
+ SF + L V P YL +G C+G L+G + + G++ ++ G++ + +K
Sbjct: 370 PVITFSF---EGDLTLNVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNK 426
Query: 351 MVIYDNEKQRIGWKPEDCNTLLSLN 375
+V+YD EKQ IGW +C++ + +
Sbjct: 427 LVVYDLEKQVIGWTDYNCSSSIKIE 451
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 162/370 (43%), Gaps = 44/370 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNI 66
YF + +G PPK + DTGSD+ W+ C PC C ++
Sbjct: 74 YFT-KIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNASSTSKK 131
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
V C + C+ + + C+ P C Y I Y D +S G + D+ L G + P
Sbjct: 132 VGCDDDFCSFISQSDS--CQ-PALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGP 188
Query: 127 L----TFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
L FGCG +Q G L D+A GV+G G+ S++SQL G + V HC+ N
Sbjct: 189 LGQEVVFGCGSDQ--SGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DN 245
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIF 235
+G G V S V TPM+ N HY + + G S L ++ I
Sbjct: 246 VKGGGIFAVGVVDSPKVKTTPMVPNQM---HYNVMLMGMDVDGTSLDLPRSIVRNGGTIV 302
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + AYF +Y ++ I L P+KL ++T F V E F P+
Sbjct: 303 DSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEETFQC-----FSFSTNVDEAFPPV 354
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG--SEAEVGENNIIGEIFMQDKMVI 353
+ F +SV+L V P YL + C G G + E E ++G++ + +K+V+
Sbjct: 355 SFEF---EDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVV 411
Query: 354 YDNEKQRIGW 363
YD + + IGW
Sbjct: 412 YDLDNEVIGW 421
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 173/387 (44%), Gaps = 52/387 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKNI-- 66
+ + +G P K + DTGSD+ WV C C + P K Y P +
Sbjct: 89 YYTEIGIGTPTKRYYVQVDTGSDILWVNC----ISCDRCPRKSGLGLELTLYDPKDSSTG 144
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
V C CAA + P C + C+Y + YGDG S+ G V+DL +G
Sbjct: 145 SKVSCDQGFCAATYGGLLPGCT-TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQT 203
Query: 123 --FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N +TFGCG Q S G++G G+ S++SQL G ++ + HC+
Sbjct: 204 RPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTI 263
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
NG G+ +G+ P V TP++ N + +LK +G P+ + +G+ G
Sbjct: 264 NGGGIFAIGNVVQPK--VKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKG-- 319
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + Y VY+E IM + + + +C F+ +G+V
Sbjct: 320 ---TIIDSGTTLTYLPEIVYKE----IMLAVFAKHKDITFHNVQEFLC----FQYVGRVD 368
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNI-IGEIFM 347
+ F + F N + L V P Y +G C+G NG +++ G+ + +G++ +
Sbjct: 369 DDFPKITFHF---ENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVL 425
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+K+V+YD E Q IGW +C++ + +
Sbjct: 426 SNKLVVYDLENQVIGWTEYNCSSSIKI 452
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 168/373 (45%), Gaps = 44/373 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C K + +++P + + C N
Sbjct: 75 YYTTRLWIGTPPQEFALIVDTGSTVTYVPC-STCKQCGKHQDPKFQPELSTSYQALKC-N 132
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
P C C C YE Y + SS G L DL + F N S + FG
Sbjct: 133 PDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDL--ISFGNESQLSPQRAVFG 181
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++S+V QL + G+I +V C G + G G + LG
Sbjct: 182 C--ENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLG 239
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
P V +S + +Y + ++ +GKS L + DSG +
Sbjct: 240 KISPPPGMV-----FSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTT 294
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAYF + I +++++ PD +C+ G + + ++ +F +A+ F
Sbjct: 295 YAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFG 354
Query: 301 NRRNSVRLVVPPEAYLV--ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
N + +L++ PE YL R CLGI ++ ++G I +++ +V YD E
Sbjct: 355 NGQ---KLILSPENYLFRHTKVRGAYCLGIFPDRDS----TTLLGGIVVRNTLVTYDREN 407
Query: 359 QRIGWKPEDCNTL 371
++G+ +C+ +
Sbjct: 408 DKLGFLKTNCSDI 420
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 168/373 (45%), Gaps = 44/373 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C K + +++P + + C N
Sbjct: 75 YYTTRLWIGTPPQEFALIVDTGSTVTYVPC-STCKQCGKHQDPKFQPELSTSYQALKC-N 132
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
P C C C YE Y + SS G L DL + F N S + FG
Sbjct: 133 PDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDL--ISFGNESQLSPQRAVFG 181
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++S+V QL + G+I +V C G + G G + LG
Sbjct: 182 C--ENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLG 239
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
P V +S + +Y + ++ +GKS L + DSG +
Sbjct: 240 KISPPPGMV-----FSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTT 294
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAYF + I +++++ PD +C+ G + + ++ +F +A+ F
Sbjct: 295 YAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFG 354
Query: 301 NRRNSVRLVVPPEAYLV--ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
N + +L++ PE YL R CLGI ++ ++G I +++ +V YD E
Sbjct: 355 NGQ---KLILSPENYLFRHTKVRGAYCLGIFPDRDS----TTLLGGIVVRNTLVTYDREN 407
Query: 359 QRIGWKPEDCNTL 371
++G+ +C+ +
Sbjct: 408 DKLGFLKTNCSDI 420
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 159/367 (43%), Gaps = 31/367 (8%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
SYF L +G P + F DTGS +T++ C C+ C K + + P K+ + C
Sbjct: 11 SYFYTTLKLGTPERTFSVIIDTGSTITYIPC-KDCSHCGKHTAEWFDPDKSTTAKKLACG 69
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+P C P C ND+C Y Y + SS G ++ D F S+ V L FG
Sbjct: 70 DPLCNC----GTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV---RLVFG 122
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C G + G++G+G + SQL + +I +V C G G+L LGD
Sbjct: 123 C--ENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGILLLGDV 180
Query: 191 KVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASYAY 243
+P + +TP+L + L +Y + + +G++ + + DSG ++ Y
Sbjct: 181 TLPEGANTVYTPLLTH-LHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFTY 239
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAP--DDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
+ ++ + + + L+ P D + ICW+G + +YF P F
Sbjct: 240 LPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFPPAEFVFG- 298
Query: 302 RRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRI 361
+L +PP YL +S CLGI + + ++G + ++D +V YD ++
Sbjct: 299 --GGAKLTLPPLRYLFLSKPAEYCLGIFDNGNS----GALVGGVSVRDVVVTYDRRNSKV 352
Query: 362 GWKPEDC 368
G+ C
Sbjct: 353 GFTTMAC 359
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 169/387 (43%), Gaps = 52/387 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN--- 65
+ + +G PPK F DTGSD+ WV C C + P K Y P +
Sbjct: 88 YYTEVRLGTPPKRFYVQVDTGSDILWVNC----ITCDQCPHKSGLGLDLTLYDPKASSTG 143
Query: 66 -IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
V C CA P+C N C+Y + YGDG S++G+ V D G
Sbjct: 144 STVMCDQGFCADTFGGRLPKCS-ANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQT 202
Query: 123 --FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N + FGCG Q S G+LG G S++SQL G ++ + HC+
Sbjct: 203 QPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTI 262
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
G G+ +GD P V TP++ + + +LK +G PA++ G+ G
Sbjct: 263 KGGGIFAIGDVVQPK--VKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRG-- 318
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + Y V+++ +M + + D +C F+ G V
Sbjct: 319 ---TIIDSGTTLTYLPELVFKK----VMLAVFNKHQDITFHDVQDFLC----FEYSGSVD 367
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNII-GEIFM 347
+ F L F + + L V P Y +G C+G NG+ +++ G++ ++ G++ +
Sbjct: 368 DGFPTLTFHF---EDDLALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVL 424
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+K+V+YD E + IGW +C++ + +
Sbjct: 425 SNKLVVYDLENRVIGWTDYNCSSSIKI 451
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 112/388 (28%), Positives = 167/388 (43%), Gaps = 54/388 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKNI- 66
YF + +G P K + DTGSD+ WV C C P K Y P +
Sbjct: 81 YF-TQIGIGTPAKSYYVQVDTGSDILWVNC----VFCDTCPRKSGLGIELTLYDPSGSSS 135
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG--- 120
V C C A H P C P C Y I YGDG S+ G VTD +G
Sbjct: 136 GTGVTCGQDFCVATHGGVIPSCV-PAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQ 194
Query: 121 -SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
++ N +TFGCG S G+LG G+ S++SQL G +R V HC+
Sbjct: 195 TTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDT 254
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPML----QNSADLKHYILG------PAELLYSGKSCGL 228
NG G+ +GD P V+ TP++ + +L+ +G P + G+S G
Sbjct: 255 INGGGIFAIGDVVQPK--VSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGESKG- 311
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
I DSG + AY VY I+S + PLK D + F+ G V
Sbjct: 312 ----TIIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQDFQC--------FRYSGSV 359
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNI-IGEIF 346
+ F + F + L + P YL +G C+G G + + G++ + +G++
Sbjct: 360 DDGFPIITFHF---EGGLPLNIHPHDYLFQNGEL-YCMGFQTGGLQTKDGKDMVLLGDLA 415
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+++V+YD E Q IGW +C++ + +
Sbjct: 416 FSNRLVLYDLENQVIGWTDYNCSSSIKI 443
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 167/373 (44%), Gaps = 44/373 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C K + +++P + + C N
Sbjct: 79 YYTTRLWIGTPPQEFALIVDTGSTVTYVPC-STCKQCGKHQDPKFQPELSSSYKALKC-N 136
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
P C C C YE Y + SS G L DL + F N S FG
Sbjct: 137 PDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDL--ISFGNESQLTPQRAVFG 185
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++S+V QL + G+I +V C G + G G + LG
Sbjct: 186 C--ENVETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLG 243
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
P+ V +S + +Y + ++ +GKS L + DSG +
Sbjct: 244 KISPPAGMV-----FSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTT 298
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAYF + I I++++ PD +C+ G + + ++ +F + + F
Sbjct: 299 YAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFG 358
Query: 301 NRRNSVRLVVPPEAYLV--ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
N + +L++ PE YL R CLGI ++ ++G I +++ +V YD E
Sbjct: 359 NGQ---KLILSPENYLFRHTKVRGAYCLGIFPDRDS----TTLLGGIVVRNTLVTYDREN 411
Query: 359 QRIGWKPEDCNTL 371
++G+ +C+ L
Sbjct: 412 DKLGFLKTNCSDL 424
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 169/385 (43%), Gaps = 54/385 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKPHKNI----V 67
+ + +G PP+ F DTGSD+ WV C PCT C + P + P K+ +
Sbjct: 48 YYTRIYLGTPPQQFYVHVDTGSDVAWVNC-VPCTNCKRASNVALPISIFDPEKSTSKTSI 106
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF-----PLRFSNGSV 122
C++ C + + +C + C Y YGDG S+ G L+ D+ P S +
Sbjct: 107 SCTDEEC---YLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATS 163
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
LTFGCG NQ T G++G G+ +S+ SQL + + N+ HC+ + +
Sbjct: 164 GTARLTFGCGSNQTGTWL-----TDGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNK 218
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK---DLT----LIF 235
G L G + G+ +TP++ + HY + + SG + DL+ +I
Sbjct: 219 GSGTLVIGHIREPGLVYTPIVPKQS---HYNVELLNIGVSGTNVTTPTAFDLSNSGGVIM 275
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + Y Y + + + RD + + + LP+ F+ + YF +
Sbjct: 276 DSGTTLTYLVQPAYDQFQAKV-RDCMRSGV--------LPVA----FQFFCTIEGYFPNV 322
Query: 296 ALSFTNRRNSVRLVVPPEAYL----VISGRKNVCLGILNGSEAE-VGENNIIGEIFMQDK 350
L F +++ P +YL + +G C L + I G+ ++D+
Sbjct: 323 TLYFA---GGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNVLKDQ 379
Query: 351 MVIYDNEKQRIGWKPEDCNTLLSLN 375
+V+YDN RIGWK DC +S++
Sbjct: 380 LVVYDNVNNRIGWKNFDCTKEISVS 404
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 166/384 (43%), Gaps = 47/384 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE----------KQYKPHKN 65
YF + +G PPK + DTGSD+ WV C APC C + K KN
Sbjct: 78 YFT-KIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKTSSTSKN 135
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
V C + C+ + K P C Y + YGDG +S G + D L G++
Sbjct: 136 -VGCEDDFCSFIMQSETCGAKKP---CSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTA 191
Query: 126 PLT----FGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
PL FGCG NQ G L D+A G++G G+ SI+SQL G + + HC+
Sbjct: 192 PLAQEVVFGCGKNQ--SGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDN 249
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG------PAELLYSGKSCGLKDLT 232
NG G+ +G+ V S V TP++ N + G P +L S S D
Sbjct: 250 MNGGGIFAVGE--VESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTN-GDGG 306
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
I DSG + AY +Y SLI + +KL +T F + F
Sbjct: 307 TIIDSGTTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFAC-----FSFTSNTDKAF 358
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII--GEIFMQDK 350
+ L F +S++L V P YL C G +G ++I G++ + +K
Sbjct: 359 PVVNLHF---EDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 415
Query: 351 MVIYDNEKQRIGWKPEDCNTLLSL 374
+V+YD E + IGW +C++ + +
Sbjct: 416 LVVYDLENEVIGWADHNCSSSIKV 439
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 166/384 (43%), Gaps = 47/384 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE----------KQYKPHKN 65
YF + +G PPK + DTGSD+ WV C APC C + K KN
Sbjct: 74 YF-TKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKTSSTSKN 131
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
V C + C+ + K P C Y + YGDG +S G + D L G++
Sbjct: 132 -VGCEDDFCSFIMQSETCGAKKP---CSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTA 187
Query: 126 PLT----FGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
PL FGCG NQ G L D+A G++G G+ SI+SQL G + + HC+
Sbjct: 188 PLAQEVVFGCGKNQ--SGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDN 245
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG------PAELLYSGKSCGLKDLT 232
NG G+ +G+ V S V TP++ N + G P +L S S D
Sbjct: 246 MNGGGIFAVGE--VESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTN-GDGG 302
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
I DSG + AY +Y SLI + +KL +T F + F
Sbjct: 303 TIIDSGTTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFAC-----FSFTSNTDKAF 354
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII--GEIFMQDK 350
+ L F +S++L V P YL C G +G ++I G++ + +K
Sbjct: 355 PVVNLHF---EDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 411
Query: 351 MVIYDNEKQRIGWKPEDCNTLLSL 374
+V+YD E + IGW +C++ + +
Sbjct: 412 LVVYDLENEVIGWADHNCSSSIKV 435
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 172/388 (44%), Gaps = 52/388 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G PP+ F+ DTGSD+ WV C++ C C + + +
Sbjct: 66 YFT-KVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTSGLGIQLNFFDSSSSSTAGL 123
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFN 124
V CS+P C + +C +QC Y +Y DG + G V+D F V N
Sbjct: 124 VHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVN 183
Query: 125 VP--LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
+ FGC Q ++ G+ G G+G +S++SQL +G+ V HC+ G
Sbjct: 184 SSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLKGEGI 243
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------I 234
G L G++ G+ ++P++ + HY L + +GK + I
Sbjct: 244 GGGILVLGEILEPGMVYSPLVPSQ---PHYNLNLQSIAVNGKLLPIDPSVFATSNSQGTI 300
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYF 292
DSG + AY + Y VS + ++I +P PI +G + V++ F
Sbjct: 301 VDSGTTLAYLVAEAYDPFVSAV--NVIVSP-------SVTPIISKGNQCYLVSTSVSQMF 351
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLV-----ISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
PLA SF N +V+ PE YL+ G C+G +V I+G++ +
Sbjct: 352 -PLA-SF-NFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGF-----QKVQGVTILGDLVL 403
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+DK+ +YD +QRIGW DC+ LS+N
Sbjct: 404 KDKIFVYDLVRQRIGWANYDCS--LSVN 429
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 166/372 (44%), Gaps = 41/372 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C K + +++P + V C N
Sbjct: 76 YYTTRLFIGTPPQEFALIVDTGSTVTYVPCSS-CEQCGKHQDPRFQPDLSSTYRPVKC-N 133
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
P C C QC YE Y + SS G + D+ + F N S FG
Sbjct: 134 PSC---------NCDDEGKQCTYERRYAEMSSSSGVIAEDV--VSFGNESELKPQRAVFG 182
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRGR+S+V QL + G+I + C G G G + LG
Sbjct: 183 C--ENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLG 240
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
P + V N +Y + EL +GK LK + DSG +YA
Sbjct: 241 QISPPPNMVF---SHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTYA 297
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
YF + + IM+++ PD IC+ G + + +++ F + + F +
Sbjct: 298 YFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSG 357
Query: 303 RNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ +L + PE YL + + CLGI NG++ ++G I +++ +V YD E
Sbjct: 358 Q---KLSLSPENYLFRHTKVSGAYCLGIFQNGNDL----TTLLGGIVVRNTLVTYDREND 410
Query: 360 RIGWKPEDCNTL 371
+IG+ +C+ L
Sbjct: 411 KIGFWKTNCSEL 422
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 168/390 (43%), Gaps = 53/390 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G P K F DTGSD+ W+ C C+ C + +
Sbjct: 83 YFT-KVKLGSPAKDFYVQIDTGSDILWINC-ITCSNCPHSSGLGIELDFFDTAGSSTAAL 140
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF---PLRFSNGSVF 123
V C++P C+ C +QC Y +YGDG + G V+D + V
Sbjct: 141 VSCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVA 200
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
N T G + + G L+ D A G+ G G G +S++SQL G+ V HC+ G+
Sbjct: 201 NSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE 260
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKS--------CGLKDL 231
NG GVL LG+ PS + ++P++ + L HY L + +G+ +
Sbjct: 261 NGGGVLVLGEILEPS--IVYSPLVPS---LPHYNLNLQSIAVNGQLLPIDSNVFATTNNQ 315
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVT 289
I DSG + AY Y V I A + PI +G + V
Sbjct: 316 GTIVDSGTTLAYLVQEAYNPFVDAITA---------AVSQFSKPIISKGNQCYLVSNSVG 366
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV----CLGILNGSEAEVGENNIIGEI 345
+ F ++L+F +V+ PE YL+ G + C+G + E G I+G++
Sbjct: 367 DIFPQVSLNF---MGGASMVLNPEHYLMHYGFLDSAAMWCIGF---QKVERGF-TILGDL 419
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+ +YD QRIGW +C+ ++++
Sbjct: 420 VLKDKIFVYDLANQRIGWADYNCSLAVNVS 449
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 167/384 (43%), Gaps = 47/384 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE----------KQYKPHKN 65
YF + +G PPK + DTGSD+ WV C APC C + K KN
Sbjct: 77 YFT-KIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKASSTSKN 134
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
V C + C+ + K P C Y + YGDG +S G V D L G++
Sbjct: 135 -VGCEDAFCSFIMQSETCGAKKP---CSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTA 190
Query: 126 PLT----FGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
PL FGCG NQ G L ++A G++G G+ S++SQL G ++ + HC+
Sbjct: 191 PLAQEVVFGCGKNQ--SGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDN 248
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG------PAELLYSGKSCGLKDLT 232
NG G+ +G+ V S V TP++ N + G P +L S S D
Sbjct: 249 MNGGGIFAIGE--VESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTN-GDGG 305
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
I DSG + AY +Y SLI + +KL +T F + F
Sbjct: 306 TIIDSGTTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFAC-----FSFTSNTDKAF 357
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII--GEIFMQDK 350
+ L F +S++L V P YL C G +G ++I G++ + +K
Sbjct: 358 PVVNLHF---EDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 414
Query: 351 MVIYDNEKQRIGWKPEDCNTLLSL 374
+V+YD E + IGW +C++ + +
Sbjct: 415 LVVYDLENEVIGWADHNCSSSIKV 438
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 171/389 (43%), Gaps = 53/389 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G PP+ F+ DTGSD+ WV C++ C C + + ++
Sbjct: 86 YFT-KVKLGSPPREFNVQIDTGSDILWVTCNS-CNDCPRTSGLGIELSFFDPSSSSTTSL 143
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFN 124
V CS+P C +L C ++QC Y YGDG + G V+D+ F + + N
Sbjct: 144 VSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIAN 203
Query: 125 --VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ FGC + + G L+ D A G+ G G+ +S+VSQL G+ V HC+
Sbjct: 204 SSASIVFGC--STYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGE 261
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
G G L G++ + ++P++ + + HY L + +G+ +
Sbjct: 262 GDGGGKLVLGEILEPNIIYSPLVPSQS---HYNLNLQSISVNGQLLPIDPAVFATSNNQG 318
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTE 290
I DSG + Y Y VS I + T P+ +G + V E
Sbjct: 319 TIVDSGTTLTYLVETAYDPFVSAITATV---------SSSTTPVLSKGNQCYLVSTSVDE 369
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEIF 346
F P++L+F +V+ P YL+ G C+G +E + I+G++
Sbjct: 370 IFPPVSLNFAG---GASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPGI---TILGDLV 423
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+ +YD QRIGW DC+ LS+N
Sbjct: 424 LKDKIFVYDLAHQRIGWANYDCS--LSVN 450
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 107/394 (27%), Positives = 168/394 (42%), Gaps = 64/394 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN--- 65
+ + +G PPK F DTGSD+ WV C C K P K Y P +
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNC----VSCDKCPTKSGLGIDLALYDPKGSSSG 142
Query: 66 -IVPCSNPRCAALHWPNP--PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV 122
V C N CAA + P C C+Y EYGDG S+ G+ V+D +G+
Sbjct: 143 SAVSCDNKFCAATYGSGEKLPGCT-AGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNA 201
Query: 123 ----FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ FGCG Q G L + A G++G G+ S +SQL G ++ + HC
Sbjct: 202 QTRHAKANVIFGCGAQQ--GGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHC 259
Query: 177 IGQ-NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------- 228
+ G G+ +G+ P V TP+L N + HY + + +G + L
Sbjct: 260 LDTIKGGGIFAIGEVVQPK--VKSTPLLPN---MSHYNVNLQSIDVAGNALQLPPHIFET 314
Query: 229 -KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP-----F 282
+ I DSG + Y VY++I++ + + K I +R F
Sbjct: 315 SEKRGTIIDSGTTLTYLPELVYKDILAAVFQ-------------KHQDITFRTIQGFLCF 361
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG--SEAEVGENN 340
+ V + F + F + + L V P Y +G CLG NG + +
Sbjct: 362 EYSESVDDGFPKITFHF---EDDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMV 418
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
++G++ + +K+V+YD EKQ IGW +C++ + +
Sbjct: 419 LLGDLVLSNKVVVYDLEKQVIGWTDYNCSSSIKI 452
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 164/384 (42%), Gaps = 40/384 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C PC+GC + P Y P ++ +
Sbjct: 2 YF-TQVGLGNPVKHYIVQVDTGSDVLWVNC-RPCSGCPRKSALNIPLTMYDPRESSTTSL 59
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF--SNGSVFN 124
V CS+P C +C + C+Y YGDG +S G V D SNG
Sbjct: 60 VSCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 119
Query: 125 VP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGC Q S G++G G+ +S+ +QL I V HC+ RG
Sbjct: 120 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRG 179
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILGPAELLYSGKS-CGLKDLTLIFDSG 238
L G + G+ +TP++ +S L+ + L + D +I DSG
Sbjct: 180 GGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSG 239
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+ AYF S Y V I TP+++ D F G++++ F + L+
Sbjct: 240 TTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQC-------FLVSGRLSDLFPNVTLN 292
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNV------CLGILNGSEA----EVGENNIIGEIFMQ 348
F + + P+ YL+ G C+G + S + + + I+G+I ++
Sbjct: 293 FEGG----AMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLK 348
Query: 349 DKMVIYDNEKQRIGWKPEDCNTLL 372
DK+V+YD + RIGW +C L
Sbjct: 349 DKLVVYDLDNSRIGWMSYNCKFLF 372
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 160/375 (42%), Gaps = 38/375 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE------KQYKPHKN----I 66
+ + +G PP + DTGSD+TW+ C APCT C + Y P ++
Sbjct: 37 YYTKIYLGTPPVGYYVQVDTGSDVTWLNC-APCTSCVTETQLPSIKLTTYDPSRSSTDGA 95
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR-FSNGSVFN- 124
+ C + C A N C C Y YGDG S+ G + D+ + N + N
Sbjct: 96 LSCRDSNCGAALGSNEVSCTSAG-YCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNG 154
Query: 125 -VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGCG Q +S G++G G+ +SI SQL G + N HC+ + +G
Sbjct: 155 TASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQG 214
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK---DLT------LI 234
+ G V +++TP++ HY +G + +G++ D T +I
Sbjct: 215 GGTIVIGSVSEPNISYTPIVSR----NHYAVGMQNIAVNGRNVTTPASFDTTSTSAGGVI 270
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG + AY Y + V+ + + + L + W V +F
Sbjct: 271 MDSGTTLAYLVDPAYTQFVNAVS---TFESSMFSSHSQCLQLAWCSLQADFPTVKLFFDA 327
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG-SEAEVGENNIIGEIFMQDKMVI 353
A+ RN L P + +G+ C+G ++A +I+G+I ++D +V+
Sbjct: 328 GAVMNLTPRN--YLYSQP----LQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVV 381
Query: 354 YDNEKQRIGWKPEDC 368
YDN+ + +GWK DC
Sbjct: 382 YDNDNRVVGWKSFDC 396
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 170/379 (44%), Gaps = 39/379 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-KPPEKQYKPHKNI----VP 68
+ YF L +G P K F DTGS +T+V C + +GC + + P + +
Sbjct: 75 YGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDPEASSTASRIS 134
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C++P+C+ PRC QC Y Y + SS G L+ D+ L + + P+
Sbjct: 135 CTSPKCSC----GSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLAL---HDGLPGAPII 187
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRGVLFL 187
FGC G + G+ GLG S+V+QL + G+I +V C G G G L L
Sbjct: 188 FGC--ETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGDGALLL 245
Query: 188 GDGKVPSS-GVAWTPMLQNSADLKHY------ILGPAELLYSGKSCGLKDLTLIFDSGAS 240
GD +VP S + +TP+L ++ +Y + +LL +S + + DSG +
Sbjct: 246 GDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQGYGTVLDSGTT 305
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLPICW-RGP-FKALGQVTEYFKPLA 296
+ Y S V++ + + + LK PD + IC+ + P L ++ F +
Sbjct: 306 FTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLEALSSVFPSME 365
Query: 297 LSFTNRRNSVRLVVPPEAYLVI----SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ F LV+ P YL + SG+ CLG+ + A ++G I ++ +V
Sbjct: 366 VQFD---QGTSLVLGPLNYLFVHTFNSGK--YCLGVFDNGRA----GTLLGGITFRNVLV 416
Query: 353 IYDNEKQRIGWKPEDCNTL 371
YD QR+G+ P C L
Sbjct: 417 RYDRANQRVGFGPALCKEL 435
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 169/389 (43%), Gaps = 54/389 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----------- 65
+ + +G PPK F+ DTGSD+ WV C+ C+ C P Q N
Sbjct: 78 YYTKVKMGTPPKEFNVQIDTGSDILWVNCNT-CSNC--PQSSQLGIELNFFDTVGSSTAA 134
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVF 123
++PCS+P C + C +QC Y +YGDG + G V+D F L
Sbjct: 135 LIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAV 194
Query: 124 NVPLT--FGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
N T FGC +Q G L+ D A G+ G G G +S+VSQL G+ V HC+
Sbjct: 195 NSSATIVFGCSISQS--GDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKG 252
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
+G G L G++ + ++P++ + HY L + +G+ +
Sbjct: 253 DGDGGGVLVLGEILEPSIVYSPLVPSQ---PHYNLNLQSIAVNGQLLPINPAVFSISNNR 309
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I D G + AY Y +V+ I + + + + + +
Sbjct: 310 GGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQC-------YLVSTSIGD 362
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEIF 346
F ++L+F +V+ PE YL+ + G + C+G E +I+G++
Sbjct: 363 IFPSVSLNF---EGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGA----SILGDLV 415
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+V+YD +QRIGW DC+ LS+N
Sbjct: 416 LKDKIVVYDIAQQRIGWANYDCS--LSVN 442
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 101/389 (25%), Positives = 170/389 (43%), Gaps = 56/389 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPH----KNIV 67
+ + +G PPK + DTGSD+ WV C C C + + + Y P + V
Sbjct: 83 YYTEIEIGTPPKQYHVQVDTGSDILWVNC-ISCNKCPRKSDLGIDLRLYDPKGSSSGSTV 141
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS----VF 123
C CAA + P C N C+Y + YGDG S+ G V+D +G
Sbjct: 142 SCDQKFCAATYGGKLPGCA-KNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDGQTRHA 200
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-N 180
N + FGCG Q G L + A G++G G+ S++SQL G ++ + HC+
Sbjct: 201 NASVIFGCGAQQ--GGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIK 258
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG-------------PAELLYSGKSCG 227
G G+ +GD P V TP++ D+ HY + P+ + +G+ G
Sbjct: 259 GGGIFAIGDVVQPK--VKSTPLV---PDMPHYNVNLESINVGGTTLQLPSHMFETGEKKG 313
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ 287
I DSG + Y VY+++++ + T D +C +
Sbjct: 314 -----TIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQD----FLC----IQYFQS 360
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNI-IGEI 345
V + F + F + + L V P Y +G C G NG +++ G++ + +G++
Sbjct: 361 VDDGFPKITFHF---EDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDL 417
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+ +K+V+YD E Q +GW +C++ + +
Sbjct: 418 VLSNKVVVYDLENQVVGWTDYNCSSSIKI 446
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 172/391 (43%), Gaps = 57/391 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C C GC + Y P + +
Sbjct: 90 YF-TRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGEL 147
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C A + P C + C+Y I YGDG S+ G VTD +G +
Sbjct: 148 VTCDQQFCVANYGGVLPSCTSTS-PCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N ++FGCG G L + A G+LG G+ S++SQL G +R + HC+
Sbjct: 207 ANASVSFGCGAKL--GGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTV 264
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY------------ILG-PAELLYSGKSC 226
NG G+ +G+ P V TP++ D+ HY LG P + SG S
Sbjct: 265 NGGGIFAIGNVVQPK--VKTTPLV---PDMPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319
Query: 227 GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
G I DSG + AY VY+ + +++ ++ D F+ G
Sbjct: 320 GT-----IIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFSC--------FQYSG 366
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS--EAEVGENNIIGE 344
V + F + F V L+V P YL +G+ C+G NG + + ++G+
Sbjct: 367 SVDDGFPEVTFHF---EGDVSLIVSPHDYLFQNGKNLYCMGFQNGGGKTKDGKDLGLLGD 423
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+ + +K+V+YD E Q IGW +C++ + ++
Sbjct: 424 LVLSNKLVLYDLENQAIGWADYNCSSSIKIS 454
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 163/380 (42%), Gaps = 40/380 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C PC+GC + P Y P ++ +
Sbjct: 29 YF-TQVGLGNPVKHYIVQVDTGSDVLWVNC-RPCSGCPRKSALNIPLTMYDPRESSTTSL 86
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF--SNGSVFN 124
V CS+P C +C + C+Y YGDG +S G V D SNG
Sbjct: 87 VSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 146
Query: 125 VP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGC Q S G++G G+ +S+ +QL I V HC+ RG
Sbjct: 147 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRG 206
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILGPAELLYSGKS-CGLKDLTLIFDSG 238
L G + G+ +TP++ +S L+ + L + D +I DSG
Sbjct: 207 GGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSG 266
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+ AYF S Y V I TP+++ D F G++++ F + L+
Sbjct: 267 TTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQC-------FLVSGRLSDLFPNVTLN 319
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNV------CLGILNGSEA----EVGENNIIGEIFMQ 348
F + + P+ YL+ G C+G + S + + + I+G+I ++
Sbjct: 320 FEGG----AMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLK 375
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
DK+V+YD + RIGW +C
Sbjct: 376 DKLVVYDLDNSRIGWMSYNC 395
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 105/388 (27%), Positives = 170/388 (43%), Gaps = 50/388 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKP----HKNI 66
YF + +G PPK F DTGSD+ WV C + C GC + P + P ++
Sbjct: 68 YFT-RVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGCPQSSGLHIPLNFFDPGSSSTASL 125
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
+ CS+ RC+ + C +QC Y +YGDG + G V+DL GS
Sbjct: 126 ISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS 185
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
+ + FGC +Q G L+ D A G+ G G+ +S++SQ+ G+ V HC+ +G
Sbjct: 186 SASIVFGCSISQ--TGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDG 243
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------KDLTL 233
G L G++ + ++P++ + HY L + +GKS + +
Sbjct: 244 GGGGILVLGEIVEEDIVYSPLVPSQ---PHYNLNLQSISVNGKSLAIDPEVFATSTNRGT 300
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEY 291
I DSG + AY Y VS I A P+ +G + V
Sbjct: 301 IVDSGTTLAYLAEEAYDPFVSAITE---------AVSQSVRPLLSKGTQCYLITSSVKGI 351
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
F ++L+F V + + PE YL+ I C+G + I+G++ +
Sbjct: 352 FPTVSLNFA---GGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGI---TILGDLVL 405
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+DK+ +YD QRIGW DC+ ++++
Sbjct: 406 KDKIFVYDLAGQRIGWANYDCSMSVNVS 433
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 166/382 (43%), Gaps = 50/382 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKP----HKNI 66
YF + +G PPK F DTGSD+ WV C + C GC + P + P ++
Sbjct: 83 YFT-RVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGCPQSSGLHIPLNFFDPGSSSTASL 140
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
+ CS+ RC+ + C +QC Y +YGDG + G V+DL GS
Sbjct: 141 ISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS 200
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
+ + FGC +Q G L+ D A G+ G G+ +S++SQ+ G+ V HC+ +G
Sbjct: 201 SASIVFGCSISQ--TGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDG 258
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------KDLTL 233
G L G++ + ++P++ + HY L + +GKS + +
Sbjct: 259 GGGGILVLGEIVEEDIVYSPLVPSQ---PHYNLNLQSISVNGKSLAIDPEVFATSTNRGT 315
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEY 291
I DSG + AY Y VS I A P+ +G + V
Sbjct: 316 IVDSGTTLAYLAEEAYDPFVSAITE---------AVSQSVRPLLSKGTQCYLITSSVKGI 366
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
F ++L+F V + + PE YL+ I C+G + I+G++ +
Sbjct: 367 FPTVSLNFA---GGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGI---TILGDLVL 420
Query: 348 QDKMVIYDNEKQRIGWKPEDCN 369
+DK+ +YD QRIGW DC+
Sbjct: 421 KDKIFVYDLAGQRIGWANYDCS 442
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 169/385 (43%), Gaps = 58/385 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++L +G PP + DTGSDL W QC APC C P ++P ++ +VPC +P
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
CAAL + P C C Y+ YGD S+ G L ++ F +N S V + FGC
Sbjct: 151 LCAALPY---PACFQ-RSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGC 206
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLG 188
G N G L+ +++G++GLGRG +S+VSQL + + R GV
Sbjct: 207 G--NINSGQLA--NSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATL 262
Query: 189 DGKVPSSG---VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL---TLIF------- 235
+G SS V TP++ N+A Y + G S G K L L+F
Sbjct: 263 NGTNASSSGSPVQSTPLVVNAALPSLYFMS-----LKGISLGQKRLPIDPLVFAINDDGT 317
Query: 236 -----DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQ 287
DSG S + Y + R+L+ L P + T L C+ P+
Sbjct: 318 GGVFIDSGTSLTWLQQDAYDA----VRRELVSVLRPLPPTNDTEIGLETCF--PWPPPPS 371
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGILNGSEAEVGENNIIGEIF 346
V + L F N + VPPE Y++I G +CL ++ G+ IIG
Sbjct: 372 VAVTVPDMELHFDGGAN---MTVPPENYMLIDGATGFLCLAMIRS-----GDATIIGNYQ 423
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ ++YD + + P CN +
Sbjct: 424 QQNMHILYDIANSLLSFVPAPCNIV 448
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 107/406 (26%), Positives = 175/406 (43%), Gaps = 71/406 (17%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN-- 65
YF + +G PPK + DTGSD+ WV C C+K P K Y P +
Sbjct: 87 YF-TEIKLGTPPKRYYVQVDTGSDILWVNC----ISCSKCPRKSGLGLDLTFYDPKASSS 141
Query: 66 --IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
V C CAA + P C N C+Y + YGDG S+ G +TD G
Sbjct: 142 GSTVSCDQGFCAATYGGKLPGCT-ANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQ 200
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
N +TFGCG Q S G+LG G+ S++SQL G + + HC+
Sbjct: 201 TQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDT 260
Query: 180 -NGRGVLFLGDGKVP--------SSGVAWTPML----------QNSADLKHYILG----- 215
G G+ +G+ P + G+ P+ + +LK +G
Sbjct: 261 IKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQ 320
Query: 216 -PAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIM---RDLIGTPLKLAPDD 271
PA + +G+ G I DSG + Y V+++++ ++ RD+ L+
Sbjct: 321 LPAHVFETGEKKGT-----IIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDF--- 372
Query: 272 KTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG 331
+C F+ G V + F + F + + L V P Y +G C+G NG
Sbjct: 373 ----LC----FQYSGSVDDGFPTITFHF---EDDLALHVYPHEYFFPNGNDIYCVGFQNG 421
Query: 332 S-EAEVGENNII-GEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+ +++ G++ ++ G++ + +K+V+YD E Q IGW +C++ + +
Sbjct: 422 ALQSKDGKDIVLMGDLVLSNKLVVYDLENQVIGWTDYNCSSSIKIK 467
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 168/384 (43%), Gaps = 42/384 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C CT C + + Y P ++
Sbjct: 69 YFT-KIGLGSPSKDYYVQVDTGSDILWVNC-VECTRCPRKSDIGIGLTLYDPKRSKTSEF 126
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C + C++ + CK N C Y I YGDG ++ G V D NG +
Sbjct: 127 VSCEHNFCSSTYEGRILGCKAEN-PCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTAT 185
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
N + FGCG Q S + G++G G+ S++SQL G ++ + HC+ N
Sbjct: 186 QNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNV 245
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------- 233
G +F G+V V TP++ N A HY + + G L T
Sbjct: 246 GGGIF-SIGEVVEPKVKTTPLVPNMA---HYNVILKNIEVDGDILQLPSDTFDSENGKGT 301
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
+ DSG + AY VY +++S ++ + L + + F+ G V F
Sbjct: 302 VIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSC-------FQYTGNVDSGFP 354
Query: 294 PLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLG-ILNGSEAEVGEN-NIIGEIFMQDK 350
+ L F +S+ L V P YL G C+G + SE + G++ ++G+ + +K
Sbjct: 355 IVKLHF---EDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNK 411
Query: 351 MVIYDNEKQRIGWKPEDCNTLLSL 374
+V+YD E IGW +C++ + +
Sbjct: 412 LVVYDLENMTIGWTDYNCSSSIKV 435
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 174/391 (44%), Gaps = 58/391 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-----TKPPEKQYKP----HKNIV 67
+ + +G PPK F DTGSD+ WV C++ C GC + P + P ++V
Sbjct: 83 YYTRVQLGNPPKDFYVQIDTGSDVLWVSCNS-CNGCPATSGLQIPLNFFDPGSSTTASLV 141
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF----SNGSVF 123
CS+ CA + C ++QC Y +YGDG + G V D+ L S S
Sbjct: 142 SCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNS 201
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
+ + FGC +Q G L+ D A G+ G G+ +S++SQL G+ V HC+
Sbjct: 202 SASVVFGCSTSQ--TGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDD 259
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
+G G+L LG+ P+ V +TP++ + HY L + +G+ +
Sbjct: 260 SGGGILVLGEIVEPN--VVYTPLVPSQ---PHYNLNLQSISVNGQVLPISPAVFATSSSQ 314
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVT 289
I DSG + AY Y V + + T + +G + V+
Sbjct: 315 GTIIDSGTTLAYLAEEAYNAFVVAVTNIV---------SQSTQSVVLKGNRCYVTSSSVS 365
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGEN-NIIGE 344
+ F ++L+F LV+ + YL+ + G C+G + G+ I+G+
Sbjct: 366 DIFPQVSLNFA---GGASLVLGAQDYLIQQNSVGGTTVWCIGF----QKIPGQGITILGD 418
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+ ++DK+ IYD QRIGW DC+ +S+N
Sbjct: 419 LVLKDKIFIYDLANQRIGWTNYDCS--MSVN 447
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 111/382 (29%), Positives = 174/382 (45%), Gaps = 46/382 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC----------TKPPEKQYKPHKN 65
Y+A + +G P K + DTG+D+ WV C C C T K+ K
Sbjct: 73 YYA-KIGIGTPSKDYYLQVDTGTDMMWVNC-IQCKECPTRSNLGMDLTLYNIKESSSGK- 129
Query: 66 IVPCSNPRCAALHWPNPPRC-KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
+VPC C ++ C ND C Y YGDG S+ G V D+ +G +
Sbjct: 130 LVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKT 189
Query: 123 --FNVPLTFGCGYNQHNPGPLSPPDTA---GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + FGCG Q G LS + G+LG G+ S++SQL G ++ + HC+
Sbjct: 190 ASANGSVIFGCGARQ--SGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL 247
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILGPAELLYSGKSCGLKDLT 232
G NG G+ +G P+ V TP+L + S ++ +G L S + +D
Sbjct: 248 NGVNGGGIFAIGHVVQPT--VNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQRDSK 305
Query: 233 -LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
I DSG + AY +YQ +V I+ ++ D+ T F+ G V +
Sbjct: 306 GTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEYTC-------FQYSGSVDDG 358
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILN-GSEAEVGEN-NIIGEIFMQ 348
F + F N + L V P YL +S +N+ C+G N G+++ +N ++G++ +
Sbjct: 359 FPNVTFYF---ENGLSLKVYPHDYLFLS--ENLWCIGWQNSGAQSRDSKNMTLLGDLVLS 413
Query: 349 DKMVIYDNEKQRIGWKPEDCNT 370
+K+V YD E Q IGW +C++
Sbjct: 414 NKLVFYDLENQVIGWTEYNCSS 435
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 158/365 (43%), Gaps = 36/365 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+AV + +G P K F FDTGSDLTW QC+ C K E + P K+ + CS+
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSA 192
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L C P C Y+++YGDG SIG T+ L SN VF L FGCG
Sbjct: 193 FCKLLDTEGGESCSSPT--CLYQVQYGDGSYSIGFFATETLTLSSSN--VFKNFL-FGCG 247
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
Q N G AG+LGLGR ++S+ SQ + + + +C+ + +L G
Sbjct: 248 --QQNSGLFR--GAAGLLGLGRTKLSLPSQTAQK--YKKLFSYCLPASSSSKGYLSFGGQ 301
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYAYFTS 246
S V +TP+ ++ Y L EL G + D ++ + DSG S
Sbjct: 302 VSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSI-DASIFSTSGTVIDSGTVITRLPS 360
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
Y + S + + P + D ++ + + T + +SF + V
Sbjct: 361 TAYSALSSAFQKLMTDYP---STDGYSI---FDTCYDFSKNETIKIPKVGVSF---KGGV 411
Query: 307 RLVVPPEAYLV-ISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+ + L ++G K VCL NG + + I G + V+YD+ K R+G+
Sbjct: 412 EMDIDVSGILYPVNGLKKVCLAFAGNGDDVKAA---IFGNTQQKTYQVVYDDAKGRVGFA 468
Query: 365 PEDCN 369
P CN
Sbjct: 469 PSGCN 473
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 166/388 (42%), Gaps = 61/388 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPH----K 64
+ + +G P K F DTGSD+ WV C GCT P+K Y P+
Sbjct: 72 YYTKVGLGSPAKEFYVQVDTGSDILWVNC----AGCTACPKKSGLGMDLTLYDPNGSKTS 127
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
N VPC + C + CK + C Y I YGDG ++ G+ V D +G++
Sbjct: 128 NAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHT 186
Query: 125 VP----LTFGCGYNQHNPGPLSP-PDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
P + FGCG Q G LS D A G++G G+ S++SQL G ++ + HC+
Sbjct: 187 KPDNSSVIFGCGAKQ--SGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCL 244
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY-------------ILGPAELLYSGK 224
+ G +F G+V TP++ A HY IL P L SG
Sbjct: 245 DSHHGGGIF-SIGQVMEPKFNTTPLVPRMA---HYNVILKDMDVDGEPILLPLYLFDSGS 300
Query: 225 SCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA 284
G I DSG + AY +Y +++ ++ G L + D T F
Sbjct: 301 GRG-----TIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFTC-------FHY 348
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNI-I 342
++ E F + F + L V P YL + C+G S + + G + I I
Sbjct: 349 SDKLDEGFPVVKFHF----EGLSLTVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDLILI 404
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
G++ + +K+V+YD E IGW +C++
Sbjct: 405 GDLVLSNKLVVYDLENMVIGWTNFNCSS 432
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 113/385 (29%), Positives = 168/385 (43%), Gaps = 58/385 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++L +G PP + DTGSDL W QC APC C P ++P ++ +VPC +P
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
CAAL + P C C Y+ YGD S+ G L ++ F +N S V + FGC
Sbjct: 151 LCAALPY---PACFQ-RSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGC 206
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLG 188
G N G L+ +++G++GLGRG +S+VSQL + + R GV
Sbjct: 207 G--NINSGQLA--NSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATL 262
Query: 189 DGKVPSSG---VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL---TLIF------- 235
+G SS V TP++ N+A Y + G S G K L L+F
Sbjct: 263 NGTNASSSGSPVQSTPLVVNAALPSLYFMS-----LKGISLGQKRLPIDPLVFAINDDGT 317
Query: 236 -----DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQ 287
DSG S + Y + +L+ L P + T L C+ P+
Sbjct: 318 GGVFIDSGTSLTWLQQDAYDA----VRHELVSVLRPLPPTNDTEIGLETCF--PWPPPPS 371
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGILNGSEAEVGENNIIGEIF 346
V + L F N + VPPE Y++I G +CL ++ G+ IIG
Sbjct: 372 VAVTVPDMELHFDGGAN---MTVPPENYMLIDGATGFLCLAMIRS-----GDATIIGNYQ 423
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ ++YD + + P CN +
Sbjct: 424 QQNMHILYDIANSLLSFVPAPCNIV 448
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 106/386 (27%), Positives = 165/386 (42%), Gaps = 62/386 (16%)
Query: 24 GKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-----------IVPCSNP 72
G F+ DTGSD+ WV C+ C+ C P Q N ++PCS+
Sbjct: 75 GXXXXXFNVQIDTGSDILWVNCNT-CSNC--PQSSQLGIELNFFDTVGSSTAALIPCSDL 131
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFN--VPLT 128
C + C +QC Y +YGDG + G V+D F L N +
Sbjct: 132 ICTSGVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIV 191
Query: 129 FGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGV 184
FGC +Q G L+ D A G+ G G G +S+VSQL G+ V HC+ NG G+
Sbjct: 192 FGCSISQS--GDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGI 249
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IF 235
L LG+ PS + ++P++ + HY L + +G+ + I
Sbjct: 250 LVLGEILEPS--IVYSPLVPSQ---PHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIV 304
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFK 293
D G + AY Y +V T + A +G + + + F
Sbjct: 305 DCGTTLAYLIQEAYDPLV---------TAINTAVSQSARQTNSKGNQCYLVSTSIGDIFP 355
Query: 294 PLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
++L+F +V+ PE YL+ + G + C+G E +I+G++ ++D
Sbjct: 356 LVSLNF---EGGASMVLKPEQYLMHNGYLDGAEMWCVGFQKLQEGA----SILGDLVLKD 408
Query: 350 KMVIYDNEKQRIGWKPEDCNTLLSLN 375
K+V+YD +QRIGW DC+ LS+N
Sbjct: 409 KIVVYDIAQQRIGWANYDCS--LSVN 432
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 168/387 (43%), Gaps = 49/387 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKP----H 63
YF + +G P K + DTGSD+ WV C C P K Y P
Sbjct: 89 YF-TQIGIGTPSKGYYVQVDTGSDILWVNC----ISCDSCPRKSGLGIDLTLYDPTASAS 143
Query: 64 KNIVPCSNPRCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-- 120
V C CA A + PP C N C Y I YGDG S+ G V D +G
Sbjct: 144 SKTVTCGQEFCATATNGGVPPSCA-ANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDG 202
Query: 121 --SVFNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHC 176
++ N +TFGCG G L + A G+LG G+ S++SQL G + + HC
Sbjct: 203 QTNLANASVTFGCG--AKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHC 260
Query: 177 IGQ-NGRGVLFLGDGKVPSSGVAWTPML----QNSADLKHYILGPAELLYSGK--SCGLK 229
+ NG G+ +G+ P V TP++ + LK +G + L G
Sbjct: 261 LDTVNGGGIFAIGNVVQPK--VKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGG 318
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + AY VY+ ++S + + LK D +C F+ G V
Sbjct: 319 SRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQD----FLC----FQYSGSVD 370
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNII-GEIFM 347
F + F + LVV P YL + C+G +G +++ G++ ++ G++ +
Sbjct: 371 NGFPEVTFHF---DGDLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLAL 427
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTLLSL 374
+K+V+YD E Q IGW +C++ + +
Sbjct: 428 SNKLVVYDLENQVIGWTNYNCSSSIKI 454
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 103/371 (27%), Positives = 159/371 (42%), Gaps = 46/371 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK----PPEKQ-----YKPHKN---- 65
N+++G P + DTGSDL W+ CD +GC + P +Q Y+P+ +
Sbjct: 115 ANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQ 174
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGS--V 122
+PC+N C+ RC C Y+++Y +G SS G LV DL L +
Sbjct: 175 TIPCNNTLCS-----RQSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTTDDAQSRA 229
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
+ + FGCG Q L G+ GLG IS+ S L G N C G++G
Sbjct: 230 LDAKIIFGCGRVQTG-SFLDGAAPNGLFGLGMTNISVPSTLAREGYTSNSFSMCFGRDGI 288
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLK-HYILGPAELLYSGKSCGLKDLTLIFDSGASY 241
G + GD SSG TP N L Y + ++ G+ L + + IFDSG S+
Sbjct: 289 GRISFGD--TGSSGQGETPF--NLRQLHPTYNVSITKINVGGRDADL-EFSAIFDSGTSF 343
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
Y Y LI + +K PF+ +++ L + N
Sbjct: 344 TYLNDPAYT---------LISESFNIGAKEKRYSSISDIPFEYCYEMSSNQTNLEIPTVN 394
Query: 302 ---RRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
+ S V P +++ G ++ CL I+ + G+ NIIG+ FM ++++ E
Sbjct: 395 LVMQGGSQFNVTDPIVIVILQGGASIYCLAIV-----KSGDVNIIGQNFMTGYRIVFNRE 449
Query: 358 KQRIGWKPEDC 368
+ +GWK DC
Sbjct: 450 RNVLGWKASDC 460
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 166/381 (43%), Gaps = 45/381 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPH----KNI 66
YF L +G PPK + DTGSD+ WV C C+ C + + Y P +
Sbjct: 70 YFT-KLGLGSPPKDYYVQVDTGSDILWVNC-VKCSRCPRKSDLGIDLTLYDPKGSETSEL 127
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+ C C+A + P CK C Y I YGDG ++ G V D N ++ P
Sbjct: 128 ISCDQEFCSATYDGPIPGCK-SEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAP 186
Query: 127 ----LTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
+ FGCG Q S + G++G G+ S++SQL G ++ + HC+ N
Sbjct: 187 QNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCL-DNI 245
Query: 182 RGVLFLGDGKVPSSGVAWTPM---------LQNSADLKHYILG-PAELLYSGKSCGLKDL 231
RG G+V V+ TP+ + S ++ IL P+++ SG G
Sbjct: 246 RGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGNGKG---- 301
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
I DSG + AY + VY E++ +M L L + F+ G V
Sbjct: 302 -TIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVEQQFSC-------FQYTGNVDRG 353
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG-SEAEVGEN-NIIGEIFMQD 349
F + L F +S+ L V P YL C+G ++ + G++ ++G++ + +
Sbjct: 354 FPVVKLHF---EDSLSLTVYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSN 410
Query: 350 KMVIYDNEKQRIGWKPEDCNT 370
K+VIYD E IGW +C++
Sbjct: 411 KLVIYDLENMAIGWTDYNCSS 431
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 163/383 (42%), Gaps = 60/383 (15%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPH----KNIVPCS 70
+G PK + DTGSD WV C GCT P+K Y P+ VPC
Sbjct: 80 IGLGPKDYYVQVDTGSDTLWVNC----VGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCD 135
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP---- 126
+ C + + C C Y I YGDG ++ G+ + D G + VP
Sbjct: 136 DEFCTSTYDGQISGCTKGM-SCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTS 194
Query: 127 LTFGCGYNQHNPGPLSPP-DTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGCG Q G LS DT+ G++G G+ S++SQL G ++ + HC+ G
Sbjct: 195 VIFGCGSKQ--SGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSISGG 252
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHY-------------ILGPAELLYSGKSCGLKD 230
+F G+V V TP+LQ A HY I P+++L S G
Sbjct: 253 GIF-AIGEVVQPKVKTTPLLQGMA---HYNVVLKDIEVAGDPIQLPSDILDSSSGRG--- 305
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY +Y +++ I+ G L L D T C+ + V +
Sbjct: 306 --TIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQFT---CFH--YSDEESVDD 358
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN---IIGEIFM 347
F + +F + L P YL + C+G S A+ + ++G++ +
Sbjct: 359 LFPTVKFTF---EEGLTLTTYPRDYLFLFKEDMWCVG-WQKSMAQTKDGKELILLGDLVL 414
Query: 348 QDKMVIYDNEKQRIGWKPEDCNT 370
+K+V+YD + IGW +C++
Sbjct: 415 ANKLVVYDLDNMAIGWADYNCSS 437
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 166/391 (42%), Gaps = 49/391 (12%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN-- 65
I + + +G P K + DTGSD+ WV C C C K Y +++
Sbjct: 74 ILGLYYAKIGIGTPTKDYYVQVDTGSDIMWVNC-IQCRECPKTSSLGIDLTLYNINESDT 132
Query: 66 --IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG--- 120
+VPC C ++ P C N C Y YGDG S+ G V D+ +G
Sbjct: 133 GKLVPCDQEFCYEINGGQLPGCT-ANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLK 191
Query: 121 -SVFNVPLTFGCGYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI- 177
+ N + FGCG Q + G + G+LG G+ S++SQL G ++ + HC+
Sbjct: 192 TTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLD 251
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPMLQN---------SADLKHYILG-PAELLYSGKSCG 227
G NG G+ +G P V TP++ N + + H L P ++ +G G
Sbjct: 252 GTNGGGIFVIGHVVQPK--VNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKG 309
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ 287
I DSG + AY VY+ +VS I+ + D+ T F+
Sbjct: 310 -----AIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEYTC-------FQYSDS 357
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENN--IIGE 344
+ + F + F NSV L V P YL G C+G N N ++G+
Sbjct: 358 LDDGFPNVTFHF---ENSVILKVYPHEYLFPFEGLW--CIGWQNSGVQSRDRRNMTLLGD 412
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+ + +K+V+YD E Q IGW +C++ + +
Sbjct: 413 LVLSNKLVLYDLENQAIGWTEYNCSSSIQVQ 443
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 115/395 (29%), Positives = 176/395 (44%), Gaps = 58/395 (14%)
Query: 12 PIFS--------YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
P+FS YFA+ + VG P DTGSDL W+QC +PC C + + P
Sbjct: 74 PVFSGIPFESGEYFAL-VGVGTPSTKAMLVIDTGSDLVWLQC-SPCRRCYAQRGQVFDPR 131
Query: 64 KNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN 119
++ VPCS+P+C AL +P C Y + YGDG SS G L TD L F+N
Sbjct: 132 RSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATD--KLAFAN 189
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIG 178
+ N +T GCG + N G AG+LG+GRG+ISI +Q+ YG +V +C+G
Sbjct: 190 DTYVN-NVTLGCG--RDNEGLFD--SAAGLLGVGRGKISISTQVAPAYG---SVFEYCLG 241
Query: 179 -QNGRGVL--FLGDGKVPSS-GVAWTPMLQNS-------ADLKHYILGPAELL-YSGKSC 226
+ R +L G+ P A+T +L N D+ + +G + +S S
Sbjct: 242 DRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASL 301
Query: 227 GLKDLT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPF 282
L T ++ DSG + + F Y + ++ + ++ + +
Sbjct: 302 ALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSV---FDACY 358
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL--VISGRKNV-----CLGILNGSEAE 335
G+ + L F + +PPE Y V GR+ CLG EA
Sbjct: 359 DLRGRPAASAPLIVLHFAG---GADMALPPENYFLPVDGGRRRAASYRRCLGF----EAA 411
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
++IG + Q V++D EK+RIG+ P+ C +
Sbjct: 412 DDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGCTS 446
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 112/379 (29%), Positives = 165/379 (43%), Gaps = 51/379 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---------YKPHK 64
F ++A N+TVG P F DTGSDL W+ CD CT C + + Y P+
Sbjct: 102 FLHYA-NVTVGTPSDWFMVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNA 158
Query: 65 NI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
+ VPC++ C RC P C Y+I Y +G SS G LV D+ L ++
Sbjct: 159 SSTSTKVPCNSTLCT-----RGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213
Query: 120 GSVFNVP--LTFGCGYNQ----HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVI 173
S +P +TFGCG Q H+ + P+ G+ GLG IS+ S L + G+ N
Sbjct: 214 KSSKAIPARVTFGCGQVQTGVFHDG---AAPN--GLFGLGLEDISVPSVLAKEGIAANSF 268
Query: 174 GHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL 233
C G +G G + GD S TP+ + I + G + G +
Sbjct: 269 SMCFGNDGAGRISFGDKG--SVDQRETPLNIRQPHPTYNIT--VTKISVGGNTGDLEFDA 324
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI--CWRGPFKALGQVTEY 291
+FDSG S+ Y T Y I + + + D LP C+ AL +
Sbjct: 325 VFDSGTSFTYLTDAAYTLISESF--NSLALDKRYQTTDSELPFEYCY-----ALSPNKDS 377
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
F+ A++ T + S V P + + CL I+ ++ + +IIG+ FM
Sbjct: 378 FQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIM-----KIEDISIIGQNFMTGYR 432
Query: 352 VIYDNEKQRIGWKPEDCNT 370
V++D EK +GWK DC T
Sbjct: 433 VVFDREKLILGWKESDCYT 451
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 119/387 (30%), Positives = 169/387 (43%), Gaps = 59/387 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCS 70
YFAV + VG PP DTGSDL W+QC PC C + Y P H+ I PC+
Sbjct: 88 YFAV-INVGDPPTRALVVIDTGSDLIWLQC-VPCRHCYRQVTPLYDPRSSSTHRRI-PCA 144
Query: 71 NPRCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD--LFPLRFSNGSVFNVPL 127
+PRC L +P C C Y + YGDG +S G L TD +FP + V NV
Sbjct: 145 SPRCRDVLRYPG---CDARTGGCVYMVVYGDGSASSGDLATDRLVFP---DDTHVHNV-- 196
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIG------QN 180
T GCG++ N G L AG+LG+GRG++S +QL YG +V +C+G QN
Sbjct: 197 TLGCGHD--NVGLLE--SAAGLLGVGRGQLSFPTQLAPAYG---HVFSYCLGDRLSRAQN 249
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-------ADLKHYILGPAELL-YSGKSCGLKDLT 232
G L G P S A+TP+ N D+ + +G + +S S L T
Sbjct: 250 GSSYLVFGRTPEPPS-TAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPAT 308
Query: 233 ----LIFDSGASYAYFTSRVYQEIVSLI--MRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
++ DSG + + F Y + GT KLA C+
Sbjct: 309 GRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAP 368
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISG---RKNVCLGILNGSEAEVGENNII 342
+ L F + +P YL+ + G R CLG+ +A N++
Sbjct: 369 AAAVRVPSIVLHFA---GGADMALPQANYLIPVQGGDRRTYFCLGL----QAADDGLNVL 421
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCN 369
G + Q +++D E+ RIG+ P C+
Sbjct: 422 GNVQQQGFGLVFDVERGRIGFTPNGCS 448
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 168/379 (44%), Gaps = 44/379 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C K + +++P ++ V C N
Sbjct: 87 YYTTRLWIGTPPQEFALIVDTGSTVTYVPC-SDCEHCGKHQDPRFQPDESSTYHPVKC-N 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFG 130
C C H C YE Y + SS G L D+ + F N S V FG
Sbjct: 145 MDC---------NCDHDGVNCVYERRYAEMSSSSGVLGEDI--ISFGNQSEVVPQRAVFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C G L G++GLGRG++SIV QL + +I + C G + +G G
Sbjct: 194 C--ENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGG-----MHVGGG 246
Query: 191 KVPSSGVAWTP-MLQNSAD---LKHYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
+ G+ P M+ + +D +Y + E+ +GK L T + DSG +
Sbjct: 247 AMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTT 306
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAY + I++ PD IC+ G + + Q+++ F + + F+
Sbjct: 307 YAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFS 366
Query: 301 NRRNSVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
N + +L + PE YL + + CLGI ++ ++G I +++ +V YD E
Sbjct: 367 NGQ---KLSLTPENYLFQHTKVHGAYCLGIFRNGDS----TTLLGGIIVRNTLVTYDREN 419
Query: 359 QRIGWKPEDCNTLLSLNHF 377
++IG+ +C+ L H
Sbjct: 420 EKIGFWKTNCSELWKRLHI 438
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/390 (27%), Positives = 162/390 (41%), Gaps = 54/390 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----------PEKQYKPHK 64
YF + +G P K + DTGSD+ WV C +PCTGC P+ +
Sbjct: 89 YF-TRVKLGNPAKEYFVQIDTGSDILWVAC-SPCTGCPTSSGLNIQLEFFNPDSSSTSSR 146
Query: 65 NIVPCSNPRCAALHWPNPPRCKH---PNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSN 119
+PCS+ RC A C+ P+ C Y YGDG + G V+D F N
Sbjct: 147 --IPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGN 204
Query: 120 GSVFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGH 175
N + FGC +Q G L D A G+ G G+ ++S+VSQL G+ H
Sbjct: 205 EQTANSSASVVFGCSNSQS--GDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSH 262
Query: 176 CI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL 233
C+ NG G+L LG+ P G+ +TP++ + HY L + SG+ +
Sbjct: 263 CLKGSDNGGGILVLGEIVEP--GLVFTPLVPSQ---PHYNLNLESIAVSGQKLPIDSSLF 317
Query: 234 --------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
I DSG + Y Y ++ I + + + + +
Sbjct: 318 ATSNTQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQCFVTTSSVDSSF 377
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEI 345
T YFK V + V PE YL+ G + + G + G I+G++
Sbjct: 378 PTATLYFK----------GGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQGI-TILGDL 426
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
++DK+ +YD R+GW DC+ LS+N
Sbjct: 427 VLKDKIFVYDLANMRMGWADYDCS--LSVN 454
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 166/379 (43%), Gaps = 41/379 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPH----KNI 66
YF L +G PP+ + DTGSD+ WV C C+ C + + Y P ++
Sbjct: 70 YFT-KLGLGSPPRDYYVQVDTGSDILWVNC-VECSRCPRKSDLGIDLTLYDPKGSETSDV 127
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
V C C+A P CK C Y I YGDG ++ G V D NG++ P
Sbjct: 128 VSCDQDFCSATFDGPIPGCK-SEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSP 186
Query: 127 ----LTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
+ FGCG Q G S G++G G+ S++SQL G ++ + HC+ N
Sbjct: 187 QNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL-DNV 245
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHY--ILGPAEL------LYSGKSCGLKDLTL 233
RG G+V V+ TP++ A HY +L E+ L S +
Sbjct: 246 RGGGIFAIGEVVEPKVSTTPLVPRMA---HYNVVLKSIEVDTDILQLPSDIFDSVNGKGT 302
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
+ DSG + AY VY E++ ++ G L L +R F G V F
Sbjct: 303 VIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLVEQQ------FR-CFLYTGNVDRGFP 355
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG-SEAEVGEN-NIIGEIFMQDKM 351
+ L F ++S+ L V P YL C+G ++ + G++ ++G++ + +K+
Sbjct: 356 VVKLHF---KDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKL 412
Query: 352 VIYDNEKQRIGWKPEDCNT 370
VIYD E IGW +C++
Sbjct: 413 VIYDLENMVIGWTDYNCSS 431
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/366 (28%), Positives = 159/366 (43%), Gaps = 43/366 (11%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH----KNIVPCSNPRCAALH 78
+G PP+ F DTGS +T+V C++ C C + +++P + V C NP C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQPDLSDTYHPVKC-NPDCT--- 56
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFGCGYNQHN 137
C NDQC YE +Y + SS G L DL + F N S FGC
Sbjct: 57 ------CDTENDQCTYERQYAEMSSSSGILGEDL--VSFGNMSELKPQRAVFGC--ENAE 106
Query: 138 PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDGKVPSS 195
G L G++GLGRG +SIV QL E G+I + C G + G G + LG PS
Sbjct: 107 TGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSD 166
Query: 196 GVAWTPMLQNSADLK-HYILGPAELLYSGKSCGLKDLTL------IFDSGASYAYFTSRV 248
V + D +Y + L +GK + I DSG +YAY
Sbjct: 167 MV----FSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAA 222
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRL 308
+ + I +L G PD +C+ G + ++ + F + + F N +
Sbjct: 223 FLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGE---KY 279
Query: 309 VVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKP 365
+ PE YL + + CLG+ NG + ++G I +++ +V YD E ++G+
Sbjct: 280 SLSPENYLFKHSKVHGAYCLGVFQNGKDP----TTLLGGIVVRNTLVTYDREHSKVGFWK 335
Query: 366 EDCNTL 371
+C+ L
Sbjct: 336 TNCSVL 341
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 156/374 (41%), Gaps = 47/374 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +G P ++F DTGSDLTWVQC +PC C + + P+ + + C
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGTCYSQNDSLFIPNTSTSFTKLACGTE 61
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L + P C C Y YGDG S G V D + NG VP FGC
Sbjct: 62 LCNGLPY---PMCNQTT--CVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGC 116
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVLF 186
G++ N G + D G+LGLG+G +S SQL+ + +C+ L
Sbjct: 117 GHD--NEGSFAGAD--GILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTSPLL 170
Query: 187 LGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----------IF 235
GD VP+ GV + +L N +Y + + GK + IF
Sbjct: 171 FGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIF 230
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + V+QE+++ + + P K + D L +C LG E P
Sbjct: 231 DSGTTVTQLAGEVHQEVLAAMNASTMDYPRK-SDDSSGLDLC-------LGGFAEGQLPT 282
Query: 296 ALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
S T + +PP Y + + ++ C +++ + IIG I Q+ V Y
Sbjct: 283 VPSMTFHFEGGDMELPPSNYFIFLESSQSYCFSMVSSPDV-----TIIGSIQQQNFQVYY 337
Query: 355 DNEKQRIGWKPEDC 368
D ++IG+ P+ C
Sbjct: 338 DTVGRKIGFVPKSC 351
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 114/395 (28%), Positives = 175/395 (44%), Gaps = 58/395 (14%)
Query: 12 PIFS--------YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
P+FS YFA+ + VG P DTGSDL W+QC +PC C + + P
Sbjct: 74 PVFSGIPFESGEYFAL-VGVGTPSTKAMLVIDTGSDLVWLQC-SPCRRCYAQRGQVFDPR 131
Query: 64 KNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN 119
++ VPCS+P+C AL +P C Y + YGDG SS G L TD L F+N
Sbjct: 132 RSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATD--KLAFAN 189
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIG 178
+ N +T GCG + N G AG+LG+ RG+ISI +Q+ YG +V +C+G
Sbjct: 190 DTYVN-NVTLGCG--RDNEGLFD--SAAGLLGVARGKISISTQVAPAYG---SVFEYCLG 241
Query: 179 -QNGRGVL--FLGDGKVPSS-GVAWTPMLQNS-------ADLKHYILGPAELL-YSGKSC 226
+ R +L G+ P A+T +L N D+ + +G + +S S
Sbjct: 242 DRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASL 301
Query: 227 GLKDLT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPF 282
L T ++ DSG + + F Y + ++ + ++ + +
Sbjct: 302 ALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSV---FDACY 358
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL--VISGRKNV-----CLGILNGSEAE 335
G+ + L F + +PPE Y V GR+ CLG EA
Sbjct: 359 DLRGRPAASAPLIVLHFAG---GADMALPPENYFLPVDGGRRRAASYRRCLGF----EAA 411
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
++IG + Q V++D EK+RIG+ P+ C +
Sbjct: 412 DDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGCTS 446
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 166/379 (43%), Gaps = 42/379 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G PPK + DTGSD+ WV C C C + +
Sbjct: 85 YYA-KIGIGTPPKNYYLQVDTGSDIMWVNC-IQCKECPTRSNLGMDLTLYDIKESSSGKF 142
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---- 122
VPC C ++ C N C Y YGDG S+ G V D+ +G +
Sbjct: 143 VPCDQEFCKEINGGLLTGCT-ANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDS 201
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDT---AGVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
N + FGCG Q G LS + G+LG G+ S++SQL G ++ + HC+ G
Sbjct: 202 ANGSIVFGCGARQ--SGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNG 259
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILGPAELLYSGKSCGLKDLT-L 233
NG G+ +G P V TP+L + S ++ +G A L S + D
Sbjct: 260 VNGGGIFAIGHVVQPK--VNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGT 317
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I DSG + AY +Y+ +V I+ ++ D+ T F+ V + F
Sbjct: 318 IIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYTC-------FQYSESVDDGFP 370
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGEN-NIIGEIFMQDKM 351
+ F N + L V P YL SG C+G N G+++ +N ++G++ + +K+
Sbjct: 371 AVTFYF---ENGLSLKVYPHDYLFPSG-DFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKL 426
Query: 352 VIYDNEKQRIGWKPEDCNT 370
V YD E Q IGW +C++
Sbjct: 427 VFYDLENQVIGWTEYNCSS 445
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/366 (28%), Positives = 159/366 (43%), Gaps = 43/366 (11%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH----KNIVPCSNPRCAALH 78
+G PP+ F DTGS +T+V C++ C C + +++P + V C NP C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQPDLSDTYHPVKC-NPDCT--- 56
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHN 137
C NDQC YE +Y + SS G L DL + F N S FGC
Sbjct: 57 ------CDTENDQCTYERQYAEMSSSSGILGEDL--VSFGNMSELKPQRAVFGC--ENAE 106
Query: 138 PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDGKVPSS 195
G L G++GLGRG +SIV QL E G+I + C G + G G + LG PS
Sbjct: 107 TGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSD 166
Query: 196 GVAWTPMLQNSADLK-HYILGPAELLYSGKSCGLKDLTL------IFDSGASYAYFTSRV 248
V + D +Y + L +GK + I DSG +YAY
Sbjct: 167 MV----FSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAA 222
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRL 308
+ + I +L G PD +C+ G + ++ + F + + F N +
Sbjct: 223 FLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGE---KY 279
Query: 309 VVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKP 365
+ PE YL + + CLG+ NG + ++G I +++ +V YD E ++G+
Sbjct: 280 SLSPENYLFKHSKVHGAYCLGVFQNGKDP----TTLLGGIVVRNTLVTYDREHSKVGFWK 335
Query: 366 EDCNTL 371
+C+ L
Sbjct: 336 TNCSVL 341
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 113/372 (30%), Positives = 161/372 (43%), Gaps = 42/372 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPPEKQ-----YKPHKNI- 66
F ++AV + +G P F DTGSDL WV CD C + P Y P K+
Sbjct: 106 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCDCLKCAPLSSPDYGNLKFDVYSPRKSST 164
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNG-- 120
VPCS+ C C ++ C Y+IEY D SS G LV D+ L +G
Sbjct: 165 SRKVPCSSNMCDL-----QTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHS 219
Query: 121 SVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+ P+TFGCG Q G +P G+LGLG S+ S L G+ N C G
Sbjct: 220 KITQAPITFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASQGVAANSFSMCFG 276
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
++G G + GD S+ TP L +Y + + GK+ K + + DSG
Sbjct: 277 EDGHGRINFGD--TGSADQLETP-LNIYKHNPYYNISIVGAMAGGKTFSTK-FSAVVDSG 332
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
S+ + +Y EI S + + K P D +LP + + G V+ P +S
Sbjct: 333 TSFTALSDPMYTEITSAFDKQV---KEKRNPADSSLPFEYCYTISSKGAVS----PPNIS 385
Query: 299 FTNRRNSVRLVVPPEAYL--VISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
T + SV V P + + S CL I+ N+IGE FM V++D
Sbjct: 386 LTAKGGSVFPVKDPIITITDISSSPVGYCLAIMKSEGV-----NLIGENFMSGLKVVFDR 440
Query: 357 EKQRIGWKPEDC 368
E+ +GWK +C
Sbjct: 441 ERLVLGWKSFNC 452
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 163/386 (42%), Gaps = 46/386 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQC----DAPCTGCTKPPEKQYKPHKNI----V 67
Y+A + +G P K + DTGSD+ WV C + P T Y ++ V
Sbjct: 86 YYA-KVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLV 144
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV----F 123
PC C ++ C N C Y YGDG S+ G V D+ +G +
Sbjct: 145 PCDEEFCYEVNGGPLSGCT-ANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSS 203
Query: 124 NVPLTFGCGYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNG 181
N + FGCG Q + GP S G+LG G+ S++SQL ++ + HC+ G NG
Sbjct: 204 NGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGING 263
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADL----------KHYILGPAELLYSGKSCGLKDL 231
G+ +G P V TP++ N + ++ P E +G G
Sbjct: 264 GGIFAIGHVVQPK--VNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKGA--- 318
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
I DSG + AY VY+ +VS I+ + + D+ T F+ G V +
Sbjct: 319 --IIDSGTTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYTC-------FQYSGSVDDG 369
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN--IIGEIFMQD 349
F + F NSV L V P YL C+G N N ++G++ + +
Sbjct: 370 FPNVTFHF---ENSVFLKVHPHEYL-FPFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSN 425
Query: 350 KMVIYDNEKQRIGWKPEDCNTLLSLN 375
K+V+YD E Q IGW +C++ + +
Sbjct: 426 KLVLYDLENQAIGWTEYNCSSSIKVQ 451
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 116/399 (29%), Positives = 171/399 (42%), Gaps = 71/399 (17%)
Query: 9 FFFPIFS--------YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
F PIFS YFAV + VG P + DTGSD+TW+QC APCT C K + +
Sbjct: 1 FEAPIFSGLAFGTGEYFAV-VGVGTPRRDMYLVVDTGSDITWLQC-APCTNCYKQKDALF 58
Query: 61 KPHKN----IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL- 115
P + ++ CS+ C L C +++C Y+ +YGDG ++G LVTD L
Sbjct: 59 NPSSSSSFKVLDCSSSLCLNLDVMG---CL--SNKCLYQADYGDGSFTMGELVTDNVVLD 113
Query: 116 -RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIG 174
F G V + GCG++ N G AG+LGLGRG +S + L RN+
Sbjct: 114 DAFGPGQVVLTNIPLGCGHD--NEGTFGT--AAGILGLGRGPLSFPNNLDAS--TRNIFS 167
Query: 175 HCIGQ-----NGRGVLFLGDGKVPSSG---VAWTPMLQNSADLKHYILGPAELLYSGKSC 226
+C+ N + L GD +P + V + P L+N +Y + +G S
Sbjct: 168 YCLPDRESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYY-----VQITGISV 222
Query: 227 GLKDLT----------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD 270
G LT IFDSG + +R Y + + L A D
Sbjct: 223 GGNLLTNIPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATM--HLTSAAD 280
Query: 271 DKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGIL 329
K C+ F + ++ + F + V + +PP Y+V N+ C
Sbjct: 281 FKIFDTCYD--FTGMNSIS--VPTVTFHF---QGDVDMRLPPSNYIVPVSNNNIFCFAF- 332
Query: 330 NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
A +G ++IG + Q VIYDN ++IG P+ C
Sbjct: 333 ---AASMGP-SVIGNVQQQSFRVIYDNVHKQIGLLPDQC 367
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 109/379 (28%), Positives = 161/379 (42%), Gaps = 53/379 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G PP DTGSDL W QCDAPC C P Y P ++ V C +P
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL P RC P+ C Y YGDG S+ G L T+ F L S+ +V V FGCG
Sbjct: 152 MCQALQSPW-SRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG-SDTAVRGV--AFGCG 207
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
N G S +++G++G+GRG +S+VSQL G+ R +C LFLG
Sbjct: 208 --TENLG--STDNSSGLVGMGRGPLSLVSQL---GVTR--FSYCFTPFNATAASPLFLGS 258
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG---------------LKDLTLI 234
SS TP + + + L G + G + D +I
Sbjct: 259 SARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVI 318
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFK 293
DSG ++ R + + + + L LA L +C F A
Sbjct: 319 IDSGTTFTALEERAFVALARALASRV---RLPLASGAHLGLSLC----FAAASPEAVEVP 371
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMV 352
L L F +R E+Y+V V CLG+++ +++G + Q+ +
Sbjct: 372 RLVLHFDGADMELRR----ESYVVEDRSAGVACLGMVSARGM-----SVLGSMQQQNTHI 422
Query: 353 IYDNEKQRIGWKPEDCNTL 371
+YD E+ + ++P C L
Sbjct: 423 LYDLERGILSFEPAKCGEL 441
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 97/356 (27%), Positives = 152/356 (42%), Gaps = 52/356 (14%)
Query: 6 IEFFFFPIFSYFAVNL-----TVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
++F F F V L +G PP F+ DTGSD+ WV C++ C+GC + Q
Sbjct: 9 VDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CSGCPQTSGLQI 67
Query: 61 K---------PHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD 111
+ +++ CS+ RC + C N+QC Y +YGDG + G V+D
Sbjct: 68 QLNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSD 127
Query: 112 LFPLR-FSNGSVFN---VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLRE 165
+ L GSV P+ FGC Q G L+ D A G+ G G+ +S++SQL
Sbjct: 128 MMHLNTIFEGSVTTNSTAPVVFGCSNQQ--TGDLTKSDRAVDGIFGFGQQEMSVISQLSS 185
Query: 166 YGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG 223
G+ V HC+ +G G+L LG+ P+ + +T ++ HY L + +G
Sbjct: 186 QGIAPRVFSHCLKGDSSGGGILVLGEIVEPN--IVYTSLV---PAQPHYNLNLQSIAVNG 240
Query: 224 KSCGLKDLTL--------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
++ + I DSG + AY Y VS I + P +
Sbjct: 241 QTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI---PQSVHTAVSRGN 297
Query: 276 ICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLG 327
C+ VTE F ++L+F +++ P+ YL+ I G C+G
Sbjct: 298 QCYL----ITSSVTEVFPQVSLNFA---GGASMILRPQDYLIQQNSIGGAAVWCIG 346
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 114/380 (30%), Positives = 166/380 (43%), Gaps = 53/380 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---------YKPHK 64
F ++A N+TVG P F DTGSDL W+ CD CT C + + Y P+
Sbjct: 102 FLHYA-NVTVGTPSDWFLVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNA 158
Query: 65 NI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
+ VPC++ C RC P C Y+I Y +G SS G LV D+ L ++
Sbjct: 159 SSTSTKVPCNSTLCT-----RGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213
Query: 120 GSVFNVP--LTFGCGYNQ----HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVI 173
S +P +T GCG Q H+ + P+ G+ GLG IS+ S L + G+ N
Sbjct: 214 KSSKAIPARVTLGCGQVQTGVFHDG---AAPN--GLFGLGLEDISVPSVLAKEGIAANSF 268
Query: 174 GHCIGQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT 232
C G +G G + GD G V TP L Y + ++ G + L +
Sbjct: 269 SMCFGNDGAGRISFGDKGSVDQRE---TP-LNIRQPHPTYNITVTKISVEGNTGDL-EFD 323
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI--CWRGPFKALGQVTE 290
+FDSG S+ Y T Y I + + + D LP C+ AL +
Sbjct: 324 AVFDSGTSFTYLTDAAYTLISESF--NSLALDKRYQTTDSELPFEYCY-----ALSPNKD 376
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
F+ A++ T + S V P + + CL IL ++ + +IIG+ FM
Sbjct: 377 SFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIL-----KIEDISIIGQNFMTGY 431
Query: 351 MVIYDNEKQRIGWKPEDCNT 370
V++D EK +GWK DC T
Sbjct: 432 RVVFDREKLILGWKESDCYT 451
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 162/391 (41%), Gaps = 58/391 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G P K + DTGSD+ WV C C C + +
Sbjct: 80 YYA-KIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIELTLYNIDESDSGKL 137
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL---------FPLRF 117
V C + C + CK N C Y YGDG S+ G V D+ +
Sbjct: 138 VSCDDDFCYQISGGPLSGCK-ANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196
Query: 118 SNGSVFNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+NGSV FGCG Q S + G+LG G+ S++SQL G ++ + HC
Sbjct: 197 ANGSVI-----FGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHC 251
Query: 177 I-GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADL----------KHYILGPAELLYSGKS 225
+ G+NG G+ + G+V V TP++ N + ++ PA+L G
Sbjct: 252 LDGRNGGGIFAI--GRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDR 309
Query: 226 CGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
G I DSG + AY +Y+ +V I + + D F+
Sbjct: 310 KG-----AIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKC-------FQYS 357
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN--IIG 343
G+V E F + F NSV L V P YL C+G N + N ++G
Sbjct: 358 GRVDEGFPNVTFHF---ENSVFLRVYPHDYL-FPHEGMWCIGWQNSAMQSRDRRNMTLLG 413
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
++ + +K+V+YD E Q IGW +C++ + +
Sbjct: 414 DLVLSNKLVLYDLENQLIGWTEYNCSSSIKV 444
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 170/383 (44%), Gaps = 51/383 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + +++P + V C N
Sbjct: 88 YYTTRLWIGSPPQEFALIVDTGSTVTYVPC-SNCVQCGNHQDPRFQPELSSTYQPVKC-N 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--LTF 129
C C QC YE Y + +S G L D+ + F S VP F
Sbjct: 146 ADC---------NCDENGVQCTYERRYAEMSTSSGVLAEDV--MSFGKESEL-VPQRAVF 193
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GC G L G++GLGRG +S++ QL G++ N C G + +G
Sbjct: 194 GC--ETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGG-----MDVGG 246
Query: 190 GKVPSSGVAWTP-MLQNSADLK---HYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
G + G++ P M+ + +D +Y + E+ +GK L T I DSG
Sbjct: 247 GAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGT 306
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+YAYF + Y IM+ + PD IC+ G + + ++ + F + + F
Sbjct: 307 TYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVF 366
Query: 300 TNRRNSVRLVVPPEAYLV----ISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIY 354
N + ++ + PE YL +SG CLGI NG++ + ++G I +++ +V Y
Sbjct: 367 ANGQ---KISLSPENYLFRHTKVSGA--YCLGIFKNGND----QTTLLGGIIVRNTLVTY 417
Query: 355 DNEKQRIGWKPEDCNTLLSLNHF 377
+ E IG+ +C+ L H+
Sbjct: 418 NRENSTIGFWKTNCSELWKNLHY 440
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 113/376 (30%), Positives = 163/376 (43%), Gaps = 55/376 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + F FDTGSDLTW QC+ PC G C + E + P ++ V C +
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCE-PCVGYCYQQREHIFDPSTSLSYSNVSCDS 205
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P C L N P C + C Y I YGDG SIG + L ++ VFN F
Sbjct: 206 PSCEKLESATGNSPGCS--SSTCLYGIRYGDGSYSIGFFARE--KLSLTSTDVFN-NFQF 260
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI--GQNGRGVLF 186
GCG Q+N G TAG+LGL R +S+VSQ ++YG V +C+ + G L
Sbjct: 261 GCG--QNNRGLFG--GTAGLLGLARNPLSLVSQTAQKYG---KVFSYCLPSSSSSTGYLS 313
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----------IFD 236
G G S V +TP NS Y L G S G + L + I D
Sbjct: 314 FGSGDGDSKAVKFTPSEVNSDYPSFYFLDMV-----GISVGERKLPIPKSVFSTAGTIID 368
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR-GPFKALG--QVTEYFK 293
SG + VY V + R+L+ ++ L C+ +K + ++ YF
Sbjct: 369 SGTVISRLPPTVYSS-VQKVFRELMSDYPRVK-GVSILDTCYDLSKYKTVKVPKIILYFS 426
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
+ + PE + + VCL S+ + E IIG + + V+
Sbjct: 427 ----------GGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDD--EVAIIGNVQQKTIHVV 474
Query: 354 YDNEKQRIGWKPEDCN 369
YD+ + R+G+ P CN
Sbjct: 475 YDDAEGRVGFAPSGCN 490
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 105/381 (27%), Positives = 168/381 (44%), Gaps = 47/381 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + +++P + V C N
Sbjct: 88 YYTTRLWIGSPPQEFALIVDTGSTVTYVPC-SNCVQCGNHQDPRFQPELSSTYQPVKC-N 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--LTF 129
C C QC YE Y + +S G L D+ + F S VP F
Sbjct: 146 ADC---------NCDENGVQCTYERRYAEMSTSSGVLAEDV--MSFGKESEL-VPQRAVF 193
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC G L G++GLGRG +S++ QL G++ N C G G G + L
Sbjct: 194 GC--ETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVL 251
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASY 241
G P G+ ++ + + +Y + E+ +GK L T I DSG +Y
Sbjct: 252 GGISSPP-GMVFSH--SDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTY 308
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
AYF + Y IM+ + PD IC+ G + + ++ + F + + F N
Sbjct: 309 AYFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFAN 368
Query: 302 RRNSVRLVVPPEAYLV----ISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDN 356
+ ++ + PE YL +SG CLGI NG++ + ++G I +++ +V Y+
Sbjct: 369 GQ---KISLSPENYLFRHTKVSGA--YCLGIFKNGND----QTTLLGGIIVRNTLVTYNR 419
Query: 357 EKQRIGWKPEDCNTLLSLNHF 377
E IG+ +C+ L H+
Sbjct: 420 ENSTIGFWKTNCSELWKNLHY 440
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 171/382 (44%), Gaps = 53/382 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ +++G P K+F DTGSDL W+QC PC C + + P + + C +
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQC-KPCQACFNQKDPIFDPEGSSSYTTMSCGDT 98
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C +L P + PN CDY YGDG + G L ++ L + G + FGC
Sbjct: 99 LCDSL----PRKSCSPN--CDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGC 152
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLF 186
G+ N G + D +G++GLGRG +S VSQL + L + +C+ + +F
Sbjct: 153 GH--LNRGSFN--DASGLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPSKTSPMF 206
Query: 187 LGD-GKVPSSG----VAWTPMLQNSADLKHYILGPAELLYSGKS----CGLKDLT----- 232
GD SSG A+TPM+ N A Y + ++ +G++ G D+
Sbjct: 207 FGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSG 266
Query: 233 -LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+IFDSG + YQ IV +R + P ++ L +C + G Y
Sbjct: 267 GMIFDSGTTLTLLPDAPYQ-IVLRALRSKVSFP-EIDGSSAGLDLC----YDVSGSKASY 320
Query: 292 FKPL-ALSFTNRRNSVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGENNIIGEIFMQ 348
K + A+ F +L P E Y + + VCL +++ S ++G I G + Q
Sbjct: 321 KKKIPAMVFHFEGADHQL--PVENYFIAANDAGTIVCLAMVS-SNMDIG---IYGNMMQQ 374
Query: 349 DKMVIYDNEKQRIGWKPEDCNT 370
+ V+YD +IGW P C++
Sbjct: 375 NFRVMYDIGSSKIGWAPSQCDS 396
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 167/380 (43%), Gaps = 41/380 (10%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VP 68
I Y+ L +G PP+ F DTGS +T+V C + C C + + +++P + V
Sbjct: 85 INGYYTTRLWIGTPPQRFALIVDTGSTVTYVPC-STCEHCGRHQDPKFQPDLSETYQPVK 143
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPL 127
C+ P C C +QC Y+ +Y + SS G L D+ + F N S
Sbjct: 144 CT-PDC---------NCDGDTNQCMYDRQYAEMSSSSGVLGEDV--VSFGNLSELAPQRA 191
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVL 185
FGC ++ G L G++GLGRG +SI+ QL + +I + C G G G +
Sbjct: 192 VFGCENDE--TGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAM 249
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
LG G P + +T + + +Y + E+ +GK L + DSG
Sbjct: 250 ILG-GISPPEDMVFTHSDPDRS--PYYNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGT 306
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+YAY + IM++ PD IC+ G + Q+ + F + + F
Sbjct: 307 TYAYLPETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVF 366
Query: 300 TNRRNSVRLVVPPEAYLVISG--RKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDN 356
N +L + PE YL R CLG+ NG + ++G IF+++ +V+YD
Sbjct: 367 ENGH---KLSLSPENYLFRHSKVRGAYCLGVFSNGRDP----TTLLGGIFVRNTLVMYDR 419
Query: 357 EKQRIGWKPEDCNTLLSLNH 376
E +IG+ +C+ L H
Sbjct: 420 ENSKIGFWKTNCSELWETLH 439
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 170/382 (44%), Gaps = 53/382 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ +++G P K+F DTGSDL W+QC PC C + + P + + C +
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQC-KPCQACFNQKDPIFDPEGSSSYTTMSCGDT 98
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C +L PR K + CDY YGDG + G L ++ L + G + FGC
Sbjct: 99 LCDSL-----PR-KSCSPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGC 152
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLF 186
G+ N G + D +G++GLGRG +S VSQL + L + +C+ + +F
Sbjct: 153 GH--LNRGSFN--DASGLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPSKTSPMF 206
Query: 187 LGD-GKVPSSG----VAWTPMLQNSADLKHYILGPAELLYSGKS----CGLKDLT----- 232
GD SSG A+TPM+ N A Y + ++ +G++ G D+
Sbjct: 207 FGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSG 266
Query: 233 -LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+IFDSG + YQ IV +R I P K+ L +C + G Y
Sbjct: 267 GMIFDSGTTLTLLPDAPYQ-IVLRALRSKISFP-KIDGSSAGLDLC----YDVSGSKASY 320
Query: 292 -FKPLALSFTNRRNSVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGENNIIGEIFMQ 348
K A+ F +L P E Y + + VCL +++ S ++G I G + Q
Sbjct: 321 KMKIPAMVFHFEGADYQL--PVENYFIAANDAGTIVCLAMVS-SNMDIG---IYGNMMQQ 374
Query: 349 DKMVIYDNEKQRIGWKPEDCNT 370
+ V+YD +IGW P C++
Sbjct: 375 NFRVMYDIGSSKIGWAPSQCDS 396
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 162/391 (41%), Gaps = 58/391 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G P K + DTGSD+ WV C C C + +
Sbjct: 80 YYA-KIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIELTLYNIDESDSGKL 137
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL---------FPLRF 117
V C + C + CK N C Y YGDG S+ G V D+ +
Sbjct: 138 VSCDDDFCYQISGGPLSGCK-ANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196
Query: 118 SNGSVFNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+NGSV FGCG Q S + G+LG G+ S++SQL G ++ + HC
Sbjct: 197 ANGSVI-----FGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHC 251
Query: 177 I-GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADL----------KHYILGPAELLYSGKS 225
+ G+NG G+ + G+V V TP++ N + ++ PA+L G
Sbjct: 252 LDGRNGGGIFAI--GRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDR 309
Query: 226 CGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
G I DSG + AY +Y+ +V I + + D F+
Sbjct: 310 KG-----AIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKC-------FQYS 357
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN--IIG 343
G+V E F + F NSV L V P YL C+G N + N ++G
Sbjct: 358 GRVDEGFPNVTFHF---ENSVFLRVYPHDYL-FPYEGMWCIGWQNSAMQSRDRRNMTLLG 413
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
++ + +K+V+YD E Q IGW +C++ + +
Sbjct: 414 DLVLSNKLVLYDLENQLIGWTEYNCSSSIKV 444
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 169/377 (44%), Gaps = 51/377 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F D+GS +T+V C A C C + +++P + V C N
Sbjct: 87 YYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQPDLSSTYSPVKC-N 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
C C +QC YE +Y + SS G L D+ + F S FG
Sbjct: 145 VDCT---------CDSDKNQCTYERQYAEMSSSSGVLGEDI--VSFGTESELKPQRAVFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++SI+ QL + G+I + C G G G + LG
Sbjct: 194 C--ENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
P G+ +T N+ +Y + E+ +GK+ + + DSG +YA
Sbjct: 252 AMPAP-PGMIYT--HSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYA 308
Query: 243 YFTSRVYQEIVSLIMRDLIGT---PLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
Y + + + +D + + PLK PD IC+ G + + Q++E F + +
Sbjct: 309 YLPEQAF-----VAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDM 363
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIY 354
F N + +L + PE YL + CLG+ NG + ++G I +++ +V Y
Sbjct: 364 VFGNGQ---KLSLSPENYLFRHSKVEGAYCLGVFQNGKDP----TTLLGGIVVRNTLVTY 416
Query: 355 DNEKQRIGWKPEDCNTL 371
D ++IG+ +C+ L
Sbjct: 417 DRHNEKIGFWKTNCSEL 433
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 162/378 (42%), Gaps = 49/378 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---------YKPHKNI--- 66
N+TVG P F DTGSDL W+ CD CT C + + Y P+ +
Sbjct: 57 ANVTVGTPSDWFMVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNASSTST 114
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSVFN 124
VPC++ C RC P C Y+I Y +G SS G LV D+ L ++ S
Sbjct: 115 KVPCNSTLCT-----RGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKA 169
Query: 125 VP--LTFGCGYNQ----HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+P +TFGCG Q H+ + P+ G+ GLG IS+ S L + G+ N C G
Sbjct: 170 IPARVTFGCGQVQTGVFHDG---AAPN--GLFGLGLEDISVPSVLAKEGIAANSFSMCFG 224
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
+G G + GD S TP+ + I + G + G + +FDSG
Sbjct: 225 NDGAGRISFGDKG--SVDQRETPLNIRQPHPTYNI--TVTKISVGGNTGDLEFDAVFDSG 280
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI--CW--RGPFKALGQV--TEYF 292
S+ Y T Y I + + + D LP C+ R P + + F
Sbjct: 281 TSFTYLTDAAYTLISESF--NSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSF 338
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ A++ T + S V P + + CL I+ ++ + +IIG+ FM V
Sbjct: 339 QYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIM-----KIEDISIIGQNFMTGYRV 393
Query: 353 IYDNEKQRIGWKPEDCNT 370
++D EK +GWK DC T
Sbjct: 394 VFDREKLILGWKESDCYT 411
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 112/379 (29%), Positives = 161/379 (42%), Gaps = 53/379 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G PP DTGSDL W QCDAPC C P Y P ++ V C +P
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL P RC P+ C Y YGDG S+ G L T+ F L S+ +V V FGCG
Sbjct: 152 MCQALQSPW-SRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG-SDTAVRGV--AFGCG 207
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
N G S +++G++G+GRG +S+VSQL G+ R +C LFLG
Sbjct: 208 --TENLG--STDNSSGLVGMGRGPLSLVSQL---GVTR--FSYCFTPFNATAASPLFLGS 258
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG---------------LKDLTLI 234
SS TP + + + L G + G + D +I
Sbjct: 259 SARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVI 318
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFK 293
DSG + FT+ V+L L LA L +C F A
Sbjct: 319 IDSGTT---FTALEESAFVALARALASRVRLPLASGAHLGLSLC----FAAASPEAVEVP 371
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMV 352
L L F +R E+Y+V V CLG+++ +++G + Q+ +
Sbjct: 372 RLVLHFDGADMELRR----ESYVVEDRSAGVACLGMVSARGM-----SVLGSMQQQNTHI 422
Query: 353 IYDNEKQRIGWKPEDCNTL 371
+YD E+ + ++P C L
Sbjct: 423 LYDLERGILSFEPAKCGEL 441
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 164/376 (43%), Gaps = 49/376 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK------PPEKQ--YKPHKNIV 67
Y+ L +G PP++F DTGS +T+V C + C C + PE Y+P K +
Sbjct: 83 YYTTRLWIGTPPQMFALIVDTGSTVTYVPC-STCEQCGRHQDPKFQPESSSTYQPVKCTI 141
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VP 126
C+ C QC YE +Y + +S G L DL + F N S
Sbjct: 142 DCN--------------CDSDRMQCVYERQYAEMSTSSGVLGEDL--ISFGNQSELAPQR 185
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV 184
FGC G L G++GLGRG +SI+ QL + +I + C G G G
Sbjct: 186 AVFGC--ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGA 243
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSG 238
+ LG G P S +A+ + +Y + E+ +GK L + DSG
Sbjct: 244 MVLG-GISPPSDMAFA--YSDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSG 300
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+YAY + I+++L PD IC+ G + Q+++ F + +
Sbjct: 301 TTYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMV 360
Query: 299 FTNRRNSVRLVVPPEAYLVISG--RKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYD 355
F N + + + PE Y+ R CLG+ NG++ + ++G I +++ +V+YD
Sbjct: 361 FENGQ---KYTLSPENYMFRHSKVRGAYCLGVFQNGND----QTTLLGGIIVRNTLVVYD 413
Query: 356 NEKQRIGWKPEDCNTL 371
E+ +IG+ +C L
Sbjct: 414 REQTKIGFWKTNCAEL 429
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 164/384 (42%), Gaps = 53/384 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNIV 67
+ + +G PP+ DTGSD+ WV C + C GC + Q + + +++
Sbjct: 77 YYTKVKLGTPPRELYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQLNYFDPGSSSTSSLI 135
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
C + RC + + C N+QC Y +YGDG + G V+DL S+F L
Sbjct: 136 SCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHF----ASIFEGTL 191
Query: 128 T--------FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
T FGC Q S G+ G G+ +S++SQL G+ V HC+ G
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251
Query: 179 QN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------K 229
N G GVL LG+ P+ + ++P++ + HY L + +G+ +
Sbjct: 252 DNSGGGVLVLGEIVEPN--IVYSPLVPSQ---PHYNLNLQSISVNGQIVRIAPSVFATSN 306
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
+ I DSG + AY Y V I + P + C+
Sbjct: 307 NRGTIVDSGTTLAYLAEEAYNPFVIAIAAVI---PQSVRSVLSRGNQCY---LITTSSNV 360
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVIS---GRKNV-CLGILNGSEAEVGENNIIGEI 345
+ F ++L+F LV+ P+ YL+ G +V C+G S + I+G++
Sbjct: 361 DIFPQVSLNFA---GGASLVLRPQDYLMQQNFIGEGSVWCIGFQKISGQSI---TILGDL 414
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCN 369
++DK+ +YD QRIGW DC+
Sbjct: 415 VLKDKIFVYDLAGQRIGWANYDCS 438
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 166/383 (43%), Gaps = 51/383 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNIV 67
+ + +G PP+ F DTGSD+ WV C + C GC + Q + + +++
Sbjct: 77 YYTKVKLGTPPREFYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQLNYFDPRSSSTSSLI 135
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFN- 124
CS+ RC + + C N+QC Y +YGDG + G V+DL F F N
Sbjct: 136 SCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTNS 195
Query: 125 -VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN-G 181
+ FGC Q S G+ G G+ +S++SQL G+ V HC+ G N G
Sbjct: 196 SASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKGDNSG 255
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------KDLTL 233
GVL LG+ P+ + ++P++Q+ HY L + +G+ + +
Sbjct: 256 GGVLVLGEIVEPN--IVYSPLVQSQ---PHYNLNLQSISVNGQIVPIAPAVFATSNNRGT 310
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP---FKALGQVTE 290
I DSG + AY Y V+ I L P + RG +
Sbjct: 311 IVDSGTTLAYLAEEAYNPFVNAIT--------ALVP-QSVRSVLSRGNQCYLITTSSNVD 361
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVIS---GRKNV-CLGILNGSEAEVGENNIIGEIF 346
F ++L+F LV+ P+ YL+ G +V C+G + I+G++
Sbjct: 362 IFPQVSLNFA---GGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIPGQSI---TILGDLV 415
Query: 347 MQDKMVIYDNEKQRIGWKPEDCN 369
++DK+ +YD QRIGW DC+
Sbjct: 416 LKDKIFVYDLAGQRIGWANYDCS 438
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 158/380 (41%), Gaps = 45/380 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ------YKPHKN---- 65
Y+ + +G P + F DTGS +T+V PC+ CT Q +KP +
Sbjct: 98 YYTSRVFIGTPAQEFALIVDTGSTVTYV----PCSSCTHCGHHQACFDPRFKPDNSSSYQ 153
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN- 124
V C++P C C QC YE Y + SS G L DL L F NGS
Sbjct: 154 TVSCNSPDCIT------KMCDARVHQCKYERVYAEMSSSKGVLGKDL--LGFGNGSRLQP 205
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGR 182
PL FGC G L G++GLGRG +SIV QL G + + C G G
Sbjct: 206 HPLLFGC--ETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGG 263
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD------LTLIFD 236
G + LG P + + N ++ +Y L +E+ G S + L + D
Sbjct: 264 GSMVLG-AIPPPPAMVFAKSDPNRSN--YYNLELSEIQVQGVSLNVPSEVFNGRLGTVLD 320
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG +YAY + + I + L PD +C+ G + ++F P+
Sbjct: 321 SGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVD 380
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGR--KNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
F+ + ++ + PE YL + CLG +A ++G I +++ +V Y
Sbjct: 381 FVFSGNQ---KVFLAPENYLFKHTKVPGAYCLGFFKNQDA----TTLLGGIVVRNTLVTY 433
Query: 355 DNEKQRIGWKPEDCNTLLSL 374
D +IG+ +C L S+
Sbjct: 434 DRANHQIGFFKTNCTNLWSI 453
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 162/390 (41%), Gaps = 50/390 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G P + + DTGSD+ WV C C C K + + +
Sbjct: 98 YYA-KIGIGTPARDYYVQVDTGSDIMWVNC-IQCNECPKKSSLGMELTLYDIKESLTGKL 155
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---- 122
V C C A++ P C N C Y Y DG SS G V D+ +G +
Sbjct: 156 VSCDQDFCYAINGGPPSYCI-ANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTS 214
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
N + FGC Q G LS + G+LG G+ S++SQL G +R + HC+ G N
Sbjct: 215 ANGSVIFGCSATQ--SGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLN 272
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQN---------SADLKHYILG-PAELLYSGKSCGLKD 230
G G+ +G P V TP++ N + ++ Y L P ++ G G
Sbjct: 273 GGGIFAIGHIVQPK--VNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKG--- 327
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY VY +++S I + D T F+ + +
Sbjct: 328 --TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTC-------FQYSESLDD 378
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNI--IGEIFMQ 348
F + F NS+ L V P YL S C+G N NI +G++ +
Sbjct: 379 GFPAVTFHF---ENSLYLKVHPHEYL-FSYDGLWCIGWQNSGMQSRDRRNITLLGDLALS 434
Query: 349 DKMVIYDNEKQRIGWKPEDCNTLLSLNHFI 378
+K+V+YD E Q IGW +C + + F+
Sbjct: 435 NKLVLYDLENQVIGWTEYNCKYHVIFSSFL 464
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 169/383 (44%), Gaps = 42/383 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-TKPP--------EKQYKPHKNI 66
Y+A + +G PPK + DTGSD+ WV C C C T+ + + +
Sbjct: 83 YYA-KIGIGTPPKNYYLQVDTGSDIMWVNC-IQCKECPTRSSLGMDLTLYDIKESSSGKL 140
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---- 122
VPC C ++ C N C Y YGDG S+ G V D+ +G +
Sbjct: 141 VPCDQEFCKEINGGLLTGCT-ANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDS 199
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA---GVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
N + FGCG Q G LS + G+LG G+ S++SQL G ++ + HC+ G
Sbjct: 200 ANGSIVFGCGARQ--SGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNG 257
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILGPAELLYSGKSCGLKDLT-L 233
NG G+ +G P V TP+L + S ++ +G L S + D
Sbjct: 258 VNGGGIFAIGHVVQPK--VNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGT 315
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I DSG + AY +Y+ +V ++ ++ D+ T F+ V + F
Sbjct: 316 IIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYTC-------FQYSESVDDGFP 368
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGEN-NIIGEIFMQDKM 351
+ F N + L V P YL S C+G N G+++ +N ++G++ + +K+
Sbjct: 369 AVTFFF---ENGLSLKVYPHDYLFPS-VNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKL 424
Query: 352 VIYDNEKQRIGWKPEDCNTLLSL 374
V YD E Q IGW +C++ + +
Sbjct: 425 VFYDLENQAIGWAEYNCSSSIKV 447
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 152/370 (41%), Gaps = 46/370 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP---------------HKN 65
+ +G P F D+GSDL W+ C+ C C Y
Sbjct: 101 IDIGTPSVSFLVALDSGSDLLWIPCN--CVQCAPLSSAYYSSLATKDLNEFDPSASTTSK 158
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYG-DGGSSIGALVTDLFPLRFSNGSVFN 124
+ PCS+ C + P C+ P +QC Y + Y + SS G LV D+ L +S + +
Sbjct: 159 VFPCSHKLCE-----SAPACESPKEQCPYTVTYASENTSSSGLLVEDVLHLAYSANASSS 213
Query: 125 VP--LTFGCGYNQHNPGPLS-PPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V + GCG Q PD GV+GLG G IS+ S L + GL+RN C +
Sbjct: 214 VKARVVVGCGEKQSGEFLKGIAPD--GVMGLGPGEISVPSFLAKAGLMRNSFSMCFDEED 271
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGAS 240
G ++ GD V S T L + Y +G E+ G SC T + DSG S
Sbjct: 272 SGRIYFGD--VGPSTQQSTRFLPYKNEFVAYFVG-VEVCCVGNSCLKQSSFTTLIDSGQS 328
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
+ + +Y+E+ I + T K+ GP++ + + K A+
Sbjct: 329 FTFLPEEIYREVALEIDSHINATVKKIE----------GGPWEYCYETSFEPKVPAIKLK 378
Query: 301 NRRNSVRLVVPPEAYLVIS-GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
N+ ++ P L S G CL I S +E G +IG+ +M +++D E
Sbjct: 379 FSSNNTFVIHKPLFVLQRSEGLVQFCLPI---SASEEGTGGVIGQNYMAGYRIVFDRENM 435
Query: 360 RIGWKPEDCN 369
++GW C
Sbjct: 436 KLGWSASKCQ 445
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 165/380 (43%), Gaps = 45/380 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F D+GS +T+V C A C C + +++P + V C N
Sbjct: 88 YYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQPDLSSSYSPVKC-N 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
C C QC YE +Y + SS G L D+ + F S FG
Sbjct: 146 VDCT---------CDSDKKQCTYERQYAEMSSSSGVLGEDI--VSFGRESELKPQRAVFG 194
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C ++ G L G++GLGRG++SI+ QL E G+I + C G G G + LG
Sbjct: 195 CENSE--TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLG 252
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
PS V +S L+ +Y + E+ +GK+ + + DSG +
Sbjct: 253 GVPAPSDMV-----FSHSDPLRSPYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTT 307
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAY + + + + PD IC+ G + + ++ E F + + F
Sbjct: 308 YAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFG 367
Query: 301 NRRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNE 357
N + +L + PE YL + + CLG+ NG + ++G I +++ +V YD
Sbjct: 368 NGQ---KLSLTPENYLFRHSKVDGAYCLGVFQNGKDP----TTLLGGIIVRNTLVTYDRH 420
Query: 358 KQRIGWKPEDCNTLLSLNHF 377
++IG+ +C+ L H
Sbjct: 421 NEKIGFWKTNCSELWERLHI 440
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 112/378 (29%), Positives = 164/378 (43%), Gaps = 54/378 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C P+ Y P K+
Sbjct: 97 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CIKCAPLASPDYGDLKFDMYSPRKS 153
Query: 66 I----VPCSNPRCAALHWPNP-PRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
VPCS+ C +P C ++ C Y I+Y + SS G LV D+ L +
Sbjct: 154 STSRKVPCSSSLC------DPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTES 207
Query: 120 GS--VFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
G + P+TFGCG Q G +P G+LGLG S+ S L G+ N
Sbjct: 208 GQSKITQAPITFGCGQVQSGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGIAANSFSM 264
Query: 176 CIGQNGRGVLFLGDGKVPSSGVAWTPM---LQNSADLKHYILGPAELLYSGKSCGLKDLT 232
C G++G G + GD SS TP+ QN +Y + + GKS K +
Sbjct: 265 CFGEDGHGRINFGD--TGSSDQLETPLNIYKQN----PYYNISITGAMVGGKSFDTK-FS 317
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
+ DSG S+ + +Y EI S + + L D ++P + A G V
Sbjct: 318 AVVDSGTSFTALSDPMYTEITSTFNAQVKESRKHL---DASMPFEYCYSISAQGAV---- 370
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQDK 350
P +S T + S+ V P + + + + CL I+ N+IGE FM
Sbjct: 371 NPPNISLTAKGGSIFPVNGPIITITDTSSRPIAYCLAIMKSEGV-----NLIGENFMSGL 425
Query: 351 MVIYDNEKQRIGWKPEDC 368
+++D E+ +GWK +C
Sbjct: 426 KIVFDRERLVLGWKTFNC 443
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 109/396 (27%), Positives = 158/396 (39%), Gaps = 79/396 (19%)
Query: 2 YVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-- 59
+V W+ +F I +G P K + DTGSD+ WV C GC K P K
Sbjct: 20 FVHWLSLYFAKI--------GLGNPSKDYYVQVDTGSDILWVNC----IGCDKCPTKSDL 67
Query: 60 ------YKPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALV 109
Y P ++ V C + C + + P CK C Y + YGDG S+ G V
Sbjct: 68 GIKLTLYDPASSVSATRVSCDDDFCTSTYNGLLPDCKKEL-PCQYNVVYGDGSSTAGYFV 126
Query: 110 TDLFPLRFSNGSV----FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE 165
+D G++ N +TFGCG Q S G+LG
Sbjct: 127 SDAVQFERVTGNLQTGLSNGTVTFGCGAQQSGGLGTSGEALDGILG-------------- 172
Query: 166 YGLIRNVIGHCIGQ-NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG--------- 215
HC+ NG G+ +G+ P V TPM+ N A Y+
Sbjct: 173 ------AFAHCLDNVNGGGIFAIGELVSPK--VNTTPMVPNQAHYNVYMKEIEVGGTVLE 224
Query: 216 -PAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL 274
P ++ SG G I DSG + AY VY +++ I G L +
Sbjct: 225 LPTDVFDSGDRRGT-----IIDSGTTLAYLPEVVYDSMMNEIRSQQPGLSLHTVEEQF-- 277
Query: 275 PICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-E 333
IC FK G V + F + F ++S+ L V P YL C G NG +
Sbjct: 278 -IC----FKYSGNVDDGFPDIKFHF---KDSLTLTVYPHDYLFQISEDIWCFGWQNGGMQ 329
Query: 334 AEVGEN-NIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
++ G + ++G++ + +K+V+YD E Q IGW +C
Sbjct: 330 SKDGRDMTLLGDLVLSNKLVLYDIENQAIGWTEYNC 365
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 163/375 (43%), Gaps = 41/375 (10%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH----KNIVP 68
I Y+ L +G PP+ F DTGS +T+V C + C C + + +++P V
Sbjct: 9 INGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSS-CEQCGRHQDPKFQPDLSSTYQSVK 67
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPL 127
C N C C QC YE +Y + +S G L D+ + F N S
Sbjct: 68 C-NIDC---------NCDDEKQQCVYERQYAEMSTSSGVLGEDI--ISFGNLSALAPQRA 115
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL 185
FGC G L G++G+GRG +SIV L + G+I + C G G +
Sbjct: 116 VFGC--ENMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAM 173
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
LG G P S + ++ + +Y + E+ +GK L I DSG
Sbjct: 174 VLG-GISPPSNMVFSQ--SDPVRSPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGT 230
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+YAY + IM++L PD IC+ G + Q++ F + + F
Sbjct: 231 TYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVF 290
Query: 300 TNRRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDN 356
N + +L++ PE YL + + CLGI NG + ++G I +++ +V+YD
Sbjct: 291 GNGQ---KLLLSPENYLFRHSKVHGAYCLGIFQNGKDP----TTLLGGIVVRNTLVLYDR 343
Query: 357 EKQRIGWKPEDCNTL 371
E +IG+ +C+ L
Sbjct: 344 ENSKIGFWKTNCSEL 358
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 162/386 (41%), Gaps = 50/386 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G P + + DTGSD+ WV C C C K + + +
Sbjct: 98 YYA-KIGIGTPARDYYVQVDTGSDIMWVNC-IQCNECPKKSSLGMELTLYDIKESLTGKL 155
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---- 122
V C C A++ P C N C Y Y DG SS G V D+ +G +
Sbjct: 156 VSCDQDFCYAINGGPPSYCI-ANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTS 214
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
N + FGC Q G LS + G+LG G+ S++SQL G +R + HC+ G N
Sbjct: 215 ANGSVIFGCSATQ--SGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLN 272
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQN---------SADLKHYILG-PAELLYSGKSCGLKD 230
G G+ +G P V TP++ N + ++ Y L P ++ G G
Sbjct: 273 GGGIFAIGHIVQPK--VNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKG--- 327
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY VY +++S I + D T F+ + +
Sbjct: 328 --TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTC-------FQYSESLDD 378
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNI--IGEIFMQ 348
F + F NS+ L V P YL S C+G N NI +G++ +
Sbjct: 379 GFPAVTFHF---ENSLYLKVHPHEYL-FSYDGLWCIGWQNSGMQSRDRRNITLLGDLALS 434
Query: 349 DKMVIYDNEKQRIGWKPEDCNTLLSL 374
+K+V+YD E Q IGW +C++ + +
Sbjct: 435 NKLVLYDLENQVIGWTEYNCSSSIKV 460
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 167/379 (44%), Gaps = 48/379 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ +++ +G PP+ + DTGSDL W QC APC C P + P ++ +PC++P
Sbjct: 89 YLMSMGIGTPPRYYSAILDTGSDLIWTQC-APCMLCVDQPTPFFDPAQSPSYAKLPCNSP 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL++P R + C Y+ YGD ++ G L + F ++ V + FGCG
Sbjct: 148 MCNALYYPLCYR-----NVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCG 202
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQL---REYGLIRNVIGHCIGQNGRGVLFLGD 189
N G L + +G++G GRG +S+VSQL R + + + + G +
Sbjct: 203 --NLNAGSLF--NGSGMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPVPSRLYFGAYATLN 258
Query: 190 GKVPSSG--VAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKDLT--LIFD 236
S+G V TP + N Y L + G+ + D T +I D
Sbjct: 259 STSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIID 318
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPIC--WRGPFKALGQVTEYFK 293
SG++ Y Y ++V D +G PL A L C W P + + + E
Sbjct: 319 SGSTITYLARAAY-DMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPPRKIVTMPE--- 374
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRK-NVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
LA F + +P E Y++I G N+CL I A + +IIG Q+ V
Sbjct: 375 -LAFHF----EGANMELPLENYMLIDGDTGNLCLAI-----AASDDGSIIGSFQHQNFHV 424
Query: 353 IYDNEKQRIGWKPEDCNTL 371
+YDNE + + P CN +
Sbjct: 425 LYDNENSLLSFTPATCNVM 443
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 169/377 (44%), Gaps = 51/377 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F D+GS +T+V C A C C + +++P + V C N
Sbjct: 87 YYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQPDLSSTYSPVKC-N 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
C C +QC YE +Y + SS G L D+ + F S FG
Sbjct: 145 VDCT---------CDSDKNQCTYERQYAEMSSSSGVLGEDI--VSFGTESELKPQRAVFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++SI+ QL + G+I + C G G G + LG
Sbjct: 194 C--ENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
P G+ +T N+ +Y + E+ +GK+ + + DSG +YA
Sbjct: 252 AMPAP-PGMIYT--HSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYA 308
Query: 243 YFTSRVYQEIVSLIMRDLIGT---PLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
Y + + + +D + + PLK PD IC+ G + + Q++E F + +
Sbjct: 309 YLPEQAF-----VAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDM 363
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIY 354
F N + +L + PE YL + CLG+ NG + ++G I +++ +V Y
Sbjct: 364 VFGNGQ---KLSLSPENYLFRHSKVEGAYCLGVFQNGKDP----TTLLGGIVVRNTLVTY 416
Query: 355 DNEKQRIGWKPEDCNTL 371
D ++IG+ +C+ L
Sbjct: 417 DRHNEKIGFWKTNCSEL 433
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 160/372 (43%), Gaps = 41/372 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C C C K + +++P + + C N
Sbjct: 87 YYTTRLFIGTPPQEFALIVDTGSTVTYVPCST-CEQCGKHQDPRFQPESSSTYKPMQC-N 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
P C C QC YE Y + SS G L D+ L F N S FG
Sbjct: 145 PSC---------NCDDEGKQCTYERRYAEMSSSSGLLAEDV--LSFGNESELTPQRAIFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFLG 188
C + G L G++GLGRG +S+V QL ++ N C G G + LG
Sbjct: 194 CETVE--TGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLG 251
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
+ P V SA +Y + EL +GK L + DSG +YA
Sbjct: 252 NIPPPPDMVFAHSDPYRSA---YYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSGTTYA 308
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
Y + I++++ PD IC+ G + + Q+++ F + + F N
Sbjct: 309 YLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVFGNG 368
Query: 303 RNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ +L + PE YL + + CLGI NG + ++G I +++ +V YD +
Sbjct: 369 Q---KLSLSPENYLFRHTKVSGAYCLGIFQNGKDP----TTLLGGIVVRNTLVTYDRDND 421
Query: 360 RIGWKPEDCNTL 371
+IG+ +C+ L
Sbjct: 422 KIGFWKTNCSEL 433
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 160/387 (41%), Gaps = 66/387 (17%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPE-KQYKPH-------KNIVP 68
+ +G P F DTGSDL W+ C+ AP + +K P Q P+ V
Sbjct: 115 IDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVL 174
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTD-LFPLRFSNGSVFNVP 126
CS+P C C P DQC YEI Y +S GAL D ++ +R S G+ +P
Sbjct: 175 CSDPLCEM-----SSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGGNPVKLP 229
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ GCG Q L G++GLG IS+ ++L G + + CI G G L
Sbjct: 230 VYLGCGKVQTG-SLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLT 288
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
GD + TP++ S + + + + G + L +FD+G S+ Y +
Sbjct: 289 FGDEGPAAQRT--TPIIPKSVSMLDTYIVEIDSITVGNTNLLMASHALFDTGTSFTYLSK 346
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
VY + V A D + W P F L + +
Sbjct: 347 TVYPQFVQ-------------AYDAQMSLPKWNDP---------RFSKWDLCYQTSNTNF 384
Query: 307 RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN-----------------IIGEIFMQD 349
++ P L +SG + L +++G ++ V +NN IIG+ FM +
Sbjct: 385 QV---PVVSLALSGGNS--LDVVSGLKSIVDDNNAMIAVCVTVMDSGAGLSIIGQNFMTN 439
Query: 350 KMVIYDNEKQRIGWKPEDCNTLLSLNH 376
+ Y+ K IGW P DC+T L+L++
Sbjct: 440 YSITYNRAKMTIGWTPSDCSTDLTLSN 466
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 166/384 (43%), Gaps = 52/384 (13%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN----IVPCS 70
+G P + DTGSD WV C GCT P+K Y P+ + +VPC
Sbjct: 81 IGLGPNDYYVQVDTGSDTLWVNC----VGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCD 136
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP---- 126
+ C + + CK + C Y I YGDG ++ G+ + D G + VP
Sbjct: 137 DEFCTSTYDGPISGCKK-DMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTS 195
Query: 127 LTFGCGYNQHNPGPLSPP-DTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGR 182
+ FGCG Q G LS DT+ G++G G+ S++SQL G ++ V HC+ NG
Sbjct: 196 VIFGCGSKQS--GTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGG 253
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLT----LI 234
G+ +G+ P V TP++ A HY + ++ +G L D T I
Sbjct: 254 GIFAIGEVVQPK--VKTTPLVPRMA---HYNVVLKDIEVAGDPIQLPTDIFDSTSGRGTI 308
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG + AY +Y +++ + G L L D T C+ + + + F
Sbjct: 309 IDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFT---CFH--YSDEKSLDDAFPT 363
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN---IIGEIFMQDKM 351
+ +F + L P YL C+G S A+ + ++G++ + +K+
Sbjct: 364 VKFTF---EEGLTLTAYPHDYLFPFKEDMWCIG-WQKSTAQTKDGKDLILLGDLVLTNKL 419
Query: 352 VIYDNEKQRIGWKPEDCNTLLSLN 375
IYD + IGW +C++ + L
Sbjct: 420 FIYDLDNMSIGWTDYNCSSSIKLK 443
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 161/375 (42%), Gaps = 45/375 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++ +G PP+ F DTGSDL W QC APC C + P ++P K+ +PCS+
Sbjct: 88 YLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASLPCSSA 146
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C AL+ P C + C Y+ YGD SS G L + F +N + VP ++FGC
Sbjct: 147 MCNALY---SPLCFQ--NACVYQAFYGDSASSAGVLANETFTFG-TNSTRVAVPRVSFGC 200
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR----GVLFL 187
G N G L + +G++G GRG +S+VSQL + R L
Sbjct: 201 G--NMNAGTLF--NGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATL 256
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKDLT--LIF 235
SSG V TP + N A Y L + +G + D T +I
Sbjct: 257 NSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVII 316
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + + Y + + +G P A T C++ P VT +
Sbjct: 317 DSGTTVTFLAQPAYAMVQGAFVA-WVGLPRANATPSDTFDTCFKWPPPPRRMVT--LPEM 373
Query: 296 ALSFTNRRNSVRLVVPPEAYLVIS-GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
L F + + +P E Y+V+ G N+CL +L + +IIG Q+ ++Y
Sbjct: 374 VLHF----DGADMELPLENYMVMDGGTGNLCLAMLPSDDG-----SIIGSFQHQNFHMLY 424
Query: 355 DNEKQRIGWKPEDCN 369
D E + + P CN
Sbjct: 425 DLENSLLSFVPAPCN 439
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 165/380 (43%), Gaps = 45/380 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F D+GS +T+V C + C C + +++P + V C N
Sbjct: 87 YYTTRLYIGTPPQEFALIVDSGSTVTYVPCSS-CEQCGNHQDPRFQPDLSSSYSPVKC-N 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
C C QC YE +Y + SS G L D+ + F S FG
Sbjct: 145 VDCT---------CDSDKKQCTYERQYAEMSSSSGVLGEDI--VSFGRESELKPQHAIFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C ++ G L G++GLGRG++SI+ QL E G+I + C G G G + LG
Sbjct: 194 CENSE--TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLG 251
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
P + NS L+ +Y + E+ +GK+ ++ + DSG +
Sbjct: 252 GMLAPPDMI-----FSNSDPLRSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTT 306
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAY + + + + PD IC+ G + + ++ E F + + F
Sbjct: 307 YAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFG 366
Query: 301 NRRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNE 357
N + +L + PE YL + + CLG+ NG + ++G I +++ +V YD
Sbjct: 367 NGQ---KLSLTPENYLFRHSKVDGAYCLGVFQNGKDP----TTLLGGIIVRNTLVTYDRH 419
Query: 358 KQRIGWKPEDCNTLLSLNHF 377
++IG+ +C+ L H
Sbjct: 420 NEKIGFWKTNCSELWERLHI 439
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 161/375 (42%), Gaps = 45/375 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++ +G PP+ F DTGSDL W QC APC C + P ++P K+ +PCS+
Sbjct: 85 YLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASLPCSSA 143
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C AL+ P C + C Y+ YGD SS G L + F +N + VP ++FGC
Sbjct: 144 MCNALY---SPLCFQ--NACVYQAFYGDSASSAGVLANETFTFG-TNSTRVAVPRVSFGC 197
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR----GVLFL 187
G N G L + +G++G GRG +S+VSQL + R L
Sbjct: 198 G--NMNAGTLF--NGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATL 253
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKDLT--LIF 235
SSG V TP + N A Y L + +G + D T +I
Sbjct: 254 NSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVII 313
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + + Y + + +G P A T C++ P VT +
Sbjct: 314 DSGTTVTFLAQPAYAMVQGAFVA-WVGLPRANATPSDTFDTCFKWPPPPRRMVT--LPEM 370
Query: 296 ALSFTNRRNSVRLVVPPEAYLVIS-GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
L F + + +P E Y+V+ G N+CL +L + +IIG Q+ ++Y
Sbjct: 371 VLHF----DGADMELPLENYMVMDGGTGNLCLAMLPSDDG-----SIIGSFQHQNFHMLY 421
Query: 355 DNEKQRIGWKPEDCN 369
D E + + P CN
Sbjct: 422 DLENSLLSFVPAPCN 436
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 154/388 (39%), Gaps = 35/388 (9%)
Query: 4 SWIEFFFFPIFSYFA--VNLTVGKPPKLFDFDFDTGSDLTWVQCD-APCTGCTKPPEKQ- 59
S + + +F Y N++VG P F DTGS+L W+ CD + C + P
Sbjct: 47 SCVSLYSNGLFGYILHYANVSVGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTV 106
Query: 60 ----YKPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVT 110
Y P+ + VPC++ C+ RC C Y++ Y +G S+ G +V
Sbjct: 107 DLNIYSPNTSSTSEKVPCNSTLCSQTQRD---RCPSDQSNCPYQVVYLSNGTSTTGYIVQ 163
Query: 111 DLFPL--RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGL 168
DL L S + +TFGCG Q L+ G+ GLG IS+ S L G
Sbjct: 164 DLLHLISDDSQSKAVDAKITFGCGKVQTG-SFLTGGAPNGLFGLGMSNISVPSTLAHNGY 222
Query: 169 IRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL 228
C NG G + GD S+G T Q Y + + G++ L
Sbjct: 223 TSGSFSMCFSPNGIGRISFGDKG--STGQGETSFNQGQPRSSLYNISITQTSIGGQASDL 280
Query: 229 KDLTLIFDSGASYAYFTSRVY-------QEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
+ IFDSG S+ Y Y ++V R P D ++ P
Sbjct: 281 V-YSAIFDSGTSFTYLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCYDIRSFISAQILP 339
Query: 282 FK-ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN 340
F A TE P + + + P + G CLG++ + G+ N
Sbjct: 340 FSCAYANQTEPTIPAVTLVMSGGDYFNVTDPIVLVQLADGSAVYCLGMI-----KSGDVN 394
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
IIG+ FM +++D E+ +GWKP +C
Sbjct: 395 IIGQNFMTGHRIVFDRERMILGWKPSNC 422
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 151/364 (41%), Gaps = 37/364 (10%)
Query: 24 GKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAA--- 76
G P DTGSDLTWVQC PC+ C + + P + V C+ CAA
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACAASLK 255
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
P C N++C Y + YGDG S G L TD L ++ F FGCG +
Sbjct: 256 AATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGASLDGF----VFGCGLS-- 309
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQ--LREYGLIRNVIGHCIGQNGRGVLFLGDGKVP- 193
N G TAG++GLGR +S+VSQ LR G+ + + G L LG
Sbjct: 310 NRGLFG--GTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSY 367
Query: 194 --SSGVAWTPMLQNSADLKHYILGPAELLYSGKSC---GLKDLTLIFDSGASYAYFTSRV 248
++ VA+T M+ + A Y L G + GL ++ DSG V
Sbjct: 368 RNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSV 427
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRL 308
Y+ + + R AP L C+ L E PL T R
Sbjct: 428 YRGVRAEFTRQFAAAGYPTAPGFSILDTCYD-----LTGHDEVKVPL---LTLRLEGGAE 479
Query: 309 VVPPEAYLVISGRKN---VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKP 365
V A ++ RK+ VCL + + S + + IIG ++K V+YD R+G+
Sbjct: 480 VTVDAAGMLFVVRKDGSQVCLAMASLSYED--QTPIIGNYQQKNKRVVYDTVGSRLGFAD 537
Query: 366 EDCN 369
EDCN
Sbjct: 538 EDCN 541
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 159/382 (41%), Gaps = 53/382 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+ ++G P + F DTGSDL +VQC APC C + Y+P + VPC +
Sbjct: 34 YFVDFSLGTPEQKFHLIVDTGSDLAFVQC-APCDLCYEQDGPLYQPSNSSTFTPVPCDSA 92
Query: 73 RCAALHWPNPPRCKH------PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
C + P C P C YE YGD S++G + + G +
Sbjct: 93 ECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATV----GGIRVNH 148
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NG 181
+ FGCG N G GVLGLG+G +S SQ N +C+ +
Sbjct: 149 VAFGCG--NRNQGSFV--SAGGVLGLGQGALSFTSQAGY--AFENKFAYCLTSYLSPTSV 202
Query: 182 RGVLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-------- 232
L GD + + + +TP++ N + Y + + + G++ + D
Sbjct: 203 FSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGN 262
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD-KTLPICWRGPFKALGQVT 289
IFDSG + Y++ + Y I++ + + P AP + LP+C V+
Sbjct: 263 GGTIFDSGTTVTYWSPQAYARIIAAFEKSV---PYPRAPPSPQGLPLCVN--------VS 311
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQ 348
P+ SFT + P + I N+ CL +L S N+IG I Q
Sbjct: 312 GIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGF---NVIGNIIQQ 368
Query: 349 DKMVIYDNEKQRIGWKPEDCNT 370
+ +V YD E+ RIG+ +C+
Sbjct: 369 NYLVQYDREEHRIGFAHANCDA 390
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/378 (26%), Positives = 160/378 (42%), Gaps = 41/378 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G P + F D+GS +T+V C A C C + +++P + V C N
Sbjct: 90 YYTTRLYIGTPSQEFALIVDSGSTVTYVPC-ATCEQCGNHQDPRFQPDLSSTYSPVKC-N 147
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
C C + QC YE +Y + SS G L D+ + F S FG
Sbjct: 148 VDCT---------CDNERSQCTYERQYAEMSSSSGVLGEDI--MSFGKESELKPQRAVFG 196
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C + G L G++GLGRG++SI+ QL E G+I + C G G G + LG
Sbjct: 197 CENTE--TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLG 254
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
P V N +Y + E+ +GK+ L + DSG +YA
Sbjct: 255 GMPAPPDMVFSH---SNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYA 311
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
Y + + + + PD IC+ G + + Q++E F + + F N
Sbjct: 312 YLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNG 371
Query: 303 RNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ +L + PE YL + CLG+ NG + ++G I +++ +V YD +
Sbjct: 372 Q---KLSLSPENYLFRHSKVEGAYCLGVFQNGKDP----TTLLGGIVVRNTLVTYDRHNE 424
Query: 360 RIGWKPEDCNTLLSLNHF 377
+IG+ +C+ L H
Sbjct: 425 KIGFWKTNCSELWERLHI 442
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/432 (23%), Positives = 162/432 (37%), Gaps = 89/432 (20%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G P K F DTGSD+ W+ C+ C C K + +
Sbjct: 71 YFT-KVKMGSPAKEFYVQIDTGSDILWLNCNT-CNNCPKSSGLGIDLNYFDTASSSTAAL 128
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFN- 124
V CS+P C+ +C +QC Y +YGDG + G V D G SVF+
Sbjct: 129 VSCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYYVYDAMYFDVIMGQSVFSN 188
Query: 125 --VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
+ FGC Q + G+ G G G +S+VSQ+ G+ V HC+ G
Sbjct: 189 SSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQGS 248
Query: 183 GVLFLGDGKVPSSGVAWTPM--LQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
G L G++ + +TP+ LQ HY L + +G+ +
Sbjct: 249 GGGILVLGEILEPNIVYTPLVPLQ-----PHYNLNLQSIAVNGQILPIDQDVFATGNNRG 303
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAP----------------------- 269
I DSG + AY Y ++ G+P
Sbjct: 304 TIVDSGTTLAYLVQEAYDPFLN------AGSPCHFFTHFNEPTNNIKYEDGNNNHQSRVK 357
Query: 270 ----DDKTLPICWRGPFKALGQVTEYFKPLA------------------LSFTNRRNSVR 307
D+ TL + + V+++ KP+ L N
Sbjct: 358 RHYYDEVTLRLVLKHSAIITTTVSQFSKPIISKGNQCYLVPTSLGDIFPLVSLNFMGGAS 417
Query: 308 LVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+V+ PE YL+ + G C+G + I+G++ ++DK+ +YD QRIGW
Sbjct: 418 MVLKPEQYLIHYGFLDGAAMWCIGFQKVQKGY----TILGDLVLKDKIFVYDLANQRIGW 473
Query: 364 KPEDCNTLLSLN 375
DC+ ++++
Sbjct: 474 TDYDCSLAVNVS 485
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 165/373 (44%), Gaps = 43/373 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + + ++ P + + C N
Sbjct: 82 YYTTRLWIGTPPQQFALIVDTGSTVTYVPC-STCEQCGRHQDPKFDPESSSTYKPIKC-N 139
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--LTF 129
C C QC YE +Y + +S G L D+ + F N S +P F
Sbjct: 140 IDCI---------CDSDGVQCVYERQYAEMSTSSGVLGEDV--ISFGNQSEL-IPQRAVF 187
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC G L G++GLG G +S+V QL E G I + C G G G + L
Sbjct: 188 GC--ENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVL 245
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK----SCGLKD--LTLIFDSGASY 241
G G P S + +T + +Y + E+ +GK S G+ D + DSG +Y
Sbjct: 246 G-GISPPSDMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTY 302
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
AY + + IM ++ PD IC+ G +++ F + + F N
Sbjct: 303 AYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFEN 362
Query: 302 RRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
+ +L + PE Y + + CLGI NG++ + ++G I +++ +V+YD
Sbjct: 363 GQ---KLSLTPENYFFRHSKVHGAYCLGIFENGND----QTTLLGGIVVRNTLVMYDRAN 415
Query: 359 QRIGWKPEDCNTL 371
+IG+ +C+ L
Sbjct: 416 SKIGFWKTNCSEL 428
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 165/373 (44%), Gaps = 43/373 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + + ++ P + + C N
Sbjct: 82 YYTTRLWIGTPPQQFALIVDTGSTVTYVPC-STCEQCGRHQDPKFDPESSSTYKPIKC-N 139
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--LTF 129
C C QC YE +Y + +S G L D+ + F N S +P F
Sbjct: 140 IDCI---------CDSDGVQCVYERQYAEMSTSSGVLGEDV--ISFGNQSEL-IPQRAVF 187
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC G L G++GLG G +S+V QL E G I + C G G G + L
Sbjct: 188 GC--ENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVL 245
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK----SCGLKD--LTLIFDSGASY 241
G G P S + +T + +Y + E+ +GK S G+ D + DSG +Y
Sbjct: 246 G-GISPPSDMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTY 302
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
AY + + IM ++ PD IC+ G +++ F + + F N
Sbjct: 303 AYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFEN 362
Query: 302 RRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
+ +L + PE Y + + CLGI NG++ + ++G I +++ +V+YD
Sbjct: 363 GQ---KLSLTPENYFFRHSKVHGAYCLGIFENGND----QTTLLGGIVVRNTLVMYDRAN 415
Query: 359 QRIGWKPEDCNTL 371
+IG+ +C+ L
Sbjct: 416 SKIGFWKTNCSEL 428
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/385 (26%), Positives = 154/385 (40%), Gaps = 49/385 (12%)
Query: 9 FFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
FF Y N+T+G P + F DTGSDL W+ C+ T Q + H N
Sbjct: 105 LFFNYLHY--ANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGETHMNAQR 162
Query: 67 ----------------VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGS-SIGALV 109
V C++ CA + RC P C Y I Y GS S G LV
Sbjct: 163 IRLNIYNPSISTSSSKVTCNSTLCALRN-----RCISPLSDCPYRIRYLSPGSKSTGVLV 217
Query: 110 TDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI 169
D+ + G + +TFGC Q G G++GL I++ + L + G+
Sbjct: 218 EDVIHMSTEEGEARDARITFGCSETQ--LGLFQEVAVNGIMGLAMADIAVPNMLVKAGVA 275
Query: 170 RNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK 229
+ C G NG+G + GD SS TP+ + L + + GK
Sbjct: 276 SDSFSMCFGPNGKGTISFGDKG--SSDQHETPLGGTISPLFYDV--SITKFKVGKVTVET 331
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK---ALG 286
+ IFDSG + + Y + T L+ D+ LP F+ +
Sbjct: 332 KFSAIFDSGTAVTWLLDPYYTALT---------TNFHLSVPDRRLPANVDSTFEFCYIIT 382
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVIS-GRKNV-CLGILNGSEAEVGENNIIGE 344
++ K ++SF + + V P S G V CL +L +A+ NIIG+
Sbjct: 383 STSDEEKLPSISFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQDKADF---NIIGQ 439
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCN 369
FM + +++D E+ +GWK +CN
Sbjct: 440 NFMTNYRIVHDRERMILGWKKSNCN 464
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 164/382 (42%), Gaps = 42/382 (10%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VP 68
I Y+ L +G PP++F D+GS +T+V C + C C K + +++P + V
Sbjct: 89 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQPEMSSTYQPVK 147
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPL 127
C N C C +QC YE EY + SS G L DL + F N S
Sbjct: 148 C-NMDC---------NCDDDREQCVYEREYAEHSSSKGVLGEDL--ISFGNESQLTPQRA 195
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVL 185
FGC G L G++GLG+G +S+V QL + GLI N G C G G G +
Sbjct: 196 VFGC--ETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSM 253
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
LG PS V S +Y + + +GK L + DSG
Sbjct: 254 ILGGFDYPSDMVFTDSDPDRSP---YYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGT 310
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR-GPFKALGQVTEYFKPLALS 298
+YAY + +MR++ PD C++ + ++++ F + +
Sbjct: 311 TYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMV 370
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYD 355
F ++ ++ PE Y+ + + CLG+ NG + ++G I +++ +V+YD
Sbjct: 371 F---KSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKD----HTTLLGGIVVRNTLVVYD 423
Query: 356 NEKQRIGWKPEDCNTLLSLNHF 377
E ++G+ +C+ L H
Sbjct: 424 RENSKVGFWRTNCSELSDRLHI 445
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 163/388 (42%), Gaps = 50/388 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK---------PPEKQYKPHKNI 66
Y+A + +G P K + DTGSD+ WV C C C + P + + +
Sbjct: 87 YYA-KIGIGTPSKDYYVQVDTGSDIVWVNC-IQCRECPRTSSLGMELTPYDLEESTTGKL 144
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C ++ C N C Y YGDG S+ G V D +G +
Sbjct: 145 VSCDEQFCLEVNGGPLSGCT-TNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTA 203
Query: 123 FNVPLTFGCGYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
N + FGCG Q + G G+LG G+ SI+SQL ++ + HC+ G N
Sbjct: 204 ANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTN 263
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS---------ADLKHYILG-PAELLYSGKSCGLKD 230
G G+ +G P V TP++ N + H IL A++ +G G
Sbjct: 264 GGGIFAMGHVVQPK--VNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKGT-- 319
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY +Y+ +V+ I+ ++ + F+ +V +
Sbjct: 320 ---IIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKC-------FQYSERVDD 369
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNI--IGEIFM 347
F P+ F NS+ L V P YL +N+ C+G N N+ G++ +
Sbjct: 370 GFPPVIFHF---ENSLLLKVYPHEYLF--QYENLWCIGWQNSGMQSRDRKNVTLFGDLVL 424
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTLLSLN 375
+K+V+YD E Q IGW +C++ + +
Sbjct: 425 SNKLVLYDLENQTIGWTEYNCSSSIKVQ 452
>gi|356546446|ref|XP_003541637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 160
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/99 (52%), Positives = 71/99 (71%), Gaps = 3/99 (3%)
Query: 273 TLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN 330
+LPICW+ FK+L VT FKP+AL FT +NS+ L + PE+YL+++ VCLGIL+
Sbjct: 58 SLPICWKDTKTFKSLHDVTSNFKPIALRFTKSKNSL-LQLQPESYLIVTKHGKVCLGILD 116
Query: 331 GSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
G+E +G NIIG+I QDK+VIYDNEK +IGW +C+
Sbjct: 117 GTEIGLGNTNIIGDISFQDKLVIYDNEKHQIGWASANCD 155
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 100/377 (26%), Positives = 158/377 (41%), Gaps = 53/377 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +G P ++F DTGSDLTWVQC +PC C + + P+ + + C +
Sbjct: 13 YLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGKCYSQNDALFLPNTSTSFTKLACGSA 71
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L + P C C Y YGDG + G V D + NG VP FGC
Sbjct: 72 LCNGLPF---PMCNQTT--CVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFGC 126
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVLF 186
G++ N G + D G+LGLG+G +S SQL+ + +C+ L
Sbjct: 127 GHD--NEGSFAGAD--GILGLGQGPLSFHSQLKS--VYNGKFSYCLVDWLAPPTQTSPLL 180
Query: 187 LGDGKVPS-SGVAWTPMLQNSADLKHY------------ILGPAELLYSGKSCGLKDLTL 233
GD VP V + P+L N +Y +L + ++ S G
Sbjct: 181 FGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVG--GAGT 238
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG-PFKALGQVTEYF 292
IFDSG + Y+E+++ + + K+ D L +C G P L
Sbjct: 239 IFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKI-DDISRLDLCLSGFPKDQL------- 290
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
P + T +V+PP Y + + ++ C + + + NIIG + Q+
Sbjct: 291 -PTVPAMTFHFEGGDMVLPPSNYFIYLESSQSYCFAMTSSPDV-----NIIGSVQQQNFQ 344
Query: 352 VIYDNEKQRIGWKPEDC 368
V YD +++G+ P+DC
Sbjct: 345 VYYDTAGRKLGFVPKDC 361
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 108/376 (28%), Positives = 159/376 (42%), Gaps = 51/376 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC------TKPPE--KQYKPHKN 65
F Y+A +TVG P + DTGSDL W+ CD C C T+ P Y P+ +
Sbjct: 128 FLYYA-EVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFNIYSPNNS 184
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN- 119
V CS+ C+ L +C P+D C Y++ Y D SS G LV D+ L ++
Sbjct: 185 STSKEVQCSSSLCSHLD-----QCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDV 239
Query: 120 -GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
N +T GCG +Q LS G+ GLG +S+ S L GLI N C G
Sbjct: 240 QSKPVNARITLGCGKDQSG-AFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFG 298
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKH--YILGPAELLYSGKSCGLKDLTLIFD 236
G + GD P G TP + +H Y + ++ G L D+ +IFD
Sbjct: 299 PARMGRIEFGDKGSP--GQNETPF---NLGRRHPTYNVSITQIGVGGHISDL-DVAVIFD 352
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV----TEYF 292
SG S+ Y Y L ++K + PF+ ++ T +
Sbjct: 353 SGTSFTYLNDPAYS---------LFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFT 403
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
PL ++ T + ++ P + ++ CL I A NIIG+ FM +
Sbjct: 404 YPL-MNLTMKGGGHFVINHPIVLISTESKRLFCLAI-----ARSDSINIIGQNFMTGYHI 457
Query: 353 IYDNEKQRIGWKPEDC 368
++D EK +GWK +C
Sbjct: 458 VFDREKMVLGWKESNC 473
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 162/382 (42%), Gaps = 39/382 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR-- 73
Y+ L +G P + F D+GS +T+V PC C + Q + NI+ +PR
Sbjct: 91 YYTTRLYIGTPSQEFALIVDSGSTVTYV----PCATCEQCGNHQSE-SPNIIEAHDPRFQ 145
Query: 74 --CAALHWPNPPR----CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VP 126
++ + P C + QC YE +Y + SS G L D+ + F S
Sbjct: 146 PDLSSTYSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDI--MSFGKESELKPQR 203
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV 184
FGC + G L G++GLGRG++SI+ QL E G+I + C G G G
Sbjct: 204 AVFGCENTE--TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 261
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSG 238
+ LG P V N +Y + E+ +GK+ L + DSG
Sbjct: 262 MVLGGMPAPPDMVFSH---SNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSG 318
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+YAY + + + + PD IC+ G + + Q++E F + +
Sbjct: 319 TTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMV 378
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYD 355
F N + +L + PE YL + CLG+ NG + ++G I +++ +V YD
Sbjct: 379 FGNGQ---KLSLSPENYLFRHSKVEGAYCLGVFQNGKDP----TTLLGGIVVRNTLVTYD 431
Query: 356 NEKQRIGWKPEDCNTLLSLNHF 377
++IG+ +C+ L H
Sbjct: 432 RHNEKIGFWKTNCSELWERLHI 453
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 166/386 (43%), Gaps = 64/386 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ ++L VG PP+ DTGSDL W QCD CT C + P+ + P + + C+
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 73 RCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C LH C P D C Y YGDG +++G T+ F S+G +VPL FGC
Sbjct: 157 LCGDILHHS----CVRP-DTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGC 211
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G N G L+ + +G++G GR +S+VSQL IR +C+ + + L G
Sbjct: 212 G--TMNVGSLN--NASGIVGFGRDPLSLVSQLS----IRR-FSYCLTPYASSRKSTLQFG 262
Query: 189 ---------DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
D P V TP+LQ++ + Y + ++G + G + L +
Sbjct: 263 SLADVGLYDDATGP---VQTTPILQSAQNPTFYYVA-----FTGVTVGARRLRIPASAFA 314
Query: 234 ---------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPF 282
I DSG + F + V E+V R + P +PDD +C+ P
Sbjct: 315 LRPDGSGGVIIDSGTALTLFPAAVLAEVVR-AFRSQLRLPFANGSSPDDG---VCFAAPA 370
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII 342
A G + L +P E Y++ R+ L +L G + G I
Sbjct: 371 VAAGGGRMARQVAVPRMVFHFQGADLDLPRENYVLEDHRRGH-LCVLLGDSGDDGAT--I 427
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDC 368
G QD V+YD E++ + + P +C
Sbjct: 428 GNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 162/382 (42%), Gaps = 39/382 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR-- 73
Y+ L +G P + F D+GS +T+V PC C + Q + NI+ +PR
Sbjct: 90 YYTTRLYIGTPSQEFALIVDSGSTVTYV----PCATCEQCGNHQSE-SPNIIEAHDPRFQ 144
Query: 74 --CAALHWPNPPR----CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VP 126
++ + P C + QC YE +Y + SS G L D+ + F S
Sbjct: 145 PDLSSTYSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDI--MSFGKESELKPQR 202
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV 184
FGC + G L G++GLGRG++SI+ QL E G+I + C G G G
Sbjct: 203 AVFGCENTE--TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 260
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSG 238
+ LG P V N +Y + E+ +GK+ L + DSG
Sbjct: 261 MVLGGMPAPPDMVFSH---SNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSG 317
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+YAY + + + + PD IC+ G + + Q++E F + +
Sbjct: 318 TTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMV 377
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYD 355
F N + +L + PE YL + CLG+ NG + ++G I +++ +V YD
Sbjct: 378 FGNGQ---KLSLSPENYLFRHSKVEGAYCLGVFQNGKDP----TTLLGGIVVRNTLVTYD 430
Query: 356 NEKQRIGWKPEDCNTLLSLNHF 377
++IG+ +C+ L H
Sbjct: 431 RHNEKIGFWKTNCSELWERLHI 452
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 108/376 (28%), Positives = 159/376 (42%), Gaps = 51/376 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC------TKPPE--KQYKPHKN 65
F Y+A +TVG P + DTGSDL W+ CD C C T+ P Y P+ +
Sbjct: 105 FLYYA-EVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFNIYSPNNS 161
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN- 119
V CS+ C+ L +C P+D C Y++ Y D SS G LV D+ L ++
Sbjct: 162 STSKEVQCSSSLCSHLD-----QCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDV 216
Query: 120 -GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
N +T GCG +Q LS G+ GLG +S+ S L GLI N C G
Sbjct: 217 QSKPVNARITLGCGKDQSG-AFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFG 275
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKH--YILGPAELLYSGKSCGLKDLTLIFD 236
G + GD P G TP + +H Y + ++ G L D+ +IFD
Sbjct: 276 PARMGRIEFGDKGSP--GQNETPF---NLGRRHPTYNVSITQIGVGGHISDL-DVAVIFD 329
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV----TEYF 292
SG S+ Y Y L ++K + PF+ ++ T +
Sbjct: 330 SGTSFTYLNDPAYS---------LFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFT 380
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
PL ++ T + ++ P + ++ CL I A NIIG+ FM +
Sbjct: 381 YPL-MNLTMKGGGHFVINHPIVLISTESKRLFCLAI-----ARSDSINIIGQNFMTGYHI 434
Query: 353 IYDNEKQRIGWKPEDC 368
++D EK +GWK +C
Sbjct: 435 VFDREKMVLGWKESNC 450
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 159/381 (41%), Gaps = 36/381 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-KPPEKQYKPHKNI----VPCS 70
YF V++ +G PP+ DTGSDLTWV+C A T C+ PP + + C
Sbjct: 83 YF-VSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCF 141
Query: 71 NPRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-PL 127
+ C + PNP C H + C YE Y DG + G + L S+G + +
Sbjct: 142 SSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKSI 201
Query: 128 TFGCGYNQHNPGPLSPP--DTAGVLGLGRGRISIVSQL-REYGLIRN--VIGHCIGQNGR 182
FGCG++ P + +GV+GLGRG IS SQL R +G + ++ + +
Sbjct: 202 AFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSPPPT 261
Query: 183 GVLFLGD----GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-------GLKDL 231
L +GD K S +++TP+L N Y + + G L +L
Sbjct: 262 SYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWSLDEL 321
Query: 232 ---TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
+ DSG + + T Y+EI+S R+ +KL P R F V
Sbjct: 322 GNGGTVIDSGTTLTFLTEPAYREILSAFKRE-----VKL-PSPTPGGASTRSGFDLCVNV 375
Query: 289 TEYFKPLALSFTNRRNSVRLVV-PPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
T +P + L PP Y + CL I EAE G ++IG +
Sbjct: 376 TGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAI-QPVEAESGRFSVIGNLMQ 434
Query: 348 QDKMVIYDNEKQRIGWKPEDC 368
Q ++ +D K R+G+ C
Sbjct: 435 QGFLLEFDRGKSRLGFSRRGC 455
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 158/374 (42%), Gaps = 46/374 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ VN+ +G P K FDTGSDLTW QC C + + P + + C++
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSA 213
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C++L N P C N C Y I+YGD +IG D L + VF+ FG
Sbjct: 214 ACSSLKSATGNSPGCSSSN--CVYGIQYGDSSFTIGFFAKD--KLTLTQNDVFD-GFMFG 268
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYG--------LIRNVIGHCIGQNG 181
CG Q+N G TAG++GLGR +SIV Q +++G R GH NG
Sbjct: 269 CG--QNNKGLFGK--TAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGHLTFGNG 324
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFD 236
GV K +G+ +TP +S +Y + + GK+ + ++ I D
Sbjct: 325 NGV---KASKAVKNGITFTP-FASSQGTAYYFIDVLGISVGGKALSISPMLFQNAGTIID 380
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG S Y + S + + P AP L C+ L T P
Sbjct: 381 SGTVITRLPSTAYGSLKSAFKQFMSKYP--TAPALSLLDTCYD-----LSNYTSISIP-K 432
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYD 355
+SF N + + + P L+ +G VCL NG + +G I G I Q V+YD
Sbjct: 433 ISF-NFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIG---IFGNIQQQTLEVVYD 488
Query: 356 NEKQRIGWKPEDCN 369
++G+ + C+
Sbjct: 489 VAGGQLGFGYKGCS 502
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 153/366 (41%), Gaps = 37/366 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + F FDTGSDLTW QC+ C E + P K+ + CS+P
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSP 197
Query: 73 RCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L N P C C Y I+YGD S+G D L ++ VFN L FG
Sbjct: 198 TCDELKSGTGNSPSCSAST--CVYGIQYGDQSYSVGFFAQD--KLALTSTDVFNNFL-FG 252
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI--GQNGRGVLFL 187
CG Q+N G AG++GLGR +S+VSQ ++YG + +C+ + G L
Sbjct: 253 CG--QNNRGLF--VGVAGLIGLGRNALSLVSQTAQKYG---KLFSYCLPSTSSSTGYLTF 305
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG-----LKDLTLIFDSGASYA 242
G G S V +TP L NS Y L + G+ I DSG +
Sbjct: 306 GSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIIDSGTVIS 365
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
Y ++ + + + P K AP L C+ F V + L F+
Sbjct: 366 RLPPTAYSDLRASFQQQMSKYP-KAAP-ASILDTCYD--FSQYDTVD--VPKINLYFS-- 417
Query: 303 RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
+ + + P I VCL S+A + I+G + + V+YD RIG
Sbjct: 418 -DGAEMDLDPSGIFYILNISQVCLAFAGNSDAT--DIAILGNVQQKTFDVVYDVAGGRIG 474
Query: 363 WKPEDC 368
+ P C
Sbjct: 475 FAPGGC 480
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 161/381 (42%), Gaps = 56/381 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---------YKPHK 64
F ++A N+T+G P + F DTGSDL W+ C+ T C + E Y P K
Sbjct: 87 FLHYA-NVTIGTPAQWFLVALDTGSDLFWLPCNCNST-CVRSMETDQGERIKLNIYNPSK 144
Query: 65 NI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFSN 119
+ V C++ CA + RC P C Y I Y GS S G LV D+ +
Sbjct: 145 SKSSSKVTCNSTLCALRN-----RCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEE 199
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
G + +TFGC +Q G G++GL I++ + L + G+ + C G
Sbjct: 200 GEARDARITFGCSESQL--GLFKEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGP 257
Query: 180 NGRGVLFLGDG------KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL 233
NG+G + GD + P SG +PM + + K + GK + T
Sbjct: 258 NGKGTISFGDKGSSDQLETPLSGTI-SPMFYDVSITKFKV---------GKVTVDTEFTA 307
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK---ALGQVTE 290
FDSG + + Y + T L+ D+ L PF+ + ++
Sbjct: 308 TFDSGTAVTWLIEPYYTALT---------TNFHLSVPDRRLSKSVDSPFEFCYIITSTSD 358
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVIS-GRKNV-CLGILNGSEAEVGENNIIGEIFMQ 348
K ++SF + + V P S G V CL +L A+ +IIG+ FM
Sbjct: 359 EDKLPSVSFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADF---SIIGQNFMT 415
Query: 349 DKMVIYDNEKQRIGWKPEDCN 369
+ +++D E++ +GWK +CN
Sbjct: 416 NYRIVHDRERRILGWKKSNCN 436
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 160/376 (42%), Gaps = 49/376 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK------PPEKQ--YKPHKNIV 67
Y+ L +G PP++F DTGS +T+V C + C C + PE Y+P K +
Sbjct: 111 YYTTRLWIGTPPQMFALIVDTGSTVTYVPC-STCEQCGRHQDPKFQPESSSTYQPVKCTI 169
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VP 126
C+ C QC YE +Y + +S G L D+ + F N S
Sbjct: 170 DCN--------------CDGDRMQCVYERQYAEMSTSSGVLGEDV--ISFGNQSELAPQR 213
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV 184
FGC G L G++GLGRG +SI+ QL + +I + C G G G
Sbjct: 214 AVFGC--ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGA 271
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSG 238
+ LG PS T + +Y + E+ +GK L + DSG
Sbjct: 272 MVLGGISPPSD---MTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSG 328
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+YAY + I+++L PD IC+ G + Q+++ F + +
Sbjct: 329 TTYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMV 388
Query: 299 FTNRRNSVRLVVPPEAYLVISG--RKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYD 355
F N + + PE Y+ R CLGI NG++ + ++G I +++ +V+YD
Sbjct: 389 FGNGH---KYSLSPENYMFRHSKVRGAYCLGIFQNGND----QTTLLGGIIVRNTLVMYD 441
Query: 356 NEKQRIGWKPEDCNTL 371
E+ +IG+ +C L
Sbjct: 442 REQTKIGFWKTNCAEL 457
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 168/384 (43%), Gaps = 46/384 (11%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VP 68
I Y+ L +G PP++F D+GS +T+V C + C C K + +++P + V
Sbjct: 90 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQPELSSTYQPVK 148
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPL 127
C N C C +QC YE EY + SS G L DL + F N S
Sbjct: 149 C-NMDC---------NCDDDKEQCVYEREYAEHSSSKGVLGEDL--ISFGNESQLTPQRA 196
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVL 185
FGC + G L G++GLG+G +S+V QL + GLI N G C G G G +
Sbjct: 197 VFGCETVE--TGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSM 254
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
LG PS + S +Y + + +GK L + DSG
Sbjct: 255 ILGGFDYPSDMIFTDSDPDRSP---YYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSGT 311
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLPICWR-GPFKALGQVTEYFKPLA 296
+YAY + +MR++ +PLK PD C+ + ++++ F +
Sbjct: 312 TYAYLPDAAFAAFEEAVMREV--SPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSVE 369
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVI 353
+ F ++ ++ PE Y+ + + CLG+ NG + ++G I +++ +V+
Sbjct: 370 MIF---KSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKD----HTTLLGGIVVRNTLVV 422
Query: 354 YDNEKQRIGWKPEDCNTLLSLNHF 377
YD E ++G+ +C+ L H
Sbjct: 423 YDRENSKVGFWRTNCSELSDRLHI 446
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 173/389 (44%), Gaps = 65/389 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCS 70
YFAV + VG PP DTGSDL W+QC PC C + Y P H+ I PC+
Sbjct: 92 YFAV-IGVGDPPTHALVVIDTGSDLIWLQC-LPCRRCYRQVTPLYDPRNSKTHRRI-PCA 148
Query: 71 NPRC-AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+P+C L +P C C Y + YGDG +S G L TD L + V NV T
Sbjct: 149 SPQCRGVLRYPG---CDARTGGCVYMVVYGDGSASSGDLATDTLVLP-DDTRVHNV--TL 202
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIG------QNGR 182
GCG++ N G L+ AG+LG GRG++S +QL YG +V +C+G +N
Sbjct: 203 GCGHD--NEGLLA--SAAGLLGAGRGQLSFPTQLAPAYG---HVFSYCLGDRMSRARNSS 255
Query: 183 GVLFLGDG-KVPSSGVAWTPMLQNS-------ADLKHYILGPAELL-YSGKSCGLKDLT- 232
L G ++PS+ A+TP+ N D+ + +G + +S S L T
Sbjct: 256 SYLVFGRTPELPST--AFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATG 313
Query: 233 ---LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPL-----KLAPDDKTLPICWRGPFKA 284
++ DSG + + FT Y + + + K + D + GP
Sbjct: 314 RGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTG 373
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL--VISG--RKNVCLGILNGSEAEVGENN 340
+ + L F + + +P YL V+ G R CLG+ +A N
Sbjct: 374 V-----RVPSIVLHFA---AAADMALPQANYLIPVVGGDRRTYFCLGL----QAADDGLN 421
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
++G + Q V++D E+ RIG+ P C+
Sbjct: 422 VLGNVQQQGFGVVFDVERGRIGFTPNGCS 450
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 151/379 (39%), Gaps = 41/379 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ + +G PP F DTGS +T+V PC+ CT Q + + C +PR
Sbjct: 39 YYTSRVFIGTPPNEFALIVDTGSTVTYV----PCSSCTHCGHHQASFSTHRLFCRDPRFK 94
Query: 76 ALHWPNPPR------------CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ + + C + QC YE Y + +S G L DL L F S
Sbjct: 95 PENSSSYQKIGCRSSDCITGLCDSNSHQCKYERMYAEMSTSKGVLGKDL--LDFGPASRL 152
Query: 124 NVPL-TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QN 180
L +FGC G L G++GLGRG +SIV QL G I + C G
Sbjct: 153 QSQLLSFGC--ETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDE 210
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD------LTLI 234
G G + LG PS V + S +Y L E+ G S L I
Sbjct: 211 GGGSMVLGAIPAPSGMVFAKSDPRRS---NYYNLELTEIQVQGASLKLDSNVFNGKFGTI 267
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG +YAY R ++ ++ L PD IC+ G ++ ++F
Sbjct: 268 LDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFPL 327
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGR--KNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ F + ++ + PE YL + CLG +A ++G I +++ +V
Sbjct: 328 VDFVFAENQ---KVSLAPENYLFKHTKVPGAYCLGFFKNQDA----TTLLGGIIVRNMLV 380
Query: 353 IYDNEKQRIGWKPEDCNTL 371
YD +IG+ +C L
Sbjct: 381 TYDRYNHQIGFLKTNCTEL 399
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 106/390 (27%), Positives = 153/390 (39%), Gaps = 47/390 (12%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVP 68
+ + + V+L VG PP+ DTGSDL W QC APC C P + +P
Sbjct: 88 VTNEYLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFHQGLPLLDPAASSTYAALP 146
Query: 69 CSNPRCAALHWPN-----PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-- 121
C PRC AL + + + N C Y YGD ++G + TD F NG
Sbjct: 147 CGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGD 206
Query: 122 --VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--- 176
+ LTFGCG+ N G +T G+ G GRGR S+ SQL +C
Sbjct: 207 SRLPTRRLTFGCGH--FNKGVFQSNET-GIAGFGRGRWSLPSQLNV-----TTFSYCFTS 258
Query: 177 ----------IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC 226
+G L S V TP+L+N + Y L +
Sbjct: 259 MFESKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRL 318
Query: 227 GLKDLTL---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
+ + L I DSGAS VY E V +G P + L +C+ P
Sbjct: 319 AVPEAKLRSTIIDSGASITTLPEAVY-EAVKAEFAAQVGLPPTGVVEGSALDLCFALPVT 377
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
AL + +P S T + +P Y+ V +L +A G+ +IG
Sbjct: 378 AL-----WRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVL---DAAPGDQTVIG 429
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTLLS 373
Q+ V+YD E + + P C++L++
Sbjct: 430 NFQQQNTHVVYDLENDWLSFAPARCDSLVA 459
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 154/379 (40%), Gaps = 56/379 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--------------TKPPEKQ 59
F ++AV + +G P F DTGSDL WV CD C C T P+K
Sbjct: 102 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKS 158
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL--R 116
K VPCS+ C P Y IEY D SS G LV D+ L
Sbjct: 159 STSRK--VPCSSNLCDLQSACRSASSSCP-----YSIEYLSDNTSSTGVLVEDVLYLITE 211
Query: 117 FSNGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIG 174
+ + P+TFGCG Q G +P G+LGLG IS+ S L G+ N
Sbjct: 212 YGQPKIVTAPITFGCGRIQTGSFLGSAAP---NGLLGLGMDSISVPSLLASEGVAANSFS 268
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPM---LQNSADLKHYILGPAELLYSGKSCGLKDL 231
C G +GRG + GD SS TP+ QN +Y + + KS +
Sbjct: 269 MCFGDDGRGRINFGD--TGSSDQQETPLNIYKQN----PYYNISITGAMVGSKSFN-TNF 321
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
I DSG S+ + +Y EI S + P +L D +LP + G V
Sbjct: 322 NAIVDSGTSFTALSDPMYSEITSSFNSQVQDKPTQL---DSSLPFEFCYSISPKGSV--- 375
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV--ISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
P +S + S+ V P + S CL ++ N+IGE FM
Sbjct: 376 -NPPNISLMAKGGSIFPVNDPIITITDDASNPMAYCLAVMKSEGV-----NLIGENFMSG 429
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V++D E++ +GWK +C
Sbjct: 430 LKVVFDRERKVLGWKKFNC 448
>gi|357461293|ref|XP_003600928.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355489976|gb|AES71179.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 295
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 152/369 (41%), Gaps = 104/369 (28%)
Query: 8 FFFFP----IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
FF+ P I + V+L +G P + FD DTGSDLTW K YK H
Sbjct: 5 FFYDPLKISIVGGYTVSLKIGYPGQSFDVFIDTGSDLTW------------DKYKLYKLH 52
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
N V Y DG + G LV D PL S+ ++
Sbjct: 53 NNFVYVRIKLAI----------------------YVDGLQTKGFLVQDNIPLESSDRTLQ 90
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGR 182
T P P+S G+LGLG G SI+SQL+ GLI+NV+GHC G+ G+
Sbjct: 91 RPKCTNILKVTDKKPKPIS----KGILGLGHGETSILSQLKSKGLIKNVVGHCFSGKEGQ 146
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
G G+ K+ G Y PA L++ K +KDL LIFDSG + +
Sbjct: 147 G----GNTKIDLEG--------------RYFSEPANLIFDEKLTFIKDLQLIFDSGTTLS 188
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
F S+ ++ +V P+++ +Y KP+ + F+N
Sbjct: 189 AFNSKDHKVLVD--------------PENEV--------------SKDYLKPIIMRFSNN 220
Query: 303 RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIF-MQDKMVIYDNEKQRI 361
LV E Y++IS C S E+ F M +K+ I+DNE++RI
Sbjct: 221 VQCQLLV---EDYIIIS-----C-----SSFRELWHKVWNWLAFSMTNKLKIFDNEEKRI 267
Query: 362 GWKPE-DCN 369
GW DC+
Sbjct: 268 GWVDHVDCD 276
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 165/386 (42%), Gaps = 64/386 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ ++L VG PP+ DTGSDL W QCD CT C + P+ + P + + C+
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 73 RCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C LH C P D C Y YGDG +++G T+ F S+G +VPL FGC
Sbjct: 157 LCGDILHHS----CVRP-DTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGC 211
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G N G L+ + +G++G GR +S+VSQL IR +C+ + + L G
Sbjct: 212 G--TMNVGSLN--NASGIVGFGRDPLSLVSQLS----IRR-FSYCLTPYASSRKSTLQFG 262
Query: 189 ---------DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
D P V TP+LQ++ + Y + ++G + G + L +
Sbjct: 263 SLADVGLYDDATGP---VQTTPILQSAQNPTFYYVA-----FTGVTVGARRLRIPASAFA 314
Query: 234 ---------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPF 282
I DSG + F V E+V R + P +PDD +C+ P
Sbjct: 315 LRPDGSGGVIIDSGTALTLFPVAVLAEVVR-AFRSQLRLPFANGSSPDDG---VCFAAPA 370
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII 342
A G + L +P E Y++ R+ L +L G + G I
Sbjct: 371 VAAGGGRMARQVAVPRMVFHFQGADLDLPRENYVLEDHRRGH-LCVLLGDSGDDGAT--I 427
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDC 368
G QD V+YD E++ + + P +C
Sbjct: 428 GNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 157/383 (40%), Gaps = 54/383 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPPEKQYKPHK----NIVPCS 70
+ V++ +G P + FDTGSDL+WVQC PC+ GC + + P + V C
Sbjct: 85 YVVSVGLGTPARDLTVVFDTGSDLSWVQC-GPCSSGGCYHQQDPLFAPSSSSTFSAVRCG 143
Query: 71 NPRCAALHWPNPPRCKHP------NDQCDYEIEYGDGGSSIGALVTDLFPLRF---SNGS 121
P C PR + +D+C YE+ YGD ++G L D L +N S
Sbjct: 144 EPEC--------PRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNAS 195
Query: 122 VFN---VP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHC 176
N +P FGCG N N G D G+ GLGRG++S+ SQ +YG +C
Sbjct: 196 ENNSNKLPGFVFGCGEN--NTGLFGKAD--GLFGLGRGKVSLSSQAAGKYG---EGFSYC 248
Query: 177 I---GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD--- 230
+ N G L LG + +TPML S Y + + +G++ +
Sbjct: 249 LPSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPA 308
Query: 231 ---LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ 287
LI DSG R Y + + + + K AP L C+ F A
Sbjct: 309 LWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYD--FTAHAN 366
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIF 346
T +AL F + V L ++ CL NG+ G I+G
Sbjct: 367 ATVSIPAVALVFA---GGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAG---ILGNTQ 420
Query: 347 MQDKMVIYDNEKQRIGWKPEDCN 369
+ V+YD +Q+IG+ + C+
Sbjct: 421 QRTVAVVYDVGRQKIGFAAKGCS 443
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 157/374 (41%), Gaps = 46/374 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ VN+ +G P K FDTGSDLTW QC C + + P + + C++
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTST 213
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C+ L N P C N C Y I+YGD ++G D L + VF+ FG
Sbjct: 214 ACSGLKSATGNSPGCSSSN--CVYGIQYGDSSFTVGFFAKD--TLTLTQNDVFD-GFMFG 268
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYG--------LIRNVIGHCIGQNG 181
CG Q+N G TAG++GLGR +SIV Q +++G R GH NG
Sbjct: 269 CG--QNNRGLFG--KTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGHLTFGNG 324
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFD 236
GV K +G+ +TP +S Y + + GK+ + ++ I D
Sbjct: 325 NGV---KTSKAVKNGITFTP-FASSQGATFYFIDVLGISVGGKALSISPMLFQNAGTIID 380
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG S VY + S + + P AP L C+ L T P
Sbjct: 381 SGTVITRLPSTVYGSLKSTFKQFMSKYP--TAPALSLLDTCYD-----LSNYTSISIP-K 432
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYD 355
+SF N + + + P L+ +G VCL NG + +G I G I Q V+YD
Sbjct: 433 ISF-NFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIG---IFGNIQQQTLEVVYD 488
Query: 356 NEKQRIGWKPEDCN 369
++G+ + C+
Sbjct: 489 VAGGQLGFGYKGCS 502
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 152/367 (41%), Gaps = 41/367 (11%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCD----APCT----------GCTKPPEKQYKPHKNIVP 68
VG P F DTGSDL WV CD AP + G KP E H +P
Sbjct: 108 VGTPNTSFLVALDTGSDLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSRH---LP 164
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FNVP 126
CS+ C+ C +P C Y I+Y + +S G L+ D+ L G N
Sbjct: 165 CSHELCSPASG-----CTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAPVNAS 219
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ GCG Q L G+LGLG IS+ S L GL+RN C ++ G +F
Sbjct: 220 VIIGCGKKQSG-SYLEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIF 278
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
GD VP+ TP + + L+ Y + + K + D+G S+
Sbjct: 279 FGDQGVPTQ--QSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPL 336
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR-GPFKALGQVTEYFKPLALSFTNRRNS 305
Y+ I + + + + + DD + C+ GP + T + L+F + S
Sbjct: 337 DAYKSITMEFDKQINAS--RASSDDYSFEYCYSTGPLEMPDVPT-----ITLTFAENK-S 388
Query: 306 VRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+ V P + G V CL +L E VG IIG+ FM V++D E ++GW
Sbjct: 389 FQAVNPILPFNDRQGEFAVFCLAVLPSPEP-VG---IIGQNFMVGYHVVFDRENMKLGWY 444
Query: 365 PEDCNTL 371
+C+ L
Sbjct: 445 RSECHDL 451
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 153/369 (41%), Gaps = 41/369 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----------GCTKPPEKQYKPHKNI 66
+ VG P F DTGSDL WV CD AP + G KP E H
Sbjct: 106 VDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSRH--- 162
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FN 124
+PCS+ C+ C +P C Y I+Y + +S G L+ D+ L G N
Sbjct: 163 LPCSHELCSPASG-----CTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAPVN 217
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+ GCG Q L G+LGLG IS+ S L GL+RN C ++ G
Sbjct: 218 ASVIIGCGKKQSG-SYLEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGR 276
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+F GD VP+ TP + + L+ Y + + K + D+G S+
Sbjct: 277 IFFGDQGVPTQ--QSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSL 334
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR-GPFKALGQVTEYFKPLALSFTNRR 303
Y+ I + + + + + DD + C+ GP + T + L+F +
Sbjct: 335 PLDAYKSITMEFDKQINAS--RASSDDYSFEYCYSTGPLEMPDVPT-----ITLTFAENK 387
Query: 304 NSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
S + V P + G V CL +L E VG IIG+ FM V++D E ++G
Sbjct: 388 -SFQAVNPILPFNDRQGEFAVFCLAVLPSPEP-VG---IIGQNFMVGYHVVFDRENMKLG 442
Query: 363 WKPEDCNTL 371
W +C+ L
Sbjct: 443 WYRSECHDL 451
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 161/368 (43%), Gaps = 33/368 (8%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ L +G PP++F DTGS +T+V C + C C + + +++P ++ P
Sbjct: 80 YYTTRLWIGTPPQMFALIVDTGSTVTYVPC-STCEQCGRHQDPKFQP--DLSSTYQPVKC 136
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFGCGYN 134
L C + QC YE +Y + +S G L D+ + F N S FGC
Sbjct: 137 TLDC----NCDNDRMQCVYERQYAEMSTSSGVLGEDV--VSFGNQSELAPQRAVFGC--E 188
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDGKV 192
G L G++GLGRG +SI+ QL + ++ + C G G G + LG G
Sbjct: 189 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLG-GIS 247
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYAYFTS 246
P S + + + +Y + E+ +GK L + DSG +YAY
Sbjct: 248 PPSDMVFAQ--SDPVRSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLDSGTTYAYLPE 305
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
+ I+++L PD +C+ G + Q+++ F + + F N
Sbjct: 306 EAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGH--- 362
Query: 307 RLVVPPEAYLVISG--RKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+ + PE Y+ R CLGI NG + ++G I +++ +V+YD E+ +IG+
Sbjct: 363 KYSLSPENYMFRHSKVRGAYCLGIFQNGKDP----TTLLGGIVVRNTLVLYDREQTKIGF 418
Query: 364 KPEDCNTL 371
+C L
Sbjct: 419 WKTNCAEL 426
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 165/376 (43%), Gaps = 39/376 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNP-RC 74
Y+ L +G PP+ F D+GS +T+V C A C C + +++P ++ +P +C
Sbjct: 84 YYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP--DLSSTYSPVKC 140
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFGCGY 133
+A C QC YE +Y + SS G L D+ + F S FGC
Sbjct: 141 SA-----DCTCDSDKSQCTYERQYAEMSSSSGVLGEDI--VSFGTESELKPQRAVFGC-- 191
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDGK 191
G L G++GLGRG++SI+ QL + G+I + C G G G + LG
Sbjct: 192 ENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMP 251
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYAYFT 245
P V + +Y + E+ +GK+ L + DSG +YAY
Sbjct: 252 APPDMVFSR---SDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDSGTTYAYLP 308
Query: 246 SRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
+ + + + PLK PD IC+ G + + Q+++ F + + F + +
Sbjct: 309 EQAFVAFKDAVTSKV--RPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDGQ 366
Query: 304 NSVRLVVPPEAYLVISGRKN--VCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
+L + PE YL + CLG+ NG + ++G I +++ +V YD ++
Sbjct: 367 ---KLSLSPENYLFRHSKVEGAYCLGVFQNGKDP----TTLLGGIVVRNTLVTYDRHNEK 419
Query: 361 IGWKPEDCNTLLSLNH 376
IG+ +C+ L H
Sbjct: 420 IGFWKTNCSELWERLH 435
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 109/377 (28%), Positives = 168/377 (44%), Gaps = 59/377 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F + L +G P + + DTGSDL W QC PC C P + P K+ +PCS+
Sbjct: 97 FLMKLAIGTPAETYSAIMDTGSDLIWTQCK-PCKDCFDQPTPIFDPKKSSSFSKLPCSSD 155
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CAAL + C +D C+Y YGD S+ G L T+ F F + SV + FGCG
Sbjct: 156 LCAALPISS---C---SDGCEYLYSYGDYSSTQGVLATETFA--FGDASVSKI--GFGCG 205
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGV--LFLG 188
+ G AG++GLGRG +S++SQL E +C+ + +G+ L +G
Sbjct: 206 EDNDGSG---FSQGAGLVGLGRGPLSLISQLGE-----PKFSYCLTSMDDSKGISSLLVG 257
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSG 238
+ + TP++QN + Y L + ++ T LI DSG
Sbjct: 258 SEATMKNAIT-TPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSG 316
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK---TLPICWRGPFKALGQVTEYFKPL 295
+ Y + + + ++ I + LKL D+ L +C+ P A T L
Sbjct: 317 TTITYLEDSAF----AALKKEFI-SQLKLDVDESGSTGLDLCFTLPPDA---STVDVPQL 368
Query: 296 ALSFTNRRNSVRLVVPPEAYLVI-SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
F L +P E Y++ SG +CL + GS + + +I G Q+ +V++
Sbjct: 369 VFHF----EGADLKLPAENYIIADSGLGVICLTM--GSSSGM---SIFGNFQQQNIVVLH 419
Query: 355 DNEKQRIGWKPEDCNTL 371
D EK+ I + P CN L
Sbjct: 420 DLEKETISFAPAQCNQL 436
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 157/376 (41%), Gaps = 60/376 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ +NL++G P + F DTGSDL W QC PCT C + P + +PCS+
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL P C N+ C Y YGDG + G++ T+ L F + S+ N+ TFGCG
Sbjct: 154 LCQALQ---SPTCS--NNSCQYTYGYGDGSETQGSMGTE--TLTFGSVSIPNI--TFGCG 204
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
N G + AG++G+GRG +S+ SQL +C IG + L LG
Sbjct: 205 ENNQGFG---QGNGAGLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSNSSTLLLGS 256
Query: 190 -GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL----------------T 232
++G T ++Q+S Y + +G S G L
Sbjct: 257 LANSVTAGSPNTTLIQSSQIPTFYY-----ITLNGLSVGSTPLPIDPSVFKLNSNNGTGG 311
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
+I DSG + YF YQ + + + + + +C++ P ++
Sbjct: 312 IIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVN--GSSSGFDLCFQMP-------SDQS 362
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+F + LV+P E Y + +CL + + S+ +I G I Q+ +V
Sbjct: 363 NLQIPTFVMHFDGGDLVLPSENYFISPSNGLICLAMGSSSQGM----SIFGNIQQQNLLV 418
Query: 353 IYDNEKQRIGWKPEDC 368
+YD + + C
Sbjct: 419 VYDTGNSVVSFLSAQC 434
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 142/363 (39%), Gaps = 32/363 (8%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR------- 73
+ +G P F D+GSDL WV CD C C Y + +P
Sbjct: 102 IDIGTPHVSFMVALDSGSDLFWVPCD--CVQCAPLSASHYSSLDRDLSEYSPSQSSTSKQ 159
Query: 74 --CAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSVFNV----P 126
C+ P CK+P C Y I Y + SS G LV D+ L N P
Sbjct: 160 LSCSHRLCDMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDTLNTSVKAP 219
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ GCG Q G L G+LGLG IS+ S L + GLI+N C ++ G +F
Sbjct: 220 VIIGCGMKQSG-GYLDGVAPDGLLGLGLQEISVPSFLAKAGLIQNSFSMCFNEDDSGRIF 278
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFT 245
GD + A P L+ + + YI+G E+ G SC + + DSG S+ +
Sbjct: 279 FGDQGPATQQSA--PFLKLNGNYTTYIVG-VEVCCVGTSCLKQSSFSALVDSGTSFTFLP 335
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
V++ I + + W+ +K Q L L F + NS
Sbjct: 336 DDVFEMIAEEFDTQVNASRSSFE------GYSWKYCYKTSSQDLPKIPSLRLIFP-QNNS 388
Query: 306 VRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKP 365
+ P I G CL I + G+ IG+ FM V++D E ++GW
Sbjct: 389 FMVQNPVFMIYGIQGVIGFCLAI----QPADGDIGTIGQNFMMGYRVVFDRENLKLGWSR 444
Query: 366 EDC 368
+C
Sbjct: 445 SNC 447
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 114/394 (28%), Positives = 157/394 (39%), Gaps = 49/394 (12%)
Query: 3 VSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP 62
S I + ++ + G P DTGSDLTWVQC PC+ C + + P
Sbjct: 134 TSGIRLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDP 192
Query: 63 HKNI----VPCSNPRCA---ALHWPNPPRCKHP---NDQCDYEIEYGDGGSSIGALVTDL 112
+ V C+ CA P C +++C Y + YGDG S G L TD
Sbjct: 193 AGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDT 252
Query: 113 FPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRN 171
L G FGCG + N G TAG++GLGR +S+VSQ YG
Sbjct: 253 VAL----GGASLGGFVFGCGLS--NRGLFG--GTAGLMGLGRTELSLVSQTASRYG---G 301
Query: 172 VIGHCI----GQNGRGVLFLGDGKVPSSG------VAWTPMLQNSADLKHYILGPAELLY 221
V +C+ + G L LG G +S VA+T M+ + A Y L
Sbjct: 302 VFSYCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAV 361
Query: 222 SGKSC---GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
G + GL ++ DSG VY+ + + MR AP L C+
Sbjct: 362 GGTALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCY 421
Query: 279 RGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN---VCLGILNGSEAE 335
L E PL T R V A ++ RK+ VCL + + S +
Sbjct: 422 D-----LTGHDEVKVPL---LTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYED 473
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
E IIG ++K V+YD R+G+ EDCN
Sbjct: 474 --ETPIIGNYQQKNKRVVYDTLGSRLGFADEDCN 505
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 147/369 (39%), Gaps = 45/369 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----V 67
+ +G P F DTGSDL WV CD AP G + + Y P ++ V
Sbjct: 103 TTVELGTPGMKFMVALDTGSDLFWVPCDCSKCAPTQGVAYASDFELSIYDPKQSSTSKKV 162
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRF--SNGSVFN 124
C+N CA + RC C Y + Y +S G LV D+ L SN
Sbjct: 163 TCNNNLCAHRN-----RCLGTFSSCPYMVSYVSAQTSTSGILVEDVLHLTSEDSNQESIK 217
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+TFGCG Q L+ G+ GLG +IS+ S L GL + C G +G G
Sbjct: 218 AYVTFGCGQVQSG-SFLNTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCFGHDGVGR 276
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+ GD P TP N + + I + G + D T +FDSG S+ Y
Sbjct: 277 ISFGDKGSPDQ--EETPFNSNPSHPSYNI--SVTQVRVGTTLVDVDFTALFDSGTSFTYL 332
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK-----ALGQVTEYFKPLALSF 299
+ +Y ++ DK P R PF+ + G + ++L+
Sbjct: 333 INPIYA---------MVSENFHAQAQDKRRPPDPRIPFEYCYDMSPGANSSLIPSMSLTM 383
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
R + V P + CL I+ +E NIIG+ FM V++D EK
Sbjct: 384 KGRGHFT--VFDPIIVITTQNELVYCLAIVKSTEL-----NIIGQNFMTGYRVVFDREKL 436
Query: 360 RIGWKPEDC 368
+GWK DC
Sbjct: 437 VLGWKETDC 445
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 152/365 (41%), Gaps = 38/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ + C+ P
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAP 239
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L + C N C Y ++YGDG SIG D L S ++ F G
Sbjct: 240 ACSDL---DTRGCSGGN--CLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 289
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC+ +G G L G
Sbjct: 290 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSSGTGYLDFGP 344
Query: 190 GKVPSSGVAW-TPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
G ++G TPML ++ +Y+ G + G+ + I DSG
Sbjct: 345 GSPAAAGARLTTPMLTDNGPTFYYV-GMTGIRVGGQLLSIPQSVFTTAGTIVDSGTVITR 403
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
Y + S + K AP L C+ F + QV ++L F +
Sbjct: 404 LPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCY--DFTGMSQVA--IPTVSLLF---Q 456
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
RL V + + VCLG + + G+ I+G ++ V YD K+ +G+
Sbjct: 457 GGARLDVDASGIMYAASVSQVCLGF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514
Query: 364 KPEDC 368
P C
Sbjct: 515 SPGAC 519
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 156/369 (42%), Gaps = 40/369 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
F +NL +G PP+ + DTGSDL W QC PCT C P + P K+ +
Sbjct: 100 FLMNLAIGTPPETYSAIMDTGSDLIWTQCK-PCTQCFDQPSPIFDPKKSSSFSKLSCSSQ 158
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
L P +D C+Y YGD S+ G + T+ F F S+ NV FGCG +
Sbjct: 159 LCKALPQ--SSCSDSCEYLYTYGDYSSTQGTMATETF--TFGKVSIPNVG--FGCGEDNE 212
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV---P 193
G +G++GLGRG +S+VSQL+E + I L +G
Sbjct: 213 GDG---FTQGSGLVGLGRGPLSLVSQLKEAKFSYCLTS--IDDTKTSTLLMGSLASVNGT 267
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGASYAY 243
S+ + TP++QN Y L + G +K+ T LI DSG + Y
Sbjct: 268 SAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITY 327
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALSFTNR 302
+ ++V +G P+ L +C+ P +E P L L FT
Sbjct: 328 LEESAF-DLVKKEFTSQMGLPVD-NSGATGLELCYNLP----SDTSELEVPKLVLHFTG- 380
Query: 303 RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
L +P E Y++ + +G++ + G +I G + Q+ V +D EK+ +
Sbjct: 381 ---ADLELPGENYMI----ADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLEKETLS 433
Query: 363 WKPEDCNTL 371
+ P +C L
Sbjct: 434 FLPTNCGQL 442
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 155/383 (40%), Gaps = 51/383 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V L VG P + DTGSDL W QC APC C P + +PC
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQC-APCRDCFDQDLPVLDPAASSTYAALPCGAA 142
Query: 73 RCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG---SVFNVPLT 128
RC AL + + R + C Y YGD ++G + TD F S G S+ LT
Sbjct: 143 RCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLT 202
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---QNGRGVL 185
FGCG+ N G +T G+ G GRGR S+ SQL +C ++ ++
Sbjct: 203 FGCGH--LNKGVFQSNET-GIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFESKSSLV 254
Query: 186 FLGD------GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-------- 231
LG S V TP+L+N + Y L G S G L
Sbjct: 255 TLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLS-----LKGISVGKTRLPVPETKFR 309
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ I DSGAS VY E V +G P + L +C+ P AL +
Sbjct: 310 STIIDSGASITTLPEEVY-EAVKAEFAAQVGLPPS-GVEGSALDLCFALPVTAL-----W 362
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVIS-GRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+P S T +P Y+ G + +C+ + +A GE +IG Q+
Sbjct: 363 RRPAVPSLTLHLEGADWELPRSNYVFEDLGARVMCIVL----DAAPGEQTVIGNFQQQNT 418
Query: 351 MVIYDNEKQRIGWKPEDCNTLLS 373
V+YD E R+ + P C+ L++
Sbjct: 419 HVVYDLENDRLSFAPARCDRLVA 441
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 155/376 (41%), Gaps = 51/376 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------------QYK 61
F +FA N++VG PP F DTGSDL W+ C+ CT C + E +
Sbjct: 100 FLHFA-NVSVGTPPLSFLVALDTGSDLFWLPCN--CTKCVRGVESNGEKIAFNIYDLKGS 156
Query: 62 PHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNG 120
V C++ C +C + C YE+ Y +G S+ G LV D+ L +
Sbjct: 157 STSQTVLCNSNLCELQR-----QCPSSDSICPYEVNYLSNGTSTTGFLVEDVLHLITDDD 211
Query: 121 SV--FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+ +TFGCG Q L G+ GLG G S+ S L + GL N C G
Sbjct: 212 ETKDADTRITFGCGQVQ-TGAFLDGAAPNGLFGLGMGNESVPSILAKEGLTSNSFSMCFG 270
Query: 179 QNGRGVLFLGD------GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT 232
+G G + GD GK P + A P Y + +++ G + L +
Sbjct: 271 SDGLGRITFGDNSSLVQGKTPFNLRALHPT---------YNITVTQIIVGGNAADL-EFH 320
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
IFDSG S+ + Y++I + + + D+ LP + + V
Sbjct: 321 AIFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDE-LPFEYCYDLSSNKTV---- 375
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
L ++ T + LV P + G +CLG+L + NIIG+ FM +
Sbjct: 376 -ELPINLTMKGGDNYLVTDPIVTISGEGVNLLCLGVLKSNNV-----NIIGQNFMTGYRI 429
Query: 353 IYDNEKQRIGWKPEDC 368
++D E +GW+ +C
Sbjct: 430 VFDRENMILGWRESNC 445
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 155/363 (42%), Gaps = 46/363 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
+ + V++ +G PP DTGSDL W QCDAPC C P Y P ++ V C
Sbjct: 90 ATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCR 149
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+P C AL P RC P+ C Y YGDG S+ G L T+ F L S+ +V V FG
Sbjct: 150 SPMCQALQSPW-SRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL-GSDTAVRGV--AFG 205
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG N G S +++G++G+GRG +S+VSQL G+ R C +
Sbjct: 206 CG--TENLG--STDNSSGLVGMGRGPLSLVSQL---GVTRPRR-SCRARAAARGGGAPTT 257
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
P G+ L D + L P + D +I DSG ++ R +
Sbjct: 258 TSPLEGITVGDTLL-PIDPAVFRLTP-----------MGDGGVIIDSGTTFTALEERAFV 305
Query: 251 EIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLV 309
+ + + L LA L +C F A L L F +R
Sbjct: 306 ALARALASRV---RLPLASGAHLGLSLC----FAAASPEAVEVPRLVLHFDGADMELRR- 357
Query: 310 VPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
E+Y+V V CLG+++ +++G + Q+ ++YD E+ + ++P C
Sbjct: 358 ---ESYVVEDRSAGVACLGMVSARGM-----SVLGSMQQQNTHILYDLERGILSFEPAKC 409
Query: 369 NTL 371
L
Sbjct: 410 GEL 412
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 161/371 (43%), Gaps = 32/371 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + F FDTGSDLTWVQC PCT C + E + P K+ VPC
Sbjct: 126 YVVTIGIGTPARNFTVLFDTGSDLTWVQCK-PCTDSCYQQQEPLFDPSKSSTYVDVPCGT 184
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+C + C C+Y ++YGD + G L + F L S V FGC
Sbjct: 185 PQC-KIGGGQDLTCG--GTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGV--VFGC 239
Query: 132 G--YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFL 187
Y+ G AG+LGLGRG SI+SQ R G +V +C+ G G L +
Sbjct: 240 SHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRR-GNSGDVFSYCLPPRGSSAGYLTI 298
Query: 188 GDGKVPSSGVAWTPMLQNSADLKH-YILGPAELLYSGKSCGLKD----LTLIFDSGASYA 242
G P S +++TP++ +++ L Y++ + SG + + + + DSG
Sbjct: 299 GAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTVIDSGTVIT 358
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
+ + Y + R + G + ++L C+ G P+AL F
Sbjct: 359 HMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCY----DVTGHDVVTAPPVALEFG-- 412
Query: 303 RNSVRLVVPPEAYLVI----SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
R+ V L++ + +++ L L + IIG + + V++D E
Sbjct: 413 -GGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEG 471
Query: 359 QRIGWKPEDCN 369
+RIG+ C+
Sbjct: 472 RRIGFGANGCS 482
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 166/390 (42%), Gaps = 62/390 (15%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-------------KPPEKQYK 61
+Y+A + VG P + + DTGSD+ W +C C GC+ + P Y
Sbjct: 87 TYYA-QIGVGHPVQFLNAIVDTGSDILWFKCKL-CQGCSSKKNVIVCSSIIMQGPITLYD 144
Query: 62 PHKNIVP----CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF 117
P +I CS+P C+ C+ N+ C Y+I Y D SS G D+ L
Sbjct: 145 PELSITASPATCSDPLCS-----EGGSCRGNNNSCAYDISYEDTSSSTGIYFRDVVHL-- 197
Query: 118 SNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
+ + N + GC + P+ G++G GR ++S+ +QL N+ HC+
Sbjct: 198 GHKASLNTTMFLGCATSISGLWPVD-----GIMGFGRSKVSVPNQLAAQAGSYNIFYHCL 252
Query: 178 G--QNGRGVLFLG-DGKVPSSGVAWTPMLQN-----------SADLKHYILGPAELLYSG 223
+ G G+L LG + + P + +TPML N S + K + +E Y+
Sbjct: 253 SGEKEGGGILVLGKNDEFPE--MVYTPMLANDIVYNVKLVSLSVNSKALPIEASEFEYNA 310
Query: 224 KSCGLKDLTLIFDSGASYAYFTSR---VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
+ + I DSG S A F S+ ++ + VS + PL+ + + I R
Sbjct: 311 T---VGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAIPTAPLESSGSPCFISISDRN 367
Query: 281 PFKA-LGQVTEYFKPLALSFTNRRNSVRLVVPPE--AYLVISGRKNVCLGILNGSEAEVG 337
+ VT F A N + VV + G + VC+ VG
Sbjct: 368 SVEVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLVCI------SWSVG 421
Query: 338 ENNIIGEIFMQDKMVIYDNEKQRIGWKPED 367
+ I+G+ ++DK+V+YD EK RIGW +D
Sbjct: 422 NSTILGDAILKDKVVVYDMEKSRIGWVKQD 451
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/377 (28%), Positives = 159/377 (42%), Gaps = 53/377 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ + L +G PP + DTGSDL W QC PCT C K P + P + V C +
Sbjct: 108 YLIELAIGTPPVSYPAVLDTGSDLIWTQC-KPCTRCYKQPTPIFDPKKSSSFSKVSCGSS 166
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+AL C +D C+Y YGD + G L T+ F S V + FGCG
Sbjct: 167 LCSALPSST---C---SDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCG 220
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
+ G +G++GLGRG +S+VSQL+E +C I VL LG
Sbjct: 221 EDNEGDG---FEQASGLVGLGRGPLSLVSQLKE-----QRFSYCLTPIDDTKESVLLLGS 272
Query: 190 -GKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDS 237
GKV + V TP+L+N Y L + ++ T +I DS
Sbjct: 273 LGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDS 332
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVTEYFKP 294
G + Y + Y+ + ++ I + KLA D + L +C+ P G
Sbjct: 333 GTTITYVQQKAYEA----LKKEFI-SQTKLALDKTSSTGLDLCFSLPS---GSTQVEIPK 384
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
L F L +P E Y++ G N LG+ + +I G + Q+ +V +
Sbjct: 385 LVFHFKGG----DLELPAENYMI--GDSN--LGVACLAMGASSGMSIFGNVQQQNILVNH 436
Query: 355 DNEKQRIGWKPEDCNTL 371
D EK+ I + P C+ L
Sbjct: 437 DLEKETISFVPTSCDQL 453
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/314 (28%), Positives = 137/314 (43%), Gaps = 34/314 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH--KNIVPCS-NP 72
Y+ + +G PP+ F DTGS +T+V C + C C + + +++P P S N
Sbjct: 89 YYTTRIWIGTPPQTFALIVDTGSTVTYVPC-STCEQCGRHQDPKFEPELSSTYQPVSCNI 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--LTFG 130
C C + QC YE +Y + SS G L D+ + F N S VP FG
Sbjct: 148 DCT---------CDNERKQCVYERQYAEMSSSSGVLGEDI--ISFGNQSEL-VPQRAIFG 195
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG +SIV QL E G+I + C G G G + LG
Sbjct: 196 C--ENQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMILG 253
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
G P SG+ + + ++Y + + +GK L + DSG +YA
Sbjct: 254 -GISPPSGMVFAE--SDPVRSQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGTTYA 310
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
Y + +M++L PD IC+ G + Q++ F + + F+N
Sbjct: 311 YLPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDICFSGAESDVSQLSNTFPAVEMVFSNG 370
Query: 303 RNSVRLVVPPEAYL 316
+ +L + PE YL
Sbjct: 371 Q---KLSLSPENYL 381
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 165/385 (42%), Gaps = 63/385 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+L +G PP+ DTGSDL W QC APC C P+ + P ++ + C+
Sbjct: 102 YVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAPGESASYEPMRCAGQ 160
Query: 73 RCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFG 130
C+ LH C+ P D C Y YGDG ++G T+ F S G + VPL FG
Sbjct: 161 LCSDILHHG----CEMP-DTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFG 215
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLF 186
CG N G L+ + +G++G GR +S+VSQL IR +C+ G G +LF
Sbjct: 216 CG--SMNVGSLN--NGSGIVGFGRNPLSLVSQLS----IRR-FSYCLTSYGSGRKSTLLF 266
Query: 187 -------LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------- 232
GD P V TP+LQ+ + Y + A L + + +
Sbjct: 267 GSLSGGVYGDATGP---VQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDG 323
Query: 233 ---LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA--PDDKT---LPICWRGPFKA 284
+I DSG + V E+V R + P P+D +P WR +
Sbjct: 324 SGGVIVDSGTALTLLPGAVLAEVVR-AFRQQLRLPFANGGNPEDGVCFLVPAAWRRS-SS 381
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRK-NVCLGILNGSEAEVGENNIIG 343
QV + F + L +P Y++ RK +CL + + + + + IG
Sbjct: 382 TSQVP--VPRMVFHFQD----ADLDLPRRNYVLDDHRKGRLCLLLADSGD----DGSTIG 431
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
+ QD V+YD E + + + P C
Sbjct: 432 NLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 167/376 (44%), Gaps = 39/376 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + +++P + V C+
Sbjct: 92 YYTARLWIGTPPQRFALIVDTGSTVTYVPC-STCRHCGSHQDPKFRPEDSETYQPVKCTW 150
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
+C C + QC YE Y + +S GAL D+ + F N + + FG
Sbjct: 151 -QC---------NCDNDRKQCTYERRYAEMSTSSGALGEDV--VSFGNQTELSPQRAIFG 198
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C ++ G + G++GLGRG +SI+ QL E +I + C G G G + G
Sbjct: 199 CENDE--TGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLG 256
Query: 191 KV-PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYAY 243
+ P + + +T + +Y + E+ +GK L + DSG +YAY
Sbjct: 257 GISPPADMVFT--RSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAY 314
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
+ IM++ PD + IC+ G + Q+++ F + + F N
Sbjct: 315 LPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVEMVFGNGH 374
Query: 304 NSVRLVVPPEAYLVISG--RKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
+L + PE YL R CLG+ NG++ ++G I +++ +V+YD E +
Sbjct: 375 ---KLSLSPENYLFRHSKVRGAYCLGVFSNGNDP----TTLLGGIVVRNTLVMYDREHTK 427
Query: 361 IGWKPEDCNTLLSLNH 376
IG+ +C+ L H
Sbjct: 428 IGFWKTNCSELWERLH 443
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 156/365 (42%), Gaps = 38/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +G PPK + DTGS L+W+QC C + ++P + + CS+
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSS 179
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTF 129
C+ L N P C + C Y YGD S+G L DL L S +P T+
Sbjct: 180 ECSLLKAATLNDPLCT-ASGVCVYTASYGDASYSMGYLSRDLLTLTPSQ----TLPSFTY 234
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCI-GQNGRGVLFL 187
GCG Q N G AG++GL R ++S+++QL +YG +C+ G FL
Sbjct: 235 GCG--QDNEGLFG--KAAGIVGLARDKLSMLAQLSPKYGY---AFSYCLPTSTSSGGGFL 287
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASYAY 243
GK+ S +TPM++NS + Y L A + +G+ G+ + I DSG
Sbjct: 288 SIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTR 347
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
+Y + ++ ++ + AP L C++G K++ E + + F +
Sbjct: 348 LPISIYAALREAFVK-IMSRRYEQAPAYSILDTCFKGSLKSMSGAPE----IRMIF---Q 399
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
L + L+ + + CL A + IIG Q + YD +IG+
Sbjct: 400 GGADLSLRAPNILIEADKGIACLAF-----ASSNQIAIIGNHQQQTYNIAYDVSASKIGF 454
Query: 364 KPEDC 368
P C
Sbjct: 455 APGGC 459
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 125/275 (45%), Gaps = 41/275 (14%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----------PEKQ 59
F + YF + +G PPK + DTGSD+ WV C +PCTGC P+
Sbjct: 86 FMVGLYF-TRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTS 143
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDL--FPLR 116
K +PCS+ RC A + C+ N C Y YGDG + G V+D F
Sbjct: 144 STSSK--IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTV 201
Query: 117 FSNGSVFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNV 172
N N + FGC +Q G L+ D A G+ G G+ ++S+VSQL G+ V
Sbjct: 202 MGNEQTANSSASIVFGCSNSQS--GDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKV 259
Query: 173 IGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD 230
HC+ NG G+L LG+ P G+ +TP++ + HY L ++ +G+ + D
Sbjct: 260 FSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIVVNGQKLPI-D 313
Query: 231 LTL---------IFDSGASYAYFTSRVYQEIVSLI 256
+L I DSG + AY Y V+ I
Sbjct: 314 SSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAI 348
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 159/385 (41%), Gaps = 54/385 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPHKNI--- 66
+ ++ +G P + DTGS WV C C P E Y P ++
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQC--PHESDILRKLTFYDPRSSVSSK 139
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSV- 122
V C + C + PP C + +C Y Y DGG ++G L TDL + NG
Sbjct: 140 EVKCDDTICTS----RPP-C-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQ 193
Query: 123 -FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
+ +TFGCG Q S G++G G + +SQL G + + HC+ N
Sbjct: 194 PTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN 253
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILG------PAELLYSGKSCGLK 229
G G+ +G+ P V TP+++N+ +LK + PA + + K+ G
Sbjct: 254 GGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGT- 310
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
DSG++ Y +Y E++ + PD + F LG V
Sbjct: 311 ----FIDSGSTLVYLPEIIYSELILAVFAK--------HPDITMGAMYNFQCFHFLGSVD 358
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+ F + F N + L V P YL+ C G + + I+G++ + +
Sbjct: 359 DKFPKITFHF---ENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISN 415
Query: 350 KMVIYDNEKQRIGWKPEDCNTLLSL 374
K+V+YD EKQ IGW +C++ + +
Sbjct: 416 KVVVYDMEKQAIGWTEHNCSSSVKI 440
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 162/388 (41%), Gaps = 56/388 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
V+L VG PP+ DTGS+L+W+ C AP K ++P + VPC++
Sbjct: 85 LTVSLAVGTPPQNVTMVLDTGSELSWLLC-APAGARNKFSAMSFRPRASSTFAAVPCASA 143
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+C + P+PP C + +C + Y DG SS GAL TD+F + GS + FGC
Sbjct: 144 QCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAV----GSGPPLRAAFGCM 199
Query: 133 YNQHNPGPLSPPD---TAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRGVLFLG 188
+ + S PD +AG+LG+ RG +S VSQ +CI ++ GVL LG
Sbjct: 200 SSAFD----SSPDGVASAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVLLLG 250
Query: 189 DGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------------- 233
+P+ + +TPM Q + L ++ + G G K L +
Sbjct: 251 HSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQ 310
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI------CWRGPFKALG 286
+ DSG + + Y + + R PL A DD + C+R P +
Sbjct: 311 TMVDSGTQFTFLLGDAYSALKAEFTRQ--ARPLLPALDDPSFAFQEAFDTCFRVP-QGRS 367
Query: 287 QVTEYFKPLALSFTNRRNSV---RLV--VPPEAYLVISGRKNVCLGILNGSEAEVGENNI 341
T + L F +V RL+ VP E G CL N + +
Sbjct: 368 PPTARLPGVTLLFNGAEMAVAGDRLLYKVPGERR---GGDGVWCLTFGNADMVPI-MAYV 423
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
IG + V YD E+ R+G P C+
Sbjct: 424 IGHHHQMNVWVEYDLERGRVGLAPVRCD 451
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 169/382 (44%), Gaps = 69/382 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F +NL +G P + + DTGSDL W QC PC C P + P K+ +PCS+
Sbjct: 97 FLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPCSSD 155
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL + C +D C+Y YGD S+ G L T+ F F + SV + FGCG
Sbjct: 156 LCVALPISS---C---SDGCEYRYSYGDHSSTQGVLATETF--TFGDASVSKI--GFGCG 205
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
+ + AG++GLGRG +S++SQL G+ + +C+ G L +G
Sbjct: 206 EDNRG---RAYSQGAGLVGLGRGPLSLISQL---GVPK--FSYCLTSIDDSKGISTLLVG 257
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDL---TLIFDSG 238
S + TP++QN + Y L G L + ++D LI DSG
Sbjct: 258 SEATVKSAIP-TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSG 316
Query: 239 ASYAYFTSRVY----QEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA----LGQVTE 290
+ Y + +E +S + D+ A L +C+ P + Q+
Sbjct: 317 TTITYLKDNAFAALKKEFISQMKLDVD------ASGSTELELCFTLPPDGSPVEVPQLVF 370
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVI-SGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+F+ V L +P E Y++ S + +CL + GS + + +I G Q+
Sbjct: 371 HFE-----------GVDLKLPKENYIIEDSALRVICLTM--GSSSGM---SIFGNFQQQN 414
Query: 350 KMVIYDNEKQRIGWKPEDCNTL 371
+V++D EK+ I + P CN L
Sbjct: 415 IVVLHDLEKETISFAPAQCNQL 436
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 160/373 (42%), Gaps = 46/373 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNI----V 67
+ V L +G PPK + DTGS L+W+QC C + Y P +K + V
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASV 184
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
CS + A L N P C+ ++ C Y YGD SIG L DL L S +P
Sbjct: 185 ECSRLKAATL---NDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLPQ 237
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCI---GQNGR 182
T+GCG Q N G AG++GL R ++S+++QL +YG + +C+
Sbjct: 238 FTYGCG--QDNQGLFG--RAAGIIGLARDKLSMLAQLSTKYG---HAFSYCLPTANSGSS 290
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK----SCGLKDLTLIFDSG 238
G FL G + + +TPML +S + Y L + SG+ + + + + DSG
Sbjct: 291 GGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSG 350
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+Y + ++ ++ T AP L C++G K++ V E + +
Sbjct: 351 TVITRLPMSMYAALRQAFVK-IMSTKYAKAPAYSILDTCFKGSLKSISAVPE----IKMI 405
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN--IIGEIFMQDKMVIYDN 356
F + L + + L+ + + CL S G N IIG Q + YD
Sbjct: 406 F---QGGADLTLRAPSILIEADKGITCLAFAGSS----GTNQIAIIGNRQQQTYNIAYDV 458
Query: 357 EKQRIGWKPEDCN 369
RIG+ P C+
Sbjct: 459 STSRIGFAPGSCH 471
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 101/394 (25%), Positives = 157/394 (39%), Gaps = 59/394 (14%)
Query: 8 FFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE---------- 57
FF ++ + +G P F D GSD+ WV CD C C
Sbjct: 96 FFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCD--CIECASLSAGNYNVLDRDL 153
Query: 58 KQYKPH----KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDL 112
QY+P +PC + C + CK D C YE++Y SS G + D
Sbjct: 154 NQYRPSLSNTSRHLPCGHKLCDVHSF-----CKGSKDPCPYEVQYASANTSSSGYVFEDK 208
Query: 113 FPL----RFSNGSVFNVPLTFGCGYNQ-----HNPGPLSPPDTAGVLGLGRGRISIVSQL 163
L + + + + GCG Q H GP GVLGLG G IS+ S L
Sbjct: 209 LHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGP------DGVLGLGPGNISVPSLL 262
Query: 164 REYGLIRNVIGHCIGQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYS 222
+ GLI+N C+ +N G + GD G V + P++ ++ + +G
Sbjct: 263 AKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPFLPIIAYMVGVESFCVG------- 315
Query: 223 GKSCGLKD--LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
S LK+ + DSG+S+ + + VYQ++V+ + + + + L W
Sbjct: 316 --SLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQSS-------WEY 366
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN 340
+ A Q PL L+F+ RN L+ P Y S + + L S + +
Sbjct: 367 CYNASSQELVNIPPLKLAFS--RNQTFLIQNPIFYDPASQEQEYTIFCLPVSPS-ADDYA 423
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
IG+ F+ +++D E R GW +C S
Sbjct: 424 AIGQNFLMGYRLVFDRENLRFGWSRWNCQDRASF 457
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 169/382 (44%), Gaps = 69/382 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F +NL +G P + + DTGSDL W QC PC C P + P K+ +PCS+
Sbjct: 97 FLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPCSSD 155
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL + C +D C+Y YGD S+ G L T+ F F + SV + FGCG
Sbjct: 156 LCVALPISS---C---SDGCEYRYSYGDHSSTQGVLATETF--TFGDASVSKI--GFGCG 205
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
+ + AG++GLGRG +S++SQL G+ + +C+ G L +G
Sbjct: 206 EDNRG---RAYSQGAGLVGLGRGPLSLISQL---GVPK--FSYCLTSIDDSKGISTLLVG 257
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDL---TLIFDSG 238
S + TP++QN + Y L G L + ++D LI DSG
Sbjct: 258 SEATVKSAIP-TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSG 316
Query: 239 ASYAYFTSRVY----QEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA----LGQVTE 290
+ Y + +E +S + D+ A L +C+ P + Q+
Sbjct: 317 TTITYLKDSAFAALKKEFISQMKLDVD------ASGSTELELCFTLPPDGSPVDVPQLVF 370
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVI-SGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+F+ V L +P E Y++ S + +CL + GS + + +I G Q+
Sbjct: 371 HFE-----------GVDLKLPKENYIIEDSALRVICLTM--GSSSGM---SIFGNFQQQN 414
Query: 350 KMVIYDNEKQRIGWKPEDCNTL 371
+V++D EK+ I + P CN L
Sbjct: 415 IVVLHDLEKETISFAPAQCNQL 436
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 154/365 (42%), Gaps = 38/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ V++ +G P + FDTGSDL+WVQC PC+ C + + + P + + VPC++P
Sbjct: 146 YVVSMGLGTPARDMTVVFDTGSDLSWVQC-TPCSDCYEQKDPLFDPARSSTYSAVPCASP 204
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L + R K +C YE+ YGD + GAL D L S+ +P FGC
Sbjct: 205 ECQGLDSRSCSRDK----KCRYEVVYGDQSQTDGALARDTLTLTQSD----VLPGFVFGC 256
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCIGQNGRGVLFLGDG 190
G + + G D G++GLGR ++S+ SQ +YG +C+ + +L G
Sbjct: 257 G--EQDTGLFGRAD--GLVGLGREKVSLSSQAASKYGA---GFSYCLPSSPSAAGYLSLG 309
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAYFT 245
+ +T M Y + + +G++ + + + DSG
Sbjct: 310 GPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTVITRLP 369
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
RVY + S R + K AP L C + G T +AL F
Sbjct: 370 PRVYAALRSAFARSMGRYGYKRAPALSILDTC----YDFTGHTTVRIPSVALVFA---GG 422
Query: 306 VRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+ + L ++ CL NG A+ G IIG + V+YD +Q+IG+
Sbjct: 423 AAVGLDFSGVLYVAKVSQACLAFAPNGDGADAG---IIGNTQQKTLAVVYDVARQKIGFG 479
Query: 365 PEDCN 369
C+
Sbjct: 480 ANGCS 484
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 154/373 (41%), Gaps = 46/373 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----GCTKPPEKQYKPH----KNIVP 68
+ +G P F D GSDL W+ CD AP + G QY P +
Sbjct: 85 IDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 144
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLR-----FSNGSV 122
CS+ C + P C P C Y I Y + SS G L+ D+ L SN SV
Sbjct: 145 CSHQLCES-----SPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSV 199
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
P+ GCG Q G L G++GLG G IS+ S L + GL++N C +
Sbjct: 200 -RAPVIIGCGMRQTG-GYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDS 257
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASY 241
G +F GD + + T L + + YI+G E G SC + DSGAS+
Sbjct: 258 GRIFFGDQGLATQQT--TLFLPSDGKYETYIVG-VEACCIGSSCIKQTSFRALVDSGASF 314
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
+ Y+ +V + + T + + + C++ K L + AL
Sbjct: 315 TFLPDESYRNVVDEFDKQVNAT--RFSFEGYPWEYCYKSSSKELLKNPSVILKFAL---- 368
Query: 302 RRNSVRLVVPPEAYLVISGRKNV---CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
N+ +V P V+ G + V CL I + G+ I+G+ FM +++D E
Sbjct: 369 --NNSFVVHNP--VFVVHGYQGVVGFCLAI----QPADGDIGILGQNFMTGYRMVFDREN 420
Query: 359 QRIGWKPEDCNTL 371
++GW +C L
Sbjct: 421 LKLGWSRSNCQDL 433
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 154/373 (41%), Gaps = 46/373 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----GCTKPPEKQYKPH----KNIVP 68
+ +G P F D GSDL W+ CD AP + G QY P +
Sbjct: 104 IDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 163
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLR-----FSNGSV 122
CS+ C + P C P C Y I Y + SS G L+ D+ L SN SV
Sbjct: 164 CSHQLCES-----SPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSV 218
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
P+ GCG Q G L G++GLG G IS+ S L + GL++N C +
Sbjct: 219 -RAPVIIGCGMRQTG-GYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDS 276
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASY 241
G +F GD + + T L + + YI+G E G SC + DSGAS+
Sbjct: 277 GRIFFGDQGLATQQT--TLFLPSDGKYETYIVG-VEACCIGSSCIKQTSFRALVDSGASF 333
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
+ Y+ +V + + T + + + C++ K L + AL
Sbjct: 334 TFLPDESYRNVVDEFDKQVNAT--RFSFEGYPWEYCYKSSSKELLKNPSVILKFAL---- 387
Query: 302 RRNSVRLVVPPEAYLVISGRKNV---CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
N+ +V P V+ G + V CL I + G+ I+G+ FM +++D E
Sbjct: 388 --NNSFVVHNP--VFVVHGYQGVVGFCLAI----QPADGDIGILGQNFMTGYRMVFDREN 439
Query: 359 QRIGWKPEDCNTL 371
++GW +C L
Sbjct: 440 LKLGWSRSNCQDL 452
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 153/374 (40%), Gaps = 45/374 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKN- 65
+S + +G P F DTGSDL WV CD AP G + + Y P K+
Sbjct: 1 YSLHYTTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASDFELSVYSPKKSS 60
Query: 66 ---IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDG-GSSIGALVTDLFPLRFSN-- 119
VPC+N CA +C C Y + Y S+ G L+ DL L+ N
Sbjct: 61 TSKTVPCNNSLCAQRD-----QCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKH 115
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
+TFGCG Q L G+ GLG +IS+ S L GL+ N C
Sbjct: 116 SEPIQAYITFGCGQVQSG-SFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSD 174
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+G G + GD S TP N + I + G + D+T +FDSG
Sbjct: 175 DGVGRINFGDKG--SLEQEETPFNLNQLHPNYNIT--VTSIRVGTTLIDADITALFDSGT 230
Query: 240 SYAYFTSRVYQEIVSLI---MRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
S++YFT +Y ++ + RD P P C+ A +T
Sbjct: 231 SFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIP----FEYCYNMSPDANASLTP-----G 281
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
+S T + V P +VIS + + CL ++ +E NIIG+ FM +++
Sbjct: 282 ISLTMKGGGPFPVYDP--IIVISTQNELIYCLAVVKSAEL-----NIIGQNFMTGYRIVF 334
Query: 355 DNEKQRIGWKPEDC 368
D EK +GWK DC
Sbjct: 335 DREKLVLGWKKFDC 348
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 166/386 (43%), Gaps = 61/386 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP--HKNIVP--CSNP 72
+ ++L +G PP+ DTGSDL W QC APC C P+ + P + VP CS
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAPAASSSYVPMRCSGQ 161
Query: 73 RC-AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C LH C+ P D C Y YGDG +++G T+ F S+G +VPL FGC
Sbjct: 162 LCNDILHH----SCQRP-DTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGC 216
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G N G L+ + +G++G GR +S+VSQL IR +C+ + L G
Sbjct: 217 G--TMNVGSLN--NGSGIVGFGRDPLSLVSQLS----IRR-FSYCLTPYTSTRKSTLMFG 267
Query: 189 -------DGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
+G ++G V T +LQ+ + Y + ++G + G + L +
Sbjct: 268 SLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVP-----FTGVTVGTRRLRIPLSAFAL 322
Query: 234 --------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPL--KLAPDDKTLPICWRGPFK 283
I DSG + F + V E++ R + P +PDD +C+ P
Sbjct: 323 RPDGSGGVIVDSGTALTLFPAAVLTEVLR-AFRAQLRLPFTSSSSPDDG---VCFATPMA 378
Query: 284 ALGQVTEYFKPLAL-SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII 342
A G+ +++ L +P Y++ R+ L IL + G I
Sbjct: 379 AGGRRASAATVVSVPRMAFHFQGADLELPRRNYVLDDPRRG-SLCILLADSGDSGAT--I 435
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDC 368
G QD V+YD E + + + P C
Sbjct: 436 GNFVQQDMRVLYDLEAETLSFAPAQC 461
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 151/366 (41%), Gaps = 40/366 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C K EK + P ++ V C+ P
Sbjct: 182 YVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAP 241
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L+ C C Y ++YGDG SIG D L S ++ F G
Sbjct: 242 ACSDLYTRG---CS--GGHCLYSVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 291
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC+ +G G L G
Sbjct: 292 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSSGTGYLDFGP 346
Query: 190 GKVPSSGV-AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
G + G TPML ++ +Y+ G + G+ + I DSG
Sbjct: 347 GSPAAVGARQTTPMLTDNGPTFYYV-GMTGIRVGGQLLSIPQSVFSTAGTIVDSGTVITR 405
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
Y + S + K AP L C+ F + +V ++L F +
Sbjct: 406 LPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCY--DFTGMSEVA--IPKVSLLF---Q 458
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGI-LNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
L V + + VCLG N + +VG I+G ++ V+YD K+ +G
Sbjct: 459 GGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVG---IVGNTQLKTFGVVYDIGKKTVG 515
Query: 363 WKPEDC 368
+ P C
Sbjct: 516 FSPGAC 521
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 149/364 (40%), Gaps = 38/364 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ V C+ P
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 238
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L + C C Y ++YGDG SIG D L S ++ F G
Sbjct: 239 ACSDL---DTRGCS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 288
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC+ G G L G
Sbjct: 289 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSTGTGYLDFGA 343
Query: 190 GKVPSSGVAWTPMLQNSADLKHY-----ILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
G P++ + TPML ++ +Y I LLY +S I DSG
Sbjct: 344 GS-PAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSV-FATAGTIVDSGTVITRL 401
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
Y + S + K AP L C+ F + QV ++L F +
Sbjct: 402 PPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCY--DFAGMSQVA--IPTVSLLF---QG 454
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
RL V + + VCL + + G+ I+G ++ V YD K+ + +
Sbjct: 455 GARLDVDASGIMYAASASQVCLAF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFS 512
Query: 365 PEDC 368
P C
Sbjct: 513 PGAC 516
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 149/365 (40%), Gaps = 38/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC+ C + EK + P ++ + C+ P
Sbjct: 186 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAP 245
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L+ C C Y ++YGDG SIG D L S ++ F G
Sbjct: 246 ACSDLYTKG---CS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAIKGFRFG 295
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC +G G L G
Sbjct: 296 CGERNEGLFG--EAAGLLGLGRGKTSLPVQAYDKYG---GVFAHCFPARSSGTGYLDFGP 350
Query: 190 GKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAY 243
G P+ S TPML ++ L Y +G + GK + I DSG
Sbjct: 351 GSSPAVSTKLTTPMLVDNG-LTFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITR 409
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
Y + S + K AP L C+ F + QV ++L F +
Sbjct: 410 LPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYD--FTGMSQVA--IPTVSLLF---Q 462
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
L V + + CLG E + + I+G ++ V+YD K+ +G+
Sbjct: 463 GGASLDVDASGIIYAASVSQACLGFAANEEDD--DVGIVGNTQLKTFGVVYDIGKKVVGF 520
Query: 364 KPEDC 368
P C
Sbjct: 521 SPGAC 525
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 161/390 (41%), Gaps = 62/390 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
V+L VG PP+ DTGS+L+W+ C TG ++P + VPC +
Sbjct: 61 LTVSLAVGTPPQNVTMVLDTGSELSWLLC---ATGRAAAAAADSFRPRASATFAAVPCGS 117
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
RC++ P PP C + +C + Y DG +S GAL TD+F + G + FGC
Sbjct: 118 ARCSSRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAV----GDAPPLRSAFGC 173
Query: 132 GYNQHNPGPLSPPD---TAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRGVLFL 187
++ S PD TAG+LG+ RG +S V+Q +CI ++ GVL L
Sbjct: 174 MSAAYD----SSPDAVATAGLLGMNRGALSFVTQAST-----RRFSYCISDRDDAGVLLL 224
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------------- 233
G +P + +TP+ Q + L ++ + G G K L +
Sbjct: 225 GHSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQ 284
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI------CWRGPFKALG 286
+ DSG + + Y + + ++ PL A +D + C+R P K
Sbjct: 285 TMVDSGTQFTFLLGDAYSAVKAEFLKQT--KPLLPALEDPSFAFQEAFDTCFRVP-KGRP 341
Query: 287 QVTEYFKPLALSFTNRRNSV---RLVVPPEAYLVISGRKNV----CLGILNGSEAEVGEN 339
+ P+ L F + SV RL+ Y V R+ CL N +
Sbjct: 342 PPSARLPPVTLLFNGAQMSVAGDRLL-----YKVPGERRGADGVWCLTFGNADMVPL-TA 395
Query: 340 NIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+IG + V YD E+ R+G P C+
Sbjct: 396 YVIGHHHQMNLWVEYDLERGRVGLAPVKCD 425
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 92/388 (23%), Positives = 156/388 (40%), Gaps = 45/388 (11%)
Query: 1 MYVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
M I+ P + +NL +G PP DTGSDLTW QC PCT C K +
Sbjct: 76 MTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPLF 134
Query: 61 KPHKNIV----PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
P + C C AL R +C + Y DG + G L ++ +
Sbjct: 135 DPKNSSTYRDSSCGTSFCLAL---GKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVD 191
Query: 117 FSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ G + P FGCG H+ G + ++G++GLG G +S++SQL+ I + +
Sbjct: 192 STAGKPVSFPGFAFGCG---HSSGGIFDKSSSGIVGLGGGELSLISQLKS--TINGLFSY 246
Query: 176 CI------GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSG 223
C+ + F G+V G TP++Q S D +Y+ +G L Y G
Sbjct: 247 CLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKG 306
Query: 224 --KSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
K +++ +I DSG +Y + Y ++ + + G ++ + +C+
Sbjct: 307 YSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGK--RVRDPNGIFSLCYN-- 362
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNI 341
E P+ T + + P + VC + A + +
Sbjct: 363 -----TTAEINAPI---ITAHFKDANVELQPLNTFMRMQEDLVCFTV-----APTSDIGV 409
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+G + + +V +D K+R+ +K DC
Sbjct: 410 LGNLAQVNFLVGFDLRKKRVSFKAADCT 437
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 158/379 (41%), Gaps = 51/379 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKPPEKQ------YKPH-- 63
F ++A N+TVG P F DTGSDL W+ CD C K P Y P+
Sbjct: 102 FLHYA-NVTVGTPSDWFLVALDTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNIYSPNAS 160
Query: 64 --KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL--RFS 118
+ VPC++ C + RC P C Y+I Y +G SS G LV D+ L
Sbjct: 161 STSSKVPCNSTLCTRVD-----RCASPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEK 215
Query: 119 NGSVFNVPLTFGCGYNQ----HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIG 174
N +T GCG Q H+ + P+ G+ GLG IS+ S L + G+ N
Sbjct: 216 NSKPIRARITLGCGLVQTGVFHDG---AAPN--GLFGLGLEDISVPSVLAKEGIAANSFS 270
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI 234
C G +G G + GD S TP+ + + + G + G + +
Sbjct: 271 MCFGDDGAGRISFGDKG--SVDQRETPLNIRQPHPTYNV--TVTQISVGGNTGDLEFDAV 326
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPL-KLAPDDKTLPI--CWRGPFKALGQVTEY 291
FD+G S+ Y T Y +LI L K D LP C+ A+ +
Sbjct: 327 FDTGTSFTYLTDAPY----TLISESFNSLALDKRYQTDSELPFEYCY-----AVSPNKKS 377
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
F+ ++ T + S V P + I CL I+ + +IIG+ FM
Sbjct: 378 FEYPDVNLTMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKSEDI-----SIIGQNFMTGYR 432
Query: 352 VIYDNEKQRIGWKPEDCNT 370
V++D EK +GWK DC+T
Sbjct: 433 VVFDREKLILGWKESDCST 451
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 163/376 (43%), Gaps = 39/376 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C C C + +++P + V C+
Sbjct: 92 YYTTRLWIGTPPQRFALIVDTGSTVTYVPCST-CKHCGSHQDPKFRPEASETYQPVKCTW 150
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
+C C QC YE Y + +S G L D+ + F N S + FG
Sbjct: 151 -QC---------NCDDDRKQCTYERRYAEMSTSSGVLGEDV--VSFGNQSELSPQRAIFG 198
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C ++ G + G++GLGRG +SI+ QL E +I + C G G G + G
Sbjct: 199 CENDE--TGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLG 256
Query: 191 KV-PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYAY 243
+ P + + +T + +Y + E+ +GK L + DSG +YAY
Sbjct: 257 GISPPADMVFTH--SDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAY 314
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
+ IM++ PD IC+ G + Q+++ F + + F N
Sbjct: 315 LPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPVVEMVFGNGH 374
Query: 304 NSVRLVVPPEAYLVISG--RKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
+L + PE YL R CLG+ NG++ ++G I +++ +V+YD E +
Sbjct: 375 ---KLSLSPENYLFRHSKVRGAYCLGVFSNGNDP----TTLLGGIVVRNTLVMYDREHSK 427
Query: 361 IGWKPEDCNTLLSLNH 376
IG+ +C+ L H
Sbjct: 428 IGFWKTNCSELWERLH 443
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 153/363 (42%), Gaps = 33/363 (9%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPC 69
+ VG P F DTGSDL WV CD AP +G ++ Y+P ++ +PC
Sbjct: 100 VDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPC 159
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FNVPL 127
S+ C ++ P C +P C Y I+Y + +S G L+ D L + V N +
Sbjct: 160 SHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
GCG Q L G+LGLG IS+ S L GL++N C ++ G +F
Sbjct: 215 IIGCGQKQSG-DYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFF 273
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSR 247
GD VPS TP + L+ Y + + K + DSG S+
Sbjct: 274 GDQGVPSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFD 331
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVR 307
VY+ + + T ++ +D T C+ + V + L+F + S++
Sbjct: 332 VYKAFTMEFDKQMNAT--RVPYEDTTWKYCYSASPLEMPDVPT----ITLTFAADK-SLQ 384
Query: 308 LVVPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
V P + G CL +L +E +G II + F+ V++D E ++GW
Sbjct: 385 AVNPILPFNDKQGALAGFCLAVLPSTEP-IG---IIAQNFLVGYHVVFDRESMKLGWYRS 440
Query: 367 DCN 369
+C
Sbjct: 441 ECK 443
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 153/361 (42%), Gaps = 33/361 (9%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPCSN 71
VG P F DTGSDL WV CD AP +G ++ Y+P ++ +PCS+
Sbjct: 72 VGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSH 131
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FNVPLTF 129
C ++ P C +P C Y I+Y + +S G L+ D L + V N +
Sbjct: 132 ELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVII 186
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GCG Q L G+LGLG IS+ S L GL++N C ++ G +F GD
Sbjct: 187 GCGQKQSG-DYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 245
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVY 249
VPS TP + L+ Y + + K + DSG S+ VY
Sbjct: 246 QGVPSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPLDVY 303
Query: 250 QEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLV 309
+ + + T ++ +D T C+ + V + L+F + S++ V
Sbjct: 304 KAFTMEFDKQMNAT--RVPYEDTTWKYCYSASPLEMPDVPT----ITLTFAADK-SLQAV 356
Query: 310 VPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
P + G CL +L +E +G II + F+ V++D E ++GW +C
Sbjct: 357 NPILPFNDKQGALAGFCLAVLPSTEP-IG---IIAQNFLVGYHVVFDRESMKLGWYRSEC 412
Query: 369 N 369
+
Sbjct: 413 H 413
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 164/376 (43%), Gaps = 52/376 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++ +VG PP DTGSD+ W+QC PC C + + P K+ I+P S+
Sbjct: 86 YLISYSVGIPPFQLYGIIDTGSDMIWLQCK-PCEKCYNQTTRIFDPSKSNTYKILPFSST 144
Query: 73 RCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C ++ C N + C+Y I YGDG S G L + L +NGS T G
Sbjct: 145 TCQSVE---DTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIG 201
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREY-GLIRNVIGHCIGQN--------- 180
CG N ++G++GLG G +S+++QLR I +C+
Sbjct: 202 CGRNNTVS---FEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLNF 258
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-TLIFDSGA 239
G + GDG V + V P + L+ + +G + ++ S + +I DSG
Sbjct: 259 GDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIIDSGT 318
Query: 240 SYAYFTSRVYQEIVS----LIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ--VTEYFK 293
+ + +Y ++ S L+ D + PL K L +C+R F L + +F
Sbjct: 319 TLTLLPNDIYSKLESAVADLVELDRVKDPL------KQLSLCYRSTFDELNAPVIMAHFS 372
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
+ + N+V + E + CL ++ +++G I G + Q+ +V
Sbjct: 373 GADV----KLNAVNTFIEVEQGV-------TCLAFIS---SKIGP--IFGNMAQQNFLVG 416
Query: 354 YDNEKQRIGWKPEDCN 369
YD +K+ + +KP DC+
Sbjct: 417 YDLQKKIVSFKPTDCS 432
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 155/379 (40%), Gaps = 52/379 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE--------KQYKPHKNIV 67
Y+ + +G PP F DTGS +T+V C + CT C + YKP +
Sbjct: 34 YYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSS-CTHCGNHQDPRFSPALSSSYKPLECGS 92
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVP 126
CS C + Y+ +Y + +S G L D+ + FSN S +
Sbjct: 93 ECSTGFC--------------DGSRKYQRQYAEKSTSSGVLGKDV--IGFSNSSDLGGQR 136
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV 184
L FGC G L G++GLGRG +SI+ QL E + +V C G G G
Sbjct: 137 LVFGC--ETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGA 194
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK------DLTLIFDSG 238
+ LG + P V S +Y L + G LK + DSG
Sbjct: 195 MILGGFQPPKDMVFTASDPHRSP---YYNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSG 251
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+YAYF +Q S + + PD+K IC+ G + ++++F +
Sbjct: 252 TTYAYFPGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFV 311
Query: 299 FTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
F + ++ + + PE YL ISG CLG+ + ++G I +++ +V Y
Sbjct: 312 FGDGQS---VTLSPENYLFRHTKISGA--YCLGVFENGDP----TTLLGGIIVRNMLVTY 362
Query: 355 DNEKQRIGWKPEDCNTLLS 373
+ K IG+ CN L S
Sbjct: 363 NRGKASIGFLKTKCNDLWS 381
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 160/373 (42%), Gaps = 49/373 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI--VPCSNPRC 74
+ + + +G P DTGSDL W +C+ PCT C+ V C + C
Sbjct: 42 YLIQMAIGTPALSLSAIMDTGSDLVWTKCN-PCTDCSTSSIYDPSSSSTYSKVLCQSSLC 100
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYN 134
P+ C + D C+Y YGD S+ G L + F + S+ S+ N+ TFGCG++
Sbjct: 101 ---QPPSIFSCNNDGD-CEYVYPYGDRSSTSGILSDETFSI--SSQSLPNI--TFGCGHD 152
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLGD- 189
+ G++G GRG +S+VSQL + N +C+ + LF+G+
Sbjct: 153 NQGFDKV-----GGLVGFGRGSLSLVSQLGPS--MGNKFSYCLVSRTDSSKTSPLFIGNT 205
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGA 239
+ ++ V TP++Q+S+ HY L + G+S + T LI DSG
Sbjct: 206 ASLEATTVGSTPLVQSSS-TNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGT 264
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+ + Y + ++ + + + L D L +C F G F + F
Sbjct: 265 TLTFLQQTAYDAV-----KEAMVSSINLPQADGQLDLC----FNQQGSSNPGFPSMTFHF 315
Query: 300 TNRRNSVRLVVPPEAYLVISGRKN-VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
VP E YL + VCL ++ + + +G I G + Q+ ++YDNE
Sbjct: 316 ----KGADYDVPKENYLFPDSTSDIVCLAMMP-TNSNLGNMAIFGNVQQQNYQILYDNEN 370
Query: 359 QRIGWKPEDCNTL 371
+ + P C+TL
Sbjct: 371 NVLSFAPTACDTL 383
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 143/372 (38%), Gaps = 43/372 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK----------QYKPHKNI---- 66
+ +G P F D GSDL WV CD C C +Y P +++
Sbjct: 104 IDIGTPSTSFLVALDAGSDLLWVPCD--CIHCAPLSASFYSNLDRDLNEYSPSRSLSSKH 161
Query: 67 VPCSNPRCAALHWPNPPRCK-HPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSVFN 124
+ CS+ C CK QC Y I Y D SS G LV D+F L+ +GS N
Sbjct: 162 LSCSHRLCDM-----GSNCKTSKQQQCPYTINYLSDNTSSSGLLVEDIFHLQSGDGSTSN 216
Query: 125 ----VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
P+ GCG Q + G L G++GLG G S+ S L + GLIR+ C ++
Sbjct: 217 SSVQAPVVVGCGMKQ-SGGYLDGTAPDGLIGLGPGESSVPSFLAKSGLIRDSFSLCFNED 275
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGA 239
G LF GD S+ TP L YI+G E G SC + FDSG
Sbjct: 276 DSGRLFFGDQG--STVQQSTPFLLVDGMFSTYIVG-VETCCIGNSCPKVTSFNAQFDSGT 332
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
S+ + Y I + + T W + Q L L F
Sbjct: 333 SFTFLPGHAYGAIAEEFDKQVNATRSTFQGSP------WEYCYVPSSQQLPKIPTLTLMF 386
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ NS + P G CL I + G IG+ FM +++D E +
Sbjct: 387 -QQNNSFVVYNPVFVSYNEQGVDGFCLAI----QPTEGGMGTIGQNFMTGYRLVFDRENK 441
Query: 360 RIGWKPEDCNTL 371
++ W +C L
Sbjct: 442 KLAWSHSNCQDL 453
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 158/373 (42%), Gaps = 54/373 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ +NL++G P + F DTGSDL W QC PCT C + P + +PCS+
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL + P C N+ C Y YGDG + G++ T+ L F + S+ N+ TFGCG
Sbjct: 154 LCQAL---SSPTCS--NNFCQYTYGYGDGSETQGSMGTE--TLTFGSVSIPNI--TFGCG 204
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHCIGQNGRGVLFLGD- 189
N G + AG++G+GRG +S+ SQL ++ IG N L LG
Sbjct: 205 ENNQGFG---QGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSN----LLLGSL 257
Query: 190 GKVPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLT----LIFDSG 238
++G T ++Q+S L +G L + L +I DSG
Sbjct: 258 ANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSG 317
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL- 297
+ YF + YQ + + I P+ + +C++ P P L
Sbjct: 318 TTLTYFVNNAYQSVRQEFISQ-INLPV-VNGSSSGFDLCFQTP----------SDPSNLQ 365
Query: 298 --SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
+F + L +P E Y + +CL + + S+ +I G I Q+ +V+YD
Sbjct: 366 IPTFVMHFDGGDLELPSENYFISPSNGLICLAMGSSSQGM----SIFGNIQQQNMLVVYD 421
Query: 356 NEKQRIGWKPEDC 368
+ + C
Sbjct: 422 TGNSVVSFASAQC 434
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 157/376 (41%), Gaps = 60/376 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ +NL++G P + F DTGSDL W QC PCT C + P + +PCS+
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL P C N+ C Y YGDG + G++ T+ L F + S+ N+ TFGCG
Sbjct: 154 LCQALQ---SPTCS--NNSCQYTYGYGDGSETQGSMGTE--TLTFGSVSIPNI--TFGCG 204
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
N G + AG++G+GRG +S+ SQL +C IG + L LG
Sbjct: 205 ENNQGFG---QGNGAGLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSTSSTLLLGS 256
Query: 190 -GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL----------------T 232
++G T ++++S Y + +G S G L
Sbjct: 257 LANSVTAGSPNTTLIESSQIPTFYY-----ITLNGLSVGSTPLPIDPSVFKLNSNNGTGG 311
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
+I DSG + YF YQ + + + + + +C++ P ++
Sbjct: 312 IIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVN--GSSSGFDLCFQMP-------SDQS 362
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+F + LV+P E Y + +CL + + S+ +I G I Q+ +V
Sbjct: 363 NLQIPTFVMHFDGGDLVLPSENYFISPSNGLICLAMGSSSQGM----SIFGNIQQQNLLV 418
Query: 353 IYDNEKQRIGWKPEDC 368
+YD + + C
Sbjct: 419 VYDTGNSVVSFLFAQC 434
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 164/387 (42%), Gaps = 47/387 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IV 67
+ YF L +G P + F DTGS +T+V C A C P K + P + ++
Sbjct: 59 YGYFYATLHLGTPARQFAVIVDTGSTITYVPC-ASCGRNCGPHHKDAAFDPASSSSSAVI 117
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
C + +C PP +C Y+ Y + SS G LV+D LR +G+V +
Sbjct: 118 GCDSDKCIC---GRPPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQLR--DGAV---EV 169
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRGVLF 186
FGC G + + G+LGLG +S+V+QL G+I +V C G G G L
Sbjct: 170 VFGC--ETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEGDGALM 227
Query: 187 LGDGKVPSSGVA--WTPMLQNSADLKHYILGPAELLYSGKSCGLK------DLTLIFDSG 238
LGD VA +T +L + A +Y + L G+ +K + DSG
Sbjct: 228 LGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGYGTVLDSG 287
Query: 239 ASYAYFTSRVYQ----EIVSLIMRDLIGTPLKLAPDDKTLP----ICWRGPFKA----LG 286
++ Y S +Q + + + + + P +K+ IC+ G A
Sbjct: 288 TTFTYLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGGAPHAGHADQS 347
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI-SGRKNV-CLGILNGSEAEVGENNIIGE 344
++ + F L F + VRL P YL + +G CLG+ + + ++G
Sbjct: 348 KLEKVFPVFELQFA---DGVRLRTGPLNYLFMHTGEMGAYCLGVFDNGAS----GTLLGG 400
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTL 371
I ++ +V YD +R+G+ C +
Sbjct: 401 ISFRNILVQYDRRNRRVGFGAASCQEI 427
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 152/361 (42%), Gaps = 33/361 (9%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPCSN 71
VG P F DTGSDL WV CD AP +G ++ Y+P ++ +PCS+
Sbjct: 102 VGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSH 161
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FNVPLTF 129
C ++ P C +P C Y I+Y + +S G L+ D L + V N +
Sbjct: 162 ELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVII 216
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GCG Q L G+LGLG IS+ S L GL++N C ++ G +F GD
Sbjct: 217 GCGQKQSG-DYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 275
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVY 249
VPS TP + L+ Y + + K + DSG S+ VY
Sbjct: 276 QGVPSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFDVY 333
Query: 250 QEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLV 309
+ + + T ++ +D T C+ + V + L+F + S++ V
Sbjct: 334 KAFTMEFDKQMNAT--RVPYEDTTWKYCYSASPLEMPDVPT----ITLTFAADK-SLQAV 386
Query: 310 VPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
P + G CL +L +E +G II + F+ V++D E ++GW +C
Sbjct: 387 NPILPFNDKQGALAGFCLAVLPSTEP-IG---IIAQNFLVGYHVVFDRESMKLGWYRSEC 442
Query: 369 N 369
Sbjct: 443 R 443
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 154/372 (41%), Gaps = 35/372 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPPEKQYKPHKNIVPCSNPRC 74
+ V++ +G P + FDTGSDL+WVQC PC+ GC K + + P + S RC
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQC-GPCSSGGCYKQQDPLFAPSDSST-FSAVRC 211
Query: 75 AALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRF---SNGSVFN---VP 126
A C +D+C YE+ YGD + G L D L +N S N +P
Sbjct: 212 GARECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLP 271
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGR 182
FGCG N N G D G+ GLGRG++S+ SQ G +C+ +
Sbjct: 272 GFVFGCGEN--NTGLFGQAD--GLFGLGRGKVSLSSQ--AAGKFGEGFSYCLPSSSSSAP 325
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD----LTLIFDSG 238
G L LG + +TPML + Y + + +G++ + L LI DSG
Sbjct: 326 GYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSG 385
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
R Y+ + + + + K AP L C+ F A T +AL
Sbjct: 386 TVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYD--FTAHANATVSIPAVALV 443
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNE 357
F + V L ++ CL NG G I+G + V+YD
Sbjct: 444 FA---GGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAG---ILGNTQQRTLAVVYDVA 497
Query: 358 KQRIGWKPEDCN 369
+Q+IG+ + C+
Sbjct: 498 RQKIGFAAKGCS 509
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 164/382 (42%), Gaps = 54/382 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKN----IVPC-- 69
+ + L +G PP + DTGSDL W QC APC + C K + Y P + ++PC
Sbjct: 88 YIMTLAIGTPPLSYPAIADTGSDLIWTQC-APCGSQCFKQAGQPYNPSSSTTFGVLPCNS 146
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
S CAAL P+PP P C Y YG G ++ G + F + VP +
Sbjct: 147 SVSMCAALAGPSPP----PGCSCMYNQTYGTGWTA-GIQSVETFTFGSTPADQTRVPGIA 201
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
FGC N +AG++GLGRG +S+VSQL G+ + N L LG
Sbjct: 202 FGC----SNASSDDWNGSAGLVGLGRGSMSLVSQLGA-GMFSYCLTPFQDANSTSTLLLG 256
Query: 189 -DGKVPSSGVAWTPMLQ--NSADLKHYILGPAELLYSGKSCGLKDLT------------- 232
+ +GV TP + + A + Y L +G S G L+
Sbjct: 257 PSAALNGTGVLTTPFVASPSKAPMSTYYY----LNLTGISIGTTALSIPPNAFALRTDGT 312
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
LI DSG + YQ++ + I L+ P+ D L +C+ +E
Sbjct: 313 GGLIIDSGTTITSLVDAAYQQVRAAI-ESLVTLPVADGSDSTGLDLCF-------ALTSE 364
Query: 291 YFKPLAL-SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
P ++ S T + +V+P + Y+++ G CL + N + VG + G Q+
Sbjct: 365 TSTPPSMPSMTFHFDGADMVLPVDNYMIL-GSGVWCLAMRNQT---VGAMSTFGNYQQQN 420
Query: 350 KMVIYDNEKQRIGWKPEDCNTL 371
++YD ++ + + P C+TL
Sbjct: 421 VHLLYDIHEETLSFAPAKCSTL 442
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 106/393 (26%), Positives = 161/393 (40%), Gaps = 61/393 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKNI----V 67
V+L VG PP+ DTGS+L+W+ C G + ++P + V
Sbjct: 63 LTVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAV 122
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
PC + +C++ P PP C + QC + Y DG +S GAL TD+F + G +
Sbjct: 123 PCGSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAV----GEAPPLRS 178
Query: 128 TFGCGYNQHNPGPLSPPD---TAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRG 183
FGC ++ S PD TAG+LG+ RG +S V+Q +CI ++ G
Sbjct: 179 AFGCMSTAYD----SSPDGVATAGLLGMNRGTLSFVTQAST-----RRFSYCISDRDDAG 229
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------- 233
VL LG +P + +TP+ Q + L ++ + G G K L +
Sbjct: 230 VLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHT 289
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD------KTLPICWRGPF 282
+ DSG + + Y + + ++ PL A DD + L C+R P
Sbjct: 290 GAGQTMVDSGTQFTFLLGDAYSALKAEFLKQT--KPLLRALDDPSFAFQEALDTCFRVP- 346
Query: 283 KALGQVTEYFKPLALSFTNRRNSV---RLV--VPPEAYLVISGRKNV-CLGILNGSEAEV 336
+ P+ L F SV RL+ VP E G V CL N +
Sbjct: 347 AGRPPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEH----RGADGVWCLTFGNADMVPL 402
Query: 337 GENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+IG + V YD E+ R+G P C+
Sbjct: 403 -TAYVIGHHHQMNLWVEYDLERGRVGLAPVKCD 434
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 162/383 (42%), Gaps = 60/383 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F + L +G PP+ F DTGSDL W QC PC C + P ++ + CS+
Sbjct: 366 FLMKLAIGSPPRSFSAIMDTGSDLIWTQC-KPCQQCFDQSTPIFDPKQSSSFYKISCSSE 424
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C AL P +D C+Y YGD S+ G L + F S ++P L FGC
Sbjct: 425 LCGAL-----PTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGC 479
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD-- 189
G + + G AG++GLGRG +S+VSQL+E + I + L LG
Sbjct: 480 GNDNNGDG---FSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTA--IDDSKPSSLLLGSLA 534
Query: 190 ---GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFD 236
K + TP+++N + Y L + G + T +I D
Sbjct: 535 NITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIID 594
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK---TLPICWRGPFKA----LGQVT 289
SG + Y + + +++ + L DD L +C+ P + ++T
Sbjct: 595 SGTTITYVENSAFTS-----LKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLT 649
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGILNGSEAEVGENNIIGEIFMQ 348
+FK L +P E Y++ + +CL I GS + +I G + Q
Sbjct: 650 FHFK-----------GADLELPGENYMIGDSKAGLLCLAI--GSSRGM---SIFGNLQQQ 693
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ MV++D +++ + + P C+++
Sbjct: 694 NFMVVHDLQEETLSFLPTQCDSI 716
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 106/373 (28%), Positives = 156/373 (41%), Gaps = 48/373 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
F + L +G PP+ + DTGSDL W QC PCT C P + P K+ +
Sbjct: 97 FLMKLAIGTPPETYSAIMDTGSDLIWTQCK-PCTQCFDQPTPIFDPKKSSSFSKLSCSSK 155
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
L P +D C+Y YGD S+ G L ++ L F SV V FGCG +
Sbjct: 156 LCEALPQST--CSDGCEYLYGYGDYSSTQGMLASE--TLTFGKVSVPEV--AFGCGEDNE 209
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLRE----YGLIRNVIGHCIGQNGRGVLFLG---D 189
G +G++GLGRG +S+VSQL+E Y L + L +G
Sbjct: 210 GSG---FSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTS------VDDTKASTLLMGSLAS 260
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGA 239
K S + TP++QNSA Y L + S +K T LI DSG
Sbjct: 261 VKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGT 320
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+ Y + ++V+ I P+ L +C+ P G L F
Sbjct: 321 TITYLEQSAF-DLVAKEFTSQINLPVD-NSGSTGLEVCFTLPS---GSTDIEVPKLVFHF 375
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
+ L +P E Y++ V CL + GS + + +I G I Q+ +V++D EK
Sbjct: 376 ----DGADLELPAENYMIADASMGVACLAM--GSSSGM---SIFGNIQQQNMLVLHDLEK 426
Query: 359 QRIGWKPEDCNTL 371
+ + + P C+ L
Sbjct: 427 ETLSFLPTQCDEL 439
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 154/378 (40%), Gaps = 48/378 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C + P ++ C +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 73 RCAALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L + K PN C Y YGD + G L D F + SV V FGC
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV--AFGC 198
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G N G +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 199 GL--FNNGVFKSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVNGLKPSTVLLDLPAD 254
Query: 192 VPSSG---VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLT--LIFDSGA 239
+ SG V TP++QN A+ LK +G L LK+ T I DSG
Sbjct: 255 LYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGT 314
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTEYFKPLA 296
+ +RVY+ ++RD +KL + T P C P +A Y L
Sbjct: 315 AMTSLPTRVYR-----LVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRA----KPYVPKLV 365
Query: 297 LSFTNRRNSVRLVVPPEAYLVI---SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
L F + +P E Y+ +G +CL I+ G GE IG Q+ V+
Sbjct: 366 LHF----EGATMDLPRENYVFEVEDAGSSILCLAIIEG-----GEVTTIGNFQQQNMHVL 416
Query: 354 YDNEKQRIGWKPEDCNTL 371
YD + ++ + P C+ L
Sbjct: 417 YDLQNSKLSFVPAQCDKL 434
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 166/380 (43%), Gaps = 48/380 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKN----IVPCS 70
+ + L++G PP + DTGSDL W QC APC+G C P Y P + ++PC+
Sbjct: 92 YLMTLSIGTPPLSYPAIADTGSDLIWTQC-APCSGDQCFAQPAPLYNPASSTTFGVLPCN 150
Query: 71 N--PRCAA-LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
+ CA L PP P C Y YG G ++ G ++ F + VP
Sbjct: 151 SSLSMCAGVLAGKAPP----PGCACMYNQTYGTGWTA-GVQGSETFTFGSAAADQARVPG 205
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ FGC N +AG++GLGRG +S+VSQL G + N L
Sbjct: 206 IAFGC----SNASSSDWNGSAGLVGLGRGSLSLVSQLGA-GRFSYCLTPFQDTNSTSTLL 260
Query: 187 LG-DGKVPSSGVAWTPMLQNSA----------DLKHYILGPAELLYSGKSCGLK-DLT-- 232
LG + +GV TP + + A +L LG L S + LK D T
Sbjct: 261 LGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGG 320
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
LI DSG + + YQ++ + + + L+ P D L +C+ P T
Sbjct: 321 LIIDSGTTITSLVNAAYQQVRAAV-QSLVTLPAIDGSDSTGLDLCYALP-------TPTS 372
Query: 293 KPLAL-SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
P A+ S T + +V+P ++Y+ ISG CL + N ++ G + G Q+
Sbjct: 373 APPAMPSMTLHFDGADMVLPADSYM-ISGSGVWCLAMRNQTD---GAMSTFGNYQQQNMH 428
Query: 352 VIYDNEKQRIGWKPEDCNTL 371
++YD + + + P C+TL
Sbjct: 429 ILYDVRNEMLSFAPAKCSTL 448
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 155/374 (41%), Gaps = 54/374 (14%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK----------QYKPHKNI---- 66
+ +G P F D GSDL WV CD C C +Y P +
Sbjct: 117 IDIGTPHVSFLVALDAGSDLLWVPCD--CLQCAPLSASYYSSLDRDLNEYSPSHSSTSKH 174
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGS---- 121
+ CS+ C P C P C Y ++Y + SS G LV D+ L SNG
Sbjct: 175 LSCSHQLCEL-----GPNCNSPKQPCPYSMDYYTENTSSSGLLVEDILHLA-SNGDNALS 228
Query: 122 -VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
P+ GCG Q G L G++GLG IS+ S L + GLIRN C ++
Sbjct: 229 YSVRAPVVIGCGMKQSG-GYLDGVAPDGLMGLGLAEISVPSFLAKAGLIRNSFSMCFDED 287
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--IFDSG 238
G +F GD + P++ + TP L + Y++G E G SC LK + + D+G
Sbjct: 288 DSGRIFFGD-QGPTTQQS-TPFLTLDGNYTTYVVG-VEGFCVGSSC-LKQTSFRALVDTG 343
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV--TEYFKPLA 296
S+ + + VY+ I R + T + C++ L +V + PL
Sbjct: 344 TSFTFLPNGVYERITEEFDRQVNATISSF--NGYPWKYCYKSSSNHLTKVPSVKLIFPLN 401
Query: 297 LSFTNRRNSVRLVVPPEAYLV--ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
SF V+ +++ I G CL I +E ++G IG+ FM V++
Sbjct: 402 NSF---------VIHNPVFMIYGIQGITGFCLAI-QPTEGDIG---TIGQNFMAGYRVVF 448
Query: 355 DNEKQRIGWKPEDC 368
D E ++GW C
Sbjct: 449 DRENMKLGWSHSSC 462
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 144/332 (43%), Gaps = 47/332 (14%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-----YKPHKNI- 66
+ + L +G PP+ F DTGSD+ WV C A C GC + Q + P ++
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ CS+ RC+ + C N+ C Y +YGDG + G V+D+ GS
Sbjct: 136 ASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
P+ FGC +Q G L D A G+ G G+ +S++SQL G+ V HC+
Sbjct: 196 VPNSTAPVVFGCSTSQT--GDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253
Query: 178 -GQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
G+N G G+L LG+ P+ + +TP++ + HY + + +G++ +
Sbjct: 254 KGENGGGGILVLGEIVEPN--MVFTPLVPSQ---PHYNVNLLSISVNGQALPINPSVFST 308
Query: 234 ------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKAL 285
I D+G + AY + Y V I A P+ +G +
Sbjct: 309 SNGQGTIIDTGTTLAYLSEAAYVPFVEAITN---------AVSQSVRPVVSKGNQCYVIT 359
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
V + F P++L+F + + P+ YL+
Sbjct: 360 TSVGDIFPPVSLNFA---GGASMFLNPQDYLI 388
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/367 (27%), Positives = 152/367 (41%), Gaps = 42/367 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC+ C K EK + P ++ + C+ P
Sbjct: 161 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTYANISCAAP 220
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV--PLTFG 130
C+ L+ C C Y ++YGDG SIG D L S ++ FG
Sbjct: 221 ACSDLYIKG---CS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAIKGFRFG 270
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFL 187
CG + N G + AG+LGLGRG+ S+ V +YG V HC +G G L
Sbjct: 271 CG--ERNEGLYG--EAAGLLGLGRGKTSLPVQAYDKYG---GVFAHCFPARSSGTGYLDF 323
Query: 188 GDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASY 241
G G +P+ S TPML ++ +Y+ G + GK + I DSG
Sbjct: 324 GPGSLPAVSAKLTTPMLVDNGPTFYYV-GLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVI 382
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
Y + S + K AP L C+ F + +V ++L F
Sbjct: 383 TRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCY--DFTGMSEVA--IPTVSLLF-- 436
Query: 302 RRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRI 361
+ L V + + CLG E + + I+G ++ V+YD K+ +
Sbjct: 437 -QGGASLDVHASGIIYAASVSQACLGFAGNKEDD--DVGIVGNTQLKTFGVVYDIGKKVV 493
Query: 362 GWKPEDC 368
G+ P C
Sbjct: 494 GFCPGAC 500
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 150/371 (40%), Gaps = 38/371 (10%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH----KNIVPCSNPRC 74
+ VG P F DTGSDL W+ C+ C C K Y P VPC +P C
Sbjct: 123 AEVEVGTPSSKFLVALDTGSDLFWLPCE--CKLCAKNGSTMYSPSLSSTSKTVPCGHPLC 180
Query: 75 AALHWPNPPRCK---HPNDQCDYEIEY--GDGGSSIGALVTDLFPL----RFSNGSVFNV 125
P C + C YE++Y + GSS G LV D+ L G
Sbjct: 181 E-----RPDACATAGKSSSSCPYEVKYVSANTGSS-GVLVEDVLHLVDGGGGGGGKAVQA 234
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGV 184
P+ FGCG Q L G++GLG ++S+ S L GL+ + C ++G G
Sbjct: 235 PIVFGCGQVQTG-AFLRGAAAGGLMGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGR 293
Query: 185 LFLGDGKVPSSGVAWTPMLQ-NSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD P A TP++ S +Y + + K+ + + T + DSG S+ Y
Sbjct: 294 INFGDAGSPDQ--AETPLIAAGSLQPSYYNISVGAITVDSKAMAV-EFTAVVDSGTSFTY 350
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
Y + + + + C+R + GQ + P A+S T +
Sbjct: 351 LDDPAYTFLTTNFNSRVSEASETYGSGYEKFEFCYR---LSPGQTSMKRLP-AMSLTTKG 406
Query: 304 NSVRLVVPPEAYLVISGRKN------VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
+V + P ++ S CLGI+ S E+ IG+ FM V++D
Sbjct: 407 GAVFPITWPIIPVLASTNGGPYHPIGYCLGIIKTSILST-EDATIGQNFMTGLKVVFDRR 465
Query: 358 KQRIGWKPEDC 368
K +GW+ DC
Sbjct: 466 KSVLGWEKFDC 476
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 162/375 (43%), Gaps = 51/375 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---------YKPHKNI 66
Y+A N+++G P F DTGSDL W+ C+ CT C K+ Y + +
Sbjct: 104 YYA-NVSIGTPGLYFLVALDTGSDLFWLPCE--CTKCPTYLTKRDNGKFWLNHYSSNASS 160
Query: 67 ----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGS 121
VPCS+ C + +C C Y+ Y + SS G LV D+ + +
Sbjct: 161 TSIRVPCSSSLCELAN-----QCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMATDDSQ 215
Query: 122 V--FNVPLTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+ +V +T GCG Q ++ P+ G++GLG G++S+ S L GL + C G
Sbjct: 216 LKPVDVKVTLGCGKVQTGKFSNVTAPN--GLIGLGMGKVSVPSFLASQGLTTDSFSMCFG 273
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
G G + GD + G TP N A L Y + +++ + + + LT I DSG
Sbjct: 274 YYGYGRIDFGD--IGPVGQRETPF--NPASLS-YNVTILQIIVTNRPTNVH-LTAIIDSG 327
Query: 239 ASYAYFTSRVYQEIVSLIMRDL-IGTPLKLAPDDKTLPI--CWRGPFKALGQVTEYFKPL 295
AS+ Y T Y S+I ++ L+ D P C+R + F+
Sbjct: 328 ASFTYLTDPFY----SIITENMDAAMELERIKSDSDFPFEYCYRLSLATI------FQQP 377
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
L+FT V+ + +CL I+ ++ N+IG F V+++
Sbjct: 378 NLNFTMEGGRKFDVITSYVSVDTDDGPALCLAIVKSTDI-----NVIGHNFFGGYRVVFN 432
Query: 356 NEKQRIGWKPEDCNT 370
EK +GWK DC++
Sbjct: 433 REKMTLGWKEVDCDS 447
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 162/383 (42%), Gaps = 60/383 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F + L +G PP+ F DTGSDL W QC PC C + P ++ + CS+
Sbjct: 111 FLMKLAIGSPPRSFSAIMDTGSDLIWTQC-KPCQQCFDQSTPIFDPKQSSSFYKISCSSE 169
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C AL P +D C+Y YGD S+ G L + F S ++P L FGC
Sbjct: 170 LCGAL-----PTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGC 224
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD-- 189
G + + G AG++GLGRG +S+VSQL+E + I + L LG
Sbjct: 225 GNDNNGDG---FSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTA--IDDSKPSSLLLGSLA 279
Query: 190 ---GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFD 236
K + TP+++N + Y L + G + T +I D
Sbjct: 280 NITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIID 339
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK---TLPICWRGPFKA----LGQVT 289
SG + Y + + +++ + L DD L +C+ P + ++T
Sbjct: 340 SGTTITYVENSAFTS-----LKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLT 394
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGILNGSEAEVGENNIIGEIFMQ 348
+FK L +P E Y++ + +CL I GS + +I G + Q
Sbjct: 395 FHFK-----------GADLELPGENYMIGDSKAGLLCLAI--GSSRGM---SIFGNLQQQ 438
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ MV++D +++ + + P C+++
Sbjct: 439 NFMVVHDLQEETLSFLPTQCDSI 461
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/395 (25%), Positives = 156/395 (39%), Gaps = 64/395 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V L VG P DTGSD++W+QC PC C + P + +PC++
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQC-VPCKDCVPALRPPFNPRHSSSFFKLPCASS 197
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF-----------PLRFSNGS 121
C ++ P C C + I+YGDG S G L + P++ SN
Sbjct: 198 TCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSN-- 255
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-- 179
+T GC P +G+LG+ R IS SQL HC
Sbjct: 256 -----ITLGCADIDREGLPTG---ASGLLGMDRRPISFPSQLSSR--YARKFSHCFPDKI 305
Query: 180 ---NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG-------PAELLYSGKS 225
N G++F G+ + S + +TP++QN SA L +Y +G + L S K+
Sbjct: 306 AHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKN 365
Query: 226 CGLKDLT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAP--DDKTLPICWR 279
+ +T I DSG ++ Y +Q + R+ + LA D+ C+
Sbjct: 366 FDIDKVTGSGGTIIDSGTAFTYLKKPAFQA----MRREFLARTSHLAKVDDNSGFTPCYN 421
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAE 335
+ + L F R + +V+P + L+ + +CL L +
Sbjct: 422 ITSGTAALESTILPSITLHF---RGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIP 478
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
NIIG Q+ V YD EK R+G P C T
Sbjct: 479 F---NIIGNYQQQNLWVEYDLEKLRLGIAPAQCAT 510
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 154/378 (40%), Gaps = 48/378 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C + P ++ C +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 73 RCAALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L + K PN C Y YGD + G L D F + SV V FGC
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV--AFGC 198
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G N G +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 199 GL--FNNGVFKSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVNGLKPSTVLLDLPAD 254
Query: 192 VPSSG---VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLT--LIFDSGA 239
+ SG V TP++QN A+ LK +G L LK+ T I DSG
Sbjct: 255 LYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTGGTIIDSGT 314
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTEYFKPLA 296
+ +RVY+ ++RD +KL + T P C P +A Y L
Sbjct: 315 AMTSLPTRVYR-----LVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRA----KPYVPKLV 365
Query: 297 LSFTNRRNSVRLVVPPEAYLVI---SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
L F + +P E Y+ +G +CL I+ G GE IG Q+ V+
Sbjct: 366 LHF----EGATMDLPRENYVFEVEDAGSSILCLAIIEG-----GEVTTIGNFQQQNMHVL 416
Query: 354 YDNEKQRIGWKPEDCNTL 371
YD + ++ + P C+ L
Sbjct: 417 YDLQNSKLSFVPAQCDKL 434
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 106/369 (28%), Positives = 151/369 (40%), Gaps = 45/369 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKN----IV 67
+ +G P F DTGSDL WV CD AP G + + Y P K+ V
Sbjct: 114 TTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASDFELSVYSPKKSSTSKTV 173
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDG-GSSIGALVTDLFPLR--FSNGSVFN 124
PC+N CA +C C Y + Y S+ G L+ DL L+ +
Sbjct: 174 PCNNNLCAQRD-----QCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTEHKHSEPIQ 228
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+TFGCG Q L G+ GLG +IS+ S L GL+ N C +G G
Sbjct: 229 AYITFGCGQVQSG-SFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGR 287
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+ GD S TP N + I + G + D+T +FDSG S++YF
Sbjct: 288 INFGDKG--SLEQEETPFNLNQLHPNYNIT--VTSIRVGTTLIDADITALFDSGTSFSYF 343
Query: 245 TSRVYQEIVSLI---MRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
T +Y ++ + RD P P C+ A +T +S T
Sbjct: 344 TDPIYSKLSASFHAQTRDGRHPPNPRIP----FEYCYNMSPDANASLTP-----GISLTM 394
Query: 302 RRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ V P +VIS + + CL ++ +E NIIG+ FM +++D EK
Sbjct: 395 KGGGPFPVYDP--IIVISTQNELIYCLAVVKSAEL-----NIIGQNFMTGYRIVFDREKL 447
Query: 360 RIGWKPEDC 368
+GWK DC
Sbjct: 448 VLGWKKFDC 456
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 160/375 (42%), Gaps = 55/375 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ ++++ G PP+ DTGSDL W QC PC C + P K + V C++
Sbjct: 80 YLIDISFGSPPQKASVIVDTGSDLIWTQC-LPCETCNAAASVIFDPVKSSTYDTVSCASN 138
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C++L + + C Y+ YGDG S+ GAL +P + FGC
Sbjct: 139 FCSSLPF------QSCTTSCKYDYMYGDGSSTSGAL-----STETVTVGTGTIPNVAFGC 187
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLG 188
G+ N G + AG++GLG+G +S++SQ + +C +G + +G
Sbjct: 188 GHT--NLGSFA--GAAGIVGLGQGPLSLISQASS--ITSKKFSYCLVPLGSTKTSPMLIG 241
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSG 238
D + GVA+T +L N+A+ Y + SGK+ T I DSG
Sbjct: 242 D-SAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSG 300
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD-KTLPICWRGPFKALGQVTEYFKPLAL 297
+ Y + + +V+ + ++ P A L C F G + +
Sbjct: 301 TTLTYLETGAFNALVAALKAEV---PFPEADGSLYGLDYC----FSTAGVANPTYPTMTF 353
Query: 298 SFTNRRNSVRLVVPPE-AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
F +PPE ++ + ++CL + A G +I+G I Q+ ++++D
Sbjct: 354 HF----KGADYELPPENVFVALDTGGSICLAM----AASTGF-SIMGNIQQQNHLIVHDL 404
Query: 357 EKQRIGWKPEDCNTL 371
QR+G+K +C T+
Sbjct: 405 VNQRVGFKEANCETI 419
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 158/383 (41%), Gaps = 58/383 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V+L +G PP + DTGSDL W QC APC C P + ++ +PC +
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCAAQPTPYFDVKRSATYRALPCRSS 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFNVPLTFGC 131
RCAAL P+ C C Y+ YGD S+ G L + F S+ V ++FGC
Sbjct: 148 RCAALSSPS---CFK--KMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFGC 202
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLG 188
G N G L+ +++G++G GRG +S+VSQL + + R GV
Sbjct: 203 G--SLNAGELA--NSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSPTPSRLYFGVFANL 258
Query: 189 DGKVPSSG--VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------------- 233
+ SSG V TP + N A Y L G S G K L +
Sbjct: 259 NSTNTSSGSPVQSTPFVINPALPNMYFLS-----VKGISLGTKRLPIDPLVFAINDDGTG 313
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPD-DKTLPICWRGPFKALGQVT 289
I DSG S + Y+ + R L T PL D D L C++ P VT
Sbjct: 314 GVIIDSGTSITWLQQDAYEA----VRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVT 369
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGILNGSEAEVGENNIIGEIFMQ 348
F + + +PPE Y++I+ +CL + A IIG Q
Sbjct: 370 ------VPDFVFHFDGANMTLPPENYMLIASTTGYLCLAM-----APTSVGTIIGNYQQQ 418
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ ++YD + + P C+ +
Sbjct: 419 NLHLLYDIANSFLSFVPAPCDII 441
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 151/361 (41%), Gaps = 33/361 (9%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPCSN 71
VG P F DTGSDL WV CD AP +G ++ Y+P ++ +PCS+
Sbjct: 102 VGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSH 161
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FNVPLTF 129
C ++ P C +P C Y I+Y + +S G L+ D L + V N +
Sbjct: 162 ELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVII 216
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GCG Q L G+L LG IS+ S L GL++N C ++ G +F GD
Sbjct: 217 GCGQKQSG-DYLDGIAPDGLLALGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 275
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVY 249
VPS TP + L+ Y + + K + DSG S+ VY
Sbjct: 276 QGVPSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFDVY 333
Query: 250 QEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLV 309
+ + + T ++ +D T C+ + V + L+F + S++ V
Sbjct: 334 KAFTMEFDKQMNAT--RVPYEDTTWKYCYSASPLEMPDVPT----ITLTFAADK-SLQAV 386
Query: 310 VPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
P + G CL +L +E +G II + F+ V++D E ++GW +C
Sbjct: 387 NPILPFNDKQGALAGFCLAVLPSTEP-IG---IIAQNFLVGYHVVFDRESMKLGWYRSEC 442
Query: 369 N 369
Sbjct: 443 R 443
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 160/380 (42%), Gaps = 52/380 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF V+ +G PP+ F D+GSDL WVQC APC C Y P N VPC +
Sbjct: 65 YF-VDFFLGTPPQKFSLIVDSGSDLLWVQC-APCLQCYAQDTPLYAPSNSSTFNPVPCLS 122
Query: 72 PRCAALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV---PL 127
P C + C H C YE Y D S G + + +V +V +
Sbjct: 123 PECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFA-------YESATVDDVRIDKV 175
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQ-----NG 181
FGCG + N G + GVLGLG+G +S SQ+ YG N +C+ +
Sbjct: 176 AFGCG--RDNQGSFAA--AGGVLGLGQGPLSFGSQVGYAYG---NKFAYCLVNYLDPTSV 228
Query: 182 RGVLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
L GD + + + +TP++ NS + Y + +++ G+S +
Sbjct: 229 SSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGN 288
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
IFDSG + Y+ Y+ I++ +++ A + L +C VT
Sbjct: 289 GGSIFDSGTTVTYWLPPAYRNILAAFDKNV---RYPRAASVQGLDLCV--------DVTG 337
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+P SFT + P + + NV + G + VG N IG + Q+
Sbjct: 338 VDQPSFPSFTIVLGGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNF 397
Query: 351 MVIYDNEKQRIGWKPEDCNT 370
+V YD E+ RIG+ P C++
Sbjct: 398 LVQYDREENRIGFAPAKCSS 417
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 161/383 (42%), Gaps = 61/383 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F +++++G P + DTGSDL W QC PC C K + P + VPCS+
Sbjct: 105 FLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSA 163
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L P +C Y YGD S+ G L T+ F L S +P + FGC
Sbjct: 164 SCSDL----PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGC 214
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ---NGRGVLFLG 188
G G AG++GLGRG +S+VSQL GL + +C+ L LG
Sbjct: 215 GDTNEGDG---FSQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDTNNSPLLLG 266
Query: 189 D------GKVPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDL---T 232
+S V TP+++N + LK +G + + ++D
Sbjct: 267 SLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG 326
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVT 289
+I DSG S Y + Y+ ++ + L D + L +C+R P K + QV
Sbjct: 327 VIVDSGTSITYLEVQGYRA-----LKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE 381
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGILNGSEAEVGENNIIGEIFMQ 348
L F + L +P E Y+V+ G +CL ++ GS +IIG Q
Sbjct: 382 --VPRLVFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM-GSRGL----SIIGNFQQQ 431
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ +YD + + P CN L
Sbjct: 432 NFQFVYDVGHDTLSFAPVQCNKL 454
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 100/395 (25%), Positives = 164/395 (41%), Gaps = 64/395 (16%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----I 66
P + + L +G PP + DTGSDL W QC APCT C + P Y P + +
Sbjct: 87 PTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAV 145
Query: 67 VPCSNPRCAALHWPN------PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
+PC++ PP C C Y + YG G +S+ ++ F +
Sbjct: 146 LPCNSSLSVCAAALAGTGTAPPPGCA-----CTYNVTYGSGWTSVFQ-GSETFTFGSTPA 199
Query: 121 SVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG- 178
VP + FGC + +G++GLGRGR+S+VSQL G+ + +C+
Sbjct: 200 GHARVPGIAFGCSTASSG---FNASSASGLVGLGRGRLSLVSQL---GVPK--FSYCLTP 251
Query: 179 ---QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLY----SGKSCGLKDL 231
N L LG PS+ + T + ++ + P Y +G S G L
Sbjct: 252 YQDTNSTSTLLLG----PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTAL 307
Query: 232 T---------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI 276
+ LI DSG + + YQ++ + ++ L+ P D L +
Sbjct: 308 SIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVV-SLVTLPTTDGSADTGLDL 366
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEV 336
C+ P + P S T N +V+P ++Y++ CL + N ++ EV
Sbjct: 367 CFMLP------SSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEV 420
Query: 337 GENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
NI+G Q+ ++YD ++ + + P C+ L
Sbjct: 421 ---NILGNYQQQNMHILYDIGQETLSFAPAKCSAL 452
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 161/383 (42%), Gaps = 61/383 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F +++++G P + DTGSDL W QC PC C K + P + VPCS+
Sbjct: 74 FLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSA 132
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L P +C Y YGD S+ G L T+ F L S +P + FGC
Sbjct: 133 SCSDL----PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGC 183
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ---NGRGVLFLG 188
G G AG++GLGRG +S+VSQL GL + +C+ L LG
Sbjct: 184 GDTNEGDG---FSQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDTNNSPLLLG 235
Query: 189 D------GKVPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDL---T 232
+S V TP+++N + LK +G + + ++D
Sbjct: 236 SLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG 295
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVT 289
+I DSG S Y + Y+ ++ + L D + L +C+R P K + QV
Sbjct: 296 VIVDSGTSITYLEVQGYRA-----LKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE 350
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGILNGSEAEVGENNIIGEIFMQ 348
L F + L +P E Y+V+ G +CL ++ GS +IIG Q
Sbjct: 351 --VPRLVFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM-GSRGL----SIIGNFQQQ 400
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ +YD + + P CN L
Sbjct: 401 NFQFVYDVGHDTLSFAPVQCNKL 423
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 150/365 (41%), Gaps = 38/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKN----IVPCSN 71
+ + +G P K + DTGS LTW+QC +PC C + + P + V CS+
Sbjct: 117 YVTRMGLGTPAKPYIMVVDTGSSLTWLQC-SPCRVSCHRQSGPVFDPKTSSSYAAVSCSS 175
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P+C L NP C P++ C Y+ YGD S+G L D + F SV N +
Sbjct: 176 PQCDGLSTATLNPAVCS-PSNVCIYQASYGDSSFSVGYLSKDT--VSFGANSVPN--FYY 230
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GCG Q N G +AG++GL R ++S++ QL + +C+ +L
Sbjct: 231 GCG--QDNEGLFG--RSAGLMGLARNKLSLLYQLAP--TLGYSFSYCLPSTSSSG-YLSI 283
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYF 244
G G ++TPM+ N+ D Y + + + +GK S L I DSG
Sbjct: 284 GSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRL 343
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
+ VY + + + G+ K A L C+ G L V + T + +
Sbjct: 344 PTSVYTALSKAVAAAMKGS-TKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLS 402
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+ L+V + CL A IIG Q V+YD + RIG+
Sbjct: 403 AGNLLVDVDG-------ATTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKSNRIGFA 450
Query: 365 PEDCN 369
C+
Sbjct: 451 AAGCS 455
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 101/366 (27%), Positives = 150/366 (40%), Gaps = 39/366 (10%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----V 67
+ +G P F DTGSDL WV CD AP G T E + Y P + V
Sbjct: 109 TTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKV 168
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNGSVFNVP 126
C+N CA + +C C Y + Y +S G L+ D+ L + + V
Sbjct: 169 TCNNSLCAQRN-----QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVE 223
Query: 127 --LTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+TFGCG Q ++ P+ G+ GLG +IS+ S L GL+ + C G +G G
Sbjct: 224 AYVTFGCGQVQSGSFLDIAAPN--GLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVG 281
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD SS TP N + + I + G + + T +FD+G S+ Y
Sbjct: 282 RISFGDKG--SSDQEETPFNLNPSHPNYNI--TVTRVRVGTTLIDDEFTALFDTGTSFTY 337
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSFTNR 302
+Y + + +PD + C+ A + +LS T +
Sbjct: 338 LVDPMYTTVSESFHSQ--AQDKRHSPDSRIPFEYCYDMSNDANASLIP-----SLSLTMK 390
Query: 303 RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
NS + P + G CL I+ SE NIIG+ +M V++D EK +
Sbjct: 391 GNSHFTINDPIIVISTEGELVYCLAIVKSSEL-----NIIGQNYMTGYRVVFDREKLVLA 445
Query: 363 WKPEDC 368
WK DC
Sbjct: 446 WKKFDC 451
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 110/373 (29%), Positives = 157/373 (42%), Gaps = 53/373 (14%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----V 67
+ +G P F DTGSDL WV CD AP G + + + Y P ++ V
Sbjct: 99 TTVELGTPGVKFMVALDTGSDLFWVPCDCSRCAPTHGASYASDFELSIYNPRESSTSKKV 158
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNG--SVFN 124
C+N CA + RC C Y + Y +S G LV D+ L +G
Sbjct: 159 TCNNDMCAQRN-----RCLGTFSSCPYIVSYVSAQTSTSGILVKDVLHLTTEDGGREFVE 213
Query: 125 VPLTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+TFGCG Q ++ P+ G+ GLG +IS+ S L GLI + C G +G G
Sbjct: 214 AYVTFGCGQVQSGSFLDIAAPN--GLFGLGMEKISVPSVLSREGLIADSFSMCFGHDGIG 271
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD P TP N A + + + G + T +FDSG S+ Y
Sbjct: 272 RISFGDKGSPDQ--EETPFNVNPAHPTYNVTVTQARV--GTMLIDVEFTALFDSGTSFTY 327
Query: 244 FT----SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI--CWRGPFKALGQVTEYFKPLAL 297
SRV ++ SL RD K P D +P C+ A + ++
Sbjct: 328 MVDPAYSRVSEKFHSL-ARD------KRRPPDPRIPFEYCYDMSPDANASLVP-----SM 375
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
S T + V P +VIS + + CL ++ +E NIIG+ FM V++D
Sbjct: 376 SLTMKGGRHFTVYDP--IIVISTQNEIVYCLAVVKSTEL-----NIIGQNFMTGYRVVFD 428
Query: 356 NEKQRIGWKPEDC 368
EK +GWK DC
Sbjct: 429 REKLVLGWKKFDC 441
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 100/395 (25%), Positives = 164/395 (41%), Gaps = 64/395 (16%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----I 66
P + + L +G PP + DTGSDL W QC APCT C + P Y P + +
Sbjct: 27 PTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAV 85
Query: 67 VPCSNPRCAALHWPN------PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
+PC++ PP C C Y + YG G +S+ ++ F +
Sbjct: 86 LPCNSSLSVCAAALAGTGTAPPPGCA-----CTYNVTYGSGWTSV-FQGSETFTFGSTPA 139
Query: 121 SVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG- 178
VP + FGC + +G++GLGRGR+S+VSQL G+ + +C+
Sbjct: 140 GHARVPGIAFGCSTASSG---FNASSASGLVGLGRGRLSLVSQL---GVPK--FSYCLTP 191
Query: 179 ---QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLY----SGKSCGLKDL 231
N L LG PS+ + T + ++ + P Y +G S G L
Sbjct: 192 YQDTNSTSTLLLG----PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTAL 247
Query: 232 T---------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI 276
+ LI DSG + + YQ++ + ++ L+ P D L +
Sbjct: 248 SIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVS-LVTLPTTDGSADTGLDL 306
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEV 336
C+ P + P S T N +V+P ++Y++ CL + N ++ EV
Sbjct: 307 CFMLP------SSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEV 360
Query: 337 GENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
NI+G Q+ ++YD ++ + + P C+ L
Sbjct: 361 ---NILGNYQQQNMHILYDIGQETLSFAPAKCSAL 392
>gi|213998812|gb|ACJ60773.1| nucellin [Hordeum euclaston]
Length = 154
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 81/147 (55%), Gaps = 5/147 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDD 271
+++Y EIVS + L + L+ D
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLEEVKGD 152
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 96/367 (26%), Positives = 158/367 (43%), Gaps = 40/367 (10%)
Query: 28 KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---YKPHKNI----VPCSNPRCAALHWP 80
+ +D DTGS T+V PC GC + E Y +++ + C A L
Sbjct: 49 QTYDLIVDTGSARTYV----PCKGCARCGEHAHGYYDYDRSMEFERLDCGEASDATLCEE 104
Query: 81 NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGP 140
+ +C Y + Y +G SS G +V D +R G++ + L FGC + N
Sbjct: 105 TMKGTCQSDGRCSYVVSYAEGSSSRGYVVRD--RVRLGEGTL-SAMLAFGCEEAETNAIY 161
Query: 141 LSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG--DGKVPSS 195
D G+ G GRG ++ +QL GLI NV C+ G NG GVL LG D +
Sbjct: 162 EQKAD--GLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANG-GVLTLGRFDFGADAP 218
Query: 196 GVAWTPMLQNSADLK-HYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
+A TP++ + A+ H + + L L T DSG ++ + V+ +
Sbjct: 219 ALARTPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFVPRSVWVSFKT 278
Query: 255 LIMRDLIGTPLKL--APDDKTLPICWRGPFKAL------GQVTEYFKPLALSFTNRRNSV 306
+ L++ PD + +C+ A+ V+E+F PL +++ V
Sbjct: 279 RLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEWFPPLTIAY---EGGV 335
Query: 307 RLVVPPEAYLVI--SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
L + PE YL + C+GI ++ ++G+I M+D ++ +D R+G
Sbjct: 336 SLTLGPENYLFAHETNSAAFCVGIFANPNNQI----LLGQITMRDTLMEFDVANSRVGMA 391
Query: 365 PEDCNTL 371
P +C L
Sbjct: 392 PANCRRL 398
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 161/383 (42%), Gaps = 61/383 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F +++++G P + DTGSDL W QC PC C K + P + VPCS+
Sbjct: 95 FLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSA 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L P +C Y YGD S+ G L T+ F L S +P + FGC
Sbjct: 154 SCSDL----PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGC 204
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ---NGRGVLFLG 188
G G AG++GLGRG +S+VSQL GL + +C+ L LG
Sbjct: 205 GDTNEGDG---FSQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDTNNSPLLLG 256
Query: 189 D------GKVPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDL---T 232
+S V TP+++N + LK +G + + ++D
Sbjct: 257 SLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG 316
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVT 289
+I DSG S Y + Y+ ++ + L D + L +C+R P K + QV
Sbjct: 317 VIVDSGTSITYLEVQGYRA-----LKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE 371
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGILNGSEAEVGENNIIGEIFMQ 348
L F + L +P E Y+V+ G +CL ++ GS +IIG Q
Sbjct: 372 --VPRLVFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM-GSRGL----SIIGNFQQQ 421
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ +YD + + P CN L
Sbjct: 422 NFQFVYDVGHDTLSFAPVQCNKL 444
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 104/391 (26%), Positives = 163/391 (41%), Gaps = 72/391 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F + L++G P + DTGSDL W QC PCT C P + P K+ V CS+
Sbjct: 107 FLMELSIGNPAVKYSAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCSSG 165
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL N C D C+Y YGD S+ G L T+ F N S+ + FGCG
Sbjct: 166 LCNALPRSN---CNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN-SISGIG--FGCG 219
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
G +G++GLGRG +S++SQL+E +C+ LF+G
Sbjct: 220 VENEGDG---FSQGSGLVGLGRGPLSLISQLKE-----TKFSYCLTSIEDSEASSSLFIG 271
Query: 189 ---DGKVPSSGVAW-------TPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------ 232
G V +G + +L+N Y L + K ++ T
Sbjct: 272 SLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAED 331
Query: 233 ----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK---TLPICWRGPFK-- 283
+I DSG + Y ++ ++++ + + L DD L +C++ P
Sbjct: 332 GTGGMIIDSGTTITYLEETAFK-----VLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAK 386
Query: 284 --ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENN 340
A+ ++ +FK L +P E Y+V V CL + GS + +
Sbjct: 387 NIAVPKMIFHFK-----------GADLELPGENYMVADSSTGVLCLAM--GSSNGM---S 430
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
I G + Q+ V++D EK+ + + P +C L
Sbjct: 431 IFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 161/379 (42%), Gaps = 55/379 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +VG PP DTGSD+ W+QC+ PC C + P K+ +PCS+
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCE-PCEQCYNQTTPIFNPSKSSSYKNIPCSSK 145
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C H C N C Y+I YGD S G L D L ++GS + P + GC
Sbjct: 146 LC---HSVRDTSCSDQN-SCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIGC 201
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G + N G ++G++GLG G +S+++QL I +C+ N +L
Sbjct: 202 GTD--NAGTFGGA-SSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSIL 256
Query: 186 FLGDGKVPS-SGVAWTPMLQNS-----ADLKHYILGPAELLYSGKSCGLKDL-TLIFDSG 238
GD V S GV TP+++ L+ + +G + + G S G D +I DSG
Sbjct: 257 SFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316
Query: 239 ASYAYFTSRVY----QEIVSLIMRDLIGTPLKLAPDDKTLPICW--RGPFKALGQVTEYF 292
+ S VY +V L+ D + P ++ +C+ + +T +F
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDRVDDP------NQQFSLCYSLKSNEYDFPIITVHF 370
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
K + +S+ VP + VC + +I G + Q+ +V
Sbjct: 371 KGADVEL----HSISTFVPITDGI-------VCFAFQPSPQL----GSIFGNLAQQNLLV 415
Query: 353 IYDNEKQRIGWKPEDCNTL 371
YD +++ + +KP DC +
Sbjct: 416 GYDLQQKTVSFKPTDCTKV 434
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 152/373 (40%), Gaps = 39/373 (10%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----V 67
+ +G P F DTGSDL WV CD AP G T E + Y P + V
Sbjct: 107 TTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKISTTNKKV 166
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNGSVFNVP 126
C+N CA + +C C Y + Y +S G L+ D+ L + + V
Sbjct: 167 TCNNSLCAQRN-----QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVE 221
Query: 127 --LTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+TFGCG Q ++ P+ G+ GLG +IS+ S L GL+ + C G +G G
Sbjct: 222 AYVTFGCGQVQSGSFLDIAAPN--GLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVG 279
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD SS TP N + + I + G + + T +FD+G S+ Y
Sbjct: 280 RISFGDKG--SSDQEETPFNLNPSHPNYNI--TVTRVRVGTTLIDDEFTALFDTGTSFTY 335
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSFTNR 302
+Y + + +PD + C+ A + +LS T +
Sbjct: 336 LVDPMYTTVSESFHSQ--AQDKRHSPDSRIPFEYCYDMSNDANASLIP-----SLSLTMK 388
Query: 303 RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
NS + P + G CL I+ SE NIIG+ +M V++D EK +
Sbjct: 389 GNSHFTINDPIIVISTEGELVYCLAIVKSSEL-----NIIGQNYMTGYRVVFDREKLVLA 443
Query: 363 WKPEDCNTLLSLN 375
WK DC + N
Sbjct: 444 WKKFDCYDIEETN 456
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 153/378 (40%), Gaps = 54/378 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPHKNI--- 66
+ ++ +G P + DTGS WV C C P E Y P ++
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQC--PHESDILRKLTFYDPRSSVSSK 139
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSV- 122
V C + C + P C + +C Y Y DGG ++G L TDL + NG
Sbjct: 140 EVKCDDTICTS-----RPPC-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQ 193
Query: 123 -FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
+ +TFGCG Q S G++G G + +SQL G + + HC+ N
Sbjct: 194 PTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN 253
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILG------PAELLYSGKSCGLK 229
G G+ +G+ P V TP+++N+ +LK + PA + + K+ G
Sbjct: 254 GGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGT- 310
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
DSG++ Y +Y E++ + PD + F LG V
Sbjct: 311 ----FIDSGSTLVYLPEIIYSELILAVFAK--------HPDITMGAMYNFQCFHFLGSVD 358
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+ F + F N + L V P YL+ C G + + I+G++ + +
Sbjct: 359 DKFPKITFHF---ENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISN 415
Query: 350 KMVIYDNEKQRIGWKPED 367
K+V+YD EKQ IGW +
Sbjct: 416 KVVVYDMEKQAIGWTEHN 433
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 150/365 (41%), Gaps = 38/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ V C+ P
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAP 238
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L + C C Y ++YGDG SIG D L S ++ F G
Sbjct: 239 ACFDL---DTRGCS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 288
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC+ +G G L G
Sbjct: 289 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSSGTGYLDFGP 343
Query: 190 GKVPSSGVAW-TPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
G ++G TPML ++ +Y+ G + G+ + I DSG
Sbjct: 344 GSPAAAGARLTTPMLTDNGPTFYYV-GMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITR 402
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
Y + S + + K AP L C+ F + QV ++L F +
Sbjct: 403 LPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYD--FTGMSQVA--IPTVSLLF---Q 455
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
L V + + VCLG + + G+ I+G ++ V YD K+ +G+
Sbjct: 456 GGAILDVDASGIMYAASVSQVCLGF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 513
Query: 364 KPEDC 368
P C
Sbjct: 514 SPGAC 518
>gi|213998798|gb|ACJ60766.1| nucellin [Hordeum brevisubulatum subsp. violaceum]
Length = 141
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 55/136 (40%), Positives = 78/136 (57%), Gaps = 5/136 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I+ NVIGHC+ G+GVL
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMIKENVIGHCLSSKGKGVL 60
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 61 YVGDFNPPSRGVTWVPMRES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 117
Query: 245 TSRVYQEIVSLIMRDL 260
+++Y EIVS + L
Sbjct: 118 PAQIYNEIVSKVRGTL 133
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 154/378 (40%), Gaps = 54/378 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPHKNI--- 66
+ ++ +G P + DTGS WV C C P E Y P ++
Sbjct: 59 YYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQC--PHESDILRKLTFYDPRSSVSSK 115
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSV- 122
V C + C + PP C + +C Y Y DGG ++G L TDL + NG
Sbjct: 116 EVKCDDTICTS----RPP-C-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQ 169
Query: 123 -FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
+ +TFGCG Q S G++G G + +SQL G + + HC+ N
Sbjct: 170 PTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN 229
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILG------PAELLYSGKSCGLK 229
G G+ +G+ P V TP+++N+ +LK + PA + + K+ G
Sbjct: 230 GGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGT- 286
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
DSG++ Y +Y E++ + PD + F LG V
Sbjct: 287 ----FIDSGSTLVYLPEIIYSELILAVFAK--------HPDITMGAMYNFQCFHFLGSVD 334
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+ F + F N + L V P YL+ C G + + I+G++ + +
Sbjct: 335 DKFPKITFHF---ENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISN 391
Query: 350 KMVIYDNEKQRIGWKPED 367
K+V+YD EKQ IGW +
Sbjct: 392 KVVVYDMEKQAIGWTEHN 409
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 156/377 (41%), Gaps = 46/377 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V+L +G PP + DTGSDL W QC APC C P + K+ +PC +
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCRSS 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
RCA+L + P C C Y+ YGD S+ G L + F +N + V + FGC
Sbjct: 148 RCASL---SSPSCFK--KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGC 202
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLG 188
G N G L+ +++G++G GRG +S+VSQL + + R GV
Sbjct: 203 G--SLNAGDLA--NSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANL 258
Query: 189 DGKVPSSG--VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFD 236
SSG V TP + N A Y L + K + L +I D
Sbjct: 259 SSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIID 318
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPD-DKTLPICWRGPFKALGQVTEYFKP 294
SG S + Y+ + R L+ PL D D L C++ P VT
Sbjct: 319 SGTSITWLQQDAYEA----VRRGLVSAIPLPAMNDTDIGLDTCFQWPPPP--NVTVTVPD 372
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
L F +S + + PE Y++I+ G L A G IIG Q+ ++Y
Sbjct: 373 LVFHF----DSANMTLLPENYMLIASTT----GYLCLVMAPTGVGTIIGNYQQQNLHLLY 424
Query: 355 DNEKQRIGWKPEDCNTL 371
D + + P C+ +
Sbjct: 425 DIGNSFLSFVPAPCDII 441
>gi|213998826|gb|ACJ60780.1| nucellin [Hordeum intercedens]
Length = 148
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 VAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLE 147
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 160/380 (42%), Gaps = 66/380 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHK----NIVPCSN 71
F V + G P + + FDTGSD++W+QC PC+G C K + + P K ++VPC +
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSATYSVVPCGH 193
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P+CAA + +C N C Y++EYGDG SS G L + L S +P FG
Sbjct: 194 PQCAAA---DGSKCS--NGTCLYKVEYGDGSSSAGVLSHETLSLT----STRALPGFAFG 244
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G D G++GLGRG++S+ SQ +C+ + +L G
Sbjct: 245 CG--QTNLGDFG--DVDGLIGLGRGQLSLSSQAA--ASFGGTFSYCLPSDNTTHGYLTIG 298
Query: 191 -KVPSSG--VAWTPMLQN------------SADLKHYILGPAELLYSGKSCGLKDLTLIF 235
P+S V +T M+Q S D+ YIL L++ D
Sbjct: 299 PTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFT-------DDGTFL 351
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG Y Y + + T K AP C+ GQ + F P
Sbjct: 352 DSGTILTYLPPEAYTALRDRFKFTM--TQYKPAPAYDPFDTCY----DFTGQ-SAIFIP- 403
Query: 296 ALSFTNRRNSVR-------LVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
A+SF SV L+ P + I CLG + A I+G + +
Sbjct: 404 AVSFKFSDGSVFDLSFFGILIFPDDTAPAIG-----CLGFVARPSAM--PFTIVGNMQQR 456
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
+ VIYD ++IG+ C
Sbjct: 457 NTEVIYDVAAEKIGFASASC 476
>gi|213998842|gb|ACJ60788.1| nucellin [Hordeum cordobense]
Length = 154
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 81/142 (57%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G ++FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEVVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLE 147
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 163/384 (42%), Gaps = 49/384 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPEKQYKPHKNIVP---CS 70
YF V+L +G+PP+ DTGSDL WV+C A C C+ P + H + C
Sbjct: 83 YF-VDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHSSTFSPAHCY 140
Query: 71 NPRCAALHWP-NPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-P 126
+P C + P PRC H + C YE Y DG + G + L+ S+G +
Sbjct: 141 DPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKS 200
Query: 127 LTFGCGY--NQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNG-- 181
+ FGCG+ + + S GV+GLGRG IS SQL R +G N +C+
Sbjct: 201 VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFG---NKFSYCLMDYTLS 257
Query: 182 ---RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK--------- 229
L +GDG S + +TP+L N Y + + +G +
Sbjct: 258 PPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDS 317
Query: 230 -DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
+ + DSG + A+ Y+ +++ + + +KL D+ P F V
Sbjct: 318 GNGGTVMDSGTTLAFLADPAYRLVIAAVKQR-----IKLPNADELTP-----GFDLCVNV 367
Query: 289 TEYFKPLA----LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGE 344
+ KP L F +V V PP Y + + + CL I + +VG ++IG
Sbjct: 368 SGVTKPEKILPRLKFEFSGGAV-FVPPPRNYFIETEEQIQCLAI-QSVDPKVG-FSVIGN 424
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDC 368
+ Q + +D ++ R+G+ C
Sbjct: 425 LMQQGFLFEFDRDRSRLGFSRRGC 448
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/364 (26%), Positives = 142/364 (39%), Gaps = 33/364 (9%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR------- 73
+ +G P F D GSDL WV CD C C Y + NP
Sbjct: 107 IDLGTPSVPFLVALDVGSDLLWVPCD--CIQCAPLSANYYSVLDRDLSEYNPALSSTSKH 164
Query: 74 --CAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL----RFSNGSVFNVP 126
C CK ND C Y+ +Y D S+ G ++ D L + S+
Sbjct: 165 LFCGHQLCAWSTTCKSANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHSLLQAS 224
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG-VL 185
+ FGCG Q L GV+GLG G IS+ + L + GL+RN C NG G +L
Sbjct: 225 VVFGCGRKQSG-SYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRIL 283
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD-LTLIFDSGASYAYF 244
F DG + P+ + Y +G E G SC + + DSG+S+ Y
Sbjct: 284 FGDDGPATQQTTQFLPLF---GEFAAYFIG-VESFCVGSSCLQRSGFQALVDSGSSFTYL 339
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
+ VY++IV + + ++ + LP W + V+ + L F N
Sbjct: 340 PAEVYKKIVFEFDKQVKVNATRIVL--RELP--WNYCYNISTLVSFNIPSMQLVFP--LN 393
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+ + P G K CL + E + +IG+ M +++D E ++GW
Sbjct: 394 QIFIHDPVYVLPANQGYKVFCLTLEETDE----DYGVIGQNLMVGYRMVFDRENLKLGWS 449
Query: 365 PEDC 368
C
Sbjct: 450 KSKC 453
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 153/369 (41%), Gaps = 41/369 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
F ++A+ +TVG P + F DTGSDL W+ C C GCT P +P +
Sbjct: 107 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFYIPGMSST 163
Query: 74 CAALHWPNPPRCKHPND-----QCDYEIEYGDGG-SSIGALVTDLFPLRFSNG--SVFNV 125
A+ N C + QC Y++ Y G SS G LV D+ L N +
Sbjct: 164 SKAVPC-NSNFCDLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKA 222
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ GCG Q L G+ GLG +S+ S L + GL N C G++G G +
Sbjct: 223 QIMLGCGQTQTG-SFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRI 281
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASY 241
GD + SS TP+ N + I SG + G K D IFD+G S+
Sbjct: 282 SFGDQE--SSDQEETPLDINRQHPTYAI------TISGITVGNKPTDMDFITIFDTGTSF 333
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA-LSFT 300
Y Y I + + A D R PF+ ++E P+ +
Sbjct: 334 TYLADPAYTYITQSFHAQVQAN--RHAADS-------RIPFEYCYDLSEARFPIPDIILR 384
Query: 301 NRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
S+ V+ P + I + V CL I+ + NIIG+ FM V++D E++
Sbjct: 385 TVTGSMFPVIDPGQVISIQEHEYVYCLAIVKSMKL-----NIIGQNFMTGLRVVFDRERK 439
Query: 360 RIGWKPEDC 368
+GWK +C
Sbjct: 440 ILGWKKFNC 448
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 151/380 (39%), Gaps = 59/380 (15%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----GCTKPPEKQYKPHKNI----VP 68
+ +G P F D GSDL WV C+ AP + G +Y+P + +
Sbjct: 107 IDIGTPSVSFLVALDAGSDLLWVPCNCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHIS 166
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF----SNGSVF 123
CS+ C + C+ P C Y I+Y + SS G L+ D+ L S+
Sbjct: 167 CSHNLCDSGQ-----SCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTI 221
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
P+ GCG Q G LS G+ GLG G IS++S L + L++N C ++G G
Sbjct: 222 QAPVILGCGMKQSG-GYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSG 280
Query: 184 VLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
+F GD G ++ P+ + YI+G + DSG S+
Sbjct: 281 RIFFGDEGPASQQTTSFVPL---DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFT 337
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG-PFKALGQVTEYFKPLALSFTN 301
Y Y+ IV + L + T + ++G P+K +++ P
Sbjct: 338 YLPEEAYENIVIEFDKRL----------NTTSAVSFKGYPWKYCYKISADAMP------- 380
Query: 302 RRNSVRLVVPPEAYLVI----------SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
+ SV L+ P V+ G C IL G+ I+G+ +M
Sbjct: 381 KVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPAD----GDIGILGQNYMTGYR 436
Query: 352 VIYDNEKQRIGWKPEDCNTL 371
+++D + ++GW +C L
Sbjct: 437 MVFDRDNLKLGWSHANCQDL 456
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 155/369 (42%), Gaps = 38/369 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSN 71
Y N T+G PP+ D +L W QC + C C K + P+ + PC
Sbjct: 53 YNVANFTIGTPPQAASAFIDLTGELVWTQC-SQCIHCFKQDLPVFVPNASSTFKPEPCGT 111
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C ++ P K +D C Y+ G GG ++G + TD F + G+ L FGC
Sbjct: 112 DVCKSIPTP-----KCASDVCAYDGVTGLGGHTVGIVATDTFAI----GTAAPASLGFGC 162
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
P +G +GLGR S+V+Q++ + H G+N R LFLG
Sbjct: 163 VVASDIDTMGGP---SGFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSR--LFLGASA 217
Query: 192 VPSSGVAWTPMLQNSAD--LKHYILGPAELLYSGKSCGL----KDLTLIFDSGASYAYFT 245
+ G AWTP ++ S + + Y E + +G + ++ L+ + +
Sbjct: 218 KLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLV 277
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
VYQE +M + P P +C+ P + + L FT + +
Sbjct: 278 DSVYQEFKKAVMASVGAAPTA-TPVGAPFEVCF--PKAGVSGAPD------LVFTFQAGA 328
Query: 306 VRLVVPPEAYLVISGRKNVCLGILNGSEAEVGE---NNIIGEIFMQDKMVIYDNEKQRIG 362
L VPP YL G VCL +++ + + NI+G ++ +++D +K +
Sbjct: 329 A-LTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLS 387
Query: 363 WKPEDCNTL 371
++P DC++L
Sbjct: 388 FEPADCSSL 396
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 153/378 (40%), Gaps = 54/378 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPHKNI--- 66
+ ++ +G P + DTGS WV C C P E Y P ++
Sbjct: 59 YYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQC--PHESDILRKLTFYDPRSSVSSK 115
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSV- 122
V C + C + P C + +C Y Y DGG ++G L TDL + NG
Sbjct: 116 EVKCDDTICTS-----RPPC-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQ 169
Query: 123 -FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
+ +TFGCG Q S G++G G + +SQL G + + HC+ N
Sbjct: 170 PTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN 229
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILG------PAELLYSGKSCGLK 229
G G+ +G+ P V TP+++N+ +LK + PA + + K+ G
Sbjct: 230 GGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGT- 286
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
DSG++ Y +Y E++ + PD + F LG V
Sbjct: 287 ----FIDSGSTLVYLPEIIYSELILAVFAK--------HPDITMGAMYNFQCFHFLGSVD 334
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+ F + F N + L V P YL+ C G + + I+G++ + +
Sbjct: 335 DKFPKITFHF---ENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISN 391
Query: 350 KMVIYDNEKQRIGWKPED 367
K+V+YD EKQ IGW +
Sbjct: 392 KVVVYDMEKQAIGWTEHN 409
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 113/409 (27%), Positives = 176/409 (43%), Gaps = 94/409 (22%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN--- 65
+ + L++G PP + DTGSDL W QC APC + Q Y P +
Sbjct: 87 YIMTLSIGTPPLSYRAIADTGSDLIWTQC-APCGDTVTDTDNQCFKQSGCLYNPSSSTTF 145
Query: 66 -IVPCSNP--RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS- 121
++PC++P CAA+ P+PP P C Y YG G + A V + F + S
Sbjct: 146 GVLPCNSPLSMCAAMAGPSPP----PGCACMYNQTYGTGWT---AGVQSVETFTFGSSST 198
Query: 122 --VFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
VP + FGC N +AG++GLGRG +S+VSQL +C+
Sbjct: 199 PPAVRVPNIAFGCSNASSNDW----NGSAGLVGLGRGSMSLVSQLGA-----GAFSYCLT 249
Query: 179 ----QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKH--YILGPAE--------LLYSGK 224
N L LG PS+ A L+ + ++ ++ GP++ L +G
Sbjct: 250 PFQDANSTSTLLLG----PSAAAA----LKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGI 301
Query: 225 SCGLKDLT---------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA- 268
S G L LI DSG + YQ++ + + R L+ T L LA
Sbjct: 302 SVGETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAV-RSLLVTRLPLAH 360
Query: 269 -PDDKT-LPICW----RGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRK 322
PD T L +C+ P A+ +T +F+ +V+P E Y+++ G
Sbjct: 361 GPDHSTGLDLCFALKASTPPPAMPSMTLHFE----------GGADMVLPVENYMIL-GSG 409
Query: 323 NVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
CL + N + VG +++G Q+ V+YD K+ + + P C++L
Sbjct: 410 VWCLAMRNQT---VGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVCSSL 455
>gi|213998830|gb|ACJ60782.1| nucellin [Hordeum pusillum]
Length = 147
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 81/142 (57%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 2 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 61
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 62 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 118
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS ++ L + L+
Sbjct: 119 PAQIYNEIVSKVIGTLSESSLE 140
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 158/379 (41%), Gaps = 57/379 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C + P Y P ++
Sbjct: 60 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQS 116
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF--S 118
VPCS+ C + C+ ++ C Y I+Y D SS G LV D+ L +
Sbjct: 117 TTSRKVPCSSNLCDLQN-----ACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 171
Query: 119 NGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ P+ FGCG Q G +P G+LGLG S+ S L GL N C
Sbjct: 172 QSKIVTAPIMFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGLAANSFSMC 228
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPA-ELLYSGKSCGLK----DL 231
G +G G + GD SS TP L Y P + +G + G K +
Sbjct: 229 FGDDGHGRINFGD--TGSSDQKETP-------LNVYKQNPYYNITITGITVGSKSISTEF 279
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ I DSG S+ + +Y +I S + + L D ++P + A G V
Sbjct: 280 SAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNML---DSSMPFEFCYSVSANGIVHP- 335
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQD 349
+S T + S+ V P + + V CL I+ N+IGE FM
Sbjct: 336 ----NVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGV-----NLIGENFMSG 386
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V++D E+ +GWK +C
Sbjct: 387 LKVVFDRERMVLGWKNFNC 405
>gi|213998836|gb|ACJ60785.1| nucellin [Hordeum bogdanii]
Length = 154
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK-DLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMRES---LFYYSPGLAELLIDNQPIGGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLE 147
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/386 (26%), Positives = 160/386 (41%), Gaps = 66/386 (17%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRC 74
+ L++G P + DTGSDL W QC PCT C P + P K+ V CS+ C
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 59
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYN 134
AL N C D C+Y YGD S+ G L T+ F N S+ + FGCG
Sbjct: 60 NALPRSN---CNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN-SISGIG--FGCGVE 113
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG-- 188
G +G++GLGRG +S++SQL+E +C+ LF+G
Sbjct: 114 NEGDG---FSQGSGLVGLGRGPLSLISQLKE-----TKFSYCLTSIEDSEASSSLFIGSL 165
Query: 189 -DGKVPSSGVAW-------TPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-------- 232
G V +G + +L+N Y L + K ++ T
Sbjct: 166 ASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGT 225
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK---TLPICWRGPFKALGQ 287
+I DSG + Y ++ ++++ + + L DD L +C++ P A
Sbjct: 226 GGMIIDSGTTITYLEETAFK-----VLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAA--- 277
Query: 288 VTEYFKPLAL-SFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEI 345
K +A+ L +P E Y+V V CL + GS + +I G +
Sbjct: 278 -----KNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAM--GSSNGM---SIFGNV 327
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ V++D EK+ + + P +C L
Sbjct: 328 QQQNFNVLHDLEKETVSFVPTECGKL 353
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 158/373 (42%), Gaps = 49/373 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + +G P KL DTGSD+ W+QC +PC C K + + P + + CS
Sbjct: 14 YF-VRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSSFRRLSCST 71
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+C L + C +++C Y++ YGDG ++G L +D F + S P+ FGC
Sbjct: 72 PQCKLL---DVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS----PVVFGC 124
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G++ N G AG+LGLG G++S SQL ++ G L GD
Sbjct: 125 GHD--NEGLF--VGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSA 180
Query: 192 VPSSG-VAWTPMLQN-------SADLKHYILGPAELLYSGKSCGLKDLT----LIFDSGA 239
+P+S A+T +L+N A L +G L + L T +I DSG
Sbjct: 181 LPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGT 240
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTP---LKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
S + Y +MRD + L A D C+ F AL VT ++
Sbjct: 241 SVTRLPTYAYT-----VMRDAFRSATQKLPRAADFSLFDTCY--DFSALTSVT--IPTVS 291
Query: 297 LSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
F + +PP YLV + C S + +IIG I Q V D
Sbjct: 292 FHF---EGGASVQLPPSNYLVPVDTSGTFCFAFSKTSL----DLSIIGNIQQQTMRVAID 344
Query: 356 NEKQRIGWKPEDC 368
+ R+G+ P C
Sbjct: 345 LDSSRVGFAPRQC 357
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 160/379 (42%), Gaps = 55/379 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +VG PP DTGSD+ W+QC+ PC C + P K+ +PC +
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCE-PCEQCYNQTTPIFNPSKSSSYKNIPCLSK 145
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C H C N C Y+I YGD S G L D L ++GS + P T GC
Sbjct: 146 LC---HSVRDTSCSDQN-SCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIGC 201
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G + N G ++G++GLG G +S+++QL I +C+ N +L
Sbjct: 202 GTD--NAGTFGGA-SSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSIL 256
Query: 186 FLGDGKVPS-SGVAWTPMLQNS-----ADLKHYILGPAELLYSGKSCGLKDL-TLIFDSG 238
GD V S GV TP+++ L+ + +G + + G S G D +I DSG
Sbjct: 257 SFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316
Query: 239 ASYAYFTSRVY----QEIVSLIMRDLIGTPLKLAPDDKTLPICW--RGPFKALGQVTEYF 292
+ S VY +V L+ D + P ++ +C+ + +T +F
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDRVDDP------NQQFSLCYSLKSNEYDFPIITAHF 370
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
K + +S+ VP + VC + +I G + Q+ +V
Sbjct: 371 KGADIEL----HSISTFVPITDGI-------VCFAFQPSPQL----GSIFGNLAQQNLLV 415
Query: 353 IYDNEKQRIGWKPEDCNTL 371
YD +++ + +KP DC +
Sbjct: 416 GYDLQQKTVSFKPTDCTKV 434
>gi|213998834|gb|ACJ60784.1| nucellin [Hordeum bulbosum]
Length = 154
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/136 (38%), Positives = 78/136 (57%), Gaps = 5/136 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QLR + +I+ NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLRGHKMIKENVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD P+ GV W PM ++ L +Y G AE+ + G +FDSG++Y +
Sbjct: 69 YVGDFNPPTRGVTWVPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDL 260
+++Y EIVS + L
Sbjct: 126 PAQIYSEIVSKVRGTL 141
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 149/371 (40%), Gaps = 44/371 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGC-----TKPPEKQYKPHK----NIV 67
+ +G P F DTGSDL W+ C+ AP T +Y P +
Sbjct: 104 IDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSSSKVF 163
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPL------RFSNG 120
CS+ C + C P +QC Y ++Y G SS G LV D+ L R NG
Sbjct: 164 LCSHKLCGS-----ASDCDSPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218
Query: 121 SV-FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
S + GCG Q L G++GLG IS+ S L + GL+RN C +
Sbjct: 219 SSSVKARVVVGCGKKQSG-DYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDE 277
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSG 238
G ++ GD A L+N++ YI+G E G SC T DSG
Sbjct: 278 EDSGRIYFGDMGPSIQQSAPFLQLENNSG---YIVG-VEACCIGNSCLKQTSFTTFIDSG 333
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
S+ Y +Y+++ I R + T + W +++ V + L
Sbjct: 334 QSFTYLPEEIYRKVALEIDRHINATSKSFE------GVSWEYCYES--SVEPKVPAIKLK 385
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F++ N+ + P + G CL I + +G IG+ +M+ +++D E
Sbjct: 386 FSH-NNTFVIHKPLFVFQQSQGLVQFCLPISPSEQEGIGS---IGQNYMRGYRMVFDREN 441
Query: 359 QRIGWKPEDCN 369
++GW P C
Sbjct: 442 MKLGWSPSKCQ 452
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/394 (26%), Positives = 165/394 (41%), Gaps = 58/394 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
++ F++ L +G K DTGS+ VQC + P Q VPC +
Sbjct: 97 YALFSMQLGIGSLQKNLSAIIDTGSEAVLVQCGSRSRPVFDPAASQSYRQ---VPCISQL 153
Query: 74 CAALHWP----NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--- 126
C A+ + C + + C Y + YGD +S G D+ L +N S V
Sbjct: 154 CLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRD 213
Query: 127 LTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN----- 180
+ FGC H+P G L + G++G RG +S+ SQL++ L + +C
Sbjct: 214 VAFGCA---HSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPR 269
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQN---SADLKHYILGPAELLYSGKSCGL--------- 228
GV+FLGD + S V +TP+L N A + Y +G + GK+ +
Sbjct: 270 ATGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDP 329
Query: 229 --KDLTLIFDSGASYAYFTSRVYQEIVSLI-------MRDLIGTPLKLAPDDKTLPICWR 279
D + DSG ++ Y + +R +G DD C+
Sbjct: 330 STGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGF--DD-----CYN 382
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKN---VCLGILNGSEAE 335
+ G + LS +N+VRL + E V +S N VCL IL+ ++
Sbjct: 383 ---ISAGSSLPGVPEVRLSL---QNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSG 436
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
G+ N++G + +V YDNE+ R+G++ DC+
Sbjct: 437 FGKINVLGNYQQSNYLVEYDNERSRVGFERADCS 470
>gi|213998804|gb|ACJ60769.1| nucellin [Hordeum muticum]
gi|213998808|gb|ACJ60771.1| nucellin [Hordeum erectifolium]
gi|213998820|gb|ACJ60777.1| nucellin [Hordeum patagonicum subsp. mustersii]
gi|213998822|gb|ACJ60778.1| nucellin [Hordeum patagonicum subsp. santacrucense]
gi|333069937|gb|AEF13570.1| nucellin, partial [Hordeum pubiflorum]
Length = 154
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLE 147
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 158/379 (41%), Gaps = 57/379 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C + P Y P ++
Sbjct: 74 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQS 130
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF--S 118
VPCS+ C + C+ ++ C Y I+Y D SS G LV D+ L +
Sbjct: 131 TTSRKVPCSSNLCDLQN-----ACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 185
Query: 119 NGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ P+ FGCG Q G +P G+LGLG S+ S L GL N C
Sbjct: 186 QSKIVTAPIMFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGLAANSFSMC 242
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPA-ELLYSGKSCGLK----DL 231
G +G G + GD SS TP L Y P + +G + G K +
Sbjct: 243 FGDDGHGRINFGD--TGSSDQKETP-------LNVYKQNPYYNITITGITVGSKSISTEF 293
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ I DSG S+ + +Y +I S + + L D ++P + A G V
Sbjct: 294 SAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNML---DSSMPFEFCYSVSANGIVHP- 349
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQD 349
+S T + S+ V P + + V CL I+ N+IGE FM
Sbjct: 350 ----NVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGV-----NLIGENFMSG 400
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V++D E+ +GWK +C
Sbjct: 401 LKVVFDRERMVLGWKNFNC 419
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 161/383 (42%), Gaps = 60/383 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
F +++++G P + DTGSDL W QC PC C + P + +PCS+
Sbjct: 118 FLMDMSIGTPALAYAAIVDTGSDLVWTQCK-PCVECFNQSTPVFDPSSSSTYSTLPCSSS 176
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L C C Y YGD S+ G L + F L + +P + FGC
Sbjct: 177 LCSDLPTST---CTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKT-----KLPGVAFGC 228
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ---NGRGVLFLG 188
G G AG++GLGRG +S+VSQL GL + +C+ + L LG
Sbjct: 229 GDTNEGDG---FTQGAGLVGLGRGPLSLVSQL---GLGK--FSYCLTSLDDTSKSPLLLG 280
Query: 189 D------GKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDL---T 232
++ + TP+++N + LK +G + G + ++D
Sbjct: 281 SLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGG 340
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVT 289
+I DSG S Y + Y+ ++ +KL D + L +C++ P + V
Sbjct: 341 VIVDSGTSITYLELQGYRP-----LKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDVE 395
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVI-SGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
L L F + L +P E Y+V+ S +CL ++ GS +IIG Q
Sbjct: 396 --VPKLVLHFDGGAD---LDLPAENYMVLDSASGALCLTVM-GSRGL----SIIGNFQQQ 445
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ +YD +K + + P C L
Sbjct: 446 NIQFVYDVDKDTLSFAPVQCAKL 468
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 150/354 (42%), Gaps = 57/354 (16%)
Query: 51 GCTKPPEKQ--------YKPH----KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY 98
GCT P+K Y P+ N VPC + C + CK + C Y I Y
Sbjct: 32 GCTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITY 90
Query: 99 GDGGSSIGALVTDLFPLRFSNGSVFNVP----LTFGCGYNQHNPGPLSP-PDTA--GVLG 151
GDG ++ G+ V D +G++ P + FGCG Q G LS D A G++G
Sbjct: 91 GDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQ--SGSLSSNSDEALDGIIG 148
Query: 152 LGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKH 211
G+ S++SQL G ++ + HC+ + G +F G+V TP++ A H
Sbjct: 149 FGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIF-SIGQVMEPKFNTTPLVPRMA---H 204
Query: 212 Y-------------ILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
Y IL P L SG G I DSG + AY +Y +++ ++
Sbjct: 205 YNVILKDMDVDGEPILLPLYLFDSGSGRG-----TIIDSGTTLAYLPLSIYNQLLPKVLG 259
Query: 259 DLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
G L + D T F ++ E F + F + L V P YL +
Sbjct: 260 RQPGLKLMIVEDQFTC-------FHYSDKLDEGFPVVKFHF----EGLSLTVHPHDYLFL 308
Query: 319 SGRKNVCLGILNGS-EAEVGENNI-IGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
C+G S + + G + I IG++ + +K+V+YD E IGW +C++
Sbjct: 309 YKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCSS 362
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 159/372 (42%), Gaps = 35/372 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK-----PPEKQYKPHKNIVPCSN 71
+ + +G P + DTGSD+ WV+C +PC C PP Y + +
Sbjct: 83 YYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQDIIPPLSIYNLSASSTSSVS 141
Query: 72 PRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
L C N C Y I Y D +SIGA V D G+ + F
Sbjct: 142 SCSDPLCTGEQAVCSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVLQGGNATTSHIFF 201
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC N P G++G G+ ++ +Q+ + V HC+G ++G G+L
Sbjct: 202 GCAINITGSWP-----ADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEF 256
Query: 188 GDGKVPSSGVAWTPMLQN---------SADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
G+ + ++ + +TP+L S + +L +S S + +I DSG
Sbjct: 257 GE-EPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVIIDSG 315
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
S+A ++ + + S I ++L T KL P + L + K+ V F + L+
Sbjct: 316 TSFALLATKANRILFSEI-KNL--TTAKLGPKLEGLQCFY---LKSGLTVETSFPNVTLT 369
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F+ + + P+ YLV+ K G + G I GEI ++DK+V YD E
Sbjct: 370 FSGGST---MKLKPDNYLVMVELKKKRNGYCYAWSSADGLT-IFGEIVLKDKLVFYDVEN 425
Query: 359 QRIGWKPEDCNT 370
+RIGWK ++C++
Sbjct: 426 RRIGWKGQNCSS 437
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 158/385 (41%), Gaps = 52/385 (13%)
Query: 4 SWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
S IE + + +N+ +G P F DTGSDL W QC+ PCT C P + P
Sbjct: 83 SGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE-PCTQCFSQPTPIFNPQ 141
Query: 64 K----NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN 119
+ +PC + C L P N++C Y YGDG ++ G + T+ F F
Sbjct: 142 DSSSFSTLPCESQYCQDL-----PSETCNNNECQYTYGYGDGSTTQGYMATETF--TFET 194
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-- 177
SV N+ FGCG + G + AG++G+G G +S+ SQL +C+
Sbjct: 195 SSVPNI--AFGCGEDNQGFG---QGNGAGLIGMGWGPLSLPSQLG-----VGQFSYCMTS 244
Query: 178 -GQNGRGVLFLGDGK--VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-- 232
G + L LG VP G T ++ +S + +Y + + G + G+ T
Sbjct: 245 YGSSSPSTLALGSAASGVP-EGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQ 303
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK- 283
+I DSG + Y Y V+ D I P + L C++ P
Sbjct: 304 LQDDGTGGMIIDSGTTLTYLPQDAY-NAVAQAFTDQINLP-TVDESSSGLSTCFQQPSDG 361
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
+ QV E N L+ P E +CL + GS +++G +I G
Sbjct: 362 STVQVPEISMQFDGGVLNLGEQNILISPAEGV--------ICLAM--GSSSQLGI-SIFG 410
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
I Q+ V+YD + + + P C
Sbjct: 411 NIQQQETQVLYDLQNLAVSFVPTQC 435
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 104/388 (26%), Positives = 160/388 (41%), Gaps = 66/388 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F + L++G P + DTGSDL W QC PCT C P + P K+ V CS+
Sbjct: 108 FLMELSIGNPAVKYAAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCSSG 166
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL N C D C+Y YGD S+ G L T+ F N S+ + FGCG
Sbjct: 167 LCNALPRSN---CNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDEN-SISGIG--FGCG 220
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
G +G++GLGRG +S++SQL+E +C+ LF+G
Sbjct: 221 VENEGDG---FSQGSGLVGLGRGPLSLISQLKE-----TKFSYCLTSIEDSEASSSLFIG 272
Query: 189 ---DGKVPSSG-------VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------ 232
G V +G +L+N Y L + K ++ T
Sbjct: 273 SLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSED 332
Query: 233 ----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK---TLPICWRGPFKAL 285
+I DSG + Y ++ ++++ + + L DD L +C++ P A
Sbjct: 333 GTGGMIIDSGTTITYLEETAFK-----VLKEEFTSRMSLPVDDSGSTGLDLCFKLPNAA- 386
Query: 286 GQVTEYFKPLAL-SFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIG 343
K +A+ L +P E Y+V V CL + GS + +I G
Sbjct: 387 -------KNIAVPKLIFHFKGADLELPGENYMVADSSTGVLCLAM--GSSNGM---SIFG 434
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+ Q+ V++D EK+ + + P +C L
Sbjct: 435 NVQQQNFNVLHDLEKETVTFVPTECGKL 462
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 151/369 (40%), Gaps = 38/369 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCS 70
YF V + +G P + FDTGSDLTW QC+ PC G C K + + P K+ + C+
Sbjct: 136 YFVV-VGLGTPKRDLSLVFDTGSDLTWTQCE-PCAGSCYKQQDAIFDPSKSSSYINITCT 193
Query: 71 NPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L RC C Y I+YGD +S+G L + + ++ F
Sbjct: 194 SSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITATD---IVDDFLF 250
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFL 187
GCG Q N G S +AG++GLGR IS V Q + + +C+ + G L
Sbjct: 251 GCG--QDNEGLFS--GSAGLIGLGRHPISFVQQTSS--IYNKIFSYCLPSTSSSLGHLTF 304
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG------KSCGLKDLTLIFDSGASY 241
G ++ + +TP+ S D Y L + G S I DSG
Sbjct: 305 GASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVI 364
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
Y + S + + P +A +D C+ F +++ + F
Sbjct: 365 TRLAPTAYAALRSAFRQGMEKYP--VANEDGLFDTCY--DFSGYKEIS--VPKIDFEFA- 417
Query: 302 RRNSVRLVVPPEAYLVISGRKNVCLGI-LNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
V + +P L+ + VCL NG++ ++ I G + + V+YD E R
Sbjct: 418 --GGVTVELPLVGILIGRSAQQVCLAFAANGNDNDI---TIFGNVQQKTLEVVYDVEGGR 472
Query: 361 IGWKPEDCN 369
IG+ CN
Sbjct: 473 IGFGAAGCN 481
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 172/398 (43%), Gaps = 57/398 (14%)
Query: 16 YFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGC-TKPPEKQYKPHKNIVPCSNPR 73
Y+ N+ +G P P+ F DTGS LT+V C A C C T ++ P + C +
Sbjct: 111 YYYANIALGDPSPRTFQVIVDTGSTLTYVPC-ATCAKCGTHTGGTRFDPTGKWLTCQEKQ 169
Query: 74 CAALHWPN---PPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFNVPLT 128
C A P R N +C Y Y +G G LV D F + + + +
Sbjct: 170 CKAAGGPGICAGGRGAAAN-RCTYSRTYAEGSGVSGDLVRDKMHFGGDIAPATNGTLDVV 228
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRI-SIVSQLREYGLIRNVIGHCIGQ-NGRGVLF 186
FGC G + + G++GLG + SI +QL + + V C G G G L
Sbjct: 229 FGC--TNAESGTIHDQEADGLIGLGNNQFASIPNQLADTHGLPRVFSLCFGSFEGGGALS 286
Query: 187 LGDGKVPSS----GVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-KDLTL----IFDS 237
G++P++ + +T M N A +Y++ A + + DL + + DS
Sbjct: 287 F--GRLPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVATPSDLAVGYGTVMDS 344
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTP---LKLA---------PDDKTLPICWR------ 279
G ++ Y ++V+ + + + KLA PDD +C++
Sbjct: 345 GTTFTYVPTKVFHATAAALDAAVTTNAKPEKKLAKVPGPDPSYPDD----VCFQREGATE 400
Query: 280 -GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRK--NVCLGILNGSEAEV 336
P + + EY+ PL ++F S LV+PP YL + G+K CLG+++ +
Sbjct: 401 IEPIVTMANLGEYYPPLTIAFDGEGAS--LVLPPSNYLFVHGKKPGAFCLGVMDNKQ--- 455
Query: 337 GENNIIGEIFMQDKMVIYDNE--KQRIGWKPEDCNTLL 372
+ +IG I ++D +V YD RIG+ DC+ LL
Sbjct: 456 -QGTLIGGISVRDVLVEYDKTVGGGRIGFAATDCDALL 492
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 158/379 (41%), Gaps = 57/379 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C + P Y P ++
Sbjct: 97 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPLQSPNYGSLKFDVYSPAQS 153
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF--S 118
VPCS+ C + C+ ++ C Y I+Y D SS G LV D+ L +
Sbjct: 154 TTSRKVPCSSNLCDLQN-----ACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 208
Query: 119 NGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ P+ FGCG Q G +P G+LGLG S+ S L GL N C
Sbjct: 209 QSKIVTAPIMFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGLAANSFSMC 265
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPA-ELLYSGKSCGLK----DL 231
G +G G + GD SS TP L Y P + +G + G K +
Sbjct: 266 FGDDGHGRINFGD--TGSSDQKETP-------LNVYKQNPYYNITITGITVGSKSISTEF 316
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ I DSG S+ + +Y +I S + + L D ++P + A G V
Sbjct: 317 SAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNML---DSSMPFEFCYSVSANGIVHP- 372
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQD 349
+S T + S+ V P + + V CL I+ N+IGE FM
Sbjct: 373 ----NVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGV-----NLIGENFMSG 423
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V++D E+ +GWK +C
Sbjct: 424 LKVVFDRERMVLGWKNFNC 442
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 100/364 (27%), Positives = 148/364 (40%), Gaps = 42/364 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ + + G P K FDTGS++ W+QC C E + P ++NI C++
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNI-SCTS 74
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L C C Y + YGDG S++G L T+ F L + G+VFN FGC
Sbjct: 75 AACTGLSSRG---CS--GSTCVYGVTYGDGSSTVGFLATETFTL--AAGNVFN-NFIFGC 126
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G Q+N G + AG++GLGR S+ SQL + N+ +C+ +L G
Sbjct: 127 G--QNNQGLFT--GAAGLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGYLNIGN 180
Query: 192 VPSSGVAWTPMLQNS-------ADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
P +T ML NS DL +G L S S + + I DSG
Sbjct: 181 -PLRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALS--STVFQSVGTIIDSGTVITRL 237
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
Y + + + T A L C+ F VT F + L +T
Sbjct: 238 PPTAYGALRTAFRAAM--TQYTRAAAASILDTCY--DFSRTTTVT--FPTIKLHYTG--- 288
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+ + +P + VCL S++ + IIG + + V YDN +RIG+
Sbjct: 289 -LDVTIPGAGVFYVISSSQVCLAFAGNSDST--QIGIIGNVQQRTMEVTYDNALKRIGFA 345
Query: 365 PEDC 368
C
Sbjct: 346 AGAC 349
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 156/369 (42%), Gaps = 38/369 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSN 71
Y N T+G PP+ D +L W QC + C C K + P+ + PC
Sbjct: 23 YNVANFTIGTPPQAASAFIDLTGELVWTQC-SQCIHCFKQDLPVFVPNASSTFKPEPCGT 81
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C ++ P K +D C ++ G GG ++G + TD F + G+ L FGC
Sbjct: 82 DVCKSIPTP-----KCASDVCAFDGVTGLGGHTVGIVATDTFAI----GTAAPASLGFGC 132
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
P +G +GLGR S+V+Q++ + H G+N R LFLG
Sbjct: 133 VVASDIDTMGGP---SGFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSR--LFLGASA 187
Query: 192 VPSSGVAWTPMLQNSAD--LKHYILGPAELLYSGKSCGL----KDLTLIFDSGASYAYFT 245
+ G AWTP ++ S + + Y E + +G + ++ L+ + +
Sbjct: 188 KLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLV 247
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
VYQE +M + P P + +C+ P + + L FT + +
Sbjct: 248 DSVYQEFKKAVMASVGAAPTA-TPVGEPFEVCF--PKAGVSGAPD------LVFTFQAGA 298
Query: 306 VRLVVPPEAYLVISGRKNVCLGILNGSEAEVGE---NNIIGEIFMQDKMVIYDNEKQRIG 362
L VPP YL G VCL +++ + + NI+G ++ +++D +K +
Sbjct: 299 A-LTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLS 357
Query: 363 WKPEDCNTL 371
++P DC++L
Sbjct: 358 FEPADCSSL 366
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 158/360 (43%), Gaps = 34/360 (9%)
Query: 27 PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALHWPNPPRCK 86
+ F+ DTGS T++ C C C +Y + S C+A +C
Sbjct: 44 AQTFELIVDTGSSRTYLPCKG-CASCGAHEAGRYYDYDASADFSRVECSACAGIGG-KCG 101
Query: 87 HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDT 146
+ C Y++ Y +G S G LV D+ L GSV N + FGC + G +
Sbjct: 102 -TSGVCRYDVHYLEGSGSEGYLVRDVVSL---GGSVGNATVVFGC--EERELGSIKQQSA 155
Query: 147 AGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVLFLG--DGKVPSSGV 197
G+ G GR ++ +QL +I ++ C+ G++ G+L LG D + +
Sbjct: 156 DGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGNFDFGADAPAL 215
Query: 198 AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIM 257
+TPM+ S+ + + + + L + G + + I DSG SY Y ++ + L
Sbjct: 216 VYTPMV--SSAMYYQVTTTSWTLGNSVVEGSRGVLTIIDSGTSYTYVPGNMHARFLQLAE 273
Query: 258 RDLIGTPL-KLAPDDKTLPICWRGPFKALG--QVTEYFKPLALSFTNRRNSVRLVVPPEA 314
+ L K+AP + +C+ G LG V+EYF L + + S RL + PE
Sbjct: 274 DAARESGLEKVAPPEDYPDLCF-GNSGGLGWSTVSEYFPALKIEY---HGSARLTLSPET 329
Query: 315 YLVISGRKNV---CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
YL +KN C+GIL + + ++G+I M++ +D + ++G +C L
Sbjct: 330 YLYWH-QKNASAFCVGILEHDDNRI----LLGQITMRNTFTEFDVARSQVGMASANCEML 384
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/395 (25%), Positives = 163/395 (41%), Gaps = 64/395 (16%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----I 66
P + + L +G PP + DTGSDL W QC APCT C + P Y P + +
Sbjct: 85 PTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAV 143
Query: 67 VPCSNPRCAALHWPN------PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
+PC++ PP C C Y + YG G +S+ ++ F +
Sbjct: 144 LPCNSSLSVCAAALAGTGTAPPPGCA-----CTYNVTYGSGWTSVFQ-GSETFTFGSTPA 197
Query: 121 SVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG- 178
VP + FGC + +G++GLGRGR+S+VSQL G+ + +C+
Sbjct: 198 GQSRVPGIAFGCSTASSG---FNASSASGLVGLGRGRLSLVSQL---GVPK--FSYCLTP 249
Query: 179 ---QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLY----SGKSCGLKDL 231
N L LG PS+ + T + ++ + P Y +G S G L
Sbjct: 250 YQDTNSTSTLLLG----PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTAL 305
Query: 232 T---------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI 276
+ LI DSG + + YQ++ + ++ L+ P L +
Sbjct: 306 SIPPDAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVV-SLVTLPTTDGSAATGLDL 364
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEV 336
C+ P + P S T N +V+P ++Y++ CL + N ++ EV
Sbjct: 365 CFMLP------SSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEV 418
Query: 337 GENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
NI+G Q+ ++YD ++ + + P C+ L
Sbjct: 419 ---NILGNYQQQNMHILYDIGQETLSFAPAKCSAL 450
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 158/373 (42%), Gaps = 49/373 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + +G P KL DTGSD+ W+QC +PC C K + + P + + CS
Sbjct: 14 YF-VRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSSFRRLSCST 71
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+C L + C +++C Y++ YGDG ++G L +D F + S P+ FGC
Sbjct: 72 PQCKLL---DVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTS----PVVFGC 124
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G++ N G AG+LGLG G++S SQL ++ G L GD
Sbjct: 125 GHD--NEGLF--VGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSA 180
Query: 192 VPSSG-VAWTPMLQN-------SADLKHYILGPAELLYSGKSCGLKDLT----LIFDSGA 239
+P+S A+T +L+N A L +G L + L T +I DSG
Sbjct: 181 LPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGT 240
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTP---LKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
S + Y +MRD + L A D C+ F AL VT ++
Sbjct: 241 SVTRLPTYAYT-----VMRDAFRSATQKLPRAADFSLFDTCY--DFSALTSVT--IPTVS 291
Query: 297 LSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
F + +PP YLV + C S + +IIG I Q V D
Sbjct: 292 FHF---EGGASVQLPPSNYLVPVDTSGTFCFAFSKTSL----DLSIIGNIQQQTMRVAID 344
Query: 356 NEKQRIGWKPEDC 368
+ R+G+ P C
Sbjct: 345 LDSSRVGFAPRQC 357
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 104/365 (28%), Positives = 152/365 (41%), Gaps = 59/365 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------------QYKPH 63
YFA + +G P K + DTGSD+ WV C GC + P K +
Sbjct: 78 YFA-KIGIGTPSKDYYVQVDTGSDILWVNC----AGCDRCPTKSDLGVDLTLYDMKASTT 132
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ V C + C+ P P CK P QC Y + YGDG S+ G V D +G+
Sbjct: 133 SDAVGCDDNFCSLYDGP-LPGCK-PGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQ 190
Query: 124 NVP----LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
P + FGCG Q S G+LG G+ S++SQL G ++ V HC+
Sbjct: 191 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN 250
Query: 180 -NGRGVLFLGDGKVPS------SGVAWTPMLQNSAD----LKHYILG------PAELLYS 222
+G G+ +G+ P + V + + A +K +G P++ S
Sbjct: 251 VDGGGIFAIGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAFES 310
Query: 223 GKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGP 281
G G I DSG + AYF VY V LI + L P L+L ++
Sbjct: 311 GDRKG-----TIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTC----- 357
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGEN- 339
F G V + F + L F S+ L V P YL C+G N G++ + G++
Sbjct: 358 FDYTGNVDDGFPTVTLHFD---KSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDL 414
Query: 340 NIIGE 344
++GE
Sbjct: 415 TLLGE 419
>gi|213998838|gb|ACJ60786.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 154
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 81/142 (57%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ + +I+ NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD P+ GV W PM ++ L +Y G AE+ + G +FDSG++Y +
Sbjct: 69 YVGDFNPPTRGVTWAPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRVTLSESSLE 147
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 165/377 (43%), Gaps = 57/377 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +++G PP+ F DTGSDL WVQC APC C + P+ + P + C++
Sbjct: 8 YVLQISLGTPPQQFSAIVDTGSDLCWVQC-APCARCFEQPDPLFIPLASSSYSNASCTDS 66
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL P C N C Y YGDG ++ G + L NGS + FGCG
Sbjct: 67 LCDALPRPT---CSMRN-TCTYSYSYGDGSNTRGDFAFETVTL---NGSTL-ARIGFGCG 118
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC-IGQNGRGV---LFLG 188
+NQ G + D G++GLG+G +S+ SQL ++ +C + Q+ G + G
Sbjct: 119 HNQE--GTFAGAD--GLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFSPITFG 172
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILG--------------PAELLYSGKSCGLKDLTLI 234
+ +S ++TP+LQN + +Y +G P+ G +I
Sbjct: 173 NAA-ENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVG----GVI 227
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG + Y+ + I++ + R I P + P L +C+ ++ +
Sbjct: 228 LDSGTTITYWRLAAFIPILAELRRQ-ISYP-EADPTPYGLNLCYD--ISSVSASSLTLPS 283
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGR--KNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ + TN V +P V+ + VC + + + +IIG + Q+ ++
Sbjct: 284 MTVHLTN----VDFEIPVSNLWVLVDNFGETVCTAM-----STSDQFSIIGNVQQQNNLI 334
Query: 353 IYDNEKQRIGWKPEDCN 369
+ D R+G+ DC+
Sbjct: 335 VTDVANSRVGFLATDCS 351
>gi|213998816|gb|ACJ60775.1| nucellin [Hordeum patagonicum subsp. patagonicum]
Length = 152
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 7 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKVITGNVIGHCLSSKGKGVL 66
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 67 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 123
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 124 PAQIYNEIVSKVRGTLSESSLE 145
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 153/386 (39%), Gaps = 57/386 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK----PPEKQYKPHKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C P + +++PCS+P
Sbjct: 415 YLVHLAIGTPPQPVQLILDTGSDLVWTQCR-PCPVCFSRALGPLDPSNSSTFDVLPCSSP 473
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVP-LTFG 130
C L W + + N C Y Y DG + G L + F ++G+ VP L FG
Sbjct: 474 VCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFG 533
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLF 186
CG N G + +T G+ G GRG +S+ SQL+ + HC G VL
Sbjct: 534 CGL--FNNGIFTSNET-GIAGFGRGALSLPSQLKV-----DNFSHCFTAITGSEPSSVLL 585
Query: 187 --------LGDGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLK-D 230
DG V S TP++QN + L+ Y L G L + LK D
Sbjct: 586 GLPANLYSDADGAVQS-----TPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQD 640
Query: 231 LT--LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
T I DSG Y+ ++ D ++L D+ T R F V
Sbjct: 641 GTGGTIIDSGTGMTTLPQDAYK-----LVHDAFTAQVRLPVDNATSSSLSRLCFSF--SV 693
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVI---SGRKNVCLGILNGSEAEVGENNIIGEI 345
KP L +P E Y+ +G CL I G + IIG
Sbjct: 694 PRRAKPDVPKLVLHFEGATLDLPRENYMFEFEDAGGSVTCLAINAGDDL-----TIIGNY 748
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ V+YD + + + P CN L
Sbjct: 749 QQQNLHVLYDLVRNMLSFVPAQCNRL 774
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 169/386 (43%), Gaps = 53/386 (13%)
Query: 17 FAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPP----EKQYKPHKNIVPCSN 71
F +++T+G PP K+F DTGSDLTWVQC PC C K +K+ PC +
Sbjct: 85 FFMSITIGTPPIKVFAIA-DTGSDLTWVQC-KPCQQCYKENGPIFDKKKSSTYKSEPCDS 142
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C AL C N+ C Y YGD S G + T+ + ++GS + P T FG
Sbjct: 143 RNCQALS-STERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFG 201
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVL 185
CGYN G +G++GLG G +S++SQL I +C+ NG V+
Sbjct: 202 CGYNN---GGTFDETGSGIIGLGGGHLSLISQLGSS--ISKKFSYCLSHKSATTNGTSVI 256
Query: 186 FLGDGKVPS-----SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDL--- 231
LG +PS SGV TP++ +Y+ +G ++ Y+G S D
Sbjct: 257 NLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGIL 316
Query: 232 -----TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
+I DSG + + + + S + + G +++ L C++ +G
Sbjct: 317 SETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAK-RVSDPQGLLSHCFKSGSAEIG 375
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIF 346
+ + FT VRL P A++ +S VCL ++ +E I G
Sbjct: 376 -----LPEITVHFTGA--DVRL-SPINAFVKLS-EDMVCLSMVPTTEVA-----IYGNFA 421
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTLL 372
D +V YD E + + ++ DC+ L
Sbjct: 422 QMDFLVGYDLETRTVSFQHMDCSANL 447
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 114/381 (29%), Positives = 154/381 (40%), Gaps = 51/381 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQYKPH-------K 64
Y+AV + VG P F DTGSDL WV CD A T P +P+
Sbjct: 111 YYAV-VEVGTPNATFLVALDTGSDLFWVPCDCKQCASIANVTGQPATALRPYSPRESSTS 169
Query: 65 NIVPCSNPRCAALHWPNPPRCKHP-NDQCDYEIEYGDGGSSI-GALVTDLFPLR------ 116
V C N C P C N C YE++Y +S G LV D+ L
Sbjct: 170 KQVTCDNALC-----DRPNGCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHLTRERPGA 224
Query: 117 -FSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIG 174
G P+ FGCG Q L G++GLGR +S+ S L GL+ +
Sbjct: 225 AAEAGEALQAPVVFGCGQVQTGTF-LDGAAFDGLMGLGRENVSVPSVLASSGLVASDSFS 283
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI 234
C G +G G + GD SSG TP + Y + + KS + +
Sbjct: 284 MCFGDDGVGRINFGDSG--SSGQGETPF---TGRRTLYNVSFTAVNVETKSVA-AEFAAV 337
Query: 235 FDSGASYAYFTSRVYQEIVS---LIMRDLIGTPLKLAPDDKTLPICWRGPFKALG-QVTE 290
DSG S+ Y Y E+ + ++R+ + D C+ ALG TE
Sbjct: 338 IDSGTSFTYLADPEYTELATNFNSLVRERRTNFSSGSADPFPFEYCY-----ALGPNQTE 392
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGEN-NIIGEIFM 347
P +S T + R V V SGR V CL I+ ++G N NIIG+ FM
Sbjct: 393 ALIP-DVSLTT-KGGARFPVTQPVIGVASGRTVVGYCLAIMKN---DLGVNFNIIGQNFM 447
Query: 348 QDKMVIYDNEKQRIGWKPEDC 368
V++D EK +GW+ DC
Sbjct: 448 TGLKVVFDREKSVLGWEKFDC 468
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 151/364 (41%), Gaps = 35/364 (9%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNPRCAA 76
+T+G DTGSDLTWVQC+ PC C +KP V C++ C +
Sbjct: 67 VTMGLGSTNMTVIIDTGSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125
Query: 77 LHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYN 134
L + N C C+Y + YGDG + G L + + S G V FGCG N
Sbjct: 126 LQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVE----QLSFGGVSVSDFVFGCGRN 181
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGK 191
N G +G++GLGR +S+VSQ V +C+ G L +G+
Sbjct: 182 --NKGLFG--GVSGLMGLGRSYLSLVSQTN--ATFGGVFSYCLPTTESGASGSLVMGNES 235
Query: 192 VPSSGV---AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL---TLIFDSGASYAYFT 245
V +T ML N YIL + G + + ++ DSG
Sbjct: 236 SVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLIDSGTVITRLP 295
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
S VY+ + +L ++ G P AP L C F G +++ F
Sbjct: 296 SSVYKALKALFLKQFTGFP--SAPGFSILDTC----FNLTGYDEVSIPTISMHFEGNA-E 348
Query: 306 VRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKP 365
+++ Y+V VCL + + S+A + IIG +++ VIYD ++ ++G+
Sbjct: 349 LKVDATGTFYVVKEDASQVCLALASLSDAY--DTAIIGNYQQRNQRVIYDTKQSKVGFAE 406
Query: 366 EDCN 369
E C+
Sbjct: 407 ESCS 410
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 155/379 (40%), Gaps = 50/379 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ + + +G P + + DTGSDL W QC APC C P + P + + CS P
Sbjct: 92 YLMEMGIGTPARFYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPANSSTYRSLGCSAP 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL++ P C C Y+ YGD S+ G L + F ++ V ++FGCG
Sbjct: 151 ACNALYY---PLCYQ--KTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCG 205
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---QNGRGVLFLGD 189
N G L+ + +G++G GRG +S+VSQL G R +C+ R L+ G
Sbjct: 206 --NLNAGSLA--NGSGMVGFGRGSLSLVSQL---GSPR--FSYCLTSFLSPVRSRLYFGA 256
Query: 190 ----GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----------I 234
+S V TP + N A Y L + G + L I
Sbjct: 257 YATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTI 316
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
DSG + Y Y + + L T PL + L C++ P VT
Sbjct: 317 IDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVT--LP 374
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVIS-GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
L L F + +P + Y+++ +CL + S+ +IIG Q+ V
Sbjct: 375 QLVLHF----DGADWELPLQNYMLVDPSTGGLCLAMATSSDG-----SIIGSYQHQNFNV 425
Query: 353 IYDNEKQRIGWKPEDCNTL 371
+YD E + + P CN +
Sbjct: 426 LYDLENSLLSFVPAPCNLM 444
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 169/392 (43%), Gaps = 62/392 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ +++ VG PPK DTGSDL+W+QCD PC C + Y P ++NI C +
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGSHYYPKDSSTYRNI-SCYD 228
Query: 72 PRCAALHWPNP-PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS--NG-SVFN--V 125
PRC + +P CK N C Y +Y DG ++ G ++ F + + NG F V
Sbjct: 229 PRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVV 288
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI-----GQ 179
+ FGCG+ N G +G+LGLGRG IS SQ++ YG + +C+
Sbjct: 289 DVMFGCGH--WNKGFFYG--ASGLLGLGRGPISFPSQIQSIYG---HSFSYCLTDLFSNT 341
Query: 180 NGRGVLFLGDGK--VPSSGVAWTPML--QNSADLKHYILGPAELLYSGKSCGLKDLT--- 232
+ L G+ K + + + +T +L + + D Y L ++ G+ + + T
Sbjct: 342 SVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHW 401
Query: 233 ------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL---APDDKTLPIC 277
I DSG++ +F Y I+++ +KL A DD + C
Sbjct: 402 SSEGAAADAGGGTIIDSGSTLTFFPDSAYD-----IIKEAFEKKIKLQQIAADDFVMSPC 456
Query: 278 WRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEV 336
+ A+ QV + F + P E Y + +CL I+
Sbjct: 457 YNVS-GAMMQVE--LPDFGIHFA---DGGVWNFPAENYFYQYEPDEVICLAIMKTPNH-- 508
Query: 337 GENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
IIG + Q+ ++YD ++ R+G+ P C
Sbjct: 509 SHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 540
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 101/423 (23%), Positives = 169/423 (39%), Gaps = 84/423 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTW----------VQCD------------------AP 48
+ ++L++G PP++ DTGSDLTW ++CD +
Sbjct: 80 YLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHSSSSH 139
Query: 49 CTGCTKPPEKQYKPHKN-IVPCSNPRC-------AALHWPNPPRCKHPNDQCDYEIEYGD 100
CT P N + PC+ C A WP PP + YG
Sbjct: 140 RDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPP----------FAYTYGA 189
Query: 101 GGSSIGALVTDLFPLRFSN-GSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRIS 158
GG G L D + N G +P FGC + + + G+ G GRG +S
Sbjct: 190 GGVVTGTLTRDTLRVHGRNLGVTQEIPRFCFGCVASSYR-------EPIGIAGFGRGALS 242
Query: 159 IVSQLREYGLIRNVIGHCI-------GQNGRGVLFLGDGKVPSSG-VAWTPMLQNSADLK 210
+ SQL G +R HC N L +GD + S + +TPML++
Sbjct: 243 LPSQL---GFLRKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPN 299
Query: 211 HYILGPAELLYSGKSC-----------GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRD 259
+Y +G + S L + ++ DSG +Y + Y +++S +++
Sbjct: 300 YYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLS-VLQS 358
Query: 260 LIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
+I P + +T +C++ P + +T P +++F N+ ++ + +
Sbjct: 359 IINYPRATDMEMRTGFDLCYKVPCQNNSILTGDLLP-SITFHFLNNASLVLSRGSHFYAM 417
Query: 319 SGRKNV----CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
S N CL + + + G ++G QD V+YD EK+RIG++P DC + S
Sbjct: 418 SAPSNSTVVKCLLFQSMDDGDYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDCASAASF 477
Query: 375 NHF 377
F
Sbjct: 478 QGF 480
>gi|213998818|gb|ACJ60776.1| nucellin [Hordeum patagonicum subsp. setifolium]
Length = 149
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EI+S + L + L+
Sbjct: 126 PAQIYNEILSKVRGTLSESSLE 147
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 158/379 (41%), Gaps = 57/379 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C + P Y P ++
Sbjct: 97 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQS 153
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF--S 118
VPCS+ C + C+ ++ C Y I+Y D SS G LV D+ L +
Sbjct: 154 TTSRKVPCSSNLCDLQN-----ACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 208
Query: 119 NGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ P+ FGCG Q G +P G+LGLG S+ S L GL N C
Sbjct: 209 QSKIVTAPIMFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGLAANSFSMC 265
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPA-ELLYSGKSCGLK----DL 231
G +G G + GD SS TP L Y P + +G + G K +
Sbjct: 266 FGDDGHGRINFGD--TGSSDQKETP-------LNVYKQNPYYNITITGITVGSKSISTEF 316
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ I DSG S+ + +Y +I S + + L D ++P + A G V
Sbjct: 317 SAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNML---DSSMPFEFCYSVSANGIVHP- 372
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQD 349
+S T + S+ V P + + V CL I+ N+IGE FM
Sbjct: 373 ----NVSLTAKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGV-----NLIGENFMSG 423
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V++D E+ +GWK +C
Sbjct: 424 LKVVFDRERMVLGWKNFNC 442
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 143/362 (39%), Gaps = 37/362 (10%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCD----APCT----------GCTKPPEKQYKPHKNIVP 68
VG P F DTGSDL WV CD AP + G KP E H +P
Sbjct: 106 VGTPTTSFLVALDTGSDLFWVPCDCIQCAPLSSYRGNLDRDLGIYKPAESTTSRH---LP 162
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FNVP 126
CS+ C C +P C Y I+Y + +S G L+ D L G N
Sbjct: 163 CSHELCQPGSG-----CTNPKQPCTYNIDYFSENTTSSGLLIEDSLHLNSREGHAPVNAS 217
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ GCG Q L G+LGLG IS+ S L GL+RN C ++ G +F
Sbjct: 218 VIIGCGRKQSG-DYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSSGRIF 276
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
GD V S TP + L+ Y + + K + DSG S+
Sbjct: 277 FGDQGVSSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGSSFQALVDSGTSFTSLPP 334
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
VY+ + + + + ++ +D T C+ + V LA + +V
Sbjct: 335 DVYKAFTTEFDKQINAS--RVPYEDSTWKYCYSASPLEMPDVPTII--LAFAANKSFQAV 390
Query: 307 RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
++P R CL +L +E +G IIG+ F+ V++D E ++GW
Sbjct: 391 NPILPFNDEQGALAR--FCLAVLPSTEP-IG---IIGQNFLVGYHVVFDRESMKLGWYRS 444
Query: 367 DC 368
+C
Sbjct: 445 EC 446
>gi|213998824|gb|ACJ60779.1| nucellin [Hordeum chilense]
Length = 140
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 78/142 (54%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 60
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
+ GD PS GV W PM ++ +Y G AELL + G +FDSG++Y +
Sbjct: 61 YFGDFNPPSRGVTWVPMKESXX---YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 117
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 118 PAQIYNEIVSKVRGTLSESSLE 139
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 164/391 (41%), Gaps = 69/391 (17%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--- 67
F YFA ++ VG PP DTGSD+ W+QC PC C + Y P +
Sbjct: 94 FASGEYFA-SVGVGTPPTPALLVIDTGSDVVWLQCK-PCVHCYRQLSPLYDPRGSSTYAQ 151
Query: 68 -PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNV 125
PCS P+C NP C C Y I YGD S+ G L TD L FSN SV NV
Sbjct: 152 TPCSPPQCR-----NPQTCDGTTGGCGYRIVYGDASSTSGNLATDR--LVFSNDTSVGNV 204
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRG- 183
T GCG++ N G AG+LG+ RG S +Q+ + YG +C+G R
Sbjct: 205 --TLGCGHD--NEGLFG--SAAGLLGVARGNNSFATQVADSYG---RYFAYCLGDRTRSG 255
Query: 184 -----VLFLGDGKVPSSGVAWTPMLQNS-------ADLKHYILGPAELL-YSGKSCGLKD 230
++F P S V +TP+ N D+ + +G + +S S L
Sbjct: 256 SSSSYLVFGRTAPEPPSSV-FTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDP 314
Query: 231 LT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
T ++ DSG S F RD G L+ A D + + R + +
Sbjct: 315 ATGRGGVVVDSGTSITRFA------------RDAYGA-LRDAFDARAAKVGMRKVGRGIS 361
Query: 287 QVTEYFKPLALSFTNRRNSV-------RLVVPPEAYLV--ISGRKNVCLGILNGSEAEVG 337
+ ++ + V + +PPE YLV SGR + C + +
Sbjct: 362 VFDACYDLRGVAVADAPGVVLHFAGGADVALPPENYLVPEESGRYH-CFALEAAGHDGL- 419
Query: 338 ENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
++IG + Q V++D E +R+G++P C
Sbjct: 420 --SVIGNVLQQRFRVVFDVENERVGFEPNGC 448
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 100/395 (25%), Positives = 155/395 (39%), Gaps = 64/395 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V L +G P DTGSD++W+QC PC C + P + +PC++
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQC-VPCKDCVPALRPPFNPRHSSSFFKLPCASS 196
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF-----------PLRFSNGS 121
C ++ P C C + I+YGDG S G L + P++ SN
Sbjct: 197 TCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSN-- 254
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-- 179
+T GC P +G+LG+ R IS SQL HC
Sbjct: 255 -----ITLGCADIDREGLPTG---ASGLLGMDRRPISFPSQLSSR--YARKFSHCFPDKI 304
Query: 180 ---NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG-------PAELLYSGKS 225
N G++F G+ + S + +TP++QN SA L +Y +G + L S K+
Sbjct: 305 AHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKN 364
Query: 226 CGLKDLT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAP--DDKTLPICWR 279
+ +T I DSG ++ Y +Q + R+ + LA D+ C+
Sbjct: 365 FDIDKVTGSGGTIIDSGTAFTYLKKPAFQA----MRREFLARTSHLAKVDDNSGFTPCYN 420
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAE 335
+ + L F R + +V+P + L+ + +CL +
Sbjct: 421 ITSGTAALESTILPSITLHF---RGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIP 477
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
NIIG Q+ V YD EK R+G P C T
Sbjct: 478 F---NIIGNYQQQNLWVEYDLEKLRLGIAPAQCAT 509
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 164/383 (42%), Gaps = 63/383 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F +++++G P + DTGSDL W QC PC C + P + +PCS+
Sbjct: 102 FLMDMSIGTPAVAYAAIIDTGSDLVWTQCK-PCVECFNQSTPVFDPSSSSTYAALPCSST 160
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L P K + +C Y YGD S+ G L + F L + +P + FGC
Sbjct: 161 LCSDL-----PSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKT-----KLPDVAFGC 210
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ---NGRGVLFLG 188
G G AG++GLGRG +S+VSQL GL N +C+ + L LG
Sbjct: 211 GDTNEGDG---FTQGAGLVGLGRGPLSLVSQL---GL--NKFSYCLTSLDDTSKSPLLLG 262
Query: 189 D------GKVPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDL---T 232
+S V TP+++N + +LK +G + + ++D
Sbjct: 263 SLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGG 322
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVT 289
+I DSG S Y + Y+ ++ +KL D + L C+ P + QV
Sbjct: 323 VIVDSGTSITYLELQGYRA-----LKKAFAAQMKLPAADGSGIGLDTCFEAPASGVDQV- 376
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVI-SGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
E K L F + L +P E Y+V+ SG +CL ++ GS +IIG Q
Sbjct: 377 EVPK---LVF--HLDGADLDLPAENYMVLDSGSGALCLTVM-GSRGL----SIIGNFQQQ 426
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ +YD + + + P C L
Sbjct: 427 NIQFVYDVGENTLSFAPVQCAKL 449
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 107/399 (26%), Positives = 162/399 (40%), Gaps = 52/399 (13%)
Query: 3 VSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC--TGCTKPPEKQY 60
V W E S + +G PP+ + DTGS+L W QC + C GC Y
Sbjct: 64 VHWAE-------SQYIAEYLIGDPPQQAEAIIDTGSNLIWTQC-STCQPAGCFSQNLSFY 115
Query: 61 KPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
P ++ V C++ CA + RC N C YG G G L T+ F +
Sbjct: 116 DPSRSRTARPVACNDTACA---LGSETRCARDNKACAVLTAYG-AGVIGGVLGTEAFTFQ 171
Query: 117 FSNGSVFNVPLTFGC-GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ NV L FGC + PG L +G++GLGRG +S+VSQL + + +
Sbjct: 172 PQSE---NVSLAFGCIAATRLTPGSLD--GASGIIGLGRGNLSLVSQLGDNKFSYCLTPY 226
Query: 176 CIGQNGRGVLFLGDGKVPSSGVA---WTPMLQN-SAD---------LKHYILGPAELLYS 222
LF+G SSG A P L+N D L +G A+L
Sbjct: 227 FSQSTNTSRLFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVP 286
Query: 223 GKSCGLKDLTL------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI 276
+ L+ + + DSG+ + YQ + +++ L + + + L +
Sbjct: 287 EAAFDLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDL 346
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG----S 332
C A G V + PL L F + V VPPE Y C+ + + S
Sbjct: 347 C---AAVAHGDVGKLVPPLVLHFGSGGGDV--AVPPENYWGPVDDSTACMVVFSSGGPNS 401
Query: 333 EAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+ E IIG QD ++YD EK + ++P DC+++
Sbjct: 402 TLPMNETTIIGNYMQQDMHLLYDLEKGMLSFQPADCSSM 440
>gi|213998840|gb|ACJ60787.1| nucellin [Hordeum patagonicum subsp. magellanicum]
Length = 154
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EI+S + L + L+
Sbjct: 126 PAQIYNEILSKVRGTLSESSLE 147
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 156/389 (40%), Gaps = 55/389 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPP----EKQYKPHKNIVPC 69
YF V L VG P K F DTGSDLTW+QC+ P T + PP +K +PC
Sbjct: 27 YF-VELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPC 85
Query: 70 SNPRCAALHWPNPPRC--KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS------ 121
++ C L P C K P+ CDY Y D + G L + ++ S
Sbjct: 86 TDDECLFLPAPIGSSCSIKSPS-PCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGN 144
Query: 122 -------VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIG 174
+ NV L GC L +GVLGLG+G IS+ +Q R L +
Sbjct: 145 HKTRTIRIKNVAL--GCSRESVGASFLG---ASGVLGLGQGPISLATQTRHTAL-GGIFS 198
Query: 175 HCIGQNGRG---VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC----- 226
+C+ RG FL G+ +A TP+++N A Y + + GK
Sbjct: 199 YCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIAS 258
Query: 227 ------GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
G + IFDSG + +Y Y +++ + + + P+ +C+
Sbjct: 259 SDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEG--FELCY-- 314
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN 340
VT K + + + +P Y+V+ C+ + + +N
Sbjct: 315 ------NVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTN--GSN 366
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
I+G + QD + YD K RIG+K C+
Sbjct: 367 ILGNLLQQDHHIEYDLAKARIGFKWSPCH 395
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 159/386 (41%), Gaps = 48/386 (12%)
Query: 4 SWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
S IE P F + L +G PP+ + DTGSDL W QC PCT C + P
Sbjct: 84 SEIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCK-PCTQCFHQSTPIFDPK 142
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
K+ + L P N+ C+Y YGD S+ G L ++ L F SV
Sbjct: 143 KSSSFSKLSCSSQLCEALPQ--SSCNNGCEYLYSYGDYSSTQGILASE--TLTFGKASVP 198
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE----YGLIRNVIGHCIGQ 179
NV FGCG + G AG++GLGRG +S+VSQL+E Y L +
Sbjct: 199 NV--AFGCGADNEGSG---FSQGAGLVGLGRGPLSLVSQLKEPKFSYCLTT------VDD 247
Query: 180 NGRGVLFLG---DGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLK 229
L +G SS + TP++ + A Y L G L + L+
Sbjct: 248 TKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQ 307
Query: 230 DL---TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
D LI DSG + Y + +V+ I P+ + L +C+ P G
Sbjct: 308 DDGSGGLIIDSGTTITYLEESAFN-LVAKEFTAKINLPVD-SSGSTGLDVCFTLPS---G 362
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEI 345
L F + L +P E Y++ V CL + GS + + +I G +
Sbjct: 363 STNIEVPKLVFHF----DGADLELPAENYMIGDSSMGVACLAM--GSSSGM---SIFGNV 413
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ +V++D EK+ + + P C+ L
Sbjct: 414 QQQNMLVLHDLEKETLSFLPTQCDLL 439
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 92/377 (24%), Positives = 150/377 (39%), Gaps = 45/377 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPC 69
+ F V + +G PP+ DTGSDLTW+Q + PC C + + + P K N + C
Sbjct: 22 YGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSE-PCRACFEQADPIFDPSKSSTYNKIAC 80
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
S+ CA L C + C Y YGDG + G + + G + F
Sbjct: 81 SSSACADLLGTQ--TCSAAAN-CIYAYGYGDGSVTRGYFSKETITATDTAGE----EVKF 133
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGV 184
G + +N G G+LGLG+G +S+ SQL ++ N +C+ +
Sbjct: 134 GA--SVYNTGTFGDTGGEGILGLGQGPVSMPSQLGS--VLGNKFSYCLVDWLSAGSETST 189
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LI 234
++ GD VPS V +TP++ N+ +Y + + G + I
Sbjct: 190 MYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTI 249
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG + Y V+ +V+ + P + L RG P
Sbjct: 250 IDSGTTITYLQQEVFNALVAAYTSQ-VRYPTTTSATGLDLCFNTRGT----------GSP 298
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
+ + T + V L +P + +CL + + + I G I Q+ ++Y
Sbjct: 299 VFPAMTIHLDGVHLELPTANTFISLETNIICLAFASALDFPIA---IFGNIQQQNFDIVY 355
Query: 355 DNEKQRIGWKPEDCNTL 371
D + RIG+ P DC +L
Sbjct: 356 DLDNMRIGFAPADCASL 372
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 98/372 (26%), Positives = 152/372 (40%), Gaps = 44/372 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGC-----TKPPEKQYKPHKN----IV 67
+ +G P F DTGS+L W+ C+ AP T +Y P + +
Sbjct: 104 IDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPL------RFSNG 120
CS+ C + C+ P +QC Y + Y G SS G LV D+ L R NG
Sbjct: 164 LCSHKLCDS-----ASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218
Query: 121 SV-FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
S + GCG Q L G++GLG IS+ S L + GL+RN C +
Sbjct: 219 SSSVKARVVIGCGKKQSG-DYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDE 277
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQ-NSADLKHYILGPAELLYSGKSC-GLKDLTLIFDS 237
G ++ GD + S TP LQ ++ YI+G E G SC T DS
Sbjct: 278 EDSGRIYFGD--MGPSIQQSTPFLQLDNNKYSGYIVG-VEACCIGNSCLKQTSFTTFIDS 334
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G S+ Y +Y+++ I R + T + W +++ + + L
Sbjct: 335 GQSFTYLPEEIYRKVALEIDRHINATSKNFE------GVSWEYCYESSAEPK--VPAIKL 386
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
F++ N+ + P + G CL I + +G IG+ +M+ +++D E
Sbjct: 387 KFSH-NNTFVIHKPLFVFQQSQGLVQFCLPISPSGQEGIGS---IGQNYMRGYRMVFDRE 442
Query: 358 KQRIGWKPEDCN 369
++GW P C
Sbjct: 443 NMKLGWSPSKCQ 454
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 156/377 (41%), Gaps = 45/377 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK-----PPEKQYKPHKNIVPCSN 71
+ + +G P + DTGSD+ WV+C +PC C PP Y + +
Sbjct: 83 YYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQDIIPPLSIYNLSASSTSSVS 141
Query: 72 PRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
L C N C Y Y D +S+GA V D G+ + F
Sbjct: 142 SCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNATTSRIFF 201
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GC N P+ G++G G ++ +Q+ + V HC+G G L
Sbjct: 202 GCATNITGSWPVD-----GIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEF 256
Query: 190 GKVP-SSGVAWTPMLQN-----------SADLKHYILGPAELLYSGKSCGLKDLTLIFDS 237
G+ P ++ + +TP+L S + K + P E Y S + +I DS
Sbjct: 257 GEAPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNST--NNTGVIIDS 314
Query: 238 GASYAYFTSR----VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
G ++ T++ ++QEI SL T KL P + L + K+ + F
Sbjct: 315 GTTFVLLTTKANRMLFQEIKSL-------TTAKLGPKLEGLECFY---LKSGLTMETSFP 364
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
+ L+F+ + + P+ YLV++ K G + G I GEI ++DK+V
Sbjct: 365 NVTLTFSG---GSTMKLKPDNYLVMAEYKKKRNGYCYAWSSADGLT-IFGEIVLKDKLVF 420
Query: 354 YDNEKQRIGWKPEDCNT 370
YD E +RIGWK ++C++
Sbjct: 421 YDVENRRIGWKGQNCSS 437
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 153/372 (41%), Gaps = 45/372 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ------YKPH---- 63
F ++A+ +TVG P + F DTGSDL W+ C C GCT P Y P
Sbjct: 114 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPASAASGSASFYIPSMSST 170
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG-- 120
VPC++ C C QC Y++ Y SS G LV D+ L +
Sbjct: 171 SQAVPCNSQFCELRK-----ECS-TTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIP 224
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ + FGCG Q L G+ GLG ISI S L + GL N C ++
Sbjct: 225 QILKAQILFGCGQVQTG-SFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRD 283
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
G G + GD SS TP+ N Y + +E+ G S + + IFD+G S
Sbjct: 284 GIGRISFGDQG--SSDQEETPLDVNPQH-PTYTISISEMTV-GNSLTDLEFSTIFDTGTS 339
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK---ALGQVTEYFKPLAL 297
+ Y Y I + + A D R PF+ L + + ++
Sbjct: 340 FTYLADPAYTYITQSFHAQVHAN--RHAADS-------RIPFEYCYDLSSSEDRIQTPSI 390
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
S SV V+ + I + V CL I+ ++ NIIG+ FM V++D
Sbjct: 391 SLRTVGGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKL-----NIIGQNFMTGLRVVFDR 445
Query: 357 EKQRIGWKPEDC 368
E++ +GWK +C
Sbjct: 446 ERKILGWKKFNC 457
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 154/375 (41%), Gaps = 51/375 (13%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPH----KNIV 67
+++G P K F DTGSDL WV CD AP G T + + Y P V
Sbjct: 105 TTVSLGTPGKKFLVALDTGSDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKV 164
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNG--SVFN 124
C N CA + RC C Y + Y +S G LV D+ L +
Sbjct: 165 TCDNSLCAHRN-----RCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVE 219
Query: 125 VPLTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+TFGCG Q ++ P+ G+ GLG +IS+ S L + G + C G +G G
Sbjct: 220 AYVTFGCGQVQTGSFLDIAAPN--GLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIG 277
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD P TP N+ + I + G + D T +FDSG S+ Y
Sbjct: 278 RISFGDKGSPDQ--EETPFNLNALHPTYNIT--VTQVRVGTTLIDLDFTALFDSGTSFTY 333
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK-----ALGQVTEYFKPLALS 298
+Y ++ D P R PF+ + G+ T ++S
Sbjct: 334 LVDPIYTNVLK---------SFHSQAQDSRRPPDSRIPFEFCYDMSPGENTSLIP--SMS 382
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
T + S V P ++IS + + C+ ++ +E NIIG+ FM +I+D
Sbjct: 383 LTMKGGSQFPVYDP--IIIISSQSELIYCMAVVRSAEL-----NIIGQNFMTGYRIIFDR 435
Query: 357 EKQRIGWKPEDCNTL 371
EK +GWK +C+ +
Sbjct: 436 EKLVLGWKEFECDDI 450
>gi|213998806|gb|ACJ60770.1| nucellin [Hordeum flexuosum]
Length = 136
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/130 (41%), Positives = 75/130 (57%), Gaps = 5/130 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVS 254
+++Y EIVS
Sbjct: 126 PAQIYNEIVS 135
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 153/372 (41%), Gaps = 45/372 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ------YKPH---- 63
F ++A+ +TVG P + F DTGSDL W+ C C GCT P Y P
Sbjct: 114 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPASAASGSASFYIPSMSST 170
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG-- 120
VPC++ C C QC Y++ Y SS G LV D+ L +
Sbjct: 171 SQAVPCNSQFCELRK-----ECS-TTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIP 224
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ + FGCG Q L G+ GLG ISI S L + GL N C ++
Sbjct: 225 QILKAQILFGCGQVQTG-SFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRD 283
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
G G + GD SS TP+ N Y + +E+ G S + + IFD+G S
Sbjct: 284 GIGRISFGDQG--SSDQEETPLDVNPQH-PTYTISISEITV-GNSLTDLEFSTIFDTGTS 339
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK---ALGQVTEYFKPLAL 297
+ Y Y I + + A D R PF+ L + + ++
Sbjct: 340 FTYLADPAYTYITQSFHAQVHAN--RHAADS-------RIPFEYCYDLSSSEDRIQTPSI 390
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
S SV V+ + I + V CL I+ ++ NIIG+ FM V++D
Sbjct: 391 SLRTVGGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKL-----NIIGQNFMTGLRVVFDR 445
Query: 357 EKQRIGWKPEDC 368
E++ +GWK +C
Sbjct: 446 ERKILGWKKFNC 457
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 153/372 (41%), Gaps = 45/372 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ------YKPH---- 63
F ++A+ +TVG P + F DTGSDL W+ C C GCT P Y P
Sbjct: 114 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPASAASGSASFYIPSMSST 170
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG-- 120
VPC++ C C QC Y++ Y SS G LV D+ L +
Sbjct: 171 SQAVPCNSQFCELRK-----ECS-TTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIP 224
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ + FGCG Q L G+ GLG ISI S L + GL N C ++
Sbjct: 225 QILKAQILFGCGQVQTG-SFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRD 283
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
G G + GD SS TP+ N Y + +E+ G S + + IFD+G S
Sbjct: 284 GIGRISFGDQG--SSDQEETPLDVNPQH-PTYTISISEITV-GNSLTDLEFSTIFDTGTS 339
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK---ALGQVTEYFKPLAL 297
+ Y Y I + + A D R PF+ L + + ++
Sbjct: 340 FTYLADPAYTYITQSFHAQVHAN--RHAADS-------RIPFEYCYDLSSSEDRIQTPSI 390
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
S SV V+ + I + V CL I+ ++ NIIG+ FM V++D
Sbjct: 391 SLRTVGGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKL-----NIIGQNFMTGLRVVFDR 445
Query: 357 EKQRIGWKPEDC 368
E++ +GWK +C
Sbjct: 446 ERKILGWKKFNC 457
>gi|308080924|ref|NP_001183009.1| uncharacterized protein LOC100501329 [Zea mays]
gi|238008766|gb|ACR35418.1| unknown [Zea mays]
Length = 205
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 55/123 (44%), Positives = 68/123 (55%), Gaps = 4/123 (3%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
FP Y+ ++ +G PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 85 FPDGQYY-TSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 143
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D + +NG + F
Sbjct: 144 RDLLCQELQG-NQNYCETCK-QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDFVF 201
Query: 130 GCG 132
GC
Sbjct: 202 GCA 204
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 147/365 (40%), Gaps = 40/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ L +G P + DTGS LTW+QC C + Y P + VPCS
Sbjct: 134 YVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSAS 193
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C L NP C N C Y+ YGD S+G L D + F +GS N +G
Sbjct: 194 QCDELQAATLNPSACSVRN-VCIYQASYGDSSFSVGYLSRDT--VSFGSGSYPN--FYYG 248
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G +AG++GL R ++S++ QL + +C+ +L G
Sbjct: 249 CG--QDNEGLFG--RSAGLIGLARNKLSLLYQLAPS--LGYSFSYCL-PTPASTGYLSIG 301
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYFT 245
S ++TPM +S D Y + + + G + L I DSG
Sbjct: 302 PYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLP 361
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALSFTNRRN 304
+ VY + + ++G ++ AP L C++ GQ ++ P +A++F
Sbjct: 362 TAVYTALSKAVAAAMVG--VQSAPAFSILDTCFQ------GQASQLRVPAVAMAFA---G 410
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
L + + L+ CL A IIG Q V+YD + RIG+
Sbjct: 411 GATLKLATQNVLIDVDDSTTCLAF-----APTDSTTIIGNTQQQTFSVVYDVAQSRIGFA 465
Query: 365 PEDCN 369
C+
Sbjct: 466 AGGCS 470
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 155/378 (41%), Gaps = 54/378 (14%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-------KPPEK---QYKPHKNI---- 66
+ +G P F D GSDL+WV CD C C KP ++ +Y+P +
Sbjct: 106 IDIGTPNVSFLVALDAGSDLSWVPCD--CIQCAPLSASLYKPLDRDLSEYRPSLSTTSRH 163
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGD-GGSSIGALVTDLFPLRF------SN 119
+ C++ C CK+ D C Y +Y D SS G LV D+ L S
Sbjct: 164 LSCNHQLCEL-----GSHCKNLKDPCPYIADYADPNTSSSGFLVEDILHLASVSDDSNST 218
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
+ GCG Q G L GV+GLG G IS+ S L + GLIR C
Sbjct: 219 QKRVQASVILGCGRKQ-TGGYLDGAAPDGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDV 277
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC----GLKDLTLIF 235
NG G + GD S TP+L + Y++ E G SC G K L
Sbjct: 278 NGSGTILFGDQGHTSQKS--TPLLPTQGNYDAYLI-EVESYCVGNSCLKQSGFKALV--- 331
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSGAS+ Y VY +IV + D +++ C+ K L V +
Sbjct: 332 DSGASFTYLPIDVYNKIV--LEFDKQVNAQRISSQGGPWNYCYNTSSKQLDNV----PAM 385
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQDKMVI 353
LSF ++ L++ Y V ++ CL L ++ G IIG+ +M V+
Sbjct: 386 RLSFLMNQS---LLIHNSTYYVPQNQEFAVFCL-TLQPTDLNYG---IIGQNYMTGYRVV 438
Query: 354 YDNEKQRIGWKPEDCNTL 371
+D E ++GW +C +
Sbjct: 439 FDMENLKLGWSSSNCKDI 456
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 163/381 (42%), Gaps = 46/381 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQ--YKPHKNIVPCSNPRCA 75
++L++G PP+ +F S +WV C + C CT Q +PC +P C+
Sbjct: 1 MDLSLGTPPQPLNFTLAVDSGFSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSCS 60
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
A + C P+ C Y YG SS G LV+D+ + L+ GCG +
Sbjct: 61 AFSAVST-SCG-PSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCG--R 116
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG-RGVLFLGDGKVP- 193
+ G L DT+G +G +G +S + QL G R+ +C+ + RG L +G+ K+
Sbjct: 117 DSGGLLELLDTSGFVGFDKGNVSFMGQLSALGY-RSKFIYCLPSDTFRGKLVIGNYKLRN 175
Query: 194 ---SSGVAWTPMLQNSADLKHYILG-------------PAELLYSGKSCGLKDLTLIFDS 237
SS +A+TPM+ N + Y + P + S + G + D+
Sbjct: 176 ASISSSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGTGG-----TVIDT 230
Query: 238 GASYAYFTSRVYQEIVSLI---MRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
+Y TS Y ++V I +L+ +A D + +C+ + +++ P
Sbjct: 231 TTFLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVA-DALGVELCYN-----ISANSDFPPP 284
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGEN-NIIGEIFMQDKM 351
L++ + + V L S N +C+ I G VG N N+IG D
Sbjct: 285 ATLTY-HFLGGAGVEVSTWFLLDDSDSVNNTICMAI--GRSESVGPNLNVIGTYQQLDLT 341
Query: 352 VIYDNEKQRIGWKPEDCNTLL 372
V YD E+ R G+ + CNT +
Sbjct: 342 VEYDLEQMRYGFGAQGCNTTM 362
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 156/387 (40%), Gaps = 54/387 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP------EKQYKPHKNIVPCS 70
+++TVG PP+ DTGS+L+W+ C+ T P Y P + CS
Sbjct: 66 LTISITVGTPPQNMSMVIDTGSELSWLHCNTNTTATIPYPFFNPNISSSYTP----ISCS 121
Query: 71 NPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+P C +P P C N+ C + Y D SS G L +D F GS FN +
Sbjct: 122 SPTCTTRTRDFPIPASCDS-NNLCHATLSYADASSSEGNLASDTFGF----GSSFNPGIV 176
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFL 187
FGC + ++ S +T G++G+ G +S+VSQL+ +CI G + G+L L
Sbjct: 177 FGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKF-----SYCISGSDFSGILLL 231
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------------- 233
G+ G + +TP++Q S L ++ + G K L +
Sbjct: 232 GESNFSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAG 291
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK------TLPICWRGPFKAL 285
+FD G ++Y VY + + GT L DD + +C+R P
Sbjct: 292 QTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRAL--DDPNFVFQIAMDLCYRVPVNQ- 348
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV---ISGRKNVCLGILNGSEAEVGENNII 342
+E + ++S +R+ Y V + G +V S+ E II
Sbjct: 349 ---SELPELPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFII 405
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCN 369
G Q + +D + R+G C+
Sbjct: 406 GHHHQQSMWMEFDLVEHRVGLAHARCD 432
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 105/378 (27%), Positives = 160/378 (42%), Gaps = 55/378 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVP----CSNP 72
+ + L +G PP + DTGSDL W QC PCT C K P + P K+ C +
Sbjct: 108 YLMELAIGTPPVSYPAVLDTGSDLIWTQC-KPCTQCYKQPTPIFDPKKSSSFSKVSCGSS 166
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+A+ C +D C+Y YGD + G L T+ F S V + FGCG
Sbjct: 167 LCSAVPSST---C---SDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCG 220
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGD 189
+ G +G++GLGRG +S+VSQL+E +C+ +L LG
Sbjct: 221 EDNEGDG---FEQASGLVGLGRGPLSLVSQLKE-----PRFSYCLTPMDDTKESILLLGS 272
Query: 190 -GKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDS 237
GKV + V TP+L+N Y L + ++ T +I DS
Sbjct: 273 LGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDS 332
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT----LPICWRGPFKALGQVTEYFK 293
G + Y + ++ + ++ I + KL P DKT L +C+ P G
Sbjct: 333 GTTITYIEQKAFEA----LKKEFI-SQTKL-PLDKTSSTGLDLCFSLPS---GSTQVEIP 383
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
+ F L +P E Y++ G N LG+ + +I G + Q+ +V
Sbjct: 384 KIVFHFKGG----DLELPAENYMI--GDSN--LGVACLAMGASSGMSIFGNVQQQNILVN 435
Query: 354 YDNEKQRIGWKPEDCNTL 371
+D EK+ I + P C+ L
Sbjct: 436 HDLEKETISFVPTSCDQL 453
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 149/371 (40%), Gaps = 43/371 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
F ++A+ +TVG P + F DTGSDL W+ C C GCT P +P +
Sbjct: 106 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFYIPGMSST 162
Query: 74 CAALHWPNPPRCKHPND-----QCDYEIEYGDGG-SSIGALVTDLFPLRFSNG--SVFNV 125
A+ N C + QC Y++ Y G SS G LV D+ L N +
Sbjct: 163 SKAVPC-NSNFCDLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKA 221
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ GCG Q L G+ GLG +S+ S L + GL N C G++G G +
Sbjct: 222 QIMLGCGQTQTG-SFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRI 280
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASY 241
GD SS TP+ N + I SG + G K D IFD+G S+
Sbjct: 281 SFGDQG--SSDQEETPLNINQQHPTYAI------TISGITIGNKPTDLDFITIFDTGTSF 332
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK---ALGQVTEYFKPLALS 298
Y Y I + + A D R PF+ L F +
Sbjct: 333 TYLADPAYTYITQSFHAQVQAN--RHAADS-------RIPFEYCYDLSSSEARFPIPDII 383
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
S+ V+ P + I + V CL I+ + NIIG+ FM V++D E
Sbjct: 384 LRTVSGSLFPVIDPGQVISIQEHEYVYCLAIVKSRKL-----NIIGQNFMTGLRVVFDRE 438
Query: 358 KQRIGWKPEDC 368
++ +GWK +C
Sbjct: 439 RKILGWKKFNC 449
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 153/379 (40%), Gaps = 56/379 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP--EKQYKPHKNI----- 66
F +FA N++VG PP F DTGSDL W+ C+ CT C K NI
Sbjct: 99 FLHFA-NVSVGTPPLSFLVALDTGSDLFWLPCN--CTKCVHGIGLSNGEKIAFNIYDLKG 155
Query: 67 ------VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL--RF 117
V C++ C +C + C YE+ Y +G S+ G LV D+ L
Sbjct: 156 SSTSQPVLCNSSLCELQR-----QCPSSDTICPYEVNYLSNGTSTTGFLVEDVLHLITDD 210
Query: 118 SNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
+ +TFGCG Q L G+ GLG S+ S L + GL N C
Sbjct: 211 DKTKDADTRITFGCGQVQ-TGAFLDGAAPNGLFGLGMSNESVPSILAKEGLTSNSFSMCF 269
Query: 178 GQNGRGVLFLGD------GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL 231
G +G G + GD GK P + A P Y + +++ K L +
Sbjct: 270 GSDGLGRITFGDNSSLVQGKTPFNLRALHPT---------YNITVTQIIVGEKVDDL-EF 319
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI--CWRGPFKALGQVT 289
IFDSG S+ Y Y++I + + I LP C+ + Q
Sbjct: 320 HAIFDSGTSFTYLNDPAYKQITNSFNSE-IKLQRHSTSSSNELPFEYCYE---LSPNQTV 375
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
E L+++ T + LV P + G +CLG+L + NIIG+ FM
Sbjct: 376 E----LSINLTMKGGDNYLVTDPIVTVSGEGINLLCLGVLKSNNV-----NIIGQNFMTG 426
Query: 350 KMVIYDNEKQRIGWKPEDC 368
+++D E +GW+ +C
Sbjct: 427 YRIVFDRENMILGWRESNC 445
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 167/383 (43%), Gaps = 54/383 (14%)
Query: 12 PIFSY---FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
PI +Y + + L +G PP DTGSDL WVQC PC GC + P K+
Sbjct: 56 PINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQC-VPCLGCYNQINPMFDPLKSSTY 114
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ C +P C + P C P +CDY Y D + G L + L + G +
Sbjct: 115 TNISCDSPLC---YKPYIGECS-PEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPIS 170
Query: 125 VP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL------REYG-----LIRNV 172
+ + FGCG+N N G + + G++GLG G S+VSQ+ +++ + ++
Sbjct: 171 LQGILFGCGHN--NTGNFNDHE-MGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDI 227
Query: 173 IGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY---ILG-PAELLYSGKSCGL 228
G+G LG+ GV TP++Q D+ Y +LG E Y + +
Sbjct: 228 TISSQMSFGKGSEVLGE------GVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTI 281
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL--PICWRGPFKALG 286
+ ++ DSG ++Y + + + PL+ DD +L +C+R G
Sbjct: 282 EKGNMLVDSGTPPNILPQQLYDRVYVEVKNKV---PLEPITDDPSLGPQLCYRTQTNLKG 338
Query: 287 -QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEI 345
+T +F+ L T ++ +PP + CL I N + ++ G I G
Sbjct: 339 PTLTYHFEGANLLLT----PIQTFIPPTP----ETKGVFCLAITNCANSDPG---IYGNF 387
Query: 346 FMQDKMVIYDNEKQRIGWKPEDC 368
+ ++ +D ++Q + +KP DC
Sbjct: 388 AQTNYLIGFDLDRQIVSFKPTDC 410
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 149/369 (40%), Gaps = 39/369 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
F ++A+ +TVG P + F DTGSDL W+ C C GCT P +P +
Sbjct: 107 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFYIPGMSST 163
Query: 74 CAALHWPNPPRCKHPND-----QCDYEIEYGDGG-SSIGALVTDLFPLRFSNG--SVFNV 125
A+ N C + QC Y++ Y G SS G LV D+ L N +
Sbjct: 164 SKAVPC-NSNFCDLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKA 222
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ GCG Q L G+ GLG +S+ S L + GL N C G++G G +
Sbjct: 223 QIMLGCGQTQTG-SFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRI 281
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASY 241
GD + SS TP+ N + I SG + G K D IFD+G S+
Sbjct: 282 SFGDQE--SSDQEETPLDINRQHPTYAI------TISGITVGNKPTDMDFITIFDTGTSF 333
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSFT 300
Y Y I + + A D + C+ L F +
Sbjct: 334 TYLADPAYTYITQSFHAQVQAN--RHAADSRIPFEYCYD-----LSSSEARFPIPDIILR 386
Query: 301 NRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
S+ V+ P + I + V CL I+ + NIIG+ FM V++D E++
Sbjct: 387 TVTGSMFPVIDPGQVISIQEHEYVYCLAIVKSMKL-----NIIGQNFMTGLRVVFDRERK 441
Query: 360 RIGWKPEDC 368
+GWK +C
Sbjct: 442 ILGWKKFNC 450
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 115/381 (30%), Positives = 163/381 (42%), Gaps = 53/381 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V L VG P + DTGSDL W+QC PC C K + + P + +PC +
Sbjct: 129 YF-VRLGVGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRNSSSFQRIPCLS 186
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C AL + + +C Y++ YGDG S+G +DLF L + + + + FGC
Sbjct: 187 PLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKA---MSVAFGC 243
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL---REYGLIRNVIGHCI------GQNGR 182
G++ AG+LGLG G++S SQ+ N +C+
Sbjct: 244 GFDNEG----LFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSS 299
Query: 183 GVLFLGDGKVPSSGVAWTPMLQN-SADLKHYI------LGPAELLYSGKSCGLKDL---T 232
L G +PS+ A +P+L+N D +Y +G A+L S KS L
Sbjct: 300 SSLIFGAAAIPSTA-ALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGG 358
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPFKALGQVT 289
+I DSG S F + VY I RD T L AP C+ KA V
Sbjct: 359 VIIDSGTSVTRFPTSVYATI-----RDAFRNATTNLPSAPRYSLFDTCYNFSGKASVDV- 412
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
L L F N L +PP YL+ I+ + CL S E+G IIG I Q
Sbjct: 413 ---PALVLHF---ENGADLQLPPTNYLIPINTAGSFCLAFAPTS-MELG---IIGNIQQQ 462
Query: 349 DKMVIYDNEKQRIGWKPEDCN 369
+ +D +K + + P+ C
Sbjct: 463 SFRIGFDLQKSHLAFAPQQCK 483
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 167/381 (43%), Gaps = 50/381 (13%)
Query: 12 PIFSY---FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
PI++Y + + L++G PP DTGSDLTW C PC C K + P K+
Sbjct: 64 PIYAYLGHYLMELSIGTPPFKIYGIADTGSDLTWTSC-VPCNNCYKQRNPMFDPQKSTTY 122
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ C + C H + C P +C+Y Y + G L + L + G +
Sbjct: 123 RNISCDSKLC---HKLDTGVCS-PQKRCNYTYAYASAAITRGVLAQETITLSSTKGK--S 176
Query: 125 VPL---TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRN----VIGHC 176
VPL FGCG+N N G + + G++GLG G +S++SQ+ +G R V H
Sbjct: 177 VPLKGIVFGCGHN--NTGGFNDHE-MGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHT 233
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYI--LGPA----ELLYSGKSCGLKD 230
+ F KV GV TP++ +++ LG + L ++G S ++
Sbjct: 234 DVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEK 293
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV-T 289
+ DSG +++Y ++V+ + ++ P+ PD +C+R G V T
Sbjct: 294 GNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGP-QLCYRTKNNLRGPVLT 352
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQ 348
+F+ + + P + + IS + V CLG N S + + G
Sbjct: 353 AHFEGADVKLS----------PTQTF--ISPKDGVFCLGFTNTSS----DGGVYGNFAQS 396
Query: 349 DKMVIYDNEKQRIGWKPEDCN 369
+ ++ +D ++Q + +KP+DC
Sbjct: 397 NYLIGFDLDRQVVSFKPKDCT 417
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 154/375 (41%), Gaps = 51/375 (13%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK----------QYKPHKNI---- 66
+ +G P F D GSDL W+ CD C C +Y P +++
Sbjct: 100 IDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLSSSYYSNLDRDLNEYSPSRSLSSKH 157
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLR----FSNGS 121
+ CS+ C CK QC Y + Y + SS G LV D+ L+ SN S
Sbjct: 158 LSCSHQLCD-----KGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGSLSNSS 212
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V P+ GCG Q G L G+LGLG G S+ S L + GLI + C ++
Sbjct: 213 V-QAPVVLGCGMKQSG-GYLDGVAPDGLLGLGPGESSVPSFLAKSGLIHDSFSLCFNEDD 270
Query: 182 RGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGA 239
G +F GD G ++ P+ YI+G E G SC + + DSG
Sbjct: 271 SGRIFFGDQGPTIQQSTSFLPL---DGLYSTYIIG-VESCCVGNSCLKMTSFKVQVDSGT 326
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
S+ + VY I + + G+ + + + C+ + L +V +L+
Sbjct: 327 SFTFLPGHVYGAIAEEFDQQVNGS--RSSFEGSPWEYCYVPSSQELPKVP------SLTL 378
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNV---CLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
T ++N+ +V P V G + V CL I + G+ IG+ FM +++D
Sbjct: 379 TFQQNNSFVVYDP--VFVFYGNEGVIGFCLAI----QPTEGDMGTIGQNFMTGYRLVFDR 432
Query: 357 EKQRIGWKPEDCNTL 371
+++ W +C L
Sbjct: 433 GNKKLAWSRSNCQDL 447
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 160/370 (43%), Gaps = 41/370 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +G PP DTGSDL WVQC +PC C ++P K+ C +
Sbjct: 90 YLMRFYIGTPPVERLATADTGSDLIWVQC-SPCASCFPQSTPLFQPLKSSTFMPTTCRSQ 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRF-SNGSVFNVPLT-- 128
C L P C + +C Y +YGD S S G L T+ LRF S G V V
Sbjct: 149 PCTLL-LPEQKGCGK-SGECIYTYKYGDQYSFSEGLLSTET--LRFDSQGGVQTVAFPNS 204
Query: 129 -FGCG-YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRG 183
FGCG YN P G++GLG G +S+VSQ+ + I + +C +G
Sbjct: 205 FFGCGLYNNITVFP--SYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTS 260
Query: 184 VLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSC--GLKDLTLIFDSGAS 240
L G+ + + GV TPM+ +Y L + + K+ G D +I DSG
Sbjct: 261 KLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTL 320
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSF 299
Y Y + + L ++L D + LP C+ P++ F +A F
Sbjct: 321 LTYLGESFYYNFAASLQESL---AVELVQDVLSPLPFCF--PYRD----NFVFPEIAFQF 371
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
T R S++ P +++ R VCL I + + V +I G D V YD E +
Sbjct: 372 TGARVSLK---PANLFVMTEDRNTVCLMI---APSSVSGISIFGSFSQIDFQVEYDLEGK 425
Query: 360 RIGWKPEDCN 369
++ ++P DC+
Sbjct: 426 KVSFQPTDCS 435
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 165/389 (42%), Gaps = 69/389 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+L +G PP+ DTGSDL W QC APC C P+ + P ++ + C+
Sbjct: 96 YVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLSQPDPLFAPGQSASYEPMRCAGT 154
Query: 73 RCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS---NGSVFNVPLT 128
C+ LH C+ P D C Y YGDG ++G T+ F S + VPL
Sbjct: 155 LCSDILHHS----CERP-DTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLG 209
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGV 184
FGCG N G L+ + +G++G GR +S+VSQL IR +C+ + +
Sbjct: 210 FGCG--SVNVGSLN--NGSGIVGFGRNPLSLVSQLS----IRR-FSYCLTSYASRRQSTL 260
Query: 185 LF--LGDGKV--PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
LF L DG + V TP+LQ+ + Y + ++G + G + L +
Sbjct: 261 LFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVH-----FTGLTVGARRLRIPESAFAL 315
Query: 234 --------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA--PDDKT---LPICWRG 280
I DSG + + V E+V R + P P+D +P WR
Sbjct: 316 RPDGSGGVIVDSGTALTLLPAAVLAEVVR-AFRQQLRLPFANGGNPEDGVCFLVPAAWR- 373
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRK-NVCLGILNGSEAEVGEN 339
++ + L F L +P Y++ R+ +CL + + + +
Sbjct: 374 --RSSSTSQMPVPRMVLHF----QGADLDLPRRNYVLDDHRRGRLCLLLADSGD----DG 423
Query: 340 NIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+ IG + QD V+YD E + + P C
Sbjct: 424 STIGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 108/214 (50%), Gaps = 27/214 (12%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----V 67
PI + L +G PP+ F+ DTGSD+ WV C + C GC + P + +
Sbjct: 77 PISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCIS-CVGCPLQNVTFFDPGASSSAVKL 135
Query: 68 PCSNPRC-AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV- 125
CS+ RC + LH K +Y++EY DG + G ++DL S V
Sbjct: 136 ACSDKRCFSDLHK------KSGCSPLEYKVEYSDGSFTSGYYISDLISFETVMSSNLTVK 189
Query: 126 ---PLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--G 178
P FGC N H G +S P+T+ G++GLG+GR+ +VSQL L V C+ G
Sbjct: 190 SSAPFVFGCS-NLH-AGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCLSGG 247
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY 212
Q G GV+ LG+ ++P++ +TP++++ HY
Sbjct: 248 QEGGGVIILGENRLPNT--VYTPLVRSQT---HY 276
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 147/365 (40%), Gaps = 38/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ V C+ P
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L N C C Y ++YGDG SIG D L S ++ F G
Sbjct: 240 ACSDL---NIHGCS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 289
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC+ G G L G
Sbjct: 290 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSTGTGYLDFGA 344
Query: 190 GKVPSSGVAW-TPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
G + ++ TPML + +Y+ G + G+ + I DSG
Sbjct: 345 GSLAAARARLTTPMLTENGPTFYYV-GMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITR 403
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
Y + + K AP L C+ F + QV ++L F +
Sbjct: 404 LPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCY--DFTGMSQVA--IPTVSLLF---Q 456
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
RL V + + VCL + + G+ I+G ++ V YD K+ +G+
Sbjct: 457 GGARLDVDASGIMYAASASQVCLAF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514
Query: 364 KPEDC 368
P C
Sbjct: 515 YPGAC 519
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 145/364 (39%), Gaps = 42/364 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALHWP 80
+TVG P + F DTGSDL W+ C C GCT P +P + A+
Sbjct: 11 VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFYIPGMSSTSKAVPC- 67
Query: 81 NPPRCKHPND-----QCDYEIEYGDGG-SSIGALVTDLFPLRFSNG--SVFNVPLTFGCG 132
N C + QC Y++ Y G SS G LV D+ L N + + GCG
Sbjct: 68 NSNFCDLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQIMLGCG 127
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
Q L G+ GLG +S+ S L + GL N C G++G G + GD +
Sbjct: 128 QTQTG-SFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQE- 185
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASYAYFTSRV 248
SS TP+ N + I SG + G K D IFD+G S+ Y
Sbjct: 186 -SSDQEETPLDINRQHPTYAI------TISGITVGNKPTDMDFITIFDTGTSFTYLADPA 238
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK---ALGQVTEYFKPLALSFTNRRNS 305
Y I + + A D R PF+ L F + S
Sbjct: 239 YTYITQSFHAQVQAN--RHAADS-------RIPFEYCYDLSSSEARFPIPDIILRTVTGS 289
Query: 306 VRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+ V+ P + I + V CL I+ + NIIG+ FM V++D E++ +GWK
Sbjct: 290 MFPVIDPGQVISIQEHEYVYCLAIVKSMKL-----NIIGQNFMTGLRVVFDRERKILGWK 344
Query: 365 PEDC 368
+C
Sbjct: 345 KFNC 348
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 170/384 (44%), Gaps = 54/384 (14%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IV 67
P+ Y ++L +G PP+ DTGSDL W QC PC C Y ++ +
Sbjct: 87 PMTEYL-LHLAIGTPPQPVQLTLDTGSDLVWTQCQ-PCAVCFNQSLPYYDASRSSTFALP 144
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
C + +C P+ C + Q C + YGD ++IG L D+ + F G+ +VP
Sbjct: 145 SCDSTQCKL--DPSVTMCVNQTVQTCAFSYSYGDKSATIGFL--DVETVSFVAGA--SVP 198
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ FGCG N N G +T G+ G GRG +S+ SQL+ G + G+ VL
Sbjct: 199 GVVFGCGLN--NTGIFRSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVSGRKPSTVL 254
Query: 186 FLGDGKVPSSG---VAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLT--L 233
F + +G V TP+++N A LK +G L + LK+ T
Sbjct: 255 FDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGT 314
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTE 290
I DSG ++ RVY+ ++ D +KL P ++T P +C+ P LG+
Sbjct: 315 IIDSGTAFTSLPPRVYR-----LVHDEFAAHVKLPVVPSNETGPLLCFSAP--PLGKAPH 367
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVIS---GRKNVCLGILNGSEAEVGENNIIGEIFM 347
K L L F + +P E Y+ + G ++CL I+ GE IIG
Sbjct: 368 VPK-LVLHF----EGATMHLPRENYVFEAKDGGNCSICLAIIE------GEMTIIGNFQQ 416
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ V+YD + ++ + C+ L
Sbjct: 417 QNMHVLYDLKNSKLSFVRAKCDKL 440
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/377 (25%), Positives = 163/377 (43%), Gaps = 47/377 (12%)
Query: 12 PIFSYFAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---- 66
P + ++ +VG PP K++ F DTGS++ W+QC PC C + P K+
Sbjct: 84 PELGEYLISYSVGTPPFKVYGF-MDTGSNIVWLQCQ-PCNTCFNQTSPIFNPSKSSSYKN 141
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+PC++ C + + C + D C+Y I YG S G L D L ++GS P
Sbjct: 142 IPCTSSTCKDTNDTH-ISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFP 200
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQN 180
+ GCG H ++GV+G+GRG +S++ Q+ + + +C+ N
Sbjct: 201 NIVIGCG---HINVLQDNSQSSGVVGMGRGPMSLIKQVGSSS-VGSKFSYCLIPYNSDSN 256
Query: 181 GRGVLFLGDGKVPSSG-VAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLT 232
L G+ V S V TPM++ + +Y L G + Y G+
Sbjct: 257 SSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEY-GERSNASTQN 315
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTE 290
++ DSG + ++VS + ++ + P ++ P D L +C+ K L +T
Sbjct: 316 ILIDSGTPLTMLPNLFLSKLVSYVAQE-VKLP-RIEPPDHHLSLCYNTTGKQLNVPDITA 373
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+F + + NS P E + +C G ++ + E I G I +
Sbjct: 374 HFNGADV----KLNSNGTFFPFEDGI-------MCFGFISSNGLE-----IFGNIAQNNL 417
Query: 351 MVIYDNEKQRIGWKPED 367
++ YD EK+ I +KP D
Sbjct: 418 LIDYDLEKEIISFKPTD 434
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 154/388 (39%), Gaps = 53/388 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPP----EKQYKPHKNIVPC 69
YF V L VG P K F DTGSDLTW+QC+ P T + PP +K +PC
Sbjct: 59 YF-VELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPC 117
Query: 70 SNPRCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS------- 121
++ C L P C + CDY Y D + G L + ++ S
Sbjct: 118 TDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNH 177
Query: 122 ------VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ NV L GC L +GVLGLG+G IS+ +Q R L + +
Sbjct: 178 KTRRIRIKNVAL--GCSRESVGASFLG---ASGVLGLGQGPISLATQTRHTAL-GGIFSY 231
Query: 176 CIGQNGRG---VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC------ 226
C+ RG FL G+ +A TP+++N A Y + + GK
Sbjct: 232 CLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASS 291
Query: 227 -----GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
G + IFDSG + +Y Y +++ + + + P+ +C+
Sbjct: 292 DWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEG--FELCY--- 346
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNI 341
VT K + + + +P Y+V+ C+ + + +NI
Sbjct: 347 -----NVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTN--GSNI 399
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+G + QD + YD K RIG+K C+
Sbjct: 400 LGNLLQQDHHIEYDLAKARIGFKWSPCH 427
>gi|213998848|gb|ACJ60790.1| nucellin [Psathyrostachys stoloniformis]
Length = 154
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 78/142 (54%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD P+ GV W PM ++ L +Y G A L + G +FDSG++Y Y
Sbjct: 69 YVGDFNPPTRGVTWVPMRES---LFYYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTYM 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y E+VS I L + L+
Sbjct: 126 PAQIYNELVSKIRGTLSESSLE 147
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 155/370 (41%), Gaps = 42/370 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+TVG + DTGSDLTWVQC PC C E + P + +PC++P C A
Sbjct: 68 VTVGIGGQNSTLIVDTGSDLTWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVA 126
Query: 77 LH--WPNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
L + C + N CDY+I+YGDG S G L + L G FGCG
Sbjct: 127 LQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL----GKTEIDNFIFGCGR 182
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDG 190
N N G +G++GL R +S+VSQ L +V +C+ G G L LG
Sbjct: 183 N--NKGLFG--GASGLMGLARSELSLVSQTSS--LFGSVFSYCLPTTGVGSSGSLTLGGA 236
Query: 191 KVPS----SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------LIFDSGAS 240
+ S +++T M+QN Y L + G + + L+ + DSG
Sbjct: 237 DFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTV 296
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
+ +Y+ + + G + P L C+ L E P + F
Sbjct: 297 ITRLSPSIYKAFKAEFEKQFSG--YRTTPGFSILNTCFN-----LTGYEEVNIP-TVKFI 348
Query: 301 NRRNSVRLV-VPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
N+ +V V Y V S +CL S + IIG +++ VIY++++
Sbjct: 349 FEGNAEMIVDVEGVFYFVKSDASQICLAF--ASLGYEDQTMIIGNYQQKNQRVIYNSKES 406
Query: 360 RIGWKPEDCN 369
++G+ E C+
Sbjct: 407 KVGFAGEPCS 416
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 155/370 (41%), Gaps = 42/370 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+TVG + DTGSDLTWVQC PC C E + P + +PC++P C A
Sbjct: 147 VTVGIGGQNSTLIVDTGSDLTWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVA 205
Query: 77 LH--WPNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
L + C + N CDY+I+YGDG S G L + L G FGCG
Sbjct: 206 LQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL----GKTEIDNFIFGCGR 261
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDG 190
N N G +G++GL R +S+VSQ L +V +C+ G G L LG
Sbjct: 262 N--NKGLFG--GASGLMGLARSELSLVSQTSS--LFGSVFSYCLPTTGVGSSGSLTLGGA 315
Query: 191 KVPS----SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------LIFDSGAS 240
+ S +++T M+QN Y L + G + + L+ + DSG
Sbjct: 316 DFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTV 375
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
+ +Y+ + + G + P L C+ L E P + F
Sbjct: 376 ITRLSPSIYKAFKAEFEKQFSG--YRTTPGFSILNTCFN-----LTGYEEVNIP-TVKFI 427
Query: 301 NRRNSVRLV-VPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
N+ +V V Y V S +CL S + IIG +++ VIY++++
Sbjct: 428 FEGNAEMIVDVEGVFYFVKSDASQICLAF--ASLGYEDQTMIIGNYQQKNQRVIYNSKES 485
Query: 360 RIGWKPEDCN 369
++G+ E C+
Sbjct: 486 KVGFAGEPCS 495
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 162/384 (42%), Gaps = 49/384 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPEKQYKPHKNIVP---CS 70
YF V+L +G+PP+ DTGSDL WV+C A C C+ P + H + C
Sbjct: 84 YF-VDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHSSTFSPAHCY 141
Query: 71 NPRCAALHWPN-PPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-P 126
+P C + P+ P C H + C YE Y DG + G + L+ S+G +
Sbjct: 142 DPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKS 201
Query: 127 LTFGCGY--NQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNG-- 181
+ FGCG+ + + S GV+GLGRG IS SQL R +G N +C+
Sbjct: 202 VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFG---NKFSYCLMDYTLS 258
Query: 182 ---RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK--------- 229
L +G+G S + +TP+L N Y + + +G +
Sbjct: 259 PPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDS 318
Query: 230 -DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
+ + DSG + A+ Y+ +++ + R +KL D P F V
Sbjct: 319 GNGGTVVDSGTTLAFLAEPAYRSVIAAVRRR-----VKLPIADALTP-----GFDLCVNV 368
Query: 289 TEYFKPLA----LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGE 344
+ KP L F +V V PP Y + + + CL I + +VG ++IG
Sbjct: 369 SGVTKPEKILPRLKFEFSGGAV-FVPPPRNYFIETEEQIQCLAI-QSVDPKVG-FSVIGN 425
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDC 368
+ Q + +D ++ R+G+ C
Sbjct: 426 LMQQGFLFEFDRDRSRLGFSRRGC 449
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 161/386 (41%), Gaps = 61/386 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C P + ++ ++PC +
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCK-PCVSCFDQPLPYFDTSRSSTNALLPCEST 93
Query: 73 RCAALHWPNPPRCKHPN---DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
+C P C N C Y YGD +IG L D F F G+ ++P +T
Sbjct: 94 QCKL--DPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKF--TFVAGT--SLPGVT 147
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
FGCG N N G + +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 148 FGCGLN--NTGVFNSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTTITGAIPSTVLLDL 203
Query: 189 DGKVPSSG---VAWTPMLQ---NSAD-------LKHYILGPAELLYSGKSCGLKDLT--L 233
+ S+G V TP++Q N A+ LK +G L + L + T
Sbjct: 204 PADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGT 263
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTE 290
I DSG S +VYQ ++RD +KL P + T C+ P +A V +
Sbjct: 264 IIDSGTSITSLPPQVYQ-----VVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPK 318
Query: 291 YFKPLALSFTNR-----RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEI 345
L L F R + VP +A G +CL I G E IIG
Sbjct: 319 ----LVLHFEGATMDLPRENYVFEVPDDA-----GNSIICLAINKGD-----ETTIIGNF 364
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ V+YD + + + C+ L
Sbjct: 365 QQQNMHVLYDLQNNMLSFVAAQCDKL 390
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/377 (29%), Positives = 159/377 (42%), Gaps = 47/377 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V L +G P + DTGSDL W+QC PC C K + + P + +PC +
Sbjct: 54 YF-VRLGLGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRNSSSFQRIPCLS 111
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C AL + + +C Y++ YGDG S+G +DLF L + + + + FGC
Sbjct: 112 PLCKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKA---MSVAFGC 168
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL---REYGLIRNVIGHCI------GQNGR 182
G++ AG+LGLG G++S SQ+ N +C+
Sbjct: 169 GFDNEG----LFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSS 224
Query: 183 GVLFLGDGKVPSSGVAWTPMLQN-SADLKHYI------LGPAELLYSGKSCGLKDL---T 232
L G +PS+ A +P+L+N D +Y +G A+L S KS L
Sbjct: 225 SSLIFGVAAIPSTA-ALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGG 283
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
+I DSG S F + VY I I P AP C+ KA V
Sbjct: 284 VIIDSGTSVTRFPTSVYATIRDAFRNATINLP--SAPRYSLFDTCYNFSGKASVDV---- 337
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
L L F N L +PP YL+ I+ + CL S E IIG I Q
Sbjct: 338 PALVLHF---ENGADLQLPPTNYLIPINTAGSFCLAFAPTSM----ELGIIGNIQQQSFR 390
Query: 352 VIYDNEKQRIGWKPEDC 368
+ +D +K + + P+ C
Sbjct: 391 IGFDLQKSHLAFAPQQC 407
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 161/384 (41%), Gaps = 51/384 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD---------APCTGCTKPPEKQYKPHKNI 66
Y+A + +G P F DTGSDL WV CD A TG PP + Y P ++
Sbjct: 110 YYA-EVELGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANATGPDAPPLRPYSPRRSS 168
Query: 67 ----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRF---- 117
V C NP C + + N C YE++Y SS G LV D+ L
Sbjct: 169 TSEQVACDNPLCGRRNGCS----AATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPG 224
Query: 118 --SNGSVFNVPLTFGCGYNQHNP------GPLSPPDTAGVLGLGRGRISIVSQLREYGLI 169
+ G P+ FGCG Q G + G++GLG G++S+ S L GL+
Sbjct: 225 PGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVD-----GLMGLGMGKVSVPSALAASGLV 279
Query: 170 -RNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL 228
+ C G +G G + GD S G A TP S + + + + G
Sbjct: 280 ASDSFSMCFGDDGVGRVNFGDAG--SRGQAETPFTVRSLNPTYNV--SFTSIGIGSESVA 335
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
+ + DSG S+ Y + Y ++ + + + + P + ++
Sbjct: 336 AEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFS-SGSADPFPFEYCYRLSPNQ 394
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVI---SGRK-NVCLGILNGSEAEVGENNIIGE 344
TE P +S T + ++ V P ++ + +GR CL I+ ++ +G + IIG+
Sbjct: 395 TEVAMP-DVSLTAKGGALFPVTQP--FIPVGDTTGRAIGYCLAIMR-NDMAIGID-IIGQ 449
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDC 368
FM V++D E+ +GW+ DC
Sbjct: 450 NFMTGLKVVFDRERSVLGWEKFDC 473
>gi|213998845|gb|ACJ60789.1| nucellin [Psathyrostachys fragilis subsp. fragilis]
Length = 150
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 78/142 (54%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 7 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGKGVL 66
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD P+ GV W PM ++ L +Y G A L + G +FDSG++Y Y
Sbjct: 67 YVGDFNPPTRGVTWVPMRES---LFYYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTYV 123
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y E+VS I L + L+
Sbjct: 124 PAQIYNELVSKIRGTLSESSLE 145
>gi|213998810|gb|ACJ60772.1| nucellin [Hordeum comosum]
Length = 154
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 79/142 (55%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDS ++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSDSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLE 147
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 141/363 (38%), Gaps = 37/363 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P + V C+ P
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 239
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L C C Y ++YGDG SIG D L S ++ F G
Sbjct: 240 ACSDLDVSG---CS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 289
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
+ N G + AG+LGLGRG+ S+ ++ YG V HC+ G G L G G
Sbjct: 290 CGERNDGLFG--EAAGLLGLGRGKTSL--PVQTYGKYGGVFAHCLPPRSTGTGYLDFGAG 345
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYFT 245
P++ TPML + +Y+ G + G+ I DSG
Sbjct: 346 SPPAT--TTTPMLTGNGPTFYYV-GMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLP 402
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y + S + + A L C+ F + QV ++L F +
Sbjct: 403 PAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCY--DFTGMSQVA--IPTVSLLF---QGG 455
Query: 306 VRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKP 365
L V + VCL + G+ I+G ++ V YD K+ +G+ P
Sbjct: 456 AALDVDASGIMYTVSASQVCLAFAGNEDG--GDVGIVGNTQLKTFGVAYDIGKKVVGFSP 513
Query: 366 EDC 368
C
Sbjct: 514 GAC 516
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 153/368 (41%), Gaps = 41/368 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNPRCAA 76
+T+G K DTGSDLTWVQC+ PC C +KP V C++ C +
Sbjct: 67 VTMGLGSKNMTVIIDTGSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125
Query: 77 LHWP--NPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
L + N C N C+Y + YGDG + G L + G V FGCG
Sbjct: 126 LQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSF----GGVSVSDFVFGCGR 181
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDG 190
N N G +G++GLGR +S+VSQ V +C+ G L +G+
Sbjct: 182 N--NKGLFG--GVSGLMGLGRSYLSLVSQTN--ATFGGVFSYCLPTTEAGSSGSLVMGNE 235
Query: 191 KV---PSSGVAWTPMLQNSADLKHYILGPAELLYSGKS----CGLKDLTLIFDSGASYAY 243
++ + +T ML N YIL + G + + ++ DSG
Sbjct: 236 SSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGILIDSGTVITR 295
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
S VY+ + + ++ G P AP L C F G ++L F
Sbjct: 296 LPSSVYKALKAEFLKKFTGFP--SAPGFSILDTC----FNLTGYDEVSIPTISLRF---E 346
Query: 304 NSVRLVVPPEA--YLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRI 361
+ +L V Y+V VCL + + S+A + IIG +++ VIYD ++ ++
Sbjct: 347 GNAQLNVDATGTFYVVKEDASQVCLALASLSDAY--DTAIIGNYQQRNQRVIYDTKQSKV 404
Query: 362 GWKPEDCN 369
G+ E C+
Sbjct: 405 GFAEEPCS 412
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 91/336 (27%), Positives = 141/336 (41%), Gaps = 41/336 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F D+GS +T+V C A C C + +++P + V C N
Sbjct: 88 YYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQPDLSSSYSPVKC-N 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C C QC YE +Y + SS G L D+ + F S FG
Sbjct: 146 VDCT---------CDSDKKQCTYERQYAEMSSSSGVLGEDI--VSFGRESELKAQRAVFG 194
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++SI+ QL E G+I + C G G G + LG
Sbjct: 195 C--ENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLG 252
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
PS V S L+ +Y + E+ +GK+ + + DSG +
Sbjct: 253 GVPTPSDMV-----FSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGTT 307
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAY + + + + PD IC+ G + + ++ E F + + F
Sbjct: 308 YAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDVDMVFG 367
Query: 301 NRRNSVRLVVPPEAYLVISGRKN--VCLGIL-NGSE 333
N + +L + PE YL + + CLG+ NG +
Sbjct: 368 NGQ---KLSLTPENYLFRHSKVDGAYCLGVFQNGKD 400
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 141/363 (38%), Gaps = 37/363 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P + V C+ P
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 238
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L C C Y ++YGDG SIG D L S ++ F G
Sbjct: 239 ACSDLDVSG---CS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 288
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
+ N G + AG+LGLGRG+ S+ ++ YG V HC+ G G L G G
Sbjct: 289 CGERNDGLFG--EAAGLLGLGRGKTSL--PVQTYGKYGGVFAHCLPARSTGTGYLDFGAG 344
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYFT 245
P++ TPML + +Y+ G + G+ I DSG
Sbjct: 345 SPPAT--TTTPMLTGNGPTFYYV-GMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLP 401
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y + S + + A L C+ F + QV ++L F +
Sbjct: 402 PAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCY--DFTGMSQVA--IPTVSLLF---QGG 454
Query: 306 VRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKP 365
L V + VCL + G+ I+G ++ V YD K+ +G+ P
Sbjct: 455 AALDVDASGIMYTVSASQVCLAFAGNEDG--GDVGIVGNTQLKTFGVAYDIGKKVVGFSP 512
Query: 366 EDC 368
C
Sbjct: 513 GAC 515
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 164/383 (42%), Gaps = 64/383 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----------KPPEKQYKPHKNI 66
+ + L++G PP+L DTGSDL W++CD C C YK
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDN-CDHCDLDHHGETIFFSDASSSYKK---- 59
Query: 67 VPCSNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS---- 121
+PC++ C+ + PRC+ + C Y+ EYGDG + G + +D R S+G+
Sbjct: 60 LPCNSTHCSGMSSAGIGPRCE---ETCKYKYEYGDGSRTSGDVGSDRISFR-SHGAGEDH 115
Query: 122 -VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE---YGLIRNVIGHCI 177
F FGCG T G++GLG+ S++ QL + Y ++ +
Sbjct: 116 RSFFDGFLFGCGRKLKGDWNF----TQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDS 171
Query: 178 GQNGRGVLFLG-DGKVPSSGVAWTPMLQNS--------ADLKHYILGPAELLYSGKSCG- 227
+ + LFLG + V TP+L DL+ +G ++ K G
Sbjct: 172 PPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGH 231
Query: 228 -------LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
L + T+I DSG +Y T VY+ + I +I L + L +C
Sbjct: 232 NTSVGPFLANKTVI-DSGTTYTLLTPPVYEAMRKSIEEQVI---LPTLGNSAGLDLC--- 284
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN 340
F + G + F + F N+ V+LV+P E ++ R VCL + ++ G+ +
Sbjct: 285 -FNSSGDTSYGFPSVTFYFANQ---VQLVLPFENIFQVTSRDVVCLSM----DSSGGDLS 336
Query: 341 IIGEIFMQDKMVIYDNEKQRIGW 363
IIG + Q+ ++YD +I +
Sbjct: 337 IIGNMQQQNFHILYDLVASQISF 359
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 156/381 (40%), Gaps = 44/381 (11%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALH 78
+ L +G K DTGS+ VQC + P Q VPC + C A+
Sbjct: 1 MQLGIGSLQKNLSAIIDTGSEAVLVQCGSRSRPVFDPAASQSYRQ---VPCISQLCLAVQ 57
Query: 79 WP----NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP---LTFGC 131
+ C + + C Y + YGD +S G D+ L +N S V + FGC
Sbjct: 58 QQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGC 117
Query: 132 GYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN-----GRGVL 185
H+P G L + G++G RG +S+ SQL++ L + +C GV+
Sbjct: 118 A---HSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPRATGVI 173
Query: 186 FLGDGKVPSSGVAWTPMLQN---SADLKHYILGPAELLYSGKSCGL-----------KDL 231
FLGD + S V++TP+L N A + Y +G + GK+ + D
Sbjct: 174 FLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDG 233
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ DSG ++ Y + K C+ + G
Sbjct: 234 GTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYN---ISAGSSLPG 290
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKN---VCLGILNGSEAEVGENNIIGEIFM 347
+ LS +N+VRL + E V +S N VCL IL+ ++ G+ N++G
Sbjct: 291 VPEVRLSL---QNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQ 347
Query: 348 QDKMVIYDNEKQRIGWKPEDC 368
+ +V YDNE+ R+G++ DC
Sbjct: 348 SNYLVEYDNERSRVGFERADC 368
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 140/363 (38%), Gaps = 37/363 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P + V C+ P
Sbjct: 183 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 242
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L C C Y ++YGDG SIG D L S ++ F G
Sbjct: 243 ACSDLDVSG---CS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 292
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
+ N G + AG+LGLGRG+ S+ Q YG V HC+ G G L G G
Sbjct: 293 CGERNDGLFG--EAAGLLGLGRGKTSLPVQ--TYGKYGGVFAHCLPARSTGTGYLDFGAG 348
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYFT 245
P++ TPML + +Y+ G + G+ I DSG
Sbjct: 349 SPPAT--TTTPMLTGNGPTFYYV-GMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLP 405
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y + S + + A L C+ F + QV ++L F +
Sbjct: 406 PAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCY--DFTGMSQVA--IPTVSLLF---QGG 458
Query: 306 VRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKP 365
L V + VCL + G+ I+G ++ V YD K+ +G+ P
Sbjct: 459 AALDVDASGIMYTVSASQVCLAFAGNEDG--GDVGIVGNTQLKTFGVAYDIGKKVVGFSP 516
Query: 366 EDC 368
C
Sbjct: 517 GAC 519
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 106/376 (28%), Positives = 150/376 (39%), Gaps = 51/376 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKP--- 62
F ++A+ +TVG P + F DTGSDL W+ C C GCT P Y P
Sbjct: 107 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSFQATFYIPGMS 163
Query: 63 -HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG 120
VPC++ C C QC Y++ Y G SS G LV D+ L N
Sbjct: 164 STSKAVPCNSNFCDLQK-----ECSTAL-QCPYKMVYVSAGTSSSGFLVEDVLYLSTENA 217
Query: 121 --SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+ + GCG Q L G+ GLG +S+ S L + GL N C G
Sbjct: 218 HPQILKAQIMLGCGQTQTG-SFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFG 276
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLI 234
++G G + GD + SS TP+ N + I SG + G K D I
Sbjct: 277 RDGIGRISFGDQE--SSDQEETPLDINRQHPTYAI------TISGITVGNKPTDMDFITI 328
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFK 293
FD+G S+ Y Y I + + A D + C+ L F
Sbjct: 329 FDTGTSFTYLADPAYTYITQSFHAQVQAN--RHAADSRIPFEYCYD-----LSSSEARFP 381
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ S+ V+ P + I + V CL I+ + NIIG+ FM V
Sbjct: 382 IPDIILRTVTGSMFPVIDPGQVISIQEHEYVYCLAIVKSMKL-----NIIGQNFMTGLRV 436
Query: 353 IYDNEKQRIGWKPEDC 368
++D E++ +GWK +C
Sbjct: 437 VFDRERKILGWKKFNC 452
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 163/383 (42%), Gaps = 46/383 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPEKQYKPHK---NIVPCS 70
YF V+L +G PP+ DTGSDL WV+C +PC C+ P + H + + C
Sbjct: 86 YF-VSLRIGTPPQTLLLVADTGSDLIWVKC-SPCRNCSHRSPGSAFFARHSTTYSAIHCY 143
Query: 71 NPRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
+P+C + P+P C + C Y+ Y D ++ G + L S G V + L
Sbjct: 144 SPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLNGL 203
Query: 128 TFGCGYNQHNPG--PLSPPDTAGVLGLGRGRISIVSQL-REYG--LIRNVIGHCIGQNGR 182
+FGCG+ P S GV+GLGR IS SQL R +G ++ + +
Sbjct: 204 SFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSPPPT 263
Query: 183 GVLFLGDGK---VPSSGV-AWTPMLQNSADLKHYILGPAELLYSGKSC-------GLKDL 231
L +G + V G+ ++TP+L N Y + + +G + DL
Sbjct: 264 SFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDDL 323
Query: 232 ---TLIFDSGASYAYFTSRVYQEIVSLIMRDL-IGTPLKLAPDDKTLPICWRGPFKALGQ 287
I DSG + + T Y EI+ + + + +P + P F
Sbjct: 324 GNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPG-----------FDLCMN 372
Query: 288 VTEYFKPL--ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEI 345
V+ +P +SF SV PP Y + +G + CL + S+ G +++G +
Sbjct: 373 VSGVTRPALPRMSFNLAGGSV-FSPPPRNYFIETGDQIKCLAVQPVSQD--GGFSVLGNL 429
Query: 346 FMQDKMVIYDNEKQRIGWKPEDC 368
Q ++ +D +K R+G+ C
Sbjct: 430 MQQGFLLEFDRDKSRLGFTRRGC 452
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 152/371 (40%), Gaps = 43/371 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK----------QYKPHKNI---- 66
+ +G P F D GSDL W+ CD C C +Y P +++
Sbjct: 101 IDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLSSSYYSNLDRDLNEYSPSRSLSSKH 158
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLR----FSNGS 121
+ CS+ C CK QC Y + Y + SS G LV D+ L+ SN S
Sbjct: 159 LSCSHRLC-----DKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGTLSNSS 213
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V P+ GCG Q G L G+LGLG G S+ S L + GLI C ++
Sbjct: 214 V-QAPVVLGCGMKQSG-GYLDGVAPDGLLGLGPGESSVPSFLAKSGLIHYSFSLCFNEDD 271
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGAS 240
G +F GD + P+S + T L YI+G E G SC + DSG S
Sbjct: 272 SGRMFFGD-QGPTSQQS-TSFLPLDGLYSTYIIG-VESCCIGNSCLKMTSFKAQVDSGTS 328
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
+ + VY I + + G+ + + + C+ + L +V + L F
Sbjct: 329 FTFLPGHVYGAITEEFDQQVNGS--RSSFEGSPWEYCYVPSSQDLPKVPSF----TLMF- 381
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
R NS + P + G CL IL +E ++G IG+ FM +++D ++
Sbjct: 382 QRNNSFVVYDPVFVFYGNEGVIGFCLAILP-TEGDMG---TIGQNFMTGYRLVFDRGNKK 437
Query: 361 IGWKPEDCNTL 371
+ W +C L
Sbjct: 438 LAWSRSNCQDL 448
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/368 (23%), Positives = 143/368 (38%), Gaps = 37/368 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP--------------HKNI 66
+ +G P F DTGSD+ WV CD C C Y
Sbjct: 106 IDIGTPNVSFLVALDTGSDMFWVPCD--CIECAPLSAAFYNALDRDLNQYSPSLSSSSRH 163
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGS--VF 123
+PC + C CK D+C Y EY D SS G L+ D L +N +
Sbjct: 164 LPCGHQLCN-----QNSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLASNNATKNSI 218
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ GCG Q L G+LGLG G IS+ + L + GLIRN I C+ + G G
Sbjct: 219 QASVILGCGRKQSGYF-LEGAAPNGMLGLGPGSISVPALLAKAGLIRNSISICLNEKGSG 277
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD + + TP L + +L +Y +G + D+G S+ Y
Sbjct: 278 RILFGDQGHATQRRS-TPFLLDDGELLNYFVGVERFCVGSFCYKETEFKAFIDTGTSFTY 336
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
VY+ +V+ + + T + + C + A + + F P+ +F+ +
Sbjct: 337 LPKGVYETVVAEFEKQVHATRIT-SQIQSDFNCC----YNASSRESNNFPPMKFTFSKNQ 391
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEA--EVGENNIIG-EIFMQDKMVIYDNEKQR 360
+ ++ + +CL ++ + +G I + F+ +++D E R
Sbjct: 392 S---FIIQNPFISMDQEDTTICLAVVQSDDELITIGRKYTIACQNFLMGYDMVFDRENLR 448
Query: 361 IGWKPEDC 368
GW +C
Sbjct: 449 FGWFRSNC 456
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/258 (29%), Positives = 115/258 (44%), Gaps = 36/258 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKPHKNI----V 67
+ +++G PP+ F D DTGS++ WV+C APCTGC P + P K+ +
Sbjct: 41 YYTRISLGTPPQQFYVDVDTGSNVAWVKC-APCTGCEHSGDVPVPMSTFDPRKSTTKISI 99
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF-----PLRFSNGSV 122
C++ C L+ +C C Y + YGDG S+ G + D+F P S
Sbjct: 100 SCTDAECGVLN--KKLQCSPERLSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKS 157
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN-- 180
L FGCG Q + G+LG G +S+ +QL + + N+ HC+ +
Sbjct: 158 GTARLVFGCGGTQTGSWSVD-----GLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVS 212
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK---DLT----L 233
GRG L +G + P + +TPM+ HY + + SG++ DL +
Sbjct: 213 GRGSLVIGTIREPD--LVYTPMVFGE---DHYNVQLLNIGISGRNVTTPASFDLEYTGGV 267
Query: 234 IFDSGASYAYFTSRVYQE 251
I DSG + Y Y E
Sbjct: 268 IIDSGTTLTYLVQPAYDE 285
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 152/384 (39%), Gaps = 71/384 (18%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD-APCTGCTKPPEKQYKPHKN---- 65
FP F+ + V+L G PP+ DTGSD+TW QC P + C + P +
Sbjct: 83 FP-FTEYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFA 141
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQ----CDYEIEYGDGGSSIGALVTDLFPLR--FSN 119
+PCS+P C P C ND C+Y I YGDG S G + ++F
Sbjct: 142 SLPCSSPACETT-----PPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGE 196
Query: 120 GSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
GS VP L FGCG+ N G + +T G+ G GRG +S+ SQL+ G + G
Sbjct: 197 GSSAAVPGLVFGCGH--ANRGVFTSNET-GIAGFGRGSLSLPSQLK-VGNFSHCFTTITG 252
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
VL G P S +P+ + + + +SG
Sbjct: 253 SKTSAVLLGLPGVAPPSA---SPLGRRRGSYR-----------------CRSTPRSSNSG 292
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTEYFKPL 295
S R Y+ + R+ +KL P + T P C+ P + KP
Sbjct: 293 TSITSLPPRTYRAV-----REEFAAQVKLPVVPGNATDPFTCFSAPLRGP-------KPD 340
Query: 296 ALSFTNRRNSVRLVVPPEAYL--------VISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
+ + +P E Y+ + + +CL ++ G E I+G I
Sbjct: 341 VPTMALHFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAVIEGGEI------ILGNIQQ 394
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ V+YD + ++ + P C+ L
Sbjct: 395 QNMHVLYDLQNSKLSFVPAQCDQL 418
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 159/380 (41%), Gaps = 52/380 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F +++ +G P + DTGSDL W QC PC C K + P + VPCS+
Sbjct: 100 FLMDVAIGTPALSYAAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSA 158
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L P +C Y YGD S+ G L ++ F L + V FGCG
Sbjct: 159 LCSDL----PTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKLPGV--AFGCG 212
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ----NGRGVLFLG 188
G AG++GLGRG +S+VSQL GL + +C+ +G+ L LG
Sbjct: 213 DTNEGDG---FTQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDGDGKSPLLLG 264
Query: 189 DGKVPSSG------VAWTPMLQNSADLKHY-------ILGPAELLYSGKSCGLKDL---T 232
S V TP+++N + Y +G + + ++D
Sbjct: 265 GSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGG 324
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
+I DSG S Y + Y+ + + + P + + L +C++GP K + +V
Sbjct: 325 VIVDSGTSITYLELQGYRALKKAFVAQM-ALP-TVDGSEIGLDLCFQGPAKGVDEV--QV 380
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVI-SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
L L F + L +P E Y+V+ S +CL + A +IIG Q+
Sbjct: 381 PKLVLHFDGGAD---LDLPAENYMVLDSASGALCLTV-----APSRGLSIIGNFQQQNFQ 432
Query: 352 VIYDNEKQRIGWKPEDCNTL 371
+YD + + P CN L
Sbjct: 433 FVYDVAGDTLSFAPVQCNKL 452
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 117/264 (44%), Gaps = 39/264 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKNI-- 66
+ + +G P K + DTGSD+ WV C + C + P K Y P +
Sbjct: 33 YYTEIGIGTPTKRYYVQVDTGSDILWVNCIS----CDRCPRKSGLGLELTLYDPKDSSTG 88
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
V C CAA + P C + C+Y + YGDG S+ G V+DL +G
Sbjct: 89 SKVSCDQGFCAATYGGLLPGCT-TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQT 147
Query: 123 --FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N +TFGCG Q S G++G G+ S++SQL G ++ + HC+
Sbjct: 148 RPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTI 207
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
NG G+ +G+ P V TP++ N + +LK +G P+ + +G+ G
Sbjct: 208 NGGGIFAIGNVVQPK--VKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKG-- 263
Query: 230 DLTLIFDSGASYAYFTSRVYQEIV 253
I DSG + Y VY+EI+
Sbjct: 264 ---TIIDSGTTLTYLPEIVYKEIM 284
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 159/383 (41%), Gaps = 56/383 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC---------TKPPEKQYKPH- 63
F Y+A N++VG P F DTGSDL W+ C+ C+ C K Y P+
Sbjct: 102 FLYYA-NVSVGTPSLDFLVALDTGSDLFWLPCE--CSSCFTYLNTSNGGKFMLNHYSPND 158
Query: 64 ---KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
+ VPC++ C RC + C YE+ Y SSIG LV D+ L +
Sbjct: 159 STTSSTVPCTSSLCN--------RCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLATDD 210
Query: 120 GSV--FNVPLTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ +TFGCG Q + P+ G++GLG +IS+ S L + GL N C
Sbjct: 211 SLLKPVEAKITFGCGTVQTGIFATTAAPN--GLIGLGMEKISVPSFLADQGLTSNSFSMC 268
Query: 177 IGQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIF 235
G +G G + GD G + ML+ + + ++ G T IF
Sbjct: 269 FGADGYGRIDFGDTGPADQKQTPFNTMLEYQSYNVTF-----NVINVGGEPNDVPFTAIF 323
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG S+ Y T Y I + + L + C+ P A + F+ L
Sbjct: 324 DSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGA-----KEFQYL 378
Query: 296 ALSFTNRR------NSVRLVVPPEAY---LVISGRKNV-CLGILNGSEAEVGENNIIGEI 345
L+FT + + + +P + ++ +V CL I A+ + ++IG+
Sbjct: 379 TLNFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAI-----AKSTDIDLIGQN 433
Query: 346 FMQDKMVIYDNEKQRIGWKPEDC 368
FM + ++ ++ +GW DC
Sbjct: 434 FMTGYRITFNRDQMVLGWSSSDC 456
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 170/386 (44%), Gaps = 53/386 (13%)
Query: 17 FAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPP----EKQYKPHKNIVPCSN 71
F +++T+G PP K+F DTGSDLTWVQC PC C K +K+ PC +
Sbjct: 85 FFMSITIGTPPMKVFAI-ADTGSDLTWVQC-KPCQQCYKENGPIFDKKKSSTYKSEPCDS 142
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C AL + C + C Y YGD S G + T+ + ++GS + P T FG
Sbjct: 143 RNCHALS-SSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVFG 201
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVL 185
CGYN G +G++GLG G +S++SQL I +C+ NG V+
Sbjct: 202 CGYNN---GGTFDETGSGIIGLGGGHLSLISQLGSS--ISKKFSYCLSHKSATTNGTSVI 256
Query: 186 FLGDGKVPS-----SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKD---- 230
LG +PS SGV TP++ +Y+ +G ++ Y+G S D
Sbjct: 257 NLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGIF 316
Query: 231 ----LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
+I DSG + S + + + + +L+ +++ L C++ +G
Sbjct: 317 SETSGNIIIDSGTTLTLLDSGFFDKFGAAV-EELVTGAKRVSDPQGLLSHCFKSGSAEIG 375
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIF 346
+ + FT VRL P A++ +S VCL ++ +E I G
Sbjct: 376 -----LPEITVHFTGA--DVRL-SPINAFVKVS-EDMVCLSMVPTTEVA-----IYGNFA 421
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTLL 372
D +V YD E + + ++ DC+ L
Sbjct: 422 QMDFLVGYDLETRTVSFQRMDCSANL 447
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 102/386 (26%), Positives = 154/386 (39%), Gaps = 52/386 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK--QYKPHKNI----VPCS 70
V+L VG PP+ DTGS+L+W+ C AP G ++P ++ VPC
Sbjct: 66 LTVSLAVGTPPQNVTMVLDTGSELSWLLC-APGGGGGGGGRSALSFRPRASLTFASVPCD 124
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ +C + P+PP C + QC + Y DG SS GAL T++F + G + FG
Sbjct: 125 SAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV----GQGPPLRAAFG 180
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRGVLFLGD 189
C + P TAG+LG+ RG +S VSQ +CI ++ GVL LG
Sbjct: 181 CMATAFDTSP-DGVATAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVLLLGH 234
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
+P + +TP+ Q + L ++ + G G K L + +
Sbjct: 235 SDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTM 294
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD------KTLPICW-----RGPFK 283
DSG + + Y + + R P A +D + C+ R P
Sbjct: 295 VDSGTQFTFLLGDAYSALKAEFSRQT--KPWLPALNDPNFAFQEAFDTCFRVPQGRAPPA 352
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
L VT F ++ R + VP E G CL N + +IG
Sbjct: 353 RLPAVTLLFNGAQMTVAGDR--LLYKVPGERR---GGDGVWCLTFGNADMVPI-TAYVIG 406
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ V YD E+ R+G P C+
Sbjct: 407 HHHQMNVWVEYDLERGRVGLAPIRCD 432
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 102/386 (26%), Positives = 154/386 (39%), Gaps = 52/386 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK--QYKPHKNI----VPCS 70
V+L VG PP+ DTGS+L+W+ C AP G ++P ++ VPC
Sbjct: 65 LTVSLAVGTPPQNVTMVLDTGSELSWLLC-APGGGGGGGGRSALSFRPRASLTFASVPCG 123
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ +C + P+PP C + QC + Y DG SS GAL T++F + G + FG
Sbjct: 124 SAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV----GQGPPLRAAFG 179
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRGVLFLGD 189
C + P TAG+LG+ RG +S VSQ +CI ++ GVL LG
Sbjct: 180 CMATAFDTSP-DGVATAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVLLLGH 233
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
+P + +TP+ Q + L ++ + G G K L + +
Sbjct: 234 SDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTM 293
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD------KTLPICW-----RGPFK 283
DSG + + Y + + R P A +D + C+ R P
Sbjct: 294 VDSGTQFTFLLGDAYSALKAEFSRQT--KPWLPALNDPNFAFQEAFDTCFRVPQGRAPPA 351
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
L VT F ++ R + VP E G CL N + +IG
Sbjct: 352 RLPAVTLLFNGAQMTVAGDR--LLYKVPGERR---GGDGVWCLTFGNADMVPI-TAYVIG 405
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ V YD E+ R+G P C+
Sbjct: 406 HHHQMNVWVEYDLERGRVGLAPIRCD 431
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 160/364 (43%), Gaps = 36/364 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKNI----VPCSN 71
+AV + +G P K F FDTGSDLTW QC+ PC+ GC ++++ P K+ + CS+
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCE-PCSGGCFPQNDEKFDPTKSTSYKNLSCSS 190
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C ++ + C N C Y ++YG G ++G L T+ + S+ VF GC
Sbjct: 191 EPCKSIGKESAQGCSSSN-SCLYGVKYGT-GYTVGFLATETLTITPSD--VFE-NFVIGC 245
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G + N G S TAG+LGLGR +++ SQ +N+ +C+ + L G
Sbjct: 246 G--ERNGGRFS--GTAGLLGLGRSPVALPSQTSS--TYKNLFSYCLPASSSSTGHLSFGG 299
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYFTS 246
S +TP+ +L Y L + + G+ + + I DSG + Y S
Sbjct: 300 GVSQAAKFTPITSKIPEL--YGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPS 357
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
+ + S + T L L C+ A +T +++ F V
Sbjct: 358 TAHSALSSAFQEMM--TNYTLTKGTSGLQPCYDFSKHANDNIT--IPQISIFF---EGGV 410
Query: 307 RLVVPPEA-YLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+ + ++ +G + VCL NG++ +V I G + + V+YD K +G+
Sbjct: 411 EVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVA---IFGNVQQKTYEVVYDVAKGMVGFA 467
Query: 365 PEDC 368
P C
Sbjct: 468 PGGC 471
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 92/391 (23%), Positives = 156/391 (39%), Gaps = 49/391 (12%)
Query: 1 MYVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
M I+ P + +NL++G PP DTGSDLTW QC PCT C K +
Sbjct: 76 MTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPFF 134
Query: 61 KPHKNIV----PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
P + C C AL N C++ +C + Y DG + G L + +
Sbjct: 135 DPKNSSTYRDSSCGTSFCLAL--GNDRSCRN-GKKCTFMYSYADGSFTGGNLAVETLTVA 191
Query: 117 FSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ G + P FGC H G + ++G++GLG +S++SQL+ I +
Sbjct: 192 STAGKPVSFPGFAFGC---VHRSGGIFDEHSSGIVGLGVAELSMISQLKS--TINGRFSY 246
Query: 176 CI------GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYS 222
C+ + F G V +G TP++ D +Y++ G L Y
Sbjct: 247 CLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYK 306
Query: 223 G--KSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
G K +++ +I DSG +Y Y Y ++ + + G ++ + +C+
Sbjct: 307 GFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGK--RVRDPNGISSLCYNT 364
Query: 281 PFKALGQ--VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGE 338
+ +T +FK + +R+ VC +L S+
Sbjct: 365 TVDQIDAPIITAHFKDANVELQPWNTFLRM-----------QEDLVCFTVLPTSDI---- 409
Query: 339 NNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
I+G + + +V +D K+R+ +K DC
Sbjct: 410 -GILGNLAQVNFLVGFDLRKKRVSFKAADCT 439
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 168/382 (43%), Gaps = 53/382 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPC 69
+ + ++L +G PP+ DTGS L W QC PC C Y ++ + C
Sbjct: 32 MTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ-PCAVCFNQSLPYYDASRSSTFALPSC 90
Query: 70 SNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
+ +C P+ C + Q C Y YGD ++IG L D+ + F G+ +VP +
Sbjct: 91 DSTQCKL--DPSVTMCVNQTVQTCAYSYSYGDKSATIGFL--DVETVSFVAGA--SVPGV 144
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
FGCG N N G +T G+ G GRG +S+ SQL+ G + G+ VLF
Sbjct: 145 VFGCGLN--NTGIFRSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVSGRKPSTVLFD 200
Query: 188 GDGKVPSSG---VAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLT--LIF 235
+ +G V TP+++N A LK +G L + LK+ T I
Sbjct: 201 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 260
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTEYF 292
DSG ++ RVY+ ++ D +KL P ++T P +C+ P LG+
Sbjct: 261 DSGTAFTSLPPRVYR-----LVHDEFAAHVKLPVVPSNETGPLLCFSAP--PLGKAPHVP 313
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVIS---GRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
K L L F + +P E Y+ + G ++CL I+ GE IIG Q+
Sbjct: 314 K-LVLHF----EGATMHLPRENYVFEAKDGGNCSICLAIIE------GEMTIIGNFQQQN 362
Query: 350 KMVIYDNEKQRIGWKPEDCNTL 371
V+YD + ++ + C+ L
Sbjct: 363 MHVLYDLKNSKLSFVRAKCDKL 384
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 106/377 (28%), Positives = 158/377 (41%), Gaps = 61/377 (16%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALH 78
+G P + DTGSDL W QC PC C K + P + VPCS+ C+ L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLP 231
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHN 137
+C + +C Y YGD S+ G L T+ F L S +P + FGCG
Sbjct: 232 T---SKCTSAS-KCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGCGDTNEG 282
Query: 138 PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD----- 189
G AG++GLGRG +S+VSQL GL + +C + L LG
Sbjct: 283 DG---FSQGAGLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSPLLLGSLAGIS 334
Query: 190 -GKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKD---LTLIFDSG 238
+S V TP+++N + LK +G + + ++D +I DSG
Sbjct: 335 EASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSG 394
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVTEYFKPL 295
S Y + Y+ ++ + L D + L +C+R P K + QV L
Sbjct: 395 TSITYLEVQGYRA-----LKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE--VPRL 447
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
F + L +P E Y+V+ G +CL ++ GS +IIG Q+ +Y
Sbjct: 448 VFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM-GSRGL----SIIGNFQQQNFQFVY 499
Query: 355 DNEKQRIGWKPEDCNTL 371
D + + P CN L
Sbjct: 500 DVGHDTLSFAPVQCNKL 516
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 169/384 (44%), Gaps = 54/384 (14%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IV 67
P+ Y ++L +G PP+ DTGS L W QC PC C Y ++ +
Sbjct: 87 PMTEYL-LHLAIGTPPQPVQLTLDTGSVLVWTQCQ-PCAVCFNQSLPYYDASRSSTFALP 144
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
C + +C P+ C + Q C Y YGD ++IG L D+ + F G+ +VP
Sbjct: 145 SCDSTQCKL--DPSVTMCVNQTVQTCAYSYSYGDKSATIGFL--DVETVSFVAGA--SVP 198
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ FGCG N N G +T G+ G GRG +S+ SQL+ G + G+ VL
Sbjct: 199 GVVFGCGLN--NTGIFRSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVSGRKPSTVL 254
Query: 186 FLGDGKVPSSG---VAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLT--L 233
F + +G V TP+++N A LK +G L + LK+ T
Sbjct: 255 FDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGT 314
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTE 290
I DSG ++ RVY+ ++ D +KL P ++T P +C+ P LG+
Sbjct: 315 IIDSGTAFTSLPPRVYR-----LVHDEFAAHVKLPVVPSNETGPLLCFSAP--PLGKAPH 367
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVIS---GRKNVCLGILNGSEAEVGENNIIGEIFM 347
K L L F + +P E Y+ + G ++CL I+ GE IIG
Sbjct: 368 VPK-LVLHF----EGATMHLPRENYVFEAKDGGNCSICLAIIE------GEMTIIGNFQQ 416
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ V+YD + ++ + C+ L
Sbjct: 417 QNMHVLYDLKNSKLSFVRAKCDKL 440
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 166/385 (43%), Gaps = 57/385 (14%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA--A 76
V+LTVG PP+ DTGS+L+W+ C+ + T + ++ I PCS+P C
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYPTTFDPTRSTSYQTI-PCSSPTCTNRT 91
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
+P P C N+ C + Y D SS G L +D+F + S+ S L FGC +
Sbjct: 92 QDFPIPASCDS-NNLCHATLSYADASSSDGNLASDVFHIGSSDIS----GLVFGCMDSVF 146
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDGKVP-S 194
+ + G++G+ RG +S VSQL G + +CI G + G+L LG+ + S
Sbjct: 147 SSNSDEDSKSTGLMGMNRGSLSFVSQL---GFPK--FSYCISGTDFSGLLLLGESNLTWS 201
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-------------------TLIF 235
+ +TP++Q S L ++ + Y+ + G+K L +
Sbjct: 202 VPLNYTPLIQISTPLPYF----DRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQTMV 257
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWRGPFKA-----L 285
DSG + + VY + S + + L++ D + +C+ P L
Sbjct: 258 DSGTQFTFLLGPVYNALRSAFLNQ-TSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLL 316
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGE 344
VT F+ ++ + R R VP E + G +V CL N V E +IG
Sbjct: 317 PTVTLVFRGAEMTVSGDRVLYR--VPGE----LRGNDSVHCLSFGNSDLLGV-EAYVIGH 369
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCN 369
Q+ + +D EK RIG C+
Sbjct: 370 HHQQNVWMEFDLEKSRIGLAQVRCD 394
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 150/372 (40%), Gaps = 53/372 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + F FDTGSD TWVQC PC C + E + P K+ + CS+
Sbjct: 96 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQ-PCVAYCYRQKEPLFDPTKSATYANISCSS 154
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C+ L+ C C Y I+YGDG +IG D L + F FGC
Sbjct: 155 SYCSDLYVSG---CS--GGHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFR----FGC 205
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
G + N G AG+LGLGRG+ S+ V +YG V +C+ G G L LG
Sbjct: 206 G--EKNRGLFG--RAAGLLGLGRGKTSLPVQAYDKYG---GVFAYCLPATSAGTGFLDLG 258
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGASYA 242
G P++ TPML + +Y+ +G L G + DSG
Sbjct: 259 PG-APAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSV--FSTAGTLVDSGTVIT 315
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW-----RGPFKALGQVTEYFKPLAL 297
Y + S + + G AP L C+ +G AL V+ F+ A
Sbjct: 316 RLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGAC 375
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDN 356
L V L ++ CL N + +V I+G + V+YD
Sbjct: 376 ----------LDVDASGILYVADVSQACLAFAPNADDTDVA---IVGNTQQKTHGVLYDI 422
Query: 357 EKQRIGWKPEDC 368
K+ +G+ P C
Sbjct: 423 GKKIVGFAPGAC 434
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 104/364 (28%), Positives = 145/364 (39%), Gaps = 55/364 (15%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWP--------NP 82
DT S+LTWVQC APC C + P + VPC +P C AL P
Sbjct: 159 DTASELTWVQC-APCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGAP 217
Query: 83 PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLS 142
P C Y + Y DG S G L D L G V + FGCG + P P
Sbjct: 218 PCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL---AGEVID-GFVFGCGTSNQGP-PFG 272
Query: 143 PPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI----GQNGRGVLFLGDGKVP---S 194
T+G++GLGR ++S+VSQ + ++G V +C+ + G L LGD S
Sbjct: 273 --GTSGLMGLGRSQLSLVSQTVDQFG---GVFSYCLPLSRESDASGSLVLGDDPSAYRNS 327
Query: 195 SGVAWTPMLQNS----------ADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+ V +T M+ NS +L +G E+ +G S I DSG
Sbjct: 328 TPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVESTGFSA-----RAIVDSGTVITSL 382
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
VY + + M L P AP L C F G L L F +
Sbjct: 383 VPSVYNAVRAEFMSQLAEYP--QAPGFSILDTC----FNMTGLKEVQVPSLTLVF-DGGA 435
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
V + Y V S VCL + + + E +IIG ++ V++D ++G+
Sbjct: 436 EVEVDSGGVLYFVSSDSSQVCLAVASLKSED--ETSIIGNYQQKNLRVVFDTSASQVGFA 493
Query: 365 PEDC 368
E C
Sbjct: 494 QETC 497
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 105/391 (26%), Positives = 173/391 (44%), Gaps = 57/391 (14%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----I 66
P + + L +G PP + DTGSDL W QC APC+ C + P Y P + +
Sbjct: 81 PTAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQC-APCSSQCFQQPTPLYNPSSSTTFAV 139
Query: 67 VPCSNPR---CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSV 122
+PC++ AAL PP P C Y + YG G +S+ ++ F S +
Sbjct: 140 LPCNSSLSMCAAALAGTTPP----PGCTCMYNMTYGSGWTSV-YQGSETFTFGSSTPANQ 194
Query: 123 FNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--- 178
VP + FGC + G + +G++GLGRG +S+VSQL G+ + +C+
Sbjct: 195 TGVPGIAFGC---SNASGGFNTSSASGLVGLGRGSLSLVSQL---GVPK--FSYCLTPYQ 246
Query: 179 -QNGRGVLFLGDGKV--PSSGVAWTPMLQNSAD----------LKHYILGPAELLYSGKS 225
N L LG + GV+ TP + + +D L LG L +
Sbjct: 247 DTNSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTA 306
Query: 226 CGLK-DLT--LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGP 281
LK D T I DSG + + YQ++ + ++ L+ P T L +C+ P
Sbjct: 307 LSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVV-SLVTLPTTDGGSAATGLDLCFELP 365
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENN 340
+ P S T + +V+P ++Y+++ N+ CL + N ++ V +
Sbjct: 366 ------SSTSAPPTMPSMTLHFDGADMVLPADSYMML--DSNLWCLAMQNQTDGGV---S 414
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
I+G Q+ ++YD ++ + + P C+TL
Sbjct: 415 ILGNYQQQNMHILYDVGQETLTFAPAKCSTL 445
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 107/388 (27%), Positives = 167/388 (43%), Gaps = 57/388 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC--TGCTKPPEKQYKPHKNI----VPCS 70
+ V + +G PP+ F FDTGSDLTWVQC PC + C E + P K+ VPCS
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQC-LPCPDSSCYPQQEPLFDPSKSSTYVDVPCS 180
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR-------FSNGSVF 123
P C H + + C+Y ++YGD + G+L + F L + G VF
Sbjct: 181 APEC---HIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVF 237
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGR 182
+ +N G AG+LGLGRG SI+SQ R V +C+ G
Sbjct: 238 GCSHEYISVFNDTGMG------VAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGS 291
Query: 183 --GVLFLGDGKVPS----SGVAWTPMLQNSADLKH-YILGPAELLYSGKSCGLK----DL 231
G L +G G S +++TP++ + L+ Y++ A + +G + + L
Sbjct: 292 STGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL 351
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD--KTLPICWRGPFKALGQVT 289
+ DSG + + Y + R +G+ K+ P+ K L C+ GQ
Sbjct: 352 GAVIDSGTVVTHMPAAAYYPLRDE-FRLHMGS-YKMLPEGSMKLLDTCY----DVTGQDV 405
Query: 290 EYFKPLALSFTN------RRNSVRLVVPPEAYLVISGRK--NVCLGILNGSEAEVGENNI 341
+AL F + + LV+P E SG+ CL L + A + I
Sbjct: 406 VTAPRVALEFGGGARIDVDASGILLVLPAEDG---SGQSLTLACLAFLPTNSAGL---VI 459
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+G + + V++D + RIG+ P C+
Sbjct: 460 VGNMQQRAYNVVFDVDGGRIGFGPNGCS 487
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 158/380 (41%), Gaps = 51/380 (13%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
SY+ ++ ++G PP DTGSD W QC PC C + P K+ + CS
Sbjct: 88 SYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCK-PCKPCLNQTSPIFNPSKSSTYKNIRCS 146
Query: 71 NPRCAALHWPNPPRC-KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
+P C RC + +C+YEI Y D S G + D L ++GS + P +
Sbjct: 147 SPICKR---GEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKIV 203
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRG 183
GCG H + +G++G GRG SIVSQL I +C+ N
Sbjct: 204 IGCG---HKNSLTTEGLASGIIGFGRGNFSIVSQLGSS--IGGKFSYCLASLFSKANISS 258
Query: 184 VLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI-------- 234
L+ GD V S GV TP++Q S + +Y LKD +LI
Sbjct: 259 KLYFGDMAVVSGHGVVSTPLIQ-SFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAV 317
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFKA--LGQVTEY 291
DSG++ + VY ++ + ++ + LK D + L +C++ K + +T +
Sbjct: 318 IDSGSTITQLPNDVYSQLETAVISMV---KLKRVKDPTQQLSLCYKTTLKKYEVPIITAH 374
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
F+ + +++ + +C + + V + G I Q+ +
Sbjct: 375 FRGADVKLNAFNTFIQM-----------NHEVMCFAFNSSAFPWV----VYGNIAQQNFL 419
Query: 352 VIYDNEKQRIGWKPEDCNTL 371
V YD K I +KP +C L
Sbjct: 420 VGYDTLKNIISFKPTNCTKL 439
>gi|213998814|gb|ACJ60774.1| nucellin [Hordeum cf. pusillum GP-2003]
Length = 142
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 78/138 (56%), Gaps = 5/138 (3%)
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGD 189
CGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL++GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGD 60
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRV 248
PS GV W PM ++ L +Y G AELL + G +FDSG++Y + +++
Sbjct: 61 FNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAQI 117
Query: 249 YQEIVSLIMRDLIGTPLK 266
Y EIVS ++ L + L+
Sbjct: 118 YNEIVSKVIGTLSESSLE 135
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 121/283 (42%), Gaps = 23/283 (8%)
Query: 98 YGDGGSSIGALVTDLFPLRFSNGS----VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLG 153
YGDG S+ G LV D+ L G+ N + FGCG Q S G++G G
Sbjct: 2 YGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFG 61
Query: 154 RGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSA----DL 209
+ S +SQL G ++ HC+ N G +F G+V S V TPML SA +L
Sbjct: 62 QSNSSFISQLASQGKVKRSFAHCLDNNNGGGIF-AIGEVVSPKVKTTPMLSKSAHYSVNL 120
Query: 210 KHYILGPAEL-LYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA 268
+G + L L S D +I DSG + Y VY +++ I+ L
Sbjct: 121 NAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTV 180
Query: 269 PDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGI 328
+ T C+ K + F + F SV L V P YL C G
Sbjct: 181 QESFT---CFHYTDKL-----DRFPTVTFQF---DKSVSLAVYPREYLFQVREDTWCFGW 229
Query: 329 LNGSEAEVGENN--IIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
NG G + I+G++ + +K+V+YD E Q IGW +C+
Sbjct: 230 QNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 272
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 101/392 (25%), Positives = 144/392 (36%), Gaps = 68/392 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
+ + + +G PPK F+ DTGSDL W+QC PC+ C + Y P + +
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCK-PCSQCYSQSDPIYDPSASSTFAKTSCSTS 62
Query: 77 LHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYN 134
P C C Y +YGD S+ G + LR S GS P FGCG
Sbjct: 63 SCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG-- 120
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLFLGD 189
+ N G AG++GLG+G+IS+ +QL I N +C+ + L G
Sbjct: 121 RLNSGSFG--GAAGIVGLGQGKISLSTQLGS--AINNKFSYCLVDFDDDSSKTSPLIFGS 176
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------- 233
SG TP++ NS +Y +G + GK L +
Sbjct: 177 SASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRAL 236
Query: 234 -------IFDSGASYAYFTSRVYQEI-------VSLIMRDLIGTPLKLAPDDKTLPICWR 279
IFDSG + VY ++ VSL D + L D +
Sbjct: 237 EVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYD-----VSKS 291
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEA-YLVI--SGRKNVCLGILNGSEAEV 336
FK F L L+F + S PP+ Y VI + CL + +
Sbjct: 292 KNFK--------FPALTLAFKGTKFS-----PPQKNYFVIVDTAETVACLAMGGSGSLGL 338
Query: 337 GENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
G + Q+ V+YD I P C
Sbjct: 339 GIIG---NLMQQNYHVVYDRGTSTISMSPAQC 367
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 160/395 (40%), Gaps = 62/395 (15%)
Query: 9 FFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK---- 64
F F +NL +G PP+ DTGS L+W+QC +PP + P
Sbjct: 67 FSFKYSMALIINLPIGTPPQTQPMVLDTGSQLSWIQCHK-----KQPPTASFDPSLSSTF 121
Query: 65 NIVPCSNPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV 122
+I+PC++P C + P C N C Y Y DG + G LV + F + SV
Sbjct: 122 SILPCTHPLCKPRIPDFTLPTSCDQ-NRLCHYSYFYADGTYAEGNLVREKFTF---SRSV 177
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----- 177
PL GC +P G+LG+ GR+S Q + +C+
Sbjct: 178 STPPLILGCATESTDP--------RGILGMNLGRLSFAKQSKI-----TKFSYCVPPRQT 224
Query: 178 --GQNGRGVLFLGDGKVPSS-GVAWTPMLQNSA------DLKHYILGPAELLYSGKSCGL 228
G G +LG+ PSS G + M+ +S D Y + + +GK +
Sbjct: 225 RPGFTPTGSFYLGNN--PSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNI 282
Query: 229 KDLTL----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
+ DSG+ + Y S Y ++ + ++R +G LK +
Sbjct: 283 SPAVFRADAGGSGQTMIDSGSEFTYLVSEAYDKVRAQVVR-AVGPRLKKGYVYGGVADMC 341
Query: 279 RGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVG- 337
KA+ ++ + F V +V+P E L G C+GI GS ++G
Sbjct: 342 FDSVKAV-EIGRLIGEMVFEF---ERGVEVVIPKERVLADVGGGVHCVGI--GSSDKLGA 395
Query: 338 ENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLL 372
+NIIG Q+ V +D ++R+G+ DC+ L+
Sbjct: 396 ASNIIGNFHQQNLWVEFDLVRRRVGFGKADCSRLV 430
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 106/376 (28%), Positives = 160/376 (42%), Gaps = 38/376 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHK----NIVPCSN 71
+ + L +G PP + DTGSDL W QC APC T C + P Y P +++PC N
Sbjct: 114 YLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASSTTFSVLPC-N 171
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
+ P C Y YG G ++ G ++ F S VP + FG
Sbjct: 172 SSLSMCAGALAGAAPPPGCACMYYQTYGTGWTA-GVQGSETFTFGSSAADQARVPGVAFG 230
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-D 189
C N +AG++GLGRG +S+VSQL G + N L LG
Sbjct: 231 C----SNASSSDWNGSAGLVGLGRGSLSLVSQLGA-GRFSYCLTPFQDTNSTSTLLLGPS 285
Query: 190 GKVPSSGVAWTPMLQNSA----------DLKHYILGPAELLYSGKSCGLK-DLT--LIFD 236
+ +GV TP + + A +L LG L S + LK D T LI D
Sbjct: 286 AALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIID 345
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
SG + + YQ++ + + L+ T P D L +C+ AL T +
Sbjct: 346 SGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCF-----ALPAPTSAPPAV 400
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
S T + +V+P ++Y+ ISG CL + N ++ G + G Q+ ++YD
Sbjct: 401 LPSMTLHFDGADMVLPADSYM-ISGSGVWCLAMRNQTD---GAMSTFGNYQQQNMHILYD 456
Query: 356 NEKQRIGWKPEDCNTL 371
++ + + P C+TL
Sbjct: 457 VREETLSFAPAKCSTL 472
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 163/383 (42%), Gaps = 64/383 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----------KPPEKQYKPHKNI 66
+ + L++G PP+L DTGSDL W++CD C C YK
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDN-CDHCDLDHHGETIFFSDASSSYKK---- 59
Query: 67 VPCSNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS---- 121
+PC++ C+ + PRC+ + C Y+ EYGDG + G + +D R S+G+
Sbjct: 60 LPCNSTHCSGMSSAGIGPRCE---ETCKYKYEYGDGSRTSGDVGSDRISFR-SHGAGEDH 115
Query: 122 -VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE---YGLIRNVIGHCI 177
F FGC T G++GLG+ S++ QL + Y ++ +
Sbjct: 116 RSFFDGFLFGCARKLKGDWNF----TQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDS 171
Query: 178 GQNGRGVLFLG-DGKVPSSGVAWTPMLQNS--------ADLKHYILGPAELLYSGKSCG- 227
+ + LFLG + V TP+L DL+ +G ++ K G
Sbjct: 172 PPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGH 231
Query: 228 -------LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
L + T+I DSG +Y T VY+ + I +I L + L +C
Sbjct: 232 NTSVGPFLANKTVI-DSGTTYTLLTPPVYEAMRKSIEEQVI---LPTLGNSAGLDLC--- 284
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN 340
F + G + F + F N+ V+LV+P E ++ R VCL + ++ G+ +
Sbjct: 285 -FNSSGDTSYGFPSVTFYFANQ---VQLVLPFENIFQVTSRDVVCLSM----DSSGGDLS 336
Query: 341 IIGEIFMQDKMVIYDNEKQRIGW 363
IIG + Q+ ++YD +I +
Sbjct: 337 IIGNMQQQNFHILYDLVASQISF 359
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 150/372 (40%), Gaps = 53/372 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + F FDTGSD TWVQC PC C + E + P K+ + CS+
Sbjct: 161 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQ-PCVAYCYRQKEPLFDPTKSATYANISCSS 219
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C+ L+ C C Y I+YGDG +IG D L + F FGC
Sbjct: 220 SYCSDLYVSG---CS--GGHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFR----FGC 270
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
G + N G AG+LGLGRG+ S+ V +YG V +C+ G G L LG
Sbjct: 271 G--EKNRGLFG--RAAGLLGLGRGKTSLPVQAYDKYG---GVFAYCLPATSAGTGFLDLG 323
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGASYA 242
G P++ TPML + +Y+ +G L G + DSG
Sbjct: 324 PG-APAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSV--FSTAGTLVDSGTVIT 380
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW-----RGPFKALGQVTEYFKPLAL 297
Y + S + + G AP L C+ +G AL V+ F+ A
Sbjct: 381 RLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGAC 440
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDN 356
L V L ++ CL N + +V I+G + V+YD
Sbjct: 441 ----------LDVDASGILYVADVSQACLAFAPNADDTDVA---IVGNTQQKTHGVLYDI 487
Query: 357 EKQRIGWKPEDC 368
K+ +G+ P C
Sbjct: 488 GKKIVGFAPGAC 499
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 149/371 (40%), Gaps = 44/371 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGC-----TKPPEKQYKPHKN----IV 67
+ +G P F DTGSDL W+ C+ AP T +Y P + +
Sbjct: 104 IDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPL------RFSNG 120
CS+ C + C+ P +QC Y + Y G SS G LV D+ L R NG
Sbjct: 164 LCSHKLCDS-----ASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218
Query: 121 SV-FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
S + GCG Q L G++GLG IS+ S L + GL+RN C +
Sbjct: 219 SSSVKARVVIGCGKKQSG-DYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDE 277
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSG 238
G ++ GD + S TP LQ + YI+G E G SC T DSG
Sbjct: 278 EDSGRIYFGD--MGPSIQQSTPFLQLENN-SGYIVG-VEACCIGNSCLKQTSFTTFIDSG 333
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
S+ Y +Y+++ I R + T + W +++ V + L
Sbjct: 334 QSFTYLPEEIYRKVALEIDRHINATSKSFE------GVSWEYCYES--SVEPKVPAIKLK 385
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F++ N+ + P + G CL I + +G IG+ +M+ +++D E
Sbjct: 386 FSH-NNTFVIHKPLFVFQQSQGLVQFCLPISPSGQEGIGS---IGQNYMRGYRMVFDREN 441
Query: 359 QRIGWKPEDCN 369
++ W C
Sbjct: 442 MKLRWSASKCQ 452
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 169/381 (44%), Gaps = 50/381 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSN 71
YF +++++G PP DTGSDLTWVQC PC C K + K+ C +
Sbjct: 85 YF-MSISIGTPPSKVFAIADTGSDLTWVQC-KPCQQCYKQNSPLFDKKKSSTYKTESCDS 142
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C AL + C D C Y YGD + G + T+ + S+GS + P T FG
Sbjct: 143 KTCQALS-EHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFG 201
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVL 185
CGYN G +G++GLG G +S+VSQL I +C+ NG V+
Sbjct: 202 CGYNN---GGTFEETGSGIIGLGGGPLSLVSQLGSS--IGKKFSYCLSHTAATTNGTSVI 256
Query: 186 FLGDGKVPS-----SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGL------ 228
LG +PS S TP++Q + +++ +G +L Y+G GL
Sbjct: 257 NLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSK 316
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
+ +I DSG + S Y + + + + G +++ L C++ K +G
Sbjct: 317 RTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAK-RVSDPQGLLTHCFKSGDKEIG-- 373
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
+ + FTN V+L P A++ ++ VCL ++ +E I G +
Sbjct: 374 ---LPAITMHFTNA--DVKL-SPINAFVKLN-EDTVCLSMIPTTEVA-----IYGNMVQM 421
Query: 349 DKMVIYDNEKQRIGWKPEDCN 369
D +V YD E + + ++ DC+
Sbjct: 422 DFLVGYDLETKTVSFQRMDCS 442
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 160/381 (41%), Gaps = 45/381 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD---------APCTGCTKPPEKQYKPHKNI 66
Y+A + +G P F DTGSDL WV CD A TG P + Y P ++
Sbjct: 108 YYA-EVELGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANGTGQDAPSLRPYSPRRSS 166
Query: 67 ----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRF---- 117
V C NP C + + N C YE++Y SS G LV D+ L
Sbjct: 167 TSKQVACDNPLCGQRNGCS----AATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPG 222
Query: 118 --SNGSVFNVPLTFGCGYNQHNP---GPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RN 171
+ G P+ FGCG Q G D G++GLG G++S+ S L GL+ +
Sbjct: 223 PGAAGEALQAPVVFGCGQVQTGAFLDGGGGAVD--GLMGLGMGKVSVPSALAASGLVASD 280
Query: 172 VIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL 231
C G +G G + GD S G A TP S + + + + G +
Sbjct: 281 SFSMCFGDDGVGRVNFGDAG--SRGQAETPFTVRSLNPTYNV--SFTSIGVGSESVAAEF 336
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ DSG S+ Y + Y ++ + + + + P + ++ TE
Sbjct: 337 AAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSAD-PFPFEYCYRLSPNQTEV 395
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVI---SGRK-NVCLGILNGSEAEVGENNIIGEIFM 347
P +S T + ++ V P ++ + +GR CL I+ ++ +G + IIG+ FM
Sbjct: 396 AMP-DVSLTAKGGALFPVTQP--FIPVGDTTGRAVGYCLAIMR-NDMAIGID-IIGQNFM 450
Query: 348 QDKMVIYDNEKQRIGWKPEDC 368
V++D E+ +GW+ DC
Sbjct: 451 TGLKVVFDRERSVLGWEKFDC 471
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 154/370 (41%), Gaps = 45/370 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V + +G P + + DTGS L+W+QC C + + P + + C++
Sbjct: 13 YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 72
Query: 73 RCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTF 129
+C++L N P C+ ++ C Y YGD S+G L DL L S +P +
Sbjct: 73 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLPGFVY 128
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI-GQNGRGVLFL 187
GCG Q + G AG+LGLGR ++S++ Q+ ++G +C+ + G G L +
Sbjct: 129 GCG--QDSEGLFG--RAAGILGLGRNKLSMLGQVSSKFGY---AFSYCLPTRGGGGFLSI 181
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD----LTLIFDSGASYAY 243
G + S +TPM + + Y L + G++ G+ + I DSG
Sbjct: 182 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITR 241
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
VY ++ ++ + AP L C++G K + V E
Sbjct: 242 LPMSVYTPFQQAFVK-IMSSKYARAPGFSILDTCFKGNLKDMQSVPE------------- 287
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSE--AEVGENN--IIGEIFMQDKMVIYDNEKQ 359
VRL+ A L + NV L + G A G N IIG Q V +D
Sbjct: 288 --VRLIFQGGADLNLR-PVNVLLQVDEGLTCLAFAGNNGVAIIGNHQQQTFKVAHDISTA 344
Query: 360 RIGWKPEDCN 369
RIG+ CN
Sbjct: 345 RIGFATGGCN 354
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 169/381 (44%), Gaps = 50/381 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP----EKQYKPHKNIVPCSN 71
YF +++++G PP F DTGSDLTWVQC PC C K +K+ C +
Sbjct: 85 YF-MSISIGTPPSKFLAIADTGSDLTWVQC-KPCQQCYKQNTPLFDKKKSSTYKTESCDS 142
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C AL + C + C Y YGD + G + T+ + S+GS + P T FG
Sbjct: 143 ITCNALS-EHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPVSFPGTAFG 201
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVL 185
CGYN G +G++GLG G +S+VSQL I +C+ NG V+
Sbjct: 202 CGYNN---GGTFEETGSGIIGLGGGPLSLVSQLGSS--IGKKFSYCLSHTSATTNGTSVI 256
Query: 186 FLGDGKVPS-----SGVAWTPMLQNSADLKHYI------LGPAELLYSG------KSCGL 228
LG + S S + TP++Q + +++ +G +L Y+G
Sbjct: 257 NLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGGYSLNRKSK 316
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
K +I DSG + S Y + +++ + G +++ L C++ K +G
Sbjct: 317 KTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAK-RVSDPQGILTHCFKSGDKEIGLP 375
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
T + + FT V+L P +++ +S VCL ++ +E I G +
Sbjct: 376 T-----ITMHFTGA--DVKL-SPINSFVKLS-EDIVCLSMIPTTEVA-----IYGNMVQM 421
Query: 349 DKMVIYDNEKQRIGWKPEDCN 369
D +V YD E + + ++ DC+
Sbjct: 422 DFLVGYDLETKTVSFQRMDCS 442
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 94/353 (26%), Positives = 150/353 (42%), Gaps = 40/353 (11%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWP--NPPRCKH 87
DTGS L+W+QC C + Y P + + C++ C+ L N P C+
Sbjct: 3 LDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCET 62
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDT 146
++ C Y YGD SIG L DL L S +P T+GCG Q N G
Sbjct: 63 DSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLPQFTYGCG--QDNQGLFG--RA 114
Query: 147 AGVLGLGRGRISIVSQLR-EYGLIRNVIGHCI---GQNGRGVLFLGDGKVPSSGVAWTPM 202
AG++GL R ++S+++QL +YG + +C+ G FL G + + +TPM
Sbjct: 115 AGIIGLARDKLSMLAQLSTKYG---HAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPM 171
Query: 203 LQNSADLKHYILGPAELLYSGK----SCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
L +S + Y L + SG+ + + + + DSG +Y + ++
Sbjct: 172 LTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVK 231
Query: 259 DLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
++ T AP L C++G K++ V E + + F + L + + L+
Sbjct: 232 -IMSTKYAKAPAYSILDTCFKGSLKSISAVPE----IKMIF---QGGADLTLRAPSILIE 283
Query: 319 SGRKNVCLGILNGSEAEVGENN--IIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ + CL S G N IIG Q + YD RIG+ P C+
Sbjct: 284 ADKGITCLAFAGSS----GTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSCH 332
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 156/377 (41%), Gaps = 50/377 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--------TKPPEKQYKPHKNIVP 68
+ V + +G P K F DTGS L+W+QC C T K YK +P
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKA----LP 168
Query: 69 CSNPRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
CS+ +C++L N P C + C Y+ YGD SIG L D+ L S +
Sbjct: 169 CSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAP--SSG 226
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNG---- 181
+GCG Q N G ++G++GL +IS++ QL ++YG N +C+ +
Sbjct: 227 FVYGCG--QDNQGLFG--RSSGIIGLANDKISMLGQLSKKYG---NAFSYCLPSSFSAPN 279
Query: 182 ----RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTL 233
G L +G + SS +TP+++N Y L + +GK G+ ++
Sbjct: 280 SSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPT 339
Query: 234 IFDSGASYAYFTSRVYQEI-VSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
I DSG VY + S ++ ++ AP L C++G K + V E
Sbjct: 340 IIDSGTVITRLPVAVYNALKKSFVL--IMSKKYAQAPGFSILDTCFKGSVKEMSTVPE-- 395
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ + F R L + LV + CL I A +IIG Q V
Sbjct: 396 --IQIIF---RGGAGLELKAHNSLVEIEKGTTCLAI----AASSNPISIIGNYQQQTFKV 446
Query: 353 IYDNEKQRIGWKPEDCN 369
YD +IG+ P C
Sbjct: 447 AYDVANFKIGFAPGGCQ 463
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 147/370 (39%), Gaps = 53/370 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ + + G P + FDTGSD+ W+QC C E + P ++N V C+
Sbjct: 16 YVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRN-VSCTE 74
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL----RFSNGSVFNVPL 127
P C L C + C Y + YGDG S+IG L D F L +F N
Sbjct: 75 PACVGLSTRG---CS--SSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQKFKN-------F 122
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRI-SIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
FGCG Q+N G TAG++GLGR S+ SQ+ + NV +C+ +
Sbjct: 123 IFGCG--QNNTGLFQ--GTAGLVGLGRSSTYSLNSQVAPS--LGNVFSYCLPSTSSATGY 176
Query: 187 LGDGKVPSSGVAWTPMLQNS-------ADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
L G P + +T ML ++ DL +G L S S + + I DSG
Sbjct: 177 LNIGN-PQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRL--SLSSTVFQSVGTIIDSGT 233
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALS 298
Y + + + + T LAP L C+ + T P + L
Sbjct: 234 VITRLPPTAYSALKTAVRAAM--TQYTLAPAVTILDTCYD-----FSRTTSVVYPVIVLH 286
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F + + +P + VCL +++ + IIG + V YDNE
Sbjct: 287 FAG----LDVRIPATGVFFVFNSSQVCLAFAGNTDSTM--IGIIGNVQQLTMEVTYDNEL 340
Query: 359 QRIGWKPEDC 368
+RIG+ C
Sbjct: 341 KRIGFSAGAC 350
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 158/375 (42%), Gaps = 37/375 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHK----NIVPCSN 71
+ + L +G PP + DTGSDL W QC APC T C + P Y P +++PC N
Sbjct: 112 YLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASSTTFSVLPC-N 169
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
+ P C Y YG G ++ G ++ F S VP + FG
Sbjct: 170 SSLSMCAGALAGAAPPPGCACMYNQTYGTGWTA-GVQGSETFTFGSSAADQARVPGVAFG 228
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-D 189
C N +AG++GLGRG +S+VSQL G + N L LG
Sbjct: 229 C----SNASSSDWNGSAGLVGLGRGSLSLVSQLGA-GRFSYCLTPFQDTNSTSTLLLGPS 283
Query: 190 GKVPSSGVAWTPMLQNSA----------DLKHYILGPAELLYSGKSCGLK-DLT--LIFD 236
+ +GV TP + + A +L LG L S + LK D T LI D
Sbjct: 284 AALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIID 343
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG + + YQ++ + + + P D L +C+ AL T +
Sbjct: 344 SGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCF-----ALPAPTSAPPAVL 398
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
S T + +V+P ++Y+ ISG CL + N ++ G + G Q+ ++YD
Sbjct: 399 PSMTLHFDGADMVLPADSYM-ISGSGVWCLAMRNQTD---GAMSTFGNYQQQNMHILYDV 454
Query: 357 EKQRIGWKPEDCNTL 371
++ + + P C+TL
Sbjct: 455 REETLSFAPAKCSTL 469
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 157/369 (42%), Gaps = 45/369 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP----EKQYKPHKNIVPCSNPRCAA 76
+T+G + DTGSDLTWVQCD PC C N + C++ C
Sbjct: 135 VTIGLGNQNMTVIIDTGSDLTWVQCD-PCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQN 193
Query: 77 LHWP--NPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
L + N C+ N C++ + YGDG + G L + L F SV N FGCG
Sbjct: 194 LQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVE--HLSFGGISVSN--FVFGCGR 249
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDG 190
N N G +G++GLGR +S++SQ V +C+ G L +G+
Sbjct: 250 N--NKGLFG--GVSGIMGLGRSNLSMISQTNT--TFGGVFSYCLPTTDSGASGSLVIGNE 303
Query: 191 KVPSSG---VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYA 242
+A+T M+ N Y+L + G ++D + ++ DSG
Sbjct: 304 SSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGG--VAIQDTSFGNGGILIDSGTVIT 361
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALSFTN 301
+Y + + ++ G P +AP L C+ L + E P L++ F
Sbjct: 362 RLAPSLYNALKAEFLKQFSGYP--IAPALSILDTCFN-----LTGIEEVSIPTLSMHF-- 412
Query: 302 RRNSVRLVVPPEAYLVI-SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
N+V L V L + VCL + S ++ + IIG +++ VIYD ++ +
Sbjct: 413 -ENNVDLNVDAVGILYMPKDGSQVCLAL--ASLSDENDMAIIGNYQQRNQRVIYDAKQSK 469
Query: 361 IGWKPEDCN 369
IG+ EDC+
Sbjct: 470 IGFAREDCS 478
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 158/387 (40%), Gaps = 63/387 (16%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
++L +G PP+ DTGS L+W+QC P+ + P + +PCS+P C
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLC 131
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + + FSN + PL GC
Sbjct: 132 KPRIPDFTLPTSCDS-NRLCHYSYFYADGTFAEGNLVKE--KITFSNTEI-TPPLILGCA 187
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
D G+LG+ RGR+S VSQ + + +CI G G
Sbjct: 188 TESS--------DDRGILGMNRGRLSFVSQAKI-----SKFSYCIPPKSNRPGFTPTGSF 234
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYS----GKSCGLKDLTL-------- 233
+LGD S G + +L + L P L Y+ G GLK L +
Sbjct: 235 YLGDNPN-SHGFKYVSLLTFPESQRMPNLDP--LAYTVPMIGIRFGLKKLNISGSVFRPD 291
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
+ DSG+ + + Y ++ + IM + K T +C+ G +
Sbjct: 292 AGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDG---NVA 348
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVG-ENNIIGEI 345
+ L FT V ++VP E LV G C+GI G + +G +NIIG +
Sbjct: 349 MIPRLIGDLVFVFT---RGVEILVPKERVLVNVGGGIHCVGI--GRSSMLGAASNIIGNV 403
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLL 372
Q+ V +D +R+G+ DC+ ++
Sbjct: 404 HQQNLWVEFDVTNRRVGFAKADCSRVV 430
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 145/374 (38%), Gaps = 50/374 (13%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK-----------QYKPH----KN 65
+ +G P F D GSDL WV CD C C +Y P
Sbjct: 111 IDIGTPNVSFLVALDAGSDLLWVPCD--CIQCAPLSASYYNISLDRDLSEYSPSLSSTSR 168
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGD--GGSSIGALVTDLFPLR----FSN 119
+ C + C W + CK+P D C Y Y D +S G LV D L +
Sbjct: 169 HLSCDHQLC---EWGS--NCKNPKDPCPYIFNYDDFENTTSAGFLVEDKLHLASVGDHTA 223
Query: 120 GSVFNVPLTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+ + GCG Q + PD GV+GLG G IS+ S L + GLI+N C
Sbjct: 224 RKMLQASVVLGCGRKQGGSFFDGAAPD--GVMGLGPGDISVPSLLAKAGLIQNCFSLCFD 281
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD-LTLIFDS 237
+N G + GD S TP L Y +G E G SC + + DS
Sbjct: 282 ENDSGRILFGDRGHASQQS--TPFLPIQGTYVAYFVG-VESYCVGNSCLKRSGFKALVDS 338
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G+S+ Y S VY E+VS + + +++ D C+ + L + + L
Sbjct: 339 GSSFTYLPSEVYNELVSEFDKQV--NAKRISFQDGLWDYCYNASSQELHDI----PAIQL 392
Query: 298 SFTNRRNSVRLVVPPEAYLV--ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
F +N VV Y + G CL + + G IIG+ FM +++D
Sbjct: 393 KFPRNQN---FVVHNPTYSIPHHQGFTMFCLSL----QPTDGSYGIIGQNFMIGYRMVFD 445
Query: 356 NEKQRIGWKPEDCN 369
E ++GW C
Sbjct: 446 IENLKLGWSNSSCQ 459
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/377 (26%), Positives = 147/377 (38%), Gaps = 47/377 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE----------KQYKPH----KNI 66
+ +G P F D GSD+ WV CD C C QY+P
Sbjct: 109 IDIGTPNVSFLVALDAGSDMLWVPCD--CIECASLSAGNYNVLDRDLNQYRPSLSNTSRH 166
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPL----RFSNGS 121
+PC + C CK D C Y ++Y SS G + D L + + +
Sbjct: 167 LPCGHKLCDV-----HSVCKGSKDPCPYAVQYSSANTSSSGYVFEDKLHLTSNGKHAEQN 221
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
+ GCG Q L GVLGLG G IS+ S L + GLI+N C +N
Sbjct: 222 SVQASIILGCGRKQTGE-YLRGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSICFEENE 280
Query: 182 RGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD--LTLIFDSG 238
G + GD G V TP L YI+G E G C LK+ + DSG
Sbjct: 281 SGRIIFGDQGHVTQHS---TPFLPIDGKFNAYIVG-VESFCVGSLC-LKETRFQALIDSG 335
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+S+ + + VYQ++V + + T + L W + A Q PL L+
Sbjct: 336 SSFTFLPNEVYQKVVIEFDKQVNATSIVLQNS-------WEYCYNASSQELISIPPLNLA 388
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F+ RN L+ P + + + L S ++ + IG+ F+ +++D E
Sbjct: 389 FS--RNQTYLIQNP--IFIDPASQEYTIFCLPVSPSD-DDYAAIGQNFLMGYRMVFDREN 443
Query: 359 QRIGWKPEDCNTLLSLN 375
R W +C S +
Sbjct: 444 LRFSWSRWNCQDRASFS 460
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 159/377 (42%), Gaps = 51/377 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKN----IVPCSN 71
+ V + +G P K + DTGS +W+QC PCT C + + P + VPCS+
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQ-PCTIYCHIQEDPVFNPSASKTYKTVPCSS 161
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNVPLT 128
+C++L N P C ++ C Y+ YGD S+G L D+ L S S F
Sbjct: 162 SQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSF----V 217
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIGQN------- 180
+GCG Q N G D G++GL +S++SQL +YG N +C+ +
Sbjct: 218 YGCG--QDNQGLFGRTD--GIIGLANNELSMLSQLSGKYG---NAFSYCLPTSFSTPNSP 270
Query: 181 GRGVLFLGDGKV-PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIF 235
G L +G + PSS +TP+L+N + Y + + +G+ G+ + I
Sbjct: 271 KEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTII 330
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + VY + + + ++ + AP L C++G + +V
Sbjct: 331 DSGTVITRLPTPVYTTLKNAYVT-ILSKKYQQAPGISLLDTCFKGSLAGISEVAP----- 384
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVC---LGILNGSEAEVGENNIIGEIFMQDKMV 352
+R++ A L + G ++ GI + A IIG Q V
Sbjct: 385 ---------DIRIIFKGGADLQLKGHNSLVELETGITCLAMAGSSSIAIIGNYQQQTVKV 435
Query: 353 IYDNEKQRIGWKPEDCN 369
YD R+G+ P C
Sbjct: 436 AYDVGNSRVGFAPGGCQ 452
>gi|213998802|gb|ACJ60768.1| nucellin [Hordeum murinum subsp. glaucum]
Length = 142
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/138 (38%), Positives = 76/138 (55%), Gaps = 5/138 (3%)
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGD 189
CGY Q P P G+LGLG G+ QL+ +I+ N+IGHC+ G+GVL++GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAVQLKGQKMIKENIIGHCLSSKGKGVLYVGD 60
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRV 248
PS GV W PM ++ L +Y G AELL + G +FDSG++Y + + +
Sbjct: 61 FNPPSRGVTWVPMRES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAHI 117
Query: 249 YQEIVSLIMRDLIGTPLK 266
Y EIVS + L + L+
Sbjct: 118 YSEIVSKVRGTLSESSLE 135
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 159/377 (42%), Gaps = 51/377 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKN----IVPCSN 71
+ V + +G P K + DTGS +W+QC PCT C + + P + VPCS+
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQ-PCTIYCHIQEDPVFNPSASKTYKTVPCSS 161
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNVPLT 128
+C++L N P C ++ C Y+ YGD S+G L D+ L S S F
Sbjct: 162 SQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSF----V 217
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIGQN------- 180
+GCG Q N G D G++GL +S++SQL +YG N +C+ +
Sbjct: 218 YGCG--QDNQGLFGRTD--GIIGLANNELSMLSQLSGKYG---NAFSYCLPTSFSTPNSP 270
Query: 181 GRGVLFLGDGKV-PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIF 235
G L +G + PSS +TP+L+N + Y + + +G+ G+ + I
Sbjct: 271 KEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTII 330
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + VY + + + ++ + AP L C++G + +V
Sbjct: 331 DSGTVITRLPTPVYTTLKNAYVT-ILSKKYQQAPGISLLDTCFKGSLAGISEVAP----- 384
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVC---LGILNGSEAEVGENNIIGEIFMQDKMV 352
+R++ A L + G ++ GI + A IIG Q V
Sbjct: 385 ---------DIRIIFKGGADLQLKGHNSLVELETGITCLAMAGSSSIAIIGNYQQQTVKV 435
Query: 353 IYDNEKQRIGWKPEDCN 369
YD R+G+ P C
Sbjct: 436 AYDVGNSRVGFAPGGCQ 452
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 159/383 (41%), Gaps = 63/383 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F VN +VG+PP DTGSDL WVQC PC C + + P K+ + +P
Sbjct: 91 FLVNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSP 149
Query: 73 RCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
C PN P+ K+ + +QC Y Y DG +S G L T+ S+ G+V + FG
Sbjct: 150 IC-----PNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 204
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-----QNGRGVL 185
CG++ N G +G+LGL G SIVS+L +CIG L
Sbjct: 205 CGHS--NRGRFDGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHNQL 255
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------------ 233
LGDG ++ S+ H G + G S G L +
Sbjct: 256 VLGDG----------VKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQ 305
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP--ICWRGPFKALGQV 288
+ DSG + + + + + I R + G ++ +T+P +C++G + +
Sbjct: 306 GGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIY--RTIPGWLCYKG---RVNED 360
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
F LA F LV+ + V + CL +L + +G ++IG + Q
Sbjct: 361 LRGFPELAFHFA---EGADLVLDANSLFVQKNQDVFCLAVLESNLKNIG--SVIGIMAQQ 415
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
V YD +R+ ++ DC L
Sbjct: 416 HYNVAYDLIGKRVYFQRTDCELL 438
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 167/385 (43%), Gaps = 57/385 (14%)
Query: 12 PIFSYFAVNLT---VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK---- 64
PI +Y +L +G PP DTGSDL W+QC APC GC K + + P K
Sbjct: 60 PINAYIGQHLMEIYIGTPPIKITGLVDTGSDLIWIQC-APCLGCYKQIKPMFDPLKSSTY 118
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
N + C +P C H + C P +C+Y YGD + G L D + G +
Sbjct: 119 NNISCDSPLC---HKLDTGVCS-PEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVS 174
Query: 125 VP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGR 182
+ FGCG+N N G + + G++GLG G S++SQ+ +G + C+
Sbjct: 175 LSRFLFGCGHN--NTGGFNDHE-MGLIGLGGGPTSLISQIGPLFGGKK--FSQCL----- 224
Query: 183 GVLFLGDGKVPS------------SGVAWTPMLQNSADLKHYI--LG-PAELLYSGKSCG 227
V FL D K+ S +GV TP++ D +++ LG E Y +
Sbjct: 225 -VPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNST 283
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL--PICWRGPFKAL 285
+ ++ DSG ++Y ++ + + + LK DD +L +C+R
Sbjct: 284 IGKANMLVDSGTPPILLPQQLYDKVFAEVRNKV---ALKPITDDPSLGTQLCYRTQTNLK 340
Query: 286 G-QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGE 344
G +T +F + T ++ +PP + CL I N + ++ G + G
Sbjct: 341 GPTLTFHFVGANVLLT----PIQTFIPPTP----QTKGIFCLAIYNRTNSDPG---VYGN 389
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCN 369
+ ++ +D ++Q + +KP DC
Sbjct: 390 FAQSNYLIGFDLDRQVVSFKPTDCT 414
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 95/365 (26%), Positives = 148/365 (40%), Gaps = 39/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P F FDTGSD TWVQC PC C + E + P K+ + C++
Sbjct: 165 YVVPIRLGTPAARFTVVFDTGSDTTWVQCQ-PCVAYCYQQKEPLFTPTKSATYANISCTS 223
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C+ L + C C Y ++YGDG ++G D L + F FGC
Sbjct: 224 SYCSDL---DTRGCS--GGHCLYAVQYGDGSYTVGFYAQDTLTLGYDTVKDFR----FGC 274
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
G + N G AG++GLGRG+ S+ ++ Y V +CI +G G L G
Sbjct: 275 G--EKNRGLFG--KAAGLMGLGRGKTSV--PVQAYDKYSGVFAYCIPATSSGTGFLDFGP 328
Query: 190 GKVPSSGVAWTPMLQNSADLKHYI----LGPAELLYSGKSCGLKDLTLIFDSGASYAYFT 245
G ++ TPML ++ +Y+ + L S + D + DSG
Sbjct: 329 GAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLP 388
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR--R 303
Y+ + S + + G K AP L C+ +T Y +AL + +
Sbjct: 389 PSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCY--------DLTGYQGSIALPAVSLVFQ 440
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
L V L ++ CL + + I+G + V+YD K+ +G+
Sbjct: 441 GGACLDVDASGILYVADVSQACLAFAANDDDT--DMTIVGNTQQKTYSVLYDLGKKVVGF 498
Query: 364 KPEDC 368
P C
Sbjct: 499 APGAC 503
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 157/387 (40%), Gaps = 63/387 (16%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
++L +G PP+ DTGS L+W+QC P+ + P + +PCS+P C
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLC 131
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + + FSN + PL GC
Sbjct: 132 KPRIPDFTLPTSCDS-NRLCHYSYFYADGTFAEGNLVKE--KITFSNTEI-TPPLILGCA 187
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
D G+LG+ RGR+S VSQ + + +CI G G
Sbjct: 188 TESS--------DDRGILGMNRGRLSFVSQAKI-----SKFSYCIPPKSNRPGFTPTGSF 234
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYS----GKSCGLKDLTL-------- 233
+LGD S G + +L + L P L Y+ G GLK L +
Sbjct: 235 YLGDNPN-SHGFKYVSLLTFPESQRMPNLDP--LAYTVPMIGIRFGLKKLNISGSVFRPD 291
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
+ DSG+ + + Y ++ + IM + K T +C+ G +
Sbjct: 292 AGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDG---NVA 348
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVG-ENNIIGEI 345
+ L FT V + VP E LV G C+GI G + +G +NIIG +
Sbjct: 349 MIPRLIGDLVFVFT---RGVEIFVPKERVLVNVGGGIHCVGI--GRSSMLGAASNIIGNV 403
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTLL 372
Q+ V +D +R+G+ DC+ ++
Sbjct: 404 HQQNLWVEFDVTNRRVGFAKADCSRVV 430
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 159/386 (41%), Gaps = 55/386 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V+LTVG PP+ DTGS+L+W+ C AP P Y P +PC++P
Sbjct: 63 LTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSP----IPCTSPT 118
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C + P C C I Y D S G L +D F + S +P T FG
Sbjct: 119 CRTRTRDFSIPVSCDK-KKLCHAIISYADASSIEGNLASDTFHIGNS-----AIPATIFG 172
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
C + + T G++G+ RG +S V+Q+ GL + +CI GQ+ G+L G+
Sbjct: 173 CMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM---GLQK--FSYCISGQDSSGILLFGE 227
Query: 190 GKVP-SSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----L 233
+ +TP++Q S L ++ I +L KS D T
Sbjct: 228 SSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQT 287
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWRGPFKA---- 284
+ DSG + + VY + + +R + LK+ D + +C+R P
Sbjct: 288 MVDSGTQFTFLLGPVYTALKNEFVRQTKAS-LKVLEDPNFVFQGAMDLCYRVPLTRRTLP 346
Query: 285 -LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
L VT F+ +S + R R VP VI G +V SE E+ IIG
Sbjct: 347 PLPTVTLMFRGAEMSVSAERLMYR--VPG----VIRGSDSVYCFTFGNSELLGVESYIIG 400
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCN 369
Q+ + +D K R+G+ C+
Sbjct: 401 HHHQQNVWMEFDLAKSRVGFAEVRCD 426
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 161/387 (41%), Gaps = 59/387 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F ++L+VG P + DTGSDL W QC PC C + P + +PCS+
Sbjct: 116 FLMDLSVGTPALPYAAIVDTGSDLVWTQCK-PCVECFNQTTPVFDPAASSTYAALPCSSA 174
Query: 73 RCAALHWPNPPRCKHPNDQCD---YEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
CA L + Y YGD S+ G L T+ F L VP +
Sbjct: 175 LCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQ-----KVPGVA 229
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ----NGRGV 184
FGCG G AG++GLGRG +S+VSQL G+ R +C+ GR
Sbjct: 230 FGCGDTNEGDGFT---QGAGLVGLGRGPLSLVSQL---GIDR--FSYCLTSLDDAAGRSP 281
Query: 185 LFLGDGKVPSSG-----VAWTPMLQNSADLKHY-------ILGPAELLYSGKSCGLKDL- 231
L LG S+ TP+++N + Y +G L + ++D
Sbjct: 282 LLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDG 341
Query: 232 --TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALG 286
+I DSG S Y R Y+ +R + L D + L +C++GP A+
Sbjct: 342 TGGVIVDSGTSITYLELRAYRA-----LRKAFVAHMSLPTVDASEIGLDLCFQGPAGAVD 396
Query: 287 QVTEYFKP-LALSFTNRRNSVRLVVPPEAYLVI-SGRKNVCLGILNGSEAEVGENNIIGE 344
Q + P L L F + L +P E Y+V+ S +CL ++ A G +IIG
Sbjct: 397 QDVQVQVPKLVLHFDGGAD---LDLPAENYMVLDSASGALCLTVM----ASRGL-SIIGN 448
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ +YD + + P +CN L
Sbjct: 449 FQQQNFQFVYDVAGDTLSFAPAECNKL 475
>gi|213998832|gb|ACJ60783.1| nucellin [Hordeum vulgare subsp. spontaneum]
Length = 127
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 75/128 (58%), Gaps = 5/128 (3%)
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGD 189
CGY Q P P G+LGLG G+ + +QL+ + +I+ NVIGHC+ G+GVL++GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGKGVLYVGD 60
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRV 248
P+ GV W PM ++ L +Y G AE+ + G +FDSG++Y + +++
Sbjct: 61 FNPPTRGVTWVPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHVPAQI 117
Query: 249 YQEIVSLI 256
Y EIVS +
Sbjct: 118 YNEIVSKV 125
>gi|218185382|gb|EEC67809.1| hypothetical protein OsI_35378 [Oryza sativa Indica Group]
Length = 344
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/106 (45%), Positives = 72/106 (67%), Gaps = 8/106 (7%)
Query: 271 DKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGI 328
D +LP+CW+G F+++ V + FK L L+F N N+V + +PPE +L+++ NVCLGI
Sbjct: 103 DPSLPLCWKGQKAFESVSDVKKEFKSLQLNFGN--NAV-MEIPPENFLIVTEYGNVCLGI 159
Query: 329 LNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSL 374
L+GS NIIG+I MQD+MVIYDNE++++GW C L+ +
Sbjct: 160 LHGSRLNF---NIIGDITMQDQMVIYDNEREQLGWIRGSCAELIGV 202
Score = 42.4 bits (98), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 17/25 (68%), Positives = 20/25 (80%)
Query: 91 QCDYEIEYGDGGSSIGALVTDLFPL 115
QCDYEI+Y DG S+IGAL+ D F L
Sbjct: 28 QCDYEIKYADGASTIGALIVDQFSL 52
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 158/385 (41%), Gaps = 55/385 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V+LTVG PP+ DTGS+L+W+ C AP P Y P +PC++P
Sbjct: 56 LTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSP----IPCTSPT 111
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C + P C C I Y D S G L +D F + S +P T FG
Sbjct: 112 CRTRTRDFSIPVSCDK-KKLCHAIISYADASSIEGNLASDTFHIGNS-----AIPATIFG 165
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
C + + T G++G+ RG +S V+Q+ GL + +CI GQ+ G+L G+
Sbjct: 166 CMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM---GLQK--FSYCISGQDSSGILLFGE 220
Query: 190 GKVP-SSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----L 233
+ +TP++Q S L ++ I +L KS D T
Sbjct: 221 SSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQT 280
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWRGPFKA---- 284
+ DSG + + VY + + +R + LK+ D + +C+R P
Sbjct: 281 MVDSGTQFTFLLGPVYTALKNEFVRQTKAS-LKVLEDPNFVFQGAMDLCYRVPLTRRTLP 339
Query: 285 -LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
L VT F+ +S + R R VP VI G +V SE E+ IIG
Sbjct: 340 PLPTVTLMFRGAEMSVSAERLMYR--VPG----VIRGSDSVYCFTFGNSELLGVESYIIG 393
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
Q+ + +D K R+G+ C
Sbjct: 394 HHHQQNVWMEFDLAKSRVGFAEVRC 418
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/197 (37%), Positives = 98/197 (49%), Gaps = 23/197 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + F FDTGSDLTW QC+ PC G C + E + P ++ V C +
Sbjct: 89 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCE-PCVGYCYQQREHIFDPSTSLSYSNVSCDS 147
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P C L N P C + C Y I YGDG SIG + L ++ VFN F
Sbjct: 148 PSCEKLESATGNSPGCS--SSTCLYGIRYGDGSYSIGFFARE--KLSLTSTDVFN-NFQF 202
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI--GQNGRGVLF 186
GCG Q+N G TAG+LGL R +S+VSQ ++YG V +C+ + G L
Sbjct: 203 GCG--QNNRGLFG--GTAGLLGLARNPLSLVSQTAQKYG---KVFSYCLPSSSSSTGYLS 255
Query: 187 LGDGKVPSSGVAWTPML 203
G G S V +TP L
Sbjct: 256 FGSGDGDSKAVKFTPRL 272
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 161/367 (43%), Gaps = 34/367 (9%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNI 66
P + V + +G P K F FDTGSDLTW QC+ GC + ++ P +KN
Sbjct: 135 PTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKN- 193
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
V CS+ C + N P ++ C Y I+YG G +IG L T+ L ++ VF
Sbjct: 194 VSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGS-GYTIGFLATET--LAIASSDVFKNF 250
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
L FGC ++ + G + T G+LGLGR I++ SQ +N+ +C+ +
Sbjct: 251 L-FGC--SEESRGTFN--GTTGLLGLGRSPIALPSQTTNK--YKNLFSYCLPASPSSTGH 303
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKH-YILGPAELLYSGKSCGLKDLT--LIFDSGASYAY 243
L G S TP+ S LK Y L + G+ + I DSG ++ +
Sbjct: 304 LSFGVEVSQAAKSTPI---SPKLKQLYGLNTVGISVRGRELPINGSISRTIIDSGTTFTF 360
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
S Y + S R+++ L + C+ F +G T +++ F
Sbjct: 361 LPSPTYSALGS-AFREMMAN-YTLTNGTSSFQPCYD--FSNIGNGTLTIPGISIFF---E 413
Query: 304 NSVRLVVPPEAYLV-ISGRKNVCLGILN-GSEAEVGENNIIGEIFMQDKMVIYDNEKQRI 361
V + + ++ ++G K VCL + GS+++ I G + VIYD K +
Sbjct: 414 GGVEVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFA---IFGNYQQKTYEVIYDVAKGMV 470
Query: 362 GWKPEDC 368
G+ P+ C
Sbjct: 471 GFAPKGC 477
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 168/386 (43%), Gaps = 56/386 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCS 70
YF +++ VG PPK F DTGSDL W+QC PC C + Y P +KNI C+
Sbjct: 155 YF-MDVLVGSPPKHFSLILDTGSDLNWIQC-LPCHDCFQQNGAFYDPKASASYKNIT-CN 211
Query: 71 NPRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS----NGSVFNV 125
+PRC + P+PP+ CK N C Y YGD ++ G + F + + + ++NV
Sbjct: 212 DPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNV 271
Query: 126 P-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQ 179
+ FGCG+ N G AG+LGLGRG +S SQL+ L + +C+
Sbjct: 272 ENMMFGCGH--WNRGLFHG--AAGLLGLGRGPLSFSSQLQ--SLYGHSFSYCLVDRNSDT 325
Query: 180 NGRGVLFLGDGK--VPSSGVAWTPMLQNSADL--KHYILGPAELLYSGKSCGLKDLT--- 232
N L G+ K + + +T + +L Y + ++ +G+ + + T
Sbjct: 326 NVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNI 385
Query: 233 -------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI---CWRGPF 282
I DSG + +YF Y+ I + I G P + PI C F
Sbjct: 386 SSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGK----YPVYRDFPILDPC----F 437
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII 342
G + L ++F + P E + VCL IL ++ +II
Sbjct: 438 NVSGIDSIQLPELGIAFA---DGAVWNFPTENSFIWLNEDLVCLAILGTPKSAF---SII 491
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDC 368
G Q+ ++YD ++ R+G+ P C
Sbjct: 492 GNYQQQNFHILYDTKRSRLGYAPTKC 517
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 140/352 (39%), Gaps = 34/352 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + +G P + FDTGSDLTW QC+ C K + + P K+ + C++
Sbjct: 145 YFVV-VGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNITCTS 203
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C L N P C C Y I+YGD S+G + + ++ V N F
Sbjct: 204 TLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSVTATD-IVDN--FLF 260
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GCG Q+N G +AG++GLGR IS V Q + R + +C+ L
Sbjct: 261 GCG--QNNQGLFG--GSAGLIGLGRHPISFVQQTA--AVYRKIFSYCLPATSSSTGRLSF 314
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAYF 244
G +S V +TP S Y L + G + T I DSG
Sbjct: 315 GTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGAIIDSGTVITRL 374
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
Y + S + + P A + L C+ G + SF
Sbjct: 375 PPTAYTALRSAFRQGMSKYP--SAGELSILDTCY----DLSGYEVFSIPKIDFSFA---G 425
Query: 305 SVRLVVPPEAYLVISGRKNVCLGI-LNGSEAEVGENNIIGEIFMQDKMVIYD 355
V + +PP+ L ++ K VCL NG +++V I G + + V+YD
Sbjct: 426 GVTVQLPPQGILYVASAKQVCLAFAANGDDSDV---TIYGNVQQKTIEVVYD 474
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 159/383 (41%), Gaps = 63/383 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F VN +VG+PP DTGSDL WVQC PC C + + P K+ + +P
Sbjct: 59 FLVNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSP 117
Query: 73 RCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
C PN P+ K+ + +QC Y Y DG +S G L T+ S+ G+V + FG
Sbjct: 118 IC-----PNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 172
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-----QNGRGVL 185
CG++ N G +G+LGL G SIVS+L +CIG L
Sbjct: 173 CGHS--NRGRFDGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHNQL 223
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------------ 233
LGDG ++ S+ H G + G S G L +
Sbjct: 224 VLGDG----------VKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQ 273
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP--ICWRGPFKALGQV 288
+ DSG + + + + + I R + G ++ +T+P +C++G + +
Sbjct: 274 GGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIY--RTIPGWLCYKG---RVNED 328
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
F LA F LV+ + V + CL +L + +G ++IG + Q
Sbjct: 329 LRGFPELAFHFA---EGADLVLDANSLFVQKNQDVFCLAVLESNLKNIG--SVIGIMAQQ 383
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
V YD +R+ ++ DC L
Sbjct: 384 HYNVAYDLIGKRVYFQRTDCELL 406
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 159/383 (41%), Gaps = 63/383 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F VN +VG+PP DTGSDL WVQC PC C + + P K+ + +P
Sbjct: 59 FLVNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSP 117
Query: 73 RCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
C PN P+ K+ + +QC Y Y DG +S G L T+ S+ G+V + FG
Sbjct: 118 IC-----PNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 172
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-----QNGRGVL 185
CG++ N G +G+LGL G SIVS+L +CIG L
Sbjct: 173 CGHS--NRGRFDGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHNQL 223
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------------ 233
LGDG ++ S+ H G + G S G L +
Sbjct: 224 VLGDG----------VKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQ 273
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP--ICWRGPFKALGQV 288
+ DSG + + + + + I R + G ++ +T+P +C++G + +
Sbjct: 274 GGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIY--RTIPGWLCYKG---RVNED 328
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
F LA F LV+ + V + CL +L + +G ++IG + Q
Sbjct: 329 LRGFPELAFHFA---EGADLVLDANSLFVQKNQDVFCLAVLESNLKNIG--SVIGIMAQQ 383
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
V YD +R+ ++ DC L
Sbjct: 384 HYNVAYDLIGKRVYFQRTDCELL 406
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/364 (26%), Positives = 143/364 (39%), Gaps = 39/364 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C K E + P K+ V C++
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDS 222
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CA L + C C Y ++YGDG ++G D + F FGCG
Sbjct: 223 ACADL---DTNGCT--GGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFR----FGCG 273
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLGDG 190
+ N G TAG++GLGRG+ S+ Q Y +C+ G G L G G
Sbjct: 274 --EKNNGLFG--KTAGLMGLGRGKTSLTVQ--AYNKYGGAFAYCLPALTTGTGYLDFGPG 327
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAYFT 245
+ TPML + +Y+ G + G+ + + + DSG
Sbjct: 328 SA-GNNARLTPMLTDKGQTFYYV-GMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLP 385
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
+ Y + S + ++ K AP L C+ F L V ++L F +
Sbjct: 386 ATAYTALSSAFDKVMLARGYKKAPGYSILDTCYD--FTGLSDVE--LPTVSLVF---QGG 438
Query: 306 VRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
L V + VCL NG + V I+G + V+YD K+ +G+
Sbjct: 439 ACLDVDVSGIVYAISEAQVCLAFASNGDDESVA---IVGNTQQKTYGVLYDLGKKTVGFA 495
Query: 365 PEDC 368
P C
Sbjct: 496 PGSC 499
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 106/414 (25%), Positives = 169/414 (40%), Gaps = 83/414 (20%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC---DAPCTGCTKPPEKQYKPHKNIVP----- 68
+ ++L +G PPK+ DTGSDLTWV C C C Y+ +K +
Sbjct: 12 YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDC-----NDYRNNKLMSTYSPSY 66
Query: 69 --------CSNPRCAALHWPNPP-----------------RCKHPNDQCDYEIEYGDGGS 103
C +P C+ +H + C P Y YG GG
Sbjct: 67 SSSSLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYT--YGAGGV 124
Query: 104 SIGALVTDLFPLRFSNGS-VFNVP-LTFGC-GYNQHNPGPLSPPDTAGVLGLGRGRISIV 160
IG L D S+ S VP FGC G P G+ G GRG +S+
Sbjct: 125 VIGTLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREP--------IGIAGFGRGVLSLP 176
Query: 161 SQLREYGLIRNVIGHCI-------GQNGRGVLFLGDGKVPSSG-VAWTPMLQNSADLKHY 212
SQL G ++ HC N L +GD + S+ + +T +L+N +Y
Sbjct: 177 SQL---GFLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYY 233
Query: 213 ILGPAELLYSGKSCGLK------------DLTLIFDSGASYAYFTSRVYQEIVSLIMRDL 260
+G E + G + ++ + +I DSG +Y + Y +++S+ ++ +
Sbjct: 234 YIG-LEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSM-LQSI 291
Query: 261 IGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVIS 319
I P + +T +C+R P VT++ L + N+V LV+P +
Sbjct: 292 ITYPRAQEQEARTGFDLCYRIPCPN-NVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAM 350
Query: 320 GRKN-----VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
G + CL + N +++ G + G Q+ V+YD EK+RIG++P DC
Sbjct: 351 GAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 156/387 (40%), Gaps = 37/387 (9%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP-----EKQYKPHKNIV 67
+ + + V+L+VG PP+ DTGSDL W QC APC C + V
Sbjct: 90 VTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQC-APCLNCFDQGAIPVLDPAASSTHAAV 148
Query: 68 PCSNPRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPL----RFSNGS 121
C P C AL + + R C Y YGD ++G L +D F G
Sbjct: 149 RCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGG 208
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V LTFGCG+ N G +T G+ G GRGR S+ SQL +
Sbjct: 209 VSERRLTFGCGH--FNKGIFQANET-GIAGFGRGRWSLPSQLGVTSFSYCFTSMFESTSS 265
Query: 182 RGVLFLGDGKVPSSG-VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLTL 233
L + ++ +G V TP+L++ + LK +G + + L++ +
Sbjct: 266 LVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASA 325
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I DSGAS VY+ + + + +G P+ A + L +C+ P A + ++
Sbjct: 326 IIDSGASITTLPEDVYEAVKAEFVAQ-VGLPVS-AVEGSALDLCFALPSAAAPKSAFGWR 383
Query: 294 PLALSFTNRRNSVRLV----------VPPEAYLVIS-GRKNVCLGILNGSEAEVGENNII 342
RLV +P E Y+ G + +CL +L+ + + +I
Sbjct: 384 WRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCL-VLDAATGGGDQTVVI 442
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCN 369
G Q+ V+YD E + + P C
Sbjct: 443 GNYQQQNTHVVYDLENDVLSFAPARCE 469
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 153/383 (39%), Gaps = 56/383 (14%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
V+L +G PP++ DTGS L+W+QC PP + P + +PC++P C
Sbjct: 99 VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPA-KPPPTASFDPSLSSTFSTLPCTHPVC 157
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + F + S+F PL GC
Sbjct: 158 KPRIPDFTLPTSCDQ-NRLCHYSYFYADGTYAEGNLVREKFTF---SRSLFTPPLILGCA 213
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
+P G+LG+ RGR+S SQ + +C+ G G
Sbjct: 214 TESTDP--------RGILGMNRGRLSFASQSKI-----TKFSYCVPTRVTRPGYTPTGSF 260
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAE--LLYSGKSCGLKDLTL---------- 233
+LG S+ + ML + + L P + G G + L +
Sbjct: 261 YLGHNP-NSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAG 319
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
+ DSG+ + Y + Y ++ + ++R + K +C+ G +G++
Sbjct: 320 GSGQTMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDGNAIEIGRL 379
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
+ F V++VVP E L C+GI N S+ +NIIG Q
Sbjct: 380 ---IGDMVFEF---EKGVQIVVPKERVLATVEGGVHCIGIAN-SDKLGAASNIIGNFHQQ 432
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ V +D +R+G+ DC+ L
Sbjct: 433 NLWVEFDLVNRRMGFGTADCSRL 455
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 150/365 (41%), Gaps = 41/365 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
+ V++ +G P + FDTGSDL+WVQC PC C K + + P ++ + P C A
Sbjct: 188 YIVSVGLGTPRRDLLVVFDTGSDLSWVQCK-PCNNCYKQHDPLFDPSQSTTYSAVP-CGA 245
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
+ C + +C YE+ YGD + G L D L S+ + FGCG
Sbjct: 246 QECLDSGTCS--SGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQG--FVFGCG--DD 299
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCIGQNGR--GVLFLGDGKVP 193
+ G D G+ GLGR R+S+ SQ YG +C+ + R G L LG P
Sbjct: 300 DTGLFGRAD--GLFGLGRDRVSLASQAAARYGA---GFSYCLPSSWRAEGYLSLGSAAAP 354
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYFTSRV 248
+T M+ S Y L + +G++ + K + DSG SR
Sbjct: 355 PH-AQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITRLPSRA 413
Query: 249 YQEIVSL---IMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y + S MR K AP L C + G+ +AL F
Sbjct: 414 YSALRSSFAGFMRR-----YKRAPALSILDTC----YDFTGRTKVQIPSVALLFD---GG 461
Query: 306 VRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
L + L ++ R CL NG + VG I+G + + V+YD Q+IG+
Sbjct: 462 ATLNLGFGGVLYVANRSQACLAFASNGDDTSVG---ILGNMQQKTFAVVYDLANQKIGFG 518
Query: 365 PEDCN 369
+ C+
Sbjct: 519 AKGCS 523
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 106/414 (25%), Positives = 169/414 (40%), Gaps = 83/414 (20%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC---DAPCTGCTKPPEKQYKPHKNIVP----- 68
+ ++L +G PPK+ DTGSDLTWV C C C Y+ +K +
Sbjct: 29 YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDC-----NDYRNNKLMSTYSPSY 83
Query: 69 --------CSNPRCAALHWPNPP-----------------RCKHPNDQCDYEIEYGDGGS 103
C +P C+ +H + C P Y YG GG
Sbjct: 84 SSSSLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYT--YGAGGV 141
Query: 104 SIGALVTDLFPLRFSNGS-VFNVP-LTFGC-GYNQHNPGPLSPPDTAGVLGLGRGRISIV 160
IG L D S+ S VP FGC G P G+ G GRG +S+
Sbjct: 142 VIGTLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREP--------IGIAGFGRGVLSLP 193
Query: 161 SQLREYGLIRNVIGHCI-------GQNGRGVLFLGDGKVPSSG-VAWTPMLQNSADLKHY 212
SQL G ++ HC N L +GD + S+ + +T +L+N +Y
Sbjct: 194 SQL---GFLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYY 250
Query: 213 ILGPAELLYSGKSCGLK------------DLTLIFDSGASYAYFTSRVYQEIVSLIMRDL 260
+G E + G + ++ + +I DSG +Y + Y +++S+ ++ +
Sbjct: 251 YIG-LEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSM-LQSI 308
Query: 261 IGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVIS 319
I P + +T +C+R P VT++ L + N+V LV+P +
Sbjct: 309 ITYPRAQEQEARTGFDLCYRIPCPN-NVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAM 367
Query: 320 GRKN-----VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
G + CL + N +++ G + G Q+ V+YD EK+RIG++P DC
Sbjct: 368 GAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 149/369 (40%), Gaps = 40/369 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + FDTGSDLTW QC+ PC G C K + + P K+ + C++
Sbjct: 46 YVVVVGLGTPKRDLSLVFDTGSDLTWTQCE-PCAGSCYKQQDAIFDPSKSSSYTNITCTS 104
Query: 72 PRCAALHWPN-PPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C L C D C Y+ +YGD +S+G L + + ++ F
Sbjct: 105 SLCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITATD---IVDDFLF 161
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFL 187
GCG Q N G + +AG++GLGR ISIV Q + +C+ + G L
Sbjct: 162 GCG--QDNEGLFNG--SAGLMGLGRHPISIVQQTSSN--YNKIFSYCLPATSSSLGHLTF 215
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG------KSCGLKDLTLIFDSGASY 241
G ++ + +TP+ S D Y L + G S I DSG
Sbjct: 216 GASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVI 275
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALSFT 300
VY + S R + P +A + L C+ L E P + F+
Sbjct: 276 TRLAPTVYAALRSAFRRXMEKYP--VANEAGLLDTCYD-----LSGYKEISVPRIDFEFS 328
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGI-LNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
V + + L + + VCL NGS+ ++ + G + + V+YD +
Sbjct: 329 ---GGVTVELXHRGILXVESEQQVCLAFAANGSDNDI---TVFGNVQQKTLEVVYDVKGG 382
Query: 360 RIGWKPEDC 368
RIG+ C
Sbjct: 383 RIGFGAAGC 391
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 157/380 (41%), Gaps = 60/380 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IVPCS 70
+ V L +G P DTGSDL+WVQC PC P+K + P K+ +PC+
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCNASDCYPQKDPLFDPSKSSTFATIPCA 183
Query: 71 NPRCAAL---HWPNPPRCKHPND----QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ C L + N C + QC Y IEYG+G + G T+ L S
Sbjct: 184 SDACKQLPVDGYDN--GCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLAL---GSSAV 238
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIG--QN 180
FGCG +QH GP D G+LGLG S+VSQ YG +C+ +
Sbjct: 239 VKSFRFGCGSDQH--GPYDKFD--GLLGLGGAPESLVSQTASVYG---GAFSYCLPPLNS 291
Query: 181 GRGVLFLG---DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---- 233
G G L LG +SG +TPM S + + + + +G S G K L +
Sbjct: 292 GAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYV----VTLTGISVGGKALDIPPAV 347
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
I DSG + Y+ + + + PL L P D L C+ F G V
Sbjct: 348 FAKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPL-LPPADSALDTCYN--FTGHGTV 404
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
T +AL+F +V L VP + CL + + G IIG + +
Sbjct: 405 T--VPKVALTFVGGA-TVDLDVPSGVLV------EDCLAFADAGDGSFG---IIGNVNTR 452
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
V+YD+ K +G++ C
Sbjct: 453 TIEVLYDSGKGHLGFRAGAC 472
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 108/397 (27%), Positives = 169/397 (42%), Gaps = 58/397 (14%)
Query: 1 MYVSWIEFFFFPIFS---YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE 57
M V ++ P+++ F + + +G P F DTGSDLTW QC PCT C P
Sbjct: 96 MSVDEVKAVEAPVYAGNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCK-PCTDCYPQPT 154
Query: 58 KQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF 113
Y P ++ VPCS+ C AL P C+Y YGD S+ G L + F
Sbjct: 155 PIYDPSQSSTYSKVPCSSSMCQAL-----PMYSCSGANCEYLYSYGDQSSTQGILSYESF 209
Query: 114 PLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNV 172
L ++P + FGCG Q N G G++G GRG +S++SQL + + N
Sbjct: 210 TLTSQ-----SLPHIAFGCG--QENEG-GGFSQGGGLVGFGRGPLSLISQLGQS--LGNK 259
Query: 173 IGHCI-----GQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC 226
+C+ + LF+G + + V+ TP++Q+ + Y L + G+
Sbjct: 260 FSYCLVSITDSPSKTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLL 319
Query: 227 GLKDLT----------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI 276
+ D T +I DSG + Y Y ++V + I P ++ + L +
Sbjct: 320 DIADGTFDLQLDGTGGVIIDSGTTVTYLEQSGY-DVVKKAVISSINLP-QVDGSNIGLDL 377
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGIL--NGSEA 334
C+ G T +F + F +P E Y+ CL +L NG
Sbjct: 378 CFE---PQSGSSTSHFPTITFHF----EGADFNLPKENYIYTDSSGIACLAMLPSNG--- 427
Query: 335 EVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+I G I Q+ ++YDNE+ + + P C+TL
Sbjct: 428 ----MSIFGNIQQQNYQILYDNERNVLSFAPTVCDTL 460
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 150/356 (42%), Gaps = 57/356 (16%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHP-- 88
DTGSD+TW+QCD PC C K + ++P + +PC++ C L H
Sbjct: 6 DTGSDITWIQCD-PCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQ-----SFSHSCL 59
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTA 147
N C+Y + YGD ++ G + LR + + +VP FGCG+ N G + A
Sbjct: 60 NSSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGH--ANKGLFN--GAA 115
Query: 148 GVLGLGRGRISIVSQLR-EYGLIRNVIGHCIGQNG----RGVLFLGDGKVPSSGVAWTPM 202
G++GLG+ I +Q +G V +C+ G+L G+ + V +TP+
Sbjct: 116 GLMGLGKSSIGFPAQTSVAFG---KVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPL 172
Query: 203 LQNSADLKHYILGPAELLYSGKSCGLKD------LTLIFDSGASYAYFTSRVYQEIVSLI 256
+ +S+ GP++ S + D T++ DSG + F Y+ +
Sbjct: 173 VDSSS-------GPSQYFVSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAF 225
Query: 257 MRDLIG--TPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL-ALSFTNRRNSVRLVVPPE 313
+ L G T + +AP D C+R + V + PL L F R+ L + P
Sbjct: 226 TQILPGLQTAVSVAPFDT----CFR-----VSTVDDINIPLITLHF---RDDAELRLSPV 273
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
L +C S +++G Q+ +YD K R+G +CN
Sbjct: 274 HILYPVDDGVMCFAFAPSSSGR----SVLGNFQQQNLRFVYDIPKSRLGISAFECN 325
>gi|213998800|gb|ACJ60767.1| nucellin [Hordeum marinum subsp. marinum]
Length = 142
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 76/138 (55%), Gaps = 5/138 (3%)
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGD 189
CGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL++G+
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGN 60
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRV 248
PS GV W PM ++S +Y G AELL + G +FDSG++Y S++
Sbjct: 61 FNPPSRGVTWVPMRESSF---YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQI 117
Query: 249 YQEIVSLIMRDLIGTPLK 266
Y EIVS + L + L+
Sbjct: 118 YNEIVSKVRGTLSESSLE 135
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 155/391 (39%), Gaps = 55/391 (14%)
Query: 5 WIEFFFFPIFSYFAV-------NLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPP 56
W+ P+ S +V L +G P + D+GS LTW+QC APC C
Sbjct: 89 WVAASSVPLASGASVGVGNYITRLGLGTPTTTYVMVVDSGSSLTWLQC-APCAVSCHPQA 147
Query: 57 EKQYKPHKN----IVPCSNPRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVT 110
Y P + VPCS P+CA L NP C + C Y+ YGDG S G L
Sbjct: 148 GPLYDPRASSTYAAVPCSAPQCAELQAATLNPSSCSG-SGVCQYQASYGDGSFSFGYLSK 206
Query: 111 DLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR 170
D L S+GS +GCG Q N G AG++GL R ++S++SQL +
Sbjct: 207 DTVSLS-SSGSFPG--FYYGCG--QDNVGLFG--RAAGLIGLARNKLSLLSQLAPS--VG 257
Query: 171 NVIGHCI---GQNGRGVLFLG---DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK 224
N +C+ G L G D K P ++T M+ +S D Y + A + +G
Sbjct: 258 NSFAYCLPTSAAASAGYLSFGSNSDNKNPGK-YSYTSMVSSSLDASLYFVSLAGMSVAGS 316
Query: 225 -----SCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
S L I DSG + VY + + L AP L C++
Sbjct: 317 PLAVPSSEYGSLPTIIDSGTVITRLPTPVYTALSKAVGAALA---APSAPAYSILQTCFK 373
Query: 280 GPFKALGQVTEYFKP-LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGE 338
GQV + P + ++F L + P LV CL A
Sbjct: 374 ------GQVAKLPVPAVNMAFA---GGATLRLTPGNVLVDVNETTTCLAF-----APTDS 419
Query: 339 NNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
IIG Q V+YD + RIG+ C+
Sbjct: 420 TAIIGNTQQQTFSVVYDVKGSRIGFAAGGCS 450
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 92/366 (25%), Positives = 151/366 (41%), Gaps = 34/366 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +G PP DT SDL WVQC +PC C ++PHK+ + C +
Sbjct: 90 YLMRFYIGTPPVERLAIADTASDLIWVQC-SPCETCFPQDTPLFEPHKSSTFANLSCDSQ 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C + N C + C Y YGDG S+ G L T+ + F + +V FGCG
Sbjct: 149 PCTS---SNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTE--SIHFGSQTVTFPKTIFGCG 203
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
N +S T G++GLG G +S+VSQL + I + +C+ + + F
Sbjct: 204 SNNDFMHQISNKVT-GIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKLKFGN 260
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-----TLIFDSGASYAY 243
D + +GV TP++ + +Y L + K ++ +I D G Y
Sbjct: 261 DTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLTY 320
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
Y V+L +R+ +G + DD P + P Q F + FT +
Sbjct: 321 LEVNFYHNFVTL-LREALG--ISETKDDIPYPFDFCFP----NQANITFPKIVFQFTGAK 373
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+ P + +CL +L A+ ++ G + D V YD + +++ +
Sbjct: 374 ---VFLSPKNLFFRFDDLNMICLAVLPDFYAK--GFSVFGNLAQVDFQVEYDRKGKKVSF 428
Query: 364 KPEDCN 369
P DC+
Sbjct: 429 APADCS 434
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/386 (26%), Positives = 160/386 (41%), Gaps = 57/386 (14%)
Query: 12 PIFSYFAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---- 66
P + + +VG PP KL+ DTGSD+ W+QC+ PC C + P K+
Sbjct: 82 PDIGEYLMTYSVGTPPFKLYGI-VDTGSDIVWLQCE-PCQECYNQTTPMFNPSKSSSYKN 139
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+PC + C ++ C N C+Y YGD S G L D L +NG + P
Sbjct: 140 IPCPSKLCQSME---DTSCNDKN-YCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFP 195
Query: 127 -LTFGCGYNQHNPGPLS-PPDTAGVLGLGRGRISIVSQLR-------EYGLIRNVIGHCI 177
+ GCG N LS ++G++G G G S ++QL Y L I
Sbjct: 196 NIVIGCGTNN----ILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNI 251
Query: 178 GQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKD 230
N L GD V GV TP+L+ + +Y+ +G + G G +
Sbjct: 252 QSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNGDNE 311
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFKALGQ-- 287
+I DSG + T Y + S ++ DL+ L+ D +TL +C+ KA G
Sbjct: 312 GNIIIDSGTTLTSLTKDDYSFLESAVV-DLV--KLERVDDPTQTLNLCYS--VKAEGYDF 366
Query: 288 --VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEI 345
+T +FK + + P + V CL + ++ I G +
Sbjct: 367 PIITMHFK-----------GADVDLHPISTFVSVADGVFCLAFESSQ-----DHAIFGNL 410
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ MV YD +++ + +KP DC +
Sbjct: 411 AQQNLMVGYDLQQKIVSFKPSDCTKV 436
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/390 (26%), Positives = 159/390 (40%), Gaps = 72/390 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+L VG PP+ DTGSDL W QC APC C P+ + P + + C+
Sbjct: 104 YLVDLAVGTPPQPVSALLDTGSDLIWTQC-APCASCLPQPDPIFSPGASSSYEPMRCAGE 162
Query: 73 RC-AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL----RFSNGSVFNVPL 127
C LH C+ P D C Y YGDG ++ G T+ F + + PL
Sbjct: 163 LCNDILHH----SCQRP-DTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPL 217
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGR--- 182
FGCG N G L+ + +G++G GR +S+VSQL IR +C+ +GR
Sbjct: 218 GFGCG--TMNKGSLN--NGSGIVGFGRAPLSLVSQL----AIRR-FSYCLTPYASGRKST 268
Query: 183 ---GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
G L G ++ V T +L++ + Y + ++G + G + L +
Sbjct: 269 LLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVP-----FTGVTVGARRLRIPISAFA 323
Query: 234 ---------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL----APDDKTLPICWRG 280
I DSG + F + V E+V R + P PDD +C+
Sbjct: 324 LRPDGSGGAIVDSGTALTLFPAPVLAEVVR-AFRSQLRLPFAANGSSGPDDG---VCF-- 377
Query: 281 PFKALGQVTEYFKPLAL-SFTNRRNSVRLVVPPEAYLVISGRK-NVCLGILNGSEAEVGE 338
+ +P + L +P Y++ RK N+CL + + ++
Sbjct: 378 ----AAAASRVPRPAVVPRMVFHLQGADLDLPRRNYVLDDQRKGNLCLLLADSGDS---- 429
Query: 339 NNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
IG QD V+YD E + + P C
Sbjct: 430 GTTIGNFVQQDMRVLYDLEADTLSFAPAQC 459
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 155/370 (41%), Gaps = 42/370 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ + LT+G PP+ FD DTGSDL WVQC PC C + P ++ P K+ C++
Sbjct: 39 YLMTLTLGSPPQSFDVIVDTGSDLNWVQC-LPCRVCYQQPGPKFDPSKSRSFRKAACTDN 97
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C P C + C Y+ YGD ++ G L + L G+ FGCG
Sbjct: 98 LCNVSALP-LKACAA--NVCQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGCG 154
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC-IGQNGRGVLFLGDGK 191
N G + AG++GLG+G +S+ SQL N +C + N L G
Sbjct: 155 --TQNLGTFA--GAAGLVGLGQGPLSLNSQLSH--TFANKFSYCLVSLNSLSASPLTFGS 208
Query: 192 VPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----------IFDSGA 239
+ ++ + +T ++ N+ +Y + + G+ L I DSG
Sbjct: 209 IAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGT 268
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY-FKPLALS 298
+ T Y ++ + P +L L +C+ + V + FK
Sbjct: 269 TITMLTLPAYSAVLR-AYESFVNYP-RLDGSAYGLDLCFNIAGVSNPSVPDMVFKFQGAD 326
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F R ++ ++V A +CL + GS+ +IIG I Q+ +V+YD E
Sbjct: 327 FQMRGENLFVLVDTSA-------TTLCLA-MGGSQGF----SIIGNIQQQNHLVVYDLEA 374
Query: 359 QRIGWKPEDC 368
++IG+ DC
Sbjct: 375 KKIGFATADC 384
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 154/385 (40%), Gaps = 60/385 (15%)
Query: 13 IFSYFA-VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-------------- 57
+F Y N++VG P + DTGSDL W+ C+ CT C +
Sbjct: 108 LFGYLHFANVSVGTPASSYLVALDTGSDLFWLPCN--CTKCVHGIQLSTGQKIAFNIYDN 165
Query: 58 KQYKPHKNIVPCSNPRCAALHWPNPPRCKHPND-QCDYEIEY-GDGGSSIGALVTDLFPL 115
K+ KN V C++ C +C + C Y++EY + S+ G LV D+ L
Sbjct: 166 KESSTSKN-VACNSSLC-----EQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHL 219
Query: 116 RFSNGSVF---NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNV 172
N N +TFGCG Q L G+ GLG +S+ S L + GL N
Sbjct: 220 ITDNDDQTQHANPLITFGCGQVQ-TGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNS 278
Query: 173 IGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT 232
C +G G + GD S TP Y + +++ G S L +
Sbjct: 279 FSMCFAADGLGRITFGDNN-SSLDQGKTP-FNIRPSHSTYNITVTQIIVGGNSADL-EFN 335
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA------PDDKTLPICWRGPFKALG 286
IFD+G S+ Y + Y++I + +KL DD C+
Sbjct: 336 AIFDTGTSFTYLNNPAYKQIT-----QSFDSKIKLQRHSFSNSDDLPFEYCYD------L 384
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN---VCLGILNGSEAEVGENNIIG 343
+ + + ++ T + V+ P ++ SG N +CL +L + NIIG
Sbjct: 385 RTNQTIEVPNINLTMKGGDNYFVMDP---IITSGGGNNGVLCLAVLKSNNV-----NIIG 436
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
+ FM +++D E +GWK +C
Sbjct: 437 QNFMTGYRIVFDRENMTLGWKESNC 461
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 155/372 (41%), Gaps = 40/372 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V L VG P + F DTGSDLTWV+C PP + ++P + +PCS+
Sbjct: 116 YF-VKLRVGTPVQEFTLVADTGSDLTWVKCAG-----ASPPGRVFRPKTSRSWAPIPCSS 169
Query: 72 PRCAALHWP-NPPRCKHPNDQCDYEIEYGDGGSSIGALV-TDLFPLRFSNGSVFNVP-LT 128
C L P C P C Y+ Y +G + +V T+ + G V + +
Sbjct: 170 DTC-KLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVV 228
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREY---GLIRNVIGHCIGQNGRGVL 185
GC + H+ D GVL LG +IS +Q ++ H +N G L
Sbjct: 229 LGCS-SSHDGQSFRSAD--GVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYL 285
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-------KDLTLIFDSG 238
G G+VP + T + + ++ Y + + +GK+ + K +I DSG
Sbjct: 286 AFGPGQVPRTPATQTKLFLDP-EMPFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSG 344
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
+ + Y+ +V+ + + L G P + P + R P E LA+
Sbjct: 345 NTLTVLAAPAYKAVVAALSKHLDGVPKVSFPPFEHCYNWTARRP-----GAPEIIPKLAV 399
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
F S RL P ++Y++ C+G+ E E ++IG I Q+ + +D +
Sbjct: 400 QFA---GSARLEPPAKSYVIDVKPGVKCIGV---QEGEWPGLSVIGNIMQQEHLWEFDLK 453
Query: 358 KQRIGWKPEDCN 369
++ +K +C
Sbjct: 454 NMQVRFKQSNCT 465
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 101/388 (26%), Positives = 156/388 (40%), Gaps = 59/388 (15%)
Query: 4 SWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
S IE + + +N+ +G P DTGSDL W QC+ PCT C P + P
Sbjct: 83 SGIETPVYAGSGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCE-PCTQCFSQPTPIFNPQ 141
Query: 64 K----NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN 119
+ +PC + C L P ND C Y YGDG S+ G + T+ F F
Sbjct: 142 DSSSFSTLPCESQYCQDL-----PSESCYND-CQYTYGYGDGSSTQGYMATETF--TFET 193
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-- 177
SV N+ FGCG + G + AG++G+G G +S+ SQL +C+
Sbjct: 194 SSVPNI--AFGCGEDNQGFG---QGNGAGLIGMGWGPLSLPSQLG-----VGQFSYCMTS 243
Query: 178 -GQNGRGVLFLGDGK--VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-- 232
G + L LG VP G T ++ +S + +Y + + G + G+ T
Sbjct: 244 SGSSSPSTLALGSAASGVP-EGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQ 302
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGP 281
+I DSG + Y Y + + L+P D++ L C++ P
Sbjct: 303 LQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ-----INLSPVDESSSGLSTCFQLP 357
Query: 282 FK-ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN 340
+ QV E N L+ P E +CL + + S+ + +
Sbjct: 358 SDGSTVQVPEISMQFDGGVLNLGEENVLISPAEGV--------ICLAMGSSSQQGI---S 406
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
I G I Q+ V+YD + + + P C
Sbjct: 407 IFGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 146/366 (39%), Gaps = 42/366 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V++ +G P K + FDTGSDL+WVQC PC C + + + P + V C P
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSSTYAAVACGAP 207
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L C + +C YE++YGD + G LV D L S+ +P FGC
Sbjct: 208 ECQELDASG---CSS-DSRCRYEVQYGDQSQTDGNLVRDTLTLSASD----TLPGFVFGC 259
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCIGQNGRGVLFLGDG 190
G N G D G+ GLGR ++S+ SQ YG +C+ + G +L G
Sbjct: 260 G--DQNAGLFGQVD--GLFGLGREKVSLPSQGAPSYG---PGFTYCLPSSSSGRGYLSLG 312
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASYAYF 244
P + +T L + A Y + + G++ + + DSG
Sbjct: 313 GAPPANAQFT-ALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRL 371
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
R Y + + R + K AP L C+ G T + L+F
Sbjct: 372 PPRAYAPLRAAFARSM--AQYKKAPALSILDTCY----DFTGHRTAQIPTVELAFA---G 422
Query: 305 SVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+ + L +S CL N ++ + I+G + V YD QRIG+
Sbjct: 423 GATVSLDFTGVLYVSKVSQACLAFAPNADDSSIA---ILGNTQQKTFAVTYDVANQRIGF 479
Query: 364 KPEDCN 369
+ C+
Sbjct: 480 GAKGCS 485
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 146/366 (39%), Gaps = 42/366 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V++ +G P K + FDTGSDL+WVQC PC C + + + P + V C P
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSSTYAAVACGAP 207
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L C + +C YE++YGD + G LV D L S+ +P FGC
Sbjct: 208 ECQELDASG---CSS-DSRCRYEVQYGDQSQTDGNLVRDTLTLSASD----TLPGFVFGC 259
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCIGQNGRGVLFLGDG 190
G N G D G+ GLGR ++S+ SQ YG +C+ + G +L G
Sbjct: 260 G--DQNAGLFGQVD--GLFGLGREKVSLPSQGAPSYG---PGFTYCLPSSSSGRGYLSLG 312
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASYAYF 244
P + +T L + A Y + + G++ + + DSG
Sbjct: 313 GAPPANAQFT-ALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRL 371
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
R Y + + R + K AP L C+ G T + L+F
Sbjct: 372 PPRAYAPLRAAFARSM--AQYKKAPALSILDTCY----DFTGHRTAQIPTVELAFA---G 422
Query: 305 SVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+ + L +S CL N ++ + I+G + V YD QRIG+
Sbjct: 423 GATVSLDFTGVLYVSKVSQACLAFAPNADDSSIA---ILGNTQQKTFAVAYDVANQRIGF 479
Query: 364 KPEDCN 369
+ C+
Sbjct: 480 GAKGCS 485
>gi|168021169|ref|XP_001763114.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685597|gb|EDQ71991.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 641
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 122/309 (39%), Gaps = 64/309 (20%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--TKPPEKQYKPHKNI-VPCSNPR 73
+ V + VGK KLF F DTGS +W+ C P P Y P K + V C +P
Sbjct: 126 YYVKMRVGKSKKLFHFLIDTGSQPSWLHCKWPAIEKHPVAGPNGMYVPEKEVQVDCRSPE 185
Query: 74 CAALHW--------PNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
C +L N C PND +C Y+I Y D G V D+ L G +
Sbjct: 186 CLSLQRIPSNFNNIRNLFPCNEPNDWRCTYDITYLDRSHLRGFYVQDVVSLATLEGEQLD 245
Query: 125 VPLTFGCGYNQHNPGPL-------------------SPPDTAGVLGLGRGRISIVSQLRE 165
+T G H P SP T G+LGL +G S VSQL+
Sbjct: 246 AKITLGYATPNHRAAPFGFCSWHASSDRYGEEELERSPLTTDGLLGLNKGTESFVSQLKR 305
Query: 166 YGLI-RNVIGHCIG-------QNGRGVLFLGDGKVPSS-GVAWTPMLQNSAD-----LKH 211
G I +V+GHC + G +F G K+ S + W+PM ++D +K
Sbjct: 306 QGAISSHVVGHCFRSLDTTDFETNSGFMFFGKSKLLDSLPITWSPMASPTSDGFILVVKL 365
Query: 212 YILGP---------AELLYS--GKSCGLKDLTL--------IFDSGASYAYFTSRVYQEI 252
+ P AE LY K L +L+L I DSG++ + +Y I
Sbjct: 366 KVPLPLKRDGQSSIAEYLYKVYVKKIKLGELSLEMTDKSNIIIDSGSTTTHILDSIYNPI 425
Query: 253 VSLIMRDLI 261
+ + +
Sbjct: 426 RDEVAKQAL 434
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 164/387 (42%), Gaps = 63/387 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSN 71
Y N T+G PP+ D +L W QC C+ C K + P+ + PC
Sbjct: 66 YNVANFTIGTPPQPASAIIDVAGELVWTQCSM-CSRCFKQDLPLFVPNASSTFRPEPCGT 124
Query: 72 PRCAALHWPNPPRCKHPNDQCDYE--IEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C ++ P ++ C YE I GG ++G + TD F + + S L F
Sbjct: 125 DACKSI-----PTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS-----LGF 174
Query: 130 GC----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
GC G + GP +G++GLGR S+VSQ+ + H G+N R L
Sbjct: 175 GCVVASGIDTMG-GP------SGLIGLGRAPSSLVSQMNITKFSYCLTPHDSGKNSR--L 225
Query: 186 FLGDGKVPSSG--VAWTPMLQNSA--DLKHYILGPAELLYSGKSCGLKDL-------TLI 234
LG + G TP ++ S D+ Y P +L G G + T++
Sbjct: 226 LLGSSAKLAGGGNSTTTPFVKTSPGDDMSQYY--PIQL--DGIKAGDAAIALPPSGNTVL 281
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPFKALGQVTEYF 292
+ A ++ YQ + + + + P L P D +C+ P L +
Sbjct: 282 VQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFD----LCF--PKAGLSNASAP- 334
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRK--NVCLGILNGS---EAEVGEN-NIIGEIF 346
L FT ++ + L VPP YL+ G + VC+ IL+ S + EN NI+G +
Sbjct: 335 ---DLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQ 391
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTLLS 373
++ + D EK+ + ++P DC++L+S
Sbjct: 392 QENTHFLLDLEKKTLSFEPADCSSLIS 418
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 163/384 (42%), Gaps = 50/384 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-----KNIVPCS 70
YF +++ VG PPK F DTGSDL W+QC PC C E Y P KNI C+
Sbjct: 162 YF-MDVLVGTPPKHFSLILDTGSDLNWLQC-LPCYDCFHQNEAFYDPKTSASFKNIT-CN 218
Query: 71 NPRCAALHWPNPP-RCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN----GSVFNV 125
+PRC+ + P PP +CK N C Y YGD ++ G + F + + S + V
Sbjct: 219 DPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKV 278
Query: 126 P-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQ 179
+ FGCG+ N G S LG G S SQL+ L + +C+
Sbjct: 279 ENMMFGCGH--WNRGLFSGASGLLGLGRGPLSFS--SQLQ--SLYGHSFSYCLVDRNSDT 332
Query: 180 NGRGVLFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT-- 232
N L G+ K + + + +T + +NS + +YI + +L G++ + + T
Sbjct: 333 NVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKS-ILVGGEALDIPEETWN 391
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA 284
I DSG + +YF Y EI+ + + + D L C+
Sbjct: 392 ISPDGAGGTIIDSGTTLSYFAEPAY-EIIKNKFAEKMKENYLVFRDFPVLDPCFN--VSG 448
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGE 344
+ + + L ++F + P E + VCL IL ++ +IIG
Sbjct: 449 IEENNIHLPELGIAFA---DGAVWNFPAENSFIWLSEDLVCLAILGTPKSTF---SIIGN 502
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDC 368
Q+ ++YD + R+G+ P C
Sbjct: 503 YQQQNFHILYDTKMSRLGFTPTKC 526
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 146/365 (40%), Gaps = 38/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ V C+ P
Sbjct: 180 YVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L N C C Y ++YGDG SIG D L S ++ F G
Sbjct: 240 ACSDL---NIHGCS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 289
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVL-FLG 188
+ N G + AG+LGLGRG+ S+ V +YG V HC+ G G L F
Sbjct: 290 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSTGTGYLDFGA 344
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
+S TPML ++ +Y+ G + G+ + I DSG
Sbjct: 345 GSLAAASARLTTPMLTDNGPTFYYV-GMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITR 403
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
Y + + K AP L C+ F + QV ++L F +
Sbjct: 404 LPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCY--DFTGMSQVA--IPTVSLLF---Q 456
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
RL V + + VCL + + G+ I+G ++ V YD K+ +G+
Sbjct: 457 GGARLDVDASGIMYAASASQVCLAF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514
Query: 364 KPEDC 368
P C
Sbjct: 515 YPGAC 519
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 155/381 (40%), Gaps = 54/381 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V+ +G PP+ F D+GSDL WVQC +PC C Y P + VPC +
Sbjct: 64 YF-VDFFLGTPPQKFSLIVDSGSDLLWVQC-SPCRQCYAQDSPLYVPSNSSTFSPVPCLS 121
Query: 72 PRCAALHWPN--PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV---P 126
C + P ++P C YE Y D SS G + + +V V
Sbjct: 122 SDCLLIPATEGFPCDFRYPG-ACAYEYLYADTSSSKGVFA-------YESATVDGVRIDK 173
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQ-----N 180
+ FGCG + N G + GVLGLG+G +S SQ+ YG N +C+ +
Sbjct: 174 VAFGCGSD--NQGSFAA--AGGVLGLGQGPLSFGSQVGYAYG---NKFAYCLVNYLDPTS 226
Query: 181 GRGVLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
L GD + + + +TP++ N Y + ++ GKS + D
Sbjct: 227 VSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLG 286
Query: 234 ----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
IFDSG + Y+ Y I++ G A + L +C ++T
Sbjct: 287 NGGSIFDSGTTLTYWFPSAYSHILAAFDS---GVHYPRAESVQGLDLCV--------ELT 335
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+P SFT + + P + NV + G + +G N IG + Q+
Sbjct: 336 GVDQPSFPSFTIEFDDGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQN 395
Query: 350 KMVIYDNEKQRIGWKPEDCNT 370
V YD E+ IG+ P C++
Sbjct: 396 FFVQYDREENLIGFAPAKCSS 416
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 150/366 (40%), Gaps = 42/366 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKN----IVPCSN 71
+ + +G P K + DTGS LTW+QC +PC C + + P + V CS
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQC-SPCRVSCHRQSGPVFDPKTSSSYAAVSCST 195
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P+C L NP C +D C Y+ YGD S+G L D + F + SV N +
Sbjct: 196 PQCNDLSTATLNPAACSS-SDVCIYQASYGDSSFSVGYLSKDT--VSFGSNSVPN--FYY 250
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GCG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 251 GCG--QDNEGLFG--RSAGLMGLARNKLSLLYQLAP--TLGYSFSYCLPSSSSSGYLSIG 304
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYF 244
P ++TPM+ ++ D Y + + + +GK S L I DSG
Sbjct: 305 SYNPGQ-YSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSGTVITRL 363
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALSFTNRR 303
+ VY + + + GT K A L C+ +GQ + P ++++F+
Sbjct: 364 PTTVYDALSKAVAGAMKGT--KRADAYSILDTCF------VGQASSLRVPAVSMAFS--- 412
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
L + + LV CL A IIG Q V+YD + RIG+
Sbjct: 413 GGAALKLSAQNLLVDVDSSTTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKSNRIGF 467
Query: 364 KPEDCN 369
C
Sbjct: 468 AAGGCT 473
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 152/370 (41%), Gaps = 57/370 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G P K FDTGSDLTW +C A T + P K+ V CS P
Sbjct: 134 YIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAET---------FDPTKSTSYANVSCSTP 184
Query: 73 RCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C+++ NP RC C Y I+YGDG SIG L + L + +FN FG
Sbjct: 185 LCSSVISATGNPSRCAAST--CVYGIQYGDGSYSIGFLGKE--RLTIGSTDIFN-NFYFG 239
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIGQNGRGVLFLGD 189
CG Q G AG+LGLGR ++S+VSQ +Y + +C+ + FL
Sbjct: 240 CG--QDVDGLFGKA--AGLLGLGRDKLSVVSQTAPKY---NQLFSYCL-PSSSSTGFLSF 291
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYF 244
G S +TP+ +S Y L + G+ + I DSG
Sbjct: 292 GSSQSKSAKFTPL--SSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVVTRL 349
Query: 245 TSRVYQEIVSLIMRDL----IGTPLKLAPDDKTLPICWR-GPFKALGQVTEYFKPLALSF 299
Y + S + + +G PL + L C+ +K T + +SF
Sbjct: 350 PPAAYSALRSAFRKAMASYPMGKPLSI------LDTCYDFSKYK-----TIKVPKIVISF 398
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ V + V V +G K VCL + A + I G ++ V+YD
Sbjct: 399 S---GGVDVDVDQAGIFVANGLKQVCLAFAGNTGAR--DTAIFGNTQQRNFEVVYDVSGG 453
Query: 360 RIGWKPEDCN 369
++G+ P C+
Sbjct: 454 KVGFAPASCS 463
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 107/372 (28%), Positives = 152/372 (40%), Gaps = 50/372 (13%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+TV K DTGSDLTWVQC PC C Y P + V C++ C
Sbjct: 140 VTVELGGKNMSLIVDTGSDLTWVQCQ-PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQD 198
Query: 77 LHWP--NPPRCKHPN----DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
L N C N C+Y + YGDG + G L ++ L G L FG
Sbjct: 199 LVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVL----GDTKLENLVFG 254
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFL 187
CG N N G +G++GLGR +S+VSQ + V +C + G L
Sbjct: 255 CGRN--NKGLFG--GASGLMGLGRSSVSLVSQTLK--TFNGVFSYCLPSLEDGASGTLSF 308
Query: 188 GDG---KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG---LKDLT----LIFDS 237
G+ S+ V +TP++QN YIL +G S G LK L+ ++ DS
Sbjct: 309 GNDFSVYKNSTSVFYTPLVQNPQLRSFYILN-----LTGASIGGVELKTLSFGRGILIDS 363
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G +Y+ + + ++ G P AP L C+ L + P
Sbjct: 364 GTVITRLPPSIYKAVKTEFLKQFSGFP--SAPGYSILDTCFN-----LTSYEDISIPTIK 416
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNIIGEIFMQDKMVIYDN 356
+ + V Y V VCL + + S E EVG IIG +++ VIYD
Sbjct: 417 MIFEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRVIYDT 473
Query: 357 EKQRIGWKPEDC 368
++R+G E+C
Sbjct: 474 TQERLGIAGENC 485
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 146/365 (40%), Gaps = 38/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ V C+ P
Sbjct: 178 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAP 237
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L N C C Y ++YGDG SIG D L S ++ F G
Sbjct: 238 ACSDL---NIHGCS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 287
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVL-FLG 188
+ N G + AG+LGLGRG+ S+ V +YG V HC+ G G L F
Sbjct: 288 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSTGTGYLDFGA 342
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
+S TPML ++ +YI G + G+ + I DSG
Sbjct: 343 GSPAAASARLTTPMLTDNGPTFYYI-GMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITR 401
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
Y + + K AP L C+ F + QV ++L F +
Sbjct: 402 LPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCY--DFTGMSQVA--IPTVSLLF---Q 454
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
RL V + + VCL + + G+ I+G ++ V YD K+ +G+
Sbjct: 455 GGARLDVDASGIMYAASASQVCLAF--AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 512
Query: 364 KPEDC 368
P C
Sbjct: 513 YPGVC 517
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 81/170 (47%), Gaps = 17/170 (10%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN---- 65
P + V L +G PP F DT SDL W QC PCTGC + + P +
Sbjct: 82 IMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYA 140
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+PCS+ C L + RC H +D+ C Y Y ++ G L D + G
Sbjct: 141 ALPCSSDTCDEL---DVHRCGHDDDESCQYTYTYSGNATTEGTLAVD----KLVIGEDAF 193
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNV 172
+ FGC + P PP +GV+GLGRG +S+VSQL R YG+I ++
Sbjct: 194 RGVAFGCSTSSTGGAP--PPQASGVVGLGRGPLSLVSQLSVRRYGMIIDI 241
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 147/360 (40%), Gaps = 46/360 (12%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPN 89
DTGSDL W QC APC C P + K+ +PC + RCA+L + P C
Sbjct: 1 MDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCRSSRCASL---SSPSCFK-- 54
Query: 90 DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGCGYNQHNPGPLSPPDTAG 148
C Y+ YGD S+ G L + F +N + V + FGCG N G L+ +++G
Sbjct: 55 KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCG--SLNAGDLA--NSSG 110
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLGDGKVPSSG--VAWTPML 203
++G GRG +S+VSQL + + R GV SSG V TP +
Sbjct: 111 MVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFV 170
Query: 204 QNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGASYAYFTSRVYQEIV 253
N A Y L + K + L +I DSG S + Y+
Sbjct: 171 INPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEA-- 228
Query: 254 SLIMRDLIGT-PLKLAPD-DKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+ R L+ PL D D L C++ P VT L F +S + +
Sbjct: 229 --VRRGLVSAIPLPAMNDTDIGLDTCFQWPPPP--NVTVTVPDLVFHF----DSANMTLL 280
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
PE Y++I+ G L A G IIG Q+ ++YD + + P C+ +
Sbjct: 281 PENYMLIASTT----GYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPCDII 336
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 152/378 (40%), Gaps = 43/378 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
F ++A+ +TVG P F DTGSDL W+ C C GC P +P +
Sbjct: 100 FLHYAL-VTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCPPPASGASGSASFYIPSMSST 156
Query: 74 CAALHWPNPPRCKHPND-----QCDYEIEYGDGG-SSIGALVTDLFPLRFSNG--SVFNV 125
A+ N C H D C Y++ Y SS G LV D+ L + +
Sbjct: 157 SQAVPC-NSDFCDHRKDCSTTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDNHPQILKA 215
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ FGCG Q L G+ GLG IS+ S L GL + C G++G G +
Sbjct: 216 QIMFGCGQVQ-TGSFLDAAAPNGLFGLGIDMISVPSILAHKGLTSDSFSMCFGRDGIGRI 274
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASY 241
GD SS TP+ N KH + +G + G + + L IFD+G ++
Sbjct: 275 SFGDQG--SSDQEETPLDINQ---KHPTYA---ITITGITVGTEPMDLEFSTIFDTGTTF 326
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK---ALGQVTEYFKPLALS 298
Y Y I + + A D R PF+ L + +S
Sbjct: 327 TYLADPAYTYITQSFHTQVRAN--RHAADT-------RIPFEYCYDLSSSEARIQTPGVS 377
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
F S+ V+ + I + V CL I+ ++ NIIG+ FM V++D E
Sbjct: 378 FRTVGGSLFPVIDLGQVISIQQHEYVYCLAIVKSTKL-----NIIGQNFMTGVRVVFDRE 432
Query: 358 KQRIGWKPEDCNTLLSLN 375
++ +GWK +C S N
Sbjct: 433 RKILGWKKFNCYDTDSTN 450
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 161/391 (41%), Gaps = 56/391 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---------YKPHK 64
F +FA N++VG PP F DTGSDL W+ C+ CT C + + Q Y+ K
Sbjct: 111 FLHFA-NVSVGTPPLWFLVALDTGSDLFWLPCN--CTSCVRGLKTQNGKVIDLNIYELDK 167
Query: 65 NI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
+ VPC++ C +C C YE+EY + SS G LV D+ L N
Sbjct: 168 SSTRKNVPCNSNMCKQT------QCHSSGSSCRYEVEYLSNDTSSSGFLVEDVLHLITDN 221
Query: 120 GSV--FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
+ +T GCG Q L+ G+ GLG +S+ S L + GLI + C
Sbjct: 222 DQTKDIDTQITIGCGQVQTGVF-LNGAAPNGLFGLGMENVSVPSILAQKGLISDSFSMCF 280
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPM-LQNSADLKHYILGPAELLYSGKSCGLKDLTLIFD 236
G +G G + GD SS TP L+ S Y + +++ G + + IFD
Sbjct: 281 GSDGSGRITFGD--TGSSDQGKTPFNLRESHPT--YNVTITQIIVGGYAAD-HEFHAIFD 335
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPFKALGQVTEYFKP 294
SG S+ Y Y ++S L+ L+PD LP + + F
Sbjct: 336 SGTSFTYLNDPAYT-LISEKFNSLVKANRHSPLSPDSD-LPFEYCYDMSPDQTIEVPFLN 393
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGI-----LN--GSEAEVGENNI------ 341
L + + +VP + + G +CLGI LN G E E +
Sbjct: 394 LTMKGGDDYYVTDPIVPVSSE--VEGNL-LCLGIQKSDNLNIIGREYTTEEEFLHLKHMI 450
Query: 342 ----IGEIFMQDKMVIYDNEKQRIGWKPEDC 368
I + FM +++D E +GWK +C
Sbjct: 451 IKFFIQKNFMTGYRIVFDRENMNLGWKESNC 481
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 107/390 (27%), Positives = 167/390 (42%), Gaps = 56/390 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++ +G PPK + DTGSDL W+QC PC C + Y P ++ + C +P
Sbjct: 192 YFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKESSSFENITCHDP 250
Query: 73 RCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS--NGS-----VFN 124
RC + P+PP+ CK N C Y YGD ++ G + F + + NG V N
Sbjct: 251 RCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVEN 310
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRG 183
V FGCG+ N G AG+LGLGRG +S SQL+ YG + +C+
Sbjct: 311 V--MFGCGH--WNRGLFH--GAAGLLGLGRGPLSFASQLQSIYG---HSFSYCLVDRNSD 361
Query: 184 V-----LFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT- 232
L G+ K + + +T + +NS D +Y+ G ++ G+ + + T
Sbjct: 362 TSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYV-GIKSIMVDGEVLKIPEETW 420
Query: 233 ---------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
I DSG + YF Y+ I M+ + G +L L C+
Sbjct: 421 HLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKG--YELVEGFPPLKPCYN---- 474
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
G + F+ + P E Y + VCL IL ++ + +IIG
Sbjct: 475 VSGIEKMELPDFGILFS---DGAMWDFPVENYFIQIEPDLVCLAILGTPKSAL---SIIG 528
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTLLS 373
Q+ ++YD +K R+G+ P C S
Sbjct: 529 NYQQQNFHILYDMKKSRLGYAPMKCTATTS 558
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 155/375 (41%), Gaps = 52/375 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++ +G PP DT +D W QC+ PC C + P K+ +PCS+P
Sbjct: 89 YIISFLIGTPPFQLYGVMDTANDNIWFQCN-PCKPCFNTTSPMFDPSKSSTYKTIPCSSP 147
Query: 73 RCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF-- 129
+C + C + + C+Y YG S G L D L +N + P++F
Sbjct: 148 KCKNVE---NTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNN----DTPISFKN 200
Query: 130 ---GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNG 181
GCG+ N GPL +G +GLGRG +S +SQL I +C+ +
Sbjct: 201 IVIGCGH--RNKGPLEGY-VSGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGI 255
Query: 182 RGVLFLGDGKVPSS-GVAWTPMLQN----SADLKHYILGPAELLYSGKSCGLKDL-TLIF 235
G L GD V S G TP+ S L +G + + + +L I
Sbjct: 256 SGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTII 315
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ--VTEYFK 293
DSG + VY + S I+ ++ +P+ + +C++ K L +T +F
Sbjct: 316 DSGTTLTILPENVYSRLES-IVTSMVKLERAKSPNQQ-FKLCYKATLKNLDVPIITAHFN 373
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
+ NS+ P + +V V +G G+ IIG I Q+ +V
Sbjct: 374 GADVHL----NSLNTFYPIDHEVVCFAF--VSVGNFPGT--------IIGNIAQQNFLVG 419
Query: 354 YDNEKQRIGWKPEDC 368
+D +K I +KP DC
Sbjct: 420 FDLQKNIISFKPTDC 434
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 147/375 (39%), Gaps = 46/375 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--------TKPPEKQYKPHKNIVP 68
+ V + VG P K F DTGS L+W+QC C T K YK
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSS- 165
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
S N P C + C Y+ YGD SIG L D+ L S +
Sbjct: 166 -SQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSAAP--SSGFV 222
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI--------GQ 179
+GCG Q N G +AG++GL ++S++ QL +YG N +C+
Sbjct: 223 YGCG--QDNQGLFG--RSAGIIGLANDKLSMLGQLSNKYG---NAFSYCLPSSFSAQPNS 275
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIF 235
+ G L +G + SS +TP+++N Y LG + +GK G+ ++ I
Sbjct: 276 SVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPTII 335
Query: 236 DSGASYAYFTSRVYQEI-VSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG +Y + S +M ++ AP L C++G K + V E
Sbjct: 336 DSGTVITRLPVAIYNALKKSFVM--IMSKKYAQAPGFSILDTCFKGSVKEMSTVPE---- 389
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
+ + F R L + LV + CL I A +IIG Q V Y
Sbjct: 390 IRIIF---RGGAGLELKVHNSLVEIEKGTTCLAI----AASSNPISIIGNYQQQTFTVAY 442
Query: 355 DNEKQRIGWKPEDCN 369
D +IG+ P C
Sbjct: 443 DVANSKIGFAPGGCQ 457
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 106/376 (28%), Positives = 156/376 (41%), Gaps = 54/376 (14%)
Query: 22 TVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAAL 77
TVG DT S+LTWVQC PC C + + P + VPC++ C AL
Sbjct: 123 TVGLGAAEATVVVDTASELTWVQCQ-PCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDAL 181
Query: 78 H---WPNPPRCKHPNDQ---CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C N+Q C Y + Y DG S G L D LR + + FGC
Sbjct: 182 RVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARD--KLRLAGQDIEG--FVFGC 237
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI---GQNGRGVLFL 187
G + P T+G++GLGR +S+VSQ + ++G V +C+ G L L
Sbjct: 238 GTSNQG-APFG--GTSGLMGLGRSHVSLVSQTMDQFG---GVFSYCLPMRESGSSGSLVL 291
Query: 188 GDGKVP---SSGVAWTPMLQNSADLKHYILGPAELL-YSGKSCGLKDLT--------LIF 235
GD S+ + +T M+ +S L+ GP L +G + G +++ +I
Sbjct: 292 GDDSSAYRNSTPIVYTAMVSDSGPLQ----GPFYFLNLTGITVGGQEVESPWFSAGRVII 347
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG VY + + + L P AP L C+ L + E P
Sbjct: 348 DSGTIITTLVPSVYNAVRAEFLSQLAEYP--QAPAFSILDTCFN-----LTGLKEVQVP- 399
Query: 296 ALSFTNRRNSVRLVVPPEA--YLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
+L F SV + V + Y V S VCL + S + +IIG ++ VI
Sbjct: 400 SLKFV-FEGSVEVEVDSKGVLYFVSSDASQVCLAL--ASLKSEYDTSIIGNYQQKNLRVI 456
Query: 354 YDNEKQRIGWKPEDCN 369
+D +IG+ E C+
Sbjct: 457 FDTLGSQIGFAQETCD 472
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 163/383 (42%), Gaps = 49/383 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-----KNIVPCSN 71
+ +++ VG PPK F DTGSDL W+QC PC C Y P KNI C++
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQC-LPCYDCFHQNGMFYDPKTSASFKNIT-CND 217
Query: 72 PRCAALHWPNPP-RCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN----GSVFNV- 125
PRC+ + P+PP +C+ N C Y YGD ++ G + F + + S + V
Sbjct: 218 PRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVG 277
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQN 180
+ FGCG+ N G S LG G S SQL+ L + +C+ N
Sbjct: 278 NMMFGCGH--WNRGLFSGASGLLGLGRGPLSFS--SQLQ--SLYGHSFSYCLVDRNSNTN 331
Query: 181 GRGVLFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT--- 232
L G+ K + + + +T + +NS + +YI + +L GK+ + + T
Sbjct: 332 VSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKS-ILVGGKALDIPEETWNI 390
Query: 233 -------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
I DSG + +YF Y EI+ + + + D L C+ +
Sbjct: 391 SSDGDGGTIIDSGTTLSYFAEPAY-EIIKNKFAEKMKENYPIFRDFPVLDPCFN--VSGI 447
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEI 345
+ + L ++F + P E + VCL IL ++ +IIG
Sbjct: 448 EENNIHLPELGIAFV---DGTVWNFPAENSFIWLSEDLVCLAILGTPKSTF---SIIGNY 501
Query: 346 FMQDKMVIYDNEKQRIGWKPEDC 368
Q+ ++YD ++ R+G+ P C
Sbjct: 502 QQQNFHILYDTKRSRLGFTPTKC 524
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 156/371 (42%), Gaps = 50/371 (13%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+T+G + DTGSDLTWVQC+ PC C +KP + + C++ C +
Sbjct: 124 VTMGLGSQNMSVIVDTGSDLTWVQCE-PCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQS 182
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
L + CDY + YGDG + G L + L F SV N FGCG N
Sbjct: 183 LELGACGSDPSTSATCDYVVNYGDGSYTSGEL--GIEKLGFGGISVSN--FVFGCGRN-- 236
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNG-RGVLFLGDGKV 192
N G +G++GLGR +S++SQ V +C+ Q G G L +G+
Sbjct: 237 NKGLFG--GASGLMGLGRSELSMISQTN--ATFGGVFSYCLPSTDQAGASGSLVMGN--- 289
Query: 193 PSSGV-------AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGAS 240
SGV A+T ML N YIL + G S ++ + +I DSG
Sbjct: 290 -QSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSFGNGGVILDSGTV 348
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
+ VY+ + + + G P AP L C F G +++ F
Sbjct: 349 ISRLAPSVYKALKAKFLEQFSGFP--SAPGFSILDTC----FNLTGYDQVNIPTISMYF- 401
Query: 301 NRRNSVRLVVPPEA--YLVISGRKNVCLGILNGS-EAEVGENNIIGEIFMQDKMVIYDNE 357
+ L V YLV VCL + + S E E+G IIG +++ V+YD +
Sbjct: 402 --EGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMG---IIGNYQQRNQRVLYDAK 456
Query: 358 KQRIGWKPEDC 368
++G+ E C
Sbjct: 457 LSQVGFAKEPC 467
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 105/381 (27%), Positives = 158/381 (41%), Gaps = 62/381 (16%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI--------VPCSNPR- 73
+G PP+ + DTGSDL W QC C K KQ P+ N+ VPC++
Sbjct: 92 IGSPPQRTEALIDTGSDLIWTQCATTCL--PKSCAKQGLPYYNLSQSSTFVPVPCADKAG 149
Query: 74 -CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
CAA N + C + YG G IG+L T+ F F +G+ L FGC
Sbjct: 150 FCAA----NGVHLCGLDGSCTFIASYG-AGRVIGSLGTESFA--FESGT---TSLAFGCV 199
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
+ G L+ D +G++GLGRGR+S+VSQ+ + + LF+G
Sbjct: 200 SLTRITSGALN--DASGLIGLGRGRLSLVSQIGATRFSYCLTPYFHSSGASSHLFVGASA 257
Query: 192 VPSSGVAWTPMLQNSADLKH---YILGPAELLYSGK---------SCGLKDL-------T 232
G A P +++ D + Y L P E + GK + L+ L
Sbjct: 258 SLGGGGASMPFVKSPKDYPYSTFYYL-PLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGG 316
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
+I D+G+ S Y+ + + L L AP+D L +C E F
Sbjct: 317 VIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELCV---------AREGF 367
Query: 293 KPL--ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+ + AL F + + VP +Y + C+ IL G G ++IIG QD
Sbjct: 368 QKVVPALVF-HFGGGADMAVPAASYWAPVDKAAACMMILEG-----GYDSIIGNFQQQDM 421
Query: 351 MVIYDNEKQRIGWKPEDCNTL 371
++YD + R ++ DC L
Sbjct: 422 HLLYDLRRGRFSFQTADCTML 442
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 157/384 (40%), Gaps = 52/384 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKP-PEKQYKPHKNIVPCSNPR 73
V+LTVG PP+ DTGS+L+W+ C T P Y P +PCS+P
Sbjct: 40 LTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSP----IPCSSPV 95
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C PNP C P C + Y D S G L +D F GS FGC
Sbjct: 96 CRTRTRDLPNPVTCD-PKKLCHAIVSYADASSLEGNLASD----NFRIGSSALPGTLFGC 150
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDG 190
+ + T G++G+ RG +S V+QL GL + +CI G++ GVL GD
Sbjct: 151 MDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL---GLPK--FSYCISGRDSSGVLLFGDS 205
Query: 191 KVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
+ G + +TP++Q S L ++ + G G K L L +
Sbjct: 206 HLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 265
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD----KTLPICWRGPFKALGQVTE 290
DSG + + VY + + + G L + + +C+R P A G++ E
Sbjct: 266 VDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVP--AGGKLPE 323
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYL-----VISGRKNVCLGILNGSEAEVGENNIIGEI 345
++L F +VV E L ++ G++ V S+ E +IG
Sbjct: 324 -LPAVSLMF----RGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIGHH 378
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCN 369
Q+ + +D K R+G+ C+
Sbjct: 379 HQQNVWMEFDLVKSRVGFVETRCD 402
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 160/387 (41%), Gaps = 52/387 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-KPPEKQYKPHKNI----VPCS 70
YF V++ +G PP+ DTGSDL WV+C A C C+ PP + P + C
Sbjct: 88 YF-VDIRLGTPPQSLLLVADTGSDLVWVKCSA-CRNCSHHPPSSAFLPRHSSSFSPFHCF 145
Query: 71 NPRCAALHWPNPPR--CKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+P C L P+ P C H + C + Y DG S G + L+ +GS ++
Sbjct: 146 DPHCRLL--PHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLK 203
Query: 127 -LTFGCGYNQHNPGPLSPP--DTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNG- 181
L+FGCG+ P GV+GLGRG IS SQL R +G N +C+
Sbjct: 204 GLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFG---NKFSYCLMDYTL 260
Query: 182 ----RGVLFLGDG--KVP---SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT 232
L +G G +P ++ +++TP+ N Y + + G +
Sbjct: 261 SPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAV 320
Query: 233 ----------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPF 282
+ DSG + Y T Y+E++ + R +KL P+ L +
Sbjct: 321 WEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRR-----VKL-PNAAELTPGFDLCV 374
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN-NI 341
A G+ P L F +V PP Y + + +CL I E G ++
Sbjct: 375 NASGESRRPSLP-RLRFRLGGGAV-FAPPPRNYFLETEEGVMCLAI---RAVESGNGFSV 429
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDC 368
IG + Q ++ +D E+ R+G+ C
Sbjct: 430 IGNLMQQGFLLEFDKEESRLGFTRRGC 456
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 106/407 (26%), Positives = 168/407 (41%), Gaps = 75/407 (18%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--PCTGCTKP-----PEKQYKPHKN- 65
+ ++V+L G PP+ F FDTGS L W C A C+ C+ P ++ P +
Sbjct: 129 YGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSS 188
Query: 66 ---IVPCSNPRCAALHWPN-PPRCKHPN-------DQC-DYEIEYGDGGSSIGALVTDLF 113
+V C NP+CA + PN RC++ N D C Y ++YG G ++ G L+++
Sbjct: 189 SVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATA-GILLSETL 247
Query: 114 PLRFSNGSVFNVPLTFGCG-YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNV 172
L F V GC + H P AG+ G GRG S+ SQ+R L R
Sbjct: 248 DLENKRVPDFLV----GCSVMSVHQP--------AGIAGFGRGPESLPSQMR---LKR-- 290
Query: 173 IGHCIGQNG------RGVLFLGDGKVPSSGVAWT---------PMLQNSADLKHYILGPA 217
HC+ G L L G + P + N+A ++Y L
Sbjct: 291 FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLR 350
Query: 218 ELLYSGKSCGLKDLTL----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LK 266
+L GK L I DSG+++ + +++ I + + L+ P K
Sbjct: 351 RILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK 410
Query: 267 LAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL-VISGRKNVC 325
L C+ P + + + F + L F + +L + E YL +++ VC
Sbjct: 411 DVEAQSGLRPCFNIPKE---EESAEFPDVVLKF---KGGGKLSLAAENYLAMVTDEGVVC 464
Query: 326 LGILNGSEAEVGENN---IIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
L ++ G I+G Q+ +V YD KQRIG++ + C
Sbjct: 465 LTMMTDEAVVGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKCT 511
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 165/386 (42%), Gaps = 56/386 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++ +G PPK F DTGSDL W+QC PC C + Y P +I + C++P
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQC-VPCFDCFEQNGPYYDPKDSISFRNITCNDP 254
Query: 73 RCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS--------VF 123
RC + P+PPR CK C Y YGD ++ G + F + ++ + V
Sbjct: 255 RCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVE 314
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
NV FGCG+ N G AG+LGLGRG +S SQL+ L + +C+
Sbjct: 315 NV--MFGCGH--WNRGLFH--GAAGLLGLGRGPLSFSSQLQ--SLYGHSFSYCLVDRDSD 366
Query: 184 V-----LFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT- 232
L G+ K + + +T ++ +N D +Y L + G+ + +
Sbjct: 367 TSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYY-LQIKSIFVGGEKLQIPEENW 425
Query: 233 ---------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
I DSG + +YF+ Y+ I +R + G KL D L C +
Sbjct: 426 NLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKG--YKLVEDFPILHPC----YN 479
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNII 342
G F + F + P E Y + I VCL +L ++ + +II
Sbjct: 480 VSGTDELNFPEFLIQFA---DGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSAL---SII 533
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDC 368
G Q+ ++YD + R+G+ P C
Sbjct: 534 GNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/361 (26%), Positives = 159/361 (44%), Gaps = 32/361 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++++G PP + DTGSDLTW QC PC C + + P K+ VPC+
Sbjct: 92 YLMSVSIGTPPVDYLGIADTGSDLTWAQC-LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQ 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C H + C CDY YGD S G DL + + GS +V GCG
Sbjct: 151 TC---HAVDDGHCG-VQGVCDYSYTYGDRTYSKG----DLGFEKITIGSS-SVKSVIGCG 201
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---QNGRGVLFLGD 189
+ + +GV+GLG G++S+VSQ+ + I +C+ + G + G+
Sbjct: 202 HASSGGFGFA----SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGE 257
Query: 190 GKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-KDLTLIFDSGASYAYFTSR 247
V S GV TP++ + +YI A + + + K +I DSG +
Sbjct: 258 NAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLTILPKE 317
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVR 307
+Y +VS +++ + +K +L +C+ A + P+ + + +V
Sbjct: 318 LYDGVVSSLLKVVKAKRVK--DPHGSLDLCFDDGINAAASLG---IPVITAHFSGGANVN 372
Query: 308 LVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPED 367
L +P + ++ N CL + S E IIG + + ++ YD E +R+ +KP
Sbjct: 373 L-LPINTFRKVADNVN-CLTLKAASPTT--EFGIIGNLAQANFLIGYDLEAKRLSFKPTV 428
Query: 368 C 368
C
Sbjct: 429 C 429
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 157/384 (40%), Gaps = 44/384 (11%)
Query: 1 MYVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
+ V W ++ +YF +L +G P + DTGSD +W+QC PC C + E +
Sbjct: 121 LQVGWGKYL--DTTNYF-TSLRLGTPATDLLVELDTGSDQSWIQCK-PCPDCYEQHEALF 176
Query: 61 KPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
P K+ + CS+ C L + C + +C YEI Y D ++G L D L
Sbjct: 177 DPSKSSTYSDITCSSRECQELGSSHKHNCSS-DKKCPYEITYADDSYTVGNLARDTLTLS 235
Query: 117 FSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIG 174
++ VP FGCG+N N G D G+LGLGRG+ S+ SQ+ YG
Sbjct: 236 PTDA----VPGFVFGCGHN--NAGSFGEID--GLLGLGRGKASLSSQVAARYGA---GFS 284
Query: 175 HCI--GQNGRGVL-FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--- 228
+C+ + G L F G + +T M+ Y L + +G++ +
Sbjct: 285 YCLPSSPSATGYLSFSGAAAAAPTNAQFTEMVAGQ-HPSFYYLNLTGITVAGRAIKVPPS 343
Query: 229 ---KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
I DSG +++ Y + S + R +G K AP C +
Sbjct: 344 VFATAAGTIIDSGTAFSCLPPSAYAALRSSV-RSAMGR-YKRAPSSTIFDTC----YDLT 397
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGE 344
G T +AL F + + + P S CL L N + +G ++G
Sbjct: 398 GHETVRIPSVALVFAD--GATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLG---VLGN 452
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDC 368
+ VIYD + Q++G+ C
Sbjct: 453 TQQRTLAVIYDVDNQKVGFGANGC 476
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 95/364 (26%), Positives = 142/364 (39%), Gaps = 39/364 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C K + P K+ V C++
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDS 222
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CA L + C C Y ++YGDG ++G D + F FGCG
Sbjct: 223 ACADL---DTNGCT--GGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFR----FGCG 273
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLGDG 190
+ N G TAG++GLGRG+ S+ Q Y +C+ G G L G G
Sbjct: 274 --EKNNGLFG--KTAGLMGLGRGKTSLTVQ--AYNKYGGAFAYCLPALTTGTGYLDFGPG 327
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAYFT 245
+ TPML + +Y+ G + G+ + + + DSG
Sbjct: 328 SA-GNNARLTPMLTDKGQTFYYV-GMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLP 385
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
+ Y + S + ++ K AP L C+ F L V ++L F +
Sbjct: 386 ATAYTALSSAFDKVMLARGYKKAPGYSILDTCYD--FTGLSDVE--LPTVSLVF---QGG 438
Query: 306 VRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
L V + VCL NG + V I+G + V+YD K+ +G+
Sbjct: 439 ACLDVDVSGIVYAISEAQVCLAFASNGDDESVA---IVGNTQQKTYGVLYDLGKKTVGFA 495
Query: 365 PEDC 368
P C
Sbjct: 496 PGSC 499
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 103/368 (27%), Positives = 147/368 (39%), Gaps = 43/368 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G PP F FDTGSD TWVQC C K ++ + P K+ V C++P
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADP 222
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CA L + C C Y I+YGDG ++G D + F FGCG
Sbjct: 223 ACADL---DASGCN--AGHCLYGIQYGDGSYTVGFFAKDTLAVAQDAIKGFK----FGCG 273
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGVLFL---- 187
+ N G TAG+LGLGRG SI Q E YG +C+ + +L
Sbjct: 274 --EKNRGLFG--QTAGLLGLGRGPTSITVQAYEKYG---GSFSYCLPASSAATGYLEFGP 326
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG------LKDLTLIFDSGASY 241
S TPML + +Y+ G + GK G + + DSG
Sbjct: 327 LSPSSSGSNAKTTPMLTDKGPTFYYV-GLTGIRVGGKQLGAIPESVFSNSGTLVDSGTVI 385
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
Y + S + + K A L C+ F L QV+ ++L F
Sbjct: 386 TRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYD--FTGLSQVS--LPTVSLVF-- 439
Query: 302 RRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
+ L + + + VCLG NG + VG I+G + V+YD K+
Sbjct: 440 -QGGACLDLDASGIVYAISQSQVCLGFASNGDDESVG---IVGNTQQRTYGVLYDVSKKV 495
Query: 361 IGWKPEDC 368
+G+ P C
Sbjct: 496 VGFAPGAC 503
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 165/386 (42%), Gaps = 56/386 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++ +G PPK F DTGSDL W+QC PC C + Y P +I + C++P
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQC-VPCFDCFEQNGPYYDPKDSISFRNITCNDP 254
Query: 73 RCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS--------VF 123
RC + P+PPR CK C Y YGD ++ G + F + ++ + V
Sbjct: 255 RCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVE 314
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
NV FGCG+ N G AG+LGLGRG +S SQL+ L + +C+
Sbjct: 315 NV--MFGCGH--WNRGLFH--GAAGLLGLGRGPLSFSSQLQ--SLYGHSFSYCLVDRDSD 366
Query: 184 V-----LFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT- 232
L G+ K + + +T ++ +N D +Y L + G+ + +
Sbjct: 367 TSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYY-LQIKSIFVGGEKLQIPEENW 425
Query: 233 ---------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
I DSG + +YF+ Y+ I +R + G KL D L C +
Sbjct: 426 NLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKG--YKLVEDFPILHPC----YN 479
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNII 342
G F + F + P E Y + I VCL +L ++ + +II
Sbjct: 480 VSGTDELNFPEFLIQFA---DGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSAL---SII 533
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDC 368
G Q+ ++YD + R+G+ P C
Sbjct: 534 GNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 167/390 (42%), Gaps = 65/390 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-----KNIVPCSN 71
+ +++ VG PPK F DTGSDL W+QC PC C + Y P KNI C +
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYACFEQNGPYYDPKDSSSFKNIT-CHD 252
Query: 72 PRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-------VF 123
PRC + P+PP+ CK C Y YGD ++ G + F + + V
Sbjct: 253 PRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIVE 312
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----G 178
NV FGCG+ N G AG+LGLGRG +S +QL+ L + +C+
Sbjct: 313 NV--MFGCGH--WNRGLFH--GAAGLLGLGRGPLSFATQLQ--SLYGHSFSYCLVDRNSN 364
Query: 179 QNGRGVLFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT- 232
+ L G+ K + + +T + +N D +Y+L + ++ G+ + + T
Sbjct: 365 SSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKS-IMVGGEVLKIPEETW 423
Query: 233 ---------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
I DSG + YF Y+ I MR + G PL +T P P K
Sbjct: 424 HLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLV-----ETFP-----PLK 473
Query: 284 ALGQVTEYFK----PLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGE 338
V+ K A+ F + P E Y + I VCL IL + +
Sbjct: 474 PCYNVSGVEKMELPEFAILFA---DGAMWDFPVENYFIQIEPEDVVCLAILGTPRSAL-- 528
Query: 339 NNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+IIG Q+ ++YD +K R+G+ P C
Sbjct: 529 -SIIGNYQQQNFHILYDLKKSRLGYAPMKC 557
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 155/382 (40%), Gaps = 44/382 (11%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPPEKQYKPHKNI----VPCSNPRCAAL 77
+G PP+ DTGS+L W QC GC Y P ++ V C++ C
Sbjct: 90 IGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDTACL-- 147
Query: 78 HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC-GYNQH 136
+ RC C YG G+ G L T++F S NV L FGC ++
Sbjct: 148 -LGSETRCARDGKACAVLTAYG-AGAIGGFLGTEVFTFGHGQSSENNVSLAFGCITASRL 205
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV---- 192
PG L +G++GLGRG++S+ SQL + + + LF+G
Sbjct: 206 TPGSLD--GASGIIGLGRGKLSLPSQLGDNKFSYCLTPYFSDAANTSTLFVGASAGLSGG 263
Query: 193 --PSSGVAWTPMLQNSAD----------LKHYILGPAELLYSGKSCGLKDLT------LI 234
P++ V P L+N D L +G A+L + L+++ +
Sbjct: 264 GAPATSV---PFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAKWGGTL 320
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG+ + YQ + ++R L + + + L +C G A G + P
Sbjct: 321 IDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGG--VAPGDAGKLVPP 378
Query: 295 LALSFTNRRNSVR-LVVPPEAYLVISGRKNVCLGILNG----SEAEVGENNIIGEIFMQD 349
L L F + +VVPPE Y C+ + + S + E IIG QD
Sbjct: 379 LVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGNYMQQD 438
Query: 350 KMVIYDNEKQRIGWKPEDCNTL 371
++YD + + ++P DC+++
Sbjct: 439 MHLLYDLGQGVLSFQPADCSSV 460
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 156/382 (40%), Gaps = 49/382 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ +++ VG PPK F DTGSDL W+QC PC C + Y P +KNI C++
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQC-LPCYDCFQQNGAFYDPKASASYKNIT-CND 227
Query: 72 PRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS----NGSVFNVP 126
RC + P+PP CK N C Y YGD ++ G + F + + + ++NV
Sbjct: 228 QRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVE 287
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQN 180
+ FGCG+ N G LG G S SQL+ L + +C+ N
Sbjct: 288 NMMFGCGH--WNRGLFHGAAGLLGLGRGPLSFS--SQLQ--SLYGHSFSYCLVDRNSDTN 341
Query: 181 GRGVLFLGDGK--VPSSGVAWTPMLQNSADL--KHYILGPAELLYSGKSCGLKDLT---- 232
L G+ K + + +T + +L Y + +L +G+ + + T
Sbjct: 342 VSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNIS 401
Query: 233 ------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG + +YF Y+ I + I G P + PI F G
Sbjct: 402 SDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGK----YPVYRDFPIL-DPCFNVSG 456
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIF 346
L ++F + P E + VCL +L ++ +IIG
Sbjct: 457 IHNVQLPELGIAFA---DGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAF---SIIGNYQ 510
Query: 347 MQDKMVIYDNEKQRIGWKPEDC 368
Q+ ++YD ++ R+G+ P C
Sbjct: 511 QQNFHILYDTKRSRLGYAPTKC 532
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 153/380 (40%), Gaps = 46/380 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPH--- 63
F ++A+ +TVG P F DTGSDL W+ C C GCT PP Y P
Sbjct: 96 FLHYAL-VTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASFYIPSLSS 152
Query: 64 -KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG- 120
VPC++ C C C Y++ Y SS G LV D+ L +
Sbjct: 153 TSQAVPCNSDFCGLRK-----ECSK-TSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDTH 206
Query: 121 -SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
+ FGCG Q L G+ GLG IS+ S L + GL N C G+
Sbjct: 207 PQFLKAQIMFGCGEVQ-TGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFGR 265
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+G G + GD SS TP+ N + I + G + +++ IFD+G
Sbjct: 266 DGIGRISFGDQG--SSDQEETPLDINQKHPTYAITITG--IAVGNNLMDLEVSTIFDTGT 321
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK---ALGQVTEYFKPLA 296
S+ Y Y I + + A D R PF+ L + +
Sbjct: 322 SFTYLADPAYTYITDGFHSQVQAN--RHAADS-------RIPFEYCYDLSSSEARIQTPS 372
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
+S S+ + P + I + V CL I+ ++ NIIG+ FM V++D
Sbjct: 373 ISLRTVGGSLFPAIDPGQVISIQQHEYVYCLAIVKSTKL-----NIIGQNFMTGVRVVFD 427
Query: 356 NEKQRIGWKPEDCNTLLSLN 375
E++ +GWK +C SLN
Sbjct: 428 RERKILGWKKFNCYDTDSLN 447
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 106/373 (28%), Positives = 161/373 (43%), Gaps = 39/373 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G PP+ F DTGSDL W+QC APC C + + P +I V C +
Sbjct: 149 YLVDVYLGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQSGPIFDPAASISYRNVTCGDD 207
Query: 73 RCAALHWP---NPPRCKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
RC + P P C+ P +D C Y YGD ++ G L + F + + V +
Sbjct: 208 RCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGV 267
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV-- 184
FGCG+ N G AG+LGLGRG +S SQLR YG + +C+ ++G
Sbjct: 268 AFGCGHR--NRGLFH--GAAGLLGLGRGPLSFASQLRGVYG--GHAFSYCLVEHGSAAGS 321
Query: 185 -LFLG--DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFD 236
+ G D + + +T + Y L +L G++ + TL I D
Sbjct: 322 KIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIID 381
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG + +YF YQ I + D + L L C+ +V E L+
Sbjct: 382 SGTTLSYFPEPAYQAIRQAFI-DRMSPSYPLILGFPVLSPCYNVSGAEKVEVPE----LS 436
Query: 297 LSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
L F + P E Y + + +CL +L + + +IIG Q+ V+YD
Sbjct: 437 LVFA---DGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGM---SIIGNYQQQNFHVLYD 490
Query: 356 NEKQRIGWKPEDC 368
E R+G+ P C
Sbjct: 491 LEHNRLGFAPRRC 503
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 153/380 (40%), Gaps = 46/380 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPH--- 63
F ++A+ +TVG P F DTGSDL W+ C C GCT PP Y P
Sbjct: 96 FLHYAL-VTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASFYIPSLSS 152
Query: 64 -KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG- 120
VPC++ C C C Y++ Y SS G LV D+ L +
Sbjct: 153 TSQAVPCNSDFCGLRK-----ECSK-TSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDTH 206
Query: 121 -SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
+ FGCG Q L G+ GLG IS+ S L + GL N C G+
Sbjct: 207 PQFLKAQIMFGCGEVQ-TGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFGR 265
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+G G + GD SS TP+ N + I + G + +++ IFD+G
Sbjct: 266 DGIGRISFGDQG--SSDQEETPLDINQKHPTYAITITG--IAVGNNLMDLEVSTIFDTGT 321
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK---ALGQVTEYFKPLA 296
S+ Y Y I + + A D R PF+ L + +
Sbjct: 322 SFTYLADPAYTYITDGFHSQVQAN--RHAADS-------RIPFEYCYDLSSSEARIQTPS 372
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
+S S+ + P + I + V CL I+ ++ NIIG+ FM V++D
Sbjct: 373 ISLRTVGGSLFPAIDPGQVISIQQHEYVYCLAIVKSTKL-----NIIGQNFMTGVRVVFD 427
Query: 356 NEKQRIGWKPEDCNTLLSLN 375
E++ +GWK +C SLN
Sbjct: 428 RERKILGWKKFNCYDTDSLN 447
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 162/387 (41%), Gaps = 57/387 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
V+LTVG PP+ DTGS+L+W++C+ T+ + + P+++ VPCS+
Sbjct: 85 LTVSLTVGTPPQNVSMVLDTGSELSWLRCNK-----TQTFQTTFDPNRSSSYSPVPCSSL 139
Query: 73 RCA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-F 129
C +P P C N C + Y D SS G L +D F + S ++P T F
Sbjct: 140 TCTDRTRDFPIPASCDS-NQLCHAILSYADASSSEGNLASDTFYIGNS-----DMPGTIF 193
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG-RGVLFLG 188
GC + + G++G+ RG +S VSQ+ +CI + GVL LG
Sbjct: 194 GCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMD-----FPKFSYCISDSDFSGVLLLG 248
Query: 189 DGKVP-SSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT---- 232
D + +TP++Q S L ++ I ++LL KS + D T
Sbjct: 249 DANFSWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQ 308
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWRGPFKA--- 284
+ DSG + + VY + + + L++ D + +C+R P
Sbjct: 309 TMVDSGTQFTFLLGPVYSALRNEFLNQ-TSQILRVLEDPNYVFQGGMDLCYRVPLSQTSL 367
Query: 285 --LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII 342
L V+ F+ + + R R VP E + G +V S+ E +I
Sbjct: 368 PWLPTVSLMFRGAEMKVSGDRLLYR--VPGE----VRGSDSVYCFTFGNSDLLAVEAYVI 421
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCN 369
G Q+ + +D EK RIG+ C+
Sbjct: 422 GHHHQQNVWMEFDLEKSRIGFAQVQCD 448
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/380 (24%), Positives = 159/380 (41%), Gaps = 48/380 (12%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSNPRC---A 75
+G PP+ DT S+LTWVQ CT C+ + P + PC++ C +
Sbjct: 5 IGTPPREVLLLVDTASELTWVQ-GTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRS 63
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-PLTFGCGYN 134
L + + C C +++ Y DG + G + ++F L+ +G+ + + FGC
Sbjct: 64 KLGFQSA--CNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASK 121
Query: 135 QHNPGPLSPPD-TAGVLGLGRGRISIVSQL--REYGLIRNVIGHCIGQ-----NGRGVLF 186
P D ++G LGL RG S +Q+ R + + +C N GV+
Sbjct: 122 DLQ----RPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVII 177
Query: 187 LGDGKVPSSGVAWTPMLQN---SADLKHYILG------PAELLYSGKSC----GLKDLTL 233
GD +P+ + + Q ++ + Y +G ELL+ +S L +
Sbjct: 178 FGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGT 237
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
FDSG + ++ + +V R ++ + + D T +C+ A G
Sbjct: 238 YFDSGTTVSFLVEPAHTALVEAFGRRVLHLN-RTSGSDFTKELCYD---VAAGDARLPTA 293
Query: 294 PL-ALSFTNRRNSVRLVVPPEAYLVISGRK----NVCLGILNGSEAEVGENNIIGEIFMQ 348
PL L F +N+V + + + V R +CL +N G N+IG Q
Sbjct: 294 PLVTLHF---KNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQ 350
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
D ++ +D E+ RIG+ P +C
Sbjct: 351 DYLIEHDLERSRIGFAPANC 370
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 152/380 (40%), Gaps = 66/380 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +VG PP DTGSD+ W+QC PC C K + P K+ +PCS+
Sbjct: 87 YLMTYSVGTPPFNVYGVVDTGSDIVWLQC-KPCEQCYKQTTPIFNPSKSSSYKNIPCSSN 145
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C ++ + + C N C+Y I + D S G L + L + G + P T GC
Sbjct: 146 LCQSVRYTS---CNKQN-SCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGC 201
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC-----IGQNGRGVLF 186
G HN + +T+G++GLG G +S+ +QL+ I +C + N L
Sbjct: 202 G---HNNRGMFQGETSGIVGLGIGPVSLTTQLKSS--IGGKFSYCLLPLLVDSNKTSKLN 256
Query: 187 LGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL------TLIFDSGA 239
GD V S GV TP ++ +Y+ A K + L +I DSG
Sbjct: 257 FGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEA-FSVGNKRIEFEVLDDSEEGNIILDSGT 315
Query: 240 SYAYFTSRVYQEIVS----LIMRDLIGTPLKL-------APDDKTLPICWRGPFKALGQV 288
+ S VY + S L+ D + P +L D PI +
Sbjct: 316 TLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQYDFPI-----------I 364
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
T +FK + + P + VCL + ++ G I G +
Sbjct: 365 TAHFK-----------GADIKLNPISTFAHVADGVVCLAF---TSSQTGP--IFGNLAQL 408
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
+ +V YD ++ + +KP DC
Sbjct: 409 NLLVGYDLQQNIVSFKPSDC 428
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 170/385 (44%), Gaps = 55/385 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++ +G PPK + DTGSDL W+QC PC C + Y P ++ + C +P
Sbjct: 90 YFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCHDCFEQNGPYYDPKESSSFRNIGCHDP 148
Query: 73 RCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-------VFN 124
RC + P+PP CK N C Y YGD ++ G T+ F + ++ + V N
Sbjct: 149 RCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVEN 208
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQ 179
V FGCG+ N G +G+LGLGRG +S SQL+ L + +C+
Sbjct: 209 V--MFGCGH--WNRGLFH--GASGLLGLGRGPLSFSSQLQ--SLYGHSFSYCLVDRNSDT 260
Query: 180 NGRGVLFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT-- 232
N L G+ K + + +T ++ +N D +Y+ + ++ G+ + + T
Sbjct: 261 NVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKS-IMVGGEVLNIPESTWN 319
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA 284
I DSG + +YFT YQ I ++ + G P+ D L C +
Sbjct: 320 MTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPI--VQDFPILDPC----YNV 373
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIG 343
G + F + P E Y + + + VCL IL + + +IIG
Sbjct: 374 SGVEKIDLPDFGILFAD---GAVWNFPVENYFIRLDPEEVVCLAILGTPRSAL---SIIG 427
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
Q+ V+YD +K R+G+ P +C
Sbjct: 428 NYQQQNFHVLYDTKKSRLGYAPMNC 452
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 141/351 (40%), Gaps = 38/351 (10%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALH----WPNPPRCK 86
DT S+LTWVQC APC C + P + ++PC++ C AL
Sbjct: 143 DTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGG 201
Query: 87 HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDT 146
C Y + Y DG S G L D L G V + FGCG + N GP T
Sbjct: 202 GEQPSCSYTLSYRDGSYSQGVLAHDKLSL---AGEVID-GFVFGCGTS--NQGPFG--GT 253
Query: 147 AGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGVAW 199
+G++GLGR ++S++SQ + ++G V +C+ G L LGD S+ + +
Sbjct: 254 SGLMGLGRSQLSLISQTMDQFG---GVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVY 310
Query: 200 TPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRD 259
T M+ + Y + + G+ +I DSG VY + + +
Sbjct: 311 TTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQ 370
Query: 260 LIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN-SVRLVVPPEAYLVI 318
P AP L C+ L E P +L F N V + Y V
Sbjct: 371 FAEYP--QAPGFSILDTCFN-----LTGFREVQIP-SLKFVFEGNVEVEVDSSGVLYFVS 422
Query: 319 SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
S VCL + S E +IIG ++ VI+D +IG+ E C+
Sbjct: 423 SDSSQVCLAL--ASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETCD 471
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 150/374 (40%), Gaps = 47/374 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALHWP 80
+ +G P F D GSDL WV CD C C Y + +P ++ P
Sbjct: 97 IDIGTPNVSFLVALDAGSDLLWVPCD--CMQCAPLSASYYDRLGRDLNEYSPSLSSTSKP 154
Query: 81 NP---------PRCKHPNDQCDYEIEY-GDGGSSIGALVTDL-----FPLRFSNGSVFNV 125
CK D C Y Y + SS G L+ D F S SV+
Sbjct: 155 LSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVW-A 213
Query: 126 PLTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+ GCG Q + PD G++GLG G +S+ S L + GL+RN C N G
Sbjct: 214 SVIIGCGRKQSGAFSDGAAPD--GLMGLGPGDLSVPSLLAKAGLVRNTFSICFDDNHSGT 271
Query: 185 LFLGD-GKVPSSGVAWTPM----LQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+ GD G V ++ P+ + +++ Y++G + L K+ G + L DSG
Sbjct: 272 ILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSL----KTAGFQALV---DSGT 324
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL-- 297
S+ + +Y++IV + + T + + C+ + L + A+
Sbjct: 325 SFTFLPYEIYEKIVVEFDKQVNAT--RSSFKGSPWKYCYNSSSQELLNIPTVTLVFAMNQ 382
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
SF ++L+ E + V CL I E E IIG+ FM +++D E
Sbjct: 383 SFIVHNPVIKLISENEEFNVF------CLPIQPIHE----EFGIIGQNFMWGYRMVFDRE 432
Query: 358 KQRIGWKPEDCNTL 371
++GW +C +
Sbjct: 433 NLKLGWSTSNCQDI 446
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 150/374 (40%), Gaps = 47/374 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALHWP 80
+ +G P F D GSDL WV CD C C Y + +P ++ P
Sbjct: 107 IDIGTPNVSFLVALDAGSDLLWVPCD--CMQCAPLSASYYDRLGRDLNEYSPSLSSTSKP 164
Query: 81 NP---------PRCKHPNDQCDYEIEY-GDGGSSIGALVTDL-----FPLRFSNGSVFNV 125
CK D C Y Y + SS G L+ D F S SV+
Sbjct: 165 LSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVW-A 223
Query: 126 PLTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+ GCG Q + PD G++GLG G +S+ S L + GL+RN C N G
Sbjct: 224 SVIIGCGRKQSGAFSDGAAPD--GLMGLGPGDLSVPSLLAKAGLVRNTFSICFDDNHSGT 281
Query: 185 LFLGD-GKVPSSGVAWTPM----LQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+ GD G V ++ P+ + +++ Y++G + L K+ G + L DSG
Sbjct: 282 ILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSL----KTAGFQALV---DSGT 334
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL-- 297
S+ + +Y++IV + + T + + C+ + L + A+
Sbjct: 335 SFTFLPYEIYEKIVVEFDKQVNAT--RSSFKGSPWKYCYNSSSQELLNIPTVTLVFAMNQ 392
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
SF ++L+ E + V CL I E E IIG+ FM +++D E
Sbjct: 393 SFIVHNPVIKLISENEEFNVF------CLPIQPIHE----EFGIIGQNFMWGYRMVFDRE 442
Query: 358 KQRIGWKPEDCNTL 371
++GW +C +
Sbjct: 443 NLKLGWSTSNCQDI 456
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 151/387 (39%), Gaps = 66/387 (17%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-----TKPPEKQYKPHKNI----VPC 69
+ +G P F DTGSDL WV CD C C T K Y P ++ V C
Sbjct: 85 AKVALGTPNATFVVALDTGSDLFWVPCD--CKRCAPIANTSELLKPYSPRQSSTSKPVTC 142
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSN--------- 119
S+ C P C + N C Y ++Y SS G LV D+ + +
Sbjct: 143 SHSLC-----DRPNACGNGNGSCPYTVKYVSANTSSSGVLVEDVLYMTRQSSSSRSGNGG 197
Query: 120 --GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHC 176
G + FGCG Q L G+LGLG R+S+ S L GL+ + C
Sbjct: 198 NVGEAVGARVVFGCGQEQTG-AFLDGAAMEGLLGLGMDRVSVPSLLAAAGLVGSDSFSMC 256
Query: 177 IGQNGRGVLFLGDGKVPSSGVAW--TPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI 234
+G G + G+ PS A TP + S Y + + GK + +
Sbjct: 257 FSPDGNGRINFGE---PSDAGAQNETPFIV-SKTRPTYNISVTAVNVKGKGAMAAEFAAV 312
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK-----ALGQVT 289
DSG S+ Y Y L+ T +K + PF+ + GQ T
Sbjct: 313 VDSGTSFTYLNDPAYS---------LLATSFNSQVREKRANLSASIPFEYCYALSRGQ-T 362
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN--------VCLGILNGSEAEVGENNI 341
E P +S T R +V V P +++++G CL + S+ + +I
Sbjct: 363 EVLMP-EVSLTTRGGAVFPVTRP--FVIVAGETTDGQVHAVGYCLAVFK-SDIPI---DI 415
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDC 368
IG+ FM V++D ++ +GW DC
Sbjct: 416 IGQNFMTGLKVVFDRQRSVLGWTKFDC 442
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 152/383 (39%), Gaps = 54/383 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C + P ++ C +
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDST 93
Query: 73 RCAALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L + K PN C Y YGD + G L D F + SV V FGC
Sbjct: 94 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV--AFGC 151
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G N G +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 152 GL--FNNGVFKSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTTITGAIPSTVLLDLPAD 207
Query: 192 VPSSG---VAWTPMLQ---NSAD-------LKHYILGPAELLYSGKSCGLKDLT--LIFD 236
+ S+G V TP++Q N A+ LK +G L + L + T I D
Sbjct: 208 LFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIID 267
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTEYFK 293
SG S +VYQ ++RD +KL P + T C+ P +A V +
Sbjct: 268 SGTSITSLPPQVYQ-----VVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPK--- 319
Query: 294 PLALSFTNR-----RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
L L F R + VP +A G +CL I G E IIG Q
Sbjct: 320 -LVLHFEGATMDLPRENYVFEVPDDA-----GNSIICLAINKGD-----ETTIIGNFQQQ 368
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ V+YD + + + C+ L
Sbjct: 369 NMHVLYDLQNNMLSFVAAQCDKL 391
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 143/372 (38%), Gaps = 48/372 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V G P K DTGSD+TW+QC PC+ C + ++P ++ + C +
Sbjct: 138 YIVTAGFGTPAKNSLLIIDTGSDVTWIQCK-PCSDCYSQVDPIFEPQQSSSYKHLSCLSS 196
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L N R C YEI YGDG S G + L GS FGCG
Sbjct: 197 ACTELTTMNHCRL----GGCVYEINYGDGSRSQGDFSQETLTL----GSDSFPSFAFGCG 248
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREY--GLIRNVIGHCIGQNGRGVLFLGDG 190
+ N G +AG+LGLGR +S SQ + G + + G +G G
Sbjct: 249 HT--NTGLFK--GSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSFSVGQG 304
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAYFT 245
+P++ + P++ NS Y +G + G+ + L I DSG
Sbjct: 305 SIPATAT-FVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVITRLV 363
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG---QVTEYFK----PLALS 298
+ Y LK + KT + PF L ++ Y + +
Sbjct: 364 PQAYDA-------------LKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFH 410
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F N + V + + + S VCL + S++ NIIG Q V +D
Sbjct: 411 FQNNAD-VAVSAVGILFTIQSDGSQVCLAFASASQSI--STNIIGNFQQQRMRVAFDTGA 467
Query: 359 QRIGWKPEDCNT 370
RIG+ P C T
Sbjct: 468 GRIGFAPGSCAT 479
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 167/386 (43%), Gaps = 57/386 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ +++ VG PPK F DTGSDL W+QC PC C + Y P ++NI C +
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYECFEQNGPHYDPGQSSSYRNI-GCHD 238
Query: 72 PRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-------VF 123
RC + P+PP+ CK N C Y YGD ++ G + F + + S V
Sbjct: 239 SRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVE 298
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----G 178
NV FGCG+ N G AG+LGLGRG +S SQL+ L + +C+
Sbjct: 299 NV--MFGCGH--WNRGLFH--GAAGLLGLGRGPLSFSSQLQ--SLYGHSFSYCLVDRNSD 350
Query: 179 QNGRGVLFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT- 232
N L G+ K + + +T ++ +N D +Y+ + ++ G+ + +
Sbjct: 351 ANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKS-IVVGGEVVNIPEEKW 409
Query: 233 ---------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
I DSG + +YF YQ I M + G P + D L C +
Sbjct: 410 QIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYP--VVKDFPVLEPC----YN 463
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNII 342
G + F+ + P E Y + I R+ VCL IL + + +II
Sbjct: 464 VTGVEQPDLPDFGIVFS---DGAVWNFPVENYFIEIEPREVVCLAILGTPPSAL---SII 517
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDC 368
G Q+ ++YD +K R+G+ P C
Sbjct: 518 GNYQQQNFHILYDTKKSRLGFAPTKC 543
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 141/351 (40%), Gaps = 38/351 (10%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALH----WPNPPRCK 86
DT S+LTWVQC APC C + P + ++PC++ C AL
Sbjct: 142 DTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGG 200
Query: 87 HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDT 146
C Y + Y DG S G L D L G V + FGCG + N GP T
Sbjct: 201 GEQPSCSYTLSYRDGSYSQGVLAHDKLSL---AGEVID-GFVFGCGTS--NQGPFG--GT 252
Query: 147 AGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGVAW 199
+G++GLGR ++S++SQ + ++G V +C+ G L LGD S+ + +
Sbjct: 253 SGLMGLGRSQLSLISQTMDQFG---GVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVY 309
Query: 200 TPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRD 259
T M+ + Y + + G+ +I DSG VY + + +
Sbjct: 310 TTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQ 369
Query: 260 LIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN-SVRLVVPPEAYLVI 318
P AP L C+ L E P +L F N V + Y V
Sbjct: 370 FAEYP--QAPGFSILDTCFN-----LTGFREVQIP-SLKFVFEGNVEVEVDSSGVLYFVS 421
Query: 319 SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
S VCL + S E +IIG ++ VI+D +IG+ E C+
Sbjct: 422 SDSSQVCLAL--ASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETCD 470
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 104/409 (25%), Positives = 163/409 (39%), Gaps = 74/409 (18%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN---- 65
P + V L +G PP F DT SDL W QC PCTGC + + P +
Sbjct: 82 IMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYA 140
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+PCS+ C L + RC H +D+ C Y Y ++ G L D + G
Sbjct: 141 ALPCSSDTCDEL---DVHRCGHDDDESCQYTYTYSGNATTEGTLAVD----KLVIGEDAF 193
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNG 181
+ FGC + P PP +GV+GLGRG +S+VSQL +R +C+
Sbjct: 194 RGVAFGCSTSSTGGAP--PPQASGVVGLGRGPLSLVSQLS----VRR-FAYCLPPPASRI 246
Query: 182 RGVLFLG---DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------ 232
G L LG D ++ PM ++ +Y L LL ++ L T
Sbjct: 247 PGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATA 306
Query: 233 ---------------------------LIFDSGASYAYFTSRVYQEIVSLI---MRDLIG 262
+I D ++ + + +Y E+V+ + +R G
Sbjct: 307 TATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRG 366
Query: 263 TPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRK 322
T L D +C+ P + Y +AL+F R +RL +A L R+
Sbjct: 367 TGSSLGLD-----LCFILP-DGVAFDRVYVPAVALAFDGR--WLRL---DKARLFAEDRE 415
Query: 323 NVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+ + ++ G AE G +I+G Q+ V+Y+ + R+ + C L
Sbjct: 416 SGMMCLMVG-RAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPCGAL 463
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/374 (25%), Positives = 157/374 (41%), Gaps = 32/374 (8%)
Query: 7 EFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN- 65
E P + + +G PP DTGS L W+QC +PC C ++P K+
Sbjct: 79 ESLLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQC-SPCHNCFPQETPLFEPLKSS 137
Query: 66 ---IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS- 121
C + C L P+ C QC Y I YGD S+G L T+ + G+
Sbjct: 138 TYKYATCDSQPCTLLQ-PSQRDCGKLG-QCIYGIMYGDKSFSVGILGTETLSFGSTGGAQ 195
Query: 122 VFNVPLT-FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
+ P T FGCG + +N + G+ GLG G +S+VSQL I + +C+
Sbjct: 196 TVSFPNTIFGCGVD-NNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHKFSYCLLPY 252
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK--SCGLKDLTLI 234
+ + F + + ++GV TP++ + +Y L + K S G D ++
Sbjct: 253 DSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIV 312
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG Y + Y V+ + L L+ P L C+ P +A + +
Sbjct: 313 IDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSP--LKTCF--PNRANLAIPD---- 364
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
+A FT ++R P + ++ +CL ++ S + ++ G I D V Y
Sbjct: 365 IAFQFTGASVALR---PKNVLIPLTDSNILCLAVVPSSGIGI---SLFGSIAQYDFQVEY 418
Query: 355 DNEKQRIGWKPEDC 368
D E +++ + P DC
Sbjct: 419 DLEGKKVSFAPTDC 432
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 104/409 (25%), Positives = 163/409 (39%), Gaps = 74/409 (18%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN---- 65
P + V L +G PP F DT SDL W QC PCTGC + + P +
Sbjct: 82 IMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYA 140
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+PCS+ C L + RC H +D+ C Y Y ++ G L D + G
Sbjct: 141 ALPCSSDTCDEL---DVHRCGHDDDESCQYTYTYSGNATTEGTLAVD----KLVIGEDAF 193
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNG 181
+ FGC + P PP +GV+GLGRG +S+VSQL +R +C+
Sbjct: 194 RGVAFGCSTSSTGGAP--PPQASGVVGLGRGPLSLVSQLS----VRR-FAYCLPPPASRI 246
Query: 182 RGVLFLG---DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------ 232
G L LG D ++ PM ++ +Y L LL ++ L T
Sbjct: 247 PGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATA 306
Query: 233 ---------------------------LIFDSGASYAYFTSRVYQEIVSLI---MRDLIG 262
+I D ++ + + +Y E+V+ + +R G
Sbjct: 307 TATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRG 366
Query: 263 TPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRK 322
T L D +C+ P + Y +AL+F R +RL +A L R+
Sbjct: 367 TGSSLGLD-----LCFILP-DGVAFDRVYVPAVALAFDGR--WLRL---DKARLFAEDRE 415
Query: 323 NVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+ + ++ G AE G +I+G Q+ V+Y+ + R+ + C L
Sbjct: 416 SGMMCLMVG-RAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPCGAL 463
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 157/385 (40%), Gaps = 50/385 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK--PPEKQYKPHKNIVP---CS 70
YF V+L +G PP+ DTGSDL WV+C A C CT+ P H C
Sbjct: 89 YF-VDLRLGTPPQKLLLVADTGSDLVWVKCSA-CRNCTRHTPGSAFLARHSTTFSPNHCY 146
Query: 71 NPRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
+ C + P RC H + C YE YGDG + G + L S+G + +
Sbjct: 147 DSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGI 206
Query: 128 TFGCGYNQHNPGP--LSPPDTAGVLGLGRGRISIVSQL-REYG--LIRNVIGHCIGQNGR 182
FGC + P S GV+GLGRG IS+ SQL +G ++ H I +
Sbjct: 207 AFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPT 266
Query: 183 GVLFLG----DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-------GLKDL 231
L +G D + +TP+ N Y +G + G L +L
Sbjct: 267 SYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALDEL 326
Query: 232 ---TLIFDSGASYAYFTSRVYQEIVSLIMRDL-IGTPLKLAPDDKTLPICWRGPFKALGQ 287
I DSG + + Y +I+++I R + + +P + P F
Sbjct: 327 GNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPG-----------FDLCVN 375
Query: 288 VTEYFKPL--ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN--NIIG 343
V+E P LSF +SV PP Y V + CL + +A + + ++IG
Sbjct: 376 VSEIEHPRLPKLSFKLGGDSV-FSPPPRNYFVDTDEDVKCLAL----QAVMTPSGFSVIG 430
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
+ Q ++ +D ++ R+G+ C
Sbjct: 431 NLMQQGFLLEFDKDRTRLGFSRHGC 455
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 104/251 (41%), Gaps = 27/251 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----GCTKPPEKQYKPHKNI----VP 68
+ +G P F D GSDL WV C+ AP + G +Y+P + +
Sbjct: 107 IDIGTPSVSFLVALDAGSDLLWVPCNCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHIS 166
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL----RFSNGSVF 123
CS+ C + C+ P C Y I+Y + SS G L+ D+ L S+
Sbjct: 167 CSHNLCDSGQ-----SCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTI 221
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
P+ GCG Q G LS G+ GLG G IS++S L + L++N C ++G G
Sbjct: 222 QAPVILGCGMKQSG-GYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSG 280
Query: 184 VLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
+F GD G ++ P+ + YI+G + DSG S+
Sbjct: 281 RIFFGDEGPASQQTTSFVPL---DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFT 337
Query: 243 YFTSRVYQEIV 253
Y Y+ IV
Sbjct: 338 YLPEEAYENIV 348
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 145/350 (41%), Gaps = 40/350 (11%)
Query: 35 DTGSDLTWVQCDAPC--TGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKHP 88
DTGSDLTWVQC +PC T C Y P + ++PC + C L + + C
Sbjct: 114 DTGSDLTWVQC-SPCDNTKCFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPY-SQYVCSDY 171
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAG 148
D C Y YGD S G L +D L +N + FGCG+ S T G
Sbjct: 172 GD-CIYAYTYGDNSYSYGGLSSDSIRLMLLQLH-YNSKICFGCGFQNKFTADKS-GKTTG 228
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGK-VPSSGVAWTPMLQ 204
++GLG G +S+VSQL + I + +C+ N L G+ V +GV TP++
Sbjct: 229 IVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFSSNSNSKLKFGEAAIVQGNGVVSTPLII 286
Query: 205 NSADLKHYILGPAELLYSGKSC--GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIG 262
DL Y L + K+ G D +I DSG++ Y Y E VSL+ +
Sbjct: 287 K-PDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGSTLTYLEESFYNEFVSLVKETVA- 344
Query: 263 TPLKLAPDDKTLPICWRGPFKALGQVTEYFKP---LALSFTNRRNSVRLVVPPEAYLVIS 319
+D+ +P PF E + FT +V+ P LV+
Sbjct: 345 -----VEEDQYIPY----PFDFCFTYKEGMSTPPDVVFHFTGG----DVVLKPMNTLVLI 391
Query: 320 GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+C ++ + I G + D V YD + ++ + P DC+
Sbjct: 392 EDNLICSTVVPSHFDGIA---IFGNLGQIDFHVGYDIQGGKVSFAPTDCS 438
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 104/386 (26%), Positives = 156/386 (40%), Gaps = 52/386 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--TKPPEKQYKPHKNI----VPCS 70
+ +N+++G PP F DTGS+L W QC APCT C P +P ++ +PC+
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRCFPRPTPAPVLQPARSSTFSRLPCN 149
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L + PR + C Y YG G ++ G L T+ L +G+ V FG
Sbjct: 150 GSFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATET--LTVGDGTFPKV--AFG 204
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C +++G++GLGRG +S+VSQL G + + G + G
Sbjct: 205 CSTEN------GVDNSSGIVGLGRGPLSLVSQL-AVGRFSYCLRSDMADGGASPILFGSL 257
Query: 191 KVPSSG--VAWTPMLQNS---------ADLKHYILGPAELLYSGKSCGLKDLTL----IF 235
+ G V TP+L+N +L + EL +G + G L I
Sbjct: 258 AKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIV 317
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIG----TPLKLAPDDKTLPICWRGPFKALGQVTEY 291
DSG + Y Y + + TP AP D L +C++ P G
Sbjct: 318 DSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYD--LDLCYK-PSAGGGGKAVR 374
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV-----ISGRKNV-CLGILNGSEAEVGENNIIGEI 345
LAL F + VP + Y GR V CL +L ++ +IIG +
Sbjct: 375 VPRLALRFA---GGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PISIIGNL 429
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTL 371
D ++YD + + P DC L
Sbjct: 430 MQMDMHLLYDIDGGMFSFAPADCAKL 455
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 102/414 (24%), Positives = 160/414 (38%), Gaps = 83/414 (20%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--------- 67
+ ++L +G PP++ DTGSDLTWV C C + Y+ K +
Sbjct: 12 YLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDD--YRNSKLMSAFSPSHSSS 69
Query: 68 ----PCSNPRCAALHWPN-----------------PPRCKHPNDQCDYEIEYGDGGSSIG 106
C++P C +H + C P Y YG GG G
Sbjct: 70 SYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAY--TYGAGGVVTG 127
Query: 107 ALVTDLFPLRFSNGSVF---NVP-LTFGC-GYNQHNPGPLSPPDTAGVLGLGRGRISIVS 161
L D LR G ++P FGC G H P G+ G RG +S S
Sbjct: 128 TLTRDT--LRVHEGPARVTKDIPKFCFGCVGSTYHEP--------IGIAGFVRGTLSFPS 177
Query: 162 QLREYGLIRNVIGHCI-------GQNGRGVLFLGDGKVPSS-GVAWTPMLQNSADLKHYI 213
QL GL++ HC N L +GD + S + +TPML++ +Y
Sbjct: 178 QL---GLLKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYY 234
Query: 214 LGPAELLYSGKSCGLKDLTL-----------IFDSGASYAYFTSRVYQEIVSLIMRDLIG 262
+G + S L L + DSG +Y + Y +++S I + +I
Sbjct: 235 IGLEAITVGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLS-IFKAIIT 293
Query: 263 TPLKLAPDDKT-LPICWRGPF--KALGQVTEYFKPLALSFTNRRNSVRLVVPP-EAYLVI 318
P + + +C++ P L F + F N+V V+P + +
Sbjct: 294 YPRATEVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFL---NNVSFVLPQGNHFYAM 350
Query: 319 SGRKNV----CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
S N CL + ++++ G + G Q+ ++YD EK+RIG++P DC
Sbjct: 351 SAPSNSTVVKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDC 404
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/372 (25%), Positives = 157/372 (42%), Gaps = 40/372 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG P + F DTGS+LTWV+ C G PP ++P + VPCS+
Sbjct: 91 YF-VKVLVGTPAQEFTLVADTGSELTWVK----CAGGASPPGLVFRPEASKSWAPVPCSS 145
Query: 72 PRCAALHWP-NPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFSNGSVFNVP-LT 128
C L P + C C Y+ Y +G + ++G + TD + G V + +
Sbjct: 146 DTC-KLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVV 204
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREY---GLIRNVIGHCIGQNGRGVL 185
GC + H+ D GVL LG +IS S+ ++ H +N G L
Sbjct: 205 LGCS-STHDGQSFKSVD--GVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATGYL 261
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-------KDLTLIFDSG 238
G G+VP + T + + A + Y + + +G++ + K +I DSG
Sbjct: 262 AFGPGQVPRTPATQTKLFLDPA-MPFYGVKVDAVHVAGQALDIPAEVWDPKSGGVILDSG 320
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
+ + Y+ +V+ + + L G P + P + W P ++ + LA+
Sbjct: 321 TTLTVLATPAYKAVVAALTKLLAGVPKVDFPPFEHCY--NWTAPRPGAPEIPK----LAV 374
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
FT RL P ++Y++ C+G+ G V ++IG I Q+ + +D +
Sbjct: 375 QFT---GCARLEPPAKSYVIDVKPGVKCIGLQEGEWPGV---SVIGNIMQQEHLWEFDLK 428
Query: 358 KQRIGWKPEDCN 369
+ + P C
Sbjct: 429 NMEVRFMPSTCT 440
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 153/369 (41%), Gaps = 41/369 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G P + FDTGSDL+WVQC PC GC + + + P ++ VPC
Sbjct: 138 YIVSVGLGTPKRDLLVVFDTGSDLSWVQCK-PCDGCYQQHDPLFDPSQSTTYSAVPCGAQ 196
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL---TF 129
C L + C + +C YE+ YGD + G L D L S+ S + L F
Sbjct: 197 ECRRL---DSGSCS--SGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVF 251
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI--GQNGRGVLF 186
GCG + G D G+ GLGR R+S+ SQ +YG +C+ G L
Sbjct: 252 GCG--DDDTGLFGKAD--GLFGLGRDRVSLASQAAAKYGA---GFSYCLPSSSTAEGYLS 304
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASY 241
LG P++ +T M+ S Y L + +G++ + + DSG
Sbjct: 305 LGSAAPPNA--RFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVI 362
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
SR Y + S + K AP L C+ F +V +AL F
Sbjct: 363 TRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCY--DFTGRNKVQ--IPSVALLFD- 417
Query: 302 RRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
L + L ++ + CL NG + + I+G + + V+YD Q+
Sbjct: 418 --GGATLNLGFGEVLYVANKSQACLAFASNGDDTSIA---ILGNMQQKTFAVVYDVANQK 472
Query: 361 IGWKPEDCN 369
IG+ + C+
Sbjct: 473 IGFGAKGCS 481
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 158/377 (41%), Gaps = 49/377 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +GKPP F DTGSDLTW QC PC C Y P + +PCS+
Sbjct: 71 YLMELAIGKPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPLPCSSA 129
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C + W R P+ C Y YGDG S G L T+ L S+ V + FGCG
Sbjct: 130 TCLPI-WS---RNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCG 185
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL------- 185
+ ++ G +GLGRG +S+++QL G+ + +C+ L
Sbjct: 186 TDNGG----DSLNSTGTVGLGRGTLSLLAQL---GVGK--FSYCLTDFFNSALDSPFLLG 236
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYI-------LGPAELLYSGKSCGLK-DLT--LIF 235
L + S V TP+LQ+ + Y LG L + L+ D T +I
Sbjct: 237 TLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIV 296
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG ++ ++E+V + R L P+ + D F A Y L
Sbjct: 297 DSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAPC-------FPAPAGEPPYMPDL 349
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
L F + +RL + Y+ + + CL I G+ E +++G Q+ +++
Sbjct: 350 VLHFAGGAD-MRLYR--DNYMSYNEEDSSFCLNI-AGTTPE--STSVLGNFQQQNIQMLF 403
Query: 355 DNEKQRIGWKPEDCNTL 371
D ++ + P DC+ L
Sbjct: 404 DTTVGQLSFLPTDCSKL 420
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 150/365 (41%), Gaps = 35/365 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ V + +G P K F FDTGSD+TW QC+ C K E + P +KNI CS+
Sbjct: 71 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNI-SCSS 129
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + + C Y+++YGDG SIG T+ L SN VF L FGC
Sbjct: 130 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN--VFKNFL-FGC 186
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
G Q+N G+ R ++++ SQ + + + +C+ + +G L LG
Sbjct: 187 G-QQNNGLFGGAAGLLGLG---RTKLALPSQTAK--TYKKLFSYCLPASSSSKGYLSLG- 239
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFT 245
G+V S V +TP+ + Y L L G+ + + + DSG +
Sbjct: 240 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGTVITRLS 298
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y E+ S + P C+ F V + ++F +
Sbjct: 299 PTAYSELSSAFQNLMTDYP--STSGYSIFDTCY--DFSKYDTVR--IPKVGVTF---KGG 349
Query: 306 VRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
V + + L ++G K VCL + + +I G + + V+YD K R+G+
Sbjct: 350 VEMDIDVSGILYPVNGLKKVCLAFAGNDDDS--DTSIFGNVQQRTYQVVYDGAKGRVGFA 407
Query: 365 PEDCN 369
P C+
Sbjct: 408 PGGCS 412
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 90/385 (23%), Positives = 150/385 (38%), Gaps = 60/385 (15%)
Query: 12 PIFSY---FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
P+ +Y + + L++G PP + DTGSDL W QC PCT C K + P +
Sbjct: 52 PVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQC-IPCTKCYKQQNPMFDPRSSSSY 110
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VF 123
+ C C L + C C+Y Y D + G L + L + G V
Sbjct: 111 TNITCGTESCNKL---DSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVA 167
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI----- 177
+ FGCG+N G++GLGRG +S++SQ+ G N+ C+
Sbjct: 168 FQGIIFGCGHNNSGFNDRE----MGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNT 223
Query: 178 -------GQNGRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILGPAELLYS-GK 224
G+G LG+G V TP++ A L + L +S G
Sbjct: 224 DPSITSQMNFGKGSEVLGNGTVS------TPLISKDGTGYFATLLGISVEDINLPFSNGS 277
Query: 225 SCG-LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
S G + ++ DSG + Y Y ++ + + P ++ +C++ P
Sbjct: 278 SLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRI----DGYELCYQTPTN 333
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
G L + F L+ P + ++ + N C + + +E V G
Sbjct: 334 LNGPT------LTIHF---EGGDVLLTPAQMFIPVQ-DDNFCFAVFDTNEEYV----TYG 379
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
+ ++ +D E+Q + +K DC
Sbjct: 380 NYAQSNYLIGFDLERQVVSFKATDC 404
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 156/380 (41%), Gaps = 42/380 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPC 69
F + ++ +G P + DTGS+LTW++C PC C + Y +++ V C
Sbjct: 97 FGEYYTSIKLGSPGQEAILIVDTGSELTWLKC-LPCKVCAPSVDTIYDAARSVSYKPVTC 155
Query: 70 SNPR-CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS--VFNVP 126
+N + C+ C QC + YGDG S G+L TD + G V
Sbjct: 156 NNSQLCSNSSQGTYAYCAR-GSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQD 214
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQ-----N 180
FGC L P +G+LGL G++++ QL + +G HC N
Sbjct: 215 FAFGCAQGDLE---LVPTGASGILGLNAGKMALPMQLGQRFGW---KFSHCFPDRSSHLN 268
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
GV+F G+ ++P V +T + +++L+ + G S +L L
Sbjct: 269 STGVVFFGNAELPHEQVQYTSVALTNSELQRKFY---HVALKGVSINSHELVLLPRGSVV 325
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD-KTLPICWRGPFKALGQVTEYF 292
I DSG+S++ F + ++ ++ + L D L C++ + ++
Sbjct: 326 ILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTL 385
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKN----VCLGILNGSEAEVGENNIIGEIFMQ 348
L+L F + V + +P L+ R +C +G V N+IG Q
Sbjct: 386 PSLSLVF---EDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPV---NVIGNYQQQ 439
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
+ V YD ++ R+G+ C
Sbjct: 440 NLWVEYDIQRSRVGFARASC 459
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 140/366 (38%), Gaps = 36/366 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P FDTGSDLTW QC C E + P K+ V CS+
Sbjct: 104 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 163
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C +L N C N C Y I+YGD S+G L + F L +N VF+ + FG
Sbjct: 164 ACGSLSSATGNAGSCSASN--CIYGIQYGDQSFSVGFLAKEKFTL--TNSDVFD-GVYFG 218
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFLG 188
CG N N G + AG+LGLGR ++S SQ + +C+ + G L G
Sbjct: 219 CGEN--NQGLFT--GVAGLLGLGRDKLSFPSQTAT--AYNKIFSYCLPSSASYTGHLTFG 272
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
+ S V +TP+ + Y L + G+ + + DSG
Sbjct: 273 SAGISRS-VKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITR 331
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
+ Y + S + P L C F G T +A SF+
Sbjct: 332 LPPKAYAALRSSFKAKMSKYPTTSGVS--ILDTC----FDLSGFKTVTIPKVAFSFS--- 382
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+ + + + VCL S+ I G + Q V+YD R+G+
Sbjct: 383 GGAVVELGSKGIFYVFKISQVCLAFAGNSDDS--NAAIFGNVQQQTLEVVYDGAGGRVGF 440
Query: 364 KPEDCN 369
P C+
Sbjct: 441 APNGCS 446
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 134/334 (40%), Gaps = 65/334 (19%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY------------KPH 63
Y+A + +G P K + DTGSD+ WV C C + P +
Sbjct: 80 YYA-KIGIGTPAKSYYVQVDTGSDIMWVNC----IQCKQCPRRSTLGIELTLYNIDESDS 134
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL---------FP 114
+V C + C + CK N C Y YGDG S+ G V D+
Sbjct: 135 GKLVSCDDDFCYQISGGPLSGCK-ANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLK 193
Query: 115 LRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVI 173
+ +NGSV FGCG Q S + G+LG G+ S++SQL G ++ +
Sbjct: 194 TQTANGSVI-----FGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIF 248
Query: 174 GHCI-GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADL----------KHYILGPAELLYS 222
HC+ G+NG G+ + G+V V TP++ N + ++ PA+L
Sbjct: 249 AHCLDGRNGGGIFAI--GRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQP 306
Query: 223 GKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPF 282
G G I DSG + AY +Y+ +V LK+ DK F
Sbjct: 307 GDRKG-----AIIDSGTTLAYLPEIIYEPLVKK------EPALKVHIVDKDYKC-----F 350
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
+ G+V E F + F NSV L V P YL
Sbjct: 351 QYSGRVDEGFPNVTFHF---ENSVFLRVYPHDYL 381
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 107/376 (28%), Positives = 153/376 (40%), Gaps = 45/376 (11%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTK----PPEKQYKPHKN----I 66
+ VG P F DTGSDL WV CD AP + P + Y P K+
Sbjct: 109 AEVAVGTPNATFLVALDTGSDLFWVPCDCKQCAPIANASDLRGGPDLRPYSPGKSSTSKA 168
Query: 67 VPCSNPRCAALHWPNP-PRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPL-RFSNG--- 120
V C + C PN + + C Y + Y SS G LV D+ L R + G
Sbjct: 169 VTCEHALC---ERPNACAAAGNSSTSCPYTVRYVSANTSSSGVLVEDVLHLSREAAGGAS 225
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQ 179
+ P+ GCG Q L G+LGLG ++S+ S L GL+ + C
Sbjct: 226 TAVTAPVVLGCGQVQTG-AFLDGAAVDGLLGLGMDKVSVPSVLHAAGLVASDSFSMCFSP 284
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+G G + GD G A TP + Y + + SGK + I DSG
Sbjct: 285 DGFGRINFGDSG--RRGQAETPFTVRNTH-PTYNISVTAMSVSGKEVA-AEFAAIVDSGT 340
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI--CWRGPFKALGQ-VTEYFKPLA 296
S+ Y Y E+ + ++ L+ ++P C+ LG+ TE F P
Sbjct: 341 SFTYLNDPAYTELATGFNSEVRERRANLS---ASIPFEYCYE-----LGRGQTELFVP-E 391
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN----NIIGEIFMQDKMV 352
+S T R +V V P +VI G + + G V +N +IIG+ FM V
Sbjct: 392 VSLTTRGGAVFPVTRP--IVVIYGETSDGRIVAAGYCLAVLKNDITIDIIGQNFMTGLKV 449
Query: 353 IYDNEKQRIGWKPEDC 368
++D E+ +GW DC
Sbjct: 450 VFDRERSVLGWHEFDC 465
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 150/365 (41%), Gaps = 35/365 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ V + +G P K F FDTGSD+TW QC+ C K E + P +KNI CS+
Sbjct: 119 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNI-SCSS 177
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + + C Y+++YGDG SIG T+ L SN VF L FGC
Sbjct: 178 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN--VFKNFL-FGC 234
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
G Q+N G+ R ++++ SQ + + + +C+ + +G L LG
Sbjct: 235 G-QQNNGLFGGAAGLLGLG---RTKLALPSQTAK--TYKKLFSYCLPASSSSKGYLSLG- 287
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFT 245
G+V S V +TP+ + Y L L G+ + + + DSG +
Sbjct: 288 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLS 346
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y E+ S + P C+ F V + ++F +
Sbjct: 347 PTAYSELSSAFQNLMTDYP--STSGYSIFDTCY--DFSKYDTVR--IPKVGVTF---KGG 397
Query: 306 VRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
V + + L ++G K VCL + + +I G + + V+YD K R+G+
Sbjct: 398 VEMDIDVSGILYPVNGLKKVCLAFAGNDDDS--DTSIFGNVQQRTYQVVYDGAKGRVGFA 455
Query: 365 PEDCN 369
P C+
Sbjct: 456 PGGCS 460
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 151/373 (40%), Gaps = 46/373 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L+VG PP DTGSD+ W QC+ PCT C + + P K+ V CS+P
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQCE-PCTNCYQQDLPMFNPSKSTTYRKVSCSSP 143
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C+ N C D C Y I YGD S G D + ++G V P T GC
Sbjct: 144 VCSFTGEDN--SCSFKPD-CTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGC 200
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRG---VL 185
G++ N G + +G++GLG G S++ Q+ + +C IG + G +
Sbjct: 201 GHD--NAGSFD-ANVSGIVGLGLGPASLIKQMGS--AVGGKFSYCLTPIGNDDGGSNKLN 255
Query: 186 FLGDGKVPSSGVAWTPMLQN-------SADLKHYILGPAELLYSGKSCGL-KDLTLIFDS 237
F + V SG TP+ + S LK +G YS + L +I DS
Sbjct: 256 FGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDS 315
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFKALGQVTEYFKP-L 295
G + +Y I + L+ D ++ L C+ +Y P +
Sbjct: 316 GTTLTLLPVDLYHNFAKAISNSI---NLQRTDDPNQFLEYCFE------TTTDDYKVPFI 366
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
A+ F L + E L+ +CL + ++ +I G I + +V YD
Sbjct: 367 AMHF----EGANLRLQRENVLIRVSDNVICLAFAGAQDNDI---SIYGNIAQINFLVGYD 419
Query: 356 NEKQRIGWKPEDC 368
+ +KP +C
Sbjct: 420 VTNMSLSFKPMNC 432
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 165/382 (43%), Gaps = 47/382 (12%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSN 71
P + F +N ++G+PP DTGS LTWV C PC+ C++ + P K+ SN
Sbjct: 88 PRYVVFLMNFSIGEPPIPQLAVMDTGSSLTWVMCH-PCSSCSQQSVPIFDPSKS-STYSN 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C+ + +C N +C Y +EY GSS G + L + S+ VP L FG
Sbjct: 146 LSCSECN-----KCDVVNGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFG 200
Query: 131 CGYN---QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV--- 184
CG N P + GV GLG GR S+ L +G +CIG N R
Sbjct: 201 CGRKFSISSNGYPYQGIN--GVFGLGSGRFSL---LPSFG---KKFSYCIG-NLRNTNYK 251
Query: 185 ---LFLGDGKVPSSGVAWTPMLQNS---ADLKHYILGPAEL-----LYSGKSCGLKDLTL 233
L LGD K G + T + N +L+ +G +L L+ +S + +
Sbjct: 252 FNRLVLGD-KANMQGDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFE-RSITDNNSGV 309
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP--ICWRGPFKALGQVTEY 291
I DSGA + + T + E++S + +L+ L LA DK P +C+ G + Q
Sbjct: 310 IIDSGADHTWLTKYGF-EVLSFEVENLLEGVLVLAQQDKHNPYTLCYSG---VVSQDLSG 365
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSE--AEVGENNIIGEIFMQD 349
F + F L + + + + C+ +L G+ + + IG + Q+
Sbjct: 366 FPLVTFHFA---EGAVLDLDVTSMFIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQN 422
Query: 350 KMVIYDNEKQRIGWKPEDCNTL 371
V YD + R+ ++ DC L
Sbjct: 423 YNVGYDLNRMRVYFQRIDCELL 444
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 95/365 (26%), Positives = 142/365 (38%), Gaps = 39/365 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + +G PP D+GSD+ WVQC PC C + + P + VPC +
Sbjct: 127 YF-VRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPATSATFSAVPCGS 184
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L C + CDYE+ YGDG + GAL + L G + GC
Sbjct: 185 AVCRTLRTSG---CGD-SGGCDYEVSYGDGSYTKGALALETLTL----GGTAVEGVAIGC 236
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G+ N G AG+LGLG G +S+V QL +C+ G G L LG +
Sbjct: 237 GH--RNRGLFV--GAAGLLGLGWGPMSLVGQLGG--AAGGAFSYCLASRGAGSLVLGRSE 290
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK-DLTLIFDSGASYAYF-----T 245
G W P+++N Y +G + + + L+ DL + + GA
Sbjct: 291 AVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAV 350
Query: 246 SRVYQEIVSLIMRDLIGT--PLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
+R+ QE + + + L AP L C+ L T P + +
Sbjct: 351 TRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYD-----LSGYTSVRVPTVSFYFD-- 403
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+ L +P L+ CL S +I+G I + + D+ IG+
Sbjct: 404 GAATLTLPARNLLLEVDGGIYCLAFAPSSSGP----SILGNIQQEGIQITVDSANGYIGF 459
Query: 364 KPEDC 368
P C
Sbjct: 460 GPTTC 464
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 92/370 (24%), Positives = 157/370 (42%), Gaps = 35/370 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---YKPHKNI---- 66
F Y + + VG PP DTGSDL WV C + G ++P ++
Sbjct: 101 FEYL-MYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQ 159
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNV 125
+ C + C AL + C + +C Y+ YGDG +IG L T+ F G V
Sbjct: 160 LSCQSNACQALSQAS---CD-ADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRV 215
Query: 126 P-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQN 180
P + FGC + + G + G++GLG G S+VSQL I + +C+ N
Sbjct: 216 PRVNFGC--STASAGTFR---SDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDAN 270
Query: 181 GRGVLFLGDGKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
L G V S G A TP++ + D +Y + + G+ D +I DSG
Sbjct: 271 SSSTLNFGSRAVVSEPGAASTPLVPSDVD-SYYTVALESVAVGGQEVATHDSRIIVDSGT 329
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALS 298
+ + + +V+ + R + ++ P ++ L +C+ + + + P + L
Sbjct: 330 TLTFLDPALLGPLVTELERRI--KLQRVQPPEQLLQLCY--DVQGKSETDNFGIPDVTLR 385
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F + + PE + +CL ++ SE++ +I+G I Q+ V YD +
Sbjct: 386 FG---GGAAVTLRPENTFSLLQEGTLCLVLVPVSESQ--PVSILGNIAQQNFHVGYDLDA 440
Query: 359 QRIGWKPEDC 368
+ + + DC
Sbjct: 441 RTVTFAAADC 450
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 106/386 (27%), Positives = 158/386 (40%), Gaps = 52/386 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--TKPPEKQYKPHKNI----VPCS 70
+ +N+++G PP F DTGS+L W QC APCT C P +P ++ +PC+
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRCFPRPTPAPVLQPARSSTFSRLPCN 149
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L + PR + C Y YG G ++ G L T+ L +G+ V FG
Sbjct: 150 GSFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATET--LTVGDGTFPKV--AFG 204
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG-VLFLGD 189
C +++G++GLGRG +S+VSQL G + + G +LF
Sbjct: 205 CSTEN------GVDNSSGIVGLGRGPLSLVSQL-AVGRFSYCLRSDMADGGASPILFGSL 257
Query: 190 GKVPS-SGVAWTPMLQNS---------ADLKHYILGPAELLYSGKSCGLKDLTL----IF 235
K+ S V TP+L+N +L + EL +G + G L I
Sbjct: 258 AKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIV 317
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIG----TPLKLAPDDKTLPICWRGPFKALGQVTEY 291
DSG + Y Y + + TP AP D L +C++ P G
Sbjct: 318 DSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYD--LDLCYK-PSAGGGGKAVR 374
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV-----ISGRKNV-CLGILNGSEAEVGENNIIGEI 345
LAL F + VP + Y GR V CL +L ++ +IIG +
Sbjct: 375 VPRLALRFA---GGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PISIIGNL 429
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTL 371
D ++YD + + P DC L
Sbjct: 430 MQMDMHLLYDIDGGMFSFAPADCAKL 455
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 162/387 (41%), Gaps = 61/387 (15%)
Query: 12 PIFSY---FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
PI++Y + + +++G PP DTGSDLTW C PC C K + P K+
Sbjct: 17 PIYAYLGHYLMEVSIGTPPFKIYGIADTGSDLTWTSC-VPCNKCYKQRNPIFDPQKSTSY 75
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ C + C H + C P C+Y Y + G L + L + G +
Sbjct: 76 RNISCDSKLC---HKLDTGVCS-PQKHCNYTYAYASAAITQGVLAQETITLSSTKGE--S 129
Query: 125 VPL---TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQN 180
VPL FGCG+N N G + + G++GLG G +S +SQ+ +G R C+
Sbjct: 130 VPLKGIVFGCGHN--NTGGFNDRE-MGIIGLGGGPVSFISQIGSSFGGKR--FSQCLVPF 184
Query: 181 GRGV-----LFLGDG-KVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSC-G 227
V + LG G +V GV TP++ +++ +G L ++G S
Sbjct: 185 HTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQS 244
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP----LKLAPDDKTLPICWRGPFK 283
++ + DSG +++Y +V+ + ++ P L L P +C+R
Sbjct: 245 VEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQ-----LCYRTKNN 299
Query: 284 ALGQV-TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII 342
G V T +F+ V+L+ P V CLG N S + +
Sbjct: 300 LRGPVLTAHFE---------GGDVKLL--PTQTFVSPKDGVFCLGFTNTSS----DGGVY 344
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCN 369
G + ++ +D ++Q + +KP DC
Sbjct: 345 GNFAQSNYLIGFDLDRQVVSFKPMDCT 371
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 150/365 (41%), Gaps = 35/365 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ V + +G P K F FDTGSD+TW QC+ C K E + P +KNI CS+
Sbjct: 131 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNI-SCSS 189
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + + C Y+++YGDG SIG T+ L SN VF L FGC
Sbjct: 190 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN--VFKNFL-FGC 246
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
G Q+N G+ R ++++ SQ + + + +C+ + +G L LG
Sbjct: 247 G-QQNNGLFGGAAGLLGLG---RTKLALPSQTAK--TYKKLFSYCLPASSSSKGYLSLG- 299
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFT 245
G+V S V +TP+ + Y L L G+ + + + DSG +
Sbjct: 300 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLS 358
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y E+ S + P C+ F V + ++F +
Sbjct: 359 PTAYSELSSAFQNLMTDYP--STSGYSIFDTCY--DFSKYDTVR--IPKVGVTF---KGG 409
Query: 306 VRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
V + + L ++G K VCL + + +I G + + V+YD K R+G+
Sbjct: 410 VEMDIDVSGILYPVNGLKKVCLAFAGNDDDS--DTSIFGNVQQRTYQVVYDGAKGRVGFA 467
Query: 365 PEDCN 369
P C+
Sbjct: 468 PGGCS 472
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 156/357 (43%), Gaps = 34/357 (9%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALH 78
+G PP + DTGSDLTW QC PC C + + P K+ VPC+ C H
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQC-LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTC---H 141
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNP 138
+ C CDY YGD S G DL + + GS +V GCG+
Sbjct: 142 AVDDGHCG-VQGVCDYSYTYGDRTYSKG----DLGFEKITIGSS-SVKSVIGCGHASSGG 195
Query: 139 GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLFLGDGKVP 193
+ +GV+GLG G++S+VSQ+ + I +C+ NG+ + F + V
Sbjct: 196 FGFA----SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGK-INFGQNAVVS 250
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-KDLTLIFDSGASYAYFTSRVYQEI 252
GV TP++ + +YI A + + + K +I DSG + ++ +Y +
Sbjct: 251 GPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLSFLPKELYDGV 310
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPP 312
VS +++ + +K +C+ + T P+ + + +V L +P
Sbjct: 311 VSSLLKVVKAKRVK--DPGNFWDLCFD---DGINVATSSGIPIITAQFSGGANVNL-LPV 364
Query: 313 EAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ ++ N CL + S + E IIG + + + ++ YD E +R+ +KP C
Sbjct: 365 NTFQKVANNVN-CLTLTPASPTD--EFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 418
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 92/377 (24%), Positives = 155/377 (41%), Gaps = 36/377 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPC 69
F + ++ +G P + DTGS+LTW+QC PC C + Y ++ V C
Sbjct: 97 FGEYYTSIKLGSPGQEAILIVDTGSELTWLQC-LPCKVCAPSVDTIYDAARSASYRPVTC 155
Query: 70 SNPR-CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS--VFNVP 126
+N + C+ C QC + YGDG S G+L TD + G V
Sbjct: 156 NNSQLCSNSSQGTYAYCAR-GSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQD 214
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQ-----N 180
FGC L P +G+LGL G++++ QL + +G HC N
Sbjct: 215 FAFGCAQGDLE---LVPTGASGILGLNAGKMALPMQLGQRFGW---KFSHCFPDRSSHLN 268
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL----KDLTLIFD 236
GV+F G+ ++P V +T + +++L+ A S S L + +I D
Sbjct: 269 STGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSVVILD 328
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD-KTLPICWRGPFKALGQVTEYFKPL 295
SG+S++ F + ++ ++ + L D L C++ + ++ L
Sbjct: 329 SGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSL 388
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGR----KNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
+L F + V + +P L+ R +C +G V N+IG Q+
Sbjct: 389 SLVF---EDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPNPV---NVIGNYQQQNLW 442
Query: 352 VIYDNEKQRIGWKPEDC 368
V YD ++ R+G+ C
Sbjct: 443 VEYDIQRSRVGFARASC 459
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 144/351 (41%), Gaps = 39/351 (11%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWP--NPPRCKHP 88
DTGSDL+WVQC PC C + + P + V CS+P C +L N C
Sbjct: 151 DTGSDLSWVQCQ-PCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSN 209
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAG 148
C+Y + YGDG + G L T+ L N + N FGCG N N G +G
Sbjct: 210 PPSCNYVVNYGDGSYTRGELGTE--HLDLGNSTAVN-NFIFGCGRN--NQGLFG--GASG 262
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGVAWTPM 202
++GLGR +S++SQ + V +C+ G L +G ++ +++T M
Sbjct: 263 LVGLGRSSLSLISQTS--AMFGGVFSYCLPITETEASGSLVMGGNSSVYKNTTPISYTRM 320
Query: 203 LQNSADLKHYILGPAELLYSGKSCGL----KDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
+ N L Y L + + KD +I DSG +YQ + ++
Sbjct: 321 IPN-PQLPFYFLNLTGITVGSVAVQAPSFGKDGMMI-DSGTVITRLPPSIYQALKDEFVK 378
Query: 259 DLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
G P AP L C+ L E P + + V Y V
Sbjct: 379 QFSGFP--SAPAFMILDTCFN-----LSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVK 431
Query: 319 SGRKNVCLGILNGS-EAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+ VCL I + S E EVG IIG +++ VIYD + +G+ E C
Sbjct: 432 TDASQVCLAIASLSYENEVG---IIGNYQQKNQRVIYDTKGSMLGFAAEAC 479
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 153/365 (41%), Gaps = 42/365 (11%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPCSN 71
VG P F DTGSDL WV CD AP G + ++ YKP ++ +PCS+
Sbjct: 149 VGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRDLGIYKPAESTTSRHLPCSH 208
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL--RFSNGSVFNVPLT 128
C P C P C Y +Y + +S G L+ D+ L R S+ V +
Sbjct: 209 ELC-----PPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESHAPV-KASVV 262
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
GCG Q L G+LGLG IS+ S L GL+RN C ++ G +F G
Sbjct: 263 IGCGRKQSG-SYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDS-GRIFFG 320
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRV 248
D V S TP + + Y + + K + DSG S+ V
Sbjct: 321 DQGV--SIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDSGTSFTALPLNV 378
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG-PFKALGQVTEYFKPLALSFTNRRNSVR 307
Y+ V++ + P ++ +D + C+ P K T + L+F + S +
Sbjct: 379 YKA-VAVEFDKQVHAP-RITQEDASFEYCYSASPLKMPDVPT-----VTLTFAANK-SFQ 430
Query: 308 LVVPPEAYLVISGRKNV---CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
V P ++ G +V CL L S +G IIG+ F+ +++D E ++GW
Sbjct: 431 AVNP--TIVLKDGEGSVAGFCLA-LQKSPEPIG---IIGQNFLTGYHIVFDKENMKLGWY 484
Query: 365 PEDCN 369
+C+
Sbjct: 485 RSECH 489
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 163/379 (43%), Gaps = 46/379 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+L VG PP+ F DTGSDL W+QC APC C + + P ++ V C +P
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASLSYRNVTCGDP 210
Query: 73 RCAALHWPNPPR-CKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNV-PLT 128
RC + P PR C+ P+ D C Y YGD ++ G L + F + + G+ V +
Sbjct: 211 RCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVV 270
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV--- 184
FGCG++ N G AG+LGLGRG +S SQLR YG + +C+ +G V
Sbjct: 271 FGCGHS--NRGLFH--GAAGLLGLGRGALSFASQLRAVYG---HAFSYCLVDHGSSVGSK 323
Query: 185 LFLGDGKV----PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-------- 232
+ GD P +A Y + +L G+ + T
Sbjct: 324 IVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGS 383
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + +YF Y E++ + + L D L C+ +V E
Sbjct: 384 GGTIIDSGTTLSYFAEPAY-EVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPE 442
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+ +L F + P E Y V + +CL +L + + +IIG Q+
Sbjct: 443 F----SLLFA---DGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAM---SIIGNFQQQN 492
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V+YD + R+G+ P C
Sbjct: 493 FHVLYDLQNNRLGFAPRRC 511
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 163/379 (43%), Gaps = 46/379 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+L VG PP+ F DTGSDL W+QC APC C + + P ++ V C +P
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPATSLSYRNVTCGDP 210
Query: 73 RCAALHWPNPPR-CKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNV-PLT 128
RC + P PR C+ P+ D C Y YGD ++ G L + F + + G+ V +
Sbjct: 211 RCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVV 270
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV--- 184
FGCG++ N G AG+LGLGRG +S SQLR YG + +C+ +G V
Sbjct: 271 FGCGHS--NRGLFH--GAAGLLGLGRGALSFASQLRAVYG---HAFSYCLVDHGSSVGSK 323
Query: 185 LFLGDGKV----PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-------- 232
+ GD P +A Y + +L G+ + T
Sbjct: 324 IVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGS 383
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + +YF Y E++ + + L D L C+ +V E
Sbjct: 384 GGTIIDSGTTLSYFAEPAY-EVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPE 442
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+ +L F + P E Y V + +CL +L + + +IIG Q+
Sbjct: 443 F----SLLFA---DGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAM---SIIGNFQQQN 492
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V+YD + R+G+ P C
Sbjct: 493 FHVLYDLQNNRLGFAPRRC 511
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 140/366 (38%), Gaps = 36/366 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P FDTGSDLTW QC C E + P K+ V CS+
Sbjct: 132 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 191
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C +L N C N C Y I+YGD S+G L + F L +N VF+ + FG
Sbjct: 192 ACGSLSSATGNAGSCSASN--CIYGIQYGDQSFSVGFLAKEKFTL--TNSDVFD-GVYFG 246
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFLG 188
CG N N G + AG+LGLGR ++S SQ + +C+ + G L G
Sbjct: 247 CGEN--NQGLFT--GVAGLLGLGRDKLSFPSQTAT--AYNKIFSYCLPSSASYTGHLTFG 300
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
+ S V +TP+ + Y L + G+ + + DSG
Sbjct: 301 SAGISRS-VKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITR 359
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
+ Y + S + P L C F G T +A SF+
Sbjct: 360 LPPKAYAALRSSFKAKMSKYPTTSGV--SILDTC----FDLSGFKTVTIPKVAFSFS--- 410
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+ + + + VCL S+ I G + Q V+YD R+G+
Sbjct: 411 GGAVVELGSKGIFYVFKISQVCLAFAGNSDDS--NAAIFGNVQQQTLEVVYDGAGGRVGF 468
Query: 364 KPEDCN 369
P C+
Sbjct: 469 APNGCS 474
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 102/215 (47%), Gaps = 17/215 (7%)
Query: 34 FDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKH 87
DT SD+ WVQC APC C + Y P K+ PCS+P C L P C
Sbjct: 160 IDTASDVPWVQC-APCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNL-GPYANGCTP 217
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
DQC Y ++Y DG +S G ++D+ L + + FGC + PG S T+
Sbjct: 218 AGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFS-NKTS 276
Query: 148 GVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQN--GRGVLFLGDGKVPSSGVAWTPMLQ 204
G++ LGRG S+ +Q + YG +V +C+ G LG +V +S A TPML+
Sbjct: 277 GIMALGRGAQSLPTQTKATYG---DVFSYCLPPTPVHSGFFILGVPRVAASRYAVTPMLR 333
Query: 205 NSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+ A Y++ + +GK L +F +GA
Sbjct: 334 SKAAPMLYLVRLIAIEVAGKR--LPVPPAVFAAGA 366
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 150/373 (40%), Gaps = 46/373 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L+VG PP DTGSD+ W QC PCT C + + P K+ V CS+P
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQC-VPCTNCYQQDLPMFNPSKSTTYRKVSCSSP 143
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C+ N C D C Y I YGD S G D + ++G V P T GC
Sbjct: 144 VCSFTGEDN--SCSFKPD-CTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGC 200
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRG---VL 185
G++ N G + +G++GLG G S++ Q+ + +C IG + G +
Sbjct: 201 GHD--NAGSFD-ANVSGIVGLGLGPASLIKQMGS--AVGGKFSYCLTPIGNDDGGSNKLN 255
Query: 186 FLGDGKVPSSGVAWTPMLQN-------SADLKHYILGPAELLYSGKSCGL-KDLTLIFDS 237
F + V SG TP+ + S LK +G YS + L +I DS
Sbjct: 256 FGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDS 315
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFKALGQVTEYFKP-L 295
G + +Y I + L+ D ++ L C+ +Y P +
Sbjct: 316 GTTLTLLPVDLYHNFAKAISNSI---NLQRTDDPNQFLEYCFE------TTTDDYKVPFI 366
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
A+ F L + E L+ +CL + ++ +I G I + +V YD
Sbjct: 367 AMHF----EGANLRLQRENVLIRVSDNVICLAFAGAQDNDI---SIYGNIAQINFLVGYD 419
Query: 356 NEKQRIGWKPEDC 368
+ +KP +C
Sbjct: 420 VTNMSLSFKPMNC 432
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 144/371 (38%), Gaps = 42/371 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF V + VG PP D+GSD+ W+QC PC C + + + P + VPC +
Sbjct: 133 YF-VRVGVGSPPTEQYLVVDSGSDVIWIQCR-PCAECYQQADPLFDPAASASFTAVPCDS 190
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L P + C Y++ YGDG + G L + L F + + + GC
Sbjct: 191 GVCRTL--PGGSSGCADSGACRYQVSYGDGSYTQGVLAMET--LTFGDSTPVQ-GVAIGC 245
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN----GRGVLFL 187
G+ N G AG+LGLG G +S+V QL +C+ G G L
Sbjct: 246 GH--RNRGLFV--GAAGLLGLGWGPMSLVGQLGG--AAGGAFSYCLASRGADAGAGSLVF 299
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC----GLKDLT------LIFDS 237
G G W P+L+N+ Y +G L G+ GL DLT ++ D+
Sbjct: 300 GRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDT 359
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G + Y + IG L AP L C + G + +AL
Sbjct: 360 GTAVTRLPPDAYAALRDAFA-STIGGDLPRAPGVSLLDTC----YDLSGYASVRVPTVAL 414
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
F R+ L +P LV G CL A +I+G I Q + D+
Sbjct: 415 YFG--RDGAALTLPARNLLVEMGGGVYCLAF----AASASGLSILGNIQQQGIQITVDSA 468
Query: 358 KQRIGWKPEDC 368
+G+ P C
Sbjct: 469 NGYVGFGPSTC 479
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/326 (26%), Positives = 132/326 (40%), Gaps = 45/326 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK---------PPEKQYKPHKNI 66
Y+A + +G P K + DTGSD+ WV C C C + P + + +
Sbjct: 87 YYA-KIGIGTPSKDYYVQVDTGSDIVWVNC-IQCRECPRTSSLGMELTPYDLEESTTGKL 144
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C ++ C N C Y YGDG S+ G V D +G +
Sbjct: 145 VSCDEQFCLEVNGGPLSGCT-TNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTA 203
Query: 123 FNVPLTFGCGYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
N + FGCG Q + G G+LG G+ SI+SQL ++ + HC+ G N
Sbjct: 204 ANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTN 263
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS---------ADLKHYILG-PAELLYSGKSCGLKD 230
G G+ +G P V TP++ N + H IL A++ +G G
Sbjct: 264 GGGIFAMGHVVQPK--VNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKG--- 318
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY +Y+ +V+ I+ ++ + F+ +V +
Sbjct: 319 --TIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKC-------FQYSERVDD 369
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYL 316
F P+ F NS+ L V P YL
Sbjct: 370 GFPPVIFHF---ENSLLLKVYPHEYL 392
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 151/382 (39%), Gaps = 36/382 (9%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP-----EKQYKPHKNIV 67
+ + + ++++VG PP+ DTGSDL W QC APC C + + +
Sbjct: 86 VTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQC-APCLDCFEQGAAPVLDPAASSTHAAL 144
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN--GSVFNV 125
PC P C AL + + + C Y YGD ++G L TD F + G +
Sbjct: 145 PCDAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAAR 204
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+TFGCG+ N G +T G+ G GRGR S+ SQL V+
Sbjct: 205 RVTFGCGHI--NKGIFQANET-GIAGFGRGRWSLPSQLNVTSF-SYCFTSMFDTKSSSVV 260
Query: 186 FLGDGKVP---------SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--- 233
LG + V T +++N + Y + + G + + L
Sbjct: 261 TLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLRSS 320
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
I DSGAS VY+ + + + +G P A L +C+ P AL +
Sbjct: 321 TIIDSGASITTLPEDVYEAVKAEFVSQ-VGLPAAAA-GSAALDLCFALPVAAL-----WR 373
Query: 293 KPLALSFT-NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
+P + T + +P Y+ V +L +A GE +IG Q+
Sbjct: 374 RPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVL---DAAAGEQVVIGNYQQQNTH 430
Query: 352 VIYDNEKQRIGWKPEDCNTLLS 373
V+YD E + + P C+ L +
Sbjct: 431 VVYDLENDVLSFAPARCDKLAA 452
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 165/389 (42%), Gaps = 68/389 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKNI----VPCSN 71
F + L +G PP F DTGSDL W QC APC+ C + P Y P + +PC++
Sbjct: 85 FLMTLAIGTPPLPFLAIADTGSDLIWTQC-APCSRQCFQQPTPLYNPSSSTTFSALPCNS 143
Query: 72 P--RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVP-L 127
CA P C C Y + YG G + + T+ F S VP +
Sbjct: 144 SLGLCA-------PACA-----CMYNMTYGSGWTYVFQ-GTETFTFGSSTPADQVRVPGI 190
Query: 128 TFGC-----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---- 178
FGC G+N + +G++GLGRG +S+VSQL +C+
Sbjct: 191 AFGCSNASSGFNASS--------ASGLVGLGRGSLSLVSQLGAPKF-----SYCLTPYQD 237
Query: 179 QNGRGVLFLG-DGKVPSSG-VAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLK- 229
N L LG + +G V+ TP + + + + +Y+ LG L + LK
Sbjct: 238 TNSTSTLLLGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKA 297
Query: 230 DLT--LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ 287
D T LI DSG + + YQ++ + ++ L+ P L +C+ P
Sbjct: 298 DGTGGLIIDSGTTITMLGNTAYQQVRAAVL-SLVTLPTTDGSAATGLDLCFELP------ 350
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-----CLGILNGSEAEVGENNII 342
+ P S T + +V+P + Y++ + CL + N ++ + +I+
Sbjct: 351 SSTSAPPSMPSMTLHFDGADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSIL 410
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
G Q+ ++YD K+ + + P C+TL
Sbjct: 411 GNYQQQNMHILYDVGKETLSFAPAKCSTL 439
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 80/152 (52%), Gaps = 15/152 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V+L +G PP + DTGSDL W QC APC C P + K+ +PC +
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCRSS 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
RCA+L + P C C Y+ YGD S+ G L + F +N + V + FGC
Sbjct: 148 RCASL---SSPSCFK--KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGC 202
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
G N G L+ +++G++G GRG +S+VSQL
Sbjct: 203 G--SLNAGDLA--NSSGMVGFGRGPLSLVSQL 230
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 144/366 (39%), Gaps = 41/366 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ L +G P + DTGS LTW+QC C + + P + V CS
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCSAS 193
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C L NP C N C Y+ YGD S+G+L TD S GS +G
Sbjct: 194 QCDELQAATLNPSACSASN-VCIYQASYGDSSFSVGSLSTD----TVSFGSTRYPSFYYG 248
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
CG Q N G +AG++GL R ++S++ QL + +C+ G L +G
Sbjct: 249 CG--QDNEGLFG--RSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTAASTGYLSIGP 302
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYF 244
++TPM +S D Y + + + G + L I DSG
Sbjct: 303 YNT-GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRL 361
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALSFTNRR 303
+ V+ + + + + G + AP L C+ GQ ++ P +A++F
Sbjct: 362 PTAVHTALSKAVAQAMAGA--QRAPAFSILDTCFE------GQASQLRVPTVAMAFAGGA 413
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
S++L L+ CL A IIG Q VIYD + RIG+
Sbjct: 414 -SMKLTT--RNVLIDVDDSTTCLAF-----APTDSTAIIGNTQQQTFSVIYDVAQSRIGF 465
Query: 364 KPEDCN 369
C+
Sbjct: 466 SAGGCS 471
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 153/383 (39%), Gaps = 51/383 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF +++ VG PPK F DTGSDL W+QC PC C + Y P + + C +
Sbjct: 197 YF-MDVFVGTPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISCHD 254
Query: 72 PRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS--NGS-----VF 123
PRC + P+PP+ CK N C Y YGDG ++ G + F + + NG+ V
Sbjct: 255 PRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVE 314
Query: 124 NVPLTFGCGYNQH---NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RN----VIGH 175
NV FGCG+ + G L S+ Q Y L+ RN V
Sbjct: 315 NV--MFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSK 372
Query: 176 CIGQNGRGVL--------FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLY-SGKSC 226
I + +L G GK S + +++ + P E + S +
Sbjct: 373 LIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSEGA 432
Query: 227 GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
G I DSG + YF Y+ I +R + G L + LP P K
Sbjct: 433 G----GTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLV-----EGLP-----PLKPCY 478
Query: 287 QVTEYFKPLALSF-TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEI 345
V+ K F + P E Y + + VCL IL + + +IIG
Sbjct: 479 NVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSAL---SIIGNY 535
Query: 346 FMQDKMVIYDNEKQRIGWKPEDC 368
Q+ ++YD +K R+G+ P C
Sbjct: 536 QQQNFHILYDMKKSRLGYAPMKC 558
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 152/389 (39%), Gaps = 52/389 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--------PCTGCTKPPE--KQYKPHKNI 66
+ V++ G PP+ DTGSDL W+QC P C++ P ++
Sbjct: 54 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 113
Query: 67 VPCSNPRCAALHWP--NPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
VPCS +C + P + P C C Y +Y DG S+ G L D + SNG+
Sbjct: 114 VPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATI--SNGTSG 171
Query: 124 NVP---LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
+ FGCG ++ G S T GV+GLG+G++S +Q L +C+
Sbjct: 172 GAAVRGVAFGCG-TRNQGGSFS--GTGGVIGLGQGQLSFPAQ--SGSLFAQTFSYCLLDL 226
Query: 178 --GQNGRGVLFLGDGKVP-SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG------- 227
G+ GR FL G+ + A+TP++ N Y +G + +
Sbjct: 227 EGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWA 286
Query: 228 ---LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT----LPICWR- 279
L + + DSG++ Y Y +VS + L P T L +C+
Sbjct: 287 IDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV---HLPRIPSSATFFQGLELCYNV 343
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN 339
+L F L + F + L +P YLV CL I
Sbjct: 344 SSSSSLAPANGGFPRLTIDFA---QGLSLELPTGNYLVDVADDVKCLAIR--PTLSPFAF 398
Query: 340 NIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
N++G + Q V +D RIG+ +C
Sbjct: 399 NVLGNLMQQGYHVEFDRASARIGFARTEC 427
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 156/375 (41%), Gaps = 39/375 (10%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP----------PEKQYKPHK 64
S + N++VG PP F DTGSDL W+ C+ T C + P Y P+
Sbjct: 100 SLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT-CIRDLEDIGVPQSVPLNLYTPNA 158
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ S+ RC+ +C P C Y+I Y + + G L+ D+ L + ++
Sbjct: 159 STT-SSSIRCSDKRCFGSKKCSSPKSICPYQISYSNSTGTTGTLLQDVLHLATEDENLTP 217
Query: 125 VP--LTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V +T GCG Q G ++ GVLGLG S+ S L + + + C G+
Sbjct: 218 VKTNVTLGCG--QKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITADSFSMCFGRVI 275
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASY 241
V + G + TP + + A Y L + G G + L FD+G+S+
Sbjct: 276 GNVGRISFGDKGYTDQEETPFI-SVAPSTAYGLNVTGVSVGGDPVGTR-LFAKFDTGSSF 333
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
+ Y +++ DL+ +DK P+ PF+ ++ + F
Sbjct: 334 THLMEPAYG-VLTKSFDDLV--------EDKRRPVDPELPFEFCYDLSPNATSIEFPFVE 384
Query: 302 RR--NSVRLVVPPEAYLVIS----GRKNV--CLGILNGSEAEVGENNIIGEIFMQDKMVI 353
++++ + + G NV CLG+L ++ N+IG+ F+ ++
Sbjct: 385 MTFVGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKSVGLKI---NVIGQNFVAGYRIV 441
Query: 354 YDNEKQRIGWKPEDC 368
+D E+ +GWKP C
Sbjct: 442 FDRERMILGWKPSLC 456
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 164/380 (43%), Gaps = 44/380 (11%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---EKQYKP 62
+E P + ++++VG P K F DTGSDL WVQ + PCTGC+ +Q
Sbjct: 44 VESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWVQSE-PCTGCSGGTIFDPRQSST 102
Query: 63 HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-RFSNGS 121
+ + CS+ CA L P C+ + C Y EYG G + G D L S+GS
Sbjct: 103 FREM-DCSSQLCAEL----PGSCEPGSSTCSYSYEYGS-GETEGEFARDTISLGTTSDGS 156
Query: 122 VFNVPLTFGCGY-NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
GCG N G G++GLG+G +S+ SQL I + +C+
Sbjct: 157 QKFPSFAVGCGMVNSGFDG------VDGLVGLGQGPVSLTSQLS--AAIDSKFSYCLVDI 208
Query: 178 -GQNGRGVLFLG-DGKVPSSGVAWTPMLQNSADL-KHYILGPAELLYSGKSCGLKDLTLI 234
Q+ L G + +G+ T + S +Y+L + +G++ G T+I
Sbjct: 209 NSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTTII 268
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG + Y S VY ++S M ++ P ++ L +C+ +K
Sbjct: 269 -DSGTTLTYVPSGVYGRVLSR-MESMVTLP-RVDGSSMGLDLCYD------RSSNRNYKF 319
Query: 295 LALSFTNRRNSVRLVVPPEA--YLVISGRKN-VCLGILNGSEAEVGENNIIGEIFMQDKM 351
AL+ R + + PP + +LV+ + VCL + + S V +IIG + Q
Sbjct: 320 PALTI---RLAGATMTPPSSNYFLVVDDSGDTVCLAMGSASGLPV---SIIGNVMQQGYH 373
Query: 352 VIYDNEKQRIGWKPEDCNTL 371
++YD + + C +L
Sbjct: 374 ILYDRGSSELSFVQAKCESL 393
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 150/380 (39%), Gaps = 52/380 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V L VG PP+ DTGS+L+W+ C +P G P Y P VPCS+P
Sbjct: 65 LTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSP----VPCSSPI 120
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P P C C I Y D S G L + F + GSV FGC
Sbjct: 121 CRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI----GSVTRPGTLFGC 176
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDG 190
+ + + G++G+ RG +S V+QL G + +CI G + G L LGD
Sbjct: 177 MDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQL---GFSK--FSYCISGSDSSGFLLLGDA 231
Query: 191 KVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
G + +TP++ S L ++ + G G K L+L +
Sbjct: 232 SYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTM 291
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICW------RGPFK 283
DSG + + VY + + + + L+L D T+ +C+ R F
Sbjct: 292 VDSGTQFTFLMGPVYTALKNEFITQ-TKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFS 350
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
L V+ F+ +S + ++ R+ G++ V S+ E +IG
Sbjct: 351 GLPMVSLMFRGAEMSVSGQKLLYRVNGAGS-----EGKEEVYCFTFGNSDLLGIEAFVIG 405
Query: 344 EIFMQDKMVIYDNEKQRIGW 363
Q+ + +D K R+G+
Sbjct: 406 HHHQQNVWMEFDLAKSRVGF 425
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 139/366 (37%), Gaps = 36/366 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P FDTGSDLTW QC C E + P K+ V CS+
Sbjct: 133 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 192
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C +L N C N C Y I+YGD S+G L D F L S+ VF+ + FG
Sbjct: 193 ACGSLSSATGNAGSCSASN--CIYGIQYGDQSFSVGFLAKDKFTLTSSD--VFD-GVYFG 247
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFLG 188
CG N N G + AG+LGLGR ++S SQ + +C+ + G L G
Sbjct: 248 CGEN--NQGLFT--GVAGLLGLGRDKLSFPSQTAT--AYNKIFSYCLPSSASYTGHLTFG 301
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
+ S V +TP+ + Y L + G+ + + DSG
Sbjct: 302 SAGISRS-VKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITR 360
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
+ Y + S + P L C F G T +A SF+
Sbjct: 361 LPPKAYAALRSSFKAKMSKYPTTSGV--SILDTC----FDLSGFKTVTIPKVAFSFS--- 411
Query: 304 NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+ + + VCL S+ I G + Q V+YD R+G+
Sbjct: 412 GGAVVELGSKGIFYAFKISQVCLAFAGNSDDS--NAAIFGNVQQQTLEVVYDGAGGRVGF 469
Query: 364 KPEDCN 369
P C+
Sbjct: 470 APNGCS 475
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 150/381 (39%), Gaps = 50/381 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V+LTVG PP+ DTGS+L+W+ C P T P Y P PC++
Sbjct: 60 LTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTP----TPCNSSI 115
Query: 74 CA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P C N C + Y D S+ G L + F L FGC
Sbjct: 116 CTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSL----AGAAQPGTLFGC 171
Query: 132 GYNQHNPGPLSP-PDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
+ ++ T G++G+ RG +S+V+Q+ +CI G++ GVL LGD
Sbjct: 172 MDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMS-----LPKFSYCISGEDALGVLLLGD 226
Query: 190 GKVPSSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----LI 234
G S + +TP++ + ++ I +LL KS + D T +
Sbjct: 227 GTDAPSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTM 286
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PD---DKTLPICWRGP--FKALGQV 288
DSG + + VY + + G ++ P+ + + +C+ P F A+ V
Sbjct: 287 VDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFAAVPAV 346
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
T F + + R R+ + + + LGI E +IG Q
Sbjct: 347 TLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGI---------EAYVIGHHHQQ 397
Query: 349 DKMVIYDNEKQRIGWKPEDCN 369
+ + +D K R+G+ C+
Sbjct: 398 NVWMEFDLLKSRVGFTQTTCD 418
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 151/380 (39%), Gaps = 56/380 (14%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKNI----VPCSNPRCA 75
L+VG PP F DTGSDLTW QC APC T C P Y P ++ +PC++P C
Sbjct: 100 LSVGTPPLAFPAIIDTGSDLTWTQC-APCTTACFAQPTPLYDPARSSTFSKLPCASPLCQ 158
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL----RFSNGSVFNVPLTFGC 131
AL P+ R + C Y+ Y G ++ G L D + + S + FGC
Sbjct: 159 AL--PSAFRACNATG-CVYDYRYAVGFTA-GYLAADTLAIGDGDGDGDASSSFAGVAFGC 214
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLFL 187
+ N G + +G++GLGR +S++SQ+ G+ R +C+ + +LF
Sbjct: 215 --STANGGDMD--GASGIVGLGRSALSLLSQI---GVGR--FSYCLRSDADAGASPILFG 265
Query: 188 GDGKVPSSGVAWTPMLQNS-----------ADLKHYILGPAELLYSGKSCGLKDL---TL 233
V V T +L+N +L +G +L + + G +
Sbjct: 266 ALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGV 325
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I DSG ++ Y Y + + G +++ +C+ G
Sbjct: 326 IVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFEA-----GAADTPVP 380
Query: 294 PLALSFTNRRNSVRLVVPPEAYL--VISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
L F VP ++Y V G + CL +L V IG + D
Sbjct: 381 RLVFRFA---GGAEYAVPRQSYFDAVDEGGRVACLLVLPTRGVSV-----IGNVMQMDLH 432
Query: 352 VIYDNEKQRIGWKPEDCNTL 371
V+YD + + P DC +L
Sbjct: 433 VLYDLDGATFSFAPADCASL 452
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 148/379 (39%), Gaps = 49/379 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--------------KPPEKQ 59
F +FA N++VG PP F DTGSDL W+ CD C C +
Sbjct: 103 FLHFA-NVSVGTPPLWFLVALDTGSDLFWLPCD--CISCVHGGLRTRTGKILKFNTYDLD 159
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFS 118
N V C+N + +C C Y+++Y + SS G +V D+ L
Sbjct: 160 KSSTSNEVSCNN----STFCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHLITD 215
Query: 119 NGSV--FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ + + FGCG Q L+ G+ GLG IS+ S L GLI N C
Sbjct: 216 DDQTKDADTRIAFGCGQVQTGVF-LNGAAPNGLFGLGMDNISVPSILAREGLISNSFSMC 274
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLK-HYILGPAELLYSGKSCGLKDLTLIF 235
G + G + GD P TP N L Y + +++ L+ IF
Sbjct: 275 FGSDSAGRITFGDTGSPDQ--RKTPF--NVRKLHPTYNITITKIIVEDSVADLE-FHAIF 329
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI--CWRGPFKALGQVTEYFK 293
DSG S+ Y Y I + + D +P C+ ++ Q E
Sbjct: 330 DSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCYD---ISISQTIEV-- 384
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKN---VCLGILNGSEAEVGENNIIGEIFMQDK 350
P L+ T + V+ P + +S + +CLGI NIIG+ FM
Sbjct: 385 PF-LNLTMKGGDDYYVMDP--IIQVSSEEEGDLLCLGIQKSDSV-----NIIGQNFMTGY 436
Query: 351 MVIYDNEKQRIGWKPEDCN 369
+++D + +GWK +C+
Sbjct: 437 KIVFDRDNMNLGWKETNCS 455
>gi|66817422|ref|XP_642564.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
gi|60470632|gb|EAL68608.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
Length = 492
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 157/393 (39%), Gaps = 57/393 (14%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKP----HKNIVP 68
+++ +N+ V + F DTGS LT + P GC + + Y P ++P
Sbjct: 94 NFYQINVNVLIGQQKFILQVDTGSTLTAI----PLKGCNSCKDNRPVYDPALSSSSQLIP 149
Query: 69 CSNPRCAALHWPNPPRCKHPNDQ--CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
CS+ +C +P H N + CD+ I YGDG G + +D +V V
Sbjct: 150 CSSDKCLGSGSASPSCKLHQNAKSTCDFIILYGDGSKIKGKVFSDEI-------TVSGVS 202
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRIS-------IVSQLREYGLIRNVIGHCIGQ 179
T G N G P G++GLGR + S +R I+N+ G +
Sbjct: 203 STIYFGANVEEVGAFEYPRADGIMGLGRTSNNKNLVPTIFDSMVRSNSSIKNIFGIYLDY 262
Query: 180 NGRGVLFLG--DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-TLIFD 236
+G+G L LG + + +TP +Q + Y + P S + +I D
Sbjct: 263 HGQGYLSLGKINHHYYIGSIQYTP-IQPAGPF--YAIKPTSFRVDNTSFPANSMGQVIVD 319
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKALGQVTEYFKPL 295
SG S TSRVY ++ + + + P + +C+ + E F
Sbjct: 320 SGTSDLILTSRVYDHLIQYFRKHYCHIDMVCSYPSIFSSRVCF--------EKEEDFATF 371
Query: 296 ALSFTNRRNSVRLVVPPEAYLVIS-----GRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
VR+ +PP+ Y++ + G C GI G + I+G++FM+
Sbjct: 372 PWLHFGFEGGVRIAIPPKNYMIKTESNQQGVYGYCWGIDRGDDMT-----ILGDVFMRGY 426
Query: 351 MVIYDNEKQRIGW------KPEDCNTLLSLNHF 377
I+DN + R+G+ K + + +N F
Sbjct: 427 YTIFDNIENRVGFAIGKNSKNSNVGDITDINQF 459
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/356 (27%), Positives = 148/356 (41%), Gaps = 43/356 (12%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKHPND 90
DT S+LTWVQC+ PC C E + P + VPC++ C AL + +D
Sbjct: 129 DTASELTWVQCE-PCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDD 187
Query: 91 Q---CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
Q C Y + Y DG S G L D L + F FGCG + N GP T+
Sbjct: 188 QPAACSYTLSYRDGSYSRGVLAHDRLSLAGEDIQGF----VFGCGTS--NQGPFG--GTS 239
Query: 148 GVLGLGRGRISIVSQ-LREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGVAWT 200
G++GLGR ++S++SQ + ++G V +C+ G L LGD S+ + +T
Sbjct: 240 GLMGLGRSQLSLISQTMDQFG---GVFSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYT 296
Query: 201 PMLQNS-------ADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
M+ + A+L +G ++ G S G ++ DSG VY +
Sbjct: 297 AMVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGGGGKAIV-DSGTIITSLVPSVYAAVR 355
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ + L P + AP L C F G L L F + V +
Sbjct: 356 AEFVSQLAEYP-QAAP-FSILDTC----FDLTGLREVQVPSLKLVF-DGGAEVEVDSKGV 408
Query: 314 AYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
Y+V VCL + S + IIG ++ VI+D +IG+ E C+
Sbjct: 409 LYVVTGDASQVCLAL--ASLKSEYDTPIIGNYQQKNLRVIFDTVGSQIGFAQETCD 462
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 149/361 (41%), Gaps = 45/361 (12%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRC-AALHWPN--PPRCK- 86
DTGSDLTWVQC PC+ C + + P + VPC+ C A+L P C
Sbjct: 182 DTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCAT 240
Query: 87 -------HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPG 139
+++C Y + YGDG S G L TD L G FGCG + N G
Sbjct: 241 VGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL----GGASVDGFVFGCGLS--NRG 294
Query: 140 PLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHCIGQNGRGVLFLGDGKVP---S 194
TAG++GLGR +S+VSQ R G+ + + G L LG +
Sbjct: 295 LFG--GTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYRNA 352
Query: 195 SGVAWTPMLQNSADLKHYILG---PAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQE 251
+ V++T M+ + A Y + + + + GL ++ DSG VY+
Sbjct: 353 TPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAPSVYRA 412
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+ + R AP L C+ L E PL T R +
Sbjct: 413 VRAEFARQFGAERYPAAPPFSLLDACYN-----LTGHDEVKVPL---LTLRLEGGADMTV 464
Query: 312 PEAYLVISGRKN---VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
A ++ RK+ VCL + + S + + IIG ++K V+YD R+G+ EDC
Sbjct: 465 DAAGMLFMARKDGSQVCLAMASLSFED--QTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 522
Query: 369 N 369
+
Sbjct: 523 S 523
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 151/380 (39%), Gaps = 52/380 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V L VG PP+ DTGS+L+W+ C +P G P Y P VPCS+P
Sbjct: 61 LTVTLAVGSPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSP----VPCSSPI 116
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P P C C I Y D S G L D F + GSV FGC
Sbjct: 117 CRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVI----GSVTRPGTLFGC 172
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDG 190
+ + + G++G+ RG +S V+QL G + +CI G + G+L LGD
Sbjct: 173 MDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQL---GFSK--FSYCISGSDSSGILLLGDA 227
Query: 191 KVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
G + +TP++ + L ++ + G G K L+L +
Sbjct: 228 SYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTM 287
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWRG------PFK 283
DSG + + VY + + + + L++ D T+ +C+R F
Sbjct: 288 VDSGTQFTFLMGPVYTALKNEFIAQ-TKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPNFT 346
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
L ++ F+ +S + ++ R+ G++ V S+ E +IG
Sbjct: 347 GLPVISLMFRGAEMSVSGQKLLYRVNGAGS-----EGKEEVYCFTFGNSDLLGIEAFVIG 401
Query: 344 EIFMQDKMVIYDNEKQRIGW 363
Q+ + +D K R+G+
Sbjct: 402 HHHQQNVWMEFDLAKSRVGF 421
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 131/318 (41%), Gaps = 25/318 (7%)
Query: 54 KPPEKQYKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDL 112
+P E H +PCS+ C ++ P C +P C Y I+Y + +S G L+ D
Sbjct: 10 RPAESTTSRH---LPCSHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDT 61
Query: 113 FPLRFSNGSV-FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRN 171
L + V N + GCG Q L G+LGLG IS+ S L GL++N
Sbjct: 62 LHLNYREDHVPVNASVIIGCGQKQSG-DYLDGIAPDGLLGLGMADISVPSFLARAGLVQN 120
Query: 172 VIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL 231
C ++ G +F GD VPS TP + L+ Y + + K
Sbjct: 121 SFSMCFKEDSSGRIFFGDQGVPSQ--QSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSF 178
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ DSG S+ VY+ + + T ++ +D T C+ + V
Sbjct: 179 KALVDSGTSFTSLPFDVYKAFTMEFDKQMNAT--RVPYEDTTWKYCYSASPLEMPDVPT- 235
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+ L+F + S++ V P + G CL +L +E +G II + F+
Sbjct: 236 ---ITLTFAADK-SLQAVNPILPFNDKQGALAGFCLAVLPSTEP-IG---IIAQNFLVGY 287
Query: 351 MVIYDNEKQRIGWKPEDC 368
V++D E ++GW +C
Sbjct: 288 HVVFDRESMKLGWYRSEC 305
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 149/361 (41%), Gaps = 45/361 (12%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRC-AALHWPN--PPRCK- 86
DTGSDLTWVQC PC+ C + + P + VPC+ C A+L P C
Sbjct: 181 DTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCAT 239
Query: 87 -------HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPG 139
+++C Y + YGDG S G L TD L G FGCG + N G
Sbjct: 240 VGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL----GGASVDGFVFGCGLS--NRG 293
Query: 140 PLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHCIGQNGRGVLFLGDGKVP---S 194
TAG++GLGR +S+VSQ R G+ + + G L LG +
Sbjct: 294 LFG--GTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYRNA 351
Query: 195 SGVAWTPMLQNSADLKHYILG---PAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQE 251
+ V++T M+ + A Y + + + + GL ++ DSG VY+
Sbjct: 352 TPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAPSVYRA 411
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+ + R AP L C+ L E PL T R +
Sbjct: 412 VRAEFARQFGAERYPAAPPFSLLDACYN-----LTGHDEVKVPL---LTLRLEGGADMTV 463
Query: 312 PEAYLVISGRKN---VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
A ++ RK+ VCL + + S + + IIG ++K V+YD R+G+ EDC
Sbjct: 464 DAAGMLFMARKDGSQVCLAMASLSFED--QTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 521
Query: 369 N 369
+
Sbjct: 522 S 522
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 101/392 (25%), Positives = 154/392 (39%), Gaps = 68/392 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+ +G PP DTGSDL W QCDAPC C P Y P +++ V C +
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGSR 159
Query: 73 RCAAL--------HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
C AL + C Y YGDG S+ G L T+ F F G+ +
Sbjct: 160 LCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETF--TFGAGTTVH 217
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQN 180
L FGCG + +++G++G+GRG +S+VSQL G+ + +C
Sbjct: 218 -DLAFGCGTDNLG----GTDNSSGLVGMGRGPLSLVSQL---GVTK--FSYCFTPFNDTT 267
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSA----------DLKHYILG-------PA--ELLY 221
LFLG S TP + + + L+ +G PA L
Sbjct: 268 TSSPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTA 327
Query: 222 SGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
SG+ LI DSG ++ R + + + + PL + L +C+ P
Sbjct: 328 SGRG------GLIIDSGTTFTALEERAFVVLARAVAARVA-LPLA-SGAHLGLSVCFAAP 379
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGEN 339
+ G L L F + P + V+ R CLGI++ V
Sbjct: 380 -QGRGPEAVDVPRLVLHFDGADMEL-----PRSSAVVEDRVAGVACLGIVSARGMSV--- 430
Query: 340 NIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+G + Q+ V YD + + ++P +C L
Sbjct: 431 --LGSMQQQNMHVRYDVGRDVLSFEPANCGEL 460
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 143/370 (38%), Gaps = 43/370 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP D+GSD+ WVQC PC C + + P + V C +
Sbjct: 130 YF-VRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCGS 187
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L +CDY + YGDG + G L + L G + GC
Sbjct: 188 AICRTLSGTGCGG-GGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGC 242
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ N G AG+LGLG G +S+V QL G V +C+ G G G L LG
Sbjct: 243 GH--RNSGLFV--GAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAGGAGSLVLG 296
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD----LT------LIFDSG 238
+ G W P+++N+ Y +G + G+ L+D LT ++ D+G
Sbjct: 297 RTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTG 356
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+ Y + D L +P L C+ L P +S
Sbjct: 357 TAVTRLPREAYAALRGAF--DGAMGALPRSPAVSLLDTCYD-----LSGYASVRVP-TVS 408
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F + +V L +P LV G CL S +I+G I + + D+
Sbjct: 409 FYFDQGAV-LTLPARNLLVEVGGAVFCLAFAPSSSGI----SILGNIQQEGIQITVDSAN 463
Query: 359 QRIGWKPEDC 368
+G+ P C
Sbjct: 464 GYVGFGPNTC 473
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 155/384 (40%), Gaps = 67/384 (17%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-TKPPEKQYKPHKNI--------VPCSNPR 73
VG PP+ + DTGS L W QC T C K +Q P+ N VPC +
Sbjct: 92 VGDPPQRAEALIDTGSSLIWTQC----TACLRKVCVRQDLPYFNASSSGSFAPVPCQDKA 147
Query: 74 CAA--LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
CA LH+ C + C + + YG GG IG L TD F + S G+ L FGC
Sbjct: 148 CAGNYLHF-----CAL-DGTCTFRVTYGAGGI-IGFLGTDAFTFQ-SGGAT----LAFGC 195
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
P +G++GLGRGR+S+ SQ + + LF+G
Sbjct: 196 VSFTRFAAPDVLHGASGLIGLGRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLFVGAAA 255
Query: 192 VPSSG---VAWTPMLQNSAD----------LKHYILGPAELLYSGKSCGLKDLT------ 232
S G V +++ D L +G +L + L+++
Sbjct: 256 SLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEG 315
Query: 233 -LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAP----DDKTLPICWRGPFKALGQ 287
+I DSG+ + Y+ ++ + R L G+ L P DD + +C A G
Sbjct: 316 GVIIDSGSPFTSLVEDAYEPLMGELARQLNGS---LVPPPGEDDGGMALC-----VARGD 367
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
+ L L F+ + + +PPE Y + C+ I+ G +IIG
Sbjct: 368 LDRVVPTLVLHFSGGAD---MALPPENYWAPLEKSTACMAIVRGYL-----QSIIGNFQQ 419
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ +++D R+ ++ DC+T+
Sbjct: 420 QNMHILFDVGGGRLSFQNADCSTI 443
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 162/373 (43%), Gaps = 37/373 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPPEKQYKPHKN----IVP 68
F Y + + VG PP DTGSDL WV C + G + P ++ ++
Sbjct: 98 FEYL-MYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLS 156
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---FNV 125
C + C AL + C + +C Y+ YGDG +IG L T+ F + G V
Sbjct: 157 CQSAACQALSQAS---CDA-DSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRV 212
Query: 126 P-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQ 179
P ++FGC + G + G++GLG G +S+VSQL I +C+
Sbjct: 213 PRVSFGC-----STGSAGSFRSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAA 267
Query: 180 NGRGVLFLGDGKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-LIFDS 237
N L G V S G A TP++ + D +Y + + +G+ + + +I DS
Sbjct: 268 NSSSTLSFGARAVVSDPGAASTPLVPSEVD-SYYTVALESVAVAGQDVASANSSRIIVDS 326
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LA 296
G + + + + +V+ + R I P + P ++ L +C+ + Q ++ P +
Sbjct: 327 GTTLTFLDPALLRPLVAELERR-IRLP-RAQPPEQLLQLCYD--VQGKSQAEDFGIPDVT 382
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
L F + + PE + +CL ++ SE++ +I+G I Q+ V YD
Sbjct: 383 LRFG---GGASVTLRPENTFSLLEEGTLCLVLVPVSESQ--PVSILGNIAQQNFHVGYDL 437
Query: 357 EKQRIGWKPEDCN 369
+ + + + DC
Sbjct: 438 DARTVTFAAVDCT 450
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/416 (23%), Positives = 157/416 (37%), Gaps = 84/416 (20%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC---DAPCTGCTKPPEKQYKPHKNIVP----- 68
+ + L +G PP+ DTGSDLTWV C C C K P
Sbjct: 11 YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSSS 70
Query: 69 -----CSNPRCAALHWPNPP-----------------RCKHPNDQCDYEIEYGDGGSSIG 106
C++ CA +H + P C P Y YG+GG G
Sbjct: 71 SFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAY--TYGEGGLVSG 128
Query: 107 ALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREY 166
L D+ R + F +FGC + ++ + G+ G GRG +S+ SQL
Sbjct: 129 ILTRDILKARTRDVPRF----SFGCVTSTYH-------EPIGIAGFGRGLLSLPSQL--- 174
Query: 167 GLIRNVIGHCI-------GQNGRGVLFLGDGKVP---SSGVAWTPMLQNSADLKHYILGP 216
G + HC N L LG + + + +TPML Y +G
Sbjct: 175 GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIG- 233
Query: 217 AELLYSGKSCGLKDLTL-------------IFDSGASYAYFTSRVYQEIVSLIMRDLIGT 263
E + G + + L + DSG +Y + + Y ++++ I++ I
Sbjct: 234 LESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLT-ILQSTITY 292
Query: 264 PLKLAPDDKT-LPICWRGP-----FKAL-GQVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
P + +T +C++ P +L V F + +F N N+ L+ ++
Sbjct: 293 PRATETESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLN--NATLLLPQGNSFY 350
Query: 317 VIS----GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+S G CL N + G + G Q+ V+YD EK+RIG++ DC
Sbjct: 351 AMSAPSDGSVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 406
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 162/378 (42%), Gaps = 40/378 (10%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---EKQYKP 62
+E P + ++++VG P K F DTGSDL WVQ + PCTGC+ +Q
Sbjct: 44 VESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWVQSE-PCTGCSGGTIFDPRQSST 102
Query: 63 HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV 122
+ + CS+ C L P C+ + C Y EYG G + G D L ++G
Sbjct: 103 FREM-DCSSQLCTEL----PGSCEPGSSACSYSYEYGS-GETEGEFARDTISLGTTSGGS 156
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----G 178
P +F G N G G++GLG+G +S+ SQL I + +C+
Sbjct: 157 QKFP-SFAVGCGMVNSG---FDGVDGLVGLGQGPVSLTSQLSA--AIDSKFSYCLVDINS 210
Query: 179 QNGRGVLFLG-DGKVPSSGVAWTPMLQNSADL-KHYILGPAELLYSGKSCGLKDLTLIFD 236
Q+ L G + +G+ T + S +Y+L + +G++ G T+I D
Sbjct: 211 QSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTTII-D 269
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG + Y S VY ++S M ++ P ++ L +C+ +K A
Sbjct: 270 SGTTLTYVPSGVYGRVLSR-MESMVTLP-RVDGSSMGLDLCYD------RSSNRNYKFPA 321
Query: 297 LSFTNRRNSVRLVVPPEA--YLVISGRKN-VCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
L+ R + + PP + +LV+ + VCL + + V +IIG + Q ++
Sbjct: 322 LTI---RLAGATMTPPSSNYFLVVDDSGDTVCLAMGSAGGLPV---SIIGNVMQQGYHIL 375
Query: 354 YDNEKQRIGWKPEDCNTL 371
YD + + C +L
Sbjct: 376 YDRGSSELSFVQAKCESL 393
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/382 (24%), Positives = 155/382 (40%), Gaps = 52/382 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPP--EKQYKPHKNIVPCSNPR 73
++LT+G PP+ DTGS+L+W+ C P T P Y P PC++
Sbjct: 59 LTISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTP----TPCNSSV 114
Query: 74 CA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN--GSVFNVPLTF 129
C P C N C + Y D S+ G L + F L + G++F +
Sbjct: 115 CMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGCMDSA 174
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLG 188
G + + T G++G+ RG +S+V+Q ++ +CI G++ GVL LG
Sbjct: 175 GYTSDINEDA-----KTTGLMGMNRGSLSLVTQ-----MVLPKFSYCISGEDAFGVLLLG 224
Query: 189 DGKVPSSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----L 233
DG S + +TP++ + ++ I +LL KS + D T
Sbjct: 225 DGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQT 284
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PD---DKTLPICWRGP--FKALGQ 287
+ DSG + + VY + + G ++ P+ + + +C+ P A+
Sbjct: 285 MVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASLAAVPA 344
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
VT F + R + RL+ Y V GR V S+ E +IG
Sbjct: 345 VTLVFSGAEM----RVSGERLL-----YRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQ 395
Query: 348 QDKMVIYDNEKQRIGWKPEDCN 369
Q+ + +D K R+G+ C+
Sbjct: 396 QNVWMEFDLVKSRVGFTETTCD 417
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 132/294 (44%), Gaps = 31/294 (10%)
Query: 94 YEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGL 152
Y+ +Y + +S G L D+ + FSN S + L FGC G L G++GL
Sbjct: 103 YQRQYAEKSTSSGVLGKDV--ISFSNSSDLGGQRLVFGC--ETAETGDLYDQTADGIIGL 158
Query: 153 GRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLK 210
GRG +SI+ QL E + +V C G G G + LG + P V + S
Sbjct: 159 GRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTSSDPHRSP--- 215
Query: 211 HYILGPAELLYSGKSCGLK------DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP 264
+Y L + G LK + DSG +YAYF +Q S + ++ +G+
Sbjct: 216 YYNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSGTTYAYFPGAAFQAFKSAV-KEQVGSL 274
Query: 265 LKL-APDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----IS 319
++ PD+K IC+ G + ++++F + F + ++ + + PE YL IS
Sbjct: 275 KEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQS---VTLSPENYLFRHTKIS 331
Query: 320 GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLS 373
G CLG+ + ++G I +++ +V Y+ K IG+ CN L S
Sbjct: 332 GA--YCLGVFENGDP----TTLLGGIIVRNMLVTYNRGKASIGFLKTKCNDLWS 379
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 163/383 (42%), Gaps = 50/383 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++ VG PPK DTGSDL+W+QCD PC C + Y P+++ + C +P
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGPHYNPNESSSYRNISCYDP 228
Query: 73 RCAALHWPNP-PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS--NGS---VFNVP 126
RC + P+P CK N C Y +Y DG ++ G + F + + NG V
Sbjct: 229 RCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVD 288
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI-----GQN 180
+ FGCG+ N G G+LGLGRG +S SQL+ YG + +C+ +
Sbjct: 289 VMFGCGH--WNKGFFHG--AGGLLGLGRGPLSFPSQLQSIYG---HSFSYCLTDLFSNTS 341
Query: 181 GRGVLFLGDGK--VPSSGVAWTPML--QNSADLKHYILGPAELLYSGKSCGLKDLT---- 232
L G+ K + + +T +L + + D Y L ++ G+ + + T
Sbjct: 342 VSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWS 401
Query: 233 ------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG++ +F Y I + + ++A DD + C+
Sbjct: 402 SEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKI--KLQQIAADDFIMSPCYNVSGAMQV 459
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEI 345
++ +Y A + P E Y + +CL IL IIG +
Sbjct: 460 ELPDYGIHFA-------DGAVWNFPAENYFYQYEPDEVICLAILKTPNH--SHLTIIGNL 510
Query: 346 FMQDKMVIYDNEKQRIGWKPEDC 368
Q+ ++YD ++ R+G+ P C
Sbjct: 511 LQQNFHILYDVKRSRLGYSPRRC 533
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 153/376 (40%), Gaps = 43/376 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + + +G P + + DTGSDL W QC APC C P + P ++ + C++P
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSATYRSLGCASP 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL++P C C Y+ YGD S+ G L + F + V ++FGCG
Sbjct: 149 ACNALYYP---LCYQ--KVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCG 203
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQL---REYGLIRNVIGHCIGQNGRGVLF-LG 188
N G L+ + +G++G GRG +S+VSQL R + + + + GV L
Sbjct: 204 --NLNAGSLA--NGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVPSRLYFGVYATLN 259
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----------IFDS 237
S V TP + N A Y L + G + I DS
Sbjct: 260 STNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDS 319
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G + Y Y + + I PL D L C++ P VT L L
Sbjct: 320 GTTITYLAEPAYDAVRAAFASQ-ITLPLLNVTDASVLDTCFQWPPPPRQSVT--LPQLVL 376
Query: 298 SFTNRRNSVRLVVPPEAYLVI--SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
F + +P + Y+++ S +CL + A + +IIG Q+ V+YD
Sbjct: 377 HF----DGADWELPLQNYMLVDPSTGGGLCLAM-----ASSSDGSIIGSYQHQNFNVLYD 427
Query: 356 NEKQRIGWKPEDCNTL 371
E + + P C+ +
Sbjct: 428 LENSLMSFVPAPCHLM 443
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 93/365 (25%), Positives = 139/365 (38%), Gaps = 39/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ L +G P + DTGS LTW+QC C + + P + V CS
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCSAS 193
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C L NP C N C Y+ YGD S+G L TD S GS +G
Sbjct: 194 QCDELQAATLNPSACSASN-VCIYQASYGDSSFSVGYLSTD----TVSFGSTSYPSFYYG 248
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
CG Q N G +AG++GL R ++S++ QL + +C+ G L +G
Sbjct: 249 CG--QDNEGLFG--RSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTAASTGYLSIGP 302
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYF 244
++TPM +S D Y + + + G + L I DSG
Sbjct: 303 YNT-GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRL 361
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
+ V+ + + + + G + AP L C+ GQ ++ P +
Sbjct: 362 PTAVHTALSKAVAQAMAGA--QRAPAFSILDTCFE------GQASQLRVPTVVMAFAGGA 413
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
S++L L+ CL A IIG Q VIYD + RIG+
Sbjct: 414 SMKLTT--RNVLIDVDDSTTCLAF-----APTDSTAIIGNTQQQTFSVIYDVAQSRIGFS 466
Query: 365 PEDCN 369
C+
Sbjct: 467 AGGCS 471
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 148/350 (42%), Gaps = 35/350 (10%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWP--NPPRCKHP 88
DTGSDLTWVQC PC C + + P + + C++ C +L + N C
Sbjct: 83 DTGSDLTWVQCQ-PCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSN 141
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAG 148
C+Y + YGDG + G L + L ++ S F FGCG N N G +G
Sbjct: 142 TPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSNF----IFGCGRN--NKGLFG--GASG 193
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGVAWTPM 202
++GLG+ +S+VSQ + V +C+ + G L LG ++ +++T M
Sbjct: 194 LMGLGKSDLSLVSQTS--AIFEGVFSYCLPTTAADASGSLILGGNSSVYKNTTPISYTRM 251
Query: 203 LQNSADLKHYILGPAELLYSG---KSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRD 259
+ N Y L + G ++ + ++ DSG VY+++ + ++
Sbjct: 252 IANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQ 311
Query: 260 LIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVIS 319
G P AP L C+ L E P + + V Y V +
Sbjct: 312 FSGFP--SAPPFSILDTCFN-----LNGYDEVDIPTIRMQFEGNAELTVDVTGIFYFVKT 364
Query: 320 GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
VCL + + S + E IIG +++ VIY+ ++ ++G+ E C+
Sbjct: 365 DASQVCLALASLSFDD--EIPIIGNYQQRNQRVIYNTKESKLGFAAEACS 412
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 139/364 (38%), Gaps = 36/364 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ L +G P + DTGS LTW+QC C + + P + V CS+
Sbjct: 131 YVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCSSS 190
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L NP C N C Y+ YGD S+G L D + F +GS +G
Sbjct: 191 ECGELQAATLNPSACSVSN-VCIYQASYGDSSYSVGYLSKDT--VSFGSGSFPG--FYYG 245
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G +AG++GL + ++S++ QL + +C+ + +L G
Sbjct: 246 CG--QDNEGLFG--RSAGLIGLAKNKLSLLYQLAPS--LGYAFSYCLPTSSAAAGYLSIG 299
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYFT 245
++TPM +S D Y + + + +G + + L I DSG
Sbjct: 300 SYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITRLP 359
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
VY + + + AP L C+RG L + ++F
Sbjct: 360 PNVYTALSRAVAAAMASA-APRAPTYSILDTCFRGSAAGL-----RVPRVDMAFA---GG 410
Query: 306 VRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKP 365
L + P L+ CL A G IIG Q V+YD + RIG+
Sbjct: 411 ATLALSPGNVLIDVDDSTTCLAF-----APTGGTAIIGNTQQQTFSVVYDVAQSRIGFAA 465
Query: 366 EDCN 369
C+
Sbjct: 466 GGCS 469
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/236 (33%), Positives = 115/236 (48%), Gaps = 34/236 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHK----NIVPCS 70
+ V +++G P + DTGSD++WVQC PC C + + P + + VPC+
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCK-PCPSPPCYSQRDPLFDPTRSSSYSAVPCA 200
Query: 71 NPRCAALH-WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C+ L + N C QC Y + YGDG ++ G +D L SN F
Sbjct: 201 AASCSQLALYSN--GCS--GGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNA---LKGFLF 253
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI--GQNGRGVLF 186
GCG+ Q G + D G+LGLGR S+VSQ YG V +C+ QN G +
Sbjct: 254 GCGHAQQ--GLFAGVD--GLLGLGRQGQSLVSQASSTYG---GVFSYCLPPTQNSVGYIS 306
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---IFDSGA 239
LG G ++G + TP+L S D +YI ++ +G S G + L++ +F SGA
Sbjct: 307 LG-GPSSTAGFSTTPLLTASNDPTYYI-----VMLAGISVGGQPLSIDASVFASGA 356
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 143/370 (38%), Gaps = 43/370 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP D+GSD+ WVQC PC C + + P + V C +
Sbjct: 130 YF-VRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCGS 187
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L +CDY + YGDG + G L + L G + GC
Sbjct: 188 AICRTLSGTGCGG-GGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGC 242
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ N G AG+LGLG G +S++ QL G V +C+ G G G L LG
Sbjct: 243 GH--RNSGLFV--GAAGLLGLGWGAMSLIGQLG--GAAGGVFSYCLASRGAGGAGSLVLG 296
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD----LT------LIFDSG 238
+ G W P+++N+ Y +G + G+ L+D LT ++ D+G
Sbjct: 297 RTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTG 356
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+ Y + D L +P L C+ L P +S
Sbjct: 357 TAVTRLPREAYAALRGAF--DGAMGALPRSPAVSLLDTCYD-----LSGYASVRVP-TVS 408
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F + +V L +P LV G CL S +I+G I + + D+
Sbjct: 409 FYFDQGAV-LTLPARNLLVEVGGAVFCLAFAPSSSGI----SILGNIQQEGIQITVDSAN 463
Query: 359 QRIGWKPEDC 368
+G+ P C
Sbjct: 464 GYVGFGPNTC 473
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 158/386 (40%), Gaps = 59/386 (15%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDF------DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN 65
P + +TVG P + D F D GSD+TW+QC PC C P Y K+
Sbjct: 120 PTSGEYIAKITVGTPYE-NDSSFEALLSPDMGSDVTWLQC-MPCFRCYHQPGPVYNRLKS 177
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS 121
V C P C AL + C ++C Y++EYGDG SS G + L F G
Sbjct: 178 SSASDVGCYAPACRALG--SSGGCVQFLNECQYKVEYGDGSSSAGDFGVET--LTFPPG- 232
Query: 122 VFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
VP + GCG + L P AG+LGLGRG +S SQ+ G +C+
Sbjct: 233 -VRVPGVAIGCGSDNQG---LFPAPAAGILGLGRGSLSFPSQIA--GRYGRSFSYCLAGQ 286
Query: 181 GRG----VLFLGDGKVP----SSGVAWTPMLQNSADLKHYILGPAELLYSG---KSCGLK 229
G G L G G ++ ++TPML NS Y +G + G +
Sbjct: 287 GTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTES 346
Query: 230 DLTL---------IFDSGASYAYFTSRVY---QEIVSLIMRDLIGTPLKLAPDDKTLPIC 277
DL L I DSG + + Y ++ + +G P P C
Sbjct: 347 DLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGP-FAFFDTC 405
Query: 278 WRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL--VISGRKNVCLGILNGSEAE 335
+ G+V + +++ F V + +PP+ YL V S + +C +
Sbjct: 406 YS---SVRGRVMKKVPAVSMHFA---GGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRG 459
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRI 361
V +IIG I +Q V+YD + QR+
Sbjct: 460 V---SIIGNIQLQGFRVVYDVDGQRV 482
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 106/377 (28%), Positives = 160/377 (42%), Gaps = 44/377 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + VG PP+ F DTGSDL W+QC APC C + P + V C +
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFDQRGPVFDPMASTSYRNVTCGDT 208
Query: 73 RCAALHWPNPPR-CKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTF 129
RC + P PR C+ +D C Y YGD ++ G L + F + + S V +
Sbjct: 209 RCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVL 268
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV---L 185
GCG+ N G AG+LGLGRG +S SQLR YG + +C+ +G V +
Sbjct: 269 GCGH--RNRGLFH--GAAGLLGLGRGPLSFASQLRAVYG---HAFSYCLVDHGSAVGSKI 321
Query: 186 FLGDGKVPSS--GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------- 232
GD V S + +T ++A+ Y + +L G+ + T
Sbjct: 322 VFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGG 381
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
I DSG + +YF Y+ I + D + L D L C+ +V E+
Sbjct: 382 TIIDSGTTLSYFPEPAYKAIRQAFV-DRMDKAYPLIADFPVLSPCYNVSGVERVEVPEF- 439
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
+L F + P E Y + + +CL +L + + +IIG Q+
Sbjct: 440 ---SLLFA---DGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAM---SIIGNYQQQNFH 490
Query: 352 VIYDNEKQRIGWKPEDC 368
V+YD R+G+ P C
Sbjct: 491 VLYDLHHNRLGFAPRRC 507
>gi|330842955|ref|XP_003293432.1| hypothetical protein DICPUDRAFT_158270 [Dictyostelium purpureum]
gi|325076242|gb|EGC30045.1| hypothetical protein DICPUDRAFT_158270 [Dictyostelium purpureum]
Length = 484
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 156/375 (41%), Gaps = 55/375 (14%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCS 70
+++ +N V + F DTGS LT + C C + Y P + ++PCS
Sbjct: 80 NFYQINANVYIGGQKFILQVDTGSTLTAIPL-KNCNNC-RGERPVYNPEISNSSILIPCS 137
Query: 71 NPRCAALHWPNPPRCKHPNDQ--CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+ C P H + + CD+ I YGDG G + +D + NG V
Sbjct: 138 SDHCLGSGSAAPSCRLHQSSKSSCDFVILYGDGSKVRGKIYSDEITM---NG----VKSI 190
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGR--GRISIV-----SQLREYGLIRNVIGHCIGQNG 181
G N G P G++GLGR ++V S +R ++NV G + G
Sbjct: 191 GFFGANVEEVGTFEYPRADGIMGLGRTGNNKNLVPTIFESMVRANSSMKNVFGIYLDYQG 250
Query: 182 RGVLFLG--DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-TLIFDSG 238
+G L LG + + +TP++QN Y + P S S L +I DSG
Sbjct: 251 QGHLSLGRINPNFYVGEIEYTPVVQNGP---FYSIKPTSFRISNTSFLASSLGQVIVDSG 307
Query: 239 ASYAYFTSRVYQEIVSLIMR-----DLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
S + ++Y +++ R D++ P+ + R F+ + E F
Sbjct: 308 TSDIILSGKIYDHLIAFFRRHYCHIDMVCDPISI--------FTGRACFER-EEDFESFP 358
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVIS-----GRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
L F+ VR+ +PP+ Y++ + G C GI G + I+G++FM+
Sbjct: 359 WLHFGFSG---GVRIAIPPKNYMIKTQSTQPGVYGYCWGIDRGEDM-----TILGDVFMR 410
Query: 349 DKMVIYDNEKQRIGW 363
I+DNE+ R+G+
Sbjct: 411 GYYTIFDNEENRVGF 425
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 71/279 (25%), Positives = 113/279 (40%), Gaps = 37/279 (13%)
Query: 1 MYVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
M I+ P + +NL +G PP DTGSDLTW QC PCT C K +
Sbjct: 76 MTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPLF 134
Query: 61 KPHKNIV----PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
P + C C AL R +C + Y DG + G L ++ +
Sbjct: 135 DPKNSSTYRDSSCGTSFCLAL---GKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVD 191
Query: 117 FSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ G + P FGCG H+ G + ++G++GLG G +S++SQL+ I + +
Sbjct: 192 STAGKPVSFPGFAFGCG---HSSGGIFDKSSSGIVGLGGGELSLISQLKS--TINGLFSY 246
Query: 176 CI------GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG--KSCG 227
C+ + F G+V G TP+ L Y G K
Sbjct: 247 CLLPVSTDSSISSRINFGASGRVSGYGTVSTPL---------------RLPYKGYSKKTE 291
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
+++ +I DSG +Y + Y ++ + + G ++
Sbjct: 292 VEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVR 330
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 150/382 (39%), Gaps = 49/382 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF +++ VG PPK F DTGSDL W+QC PC C + Y P + + C +
Sbjct: 195 YF-MDVFVGTPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISCHD 252
Query: 72 PRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS--NGS-----VF 123
PRC + P+PP CK N C Y YGDG ++ G + F + + NG V
Sbjct: 253 PRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVE 312
Query: 124 NVPLTFGCGYNQH---NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RN----VIGH 175
NV FGCG+ + G L S+ Q Y L+ RN V
Sbjct: 313 NV--MFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSK 370
Query: 176 CIGQNGRGVL--------FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG 227
I + +L G GK S + + NS + +L E + S G
Sbjct: 371 LIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQI-NSVMVDDEVLKIPEETWHLSSEG 429
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ 287
I DSG + YF Y+ I +R + G L + LP P K
Sbjct: 430 AGG--TIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELV-----EGLP-----PLKPCYN 477
Query: 288 VTEYFKPLALSF-TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIF 346
V+ K F + P E Y + VCL IL + + +IIG
Sbjct: 478 VSGIEKMELPDFGILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSAL---SIIGNYQ 534
Query: 347 MQDKMVIYDNEKQRIGWKPEDC 368
Q+ ++YD +K R+G+ P C
Sbjct: 535 QQNFHILYDMKKSRLGYAPMKC 556
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 103/400 (25%), Positives = 151/400 (37%), Gaps = 57/400 (14%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------------YKPH-- 63
+ +G P F DTGSDL WV CD CT C+ Y P+
Sbjct: 103 TTIELGTPGVKFMVALDTGSDLFWVPCD--CTRCSATRSSAFASALASDFDLSVYNPNGS 160
Query: 64 --KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRF--S 118
V C+N C + +C C Y + Y +S G LV D+ L
Sbjct: 161 STSKKVTCNNSLCT-----HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDD 215
Query: 119 NGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
N + + FGCG Q + L G+ GLG +IS+ S L G + C G
Sbjct: 216 NHDLVEANVIFGCGQVQ-SGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFG 274
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
++G G + GD S TP N + + I + G + + T +FDSG
Sbjct: 275 RDGIGRISFGDKG--SLDQDETPFNVNPSHPTYNI--TINQVRVGTTLIDVEFTALFDSG 330
Query: 239 ASYAYFT----SRVYQEIVSLIMRDLIGTPLKLAP-------------DDKTLPICWRGP 281
S+ Y SR+ + + I L LK+ +D+ P R P
Sbjct: 331 TSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQFHSQVEDRRRPPDSRIP 390
Query: 282 FKALGQVTEYFKPL---ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGE 338
F ++ ++S T S +V P + CL ++ +E
Sbjct: 391 FDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELVYCLAVVKSAEL---- 446
Query: 339 NNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLSLNHFI 378
NIIG+ FM V++D EK +GWK DC + N+ I
Sbjct: 447 -NIIGQNFMTGYRVVFDREKLILGWKKSDCYDIEDHNNAI 485
>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
Length = 566
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 118/262 (45%), Gaps = 50/262 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNIV 67
+ + +G PP+ F+ DTGSD+ WV C + C GC K E Q + ++V
Sbjct: 132 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSELQIQLSFFDPGVSSSASLV 190
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CS+ RC + ++ C PN+ C Y +YGDG + G ++D
Sbjct: 191 SCSDRRCYS-NFQTESGCS-PNNLCSYSFKYGDGSGTSGYYISD---------------- 232
Query: 128 TFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRG 183
F C Q G L P A G+ GLG+G +S++SQL GL V HC+ ++G G
Sbjct: 233 -FMCSNLQS--GDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGG 289
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKDLTLI 234
++ LG K P + +TP++ + HY + + +G+ + D T+I
Sbjct: 290 IMVLGQIKRPDT--VYTPLVPSQP---HYNVNLQSIAVNGQILPIDPSVFTIATGDGTII 344
Query: 235 FDSGASYAYFTSRVYQEIVSLI 256
D+G + AY Y + +
Sbjct: 345 -DTGTTLAYLPDEAYSPFIQAV 365
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 140/354 (39%), Gaps = 37/354 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + +G P + FDTGSDLTW QC+ C K + + P K+ + C++
Sbjct: 146 YFVV-VGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNITCTS 204
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C L N P C C Y I+YGD S+G + + ++ V N F
Sbjct: 205 ALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATD-VVDN--FLF 261
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GCG Q+N G +AG++GLGR IS V Q R + +C+ L
Sbjct: 262 GCG--QNNQGLFG--GSAGLIGLGRHPISFVQQTA--AKYRKIFSYCLPSTSSSTGHLSF 315
Query: 190 GKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
G + + +TP S Y L + G + T I DSG
Sbjct: 316 GPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAIIDSGTVITR 375
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR-GPFKALGQVTEYFKPLALSFTNR 302
Y + S + + P A + L C+ +K T + SF
Sbjct: 376 LPPTAYGALRSAFRQGMSKYP--SAGELSILDTCYDLSGYKVFSIPT-----IEFSFA-- 426
Query: 303 RNSVRLVVPPEAYLVISGRKNVCLGI-LNGSEAEVGENNIIGEIFMQDKMVIYD 355
V + +PP+ L ++ K VCL NG +++V I G + + V+YD
Sbjct: 427 -GGVTVKLPPQGILFVASTKQVCLAFAANGDDSDV---TIYGNVQQRTIEVVYD 476
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 159/362 (43%), Gaps = 34/362 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++++G PP + DTGSDL W QC PC C K + P K+ VPC++
Sbjct: 92 YLMSVSIGTPPVDYIGMADTGSDLMWAQC-LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQ 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C A+ + C CDY YGD + G DL + + GS +V GCG
Sbjct: 151 NCKAI---DDSHCG-AQGVCDYSYTYGDQTYTKG----DLGFEKITIGSS-SVKSVIGCG 201
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLFL 187
+ +GV+GLG G++S+VSQ+ + I +C+ NG+ + F
Sbjct: 202 HESGG----GFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGK-INFG 256
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYI-LGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
+ V GV TP++ + +Y+ L + K +I DSG + ++
Sbjct: 257 QNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPK 316
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
+Y +VS +++ + +K +C+ + T P+ + + +V
Sbjct: 317 ELYDGVVSSLLKVVKAKRVK--DPGNFWDLCFD---DGINVATSSGIPIITAQFSGGANV 371
Query: 307 RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
L +P + ++ N CL + S + E IIG + + + ++ YD E +R+ +KP
Sbjct: 372 NL-LPVNTFQKVANNVN-CLTLTPASPTD--EFGIIGNLALANFLIGYDLEAKRLSFKPT 427
Query: 367 DC 368
C
Sbjct: 428 VC 429
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 157/377 (41%), Gaps = 44/377 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC----TKPPEKQYKPHKNIVPCSNP 72
+ + L +G PP F DTGSDLTW QC PC C T + + VPC++
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPIYDTAVSSSFSPVPCASA 151
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C + W + C + C Y YGDG S G L T+ + G V + FGCG
Sbjct: 152 TCLPI-W-SSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPG-VSVGGIAFGCG 208
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF--LGDG 190
+ G LS ++ G +GLGRG +S+V+QL + G VLF L +
Sbjct: 209 VDN---GGLS-YNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGALAEL 264
Query: 191 KVPSSGVA--WTPMLQNS-------ADLKHYILGPAELLYSGKSCGLKDL---TLIFDSG 238
PS+G A TP++Q+ L+ LG A L + L+D +I DSG
Sbjct: 265 AAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSG 324
Query: 239 ASYAYFTS---RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW-RGPFKALGQVTEYFKP 294
++ + RV + V+ ++R + L D P A+ + +F
Sbjct: 325 TTFTFLVESAFRVVVDHVAGVLRQPVVNASSL--DSPCFPAATGEQQLPAMPDMVLHFAG 382
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
A +R N + ++ CL I A+V +I+G Q+ +++
Sbjct: 383 GADMRLHRDNYMSFNQEESSF---------CLNIAGSPSADV---SILGNFQQQNIQMLF 430
Query: 355 DNEKQRIGWKPEDCNTL 371
D ++ + P DC L
Sbjct: 431 DITVGQLSFMPTDCGKL 447
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 75/162 (46%), Gaps = 15/162 (9%)
Query: 7 EFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN- 65
E P + V L +G PP F DT SDL W QC PCTGC + + P +
Sbjct: 79 ETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSS 137
Query: 66 ---IVPCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGS 121
+PCS+ C L + RC H +D+ C Y Y ++ G L D + G
Sbjct: 138 TYAALPCSSDTCDEL---DVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI----GE 190
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
+ FGC + P PP +GV+GLGRG +S+VSQL
Sbjct: 191 DAFRGVAFGCSTSSTGGAP--PPQASGVVGLGRGPLSLVSQL 230
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 101/388 (26%), Positives = 160/388 (41%), Gaps = 56/388 (14%)
Query: 12 PIFS---YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
PIF+ + V ++VG PP DTGSD+ W QC PC+ C + + P K+
Sbjct: 75 PIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCK-PCSNCYQQNAPMFDPSKSTTY 133
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
V CS+P C+ + + C + +C Y I YGD S G L D ++ ++G
Sbjct: 134 KNVACSSPVCS--YSGDGSSCSD-DSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVA 190
Query: 125 VPLT-FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-------EYGLIRNVIGHC 176
P T GCG++ N G + + +G++GLGRG S+V+QL Y LI IG
Sbjct: 191 FPRTVIGCGHD--NAGTFN-ANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIP--IGTG 245
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQN-------SADLKHYILGPAELLY-SGKSCGL 228
+ + F + V SG TP+ + S L+ +G + + G S
Sbjct: 246 STNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLG 305
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFK--AL 285
+ +I DSG + Y S + S I + + L A D + L C+ +
Sbjct: 306 GESNIIIDSGTTLTYLPSALLNSFGSAISQSM---SLPHAQDPSEFLDYCFATTTDDYEM 362
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNII--G 343
VT +F+ + VRL +CL ++NI G
Sbjct: 363 PPVTMHFEGADVPLQRENLFVRL-----------SDDTICLAF-----GSFPDDNIFIYG 406
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
I + +V YD + + ++P C +
Sbjct: 407 NIAQSNFLVGYDIKNLAVSFQPAHCGAV 434
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 95/206 (46%), Gaps = 26/206 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNIV 67
+ + +G PP+ DTGSD+ WV C + C GC + Q + + +++
Sbjct: 77 YYTKVKLGTPPRELYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQLNYFDPGSSSTSSLI 135
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
C + RC + + C N+QC Y +YGDG + G V+DL S+F L
Sbjct: 136 SCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHF----ASIFEGTL 191
Query: 128 T--------FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
T FGC Q S G+ G G+ +S++SQL G+ V HC+ G
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251
Query: 179 QN-GRGVLFLGDGKVPSSGVAWTPML 203
N G GVL LG+ P+ + ++P++
Sbjct: 252 DNSGGGVLVLGEIVEPN--IVYSPLV 275
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 166/385 (43%), Gaps = 54/385 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ +++ VG PP+ F DTGSDL W+QC APC C + P ++N+ C +
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFDQVGPVFDPAASSSYRNVT-CGD 208
Query: 72 PRCAALHWPNPPR-CKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNV-PL 127
RC + P PPR C+ P D C Y YGD ++ G L + F + + G+ V +
Sbjct: 209 QRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDDV 268
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV-- 184
FGCG+ N G AG+LGLGRG +S SQLR YG + +C+ +G V
Sbjct: 269 VFGCGH--WNRGLFH--GAAGLLGLGRGPLSFASQLRAVYG---HTFSYCLVDHGSDVAS 321
Query: 185 -LFLGDGKVPSSG--------VAWTPMLQNSADLKHY-----ILGPAELL------YSGK 224
+ G+ + A+ P + AD +Y +L ELL +
Sbjct: 322 KVVFGEDDALALAAAHPQLNYTAFAPA-SSPADTFYYVKLKGVLVGGELLNISSDTWGVG 380
Query: 225 SCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA 284
I DSG + +YF YQ I + D +G L PD L C+
Sbjct: 381 EGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFI-DRMGRSYPLIPDFPVLSPCYNVSGVD 439
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIG 343
+V E L+L F + P E Y + + +CL +L + +IIG
Sbjct: 440 RPEVPE----LSLLFA---DGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGM---SIIG 489
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
Q+ V+YD + R+G+ P C
Sbjct: 490 NFQQQNFHVVYDLKNNRLGFAPRRC 514
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 152/370 (41%), Gaps = 45/370 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHK----NIVPCSN 71
F V + G P + + FDTGSD++W+QC PC+G C K + + P K + VPC +
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSATYSAVPCGH 178
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P+CAA +C N C Y+++YGDG S+ G L + L S +P FG
Sbjct: 179 PQCAAAGG----KCSS-NGTCLYKVQYGDGSSTAGVLSHETLSLT----SARALPGFAFG 229
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG + N G D G++GLGRG++S+ SQ G L +G
Sbjct: 230 CG--ETNLGDFG--DVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGT- 284
Query: 191 KVPSS---GVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASY 241
P+S GV +T M+Q Y + ++ G + +D TL+ DSG
Sbjct: 285 TTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLL-DSGTVL 343
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
Y Y + + T K AP C+ GQ + ++ F++
Sbjct: 344 TYLPPEAYTALRDRFKFTM--TQYKPAPAYDPFDTCY----DFAGQNAIFMPLVSFKFSD 397
Query: 302 RRNSVRLVVPPEAYLVI---SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
+ + P L+ + CL + I+G ++ +IYD
Sbjct: 398 GSS---FDLSPFGVLIFPDDTAPATGCLAFV--PRPSTMPFTIVGNTQQRNTEMIYDVAA 452
Query: 359 QRIGWKPEDC 368
++IG+ C
Sbjct: 453 EKIGFVSGSC 462
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 153/376 (40%), Gaps = 43/376 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + + +G P + + DTGSDL W QC APC C P + P ++ + C++P
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSATYRSLGCASP 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL++P C C Y+ YGD S+ G L + F + V ++FGCG
Sbjct: 149 ACNALYYP---LCYQ--KVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCG 203
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQL---REYGLIRNVIGHCIGQNGRGVLF-LG 188
N G L+ + +G++G GRG +S+VSQL R + + + + GV L
Sbjct: 204 --NLNAGLLA--NGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVPSRLYFGVYATLN 259
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----------IFDS 237
S V TP + N A Y L + G + I DS
Sbjct: 260 STNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDS 319
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G + Y Y + + I PL D L C++ P VT L L
Sbjct: 320 GTTITYLAEPAYDAVRAAFASQ-ITLPLLNVTDASVLDTCFQWPPPPRQSVT--LPQLVL 376
Query: 298 SFTNRRNSVRLVVPPEAYLVI--SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
F + +P + Y+++ S +CL + A + +IIG Q+ V+YD
Sbjct: 377 HF----DGADWELPLQNYMLVDPSTGGGLCLAM-----ASSSDGSIIGSYQHQNFNVLYD 427
Query: 356 NEKQRIGWKPEDCNTL 371
E + + P C+ +
Sbjct: 428 LENSLMSFVPAPCHLM 443
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 157/380 (41%), Gaps = 54/380 (14%)
Query: 18 AVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP--------HKNIVPC 69
++ + VG PP+ D GSDL W QC P KQ +P +++PC
Sbjct: 108 SLTVGVGTPPQPSKVILDLGSDLLWTQC-----SLVGPTAKQLEPVFDAARSSSFSVLPC 162
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C A + N C + +C YE +YG ++ G L T+ F +G N LTF
Sbjct: 163 DSKLCEAGTFTN-KTCT--DRKCAYENDYGI-MTATGVLATETFTFGAHHGVSAN--LTF 216
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVL 185
GCG + + + +G+LGL G +S++ Q L +C+ + V+
Sbjct: 217 GCGKLANG----TIAEASGILGLSPGPLSMLKQ-----LAITKFSYCLTPFADRKTSPVM 267
Query: 186 F--LGD-GKVPSSGVAWT-PMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------- 233
F + D GK ++G T P+L+N + +Y + + K + TL
Sbjct: 268 FGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTG 327
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ DS + AY + E+ +M + + DD P+C+ P + +
Sbjct: 328 GTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRSVDD--YPVCFELP-RGMSMEGVQ 384
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
PL L F + +P + Y +CL ++ G N+IG + Q+
Sbjct: 385 VPPLVLHFD---GDAEMSLPRDNYFQEPSPGMMCLAVMQAPFE--GAPNVIGNVQQQNMH 439
Query: 352 VIYDNEKQRIGWKPEDCNTL 371
V+YD ++ + P C+++
Sbjct: 440 VLYDVGNRKFSYAPTKCDSI 459
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 87/374 (23%), Positives = 151/374 (40%), Gaps = 48/374 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK--------PPEKQYKPHKNIVP 68
+ NLT+G PP+ + W QC +PC C K Y+P P
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQC-SPCRRCFKQDLPLFNRSASSTYRPE----P 82
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIE--YGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
C C ++ P + C YE+E +GD S IG TD F + + S
Sbjct: 83 CGTALCESV----PASTCSGDGVCSYEVETMFGDT-SGIGG--TDTFAIGTATAS----- 130
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG----R 182
L FGC + + L +GV+GLGR S+V Q+ +C+ +G +
Sbjct: 131 LAFGCAMDSNIKQLLG---ASGVVGLGRTPWSLVGQMNA-----TAFSYCLAPHGAAGKK 182
Query: 183 GVLFLGDGKVPSSG--VAWTPMLQNSADLKHYILGPAELLYSGKSCGL--KDLTLIFDSG 238
L LG + G A TP++ S D Y++ + + ++ D+
Sbjct: 183 SALLLGASAKLAGGKSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPPNGSVVLVDTI 242
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
++ +Q I + + P+ A K +C+ P A PL
Sbjct: 243 FGVSFLVDAAFQAIKKAVTVAVGAAPM--ATPTKPFDLCF--PKAAAAAGANSSLPLPDV 298
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVG-ENNIIGEIFMQDKMVIYDNE 357
+ + L VPP Y+ +G VCL +++ + + E +I+G + ++ ++D +
Sbjct: 299 VLTFQGAAALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLD 358
Query: 358 KQRIGWKPEDCNTL 371
K+ + ++P DC++L
Sbjct: 359 KETLSFEPADCSSL 372
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 93/351 (26%), Positives = 148/351 (42%), Gaps = 38/351 (10%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWP--NPPRCKHP 88
DTGSDL+WVQC PC C + + P K+ V C++ C +L N C
Sbjct: 82 DTGSDLSWVQCQ-PCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSN 140
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAG 148
C+Y + YGDG + G + + L N +V N FGCG + N G +G
Sbjct: 141 PPTCNYVVNYGDGSYTSGEV--GMEHLNLGNTTVNN--FIFGCG--RKNQGLFG--GASG 192
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGVAWTPM 202
++GLGR +S++SQ+ + V +C+ G L +G ++ +++T M
Sbjct: 193 LVGLGRTDLSLISQISP--MFGGVFSYCLPTTEAEASGSLVMGGNSSVYKNTTPISYTRM 250
Query: 203 LQNSADLKHYILGPAELLYSG---KSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRD 259
+ N L Y L + G ++ +I DSG + +YQ + + ++
Sbjct: 251 IHNPL-LPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGTVISRLPPSIYQALKAEFVKQ 309
Query: 260 LIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVIS 319
G P AP L C+ L E P + + + V Y V +
Sbjct: 310 FSGYP--SAPSFMILDSCFN-----LSGYQEVKIPDIKMYFEGSAELNVDVTGVFYSVKT 362
Query: 320 GRKNVCLGILN-GSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
VCL I + E EVG IIG +++ +IYD + +G+ E C+
Sbjct: 363 DASQVCLAIASLPYEDEVG---IIGNYQQKNQRIIYDTKGSMLGFAEEACS 410
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 108/389 (27%), Positives = 172/389 (44%), Gaps = 62/389 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF +++ +G PP+ F DTGSDL W+QC PC C Y P ++ + C +
Sbjct: 192 YF-MDVFIGTPPRHFSLILDTGSDLNWIQC-VPCYDCFVQNGPYYDPKESSSFKNIGCHD 249
Query: 72 PRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-------VF 123
PRC + P+PP+ CK N C Y YGD ++ G + F + ++ + V
Sbjct: 250 PRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVE 309
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----G 178
NV FGCG+ N G AG+LGLGRG +S SQL+ L + +C+
Sbjct: 310 NV--MFGCGH--WNRGLFH--GAAGLLGLGRGPLSFSSQLQ--SLYGHSFSYCLVDRNSD 361
Query: 179 QNGRGVLFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT- 232
N L G+ K + V +T ++ +N D +Y+ + ++ G+ + + T
Sbjct: 362 TNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKS-IMVGGEVLKIPEETW 420
Query: 233 ---------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI---CWRG 280
I DSG + +YF Y+ I++D +K P K PI C+
Sbjct: 421 HLSPEGAGGTIVDSGTTLSYFAEPSYE-----IIKDAFVKKVKGYPVIKDFPILDPCYNV 475
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGEN 339
++ E F+ L + P E Y + + + VCL IL + +
Sbjct: 476 SGVEKMELPE-FRILF------EDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSAL--- 525
Query: 340 NIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+IIG Q+ ++YD +K R+G+ P C
Sbjct: 526 SIIGNYQQQNFHILYDTKKSRLGYAPMKC 554
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 151/380 (39%), Gaps = 64/380 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF L VG P + DTGSD+ W+QC APC C + + P K+ +PC +
Sbjct: 145 YF-TRLGVGTPARYVYMVLDTGSDIVWIQC-APCIKCYSQTDPVFDPTKSRSFANIPCGS 202
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L +P C C Y++ YGDG ++G T+ L F V V L GC
Sbjct: 203 PLCRRLDYPG---CSTKKQICLYQVSYGDGSFTVGEFSTE--TLTFRGTRVGRVVL--GC 255
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR----GVLFL 187
G++ N G LG GR+S SQ+ + +C+G +
Sbjct: 256 GHD--NEGLFVGAAGLLGLGR--GRLSFPSQIGRR--FNSKFSYCLGDRSASSRPSSIVF 309
Query: 188 GDGKVPSSGVAWTPMLQN-SADLKHYILGPAELL--------YSGKSCGLKDLT------ 232
GD + S +TP+L N D +Y+ ELL SG S L L
Sbjct: 310 GDSAI-SRTTRFTPLLSNPKLDTFYYV----ELLGISVGGTRVSGISASLFKLDSTGNGG 364
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRD--LIGTP-LKLAPDDKTLPICWRGPFKALGQVT 289
+I DSG S T Y + +RD L+G LK AP+ C F G+
Sbjct: 365 VIIDSGTSVTRLTRAAY-----VALRDAFLVGASNLKRAPEFSLFDTC----FDLSGKTE 415
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
+ L F + +P YL+ + + C + +IIG I Q
Sbjct: 416 VKVPTVVLHF----RGADVPLPASNYLIPVDNSGSFCFAFAGTASGL----SIIGNIQQQ 467
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
V+YD R+G+ P C
Sbjct: 468 GFRVVYDLATSRVGFAPRGC 487
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 156/373 (41%), Gaps = 51/373 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++++ G PP+ DTGSDL WVQC PC C + ++ P K+ + C +
Sbjct: 90 YLIDISYGNPPQKSTAIVDTGSDLNWVQC-LPCKSCYETLSAKFDPSKSASYKTLGCGSN 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L + + C Y+ YGDG S+ GAL TD + G + NV FGCG
Sbjct: 149 FCQDLPFQSCAA------SCQYDYMYGDGSSTSGALSTD--DVTIGTGKIPNV--AFGCG 198
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
N G + G++GLG+G +S+VSQL G +C +G L++GD
Sbjct: 199 --NSNLGTFA--GAGGLVGLGKGPLSLVSQLG--GTATKKFSYCLVPLGSTKTSPLYIGD 252
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGA 239
+ + GVA+TPML N+ Y + GK+ T LI DSG
Sbjct: 253 STL-AGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGT 311
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD-KTLPICWRGPFKALGQVTEYFKPLALS 298
+ Y + +V+ + L P A L C F G + +
Sbjct: 312 TLTYLDVDAFNPMVAALKAAL---PYPEADGSFYGLEYC----FSTAGVANPTYPTVVFH 364
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F + + P ++ + CL + A +I G I + ++++D
Sbjct: 365 FNGADVA---LAPDNTFIALDFEGTTCLAM-----ASSTGFSIFGNIQQLNHVIVHDLVN 416
Query: 359 QRIGWKPEDCNTL 371
+RIG+K +C T+
Sbjct: 417 KRIGFKSANCETI 429
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 78/236 (33%), Positives = 115/236 (48%), Gaps = 34/236 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHK----NIVPCS 70
+ V +++G P + DTGSD++WVQC PC C + + P + + VPC+
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCK-PCPSPPCYSQRDPLFDPTRSSSYSAVPCA 189
Query: 71 NPRCAALH-WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C+ L + N C QC Y + YGDG ++ G +D L SN F
Sbjct: 190 AASCSQLALYSN--GCS--GGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNA---LKGFLF 242
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI--GQNGRGVLF 186
GCG+ Q G + D G+LGLGR S+VSQ YG V +C+ QN G +
Sbjct: 243 GCGHAQQ--GLFAGVD--GLLGLGRQGQSLVSQASSTYG---GVFSYCLPPTQNSVGYIS 295
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---IFDSGA 239
LG G ++G + TP+L S D +YI ++ +G S G + L++ +F SGA
Sbjct: 296 LG-GPSSTAGFSTTPLLTASNDPTYYI-----VMLAGISVGGQPLSIDASVFASGA 345
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 152/383 (39%), Gaps = 63/383 (16%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNPRC 74
V+L +G PP+ DTGS L+W+QC P K P + P +++PC++ C
Sbjct: 80 VSLPIGTPPQTQQMVLDTGSQLSWIQCKVP----PKTPPTAFDPLLSSSFSVLPCNHSLC 135
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + F + S PL GC
Sbjct: 136 KPRVPDYTLPTSCDQ-NRLCHYSYFYADGTYAEGNLVREKFTF---SSSQTTPPLILGCA 191
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
+ DT G+LG+ GR+S S + + +C+ G + G
Sbjct: 192 TDSS--------DTQGILGMNLGRLSFSSLAK-----ISKFSYCVPPRRSQSGSSPTGSF 238
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLK-------HYILGPAELLYSGKSCGLKDLTL----- 233
+LG S+G + ++ + Y L + +GK +
Sbjct: 239 YLGPNP-SSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPS 297
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKALGQ 287
+ DSG + + Y ++ I++ L G LK +L +C+ G +G+
Sbjct: 298 GAGQTLIDSGTWFTFLVDEAYSKVKEEIVK-LAGPKLKKGYVYGGSLDMCFDGDAMVIGR 356
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVG-ENNIIGEIF 346
+ +A F N V +VV E L G CLGI G +G +NIIG
Sbjct: 357 M---IGNMAFEF---ENGVEIVVEREKMLADVGGGVQCLGI--GRSDLLGVASNIIGNFH 408
Query: 347 MQDKMVIYDNEKQRIGWKPEDCN 369
QD V +D +R+G+ DC+
Sbjct: 409 QQDLWVEFDLVGRRVGFGRTDCS 431
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 99/401 (24%), Positives = 163/401 (40%), Gaps = 69/401 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCD-----APCTGCTKPPE--KQYKPHKNIVPC 69
++++L G P + F F DTGS L W+ C + C + P+ + V C
Sbjct: 86 YSIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGC 145
Query: 70 SNPRCAALHWPN-PPRC----KHPNDQCD-----YEIEYGDGGSSIGALVTDL-FPL-RF 117
+NP+CA + P+ C K + C Y ++YG G ++ L +L FP ++
Sbjct: 146 TNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGSTAGFLLSENLNFPTKKY 205
Query: 118 SNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----EYGLIRNVI 173
S+ GC +S AG+ G GRG S+ SQ+ Y L+ +
Sbjct: 206 SD-------FLLGCSV-------VSVYQPAGIAGFGRGEESLPSQMNLTRFSYCLLSHQF 251
Query: 174 GHCIGQNGRGVLFLG---DGKVPSSGVAWTPMLQNSADLK------HYILGPAELLYSGK 224
VL DGK ++GV++TP L+N K +Y + ++ K
Sbjct: 252 DDSATITSNLVLETASSRDGK--TNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEK 309
Query: 225 SCGL----------KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL 274
+ D I DSG+++ + ++ + + + T + A L
Sbjct: 310 RVRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQFGL 369
Query: 275 PICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILN--- 330
C+ A G T F L F R ++ +P Y + G+ +V CL I++
Sbjct: 370 SPCF---VLAGGAETASFPELRFEF---RGGAKMRLPVANYFSLVGKGDVACLTIVSDDV 423
Query: 331 -GSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
GS VG I+G Q+ V YD E +R G++ + C T
Sbjct: 424 AGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSCQT 464
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 92/368 (25%), Positives = 137/368 (37%), Gaps = 43/368 (11%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI------------ 66
+ +G P F DTGSDL WV CD CT C + ++
Sbjct: 102 TTVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAASDSTAFASDFDLNVYNPNGSSTSK 159
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNG--SV 122
V C+N C + +C C Y + Y +S G LV D+ L + +
Sbjct: 160 KVTCNNSLCT-----HRSQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDL 214
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
+ FGCG Q + L G+ GLG +IS+ S L G + C G++G
Sbjct: 215 VEANVIFGCGQIQ-SGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGI 273
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
G + GD S TP N + + I + G + + T +FDSG S+
Sbjct: 274 GRISFGDKG--SFDQDETPFNLNPSHPTYNI--TVTQVRVGTTVIDVEFTALFDSGTSFT 329
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI--CWRGPFKALGQVTEYFKPLALSFT 300
Y Y + + + D +P C+ A + ++S T
Sbjct: 330 YLVDPTYTRLTESFHSQVQD---RRHRSDSRIPFEYCYDMSPDANTSLIP-----SVSLT 381
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
S V P + CL ++ +E NIIG+ FM V++D EK
Sbjct: 382 MGGGSHFAVYDPIIIISTQSELVYCLAVVKSAEL-----NIIGQNFMTGYRVVFDREKLV 436
Query: 361 IGWKPEDC 368
+GWK DC
Sbjct: 437 LGWKKFDC 444
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 145/368 (39%), Gaps = 50/368 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ + + +G P K DTGSD++WVQC PC+ C + + P + CS+
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCSSA 191
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
CA L C + QC Y + YGDG S+ G +D L GS FGC
Sbjct: 192 ACAQLGQEG-NGCS--SSQCQYTVTYGDGSSTTGTYSSDTLAL----GSNAVRKFQFGCS 244
Query: 132 ----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVL 185
G+N T G++GLG G S+VSQ G +C+ + G L
Sbjct: 245 NVESGFNDQ---------TDGLMGLGGGAQSLVSQ--TAGTFGAAFSYCLPATSSSSGFL 293
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASY 241
LG G +SG TPML++S Y + + G+ + I DSG
Sbjct: 294 TLGAG---TSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGTIMDSGTVL 350
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
Y + S + P AP L C F GQ + +AL F+
Sbjct: 351 TRLPPTAYSALSSAFKAGMKQYP--SAPPSGILDTC----FDFSGQSSVSIPTVALVFS- 403
Query: 302 RRNSVRLVVPPEAYLVISGRKNVCLGI-LNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
+ + + ++ + +CL N ++ +G IIG + + V+YD
Sbjct: 404 --GGAVVDIASDGIMLQTSNSILCLAFAANSDDSSLG---IIGNVQQRTFEVLYDVGGGA 458
Query: 361 IGWKPEDC 368
+G+K C
Sbjct: 459 VGFKAGAC 466
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 102/414 (24%), Positives = 168/414 (40%), Gaps = 73/414 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVP-------- 68
+ ++L +G PP++ DTGSDLTWV C C + + Y+ +K +
Sbjct: 82 YLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDD--YRNNKLMATFSPSYSSS 139
Query: 69 -----CSNPRCAALHWPNPP-----------------RCKHPNDQCDYEIEYGDGGSSIG 106
C++P C +H + P C P Y YG GG G
Sbjct: 140 SYRASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAY--TYGAGGVVTG 197
Query: 107 ALVTDLFPLRFSN-GSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
L D + S+ G +P FGC + + + G+ G GRG +S+VSQL
Sbjct: 198 ILTRDTLRVNGSSPGVAKEIPKFCFGCVGSAYR-------EPIGIAGFGRGTLSMVSQL- 249
Query: 165 EYGLIRNVIGHCI-------GQNGRGVLFLGDGKVPSS-GVAWTPMLQNSADLKHYILGP 216
G ++ HC N L +GD + S + +TPML + Y +G
Sbjct: 250 --GFLQKGFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGL 307
Query: 217 AELLYSGKSC-----------GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPL 265
+ S L + + DSG +Y + Y +++S I++ I P
Sbjct: 308 EAITVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLS-ILQSTINYPR 366
Query: 266 KLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKN 323
+ +T +C++ P +T +++F + N+V LV+P + +S N
Sbjct: 367 DTGMEMQTGFDLCYKVPRPNNNTLTSDDLLPSITF-HFLNNVSLVLPQGNHFYPVSAPGN 425
Query: 324 ----VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTLLS 373
CL + + + G + G Q+ V+YD EK+RIG++P DC + S
Sbjct: 426 PAVVKCLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDCASAAS 479
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 168/379 (44%), Gaps = 47/379 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ +++ VG PP+ F DTGSDL W+QC APC C + + P ++N+ C +
Sbjct: 149 YLIDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNVT-CGD 206
Query: 72 PRCAALHWPNPPR-CKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVP-L 127
RC + P PR C+ P D C Y YGD ++ G L + F + + G+ V +
Sbjct: 207 QRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGV 266
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV-- 184
FGCG+ N G AG+LGLGRG +S SQLR YG + +C+ ++G
Sbjct: 267 VFGCGH--RNRGLFH--GAAGLLGLGRGPLSFASQLRAVYG---HTFSYCLVEHGSDAGS 319
Query: 185 --------LFLGDGKVPSSGVAWTPMLQNS---ADLKHYILGPAELLYSGKSCGL-KDLT 232
L L ++ + A T ++ LK ++G L S + + KD +
Sbjct: 320 KVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGS 379
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + +YF YQ ++ DL+ L PD L C+ +V E
Sbjct: 380 GGTIIDSGTTLSYFVEPAYQ-VIRQAFVDLMSRLYPLIPDFPVLNPCYNVSGVERPEVPE 438
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
L+L F + P E Y V + +CL + + +IIG Q+
Sbjct: 439 ----LSLLFAD---GAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGM---SIIGNFQQQN 488
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V+YD + R+G+ P C
Sbjct: 489 FHVVYDLQNNRLGFAPRRC 507
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 99/394 (25%), Positives = 152/394 (38%), Gaps = 54/394 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------QYKPHKNIVPCS 70
V + VG PP+ DTGS+L+W++C+ T PP+ CS
Sbjct: 62 LTVPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCS 121
Query: 71 NPRCAALHW-----PNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+P C W P PP C P++ C + Y D S+ G L D F L G
Sbjct: 122 SPEC---QWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLL----GGAPP 174
Query: 125 VPLTFGCGYNQHNPGPLSPPDT---AGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-N 180
V FGC + + + D+ G+LG+ RG +S V+Q +R +CI +
Sbjct: 175 VRALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQT---ATLR--FAYCIAPGD 229
Query: 181 GRGVLFL-GDGKVPSSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGL 228
G G+L L GDG + + +TP++Q S L ++ I A LL KS
Sbjct: 230 GPGLLVLGGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLA 289
Query: 229 KDLT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD----KTLPICWRG 280
D T + DSG + + + Y + + L D C+R
Sbjct: 290 PDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRA 349
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSV---RLV--VPPEAYLVISGRKNVCLGILNGSEAE 335
+ ++ + L +V +L+ VP E CL N A
Sbjct: 350 SEARVAAASQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAG 409
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ +IG Q+ V YD + R+G+ P C+
Sbjct: 410 M-SAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 442
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 87/202 (43%), Gaps = 16/202 (7%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN--IVPC 69
+ + +G PPK + DTGSD+ WV C C GC QY P + V C
Sbjct: 84 YYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGCPTRSGLGIELTQYDPAGSGTTVGC 142
Query: 70 SNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SVFN 124
C A PP C + C + I YGDG ++ G VTD +G + N
Sbjct: 143 EQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSN 202
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRG 183
+TFGCG S G+LG G+ S++SQL +R + HC+ G G
Sbjct: 203 ASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGG 262
Query: 184 VLFLGDGKVPSSGVAWTPMLQN 205
+ +G+ P V TP++ N
Sbjct: 263 IFAIGNVVQPK--VKTTPLVPN 282
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 152/373 (40%), Gaps = 50/373 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N+++G PP DTGSDL W QC PC C + + P + V CS+
Sbjct: 94 YLMNISLGTPPFPIMAIADTGSDLLWTQC-KPCDDCYTQVDPLFDPKASSTYKDVSCSSS 152
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF--- 129
+C AL N C ++ C Y YGD + G + D L GS P+
Sbjct: 153 QCTALE--NQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTL----GSTDTRPVQLKNI 206
Query: 130 --GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRG 183
GCG+N N G + + V G +S+++QL + I +C+ +N R
Sbjct: 207 IIGCGHN--NAGTFNKKGSGIVGLGGGA-VSLITQLGDS--IDGKFSYCLVPLTSENDRT 261
Query: 184 --VLFLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIF 235
+ F + V +GV TP++ S + +Y+ +G E+ Y G G + +I
Sbjct: 262 SKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIII 321
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + + Y E+ + I K P L +C+ T K
Sbjct: 322 DSGTTLTLLPTEFYSELEDAVASS-IDAEKKQDP-QTGLSLCYSA--------TGDLKVP 371
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
A++ V L P ++ IS VC GS + +I G + + +V YD
Sbjct: 372 AITMHFDGADVNL-KPSNCFVQIS-EDLVCFA-FRGSPSF----SIYGNVAQMNFLVGYD 424
Query: 356 NEKQRIGWKPEDC 368
+ + +KP DC
Sbjct: 425 TVSKTVSFKPTDC 437
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 93/204 (45%), Gaps = 23/204 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C C GC + Y P + +
Sbjct: 90 YF-TRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGEL 147
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C A + P C + C+Y I YGDG S+ G VTD +G +
Sbjct: 148 VTCDQQFCVANYGGVLPSCTSTS-PCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N ++FGCG G L + A G+LG G+ S++SQL G +R + HC+
Sbjct: 207 ANASVSFGCGAKLG--GDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTV 264
Query: 180 NGRGVLFLGDGKVPSSGVAWTPML 203
NG G+ +G+ P V TP++
Sbjct: 265 NGGGIFAIGNVVQPK--VKTTPLV 286
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/296 (30%), Positives = 116/296 (39%), Gaps = 48/296 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V+L VG PP+ DTGSDL W QC APC C P + +PC P
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFDQGIPLLDPAASSTYAALPCGAP 144
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-----RFSNGSV-FNVP 126
RC AL P C Y YGD ++G + TD F R +GS+
Sbjct: 145 RCRAL-----PFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRR 199
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ---NGRG 183
LTFGCG+ N G +T G+ G GRGR S+ SQL +C +
Sbjct: 200 LTFGCGH--FNKGVFQSNET-GIAGFGRGRWSLPSQLNA-----TSFSYCFTSMFDSKSS 251
Query: 184 VLFLGDGKVP------SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL------ 231
++ LG S V TP+ +N + Y L G S G L
Sbjct: 252 IVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLS-----LKGISVGKTRLPVPETK 306
Query: 232 --TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
+ I DSGAS VY E V +G P + L +C+ P AL
Sbjct: 307 FRSTIIDSGASITTLPEEVY-EAVKAEFAAQVGLPPS-GVEGSALDVCFALPVSAL 360
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 93/350 (26%), Positives = 147/350 (42%), Gaps = 45/350 (12%)
Query: 35 DTGSDLTWVQC-DAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPN 89
DT SD+ WVQC P C + Y P K+ +PC +P C L C
Sbjct: 174 DTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTT 233
Query: 90 DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGV 149
D+C Y + YGDG ++ G VTD + + ++ FGC + G S + AG+
Sbjct: 234 DECKYIVNYGDGKATTGTYVTDTLTM---SPTIVVKDFRFGCSHAVR--GSFSNQN-AGI 287
Query: 150 LGLGRGRISIVSQLRE-YGLIRNVIGHCIGQ-NGRGVLFLGDGKVPSSGVAWTPMLQNSA 207
L LG GR S++ Q + YG N +CI + + G L LG S ++TP+++N
Sbjct: 288 LALGGGRGSLLEQTADAYG---NAFSYCIPKPSSAGFLSLGGPVEASLKFSYTPLIKNKH 344
Query: 208 DLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFTSRVYQEIVSLIMRDLIGT 263
YI+ ++ +GK + + DSGA +VY + + R +
Sbjct: 345 APTFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRA-AFRSAMAA 403
Query: 264 PLKLAPDDKTLPICW---RGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISG 320
LA + L C+ R P + +V+ F L + P A +++ G
Sbjct: 404 YGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFA----------GGATLDLEP-ASIILDG 452
Query: 321 RKNVCLGILNGSEAEVGENNI--IGEIFMQDKMVIYDNEKQRIGWKPEDC 368
CL A GE ++ IG + Q V+YD ++G++ C
Sbjct: 453 ----CLAF----AATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 101/390 (25%), Positives = 162/390 (41%), Gaps = 61/390 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP---------HKNIV 67
V+L +G PP+ D DTGS L+W+QC PP + K +++
Sbjct: 66 LVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKKRLPPLPKPKTTSFDPSLSSSFSLL 125
Query: 68 PCSNPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
PC++P C + P C N C Y Y DG + G LV + F + S+
Sbjct: 126 PCNHPICKPRIPDFTLPTSCDQ-NRLCHYSYFYADGTLAEGNLVREKFTF---SKSLSTP 181
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNG 181
P+ GC + + G+LG+ RGR+S +SQ + + +C+ G N
Sbjct: 182 PVILGCAQ--------ASTENRGILGMNRGRLSFISQAK-----ISKFSYCVPSRTGSNP 228
Query: 182 RGVLFLGDGKVPSSGVAWTPML-----QNSADLK--HYILGPAELLYSGK---------- 224
G+ +LGD SS + ML Q+S +L Y L + +GK
Sbjct: 229 TGLFYLGDNP-NSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFK 287
Query: 225 -SCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPF 282
G T+I DSG+ Y Y+++ ++R L+G +K +C+
Sbjct: 288 PDAGGSGQTMI-DSGSDLTYLVDEAYEKVKEEVVR-LVGAMMKKGYVYADVADMCFDAGV 345
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNI 341
A +V ++ F N V + V ++ K V C+GI +G +NI
Sbjct: 346 TA--EVGRRIGGISFEFD---NGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIG-SNI 399
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
IG + Q+ V YD +R+G+ +C+ L
Sbjct: 400 IGTVHQQNMWVEYDLANKRVGFGGAECSRL 429
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 153/374 (40%), Gaps = 45/374 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ ++L++G PP DTGSDL W QC PC C K + P + + C
Sbjct: 93 YLMSLSLGTPPFEILAIADTGSDLIWTQC-TPCDKCYKQIAPLFDPKSSKTYRDLSCDTR 151
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
+C L + + C Y YGD + G L D L +NG P T GC
Sbjct: 152 QCQNLGESSSCSSEQ---LCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGC 208
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGV 184
G + N G D +G++GLG G +S++SQ+ + +C+ N +
Sbjct: 209 G--RRNNGTFDKKD-SGIIGLGGGPMSLISQMGSS--VGGKFSYCLVPFSSESAGNSSKL 263
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSG 238
F + V SGV TP++ + D +Y+ +G ++ + G S G + +I DSG
Sbjct: 264 HFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEGNIIIDSG 323
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR-GPFKALGQVTEYFKPLAL 297
S F + E + + +I + L C+R P + +T +F
Sbjct: 324 TSLTLFPVNFFTEFATAVENAVINGE-RTQDASGLLSHCYRPTPDLKVPVITAHF----- 377
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
N +V+ ++ +CL N +++ I G + + ++ YD +
Sbjct: 378 ------NGADVVLQTLNTFILISDDVLCLA-FNSTQSGA----IFGNVAQMNFLIGYDIQ 426
Query: 358 KQRIGWKPEDCNTL 371
+ + +KP DC L
Sbjct: 427 GKSVSFKPTDCTQL 440
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 98/389 (25%), Positives = 151/389 (38%), Gaps = 52/389 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--------PCTGCTKPPE--KQYKPHKNI 66
+ V++ G PP+ DTGSDL W+QC P C++ P ++
Sbjct: 53 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 112
Query: 67 VPCSNPRCAALHWP--NPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
VPCS +C + P + P C C Y +Y DG S+ G L D + SNG+
Sbjct: 113 VPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATI--SNGTSG 170
Query: 124 NVP---LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
+ FGCG ++ G S T GV+GLG+G++S +Q L +C+
Sbjct: 171 GAAVRGVAFGCG-TRNQGGSFS--GTGGVIGLGQGQLSFPAQ--SGSLFAQTFSYCLLDL 225
Query: 178 --GQNGRGVLFLGDGKVP-SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG------- 227
G+ GR FL G+ + A+TP++ N Y +G + +
Sbjct: 226 EGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWA 285
Query: 228 ---LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT----LPICWR- 279
L + + DSG++ Y Y +VS + L P T L +C+
Sbjct: 286 IDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV---HLPRIPSSATFFQGLELCYNV 342
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN 339
+ F L + F + L +P YLV CL I
Sbjct: 343 SSSSSSAPANGGFPRLTIDFA---QGLSLELPTGNYLVDVADDVKCLAIR--PTLSPFAF 397
Query: 340 NIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
N++G + Q V +D RIG+ +C
Sbjct: 398 NVLGNLMQQGYHVEFDRASARIGFARTEC 426
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 147/379 (38%), Gaps = 62/379 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V G P K DTGSDLTW+QC PC C + ++P ++ +PC +
Sbjct: 137 YIVTAGFGTPAKNSLLIIDTGSDLTWIQCK-PCADCYSQVDAIFEPKQSSSYKTLPCLSA 195
Query: 73 RCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L NP C C YEI YGDG SS G + L GS FG
Sbjct: 196 TCTELITSESNPTPCLLGG--CVYEINYGDGSSSQGDFSQETLTL----GSDSFQNFAFG 249
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCI----GQNGRGVL 185
CG+ N G ++G+LGLG+ +S SQ + +YG +C+ G
Sbjct: 250 CGHT--NTGLFK--GSSGLLGLGQNSLSFPSQSKSKYG---GQFAYCLPDFGSSTSTGSF 302
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGAS 240
+G G +P+S V +TP++ N Y +G + G + L I DSG
Sbjct: 303 SVGKGSIPASAV-FTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVDSGTV 361
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK------- 293
+ Y LK + KT + PF L + +
Sbjct: 362 ITRLLPQAYNA-------------LKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIP 408
Query: 294 PLALSFTNRRN----SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+ F N + V ++VP V +G VCL + S+ + NIIG Q
Sbjct: 409 TITFHFQNNADVAVSDVGILVP-----VQNGGSQVCLAFASASQMD--GFNIIGNFQQQR 461
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V +D RIG+ C
Sbjct: 462 MRVAFDTGAGRIGFASGSC 480
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 99/386 (25%), Positives = 155/386 (40%), Gaps = 55/386 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
V+LTVG PP+ DTGS+L+W+ C T+ + P + VPC +P
Sbjct: 69 LTVSLTVGSPPQNVTMVLDTGSELSWLHCKK-----TQFLNSVFNPLSSKTYSKVPCLSP 123
Query: 73 RCA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C P C C + Y D S G L + F L GS+ FG
Sbjct: 124 TCKTRTRDLTIPVSCD-ATKLCHVIVSYADATSIEGNLAFETFRL----GSLTKPATIFG 178
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
C + + T G++G+ RG +S V+Q+ G + +CI G + GVL LG+
Sbjct: 179 CMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQM---GYPK--FSYCISGFDSAGVLLLGN 233
Query: 190 GKVP-SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------------- 233
P +++TP++Q S L ++ + G K L+L
Sbjct: 234 ASFPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQT 293
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-----TLPICW-----RGPFK 283
+ DSG + + VY + + + G LK+ DD + +C+ R +
Sbjct: 294 MVDSGTQFTFLLGPVYTALKNEFLSQTRGI-LKVLNDDNFVFQGAMDLCYLLDSSRPNLQ 352
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
L V+ F+ +S + R R VP E + GR +V S+ E +IG
Sbjct: 353 NLPVVSLMFQGAEMSVSGERLLYR--VPGE----VRGRDSVWCFTFGNSDLLGVEAFVIG 406
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCN 369
Q+ + +D EK RIG C+
Sbjct: 407 HHHQQNVWMEFDLEKSRIGLADVRCD 432
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 147/377 (38%), Gaps = 72/377 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C + P ++ C +
Sbjct: 89 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDST 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L + PR +D F + SV V FGCG
Sbjct: 148 LCQGLPVASLPR-------------------------SDKFTFVGAGASVPGV--AFGCG 180
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
N G +T G+ G GRG +S+ SQL+ G + G VL +
Sbjct: 181 L--FNNGVFKSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTTITGAIPSTVLLDLPADL 236
Query: 193 PSSG---VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLT--LIFDSGAS 240
S+G V TP++QN A+ LK +G L LK+ T I DSG +
Sbjct: 237 FSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTA 296
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTEYFKPLAL 297
+RVY+ ++RD +KL + T P C P +A Y L L
Sbjct: 297 MTSLPTRVYR-----LVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRA----KPYVPKLVL 347
Query: 298 SFTNRRNSVRLVVPPEAYLVI---SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
F + +P E Y+ +G +CL I+ G GE IG Q+ V+Y
Sbjct: 348 HF----EGATMDLPRENYVFEVEDAGSSILCLAIIEG-----GEVTTIGNFQQQNMHVLY 398
Query: 355 DNEKQRIGWKPEDCNTL 371
D + ++ + P C+ L
Sbjct: 399 DLQNSKLSFVPAQCDKL 415
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 144/367 (39%), Gaps = 43/367 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
++ + + L VG PP + + DTGSDL W QC PCT C QY P I SN
Sbjct: 58 YNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNC----YSQYAP---IFDPSNSS 109
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCG 132
RC + C Y+I Y D S G L T+ + ++G F +P T GCG
Sbjct: 110 TF-----KEKRCN--GNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCG 162
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-DGK 191
+N P +G++GL G S+++Q+ G ++ +C G + G +
Sbjct: 163 HNS----SWFKPTFSGMVGLSWGPSSLITQMG--GEYPGLMSYCFASQGTSKINFGTNAI 216
Query: 192 VPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
V GV T M +A +L +G + G + + +I DSG + YF
Sbjct: 217 VAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYF 276
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
Y +V + + P + + +T +F A ++ N
Sbjct: 277 PVS-YCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYN 335
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
Y+ R CL I+ + ++ I G + +V YD+ + +
Sbjct: 336 ---------MYIETITRGTFCLAIICNNPP---QDAIFGNRAQNNFLVGYDSSSLLVSFS 383
Query: 365 PEDCNTL 371
P +C+ L
Sbjct: 384 PTNCSAL 390
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/386 (24%), Positives = 158/386 (40%), Gaps = 65/386 (16%)
Query: 34 FDTGSDLTWVQC--DAPCTGCTKPPEK------QYKPHKNIVPCSNPRCAALHWPNPP-- 83
DTGSDL WV C + C C + + ++V C++ C L+ N
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60
Query: 84 --RCKHPNDQCD-----YEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
C C Y I+YG G S+ G L+T+ L NG F G +
Sbjct: 61 CQSCAGSLKNCSETCPPYGIQYGRG-STAGLLLTETLNLPLENGEGARAITHFAVGCS-- 117
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG------QNGRGVLFLGDG 190
+S +G+ G GRG +S+ SQL E+ + ++ +C+ +N + ++ LGD
Sbjct: 118 ---IVSSQQPSGIAGFGRGALSMPSQLGEH-IGKDRFAYCLQSHRFDEENKKSLMVLGDK 173
Query: 191 KVPSS-GVAWTPMLQNSAD------LKHYILGPAELLYSGKSCGLKDL------------ 231
+P++ + +TP L NS +Y +G + GK LK L
Sbjct: 174 ALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKR--LKQLPSKLLRFDTKGN 231
Query: 232 -TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFKALGQVT 289
I DSG ++ F+ +++ I + IG +DKT + +C+ G
Sbjct: 232 GGTIIDSGTTFTVFSDEIFKHIAAGFASQ-IGYRRAGEVEDKTGMGLCY----DVTGLEN 286
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYL-VISGRKNVCLGILNGS---EAEVGENNIIGEI 345
A F + +V+P Y S ++CL +++ E + G I+G
Sbjct: 287 IVLPEFAFHF---KGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGND 343
Query: 346 FMQDKMVIYDNEKQRIGWKPEDCNTL 371
QD ++YD EK R+G+ + C T
Sbjct: 344 QQQDFYLLYDREKNRLGFTQQTCKTF 369
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 89/374 (23%), Positives = 160/374 (42%), Gaps = 41/374 (10%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP----------PEKQYKPHK 64
S + N++VG PP F DTGSDL W+ C+ T C + P Y P+
Sbjct: 100 SLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT-CIRDLEDIGVPQSVPLNLYTPNA 158
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ S+ RC+ +C P+ C Y+I Y + + G L+ D+ L + ++
Sbjct: 159 STT-SSSIRCSDKRCFGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTP 217
Query: 125 VP--LTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V +T GCG Q G ++ GVLGLG S+ S L + + N C G+
Sbjct: 218 VKANVTLGCG--QKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVI 275
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASY 241
V + G + TP + + A Y + + + +G ++ L FD+G+S+
Sbjct: 276 GNVGRISFGDRGYTDQEETPFI-SVAPSTAYGVNISGVSVAGDPVDIR-LFAKFDTGSSF 333
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK-----ALGQVTEYFKPLA 296
+ Y +++ +L+ +D+ P+ PF+ + T F +
Sbjct: 334 THLREPAYG-VLTKSFDELV--------EDRRRPVDPELPFEFCYDLSPNATTIQFPLVE 384
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
++F ++++ + + NV CLG+L ++ N+IG+ F+ +++
Sbjct: 385 MTFI---GGSKIILNNPFFTARTQEGNVMYCLGVLKSVGLKI---NVIGQNFVAGYRIVF 438
Query: 355 DNEKQRIGWKPEDC 368
D E+ +GWK C
Sbjct: 439 DRERMILGWKQSLC 452
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/395 (24%), Positives = 159/395 (40%), Gaps = 67/395 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP-----CTGC-------TKPPEKQYKPHK 64
++V ++G PP+ DTGS L W C P C C TK P
Sbjct: 74 YSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSS 133
Query: 65 NI--VPCSNPRC-----AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF 117
+ +PC +P+C + L+ RC + Y +EYG GS+ G LV+D+ L
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNCSTTKRCPY------YGLEYGL-GSTTGQLVSDVLGLSK 186
Query: 118 SNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
N +P FGC +S G+ G GRG SI +QL ++ H
Sbjct: 187 LN----RIPDFLFGCSL-------VSNRQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHR 235
Query: 177 IG---QNGRGVLFLG--DGKVPSSGVAWTPMLQNSA---DLKHYILGPAELLYSGKSCGL 228
Q+G VL G ++GVA+ P ++ A ++Y + +++L GK +
Sbjct: 236 FDDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPI 295
Query: 229 K----------DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIG-TPLKLAPDDKTLPIC 277
D +I DSG+++ + ++ + + + + K D L C
Sbjct: 296 PPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPC 355
Query: 278 WRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSE---A 334
+ GQ L SF N + +P Y + VC+ +L + +
Sbjct: 356 ----YNITGQSEVDVPKLTFSFKGGAN---MDLPLTDYFSLVTDGVVCMTVLTDPDEPGS 408
Query: 335 EVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
G I+G Q+ + YD +KQR G+KP+ C+
Sbjct: 409 TTGPAIILGNYQQQNFYIEYDLKKQRFGFKPQQCD 443
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/380 (25%), Positives = 149/380 (39%), Gaps = 52/380 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V L VG PP+ DTGS+L+W+ C +P G P Y P VPCS+P
Sbjct: 65 LTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSP----VPCSSPI 120
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P P C C I Y D S G L + F + GSV FGC
Sbjct: 121 CRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI----GSVTRPGTLFGC 176
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDG 190
+ + + G++G+ RG +S V+QL G + +CI G + L LGD
Sbjct: 177 MDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQL---GFSK--FSYCISGSDSSVFLLLGDA 231
Query: 191 KVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
G + +TP++ S L ++ + G G K L+L +
Sbjct: 232 SYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTM 291
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICW------RGPFK 283
DSG + + VY + + + + L+L D T+ +C+ R F
Sbjct: 292 VDSGTQFTFLMGPVYTALKNEFITQ-TKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFS 350
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
L V+ F+ +S + ++ R+ G++ V S+ E +IG
Sbjct: 351 GLPMVSLMFRGAEMSVSGQKLLYRVNGAGS-----EGKEEVYCFTFGNSDLLGIEAFVIG 405
Query: 344 EIFMQDKMVIYDNEKQRIGW 363
Q+ + +D K R+G+
Sbjct: 406 HHHQQNVWMEFDLAKSRVGF 425
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 115/275 (41%), Gaps = 39/275 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKP-PEKQYKPHKNIVPCSNPR 73
V+LTVG PP+ DTGS+L+W+ C T P Y P +PCS+P
Sbjct: 1000 LTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSP----IPCSSPI 1055
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C PNP C P C + Y D S G L +D F + GS FGC
Sbjct: 1056 CRTRTRDLPNPVTCD-PKKLCHAIVSYADASSLEGNLASDNFRI----GSSALPGTLFGC 1110
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDG 190
+ + T G++G+ RG +S V+QL GL + +CI G++ GVL GD
Sbjct: 1111 MDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL---GLPK--FSYCISGRDSSGVLLFGDL 1165
Query: 191 KVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
+ G + +TP++Q S L ++ + G G K L L +
Sbjct: 1166 HLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 1225
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAP 269
DSG + + VY + + + G LAP
Sbjct: 1226 VDSGTQFTFLLGPVYTALRNEFLEQTKGV---LAP 1257
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 113/263 (42%), Gaps = 42/263 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C + P Y P ++
Sbjct: 33 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQS 89
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF--S 118
VPCS+ C + C+ ++ C Y I+Y D SS G LV D+ L +
Sbjct: 90 TTSRKVPCSSNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 144
Query: 119 NGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ P+ FGCG Q G +P G+LGLG S+ S L GL N C
Sbjct: 145 QSKIVTAPIMFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGLAANSFSMC 201
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGP-AELLYSGKSCGLK----DL 231
G +G G + GD SS TP L Y P + +G + G K +
Sbjct: 202 FGDDGHGRINFGD--TGSSDQKETP-------LNVYKQNPYYNITITGITVGSKSISTEF 252
Query: 232 TLIFDSGASYAYFTSRVYQEIVS 254
+ I DSG S+ + +Y +I S
Sbjct: 253 SAIVDSGTSFTALSDPMYTQITS 275
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 153/378 (40%), Gaps = 50/378 (13%)
Query: 26 PPKLFDFDFDTGSDLTWVQCDA-----PCTGCTKPPEKQYKPHKNIVPCSNPRC--AALH 78
PP+ DTGS+L+W++C+ P Y P +PCS+P C
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSP----IPCSSPTCRTRTRD 137
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNP 138
+ P C + C + Y D SS G L ++F F N S + L FGC +
Sbjct: 138 FLIPASCDS-DKLCHATLSYADASSSEGNLAAEIF--HFGN-STNDSNLIFGCMGSVSGS 193
Query: 139 GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFLGDGKVP-SS 195
P T G+LG+ RG +S +SQ+ G + +CI G L LGD +
Sbjct: 194 DPEEDTKTTGLLGMNRGSLSFISQM---GFPK--FSYCISGTDDFPGFLLLGDSNFTWLT 248
Query: 196 GVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----LIFDSGAS 240
+ +TP+++ S L ++ I +LL KS L D T + DSG
Sbjct: 249 PLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQ 308
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-----TLPICWR-GPFKALGQVTEYFKP 294
+ + VY + S + G L + D + T+ +C+R PF+ +
Sbjct: 309 FTFLLGPVYTALRSDFLNQTNGI-LTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPT 367
Query: 295 LALSFTNRRNSVRLVVPPEAYLV---ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
++L F + + P Y V +G +V S+ E +IG Q+
Sbjct: 368 VSLVFEGAE--IAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMW 425
Query: 352 VIYDNEKQRIGWKPEDCN 369
+ +D ++ RIG P C+
Sbjct: 426 IEFDLQRSRIGLAPVQCD 443
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 158/385 (41%), Gaps = 74/385 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKNIVPCS- 70
+ + + +G P + + F TGSD+ WV C + CT C P + Y P +
Sbjct: 76 YCITVKLGNPSRHYYLAFHTGSDVMWVPC-SSCTDCPTPDDIGFSLDLYDPKNSSTSSEI 134
Query: 71 ---NPRCAALHWPNPPRCKHPN---DQCDYEIEYGDGG-SSIGALVTD--LFPLRFSNGS 121
+ RCA C + DQC Y Y DG ++ G V+D F + N S
Sbjct: 135 SCSDDRCADALKTGHAICHTSHSSGDQCGYNQIYADGVLATTGYYVSDDIHFDIFMGNES 194
Query: 122 VFN--VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-- 177
+ + FGC ++ G L GV+G G+ S++SQL G + + C+
Sbjct: 195 FASSSASVIFGC--SKSRSGHLQAD---GVIGFGKDAPSLISQLNSQG-VSHAFSRCLDD 248
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCG 227
+G GVL L + P G+ +T ++ + + ++K + + L + + G
Sbjct: 249 SDDGGGVLILDEVGEP--GLEFTSLVASRPCYNLNMKSIAVNNQNVPIDSSLFTTSSTQG 306
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ 287
DSG S AYF VY ++ I+ T F +
Sbjct: 307 T-----FLDSGTSLAYFPDGVYDPVIRAILFIYFSTR----------------SFSSFPT 345
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN----VCLGILNGSEAEVGENNIIG 343
VT YF+ A + V PE YL+ G + +C+ SE + + I+G
Sbjct: 346 VTXYFEGGA----------AMKVGPENYLLRRGSYDNDSYMCIA-FQRSEGDYKQTTILG 394
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
++ + DK+ +Y+ +K +IGW +C
Sbjct: 395 DLILHDKIFVYNLKKMQIGWVNYNC 419
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/391 (25%), Positives = 166/391 (42%), Gaps = 59/391 (15%)
Query: 18 AVNLTVGKPPKLFDFDFDTGSDLTWVQC--DAPCTGCT---KPPEK------QYKPHKNI 66
+++L+ G PP+ F DTGSD+ W C D CT C+ P+K + I
Sbjct: 79 SISLSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKI 138
Query: 67 VPCSNPRCAALHWP----NPPRC----KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS 118
+ C NP+C + ++P PRC KH + C Y +YG G SS L+ + L+F
Sbjct: 139 LDCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGASSGYFLLEN---LKFP 195
Query: 119 NGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHC 176
++ N L GC + + G GR S+ Q+ +++ N +
Sbjct: 196 RKTIRNFLL--GC-----TTSAARELSSDALAGFGRSMFSLPIQMGVKKFAYCLNSHDYD 248
Query: 177 IGQN-GRGVLFLGDGKVPSSGVAWTPMLQN-SADLKHYILGPAELLYSGKSCGLKDLTL- 233
+N G+ +L DGK + G+++TP L++ A +Y LG ++ K + L
Sbjct: 249 DTRNSGKLILDYRDGK--TKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLA 306
Query: 234 ---------IFDSGASYA-YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPF 282
I DSG A Y T V++ + + + + + L + +T L C+
Sbjct: 307 PGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYN--- 363
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGIL-----NGSEAEVG 337
G + PL F R +VVP + Y IS ++++ ++ N E
Sbjct: 364 -FTGHKSIKIPPLIYQF---RGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPD 419
Query: 338 ENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+ I+G D V YD + R G++ + C
Sbjct: 420 PSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 450
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 144/369 (39%), Gaps = 47/369 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSN----P 72
+ + +G P K + DTGS LTW+QC C + + P + S P
Sbjct: 121 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAP 180
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C AL NP C N C Y+ YGD S+G L D + F + SV N +G
Sbjct: 181 QCDALTTATLNPSTCSTSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSVPN--FYYG 235
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G +AG++GL R ++S++ QL + +C+ + +L G
Sbjct: 236 CG--QDNEGLFG--QSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSGYLSIG 289
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYFT 245
++TPM ++S D Y + + +GK + L I DSG
Sbjct: 290 SYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDSGTVITRLP 349
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTEYF---KPLALSFT 300
+ VY + + + GTP A L C++G L QV+ F L L T
Sbjct: 350 TDVYSALSKAVAGAMKGTPRASA--FSILDTCFQGQASRLRVPQVSMAFAGGAALKLKAT 407
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
N LV CL A IIG Q V+YD + +
Sbjct: 408 N-------------LLVDVDSATTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKNSK 449
Query: 361 IGWKPEDCN 369
IG+ C+
Sbjct: 450 IGFAAGGCS 458
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/388 (24%), Positives = 155/388 (39%), Gaps = 56/388 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP----CTGCTKPP--EKQYKPHKNIVPCS 70
V+LTVG PP+ DTGS+L+W+ C+ + T P Y P +PCS
Sbjct: 73 LTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSP----IPCS 128
Query: 71 NPRCA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+ C +P P C N C + Y D SS G L TD F + GS +
Sbjct: 129 SSTCTDQTRDFPIRPSCDS-NQFCHATLSYADASSSEGNLATDTFYI----GSSGIPNVV 183
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRGVLFL 187
FGC + + G++G+ RG +S VSQ+ G + +CI + + G+L L
Sbjct: 184 FGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQM---GFPK--FSYCISEYDFSGLLLL 238
Query: 188 GDGKVP-SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------------- 233
GD + + +TP+++ S L ++ + G K L +
Sbjct: 239 GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG 298
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-----TLPICWRGPFKA-- 284
+ DSG + + Y + + G+ L++ D + +C+R P
Sbjct: 299 QTMVDSGTQFTFLLGPAYTALRDHFLNKTAGS-LRVYEDSNFVFQGAMDLCYRVPTNQTR 357
Query: 285 ---LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNI 341
L VT F+ ++ T R R VP E G ++ S+ E +
Sbjct: 358 LPPLPSVTLVFRGAEMTVTGDRILYR--VPGER----RGNDSIHCFTFGNSDLLGVEAFV 411
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
IG + Q+ + +D +K RIG C+
Sbjct: 412 IGHLHQQNVWMEFDLKKSRIGLAEIRCD 439
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 146/377 (38%), Gaps = 48/377 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
V+L +G PP+ DTGS L+W+QC PP + P +++PC++P C
Sbjct: 84 VSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPR-KPPPSSVFDPSLSSSFSVLPCNHPLC 142
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + S + PL GC
Sbjct: 143 KPRIPDFTLPTSCDQ-NRLCHYSYFYADGTLAEGNLVREKITFSRSQST---PPLILGCA 198
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
D G+LG+ GR+S SQ + V + G G +LG+
Sbjct: 199 EESS--------DAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTGSFYLGEN 250
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAE--LLYSGKSCGLKDLTL--------------- 233
S G + +L S + L P + G G + L +
Sbjct: 251 P-NSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQT 309
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKALGQVTEYF 292
+ DSG+ + Y Y ++ ++R L+G LK +C+ G +G++
Sbjct: 310 MIDSGSEFTYLVDEAYNKVREEVVR-LVGARLKKGYVYGGVSDMCFNGNAIEIGRL---I 365
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ F V +VV E L G C+GI SE +NIIG Q+ V
Sbjct: 366 GNMVFEFD---KGVEIVVEKERVLADVGGGVHCVGI-GRSEMLGAASNIIGNFHQQNIWV 421
Query: 353 IYDNEKQRIGWKPEDCN 369
+D +R+G+ DC+
Sbjct: 422 EFDLANRRVGFGKADCS 438
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 151/377 (40%), Gaps = 58/377 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF L VG PPK DTGSD+ W+QC PCT C ++ + P K+ +PC +
Sbjct: 130 YF-TRLGVGTPPKYLYMVLDTGSDVVWLQCK-PCTKCYSQTDQIFDPSKSKSFAGIPCYS 187
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + P C N+ C Y++ YGDG + G T+ L F +V V + GC
Sbjct: 188 PLCRRL---DSPGCSLKNNLCQYQVSYGDGSFTFGDFSTET--LTFRRAAVPRVAI--GC 240
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV----LFL 187
G++ N G LG G + R N +C+ +
Sbjct: 241 GHD--NEGLFVGAAGLLGLGRGGLSFPTQTGTR----FNNKFSYCLTDRTASAKPSSIVF 294
Query: 188 GDGKVPSSGVAWTPMLQN-SADLKHYI------LGPAELLYSGKSCGLKDLT----LIFD 236
GD V S +TP+++N D +Y+ +G A + S D T +I D
Sbjct: 295 GDSAV-SRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIID 353
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
SG S T Y + +RD + LK AP+ C+ L ++E
Sbjct: 354 SGTSVTRLTRPAY-----VSLRDAFRVGASHLKRAPEFSLFDTCYD-----LSGLSEVKV 403
Query: 294 P-LALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
P + L F + +P YLV + + C + +IIG I Q
Sbjct: 404 PTVVLHF----RGADVSLPAANYLVPVDNSGSFCFAF----AGTMSGLSIIGNIQQQGFR 455
Query: 352 VIYDNEKQRIGWKPEDC 368
V++D R+G+ P C
Sbjct: 456 VVFDLAGSRVGFAPRGC 472
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 148/381 (38%), Gaps = 50/381 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + VG P DTGSD+TW+QC PC C + P + + P
Sbjct: 134 YMAKIAVGTPAVEALLAMDTGSDITWLQCQ-PCRRCYPQSGPVFDPRHSTSYREMGYDAP 192
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSS-IGALVTDLFPLRFSNGSVFNVP-LTFG 130
C AL K C Y + YGD GS+ +G + + L F+ G VP ++ G
Sbjct: 193 DCQALGRSGGGDAKRMT--CVYAVGYGDDGSTTVGDFIEET--LTFAGG--VQVPHMSIG 246
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--------GQNGR 182
CG++ N G + P AG+LGLGRG+IS SQ+ G +C+ G++
Sbjct: 247 CGHD--NKGLFAAP-AAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSPGRSVS 303
Query: 183 GVLFLGDGKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSGKSCGL---KDLTL----- 233
L +GDG S ++TP +QN Y + + G DL L
Sbjct: 304 STLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLDPYTG 363
Query: 234 ----IFDSGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPDDKTLPICWRGPFKALGQV 288
I DSG + R Y + + + C+ +G
Sbjct: 364 RGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCY-----TMGGR 418
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
+++ F V L +PP+ YL+ + VC + V +IIG I
Sbjct: 419 AMKVPTVSMHFA---GGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSV---SIIGNIQQ 472
Query: 348 QDKMVIYDNEKQRIGWKPEDC 368
Q V+Y+ R+G+ P C
Sbjct: 473 QGFRVVYNIGGGRVGFAPNSC 493
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 88/363 (24%), Positives = 134/363 (36%), Gaps = 30/363 (8%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + +G P K F FDTGSDLTW QC+ C E + P ++ + C +
Sbjct: 153 YF-VTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANISCGS 211
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C +L + C Y I+YGD SIG + L ++ VFN FGC
Sbjct: 212 TLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATD--VFN-DFYFGC 268
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G N + R ++S+VSQ + + +C+ + FL G
Sbjct: 269 GQNNKGLFGGAAGLLGLG----RDKLSLVSQTAQR--YNKIFSYCLPSSSSSTGFLTFGG 322
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYFTS 246
S ++TP+ S Y L + G+ + I DSG
Sbjct: 323 STSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSGTVITRLPP 382
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
Y + S + + P AP L C F T + L F+ V
Sbjct: 383 AAYSALSSTFRKLMSQYP--AAPALSILDTC----FDFSNHDTISVPKIGLFFS---GGV 433
Query: 307 RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
+ + ++ VCL S+A + I G + + V+YD R+G+ P
Sbjct: 434 VVDIDKTGIFYVNDLTQVCLAFAGNSDAS--DVAIFGNVQQKTLEVVYDGAAGRVGFAPA 491
Query: 367 DCN 369
C+
Sbjct: 492 GCS 494
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/391 (23%), Positives = 153/391 (39%), Gaps = 53/391 (13%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCS 70
S + +G PP+ + DTGS+L W QC C + Y P ++ V C+
Sbjct: 69 SQYIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCN 128
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ CA + +C N C YG G+ G L T+ + V L FG
Sbjct: 129 DAACA---LGSETQCLSDNKTCAVVTGYG-AGNIAGTLATENLTFQSE-----TVSLVFG 179
Query: 131 C-GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE----YGL---IRNVI--GHCIGQN 180
C + +PG L+ +G++GLGRG++S+ SQL + Y L + I H +
Sbjct: 180 CIVVTKLSPGSLN--GASGIIGLGRGKLSLPSQLGDTRFSYCLTPYFEDTIEPSHMVVGA 237
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSAD----------LKHYILGPAELLYSGKSCGLKD 230
G++ +G S+ V P +++ +D L G +L + L+
Sbjct: 238 SAGLI---NGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQ 294
Query: 231 LT------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA 284
+ DSGA YQ + + + R L ++ +C A
Sbjct: 295 VAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTGFDLC-----VA 349
Query: 285 LGQVTEYFKPLALSFTNRRNS-VRLVVPPEAYLVISGRKNVCLGILNGSEAE---VGENN 340
L PL L F + LVVPP Y C+ + + + + + E
Sbjct: 350 LKDAERLVPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETT 409
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+IG Q+ V+YD + ++P DC+++
Sbjct: 410 VIGNYMQQNMHVLYDLAGGVLSFQPADCSSI 440
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 155/379 (40%), Gaps = 50/379 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L++G PP DTGSDL W+QC PCT C K + P + + +
Sbjct: 59 YLMELSIGTPPVKTYAQVDTGSDLIWLQC-IPCTNCYKQLNPMFDPQSSSTYSNIAYGSE 117
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L+ + C + C+Y Y D + G L + L + G + + FGC
Sbjct: 118 SCSKLYSTS---CSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGC 174
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI-----GQNGRGVL 185
G+N N G + + G++GLGRG +S+VSQ+ +G + C+ + +
Sbjct: 175 GHN--NNGVFNDKE-MGIIGLGRGPLSLVSQIGSSFG--GKMFSQCLVPFHTNPSITSPM 229
Query: 186 FLGDG-KVPSSGVAWTPMLQNSADLKHY---ILGPA----ELLYSGKSCGLKDLT---LI 234
G G +V +GV TP++ + Y +LG + L ++ S L+ +T ++
Sbjct: 230 SFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFNDGS-SLEPITKGNMV 288
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL--PICWRGPFKALGQVTEYF 292
DSG Y +V + + P+ P D TL +C+R P G
Sbjct: 289 IDSGTPTTLLPEDFYHRLVEEVRNKVALDPI---PIDPTLGYQLCYRTPTNLKGT----- 340
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ T +++ P + C + E G I G + ++
Sbjct: 341 -----TLTAHFEGADVLLTPTQIFIPVQDGIFCFAFTSTFSNEYG---IYGNHAQSNYLI 392
Query: 353 IYDNEKQRIGWKPEDCNTL 371
+D EKQ + +K DC L
Sbjct: 393 GFDLEKQLVSFKATDCTNL 411
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 155/381 (40%), Gaps = 61/381 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N+++G PP DTGSDL W QC+ PC C + + P ++ V CS+
Sbjct: 86 YLMNISIGTPPVPILAIADTGSDLIWTQCN-PCEDCYQQTSPLFDPKESSTYRKVSCSSS 144
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF--- 129
+C AL C + C Y I YGD + G + D + GS P++
Sbjct: 145 QCRALE---DASCSTDENTCSYTITYGDNSYTKGDVAVDTVTM----GSSGRRPVSLRNM 197
Query: 130 --GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---------- 177
GCG+ N G P +G++GLG G S+VSQLR+ I +C+
Sbjct: 198 IIGCGH--ENTGTFDPA-GSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETGLT 252
Query: 178 -----GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT 232
G NG + GDG V +S V P +L+ +G ++ ++ G +
Sbjct: 253 SKINFGTNG---IVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGN 309
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR--GPFKALGQVTE 290
++ DSG + S Y E+ S++ + ++ D L +C+R FK + +T
Sbjct: 310 IVIDSGTTLTLLPSNFYYELESVVASTIKAE--RVQDPDGILSLCYRDSSSFK-VPDITV 366
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+FK + N V + + + + I G + +
Sbjct: 367 HFKGGDVKLGNLNTFVAVSEDVSCFAFAANE----------------QLTIFGNLAQMNF 410
Query: 351 MVIYDNEKQRIGWKPEDCNTL 371
+V YD + +K DC+ +
Sbjct: 411 LVGYDTVSGTVSFKKTDCSQM 431
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/368 (25%), Positives = 137/368 (37%), Gaps = 43/368 (11%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI------------ 66
+ +G P F DTGSDL WV CD CT C + ++
Sbjct: 98 TTVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAATDSSAFASDFDLNVYNPNGSSTSK 155
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNG--SV 122
V C+N C + +C C Y + Y +S G LV D+ L + +
Sbjct: 156 KVTCNNSLCM-----HRSQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDL 210
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
+ FGCG Q + L G+ GLG +IS+ S L G + C G++G
Sbjct: 211 VEANVIFGCGQIQ-SGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGI 269
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
G + GD S TP N + + I + G + + T +FDSG S+
Sbjct: 270 GRISFGDKG--SFDQDETPFNLNPSHPTYNI--TVTQVRVGTTLIDVEFTALFDSGTSFT 325
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI--CWRGPFKALGQVTEYFKPLALSFT 300
Y Y + + + D +P C+ A + ++S T
Sbjct: 326 YLVDPTYTRLTESFHSQVQD---RRHRSDSRIPFEYCYDMSPDANTSLIP-----SVSLT 377
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
S V P + CL ++ + E NIIG+ FM V++D EK
Sbjct: 378 MGGGSHFAVYDPIIIISTQSELVYCLAVV-----KTAELNIIGQNFMTGYRVVFDREKLV 432
Query: 361 IGWKPEDC 368
+GWK DC
Sbjct: 433 LGWKKFDC 440
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 155/385 (40%), Gaps = 53/385 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
V+LT G P + DTGS+L+W+ C P P K +PCS+P C
Sbjct: 67 LTVSLTAGTPLQNITMVLDTGSELSWLHCKKEPNFNSIFNPLASKTYTK--IPCSSPTCE 124
Query: 76 --ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
P P C P C + I Y D S G L + F + GSV FGC
Sbjct: 125 TRTRDLPLPVSCD-PAKLCHFIISYADASSVEGNLAFETFRV----GSVTGPATVFGCMD 179
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHCIG-QNGRGVLFLGDG 190
+ + T G++G+ RG +S V+Q+ R++ +CI ++ GVL LG+
Sbjct: 180 SGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFRKF-------SYCISDRDSSGVLLLGEA 232
Query: 191 KVP-SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
+ +TP+++ S L ++ + G K L+L +
Sbjct: 233 SFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTM 292
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-----TLPICW-----RGPFKA 284
DSG + + VY + + G L++ + + + +C+ R
Sbjct: 293 VDSGTQFTFLLGPVYSALKQEFLLQTKGV-LRVLNEPRYVFQGAMDLCYLIEPTRAALPN 351
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGE 344
L V F+ +S + +R R VP E + G+ +V S++ E+ +IG
Sbjct: 352 LPVVNLMFRGAEMSVSGQRLLYR--VPGE----VRGKDSVWCFTFGNSDSLGIESFVIGH 405
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCN 369
Q+ + YD EK RIG+ C+
Sbjct: 406 HQQQNVWMEYDLEKSRIGFAEVRCD 430
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 132/318 (41%), Gaps = 61/318 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ + ++G+PP L + DTGSDL WV+C +PC GC PP Y P ++ +PCS+
Sbjct: 87 YIMQFSIGEPPLLIWAEVDTGSDLMWVKC-SPCNGCNPPPSPLYDPARSRSSGKLPCSSQ 145
Query: 73 RCAALHWPN--PPRCKHPNDQCDYEIEYGDGG--SSIGALVTDLFPLRFSNGSVFNVPLT 128
C AL +C C Y YG G S+ G L T+ F T
Sbjct: 146 LCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETF--------------T 191
Query: 129 FGCGYNQHNP--GPLSPPD------TAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
FG GY +N G D TAG++GLGRG +S+VSQL G R +C+ +
Sbjct: 192 FGDGYVANNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQL---GAGR--FAYCLAAD 246
Query: 181 GR---GVLF--LGDGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLT- 232
+LF L + V+ TP++ N + HY + + G +KD T
Sbjct: 247 PNVYSTILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTF 306
Query: 233 ---------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
+ FDSGA YQ ++R I + ++ D C+
Sbjct: 307 AINSDGSGGVFFDSGAIDTSLKDAAYQ-----VVRQAITSEIQRLGYDAGDDTCF---VA 358
Query: 284 ALGQVTEYFKPLALSFTN 301
A Q PL L F +
Sbjct: 359 ANQQAVAQMPPLVLHFDD 376
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 155/372 (41%), Gaps = 37/372 (9%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IV 67
PI Y + +G PP DTGSDL WVQC APC C + P K+ V
Sbjct: 88 PITEYL-MRFYIGTPPVERFAIADTGSDLIWVQC-APCEKCVPQNAPLFDPRKSSTFKTV 145
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
PC + C L P+ C + QC Y+ YGD G L + N ++ L
Sbjct: 146 PCDSQPCTLLP-PSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKL 204
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGV 184
TFGC ++ ++ S + G++GLG G +S++SQL Y + R +C + N
Sbjct: 205 TFGCTFSNNDTVDESKRNM-GLVGLGVGPLSLISQL-GYQIGRK-FSYCFPPLSSNSTSK 261
Query: 185 LFLGDGKVPSS--GVAWTPMLQNSADLKHYILGPAELLYSGK----SCGLKDLTLIFDSG 238
+ G+ + GV TP++ S +Y L + K S D ++ DSG
Sbjct: 262 MRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSG 321
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
S+ Y + V+L+ +++ G P P+ + F+ G+ + F +
Sbjct: 322 TSFTILKQSFYNKFVALV-KEVYGVEAVKIP-----PLVYNFCFENKGK-RKRFPDVVFL 374
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
FT + V +A + N +C+ L S+ +++I G V YD
Sbjct: 375 FTGAKVRV------DASNLFEAEDNNLLCMVALPTSDE---DDSIFGNHAQIGYQVEYDL 425
Query: 357 EKQRIGWKPEDC 368
+ + + P DC
Sbjct: 426 QGGMVSFAPADC 437
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 144/367 (39%), Gaps = 43/367 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
++ + + L VG PP + + DTGSDL W QC PCT C QY P I SN
Sbjct: 58 YNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNC----YSQYAP---IFDPSNSS 109
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCG 132
RC + C Y+I Y D S G L T+ + ++G F +P T GCG
Sbjct: 110 TF-----KEKRCN--GNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCG 162
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-DGK 191
+N P +G++GL G S+++Q+ G ++ +C G + G +
Sbjct: 163 HNS----SWFKPTFSGMVGLSWGPSSLITQMG--GEYPGLMSYCFASQGTSKINFGTNAI 216
Query: 192 VPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
V GV T M +A +L +G + G + + +I DSG + YF
Sbjct: 217 VAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYF 276
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
Y +V + + P + + +T +F A ++ N
Sbjct: 277 PVS-YCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYN 335
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
Y+ R CL I+ + ++ I G + +V YD+ + +
Sbjct: 336 ---------MYIETITRGTFCLAIICNNPP---QDAIFGNRAQNNFLVGYDSSSLLVFFS 383
Query: 365 PEDCNTL 371
P +C+ L
Sbjct: 384 PTNCSAL 390
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 154/370 (41%), Gaps = 60/370 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + ++G PP+ DTGSDL W +C A CT C Y P+K+ +PCS
Sbjct: 82 YDMTFSIGTPPQELSALADTGSDLIWAKCGA-CTRCVPQGSPSYYPNKSSSFSKLPCSGS 140
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
C+ L P+ +C +CDY+ YG L +D P ++ G + + T G
Sbjct: 141 LCSDL--PS-SQCSAGGAECDYKYSYG--------LASD--PHHYTQGYLGSETFTLGSD 187
Query: 132 -----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV-- 184
G+ +G++GLGRG +S+VSQL +C+ +
Sbjct: 188 AVPGIGFGCTTMSEGGYGSGSGLVGLGRGPLSLVSQLN-----VGAFSYCLTSDAAKTSP 242
Query: 185 LFLGDGKVPSSGVAWTPMLQNSA-----DLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
L G G + +GV TP+L+ S +L+ +G A +G S +IFDSG
Sbjct: 243 LLFGSGALTGAGVQSTPLLRTSTYYYTVNLESISIGAATTAGTGSS------GIIFDSGT 296
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+ A+ Y ++ T L +A +C F+ G V F + L F
Sbjct: 297 TVAFLAEPAYTLAKEAVLSQT--TNLTMASGRDGYEVC----FQTSGAV---FPSMVLHF 347
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ + +P E Y C I+ S + +I+G I + + YD EK
Sbjct: 348 ----DGGDMDLPTENYFGAVDDSVSCW-IVQKSPSL----SIVGNIMQMNYHIRYDVEKS 398
Query: 360 RIGWKPEDCN 369
+ ++P +C+
Sbjct: 399 MLSFQPANCD 408
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 101/386 (26%), Positives = 154/386 (39%), Gaps = 61/386 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +NL++G PP F DTGS L W QC APCT C P ++P + +PC++
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQC-APCTECAARPAPPFQPASSSTFSKLPCASS 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C + P C Y YG G ++ G L T+ + G+ F + FGC
Sbjct: 149 LC---QFLTSPYLTCNATGCVYYYPYGMGFTA-GYLATETLHV---GGASFP-GVAFGCS 200
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLFLG 188
++ G ++G++GLGR +S+VSQ+ G+ R +C+ + +LF
Sbjct: 201 -TENGVG----NSSSGIVGLGRSPLSLVSQV---GVGR--FSYCLRSDADAGDSPILFGS 250
Query: 189 DGKVPSSGVAWTPMLQNS---------ADLKHYILGPAEL--------LYSGKSCGLKDL 231
KV V TP+L+N +L +G +L G GL
Sbjct: 251 LAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGG 310
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT--LPICWRGPFKALGQVT 289
T++ DSG + Y Y + + + L + +C+ G
Sbjct: 311 TIV-DSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGV 369
Query: 290 EYFKPLALSFTN------RRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNII 342
L L F RR S VV ++ GR V CL +L SE +II
Sbjct: 370 P-VPTLVLRFAGGAEYAVRRRSYVGVVAVDS----QGRAAVECLLVLPASEKL--SISII 422
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDC 368
G + D V+YD + + P DC
Sbjct: 423 GNVMQMDLHVLYDLDGGMFSFAPADC 448
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 145/361 (40%), Gaps = 47/361 (13%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPN------PPR 84
DT S+LTWVQC APC C + + P + VPC++ C AL
Sbjct: 169 DTASELTWVQC-APCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAAA 227
Query: 85 CKHPNDQ---CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPL 141
C+ + C Y + Y DG S G L D L G V + FGCG + P P
Sbjct: 228 CQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSL---AGEVID-GFVFGCGTSNQGP-PF 282
Query: 142 SPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PS 194
T+G++GLGR ++S+VSQ + ++G V +C+ + G L +GD S
Sbjct: 283 G--GTSGLMGLGRSQLSLVSQTMDQFG---GVFSYCLPLKESDSSGSLVIGDDSSVYRNS 337
Query: 195 SGVAWTPMLQNS-------ADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSR 247
+ + + M+ + +L +G E+ SG S G I DSG
Sbjct: 338 TPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPS 397
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVR 307
+Y + + + P AP L C F G L L F V
Sbjct: 398 IYNAVKAEFLSQFAEYP--QAPGFSILDTC----FNMTGLREVQVPSLKLVFDGGVE-VE 450
Query: 308 LVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPED 367
+ Y V S VCL + ++E E NIIG ++ VI+D ++G+ E
Sbjct: 451 VDSGGVLYFVSSDSSQVCLAMAP-LKSEY-ETNIIGNYQQKNLRVIFDTSGSQVGFAQET 508
Query: 368 C 368
C
Sbjct: 509 C 509
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 153/389 (39%), Gaps = 59/389 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ V++ +G PP+ DTGSDLTW QC APC C + ++ P + +++PC
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV--FNVP-LTF 129
C L W + N C Y Y D + G L +D F ++ ++ +VP LTF
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLF 186
GCG N G +T G+ G RG +S+ +QL+ + N +C I + +F
Sbjct: 230 GCGL--FNNGIFVSNET-GIAGFSRGALSMPAQLK----VDN-FSYCFTAITGSEPSPVF 281
Query: 187 LG-------DGKVPSSGVAWTPML--QNSADLKHY-------ILGPAELLYSGKSCGLKD 230
LG D GV + L +S+ LK Y +G L LK+
Sbjct: 282 LGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKE 341
Query: 231 L---TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP-ICWRGPFKALG 286
I DSG VY + + T L + +L +C+ P A
Sbjct: 342 DGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQ---TKLTVHNSTSSLSQLCFSVPPGA-- 396
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNII 342
KP + L +P E Y+ G + CL I G + V I
Sbjct: 397 ------KPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSV-----I 445
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
G Q+ V+YD + + P CN +
Sbjct: 446 GNFQQQNMHVLYDLANDMLSFVPARCNKI 474
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 98/406 (24%), Positives = 156/406 (38%), Gaps = 81/406 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCT----------KPPEKQYKPHK 64
++V+L+ G PP+ F DTGSD+ W C + C C+ +P +
Sbjct: 67 YSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSS 126
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCD---------------YEIEYGDGGSSIGALV 109
++ C NP+C+ +H H N CD Y I YG G + AL
Sbjct: 127 KLLGCKNPKCSWIH--------HSNINCDQDCSIKSCLNQTCPPYMIFYGSGTTGGVALS 178
Query: 110 TDLFPLRFSNGSVFNVPLTFGCG-YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR---- 164
L S + GC ++ H P AG+ G GRG S+ SQL
Sbjct: 179 ETLHLHSLSKPNFL-----VGCSVFSSHQP--------AGIAGFGRGLSSLPSQLGLGKF 225
Query: 165 EYGLIRNVIGHCIGQNGRGVLFLG--DGKVPSSGVAWTPMLQN------SADLKHYILGP 216
Y L+ + ++ VL + D ++ + +TP ++N S+ +Y LG
Sbjct: 226 SYCLLSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGL 285
Query: 217 AELLYSGKSCGL--KDLT--------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
+ G + K L+ +I DSG ++ + ++ + +R +
Sbjct: 286 RRITVGGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRV 345
Query: 267 LAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCL 326
+D I R F T F L L F + + +P E Y G + CL
Sbjct: 346 KEIEDA---IGLRPCFNVSDAKTVSFPELRLYF---KGGADVALPVENYFAFVGGEVACL 399
Query: 327 GILN----GSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
++ G E G I+G MQ+ V YD +R+G+K E C
Sbjct: 400 TVVTDGVAGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 70/197 (35%), Positives = 94/197 (47%), Gaps = 25/197 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP+ D+GSD+ WVQC+ PCT C + + P + V C++
Sbjct: 134 YF-VRIGVGSPPRNQYVVIDSGSDIIWVQCE-PCTQCYHQSDPVFNPADSSSYAGVSCAS 191
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C+ H N C +C YE+ YGDG + G L L L F + NV + GC
Sbjct: 192 TVCS--HVDNAG-CH--EGRCRYEVSYGDGSYTKGTLA--LETLTFGRTLIRNVAI--GC 242
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ HN G AG+LGLG G +S V QL G +C+ G G+L G
Sbjct: 243 GH--HNQGMFV--GAAGLLGLGSGPMSFVGQLG--GQAGGTFSYCLVSRGIQSSGLLQFG 296
Query: 189 DGKVPSSGVAWTPMLQN 205
VP G AW P++ N
Sbjct: 297 REAVP-VGAAWVPLIHN 312
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 99/394 (25%), Positives = 150/394 (38%), Gaps = 54/394 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------QYKPHKNIVPCS 70
V + VG PP+ DTGS+L+W++C+ T PP+ CS
Sbjct: 60 LTVPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCS 119
Query: 71 NPRCAALHW-----PNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+P C W P PP C P+ C + Y D S+ G L D F L G
Sbjct: 120 SPEC---QWRGRDLPVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLL----GGAPP 172
Query: 125 VPLTFGCGYNQHNPGPLSPPDT---AGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-N 180
V FGC + + + D+ G+LG+ RG +S V+Q +R +CI +
Sbjct: 173 VXALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQT---ATLR--FAYCIAPGD 227
Query: 181 GRGVLFL-GDGKVPSSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGL 228
G G+L L GDG + + +TP++Q S L ++ I A LL KS
Sbjct: 228 GPGLLVLGGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLA 287
Query: 229 KDLT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD----KTLPICWRG 280
D T + DSG + + + Y + + L D C+R
Sbjct: 288 PDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRA 347
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSV---RLV--VPPEAYLVISGRKNVCLGILNGSEAE 335
+ + + L +V +L+ VP E CL N A
Sbjct: 348 SEARVAAASXMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAG 407
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ +IG Q+ V YD + R+G+ P C+
Sbjct: 408 M-SAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 440
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 147/367 (40%), Gaps = 45/367 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRC 74
S + + L VG PP DTGS++TW QC PC C + + P K+ RC
Sbjct: 63 SVYLMKLQVGTPPFEIQAIIDTGSEITWTQC-LPCVHCYEQNAPIFDPSKSST-FKEKRC 120
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGCGY 133
C YE++Y D ++G L T+ L ++G F +P T GCG+
Sbjct: 121 DG-------------HSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGH 167
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDGKV 192
N P +G++GL G S+++Q+ G ++ +C GQ + F + V
Sbjct: 168 NN----SWFKPSFSGMVGLNWGPSSLITQMG--GEYPGLMSYCFSGQGTSKINFGANAIV 221
Query: 193 PSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLTLIFDSGASYAYFT 245
GV T M +A Y L G + G + + ++ DSG + YF
Sbjct: 222 AGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGTTLTYFP 281
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y +V + ++ T ++ A +C+ + F + + F+
Sbjct: 282 VS-YCNLVRQAVEHVV-TAVRAADPTGNDMLCYN------SDTIDIFPVITMHFS---GG 330
Query: 306 VRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
V LV+ + S V CL I+ S + I G + +V YD+ + +
Sbjct: 331 VDLVLDKYNMYMESNNGGVFCLAIICNSPTQEA---IFGNRAQNNFLVGYDSSSLLVSFS 387
Query: 365 PEDCNTL 371
P +C+ L
Sbjct: 388 PTNCSAL 394
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 157/389 (40%), Gaps = 58/389 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC------TKPPEKQYKPHKNIVPCS 70
V+LTVG PP+ DTGS+L+W+ C+ T + Y+P +PCS
Sbjct: 31 LTVSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPTTFNQTRSISYRP----IPCS 86
Query: 71 NPRCA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
+ C + P C N C + Y D SS G L +D F + S ++P +
Sbjct: 87 SSTCTNQTRDFSIPASCDS-NSLCHATLSYADASSSEGNLASDTFHMGAS-----DIPGM 140
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLF 186
FGC + + G++G+ RG +S VSQ+ G + +CI G + G+L
Sbjct: 141 VFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQM---GFPK--FSYCISGTDFSGMLL 195
Query: 187 LGDGKVP-SSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT-- 232
LG+ + + +TP++Q S L ++ I LL KS D T
Sbjct: 196 LGESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGA 255
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD----KTLPICWRGPFKA-- 284
+ DSG + + Y + S + G L D + +C+R P
Sbjct: 256 GQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRV 315
Query: 285 ---LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENN 340
L V+ F ++ + R R VP E I G +V CL N V E
Sbjct: 316 LPRLPTVSLVFNGAEMTVADERVLYR--VPGE----IRGNDSVHCLSFGNSDLLGV-EAY 368
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+IG Q+ + +D E+ RIG C+
Sbjct: 369 VIGHHHQQNVWMEFDLERSRIGLAQVRCD 397
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 153/389 (39%), Gaps = 59/389 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ V++ +G PP+ DTGSDLTW QC APC C + ++ P + +++PC
Sbjct: 85 YLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLR 143
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV--FNVP-LTF 129
C L W + N C Y Y D + G L +D F ++ ++ +VP LTF
Sbjct: 144 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 203
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLF 186
GCG N G +T G+ G RG +S+ +QL+ + N +C I + +F
Sbjct: 204 GCGL--FNNGIFVSNET-GIAGFSRGALSMPAQLK----VDN-FSYCFTAITGSEPSPVF 255
Query: 187 LG-------DGKVPSSGVAWTPML--QNSADLKHY-------ILGPAELLYSGKSCGLKD 230
LG D GV + L +S+ LK Y +G L LK+
Sbjct: 256 LGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKE 315
Query: 231 L---TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP-ICWRGPFKALG 286
I DSG VY + + T L + +L +C+ P A
Sbjct: 316 DGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQ---TKLTVHNSTSSLSQLCFSVPPGA-- 370
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNII 342
KP + L +P E Y+ G + CL I G + V I
Sbjct: 371 ------KPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSV-----I 419
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
G Q+ V+YD + + P CN +
Sbjct: 420 GNFQQQNMHVLYDLANDMLSFVPARCNKI 448
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 102/358 (28%), Positives = 141/358 (39%), Gaps = 46/358 (12%)
Query: 35 DTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKN----IVPCSNPRCAAL---HWPNPPRC 85
DTGSDLTWVQC+ PC G C + + P + VPC +P CAA P C
Sbjct: 199 DTGSDLTWVQCE-PCPGSSCYAQRDPLFDPAASPTFAAVPCGSPACAASLKDATGAPGSC 257
Query: 86 K----HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHNPGP 140
+ +C Y + YGDG S G L D L G+ + FGCG + N G
Sbjct: 258 ARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGL----GTTTKLDGFVFGCGLS--NRGL 311
Query: 141 LSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSG-- 196
TAG++GLGR +S+VSQ V +C+ G L LG G PSS
Sbjct: 312 FG--GTAGLMGLGRTDLSLVSQ--TAARFGGVFSYCLPATTTSTGSLSLGPG--PSSSFP 365
Query: 197 -VAWTPMLQNSADLKHYILGPAELLYSGKSC----GLKDLTLIFDSGASYAYFTSRVYQE 251
+A+T M+ + Y + G + G ++ DSG VY+
Sbjct: 366 NMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGFGAGNVLVDSGTVITRLAPSVYKA 425
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+ + R AP L C+ L E PL V +
Sbjct: 426 VRAEFARRF---EYPAAPGFSILDACYD-----LTGRDEVNVPLLTLTLEGGAQVTVDAA 477
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
++V VCL + S + IIG ++K V+YD R+G+ EDC
Sbjct: 478 GMLFVVRKDGSQVCLAM--ASLPYEDQTPIIGNYQQRNKRVVYDTVGSRLGFADEDCT 533
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 106/413 (25%), Positives = 156/413 (37%), Gaps = 74/413 (17%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK-PPEK--QYKPH----K 64
P Y+ LT+G P + DTGS L PC+GCT+ P K +KP
Sbjct: 76 PELGYYYTYLTIGTPGQTVSGILDTGSTLPAF----PCSGCTRCGPSKTGMFKPELSSTS 131
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ CS+ RC + C N+QC Y I Y +G S+ G L D+ + G N
Sbjct: 132 STFGCSDARC----FCGANSCSCNNEQCGYSIRYLEGSSTSGFLAEDMLAVG-DGGPAAN 186
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
FGC Q G L GV G+GR S+ QL + G+I + C G GV
Sbjct: 187 --FVFGCA--QSESGLLYSQIADGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPREGV 242
Query: 185 LFLGDGKVPSSGVA--WTPMLQNSADLKHYILG---PAELLYSGKSCGLKDLTLIFDSGA 239
L LG+ +P+ A TP++ N+ I G + L SG+ L+ L A
Sbjct: 243 LLLGNVALPADAPAPVVTPVVGNTNKFNIQIEGLNFNDQQLVSGQRHNLQLLHTQCVQRA 302
Query: 240 SYAYFTSRVYQEI------------VSLIMRDLI----------------GTPLKLAPD- 270
+ +R Q + +D I PL D
Sbjct: 303 GGGHPETRRGQPRPCVRAGCLRECWLPYTHKDCIRRRRALCACDARARPRACPLHCCADC 362
Query: 271 -----------DKTLPICWRG-PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
++ ICW+G P ++ YF + L RL P YL
Sbjct: 363 CLWFCACVMSLAQSDDICWKGAPADDASKLGAYFPDMELLLA---GGGRLTRSPLHYLYP 419
Query: 319 SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
G CLG + + + + ++G M D +V YD ++ + +C+ L
Sbjct: 420 YGAA-WCLGFFDNAYS----STVLGANLMLDTVVTYDGRLNQMRFTTYECDKL 467
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 153/389 (39%), Gaps = 59/389 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ V++ +G PP+ DTGSDLTW QC APC C + ++ P + +++PC
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV--FNVP-LTF 129
C L W + N C Y Y D + G L +D F ++ ++ +VP LTF
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLF 186
GCG N G +T G+ G RG +S+ +QL+ + N +C I + +F
Sbjct: 230 GCGL--FNNGIFVSNET-GIAGFSRGALSMPAQLK----VDN-FSYCFTAITGSEPSPVF 281
Query: 187 LG-------DGKVPSSGVAWTPML--QNSADLKHY-------ILGPAELLYSGKSCGLKD 230
LG D GV + L +S+ LK Y +G L LK+
Sbjct: 282 LGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKE 341
Query: 231 L---TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP-ICWRGPFKALG 286
I DSG VY + + T L + +L +C+ P A
Sbjct: 342 DGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQ---TKLTVHNSTSSLSQLCFSVPPGA-- 396
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNII 342
KP + L +P E Y+ G + CL I G + V I
Sbjct: 397 ------KPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSV-----I 445
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
G Q+ V+YD + + P CN +
Sbjct: 446 GNFQQQNMHVLYDLANDMLSFVPARCNKI 474
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 92/385 (23%), Positives = 165/385 (42%), Gaps = 53/385 (13%)
Query: 15 SYFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKP---PEKQYKPHKN----I 66
S + V++ +G P P+ F DTGSDLTW+ C+ C C KP P + ++ + +
Sbjct: 117 SQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRT 176
Query: 67 VPCSNPRCAA--LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS--- 121
+PCS+ C + + C +PN C ++ Y +G +IG + + ++
Sbjct: 177 IPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIR 236
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---- 177
+F+V + +N+ N P GV+GLG + S+ +L E + N +C+
Sbjct: 237 LFDVLIGCTESFNETNGFP------DGVMGLGYRKHSLALRLAE--IFGNKFSYCLVDHL 288
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT---- 232
N + L GD +P + P +Q++ L YI + SG S G L+
Sbjct: 289 SSSNHKNFLSFGD--IPEMKL---PKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSD 343
Query: 233 ---------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
+I DSG S Y ++V ++ + K+ P + LP F+
Sbjct: 344 IWNVTGVGGMIVDSGTSLTMLAGEAYDKVVD-ALKPIFDKHKKVVPIE--LPELNNFCFE 400
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
G L + F + P ++Y++ CLGI+ +A+ ++I+G
Sbjct: 401 DKGFDRAAVPRLLIHFA---DGAIFKPPVKSYIIDVAEGIKCLGII---KADFPGSSILG 454
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
+ Q+ + YD + ++G+ P C
Sbjct: 455 NVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 312
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/260 (27%), Positives = 115/260 (44%), Gaps = 32/260 (12%)
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
+ + FGC +Q G L+ D A G+ G G+ ++S++SQL G+ V HC+
Sbjct: 16 SASIVFGCSNSQ--SGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSD 73
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
NG G+L LG+ P G+ +TP++ + HY L + +G+ + D +L
Sbjct: 74 NGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIAVNGQKLPI-DSSLFTTSNT 127
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY Y VS I ++P ++L F V
Sbjct: 128 QGTIVDSGTTLAYLADGAYDPFVSAI-------AAAVSPSVRSLVSKGSQCFITSSSVDS 180
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEIFMQD 349
F + L F V + V PE YL+ N L + + E I+G++ ++D
Sbjct: 181 SFPTVTLYF---MGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKD 237
Query: 350 KMVIYDNEKQRIGWKPEDCN 369
K+ +YD R+GW DC+
Sbjct: 238 KIFVYDLANMRMGWADYDCS 257
>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
Length = 519
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 152/378 (40%), Gaps = 57/378 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP----------PEKQYKPH 63
F ++A N++VG P F DTGSDL W+ C+ T C + P Y P+
Sbjct: 100 FLHYA-NVSVGTPATWFLVALDTGSDLFWLPCNCGST-CIRDLKEVGLSQSRPLNLYSPN 157
Query: 64 KNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFS 118
+ + CS+ RC P C Y+I+Y + + G L D+ L
Sbjct: 158 TSSTSSSIRCSDDRCFGSSRC-----SSPASSCPYQIQYLSKDTFTTGTLFEDVLHLVTE 212
Query: 119 NGSV--FNVPLTFGCGYNQHNPGPL-SPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ + +T GCG NQ G L S G+LGLG S+ S L + + N
Sbjct: 213 DEGLEPVKANITLGCGKNQ--TGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSM 270
Query: 176 CIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIF 235
C G V + G + TP+L + +G G + G++ L L F
Sbjct: 271 CFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSVTEVSVG-------GDAVGVQLLAL-F 322
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK-----ALGQVTE 290
D+G S+ + Y LI DK PI PF+ + + T
Sbjct: 323 DTGTSFTHLLEPEY---------GLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTI 373
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
F +A++F S + P L I CLGIL + ++ NIIG+ FM
Sbjct: 374 LFPRVAMTFEG--GSQMFLRNP---LFIDNSAMYCLGILKSVDFKI---NIIGQNFMSGY 425
Query: 351 MVIYDNEKQRIGWKPEDC 368
+++D E+ +GWK DC
Sbjct: 426 RIVFDRERMILGWKRSDC 443
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 97/389 (24%), Positives = 159/389 (40%), Gaps = 59/389 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP---------HKNIV 67
V+L +G PP+ D DTGS L+W+QC PP + K +++
Sbjct: 66 LVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLL 125
Query: 68 PCSNPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
PC++P C + P C N C Y Y DG + G LV + F + S+
Sbjct: 126 PCNHPICKPRIPDFTLPTSCDQ-NRLCHYSYFYADGTLAEGNLVREKFTF---SKSLSTP 181
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNG 181
P+ GC + + G+LG+ GR+S +SQ + + +C+ G N
Sbjct: 182 PVILGCAQ--------ASTENRGILGMNHGRLSFISQAK-----ISKFSYCVPSRTGSNP 228
Query: 182 RGVLFLGDGKVPSSGVAWTPML-----QNSADLK--HYILGPAELLYSGKSCGLKDLTL- 233
G+ +LGD SS + ML Q+S +L Y L + +GK +
Sbjct: 229 TGLFYLGDNP-NSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFK 287
Query: 234 ---------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFK 283
+ DSG+ Y Y+++ ++R L+G +K +C+
Sbjct: 288 PDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVR-LVGAMMKKGYVYADVADMCFDAGVT 346
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNII 342
A +V ++ F N V + V ++ K V C+GI +G +NII
Sbjct: 347 A--EVGRRIGGISFEFD---NGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIG-SNII 400
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
G + Q+ V YD +R+G+ +C+ L
Sbjct: 401 GTVHQQNMWVEYDLANKRVGFGGAECSRL 429
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 96/392 (24%), Positives = 144/392 (36%), Gaps = 70/392 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----------HKN 65
V L +G PP+L DTGS L+W+QC K P+K+ P
Sbjct: 82 LVVTLPIGTPPQLQQMVLDTGSQLSWIQCHN-----KKTPQKKQPPTTSSFDPSLSSSFF 136
Query: 66 IVPCSNPRCAALHWPNPPRCKHPND-----QCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
++PC++P C P P P D C Y Y DG + G LV + S
Sbjct: 137 VLPCNHPLCK----PRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQT 192
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
+ P+ GC D G+LG+ GR+ SQ + +C+
Sbjct: 193 T---PPIILGCATQSD--------DARGILGMNLGRLGFPSQAK-----ITKFSYCVPTK 236
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAE--LLYSGKSCGLKDLTL- 233
Q G +LG+ SS + +L + L P L G S G K L +
Sbjct: 237 QAQPASGSFYLGNNPA-SSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIP 295
Query: 234 --------------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
+ DSG+ + Y Y I +++ + K IC+
Sbjct: 296 PSVFKPNAGGSGQTMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFD 355
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN 339
G +G++ + F V++V+P E L CLG + SE
Sbjct: 356 GDAIEIGRLV---GDMVFEF---EKGVQIVIPKERVLATVDGGVHCLG-MGRSERLGAGG 408
Query: 340 NIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
NIIG Q+ V +D +R+G+ DC+ L
Sbjct: 409 NIIGNFHQQNLWVEFDLANRRVGFGEADCSKL 440
>gi|213998828|gb|ACJ60781.1| nucellin [Hordeum brachyantherum subsp. californicum]
Length = 133
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 52/129 (40%), Positives = 73/129 (56%), Gaps = 7/129 (5%)
Query: 140 PLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGDGKVPSSGVA 198
P SP D G+LGLG G+ QL+ +I NVIGHC+ G+GVL++GD PS GV
Sbjct: 5 PPSPVD--GILGLGMGKAGFAVQLKGQKMITGNVIGHCLSSQGKGVLYVGDFNPPSRGVT 62
Query: 199 WTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRVYQEIVSLIM 257
W PM ++ L +Y G AE L + G +FDSG++Y + ++VY EIVS +
Sbjct: 63 WVPMKES---LFYYSPGLAEPLIDNQPIRGNPTFEAVFDSGSTYTHVPAQVYNEIVSKVR 119
Query: 258 RDLIGTPLK 266
L + L+
Sbjct: 120 GTLSESSLE 128
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 88/192 (45%), Gaps = 22/192 (11%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VP 68
I Y+ L +G PP++F D+GS +T+V C + C C K + +++P + V
Sbjct: 89 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQPEMSSTYQPVK 147
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPL 127
C N C C +QC YE EY + SS G L DL + F N S
Sbjct: 148 C-NMDC---------NCDDDREQCVYEREYAEHSSSKGVLGEDL--ISFGNESQLTPQRA 195
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVL 185
FGC G L G++GLG+G +S+V QL + GLI N G C G G G +
Sbjct: 196 VFGC--ETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSM 253
Query: 186 FLGDGKVPSSGV 197
LG PS V
Sbjct: 254 ILGGFDYPSDMV 265
>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
Length = 472
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 155/376 (41%), Gaps = 51/376 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC---TKPP--EKQYKPHKNIVPCSN 71
FA+NL +G PP +F S+ W C +PC C T P +PC++
Sbjct: 88 FAMNLNLGTPPVQHNFTMALNSEFFWAAC-SPCVDCNVSTNDPLFSSASSTSYTRIPCTS 146
Query: 72 PRCAALHWPNPPRCKHP---NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
P C+ + C + C Y Y SS G + +D+ ++ + N L
Sbjct: 147 PFCSTSPGFSTNACGSSAVGSTTCLYNFSYSTDYSSAGEMASDVVAMKTPRKTRGNKSLR 206
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFL 187
G + + L +T+G++G + S + QL E I +C+ G + L
Sbjct: 207 MSLGCGRESTTLLGILNTSGLVGFAKTDKSFIGQLAEMDYTSKFI-YCVPSDTFSGKIVL 265
Query: 188 GDGKVPS-SGVAWTPMLQNSADLKHYI----LGPAELLYSGKSCGLKDLT--LIFDSGAS 240
G+ K+ S S +++TPM+ NS L +YI + + L L D T I DS +
Sbjct: 266 GNYKISSHSSLSYTPMIVNSTAL-YYIGLRSISITDTLTFPVQGILADGTGGTIIDSTFA 324
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
++YFT Y +V I + + L ++T + LG Y ++++
Sbjct: 325 FSYFTPDSYTPLVQAIQN--LNSNLTKVSSNETAAL--------LGNDICY--NVSVNDD 372
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN-NIIGEIFMQDKMVIYDNEKQ 359
+ N+ VCL + G +VG + N+IG D V +D EKQ
Sbjct: 373 DAENAT-----------------VCLAV--GDSEKVGFSLNVIGTYQQLDVAVEFDLEKQ 413
Query: 360 RIGWKPEDCNTLLSLN 375
IG+ CN ++L+
Sbjct: 414 EIGFGTAGCNVSMNLD 429
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 150/364 (41%), Gaps = 43/364 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IVPCS 70
+ V ++ G P DTGSD++W+QC PC+ P+K Y P + VPC+
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 137
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ C L QC + I Y DG S++GA D L + G++ FG
Sbjct: 138 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQD--KLTLAPGAIVQ-NFYFG 194
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
CG+ +H L GVLGLGR R S+ ++ YG V +C+ + G L LG
Sbjct: 195 CGHGKHAVRGL----FDGVLGLGRLRESLGAR---YG---GVFSYCLPSVSSKPGFLALG 244
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----LIFDSGASYAYF 244
GK P SG +TPM + A + GK L+ +I DSG
Sbjct: 245 AGKNP-SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGL 303
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
S Y+ + S + + +L P+ L C+ G +AL+FT
Sbjct: 304 QSTAYRALRSAFRKAM--EAYRLLPNGD-LDTCY----NLTGYKNVVVPKIALTFTG-GA 355
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
++ L V P LV N CL G ++G + + V++D + G++
Sbjct: 356 TINLDV-PNGILV-----NGCLAF--AESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFR 407
Query: 365 PEDC 368
+ C
Sbjct: 408 AKAC 411
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 158/380 (41%), Gaps = 51/380 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP----------PEKQYKPH 63
F ++A N++VG P F DTGSDL W+ C+ T C + P Y P+
Sbjct: 100 FLHYA-NVSVGTPATWFLVALDTGSDLFWLPCNCGST-CIRDLKEVGLSQSRPLNLYSPN 157
Query: 64 KNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFS 118
+ + CS+ RC RC P C Y+I+Y + + G L D+ L
Sbjct: 158 TSSTSSSIRCSDDRCFGSS-----RCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHLVTE 212
Query: 119 NGSV--FNVPLTFGCGYNQHNPGPL-SPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ + +T GCG NQ G L S G+LGLG S+ S L + + N
Sbjct: 213 DEGLEPVKANITLGCGKNQ--TGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSM 270
Query: 176 CIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIF 235
C G V + G + TP+L Y + E+ G + G++ L L F
Sbjct: 271 CFGNIIDVVGRISFGDKGYTDQMETPLLPTEPS-PTYAVSVTEVSVGGDAVGVQLLAL-F 328
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK-----ALGQVTE 290
D+G S+ + Y LI DK PI PF+ + + T
Sbjct: 329 DTGTSFTHLLEPEY---------GLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTI 379
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV--CLGILNGSEAEVGENNIIGEIFMQ 348
F +A++F ++ + ++V + + CLGIL + ++ NIIG+ FM
Sbjct: 380 LFPRVAMTF---EGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKI---NIIGQNFMS 433
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
+++D E+ +GWK DC
Sbjct: 434 GYRIVFDRERMILGWKRSDC 453
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 151/390 (38%), Gaps = 57/390 (14%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----V 67
P + + VG P DT SDLTW+QC PC C + P + +
Sbjct: 129 PTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPRHSTSYGEM 187
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF--PLRFSNGSVFNV 125
P C AL K C Y ++YGDG S V DL L F+ G V
Sbjct: 188 NYDAPDCQALGRSGGGDAKR--GTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG-VRQA 244
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRG 183
L+ GCG++ N G P AG+LGLGRG+ISI Q+ G +C+ +G G
Sbjct: 245 YLSIGCGHD--NKGLFGAP-AAGILGLGRGQISIPHQIAFLGY-NASFSYCLVDFISGPG 300
Query: 184 ----VLFLGDGKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSG---KSCGLKDLTL-- 233
L G G V +S ++TP + N Y + + G +DL L
Sbjct: 301 SPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDP 360
Query: 234 -------IFDSGASYAYFTSRVY-------QEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
I DSG + Y + + + + G P L D + R
Sbjct: 361 YTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLF--DTCYTVGGR 418
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGE 338
K + V+ +F V + + P+ YL+ + R VC + V
Sbjct: 419 AGVK-VPAVSMHFA----------GGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSV-- 465
Query: 339 NNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
++IG I Q V+YD QR+G+ P +C
Sbjct: 466 -SVIGNILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 155/388 (39%), Gaps = 63/388 (16%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD-----APCTGCTKPPEKQYKPHKNIVPCSNPR 73
++L +G P + + DTGS L+W+QC P T + + +PCS+P
Sbjct: 82 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 141
Query: 74 CAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + P C N C Y Y DG + G LV + F FSN PL GC
Sbjct: 142 CKPRIPDFTLPTSCDS-NRLCHYSYFYADGTFAEGNLVKEKF--TFSNSQT-TPPLILGC 197
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGV 184
D G+LG+ GR+S +SQ + + +CI G G
Sbjct: 198 AKES--------TDEKGILGMNLGRLSFISQAKI-----SKFSYCIPTRSNRPGLASTGS 244
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYS----GKSCGLKDLTL------- 233
+LGD S G + +L + L P L Y+ G G K L +
Sbjct: 245 FYLGDNP-NSRGFKYVSLLTFPQSQRMPNLDP--LAYTVPLQGIRIGQKRLNIPGSVFRP 301
Query: 234 --------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKA 284
+ DSG+ + + Y ++ I+R L+G+ LK T +C+ G
Sbjct: 302 DAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVR-LVGSRLKKGYVYGSTADMCFDGNHSM 360
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVG-ENNIIG 343
++ L F V ++V ++ LV G C+GI G + +G +NIIG
Sbjct: 361 --EIGRLIGDLVFEFG---RGVEILVEKQSLLVNVGGGIHCVGI--GRSSMLGAASNIIG 413
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+ Q+ V +D +R+G+ +C L
Sbjct: 414 NVHQQNLWVEFDVTNRRVGFSKAECRLL 441
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 118/288 (40%), Gaps = 32/288 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C + P ++ C +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 73 RCAALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L + K PN C Y YGD + G L D F + SV V FGC
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV--AFGC 198
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G N G +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 199 GL--FNNGVFKSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVNGLKPSTVLLDLPAD 254
Query: 192 VPSSG---VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLT--LIFDSGA 239
+ SG V TP++QN A+ LK +G L LK+ T I DSG
Sbjct: 255 LYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGT 314
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKA 284
+ +RVY+ ++RD +KL + T P C P +A
Sbjct: 315 AMTSLPTRVYR-----LVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRA 357
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 149/364 (40%), Gaps = 43/364 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IVPCS 70
+ V ++ G P DTGSD++W+QC PC+ P+K Y P + VPC+
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 171
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ C L QC + I Y DG S++GA D L + G++ FG
Sbjct: 172 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQD--KLTLAPGAIVQ-NFYFG 228
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFLG 188
CG+ +H L GVLGLGR R S+ ++ YG V +C+ G L LG
Sbjct: 229 CGHGKHAVRGL----FDGVLGLGRLRESLGAR---YG---GVFSYCLPSVSSKPGFLALG 278
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----LIFDSGASYAYF 244
GK P SG +TPM + A + GK L+ +I DSG
Sbjct: 279 AGKNP-SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGL 337
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
S Y+ + S + + +L P+ L C+ G +AL+FT
Sbjct: 338 QSTAYRALRSAFRKAM--EAYRLLPNGD-LDTCY----NLTGYKNVVVPKIALTFTGGA- 389
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
++ L V P LV N CL G ++G + + V++D + G++
Sbjct: 390 TINLDV-PNGILV-----NGCLAFAE--SGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFR 441
Query: 365 PEDC 368
+ C
Sbjct: 442 AKAC 445
>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
Length = 475
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 121/284 (42%), Gaps = 36/284 (12%)
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLF-------PLRFSNGSVFNVPLTFGCGYNQHNPGPL 141
N++C Y Y + SS G +V D F P+R + FGC + G +
Sbjct: 4 NEKCYYSRTYAERSSSEGWMVEDAFGFPDDQPPVR----------MVFGCENGET--GEI 51
Query: 142 SPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS-SGVAWT 200
G++G+G + SQL G+I +V C G G+L LGD +P + +T
Sbjct: 52 YRQLADGIMGMGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYT 111
Query: 201 PMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASYAYFTSRVYQEIVS 254
P+L N+ L +Y + + +G L + ++ DSG ++ Y + + + +
Sbjct: 112 PLL-NNLHLHYYNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAA 170
Query: 255 LIMRDLIGTPLKLAP--DDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPP 312
I + L+ P D + ICW+G + +F F ++ RL +PP
Sbjct: 171 AIGSYALSHGLQSTPGADPQYNDICWKGAPDNFQGLENHFPSAEFVFG---DNARLSLPP 227
Query: 313 EAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
YL +S CLG+ + G +IG + ++D +V N
Sbjct: 228 LRYLFVSRPGEYCLGVFDNG----GSGTLIGGVSVRDVVVTMFN 267
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 157/379 (41%), Gaps = 51/379 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + VG P DTGSD+ W+QC APC C + + P + V C+
Sbjct: 147 YF-TKIGVGTPVTPALMVLDTGSDVVWLQC-APCRRCYDQSGQMFDPRASHSYGAVDCAA 204
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P C L C C Y++ YGDG + G T+ L F++G+ VP + G
Sbjct: 205 PLCRRLDSGG---CDLRRKACLYQVAYGDGSVTAGDFATET--LTFASGA--RVPRVALG 257
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYG------LIRNVIGHCIGQNGRG 183
CG++ N G AG+LGLGRG +S SQ+ R +G L+ +
Sbjct: 258 CGHD--NEGLFV--AAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSS 313
Query: 184 VLFLGDGKV-PSSGVAWTPMLQNSADLKHYILGPAELLYSGK---SCGLKDLTL------ 233
+ G G V PS+ ++TPM++N Y + + G + DL L
Sbjct: 314 TVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGR 373
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG S Y + G L+L+P +L + + G
Sbjct: 374 GGVIVDSGTSVTRLARPAYAALRDAFRAAAAG--LRLSPGGFSL---FDTCYDLSGLKVV 428
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+++ F +PPE YL+ + R C G++ V +IIG I Q
Sbjct: 429 KVPTVSMHFAG---GAEAALPPENYLIPVDSRGTFCFA-FAGTDGGV---SIIGNIQQQG 481
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V++D + QR+G+ P+ C
Sbjct: 482 FRVVFDGDGQRLGFVPKGC 500
>gi|296084698|emb|CBI25840.3| unnamed protein product [Vitis vinifera]
Length = 306
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/244 (29%), Positives = 105/244 (43%), Gaps = 22/244 (9%)
Query: 129 FGCGYNQHNPGP-LSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
FGC + G L G+ GLG G IS+ S L + GL+ + C G +G G +
Sbjct: 9 FGCSCGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISF 68
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSR 247
GD SSG TP + + L Y + ++ G S L + IFDSG S+ Y
Sbjct: 69 GDEG--SSGQEETPFNPSKSQL-LYNISITQISVGGTSADL-NFDAIFDSGTSFTYLNDP 124
Query: 248 VYQEI---VSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRN 304
Y I +L +D K + D LP + EY P+ ++ T +
Sbjct: 125 AYTSISESFNLRAKD------KRSSSDSDLPFEYCYDISEQQTTVEY--PI-VNLTMKGG 175
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
V P + I G CLG++ + G+ NIIG+ FM +I+D EK +GW
Sbjct: 176 DNFFVTDPIVIVSIQGGYVYCLGVV-----KSGDINIIGQNFMTGYRIIFDREKMVLGWT 230
Query: 365 PEDC 368
+C
Sbjct: 231 KSNC 234
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 144/375 (38%), Gaps = 36/375 (9%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTK-----PPEKQYKPHKNIVPC 69
+ VG P F DTGSDL WV CD AP T PE +
Sbjct: 107 AEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLGNLTAVDGGGGPELRQYSPSKSSTS 166
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSN-------GS 121
CA+ P C C Y + Y SS G LV D+ L G+
Sbjct: 167 KTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAAGA 226
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQN 180
P+ FGCG Q L G++GLG ++S+ S L G+++ N C ++
Sbjct: 227 AVRTPVVFGCGQVQTG-SFLDGAAADGLMGLGMEKVSVPSILASTGVVKSNSFSMCFSKD 285
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFD 236
G G + GD S+ + TP + S + I + S G K+L L I D
Sbjct: 286 GLGRINFGD--TGSADQSETPFIVKSTHSYYNI------SITSMSVGDKNLPLGFYAIAD 337
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG S+ Y Y + + + ++ P + + T P+
Sbjct: 338 SGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSLSPDQTTVELPI- 396
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN---NIIGEIFMQDKMVI 353
+S T +V V P Y + + N + I+ A + + +IIG+ FM V+
Sbjct: 397 VSLTTNGGAVFPVTSP-VYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIGQNFMTGLKVV 455
Query: 354 YDNEKQRIGWKPEDC 368
++ EK +GW+ DC
Sbjct: 456 FNREKSVLGWQKFDC 470
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 153/372 (41%), Gaps = 47/372 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSNP 72
+ ++L++G PP DTGSDL W QC PC C K + + P + C
Sbjct: 95 YLMSLSLGTPPFKIMGIADTGSDLIWTQCK-PCERCYKQVDPLFDPKSSKTYRDFSCDAR 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
+C+ L + C + C Y+ YGD ++G + +D L + GS + P T GC
Sbjct: 154 QCSLL---DQSTCS--GNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGC 208
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G+ N G S + G++GLG G +S++SQ+ + +C+ N +
Sbjct: 209 GH--ENDGTFSDKGS-GIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNSSKLN 263
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLTLIFDSG 238
F + V GV TP+L + Y L G + + S G + +I DSG
Sbjct: 264 FGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSG 323
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFKALGQVTEYFKPLAL 297
+ + + + + + G + A D L +C+ T K A+
Sbjct: 324 TTLTIVPDDFFSNLSTAVGNQVEG---RRAEDPSGFLSVCY--------SATSDLKVPAI 372
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNE 357
+ V+L P ++ +S VCL + + +I G + + +V Y+ +
Sbjct: 373 TAHFTGADVKL-KPINTFVQVS-DDVVCLAFASTTSGI----SIYGNVAQMNFLVEYNIQ 426
Query: 358 KQRIGWKPEDCN 369
+ + +KP DC
Sbjct: 427 GKSLSFKPTDCT 438
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 153/379 (40%), Gaps = 51/379 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ +N+++G PP DTGSDL W QC PC C + E + P K+ I+ C
Sbjct: 95 YLMNISLGTPPVSMHGIADTGSDLLWRQC-KPCDSCYEQIEPIFDPAKSKTYQILSCEGK 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L C N C Y YGDG + G L D + + G +VP + FGC
Sbjct: 154 SCSNLGGQG--GCSDDN-TCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVFGC 210
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG------VL 185
G HN G +G++GLG G +S++SQLR LI +C+ G +
Sbjct: 211 G---HNNGGTFELHGSGLVGLGGGPLSMISQLRP--LIGGRFSYCLVPLGNDPSVSSKMH 265
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKS------CGLKDLTL 233
F G V +G TP+ D +Y+ +G +L Y G S + +
Sbjct: 266 FGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNI 325
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG-PFKALGQVTEYF 292
I DSG + Y + S ++ + G P++ + +C+ + +T +F
Sbjct: 326 IIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVR--DPNNVFSLCYSNLSGLRIPTITAHF 383
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
L + P V C ++ V + I G + + +V
Sbjct: 384 V-----------GADLELKPLNTFVQVQEDLFCFAMI-----PVSDLAIFGNLAQMNFLV 427
Query: 353 IYDNEKQRIGWKPEDCNTL 371
YD + + + +KP DC +
Sbjct: 428 GYDLKSRTVSFKPTDCTKI 446
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 79/167 (47%), Gaps = 17/167 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK--PPEKQYKPHKNIVPCS- 70
+ Y+A L +G PP+ F DTGS++T+V C C K P Q + P +
Sbjct: 47 YGYYATKLYIGTPPQEFTLVVDTGSNMTFVPCCGSEEYCGKHEDPAFQTESSSTYQPVNC 106
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTF 129
+P C C + QC Y++ YGDG S G L D+ + F N S F L F
Sbjct: 107 HPSC---------DCDYLRSQCSYKMHYGDGSYSRGVLAEDI--ISFGNESEFAPQRLVF 155
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
GC + G L G++GLGRGR +IV QL + G+I + C
Sbjct: 156 GCELDA--IGSLYSLRADGIIGLGRGRSTIVDQLVDKGVISDSFSLC 200
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 146/365 (40%), Gaps = 46/365 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
+ + L +G PP + DTGS+ W QC PC C + P K+
Sbjct: 59 YLMKLQIGTPPFEIEAVLDTGSEHIWTQC-LPCVHCYNQTAPIFDPSKS----------- 106
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGCGYNQ 135
RC + C YE+ YG + G LVT+ + ++G F +P T GCG N
Sbjct: 107 -STFKEIRCDTHDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRN- 164
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL-FLGDGKVPS 194
N G P AGV+GL RG S+++Q+ G ++ +C G + F + V
Sbjct: 165 -NSG--FKPGFAGVVGLDRGPKSLITQMG--GEYPGLMSYCFAGKGTSKINFGANAIVAG 219
Query: 195 SGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLTLIFDSGASYAYFTSR 247
GV T + +A Y L G + G ++ DSG++ YF
Sbjct: 220 DGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYFPES 279
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVR 307
Y +V + ++ T ++ D +C+ + + F + + F+ +
Sbjct: 280 -YCNLVRKAVEQVV-TAVRFPRSDI---LCY------YSKTIDIFPVITMHFSGGAD--- 325
Query: 308 LVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
LV+ V S V CL I+ S E I G + +V YD+ + +KP
Sbjct: 326 LVLDKYNMYVASNTGGVFCLAIICNSPI---EEAIFGNRAQNNFLVGYDSSSLLVSFKPT 382
Query: 367 DCNTL 371
+C+ L
Sbjct: 383 NCSAL 387
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 144/375 (38%), Gaps = 36/375 (9%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTK-----PPEKQYKPHKNIVPC 69
+ VG P F DTGSDL WV CD AP T PE +
Sbjct: 107 AEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLGNLTAVDGGGGPELRQYSPSKSSTS 166
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSN-------GS 121
CA+ P C C Y + Y SS G LV D+ L G+
Sbjct: 167 KTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAAGA 226
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQN 180
P+ FGCG Q L G++GLG ++S+ S L G+++ N C ++
Sbjct: 227 AVRTPVVFGCGQVQTG-SFLDGAAADGLMGLGMEKVSVPSILASTGVVKSNSFSMCFSKD 285
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFD 236
G G + GD S+ + TP + S + I + S G K+L L I D
Sbjct: 286 GLGRINFGD--TGSADQSETPFIVKSTHSYYNI------SITSMSVGDKNLPLGFYAIAD 337
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG S+ Y Y + + + ++ P + + T P+
Sbjct: 338 SGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSLSPDQTTVELPV- 396
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN---NIIGEIFMQDKMVI 353
+S T +V V P Y + + N + I+ A + + +IIG+ FM V+
Sbjct: 397 VSLTTNGGAVFPVTSP-VYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIGQNFMTGLKVV 455
Query: 354 YDNEKQRIGWKPEDC 368
++ EK +GW+ DC
Sbjct: 456 FNREKSVLGWQKFDC 470
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 81/167 (48%), Gaps = 22/167 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P FDTGSDLTW QC+ PC G C E ++ P + V CS+
Sbjct: 134 YIVTIGIGTPKHDISLMFDTGSDLTWTQCE-PCLGSCYSQKEPKFNPSSSSSYHNVSCSS 192
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C NP C N C Y I YGDG ++G L + F L +N V + + FGC
Sbjct: 193 PMCG-----NPESCSASN--CLYGIGYGDGSVTVGFLAKEKFTL--TNSDVLD-DIYFGC 242
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
G N N G +AG+LGLG G+ S L+ N+ +C G
Sbjct: 243 GEN--NKGVF--IGSAGILGLGPGKFSF--PLQTTTTYNNIFSYCCG 283
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 156/388 (40%), Gaps = 63/388 (16%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD-----APCTGCTKPPEKQYKPHKNIVPCSNPR 73
++L +G P + + DTGS L+W+QC P T + + +PCS+P
Sbjct: 83 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 142
Query: 74 CAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + P C N C Y Y DG + G LV + F FSN PL GC
Sbjct: 143 CKPRIPDFTLPTSCD-SNRLCHYSYFYADGTFAEGNLVKEKFT--FSNSQT-TPPLILGC 198
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGV 184
D G+LG+ GR+S +SQ + + +CI G G
Sbjct: 199 AKES--------TDVKGILGMNLGRLSFISQAKI-----SKFSYCIPTRSNRPGLASTGS 245
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYS----GKSCGLKDLTL------- 233
+LG+ S G + +L + L P L Y+ G G K L +
Sbjct: 246 FYLGENP-NSRGFKYVSLLTFPQSQRMPNLDP--LAYTVPLLGIRIGQKRLNIPSSVFRP 302
Query: 234 --------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKA 284
+ DSG+ + + Y ++ I+R L+G+ LK T +C+ G +
Sbjct: 303 DAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVR-LVGSRLKKGYVYGSTADMCFDGNHQM 361
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVG-ENNIIG 343
+ + L F V ++V + LV G C+GI G + +G +NIIG
Sbjct: 362 V--IGRLIGDLVFEFG---RGVEILVEKQRLLVNVGGGIHCVGI--GRSSMLGAASNIIG 414
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+ Q+ V +D +R+G+ +C+ L
Sbjct: 415 NVHQQNLWVEFDVANRRVGFSKAECSRL 442
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 101/397 (25%), Positives = 146/397 (36%), Gaps = 58/397 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP-----EKQYKPHKNIVPCSN 71
V + VG PP+ DTGS+L+W+ C+ G PP VPC +
Sbjct: 55 LTVPVAVGTPPQNVTMVLDTGSELSWLLCN----GSYAPPLTPAFNASGSSSYGAVPCPS 110
Query: 72 PRCA--ALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C P PP C P++ C + Y D S+ G L TD F L V
Sbjct: 111 TACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTF-LLTGGAPPVAVGAY 169
Query: 129 FGC--------GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-Q 179
FGC N + G G+LG+ RG +S V+Q G R +CI
Sbjct: 170 FGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQT---GTRR--FAYCIAPG 224
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGL 228
G GVL LGD + + +TP+++ S L ++ I LL KS
Sbjct: 225 EGPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLT 284
Query: 229 KDLT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-------TLPIC 277
D T + DSG + + + Y + + L LAP + C
Sbjct: 285 PDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ---ARLLLAPLGEPGFVFQGAFDAC 341
Query: 278 WRGPFKALGQVTEYFKPLALSFTNRRNSVR-----LVVPPEAYLVISGRKNVCLGILNGS 332
+RGP + + + L +V +VP E CL N
Sbjct: 342 FRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSD 401
Query: 333 EAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
A + +IG Q+ V YD + R+G+ P C+
Sbjct: 402 MAGM-SAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 437
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 150/368 (40%), Gaps = 40/368 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPC 69
+ VG P F DTGSDL W+ CD AP +G ++ YKP ++ +PC
Sbjct: 212 VDVGTPNTSFMVALDTGSDLFWIPCDCIECAPLSGYHGSLDRDLGIYKPAESTTSRHLPC 271
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL--RFSNGSVFNVP 126
S+ C C + C Y +Y + +S G LV D+ L R S+ V
Sbjct: 272 SHELCLLGS-----DCTNQKQPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAPV-KAS 325
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ GCG Q L G+LGLG IS+ S L GL+RN C ++ G +F
Sbjct: 326 VIIGCGRKQSG-SYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFTKDS-GRIF 383
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
GD V S TP + L+ Y + + K I DSG S+
Sbjct: 384 FGDQGV--STQQSTPFVPLYGKLQTYTVNVDKSCVGHKCFESTSFQAIVDSGTSFTALPL 441
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
+Y+ + I D +L + + C+ + V + L+F + S
Sbjct: 442 DIYKAVA--IEFDKQVNASRLPQEATSFDYCYSASPLVMPDVPT----VTLTFAGNK-SF 494
Query: 307 RLVVPPEAYLVISGRKNV---CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
+ V P +L+ V CL ++ E +G II + F+ V++D E ++GW
Sbjct: 495 QPVNP--TFLLHDEEGAVAGFCLAVVQSPEP-IG---IIAQNFLLGYHVVFDRENMKLGW 548
Query: 364 KPEDCNTL 371
+C+ L
Sbjct: 549 YRSECHDL 556
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 152/377 (40%), Gaps = 51/377 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF + ++VG PP+ DTGSD+ W+QC APC C ++ + P+K + + C++
Sbjct: 37 YF-IRVSVGTPPRGMYLVMDTGSDILWLQC-APCVSCYHQCDEVFDPYKSSTYSTLGCNS 94
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS---VFN-VPL 127
+C L C ++C Y+++YGDG S G TD L ++G V N +PL
Sbjct: 95 RQCLNLDVGG---CV--GNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPL 149
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR---NVIGHCIGQNGRGV 184
GCG++ N G LG+G +S +Q+ R + G R
Sbjct: 150 --GCGHD--NEGYFVGAAGLLG--LGKGPLSFPNQINSENGGRFSYCLTGRDTDSTERSS 203
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC----------GLKDLTLI 234
L GD VP +GV +TP N Y L + G L + +I
Sbjct: 204 LIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVI 263
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG S + Y + + L L + C+ L ++ P
Sbjct: 264 IDSGTSVTRLQNAAYASLREAFRAGT--SDLVLTTEFSLFDTCYN-----LSDLSSVDVP 316
Query: 295 -LALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ L F + L +P YLV + CL A +IIG I Q V
Sbjct: 317 TVTLHF---QGGADLKLPASNYLVPVDNSSTFCLAF-----AGTTGPSIIGNIQQQGFRV 368
Query: 353 IYDNEKQRIGWKPEDCN 369
IYDN ++G+ P C+
Sbjct: 369 IYDNLHNQVGFVPSQCD 385
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 155/378 (41%), Gaps = 50/378 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + VG P DTGSD+ W+QC APC C + + P ++ V CS
Sbjct: 142 YF-TKIGVGTPATPALMVLDTGSDVVWLQC-APCRRCYDQSGQVFDPRRSRSYGAVGCSA 199
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L C C Y++ YGDG + G T+ L F+ G+ + GC
Sbjct: 200 PLCRRLDSGG---CDLRRKACLYQVAYGDGSVTAGDFATET--LTFAGGARV-ARIALGC 253
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYG------LIRNVIGHCIGQNGRGV 184
G++ N G AG+LGLGRG +S +Q+ R YG L+ + V
Sbjct: 254 GHD--NEGLFV--AAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTV 309
Query: 185 LFLGDGKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSG-KSCGLKDLTL--------- 233
F G G V S+ ++TPM++N Y + + G + G+ D L
Sbjct: 310 TF-GSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRG 368
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQVTE 290
I DSG S Y + G L+L+P +L C+ G+
Sbjct: 369 GVIVDSGTSVTRLARPAYSALRDAFRAAAAG--LRLSPGGFSLFDTCY----DLSGRKVV 422
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+++ F +PPE YL+ K G++ V +IIG I Q
Sbjct: 423 KVPTVSMHFAG---GAEAALPPENYLIPVDSKGTFCFAFAGTDGGV---SIIGNIQQQGF 476
Query: 351 MVIYDNEKQRIGWKPEDC 368
V++D + QR+G+ P+ C
Sbjct: 477 RVVFDGDGQRVGFVPKGC 494
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 142/365 (38%), Gaps = 40/365 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ + +G P + DTGS LTW+QC C + + P + V CS
Sbjct: 122 YVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQ 181
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C+ L NP C N C Y+ YGD S+G L D + F + S+ N +G
Sbjct: 182 QCSDLPSATLNPSACSSSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSLPN--FYYG 236
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 237 CG--QDNEGLFG--RSAGLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSLGS 290
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYFT 245
P ++TPM+ +S D Y + + + +G S L I DSG
Sbjct: 291 YNPGQ-YSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLP 349
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALSFTNRRN 304
+ VY + + + GT A L C++ GQ + P + +SF
Sbjct: 350 TSVYSALSKAVAAAMKGT--SRASAYSILDTCFK------GQASRVSAPAVTMSFA---G 398
Query: 305 SVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
L + + LV CL A IIG Q V+YD + RIG+
Sbjct: 399 GAALKLSAQNLLVDVDDSTTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKSSRIGFA 453
Query: 365 PEDCN 369
C+
Sbjct: 454 AGGCS 458
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/416 (23%), Positives = 153/416 (36%), Gaps = 84/416 (20%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC---DAPCTGCTKPPEKQYKPHKNIVP----- 68
+ + L +G PP+ DTGSDLTWV C C C K P
Sbjct: 83 YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142
Query: 69 -----CSNPRCAALHWPNPP-----------------RCKHPNDQCDYEIEYGDGGSSIG 106
C++ C +H + P C P Y YG+GG G
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAY--TYGEGGLISG 200
Query: 107 ALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREY 166
L D+ R + F +FGC + + + G+ G GRG +S+ SQL
Sbjct: 201 ILTRDILKARTRDVPRF----SFGCVTSTYR-------EPIGIAGFGRGLLSLPSQL--- 246
Query: 167 GLIRNVIGHCI-------GQNGRGVLFLGDGKVP---SSGVAWTPMLQNSADLKHYILGP 216
G + HC N L LG + + + +TPML Y +G
Sbjct: 247 GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIG- 305
Query: 217 AELLYSGKSCGLKDLTL-------------IFDSGASYAYFTSRVYQEIVSLIMRDLIGT 263
E + G + + L + DSG +Y + Y ++++ ++ I
Sbjct: 306 LESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLT-TLQSTITY 364
Query: 264 PLKLAPDDKT-LPICWRGP-----FKAL-GQVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
P + +T +C++ P +L V F + F N N+ L+ ++
Sbjct: 365 PRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLN--NATLLLPQGNSFY 422
Query: 317 VIS----GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+S G CL N + + G + G Q+ V+YD EK+RIG++ DC
Sbjct: 423 AMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 146/379 (38%), Gaps = 60/379 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF L VG PP+ DTGSD+ W+QC +PC C + + P+K+ +PCS+
Sbjct: 110 YF-TRLGVGTPPRYLYMVLDTGSDVVWLQC-SPCRKCYSQSDPIFNPYKSKSFAGIPCSS 167
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + C C Y++ YGDG + G T+ L F + V L GC
Sbjct: 168 PLCRRL---DSSGCSTRRHTCLYQVSYGDGSFTTGDFATE--TLTFRGNKIAKVAL--GC 220
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFL 187
G+ HN G LG GR + +R + +C+ + +
Sbjct: 221 GH--HNEGLFVGAAGLLGLGRGRLSFPSQTGIR----FNHKFSYCLVDRSASSKPSSMVF 274
Query: 188 GDGKVPSSGVAWTPMLQN-SADLKHY------------ILGPAELLYSGKSCGLKDLTLI 234
GD + S +TP+++N D +Y + G + L+ S G + +I
Sbjct: 275 GDAAI-SRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAG--NGGVI 331
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
DSG S T Y +RD LK P+ C+ GQ +
Sbjct: 332 IDSGTSVTRLTRPAYTA-----LRDAFRVGARHLKRGPEFSLFDTCY----DLSGQSSVK 382
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+ L F + +P YL+ + + C + +IIG I Q
Sbjct: 383 VPTVVLHF----RGADMALPATNYLIPVDENGSFCFAF----AGTISGLSIIGNIQQQGF 434
Query: 351 MVIYDNEKQRIGWKPEDCN 369
V+YD RIG+ P C
Sbjct: 435 RVVYDLAGSRIGFAPRGCT 453
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/384 (25%), Positives = 161/384 (41%), Gaps = 65/384 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ N T+G PP+ D +L W QC PC C + + P K+ +PC +
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQC-TPCQPCFEQDLPLFDPTKSSTFRGLPCGSH 115
Query: 73 RCAALHWPNPPRCKHPNDQCDYE--IEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C ++ P R +D C YE + GD G G TD F + + + L FG
Sbjct: 116 LCESI--PESSR-NCTSDVCIYEAPTKAGDTGGKAG---TDTFAIGAAKET-----LGFG 164
Query: 131 CGYNQHN-----PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
C GP +G++GLGR S+V+Q+ +C+ G L
Sbjct: 165 CVVMTDKRLKTIGGP------SGIVGLGRTPWSLVTQMN-----VTAFSYCLAGKSSGAL 213
Query: 186 FLGDGKVPSSGV--AWTP-MLQNSADLK------HYILGPAELLYSG---KSCGLKDLTL 233
FLG +G + TP +++ SA +Y++ A + G ++ T+
Sbjct: 214 FLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTV 273
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
+ D+ + +Y Y+ + + + P+ P K +C+ P G E
Sbjct: 274 LLDTVSRASYLADGAYKALKKALTAAVGVQPVASPP--KPYDLCF--PKAVAGDAPE--- 326
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEA------EVGENNIIGEIFM 347
L +F L VPP YL+ SG VCL I GS A E+ +I+G +
Sbjct: 327 -LVFTF---DGGAALTVPPANYLLASGNGTVCLTI--GSSASLNLTGELEGASILGSLQQ 380
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTL 371
++ V++D +++ + +KP DC++L
Sbjct: 381 ENVHVLFDLKEETLSFKPADCSSL 404
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 98/409 (23%), Positives = 150/409 (36%), Gaps = 71/409 (17%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK---------QYKPHKNI------- 66
+G PP+ + DTGSDL W QC C P Q P+ N
Sbjct: 84 IGDPPQPAEAVVDTGSDLVWTQCST----CRLPAAAAAGGGGCFPQNLPYYNFSLSRTAR 139
Query: 67 -VPCSNPRCAALH-WPNPPRCKH----PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
VPC + A P C +D C YG G ++G L TD F S+
Sbjct: 140 AVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYG-AGVALGVLGTDAFTFPSSS- 197
Query: 121 SVFNVPLTFGC-GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
+V L FGC + +PG L+ +G++GLGRG +S+VSQL + +
Sbjct: 198 ---SVTLAFGCVSQTRISPGALN--GASGIIGLGRGALSLVSQLNATEFSYCLTPYFRDT 252
Query: 180 NGRGVLFLGDGKVPSSG------------VAWTPMLQNSAD----------LKHYILGPA 217
LF+GDG++ V P +N D L G A
Sbjct: 253 VSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNA 312
Query: 218 ELLYSGKSCGLKDLT-------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD 270
+ + L++ + DSG+ + ++ + + R L G+ + P
Sbjct: 313 TVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPP 372
Query: 271 DK---TLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVR-LVVPPEAYLVISGRKNVCL 326
K L +C PL L F + R LV+P E Y C+
Sbjct: 373 AKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCM 432
Query: 327 GILNGSEAEV----GENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+++ + E IIG QD V+YD + ++P +C+ +
Sbjct: 433 AVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCSAV 481
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 146/365 (40%), Gaps = 46/365 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
+ + L +G PP + DTGS+ W QC PC C + P K+
Sbjct: 65 YLMKLQIGTPPFEIEAVLDTGSEHIWTQC-LPCVHCYNQTAPIFDPSKS----------- 112
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGCGYNQ 135
RC + C YE+ YG + G LVT+ + ++G F +P T GCG N
Sbjct: 113 -STFKEIRCDTHDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRN- 170
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL-FLGDGKVPS 194
N G P AGV+GL RG S+++Q+ G ++ +C G + F + V
Sbjct: 171 -NSG--FKPGFAGVVGLDRGPKSLITQMG--GEYPGLMSYCFAGKGTSKINFGANAIVAG 225
Query: 195 SGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLTLIFDSGASYAYFTSR 247
GV T + +A Y L G + G ++ DSG++ YF
Sbjct: 226 DGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYFPES 285
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVR 307
Y +V + ++ T ++ D +C+ + + F + + F+ +
Sbjct: 286 -YCNLVRKAVEQVV-TAVRFPRSDI---LCY------YSKTIDIFPVITMHFSGGAD--- 331
Query: 308 LVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
LV+ V S V CL I+ S E I G + +V YD+ + +KP
Sbjct: 332 LVLDKYNMYVASNTGGVFCLAIICNSPI---EEAIFGNRAQNNFLVGYDSSSLLVSFKPT 388
Query: 367 DCNTL 371
+C+ L
Sbjct: 389 NCSAL 393
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 101/397 (25%), Positives = 146/397 (36%), Gaps = 58/397 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP-----EKQYKPHKNIVPCSN 71
V + VG PP+ DTGS+L+W+ C+ G PP VPC +
Sbjct: 55 LTVPVAVGTPPQNVTMVLDTGSELSWLLCN----GSYAPPLTPAFNASGSSSYGAVPCPS 110
Query: 72 PRCA--ALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C P PP C P++ C + Y D S+ G L TD F L V
Sbjct: 111 TACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTF-LLTGGAPPVAVGAY 169
Query: 129 FGC--------GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-Q 179
FGC N + G G+LG+ RG +S V+Q G R +CI
Sbjct: 170 FGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQT---GTRR--FAYCIAPG 224
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGL 228
G GVL LGD + + +TP+++ S L ++ I LL KS
Sbjct: 225 EGPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLT 284
Query: 229 KDLT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-------TLPIC 277
D T + DSG + + + Y + + L LAP + C
Sbjct: 285 PDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ---ARLLLAPLGEPGFVFQGAFDAC 341
Query: 278 WRGPFKALGQVTEYFKPLALSFTNRRNSVR-----LVVPPEAYLVISGRKNVCLGILNGS 332
+RGP + + + L +V +VP E CL N
Sbjct: 342 FRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSD 401
Query: 333 EAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
A + +IG Q+ V YD + R+G+ P C+
Sbjct: 402 MAGM-SAYVIGHHHQQNVWVEYDLQNGRVGFAPARCD 437
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 144/380 (37%), Gaps = 64/380 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + VG PP+ DTGSD+ W+QC APC C + + P K+ + C +
Sbjct: 126 YF-TRIGVGTPPRYVYMVLDTGSDIVWIQC-APCKRCYAQSDPVFDPRKSRSFASIACRS 183
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C H + P C C Y++ YGDG + G T+ L F V V L GC
Sbjct: 184 PLC---HRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTE--TLTFRRTRVARVAL--GC 236
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFL 187
G++ N G LG GR + R + +C+ + +
Sbjct: 237 GHD--NEGLFVGAAGLLGLGRGRLSFPSQTGRR----FNHKFSYCLVDRSASSKPSSMVF 290
Query: 188 GDGKVPSSGVAWTPMLQN-SADLKHYILGPAELL--------YSGKSCGLKDLT------ 232
GD V S +TP++ N D +Y+ ELL G + L L
Sbjct: 291 GDSAV-SRTARFTPLVSNPKLDTFYYV----ELLGISVGGTRVPGITASLFKLDQTGNGG 345
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPFKALGQVT 289
+I DSG S T Y + RD + LK AP C F G+
Sbjct: 346 VIIDSGTSVTRLTRPAY-----IAFRDAFRAGASNLKRAPQFSLFDTC----FDLSGKTE 396
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
+ L F + +P YL+ + N CL +G +IIG I Q
Sbjct: 397 VKVPTVVLHF----RGADVSLPASNYLIPVDTSGNFCLAF----AGTMGGLSIIGNIQQQ 448
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
V+YD R+G+ P C
Sbjct: 449 GFRVVYDLAGSRVGFAPHGC 468
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 145/383 (37%), Gaps = 55/383 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
V L +G PP+ DTGS L+W+QC PP + P + ++PC++P
Sbjct: 88 LVVTLPIGTPPQPQQMVLDTGSQLSWIQCHN-----KTPPTASFDPSLSSSFYVLPCTHP 142
Query: 73 RCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C + P C N C Y Y DG + G LV + L FS S PL G
Sbjct: 143 LCKPRVPDFTLPTTCDQ-NRLCHYSYFYADGTYAEGNLVRE--KLAFSP-SQTTPPLILG 198
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFL 187
C D G+LG+ GR+S Q + V N G +L
Sbjct: 199 CSSESR--------DARGILGMNLGRLSFPFQAKVTKFSYCVPTRQPANNNNFPTGSFYL 250
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYS----GKSCGLKDLTL---------- 233
G+ S+ + ML + L P L Y+ G G + L +
Sbjct: 251 GNNP-NSARFRYVSMLTFPQSQRMPNLDP--LAYTVPMQGIRIGGRKLNIPPSVFRPNAG 307
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
+ DSG+ + + Y + I+R L K +C+ G +G++
Sbjct: 308 GSGQTMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFDGNAMEIGRL 367
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
+A F V +VVP E L G C+GI SE +NIIG Q
Sbjct: 368 ---LGDVAFEF---EKGVEIVVPKERVLADVGGGVHCVGI-GRSERLGAASNIIGNFHQQ 420
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ V +D +RIG+ DC+ L
Sbjct: 421 NLWVEFDLANRRIGFGVADCSRL 443
>gi|213998796|gb|ACJ60765.1| nucellin [Hordeum marinum subsp. gussoneanum]
Length = 133
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/122 (40%), Positives = 69/122 (56%), Gaps = 6/122 (4%)
Query: 142 SPP-DTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGDGKVPSSGVAW 199
SPP G+LGLG G+ +QL+ +I NVIGHC+ G+GVL++G+ PS GV W
Sbjct: 4 SPPLPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGNFNPPSRGVTW 63
Query: 200 TPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
PM ++S +Y G AELL + G +FDSG++Y S++Y EIV +
Sbjct: 64 VPMRESSF---YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYNEIVPKVRG 120
Query: 259 DL 260
L
Sbjct: 121 TL 122
>gi|359496966|ref|XP_002269916.2| PREDICTED: aspartic proteinase-like protein 1-like, partial [Vitis
vinifera]
Length = 294
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 99/224 (44%), Gaps = 21/224 (9%)
Query: 148 GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSA 207
G+ GLG G IS+ S L + GL+ + C G +G G + GD SSG TP + +
Sbjct: 17 GLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG--SSGQEETPFNPSKS 74
Query: 208 DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI---VSLIMRDLIGTP 264
L Y + ++ G S L + IFDSG S+ Y Y I +L +D
Sbjct: 75 QL-LYNISITQISVGGTSADL-NFDAIFDSGTSFTYLNDPAYTSISESFNLRAKD----- 127
Query: 265 LKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV 324
K + D LP + EY P+ ++ T + V P + I G
Sbjct: 128 -KRSSSDSDLPFEYCYDISEQQTTVEY--PI-VNLTMKGGDNFFVTDPIVIVSIQGGYVY 183
Query: 325 CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
CLG++ + G+ NIIG+ FM +I+D EK +GW +C
Sbjct: 184 CLGVV-----KSGDINIIGQNFMTGYRIIFDREKMVLGWTKSNC 222
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 151/377 (40%), Gaps = 48/377 (12%)
Query: 26 PPKLFDFDFDTGSDLTWVQCDA-----PCTGCTKPPEKQYKPHKNIVPCSNPRC--AALH 78
PP+ DTGS+L+W++C+ P Y P +PCS+P C
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSP----IPCSSPTCRTRTRD 137
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNP 138
+ P C + C + Y D SS G L ++F F N S + L FGC +
Sbjct: 138 FLIPASCD-SDKLCHATLSYADASSSEGNLAAEIF--HFGN-STNDSNLIFGCMGSVSGS 193
Query: 139 GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFLGDGKVP-SS 195
P T G+LG+ RG +S +SQ+ G + +CI G L LGD +
Sbjct: 194 DPEEDTKTTGLLGMNRGSLSFISQM---GFPK--FSYCISGTDDFPGFLLLGDSNFTWLT 248
Query: 196 GVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----LIFDSGAS 240
+ +TP+++ S L ++ I +LL KS + D T + DSG
Sbjct: 249 PLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQ 308
Query: 241 YAYFTSRVYQEIVSLIMRDLIGT-PLKLAPD---DKTLPICWR-GPFKALGQVTEYFKPL 295
+ + VY + S + G + PD T+ +C+R P + + +
Sbjct: 309 FTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTV 368
Query: 296 ALSFTNRRNSVRLVVPPEAYLV---ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+L F +V P Y V G +V S+ E +IG Q+ +
Sbjct: 369 SLVFEGAEIAVS--GQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWI 426
Query: 353 IYDNEKQRIGWKPEDCN 369
+D ++ RIG P +C+
Sbjct: 427 EFDLQRSRIGLAPVECD 443
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 92/361 (25%), Positives = 141/361 (39%), Gaps = 40/361 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+ +G P + DTGS LTW+QC C + + P + V CS +C+
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 77 LHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYN 134
L NP C N C Y+ YGD S+G L D + F + S+ N +GCG
Sbjct: 61 LPSATLNPSACSSSN-VCIYQASYGDSSFSVGYLSKD--TVSFGSTSLPN--FYYGCG-- 113
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS 194
Q N G +AG++GL R ++S++ QL + +C+ + P
Sbjct: 114 QDNEGLFG--RSAGLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSLGSYNPG 169
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYFTSRVY 249
++TPM+ +S D Y + + + +G S L I DSG + VY
Sbjct: 170 Q-YSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVY 228
Query: 250 QEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALSFTNRRNSVRL 308
+ + + GT A L C++ GQ + P + +SF L
Sbjct: 229 SALSKAVAAAMKGT--SRASAYSILDTCFK------GQASRVSAPAVTMSFA---GGAAL 277
Query: 309 VVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+ + LV CL A IIG Q V+YD + RIG+ C
Sbjct: 278 KLSAQNLLVDVDDSTTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKSSRIGFAAGGC 332
Query: 369 N 369
+
Sbjct: 333 S 333
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 109/246 (44%), Gaps = 24/246 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V + G P + + DTGS L+W+QC C + + P + + C++
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 73 RCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTF 129
+C++L N P C+ ++ C Y YGD S+G L DL L S +P +
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLPGFVY 233
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI-GQNGRGVLFL 187
GCG Q + G AG+LGLGR ++S++ Q+ ++G +C+ + G G L +
Sbjct: 234 GCG--QDSDGLFG--RAAGILGLGRNKLSMLGQVSSKFGY---AFSYCLPTRGGGGFLSI 286
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASYAY 243
G + S +TPM + + Y L + G++ G+ + I DSG
Sbjct: 287 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITR 346
Query: 244 FTSRVY 249
VY
Sbjct: 347 LPMSVY 352
>gi|238012174|gb|ACR37122.1| unknown [Zea mays]
Length = 84
Score = 79.3 bits (194), Expect = 3e-12, Method: Composition-based stats.
Identities = 36/72 (50%), Positives = 53/72 (73%), Gaps = 2/72 (2%)
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
LSF + +N+ + +PPE YL+++ NVCLGIL+G+ A++ N+IG+I MQD+MVIYDN
Sbjct: 3 LSFASAKNAA-MEIPPENYLIVTKNGNVCLGILDGTAAKL-SFNVIGDITMQDQMVIYDN 60
Query: 357 EKQRIGWKPEDC 368
EK ++GW C
Sbjct: 61 EKSQLGWARGAC 72
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 149/374 (39%), Gaps = 43/374 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F VN ++G+P DTGS++ WV+C APC CT+ P K+ +PC+N
Sbjct: 99 FLVNFSMGQPATPQLAIMDTGSNILWVRC-APCKRCTQQNGPLLDPSKSSTYASLPCTNT 157
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C H+ C N QC Y + Y G SS G L T+ S+ V VP + FGC
Sbjct: 158 MC---HYAPSAYCNRLN-QCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGC 213
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-----QNGRGVLF 186
H G GV GLG+G S V+++ + +C+G G L
Sbjct: 214 ---SHENGDYKDRRFTGVFGLGKGITSFVTRM------GSKFSYCLGNIADPHYGYNQLV 264
Query: 187 LGDGKVPSSGVAWTPMLQNS---ADLKHYILGPAELLYSGKSCGLK--DLTLIFDSGASY 241
G+ K G + + N L+ +G L + +K + + + DSG +
Sbjct: 265 FGE-KANFEGYSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSGTAL 323
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL-GQVTEYFKPLALSFT 300
+ ++ + + + + L G + WRG F G V++ +
Sbjct: 324 TWLAESAFRALDNEVRQLLDGVLMPF----------WRGSFACYKGTVSQDLIGFPVVTF 373
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSE--AEVGENNIIGEIFMQDKMVIYDNEK 358
+ L + E+ + +C+ + S + ++IG + Q + YD
Sbjct: 374 HFSGGADLDLDTESMFYQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNS 433
Query: 359 QRIGWKPEDCNTLL 372
++ ++ DC L+
Sbjct: 434 NKLFFQRIDCQLLV 447
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/385 (25%), Positives = 152/385 (39%), Gaps = 63/385 (16%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
V+L +G PP+ DTGS L+W+QC PP + P +++PC++P C
Sbjct: 82 VSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLC 141
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G+LV + S + PL GC
Sbjct: 142 KPRIPDFTLPTTCDQ-NRLCHYSYFYADGTYAEGSLVREKITFSSSQST---PPLILGCA 197
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
+ D G+LG+ GR S SQ + + +C+ G + G
Sbjct: 198 E--------ASTDEKGILGMNLGRRSFASQAKI-----SKFSYCVPTRQARAGLSSTGSF 244
Query: 186 FLGDGKVPSSG-------VAWTPM--------LQNSADLKHYILGPAEL-----LYSGKS 225
+LG+ P+SG + +TP L + ++ +G A L L+
Sbjct: 245 YLGNN--PNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDP 302
Query: 226 CGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKA 284
G I DSG+ + Y Y ++ ++R L+G LK +C+ G
Sbjct: 303 SGAGQ--TIIDSGSEFTYLVDEAYNKVREEVVR-LVGPKLKKGYVYGGVSDMCFDGNPME 359
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGE 344
+G++ + F V +V+ L G C+GI SE +NIIG
Sbjct: 360 IGRL---IGNMVFEF---EKGVEIVIDKWRVLADVGGGVHCIGI-GRSEMLGAASNIIGN 412
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCN 369
Q+ V YD +RIG DC+
Sbjct: 413 FHQQNLWVEYDLANRRIGLGKADCS 437
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 86/370 (23%), Positives = 141/370 (38%), Gaps = 26/370 (7%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG---------CTKPPEKQYKPHK 64
F ++A N+++G P F DTGSDL W+ C+ T P Y P+
Sbjct: 101 FLHYA-NVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNA 159
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
+ S+ RC+ +C P C Y+I + G L+ D+ L + +
Sbjct: 160 STTS-SSIRCSDKRCFGSGKCSSPESICPYQIALSSNTVTTGTLLQDVLHLVTEDEDLKP 218
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
N +T GCG NQ + GVLGL S+ S L + + N C G+
Sbjct: 219 VNANVTLGCGQNQTGAFQ-TDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIIS 277
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
V + G + TP++ Y + + G + L +FD+G+S+
Sbjct: 278 VVGRISFGDKGYTDQEETPLVSLETSTA-YGVNVTGVSVGGVPVDVP-LFALFDTGSSFT 335
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
Y + + DL+ + D C+ + L + +
Sbjct: 336 LLLESAYG-VFTKAFDDLMEDKRRPVDPDFPFEFCYDLREEHLNSDARPRHMQSKCYNPC 394
Query: 303 RNSVRLVVPPEAYLVIS----GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
R+ R + ++ +S G K CLGIL NIIG+ M +++D E+
Sbjct: 395 RDDFRWRIQNDSQESVSYSNEGTKMYCLGILKSINL-----NIIGQNLMSGHRIVFDRER 449
Query: 359 QRIGWKPEDC 368
+GWK +C
Sbjct: 450 MILGWKQSNC 459
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 148/377 (39%), Gaps = 58/377 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF + VG P + DTGSD+ W+QC APC C + + P K+ +PC
Sbjct: 118 YF-TRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQTDHVFDPTKSRTYAGIPCGA 175
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + P C + N C Y++ YGDG + G T+ L F V V L GC
Sbjct: 176 PLCRRL---DSPGCSNKNKVCQYQVSYGDGSFTFGDFSTE--TLTFRRNRVTRVAL--GC 228
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIGQNGRGVLFLGDG 190
G++ N G + LG GR + + R + ++ V+F GD
Sbjct: 229 GHD--NEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVIF-GDS 285
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELL--------YSGKSCGLKDLT------LIFD 236
V S +TP+++N Y L ELL G S L L +I D
Sbjct: 286 AV-SRTAHFTPLIKNPKLDTFYYL---ELLGISVGGAPVRGLSASLFRLDAAGNGGVIID 341
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
SG S T Y + +RD + LK AP+ C+ L +TE
Sbjct: 342 SGTSVTRLTRPAY-----IALRDAFRIGASHLKRAPEFSLFDTCF-----DLSGLTEVKV 391
Query: 294 P-LALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
P + L F + +P YL+ + + C + +IIG I Q
Sbjct: 392 PTVVLHF----RGADVSLPATNYLIPVDNSGSFCFAF----AGTMSGLSIIGNIQQQGFR 443
Query: 352 VIYDNEKQRIGWKPEDC 368
+ YD R+G+ P C
Sbjct: 444 ISYDLTGSRVGFAPRGC 460
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 76/152 (50%), Gaps = 15/152 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + F FDTGSDLTW QC+ C E + P K+ + CS+P
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSP 197
Query: 73 RCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L N P C C Y I+YGD S+G D L ++ VFN L FG
Sbjct: 198 TCDELKSGTGNSPSCSAST--CVYGIQYGDQSYSVGFFAQD--KLALTSTDVFNNFL-FG 252
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ 162
CG Q+N G AG++GLGR +S++S+
Sbjct: 253 CG--QNNRGLFV--GVAGLIGLGRNALSLMSK 280
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 100/381 (26%), Positives = 144/381 (37%), Gaps = 69/381 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKNI----VPCS 70
+ V L G P DTGSD++WVQC APC P+K + P K+ + C
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQC-APCNSTECYPQKDPLFDPSKSSTYAPIACG 183
Query: 71 NPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C L H+ N C QC Y +EYGDG S+ G + + F+ G
Sbjct: 184 ADACNKLGDHYRN--GCTSGGTQCGYRVEYGDGSSTRGVYSNET--ITFAPGITVK-DFH 238
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGVLFL 187
FGCG++Q GP D G+LGLG S+V Q YG +C+ FL
Sbjct: 239 FGCGHDQR--GPSDKFD--GLLGLGGAPESLVVQTASVYG---GAFSYCLPALNSEAGFL 291
Query: 188 GDGKVPS-----SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----LIFDSG 238
G PS S +TPM D Y++ + GK + ++ DSG
Sbjct: 292 ALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFRGGMLIDSG 351
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
Y + + + + P+ + D T C+ +
Sbjct: 352 TIVTELPETAYNALNAALRKAFAAYPMVASEDFDT---CY-------------------N 389
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG-----------SEAEVGENNIIGEIFM 347
FT N V P L SG + L + NG S +VG IIG +
Sbjct: 390 FTGYSN----VTVPRVALTFSGGATIDLDVPNGILVKDCLAFRESGPDVGL-GIIGNVNQ 444
Query: 348 QDKMVIYDNEKQRIGWKPEDC 368
+ V+YD ++G++ C
Sbjct: 445 RTLEVLYDAGHGKVGFRAGAC 465
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 138/344 (40%), Gaps = 44/344 (12%)
Query: 35 DTGSDLTWVQCDAP-CTGCTKPPEKQYKPHKNIV----PCSNPRCAALHWPNPPRCKHPN 89
D+GS L W+QC P C C + + P K++ C+ C RCK PN
Sbjct: 119 DSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPN 178
Query: 90 DQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
C Y +Y D + G + TD+ FP S + + + FGCGYN +P PP
Sbjct: 179 QICKYHEDYLDDSYTEGVISTDIFTFPEHISGFGNYTLRIIFGCGYNNSDPQHFYPP--- 235
Query: 148 GVLGLGRGRISIVSQLREYGLIRNVIGHCIG----QNGRGVLFLGDGKVPSSGVAWTPML 203
G++GL + S+V Q+ + +C+ QN +G + + G S T ++
Sbjct: 236 GLVGLTNNKASLVGQMD-----VDQFSYCVSIDTEQNLKGSMEIRFGLAASISGHSTQLV 290
Query: 204 QNSADLKHYILGPAELLY------SGKSCGLKDLT------LIFDSGASYAYFTSRVYQE 251
NS YI + +Y G + T L D+G +Y + V
Sbjct: 291 PNSDGW--YIFKNVDGIYVNEFEVEGYPAWVFKYTEGGQGGLTMDTGTTYTELHNSVMDP 348
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
++ L+ + P K + +C+ LG + L FT+ +++
Sbjct: 349 LIKLLEEHITIVPEK-DYSNSGFELCYFSD-DFLGAT---LPDIELRFTDNKDTYFSFNT 403
Query: 312 PEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
A+ +GR +CL + + +IIG ++D + YD
Sbjct: 404 RNAW-TPNGRSQMCLAMFRTNGM-----SIIGMHQLRDIKIGYD 441
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 151/382 (39%), Gaps = 54/382 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + VG P DTGSD+ WVQC APC C + + P ++ V C
Sbjct: 129 YF-TKIGVGTPATQALMVLDTGSDVVWVQC-APCRRCYEQSGPVFDPRRSSSYGAVGCGA 186
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFG 130
C L + C C Y++ YGDG + G VT+ L F+ G+ V V L G
Sbjct: 187 ALCRRL---DSGGCDLRRGACMYQVAYGDGSVTAGDFVTET--LTFAGGARVARVAL--G 239
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYG---------LIRNVIGHCIGQN 180
CG++ N G LG G +S +Q+ R YG + G G +
Sbjct: 240 CGHD--NEGLFVAAAGLLGLGR--GGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSH 295
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK---SCGLKDLTL---- 233
+ G G V +S ++TPM++N Y + + G DL L
Sbjct: 296 RSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPST 355
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQ 287
I DSG S Y + R L+L+P +L C+ G+
Sbjct: 356 GRGGVIVDSGTSVTRLARASYSALRD-AFRAAAAGGLRLSPGGFSLFDTCY----DLGGR 410
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIF 346
+++ F +PPE YL+ + R C G++ V +IIG I
Sbjct: 411 RVVKVPTVSMHFA---GGAEAALPPENYLIPVDSRGTFCF-AFAGTDGGV---SIIGNIQ 463
Query: 347 MQDKMVIYDNEKQRIGWKPEDC 368
Q V++D + QR+G+ P+ C
Sbjct: 464 QQGFRVVFDGDGQRVGFAPKGC 485
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 86/370 (23%), Positives = 141/370 (38%), Gaps = 26/370 (7%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG---------CTKPPEKQYKPHK 64
F ++A N+++G P F DTGSDL W+ C+ T P Y P+
Sbjct: 89 FLHYA-NVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNA 147
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
+ S+ RC+ +C P C Y+I + G L+ D+ L + +
Sbjct: 148 STTS-SSIRCSDKRCFGSGKCSSPESICPYQIALSSNTVTTGTLLQDVLHLVTEDEDLKP 206
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
N +T GCG NQ + GVLGL S+ S L + + N C G+
Sbjct: 207 VNANVTLGCGQNQTGAFQ-TDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIIS 265
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
V + G + TP++ Y + + G + L +FD+G+S+
Sbjct: 266 VVGRISFGDKGYTDQEETPLVSLETSTA-YGVNVTGVSVGGVPVDVP-LFALFDTGSSFT 323
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
Y + + DL+ + D C+ + L + +
Sbjct: 324 LLLESAYG-VFTKAFDDLMEDKRRPVDPDFPFEFCYDLREEHLNSDARPRHMQSKCYNPC 382
Query: 303 RNSVRLVVPPEAYLVIS----GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
R+ R + ++ +S G K CLGIL NIIG+ M +++D E+
Sbjct: 383 RDDFRWRIQNDSQESVSYSNEGTKMYCLGILKSINL-----NIIGQNLMSGHRIVFDRER 437
Query: 359 QRIGWKPEDC 368
+GWK +C
Sbjct: 438 MILGWKQSNC 447
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 88/181 (48%), Gaps = 14/181 (7%)
Query: 35 DTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKNIV----PCSNPRCAAL-HWPNPPRCKH 87
DT SD+ WVQC APC C + Y P K+I+ PCS+P+C +L + N
Sbjct: 179 DTASDVPWVQC-APCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGAG 237
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFS-NGSVFNVPLTFGCGYNQHNPGPLSPPDT 146
C Y + Y DG + G V+DL L G+V FGC + PG + T
Sbjct: 238 NTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVSK--FQFGCSHALLRPGSFNN-KT 294
Query: 147 AGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG--RGVLFLGDGKVPSSGVAWTPMLQ 204
AG + LGRG S+ SQ + NV +C+ G +G L LG + +S A TPML+
Sbjct: 295 AGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPMLK 354
Query: 205 N 205
+
Sbjct: 355 S 355
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 98/373 (26%), Positives = 160/373 (42%), Gaps = 46/373 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ ++ +VG PP DTGSD+ W+QC+ PC C ++ P K+ + CS+
Sbjct: 87 YIMSYSVGTPPIKSYGIVDTGSDIVWLQCE-PCEQCYNQTTPKFNPSKSSSYKNISCSSK 145
Query: 73 RCAALHWPNPPRCKHPNDQ--CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-F 129
C ++ R ND+ C+Y I YG+ S G L + L + G + P T
Sbjct: 146 LCQSV------RDTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVI 199
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-------EYGLIRNVIGHCIGQNGR 182
GCG N N G ++GV+GLG G S+++QL Y L+R I G
Sbjct: 200 GCGTN--NIGSF-KRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGS 256
Query: 183 GVLFLGDGKVPS-SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIF 235
L GD + S V TP+++ +Y+ +G + ++G S G+++ +I
Sbjct: 257 SKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVEEGNIII 316
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DS + S VY ++ S I+ DL+ T ++ ++ +C+ + EY P
Sbjct: 317 DSSTIVTFVPSDVYTKLNSAIV-DLV-TLERVDDPNQQFSLCYN-----VSSDEEYDFPY 369
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
T +++ V R +C A I G QD MV YD
Sbjct: 370 ---MTAHFKGADILLYATNTFVEVARDVLCFAF-----APSNGGAIFGSFSQQDFMVGYD 421
Query: 356 NEKQRIGWKPEDC 368
+++ + +K DC
Sbjct: 422 LQQKTVSFKSVDC 434
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 156/370 (42%), Gaps = 54/370 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPPEKQYKPHK----NIVPCS 70
+ V +++G P + DTGSD++WVQC PC+ C ++ + P K + VPC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C+ L C QC Y + YGDG ++ G +D L + G+ L FG
Sbjct: 202 ADACSELRIYE-AGCS--GSQCGYVVSYGDGSNTTGVYGSDTLAL--APGNTVGTFL-FG 255
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
CG+ Q G + D G+L LGR +S+ SQ G V +C+ Q+ G L LG
Sbjct: 256 CGHAQA--GMFAGID--GLLALGRQSMSLKSQ--AAGAYGGVFSYCLPSKQSAAGYLTLG 309
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFDSGA 239
G +SG A T +L A Y+ ++ +G S G + + + + D+G
Sbjct: 310 -GPTSASGFATTGLLTAWAAPTFYM-----VMLTGISVGGQQVAVPASAFAGGTVVDTGT 363
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
Y + S + AP + L C+ F G VT +AL+F
Sbjct: 364 VITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYD--FSRYGVVT--LPTVALTF 419
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
+ + EA ++S + CL NG + G+ I+G + + V +D
Sbjct: 420 SGGAT-----LALEAPGILS---SGCLAFAPNGGD---GDAAILGNVQQRSFAVRFDGST 468
Query: 359 QRIGWKPEDC 368
+G+ P C
Sbjct: 469 --VGFMPGAC 476
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 94/399 (23%), Positives = 162/399 (40%), Gaps = 62/399 (15%)
Query: 12 PIFSY----FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCT-----KPPEKQY 60
P+FS+ ++++L+ G PP+ F DTGS W C C C+ P ++
Sbjct: 68 PVFSHSYGGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKH 127
Query: 61 KPHKNIVPCSNPRCAALHWPNP--PRCKHPNDQCD-----YEIEYGDGGSSIGALVTDLF 113
I+ C NP+C+ +H + C + + C Y I YG G + G +++
Sbjct: 128 SSSSKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTTG-GVALSETL 186
Query: 114 PLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNV 172
L +G + VP GC S AG+ G GRG S+ SQL +
Sbjct: 187 HL---HGLI--VPNFLVGCSV-------FSSRQPAGIAGFGRGPSSLPSQLGLTKFSYCL 234
Query: 173 IGHCIGQNGRGVLFLGDGKVPS----SGVAWTPMLQNS------ADLKHYILGPAELLYS 222
+ H + D + S + + +TP+++N A +Y + +
Sbjct: 235 LSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIG 294
Query: 223 GKSCGL--KDLT--------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK 272
G+S + K L+ I DSG ++ Y ++ ++ + + + + L +
Sbjct: 295 GRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEAL 354
Query: 273 T-LPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGIL- 329
+ L C F G L L F + + +P E Y G + V C ++
Sbjct: 355 SGLKPC----FNVSGAKELELPQLRLHF---KGGADVELPLENYFAFLGSREVACFTVVT 407
Query: 330 NGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+G+E G I+G MQ+ V YD + +R+G+K E C
Sbjct: 408 DGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKKESC 446
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 155/370 (41%), Gaps = 54/370 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPPEKQYKPHK----NIVPCS 70
+ V +++G P + DTGSD++WVQC PC+ C ++ + P K + VPC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C+ L QC Y + YGDG ++ G +D L + G+ L FG
Sbjct: 202 ADACSELRIYEA---GCSGSQCGYVVSYGDGSNTTGVYGSDTLAL--APGNTVGTFL-FG 255
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
CG+ Q G + D G+L LGR +S+ SQ G V +C+ Q+ G L LG
Sbjct: 256 CGHAQA--GMFAGID--GLLALGRQSMSLKSQ--AAGAYGGVFSYCLPSKQSAAGYLTLG 309
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFDSGA 239
G +SG A T +L A Y+ ++ +G S G + + + + D+G
Sbjct: 310 -GPSSASGFATTGLLTAWAAPTFYM-----VMLTGISVGGQQVAVPASAFAGGTVVDTGT 363
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
Y + S + AP + L C+ F G VT +AL+F
Sbjct: 364 VITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYD--FSRYGVVT--LPTVALTF 419
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
+ + EA ++S + CL NG + G+ I+G + + V +D
Sbjct: 420 SGGAT-----LALEAPGILS---SGCLAFAPNGGD---GDAAILGNVQQRSFAVRFDGST 468
Query: 359 QRIGWKPEDC 368
+G+ P C
Sbjct: 469 --VGFMPGAC 476
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 97/376 (25%), Positives = 155/376 (41%), Gaps = 45/376 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +G PP F DTGSDLTW QC PC C Y P + VPCS+
Sbjct: 66 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 124
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS-NGSVFNV-PLTFG 130
C W C +P+ C Y Y DG S+G L T+ + S G +V + FG
Sbjct: 125 TCLP-TW-RSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFG 182
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG + ++ G +GLGRG +S+++QL G + FLG
Sbjct: 183 CGTDNGG----DSLNSTGTVGLGRGTLSLLAQL-GVGKFSYCLTDFFNSTMDSPFFLGTL 237
Query: 191 KVPSSG---VAWTPMLQNSADLKHYI-------LGPAELLYSGKSCGLK---DLTLIFDS 237
+ G V TP+LQ+ + Y LG L + L+ + ++ DS
Sbjct: 238 AELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDS 297
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LA 296
G ++ ++E+V + + L P+ + D C+ P E F P L
Sbjct: 298 GTTFTILAKSGFREVVDRVAQLLGQPPVNASSLDSP---CFPSPDG------EPFMPDLV 348
Query: 297 LSFTNRRNSVRLVVPPEAYLVIS-GRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
L F + + + + Y+ + + CL I+ GS + +G Q+ +++D
Sbjct: 349 LHFAGGAD---MRLHRDNYMSYNEDDSSFCLNIV-GSPSTWSR---LGNFQQQNIQMLFD 401
Query: 356 NEKQRIGWKPEDCNTL 371
++ + P DC+ L
Sbjct: 402 MTVGQLSFLPTDCSKL 417
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 140/370 (37%), Gaps = 48/370 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--------- 67
+ + +G P K + DTGS LTW+QC C + + P +
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CS+ A L NP C N C Y+ YGD S+G L D + F + SV N
Sbjct: 187 QCSDLTTATL---NPASCSTSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSVPN--F 238
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
+GCG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 239 YYGCG--QDNEGLFG--QSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGY 292
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASY 241
+ G ++TPM +S D Y + + +GK S L I DSG
Sbjct: 293 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 352
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTEYFKPLALSF 299
+ VY + + + GTP A L C++G L +VT F A
Sbjct: 353 TRLPTGVYSALSKAVAGAMKGTPRASA--FSILDTCFQGQAARLRVPEVTMAFAGGAALK 410
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
RN LV CL A IIG Q V+YD +
Sbjct: 411 LAARN----------LLVDVDSATTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKNS 455
Query: 360 RIGWKPEDCN 369
+IG+ C+
Sbjct: 456 KIGFAAAGCS 465
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 154/376 (40%), Gaps = 42/376 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +G PP F DTGSDLTW QC PC C Y P + VPCS+
Sbjct: 77 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 135
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS-NGSVFNVP-LTFG 130
C L C P+ C Y Y DG S G L T+ L S G +V + FG
Sbjct: 136 TC--LPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG + ++ G +GLGRG +S+++QL G + LG
Sbjct: 194 CGTDNGG----DSLNSTGTVGLGRGTLSLLAQL-GVGKFSYCLTDFFNSTLDSPFLLGTL 248
Query: 191 KVPSSG---VAWTPMLQNSADLKHYI-------LGPAELLYSGKSCGLKDLT---LIFDS 237
+ G V TP+LQ+ + Y+ LG L K+ L + ++ DS
Sbjct: 249 AELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVDS 308
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LA 296
G +++ ++ +V + + L P+ + D C+ P G+ F P L
Sbjct: 309 GTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSP---CFPAP---AGERQLPFMPDLV 362
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
L F + + + + Y+ + + CL I+ + +++G Q+ +++D
Sbjct: 363 LHFAGGAD---MRLHRDNYMSYNQEDSSFCLNIVGTTSTW----SMLGNFQQQNIQMLFD 415
Query: 356 NEKQRIGWKPEDCNTL 371
++ + P DC+ L
Sbjct: 416 MTVGQLSFLPTDCSKL 431
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 154/379 (40%), Gaps = 52/379 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF + VG P DTGSD+ W+QC APC C + + + P + N V C+
Sbjct: 140 YF-TKIGVGTPATPALMVLDTGSDVVWLQC-APCRRCYEQSGQVFDPRRSRSYNAVGCAA 197
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFG 130
P C L C C Y++ YGDG + G T+ L F+ G+ V V L G
Sbjct: 198 PLCRRLDSGG---CDLRRSACLYQVAYGDGSVTAGDFATET--LTFAGGARVARVAL--G 250
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYG------LIRNVIGHCIGQNGRG 183
CG++ N G AG+LGLGRG +S +Q+ R YG L+
Sbjct: 251 CGHD--NEGLFVA--AAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSST 306
Query: 184 VLFLGDGKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSGK---SCGLKDLTL------ 233
V F G G V S+ ++TPM++N Y + + G DL L
Sbjct: 307 VTF-GSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGR 365
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQVT 289
I DSG S Y + G L+L+P +L C+ G+
Sbjct: 366 GGVIVDSGTSVTRLARPAYSALRDAFRGAAAG--LRLSPGGFSLFDTCY----DLSGRKV 419
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+++ F +PPE YL+ K G++ V +IIG I Q
Sbjct: 420 VKVPTVSMHFAG---GAEAALPPENYLIPVDSKGTFCFAFAGTDGGV---SIIGNIQQQG 473
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V++D + QR+ + P+ C
Sbjct: 474 FRVVFDGDGQRVAFTPKGC 492
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 146/372 (39%), Gaps = 50/372 (13%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+TV K DTGSDLTWVQC PC C Y P + V C++ C
Sbjct: 137 VTVELGGKNMSLIVDTGSDLTWVQCQ-PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQD 195
Query: 77 L--HWPNPPRCKHPN----DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
L N C N C+Y + YGDG + G L ++ L G FG
Sbjct: 196 LVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDTKLENFVFG 251
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFL 187
CG N N G LGR +S+VSQ + V +C + G L
Sbjct: 252 CGRN--NKGLFGGSSGLMG--LGRSSVSLVSQTLK--TFNGVFSYCLPSLEDGASGSLSF 305
Query: 188 GDGK---VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-------LIFDS 237
G+ S+ V++TP++QN YIL +G S G +L ++ DS
Sbjct: 306 GNDSSVYTNSTSVSYTPLVQNPQLRSFYILN-----LTGASIGGVELKSSSFGRGILIDS 360
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G +Y+ + ++ G P AP L C+ L + P+
Sbjct: 361 GTVITRLPPSIYKAVKIEFLKQFSGFP--TAPGYSILDTCFN-----LTSYEDISIPIIK 413
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNIIGEIFMQDKMVIYDN 356
+ + V Y V VCL + + S E EVG IIG +++ VIYD+
Sbjct: 414 MIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRVIYDS 470
Query: 357 EKQRIGWKPEDC 368
++R+G E+C
Sbjct: 471 TQERLGIVGENC 482
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 98/385 (25%), Positives = 163/385 (42%), Gaps = 67/385 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ N T+G PP+ D +L W QC PC C + + P K+ +PC +
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCT-PCQPCFEQDLPLFDPTKSSTFRGLPCGSH 115
Query: 73 RCAALHWPNPPRCKHPNDQCDYE--IEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C ++ P R +D C YE + GD G G TD F + + + L FG
Sbjct: 116 LCESI--PESSR-NCTSDVCIYEAPTKAGDTGGMAG---TDTFAIGAAKET-----LGFG 164
Query: 131 CGYNQHN-----PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
C GP +G++GLGR S+V+Q+ +C+ G L
Sbjct: 165 CVVMTDKRLKTIGGP------SGIVGLGRTPWSLVTQMN-----VTAFSYCLAGKSSGAL 213
Query: 186 FLGDGKVPSSGV--AWTP-MLQNSADLK------HYILGPAELLYSG---KSCGLKDLTL 233
FLG +G + TP +++ SA +Y++ A + G ++ T+
Sbjct: 214 FLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAASSSGSTV 273
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL-GQVTEYF 292
+ D+ + +Y Y+ + + + P+ P K +C+ KA+ G E
Sbjct: 274 LLDTVSRASYLADGAYKALKKALTAAVGVQPVASPP--KPYDLCFS---KAVAGDAPE-- 326
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEA------EVGENNIIGEIF 346
L +F L VPP YL+ SG VCL I GS A E+ +I+G +
Sbjct: 327 --LVFTF---DGGAALTVPPANYLLASGNGTVCLTI--GSSASLNLTGELEGASILGSLQ 379
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTL 371
++ V++D +++ + +KP DC++L
Sbjct: 380 QENVHVLFDLKEETLSFKPADCSSL 404
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 140/370 (37%), Gaps = 48/370 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--------- 67
+ + +G P K + DTGS LTW+QC C + + P +
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CS+ A L NP C N C Y+ YGD S+G L D + F + SV N
Sbjct: 189 QCSDLTTATL---NPASCSTSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSVPN--F 240
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
+GCG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 241 YYGCG--QDNEGLFG--QSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGY 294
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASY 241
+ G ++TPM +S D Y + + +GK S L I DSG
Sbjct: 295 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 354
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTEYFKPLALSF 299
+ VY + + + GTP A L C++G L +VT F A
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTP--RASAFSILDTCFQGQAARLRVPEVTMAFAGGAALK 412
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
RN LV CL A IIG Q V+YD +
Sbjct: 413 LAARN----------LLVDVDSATTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKNS 457
Query: 360 RIGWKPEDCN 369
+IG+ C+
Sbjct: 458 KIGFAAGGCS 467
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 156/372 (41%), Gaps = 44/372 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF + +++G P DTGSDLTWVQC PC C + + P ++ + C +
Sbjct: 94 YF-MKMSIGTPLVEVIVIADTGSDLTWVQC-LPCDPCYRQKSPLFDPSRSSSYRHMLCGS 151
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFNVPLTFG 130
C AL + C + C+Y YGD + G L T+ F + S+ V P+ FG
Sbjct: 152 RFCNALDV-SEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFG 210
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGV 184
CG N G + V G +S+VSQL +I+ +C+ +
Sbjct: 211 CGTG--NGGTFDELGSGIVGLGGGA-LSLVSQLS--SIIKGKFSYCLVPLSEQSNVTSKI 265
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGK--SCGLKDLTLIFD 236
F D + V TP++ D +Y+ +G L Y+ + ++ +I D
Sbjct: 266 KFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIID 325
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG + + S + E+ ++ + +++ +C+R + G + +A
Sbjct: 326 SGTTLTFLDSEFFTELERVLEETVKAE--RVSDPRGLFSVCFR----SAGDID--LPVIA 377
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
+ F N + + P V + +C ++ S ++G I G + D +V YD
Sbjct: 378 VHF----NDADVKLQPLNTFVKADEDLLCFTMI--SSNQIG---IFGNLAQMDFLVGYDL 428
Query: 357 EKQRIGWKPEDC 368
EK+ + +KP DC
Sbjct: 429 EKRTVSFKPTDC 440
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 72/153 (47%), Gaps = 14/153 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V L +G P F DT SDL W QC PC C K + + P + +VPC++
Sbjct: 88 YLVKLGLGTPQHCFTAAIDTASDLIWTQCQ-PCVKCYKQLDPVFNPVASTSYAVVPCNSD 146
Query: 73 RCAALHWPNPPRCKHPNDQ--CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L R +D+ C Y YG ++ G L D R + G + FG
Sbjct: 147 TCDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVD----RLAIGDDVFRGVVFG 202
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
C + GP PP +GV+GLGRG +S+VSQL
Sbjct: 203 CSSSSVG-GP--PPQVSGVVGLGRGALSLVSQL 232
>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 430
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 81/330 (24%), Positives = 133/330 (40%), Gaps = 44/330 (13%)
Query: 58 KQYKPH----KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDL 112
Y P+ + VPC++ C RC + C YE+ Y SSIG LV D+
Sbjct: 4 NHYSPNDSTTSSTVPCTSSLCN--------RCTSNQNVCPYEMRYLSANTSSIGYLVEDV 55
Query: 113 FPLRFSNGSV--FNVPLTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLI 169
L + + +TFGCG Q + P+ G++GLG +IS+ S L + GL
Sbjct: 56 LHLATDDSLLKPVEAKITFGCGTVQTGIFATTAAPN--GLIGLGMEKISVPSFLADQGLT 113
Query: 170 RNVIGHCIGQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL 228
N C G +G G + GD G + ML+ + + ++ G
Sbjct: 114 SNSFSMCFGADGYGRIDFGDTGPADQKQTPFNTMLEYQSYNVTF-----NVINVGGEPND 168
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
T IFDSG S+ Y T Y I + + L + C+ P A
Sbjct: 169 VPFTAIFDSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGA---- 224
Query: 289 TEYFKPLALSFTNRR------NSVRLVVPPEAY---LVISGRKNV-CLGILNGSEAEVGE 338
+ F+ L L+FT + + + +P + ++ +V CL I ++ +
Sbjct: 225 -KEFQYLTLNFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDID--- 280
Query: 339 NNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+IG+ FM + ++ ++ +GW DC
Sbjct: 281 --LIGQNFMTGYRITFNRDQMVLGWSSSDC 308
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 96/399 (24%), Positives = 158/399 (39%), Gaps = 67/399 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKP---PEK------QYKPHKN 65
++++L +G PP+ F DTGS L W C + C+ C P P K +
Sbjct: 88 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAK 147
Query: 66 IVPCSNPRCAALHWPNP----PRCKHPNDQ-C-----DYEIEYGDGGSSIGALVTDL-FP 114
++ C NP+C L P+ P+CK P Q C Y I+YG G ++ L+ +L FP
Sbjct: 148 LLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNLNFP 207
Query: 115 LRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----EYGLI 169
+ VP GC LS +G+ G GRG+ S+ SQ+ Y L+
Sbjct: 208 GK-------TVPQFLVGCSI-------LSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLV 253
Query: 170 RNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILGPAELLYSGKS 225
+ + + G ++G+++TP N ++ ++Y + +L+ G
Sbjct: 254 SHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVD 313
Query: 226 CGLKDLTL----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
+ L I DSG+++ + VY + +R L K + ++
Sbjct: 314 VKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGK---KYSREENVEA 370
Query: 276 ICWRGP-FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILN--- 330
P F G T F F + ++ P Y G V C +++
Sbjct: 371 QSGLSPCFNISGVKTISFPEFTFQF---KGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGG 427
Query: 331 -GSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
G G I+G Q+ V YD E +R G+ P +C
Sbjct: 428 AGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 145/372 (38%), Gaps = 50/372 (13%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+TV K DTGSDLTWVQC PC C Y P + V C++ C
Sbjct: 137 VTVELGGKNMSLIVDTGSDLTWVQCQ-PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQD 195
Query: 77 L--HWPNPPRCKHPN----DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
L N C N C+Y + YGDG + G L ++ L G FG
Sbjct: 196 LVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDTKLENFVFG 251
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFL 187
CG N N G LGR +S+VSQ + V +C + G L
Sbjct: 252 CGRN--NKGLFGGSSGLMG--LGRSSVSLVSQTLK--TFNGVFSYCLPSLEDGASGSLSF 305
Query: 188 GDGK---VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-------LIFDS 237
G+ S+ V++TP++QN YIL +G S G +L ++ DS
Sbjct: 306 GNDSSVYTNSTSVSYTPLVQNPQLRSFYILN-----LTGASIGGVELKSSSFGRGILIDS 360
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G +Y+ + ++ G P AP L C+ L + P+
Sbjct: 361 GTVITRLPPSIYKAVKIEFLKQFSGFP--TAPGYSILDTCFN-----LTSYEDISIPIIK 413
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNIIGEIFMQDKMVIYDN 356
+ + V Y V VCL + + S E EVG IIG +++ VIYD
Sbjct: 414 MIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRVIYDT 470
Query: 357 EKQRIGWKPEDC 368
++R+G E+C
Sbjct: 471 TQERLGIVGENC 482
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 97/399 (24%), Positives = 157/399 (39%), Gaps = 67/399 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKPPEKQYK---------PHKN 65
++++L +G PP+ F DTGS L W C + C+ C P K
Sbjct: 92 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAK 151
Query: 66 IVPCSNPRCAALHWPNP----PRCKHPNDQCD-----YEIEYGDGGSSIGALVTDLFPLR 116
++ C NP+C + + P+CK + C Y I+YG GS+ G L+ D L
Sbjct: 152 LLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGL-GSTAGFLLLD--NLN 208
Query: 117 FSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----EYGLIRNV 172
F +V GC LS +G+ G GRG+ S+ SQ+ Y L+ +
Sbjct: 209 FPGKTVPQ--FLVGCSI-------LSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHR 259
Query: 173 IGHCIGQNGRGVLFLGDGKVPSSGVAWTPM-----LQNSADLKHYILGPAELLYSGKSCG 227
+ + G ++G+++TP N A ++Y L +++ GK
Sbjct: 260 FDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVK 319
Query: 228 LKDLTL----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---L 274
+ L I DSG+++ + VY + ++ L A D +T L
Sbjct: 320 IPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKN-YSRAEDAETQSGL 378
Query: 275 PICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGILN--- 330
C F G T F L F + ++ P + Y + G VCL +++
Sbjct: 379 SPC----FNISGVKTVTFPELTFKF---KGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGG 431
Query: 331 -GSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
G G I+G Q+ + YD E +R G+ P C
Sbjct: 432 AGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSC 470
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 169/383 (44%), Gaps = 49/383 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKN----IVPCSN 71
+ + L +G PP+ + DTGSDL W QC APC C K P Y P + ++PCS+
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQC-APCGERCFKQPSPLYNPSSSPTFRVLPCSS 150
Query: 72 P--RCAA---LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
CAA L PP P C Y YG G +S G ++ F S VP
Sbjct: 151 ALNLCAAEARLAGATPP----PGCACRYNQTYGTGWTS-GLQGSETFTFGSSPADQVRVP 205
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ FGC N +AG++GLGRG +S+VSQL G+ + + L
Sbjct: 206 GIAFGC----SNASSDDWNGSAGLVGLGRGGLSLVSQLAA-GMFSYCLTPFQDTKSKSTL 260
Query: 186 FLG----DGKVPSSGVAWTPMLQNSA----------DLKHYILGPAELLYSGKSCGLK-D 230
LG + +GV TP + + + +L +GPA L + L+ D
Sbjct: 261 LLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRAD 320
Query: 231 LT--LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
T LI DSG + Y+ + + + R L+ P+ + L +C+ P +
Sbjct: 321 GTGGLIIDSGTTITSLVDAAYKRVRAAV-RSLVKLPVTDGSNATGLDLCFALPSSSAPPA 379
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
T + L F + +V+P E Y+++ G CL + + ++ GE + +G Q
Sbjct: 380 T--LPSMTLHFGGGAD---MVLPVENYMILDG-GMWCLAMRSQTD---GELSTLGNYQQQ 430
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ ++YD +K+ + + P C+TL
Sbjct: 431 NLHILYDVQKETLSFAPAKCSTL 453
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 95/390 (24%), Positives = 163/390 (41%), Gaps = 58/390 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V L G P F DT SDL W+QC PC C + + + P + +VPC++
Sbjct: 92 YLVKLGTGTPQHFFSAAIDTASDLVWMQCQ-PCVSCYRQLDPVFNPKLSSSYAVVPCTSD 150
Query: 73 RCAALHWPNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
CA L + RC +D C Y +Y G + G L D + G VF+ + FGC
Sbjct: 151 TCAQL---DGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI---GGDVFHA-VVFGC 203
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
+ GP + +G++GLGRG +S+VSQL + + + +G+ VL G
Sbjct: 204 S-DSSVGGPAA--QASGLVGLGRGPLSLVSQLSVHRFMYCLPPPMSRTSGKLVLGAGADA 260
Query: 192 VPSSGVAWTPMLQNSADL-KHYILGPAELLYSGKSCG-LKDLT----------------- 232
V + T + +S +Y L L ++ G ++ T
Sbjct: 261 VRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGG 320
Query: 233 -----------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
+I D ++ ++ + +Y E+ + + I P L +C+ P
Sbjct: 321 IVGAGGANAYGMIVDVASTISFLETSLYDELAD-DLEEEIRLPRATPSLRLGLDLCFILP 379
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNI 341
+ +G Y ++LSF R L + + V GR +CL I G + V +I
Sbjct: 380 -EGVGMDRVYVPTVSLSFDGR----WLELDRDRLFVTDGRM-MCLMI--GRTSGV---SI 428
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+G +Q+ V+++ + +I + C++L
Sbjct: 429 LGNFQLQNMRVLFNLRRGKITFAKASCDSL 458
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 140/374 (37%), Gaps = 48/374 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF V + +G PP D+GSD+ WVQC PC C + + P + V C +
Sbjct: 125 YF-VRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPASSATFSAVSCGS 182
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L C + C+YE+ YGDG + G L + L G + GC
Sbjct: 183 AICRTLRTSG---CGD-SGGCEYEVSYGDGSYTKGTLALETLTL----GGTAVEGVAIGC 234
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG---------R 182
G+ N G AG+LGLG G +S+V QL +C+ G
Sbjct: 235 GH--RNRGLFV--GAAGLLGLGWGPMSLVGQLGG--AAGGAFSYCLASRGGSGSGAADAA 288
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD--LTLIFDSGAS 240
G L LG + G W P+++N Y +G + + + L+D L D G
Sbjct: 289 GSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGG 348
Query: 241 YAYFT----SRVYQEIVSLIMRDLIGT--PLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
T +R+ QE + + +G L AP L C+ L T P
Sbjct: 349 VVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYD-----LSGYTSVRVP 403
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
+ + + L +P L+ CL S +I+G I + +
Sbjct: 404 TVSFYFD--GAATLTLPARNLLLEVDGGIYCLAFAPSSSGL----SILGNIQQEGIQITV 457
Query: 355 DNEKQRIGWKPEDC 368
D+ IG+ P C
Sbjct: 458 DSANGYIGFGPATC 471
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 140/370 (37%), Gaps = 48/370 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--------- 67
+ + +G P K + DTGS LTW+QC C + + P +
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CS+ A L NP C N C Y+ YGD S+G L D + F + SV N
Sbjct: 187 QCSDLTTATL---NPASCSTSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSVPN--F 238
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
+GCG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 239 YYGCG--QDNEGLFG--QSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGY 292
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASY 241
+ G ++TPM +S D Y + + +GK S L I DSG
Sbjct: 293 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 352
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTEYFKPLALSF 299
+ VY + + + GTP A L C++G L +VT F A
Sbjct: 353 TRLPTGVYSALSKAVAGAMKGTPRASA--FSILDTCFQGQAARLRVPEVTMAFAGGAALK 410
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
RN LV CL A IIG Q V+YD +
Sbjct: 411 LAARN----------LLVDVDSATTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKNS 455
Query: 360 RIGWKPEDCN 369
+IG+ C+
Sbjct: 456 KIGFAAGGCS 465
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 107/383 (27%), Positives = 169/383 (44%), Gaps = 49/383 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKN----IVPCSN 71
+ + L +G PP+ + DTGSDL W QC APC C K P Y P + ++PCS+
Sbjct: 97 YIMTLAIGTPPQSYPAIADTGSDLVWTQC-APCGERCFKQPSPLYNPSSSPTFRVLPCSS 155
Query: 72 P--RCAA---LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
CAA L PP P C Y YG G +S G ++ F S VP
Sbjct: 156 ALNLCAAEARLAGATPP----PGCACRYNQTYGTGWTS-GLQGSETFTFGSSPADQVRVP 210
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ FGC N +AG++GLGRG +S+VSQL G+ + + L
Sbjct: 211 GIAFGC----SNASSDDWNGSAGLVGLGRGGLSLVSQLAA-GMFSYCLTPFQDTKSKSTL 265
Query: 186 FLG----DGKVPSSGVAWTPMLQNSA----------DLKHYILGPAELLYSGKSCGLK-D 230
LG + +GV TP + + + +L +GPA L + L+ D
Sbjct: 266 LLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRAD 325
Query: 231 LT--LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
T LI DSG + Y+ + + + R L+ P+ + L +C+ P +
Sbjct: 326 GTGGLIIDSGTTITSLVDAAYKRVRAAV-RSLVKLPVTDGSNATGLDLCFALPSSSAPPA 384
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
T + L F + +V+P E Y+++ G CL + + ++ GE + +G Q
Sbjct: 385 T--LPSMTLHFGGGAD---MVLPVENYMILDG-GMWCLAMRSQTD---GELSTLGNYQQQ 435
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ ++YD +K+ + + P C+TL
Sbjct: 436 NLHILYDVQKETLSFAPAKCSTL 458
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 145/372 (38%), Gaps = 50/372 (13%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+TV K DTGSDLTWVQC PC C Y P + V C++ C
Sbjct: 89 VTVELGGKNMSLIVDTGSDLTWVQCQ-PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQD 147
Query: 77 L--HWPNPPRCKHPNDQ----CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
L N C N C+Y + YGDG + G L ++ L G FG
Sbjct: 148 LVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDTKLENFVFG 203
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFL 187
CG N N G LGR +S+VSQ + V +C + G L
Sbjct: 204 CGRN--NKGLFGGSSGLMG--LGRSSVSLVSQTLK--TFNGVFSYCLPSLEDGASGSLSF 257
Query: 188 GDGK---VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-------LIFDS 237
G+ S+ V++TP++QN YIL +G S G +L ++ DS
Sbjct: 258 GNDSSVYTNSTSVSYTPLVQNPQLRSFYILN-----LTGASIGGVELKSSSFGRGILIDS 312
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G +Y+ + ++ G P AP L C+ L + P+
Sbjct: 313 GTVITRLPPSIYKAVKIEFLKQFSGFP--TAPGYSILDTCFN-----LTSYEDISIPIIK 365
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGS-EAEVGENNIIGEIFMQDKMVIYDN 356
+ + V Y V VCL + + S E EVG IIG +++ VIYD
Sbjct: 366 MIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---IIGNYQQKNQRVIYDT 422
Query: 357 EKQRIGWKPEDC 368
++R+G E+C
Sbjct: 423 TQERLGIVGENC 434
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/377 (24%), Positives = 146/377 (38%), Gaps = 48/377 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
V+L +G PP+ DTGS L+W+QC PP + P +++PC++P C
Sbjct: 79 VSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPR-KPPPSTVFDPSLSSSFSVLPCNHPLC 137
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + S + PL GC
Sbjct: 138 KPRIPDFTLPTSCDL-NRLCHYSYFYADGTLAEGNLVREKITFSTSQST---PPLILGCA 193
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
+ D G+LG+ GR+S SQ + V + G G +LG+
Sbjct: 194 EDAS--------DDKGILGMNLGRLSFASQAKITKFSYCVPTRQVRPGFTPTGSFYLGEN 245
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAE--LLYSGKSCGLKDLTL--------------- 233
S+G + +L S + L P + G G K L +
Sbjct: 246 P-NSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQS 304
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKALGQVTEYF 292
+ DSG+ + Y Y ++ ++R L G LK +C+ G +G++
Sbjct: 305 MIDSGSEFTYLVDVAYNKVREEVVR-LAGPRLKKGYVYSGVSDMCFDGNAMEIGRL---I 360
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ F V +V+ L G C+GI SE +NIIG Q+ V
Sbjct: 361 GNMVFEFD---KGVEIVIEKGRVLADVGGGVHCVGI-GRSEMLGAASNIIGNFHQQNLWV 416
Query: 353 IYDNEKQRIGWKPEDCN 369
+D +R+G+ DC+
Sbjct: 417 EFDIANRRVGFGKADCS 433
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/250 (29%), Positives = 104/250 (41%), Gaps = 26/250 (10%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPH----KNIV 67
+++G P K F DTGSDL WV CD AP G T + + Y P V
Sbjct: 105 TTVSLGTPGKKFLVALDTGSDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKV 164
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNG--SVFN 124
C+N CA + RC C Y + Y +S G LV D+ L +
Sbjct: 165 TCNNSLCA-----HRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVE 219
Query: 125 VPLTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+TFGCG Q ++ P+ G+ GLG +IS+ S L + G + C G +G G
Sbjct: 220 AYVTFGCGQVQTGSFLDIAAPN--GLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIG 277
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD P TP N+ + I + G + D T +FDSG S+ Y
Sbjct: 278 RISFGDKGGPDQ--EETPFNLNALHPTYNI--TVTQVRVGTTLIDLDFTALFDSGTSFTY 333
Query: 244 FTSRVYQEIV 253
+Y ++
Sbjct: 334 LVDPIYTNVL 343
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 106/235 (45%), Gaps = 26/235 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L++G PP DTGSDL W+QC PCT C K + + + C +
Sbjct: 59 YLMELSIGTPPVKIYAQADTGSDLIWLQC-IPCTNCYKQLNPMFDSQSSSTFSNIACGSE 117
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
C+ L+ C C Y Y DG + G L + L + G V + FGC
Sbjct: 118 SCSKLY---STSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGC 174
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLF 186
G+N N G + + G++GLGRG +S+VSQ+ L N+ C+ + +
Sbjct: 175 GHN--NNGAFNDKE-MGIIGLGRGPLSLVSQIGS-SLGGNMFSQCLVPFNTNPSISSPMS 230
Query: 187 LGDG-KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
G G +V +GV TP++ + Y + LL ++D+ L F++G+S
Sbjct: 231 FGKGSEVLGNGVVSTPLVSKTTYQSFYFV---TLL----GISVEDINLPFNAGSS 278
>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 441
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 153/388 (39%), Gaps = 64/388 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----------HKN 65
V L +G PP+L DTGS ++W+ CD K P+K+ P
Sbjct: 69 LVVTLPIGTPPQLQQMVLDTGSQVSWIHCDN-----KKGPQKKQPPTTSSFDPSLSSSFF 123
Query: 66 IVPCSNPRCAALHWPNPPRCKHPND-----QCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
+PC++P C P P P D C Y Y DG G LV + L +
Sbjct: 124 ALPCNHPLCK----PQVPDISLPTDCDANRLCHYSFSYTDGTVVEGNLVRENIAL---SP 176
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
S+ P+ GC NQ + D G+LG+ GR+S +Q + V Q
Sbjct: 177 SLTTPPIILGCA-NQSD-------DARGILGMNLGRLSFPNQAKITKFSYFVPVKQT-QP 227
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYS----GKSCGLKDLTL--- 233
G G L+LG+ SS + +L S + L ++ G S G K L +
Sbjct: 228 GSGSLYLGNNP-NSSCFRYVKLLTFSKSQSQRMPNLDPLAFTLPMQGISIGGKKLNIPPS 286
Query: 234 ------------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
I DSG+ ++Y + Y I + +++ + K IC+ G
Sbjct: 287 VFKPDTTGFGQTIIDSGSEFSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVADICFDGD 346
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNI 341
+G++ + F V +V+P E L+ C GI +E G NI
Sbjct: 347 ATEIGRLV---GDMVFEF---EKGVEIVIPKERVLIEVDGGVHCFGI-GRAEGLGGGGNI 399
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
IG + Q+ V +D K R+G++ +C+
Sbjct: 400 IGNFYQQNLWVEFDLAKHRVGFRGANCS 427
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 156/380 (41%), Gaps = 63/380 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHK----NIVPCSN 71
F V + G P + + DTGSD++W+QC PC+G C K + + P K + VPC +
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQC-LPCSGHCYKQHDPVFDPTKSATYSAVPCGH 219
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P+CAA +C + + C Y++ YGDG S+ G L + L S ++P FG
Sbjct: 220 PQCAAAGG----KCSN-SGTCLYKVTYGDGSSTAGVLSHETLSLS----STRDLPGFAFG 270
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLG 188
CG Q N G G++GLGRG +S+ SQ +C+ G L +G
Sbjct: 271 CG--QTNLGEFG--GVDGLVGLGRGALSLPSQAA--ATFGATFSYCLPSYDTTHGYLTMG 324
Query: 189 DGKVPSSG----VAWTPMLQN------------SADLKHYILGPAELLYSGKSCGLKDLT 232
+S V +T M+Q S D+ YIL +++ +D T
Sbjct: 325 STTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFT------RDGT 378
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
L FDSG Y Y + + T K AP C+ G +
Sbjct: 379 L-FDSGTILTYLPPEAYASLRDRFKFTM--TQYKPAPAYDPFDTCY----DFTGHNAIFM 431
Query: 293 KPLALSFTNRR----NSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
+A F++ + V +++ P+ +G CL + NIIG +
Sbjct: 432 PAVAFKFSDGAVFDLSPVAILIYPDDTAPATG----CLAFV--PRPSTMPFNIIGNTQQR 485
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
VIYD ++IG+ C
Sbjct: 486 GTEVIYDVAAEKIGFGQFTC 505
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/375 (23%), Positives = 156/375 (41%), Gaps = 55/375 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ ++ ++G PP DT SD+ WVQC C C + P +KN+ PCS+
Sbjct: 88 YLMSYSLGTPPFPVYGIVDTASDIIWVQCQL-CETCYNDTSPMFDPSYSKTYKNL-PCSS 145
Query: 72 PRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-F 129
C ++ + C + C++ + Y DG S G L+ + L N + P T
Sbjct: 146 TTCKSVQGTS---CSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVI 202
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---------QN 180
GC N + D+ G++GLG G +S+V QL I +C+ +
Sbjct: 203 GCIRNTN-----VSFDSIGIVGLGGGPVSLVPQLSSS--ISKKFSYCLAPISDRSSKLKF 255
Query: 181 GRGVLFLGDGKVPSSGV--AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-TLIFDS 237
G + GDG V + V W + L+ + +G + + S +I DS
Sbjct: 256 GDAAMVSGDGTVSTRIVFKDWKKFYYLT--LEAFSVGNNRIEFRSSSSRSSGKGNIIIDS 313
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD-KTLPICWRGPFKALGQ--VTEYFKP 294
G ++ VY ++ S + D++ L+ A D K +C++ + + +T +F
Sbjct: 314 GTTFTVLPDDVYSKLESAVA-DVV--KLERAEDPLKQFSLCYKSTYDKVDVPVITAHFSG 370
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
+ N N+ +++ + VCL L+ I G + Q+ +V Y
Sbjct: 371 ADVKL-NALNT----------FIVASHRVVCLAFLSSQSGA-----IFGNLAQQNFLVGY 414
Query: 355 DNEKQRIGWKPEDCN 369
D +++ + +KP DC
Sbjct: 415 DLQRKIVSFKPTDCT 429
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 154/376 (40%), Gaps = 52/376 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ + +VG PP DTGSD+ W+QC+ PC C K + P K+ +PCS+
Sbjct: 91 YLMRYSVGSPPFQVLGIVDTGSDILWLQCE-PCEDCYKQTTPIFDPSKSKTYKTLPCSSN 149
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C +L C N C+Y I+YGDG S G L + L ++GS + P T GC
Sbjct: 150 TCESLR---NTACSSDN-VCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVIGC 205
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-----QNGRGVLF 186
G+N N G + V G I G +C+ N L
Sbjct: 206 GHN--NGGTFQEEGSGIVGLGGGPVSLISQLSSSIG---GKFSYCLAPIFSESNSSSKLN 260
Query: 187 LGDGKVPS-SGVAWTPM--LQNSA----DLKHYILGPAELLY---SGKSCGLKDLTLIFD 236
GD V S G TP+ L L+ + +G + + S G D +I D
Sbjct: 261 FGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIID 320
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFKALG--QVTEYFK 293
SG + Y + S + D+I L+ A D K L +C++ L +T +FK
Sbjct: 321 SGTTLTLLPQEDYLNLESAV-SDVI--KLERARDPSKLLSLCYKTTSDELDLPVITAHFK 377
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
+ N + VP E + VC ++ +++G I G + Q+ +V
Sbjct: 378 GADVEL----NPISTFVPVE-------KGVVCFAFIS---SKIGA--IFGNLAQQNLLVG 421
Query: 354 YDNEKQRIGWKPEDCN 369
YD K+ + +KP DC
Sbjct: 422 YDLVKKTVSFKPTDCT 437
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/269 (31%), Positives = 109/269 (40%), Gaps = 48/269 (17%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--------------TKPPEKQ 59
F ++AV + +G P F DTGSDL WV CD C C T P+K
Sbjct: 86 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKS 142
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFS 118
K VPCS+ C P Y I+Y D SS G LV D+ L
Sbjct: 143 STSRK--VPCSSNLCDEQSACRSASSSCP-----YSIQYLSDNTSSTGVLVEDVLYLVTE 195
Query: 119 NG---SVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGL-IRNV 172
G + P+TFGCG Q G +P G+LGLG IS+ S L G+ N
Sbjct: 196 YGRQPKIVTAPITFGCGRTQTGSFLGTAAP---NGLLGLGMDTISVPSLLASQGVAAANS 252
Query: 173 IGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGP-AELLYSGKSCGLKDL 231
C Q+G G + GD SS TP L Y P + +G + G K +
Sbjct: 253 FSMCFAQDGHGRINFGD--TGSSDQQETP-------LNMYKQNPYYNISITGATVGSKSI 303
Query: 232 ----TLIFDSGASYAYFTSRVYQEIVSLI 256
I DSG S+ + +Y +I S +
Sbjct: 304 HTKFNAIVDSGTSFTALSDPMYTQITSSV 332
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/379 (22%), Positives = 154/379 (40%), Gaps = 48/379 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSN 71
Y N T+G PP+ D +L W QC + C+ C K + P+ + PC
Sbjct: 42 YNVANFTIGTPPQPASAIIDVAGELVWTQC-SRCSRCFKQDLPLFIPNASSTFRPEPCGT 100
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYG---DGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C + P D C YE D +++G + T+ F + + S L
Sbjct: 101 DAC-----KSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATAS-----LA 150
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV---L 185
FGC + T+G +GLGR S+V+Q++ +C+ G G L
Sbjct: 151 FGCVVASDID---TMDGTSGFIGLGRTPRSLVAQMK-----LTKFSYCLSPRGTGKSSRL 202
Query: 186 FLGDGKVPSSG--VAWTPMLQNSA--DLKHYILGPAELLYSGKSCGLKDLT---LIFDSG 238
FLG + G + P ++ S D HY L + + +G + + L+ +
Sbjct: 203 FLGSSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQSGGILVMHTV 262
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLK-LAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
+ ++ Y+ + + G + +A + +C++ KA G L
Sbjct: 263 SPFSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFK---KAAGFSRATAPDLVF 319
Query: 298 SFTNRRNSVRLVVPPEAYLVISG--RKNVCLGILNGS---EAEVGENNIIGEIFMQDKMV 352
+F + + L VPP YL+ G + C IL+ + + +++G + +D
Sbjct: 320 TF---QGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHF 376
Query: 353 IYDNEKQRIGWKPEDCNTL 371
+YD +K+ + ++P DC++L
Sbjct: 377 LYDLKKETLSFEPADCSSL 395
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 93/372 (25%), Positives = 154/372 (41%), Gaps = 47/372 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N+++G PP DTGSDL W QC APC C + + P + V CS+
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQC-APCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
+C AL N C ++ C Y + YGD + G + D L S+ + + GC
Sbjct: 149 QCTALE--NQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 206
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G+N N G + +G++GLG G +S++ QL + I +C+ +
Sbjct: 207 GHN--NAGTFNKK-GSGIVGLGGGPVSLIKQLGDS--IDGKFSYCLVPLTSKKDQTSKIN 261
Query: 186 FLGDGKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
F + V SGV TP++ ++ LK +G ++ YSG + +I DSG
Sbjct: 262 FGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSG 321
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR--GPFKALGQVTEYFKPLA 296
+ + Y E+ + I K P L +C+ G K + +T +F
Sbjct: 322 TTLTLLPTEFYSELEDAVASS-IDAEKKQDPQSG-LSLCYSATGDLK-VPVITMHFDGAD 378
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
+ + V++ VC GS + +I G + + +V YD
Sbjct: 379 VKLDSSNAFVQV-----------SEDLVCFA-FRGSPSF----SIYGNVAQMNFLVGYDT 422
Query: 357 EKQRIGWKPEDC 368
+ + +KP DC
Sbjct: 423 VSKTVSFKPTDC 434
>gi|413952261|gb|AFW84910.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 298
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 108/245 (44%), Gaps = 30/245 (12%)
Query: 139 GPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDGKVPS 194
G L+ D A G+ G G+ ++S++SQL G+ V HC+ NG G+L LG+ P
Sbjct: 15 GDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEP- 73
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFDSGASYAYFT 245
G+ +TP++ + HY L + +G+ + D +L I DSG + AY
Sbjct: 74 -GLVYTPLVPSQ---PHYNLNLESIAVNGQKLPI-DSSLFTTSNTQGTIVDSGTTLAYLA 128
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y VS I ++P ++L F V F + L F
Sbjct: 129 DGAYDPFVSAI-------AAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYF---MGG 178
Query: 306 VRLVVPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
V + V PE YL+ N L + + E I+G++ ++DK+ +YD R+GW
Sbjct: 179 VAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWA 238
Query: 365 PEDCN 369
DC+
Sbjct: 239 DYDCS 243
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 84/178 (47%), Gaps = 18/178 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G P K FDTGSDLTW QC C + + P ++ + CS+P
Sbjct: 131 YIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSP 190
Query: 73 RCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C+ L N P C C Y I+YGD S+G + L S + N FG
Sbjct: 191 DCSQLESGTGNQPGCSAAR-ACIYGIQYGDQSFSVGYFAKETLTLT-STDVIEN--FLFG 246
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNGRGVLFL 187
CG Q+N G AG++GLG+ +ISIV Q ++YG V +C+ + +L
Sbjct: 247 CG--QNNRGLFG--SAAGLIGLGQDKISIVKQTAQKYG---QVFSYCLPKTSSSTGYL 297
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 156/372 (41%), Gaps = 47/372 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N+++G PP DTGSDL W QC APC C + + P + V CS+
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQC-APCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
+C AL N C ++ C Y + YGD + G + D L S+ + + GC
Sbjct: 149 QCTALE--NQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 206
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G+N N G + +G++GLG G +S++ QL + I +C+ +
Sbjct: 207 GHN--NAGTFNKK-GSGIVGLGGGPVSLIKQLGDS--IDGKFSYCLVPLTSKKDQTSKIN 261
Query: 186 FLGDGKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
F + V SGV TP++ ++ LK +G ++ YSG + +I DSG
Sbjct: 262 FGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSG 321
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR--GPFKALGQVTEYFKPLA 296
+ + Y E+ + I K P L +C+ G K + +T +F
Sbjct: 322 TTLTLLPTEFYSELEDAVASS-IDAEKKQDPQSG-LSLCYSATGDLK-VPVITMHFDGAD 378
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
+ + A++ +S VC GS + +I G + + +V YD
Sbjct: 379 VKLDSSN----------AFVQVS-EDLVCFA-FRGSPSF----SIYGNVAQMNFLVGYDT 422
Query: 357 EKQRIGWKPEDC 368
+ + +KP DC
Sbjct: 423 VSKTVSFKPTDC 434
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 148/372 (39%), Gaps = 54/372 (14%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKN----IVP 68
S + V ++G P + DTGSDL+WVQC PC C + + + P ++ VP
Sbjct: 135 SNYVVTASLGTPGMAQTLEVDTGSDLSWVQCK-PCAAPSCYRQKDPLFDPAQSSSYAAVP 193
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C CA L C QC Y + YGDG ++ G +D L +N +V
Sbjct: 194 CGRSACAGLGI-YASACSAA--QCGYVVSYGDGSNTTGVYSSDTLTLA-ANATVQG--FL 247
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLF 186
FGCG+ Q G + D G+LG GR + S+V Q G V +C+ + G L
Sbjct: 248 FGCGHAQSG-GLFTGID--GLLGFGREQPSLVQQ--TAGAYGGVFSYCLPTKSSTTGYLT 302
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFDS 237
LG + G + T +L + +Y+ ++ +G S G + L++ + D+
Sbjct: 303 LGGPSGVAPGFSTTQLLPSPNAPTYYV-----VMLTGISVGGQPLSVPASAFAAGTVVDT 357
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G Y + S + P AP L C+ F G V +AL
Sbjct: 358 GTVITRLPPAAYAALRSAFRSGMASYP--SAPPIGILDTCYS--FAGYGTVN--LTSVAL 411
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGENNIIGEIFMQDKMVIYDN 356
+F++ A + + + G L S G I+G + + V D
Sbjct: 412 TFSS-----------GATMTLGADGIMSFGCLAFASSGSDGSMAILGNVQQRSFEVRIDG 460
Query: 357 EKQRIGWKPEDC 368
+G++P C
Sbjct: 461 SS--VGFRPSSC 470
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 78/162 (48%), Gaps = 16/162 (9%)
Query: 7 EFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN- 65
E P + V L +G P F DT SDL W+QC PC C + + + P +
Sbjct: 78 EAPLVPRGGEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQ-PCVSCYRQLDPIFNPRLSS 136
Query: 66 ---IVPCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGS 121
+VPCS+ C+ L + RC +DQ C Y +Y + G L D + G+
Sbjct: 137 SYAVVPCSSDTCSQL---DGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAV---GGN 190
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
VF+ + GC + GP PP +G++GL RG +S++SQL
Sbjct: 191 VFHA-VVLGCS-DSSVGGP--PPQASGLVGLARGPLSLLSQL 228
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 159/374 (42%), Gaps = 56/374 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG P K DTGSD+ W+QC+ PC C + + + P + + CS
Sbjct: 162 YFS-RIGVGTPAKEMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
P+C+ L C+ +++C Y++ YGDG ++G L TD + F N G + NV L G
Sbjct: 220 PQCSLLE---TSACR--SNKCLYQVSYGDGSFTVGELATD--TVTFGNSGKINNVAL--G 270
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----EYGLIRNVIGHCIGQNGRGVLF 186
CG++ N G + AG+LGLG G +SI +Q++ Y L+ G + V
Sbjct: 271 CGHD--NEGLFTG--AAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQL 326
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFD 236
G G A P+L+N Y +G + G+ L D +I D
Sbjct: 327 GG-------GDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILD 379
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQVTEYFKPL 295
G + ++ Y + ++ + LK +L C+ F +L V +
Sbjct: 380 CGTAVTRLQTQAYNSLRDAFLK--LTVNLKKGSSSISLFDTCY--DFSSLSTVK--VPTV 433
Query: 296 ALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
A FT ++ L +P + YL+ + C S + +IIG + Q + Y
Sbjct: 434 AFHFTGGKS---LDLPAKNYLIPVDDSGTFCFAFAPTSSSL----SIIGNVQQQGTRITY 486
Query: 355 DNEKQRIGWKPEDC 368
D K IG C
Sbjct: 487 DLSKNVIGLSGNKC 500
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 96/228 (42%), Gaps = 23/228 (10%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCS 70
S F V + VG PP+ F FD +D TW+QC PC C P+ + P ++ ++ C
Sbjct: 185 SNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQ-PCIKCYDQPDSIFDPSQSSSYTLLSCE 243
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L P + C Y I Y DG ++ G L+ + S+G V V L G
Sbjct: 244 TKHCNLL----PNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFE-SSGWVDRVSL--G 296
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
C + N GP D G GLGRG +S S++ + +C+ ++G L
Sbjct: 297 C--SNKNQGPFVGSD--GTFGLGRGSLSFPSRINASSM-----SYCLVESKDGYSSSTLE 347
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFD 236
P SG +LQN Y +G + G+ + + T D
Sbjct: 348 FNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTID 395
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 145/367 (39%), Gaps = 45/367 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRC 74
S + + L VG PP + DTGS++TW QC PC C K + P K+ RC
Sbjct: 378 SVYLMKLQVGTPPFEIEAVIDTGSEITWTQC-LPCVHCYKQNAPIFDPSKSST-FKEKRC 435
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGCGY 133
+ C YE++Y D + G L TD + ++G F + T GCG
Sbjct: 436 H-------------DHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGR 482
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-DGKV 192
N P G +GL G +S+++Q+ G ++ +C NG + G + V
Sbjct: 483 NNS----WFRPSFEGFVGLNWGPLSLITQMG--GEYPGLMSYCFAGNGTSKINFGTNAIV 536
Query: 193 PSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFT 245
GV T M +A +L +G + G + ++ DSG + YF
Sbjct: 537 GGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTLTYFP 596
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y +V + ++ P L +C+ TE F + + F+ +
Sbjct: 597 ES-YCNLVRQAVEHVVPAVPAADPTGNDL-LCY------YSNTTEIFPVITMHFSGGAD- 647
Query: 306 VRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
LV+ + S + CL I+ + + I G + +V YD+ + +K
Sbjct: 648 --LVLDKYNMFMESYSGGLFCLAIICNNPT---QEAIFGNRAQNNFLVGYDSSSLLVSFK 702
Query: 365 PEDCNTL 371
P +C+ L
Sbjct: 703 PTNCSAL 709
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 92/242 (38%), Gaps = 49/242 (20%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
F + Y + L +G PP + DTGS+L W QC PC C + P K+
Sbjct: 60 FDTYEYL-MKLQIGTPPFEVEAVLDTGSELIWTQC-LPCLHCYDQKAPIFDPSKS----- 112
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-F 129
RC P+ C Y++ Y D + G L T+ + ++G F +P T
Sbjct: 113 -------STFKETRCNTPDHSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETII 165
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GC N N G P ++G++GL RG +S++SQ+
Sbjct: 166 GCSRN--NSGSGFRPSSSGIVGLSRGSLSLISQM-------------------------G 198
Query: 190 GKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLTLIFDSGASYA 242
G P GV T M +A Y L G + G + ++ DSG
Sbjct: 199 GAYPGDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLT 258
Query: 243 YF 244
YF
Sbjct: 259 YF 260
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 85/184 (46%), Gaps = 22/184 (11%)
Query: 34 FDTGSDLTWVQC-DAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKHP 88
DT SD+ WVQC P + C + Y P K+ CS+P C L P C
Sbjct: 186 LDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQL-GPYANGCSSS 244
Query: 89 ND---QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPP 144
++ QC Y + Y DG ++ G LV D L ++ VP FGC + G S
Sbjct: 245 SNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTS----QVPKFEFGCSHAAR--GSFSRS 298
Query: 145 DTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTP 201
TAG++ LGRG S+VSQ +YG V +C + +G LG + SS A TP
Sbjct: 299 KTAGIMALGRGVQSLVSQTSTKYG---QVFSYCFPPTASHKGFFVLGVPRRSSSRYAVTP 355
Query: 202 MLQN 205
ML+
Sbjct: 356 MLKT 359
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/324 (24%), Positives = 138/324 (42%), Gaps = 36/324 (11%)
Query: 57 EKQYKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
+KP PC C ++ P K +D C Y+ G GG ++G + TD F +
Sbjct: 74 SSTFKPE----PCGTDVCKSIPTP-----KCASDVCAYDGVTGLGGHTVGIVATDTFAIG 124
Query: 117 FSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ + P G + + P + P +G +GLGR S+V+Q++ + H
Sbjct: 125 TAAPAR---PPASGASWRATST-PWAGP--SGFIGLGRTPWSLVAQMKLTRFSYCLAPHD 178
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSAD--LKHYILGPAELLYSGKSCGL----KD 230
G+N R LFLG + G AWTP ++ S + + Y E + +G + ++
Sbjct: 179 TGKNSR--LFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRN 236
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
L+ + + VYQE +M + P P +C+ P + +
Sbjct: 237 TVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTA-TPVGAPFEVCF--PKAGVSGAPD 293
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN---NIIGEIFM 347
L FT + + L VPP YL G VCL +++ + + NI+G
Sbjct: 294 ------LVFTFQAGAA-LTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQ 346
Query: 348 QDKMVIYDNEKQRIGWKPEDCNTL 371
++ +++D +K + ++P DC++L
Sbjct: 347 ENVHLLFDLDKDMLSFEPADCSSL 370
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 155/372 (41%), Gaps = 55/372 (14%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
S + VN+ +G P K FDTGS L W QC PC C P + P K+ +PCS
Sbjct: 130 SDYIVNVGIGTPKKEMPLIFDTGSGLIWTQCK-PCKAC-YPKVPVFDPTKSASFKGLPCS 187
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ C ++ C P +C Y Y D SS G L T+ + FS+ + G
Sbjct: 188 SKLCQSIRQ----GCSSP--KCTYLTAYVDNSSSTGTLATET--ISFSHLKYDFKNILIG 239
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFLG 188
C +Q + L +G++GL R IS+ SQ + + +CI G L G
Sbjct: 240 CS-DQVSGESLGE---SGIMGLNRSPISLASQTAN--IYDKLFSYCIPSTPGSTGHLTFG 293
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGASYA 242
GKVP+ V ++P+ + + + I +G +LL + + DSGA
Sbjct: 294 -GKVPND-VRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIAS---TIDSGAVLT 348
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW---RGPFKALGQVTEYFKPLALSF 299
+ Y + S+ + G PL L DD L C+ A+ ++ +F+
Sbjct: 349 RLPPKAYSALRSVFREMMKGYPL-LDQDD-FLDTCYDFSNYSTVAIPSISVFFE------ 400
Query: 300 TNRRNSVRLVVPPEAYL-VISGRKNVCLGILNGSEAEV-GENNIIGEIFMQDKMVIYDNE 357
V + + + + G K CL AE+ E +I G + V++D
Sbjct: 401 ----GGVEMDIDVSGIMWQVPGSKVYCLAF-----AELDDEVSIFGNFQQKTYTVVFDGA 451
Query: 358 KQRIGWKPEDCN 369
K+RIG+ P C+
Sbjct: 452 KERIGFAPGGCD 463
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 140/370 (37%), Gaps = 48/370 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--------- 67
+ + +G P K + DTGS LTW+QC C + + P +
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CS+ A L +P C N C Y+ YGD S+G L D + F + SV N
Sbjct: 189 QCSDLTTATL---SPASCSTSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSVPN--F 240
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
+GCG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 241 YYGCG--QDNEGLFG--QSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGY 294
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASY 241
+ G ++TPM +S D Y + + +GK S L I DSG
Sbjct: 295 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 354
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTEYFKPLALSF 299
+ VY + + + GTP A L C++G L +VT F A
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTP--RASAFSILDTCFQGQAARLRVPEVTMAFAGGAALK 412
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
RN LV CL A IIG Q V+YD +
Sbjct: 413 LAARN----------LLVDVDSATTCLAFAPARSAA-----IIGNTQQQTFSVVYDVKNS 457
Query: 360 RIGWKPEDCN 369
+IG+ C+
Sbjct: 458 KIGFAAGGCS 467
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 159/374 (42%), Gaps = 56/374 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG P K DTGSD+ W+QC+ PC C + + + P + + CS
Sbjct: 162 YFS-RIGVGTPAKDMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
P+C+ L C+ +++C Y++ YGDG ++G L TD + F N G + NV L G
Sbjct: 220 PQCSLLE---TSACR--SNKCLYQVSYGDGSFTVGELATD--TVTFGNSGKINNVAL--G 270
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----EYGLIRNVIGHCIGQNGRGVLF 186
CG++ N G + AG+LGLG G +SI +Q++ Y L+ G + V
Sbjct: 271 CGHD--NEGLFTG--AAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQL 326
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFD 236
G G A P+L+N Y +G + G+ L D +I D
Sbjct: 327 GG-------GDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILD 379
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQVTEYFKPL 295
G + ++ Y + ++ + LK +L C+ F +L V +
Sbjct: 380 CGTAVTRLQTQAYNSLRDAFLK--LTVNLKKGSSSISLFDTCY--DFSSLSTVK--VPTV 433
Query: 296 ALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
A FT ++ L +P + YL+ + C S + +IIG + Q + Y
Sbjct: 434 AFHFTGGKS---LDLPAKNYLIPVDDSGTFCFAFAPTSSSL----SIIGNVQQQGTRITY 486
Query: 355 DNEKQRIGWKPEDC 368
D K IG C
Sbjct: 487 DLSKNVIGLSGNKC 500
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 86/369 (23%), Positives = 152/369 (41%), Gaps = 43/369 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + LT+G PP DTGSDL W QC PC GC + ++P ++ +PC +
Sbjct: 82 YLMKLTLGSPPVDIYGLVDTGSDLVWAQC-TPCGGCYRQKSPMFEPLRSKTYSPIPCESE 140
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-PLTFGC 131
+C+ + C P C Y Y D + G L + ++G V + FGC
Sbjct: 141 QCSFFGY----SCS-PQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFGC 195
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRN----VIGHCIGQNGRGVLF 186
G++ N G + D ++G+G G +S+VSQ+ YG R V H + F
Sbjct: 196 GHS--NSGTFNENDMG-IIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINF 252
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGAS 240
+ V GV TP+ + + +G + ++ S L ++ DSG
Sbjct: 253 GEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFN-SSETLSKGNIMIDSGTP 311
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV-TEYFKPLALSF 299
Y Y+ +V + P++ PD T +C+R G + T +F+ +
Sbjct: 312 ATYIPQEFYERLVEELKVQSSLLPIEDDPDLGT-QLCYRSETNLEGPILTAHFEGADVQL 370
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
++ +PP+ + C + ++ + I G + ++ +D +++
Sbjct: 371 L----PIQTFIPPKDGV-------FCFAMAGSTDGDY----IFGNFAQSNILMGFDLDRK 415
Query: 360 RIGWKPEDC 368
I +KP DC
Sbjct: 416 TISFKPTDC 424
>gi|66815065|ref|XP_641634.1| hypothetical protein DDB_G0279453 [Dictyostelium discoideum AX4]
gi|60469677|gb|EAL67665.1| hypothetical protein DDB_G0279453 [Dictyostelium discoideum AX4]
Length = 864
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 159/391 (40%), Gaps = 58/391 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWV----------QCDAPCTGCTKPPEKQYKPH 63
F YF + + VG PP++F DTGS V Q C+
Sbjct: 163 FEYF-IPILVGTPPQMFTVQVDTGSTSLAVPGLNCYLYKSQTIKTSCSCSDGNLDGLYNF 221
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV 122
+ V C+A N C++ N D C + ++YGDG G+LV D +
Sbjct: 222 DDSVSGIALNCSASVCNNS--CQNKNHDNCPFMLKYGDGSFIAGSLVIDNVTI-----GQ 274
Query: 123 FNVPLTFGCGYNQH-NPGPLSPPDTA-------GVLGLGRGRI------SIVSQLREYGL 168
F VP FG + + L+ P A G+LGL + I S++
Sbjct: 275 FTVPAKFGNIQKESLSFSQLTCPSNARSQAVRDGILGLSFQELDPYNGDDIFSKIVSSYG 334
Query: 169 IRNVIGHCIGQNGRGVLFLG--DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC 226
I NV C+G++G G+L +G + +V +TP++ D +Y + + +S
Sbjct: 335 IPNVFSMCLGKDG-GILTIGGINERVNIETPKYTPII----DFHYYSIHVLNIYVENESL 389
Query: 227 GLKD---LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
++ I DSG + YF ++ I+ + + + L +DK W G
Sbjct: 390 KFTPNDFISSIVDSGTTLLYFNDEIFYSIIKNLEQSY--SKLPGIGEDK----FWEGNCH 443
Query: 284 ALGQVTEYFKP---LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN 340
L + + P L L + S +L +PP Y + + C GI + E V
Sbjct: 444 YLSEESVELYPTIYLELDGSGASGSFKLAIPPSLYFLKINNLH-CFGISHMKEISV---- 498
Query: 341 IIGEIFMQDKMVIYDNEKQRIGW-KPEDCNT 370
+IG++ +Q VIYD RIG+ K E+C T
Sbjct: 499 LIGDVVLQGYNVIYDRGNSRIGFAKIENCKT 529
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/249 (28%), Positives = 106/249 (42%), Gaps = 26/249 (10%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----V 67
+ +G P F DTGSDL WV CD AP G T E + Y P + V
Sbjct: 109 TTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKV 168
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNGSVFNVP 126
C+N CA + +C C Y + Y +S G L+ D+ L + + V
Sbjct: 169 TCNNSLCAQRN-----QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVE 223
Query: 127 --LTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+TFGCG Q ++ P+ G+ GLG +IS+ S L GL+ + C G +G G
Sbjct: 224 AYVTFGCGQVQSGSFLDIAAPN--GLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVG 281
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD SS TP N + + I + G + + T +FD+G S+ Y
Sbjct: 282 RISFGDKG--SSDQEETPFNLNPSHPNYNI--TVTRVRVGTTLIDDEFTALFDTGTSFTY 337
Query: 244 FTSRVYQEI 252
+Y +
Sbjct: 338 LVDPMYTTV 346
>gi|326515366|dbj|BAK03596.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 452
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 93/394 (23%), Positives = 153/394 (38%), Gaps = 74/394 (18%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY---------KPHKNIVPC 69
V + G + D LTW+QC PC PEK+ PH + +
Sbjct: 83 VGIGSGGTQHFYKLALDLVRPLTWMQCK-PCV-----PEKRQDGSVFNTAASPHYHHIAS 136
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLF---------PLRFSN 119
++PRC A P + +C +++++ G S + G L +D F P+ N
Sbjct: 137 TDPRCMA------PYTRAGQGRCTFDVKFQYGDSRARGVLGSDDFVFDGSGPGSPISSVN 190
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDT-AGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
G L FGC +N H+ D AGV+ L R S + QL GL +C+
Sbjct: 191 G------LVFGCAHNTHD---FYNHDLWAGVMSLNRHPTSFIRQLSARGLAAPRFSYCLA 241
Query: 179 ----QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLK-----HYILGPAELLYSGKSCGLK 229
++ RG L G S TP+L DL +Y+ L + +
Sbjct: 242 SRQHRDRRGFLRFGADIPDQSHARSTPLLH--GDLAQGGGMYYVGVVGVSLGGRRLTAIT 299
Query: 230 DLTL-----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
+ I D G S + Y +V+ ++ + ++ A C+
Sbjct: 300 PVMFELNRRSLRGGCIIDVGTSLTLMATAPYHVLVAELIAHMRSRGVQHAIFSPGQKHCF 359
Query: 279 RGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEA-YLVISGRKN--VCLGILNGSEAE 335
RG +++ + + + L F SV L + PE ++ ++G + VCL I+
Sbjct: 360 RGKWES---IHRHLPSVTLHFQFHPESVALFIRPELLFVAMTGERTDYVCLAIV-----P 411
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
E IIG M D +D ++ R+ + PE C+
Sbjct: 412 YAERTIIGAGQMLDTRFTFDLQQNRLFFAPEQCH 445
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/373 (21%), Positives = 143/373 (38%), Gaps = 46/373 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +++G PP +DTGSDL W QC PC C K + P K+ V C +
Sbjct: 91 YLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKSTSFKEVSCESQ 149
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG---SVFNVPLTF 129
+C L + C P CD+ YGDG + G + T+ L ++G S+ N+ F
Sbjct: 150 QCRLLDTVS---CSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNI--VF 204
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRG 183
GCG+N N G + + G+ G G +S+ SQ+ C+
Sbjct: 205 GCGHN--NSGTFN-ENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSK 261
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDS 237
++F + +V S V TP++ +++ +G +S S + D+
Sbjct: 262 IIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDA 321
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP-ICWRGPFKALGQVTEYFKPLA 296
G Y +V + + P++ D P +C+R G +
Sbjct: 322 GTPPTLLPRDFYNRLVQGVKEAI---PMEPVQDPDLQPQLCYRSATLIDGPI-------- 370
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
T + + + P + C + + G+ I G + ++ +D
Sbjct: 371 --LTAHFDGADVQLKPLNTFISPKEGVYCFAM----QPIDGDTGIFGNFVQMNFLIGFDL 424
Query: 357 EKQRIGWKPEDCN 369
+ +++ +K DC
Sbjct: 425 DGKKVSFKAVDCT 437
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 152/381 (39%), Gaps = 55/381 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YFA + VG P DTGSD+ W+QC APC C + + P ++ V C
Sbjct: 128 YFA-QVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDCVA 185
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L C + C Y++ YGDG + G ++ L F+ G+ + GC
Sbjct: 186 PICRRLDSAG---CDRRRNSCLYQVAYGDGSVTAGDFASET--LTFARGARVQ-RVAIGC 239
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI----------GQN 180
G++ N G LG GR +S SQ+ R +G +C+
Sbjct: 240 GHD--NEGLFIAASGLLGLGRGR--LSFPSQIARSFG---RSFSYCLVDRTSSVRPSSTR 292
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY---ILGPAELLYSGKSCGLKDLTL---- 233
V F ++G ++TPM +N Y +LG + K DL L
Sbjct: 293 SSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT 352
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQ 287
I DSG S VY+ + +G L+++P +L C+ + + +
Sbjct: 353 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVG--LRVSPGGFSLFDTCYNLSGRRVVK 410
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
V LA + +PPE YL+ + G++ V +IIG I
Sbjct: 411 VPTVSMHLA-------GGASVALPPENYLIPVDTSGTFCFAMAGTDGGV---SIIGNIQQ 460
Query: 348 QDKMVIYDNEKQRIGWKPEDC 368
Q V++D + QR+G+ P+ C
Sbjct: 461 QGFRVVFDGDAQRVGFVPKSC 481
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 89/377 (23%), Positives = 156/377 (41%), Gaps = 58/377 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++ +VG P DTGSD+ W+QC PC C + + K+ +PC +
Sbjct: 89 YLISYSVGTPSLQVFGILDTGSDIIWLQCQ-PCKKCYEQTTPIFDSSKSQTYKTLPCPSN 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C ++ KH C Y I Y DG S+G L + L +NGS P T GC
Sbjct: 148 TCQSVQGTFCSSRKH----CLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGC 203
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-------EYGLIRNVIGHCIGQNGRGV 184
G ++N + + +G++GLGRG +S+++QL Y L+ +
Sbjct: 204 G--RYNAIGIEEKN-SGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGL------STASSK 254
Query: 185 LFLGDGKVPSS-GVAWTPMLQNSA------DLKHYILGPAELLYSGKSCGLKDLTLIFDS 237
L G+ V S G TP+ + L+ + +G + + G K +I DS
Sbjct: 255 LNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGSGGKG-NIIIDS 313
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWR-GPFK---ALGQVTEYF 292
G + + VY ++ + + + +I L+ D ++ L +C++ P K ++ +T +F
Sbjct: 314 GTTLTALPNGVYSKLEAAVAKTVI---LQRVRDPNQVLGLCYKVTPDKLDASVPVITAHF 370
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
++ V++ VC E G + G + Q+ +V
Sbjct: 371 SGADVTLNAINTFVQV-----------ADDVVCFAF---QPTETGA--VFGNLAQQNLLV 414
Query: 353 IYDNEKQRIGWKPEDCN 369
YD + + +K DC
Sbjct: 415 GYDLQMNTVSFKHTDCT 431
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 153/381 (40%), Gaps = 55/381 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YFA + VG P DTGSD+ W+QC APC C + + P ++ V C
Sbjct: 122 YFA-QVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDCVA 179
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + C + C Y++ YGDG + G ++ L F+ G+ + GC
Sbjct: 180 PICRRL---DSAGCDRRRNSCLYQVAYGDGSVTAGDFASET--LTFARGARVQ-RVAIGC 233
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI----------GQN 180
G++ N G LG GR +S SQ+ R +G +C+
Sbjct: 234 GHD--NEGLFIAASGLLGLGRGR--LSFPSQIARSFG---RSFSYCLVDRTSSVRPSSTR 286
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY---ILGPAELLYSGKSCGLKDLTL---- 233
V F ++G ++TPM +N Y +LG + K DL L
Sbjct: 287 SSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT 346
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQ 287
I DSG S VY+ + +G L+++P +L C+ + + +
Sbjct: 347 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVG--LRVSPGGFSLFDTCYNLSGRRVVK 404
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
V LA + +PPE YL+ + G++ V +IIG I
Sbjct: 405 VPTVSMHLA-------GGASVALPPENYLIPVDTSGTFCFAMAGTDGGV---SIIGNIQQ 454
Query: 348 QDKMVIYDNEKQRIGWKPEDC 368
Q V++D + QR+G+ P+ C
Sbjct: 455 QGFRVVFDGDAQRVGFVPKSC 475
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 152/382 (39%), Gaps = 54/382 (14%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
S F VNL++G PP DTGS L WVQC PC C + + P K++ + C
Sbjct: 102 SGFLVNLSIGSPPVTQLVVVDTGSSLLWVQC-LPCINCFQQSTSWFDPLKSVSFKTLGCG 160
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD-LFPLRFSNGSVFNVPLTF 129
P ++ N +C N Q +Y++ Y G SS G L + L G + +TF
Sbjct: 161 FP---GYNYINGYKCNRFN-QAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITF 216
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRG-RISIVSQLREYGLIRNVIGHCIGQNG-----RG 183
GCG+ N + GV GLG I++ +QL N +CIG
Sbjct: 217 GCGH--MNIKTNNDDAYNGVFGLGAYPHITMATQL------GNKFSYCIGDINNPLYTHN 268
Query: 184 VLFLGDGKVPSS---------GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI 234
L LG G G + + S K + P S G ++
Sbjct: 269 HLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSG----GVL 324
Query: 235 FDSGASYAYFTS----RVYQEIVSLIMRDLIGTPLKLAPDDKTLP-ICWRGPFKALGQVT 289
DSG +Y + +Y EIV DL+ L+ P + +C++G + +
Sbjct: 325 IDSGMTYTKLANGGFELLYDEIV-----DLMKGLLERIPTQRKFEGLCFKG---VVSRDL 376
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
F + F LV+ + G CL IL S +E+ ++IG + Q+
Sbjct: 377 VGFPAVTFHFA---GGADLVLESGSLFRQHGGDRFCLAILP-SNSELLNLSVIGILAQQN 432
Query: 350 KMVIYDNEKQRIGWKPEDCNTL 371
V +D E+ ++ ++ DC L
Sbjct: 433 YNVGFDLEQMKVFFRRIDCQLL 454
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 90/385 (23%), Positives = 152/385 (39%), Gaps = 51/385 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
V+LTVG PP+ DTGS+L+W+ C + + PH + +PC +P
Sbjct: 70 LTVSLTVGTPPQSVTMVLDTGSELSWLHCKK-----QQNINSVFNPHLSSSYTPIPCMSP 124
Query: 73 RCA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C + P C N+ C + Y D S G L +D F + S + FG
Sbjct: 125 ICKTRTRDFLIPVSCDS-NNLCHVTVSYADFTSLEGNLASDTFAISGSG----QPGIIFG 179
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
+ + T G++G+ RG +S V+Q+ G + +CI G++ GVL GD
Sbjct: 180 SMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQM---GFPK--FSYCISGKDASGVLLFGD 234
Query: 190 GKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------------- 233
G + +TP+++ + L ++ + G G K L +
Sbjct: 235 ATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQT 294
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWR----GPFKA 284
+ DSG + + VY + + + G L L D + + +C+R G A
Sbjct: 295 MVDSGTRFTFLLGSVYTALRNEFVAQTRGV-LTLLEDPNFVFEGAMDLCFRVRRGGVVPA 353
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGE 344
+ VT F+ +S + R R+ + V G +V S+ E +IG
Sbjct: 354 VPAVTMVFEGAEMSVSGERLLYRVGGDGD---VAKGNGDVYCLTFGNSDLLGIEAYVIGH 410
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCN 369
Q+ + +D R+G+ C
Sbjct: 411 HHQQNVWMEFDLVNSRVGFADTKCE 435
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 104/412 (25%), Positives = 163/412 (39%), Gaps = 80/412 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKPPEKQ---YKPHKN----IV 67
+A ++G PP+ DTGS LTWV C + C C+ P + P + +V
Sbjct: 67 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 126
Query: 68 PCSNPRCAALH--------------WPNPPRC-KHPNDQC-DYEIEYGDGGSSIGALVTD 111
C NP C +H P C ++ C Y + YG GS+ G L+ D
Sbjct: 127 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGS-GSTAGLLIAD 185
Query: 112 LF--PLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----E 165
P R G V L + H P +G+ G GRG S+ +QL
Sbjct: 186 TLRAPGRAVPGFVLGCSLV-----SVHQP-------PSGLAGFGRGAPSVPAQLGLPKFS 233
Query: 166 YGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLK-----HYILGPAELL 220
Y L+ +G VL G+ + P+++++A K +Y L +
Sbjct: 234 YCLLSRRFDDNAAVSGSLVLGG---TGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVT 290
Query: 221 YSGKSCGLKDLT----------LIFDSGASYAYFTSRVYQEIVSLIMRDLIG--TPLKLA 268
GK+ L I DSG ++ Y V+Q + ++ + G K A
Sbjct: 291 VGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDA 350
Query: 269 PDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGR---KNVC 325
D+ L C+ AL Q LSF +V + +P E Y V++GR + +C
Sbjct: 351 EDELGLHPCF-----ALPQGARSMALPELSFHFEGGAV-MQLPVENYFVVAGRGAVEAIC 404
Query: 326 LGILNGSEAEVGENN-------IIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
L ++ G N I+G Q+ +V YD EK+R+G++ + C +
Sbjct: 405 LAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTS 456
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 149/370 (40%), Gaps = 47/370 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK--QYKPHKNI----VPCS 70
+ + +G P DTGS LTWVQC PC P++ + P+ + VPC
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWVQCK-PCNSSQCYPQRLPLFDPNTSSSYSPVPCD 187
Query: 71 NPRCAALHWP-NPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+ C AL + C D C YEI YG G + G TD L G++
Sbjct: 188 SQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDA--LTLGPGAIVKR-FH 244
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ--LREYGLIRNVIGHCIGQNGRGVLF 186
FGCG++Q G D GVLGLGR S+ Q R G V HC+ G F
Sbjct: 245 FGCGHHQQR-GKFDMAD--GVLGLGRLPQSLAWQASARRGG---GVFSHCLPPTGVSTGF 298
Query: 187 LGDGK-VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-------TLIFDSG 238
L G +S +TP+L Y L P + +G+ L D+ +I DSG
Sbjct: 299 LALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQ---LLDIPPAVFREGVITDSG 355
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+ Y + + + P LAP L C+ F VT ++L+
Sbjct: 356 TVLSALQETAYTALRTAFRSAMAEYP--LAPPVGHLDTCFN--FTGYDNVT--VPTVSLT 409
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F R + + + +++ G CL + + G +IG + + V+YD
Sbjct: 410 F---RGGATVHLDASSGVLMDG----CLAFWSSGDEYTG---LIGSVSQRTIEVLYDMPG 459
Query: 359 QRIGWKPEDC 368
+++G++ C
Sbjct: 460 RKVGFRTGAC 469
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 154/382 (40%), Gaps = 44/382 (11%)
Query: 17 FAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
+ ++ +G P P+ DTGSDL W QC PC C P + P + V C +
Sbjct: 87 YLIHFNIGTPRPQRVALTMDTGSDLVWTQC-TPCPVCFDQPFPLFDPSVSSTFRAVACPD 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS----VFNVPL 127
P C + C +C Y YGD + G + D F NG V L
Sbjct: 146 PICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGL 205
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRGVLF 186
FGCG +N G + + +G+ G GRG +S+ SQLR + H + N +F
Sbjct: 206 AFGCG--DYNTGVFA-SNESGIAGFGRGPLSLPSQLRVGRFSYCLTSHDETESNKTSAVF 262
Query: 187 LGDG----KVPSSG-VAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLK---DL 231
LG + SSG TP++ + + L+ +G L LK
Sbjct: 263 LGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSG 322
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP--ICWRGPFKALGQVT 289
+ DSG F + V++++ + + L PL + + +C++ P K QV
Sbjct: 323 GTVIDSGTGVTTFPAAVFEQLKNEFVAQL---PLPRYDNTSEVGNLLCFQRP-KGGKQVP 378
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
L+ S + +P E Y+ V ++NG+E ++ +IG Q+
Sbjct: 379 VPKLIFHLA------SADMDLPRENYIPEDTDSGVMCLMINGAEVDM---VLIGNFQQQN 429
Query: 350 KMVIYDNEKQRIGWKPEDCNTL 371
++YD E ++ + C+ +
Sbjct: 430 MHIVYDVENSKLLFASAQCDKM 451
>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
Length = 507
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 144/376 (38%), Gaps = 65/376 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKNI--VPCSNP 72
F +N + F DTGS L + P GC E + Y P V CS+
Sbjct: 120 FQINTQIIVGNTTFLVQVDTGSLLMAI----PLEGCNTCVESRPVYHPSSTSTKVACSSD 175
Query: 73 RCAALHWPNPPRCKHPN--DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C PP C + + CD++I YGDG G + D+ L G
Sbjct: 176 QCKG-SGSTPPSCSRTSSGESCDFQIRYGDGSHVSGYIYEDVVNLAGLQGKA-------N 227
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIV-----SQLREYGLIRNVIGHCIGQNGRGVL 185
G N G P G++G GR S V S + + GL +N G + G G L
Sbjct: 228 FGANDEETGDFEYPRADGIIGFGRTCSSCVPTVWDSLVSDLGL-KNQFGMLLNYEGGGSL 286
Query: 186 FLGDGKVP--SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK--DLTL-------- 233
LG+ + + +TP++Q + YS KS G++ D T+
Sbjct: 287 SLGEINTSYYTGDIRYTPLVQKNTPF-----------YSVKSTGIRINDYTIPGSKLGQE 335
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQVTEY 291
I DSG++ S Y ++ + + P+ IC+ V
Sbjct: 336 VIVDSGSTALSLASGAYDQLRNYFQTHYCSIQGVCENPNIFQGSICYSSD-----DVLSK 390
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV----ISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
F L +F V++ +PP+ YLV +G+ C I E I+G++FM
Sbjct: 391 FPTLYFTF---DGGVQVAIPPKNYLVKAPLTNGKYGYCFMI----ERADSTMTILGDVFM 443
Query: 348 QDKMVIYDNEKQRIGW 363
+ ++DN R+G+
Sbjct: 444 RGYYTVFDNVNDRVGF 459
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 87/353 (24%), Positives = 137/353 (38%), Gaps = 43/353 (12%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCK-HPN 89
DTGSDL W QC PC C + + P + + CS +C L C N
Sbjct: 110 DTGSDLIWTQC-KPCDQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLK--EGASCSGEGN 166
Query: 90 DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAG 148
C Y YGD + G + D L ++G +P GCG HN G +G
Sbjct: 167 KTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIGCG---HNNGGSFTEKGSG 223
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVLFLGDGKVPSSGVAWTPM 202
++GLG G IS++SQL I +C+ N + F +G V GV TP+
Sbjct: 224 IVGLGGGPISLISQLGS--TIDGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPL 281
Query: 203 LQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLI 256
+ D +++ +G + + G S G + +I DSG + F + E+ S +
Sbjct: 282 ISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAV 341
Query: 257 MRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
+ GTP++ +L K +T +F + + + P
Sbjct: 342 QDAVAGTPVEDPSGILSLCYSIDADLK-FPSITAHF-----------DGADVKLNPLNTF 389
Query: 317 VISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
V +C + I G + + +V YD E + + +KP DC
Sbjct: 390 VQVSDTVLCFAF-----NPINSGAIFGNLAQMNFLVGYDLEGKTVSFKPTDCT 437
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 93/412 (22%), Positives = 155/412 (37%), Gaps = 75/412 (18%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD-----------------APCTGCTKPPEK 58
YF V VG P + F DTGSDLTWV+C AP P +
Sbjct: 87 YF-VRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPAS---PRR 142
Query: 59 QYKPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFP 114
++P K+ +PCS+ C + C P + C Y+ Y DG ++ G + D
Sbjct: 143 TFRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSAT 202
Query: 115 LRFSNGSVFNVPL---TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYG--L 168
+ S + L GC + + L+ + GVL LG IS S+ +G
Sbjct: 203 IALSGRAARKAKLRGVVLGCTTSYNGQSFLA---SDGVLSLGYSNISFASRAASRFGGRF 259
Query: 169 IRNVIGHCIGQNGRGVLFLG-----DGKVPSSGVA-------------------WTPMLQ 204
++ H +N L G + PS G+A TP++
Sbjct: 260 SYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVL 319
Query: 205 NSADLKHYILGPAELLYSGKSCGLKDLT--------LIFDSGASYAYFTSRVYQEIVSLI 256
+ Y + + +G+ + I DSG S Y+ +V+ +
Sbjct: 320 DHRTRPFYAVTVKGVSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAAL 379
Query: 257 MRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
+ L G P ++ D W P ++ PL + + S RL P ++Y+
Sbjct: 380 SKRLAGLP-RVTMDPFDYCYNWTSP-----SGSDVAAPLPMLAVHFAGSARLEPPAKSYV 433
Query: 317 VISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+ + C+G+ G + ++IG I Q+ + YD + +R+ +K C
Sbjct: 434 IDAAPGVKCIGLQEGPWPGL---SVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 77/371 (20%), Positives = 142/371 (38%), Gaps = 42/371 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +++G PP +DTGSDL W QC PC C K + P K+ V C +
Sbjct: 91 YLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKSTSFKEVSCESQ 149
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
+C L + C P CD+ YGDG + G + T+ L ++G ++ + FGC
Sbjct: 150 QCRLLDTVS---CSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVFGC 206
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G+N N G + + G+ G G +S+ SQ+ C+ ++
Sbjct: 207 GHN--NSGTFN-ENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKII 263
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGA 239
F + +V S V TP++ +++ +G +S S + D+G
Sbjct: 264 FGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGT 323
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP-ICWRGPFKALGQVTEYFKPLALS 298
Y +V + + P++ D P +C+R G +
Sbjct: 324 PPTLLPRDFYNRLVQGVKEAI---PMEPVQDPDLQPQLCYRSATLIDGPI---------- 370
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
T + + + P + C + + G+ I G + ++ +D +
Sbjct: 371 LTAHFDGADVQLKPLNTFISPKEGVYCFAM----QPIDGDTGIFGNFVQMNFLIGFDLDG 426
Query: 359 QRIGWKPEDCN 369
+++ +K DC
Sbjct: 427 KKVSFKAVDCT 437
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 92/362 (25%), Positives = 141/362 (38%), Gaps = 42/362 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSNP 72
+ + + +G P K D+GSD++WVQC PC C + + P + CS+
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCK-PCLQCHSQVDPLFDPSLSSTYSPFSCSSA 189
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CA L + C + QC Y + Y DG S+ G +D L + S F FGC
Sbjct: 190 ACAQLGQ-DGNGCSS-SSQCQYIVRYADGSSTTGTYSSDTLALGSNTISNFQ----FGCS 243
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
+ + L T G++GLG G S+ SQ G +C+ + G L LG G
Sbjct: 244 HVESGFNDL----TDGLMGLGGGAPSLASQ--TAGTFGTAFSYCLPPTPSSSGFLTLGAG 297
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASYAYFTS 246
+SG TPML++S Y + + G + ++ DSG
Sbjct: 298 ---TSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVMDSGTIITRLPR 354
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
Y + S + + AP + C F GQ + +AL F+
Sbjct: 355 TAYSALSSAFKAGM--KQYRPAPPRSIMDTC----FDFSGQSSVRLPSVALVFSG----- 403
Query: 307 RLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
VV +A +I G CL S+ I+G + + V+YD +G+K
Sbjct: 404 GAVVNLDANGIILGN---CLAFAANSDDS--SPGIVGNVQQRTFEVLYDVGGGAVGFKAG 458
Query: 367 DC 368
C
Sbjct: 459 AC 460
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 155/383 (40%), Gaps = 76/383 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + ++G PP+ DTGSDL W +CDA Y P+ + +PCS+
Sbjct: 100 YDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGG-AAWGGSSSYHPNASSTFTRLPCSDR 158
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGS---SIGALVTDLFPLRFSNGSVFNVP-LT 128
CAAL + RC +CDY+ YG G + G L ++ F L G VP +
Sbjct: 159 LCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTL---GGDA--VPGVG 213
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
FGC + AG++GLGRG +S+VSQL +C+ + L
Sbjct: 214 FGCTTALEG----DYGEGAGLVGLGRGPLSLVSQLDA-----GTFMYCLTADASKASPLL 264
Query: 189 DGKVPS-----SGVAWTPMLQNSA----DLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
G + + +GV T +L ++ +L+ +G A + ++FDSG
Sbjct: 265 FGALATMTGAGAGVQSTGLLASTTFYAVNLRSITIGSAT-----TAGVGGPGGVVFDSGT 319
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+ Y Y E + L+ P+ R F+A Y KP
Sbjct: 320 TLTYLAEPAYTEAKAAF----------LSQTTSLTPVEGRYGFEAC-----YEKP----- 359
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGEN------------NIIGEIFM 347
+S RL+ P L G ++ L + N EV + +IIG I
Sbjct: 360 ----DSARLI--PAMVLHFDGGADMALPVAN-YVVEVDDGVVCWVVQRSPSLSIIGNIMQ 412
Query: 348 QDKMVIYDNEKQRIGWKPEDCNT 370
+ +V++D K + ++P +C++
Sbjct: 413 MNYLVLHDVRKSVLSFQPANCDS 435
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 145/364 (39%), Gaps = 53/364 (14%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKHPN 89
DTGSD+ WVQC APC C + + P ++ V C C L + C
Sbjct: 3 LDTGSDVVWVQC-APCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRL---DSGGCDLRR 58
Query: 90 DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGCGYNQHNPGPLSPPDTAG 148
C Y++ YGDG + G VT+ L F+ G+ V V L GCG++ N G
Sbjct: 59 GACMYQVAYGDGSVTAGDFVTET--LTFAGGARVARVAL--GCGHD--NEGLFVAAAGLL 112
Query: 149 VLGLGRGRISIVSQL-REYG---------LIRNVIGHCIGQNGRGVLFLGDGKVPSSGVA 198
LG G +S +Q+ R YG + G G + + G G V +S +
Sbjct: 113 GLGR--GGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSAS 170
Query: 199 WTPMLQNSADLKHYILGPAELLYSGK---SCGLKDLTL---------IFDSGASYAYFTS 246
+TPM++N Y + + G DL L I DSG S
Sbjct: 171 FTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLAR 230
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y + R L+L+P +L C+ G+ +++ F
Sbjct: 231 ASYSALRD-AFRAAAAGGLRLSPGGFSLFDTCY----DLGGRRVVKVPTVSMHFA---GG 282
Query: 306 VRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+PPE YL+ + R C G++ V +IIG I Q V++D + QR+G+
Sbjct: 283 AEAALPPENYLIPVDSRGTFCF-AFAGTDGGV---SIIGNIQQQGFRVVFDGDGQRVGFA 338
Query: 365 PEDC 368
P+ C
Sbjct: 339 PKGC 342
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 147/372 (39%), Gaps = 57/372 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPPEKQYKPHKN----IVPCS 70
+ V +++G P + DTGSDL+WVQC PC C + + P ++ VPC
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCT-PCAAPACYSQKDPLFDPAQSSSYAAVPCG 198
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
P C L C QC Y + YGDG + G +D L N +V FG
Sbjct: 199 GPVCGGLGI-YASSCSA--AQCGYVVSYGDGSKTTGVYSSDTLTLS-PNDAVRG--FFFG 252
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLG 188
CG+ Q + D G+LGLGR S+V Q G V +C+ + G L LG
Sbjct: 253 CGHAQSG---FTGND--GLLGLGREEASLVEQ--TAGTYGGVFSYCLPTRPSTTGYLTLG 305
Query: 189 --DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFDS 237
G P G + T +L + +Y+ ++ +G S G + L++ + D+
Sbjct: 306 GPSGAAP-PGFSTTQLLSSPNAATYYV-----VMLTGISVGGQQLSVPSSVFAGGTVVDT 359
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G Y + S + AP L C+ F G VT +AL
Sbjct: 360 GTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYN--FSGYGTVT--LPNVAL 415
Query: 298 SFTNRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDN 356
+F+ + + + L CL +GS+ G I+G + + V D
Sbjct: 416 TFS---GGATVTLGADGILSFG-----CLAFAPSGSD---GGMAILGNVQQRSFEVRIDG 464
Query: 357 EKQRIGWKPEDC 368
+G+KP C
Sbjct: 465 TS--VGFKPSSC 474
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/138 (36%), Positives = 68/138 (49%), Gaps = 17/138 (12%)
Query: 12 PIFS--------YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
P+FS YFA+ + VG P DTGSDL W+QC +PC C + + P
Sbjct: 74 PVFSGIPFESGEYFAL-VGVGTPSTKAMLVIDTGSDLVWLQC-SPCRRCYAQRGQVFDPR 131
Query: 64 KNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN 119
++ VPCS+P+C AL +P C Y + YGDG SS G L TD L F+N
Sbjct: 132 RSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATD--KLAFAN 189
Query: 120 GSVFNVPLTFGCGYNQHN 137
+ N +T GCG +
Sbjct: 190 DTYVNN-VTLGCGRDNEG 206
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 36/70 (51%), Gaps = 11/70 (15%)
Query: 308 LVVPPEAYL--VISGRKNV-----CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
+ +PPE Y V GR+ CLG EA ++IG + Q V++D EK+R
Sbjct: 382 MALPPENYFLPVDGGRRRAASYRRCLGF----EAADDGLSVIGNVQQQGFRVVFDVEKER 437
Query: 361 IGWKPEDCNT 370
IG+ P+ C +
Sbjct: 438 IGFAPKGCTS 447
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 95/372 (25%), Positives = 160/372 (43%), Gaps = 52/372 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG P K DTGSD+ W+QC+ PC+ C + + + P + + CS
Sbjct: 162 YFS-RIGVGTPAKEMYLVLDTGSDVNWIQCE-PCSDCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+C+ L C+ +++C Y++ YGDG ++G L TD + F N N + GC
Sbjct: 220 PQCSLLE---TSACR--SNKCLYQVSYGDGSFTVGELATD--TVTFGNSGKIN-DVALGC 271
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLG 188
G++ N G + AG+LGLG G +SI +Q++ ++ G++ + LG
Sbjct: 272 GHD--NEGLFTG--AAGLLGLGGGALSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLG 327
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSG 238
SG A P+L+N Y +G + G+ + D +I D G
Sbjct: 328 ------SGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCG 381
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQVTEYFKPLAL 297
+ ++ Y + ++ + T LK +L C+ F +L V +A
Sbjct: 382 TAVTRLQTQAYNSLRDAFLK--LTTNLKKGTSSISLFDTCY--DFSSLSSVK--VPTVAF 435
Query: 298 SFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
FT ++ L +P + YL+ + C S + +IIG + Q + YD
Sbjct: 436 HFTGGKS---LDLPAKNYLIPVDDNGTFCFAFAPTSSSL----SIIGNVQQQGTRITYDL 488
Query: 357 EKQRIGWKPEDC 368
+ IG C
Sbjct: 489 ANKIIGLSGNKC 500
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 97/230 (42%), Gaps = 27/230 (11%)
Query: 3 VSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP 62
S I + ++ + G P DTGSDLTWVQC PC+ C + + P
Sbjct: 82 TSGIRLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDP 140
Query: 63 HKN----IVPCSNPRCA---ALHWPNPPRCKHP---NDQCDYEIEYGDGGSSIGALVTDL 112
+ V C+ CA P C +++C Y + YGDG S G L TD
Sbjct: 141 AGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDT 200
Query: 113 FPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIR 170
L ++ F FGCG + N G TAG++GLGR +S+VSQ R G+
Sbjct: 201 VALGGASLGGF----VFGCGLS--NRGLFG--GTAGLMGLGRTELSLVSQTASRYGGVFS 252
Query: 171 NVIGHCIGQNGRGVLFLGDGKVPSSG------VAWTPMLQNSADLKHYIL 214
+ + G L LG G +S VA+T M+ + A Y L
Sbjct: 253 YCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFL 302
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 101/388 (26%), Positives = 146/388 (37%), Gaps = 47/388 (12%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----V 67
P + + VG P DT SDLTW+QC PC C + P + +
Sbjct: 136 PTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPRHSTSYGEM 194
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDG------GSSIGALVTDLFPLRFSNGS 121
P C AL K C Y + YGDG +S+G LV + L F+ G
Sbjct: 195 NYDAPDCQALGRSGGGDAK--RGTCIYTVLYGDGDGHGSTSTSVGDLVEET--LTFAGG- 249
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-- 179
V L+ GCG++ N G P AG+LGL RG+ISI Q+ G +C+
Sbjct: 250 VRQAYLSIGCGHD--NKGLFGAP-AAGILGLSRGQISIPHQIAFLGY-NASFSYCLVDFI 305
Query: 180 NGRG----VLFLGDGKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSG---KSCGLKDL 231
+G G L G G V +S ++TP + N Y + + G +DL
Sbjct: 306 SGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDL 365
Query: 232 TL---------IFDSGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPDDKTLPICWRGP 281
L I DSG + Y G + C+
Sbjct: 366 QLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVG 425
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENN 340
+A + +++ F V L + P+ YL+ + R VC + V +
Sbjct: 426 GRAGLRHCVKVPAVSMHFA---GGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSV---S 479
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+IG I Q V+YD QR+G+ P C
Sbjct: 480 VIGNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 154/374 (41%), Gaps = 48/374 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N++VG P F DTGSDL W QC APCT C + P ++P + +PC++
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQC-APCTKCFQQPAPPFQPASSSTFSKLPCTSS 144
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L PN R + C Y +YG G ++ G L T+ L+ + S +V FGC
Sbjct: 145 FCQFL--PNSIRTCNATG-CVYNYKYGSGYTA-GYLATE--TLKVGDASFPSV--AFGCS 196
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLFLG 188
++ G T+G+ GLGRG +S++ QL G+ R +C+ +LF
Sbjct: 197 -TENGVG----NSTSGIAGLGRGALSLIPQL---GVGR--FSYCLRSGSAAGASPILFGS 246
Query: 189 DGKVPSSGVAWTPMLQNSA--------DLKHYILGPAELLYSGKSCGLKDLTL----IFD 236
+ V TP + N A +L +G +L + + G L I D
Sbjct: 247 LANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVD 306
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG + Y Y+ + + + + L +C++ G + L
Sbjct: 307 SGTTLTYLAKDGYEMVKQAFLSQT--ADVTTVNGTRGLDLCFKSTGGGGGGIA--VPSLV 362
Query: 297 LSFTNRRNSVRLVVPPE-AYLVISGRKNVCLGILNGSEAEVGE-NNIIGEIFMQDKMVIY 354
L F VP A + + +V + L A+ + ++IG + D ++Y
Sbjct: 363 LRF---DGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLY 419
Query: 355 DNEKQRIGWKPEDC 368
D + + P DC
Sbjct: 420 DLDGGIFSFAPADC 433
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 67/135 (49%), Gaps = 15/135 (11%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWP--NPPRCKH 87
DTGSDLTWVQC+ PC C +KP + +PC++ C +L N C+
Sbjct: 160 IDTGSDLTWVQCE-PCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACES 218
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
C Y + YGDG + G L + L F SV N FGCG N N G +
Sbjct: 219 NPSNCSYAVNYGDGSYTNGELGAE--HLSFGGISVSN--FVFGCGKN--NKGLFG--GVS 270
Query: 148 GVLGLGRGRISIVSQ 162
G++GLGR +S++SQ
Sbjct: 271 GLMGLGRSNLSLISQ 285
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 140/379 (36%), Gaps = 59/379 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK---NIVPCSNPR 73
+ V +G PP+L DT +D W+ C C+GC+ + V CS +
Sbjct: 30 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG-CSGCSNASTSFNTNSSSTYSTVSCSTAQ 88
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C P C + YG S +LV D L + + N +FGC
Sbjct: 89 CTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDT--LTLAPDVIPN--FSFGC-I 143
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
N + L P G++GLGRG +S+VSQ L V +C+ + R F G K+
Sbjct: 144 NSASGNSLPP---QGLMGLGRGPMSLVSQTTS--LYSGVFSYCL-PSFRSFYFSGSLKLG 197
Query: 194 SSG----VAWTPMLQNSADLKHYILG--------------PAELLYSGKSCGLKDLTLIF 235
G + +TP+L+N Y + P L + S I
Sbjct: 198 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANS----GAGTII 253
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP- 294
DSG F VY+ I RD + ++ F LG F
Sbjct: 254 DSGTVITRFAQPVYEAI-----RDEFRKQVNVSS------------FSTLGAFDTCFSAD 296
Query: 295 ---LALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDK 350
+A T S+ L +P E L+ S + CL + + N+I + Q+
Sbjct: 297 NENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNL 356
Query: 351 MVIYDNEKQRIGWKPEDCN 369
+++D RIG PE CN
Sbjct: 357 RILFDVPNSRIGIAPEPCN 375
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 144/368 (39%), Gaps = 40/368 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + FDTGS LTW QC+ PC G C K + + P K+ + C++
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTWTQCE-PCAGSCYKQQDPIFDPSKSSSYTNIKCTS 198
Query: 72 PRCAALHWPNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C C D C Y+++YGD S G L + + ++ FG
Sbjct: 199 SLCTQFRSAG---CSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD---IVHDFLFG 252
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLG 188
CG Q N G TAG++GL R IS V Q + + +C+ + G L G
Sbjct: 253 CG--QDNEGLFR--GTAGLMGLSRHPISFVQQTSS--IYNKIFSYCLPSTPSSLGHLTFG 306
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG------KSCGLKDLTLIFDSGASYA 242
++ + +TP S + Y L + G S I DSG
Sbjct: 307 ASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVIT 366
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
Y + S + ++ P +A + L C+ F +++ + F
Sbjct: 367 RLPPTAYAALRSAFRQFMMKYP--VAYGTRLLDTCY--DFSGYKEIS--VPRIDFEFA-- 418
Query: 303 RNSVRLVVPPEAYLVISGRKNVCLGI-LNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRI 361
V++ +P L + +CL NG+ ++ I G + + V+YD E RI
Sbjct: 419 -GGVKVELPLVGILYGESAQQLCLAFAANGNGNDI---TIFGNVQQKTLEVVYDVEGGRI 474
Query: 362 GWKPEDCN 369
G+ CN
Sbjct: 475 GFGAAGCN 482
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 92/393 (23%), Positives = 152/393 (38%), Gaps = 56/393 (14%)
Query: 15 SYFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPC 69
S + ++L +G P P+ DTGSDL W QC CT C P ++ + VPC
Sbjct: 92 SEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQC--ACTVCFDQPVPVFRASVSHTFSRVPC 149
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN--GSVFNVP- 126
S+P C + C + C Y Y D + G + D F + + + VP
Sbjct: 150 SDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPN 209
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-----------EYGLIRNVIGH 175
+ FGCG + L P+ +G+ G G G +S+ SQL+ E + VI
Sbjct: 210 IRFGCGMMNYG---LFTPNQSGIAGFGTGPLSLPSQLKVRRFSYCFTAMEESRVSPVI-- 264
Query: 176 CIGQNGRGVLFLGDGKVPSS----GVAWTPMLQNS---ADLKHYILGPAELLYSGKSCGL 228
+G + G + S+ G A P+ L+ +G L ++ + L
Sbjct: 265 -LGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFAL 323
Query: 229 K---DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
K DSG + +F V++ + + + P+ D +C+ P K
Sbjct: 324 KGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQ-VPLPVAKGYTDPDNLLCFSVPAKKK 382
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI-------SGRKNVCLGILNGSEAEVGE 338
P +P E Y++ +GRK +C+ IL+ +
Sbjct: 383 A-------PAVPKLILHLEGADWELPRENYVLDNDDDGSGAGRK-LCVVILSAGNS---N 431
Query: 339 NNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
IIG Q+ ++YD E ++ + P C+ L
Sbjct: 432 GTIIGNFQQQNMHIVYDLESNKMVFAPARCDKL 464
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 93/391 (23%), Positives = 165/391 (42%), Gaps = 60/391 (15%)
Query: 18 AVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--PCTGCT-KPPEK------QYKPHKNIVP 68
+ L+ G PP+ F DTGS + W C CT C+ P+K + I+
Sbjct: 88 TIPLSFGTPPQKLSFLMDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILG 147
Query: 69 CSNPRCAALHWPN----PPRCKHPNDQC-----DYEIEYGDGGSSIGALVTDL-FPLRFS 118
C +P+CA PB PRC + +C Y ++YG G +S L+ +L FP
Sbjct: 148 CRDPKCADTSSPBVHLGXPRCNGNSKKCSHACPQYTLQYGTGAASGFFLLENLDFP---- 203
Query: 119 NGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHC 176
G + L GC + P + + G GR S+ Q+ +++ N +
Sbjct: 204 -GKTIHKFLV-GCTTSADR-----EPSSDALAGFGRTMFSLPMQMGVKKFAYCLNSHDYD 256
Query: 177 IGQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLK-HYILGPAELLYSGKSCGL--KDLT 232
+N G+ +L DG+ + G+++ P +N D +Y LG ++ K + K LT
Sbjct: 257 DTRNSGKLILDYSDGE--TQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGKYLT 314
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFK 283
++ DSG +Y+Y T V++ + + + + + L + +T + C+
Sbjct: 315 PGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGVTPCYN---- 370
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGS-----EAEVG 337
G + L FT N +VVP Y ++ ++ C + S E G
Sbjct: 371 FTGHKSIKIPDLIYQFTGGAN---MVVPGMNYFLLFSEASLGCFPVTTDSPTSNLEFTPG 427
Query: 338 ENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+ I+G D V +D + +R+G++ + C
Sbjct: 428 PSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 97/381 (25%), Positives = 152/381 (39%), Gaps = 55/381 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YFA + VG P DTGSD+ W+QC APC C + + P ++ V C
Sbjct: 122 YFA-QVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDCVA 179
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L C + C Y++ YGDG + G ++ L F+ G+ + GC
Sbjct: 180 PICRRLDSAG---CDRRRNSCLYQVAYGDGSVTAGDFASET--LTFARGARVQ-RVAIGC 233
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI----------GQN 180
G++ N G LG GR +S +Q+ R +G +C+
Sbjct: 234 GHD--NEGLFIAASGLLGLGRGR--LSFPTQIARSFG---RSFSYCLVDRTSSVRPSSTR 286
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY---ILGPAELLYSGKSCGLKDLTL---- 233
V F ++G ++TPM +N Y +LG + K DL L
Sbjct: 287 SSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT 346
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQ 287
I DSG S VY+ + +G L+++P +L C+ + + +
Sbjct: 347 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVG--LRVSPGGFSLFDTCYNLSGRRVVK 404
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFM 347
V LA + +PPE YL+ + G++ V +IIG I
Sbjct: 405 VPTVSMHLA-------GGASVALPPENYLIPVDTSGTFCFAMAGTDGGV---SIIGNIQQ 454
Query: 348 QDKMVIYDNEKQRIGWKPEDC 368
Q V++D + QR+G+ P+ C
Sbjct: 455 QGFRVVFDGDAQRVGFVPKSC 475
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 161/387 (41%), Gaps = 62/387 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPPEKQYKPHKN----IVPCSN 71
+ N T+G PP+ D +L W QC A +GC K + P + C +
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIE--YGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P C ++ P R + +C YE +GD + G TD + + G L F
Sbjct: 122 PLCKSI----PTRNCSGDGECGYEAPSMFGD---TFGIASTDAIAIGNAEGR-----LAF 169
Query: 130 GC--GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG---V 184
GC + G + P +G +GLGR S+V Q +C+ +G G
Sbjct: 170 GCVVASDGSIDGAMDGP--SGFVGLGRTPWSLVGQSN-----VTAFSYCLAPHGPGKKSA 222
Query: 185 LFLG-DGKVPSSGVAW--TPML----QNSAD--------LKHYILGPAELLYSGKSCGLK 229
LFLG K+ +G + TP+L N++D ++ + ++ + S G
Sbjct: 223 LFLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGG 282
Query: 230 DLTLI-FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
+T++ ++ +Y YQ + ++ L G+P P + PF Q
Sbjct: 283 AITILQLETFRPLSYLPDAAYQALEKVVTAAL-GSPSMANPPE---------PFDLCFQN 332
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGEN--NIIGE 344
L FT + L PP YL+ G N VCL IL+ + + ++ +I+G
Sbjct: 333 AAVSGVPDLVFT-FQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGS 391
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+ ++ ++D EK+ + ++P DC++L
Sbjct: 392 LLQENVHFLFDLEKETLSFEPADCSSL 418
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 89/371 (23%), Positives = 153/371 (41%), Gaps = 49/371 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP-EKQYKPHKNI----VPCSN 71
+ + ++G PP+ DTGSDL W +C CT +P Y P+ + +PCS+
Sbjct: 91 YDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSD 150
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYG----DGGSSIGALVTDLFPLRFSNGSVFNVPL 127
C+ L + C +CDY YG D + G L + F L G+ +
Sbjct: 151 RLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTL----GADAVPSV 206
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG---V 184
FGC +G++GLGRG +S+VSQL + +C+ + +
Sbjct: 207 RFGC----TTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFM-----YCLTSDASKASPL 257
Query: 185 LFLGDGKVPSSGVAWTPMLQNSA----DLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
LF + + V T +L ++ +L+ +G A G+ G ++FDSG +
Sbjct: 258 LFGSLASLTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVGEPEG-----VVFDSGTT 312
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALSF 299
Y Y E + + T L D C++ P A G+++ P + L F
Sbjct: 313 LTYLAEPAYSEAKAAFLSQ---TSLDQVEDTDGFEACFQKP--ANGRLSNAAVPTMVLHF 367
Query: 300 TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
+ + +P Y+V VC + +IIG I + +V++D +
Sbjct: 368 ----DGADMALPVANYVVEVEDGVVCWIVQRSPSL-----SIIGNIMQVNYLVLHDVHRS 418
Query: 360 RIGWKPEDCNT 370
+ ++P +C+T
Sbjct: 419 VLSFQPANCDT 429
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 148/385 (38%), Gaps = 78/385 (20%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKNI----VPCS 70
+ V + +G P DTGSDL+WVQC APC T P+K + P ++ +PC+
Sbjct: 120 YVVTVGLGTPAVSQVLLIDTGSDLSWVQC-APCNSTTCYPQKDPLFDPSRSSTYAPIPCN 178
Query: 71 NPRCAAL----HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
C L + + QC Y I YGDG + G +SN ++ P
Sbjct: 179 TDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGV---------YSNETLTMAP 229
Query: 127 ------LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI-- 177
FGCG++Q P G+LGLG S+V Q YG +C+
Sbjct: 230 GVTVKDFHFGCGHDQDGPN----DKYDGLLGLGGAPESLVVQTSSVYG---GAFSYCLPA 282
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----L 233
+ G L LG +SG +TPM++ Y++ + G+ + +
Sbjct: 283 ANDQAGFLALGAPVNDASGFVFTPMVREQQTF--YVVNMTGITVGGEPIDVPPSAFSGGM 340
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I DSG Y + + + + P L P+ + L C+
Sbjct: 341 IIDSGTVVTELQHTAYAALQAAFRKAMAAYP--LLPNGE-LDTCY--------------- 382
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNG-------SEAEVGENN---IIG 343
+FT N V P L SG V L + +G + E G +N I+G
Sbjct: 383 ----NFTGHSN----VTVPRVALTFSGGATVDLDVPDGILLDNCLAFQEAGPDNQPGILG 434
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
+ + V+YD R+G+ + C
Sbjct: 435 NVNQRTLEVLYDVGHGRVGFGADAC 459
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 104/405 (25%), Positives = 152/405 (37%), Gaps = 76/405 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-----DAPCTGCTKPPEKQYKPHKNIVPCSN 71
V + VG PP+ DTGS+L+W+ C DAP Y P VPCS+
Sbjct: 63 LTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSRHDAPFDASAS---SSYAP----VPCSS 115
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L P R + C + Y D S+ G L D F L S +P FGC
Sbjct: 116 PACTWLGRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGSS-----PMPALFGC 170
Query: 132 --GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRGVLFLG 188
Y+ +PP G+LG+ RG +S V+Q +CI G G+L LG
Sbjct: 171 ITSYSSSTDPSETPP--TGLLGMNRGGLSFVTQ-----TATRRFAYCIAAGQGPGILLLG 223
Query: 189 DGKV-------PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------- 233
P + +TP+++ S L ++ + G G L +
Sbjct: 224 GNDTETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPD 283
Query: 234 -------IFDSGASYAYFTSRVY----QEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPF 282
+ DSG + + Y E + + R L G LAP + ++G F
Sbjct: 284 HTGAGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDG---GLAPLGEP-GFVFQGAF 339
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV----------------CL 326
A + TE + A + V LV+ A +V++G + + CL
Sbjct: 340 DACFRGTEA-RVSAAAAGGLLPEVGLVL-RGAEVVVAGAEKLLYRVPGERRGEGEGVWCL 397
Query: 327 GILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+ A V +IG QD V YD R+G+ C L
Sbjct: 398 TFGSSDMAGV-SAYVIGHHHQQDVWVEYDLRNARLGFAAARCADL 441
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 92/382 (24%), Positives = 146/382 (38%), Gaps = 50/382 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK-PPEKQYKPHKNI----VPCSN 71
+ +G PP+ D +D WV C A C GC + P ++ V C
Sbjct: 100 YVARARLGTPPQTLLVAIDPSNDAAWVPCSA-CLGCAPGASSPSFDPTQSSTYRPVRCGA 158
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALV-TDLFPLRFSNGSVFNVP---L 127
P+CA + P P C + + Y S++ A++ D L SNG+ VP
Sbjct: 159 PQCAQVPPATPSCPAGPGASCAFNLSYAS--STLHAVLGQDALSLSDSNGAA--VPDDHY 214
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI----GQNGR 182
TFGC G PP G++G GRG +S +SQ + YG ++ +C+ N
Sbjct: 215 TFGCLRVVTGSGGSVPPQ--GLVGFGRGPLSFLSQTKATYG---SIFSYCLPSYKSSNFS 269
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------- 233
G L LG P + TP+L N Y + + +GK+ + L
Sbjct: 270 GTLRLGPAGQPRR-IKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRG 328
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
I D+G + + Y + + R G AP C+ T+
Sbjct: 329 GTIVDAGTMFTRLSPPAYAALRNAFRR---GVSAPAAPALGGFDTCY------YVNGTKS 379
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGEN-NIIGEIFMQD 349
+A F R+ +P E ++ S V CL + G V N++ + Q+
Sbjct: 380 VPAVAFVFA---GGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQN 436
Query: 350 KMVIYDNEKQRIGWKPEDCNTL 371
V++D R+G+ E C +
Sbjct: 437 HRVVFDVGNGRVGFSRELCTAV 458
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 94/391 (24%), Positives = 165/391 (42%), Gaps = 60/391 (15%)
Query: 18 AVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--PCTGCT-KPPEK------QYKPHKNIVP 68
+ L+ G PP+ F DTGS + W C CT C+ P+K + I+
Sbjct: 88 TIPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILG 147
Query: 69 CSNPRCAALHWPNP----PRCKHPNDQC-----DYEIEYGDGGSSIGALVTDL-FPLRFS 118
C +P+CA P+ PRC + +C Y ++YG G +S L+ +L FP
Sbjct: 148 CRDPKCANTSSPDVHLGCPRCNGNSKKCSHACPQYTLQYGTGAASGFFLLENLDFP---- 203
Query: 119 NGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHC 176
G + L GC + P + + G GR S+ Q+ +++ N +
Sbjct: 204 -GKTIHKFLV-GCTTSADR-----EPSSDALAGFGRTMFSLPMQMGVKKFAYCLNSHDYD 256
Query: 177 IGQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLK-HYILGPAELLYSGKSCGL--KDLT 232
+N G+ +L DG+ + G+++ P L+N D +Y LG ++ K + K LT
Sbjct: 257 DTRNSGKLILDYSDGE--TQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGKYLT 314
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFK 283
++ DSG +Y Y T V++ + + + + + L + ++ L C+
Sbjct: 315 PGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGLTPCYN---- 370
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGS-----EAEVG 337
G + L FT N +VVP Y ++ ++ C + S E G
Sbjct: 371 FTGHKSIKIPDLIYQFTGGAN---MVVPGMNYFLLFSEASLGCFPVTTDSPTNNLEFTPG 427
Query: 338 ENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+ I+G D V +D + +R+G++ + C
Sbjct: 428 PSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 88/378 (23%), Positives = 137/378 (36%), Gaps = 49/378 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V ++VG PP D+GSD+ WVQC PC C + + P + V C +
Sbjct: 171 YLVRVSVGSPPTEQYLVVDSGSDVMWVQCK-PCLECYVQADPLFDPATSATFSGVSCGSA 229
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L P C+YE+ Y DG + GAL + L G + GCG
Sbjct: 230 ICRIL--PTSACGDGELGGCEYEVSYADGSYTKGALALETLTL----GGTAVEGVVIGCG 283
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG----------R 182
+ N G AG++GLG G +S+V QL G + +C+ G
Sbjct: 284 H--RNRGLFV--GAAGLMGLGWGPMSLVGQLG--GEVGGAFSYCLASRGGYGSGAADDDA 337
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK----SCGLKDLT------ 232
G L LG + G W P+++N Y +G + + + GL LT
Sbjct: 338 GWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGD 397
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPDDKTLPICWRGPFKALGQVTEY 291
++ D+G + Y + + L G P L C + G +
Sbjct: 398 VVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTC----YDLSGYASVR 453
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
++ F RL++ L+ CL S +I+G
Sbjct: 454 VPTVSFCFD---GDARLILAARNVLLEVDMGIYCLAFAPSSSGL----SIMGNTQQAGIQ 506
Query: 352 VIYDNEKQRIGWKPEDCN 369
+ D+ IG+ P +C
Sbjct: 507 ITVDSANGYIGFGPANCG 524
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 84/328 (25%), Positives = 130/328 (39%), Gaps = 54/328 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPHKNI--- 66
+ ++ +G P + DTGS WV C C P E Y P ++
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQC--PHESDILRKLTFYDPRSSVSSK 139
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSV- 122
V C + C + PP C + +C Y Y DGG ++G L TDL + NG
Sbjct: 140 EVKCDDTICTS----RPP-C-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQ 193
Query: 123 -FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
+ +TFGCG Q S G++G G + +SQL G + + HC+ N
Sbjct: 194 PTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN 253
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILG------PAELLYSGKSCGLK 229
G G+ +G+ P V TP+++N+ +LK + PA + + K+ G
Sbjct: 254 GGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKG-- 309
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
DSG++ Y +Y E++ + PD + F LG V
Sbjct: 310 ---TFIDSGSTLVYLPEIIYSELILAVFAK--------HPDITMGAMYNFQCFHFLGSVD 358
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ F + F N + L V P YL+
Sbjct: 359 DKFPKITFHF---ENDLTLDVYPYDYLL 383
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 150/378 (39%), Gaps = 54/378 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + L VG P DTGSD+ W+QC +PC C + + P K+ VPC +
Sbjct: 136 YF-MRLGVGTPATNMYMVLDTGSDVVWLQC-SPCKVCYNQSDPVFNPAKSKTFATVPCGS 193
Query: 72 PRCAALHWPNPPRC-KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L + C + C Y++ YGDG ++G T+ L F V +V L G
Sbjct: 194 RLCRRLD--DSSECVSRRSKACLYQVSYGDGSFTVGDFSTE--TLTFHGARVDHVAL--G 247
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-----EYGLIRNVIGHCIGQNGRGVL 185
CG++ N G LG G ++ R Y L+ + ++
Sbjct: 248 CGHD--NEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIV 305
Query: 186 FLGDGKVPSSGVAWTPMLQN-SADLKHYI------LGPAELLYSGKSCGLKDLT----LI 234
F G+G VP + V +TP+L N D +Y+ +G + + +S D T +I
Sbjct: 306 F-GNGAVPKTAV-FTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVI 363
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRD---LIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
DSG S T Y + +RD L T LK AP C F G T
Sbjct: 364 IDSGTSVTRLTQSAY-----VALRDAFRLGATRLKRAPSYSLFDTC----FDLSGMTTVK 414
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+ FT S +P YL+ ++ + C +G +IIG I Q
Sbjct: 415 VPTVVFHFTGGEVS----LPASNYLIPVNNQGRFCFAF----AGTMGSLSIIGNIQQQGF 466
Query: 351 MVIYDNEKQRIGWKPEDC 368
V YD R+G+ C
Sbjct: 467 RVAYDLVGSRVGFLSRAC 484
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 98/373 (26%), Positives = 139/373 (37%), Gaps = 72/373 (19%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YFA ++ VG PP DTGSD+ W+QC APC C + + P ++ V C
Sbjct: 142 YFA-SVGVGTPPTPALLVLDTGSDVVWLQC-APCRQCYAQSGRVFDPRRSRSYAAVRCGA 199
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P C L C C Y++ YGDG + G L T+ L F+ G+ VP + G
Sbjct: 200 PPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATET--LWFARGA--RVPRVAVG 255
Query: 131 CGYNQHN-------------PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
CG++ P TA G S L +IR V H
Sbjct: 256 CGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCFQGSDLDHRTIIRTVHQHVG 315
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDS 237
G RGV PS+G +I DS
Sbjct: 316 GARVRGVGERSLRLDPSTGRGG---------------------------------VILDS 342
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQVTEYFKPLA 296
G S VY + G L+LAP +L C+ + + +V LA
Sbjct: 343 GTSVTRLARPVYVAVREAFRAAAGG--LRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLA 400
Query: 297 LSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
+ +PPE YL+ + R CL L G++ V +I+G I Q V++D
Sbjct: 401 -------GGAEVALPPENYLIPVDTRGTFCLA-LAGTDGGV---SIVGNIQQQGFRVVFD 449
Query: 356 NEKQRIGWKPEDC 368
++QR+ P+ C
Sbjct: 450 GDRQRVALVPKSC 462
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 168/383 (43%), Gaps = 49/383 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKN----IVPCSN 71
+ + L +G PP+ + DTGSDL W QC APC C K P Y P + ++PCS+
Sbjct: 92 YIMTLAIGTPPQSYPAIADTGSDLVWTQC-APCGERCFKQPSPLYNPSSSPTFRVLPCSS 150
Query: 72 P--RCAA---LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
CAA L PP P C Y YG G +S G ++ F S VP
Sbjct: 151 ALNLCAAEARLAGATPP----PGCACRYNQTYGTGWTS-GLQGSETFTFGSSPADQVRVP 205
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ FGC N +AG++GLGRG +S+VSQL G+ + + L
Sbjct: 206 GIAFGC----SNASSDDWNGSAGLVGLGRGGLSLVSQLAA-GMFSYCLTPFQDTKSKSTL 260
Query: 186 FLG----DGKVPSSGVAWTPMLQNSA----------DLKHYILGPAELLYSGKSCGLK-D 230
LG + +GV TP + + + +L +G A L + L+ D
Sbjct: 261 LLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRAD 320
Query: 231 LT--LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
T LI DSG + Y+ + + + R L+ P+ + L +C+ P +
Sbjct: 321 GTGGLIIDSGTTITSLVDAAYKRVRAAV-RSLVKLPVTDGSNATGLDLCFALPSSSAPPA 379
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
T + L F + +V+P E Y+++ G CL + + ++ GE + +G Q
Sbjct: 380 T--LPSMTLHFGGGAD---MVLPVENYMILDG-GMWCLAMRSQTD---GELSTLGNYQQQ 430
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ ++YD +K+ + + P C+TL
Sbjct: 431 NLHILYDVQKETLSFAPAKCSTL 453
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 89/373 (23%), Positives = 150/373 (40%), Gaps = 68/373 (18%)
Query: 17 FAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCS 70
+ ++ ++G PP K+F F DTGSDL W+QC+ PC C + P ++NI PC
Sbjct: 88 YLMSYSIGTPPFKVFGF-VDTGSDLVWLQCE-PCKQCYPQITPIFDPSLSSSYQNI-PCL 144
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF- 129
+ C ++ CD G L + L + G + P T
Sbjct: 145 SDTCHSMR----------TTSCDVR----------GYLSVETLTLDSTTGYSVSFPKTMI 184
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---QNGRGVLF 186
GCGY N G P ++G++GLG G +S+ SQL I +C+G N L
Sbjct: 185 GCGY--RNTGTFHGP-SSGIVGLGSGPMSLPSQLGT--SIGGKFSYCLGPWLPNSTSKLN 239
Query: 187 LGDGK-VPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGA 239
GD V G TP+++ A +Y+ +G + + G + G + ++ DSG
Sbjct: 240 FGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGT 299
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFKALGQ--VTEYFKPLA 296
++ + VY S + + L+ D + T +C+ + +T +FK
Sbjct: 300 TFTFLPYDVYYRFESAVAEYI---NLEHVEDPNGTFKLCYNVAYHGFEAPLITAHFKGAD 356
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
+ +++ CL + A I G + Q+ +V Y+
Sbjct: 357 IKLYYISTFIKV-----------SDGIACLAFIPSQTA------IFGNVAQQNLLVGYNL 399
Query: 357 EKQRIGWKPEDCN 369
+ + +KP DC
Sbjct: 400 VQNTVTFKPVDCT 412
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 95/380 (25%), Positives = 141/380 (37%), Gaps = 61/380 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK---NIVPCSNPR 73
+ V +G PP+L DT +D W+ C C+GC+ + V CS +
Sbjct: 104 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG-CSGCSNASTSFNTNSSSTYSTVSCSTAQ 162
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C P C + YG S +LV D L + + N +FGC
Sbjct: 163 CTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDT--LTLAPDVIPN--FSFGC-I 217
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV- 192
N + L P G++GLGRG +S+VSQ L V +C+ + R F G K+
Sbjct: 218 NSASGNSLPP---QGLMGLGRGPMSLVSQTTS--LYSGVFSYCL-PSFRSFYFSGSLKLG 271
Query: 193 ----PSSGVAWTPMLQNSADLKHYILG--------------PAELLYSGKSCGLKDLTLI 234
P S + +TP+L+N Y + P L + S I
Sbjct: 272 LLGQPKS-IRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANS----GAGTI 326
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG F VY+ I RD + ++ F LG F
Sbjct: 327 IDSGTVITRFAQPVYEAI-----RDEFRKQVNVSS------------FSTLGAFDTCFSA 369
Query: 295 ----LALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQD 349
+A T S+ L +P E L+ S + CL + + N+I + Q+
Sbjct: 370 DNENVAPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQN 429
Query: 350 KMVIYDNEKQRIGWKPEDCN 369
+++D RIG PE CN
Sbjct: 430 LRILFDVPNSRIGIAPEPCN 449
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 95/350 (27%), Positives = 143/350 (40%), Gaps = 43/350 (12%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IVPCSNPRCAALHWPNPPRCKH 87
DT SD+TWVQC +PC P+K Y P K+ + C++P C L P C +
Sbjct: 148 LDTASDVTWVQC-SPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLG-PYANGCTN 205
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
N+QC Y + Y DG S+ G ++DL L + + FGC + A
Sbjct: 206 -NNQCQYRVRYPDGTSTAGTYISDL--LTITPATAVRS-FQFGCSHGVQGSFSFG-SSAA 260
Query: 148 GVLGLGRGRISIVSQLRE-YGLIRNVIGHCI-GQNGRGVLFLGDGKVPSSGVAWTPMLQN 205
G++ LG G S+VSQ YG V HC RG LG +V + TPML+N
Sbjct: 261 GIMALGGGPESLVSQTAATYG---RVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKN 317
Query: 206 SA-DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS------RVYQEIVSLIMR 258
A Y++ + +G+ + +F +GA+ T+ YQ + R
Sbjct: 318 PAIPPTFYMVRLEAIAVAGQRIAVPP--TVFAAGAALDSRTAITRLPPTAYQAL-RQAFR 374
Query: 259 DLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
D + + AP L C+ + V + P ++ +V L P L
Sbjct: 375 DRMAM-YQPAPPKGPLDTCYD-----MAGVRSFALPRITLVFDKNAAVEL--DPSGVLF- 425
Query: 319 SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
CL G +V IIG I +Q V+Y+ +G++ C
Sbjct: 426 ----QGCLAFTAGPNDQV--PGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 142/350 (40%), Gaps = 43/350 (12%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IVPCSNPRCAALHWPNPPRCKH 87
DT SD+TWVQC +PC P+K Y P K+ + C++P C L P C +
Sbjct: 173 LDTASDVTWVQC-SPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLG-PYANGCTN 230
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
N+QC Y + Y DG S+ G ++DL + + FGC + A
Sbjct: 231 -NNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRS---FQFGCSHGVQGSFSFG-SSAA 285
Query: 148 GVLGLGRGRISIVSQLRE-YGLIRNVIGHCI-GQNGRGVLFLGDGKVPSSGVAWTPMLQN 205
G++ LG G S+VSQ YG V HC RG LG +V + TPML+N
Sbjct: 286 GIMALGGGPESLVSQTAATYG---RVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKN 342
Query: 206 SA-DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS------RVYQEIVSLIMR 258
A Y++ + +G+ + +F +GA+ T+ YQ + R
Sbjct: 343 PAIPPTFYMVRLEAIAVAGQRIAVPP--TVFAAGAALDSRTAITRLPPTAYQAL-RQAFR 399
Query: 259 DLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
D + + AP L C+ + V + P ++ +V L P L
Sbjct: 400 DRMAM-YQPAPPKGPLDTCYD-----MAGVRSFALPRITLVFDKNAAVEL--DPSGVLF- 450
Query: 319 SGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
CL G +V IIG I +Q V+Y+ +G++ C
Sbjct: 451 ----QGCLAFTAGPNDQV--PGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 160/388 (41%), Gaps = 71/388 (18%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y N T+G PP+ S + V APC+ ++P PC C
Sbjct: 66 YNVANFTIGTPPQ-------PASAIIDVAGPAPCSFPNA--SSTFRPE----PCGTDACK 112
Query: 76 ALHWPNPPRCKHPNDQCDYE--IEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC-- 131
++ P ++ C YE I GG ++G + TD F + + S L FGC
Sbjct: 113 SI-----PTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS-----LGFGCVV 162
Query: 132 --GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
G + GP +G++GLGR S+VSQ+ + H G+N R L LG
Sbjct: 163 ASGIDTMG-GP------SGLIGLGRAPSSLVSQMNITKFSYCLTPHDSGKNSR--LLLGS 213
Query: 190 GKVPSSG--VAWTPMLQNSA--DLKHYILGPAELLYSGKSCGLKDL-------TLIFDSG 238
+ G TP ++ S D+ Y P +L G G + T++ +
Sbjct: 214 SAKLAGGGNSTTTPFVKTSPGDDMSQYY--PIQL--DGIKAGDAAIALPPSGNTVLVQTL 269
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
A ++ YQ + + + + P L P D +C+ P L +
Sbjct: 270 APMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFD----LCF--PKAGLSNASAP----D 319
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRK--NVCLGILNGS---EAEVGEN-NIIGEIFMQDK 350
L FT ++ + L VPP YL+ G + VC+ IL+ S + EN NI+G + ++
Sbjct: 320 LVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENT 379
Query: 351 MVIYDNEKQRIGWKPEDCNTLLSLNHFI 378
+ D EK+ + ++P DC L ++ F+
Sbjct: 380 HFLLDLEKKTLSFEPADCAHLSLIDGFL 407
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 161/383 (42%), Gaps = 49/383 (12%)
Query: 14 FSYFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VP 68
++ + ++ +G P P+ + DTGSD+ W QC PC C P ++ + V
Sbjct: 89 YTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCR-PCFDCFTQPLPRFDTSASDTVHGVL 147
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
C++P C AL P C C Y++ YGD +IG L D F G VP L
Sbjct: 148 CTDPICRAL---RPHACFLGG--CTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDL 202
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGV 184
FGCG Q+N G +T G+ G GRG +S+ QL G+ + +C I ++
Sbjct: 203 VFGCG--QYNTGNFHSNET-GIAGFGRGPLSLPRQL---GV--SSFSYCFTTIFESKSTP 254
Query: 185 LFLG----DG-KVPSSG-VAWTPMLQNSAD-----LKHYILGPAELLYSGKSCGLK---D 230
+FLG DG + ++G + TP L N + LK +G L + +K
Sbjct: 255 VFLGGAPADGLRAHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGS 314
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPL-KLAPDDKTLPICWRGPFKALGQVT 289
I DSG + F V++ + + + PL + +D P +++ +
Sbjct: 315 GGTIIDSGTAITAFPRAVFRSLWEAFVAQV---PLPHTSYNDTGEPTLQCFSTESVPDAS 371
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
+ P T +P E Y+ +C+ +L G + + +IG Q
Sbjct: 372 KVPVP---KMTLHLEGADWELPRENYMAEYPDSDQLCVVVLAGDD----DRTMIGNFQQQ 424
Query: 349 DKMVIYDNEKQRIGWKPEDCNTL 371
+ +++D ++ +P C+ +
Sbjct: 425 NMHIVHDLAGNKLVIEPAQCDKM 447
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 99/419 (23%), Positives = 171/419 (40%), Gaps = 83/419 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTK----PPEKQYKPHKN----I 66
+A +++G PP+ DTGS L+WV C + C C+ P + P + +
Sbjct: 89 YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148
Query: 67 VPCSNPRCAALHWPN----------------PPRCKHPNDQC-DYEIEYGDGGSSIGALV 109
+ C NP C +H P+ PR + N+ C Y + YG GS+ G L+
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGS-GSTAGLLI 207
Query: 110 TDLFPLRFSNGSVFNVPLTFGCGYNQ-HNPGPLSPPDTAGVLGLGRGRISIVSQLR---- 164
+D LR +V N GC H P +G+ G GRG S+ SQL
Sbjct: 208 SDT--LRTPGRAVRN--FVIGCSLASVHQP-------PSGLAGFGRGAPSVPSQLGLTKF 256
Query: 165 EYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLK----HYILGPAELL 220
Y L+ +G +L GK G+ + P+ ++++ +Y L +
Sbjct: 257 SYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAIT 316
Query: 221 YSGKSCGLKDLTL---------IFDSGASYAYFTSRVYQEIVSLIMRDLIG--TPLKLAP 269
GKS L + I DSG +++YF V++ + + ++ + G + K+
Sbjct: 317 VGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVE 376
Query: 270 DDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISG--------- 320
+ L C+ P G T ++L F + + +P E Y V++G
Sbjct: 377 EGLGLSPCFAMP---PGTKTMELPEMSLHF---KGGSVMNLPVENYFVVAGPAPSGGAPA 430
Query: 321 -RKNVCLGILNGSEAEVGENN--------IIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
+ +CL +++ G I+G Q+ + YD EK+R+G++ + C +
Sbjct: 431 MAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCAS 489
>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
Length = 431
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 95/390 (24%), Positives = 142/390 (36%), Gaps = 60/390 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
V + VG PP+ DTGS+L+W+ C+ G PP + S R
Sbjct: 55 LTVPVAVGTPPQNVTMVLDTGSELSWLLCN----GSYAPP---------LTRRSTRRWRG 101
Query: 77 LHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC---- 131
P PP C P++ C + Y D S+ G L TD F L V FGC
Sbjct: 102 RDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTF-LLTGGAPPVAVGAYFGCITSY 160
Query: 132 ----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRGVLF 186
N + G G+LG+ RG +S V+Q G R +CI G GVL
Sbjct: 161 SSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQT---GTRR--FAYCIAPGEGPGVLL 215
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------------- 233
LGD + + +TP+++ S L ++ + G G L +
Sbjct: 216 LGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAG 275
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-------TLPICWRGPFKA 284
+ DSG + + + Y + + L LAP + C+RGP
Sbjct: 276 QTMVDSGTQFTFLLADAYAALKAEFTSQ---ARLLLAPLGEPGFVFQGAFDACFRGPEAR 332
Query: 285 LGQVTEYFKPLALSFTNRRNSVR-----LVVPPEAYLVISGRKNVCLGILNGSEAEVGEN 339
+ + + L +V +VP E CL N A +
Sbjct: 333 VAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGM-SA 391
Query: 340 NIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+IG Q+ V YD + R+G+ P C+
Sbjct: 392 YVIGHHHQQNVWVEYDLQNGRVGFAPARCD 421
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 146/384 (38%), Gaps = 72/384 (18%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF L VG PP+ DTGSD+ W+QC PC C + + P + VPC+
Sbjct: 153 YF-TRLGVGTPPRYTYMVLDTGSDIMWIQC-LPCAKCYGQTDPLFNPAASSTYRKVPCAT 210
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L C++ C+Y++ YGDG ++G T+ R G V + GC
Sbjct: 211 PLCKKLDISG---CRNKR-YCEYQVSYGDGSFTVGDFSTETLTFR---GQVIR-RVALGC 262
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRI-----SIVSQLREYGLI-RNVIGHCIGQNGRGVL 185
G++ N G LG G + S+ Y L+ R+ G L
Sbjct: 263 GHD--NEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDRSASGTA------SSL 314
Query: 186 FLGDGKVPSSGVAWTPMLQN-SADLKHYILGPAELLYSGKSCGLKDLT------------ 232
G +P S + +TP+L N D +Y+ EL+ G S G + LT
Sbjct: 315 IFGKAAIPKSAI-FTPLLSNPKLDTFYYV----ELV--GISVGGRRLTSIPASVFRMDAT 367
Query: 233 ----LIFDSGASYAYFTSRVYQEIVSLIMRDL--IGT-PLKLAPDDKTLPICWRGPFKAL 285
+I DSG S Y MRD +GT LK A C+
Sbjct: 368 GNGGVIIDSGTSVTRLVDSAYS-----TMRDAFRVGTGNLKSAGGFSLFDTCY----DLS 418
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGE 344
G T L F + + +P YL+ + C G +IIG
Sbjct: 419 GLKTVKVPTLVFHF---QGGAHISLPATNYLIPVDSSATFCFAF----AGNTGGLSIIGN 471
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDC 368
I Q V++D+ R+G+K C
Sbjct: 472 IQQQGYRVVFDSLANRVGFKAGSC 495
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 150/376 (39%), Gaps = 57/376 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF L VG PPK DTGSD+ W+QC APC C + + P K + + C +
Sbjct: 147 YF-TRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISCRS 204
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P C L + P C + C Y++ YGDG + G T+ R + VP + G
Sbjct: 205 PLCLRL---DSPGC-NSRQSCLYQVAYGDGSFTFGEFSTETLTFRGT-----RVPKVALG 255
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRN--VIGHCIGQNGRGVLFLG 188
CG++ N G LG GR + LR +G + ++ V+F G
Sbjct: 256 CGHD--NEGLFVGAAGLLGLGRGRLSFPTQTGLR-FGRKFSYCLVDRSASSKPSSVVF-G 311
Query: 189 DGKVPSSGVAWTPMLQN-SADLKHYI------LGPAELLYSGKSCGLKDLT------LIF 235
V + V +TP++ N D +Y+ +G A + +G + L L +I
Sbjct: 312 QSAVSRTAV-FTPLITNPKLDTFYYLELTGISVGGARV--AGITASLFKLDTAGNGGVII 368
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
DSG S T R Y + +RD LK APD C F G+
Sbjct: 369 DSGTSVTRLTRRAY-----VSLRDAFRAGAADLKRAPDYSLFDTC----FDLSGKTEVKV 419
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ + F + +P YL+ V G+ + + +IIG I Q V
Sbjct: 420 PTVVMHF----RGADVSLPATNYLIPVDTNGVFCFAFAGTMSGL---SIIGNIQQQGFRV 472
Query: 353 IYDNEKQRIGWKPEDC 368
++D RIG+ C
Sbjct: 473 VFDVAASRIGFAARGC 488
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 148/372 (39%), Gaps = 46/372 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ N T+G PP+ D +L W QC C+ C + + P + PC P
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQ-CSRCFEQDTPLFDPTASNTYRAEPCGTP 109
Query: 73 RCAALHWPNPPRCKHPNDQCDYE--IEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C ++ P+ R + C Y+ GD G +G TD F + + S L FG
Sbjct: 110 LCESI--PSDSR-NCSGNVCAYQASTNAGDTGGKVG---TDTFAVGTAKAS-----LAFG 158
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C P +G++GLGR S+V+Q + H G+N LFLG
Sbjct: 159 CVVASDIDTMGGP---SGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGRN--SALFLGSS 213
Query: 191 KVPSSG--VAWTPMLQ---NSADLKHYILGPAELLYSGKSC---GLKDLTLIFDSGASYA 242
+ G A TP + N DL +Y E L +G + T++ D+ + +
Sbjct: 214 AKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPIS 273
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
+ YQ + + + P+ + P D P A G + L +F
Sbjct: 274 FLVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFP-----KSGASGAAPD----LVFTF- 323
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEA-EVGENNIIGEIFMQDKMVIYDNEKQ 359
R + VP YL+ VCL +L+ + E +++G + ++ ++D +K+
Sbjct: 324 --RGGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKE 381
Query: 360 RIGWKPEDCNTL 371
+ ++P DC L
Sbjct: 382 TLSFEPADCTKL 393
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 98/373 (26%), Positives = 151/373 (40%), Gaps = 46/373 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKNI----VPCS 70
+ V L G P DTGSDL+WVQC PC T P+K + P + VPC
Sbjct: 122 YVVTLGFGTPAVPQVLLIDTGSDLSWVQCQ-PCNSSTCYPQKDPVFDPSASSTYAPVPCG 180
Query: 71 NPRCAAL---HWPNPPRCKHPNDQ---CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ C L + N C + + C Y I+YG+G +++G T+ L +V N
Sbjct: 181 SEACRDLDPDSYAN--GCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVN 238
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGR 182
+FGCG Q G+LGLG S+VSQ G +C+ G +
Sbjct: 239 -NFSFGCGLVQKG----VFDLFDGLLGLGGAPESLVSQTT--GTYGGAFSYCLPAGNSTA 291
Query: 183 GVLFLG---DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----LIF 235
G L LG G ++G +TP+ + Y++ + GK ++ +I
Sbjct: 292 GFLALGAPATGGNNTAGFQFTPL--QVVETTFYLVKLTGISVGGKQLDIEPTVFAGGMII 349
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG Y + + + PL DD+ L C+ G +
Sbjct: 350 DSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCY----DFTGNTNVTVPTV 405
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYD 355
AL+F ++ L VP L + CL + G A G+ IIG + + V+YD
Sbjct: 406 ALTFEGGV-TIDLDVPSGVLL------DGCLAFVAG--ASDGDTGIIGNVNQRTFEVLYD 456
Query: 356 NEKQRIGWKPEDC 368
+ + +G++ C
Sbjct: 457 SARGHVGFRAGAC 469
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 95/397 (23%), Positives = 144/397 (36%), Gaps = 71/397 (17%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEK---------QYKPHKNI--------VPCSNPRCAAL 77
DTGSDL W QC C P Q P+ N VPC + A
Sbjct: 79 DTGSDLVWTQCST----CRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALC 134
Query: 78 H-WPNPPRCKHP----NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
P C +D C YG G ++G L TD F S+ +V L FGC
Sbjct: 135 GVAPETAGCARGGGSGDDACVVAASYG-AGVALGVLGTDAFTFPSSS----SVTLAFGCV 189
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
+ +PG L+ +G++GLGRG +S+VSQL + + LF+GDG+
Sbjct: 190 SQTRISPGALN--GASGIIGLGRGALSLVSQLNATEFSYCLTPYFRDTVSPSHLFVGDGE 247
Query: 192 VPSSG------------VAWTPMLQNSAD----------LKHYILGPAELLYSGKSCGLK 229
+ V P +N D L G A + + L+
Sbjct: 248 LAGLRAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLR 307
Query: 230 DLT-------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK---TLPICWR 279
+ + DSG+ + ++ + + R L G+ + P K L +C
Sbjct: 308 EAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVE 367
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVR-LVVPPEAYLVISGRKNVCLGILNGSEAEV-- 336
PL L F + R LV+P E Y C+ +++ +
Sbjct: 368 AGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSASGNATL 427
Query: 337 --GENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
E IIG QD V+YD + ++P +C+ +
Sbjct: 428 PTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCSAV 464
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 95/197 (48%), Gaps = 27/197 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +NL++G PP F DTGS L W QC APCT C P ++P + +PC++
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQC-APCTECAARPAPPFQPASSSTFSKLPCASS 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L P C C Y YG G ++ G L T+ + G+ F +TFGC
Sbjct: 149 LCQFLTSPY-RTCNATG--CVYYYPYGMGFTA-GYLATETLHV---GGASFP-GVTFGCS 200
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLFLG 188
++ G ++G++GLGR +S+VSQ+ G+ R +C+ N +LF
Sbjct: 201 -TENGVG----NSSSGIVGLGRSPLSLVSQV---GVAR--FSYCLRSNADAGDSPILFGS 250
Query: 189 DGKVPSSGVAWTPMLQN 205
KV V TP+L+N
Sbjct: 251 LAKVTGGNVQSTPLLEN 267
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 143/390 (36%), Gaps = 85/390 (21%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + VG PPK DTGSD+ W+QC APC C + + P K+ V C
Sbjct: 129 YF-TRIGVGTPPKYVYMVLDTGSDIVWLQC-APCKNCYSQTDPVFNPVKSGSFAKVLCRT 186
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L P C C Y++ YGDG + G VT+ L F V V L GC
Sbjct: 187 PLCRRLESPG---CNQ-RQTCLYQVSYGDGSYTTGEFVTET--LTFRRTKVEQVAL--GC 238
Query: 132 GYNQHN------------PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
G++ G LS P AG +Q Y L +
Sbjct: 239 GHDNEGLFVGAAGLLGLGRGGLSFPSQAG---------RTFNQKFSYCL----VDRSASS 285
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN-SADLKHYILGPAELLYSGKSCGLKDLT------ 232
V+F G+ V S +TP+L N D +Y+ ELL G S G ++
Sbjct: 286 KPSSVVF-GNSAV-SRTARFTPLLTNPRLDTFYYV----ELL--GISVGGTPVSGITASH 337
Query: 233 ----------LIFDSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWR 279
+I D G S Y + +RD + LK AP+ C+
Sbjct: 338 FKLDRTGNGGVIIDCGTSVTRLNKPAY-----IALRDAFRAGASSLKSAPEFSLFDTCY- 391
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGE 338
G+ T + L F + +P YL+ + G C +
Sbjct: 392 ---DLSGKTTVKVPTVVLHF----RGADVSLPASNYLIPVDGSGRFCFAFAGTTSGL--- 441
Query: 339 NNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+IIG I Q V+YD R+G+ P C
Sbjct: 442 -SIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 102/395 (25%), Positives = 148/395 (37%), Gaps = 87/395 (22%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF + ++VG PP+ DTGSD+ W+QC APC C + + P+K + + CS
Sbjct: 58 YF-IRISVGTPPRRMYLVMDTGSDILWLQC-APCVNCYHQSDAIFDPYKSSTYSTLGCST 115
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG---SVFN-VPL 127
+C L C+ ++C Y+++YGDG + G TD L ++G V N +PL
Sbjct: 116 RQCLNLDIGT---CQA--NKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPL 170
Query: 128 TFGCGYNQHN------------------PGPLSPPDTAGVLGLGRGRISIVSQLREYGLI 169
GCG++ P + P + GR S RE
Sbjct: 171 --GCGHDNEGYFVGAAGLLGLGKGPLSFPNQVDPQNG--------GRFSYCLTDRETD-- 218
Query: 170 RNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC--- 226
G ++F G+ VP +G +TP N Y L + G
Sbjct: 219 --------STEGSSLVF-GEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIP 269
Query: 227 -------GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLI--GTPLKLAPDD--KTLP 275
L + +I DSG S + Y +RD GT LAP
Sbjct: 270 TSAFQLDSLGNGGVIIDSGTSVTRLQNAAYAS-----LRDAFRAGTS-DLAPTAGFSLFD 323
Query: 276 ICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEA 334
C+ L V + L F + L +P YL+ + CL A
Sbjct: 324 TCY--DLSGLASVD--VPTVTLHF---QGGTDLKLPASNYLIPVDNSNTFCLAF-----A 371
Query: 335 EVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+IIG I Q VIYDN ++G+ P CN
Sbjct: 372 GTTGPSIIGNIQQQGFRVIYDNLHNQVGFVPSQCN 406
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 112/393 (28%), Positives = 169/393 (43%), Gaps = 62/393 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ +++ VG PP+ F DTGSDL W+QC APC C + + P ++N+ C +
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNVT-CGD 208
Query: 72 PRCAALHWPNPP------RCKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVF 123
RC + P P C+ P D C Y YGD ++ G L + F + + G+
Sbjct: 209 HRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASR 268
Query: 124 NVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNG 181
V + FGCG+ N G AG+LGLGRG +S SQLR YG + +C+ +G
Sbjct: 269 RVDGVVFGCGH--RNRGLFH--GAAGLLGLGRGPLSFASQLRAVYG---HTFSYCLVDHG 321
Query: 182 RGV---LFLGDGKVPSSGVAWTPMLQNSA-----------------DLKHYILGPAELLY 221
V + G+ + +A P L+ +A LK ++G L
Sbjct: 322 SDVGSKVVFGEDD-DALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNI 380
Query: 222 SGKSCGL-KDLT--LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
S + + KD + I DSG + +YF YQ I M D + L P+ L C+
Sbjct: 381 SSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFM-DRMSRSYPLVPEFPVLSPCY 439
Query: 279 RGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI---SGRKNVCLGILNGSEAE 335
+V E L+L F + P E Y + G +CL +L
Sbjct: 440 NVSGVERPEVPE----LSLLFAD---GAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTG 492
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+ +IIG Q+ V+YD + R+G+ P C
Sbjct: 493 M---SIIGNFQQQNFHVVYDLQNNRLGFAPRRC 522
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 96/403 (23%), Positives = 150/403 (37%), Gaps = 69/403 (17%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGC----TKPPEKQYKPHKN-- 65
+ +++ L+ G PP+ DTGSDL W C C C + P + P +
Sbjct: 87 YGAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSS 146
Query: 66 --IVPCSNPRCAALHW-----------PNPPRCKHPNDQC-DYEIEYGDGGSSIGALVTD 111
++ C NP+C +H P P C C Y + YG G + G ++++
Sbjct: 147 SKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQ---ICPPYLVFYGSGITG-GIMLSE 202
Query: 112 LFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRN 171
L F V GC LS AG+ G GRG S+ SQL
Sbjct: 203 TLDLPGKGVPNFIV----GCSV-------LSTSQPAGISGFGRGPPSLPSQLGLKKFSYC 251
Query: 172 VIGHCIGQNGRGVLFLGDGKVPS----SGVAWTPMLQN------SADLKHYILGPAELLY 221
++ + DG+ S +G+++TP +QN A +Y LG +
Sbjct: 252 LLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITV 311
Query: 222 SGKSCGLK----------DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD 271
GK + D I DSG ++ Y +++ + + + +
Sbjct: 312 GGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGI 371
Query: 272 KTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILN 330
L C F G T F L L F R + +P Y+ + G VCL I+
Sbjct: 372 TGLRPC----FNISGLNTPSFPELTLKF---RGGAEMELPLANYVAFLGGDDVVCLTIVT 424
Query: 331 ----GSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
G E G I+G Q+ V YD +R+G++ + C
Sbjct: 425 DGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSCK 467
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 101/434 (23%), Positives = 160/434 (36%), Gaps = 84/434 (19%)
Query: 7 EFFFFPIFS--------YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD------------ 46
E F P+ S YF V VG P + F DTGSDLTWV+C
Sbjct: 38 EAFAMPLSSGAYTGTGQYF-VRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPA 96
Query: 47 --------AP-------CTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKH 87
AP + P + ++P ++ +PCS+ C A + C
Sbjct: 97 PGYNYGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPT 156
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-------LTFGCGYNQHNPGP 140
P C YE Y DG ++ G + TD + S + GC +
Sbjct: 157 PGSPCAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESF 216
Query: 141 LSPPDTAGVLGLGRGRISIVSQ-LREYG--LIRNVIGHCIGQNGRGVLFLG--------- 188
L+ + GVL LG +S S+ +G ++ H +N L G
Sbjct: 217 LA---SDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSAS 273
Query: 189 ------DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT--------LI 234
G + G TP+L + Y + + G+ + L I
Sbjct: 274 ASRTACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGAI 333
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG S S Y+ +V+ + + L+G P ++A D W P +
Sbjct: 334 LDSGTSLTVLVSPAYRAVVAALGKKLVGLP-RVAMDPFDYCYNWTSPLTGE-DLAVAVPA 391
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
LA+ F S RL PP++Y++ + C+G+ G V ++IG I Q+ + +
Sbjct: 392 LAVHFA---GSARLQPPPKSYVIDAAPGVKCIGLQEGDWPGV---SVIGNILQQEHLWEF 445
Query: 355 DNEKQRIGWKPEDC 368
D + +R+ +K C
Sbjct: 446 DLKNRRLRFKRSRC 459
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/368 (23%), Positives = 139/368 (37%), Gaps = 51/368 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKNI-------VP 68
+ ++ +VG PP++ D SD W+QC A T G P P V
Sbjct: 97 YVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREVR 156
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG--SSIGALVTDLFPLRFSNGSVFNVP 126
C+N C L P C + C Y YG G ++ G L D F +V
Sbjct: 157 CANRGCQRLV---PQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAF----ATVRADG 209
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ FGC D GV+GLGRG +S+VSQL+ + G +LF
Sbjct: 210 VIFGCAVATEG-------DIGGVIGLGRGELSLVSQLQIGRFSYYLAPDDAVDVGSFILF 262
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS------ 240
L D K +S TP++ N A Y + A + G+ + T + S
Sbjct: 263 LDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLS 322
Query: 241 ----YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVTEYFK 293
+ + Y+ ++R + + + L D + L +C+ A +V
Sbjct: 323 ITIPVTFLDAGAYK-----VVRQAMASKIGLRAADGSELGLDLCYTSESLATAKVPS--- 374
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
+AL F +V + + + S CL IL + G+ +++G + +I
Sbjct: 375 -MALVFAG--GAVMELEMGNYFYMDSTTGLECLTIL---PSPAGDGSLLGSLIQVGTHMI 428
Query: 354 YDNEKQRI 361
YD R+
Sbjct: 429 YDISGSRL 436
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 143/386 (37%), Gaps = 76/386 (19%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF L VG P + DTGSD+ W+QC APC C + + P K+ +PCS+
Sbjct: 142 YF-TRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPCSS 199
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + C C Y++ YGDG ++G T+ L F V V L GC
Sbjct: 200 PHCRRL---DSAGCNTRRKTCLYQVSYGDGSFTVGDFSTET--LTFRRNRVKGVAL--GC 252
Query: 132 GYNQHN------------PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
G++ G LS P G +Q Y L +
Sbjct: 253 GHDNEGLFVGAAGLLGLGKGKLSFPGQTG---------HRFNQKFSYCL----VDRSASS 299
Query: 180 NGRGVLFLGDGKVPSSGVA-WTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLT- 232
V+F G S +A +TP+L N Y +G + G + L L
Sbjct: 300 KPSSVVF---GNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQ 356
Query: 233 -----LIFDSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPFKA 284
+I DSG S Y + MRD LK AP+ C+
Sbjct: 357 IGNGGVIIDSGTSVTRLIRPAY-----IAMRDAFRVGAKTLKRAPNFSLFDTCF-----D 406
Query: 285 LGQVTEYFKP-LALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNII 342
L + E P + L F RR V L P YL+ + C +G +II
Sbjct: 407 LSNMNEVKVPTVVLHF--RRADVSL--PATNYLIPVDTNGKFCFAF----AGTMGGLSII 458
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDC 368
G I Q V+YD R+G+ P C
Sbjct: 459 GNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/369 (23%), Positives = 141/369 (38%), Gaps = 43/369 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
+ V + +G P K FDTGSD+TW QC C K E+ + P ++ + ++
Sbjct: 149 YIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSS 208
Query: 77 LHWP------NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ N P C + C Y I+YGD S+G T+ L ++ FN + FG
Sbjct: 209 ICNSLTSATGNTPGC--ASSACVYGIQYGDSSFSVGFFGTE--KLTLTSTDAFN-NIYFG 263
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG N S R ++S+VSQ + + +C+ + FL G
Sbjct: 264 CGQNNQGLFGGSAGLLGLG----RDKLSVVSQTAQK--YNKIFSYCLPSSSSSTGFLTFG 317
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----------IFDSGAS 240
S +TP+ SA Y L ++G S G K L + I DSG
Sbjct: 318 GSASKNAKFTPLSTISAGPSFY-----GLDFTGISVGGKKLAISASVFSTAGAIIDSGTV 372
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
Y + + + P+ A L C+ F + ++ + SF+
Sbjct: 373 ITRLPPAAYSALRASFRNLMSKYPMTKALS--ILDTCY--DFSSYTTIS--VPKIGFSFS 426
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQR 360
+ + + + L S VCL S+A + I G + + V YD +
Sbjct: 427 ---SGIEVDIDATGILYASSLSQVCLAFAGNSDAT--DVFIFGNVQQKTLEVFYDGSAGK 481
Query: 361 IGWKPEDCN 369
+G+ P C+
Sbjct: 482 VGFAPGGCS 490
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 95/388 (24%), Positives = 154/388 (39%), Gaps = 59/388 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKP-PEKQYKPHKNIVPCSNPR 73
+LT+G PP+ DTGS+L+W++C T P K Y +PCS+
Sbjct: 67 LTASLTIGTPPQNITMVLDTGSELSWLRCKKEPNFTSIFNPLASKTYTK----IPCSSQT 122
Query: 74 CAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P C P C + I Y D S G L + F RF GS+ FGC
Sbjct: 123 CKTRTSDLTLPVTCD-PAKLCHFIISYADASSVEGHLAFETF--RF--GSLTRPATVFGC 177
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHCI-GQNGRGVLFLG 188
+ + T G++G+ RG +S V+Q+ R++ +CI G + G L LG
Sbjct: 178 MDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRKF-------SYCISGLDSTGFLLLG 230
Query: 189 DGKVP-SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------------- 233
+ + + +TP++Q S L ++ + G K L L
Sbjct: 231 EARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQ 290
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-----TLPICW-----RGPF 282
+ DSG + + VY + + G L++ + + + +C+
Sbjct: 291 TMVDSGTQFTFLLGPVYSALRKEFLLQTAGV-LRVLNEPQYVFQGAMDLCYLIDSTSSTL 349
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNI 341
L V F+ +S + +R R VP E + G+ +V C N E + + +
Sbjct: 350 PNLPVVKLMFRGAEMSVSGQRLLYR--VPGE----VRGKDSVWCFTFGNSDELGI-SSFL 402
Query: 342 IGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
IG Q+ + YD E RIG+ C+
Sbjct: 403 IGHHQQQNVWMEYDLENSRIGFAELRCD 430
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 98/391 (25%), Positives = 153/391 (39%), Gaps = 68/391 (17%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
+ + +++G PP+ DTGSDL W QC T + + Y P K+ PC
Sbjct: 88 HHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHR-EKPLYDPAKSSSFAAAPCDG 146
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + N C ++C Y YG ++ G L ++ F F +V L FGC
Sbjct: 147 RLCETGSF-NTKNCSR--NKCIYTYNYGS-ATTKGELASETF--TFGEHRRVSVSLDFGC 200
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----EYGLI----RNVIGHCIGQNGRG 183
G + G L P +G+LG+ R+S+VSQL+ Y L RN H
Sbjct: 201 G--KLTSGSL--PGASGILGISPDRLSLVSQLQIPRFSYCLTPFLDRNTTSH-------- 248
Query: 184 VLFLGD----GKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----- 233
+F G K ++G + T ++ N +Y P G S G K L +
Sbjct: 249 -IFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVP----LIGISVGTKRLNVPVSSF 303
Query: 234 ----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-TLPICWRGPF 282
DSG + S V E + M + + P+ A D +C++ P
Sbjct: 304 AIGRDGSGGTFVDSGDTTGMLPS-VVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPR 362
Query: 283 KALGQVTEYFK--PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENN 340
G V + PL F +++ ++Y+V +CL I +G+
Sbjct: 363 NGGGAVETAVQVPPLVYHFD---GGAAMLLRRDSYMVEVSAGRMCLVISSGARGA----- 414
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
IIG Q+ V++D E + P CN +
Sbjct: 415 IIGNYQQQNMHVLFDVENHEFSFAPTQCNQI 445
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/376 (23%), Positives = 148/376 (39%), Gaps = 51/376 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ +N+++G PP DTGSDL W QC PC C K E + P K+ + C+N
Sbjct: 94 YLMNISLGTPPVSMLGIADTGSDLIWRQC-LPCDDCYKQVEPLFDPKKSKTYKTLGCNND 152
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L C N C YGD + L ++ F + + G + P L FGC
Sbjct: 153 FCQDLGQQG--SCGDDN-TCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGC 209
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G++ N G + D+ + G ++ + G +C+ +
Sbjct: 210 GHS--NGGTFNEKDSGLIGLGGGPLSLVMQLSSKVG---GQFSYCLVPLSSDSTASSKIN 264
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKS------CGLKDLTL 233
F V SG TP+++ + D +Y+ LG ++ + G S ++ +
Sbjct: 265 FGKSAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESNI 324
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK-ALGQVTEYF 292
I DSG + Y ++ S + + +IG P T +C+ G K + +T +F
Sbjct: 325 IIDSGTTLTLLPRDFYTDMESALTK-VIGGQTTTDPRG-TFSLCYSGVKKLEIPTITAHF 382
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMV 352
+ +PP V + VC ++ S I G + + +V
Sbjct: 383 I-----------GADVQLPPLNTFVQAQEDLVCFSMIPSSNLA-----IFGNLSQMNFLV 426
Query: 353 IYDNEKQRIGWKPEDC 368
YD + ++ +KP DC
Sbjct: 427 GYDLKNNKVSFKPTDC 442
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 158/374 (42%), Gaps = 52/374 (13%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSN 71
S + +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 80 SLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGT 137
Query: 72 PRCAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
C L + P C+ + C + + Y DG +S G L D L FS+ V +P T
Sbjct: 138 SMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFT 191
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL- 185
FGC + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 192 FGCNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFS 246
Query: 186 ----FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIF 235
+ GKV + + V +T M+ + + + + A + G+ GL ++F
Sbjct: 247 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 306
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP- 294
DSG+ +Y R ++S +R+L+ + A ++++ C+ + V E P
Sbjct: 307 DSGSELSYIPDRAL-SVLSQRIRELL--LRRGAAEEESERNCY-----DMRSVDEGDMPA 358
Query: 295 LALSFTNRRNSVRLVVPPEAYLV---ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKM 351
++L F + R + V + + CL A +IIG + K
Sbjct: 359 ISLHFD---DGARFDLGSHGVFVERSVQEQDVWCLAF-----APTESVSIIGSLMQTSKE 410
Query: 352 VIYDNEKQRIGWKP 365
V+YD ++Q IG P
Sbjct: 411 VVYDLKRQLIGIGP 424
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 155/375 (41%), Gaps = 51/375 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N++VG P F DTGSDL W QC APCT C + P ++P + +PC++
Sbjct: 86 YNMNISVGTPLLTFPVVADTGSDLIWTQC-APCTKCFQQPAPPFQPASSSTFSKLPCTSS 144
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L PN R + C Y +YG G ++ G L T+ L+ + S +V FGC
Sbjct: 145 FCQFL--PNSIRTCNATG-CVYNYKYGSGYTA-GYLATE--TLKVGDASFPSV--AFGCS 196
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLFLG 188
++ G T+G+ GLGRG +S++ QL G+ R +C+ +LF
Sbjct: 197 -TENGVG----NSTSGIAGLGRGALSLIPQL---GVGR--FSYCLRSGSAAGASPILFGS 246
Query: 189 DGKVPSSGVAWTPMLQNSA--------DLKHYILGPAELLYSGKSCGLKDLTL----IFD 236
+ V TP + N A +L +G +L + + G L I D
Sbjct: 247 LANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVD 306
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-L 295
SG + Y Y+ + + + + L +C FK+ G P L
Sbjct: 307 SGTTLTYLAKDGYEMVKQAFLSQTAN--VTTVNGTRGLDLC----FKSTGGGGGIAVPSL 360
Query: 296 ALSFTNRRNSVRLVVPPE-AYLVISGRKNVCLGILNGSEAEVGE-NNIIGEIFMQDKMVI 353
L F VP A + + +V + L A+ + ++IG + D ++
Sbjct: 361 VLRF---DGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLL 417
Query: 354 YDNEKQRIGWKPEDC 368
YD + + P DC
Sbjct: 418 YDLDGGIFSFSPADC 432
>gi|330794218|ref|XP_003285177.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
gi|325084898|gb|EGC38316.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
Length = 817
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 104/396 (26%), Positives = 160/396 (40%), Gaps = 67/396 (16%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWV----------QCDAPCTGCTKPPEKQYKPH 63
F YF + + VG PP++F DTGS V Q C+
Sbjct: 203 FEYF-IPILVGTPPQMFTVQVDTGSTSLAVPGSNCYLYKSQSIKTSCSCSDGNLDGLYSL 261
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ + + C+ N + N C + ++YGDG G+LV D + F
Sbjct: 262 EESISSNQLNCSDTSNCNTCKNNKSNKPCPFVLKYGDGSFIAGSLVIDHVTI-----GDF 316
Query: 124 NVPLTFGCGYNQH-NPGPLSPPDTA-------GVLGLGRGRI------SIVSQLREYGLI 169
VP FG + + L+ P T G+LGL ++ I S++ + I
Sbjct: 317 TVPAKFGNIQKESLSFSQLTCPSTQRSQAVRDGILGLSFQQLDPDNGDDIFSKIVAHYNI 376
Query: 170 RNVIGHCIGQNGRGVLFLG--DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG 227
NV C+G++G G+L +G + + +TP+ D +Y + + S
Sbjct: 377 PNVFSMCLGKDG-GLLTIGGTNDHITQETPKYTPIF----DSHYYSITVTNIYVGNDSLN 431
Query: 228 LK--DL-TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP-IC----WR 279
L DL T I DSG + YF+ ++ IV L LP IC W
Sbjct: 432 LAPPDLSTSIVDSGTTLLYFSDEIFYSIVR-----------NLEEKHCELPGICNDPFWE 480
Query: 280 GPFKALGQ--VTEY-FKPLALSFTNRRNSVRLVVPPEAY-LVISGRKNVCLGILNGSEAE 335
G L + ++EY L + N S +L VPP+ Y L I+G C GI + E
Sbjct: 481 GNCHHLEEKLISEYPTIYLEMKGMNGEPSFKLEVPPDLYFLNINGL--YCFGISHMKEIS 538
Query: 336 VGENNIIGEIFMQDKMVIYDNEKQRIGW-KPEDCNT 370
V +IG++ +Q VIY+ E IG+ + C+T
Sbjct: 539 V----LIGDVVLQGYNVIYNRENSSIGFARTHGCST 570
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 99/419 (23%), Positives = 171/419 (40%), Gaps = 83/419 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTK----PPEKQYKPHKN----I 66
+A +++G PP+ DTGS L+WV C + C C+ P + P + +
Sbjct: 89 YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148
Query: 67 VPCSNPRCAALHWPN----------------PPRCKHPNDQC-DYEIEYGDGGSSIGALV 109
+ C NP C +H P+ PR + N+ C Y + YG GS+ G L+
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGS-GSTAGLLI 207
Query: 110 TDLFPLRFSNGSVFNVPLTFGCGYNQ-HNPGPLSPPDTAGVLGLGRGRISIVSQLR---- 164
+D LR +V N GC H P +G+ G GRG S+ SQL
Sbjct: 208 SDT--LRTPGRAVRN--FVIGCSLASVHQP-------PSGLAGFGRGAPSVPSQLGLTKF 256
Query: 165 EYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLK----HYILGPAELL 220
Y L+ +G +L GK G+ + P+ ++++ +Y L +
Sbjct: 257 SYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAIT 316
Query: 221 YSGKSCGLKDLTL---------IFDSGASYAYFTSRVYQEIVSLIMRDLIG--TPLKLAP 269
GKS L + I DSG +++YF V++ + + ++ + G + K+
Sbjct: 317 VGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVE 376
Query: 270 DDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISG--------- 320
+ L C+ P G T ++L F + + +P E Y V++G
Sbjct: 377 EGLGLSPCFAMP---PGTKTMELPEMSLHF---KGGSVMNLPVENYFVVAGPAPSGGAPA 430
Query: 321 -RKNVCLGILNGSEAEVGENN--------IIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
+ +CL +++ G I+G Q+ + YD EK+R+G++ + C +
Sbjct: 431 MAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCAS 489
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/362 (24%), Positives = 152/362 (41%), Gaps = 46/362 (12%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALH 78
VG P + F DTGSD+ W+QC PCT C + + + P + V C + +C++L
Sbjct: 26 VGNPARQFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPTASSTYAPVTCQSQQCSSLE 84
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFGCGYNQHN 137
+ C+ + QC Y++ YGDG + G T+ + F N GSV NV L GCG++ N
Sbjct: 85 MSS---CR--SGQCLYQVNYGDGSYTFGDFATE--SVSFGNSGSVKNVAL--GCGHD--N 133
Query: 138 PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGV 197
G AG+LGLG G +S+ +QL+ ++ G L ++ V
Sbjct: 134 EGLF--VGAAGLLGLGGGPLSLTNQLKATSFSYCLVNR--DSAGSSTLDFNSAQLGVDSV 189
Query: 198 AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGASYAYFTSR 247
P+++N Y +G + + G+ + + T +I D G + ++
Sbjct: 190 T-APLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQ 248
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVR 307
Y + +R + LKL C+ GQ + ++ F + ++
Sbjct: 249 AYNPLRDAFVR--MTQNLKLTSAVALFDTCY----DLSGQASVRVPTVSFHFADGKS--- 299
Query: 308 LVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
+P YL+ + C + + +IIG + Q V +D R+G+ P
Sbjct: 300 WNLPAANYLIPVDSAGTYCFAFAPTTSSL----SIIGNVQQQGTRVTFDLANNRMGFSPN 355
Query: 367 DC 368
C
Sbjct: 356 KC 357
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/370 (24%), Positives = 144/370 (38%), Gaps = 52/370 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP-CTGCTKPPEKQYKPHKN----IVPCSN 71
+ + +G PP DTGS++ W+QC +P CT C K + P K+ I C +
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGH 167
Query: 72 PRCAALHWPNPPR--CKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFNVPL 127
C W CK C Y I Y D S G + TD+ FP + +++ +
Sbjct: 168 RECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRM 227
Query: 128 TFGCGYNQ-----HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
FGCGYN +P + P GV+GLG S+V QL G I Q
Sbjct: 228 FFGCGYNNSETPGQDPNSFTAP---GVVGLGNEMASLVGQL-TLGQFSYCISTPDVQKPN 283
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHY------------ILGPAELLYSGKSCGLKD 230
G + + G S T + N + + G E ++ G+
Sbjct: 284 GTIEIRFGLAASISGHSTALANNLEGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIGG 343
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWRGPFKAL 285
LI DSG +Y + +Y + ++ +L ++LAPD + +C + A
Sbjct: 344 --LIMDSGTTY----TELYFSALDALIGEL-KEQIELAPDTQDHSNSNYSLC----YNAA 392
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEI 345
+ Y + L FT+ + + A+ + +G CL + S +IIG
Sbjct: 393 NFLLTYVPAIELKFTDNKEAYFPFTLRNAW-IDNGNDQYCLAMFGTSGI-----SIIGIY 446
Query: 346 FMQDKMVIYD 355
+D + YD
Sbjct: 447 QHRDIKIGYD 456
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 93/400 (23%), Positives = 161/400 (40%), Gaps = 70/400 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKPP---------EKQYKPHKN 65
++ L+ G P + FDTGS L W C + C+ C+ P +
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 66 IVPCSNPRCAALHWPN-PPRCKHPNDQCD--------YEIEYGDGGSSIGALVTDLFPLR 116
+V C NP+C+ + P+ +C+ N + + Y ++YG GS+ G L+++ L
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGS-GSTAGLLLSET--LD 197
Query: 117 FSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
F + + N GC + LS +G+ G GRG S+ SQ+ GL + +C
Sbjct: 198 FPDKXIPN--FVVGCSF-------LSIHQPSGIAGFGRGSESLPSQM---GLKK--FAYC 243
Query: 177 IGQNG------RGVLFLGDGKVPSSGVAWTPMLQ-----NSADLKHYILGPAELLYSGKS 225
+ G L L V SSG+ +TP Q N+A ++Y L +++ ++
Sbjct: 244 LASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQA 303
Query: 226 CGLKDLTL----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
+ L I DSG+++ + V + + + L A D +TL
Sbjct: 304 VKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN--WTRATDVETL- 360
Query: 276 ICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEA 334
R F + + F L F + + +P Y + V CL ++
Sbjct: 361 TGLRPCFDISKEKSVKFPELIFQF---KGGAKWALPLNNYFALVSSSGVACLTVVTHQME 417
Query: 335 EVGENN-----IIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ G I+G Q+ V YD QR+G++ + C+
Sbjct: 418 DGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|145351657|ref|XP_001420185.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580418|gb|ABO98478.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 498
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 145/378 (38%), Gaps = 62/378 (16%)
Query: 30 FDFDFDTGSDLTWVQCDAPCTGC-------TKPPEKQYKPHKNI--VPCSNPRCAALH-- 78
FD + DTGS LT+ PC GC + P Y K + C+ A +
Sbjct: 79 FDLEVDTGSPLTYF----PCKGCPLEVCGIHEHPYYDYDMSKTFRKLNCTTSTEDAAYCN 134
Query: 79 -WPNPPRCKHP---NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYN 134
PN C + C + I Y DG G + D F L + +TFGCG
Sbjct: 135 AQPNVLLCDTNISYTNTCLFGIGYVDGSVGRGYMAEDTFTL---GDELAPAKITFGCGGM 191
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIG--QNGRGVLFLGD-- 189
+ G D G+ G RG + +QL + G+I +V G C + +L LG
Sbjct: 192 YYPDGSNLRQD--GMAGFSRGNTAFHTQLAKAGVIDAHVFGFCSEGMETSTAMLTLGRYN 249
Query: 190 --GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSR 247
+VP +AWT ML G +L S L D T I S Y S
Sbjct: 250 FGRRVPE--LAWTRML-----------GEDDLAVRTMSWKLGDKT-IASSSNVYTVLDSG 295
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPF--------KALGQ--VTEYFKPLAL 297
++ M T L L + RG +L Q +T +F L +
Sbjct: 296 TTLTVLPSAMHHDFMTHLNETARSAGLSVVVRGTHCFYENQRQSSLTQYTLTRWFPSLTI 355
Query: 298 SFTNRRNSVRLVVPPEAYLVIS--GRKNVCLGILNGSEAEV--GENNIIGEIFMQDKMVI 353
++ V LV+ PE YL C GI++ S+A + GE I+G+ +++ V
Sbjct: 356 TY---DPDVTLVLRPENYLFADTVNLHAFCAGIMSASDAALANGEQIILGQQTLRNTFVE 412
Query: 354 YDNEKQRIGWKPEDCNTL 371
YD E R+G C L
Sbjct: 413 YDLENSRVGMATVQCEKL 430
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 151/376 (40%), Gaps = 67/376 (17%)
Query: 17 FAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
+ + +VG PP KL+ DTGSD+ W+QC+ PC C ++KP K+ +PCS+
Sbjct: 87 YLMTYSVGTPPFKLYGIA-DTGSDIVWLQCE-PCKECYNQTTPKFKPSKSSTYKNIPCSS 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C + G L D L S G + P T G
Sbjct: 145 DLCKSGQQ--------------------------GNLSVDTLTLESSTGHPISFPKTVIG 178
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC-----IGQNGRGVL 185
CG + + ++G++GLG G S+++QL I +C + N L
Sbjct: 179 CGTDNTVSFEGA---SSGIVGLGGGPASLITQLGSS--IDAKFSYCLLPNPVESNTTSKL 233
Query: 186 FLGDGKVPS-SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSG 238
GD V S GV TP+++ + +Y+ +G + + G S G + +I DSG
Sbjct: 234 NFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEGSSNGGHEGNIIIDSG 293
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE--YFKPLA 296
+ + VY + S ++ + LK D L F VT Y P+
Sbjct: 294 TTLTVIPTDVYNNLESAVLELV---KLKRVNDPTRL-------FNLCYSVTSDGYDFPI- 342
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGE-NNIIGEIFMQDKMVIYD 355
T + + P + V VCL S + +I G + Q+ +V YD
Sbjct: 343 --ITTHFKGADVKLHPISTFVDVADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYD 400
Query: 356 NEKQRIGWKPEDCNTL 371
+++ + +KP DC+ +
Sbjct: 401 LQQKIVSFKPTDCSKV 416
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 136/379 (35%), Gaps = 60/379 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK---NIVPCSNPR 73
+ V +G PP+L DT +D W+ C C+GC+ + V CS +
Sbjct: 105 YVVRARLGTPPQLMFMVLDTSNDAVWLPCSG-CSGCSNASTSFNTNSSSTYSTVSCSTTQ 163
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C P C + YG S LV D L S + N +FGC
Sbjct: 164 CTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDT--LTLSPDVIPN--FSFGC-I 218
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
N + L P G++GLGRG +S+VSQ L V +C+ + R F G K+
Sbjct: 219 NSASGNSLPP---QGLMGLGRGPMSLVSQTTS--LYSGVFSYCL-PSFRSFYFSGSLKLG 272
Query: 194 SSG----VAWTPMLQNSADLKHYILG--------------PAELLYSGKSCGLKDLTLIF 235
G + +TP+L+N Y + P L + S I
Sbjct: 273 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNS----GAGTII 328
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP- 294
DSG F VY+ I + + G F LG F
Sbjct: 329 DSGTVITRFAQPVYEAIRDEFRKQV------------------NGSFSTLGAFDTCFSAD 370
Query: 295 ---LALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDK 350
+ T S+ L +P E L+ S + CL + + N+I + Q+
Sbjct: 371 NENVTPKITLHMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNL 430
Query: 351 MVIYDNEKQRIGWKPEDCN 369
+++D RIG PE CN
Sbjct: 431 RILFDVPNSRIGIAPEPCN 449
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 152/363 (41%), Gaps = 46/363 (12%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALH 78
VG P + F DTGSD+ W+QC PCT C + + + P + V C + +C++L
Sbjct: 167 VGNPARQFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPTASSTYAPVTCQSQQCSSLE 225
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFGCGYNQHN 137
+ C+ + QC Y++ YGDG + G T+ + F N GSV NV L GCG++ N
Sbjct: 226 MSS---CR--SGQCLYQVNYGDGSYTFGDFATE--SVSFGNSGSVKNVAL--GCGHD--N 274
Query: 138 PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGV 197
G AG+LGLG G +S+ +QL+ ++ G L ++ V
Sbjct: 275 EGLFVG--AAGLLGLGGGPLSLTNQLKATSFSYCLVNR--DSAGSSTLDFNSAQLGVDSV 330
Query: 198 AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGASYAYFTSR 247
P+++N Y +G + + G+ + + T +I D G + ++
Sbjct: 331 T-APLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQ 389
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVR 307
Y + +R + LKL C+ GQ + ++ F + ++
Sbjct: 390 AYNPLRDAFVR--MTQNLKLTSAVALFDTCY----DLSGQASVRVPTVSFHFADGKS--- 440
Query: 308 LVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPE 366
+P YL+ + C + + +IIG + Q V +D R+G+ P
Sbjct: 441 WNLPAANYLIPVDSAGTYCFAFAPTTSSL----SIIGNVQQQGTRVTFDLANNRMGFSPN 496
Query: 367 DCN 369
C
Sbjct: 497 KCQ 499
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 93/400 (23%), Positives = 161/400 (40%), Gaps = 70/400 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKPP---------EKQYKPHKN 65
++ L+ G P + FDTGS L W C + C+ C+ P +
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 66 IVPCSNPRCAALHWPN-PPRCKHPNDQCD--------YEIEYGDGGSSIGALVTDLFPLR 116
+V C NP+C+ + P+ +C+ N + + Y ++YG GS+ G L+++ L
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGS-GSTAGLLLSET--LD 197
Query: 117 FSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
F + + N GC + LS +G+ G GRG S+ SQ+ GL + +C
Sbjct: 198 FPDKKIPN--FVVGCSF-------LSIHQPSGIAGFGRGSESLPSQM---GLKK--FAYC 243
Query: 177 IGQNG------RGVLFLGDGKVPSSGVAWTPMLQ-----NSADLKHYILGPAELLYSGKS 225
+ G L L V SSG+ +TP Q N+A ++Y L +++ ++
Sbjct: 244 LASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQA 303
Query: 226 CGLKDLTL----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
+ L I DSG+++ + V + + + L A D +TL
Sbjct: 304 VKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN--WTRATDVETL- 360
Query: 276 ICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEA 334
R F + + F L F + + +P Y + V CL ++
Sbjct: 361 TGLRPCFDISKEKSVKFPELIFQF---KGGAKWALPLNNYFALVSSSGVACLTVVTHQME 417
Query: 335 EVGENN-----IIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ G I+G Q+ V YD QR+G++ + C+
Sbjct: 418 DGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 147/372 (39%), Gaps = 46/372 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSNP 72
+ N T+G PP+ D +L W QC C C + + P + PC P
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQ-CGRCFEQGTPLFDPTASNTYRAEPCGTP 109
Query: 73 RCAALHWPNPPRCKHPNDQCDYE--IEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C ++ P+ R + C YE GD G +G TD F + + S L FG
Sbjct: 110 LCESI--PSDVR-NCSGNVCAYEASTNAGDTGGKVG---TDTFAVGTAKAS-----LAFG 158
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C P +G++GLGR S+V+Q + H G+N LFLG
Sbjct: 159 CVVASDIDTMGGP---SGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKN--SALFLGSS 213
Query: 191 KVPSSG--VAWTPMLQ---NSADLKHYILGPAELLYSGKSC---GLKDLTLIFDSGASYA 242
+ G A TP + N DL +Y E L +G + T++ D+ + +
Sbjct: 214 AKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPIS 273
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
+ YQ + + + P+ + P D P A G + L +F
Sbjct: 274 FLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFP-----KSGASGAAPD----LVFTF- 323
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEA-EVGENNIIGEIFMQDKMVIYDNEKQ 359
R + VP YL+ VCL +L+ + E +++G + ++ ++D +K+
Sbjct: 324 --RGGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKE 381
Query: 360 RIGWKPEDCNTL 371
+ ++P DC L
Sbjct: 382 TLSFEPADCTKL 393
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 101/386 (26%), Positives = 141/386 (36%), Gaps = 76/386 (19%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF L VG P + DTGSD+ W+QC APC C + + P K+ +PCS+
Sbjct: 142 YF-TRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPCSS 199
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + C C Y++ YGDG ++G T+ L F V V L GC
Sbjct: 200 PHCRRL---DSAGCNTRRKTCLYQVSYGDGSFTVGDFSTET--LTFRRNRVKGVAL--GC 252
Query: 132 GYNQHN------------PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
G++ G LS P G +Q Y L +
Sbjct: 253 GHDNEGLFVGAAGLLGLGKGKLSFPGQTG---------HRFNQKFSYCL----VDRSASS 299
Query: 180 NGRGVLFLGDGKVPSSGVA-WTPMLQNSADLKHYILGPAELLYSG-----------KSCG 227
V+F G S +A +TP+L N Y +G + G K
Sbjct: 300 KPSSVVF---GNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQ 356
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPFKA 284
+ + +I DSG S Y + MRD LK APD C+
Sbjct: 357 IGNGGVIIDSGTSVTRLIRPAY-----IAMRDAFRVGAKTLKRAPDFSLFDTCF-----D 406
Query: 285 LGQVTEYFKP-LALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNII 342
L + E P + L F + +P YL+ + C +G +II
Sbjct: 407 LSNMNEVKVPTVVLHF----RGADVSLPATNYLIPVDTNGKFCFAF----AGTMGGLSII 458
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDC 368
G I Q V+YD R+G+ P C
Sbjct: 459 GNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 87/212 (41%), Gaps = 32/212 (15%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----- 67
I Y+ L +G PP++F D+GS +T+V C + C C K P I+
Sbjct: 88 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQVMLSSPKDQILCLVSC 146
Query: 68 ---------------PCSNPRCAALHWP----NPPRCKHPNDQCDYEIEYGDGGSSIGAL 108
P P ++ + P C +QC YE EY + SS G L
Sbjct: 147 KVQIFKISYGLFDEDPKFQPELSSTYQPVKCNMDCNCDDDKEQCVYEREYAEHSSSKGVL 206
Query: 109 VTDLFPLRFSNGSVFN-VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYG 167
DL + F N S FGC G L G++GLG+G +S+V QL + G
Sbjct: 207 GEDL--ISFGNESHLTPQRAVFGC--KTVETGDLYSQRADGIIGLGQGDLSLVGQLVDKG 262
Query: 168 LIRNVIGHCIG--QNGRGVLFLGDGKVPSSGV 197
LI N G C G G G + +G PS +
Sbjct: 263 LISNSFGLCYGGLDVGGGSMIVGGFDYPSDMI 294
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 153/380 (40%), Gaps = 53/380 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + VG P DTGSD+ W+QC APC C + P ++ V C+
Sbjct: 140 YF-TKIGVGTPSTPALMVLDTGSDVVWLQC-APCRRCYDQSGPVFDPRRSSSYGAVDCAA 197
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFG 130
P C L C C Y++ YGDG + G T+ L F+ G+ V V L G
Sbjct: 198 PLCRRLDSGG---CDLRRRACLYQVAYGDGSVTAGDFATET--LTFAGGARVARVAL--G 250
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYG-------LIRNVIGHCIGQNGR 182
CG++ N G AG+LGLGRG +S +Q+ R YG + R +
Sbjct: 251 CGHD--NEGLFV--AAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRS 306
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK---SCGLKDLTL------ 233
+ G +S ++TPM++N Y + + G DL L
Sbjct: 307 RSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGR 366
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL-PICWRGPFKALGQVT 289
I DSG S Y + G L+L+P +L C+ G+
Sbjct: 367 GGVIVDSGTSVTRLARPSYSALRDAFRAAAAG--LRLSPGGFSLFDTCY----DLGGRKV 420
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
+++ F +PPE YL+ + R C G++ V +IIG I Q
Sbjct: 421 VKVPTVSMHFAG---GAEAALPPENYLIPVDSRGTFCFA-FAGTDGGV---SIIGNIQQQ 473
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
V++D + QR+G+ P+ C
Sbjct: 474 GFRVVFDGDGQRVGFAPKGC 493
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 142/388 (36%), Gaps = 80/388 (20%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF L VG P + DTGSD+ W+QC APC C + + P K+ +PCS+
Sbjct: 142 YF-TRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPCSS 199
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L C C Y++ YGDG ++G T+ L F V V L GC
Sbjct: 200 PHCRRLDSAG---CNTRRKTCLYQVSYGDGSFTVGDFSTET--LTFRRNRVKGVAL--GC 252
Query: 132 GYNQHN------------PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
G++ G LS P G +Q Y L +
Sbjct: 253 GHDNEGLFVGAAGLLGLGKGKLSFPGQTG---------HRFNQKFSYCL----VDRSASS 299
Query: 180 NGRGVLFLGDGKVPSSGVA-WTPMLQN-SADLKHYIL------------GPAELLYSGKS 225
V+F G S +A +TP+L N D +Y+ G A L+
Sbjct: 300 KPSSVVF---GNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQ 356
Query: 226 CGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPF 282
G + +I DSG S Y + MRD LK APD C+
Sbjct: 357 IG--NGGVIIDSGTSVTRLIRPAY-----IAMRDAFRVGAKALKRAPDFSLFDTCF---- 405
Query: 283 KALGQVTEYFKP-LALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENN 340
L + E P + L F + +P YL+ + C +G +
Sbjct: 406 -DLSNMNEVKVPTVVLHF----RGADVSLPATNYLIPVDTNGKFCFAF----AGTMGGLS 456
Query: 341 IIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
IIG I Q V+YD R+G+ P C
Sbjct: 457 IIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 143/390 (36%), Gaps = 85/390 (21%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + VG PPK DTGSD+ W+QC APC C + + P K+ V C
Sbjct: 42 YF-TRIGVGTPPKYVYMVLDTGSDIVWLQC-APCKNCYSQTDPVFNPVKSGSFAKVLCRT 99
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L P C C Y++ YGDG + G VT+ L F V V L GC
Sbjct: 100 PLCRRLESPG---CNQ-RQTCLYQVSYGDGSYTTGEFVTET--LTFRRTKVEQVAL--GC 151
Query: 132 GYNQHN------------PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
G++ G LS P AG +Q Y L+
Sbjct: 152 GHDNEGLFVGAAGLLGLGRGGLSFPSQAG---------RTFNQKFSYCLVD----RSASS 198
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN-SADLKHYILGPAELLYSGKSCGLKDLT------ 232
V+F G+ V S +TP+L N D +Y+ ELL G S G ++
Sbjct: 199 KPSSVVF-GNSAV-SRTARFTPLLTNPRLDTFYYV----ELL--GISVGGTPVSGITASH 250
Query: 233 ----------LIFDSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWR 279
+I D G S Y + +RD + LK AP+ C+
Sbjct: 251 FKLDRTGNGGVIIDCGTSVTRLNKPAY-----IALRDAFRAGASSLKSAPEFSLFDTCY- 304
Query: 280 GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGE 338
G+ T + L F + +P YL+ + G C +
Sbjct: 305 ---DLSGKTTVKVPTVVLHF----RGADVSLPASNYLIPVDGSGRFCFAFAGTTSGL--- 354
Query: 339 NNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
+IIG I Q V+YD R+G+ P C
Sbjct: 355 -SIIGNIQQQGFRVVYDLASSRVGFSPRGC 383
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/388 (23%), Positives = 156/388 (40%), Gaps = 57/388 (14%)
Query: 18 AVNLTVGKPPKLFDFDFDTGSDLTWVQCDA------PCTGCTKPPEKQYKPHKN----IV 67
++ + +G PP+ DTGSDL W QC ++ E Y+P ++ +
Sbjct: 85 SLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYL 144
Query: 68 PCSNPRC--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
PCS+ C + N R N++C Y+ YG + G L ++ F F + ++
Sbjct: 145 PCSDRLCQEGQFSYKNCAR----NNRCMYDELYGSAEAG-GVLASETF--TFGVNAKVSL 197
Query: 126 PLTFGCGYNQHNPGPLSPPD---TAGVLGLGRGRISIVSQLR----EYGLI----RNVIG 174
PL FGCG LS D +G++GL G +S+VSQL Y L R
Sbjct: 198 PLGFGCG-------ALSAGDLVGASGLMGLSPGIMSLVSQLSVPRFSYCLTPFAERKTSP 250
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNS---ADLKHYILGPAELLYSGKSCGL--- 228
G + G V ++ + P ++ + L LG L S G+
Sbjct: 251 LLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKP 310
Query: 229 -KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK--TLPICWRGPFKAL 285
I DSG++ +Y ++ + ++ + + P+ D+ +C+ P
Sbjct: 311 DGSGGTIVDSGSTMSYLEETAFRAVKKAVV-EAVRLPVANGTDEDYDDYELCFALP---T 366
Query: 286 GQVTEYFK--PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIG 343
G E K PL L F + +P + Y +CL + G+ + +IIG
Sbjct: 367 GVAMEAVKTPPLVLHFDG---GAAMTLPRDNYFQEPRAGLMCLAV--GTSPDGFGVSIIG 421
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
+ Q+ V++D Q+ + P C+ +
Sbjct: 422 NVQQQNMHVLFDVRNQKFSFAPTKCDDI 449
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 166/384 (43%), Gaps = 52/384 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ +++ VG PP+ F DTGSDL W+QC APC C + + P ++N+ C +
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNLT-CGD 203
Query: 72 PRCAAL---HWPNPPRCKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVP 126
PRC + P P C+ P D C Y YGD +S G L + F + + G+ V
Sbjct: 204 PRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRVD 263
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV 184
+ FGCG+ N G AG+LGLGRG +S SQLR YG + +C+ +G V
Sbjct: 264 GVVFGCGH--RNRGLFH--GAAGLLGLGRGPLSFASQLRAVYG--GHTFSYCLVDHGSDV 317
Query: 185 LF-LGDGKVPSSGVAWTPMLQNS--------ADLKHYILGPAELLYSGKSCGLKDLT--- 232
+ G+ + +A P L+ + AD +Y+ +L G+ + T
Sbjct: 318 ASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTG-VLVGGELLNISSDTWDA 376
Query: 233 -------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
I DSG + +YF YQ I + + G+ PD L C+
Sbjct: 377 SEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGS-YPPVPDFPVLSPCYNVSGVER 435
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGE 344
+V E L+L F + P E Y + + +CL +L + +IIG
Sbjct: 436 PEVPE----LSLLFA---DGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGM---SIIGN 485
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDC 368
Q+ V YD R+G+ P C
Sbjct: 486 FQQQNFHVAYDLHNNRLGFAPRRC 509
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 93/389 (23%), Positives = 162/389 (41%), Gaps = 62/389 (15%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPPEKQYKPHKN----IVPC 69
+++ N T+G PP+ D +L W QC A +GC K + P + C
Sbjct: 60 AHYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQC 119
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEI--EYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+P C ++ P R + +C YE +GD + G TD + + G L
Sbjct: 120 GSPLCKSI----PTRNCSGDGECGYEAPSMFGD---TFGIASTDAIAIGNAEGR-----L 167
Query: 128 TFGC--GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG-- 183
FGC + G + P +G +GLGR S+V Q +C+ +G G
Sbjct: 168 AFGCVVASDGSIDGAMDGP--SGFVGLGRTPWSLVGQSN-----VTAFSYCLALHGPGKK 220
Query: 184 -VLFLG-DGKVPSSGVAW--TPML----QNSAD--------LKHYILGPAELLYSGKSCG 227
LFLG K+ +G + TP+L N++D ++ + ++ + S G
Sbjct: 221 SALFLGASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSG 280
Query: 228 LKDLTLI-FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
+T++ ++ +Y YQ + ++ L G+P P + PF
Sbjct: 281 GGAITVLQLETFRPLSYLPDAAYQALEKVVTAAL-GSPSMANPPE---------PFDLCF 330
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGEN--NII 342
Q L FT + L P YL+ G N VCL IL+ + + ++ +I+
Sbjct: 331 QNAAVSGVPDLVFT-FQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSIL 389
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
G + ++ ++D EK+ + ++P DC++L
Sbjct: 390 GSLLQENVHFLFDLEKETLSFEPADCSSL 418
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 137/348 (39%), Gaps = 42/348 (12%)
Query: 34 FDTGSDLTWVQC-DAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKHP 88
D+ SD+ WVQC P C + Y P ++ CS+P C AL P C
Sbjct: 33 LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTAL-GPYANGCA-- 89
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNVPLTFGCGYNQHNPGPLSPPDTA 147
N+QC Y + Y DG S+ GA + DL L N S F FGC + + A
Sbjct: 90 NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFK----FGCSHAEQGS---FDARAA 142
Query: 148 GVLGLGRGRISIVSQL-REYGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQ 204
G++ LG G S++SQ YG N +CI + G LG + SS TPM++
Sbjct: 143 GIMALGGGPESLLSQTASRYG---NAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVR 199
Query: 205 NSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFTSRVYQEIVSLIMRDL 260
Y + + G+ G+ + DS + YQ + + +
Sbjct: 200 FRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSM 259
Query: 261 IGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISG 320
T + AP L C+ G V ++L F RN+V L + P L
Sbjct: 260 --TMYRSAPPKGYLDTCY----DFTGVVNIRLPKISLVFD--RNAV-LPLDPSGILF--- 307
Query: 321 RKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
N CL S A+ ++G + Q V+YD +G++ C
Sbjct: 308 --NDCLAFT--SNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/234 (29%), Positives = 101/234 (43%), Gaps = 26/234 (11%)
Query: 34 FDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPCSNPRCAALHWPNP 82
DTGSDL WV CD AP G T E + Y P + V C+N CA +
Sbjct: 4 LDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRN---- 59
Query: 83 PRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNGSVFNVP--LTFGCGYNQHNPG 139
+C C Y + Y +S G L+ D+ L + + V +TFGCG Q
Sbjct: 60 -QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSF 118
Query: 140 -PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVA 198
++ P+ G+ GLG +IS+ S L GL+ + C G +G G + GD SS
Sbjct: 119 LDIAAPN--GLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKG--SSDQE 174
Query: 199 WTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
TP N + + I + G + + T +FD+G S+ Y +Y +
Sbjct: 175 ETPFNLNPSHPNYNI--TVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTV 226
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/375 (22%), Positives = 149/375 (39%), Gaps = 46/375 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRC 74
+++ VNLT+G PP+ D G +L W QC C C K + + + P
Sbjct: 49 AFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCG 108
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDG-GSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
AA+ P R + E G ++G + TD + G+ L FGC
Sbjct: 109 AAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAI----GTAATARLAFGCAV 164
Query: 134 NQHNPGPLSPPDT----AGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG---VLF 186
DT +G +GLGR +S+ +Q+ +C+ G LF
Sbjct: 165 ASEM-------DTMWGSSGSVGLGRTNLSLAAQMNA-----TAFSYCLAPPDTGKSSALF 212
Query: 187 LG-DGKVPSS--GVAWTPMLQ-----NSADLKHYILGPAELLYSGKSCGLKDL--TLIFD 236
LG K+ + G TP ++ NS + Y+L + + + T+
Sbjct: 213 LGASAKLAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQSGNTITVS 272
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
+ VY+++ + + P+ P + +C+ + G L
Sbjct: 273 TATPVTALVDSVYRDLRKAVADAVGAAPVP--PPVQNYDLCFPKASASGGA-----PDLV 325
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
L+F + + VP +YL +G C+ IL GS A +G +I+G + + +++D
Sbjct: 326 LAF---QGGAEMTVPVSSYLFDAGNDTACVAIL-GSPA-LGGVSILGSLQQVNIHLLFDL 380
Query: 357 EKQRIGWKPEDCNTL 371
+K+ + ++P DC+ L
Sbjct: 381 DKETLSFEPADCSAL 395
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 91/375 (24%), Positives = 158/375 (42%), Gaps = 54/375 (14%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSN 71
S + +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 80 SLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGT 137
Query: 72 PRCAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
C L + P C+ + C + + Y DG +S G L D L FS+ V +P +
Sbjct: 138 SMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPGFS 191
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL- 185
FGC + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 192 FGCNMDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSERGFFS 246
Query: 186 ----FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIF 235
+ GKV + + V +T M+ + + + + + G+ GL ++F
Sbjct: 247 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF 306
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL-APDDKTLPICWRGPFKALGQVTEYFKP 294
DSG+ +Y R ++S +R+L+ LK A ++++ C+ + V E P
Sbjct: 307 DSGSELSYIPDRAL-SVLSQRIRELL---LKRGAAEEESERNCY-----DMRSVDEGDMP 357
Query: 295 -LALSFTNRRNSVRLVVPPEAYLV---ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
++L F + R + V + + CL A +IIG + K
Sbjct: 358 AISLHFD---DGARFDLGSHGVFVERSVQEQDVWCLAF-----APTESVSIIGSLMQTSK 409
Query: 351 MVIYDNEKQRIGWKP 365
V+YD ++Q IG P
Sbjct: 410 EVVYDLKRQLIGIGP 424
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 136/348 (39%), Gaps = 42/348 (12%)
Query: 34 FDTGSDLTWVQC-DAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHP 88
D+ SD+ WVQC P C + Y P ++ CS+P C AL P C
Sbjct: 163 LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTAL-GPYANGCA-- 219
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNVPLTFGCGYNQHNPGPLSPPDTA 147
N+QC Y + Y DG S+ GA + DL L N S F FGC + + A
Sbjct: 220 NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFK----FGCSHAEQGSFDAR---AA 272
Query: 148 GVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNG--RGVLFLGDGKVPSSGVAWTPMLQ 204
G++ LG G S++SQ YG N +CI G LG + SS TPM++
Sbjct: 273 GIMALGGGPESLLSQTASRYG---NAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVR 329
Query: 205 NSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFTSRVYQEIVSLIMRDL 260
Y + + G+ G+ + DS + YQ + S +
Sbjct: 330 FRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSM 389
Query: 261 IGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISG 320
T + AP L C+ G V ++L F RN+V L + P L
Sbjct: 390 --TMYRSAPPKGYLDTCY----DFTGVVNIRLPKISLVFD--RNAV-LPLDPSGILF--- 437
Query: 321 RKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDC 368
N CL S A+ ++G + Q V+YD +G++ C
Sbjct: 438 --NDCLAFT--SNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 145/369 (39%), Gaps = 47/369 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKNI----VPCS 70
+ V L G P DTGSD++WVQC PC P+K + P K+ + C+
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQC-TPCNSTKCYPQKDPLFDPSKSSTYAPIACN 189
Query: 71 NPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C L H+ N C QC Y +EY DG S G + L +
Sbjct: 190 TDACRKLGDHYHN--GCTSGGTQCGYSVEYADGSHSRGVYSNETLTLA---PGITVEDFH 244
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGVLFL 187
FGCG +Q GP D G+LGLG +S+V Q YG +C+ FL
Sbjct: 245 FGCGRDQR--GPSDKYD--GLLGLGGAPVSLVVQTSSVYG---GAFSYCLPALNSEAGFL 297
Query: 188 GDGKVPS---SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----LIFDSGAS 240
G PS S +TPM Y++ + GK + +I DSG
Sbjct: 298 VLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMIIDSGTV 357
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
Y + + + + L PL + D T C+ F +T +A +F+
Sbjct: 358 DTELPETAYNALEAALRKALKAYPLVPSDDFDT---CYN--FTGYSNIT--VPRVAFTFS 410
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGIL-NGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
++ L V P LV N CL +G + +G IIG + + V+YD +
Sbjct: 411 GGA-TIDLDV-PNGILV-----NDCLAFQESGPDDGLG---IIGNVNQRTLEVLYDAGRG 460
Query: 360 RIGWKPEDC 368
+G++ C
Sbjct: 461 NVGFRAGAC 469
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 96/389 (24%), Positives = 153/389 (39%), Gaps = 68/389 (17%)
Query: 15 SYFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPC 69
S + ++L++G P + DTGSD+ W QC+ PC C P ++ + V C
Sbjct: 90 SEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE-PCAECFTQPLPRFDTAASNTVRSVAC 148
Query: 70 SNPRCAALHWPNPPRCKHPN--DQCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFNVP 126
S+P C A +H C Y YGDG S G + D F G VP
Sbjct: 149 SDPLCNA-------HSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVP 201
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---QNGR 182
+ FGCG +N G +T G+ G GRG +S+ SQL+ +R +C +
Sbjct: 202 DIGFGCG--MYNAGRFLQTET-GIAGFGRGPLSLPSQLK----VRQ-FSYCFTTRFEAKS 253
Query: 183 GVLFL---GDGKVPSSG-VAWTPMLQN---SADLKHYILGPAELLYSGKSCGLKDL---- 231
+FL GD K ++G + TP +++ D HY+L + G + G L
Sbjct: 254 SPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLS-----FKGVTVGKTRLPVPE 308
Query: 232 -------TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA 284
DSG F V++++ S + P+ D+ + W
Sbjct: 309 IKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQ-AALPVNKTADEDDICFSWD----- 362
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN--VCLGILNGSEAEVGENNII 342
G+ T L +P E Y V R++ VC+ + + + +I
Sbjct: 363 -GKKTAAMPKLVFHL----EGADWDLPRENY-VTEDRESGQVCVAVSTSGQM---DRTLI 413
Query: 343 GEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
G Q+ ++YD ++ P C+ L
Sbjct: 414 GNFQQQNTHIVYDLAAGKLLLVPAQCDKL 442
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 85/375 (22%), Positives = 150/375 (40%), Gaps = 46/375 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRC 74
+++ VNLT+G PP+ D G +L W QC C C K + + + P
Sbjct: 49 AFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCG 108
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDG-GSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
AA+ P R + E G ++G + TD + G+ L FGC
Sbjct: 109 AAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAI----GTAATARLAFGCAV 164
Query: 134 NQHNPGPLSPPDT----AGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG---VLF 186
DT +G +GLGR +S+ +Q+ +C+ G LF
Sbjct: 165 ASEM-------DTMWGSSGSVGLGRTNLSLAAQMNA-----TAFSYCLAPPDTGKSSALF 212
Query: 187 LG-DGKVPSS--GVAWTPMLQNS----ADLKHYILGPAELLYSGKSCGL---KDLTLIFD 236
LG K+ + G TP ++ S + L L E + +G + T++
Sbjct: 213 LGASAKLAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQSGNTIMVS 272
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
+ VY+++ + + P+ P + +C+ + G L
Sbjct: 273 TATPVTALVDSVYRDLRKAVADAVGAAPVP--PPVQNYDLCFPKASASGGA-----PDLV 325
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
L+F + + VP +YL +G C+ IL GS A +G +I+G + + +++D
Sbjct: 326 LAF---QGGAEMTVPVSSYLFDAGNDTACVAIL-GSPA-LGGVSILGSLQQVNIHLLFDL 380
Query: 357 EKQRIGWKPEDCNTL 371
+K+ + ++P DC+ L
Sbjct: 381 DKETLSFEPADCSAL 395
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 144/380 (37%), Gaps = 64/380 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF + VG P + DTGSD+ W+QC APC C + + P K+ +PC
Sbjct: 129 YF-TRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQADPVFDPTKSRTYAGIPCGA 186
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + P C + N C Y++ YGDG + G T+ L F V V L GC
Sbjct: 187 PLCRRL---DSPGCNNKNKVCQYQVSYGDGSFTFGDFSTE--TLTFRRTRVTRVAL--GC 239
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV----LFL 187
G++ N G LG GR + + R +C+ +
Sbjct: 240 GHD--NEGLFIGAAGLLGLGRGRLSFPVQTGRR----FNQKFSYCLVDRSASAKPSSVVF 293
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELL--------YSGKSCGLKDLT------L 233
GD V S +TP+++N Y L ELL G S L L +
Sbjct: 294 GDSAV-SRTARFTPLIKNPKLDTFYYL---ELLGISVGGSPVRGLSASLFRLDAAGNGGV 349
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG S T Y + +RD + LK A + C+ L +TE
Sbjct: 350 IIDSGTSVTRLTRPAY-----IALRDAFRVGASHLKRAAEFSLFDTCFD-----LSGLTE 399
Query: 291 YFKP-LALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQ 348
P + L F + +P YL+ + + C + +IIG I Q
Sbjct: 400 VKVPTVVLHF----RGADVSLPATNYLIPVDNSGSFCFAF----AGTMSGLSIIGNIQQQ 451
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
V +D R+G+ P C
Sbjct: 452 GFRVSFDLAGSRVGFAPRGC 471
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 85/374 (22%), Positives = 150/374 (40%), Gaps = 40/374 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSN 71
Y+ N T+G PP+ D +L W QC A C C K + P+ + PC
Sbjct: 61 YYVANFTIGTPPQPASAIVDVAGELVWTQCSA-CRRCFKQDLPVFVPNASSTFKPEPCGT 119
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGD-GGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C ++ P D C Y+ G++ G TD F + V L FG
Sbjct: 120 AVCESI-----PTRSCSGDVCSYKGPPTQLRGNTSGFAATDTFAI-----GTATVRLAFG 169
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C P +G +GLGR S+V+Q++ + G++ R LFLG
Sbjct: 170 CVVASDIDTMDGP---SGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGKSSR--LFLGSS 224
Query: 191 KVPSSG--VAWTPMLQNSA--DLKHYILGPAELLYSGKSCGLKDLT---LIFDSGASYAY 243
+ G + P ++ S D HY L + + +G + + L+ + + ++
Sbjct: 225 AKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSL 284
Query: 244 FTSRVYQEIVSLIMRDLIG-TPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
Y+ + + G +A + +C++ KA G L +F
Sbjct: 285 LVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFK---KAAGFSRATAPDLVFTF--- 338
Query: 303 RNSVRLVVPPEAYLVISG--RKNVCLGILNGS---EAEVGENNIIGEIFMQDKMVIYDNE 357
+ + L VPP YL+ G + C IL+ + + +++G + +D +YD +
Sbjct: 339 QGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLK 398
Query: 358 KQRIGWKPEDCNTL 371
K+ + ++P DC++L
Sbjct: 399 KETLSFEPADCSSL 412
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 144/378 (38%), Gaps = 55/378 (14%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI--------VPCSNPRC 74
+G PP+ DTGS+L W QC C K KQ P+ N+ VPC++
Sbjct: 90 IGDPPQRAAALIDTGSNLIWTQCGTTCG--LKACAKQDLPYYNLSRSSTFAAVPCADS-- 145
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC-GY 133
A L N + C + YG GS G+L T+ F F +G+ L FGC
Sbjct: 146 AKLCAANGVHLCGLDGSCTFAASYG-AGSVFGSLGTEAFT--FQSGA---AKLGFGCVSL 199
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+ G L+ +G++GLGRGR+S+VSQ + + LF+G
Sbjct: 200 TRITKGALN--GASGLIGLGRGRLSLVSQTGATKFSYCLTPYLRNHGASSHLFVGASASL 257
Query: 194 SSG---VAWTPMLQNSAD----------LKHYILGPAELLYSGKSCGLKDLT-------L 233
S G V P +++ D L +G +L + L+ + +
Sbjct: 258 SGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGV 317
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I D+G+ Y + + R L L P D L +C A V +
Sbjct: 318 IIDTGSPVTSLAEAAYSALSDEVARQL-NRSLVQPPADTGLDLCV-----ARQDVDKVVP 371
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
L F + + V +Y + C+ I G G +IG QD ++
Sbjct: 372 VLVFHFGGGAD---MAVSAGSYWGPVDKSTACMLIEEG-----GYETVIGNFQQQDVHLL 423
Query: 354 YDNEKQRIGWKPEDCNTL 371
YD K + ++ DC+ L
Sbjct: 424 YDIGKGELSFQTADCSVL 441
>gi|168025647|ref|XP_001765345.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683398|gb|EDQ69808.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 879
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 88/392 (22%), Positives = 157/392 (40%), Gaps = 58/392 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP----PEKQYKP--HKNIVPC- 69
F V + +G PPK F F DTGS TWV C P P +++P + + C
Sbjct: 227 FHVEMKLGVPPKKFHFHMDTGSRDTWVYCQVSRNLDEPPIELGPNGKFEPRDESSYIQCI 286
Query: 70 --SNPRCAALHWPNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+ C+ + P C + C ++ Y D + G LV + + + S +
Sbjct: 287 GHTASLCSEYQY-EPHLCNSVDKYHCVNDLNYADDSTYSGVLVNESLMVSTIDNSDMDAM 345
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVL 185
F C +P T G++GLG + ++ Q +I +NV+G C+ + V
Sbjct: 346 GLFWCINEASHPF----TGTDGIIGLGNCKKTLGDQWTTNKVISQNVLGVCLAKGPGPVG 401
Query: 186 FLGDG-----KVPSSGVAW---TPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-LIFD 236
++ G K S W TPM +SA Y A + + K+ T L FD
Sbjct: 402 YISLGVNFKKKFEESTSVWSKLTPM--SSAGECAYSSPLASISFHDKTFVFTSETNLGFD 459
Query: 237 SGASYAYFTSRVYQEIVSLI-----------MRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
+G+ Y + +Y+ ++ ++ + D + + ++ CW P K
Sbjct: 460 TGSDMMYLEAVIYEPLLDMLDSYATSRGYVRVEDSVAQSYYVHQSEQRQ--CWAPPAKMQ 517
Query: 286 GQV------TEYFKPLALSF------TNRRNSVRLVVPPEAYLVISG-RKNVCLGILNGS 332
+ +F L +F T + L+V P +YL + + +C I+
Sbjct: 518 RALLTKASPISHFHALTFTFKGIPRATGHSSDQNLIVEPASYLSWNAPERKLCANIILSP 577
Query: 333 EAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+++ +G I M+ + ++D E Q++ WK
Sbjct: 578 -----KDSDLGAIGMKGHLFVFDVENQKVQWK 604
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 153/379 (40%), Gaps = 62/379 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF L VG P + DTGSD+ W+QC APC C + + P K+ +PC +
Sbjct: 147 YF-TRLGVGTPARYVFMVLDTGSDVVWIQC-APCKKCYSQTDPVFNPTKSRSFANIPCGS 204
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + P C C Y++ YGDG + G T+ L F V V L GC
Sbjct: 205 PLCRRL---DSPGCSTKKHICLYQVSYGDGSFTYGEFSTET--LTFRGTRVGRVAL--GC 257
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI----GQNGRGVLF 186
G++ N G AG+LGLGRGR+S SQ+ R + +C+ + +
Sbjct: 258 GHD--NEGLF--IGAAGLLGLGRGRLSFPSQIGRRFS---RKFSYCLVDRSASSKPSYMV 310
Query: 187 LGDGKVPSSGVAWTPMLQN-SADLKHYIL------------GPAELLYSGKSCGLKDLTL 233
GD + S +TP++ N D +Y+ G L+ S G + +
Sbjct: 311 FGDSAI-SRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTG--NGGV 367
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLI---GTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG S T Y + +RD + LK AP+ C F G+
Sbjct: 368 IIDSGTSVTRLTRPAY-----VALRDAFRVGASNLKRAPEFSLFDTC----FDLSGKTEV 418
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQD 349
+ L F + +P YL+ + + C + +I+G I Q
Sbjct: 419 KVPTVVLHF----RGADVSLPASNYLIPVDNSGSFCFAF----AGTMSGLSIVGNIQQQG 470
Query: 350 KMVIYDNEKQRIGWKPEDC 368
V+YD R+G+ P C
Sbjct: 471 FRVVYDLAASRVGFAPRGC 489
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 97/240 (40%), Gaps = 26/240 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
+S + + L +G PP + DTGSDL W QC PC C + P K+ R
Sbjct: 58 YSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQC-MPCPNCYTQFAPIFDPSKSST-FKEKR 115
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGCG 132
C H N C YEI Y D S G L T+ ++ ++G F + T GCG
Sbjct: 116 C------------HGN-SCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCG 162
Query: 133 YNQHN-PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-DG 190
N N P ++G++GL G S++SQ+ I +I +C G + G +
Sbjct: 163 LNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDL--PIPGLISYCFSSQGTSKINFGTNA 220
Query: 191 KVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
V G M +Y+ +G + G +D + DSG +Y Y
Sbjct: 221 VVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTTYTYL 280
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 144/375 (38%), Gaps = 56/375 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + DT +D WV PC+GCT + P+ + + CS
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWV----PCSGCTGCSSTTFLPNASTTLGSLDCSGA 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+C+ + + P + C + YG S LV D L +N + TFGC
Sbjct: 154 QCSQVRGFSCPATG--SSACLFNQSYGGDSSLTATLVQDAITL--ANDVIPG--FTFGC- 206
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG----RGVLFLG 188
N + G + P G+LGLGRG IS++SQ + V +C+ G L LG
Sbjct: 207 INAVSGGSIPP---QGLLGLGRGPISLISQ--AGAMYSGVFSYCLPSFKSYYFSGSLKLG 261
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILG-------------PAELLYSGKSCGLKDLTLIF 235
P S + TP+L+N Y + P+E L + G I
Sbjct: 262 PVGQPKS-IRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGT---II 317
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG F VY I + + G PI G F T +
Sbjct: 318 DSGTVITRFVQPVYFAIRDEFRKQVNG------------PISSLGAFDTCFAATNEAEAP 365
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
A++ + LV+P E L+ S ++ CL + N+I + Q+ +++
Sbjct: 366 AITL--HFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMF 423
Query: 355 DNEKQRIGWKPEDCN 369
D R+G E CN
Sbjct: 424 DTTNSRLGIARELCN 438
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 87/381 (22%), Positives = 151/381 (39%), Gaps = 51/381 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSN 71
Y N T+G PP+ D +L W QC + C+ C K + P+ + PC
Sbjct: 42 YNVANFTIGTPPQPASAIIDVAGELVWTQC-SRCSRCFKQDLPLFIPNASSTFRPEPCGT 100
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYG---DGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C + P D C YE D +++G + T+ F + + S L
Sbjct: 101 DAC-----KSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATAS-----LA 150
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV---L 185
FGC + T+G +GLGR S+V+Q++ +C+ G G L
Sbjct: 151 FGCVVASDID---TMDGTSGFIGLGRTPRSLVAQMK-----LTKFSYCLSPRGTGKSSRL 202
Query: 186 FLGDGKVPSSG--VAWTPMLQNS--ADLKHYILGPAELLYSGKSCGLKDLT---LIFDSG 238
FLG + G + P ++ S D HY L + + +G + + L+ +
Sbjct: 203 FLGSSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQSGGILVMHTV 262
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIG-TPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
+ ++ Y+ + + G +A + +C FK + P L
Sbjct: 263 SPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLC----FKKAAGFSRATAP-DL 317
Query: 298 SFTNRRNSVRLVVPPEAYLVISG--RKNVCLGILNGSEAEVGEN-----NIIGEIFMQDK 350
FT + L VPP YL+ G + C IL S A + +++G + ++
Sbjct: 318 VFTFQGGGAALTVPPAKYLIDVGEEKDTACAAIL--SMARLNRTGLEGVSVLGSLQQENV 375
Query: 351 MVIYDNEKQRIGWKPEDCNTL 371
+YD +K+ + ++P DC++L
Sbjct: 376 HFLYDLKKETLSFEPADCSSL 396
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 59/125 (47%), Gaps = 12/125 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF+ + VG P + DTGSD+TWVQC PC C + + + P + V C N
Sbjct: 167 YFS-RVGVGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSTSYASVACDN 224
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
PRC H + C++ C YE+ YGDG ++G T+ L S + GC
Sbjct: 225 PRC---HDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAIGC 278
Query: 132 GYNQH 136
G++
Sbjct: 279 GHDNE 283
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 91/366 (24%), Positives = 147/366 (40%), Gaps = 43/366 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V +G P + DT +D W+ C C GC+ + P K+ + C P
Sbjct: 88 YIVRANIGTPAQAMLVALDTSNDAAWIPCSG-CVGCSS--SVLFDPSKSSSSRTLQCEAP 144
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+C PNP C + C + + YG GS+I A +T L + + N TFGC
Sbjct: 145 QCK--QAPNP-SCT-VSKSCGFNMTYG--GSAIEAYLTQ-DTLTLATDVIPN--YTFGC- 194
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
N + G++GLGRG +S++SQ L ++ +C+ N G L LG
Sbjct: 195 ---INKASGTSLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSNFSGSLRLG 249
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFD--SGASYAYFTS 246
P + TP+L+N Y + + K + L FD +GA + +
Sbjct: 250 PKNQPIR-IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308
Query: 247 RVYQEIVS---LIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
VY +V + MR+ +K A + +L G F + F + F
Sbjct: 309 TVYTRLVEPAYVAMRNEFRRRVKNA-NATSL-----GGFDTCYSGSVVFPSVTFMFAG-- 360
Query: 304 NSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
+ + +PP+ L+ S N+ CL + N+I + Q+ V+ D R+G
Sbjct: 361 --MNVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLG 418
Query: 363 WKPEDC 368
E C
Sbjct: 419 ISRETC 424
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 84/201 (41%), Gaps = 17/201 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSNP 72
+ + + +G P DTGSD++WVQC PC+ C + + P + CS+
Sbjct: 131 YVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSASSTYSPFSCSSA 189
Query: 73 RCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L C + QC Y + Y DG S+ G +D L GS FGC
Sbjct: 190 ACVQLSQSQQGNGCS--SSQCQYIVSYVDGSSTTGTYSSDTLTL----GSNAIKGFQFGC 243
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
+Q G S T G++GLG S+VSQ G +C+ FL G
Sbjct: 244 --SQSESGGFS-DQTDGLMGLGGDAQSLVSQTA--GTFGKAFSYCLPPTPGSSGFLTLGA 298
Query: 192 VPSSGVAWTPMLQNSADLKHY 212
SG TPML+++ +Y
Sbjct: 299 ASRSGFVKTPMLRSTQIPTYY 319
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 89/365 (24%), Positives = 146/365 (40%), Gaps = 41/365 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCA 75
+ V +G P + DT +D W+ C C GC+ K V C P+C
Sbjct: 96 YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSG-CVGCSSTVFNNVKSTTFKTVGCEAPQCK 154
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGA-LVTDLFPLRFSNGSVFNVP-LTFGCGY 133
+ P K C + + YG SSI A L D+ L + ++P TFGC
Sbjct: 155 QV-----PNSKCGGSACAFNMTYGS--SSIAANLSQDVVTL-----ATDSIPSYTFGCL- 201
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLGD 189
G PP G+LGLGRG +S++SQ + L ++ +C+ N G L LG
Sbjct: 202 -TEATGSSIPPQ--GLLGLGRGPMSLLSQTQN--LYQSTFSYCLPSFRSLNFSGSLRLGP 256
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFD--SGASYAYFTSR 247
P + TP+L+N Y + + + + L F+ +GA + +
Sbjct: 257 VGQPKR-IKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGT 315
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV-TEYFKPL-ALSFTNRRNS 305
V+ +V+ P A D +LG T Y P+ A + T +
Sbjct: 316 VFTRLVA---------PAYTAVRDAFRKRVGNATVTSLGGFDTCYTSPIVAPTITFMFSG 366
Query: 306 VRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
+ + +PP+ L+ S ++ CL + + N+I + Q+ +++D R+G
Sbjct: 367 MNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVA 426
Query: 365 PEDCN 369
E C
Sbjct: 427 REPCT 431
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 149/366 (40%), Gaps = 39/366 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHK----NIVPCSN 71
+ V + +G P + F FDTGS +TW QC PC G C E+++ P K N V CS+
Sbjct: 135 YVVTVGLGTPKEDFTLVFDTGSGITWTQCQ-PCLGSCYPQKEQKFDPTKSTSYNNVSCSS 193
Query: 72 PRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L P R C N C Y+I YGD S G T+ L S+ VF L FG
Sbjct: 194 ASCNLL--PTSERGCSASNSTCLYQIIYGDQSYSQGFFATE--TLTISSSDVFTNFL-FG 248
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G AG+LGL +S+ SQ E + +C+ +L G
Sbjct: 249 CG--QSNNGLFG--QAAGLLGLSSSSVSLPSQTAEK--YQKQFSYCLPSTPSSTGYLNFG 302
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAYFT 245
S +TP+ + A Y + + +G + I DSG
Sbjct: 303 GKVSQTAGFTPI--SPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVITRLP 360
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
Y+ + + P D+ L C+ F V+ F +++SF +
Sbjct: 361 PTAYKALKEAFDEKMSNYP--KTNGDELLDTCYD--FSNYTTVS--FPKVSVSF---KGG 411
Query: 306 VRLVVPPEAYL-VISGRKNVCLGI-LNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGW 363
V + + L +++G K VCL N ++E G I G + V+YD K IG+
Sbjct: 412 VEVDIDASGILYLVNGVKMVCLAFAANKDDSEFG---IFGNHQQKTYEVVYDGAKGMIGF 468
Query: 364 KPEDCN 369
C+
Sbjct: 469 AAGACS 474
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 148/370 (40%), Gaps = 42/370 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKNI----VPCS 70
+ V L +G P DTGSDL+WVQC PC + P+K Y P + VPC
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCNSSSCYPQKDPLYDPTASSTYAPVPCD 185
Query: 71 NPRCAAL---HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ C L + + C Y IEYG+ +++G T+ L + V
Sbjct: 186 SKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTL---SPQVSVKDF 242
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI--GQNGRGV 184
FGCG Q + G+LGLG S+VSQ E YG +C+ G + G
Sbjct: 243 GFGCGLVQQG----TFDLFDGLLGLGGAPESLVSQTAETYG---GAFSYCLPPGNSTTGF 295
Query: 185 LFLG--DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSG 238
L LG ++G +TP+ Y++ + GK + L I DSG
Sbjct: 296 LALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSGGMIIDSG 355
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
Y + + + PL +D L C+ F + VT +AL+
Sbjct: 356 TIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYN--FTGIANVT--VPTVALT 411
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
F + ++ L VP + CL G A G+ IIG + + V+YD+ +
Sbjct: 412 F-DGGATIDLDVPSGVLI------QDCLAFAGG--ASDGDVGIIGNVNQRTFEVLYDSGR 462
Query: 359 QRIGWKPEDC 368
+G++P C
Sbjct: 463 GHVGFRPGAC 472
>gi|281200780|gb|EFA74998.1| putative aspartyl protease [Polysphondylium pallidum PN500]
Length = 394
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 146/368 (39%), Gaps = 36/368 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP--EKQYKPHKNIVPCSNPRC 74
+ +N + F DTGS L + C C P + + + +V C + C
Sbjct: 39 YQINTKIIVGNHTFTVQVDTGSSLMAIPM-VNCNTCHDRPSYDPTHSQYSKVVSCFSEHC 97
Query: 75 AALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
PP+CK+ D CD+ I YGDG G + D+ L +G G
Sbjct: 98 LG-SGSAPPQCKNRAEDDCDFVILYGDGSRVSGKIYQDVVNLSGLSGIA-------NFGA 149
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIV-----SQLREYGLIRNVIGHCIGQNGRGVLFLG 188
N+ G P G++G GR + V S ++ +GL +N+ + GRG L LG
Sbjct: 150 NRIETGDFEYPRADGIVGFGRSCKTCVPTVFESLVQAHGL-KNIFAMSMDYEGRGTLSLG 208
Query: 189 DGKVPSSGVA---WTPMLQNSADLKHYILGPAELLYSGKSC--GLKDLTLIFDSGASYAY 243
+ PS+ + +TP+ + D Y + P L +I DSG+S
Sbjct: 209 ELN-PSNHIGEIQYTPLFE---DGPFYNIKPTNFKVDDTVILPRLLGRQVIVDSGSSALS 264
Query: 244 FTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
S Y +V ++ + +P IC+ + + L+F
Sbjct: 265 LASGAYDALVHHFRKNYCHVAGICDSPSILDGSICYNS-----ASSLDLLPTIYLTF--- 316
Query: 303 RNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIG 362
V++ VPP+ YL + N G + I+G++FM+ ++DNE++RIG
Sbjct: 317 EGGVKVAVPPKNYLTKAPLTNGASGYCWMIDRADPSTTILGDVFMRGYYTVFDNEEKRIG 376
Query: 363 WKPEDCNT 370
+ NT
Sbjct: 377 FAVNSRNT 384
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 152/374 (40%), Gaps = 60/374 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKNIV----PCS 70
+ + +T+G P DTGSD++WVQC APC C+ +K + P + C
Sbjct: 129 YVITVTIGTPAVTQVMSIDTGSDVSWVQC-APCAAQSCSSQKDKLFDPAMSATYSAFSCG 187
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ +CA L K QC Y ++YGDG ++ G +D L S+ FG
Sbjct: 188 SAQCAQLGDEGNGCLK---SQCQYIVKYGDGSNTAGTYGSDTLSLTSSDAV---KSFQFG 241
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI---GQNGRGVLF 186
C + G + D G++GLG S+VSQ YG +C+ +G G L
Sbjct: 242 C--SHRAAGFVGELD--GLMGLGGDTESLVSQTAATYG---KAFSYCLPPPSSSGGGFLT 294
Query: 187 LG-DGKVPSSGVAWTPMLQNSA-----------DLKHYILGPAELLYSGKSCGLKDLTLI 234
LG G SS + TPM++ S + +L ++SG S +
Sbjct: 295 LGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFSGAS--------V 346
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG YQ + + +++ P AP +L C+ F +T
Sbjct: 347 VDSGTVITQLPPTAYQALRTAFKKEMKAYP-SAAPVG-SLDTCFD--FSGFNTIT--VPT 400
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
+ L+F +R ++ L + Y CL + A G+ I+G + + +++
Sbjct: 401 VTLTF-SRGAAMDLDISGILYA-------GCLAFT--ATAHDGDTGILGNVQQRTFEMLF 450
Query: 355 DNEKQRIGWKPEDC 368
D + IG++ C
Sbjct: 451 DVGGRTIGFRSGAC 464
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 88/371 (23%), Positives = 150/371 (40%), Gaps = 44/371 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + LT+G PP DTGSDL W QC PC GC + ++P ++ +PC +
Sbjct: 50 YLMKLTLGTPPVDVYGLVDTGSDLVWAQC-TPCQGCYRQKSPMFEPLRSNTYTPIPCDSE 108
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-PLTFGC 131
C +L + C P C Y Y D + G L + ++G V + FGC
Sbjct: 109 ECNSLFGHS---CS-PQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDIVFGC 164
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI-----GQNGRGVL 185
G++ N G + D ++GLG G +S+VSQ YG R C+ + G +
Sbjct: 165 GHS--NSGTFNENDMG-IIGLGGGPLSLVSQFGNLYGSKR--FSQCLVPFHADPHTLGTI 219
Query: 186 FLGDGK-VPSSGVAWTPMLQNSADLKHYI----LGPAELLYSGKSCG-LKDLTLIFDSGA 239
GD V GVA TP++ + + + + S S L ++ DSG
Sbjct: 220 SFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGNIMIDSGT 279
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV-TEYFKPLALS 298
Y Y +V + P+ PD T +C+R G + +F+ +
Sbjct: 280 PATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGT-QLCYRSETNLEGPILIAHFEGADVQ 338
Query: 299 FTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEK 358
++ +PP+ + C + ++ E I G + ++ +D ++
Sbjct: 339 LM----PIQTFIPPKDGV-------FCFAMAGTTDGEY----IFGNFAQSNVLIGFDLDR 383
Query: 359 QRIGWKPEDCN 369
+ + +K DC+
Sbjct: 384 KTVSFKATDCS 394
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 143/372 (38%), Gaps = 37/372 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC--TGCTKPPEKQYKPHKNI----V 67
F Y + + +G PP+ DTGSDL WV+C T P Q+ P ++ V
Sbjct: 99 FEYL-MTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRV 157
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
C C AL C ++ C Y YGDG ++ G L T+ F F +G P
Sbjct: 158 SCQTDACEALGRAT---CDDGSN-CAYLYAYGDGSNTTGVLSTETF--TFDDGGSGRSPR 211
Query: 127 ------LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
+ FGC P G +S+V+QL + +C+
Sbjct: 212 QVRVGGVKFGCSTATAGSFPADGLVGLGGG-----AVSLVTQLGGATSLGRRFSYCLVPH 266
Query: 178 GQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFD 236
N L G V G A TP++ D + ++ + + + +I D
Sbjct: 267 SVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSRIIVD 326
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG + + + IV + R + P++ +P D L +C+ + + + E L
Sbjct: 327 SGTTLTFLDPSLLGPIVDELSRRITLPPVQ-SP-DGLLQLCYNVAGREV-EAGESIPDLT 383
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDN 356
L F + + PE V +CL I+ +E + +I+G + Q+ V YD
Sbjct: 384 LEF---GGGAAVALKPENAFVAVQEGTLCLAIVATTEQQ--PVSILGNLAQQNIHVGYDL 438
Query: 357 EKQRIGWKPEDC 368
+ + + DC
Sbjct: 439 DAGTVTFAGADC 450
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 147/372 (39%), Gaps = 46/372 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ N T+G PP+ D +L W QC C+ C + + P + PC P
Sbjct: 51 YVANFTIGTPPQPASAVIDLAGELVWTQCKQ-CSRCFEQDTPLFDPTASNTYRAEPCGTP 109
Query: 73 RCAALHWPNPPRCKHPNDQCDYE--IEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C ++ P+ R + C Y+ GD G +G TD F + + S L FG
Sbjct: 110 LCESI--PSDSR-NCSGNVCAYQASTNAGDTGGKVG---TDTFAVGTAKAS-----LAFG 158
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C P +G++GLGR S+V+Q + H G+N LFLG
Sbjct: 159 CVVASDIDTMGGP---SGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKN--SALFLGSS 213
Query: 191 KVPSSG--VAWTPMLQ---NSADLKHYILGPAELLYSGKSC---GLKDLTLIFDSGASYA 242
+ G A TP + N DL +Y E L +G + T++ D+ + +
Sbjct: 214 AKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPIS 273
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
+ YQ + + + P+ + P D P A G + L +F
Sbjct: 274 FLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFP-----KSGASGAAPD----LVFTF- 323
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEA-EVGENNIIGEIFMQDKMVIYDNEKQ 359
R + V YL+ VCL +L+ + E +++G + ++ ++D +K+
Sbjct: 324 --RGGAAMTVAASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKE 381
Query: 360 RIGWKPEDCNTL 371
+ ++P DC L
Sbjct: 382 TLSFEPADCTKL 393
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 92/393 (23%), Positives = 154/393 (39%), Gaps = 45/393 (11%)
Query: 9 FFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP------------------CT 50
F+ F Y A + VG PP F DTGSDL W++C+
Sbjct: 75 LFYGDFEYLAA-VNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPP 133
Query: 51 GCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIG 106
+ P + V C P C AL C + CD+ Y DG S+ G
Sbjct: 134 PPPPEAVVYFNPFDSSSYSRVGCDGPSCLAL--ATNASCNGDSHACDFRYSYRDGASATG 191
Query: 107 ALVTDLFPL--RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL- 163
L D F +N + + FGC G D G++GLG G +S+ SQL
Sbjct: 192 LLAADTFTFGGNINNDTTSTASIDFGCATG--TAGREFQAD--GMVGLGAGPLSLASQLG 247
Query: 164 REYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSAD-LKHYILGPAELLYS 222
R++ + + I + F V G A TP++ +S++ +Y + L +
Sbjct: 248 RKFSFC--LTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVA 305
Query: 223 GKSC-GLKDLT-LIFDSGASYAYFT-SRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICW 278
G+ G ++ +I D+G + + + + + R + G L A P D+TL +C+
Sbjct: 306 GQPVPGTTSVSKVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPPDETLELCY 365
Query: 279 RGPFKALGQVTEYFKPLALSF-TNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVG 337
+ V + L VRL E V+ +CL ++ S E+
Sbjct: 366 D--VSRVKDVDGVIPDVTLVLGGGGGGEVRLT--GEGTFVLVKEGVLCLAVVTTSP-ELQ 420
Query: 338 ENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
+++G + +QD V D + + + +C++
Sbjct: 421 PLSVLGNVALQDLHVGIDLDARTATFATANCDS 453
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 59/125 (47%), Gaps = 12/125 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF+ + VG P + DTGSD+TWVQC PC C + + + P + V C N
Sbjct: 163 YFS-RVGVGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSTSYASVACDN 220
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
PRC H + C++ C YE+ YGDG ++G T+ L S + GC
Sbjct: 221 PRC---HDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAIGC 274
Query: 132 GYNQH 136
G++
Sbjct: 275 GHDNE 279
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 144/375 (38%), Gaps = 56/375 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + DT +D WV PC+GCT + P+ + + CS
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWV----PCSGCTGFSSTTFLPNASTTLGSLDCSGA 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+C+ + + P + C + YG S LV D L +N + TFGC
Sbjct: 154 QCSQVRGFSCPATG--SSACLFNQSYGGDSSLTATLVQDAITL--ANDVIPG--FTFGC- 206
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG----RGVLFLG 188
N + G + P G+LGLGRG IS++SQ + V +C+ G L LG
Sbjct: 207 INAVSGGSIPP---QGLLGLGRGPISLISQ--AGAMYSGVFSYCLPSFKSYYFSGSLKLG 261
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILG-------------PAELLYSGKSCGLKDLTLIF 235
P S + TP+L+N Y + P+E L + G I
Sbjct: 262 PVGQPKS-IRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGT---II 317
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG F VY I + + G PI G F T +
Sbjct: 318 DSGTVITRFVQPVYFAIRDEFRKQVNG------------PISSLGAFDTCFAATNEAEAP 365
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIY 354
A++ + LV+P E L+ S ++ CL + N+I + Q+ +++
Sbjct: 366 AITL--HFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMF 423
Query: 355 DNEKQRIGWKPEDCN 369
D R+G E CN
Sbjct: 424 DTTNSRLGIARELCN 438
>gi|308810200|ref|XP_003082409.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116060877|emb|CAL57355.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 455
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/367 (22%), Positives = 148/367 (40%), Gaps = 40/367 (10%)
Query: 30 FDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRC--AALHWPNPPRCKH 87
FD DTGS LT++ C P+ + + R A + + C+
Sbjct: 33 FDLFVDTGSPLTYLACWPASREFVDYCGVHEHPYYDARVSDDFRFLNATTNAEDDAFCRR 92
Query: 88 PND---------QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNP 138
+ C++ I Y D ++IG +V D+ + + + FGCG
Sbjct: 93 ASSLFILDDESGACEFGIPYMDNSTAIGVMVEDVMTV---GDELAGAKMIFGCGCLVEAN 149
Query: 139 GPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLFLGDGKVP---- 193
G D G+ G GRG + +QL G+I +V G C G L G+
Sbjct: 150 GEADRYD--GMAGFGRGETTFHTQLARTGVIDADVFGFCSEGAGTNTAMLSLGRYDFGRD 207
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S ++WT ML + DL + + L + G ++ + DSG + +Y + +
Sbjct: 208 LSPLSWTRMLGDD-DLA--VRTMSWKLGAKIIAGSTNVYTVLDSGTTLVVLPPVMYGDFM 264
Query: 254 SLIMRDLIG-----TPLKLAPDDKTLPICWRGPFKALGQ--VTEYFKPLALSFTNRRNSV 306
++ ++ + + + D C+ AL + + L +++ +
Sbjct: 265 KELLDRIVDLNATYSDVHVFEDYSFSTFCFYSKSGALTNDIIRDALPKLTITYDP---DI 321
Query: 307 RLVVPPEAYLVISGR--KNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWK 364
LV+PPE YL S + C+GI+ G+E ++ I+G+ +++ V YD E +RIG
Sbjct: 322 ALVLPPENYLFSSWIVPREHCIGIMKGAEGQI----ILGQQTLRNTFVEYDLENERIGLA 377
Query: 365 PEDCNTL 371
C L
Sbjct: 378 VTHCENL 384
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 153/395 (38%), Gaps = 67/395 (16%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
S F VNL++G PP DTGS L WVQC PC C + + P K++ + C
Sbjct: 102 SGFLVNLSIGSPPVTQLVVVDTGSSLLWVQC-LPCINCFQQSTSWFDPLKSVSFKTLGCG 160
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD-LFPLRFSNGSVFNV---- 125
P ++ N +C N Q +Y++ Y G SS G L + L G VF
Sbjct: 161 FP---GYNYINGYKCNRFN-QAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAIS 216
Query: 126 ---------PLTFGCGYNQHNPGPLSPPDTAGVLGLGRG-RISIVSQLREYGLIRNVIGH 175
+TFGCG+ N + GV GLG I++ +QL N +
Sbjct: 217 TQISKIKKSNITFGCGH--MNIKTNNDDAYNGVFGLGAYPHITMATQL------GNKFSY 268
Query: 176 CIGQNG-----RGVLFLGDGKVPSS---------GVAWTPMLQNSADLKHYILGPAELLY 221
CIG L LG G G + + S K + P
Sbjct: 269 CIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKI 328
Query: 222 SGKSCGLKDLTLIFDSGASYAYFTS----RVYQEIVSLIMRDLIGTPLKLAPDDKTLP-I 276
S G ++ DSG +Y + +Y EIV DL+ L+ P + +
Sbjct: 329 SSDGSG----GVLIDSGMTYTKLANGGFELLYDEIV-----DLMKGLLERIPTQRKFEGL 379
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEV 336
C++G + + F + F LV+ + G CL IL S +E+
Sbjct: 380 CFKG---VVSRDLVGFPAVTFHFA---GGADLVLESGSLFRQHGGDRFCLAILP-SNSEL 432
Query: 337 GENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
++IG + Q+ V +D E+ ++ ++ DC L
Sbjct: 433 LNLSVIGILAQQNYNVGFDLEQMKVFFRRIDCQLL 467
>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 530
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 156/381 (40%), Gaps = 53/381 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP----------PEKQYKPH 63
F ++A N++VG P F DTGS+L W+ C+ T C + P Y P+
Sbjct: 101 FLHYA-NVSVGTPATWFLVALDTGSNLFWLPCNCGST-CIRDLKDIGLSQSRPLNLYSPN 158
Query: 64 KNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFS 118
+ + C++ RC +C P C Y+I+Y + + G L D+ L
Sbjct: 159 TSSTSSSIRCNDDRCFGSS-----QCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHLVTE 213
Query: 119 NGSVFNVP--LTFGCGYNQHNPGPL-SPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ + V +T GCG NQ G L S G+LGLG S+ S L + + N
Sbjct: 214 DVDLKPVKANITLGCGRNQ--TGFLQSSAAINGLLGLGMKDYSVPSILAKAKITANSFSM 271
Query: 176 CIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIF 235
C G + + G + TP+L Y + E+ G G++ L L F
Sbjct: 272 CFGNIIDVIGRISFGDKGYTDQMETPLLPTEPS-PTYAVNVTEVSVGGDVVGVQLLAL-F 329
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK-----ALGQVTE 290
D+G S+ + Y LI DK PI PF+ + T
Sbjct: 330 DTGTSFTHLLEPEY---------GLITKAFDDHVTDKRRPIDPEIPFEFCYDLSPNSTTI 380
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV---CLGILNGSEAEVGENNIIGEIFM 347
F +A++F S+ + P ++ N CLGIL + ++ NIIG+ FM
Sbjct: 381 LFPRVAMTFEG--GSLMFLRNP--LFIVWNEDNTAMYCLGILKSVDFKI---NIIGQNFM 433
Query: 348 QDKMVIYDNEKQRIGWKPEDC 368
V++D E+ +GWK DC
Sbjct: 434 SGYRVVFDRERMILGWKRSDC 454
>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
Length = 429
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 98/430 (22%), Positives = 164/430 (38%), Gaps = 94/430 (21%)
Query: 12 PIFSY---FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA----PCTGCTKPPEKQ----- 59
P+ +Y + ++L +G PP++F DTGSDLTWV C C C
Sbjct: 17 PVTTYTDGYLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPS 76
Query: 60 ----------------------YKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIE 97
+ + PC+ CA + + C P Y
Sbjct: 77 FSPSQSSSNMKELCGSRFCVDIHSSDNSHDPCAAVGCAIPSFMS-GLCTRPCPPFSY--T 133
Query: 98 YGDGGSSIGALVTDLFPLRFSNGSVFNVPL-------TFGC-GYNQHNPGPLSPPDTAGV 149
YG G +G+L D+ L +GS+F + + FGC G + P G+
Sbjct: 134 YGGGALVLGSLAKDIVTL---HGSIFGIAILLDVPGFCFGCVGSSIREP--------IGI 182
Query: 150 LGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVLFLGDGKVPS-SGVAWTP 201
G G+G +S+ SQL G + HC N L +GD + + +TP
Sbjct: 183 AGFGKGILSLPSQL---GFLDKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTP 239
Query: 202 MLQNSADLKHYILGPAELLYSGKSCGLK------------DLTLIFDSGASYAYFTSRVY 249
ML++ + Y +G E + G + + +I D+G +Y + Y
Sbjct: 240 MLKSITNPNFYYIG-LEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFY 298
Query: 250 QEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLV 309
I+S + ++ +C++ P + + F V+L
Sbjct: 299 TAILSSLASVILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFL---GDVKLT 355
Query: 310 VPPEA--YLVISGRKNVCLGIL----NGSEAEVGENN-----IIGEIFMQDKMVIYDNEK 358
+P ++ Y V + + +V + L E +VG N ++G MQ+ V+YD E
Sbjct: 356 LPKDSCYYAVTAPKNSVVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEA 415
Query: 359 QRIGWKPEDC 368
RIG++P+DC
Sbjct: 416 GRIGFQPKDC 425
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 152/380 (40%), Gaps = 57/380 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKNI----VPCS 70
+ V L +G P DTGSDL+WVQC PC C + + P + VPC
Sbjct: 91 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 149
Query: 71 NPRCAALHWPNPPR-CKHPNDQ----CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
+ C L C + C+Y IEYG+ ++ G T+ L+ V
Sbjct: 150 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVVVA 206
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
FGCG +QH GP D G+LGLG S+VSQ +C+ G
Sbjct: 207 DFGFGCGDHQH--GPYEKFD--GLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPTSGGAG 260
Query: 186 FLGDGKVP-------SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------ 232
FL G P +SG+++TPM + + YI + +G S G L
Sbjct: 261 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYI-----VTLTGISVGGAPLAIPPSAF 315
Query: 233 ---LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
++ DSG + Y + S + L + L C+ F VT
Sbjct: 316 SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYD--FTGHANVT 373
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGENNIIGEIFMQ 348
++L+F+ ++ L P A +++ G CL G++ +G IIG + +
Sbjct: 374 --VPTISLTFSGGA-TIDLAAP--AGVLVDG----CLAFAGAGTDNAIG---IIGNVNQR 421
Query: 349 DKMVIYDNEKQRIGWKPEDC 368
V+YD+ K +G++ C
Sbjct: 422 TFEVLYDSGKGTVGFRAGAC 441
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 100/385 (25%), Positives = 154/385 (40%), Gaps = 50/385 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +G PP F DTGSDLTW QC PC C Y + VPC++
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCK-PCKLCFPQDTPIYDTAASASFSPVPCASA 153
Query: 73 RCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP----- 126
C + W + C C Y Y DG S G L T+ L F+ GS P
Sbjct: 154 TCLPI-WRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTET--LTFA-GSSPGAPGPGVS 209
Query: 127 ---LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGCG + G LS ++ G +GLGRG +S+V+QL + G
Sbjct: 210 VGGVAFGCGVDN---GGLS-YNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSP 265
Query: 184 VLF--LGDGKVPSS----GVAWTPMLQNSADLKHYI-------LGPAELLYSGKSCGLKD 230
VLF L + PS+ V TP++Q + Y LG A L + L+D
Sbjct: 266 VLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRD 325
Query: 231 ---LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ 287
+I DSG + ++ +V+ + L + + D C+ P A Q
Sbjct: 326 DGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNASSLDSP---CF--PATAGEQ 380
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGR-KNVCLGILNGSEAEVGENNIIGEIF 346
+ L F + + + + Y+ + + CL I A +I+G
Sbjct: 381 QLPDMPDMLLHFAGGAD---MRLHRDNYMSFNQESSSFCLNIAGAPSA---YGSILGNFQ 434
Query: 347 MQDKMVIYDNEKQRIGWKPEDCNTL 371
Q+ +++D ++ + P DC+ L
Sbjct: 435 QQNIQMLFDITVGQLSFVPTDCSKL 459
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 61/123 (49%), Gaps = 8/123 (6%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++ +VG PP DTGSD+ W+QC PC C + P ++ +PCS+
Sbjct: 94 YLMSYSVGTPPFQILGIVDTGSDIIWLQCQ-PCEDCYNQTTPIFDPSQSKTYKTLPCSSN 152
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C ++ + C ND+C+Y I YGD S G L + L ++GS P T GC
Sbjct: 153 ICQSVQ--SAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVIGC 210
Query: 132 GYN 134
G+N
Sbjct: 211 GHN 213
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 150/385 (38%), Gaps = 55/385 (14%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----V 67
P + + VG P DT SDLTW+QC PC C + P + +
Sbjct: 133 PTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPRHSTSYREM 191
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
+ C AL K C Y + YGDG +++G + + L F+ G +P
Sbjct: 192 SFNAADCQALGRSGGGDAKR--GTCVYTVGYGDGSTTVGDFIEET--LTFAGG--VRLPR 245
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG--RGV 184
++ GCG++ N G P AG+LGLGRG +S +Q+ G + + G
Sbjct: 246 ISIGCGHD--NKGLFGAP-AAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLSGPGSLSST 302
Query: 185 LFLGDGKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSG---KSCGLKDLTL------- 233
L G G V +S V++TP + N Y + + G +DL L
Sbjct: 303 LTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGRG 362
Query: 234 --IFDSGASYAYFTSRVY---QEIVSLIMRDL----IGTPLKLAPDDKTLPICWRGPFKA 284
I DSG + Y ++ + DL IG P D + RG K
Sbjct: 363 GVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFF--DTCYTVGGRG-MKK 419
Query: 285 LGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIG 343
+ V+ +F SV + + P+ YL+ + VC + V +IIG
Sbjct: 420 VPTVSMHFA----------GSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSV---SIIG 466
Query: 344 EIFMQDKMVIYDNEKQRIGWKPEDC 368
I Q ++YD R+G+ P C
Sbjct: 467 NIQQQGFRIVYD-IGGRVGFAPNSC 490
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 101/413 (24%), Positives = 167/413 (40%), Gaps = 79/413 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--PCTGCTKPPEKQ---YKPHKN----IV 67
+A ++G PP+ DTGS LTWV C + C C+ P + P + +V
Sbjct: 103 YAFTASLGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFAAAVPVFHPKNSSSSRLV 162
Query: 68 PCSNPRCAALH-WPNPPRCKHP----------NDQC-DYEIEYGDGGSSIGALVTDLF-- 113
C NP C +H + +C+ P ++ C Y + YG GS+ G L+ D
Sbjct: 163 GCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPPYAVVYGS-GSTAGLLIADTLRA 221
Query: 114 PLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVI 173
P R +G V L + H P +G+ G GRG S+ +QL ++
Sbjct: 222 PGRAVSGFVLGCSLV-----SVHQP-------PSGLAGFGRGAPSVPAQLGLSKFSYCLL 269
Query: 174 GHCIGQNG--RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL 231
N G L LG + G+ + P+++++A K L SG + G K +
Sbjct: 270 SRRFDDNAAVSGSLVLGGD---NDGMQYVPLVKSAAGDKQPYAVYYYLALSGVTVGGKAV 326
Query: 232 TL---------------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI 276
L I DSG ++ Y V+Q + ++ + G + ++ L +
Sbjct: 327 RLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDVEEGLGL 386
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNV------------ 324
P AL Q + LS + +V + +P E Y V++GR V
Sbjct: 387 ---HPCFALPQGAKSMALPELSLHFKGGAV-MQLPLENYFVVAGRAPVPGAGAGAGAAEA 442
Query: 325 -CLGILN------GSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
CL ++ + G I+G Q+ +V YD EK+R+G++ + C +
Sbjct: 443 ICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNYLVEYDLEKERLGFRRQPCAS 495
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/371 (23%), Positives = 141/371 (38%), Gaps = 51/371 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKNI-------VP 68
+ ++ +VG PP++ D SD W+QC A T G P P V
Sbjct: 97 YVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREVR 156
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG--SSIGALVTDLFPLRFSNGSVFNVP 126
C+N C L P C + C Y YG G ++ G L D F +V
Sbjct: 157 CANRGCQRL---VPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAF----ATVRADG 209
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ FGC D GV+GLGRG +S VSQL+ + G +LF
Sbjct: 210 VIFGCAVATEG-------DIGGVIGLGRGELSPVSQLQIGRFSYYLAPDDAVDVGSFILF 262
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS------ 240
L D K +S TP++ + A Y + A + G+ + T + S
Sbjct: 263 LDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLS 322
Query: 241 ----YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVTEYFK 293
+ + Y+ ++R + + ++L D + L +C+ A +V
Sbjct: 323 ITIPVTFLDAGAYK-----VVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPS--- 374
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVI 353
+AL F +V + + + S CL IL + G+ +++G + +I
Sbjct: 375 -MALVFAG--GAVMELEMGNYFYMDSTTGLECLTIL---PSPAGDGSLLGSLIQVGTHMI 428
Query: 354 YDNEKQRIGWK 364
YD R+ ++
Sbjct: 429 YDISGSRLVFE 439
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 147/378 (38%), Gaps = 54/378 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + L VG P DTGSD+ W+QC +PC C + + P K+ VPC +
Sbjct: 135 YF-MRLGVGTPATNVYMVLDTGSDVVWLQC-SPCKACYNQTDAIFDPKKSKTFATVPCGS 192
Query: 72 PRCAALHWPNPPRC-KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L + C + C Y++ YGDG + G T+ L F V +VPL G
Sbjct: 193 RLCRRLD--DSSECVTRRSKTCLYQVSYGDGSFTEGDFSTE--TLTFHGARVDHVPL--G 246
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-----EYGLIRNVIGHCIGQNGRGVL 185
CG++ N G LG G ++ R Y L+ + ++
Sbjct: 247 CGHD--NEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIV 304
Query: 186 FLGDGKVPSSGVAWTPMLQN-SADLKHYI------LGPAELLYSGKSCGLKDLT----LI 234
F G+ VP + V +TP+L N D +Y+ +G + + +S D T +I
Sbjct: 305 F-GNAAVPKTSV-FTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVI 362
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRD---LIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
DSG S T Y + +RD L T LK AP C F G T
Sbjct: 363 IDSGTSVTRLTQPAY-----VALRDAFRLGATKLKRAPSYSLFDTC----FDLSGMTTVK 413
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+ F S +P YL+ ++ C +G +IIG I Q
Sbjct: 414 VPTVVFHFGGGEVS----LPASNYLIPVNTEGRFCFAF----AGTMGSLSIIGNIQQQGF 465
Query: 351 MVIYDNEKQRIGWKPEDC 368
V YD R+G+ C
Sbjct: 466 RVAYDLVGSRVGFLSRAC 483
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 153/377 (40%), Gaps = 51/377 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKNI----VPCS 70
+ V L +G P DTGSDL+WVQC PC C + + P + VPC
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 229
Query: 71 NPRC---AALHWPNPPRCKHPNDQ----CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ C AA + + C + C+Y IEYG+ ++ G T+ L+ V
Sbjct: 230 SDACRKLAAGAYGH--GCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVV 284
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
FGCG +QH GP D G+LGLG S+VSQ +C+ G
Sbjct: 285 VADFGFGCGDHQH--GPYEKFD--GLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPTSGG 338
Query: 184 VLFLGDGKVP-------SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLT 232
FL G P +SG+++TPM + + YI+ + G +
Sbjct: 339 AGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSG 398
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
++ DSG + Y + S + L + L C+ F VT
Sbjct: 399 MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYD--FTGHANVT--V 454
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN-GSEAEVGENNIIGEIFMQDKM 351
++L+F+ ++ L P A +++ G CL G++ +G IIG + +
Sbjct: 455 PTISLTFSGGA-TIDLAAP--AGVLVDG----CLAFAGAGTDNAIG---IIGNVNQRTFE 504
Query: 352 VIYDNEKQRIGWKPEDC 368
V+YD+ K +G++ C
Sbjct: 505 VLYDSGKGTVGFRAGAC 521
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 69/144 (47%), Gaps = 20/144 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF L VG PPK DTGSD+ W+QC APC C + + P K + + C +
Sbjct: 174 YF-TRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISCRS 231
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P C L + P C + C Y++ YGDG + G T+ R + VP + G
Sbjct: 232 PLCLRL---DSPGC-NSRQSCLYQVAYGDGSFTFGEFSTETLTFRGT-----RVPKVALG 282
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGR 154
CG++ N G AG+LGLGR
Sbjct: 283 CGHD--NEGLFV--GAAGLLGLGR 302
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 143/382 (37%), Gaps = 51/382 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V +G P + DT +D TW C +PC C P + P + +PCS+
Sbjct: 81 YVVRAGLGSPSQQLLLALDTSADATWAHC-SPCGTC--PSSSLFAPANSSSYASLPCSSS 137
Query: 73 RCAALHWPNPPRCKHPNDQ---------CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
C P + D C + + D S AL +D LR ++
Sbjct: 138 WCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADA-SFQAALASDT--LRLGKDAIP 194
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR- 182
N TFGC + GP + G+LGLGRG ++++SQ L V +C+
Sbjct: 195 N--YTFGCVSSVT--GPTTNMPRQGLLGLGRGPMALLSQAGS--LYNGVFSYCLPSYRSY 248
Query: 183 ---GVLFLGDGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLT 232
G L LG G V +TPML+N Y + G A + S T
Sbjct: 249 YFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAAT 308
Query: 233 ---LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
+ DSG +T+ VY + R + AP T G F
Sbjct: 309 GAGTVVDSGTVITRWTAPVYAALREEFRRQVA------APSGYT----SLGAFDTCFNTD 358
Query: 290 EYFKPLALSFT-NRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFM 347
E A + T + V L +P E L+ S + CL + + N+I +
Sbjct: 359 EVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQ 418
Query: 348 QDKMVIYDNEKQRIGWKPEDCN 369
Q+ V++D RIG+ E CN
Sbjct: 419 QNIRVVFDVANSRIGFAKESCN 440
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 147/378 (38%), Gaps = 54/378 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + L VG P DTGSD+ W+QC +PC C + + P K+ VPC +
Sbjct: 138 YF-MRLGVGTPATNVYMVLDTGSDVVWLQC-SPCKACYNQSDVIFDPKKSKTFATVPCGS 195
Query: 72 PRCAALHWPNPPRC-KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L + C + C Y++ YGDG + G T+ L F V +VPL G
Sbjct: 196 RLCRRLD--DSSECVTRRSKTCLYQVSYGDGSFTEGDFSTE--TLTFHGARVDHVPL--G 249
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-----EYGLIRNVIGHCIGQNGRGVL 185
CG++ N G LG G ++ R Y L+ + ++
Sbjct: 250 CGHD--NEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPSTIV 307
Query: 186 FLGDGKVPSSGVAWTPMLQN-SADLKHYI------LGPAELLYSGKSCGLKDLT----LI 234
F G+ VP + V +TP+L N D +Y+ +G + + +S D T +I
Sbjct: 308 F-GNDAVPKTSV-FTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVI 365
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRD---LIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
DSG S T Y + +RD L T LK AP C F G T
Sbjct: 366 IDSGTSVTRLTQSAY-----VALRDAFRLGATKLKRAPSYSLFDTC----FDLSGMTTVK 416
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
+ F S +P YL+ ++ C +G +IIG I Q
Sbjct: 417 VPTVVFHFGGGEVS----LPASNYLIPVNTEGRFCFAF----AGTMGSLSIIGNIQQQGF 468
Query: 351 MVIYDNEKQRIGWKPEDC 368
V YD R+G+ C
Sbjct: 469 RVAYDLVGSRVGFLSRAC 486
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 115/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS TWV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 62/125 (49%), Gaps = 12/125 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG+P + DTGSD+TW+QC PC C + Y P + V C +
Sbjct: 163 YFS-RVGVGRPARQLYMVLDTGSDVTWLQCQ-PCADCYAQSDPVYDPSVSTSYATVGCDS 220
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
PRC L + C++ C YE+ YGDG ++G T+ L S V NV + GC
Sbjct: 221 PRCRDL---DAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDS-APVSNVAI--GC 274
Query: 132 GYNQH 136
G++
Sbjct: 275 GHDNE 279
>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 97/390 (24%), Positives = 148/390 (37%), Gaps = 51/390 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA- 75
V++ VG PP+ DTGS+L+ + C+ P + V CS+P C
Sbjct: 65 LTVSVVVGTPPQNVTMVLDTGSELSGLLCNGSSLSPPAPFNASASLTYSAVDCSSPACVW 124
Query: 76 -ALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC-- 131
P P C P+ C I Y D S+ G LV D F L VP FGC
Sbjct: 125 RGRDLPVRPFCDAPPSTSCRVSISYADASSADGHLVADTFIL-----GTQAVPALFGCIT 179
Query: 132 GYNQH---NPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
Y+ N P + A G+LG+ RG +S V+Q +R +CI + L
Sbjct: 180 SYSSSTAINSSATDPSEAATGLLGMNRGSLSFVTQ---TATLR--FAYCIAPGQGPGILL 234
Query: 188 GDGKVPSS-GVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT--- 232
G ++ + +TP+++ S L ++ I + LL KS D T
Sbjct: 235 LGGDGGAAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGSALLQIPKSVLTPDHTGAG 294
Query: 233 -LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-------TLPICWRGPFKA 284
+ DSG + + + Y + + + LAP + C+RGP +
Sbjct: 295 QTMVDSGTQFTFLLADAYAALKAEFLNQARSL---LAPLGEPGFVFQGAFDACFRGPEER 351
Query: 285 LGQVTEYFKPLALSFTNRRNSV---RLV--VPPEAYLVISGRKNVCLGILNGSEAEVGEN 339
+ + + L +V +L+ VP E CL N A +
Sbjct: 352 VSAASRLLPEVGLVLRGAEVAVAGEKLLYSVPGERRGEEGAEAVWCLTFGNSDMAGM-SA 410
Query: 340 NIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+IG QD V YD + R+G+ P C
Sbjct: 411 YVIGHHHQQDVWVEYDLQNGRVGFAPARCE 440
>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
Length = 432
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 95/430 (22%), Positives = 160/430 (37%), Gaps = 91/430 (21%)
Query: 12 PIFSY---FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA----PCTGCTKPPEKQYKPHK 64
P+ +Y + ++L +G PP++F DTGSDLTWV C C C
Sbjct: 17 PVTTYTDGYLLSLNLGMPPQVFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPS 76
Query: 65 NIVP---------CSNPRCAALHWPNPPR-------CKHPNDQCD--------YEIEYGD 100
C + C +H + C P+ D + YG
Sbjct: 77 FSPSQSSSNMKELCGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSDLCTRPCPPFSYTYGG 136
Query: 101 GGSSIGALVTDLFPLRFSNGSVFNVPL-------TFGC-GYNQHNPGPLSPPDTAGVLGL 152
G +G+L D+ L +GS+F + + FGC G + P G+ G
Sbjct: 137 GALVLGSLAKDIVTL---HGSIFGIAILLDVPGFCFGCVGSSIREP--------IGIAGF 185
Query: 153 GRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVLFLGDGKVPS-SGVAWTPMLQ 204
G+G +S+ SQL G + HC N L +GD + + +TPML+
Sbjct: 186 GKGILSLPSQL---GFLDKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLK 242
Query: 205 NSADLKHYILGPAELLYSGKSCGLK------------DLTLIFDSGASYAYFTSRVYQEI 252
+ + Y +G E + G + + +I D+G +Y + Y I
Sbjct: 243 SITNPNFYYIG-LEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAI 301
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPP 312
+S + ++ +C++ P + + F V+L +P
Sbjct: 302 LSSLASVILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFL---GDVKLTLPK 358
Query: 313 EA--YLVISGRKNVCLGILNGSE------------AEVGENNIIGEIFMQDKMVIYDNEK 358
++ Y V + + +V + L A G ++G MQ+ V+YD E
Sbjct: 359 DSCYYAVTAPKNSVVVKCLLFQRMDNDDDDDDVGGANNGPGAVLGSFQMQNVEVVYDMEA 418
Query: 359 QRIGWKPEDC 368
RIG++P+DC
Sbjct: 419 GRIGFQPKDC 428
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 66/125 (52%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF+ + +GKPP DTGSD+ WVQC APC C + + ++P + + C+
Sbjct: 149 YFS-RVGIGKPPSQAYLILDTGSDVNWVQC-APCADCYQQADPIFEPASSASFSTLSCNT 206
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C +L + C+ ND C YE+ YGDG ++G VT+ L + V NV + GC
Sbjct: 207 RQCRSL---DVSECR--NDTCLYEVSYGDGSYTVGDFVTETITL--GSAPVDNVAI--GC 257
Query: 132 GYNQH 136
G+N
Sbjct: 258 GHNNE 262
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 101/400 (25%), Positives = 166/400 (41%), Gaps = 58/400 (14%)
Query: 13 IFSYFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IV 67
I S + ++L++G P P+ DTGSDL W QC C C P + + V
Sbjct: 96 IDSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQC--ACHVCFAQPFPTFDALASQTTLAV 153
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF---SNGSVFN 124
PCS+P C + +P C ++ C Y +Y D + G +V D F R +NGS +
Sbjct: 154 PCSDPICTSGKYP-LSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAH 212
Query: 125 ----VP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--- 176
VP + FGCG Q+N G + + +G+ G RG +S+ SQL+ + R HC
Sbjct: 213 AGVAVPNVRFGCG--QYNKG-IFKSNESGIAGFSRGPMSLPSQLK---VAR--FSHCFTA 264
Query: 177 IGQNGRGVLFLGDGKVP-------SSGVAWTPMLQNSADLKHYILGPA----------EL 219
I +FLG P + V TP ++ L + L L
Sbjct: 265 IADARTSPVFLGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNAL 324
Query: 220 LYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI-VSLIMRDLIGTPLKLAPDDKTLPICW 278
++GK G I DSG +Y+ + + + R + + A D ++ +C+
Sbjct: 325 AFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAEST-LCF 383
Query: 279 RGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI-------SGRKNVCLGILNG 331
++ E P +P E+Y++ SG +CL + +
Sbjct: 384 EAA-RSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSG-SGLCLVMNSA 441
Query: 332 SEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
++++ IIG Q+ V YD EK ++ + P C+ +
Sbjct: 442 GDSDL---TIIGNFQQQNMHVAYDLEKNKLVFVPARCDKM 478
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 93/399 (23%), Positives = 160/399 (40%), Gaps = 55/399 (13%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD----------APCTGCTKPPEKQYKP 62
I YF V VG P + F DTGSDLTWV+C + + P + ++P
Sbjct: 92 IGQYF-VRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRP 150
Query: 63 HKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS 118
K+ +PC++ C+ + C P C Y+ Y DG ++ G + T+ + S
Sbjct: 151 EKSKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALS 210
Query: 119 NGSVFN---------VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYG- 167
+ S + L GC + P S + GVL LG +S S +G
Sbjct: 211 SSSSSSKNKVKKAKLQGLVLGCTGSYTGP---SFEASDGVLSLGYSNVSFASHAASRFGG 267
Query: 168 -LIRNVIGHCIGQNGRGVLFLG-----DGKVPSS---GVAWTPMLQNSADLKHYILGPAE 218
++ H +N L G G P++ G TP++ +S Y +
Sbjct: 268 RFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKA 327
Query: 219 LLYSGKSCGL-KDL-------TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD 270
+ G+ + +D+ +I DSG S Y+ +V+ + + L P ++A D
Sbjct: 328 ISVDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFP-RVAMD 386
Query: 271 DKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKNVCLGILN 330
W P + + LA+ F S RL P ++Y++ + C+G+
Sbjct: 387 PFEYCYNWTSPSRK--DEGDDLPKLAVHFA---GSARLEPPSKSYVIDAAPGVKCIGVQE 441
Query: 331 GSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
G + ++IG I Q+ + +D + +R+ +K C
Sbjct: 442 GPWPGI---SVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 145/378 (38%), Gaps = 63/378 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----IVPCSN 71
F V + G P + FDTGSDL+W+QC PC+G C K + + P K+ +VPC
Sbjct: 112 FVVVVGFGSPAQTSATMFDTGSDLSWIQCQ-PCSGHCYKQHDPVFDPAKSSSYAVVPCGT 170
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
CAA C C Y +EYGDG S+ G L + L FS+ S F FGC
Sbjct: 171 TECAAAGG----ECN--GTTCVYGVEYGDGSSTTGVLARET--LTFSSSSEFT-GFIFGC 221
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G + N G D LG S+ + + +C+ +L G
Sbjct: 222 G--ETNLGDFGEVDGLLGLGR----GSLSLSSQAAPAFGGIFSYCLPSYNTTPGYLSIGA 275
Query: 192 VPSSG---VAWTPMLQN------------SADLKHYIL--GPAELLYSGKSCGLKDLTLI 234
P +G V +T M+ S ++ Y+L P+E +G +
Sbjct: 276 TPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKTGT---------L 326
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG Y Y + + G+ K AP L C+ GQ
Sbjct: 327 LDSGTILTYLPPPAYTALRDRFKFTMQGS--KPAPPYDELDTCY----DFTGQSGILIPG 380
Query: 295 LALSFTN----RRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDK 350
++ +F++ N ++ P+ G CL + S +++G +
Sbjct: 381 VSFNFSDGAVFNLNFFGIMTFPDDTKPAVG----CLAFV--SRPADMPFSVVGSTTQRSA 434
Query: 351 MVIYDNEKQRIGWKPEDC 368
VIYD Q+IG+ P C
Sbjct: 435 EVIYDVPAQKIGFIPASC 452
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 103/413 (24%), Positives = 150/413 (36%), Gaps = 74/413 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN--------IVP 68
V + VG PP+ DTGS+L+W+ C+ T PP+ Q N
Sbjct: 59 LTVPVAVGAPPQNVTMVLDTGSELSWLLCNGSRVPST-PPQPQAPAAFNGSASSTYAAAH 117
Query: 69 C-SNPRCAALHW-----PNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS 121
C S+P C W P PP C P++ C + Y D S+ G L D F L G
Sbjct: 118 CSSSPEC---QWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGVLAADTFLL----GG 170
Query: 122 VFNVPLTFGC-------------GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGL 168
V FGC G S G+LG+ RG +S V+Q G
Sbjct: 171 APPVRALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQT---GT 227
Query: 169 IRNVIGHCIG-QNGRGVLFL---GDGKVPSSG--VAWTPMLQNSADLKHY---------- 212
+R +CI +G G+L L GDG S+ + +TP+++ S L ++
Sbjct: 228 LR--FAYCIAPGDGPGLLVLGGDGDGAALSAAPQLNYTPLIEMSQPLPYFDRVAYSVQLE 285
Query: 213 -ILGPAELLYSGKSCGLKDLT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL 267
I A LL KS D T + DSG + + + Y + + L
Sbjct: 286 GIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPL 345
Query: 268 APDD----KTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN 323
D C+R + T + R V + Y+V R+
Sbjct: 346 GEPDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLRGAEVAVGGEKLLYMVPGERRG 405
Query: 324 V-------CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
CL N A + +IG Q+ V YD + R+G+ P C+
Sbjct: 406 EGGSEAVWCLTFGNSDMAGM-SAYVIGHHHQQNVWVEYDLQNSRVGFAPARCD 457
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 143/382 (37%), Gaps = 51/382 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V +G P + DT +D TW C +PC C P + P + +PCS+
Sbjct: 79 YVVRAGLGSPSQQLLLALDTSADATWAHC-SPCGTC--PSSSLFAPANSSSYASLPCSSS 135
Query: 73 RCAALHWPNPPRCKHPNDQ---------CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
C P + D C + + D S AL +D LR ++
Sbjct: 136 WCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADA-SFQAALASDT--LRLGKDAIP 192
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR- 182
N TFGC + GP + G+LGLGRG ++++SQ L V +C+
Sbjct: 193 N--YTFGCVSSVT--GPTTNMPRQGLLGLGRGPMALLSQAGS--LYNGVFSYCLPSYRSY 246
Query: 183 ---GVLFLGDGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLT 232
G L LG G V +TPML+N Y + G A + S T
Sbjct: 247 YFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAAT 306
Query: 233 ---LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
+ DSG +T+ VY + R + AP T G F
Sbjct: 307 GAGTVVDSGTVITRWTAPVYAALREEFRRQVA------APSGYT----SLGAFDTCFNTD 356
Query: 290 EYFKPLALSFT-NRRNSVRLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFM 347
E A + T + V L +P E L+ S + CL + + N+I +
Sbjct: 357 EVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQ 416
Query: 348 QDKMVIYDNEKQRIGWKPEDCN 369
Q+ V++D R+G+ E CN
Sbjct: 417 QNIRVVFDVANSRVGFAKESCN 438
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 103/412 (25%), Positives = 161/412 (39%), Gaps = 80/412 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKPPEKQ---YKPHKN----IV 67
+A ++G PP+ DTGS LTWV C + C C+ P + P + +V
Sbjct: 99 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 158
Query: 68 PCSNPRCAALH--------------WPNPPRC-KHPNDQC-DYEIEYGDGGSSIGALVTD 111
C NP C +H P C ++ C Y + YG GS+ G L+ D
Sbjct: 159 GCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGS-GSTAGLLIAD 217
Query: 112 LF--PLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----E 165
P R G V L + H P +G+ G GRG S+ +QL
Sbjct: 218 TLRAPGRAVPGFVLGCSLV-----SVHQP-------PSGLAGFGRGAPSVPAQLGLPKFS 265
Query: 166 YGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLK-----HYILGPAELL 220
Y L+ +G VL G+ + P+++++A K +Y L +
Sbjct: 266 YCLLSRRFDDNAAVSGSLVLGG---TGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVT 322
Query: 221 YSGKSCGLKDLTL----------IFDSGASYAYFTSRVYQEIVSLIMRDLIG--TPLKLA 268
GK+ L I DSG ++ Y V+Q + ++ + G K A
Sbjct: 323 VGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDA 382
Query: 269 PDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGR---KNVC 325
D L C+ AL Q LSF +V + +P E Y V++GR + +C
Sbjct: 383 EDGLGLHPCF-----ALPQGARSMALPELSFHFEGGAV-MQLPVENYFVVAGRGAVEAIC 436
Query: 326 LGILN-------GSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNT 370
L ++ G I+G Q+ +V YD EK+R+G++ + C +
Sbjct: 437 LAVVTDFGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTS 488
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 149/376 (39%), Gaps = 61/376 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG---CTKPPEKQYKPHKN----IVPC 69
+ V ++G P + DTGSDL+WVQC PC+ C + + P ++ VPC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCSAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P CA L QC Y + YGDG ++ G +D L S+ F
Sbjct: 199 GGPVCAGLGIYA--ASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQG---FFF 253
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVL 185
GCG+ Q G + D G+LGLGR + S+V Q G V +C+ G L
Sbjct: 254 GCGHAQS--GLFNGVD--GLLGLGREQPSLVEQ--TAGTYGGVFSYCLPTKPSTAGYLTL 307
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFD 236
LG + G + T +L + +Y+ ++ +G S G + L++ + D
Sbjct: 308 GLGGPSGAAPGFSTTQLLPSPNAPTYYV-----VMLTGISVGGQQLSVPASAFAGGTVVD 362
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
+G Y + S + AP + L C+ F G VT +A
Sbjct: 363 TGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYN--FAGYGTVT--LPNVA 418
Query: 297 LSFTNRRNSVRLVVPPEAYLVISGRKNVCLGIL----NGSEAEVGENNIIGEIFMQDKMV 352
L+F + A +++ + G L +GS+ G I+G + + V
Sbjct: 419 LTFGS-----------GATVMLGADGILSFGCLAFAPSGSD---GGMAILGNVQQRSFEV 464
Query: 353 IYDNEKQRIGWKPEDC 368
D +G+KP C
Sbjct: 465 RIDGTS--VGFKPSSC 478
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 86/354 (24%), Positives = 144/354 (40%), Gaps = 44/354 (12%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPN 89
FDTGSDL+W+QC PC C + P ++ VPC + C +P R +
Sbjct: 105 FDTGSDLSWLQC-TPCKTCYPQEAPLFDPTQSSTYVDVPCESQPCTL--FPQNQRECGSS 161
Query: 90 DQCDYEIEYGDGGSSIGALVTDLFPLRFSN------GSVFNVPLTFGCGYNQHNPGPLSP 143
QC Y +YG +IG L D + FS+ G+ F + FGC + + +S
Sbjct: 162 KQCIYLHQYGTDSFTIGRLGYDT--ISFSSTGMGQGGATFPKSV-FGCAFYSNFTFKIS- 217
Query: 144 PDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGKVPSSGVAWT 200
G +GLG G +S+ SQL + I + +C+ G L G P++ V T
Sbjct: 218 TKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTGKLKFGS-MAPTNEVVST 274
Query: 201 PMLQNSADLKHYILGPAELLYSGKSC--GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
P + N + +Y+L + K G +I DS + +Y + +S +
Sbjct: 275 PFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILTHLEQGIYTDFISSVKE 334
Query: 259 DLIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ +++A D T C R P F FT +V+ P+ +
Sbjct: 335 AI---NVEVAEDAPTPFEYCVRNP------TNLNFPEFVFHFTG----ADVVLGPKNMFI 381
Query: 318 ISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCNTL 371
VC+ ++ +I G + V YD ++++ + P +C+T+
Sbjct: 382 ALDNNLVCMTVVPSKGI-----SIFGNWAQVNFQVEYDLGEKKVSFAPTNCSTI 430
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 151/364 (41%), Gaps = 44/364 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-TKPPEKQYKPHKNIVPCSNPRCA 75
F + G P K DTGS LTW QC PC+ C + +Y+P +I + C
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQC-FPCSDCYAQKIYPKYRPAASIT-YRDAMCE 115
Query: 76 ALH-WPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCG 132
H NP P + C Y+ Y D + G L ++ + +G V + FGC
Sbjct: 116 DSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGC- 174
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ----NGRGVLFLG 188
N + G S G+LGLG G+ SI+ E+G + C+G+ L LG
Sbjct: 175 -NTLSDG--SYFTGTGILGLGVGKYSIIG---EFG---SKFSFCLGEISEPKASHNLILG 225
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIF-DSGASYAYFTSR 247
DG + V P + N + H I E + G+ L D +F D+G++ ++ ++
Sbjct: 226 DG----ANVQGHPTVINITE-GHTIF-QLESIIVGEEITLDDPVQVFVDTGSTLSHLSTN 279
Query: 248 VYQEIVSLIMRDLIGT-PLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
+Y + V DLIG+ PL P +C++ E + + + F +
Sbjct: 280 LYYKFVDA-FDDLIGSRPLSYEP-----TLCYK------ADTIERLEKMDVGFKFDVGA- 326
Query: 307 RLVVPPEAYLVISGRKNV-CLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQRIGWKP 365
L V + G + CL I N E+ + IIG I MQ V YD +
Sbjct: 327 ELSVNIHNIFIQQGPPEIRCLAIQNNKES--FSHVIIGVIAMQGYNVGYDLSAKTAYINK 384
Query: 366 EDCN 369
+DC+
Sbjct: 385 QDCD 388
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 145/372 (38%), Gaps = 47/372 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
F V++ G PP+ F DTGS +TW QC A C C K + H + + S +
Sbjct: 127 FLVDVAFGTPPQKFKLILDTGSSITWTQCKA-CVHCLKDSHR----HFDSLASSTYSFGS 181
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
C Y + YGD +S+G D L S+ VF FGCG N
Sbjct: 182 --------CIPSTVGNTYNMTYGDKSTSVGNYGCDTMTLEPSD--VFQ-KFQFGCGRN-- 228
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDGKVP-S 194
N G G+LGLG+G++S VSQ + V +C+ +N G L G+ S
Sbjct: 229 NEGDFGS-GADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEENSIGSLLFGEKATSQS 285
Query: 195 SGVAWTPMLQ--NSADLKHYILGPAELLYSGKSCGLKDLTL----------IFDSGASYA 242
S + +T ++ ++ L+ +LL S G K L + I DSG
Sbjct: 286 SSLKFTSLVNGPGTSGLEESGYYFVKLL--DISVGNKRLNIPSSVFASPGTIIDSGTVIT 343
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
R Y + + + + PL ++ L C+ + + E
Sbjct: 344 RLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGAD 403
Query: 301 NRRNSVRLVVPPEAYLVISGRKNVCLGILNGSEAEVG-ENNIIGEIFMQDKMVIYDNEKQ 359
R N R+V +A +CL S++ + E IIG V+YD +
Sbjct: 404 VRLNGKRVVWGNDA-------SRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGR 456
Query: 360 RIGWKPEDCNTL 371
RIG+ C+ L
Sbjct: 457 RIGFGGNGCSNL 468
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 142/364 (39%), Gaps = 46/364 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
F Y + L V PP DTGS L W++C P P Y +PC
Sbjct: 74 FEYL-MALDVSTPPVRMLALADTGSSLVWLKCKLP--AAHTPASSSYAR----LPCDAFA 126
Query: 74 CAALHWPNPPRCKHP---NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C AL + C+ N+ C Y + DG + G + D F F+ L FG
Sbjct: 127 CKALG--DAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAF--------TFSTRLDFG 176
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVL 185
C LS PD G++GL G IS+VSQL + +C+ + L
Sbjct: 177 CATRTEG---LSVPDD-GLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSSSL 232
Query: 186 FLGDGKVPSS--GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT--LIFDSGASY 241
G + SS G A TP++ + Y + + +GK L+ T LI DSG
Sbjct: 233 NFGSHAIVSSSPGAATTPLVAGR-NKSFYTIALDSIKVAGKPVPLQTTTTKLIVDSGTML 291
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
Y V +V+ + I P +P + +C+ +A V + + L
Sbjct: 292 TYLPKAVLDPLVA-ALTAAIKLPRVKSP-ETLYAVCYDVRRRAPEDVGKSIPDVTLVLGG 349
Query: 302 RRNSVRLVVPP--EAYLVISGRKNVCLGILNGSEAEVGENNIIGEIFMQDKMVIYDNEKQ 359
VRL P ++V + VCL ++ E I+G + Q+ V +D E++
Sbjct: 350 -GGEVRL---PWGNTFVVENKGTTVCLALVESHLPEF----ILGNVAQQNLHVGFDLERR 401
Query: 360 RIGW 363
+ +
Sbjct: 402 TVSF 405
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 149/395 (37%), Gaps = 76/395 (19%)
Query: 2 YVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK 61
YV+ I A NLTV DTGSDLTWVQC PC+ C + +
Sbjct: 103 YVTTIALGGGGSSRAGAGNLTV---------IVDTGSDLTWVQCK-PCSVCYAQRDPLFD 152
Query: 62 PHKN----IVPCSNPRC-AALHWPN--PPRCK--------HPNDQCDYEIEYGDGGSSIG 106
P + VPC+ C A+L P C +++C Y + YGDG S G
Sbjct: 153 PSGSASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRG 212
Query: 107 ALVTDLFPLRFSNGSVFNVPLTFGCGYNQ---HNPGPLSPPDTA---GVLGLGRGRISIV 160
L TD L ++ F FGCG + PG + TA G G G +S+
Sbjct: 213 VLATDTVALGGASVDGF----VFGCGLSNRGLRRPGSAASSPTASPPGTSGDAAGSLSLG 268
Query: 161 SQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG---PA 217
Y RN ++ V++T M+ + A Y + +
Sbjct: 269 GDTSSY---RN----------------------ATPVSYTRMIADPAQPPFYFMNVTGAS 303
Query: 218 ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPIC 277
+ + GL ++ DSG VY+ + + R AP L C
Sbjct: 304 VGGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDAC 363
Query: 278 WRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN---VCLGILNGSEA 334
+ L E PL T R + + A ++ RK+ VCL + + S
Sbjct: 364 YN-----LTGHDEVKVPL---LTLRLEAGADMTVDAAGMLFMARKDGSQVCLAMASLSFE 415
Query: 335 EVGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ + IIG ++K V+YD R+G+ EDC+
Sbjct: 416 D--QTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 103/404 (25%), Positives = 164/404 (40%), Gaps = 76/404 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGC-------TKPPE--KQYKPHKN 65
++V+L+ G P + F FDTGS L W+ C + C+GC T P +
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149
Query: 66 IVPCSNPRCAALHWPNPPRCK--HPNDQ-CD-----YEIEYGDGGSSIGALVTDLFPLRF 117
I+ C +P+C L+ PN +C+ PN + C Y ++YG GS+ G L+T+ L F
Sbjct: 150 IIGCQSPKCQFLYGPN-VQCRGCDPNTRNCTVGCPPYILQYGL-GSTAGVLITE--KLDF 205
Query: 118 SNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
+ +V + GC +S AG+ G GRG +S+ SQ+ L R HC+
Sbjct: 206 PDLTVPD--FVVGCSI-------ISTRQPAGIAGFGRGPVSLPSQMN---LKR--FSHCL 251
Query: 178 ------GQNGRGVLFLGDGKVPSS-----GVAWTPM-----LQNSADLKHYILGPAELLY 221
N L L G +S G+ +TP + N A L++Y L +
Sbjct: 252 VSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYV 311
Query: 222 SGKSCGLK----------DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIG-TPLKLAPD 270
K + D I DSG+++ + V++ + + T K
Sbjct: 312 GRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEK 371
Query: 271 DKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISGRKN-VCLGIL 329
+ L C+ G VT L F + +L +P Y G + VCL ++
Sbjct: 372 ETGLGPCFN--ISGKGDVT--VPELIFEF---KGGAKLELPLSNYFTFVGNTDTVCLTVV 424
Query: 330 NGSEAE----VGENNIIGEIFMQDKMVIYDNEKQRIGWKPEDCN 369
+ G I+G Q+ +V YD E R G+ + C+
Sbjct: 425 SDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|168002493|ref|XP_001753948.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694924|gb|EDQ81270.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 602
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 99/456 (21%), Positives = 166/456 (36%), Gaps = 107/456 (23%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPR- 73
+ V + +GK + + DTGS ++WV C T+ P +KP + V C
Sbjct: 155 FVKVPIGLGKERQEYYMHIDTGSGISWVNCKGRGPITTEGPHGLFKPKADSYVNCKKQEE 214
Query: 74 -CAALHWPNPPRC-KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C RC K + +C ++ +YGDG G +V S+GS + FGC
Sbjct: 215 FCKGFQDGEEHRCDKKHHFRCIFDTQYGDGLIIEGYIVMIDLIFDLSDGSESQADVAFGC 274
Query: 132 G------------------------------YNQHNPGPLSPPD--TAGVLGLGRGRISI 159
N L T G++GLG S
Sbjct: 275 ASTCPKFQVVKNTPHLSVKIASSFSIMCADKVNDEETKKLGQNTALTDGLIGLGPHPGSW 334
Query: 160 VSQLREYGLIRN-VIGHC----IGQNGRGVL---------FLGDG---KVPSSGVAWT-- 200
+ QL G I VI C +G++ + FL G + WT
Sbjct: 335 LHQLNMLGYISEYVIAICFEPDLGKSRHAAIGPELPEPAGFLSFGNPYSAQAESTIWTAN 394
Query: 201 -----------PMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI--------------- 234
P NS +L++Y + +Y+G+ ++ ++
Sbjct: 395 IPSPEEYANPHPHEANSTNLQYY-----DAMYTGRLVSIRYRDIVIQLRGNEKKRKRDHP 449
Query: 235 ------FDSGASYAYFTSRVYQEIVSLIMRDL--IGTPLKLAPDD---KTLPICWRGPFK 283
FD+G+ Y T + + V+++ + +G + D+ CWR
Sbjct: 450 EGVQMGFDTGSDLTYLTRKTFDAFVTILDEEAKHLGYEITRDADEFVKDEQRKCWRKKSG 509
Query: 284 ALGQVTEYFKPLAL---SFTNRRNSVRLVVPPEAYLVI--SGRKN-VCLGILNGSEAEVG 337
E F + L +F LV+ P+ Y+ SGR++ C +L +E + G
Sbjct: 510 GEEPSVEDFGDMILEFATFAEDDTKSELVINPKYYITSEGSGRQHRTCFNMLKETEFDFG 569
Query: 338 ENNIIGEIFMQDKMVIYDNEKQRIGWKPED-CNTLL 372
+G M+ ++++DNE RIGW+ D C+ +L
Sbjct: 570 N---LGAEVMRGHLLLFDNELNRIGWRRVDSCSRVL 602
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 147/384 (38%), Gaps = 74/384 (19%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + +G P + DTGSD+TWVQC PC C + + + P + V C +
Sbjct: 169 YFS-RVGIGSPARELYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSASYAAVSCDS 226
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
PRC L + C++ C YE+ YGDG ++G T+ L S V NV + GC
Sbjct: 227 PRCRDL---DTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST-PVTNVAI--GC 280
Query: 132 GYNQHN------------PGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIG 178
G++ GPLS P I + Y L+ R+
Sbjct: 281 GHDNEGLFVGAAGLLALGGGPLSFPS------------QISASTFSYCLVDRDSPAASTL 328
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLT 232
Q F DG + A P++++ Y + + + G++ + D T
Sbjct: 329 Q------FGADGAEADTVTA--PLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDAT 380
Query: 233 -----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALG 286
+I DSG + S Y + +R GTP L C+ L
Sbjct: 381 SGSGGVIVDSGTAVTRLQSSAYAALRDAFVR---GTPSLPRTSGVSLFDTCYD-----LS 432
Query: 287 QVTEYFKP-LALSFTNRRNSVRLVVPPEAYLV-ISGRKNVCLGILNGSEAEVGENNIIGE 344
T P ++L F L +P + YL+ + G CL + A +IIG
Sbjct: 433 DRTSVEVPAVSLRF---EGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAV----SIIGN 485
Query: 345 IFMQDKMVIYDNEKQRIGWKPEDC 368
+ Q V +D K +G+ P C
Sbjct: 486 VQQQGTRVSFDTAKGVVGFTPNKC 509
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 65/125 (52%), Gaps = 14/125 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG+P K F DTGSD+ W+QC PCT C + + + P + +PC +
Sbjct: 155 YFS-RVGVGQPAKPFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPRSSSSFASLPCES 212
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C AL C+ +C Y++ YGDG ++G VT+ L F N + N + GC
Sbjct: 213 QQCQALETSG---CRA--SKCLYQVSYGDGSFTVGEFVTE--TLTFGNSGMIN-DVAVGC 264
Query: 132 GYNQH 136
G++
Sbjct: 265 GHDNE 269
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 71/147 (48%), Gaps = 12/147 (8%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALH 78
+G P L DTGS+L W+QC PCT C + P ++ V +P C A+
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQC-LPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVR 121
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHN 137
+ C+ + C Y+ YGDG ++ G L TD+F ++ V LTFGC H+
Sbjct: 122 RIS---CREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGC---SHD 175
Query: 138 PGPLSPPDTAGVLGLGRGRISIVSQLR 164
AGV+GL R S+VSQL+
Sbjct: 176 TKARLKGHQAGVVGLNRHPNSLVSQLK 202
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.142 0.459
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,934,547,657
Number of Sequences: 23463169
Number of extensions: 326664272
Number of successful extensions: 553695
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 424
Number of HSP's successfully gapped in prelim test: 1414
Number of HSP's that attempted gapping in prelim test: 548848
Number of HSP's gapped (non-prelim): 2234
length of query: 378
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 234
effective length of database: 8,980,499,031
effective search space: 2101436773254
effective search space used: 2101436773254
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)