BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019819
(335 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 180/318 (56%), Positives = 230/318 (72%), Gaps = 6/318 (1%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
FP+ Y++V + +G PPK F FD DTGSDLTWVQCDAPC+GCT PP QYKP NI+PCS
Sbjct: 44 FPL-GYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTLPPNLQYKPKGNIIPCS 102
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
NP C ALHWPN P C +P +QCDYE++Y D GSS+GALVTD FPL+ NGS P+ FG
Sbjct: 103 NPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSMGALVTDQFPLKLVNGSFMQPPVAFG 162
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CGY+Q P PP TAGVLGLGRG+I +++QL GL RNV+GHC+ G G LF GD
Sbjct: 163 CGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGFLFFGDN 222
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
VPS GVAWTP+L HY GPA+LL++GK GLK L LIFD+G+SY YF S+ YQ
Sbjct: 223 LVPSIGVAWTPLLSQD---NHYTTGPADLLFNGKPTGLKGLKLIFDTGSSYTYFNSKAYQ 279
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRL 308
I++LI DL +PLK+A +DKTLPICW+G PFK++ +V +FK + ++FTN R + +L
Sbjct: 280 TIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINFTNGRRNTQL 339
Query: 309 VVPPEAYLVISVSTSIII 326
+ PE YL++S + ++ +
Sbjct: 340 YLAPELYLIVSKTGNVCL 357
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 179/311 (57%), Positives = 225/311 (72%), Gaps = 6/311 (1%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
FP+ Y++V L +G PPK F+FD DTGSD+TWVQCDAPCTGC PP+ QYKP N VPCS
Sbjct: 49 FPL-GYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGCNLPPKLQYKPKGNTVPCS 107
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+P C ALH+PN P+C +P +QCDYE+ Y D GSS+GALV D FP + NGS L FG
Sbjct: 108 DPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPFKLLNGSAMQPRLAFG 167
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CGY+Q P PP TAGVLGLGRG+I +++QL GL RNV+GHC+ G G LF GD
Sbjct: 168 CGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGYLFFGDT 227
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
+PS GVAWTP+L HY GPAELL++GK GLK L LIFD+G+SY YF S+ YQ
Sbjct: 228 LIPSLGVAWTPLLPPD---NHYTTGPAELLFNGKPTGLKGLKLIFDTGSSYTYFNSKTYQ 284
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRL 308
IV+LI DL +PLK+A +DKTLPICW+G PFK++ +V +FK + ++FTN R + +L
Sbjct: 285 TIVNLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINFTNARRNTQL 344
Query: 309 VVPPEAYLVIS 319
+PPE+YL+IS
Sbjct: 345 QIPPESYLIIS 355
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 170/313 (54%), Positives = 222/313 (70%), Gaps = 4/313 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y++V+L +G PPKLF+ D DTGSDLTWVQCDAPCTGCTKP YKP N++ C +P C+
Sbjct: 66 YYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLHHLYKPRNNLLSCIDPLCS 125
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
A+ +C+ DQCDYEI+Y D GSS+G LVTD FPLR NGS +TFGCGY+Q
Sbjct: 126 AVQNSGTYQCQSATDQCDYEIQYADEGSSLGVLVTDYFPLRLMNGSFLRPKMTFGCGYDQ 185
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
+PGP++PP T GVLGLG G+ SI+SQL+ G++ NVIGHC+ + G G LF G VPS
Sbjct: 186 KSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSRKGGGFLFFGQDPVPSF 245
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G++W PM Q S D K+Y GPAELLY GK G K IFDSG+SY YF ++VYQ ++L
Sbjct: 246 GISWAPMSQKSLD-KYYASGPAELLYGGKPTGTKAEEFIFDSGSSYTYFNAQVYQSTLNL 304
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
I ++L G PL+ AP++K L ICW+G FK++ +V YFKP ALSFT + SV+L +PPE
Sbjct: 305 IRKELSGKPLRDAPEEKALAICWKGTKRFKSVNEVKSYFKPFALSFT-KAKSVQLQIPPE 363
Query: 314 AYLVISVSTSIII 326
YL+++ ++ +
Sbjct: 364 DYLIVTNDGNVCL 376
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 355 bits (912), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 173/313 (55%), Positives = 223/313 (71%), Gaps = 5/313 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y++V L +G PPK FDFD DTGSDLTWVQCDAPC GCTKP +K YKP N+VPCSN C
Sbjct: 53 YYSVILNIGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRDKLYKPKNNLVPCSNSLCQ 112
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
A+ C P+DQCDYEIEY D GSSIG L++D FPLR SNG++ + FGCGY+Q
Sbjct: 113 AVSTGENYHCDAPDDQCDYEIEYADLGSSIGVLLSDSFPLRLSNGTLLQPKMAFGCGYDQ 172
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
+ GP PPDTAG+LGLGRG++SI+SQLR G+ +NV+GHC + G LF GD PSS
Sbjct: 173 KHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARGGFLFFGDHLFPSS 232
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
+ WTPML++S+D Y GPAELL+ GK G+K L LIFDSG+SY YF ++VYQ I++L
Sbjct: 233 RITWTPMLRSSSD-TLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSILNL 291
Query: 256 IMRDLIGTPLKLAPDDKTLPICWR--GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ +DL G PLK AP +K L +CW+ P K++ + YFKPL +SF N +N V+L + PE
Sbjct: 292 VRKDLAGKPLKDAP-EKELAVCWKTAKPIKSILDIKSYFKPLTISFMNAKN-VQLQLAPE 349
Query: 314 AYLVISVSTSIII 326
YL+I+ ++ +
Sbjct: 350 DYLIITKDGNVCL 362
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 353 bits (906), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 170/308 (55%), Positives = 218/308 (70%), Gaps = 3/308 (0%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V L +G PPKLFD D DTGSDLTWVQCDAPC GCTKP KQYKP+ N +PCS+
Sbjct: 64 LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHIL 123
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+ L P C P DQCDYEI Y D SSIGALVTD PL+ +NGS+ N+ LTFGCGY
Sbjct: 124 CSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGY 183
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q NPGP PP TAG+LGLGRG++ + +QL+ G+ +NVI HC+ G+G L +GD VP
Sbjct: 184 DQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVP 243
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSGV WT + NS K+Y+ GPAELL++ K+ G+K + ++FDSG+SY YF + YQ I+
Sbjct: 244 SSGVTWTSLATNSPS-KNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAIL 302
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
LI +DL G PL DDK+LP+CW+G P K+L +V +YFK + L F N++N VP
Sbjct: 303 DLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVP 362
Query: 312 PEAYLVIS 319
PE+YL+I+
Sbjct: 363 PESYLIIT 370
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 353 bits (905), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 170/308 (55%), Positives = 218/308 (70%), Gaps = 3/308 (0%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V L +G PPKLFD D DTGSDLTWVQCDAPC GCTKP KQYKP+ N +PCS+
Sbjct: 64 LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHIL 123
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+ L P C P DQCDYEI Y D SSIGALVTD PL+ +NGS+ N+ LTFGCGY
Sbjct: 124 CSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGY 183
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q NPGP PP TAG+LGLGRG++ + +QL+ G+ +NVI HC+ G+G L +GD VP
Sbjct: 184 DQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVP 243
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSGV WT + NS K+Y+ GPAELL++ K+ G+K + ++FDSG+SY YF + YQ I+
Sbjct: 244 SSGVTWTSLATNSPS-KNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAIL 302
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
LI +DL G PL DDK+LP+CW+G P K+L +V +YFK + L F N++N VP
Sbjct: 303 DLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVP 362
Query: 312 PEAYLVIS 319
PE+YL+I+
Sbjct: 363 PESYLIIT 370
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 171/315 (54%), Positives = 220/315 (69%), Gaps = 3/315 (0%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V L +G PPKLFD D DTGSDLTWVQCDAPC GCTKP KQYKP+ N +PCS+
Sbjct: 65 LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHLL 124
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+ L C P DQCDYEI Y D SSIGALVTD FPL+ +NGS+ N LTFGCGY
Sbjct: 125 CSGLDLTQNRPCDDPEDQCDYEIGYSDHASSIGALVTDEFPLKLANGSIMNPHLTFGCGY 184
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q NPGP PP TAG+LGLGRG++ I +QL+ G+ +NVI HC+ G+G L +GD VP
Sbjct: 185 DQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVP 244
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSGV WT + NSA K+Y+ GPAELL++ K+ G+K + ++FDSG+SY YF + YQ I+
Sbjct: 245 SSGVTWTSLATNSAS-KNYMTGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAIL 303
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
LI +DL G PL DDK+LP+CW+G P K+L +V +YFK + L F ++N VP
Sbjct: 304 DLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGYQKNGQLFQVP 363
Query: 312 PEAYLVISVSTSIII 326
PE+YL+I+ ++ +
Sbjct: 364 PESYLIITEKGNVCL 378
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 168/306 (54%), Positives = 218/306 (71%), Gaps = 4/306 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V+L +G PPKL+D D D+GSDLTWVQCDAPC GCTKP ++ YKP+ N+V C + C+
Sbjct: 63 HYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKPNHNLVQCVDQLCS 122
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+ C P+DQCDYE+EY D GSS+G LV D P +F+NGSV + FGCGY+Q
Sbjct: 123 EVQLSMEYTCASPDDQCDYEVEYADHGSSLGVLVRDYIPFQFTNGSVVRPRVAFGCGYDQ 182
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
G SPP T+GVLGLG GR SI+SQL GLI NV+GHC+ G G LF GD +PSS
Sbjct: 183 KYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHNVVGHCLSARGGGFLFFGDDFIPSS 242
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G+ WT ML +S++ KHY GPAEL+++GK+ +K L LIFDSG+SY YF S+ YQ +V L
Sbjct: 243 GIVWTSMLPSSSE-KHYSSGPAELVFNGKATVVKGLELIFDSGSSYTYFNSQAYQAVVDL 301
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ +DL G LK A DD +LPICW+G FK+L V +YFKPLALSFT + +++ +PPE
Sbjct: 302 VTQDLKGKQLKRATDDPSLPICWKGAKSFKSLSDVKKYFKPLALSFT-KTKILQMHLPPE 360
Query: 314 AYLVIS 319
AYL+I+
Sbjct: 361 AYLIIT 366
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 174/310 (56%), Positives = 223/310 (71%), Gaps = 6/310 (1%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y++VNL +G PPK ++ D DTGSDLTWVQCDAPC GCT P ++QYKPH N+V C +P
Sbjct: 45 LGYYSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCKGCTLPRDRQYKPHGNLVKCVDPL 104
Query: 74 CAALH-WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CAA+ PNPP C +PN+QCDYE+EY D GSS+G LV D+ PL+ +NG++ + L FGCG
Sbjct: 105 CAAIQSAPNPP-CVNPNEQCDYEVEYADQGSSLGVLVRDIIPLKLTNGTLTHSMLAFGCG 163
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
Y+Q + G PP AGVLGLG GR SI+SQL GLIRNV+GHC+ G G LF GD +
Sbjct: 164 YDQTHVGHNPPPSAAGVLGLGNGRASILSQLNSKGLIRNVVGHCLSGTGGGFLFFGDQLI 223
Query: 193 PSSGVAWTPMLQNSAD-LKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQE 251
P SGV WTP+LQ+S+ LKHY GPA++ ++GK+ +K L L FDSG+SY YF S ++
Sbjct: 224 PQSGVVWTPILQSSSSLLKHYKTGPADMFFNGKATSVKGLELTFDSGSSYTYFNSLAHKA 283
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLV 309
+V LI D+ G PL A +D +LPICW+G PFK+L VT FKPL LSFT +NS+
Sbjct: 284 LVDLITNDIKGKPLSRATEDPSLPICWKGPKPFKSLHDVTSNFKPLVLSFTKSKNSL-FQ 342
Query: 310 VPPEAYLVIS 319
VPPEAYL+++
Sbjct: 343 VPPEAYLIVT 352
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 167/315 (53%), Positives = 217/315 (68%), Gaps = 4/315 (1%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V+L +G PPK++D D DTGSDLTWVQCDAPC GCT P + YKP+ N+V C +P
Sbjct: 61 LGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRNRLYKPNGNLVKCGDPL 120
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C A+ C PN+QCDYE+EY D GSS+G L+ D PL+F+NGS+ L FGCGY
Sbjct: 121 CKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLARPILAFGCGY 180
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q + G TAGVLGLG G+ SI+SQL GLIRNV+GHC+ + G G LF GD VP
Sbjct: 181 DQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERGGGFLFFGDQLVP 240
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SGV WTP+LQ+S+ +HY GPA+L + K +K L LIFDSG+SY YF S+ ++ +V
Sbjct: 241 QSGVVWTPLLQSSS-TQHYKTGPADLFFDRKPTSVKGLQLIFDSGSSYTYFNSKAHKALV 299
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+L+ DL G PL A +D +LPICWRG PFK+L VT FKPL LSFT +NS+ L +P
Sbjct: 300 NLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLHDVTSNFKPLLLSFTKSKNSL-LQLP 358
Query: 312 PEAYLVISVSTSIII 326
PEAYL+++ ++ +
Sbjct: 359 PEAYLIVTKHGNVCL 373
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 165/313 (52%), Positives = 221/313 (70%), Gaps = 7/313 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
+++V L +G PPK FD D DTGSDLTWVQCDAPC GCTKP +K YKP N VPC++ C
Sbjct: 67 HYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDKLYKPKNNRVPCASSLCQ 126
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
A+ N C P +QCDYE+EY D GSS+G L++D FPLR +NGS+ + FGCGY+Q
Sbjct: 127 AIQNNN---CDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQPRIAFGCGYDQ 183
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
GP SPPDTAG+LGLGRG+ SI+SQLR G+ +NV+GHC + G LF GD +P S
Sbjct: 184 KYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGFLFFGDHLLPPS 243
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G+ WTPML++S+D Y GPAELL+ GK G+K L LIFDSG+SY YF ++VYQ I++L
Sbjct: 244 GITWTPMLRSSSD-TLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSILNL 302
Query: 256 IMRDLIGTPLKLAPDDKTLPICWR--GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ +DL G PLK AP++K L +CW+ P K++ + +FKPL ++F +N V+L + PE
Sbjct: 303 VRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSFFKPLTINFIKAKN-VQLQLAPE 361
Query: 314 AYLVISVSTSIII 326
YL+I+ ++ +
Sbjct: 362 DYLIITKDGNVCL 374
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 167/308 (54%), Positives = 215/308 (69%), Gaps = 8/308 (2%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V L +G PPKLFD D DTGSDLTWVQCDAPC GCTK YKP+ N +PCS+
Sbjct: 64 LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTK-----YKPNHNTLPCSHIL 118
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+ L P C P DQCDYEI Y D SSIGALVTD PL+ +NGS+ N+ LTFGCGY
Sbjct: 119 CSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGY 178
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q NPGP PP TAG+LGLGRG++ + +QL+ G+ +NVI HC+ G+G L +GD VP
Sbjct: 179 DQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVP 238
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSGV WT + NS K+Y+ GPAELL++ K+ G+K + ++FDSG+SY YF + YQ I+
Sbjct: 239 SSGVTWTSLATNSPS-KNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAIL 297
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
LI +DL G PL DDK+LP+CW+G P K+L +V +YFK + L F N++N VP
Sbjct: 298 DLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVP 357
Query: 312 PEAYLVIS 319
PE+YL+I+
Sbjct: 358 PESYLIIT 365
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 167/315 (53%), Positives = 220/315 (69%), Gaps = 4/315 (1%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
++ V+L +G PPKL+D D D+GSDLTWVQCDAPC GCTKP ++ YKP+ N+V C +
Sbjct: 61 LGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRDQLYKPNHNLVQCVDQL 120
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+ +H C P+D CDYE+EY D GSS+G LV D P +F+NGSV + FGCGY
Sbjct: 121 CSEVHLSMAYNCPSPDDPCDYEVEYADHGSSLGVLVRDYIPFQFTNGSVVRPRVAFGCGY 180
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q G SPP T+GVLGLG GR SI+SQL GLIRNV+GHC+ G G LF GD +P
Sbjct: 181 DQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIRNVVGHCLSAQGGGFLFFGDDFIP 240
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSG+ WT ML +S+ KHY GPAEL+++GK+ +K L LIFDSG+SY YF S+ YQ +V
Sbjct: 241 SSGIVWTSMLSSSS-EKHYSSGPAELVFNGKATAVKGLELIFDSGSSYTYFNSQAYQAVV 299
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
L+ +DL G LK A DD +LPICW+G F++L V +YFKPLALSF N +++ +P
Sbjct: 300 DLVTKDLKGKQLKRATDDPSLPICWKGAKSFESLSDVKKYFKPLALSFKKSXN-LQMHLP 358
Query: 312 PEAYLVISVSTSIII 326
PE+YL+I+ ++ +
Sbjct: 359 PESYLIITKHGNVCL 373
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 170/315 (53%), Positives = 218/315 (69%), Gaps = 4/315 (1%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V+L +G PPK++D D DTGSDLTWVQCDAPC GCT P + YKPH ++V C +P
Sbjct: 61 LGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRNRLYKPHGDLVKCVDPL 120
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
CAA+ C PN+QCDYE+EY D GSS+G L+ D PL+F+NGS+ L FGCGY
Sbjct: 121 CAAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLARPMLAFGCGY 180
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q + G PP TAGVLGLG GR SI+SQL GLIRNV+GHC+ G G LF GD +P
Sbjct: 181 DQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCLSGRGGGFLFFGDQLIP 240
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SGV WTP+LQ+S+ +HY GPA+L + K+ +K L LIFDSG+SY YF S+ ++ +V
Sbjct: 241 PSGVVWTPLLQSSS-AQHYKTGPADLFFDRKTTSVKGLELIFDSGSSYTYFNSQAHKALV 299
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+LI DL G PL A D +LPICW+G PFK+L VT FKPL LSFT +NS L +P
Sbjct: 300 NLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLHDVTSNFKPLLLSFTKSKNS-PLQLP 358
Query: 312 PEAYLVISVSTSIII 326
PEAYL+++ ++ +
Sbjct: 359 PEAYLIVTKHGNVCL 373
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 162/306 (52%), Positives = 213/306 (69%), Gaps = 5/306 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y++V++ +GK + F+FD D+GSDLTWVQCDAPCT CTKP E+ YKP+ N + C P C
Sbjct: 54 YYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALNCFEPLCT 113
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH CK +DQC YEIEY D GSS+G LV D PL+ +NGS+ + FGCGY+
Sbjct: 114 SLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNGSLAAPRIAFGCGYDH 173
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P S P TAGVLGLG G +S +SQL G++RNV+GHC+ G G LF GD VPSS
Sbjct: 174 KYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEG-GFLFFGDEFVPSS 232
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
GV WT M S +Y GPAE+ +SGK+ G+KDLTL+FDSG+SY YF S+ Y I++L
Sbjct: 233 GVTWTSMSHESIG-SYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAYNSILAL 291
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ +L G PL+ AP+DK+LP+CW+G PFK+L V +YF PLAL FT +N+ ++ +PPE
Sbjct: 292 VKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNPLALRFTKTKNA-QIQLPPE 350
Query: 314 AYLVIS 319
YL+I+
Sbjct: 351 NYLIIT 356
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 160/306 (52%), Positives = 211/306 (68%), Gaps = 5/306 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y++V++ +GK + F+FD D+GSDLTWVQCDAPCT CTKP E+ YKP+ N + C P C
Sbjct: 54 YYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLYKPNNNALNCFEPLCT 113
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH CK +DQC YEIEY D GSS+G LV D PL+ +NGS+ + FGCGY+
Sbjct: 114 SLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNGSLAAPRIAFGCGYDH 173
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P S P TAGVLGLG G +S +SQL G++RNV+GHC+ G G LF GD VPSS
Sbjct: 174 KYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEG-GFLFFGDEFVPSS 232
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
GV WT M S +Y GPAE+ + GK+ G+KDLTL+FDSG+SY YF S+ Y I++L
Sbjct: 233 GVTWTSMSHESIG-SYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFNSQAYNSILAL 291
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ +L G PL+ AP+DK+LP+CW+G PFK+L V +YF LAL FT +N+ ++ +PPE
Sbjct: 292 VKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNLLALRFTKTKNA-QIQLPPE 350
Query: 314 AYLVIS 319
YL+I+
Sbjct: 351 NYLIIT 356
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 164/308 (53%), Positives = 211/308 (68%), Gaps = 6/308 (1%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
+F V++T+G PPK+F+ D DTGSDLTWVQCDAPCTGCT P ++ YKPH N+V C P
Sbjct: 52 LGHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGCTLPHDRLYKPHNNVVRCGEPL 111
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C+AL + CK+PNDQCDYE+EY D GSSIG LV D PLR +NG++ L FGCGY
Sbjct: 112 CSALFSASKSPCKNPNDQCDYEVEYADHGSSIGVLVKDPVPLRLTNGTILAPNLGFGCGY 171
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+QHN G PP TAGVLGLG + ++ +QL +RNV+GHC G G LF G VP
Sbjct: 172 DQHNGGSQLPPLTAGVLGLGNSKATMATQLSALSHVRNVLGHCFSGQGGGFLFFGGDLVP 231
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SSG++W P+L+ Y GPAE+ + G G++ L L FDSG+SY YF S+VY ++
Sbjct: 232 SSGMSWMPILRTPGG--KYSAGPAEVYFGGNPVGIRGLILTFDSGSSYTYFNSQVYGAVL 289
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+L+ L G PL+ AP+DKTLPICW+G FK++ V +FKPLALSF N + V+ +P
Sbjct: 290 NLLRNGLKGQPLRDAPEDKTLPICWKGSKAFKSVADVRNFFKPLALSFGNSK--VQFQIP 347
Query: 312 PEAYLVIS 319
PEAYL+IS
Sbjct: 348 PEAYLIIS 355
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 161/306 (52%), Positives = 206/306 (67%), Gaps = 5/306 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
+F V L +G P K+F+ D DTGSDLTWVQCD C GCT P + Y+PH N V +P CA
Sbjct: 52 HFTVLLNIGNPSKVFELDIDTGSDLTWVQCDVECIGCTLPRDMLYRPHNNAVSREDPLCA 111
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
AL K+PNDQC YE+EY D GSS+G LV DL P+R +NG + L FGCGY+Q
Sbjct: 112 ALSSLGKFIFKNPNDQCAYEVEYADHGSSVGVLVKDLVPMRLTNGKRISPNLGFGCGYDQ 171
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
N PP AGVLGL + +IVSQL + G + NV+GHC+ G G LF G VPSS
Sbjct: 172 ENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRGGGFLFFGGDVVPSS 231
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G++WTP+L+NS Y GPAE+ ++G++ G+ LTL FDSG+SY YF S+VY+ I L
Sbjct: 232 GMSWTPILRNSE--GKYSSGPAEVYFNGRAVGIGGLTLTFDSGSSYTYFNSQVYRAIEKL 289
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ DL G PLKLA DDKTL +CW+G PF+++ V +FKPLA+SF N +N V+ +PPE
Sbjct: 290 LKNDLKGNPLKLASDDKTLELCWKGPKPFESVVDVRNFFKPLAMSFKNSKN-VQFQIPPE 348
Query: 314 AYLVIS 319
AYL+IS
Sbjct: 349 AYLIIS 354
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 313 bits (802), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 157/313 (50%), Positives = 210/313 (67%), Gaps = 12/313 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 35 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 94
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH + RC+ P +QCDYE+EY DGGSS+G LV D+F + ++ G L GCGY
Sbjct: 95 CKALHLNSNQRCETP-EQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGY 153
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG S GVLGLGRG++SI+SQL G ++NVIGHC+ G G+LF GD
Sbjct: 154 DQ-IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 212
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + + KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 213 SSRVSWTPMSREYS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 268
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 269 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 328
Query: 308 LVVPPEAYLVISV 320
+PPEAYL+ISV
Sbjct: 329 FEIPPEAYLIISV 341
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 313 bits (802), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 158/306 (51%), Positives = 219/306 (71%), Gaps = 6/306 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
+++V+L +G PPK + D D+GSDLTW+QCDAPC CTK P YKP+K + C++P C+
Sbjct: 67 FYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPNKGPITCNDPMCS 126
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
ALHWP+ P CK ++QCDYE+ Y D GSS+G LV D+F L+ +NG++ L FGCGY+Q
Sbjct: 127 ALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTLAAPRLAFGCGYDQ 186
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
PGP +PP GVLGLG G+ SIV+QLR GLIR+++GHC+ G G LFLGDG +
Sbjct: 187 SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGLSTTP 246
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G+ WTPM + S + Y LGPA+LL++G++ G+K L L+FDSG+SY YF ++ Y+ +SL
Sbjct: 247 GIIWTPMSRKSGE-SAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSL 305
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ + L G + A D++LP+CWRG PFK++ +V YFKP ALSFT + S +L +PPE
Sbjct: 306 VRKYLNGKLKETA--DESLPVCWRGAKPFKSIFEVKNYFKPFALSFT-KAKSAQLQLPPE 362
Query: 314 AYLVIS 319
+YL+IS
Sbjct: 363 SYLIIS 368
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 156/319 (48%), Positives = 213/319 (66%), Gaps = 12/319 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 57 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 116
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH + RC+ P +QCDYE+EY DGGSS+G LV D+F + ++ G L GCGY
Sbjct: 117 CKALHLNSNQRCETP-EQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGY 175
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG S GVLGLGRG++SI+SQL G ++NVIGHC+ G G+LF GD
Sbjct: 176 DQ-IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 234
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + + KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 235 SSRVSWTPMSREYS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 290
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 291 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 350
Query: 308 LVVPPEAYLVISVSTSIII 326
+PPEAYL+IS+ ++ +
Sbjct: 351 FEIPPEAYLIISMKGNVCL 369
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 156/319 (48%), Positives = 213/319 (66%), Gaps = 12/319 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 57 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 116
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH + RC+ P +QCDYE+EY DGGSS+G LV D+F + ++ G L GCGY
Sbjct: 117 CKALHLNSNQRCETP-EQCDYEVEYADGGSSLGVLVRDVFSMNYTKGLRLTPRLALGCGY 175
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG S GVLGLGRG++SI+SQL G ++NVIGHC+ G G+LF GD
Sbjct: 176 DQ-IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 234
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + + KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 235 SSRVSWTPMSREYS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 290
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 291 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 350
Query: 308 LVVPPEAYLVISVSTSIII 326
+PPEAYL+IS+ ++ +
Sbjct: 351 FEIPPEAYLIISMKGNVCL 369
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 156/319 (48%), Positives = 213/319 (66%), Gaps = 12/319 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 45 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 104
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH + RC+ P +QCDYE+EY DGGSS+G LV D+F + ++ G L GCGY
Sbjct: 105 CKALHLNSNQRCETP-EQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGY 163
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG S GVLGLGRG++SI+SQL G ++NVIGHC+ G G+LF GD
Sbjct: 164 DQ-IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 222
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + + KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 223 SSRVSWTPMSREYS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 278
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 279 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 338
Query: 308 LVVPPEAYLVISVSTSIII 326
+PPEAYL+IS+ ++ +
Sbjct: 339 FEIPPEAYLIISMKGNVCL 357
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 158/306 (51%), Positives = 219/306 (71%), Gaps = 6/306 (1%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
+++V+L +G PPK + D D+GSDLTW+QCDAPC CTK P YKP+K + C++P C+
Sbjct: 34 FYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPNKGPITCNDPMCS 93
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
ALHWP+ P CK ++QCDYE+ Y D GSS+G LV D+F L+ +NG++ L FGCGY+Q
Sbjct: 94 ALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTLAAPRLAFGCGYDQ 153
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
PGP +PP GVLGLG G+ SIV+QLR GLIR+++GHC+ G G LFLGDG +
Sbjct: 154 SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGLSTTP 213
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
G+ WTPM + S + Y LGPA+LL++G++ G+K L L+FDSG+SY YF ++ Y+ +SL
Sbjct: 214 GIIWTPMSRKSGE-SAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSL 272
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ + L G + A D++LP+CWRG PFK++ +V YFKP ALSFT + S +L +PPE
Sbjct: 273 VRKYLNGKLKETA--DESLPVCWRGAKPFKSIFEVKNYFKPFALSFT-KAKSAQLQLPPE 329
Query: 314 AYLVIS 319
+YL+IS
Sbjct: 330 SYLIIS 335
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 156/319 (48%), Positives = 213/319 (66%), Gaps = 12/319 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 54 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPL 113
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH + RC+ P +QCDYE+EY DGGSS+G LV D+F + ++ G L GCGY
Sbjct: 114 CKALHLNSNQRCETP-EQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGY 172
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG S GVLGLGRG++SI+SQL G ++NVIGHC+ G G+LF GD
Sbjct: 173 DQ-IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 231
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + + KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 232 SSRVSWTPMSREYS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 287
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 288 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 347
Query: 308 LVVPPEAYLVISVSTSIII 326
+PPEAYL+IS+ ++ +
Sbjct: 348 FEIPPEAYLIISMKGNVCL 366
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 154/319 (48%), Positives = 213/319 (66%), Gaps = 12/319 (3%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V + +G+PP+ + D DTGSDLTW+QCDAPC C + P Y+P +++PC++P
Sbjct: 54 LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVHCLEAPHPLYQPSNDLIPCNDPL 113
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C ALH+ RC+ P +QCDYE+EY DGGSS+G LV D+F L ++ G L GCGY
Sbjct: 114 CKALHFNGNHRCETP-EQCDYEVEYADGGSSLGVLVRDVFSLNYTKGLRLTPRLALGCGY 172
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q PG GVLGLGRG++SI+SQL G ++NV+GHC+ G G+LF G+
Sbjct: 173 DQ-IPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSSLGGGILFFGNDLYD 231
Query: 194 SSGVAWTPMLQNSADLKHYILGPA---ELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
SS V+WTPM + ++ KHY PA ELL+ G++ GLK+L +FDSG+SY YF S+ YQ
Sbjct: 232 SSRVSWTPMARENS--KHY--SPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQ 287
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVR 307
+ L+ R+L G PLK A DD TLP+CW+G PF ++ +V +YFKPLALSF T R+
Sbjct: 288 AVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTL 347
Query: 308 LVVPPEAYLVISVSTSIII 326
+PPEAYL+IS+ ++ +
Sbjct: 348 FEIPPEAYLIISMKGNVCL 366
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 161/314 (51%), Positives = 208/314 (66%), Gaps = 9/314 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ V +G+PPK + D DTGSDLTW+QCDAPC CT P Y+P ++V C +P CA
Sbjct: 66 YYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQPTNDLVVCKDPICA 125
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH P+ RC P DQCDYE+EY DGGSSIG LV DLFP+ ++G LT GCGY+Q
Sbjct: 126 SLH-PDNYRCDDP-DQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMRARPRLTIGCGYDQ 183
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P D GVLGLGRG SIV+QL GL+RNV+GHC + G G LF GD SS
Sbjct: 184 LPGIAYHPLD--GVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGDDIYDSS 241
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
V WTPM ++ LKHY G AEL+ +G+S GLK+L ++FDSG+SY YF ++ YQ ++S
Sbjct: 242 KVIWTPMSRDY--LKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQTLLSF 299
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVRLVVPP 312
I +DL G PLK A +D TLP+CWRG PFK++ +YFKPLALSF + + + +
Sbjct: 300 IKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWKTKSQFEIQQ 359
Query: 313 EAYLVISVSTSIII 326
E+YL+IS S+ +
Sbjct: 360 ESYLIISSKGSVCL 373
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 158/320 (49%), Positives = 211/320 (65%), Gaps = 8/320 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L VG+PPK + D DTGSDLTW+QCDAPC CT+ Y+P ++VPC +P C
Sbjct: 56 FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCM 115
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH RC++P DQCDYE+EY DGGSS+G LV D+FPL +NG L GCGY+Q
Sbjct: 116 SLHSSMDHRCENP-DQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQ 174
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
+PG S G+LGLGRG +SIVSQL G++RNV+GHC G G LF GDG
Sbjct: 175 -DPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPY 233
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
+ WTPM ++ KHY G EL+++G+S GL++L ++FDSG+SY YF ++ YQ + SL
Sbjct: 234 RLVWTPMSRDYP--KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSL 291
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN-RRNSVRLVVPP 312
+ R+L G PL+ A DD TLP+CWRG P K+L V +YFKPLALSF++ R+ +P
Sbjct: 292 LNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPT 351
Query: 313 EAYLVISVSTSIIIIAYLTG 332
E Y++IS S + + L G
Sbjct: 352 EGYMIIS-SMGNVCLGILNG 370
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 166/307 (54%), Positives = 205/307 (66%), Gaps = 9/307 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ V L +G+P K + D DTGSDLTW+QCDAPC CT+ P Y+P N+VPC +P C
Sbjct: 33 YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRNNLVPCMDPICQ 92
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH RC++P QCDYE+EY DGGSS G LVTD F L F++ + L GCGY+Q
Sbjct: 93 SLHSNGDHRCENPG-QCDYEVEYADGGSSFGVLVTDTFNLNFTSEKRHSPLLALGCGYDQ 151
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
G P D GVLGLG+G+ SIVSQL GL+RNVIGHC+ +G G LF GD SS
Sbjct: 152 FPGGSHHPID--GVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGFLFFGDDLYDSS 209
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
VAWTPM S D KHY G AEL + GK+ G K+L FDSGASY Y S+ YQ ++SL
Sbjct: 210 RVAWTPM---SPDAKHYSPGLAELTFDGKTTGFKNLLTTFDSGASYTYLNSQAYQGLISL 266
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNS-VRLVVPP 312
+ ++L G PL+ A DD+TLP+CW+G PFK++ V +YFK ALSFTN R S L PP
Sbjct: 267 LKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKKYFKTFALSFTNERKSKTELEFPP 326
Query: 313 EAYLVIS 319
EAYL+IS
Sbjct: 327 EAYLIIS 333
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 157/316 (49%), Positives = 210/316 (66%), Gaps = 9/316 (2%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V+L++G+PPK + D DTGSDL+W+QCDAPC CTK P Y+P+ N+V C +P
Sbjct: 64 LGYYYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHPLYRPNNNLVICKDPM 123
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
CA+LH P +C+HP +QCDYE+EY DGGSS+G LV D+FPL F+NG L GCGY
Sbjct: 124 CASLHPPG-YKCEHP-EQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGY 181
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q P D GVLGLG+G+ SIVSQL G+IRNV+GHC+ G G LF GD
Sbjct: 182 DQIPGQSYHPLD--GVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGFLFFGDDLYD 239
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SS V WTPML++ HY G AEL+ GK+ K+L + FDSG+SY Y S YQ +V
Sbjct: 240 SSRVVWTPMLRDQH--THYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALV 297
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVRLVV 310
L+ ++L P++ A DD+TLP+CWRG PFK++ V ++FKPLALSF R + +
Sbjct: 298 HLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALSFPGGGRTKTQYDI 357
Query: 311 PPEAYLVISVSTSIII 326
P E+YL+IS+ ++ +
Sbjct: 358 PLESYLIISLKGNVCL 373
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 165/308 (53%), Positives = 204/308 (66%), Gaps = 10/308 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ V L +G+P K + D DTGSDLTW+QCDAPC CT+ P Y+P N+VPC +P C
Sbjct: 19 YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRNNLVPCMDPICQ 78
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG-CGYN 134
+LH RC++P QCDYE+EY DGGSS G LV D F L F++ + L G CGY+
Sbjct: 79 SLHSNGDHRCENPG-QCDYEVEYADGGSSFGVLVRDTFNLNFTSEKRHSPLLALGLCGYD 137
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS 194
Q G P D GVLGLG+G+ SIVSQL GL+RNVIGHC+ +G G LF GD S
Sbjct: 138 QFPGGSHHPID--GVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGFLFFGDDLYDS 195
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
S VAWTPM S D KHY G AEL + GK+ G K+L FDSGASY Y S+ YQ ++S
Sbjct: 196 SRVAWTPM---SPDAKHYSPGLAELTFDGKTTGFKNLLTTFDSGASYTYLNSQAYQGLIS 252
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNS-VRLVVP 311
L+ ++L G PL+ A DD+TLP+CW+G PFK++ V +YFK ALSFTN R S L P
Sbjct: 253 LLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKKYFKTFALSFTNERKSKTELEFP 312
Query: 312 PEAYLVIS 319
PEAYL+IS
Sbjct: 313 PEAYLIIS 320
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 155/317 (48%), Positives = 207/317 (65%), Gaps = 9/317 (2%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
Y+ V+L++G+PP + D TGSDL+W+QCDAPC CTK Y+P+ N+V C +P
Sbjct: 64 LGYYYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAXHXLYRPNNNLVICKDPM 123
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
CA LH P +C+HP +QCDYE+EY DGGSS+G LV D+FPL F+NG L GCGY
Sbjct: 124 CAXLHPPG-YKCEHP-EQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGY 181
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q P D GVLGLG+G+ SIVSQL G+IRNV+GHC+ +G G LF GD
Sbjct: 182 DQIPGXSYHPLD--GVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGGGFLFFGDDLYD 239
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
SS V WTPML++ HY G AEL+ GK+ K+L + FDSG+SY Y S YQ +V
Sbjct: 240 SSRVVWTPMLRDQH--THYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALV 297
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT-NRRNSVRLVV 310
L+ ++L P++ A DD+TLP+CWRG PFK++ V ++FKPLALSF R + +
Sbjct: 298 HLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVRKFFKPLALSFAGGGRTKTQYDI 357
Query: 311 PPEAYLVISVSTSIIII 327
P E+YL+IS + + I+
Sbjct: 358 PLESYLIISGNVCLGIL 374
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 213/319 (66%), Gaps = 15/319 (4%)
Query: 11 FPIFS------YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK 64
FPI+ ++ V L +G+PP+ + D DTGS+LTW+QCDAPC+ C++ P YKP
Sbjct: 62 FPIYGNVYPVGFYNVTLNIGQPPRPYFLDVDTGSELTWLQCDAPCSQCSETPHPLYKPSN 121
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ +PC +P CA+L + C+ PN QCDYEI+Y D S++G L+ D++ L F+NG
Sbjct: 122 DFIPCKDPLCASLQPTDDYTCEDPN-QCDYEIKYADQYSTLGVLLNDVYLLNFTNGVQLK 180
Query: 125 VPLTFGCGYNQ-HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
V + GCGY+Q +P P D G+LGLGRG+ S++SQL GL+RNV+GHC+ G G
Sbjct: 181 VRMALGCGYDQIFSPSTYHPLD--GILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGG 238
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+F G+ SS ++WTP+ + KHY GPAEL++ G+ G+ L +IFD+G+SY Y
Sbjct: 239 YIFFGN-VYDSSRMSWTPISSIDSG-KHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTY 296
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
F S+ YQ ++SL+ ++L P+K APDD+TLP+CW G PF+++ +V +YFKPL LSFTN
Sbjct: 297 FNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEVKKYFKPLTLSFTN 356
Query: 302 -RRNSVRLVVPPEAYLVIS 319
R + +PPEAYL+IS
Sbjct: 357 GGRVKPQFEIPPEAYLIIS 375
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 148/308 (48%), Positives = 204/308 (66%), Gaps = 9/308 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L +G+PP+ + D DTGSDLTW+QCDAPC+ C++ P Y+P + VPC + CA
Sbjct: 76 FYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLYRPSNDFVPCRHSLCA 135
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH + C+ P+ QCDYE++Y D SS+G L+ D++ L F+NG V + GCGY+Q
Sbjct: 136 SLHHSDNYDCEVPH-QCDYEVQYADHYSSLGVLLHDVYTLNFTNGVQLKVRMALGCGYDQ 194
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P P P G+LGLGRG+ S+ SQL GL+RNVIGHC+ G G +F GD SS
Sbjct: 195 IFPDPSHHP-LDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYIFFGD-VYDSS 252
Query: 196 GVAWTPMLQNSADLKHY-ILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
+ WTPM +S D KHY G AELL+ GK G+ L +FD+G+SY YF YQ ++S
Sbjct: 253 RLTWTPM--SSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSSYTYFNPYAYQALIS 310
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT-NRRNSVRLVVP 311
+ ++ G PLK A DD+TLP+CWRG PF+++ +V +YFKP+ LSFT N R+ + +P
Sbjct: 311 WLGKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYFKPIVLSFTSNGRSKAQFEMP 370
Query: 312 PEAYLVIS 319
PEAYL+IS
Sbjct: 371 PEAYLIIS 378
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 206/308 (66%), Gaps = 9/308 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L +G+PP+ + D DTGSDLTW+QCDAPC+ C++ P Y+P ++VPC + CA
Sbjct: 78 FYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLYRPSNDLVPCRHALCA 137
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH + C+ P+ QCDYE++Y D SS+G L+ D++ L F+NG V + GCGY+Q
Sbjct: 138 SLHLSDNYDCEVPH-QCDYEVQYADHYSSLGVLLHDVYTLNFTNGVQLKVRMALGCGYDQ 196
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P P P G+LGLGRG+ S+ SQL GL+RNVIGHC+ G G +F GD S
Sbjct: 197 IFPDPSHHP-LDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYIFFGD-VYDSF 254
Query: 196 GVAWTPMLQNSADLKHY-ILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
+ WTPM +S D KHY + G AELL+ GK G+ +L +FD+G+SY YF S YQ ++S
Sbjct: 255 RLTWTPM--SSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSSYTYFNSYAYQVLIS 312
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT-NRRNSVRLVVP 311
+ ++ G PLK A DD+TLP+CWRG PF+++ +V +YFKP+ LSFT N R+ + +
Sbjct: 313 WLKKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYFKPIVLSFTSNGRSKAQFEML 372
Query: 312 PEAYLVIS 319
PEAYL++S
Sbjct: 373 PEAYLIVS 380
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 204/307 (66%), Gaps = 13/307 (4%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V + +G PP+ + D DTGSDLTW+QCDAPC+ C++ P Y+P ++VPC +P CA
Sbjct: 84 FYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPLYRPSNDLVPCRHPLCA 143
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
++H + C+ + QCDYE+EY D SS+G LV D++ L F+NG V + GCGY+Q
Sbjct: 144 SVHQTDNYECEVEH-QCDYEVEYADHYSSLGVLVNDVYVLNFTNGVQLKVRMALGCGYDQ 202
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
P P G+LGLGRG+ S++SQL GL+RNV+GHC+ G G +F GD SS
Sbjct: 203 IFPDSSYHP-VDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSAQGGGYIFFGD-VYDSS 260
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
+AWTPM +S D KHY G AEL+ GK G +L +FD+G+SY YF S YQ
Sbjct: 261 RLAWTPM--SSRDYKHYSAGAAELVLGGKRTGFGNLLAVFDAGSSYTYFNSNAYQ----- 313
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF-TNRRNSVRLVVPP 312
+ ++L G P+K AP+D+TLP+CW G PF+++ +V +YFKP+ALSF +RR+ + +PP
Sbjct: 314 LTKELAGKPIKEAPEDQTLPLCWYGKRPFRSVYEVKKYFKPIALSFPGSRRSKAQFEIPP 373
Query: 313 EAYLVIS 319
EAYL+IS
Sbjct: 374 EAYLIIS 380
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 157/320 (49%), Positives = 210/320 (65%), Gaps = 8/320 (2%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L VG+PPK + D DTGSDLTW+QCDAPC CT+ Y+P ++VPC +P C
Sbjct: 56 FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCM 115
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+LH RC++P DQCDYE+EY DGGSS+G LV D+FPL +NG L GCGY+Q
Sbjct: 116 SLHSSMDHRCENP-DQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQ 174
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
+PG S G+LGLGRG +SIVSQL G++RNV+GHC G G F GDG
Sbjct: 175 -DPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYXFFGDGIYDPY 233
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
+ WTPM ++ KHY G EL+++G+S GL++L ++FDSG+SY YF ++ YQ + SL
Sbjct: 234 RLVWTPMSRDYP--KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSL 291
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN-RRNSVRLVVPP 312
+ R+L G PL+ A DD TLP+CWRG P K+L V +YFKPLALSF++ R+ +P
Sbjct: 292 LNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPT 351
Query: 313 EAYLVISVSTSIIIIAYLTG 332
E Y++IS S + + L G
Sbjct: 352 EGYMIIS-SMGNVCLGILNG 370
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 154/309 (49%), Positives = 202/309 (65%), Gaps = 10/309 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ V L++G+P K + D DTGSDLTW+QCDAPC C + P Y+P N+V C +P CA
Sbjct: 70 YYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPCRQCIEAPHPLYRPSNNLVICEDPLCA 129
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+L P C+ P DQCDYE+EY DGGSS+G LV D+F L F+NG N L GCGY+Q
Sbjct: 130 SLQPPGVHNCQDP-DQCDYEVEYADGGSSLGVLVKDVFVLNFTNGKRLNPLLALGCGYDQ 188
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
PG + P G+LGLGRG SI SQL GL+ NVIGHC+ G G LF G+ SS
Sbjct: 189 L-PGRSNHP-LDGILGLGRGISSIPSQLSSQGLVSNVIGHCLSGRGGGFLFFGEDIYDSS 246
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
GV WTPM ++ LKHY G AEL++ GKS G+++L ++FDSG+SY Y ++ YQ +V
Sbjct: 247 GVTWTPMSRDH--LKHYSPGFAELIFDGKSTGIRNLLVVFDSGSSYTYLNAQAYQHLVFS 304
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF---TNRRNSVRLVV 310
+ R+L P+ A DD+TLP+CW+G PFK++ V +YFKP AL F + R + +
Sbjct: 305 LKRELSRKPISEALDDQTLPLCWKGKRPFKSIRDVKKYFKPFALVFKTSSGRSSKTQFEF 364
Query: 311 PPEAYLVIS 319
PEAYL+IS
Sbjct: 365 SPEAYLIIS 373
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 161/308 (52%), Positives = 202/308 (65%), Gaps = 10/308 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L +G+P K + D DTGSDLTW+QCD P CT+ P YKP N+V C +P C
Sbjct: 19 FYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPHPYYKPSNNLVACKDPICQ 78
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG-CGYN 134
+LH RC++P QCDYE+EY DGGSS+G LV D F L F++ + L G CGY+
Sbjct: 79 SLHTGGDQRCENPG-QCDYEVEYADGGSSLGVLVKDAFNLNFTSEKRQSPLLALGLCGYD 137
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS 194
Q G P D GVLGLGRG+ SIVSQL GL+RNVIGHC+ G G LF GD S
Sbjct: 138 QLPGGTYHPID--GVLGLGRGKPSIVSQLSGLGLVRNVIGHCLSGRGGGFLFFGDDLYDS 195
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
S VAWTPM S + KHY G AEL + GK+ G K+L + FDSGASY Y S+VYQ ++S
Sbjct: 196 SRVAWTPM---SPNAKHYSPGFAELTFDGKTTGFKNLIVAFDSGASYTYLNSQVYQGLIS 252
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNR-RNSVRLVVP 311
LI R+L PL+ A DD+TLPICW+G PFK++ V +YFK ALSF N ++ +L P
Sbjct: 253 LIKRELSTKPLREALDDQTLPICWKGRKPFKSVRDVKKYFKTFALSFANDGKSKTQLEFP 312
Query: 312 PEAYLVIS 319
PEAYL++S
Sbjct: 313 PEAYLIVS 320
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 200/308 (64%), Gaps = 10/308 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L +G+P + + D DTGSDLTW+QCDAPCT C++ P Y+P + VPC +P CA
Sbjct: 68 FYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAPCTHCSETPHPLYRPSNDFVPCRDPLCA 127
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+L C+HP DQCDYEI Y D S+ G L+ D++ L F+NG V + GCGY+Q
Sbjct: 128 SLQPTEDYNCEHP-DQCDYEINYADQYSTFGVLLNDVYLLNFTNGVQLKVRMALGCGYDQ 186
Query: 136 -HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS 194
+P P D LG G+ S++SQL GL+RNVIGHC+ G G +F G+ S
Sbjct: 187 VFSPSSYHPLDGLLGLGRGKA--SLISQLNSQGLVRNVIGHCLSAQGGGYIFFGNA-YDS 243
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
+ V WTP+ +S D KHY GPAEL++ G+ G+ LT +FD+G+SY YF S YQ ++S
Sbjct: 244 ARVTWTPI--SSVDSKHYSAGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHAYQALLS 301
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN-RRNSVRLVVP 311
+ ++L G PLK+APDD+TLP+CW G PF +L +V +YFKP+AL FTN R + +
Sbjct: 302 WLKKELSGKPLKVAPDDQTLPLCWHGKRPFTSLREVRKYFKPVALGFTNGGRTKAQFEIL 361
Query: 312 PEAYLVIS 319
PEAYL+IS
Sbjct: 362 PEAYLIIS 369
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 200/308 (64%), Gaps = 10/308 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
++ V L +G+P + + D DTGSDLTW+QCDAPCT C++ P ++P + VPC +P CA
Sbjct: 70 FYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAPCTHCSETPHPLHRPSNDFVPCRDPLCA 129
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+L C+HP DQCDYEI Y D S+ G L+ D++ L SNG V + GCGY+Q
Sbjct: 130 SLQPTEDYNCEHP-DQCDYEINYADQYSTYGVLLNDVYLLNSSNGVQLKVRMALGCGYDQ 188
Query: 136 -HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS 194
+P P D LG G+ S++SQL GL+RNVIGHC+ G G +F G+ S
Sbjct: 189 VFSPSSYHPLDGLLGLGRGKA--SLISQLNSQGLVRNVIGHCLSSQGGGYIFFGNA-YDS 245
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
+ V WTP+ +S D KHY GPAEL++ G+ G+ LT +FD+G+SY YF S YQ ++S
Sbjct: 246 ARVTWTPI--SSVDSKHYSAGPAELVFGGRKTGVGSLTAVFDTGSSYTYFNSHAYQALLS 303
Query: 255 LIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN-RRNSVRLVVP 311
+ ++L G PLK+APDD+TL +CW G PF +L +V +YFKP+ALSFTN R + +P
Sbjct: 304 WLNKELSGKPLKVAPDDQTLSLCWHGKRPFTSLREVRKYFKPVALSFTNGGRVKAQFEIP 363
Query: 312 PEAYLVIS 319
PEAYL+IS
Sbjct: 364 PEAYLIIS 371
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 148/313 (47%), Positives = 200/313 (63%), Gaps = 14/313 (4%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCD---APCTGCTKPPEKQYKPH-KNIVPCSNP 72
+ V++ +G PPK ++ D DTGSDLTWVQCD APC GCT P +K YKP+ K +V CS+P
Sbjct: 62 YTVSINIGNPPKPYELDIDTGSDLTWVQCDGPDAPCKGCTMPKDKLYKPNGKQVVKCSDP 121
Query: 73 RCAALHWPN--PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C A + C + C Y ++Y D S++G LV D + + S + + FG
Sbjct: 122 ICVATQSTHVLGQICSKQSPPCVYNVQYADHASTLGVLVRDYMHIGSPSSSTKDPLVAFG 181
Query: 131 CGYNQHNPGPLSPPDT--AGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
CGY Q GP +PP + AG+LGLG G+ SI+SQL G I NV+GHC+ G G LFLG
Sbjct: 182 CGYEQKFSGP-TPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCLSAEGGGYLFLG 240
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRV 248
D VPSSG+ WTP++Q+S + KHY GP +L ++GK K L +IFDSG+SY YF+S V
Sbjct: 241 DKFVPSSGIVWTPIIQSSLE-KHYNTGPVDLFFNGKPTPAKGLQIIFDSGSSYTYFSSPV 299
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSV 306
Y + +++ DL G PL D +LPICW+G PFK+L +V YFKPL LSFT +N +
Sbjct: 300 YTIVANMVNNDLKGKPLSRV-KDPSLPICWKGVKPFKSLNEVNNYFKPLTLSFTKSKN-L 357
Query: 307 RLVVPPEAYLVIS 319
+ +PP AYL+I+
Sbjct: 358 QFQLPPVAYLIIT 370
>gi|356507650|ref|XP_003522577.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 326
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 143/304 (47%), Positives = 193/304 (63%), Gaps = 27/304 (8%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALH 78
+++T+ +L++ D DTGSDLTW Q DAPC GCT P +K KPH +V C + CAA+H
Sbjct: 1 MSITITSSSELYELDIDTGSDLTWFQWDAPCQGCTLPRDKLNKPHCKLVKCGDRLCAAIH 60
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNP 138
C P++QCDYE+EY D GSS+G LV D L+F++GS+ P+
Sbjct: 61 ---SEPCADPDEQCDYEVEYADQGSSLGVLVLDNIALKFTSGSLAR-PI----------- 105
Query: 139 GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVA 198
L+ PD +GL G+ SI+SQL GLIRNV+GHC+ + G G LF GD +P SGV
Sbjct: 106 --LAAPD----MGLATGKTSILSQLHSLGLIRNVVGHCLSRRGGGFLFFGDQLIPQSGVV 159
Query: 199 WTPMLQNSA---DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
WTP+LQNS+ HY GPA++ ++GK+ +K L L FDSG+SY F S ++ +V L
Sbjct: 160 WTPLLQNSSVTYTRPHYKTGPADMFFNGKATSVKGLELTFDSGSSYTXFNSHAHKALVGL 219
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
I D+ G A +D +LPICW+ P FK+L VT YFKP+ALSFT +NS+ L +PPE
Sbjct: 220 ITNDIKGKSFSRATEDPSLPICWKNPKTFKSLHDVTNYFKPIALSFTKSKNSL-LQLPPE 278
Query: 314 AYLV 317
AYL+
Sbjct: 279 AYLI 282
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 153/309 (49%), Positives = 199/309 (64%), Gaps = 14/309 (4%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCD---APCTGCTKPPEKQYKPHKN-IVPCSNP 72
+ V++ +G PP ++ D DTGSDLTWVQCD APC GCT P +K YKP+ N +V CS+P
Sbjct: 62 YTVSINIGNPPNPYELDIDTGSDLTWVQCDGPDAPCKGCTLPKDKLYKPNGNQLVKCSDP 121
Query: 73 RCAALHWPNP---PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT- 128
CAA+ P +C P C Y++EY D S GAL D + +GS NVPL
Sbjct: 122 ICAAVQPPFSTFGQKCAKPIPPCVYKVEYADNAESTGALARDYMHIGSPSGS--NVPLVV 179
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
FGCGY Q GP PP T GVLGLG G+ISI+SQL G I NV+GHC+ G G LFLG
Sbjct: 180 FGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVLGHCLSAEGGGYLFLG 239
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRV 248
D +PSSG+ WTP++Q+S + KHY GP +L ++GK K L +IFDSG+SY YF+ RV
Sbjct: 240 DKFIPSSGIFWTPIIQSSLE-KHYSTGPVDLFFNGKPTPAKGLQIIFDSGSSYTYFSPRV 298
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSV 306
Y + +++ DL G PL+ D +LPICW+G PFK+L +V YFKPL LSFT +N +
Sbjct: 299 YTIVANMVNNDLKGKPLRRETKDPSLPICWKGVKPFKSLNEVNNYFKPLTLSFTKSKN-L 357
Query: 307 RLVVPPEAY 315
+ +PP +
Sbjct: 358 QFQLPPVKF 366
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 196/308 (63%), Gaps = 9/308 (2%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V +++G PP+ + D DTGSDLTW+QCDAPC C+K P Y+P KN +VPC + CA
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKLVPCVDQMCA 117
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH +C P QCDYEI+Y D GSS+G LVTD F LR +N S+ L FGCGY
Sbjct: 118 ALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAFGCGY 177
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q T GVLGLG G +S++SQL+++G+ +NV+GHC+ G G LF GD VP
Sbjct: 178 DQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVP 237
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S W PM + S +Y G A L + G+ G++ + ++FDSG+S+ YF+++ YQ +V
Sbjct: 238 YSRATWAPMAR-STSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALV 296
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
I DL LK P D +LP+CW+G PFK++ V + FK + LSF+N + ++ + +P
Sbjct: 297 DAIKGDL-SKNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFKTVVLSFSNGKKAL-MEIP 353
Query: 312 PEAYLVIS 319
PE YL+++
Sbjct: 354 PENYLIVT 361
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 270 bits (690), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 196/308 (63%), Gaps = 9/308 (2%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V +++G PP+ + D DTGSDLTW+QCDAPC C+K P Y+P KN +VPC + CA
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKLVPCVDQMCA 117
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH +C P QCDYEI+Y D GSS+G LVTD F LR +N S+ L FGCGY
Sbjct: 118 ALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAFGCGY 177
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q T GVLGLG G +S++SQL+++G+ +NV+GHC+ G G LF GD VP
Sbjct: 178 DQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVP 237
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S W PM + S +Y G A L + G+ G++ + ++FDSG+S+ YF+++ YQ +V
Sbjct: 238 YSRATWAPMAR-STSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALV 296
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
I DL LK P D +LP+CW+G PFK++ V + F+ + LSF+N + ++ + +P
Sbjct: 297 DAIKGDL-SKNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKKAL-MEIP 353
Query: 312 PEAYLVIS 319
PE YL+++
Sbjct: 354 PENYLIVT 361
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 196/308 (63%), Gaps = 9/308 (2%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V +++G PP+ + D DTGSDLTW+QCDAPC C+K P Y+P KN +VPC + CA
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKLVPCVDQMCA 117
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH +C P QCDYEI+Y D GSS+G LVTD F LR +N S+ L FGCGY
Sbjct: 118 ALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAFGCGY 177
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q T GVLGLG G +S++SQL+++G+ +NV+GHC+ G G LF GD VP
Sbjct: 178 DQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVP 237
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S W PM + S +Y G A L + G+ G++ + ++FDSG+S+ YF+++ YQ +V
Sbjct: 238 YSRATWAPMAR-STSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALV 296
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
I DL LK P D +LP+CW+G PFK++ V + F+ + LSF+N + ++ + +P
Sbjct: 297 DAIKGDL-SKNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKKAL-MEIP 353
Query: 312 PEAYLVIS 319
PE YL+++
Sbjct: 354 PENYLIVT 361
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 137/308 (44%), Positives = 196/308 (63%), Gaps = 9/308 (2%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V +++G PP+ + D DTGSDLTW+QCDAPC C+K P Y+P KN +VPC + CA
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKLVPCVDQMCA 117
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH +C P QCDYEI+Y D GSS+G LVTD F LR +N S+ L FGCGY
Sbjct: 118 ALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLAFGCGY 177
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q T GVLGLG G +S++SQL+++G+ +NV+GHC+ G G LF GD VP
Sbjct: 178 DQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGGGFLFFGDDIVP 237
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S W PM + S +Y G A L + G+ G++ + ++FDSG+S+ YF+++ YQ +V
Sbjct: 238 YSRATWAPMAR-STSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGSSFTYFSAQPYQALV 296
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
I DL LK P D +LP+CW+G PFK++ V + F+ + LSF+N + ++ + +P
Sbjct: 297 DAIKGDL-SKNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKKAL-MEIP 353
Query: 312 PEAYLVIS 319
PE YL+++
Sbjct: 354 PENYLIVT 361
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 135/308 (43%), Positives = 197/308 (63%), Gaps = 9/308 (2%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V +++G PP+ + D DTGSDLTW+QCDAPC C K P Y+P KN IVPC + C+
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPHPLYRPTKNKIVPCVDQLCS 117
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
+LH +C P QCDYEI+Y D GSS+G L+TD F +R +N S+ L FGCGY
Sbjct: 118 SLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAVRLANSSIVRPSLAFGCGY 177
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVP 193
+Q T GVLGLG G IS++SQL+++G+ +NV+GHC+ G G LF GD VP
Sbjct: 178 DQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCLSIRGGGFLFFGDNLVP 237
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
S W PM++ SA +Y G A L + G+S G++ + ++ DSG+S+ YF ++ YQ +V
Sbjct: 238 YSRATWVPMVR-SAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSGSSFTYFGAQPYQALV 296
Query: 254 SLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+ + DL T ++ D +LP+CW+G PFK++ V + FK L LSF+N + ++ + +P
Sbjct: 297 TALKSDLSKTLKEVF--DPSLPLCWKGKKPFKSVLDVKKEFKSLVLSFSNGKKAL-MEIP 353
Query: 312 PEAYLVIS 319
PE YL+++
Sbjct: 354 PENYLIVT 361
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 195/309 (63%), Gaps = 12/309 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V + +G PPK + D DTGSDLTW+QCDAPC C K P Y+P KN +VPC + CA
Sbjct: 66 YYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTKNKLVPCVDQLCA 125
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
+LH +C P +QCDY I+Y D GSS G LV D F LR +NGSV L FGCGY
Sbjct: 126 SLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVLVNDSFALRLANGSVVRPSLAFGCGY 185
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q + G +SP D GVLGLG G +S++SQ +++G+ +NV+GHC+ G G LF GD V
Sbjct: 186 DQQVSSGEMSPTD--GVLGLGTGSVSLLSQFKQHGVTKNVVGHCLSLRGGGFLFFGDDLV 243
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P V WTPM++ S +Y G A L + +S +K ++FDSG+S+ YF ++ YQ +
Sbjct: 244 PYQRVTWTPMVR-SPLRNYYSPGSASLYFGDQSLRVKLTEVVFDSGSSFTYFAAQPYQAL 302
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
V+ + DL T +++ D +LP+CW+G PFK++ V + FK L L+F N N + +
Sbjct: 303 VTALKGDLSRTLKEVS--DPSLPLCWKGKKPFKSVLDVKKEFKSLVLNFGN-GNKAFMEI 359
Query: 311 PPEAYLVIS 319
PP+ YL+++
Sbjct: 360 PPQNYLIVT 368
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 203/322 (63%), Gaps = 16/322 (4%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V + +G PPK + D D+GSDLTW+QCDAPC C + P Y+P K+ +VPC + CA
Sbjct: 66 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKLVPCVHRLCA 125
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
+LH RC P++QCDY I+Y D GSS G L+ D F LR +NGSV + FGCGY
Sbjct: 126 SLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAFGCGY 185
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q G LS P T GVLGLG G +S++SQL++ G+ +NV+GHC+ G G LF GD V
Sbjct: 186 DQQVRSGDLSSP-TDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGFLFFGDDLV 244
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P WTPM + SA +Y G A L + +S G++ ++FDSG+S+ YF ++ YQ +
Sbjct: 245 PYQRATWTPMAR-SAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFAAKPYQAL 303
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
V+ ++D + L+ P D +LP+CW+G PFK++ V + FK L L+F + + ++ + +
Sbjct: 304 VT-ALKDGLSRTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKKTL-MEI 360
Query: 311 PPEAYLVISVSTSIIIIAYLTG 332
PPE YL+++V+ IAY G
Sbjct: 361 PPENYLIVTVN-----IAYPDG 377
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 136/310 (43%), Positives = 198/310 (63%), Gaps = 12/310 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V + +G PPK + D D+GSDLTW+QCDAPC C + P Y+P K+ +VPC + CA
Sbjct: 64 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKLVPCVHRLCA 123
Query: 76 ALHWP---NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+LH RC+ P++QCDY I+Y D GSS G LV D F LR +NGSV + FGCG
Sbjct: 124 SLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLTNGSVARPSVAFGCG 183
Query: 133 YNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
Y+Q G LS P T GVLGLG G +S++SQL++ G+ +NV+GHC+ G G LF GD
Sbjct: 184 YDQQVRSGDLSSP-TDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGFLFFGDDL 242
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQE 251
VP WTPM + SA +Y G A L + +S G++ ++FDSG+S+ YF ++ YQ
Sbjct: 243 VPYQRATWTPMAR-SAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFAAKPYQA 301
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLV 309
+V+ ++D + L+ P D +LP+CW+G PFK++ V + FK L L+F + + ++ +
Sbjct: 302 LVT-ALKDGLSRTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKKTL-ME 358
Query: 310 VPPEAYLVIS 319
+PPE YL+++
Sbjct: 359 IPPENYLIVT 368
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 197/309 (63%), Gaps = 11/309 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V + +G PPK + D D+GSDLTW+QCDAPC C + P Y+P K+ +VPC + CA
Sbjct: 66 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKLVPCVHRLCA 125
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
+LH RC P++QCDY I+Y D GSS G L+ D F LR +NGSV + FGCGY
Sbjct: 126 SLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAFGCGY 185
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q G LS P T GVLGLG G +S++SQL++ G+ +NV+GHC+ G G LF GD V
Sbjct: 186 DQQVRSGDLSSP-TDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGFLFFGDDLV 244
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P WTPM + SA +Y G A L + +S G++ ++FDSG+S+ YF ++ YQ +
Sbjct: 245 PYQRATWTPMAR-SAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFAAKPYQAL 303
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
V+ ++D + L+ P D +LP+CW+G PFK++ V + FK L L+F + + ++ + +
Sbjct: 304 VT-ALKDGLSRTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKKTL-MEI 360
Query: 311 PPEAYLVIS 319
PPE YL+++
Sbjct: 361 PPENYLIVT 369
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 197/309 (63%), Gaps = 11/309 (3%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCA 75
+ V + +G PPK + D D+GSDLTW+QCDAPC C + P Y+P K+ +VPC + CA
Sbjct: 57 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKLVPCVHRLCA 116
Query: 76 ALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
+LH RC P++QCDY I+Y D GSS G L+ D F LR +NGSV + FGCGY
Sbjct: 117 SLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNGSVARPSVAFGCGY 176
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q G LS P T GVLGLG G +S++SQL++ G+ +NV+GHC+ G G LF GD V
Sbjct: 177 DQQVRSGDLSSP-TDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLRGGGFLFFGDDLV 235
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P WTPM + SA +Y G A L + +S G++ ++FDSG+S+ YF ++ YQ +
Sbjct: 236 PYQRATWTPMAR-SAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSFTYFAAKPYQAL 294
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
V+ ++D + L+ P D +LP+CW+G PFK++ V + FK L L+F + + ++ + +
Sbjct: 295 VT-ALKDGLSRTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSLVLNFASGKKTL-MEI 351
Query: 311 PPEAYLVIS 319
PPE YL+++
Sbjct: 352 PPENYLIVT 360
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 137/313 (43%), Positives = 197/313 (62%), Gaps = 15/313 (4%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRC 74
++ V + +G P K + D DTGSDLTW+QCDAPC C K P Y+P N +VPC+N C
Sbjct: 53 HYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANSLVPCANALC 112
Query: 75 AALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLF--PLRFSNGSVFNVPLTFGC 131
ALH + K P+ QCDY+I+Y D SS G L+ D F P+R SN LTFGC
Sbjct: 113 TALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPMRSSN---IRPGLTFGC 169
Query: 132 GYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
GY+Q T G+LGLGRG +S+VSQL++ G+ +NV+GHC+ NG G LF GD
Sbjct: 170 GYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCLSTNGGGFLFFGDD 229
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQ 250
VP+S V W PM + S + +Y G L + +S G+K + ++FDSG++Y YFT++ YQ
Sbjct: 230 IVPTSRVTWVPMAKISGN--YYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQ 287
Query: 251 EIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRL 308
+VS + L + +++ D +LP+CW+GP FK++ V + FK L LSF + +N+V +
Sbjct: 288 AVVSALKSGLSKSLKQVS--DPSLPLCWKGPKAFKSVFDVKKEFKSLFLSFASAKNAV-M 344
Query: 309 VVPPEAYLVISVS 321
+PPE YL+++V+
Sbjct: 345 EIPPENYLIVTVN 357
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 117/193 (60%), Positives = 140/193 (72%), Gaps = 1/193 (0%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
FP+ Y++V L +G PPK F+FD DTGSDLTWVQCDAPCTGCT PP +QYKP N VPC
Sbjct: 49 FPL-GYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCTLPPIRQYKPKGNTVPCL 107
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+P C ALH+PN P+C +P +QCDYE+ Y D GSS+GALV D FPL+ NGS L FG
Sbjct: 108 DPICLALHFPNKPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPLKLLNGSAMQPRLAFG 167
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CGY+Q P PP TAGVLGLGRG+I ++ QL GL RNV+GHC+ G G LF GD
Sbjct: 168 CGYDQILPKAHPPPATAGVLGLGRGKIGVLPQLVAAGLTRNVVGHCLSSKGGGYLFFGDT 227
Query: 191 KVPSSGVAWTPML 203
+P+ GVAWTP+L
Sbjct: 228 LIPTLGVAWTPLL 240
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 138/325 (42%), Positives = 200/325 (61%), Gaps = 19/325 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
+P Y+ V + +G P K + D DTGSDLTW+QCDAPC C K P Y+P N +VPC
Sbjct: 48 YPTGHYY-VTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPC 106
Query: 70 SNPRCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLF--PLRFSNGSVFNVP 126
+N C ALH K P+ QCDY+I+Y D SS G L+ D F P+R SN
Sbjct: 107 ANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN---IRPG 163
Query: 127 LTFGCGYNQH---NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
LTFGCGY+Q N + D G+LGLGRG +S+VSQL++ G+ +NV+GHC+ NG G
Sbjct: 164 LTFGCGYDQQVGKNGAVQAAID--GMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGG 221
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
LF GD VPSS V W PM Q ++ +Y G L + +S G+K + ++FDSG++Y Y
Sbjct: 222 FLFFGDDVVPSSRVTWVPMAQRTSG-NYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTY 280
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
FT++ YQ +VS + L + +++ D TLP+CW+G FK++ V FK + LSF++
Sbjct: 281 FTAQPYQAVVSALKGGLSKSLKQVS--DPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFSS 338
Query: 302 RRNSVRLVVPPEAYLVISVSTSIII 326
+N+ + +PPE YL+++ + ++ +
Sbjct: 339 AKNAA-MEIPPENYLIVTKNGNVCL 362
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 138/325 (42%), Positives = 199/325 (61%), Gaps = 19/325 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
+P Y+ V + +G P K + D DTGSDLTW+QCDAPC C K P Y+P N +VPC
Sbjct: 48 YPTGHYY-VTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPC 106
Query: 70 SNPRCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLF--PLRFSNGSVFNVP 126
+N C ALH K P+ QCDY+I+Y D SS G L+ D F P+R SN
Sbjct: 107 ANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN---IRPG 163
Query: 127 LTFGCGYNQH---NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
LTFGCGY+Q N + D G+LGLGRG +S+VSQL++ G+ +NV+GHC+ NG G
Sbjct: 164 LTFGCGYDQQVGKNGAVQAAID--GMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGG 221
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
LF GD VPSS V W PM Q ++ +Y G L + +S G+K + ++FDSG++Y Y
Sbjct: 222 FLFFGDDVVPSSRVTWVPMAQRTSG-NYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTY 280
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
FT++ YQ +VS + L + +++ D TLP+CW+G FK++ V FK + LSF +
Sbjct: 281 FTAQPYQAVVSALKGGLSKSLKQVS--DPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFAS 338
Query: 302 RRNSVRLVVPPEAYLVISVSTSIII 326
+N+ + +PPE YL+++ + ++ +
Sbjct: 339 AKNAA-MEIPPENYLIVTKNGNVCL 362
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 198/316 (62%), Gaps = 11/316 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRC 74
++ V + +G P K + D DTGSDLTW+QCDAPC C K P Y+P KN +VPC+N C
Sbjct: 56 HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKLVPCANSIC 115
Query: 75 AALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH + P K QCDY+I+Y D SS+G LVTD F L N S L+FGCGY
Sbjct: 116 TALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKSNVRPSLSFGCGY 175
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q +P T G+LGLGRG +S++SQL++ G+ +NV+GHC+ +G G LF GD V
Sbjct: 176 DQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGGGFLFFGDDMV 235
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P+S V W PM+++++ +Y G A L + +S K + ++FDSG++Y YF+++ YQ
Sbjct: 236 PTSRVTWVPMVRSTSG-NYYSPGSATLYFDRRSLSTKPMEVVFDSGSTYTYFSAQPYQAT 294
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
+S I L + +++ D +LP+CW+G FK++ V + FK +L F +N+V + +
Sbjct: 295 ISAIKGSLSKSLKQVS--DPSLPLCWKGQKAFKSVSDVKKDFK--SLQFIFGKNAV-MEI 349
Query: 311 PPEAYLVISVSTSIII 326
PPE YL+++ + ++ +
Sbjct: 350 PPENYLIVTKNGNVCL 365
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 134/310 (43%), Positives = 189/310 (60%), Gaps = 13/310 (4%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRC 74
++ V + +G P K + D DTGSDLTW+QCDAPC C K P YKP KN +VPC+ C
Sbjct: 51 HYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKPTKNKLVPCAASIC 110
Query: 75 AALHWPNPP--RCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
LH P +C P QCDY+I+Y D SS+G LVTD F L N S TFGCG
Sbjct: 111 TTLHSAQSPNKKCAVPQ-QCDYQIKYTDSASSLGVLVTDNFTLPLRNSSSVRPSFTFGCG 169
Query: 133 YNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
Y+Q + T G+LGLG+G +S+VSQL+ G+ +NV+GHC+ NG G LF GD
Sbjct: 170 YDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLSTNGGGFLFFGDNV 229
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQE 251
VP+S W PM+++++ +Y G L + +S G+K + ++FDSG++Y YF ++ YQ
Sbjct: 230 VPTSRATWVPMVRSTSG-NYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFAAQPYQA 288
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLV 309
VS + L + +++ D +LP+CW+G FK++ V FK L LSF +NSV L
Sbjct: 289 TVSALKAGLSKSLQQVS--DPSLPLCWKGQKVFKSVSDVKNDFKSLFLSFV--KNSV-LE 343
Query: 310 VPPEAYLVIS 319
+PPE YL+++
Sbjct: 344 IPPENYLIVT 353
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 137/313 (43%), Positives = 192/313 (61%), Gaps = 14/313 (4%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
+PI Y+ V + +G P K + D DTGSDLTW+QCDAPC C K P YKP KN IVPC
Sbjct: 68 YPIGHYY-VTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKPTKNKIVPC 126
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C +L PN +C P QCDY+I+Y D SS+G L+ D F L N S LTF
Sbjct: 127 AASLCTSLT-PNK-KCAVPQ-QCDYQIKYTDKASSLGVLIADNFTLSLRNSSTVRANLTF 183
Query: 130 GCGYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
GCGY+Q T G+LGLG+G +S++SQL++ G+ +NV+GHC NG G LF G
Sbjct: 184 GCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNGGGFLFFG 243
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRV 248
D VP+S V W PM + ++ +Y G L + +S G+K + ++FDSG++YAYF +
Sbjct: 244 DDIVPTSRVTWVPMARTTSG-NYYSPGSGTLYFDRRSLGMKPMEVVFDSGSTYAYFAAEP 302
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNRRNSV 306
YQ VS + L + +++ D +LP+CW+G FK++ +V FK L LSF +NSV
Sbjct: 303 YQATVSALKAGLSKSLKEVS--DVSLPLCWKGQKVFKSVSEVKNDFKSLFLSFG--KNSV 358
Query: 307 RLVVPPEAYLVIS 319
+ +PPE YL+++
Sbjct: 359 -MEIPPENYLIVT 370
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 135/313 (43%), Positives = 193/313 (61%), Gaps = 18/313 (5%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRCAALHWPN 81
+G P K + D DTGSDLTW+QCDAPC C K P Y+P N +VPC+N C ALH
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTALHSGQ 60
Query: 82 PPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLF--PLRFSNGSVFNVPLTFGCGYNQH-- 136
K P+ QCDY+I+Y D SS G L+ D F P+R SN LTFGCGY+Q
Sbjct: 61 GSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN---IRPGLTFGCGYDQQVG 117
Query: 137 -NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSS 195
N + D G+LGLGRG +S+VSQL++ G+ +NV+GHC+ NG G LF GD VPSS
Sbjct: 118 KNGAVQAAID--GMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGGFLFFGDDVVPSS 175
Query: 196 GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSL 255
V W PM Q ++ +Y G L + +S G+K + ++FDSG++Y YFT++ YQ +VS
Sbjct: 176 RVTWVPMAQRTSG-NYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSA 234
Query: 256 IMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
+ L + +++ D TLP+CW+G FK++ V FK + LSF + +N+ + +PPE
Sbjct: 235 LKGGLSKSLKQVS--DPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFASAKNAA-MEIPPE 291
Query: 314 AYLVISVSTSIII 326
YL+++ + ++ +
Sbjct: 292 NYLIVTKNGNVCL 304
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 130/316 (41%), Positives = 196/316 (62%), Gaps = 11/316 (3%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPCSNPRC 74
++ V + +G P K + D DTGSDLTW+QCDAPC C K P Y+P KN +VPC+N C
Sbjct: 56 HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKLVPCANSIC 115
Query: 75 AALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
ALH + P K QCDY+I+Y D SS+G LV D F L N S L+FGCGY
Sbjct: 116 TALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKSNVRPSLSFGCGY 175
Query: 134 NQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+Q +P T G+LGLGRG +S++SQL++ G+ +NV+GHC+ +G G LF GD V
Sbjct: 176 DQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGGGFLFFGDDMV 235
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
P+S V W M+++++ +Y G A L + +S K + ++FDSG++Y YF+++ YQ
Sbjct: 236 PTSRVTWVSMVRSTSG-NYYSPGSATLYFDRRSLSTKPMEVVFDSGSTYTYFSAQPYQAT 294
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVV 310
+S I L + +++ D +LP+CW+G FK++ V + FK +L F +N+V + +
Sbjct: 295 ISAIKGSLSKSLKQVS--DPSLPLCWKGQKAFKSVSDVKKDFK--SLQFIFGKNAV-MDI 349
Query: 311 PPEAYLVISVSTSIII 326
PPE YL+I+ + ++ +
Sbjct: 350 PPENYLIITKNGNVCL 365
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 127/326 (38%), Positives = 177/326 (54%), Gaps = 20/326 (6%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCA 75
+ + L +G PPKL+ D DTGSDLTW QCDAPC C P Y P K +V C P CA
Sbjct: 40 YYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLYNPKKAKVVDCHLPVCA 99
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+ C QCDYE+EY DG S++G LV D +R +NG++ GCGY+Q
Sbjct: 100 QIQQGGSYECNSDVKQCDYEVEYADGSSTMGVLVEDTLTVRLTNGTLIQTKAIIGCGYDQ 159
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDGKVP 193
SP T GV+GL ++++ +QL E G+I+NV+GHC+ G NG G LF GD VP
Sbjct: 160 QGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCLADGSNGGGYLFFGDELVP 219
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL---KDLT-----LIFDSGASYAYFT 245
S G+ WTPM+ ++ Y + Y G S L +DLT ++FDSG S+ Y
Sbjct: 220 SWGMTWTPMM-GKPEMLGYQARLQSIRYGGDSLVLNNDEDLTRSTSSVMFDSGTSFTYLV 278
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRR 303
+ Y ++S + + + L D TLP CWRG PF+++ V +YFK L L F R
Sbjct: 279 PQAYASVLSAVTKQ---SGLLRVKSDTTLPYCWRGPSPFQSITDVHQYFKTLTLDFGGRN 335
Query: 304 ---NSVRLVVPPEAYLVISVSTSIII 326
L + P+ YL++S ++ +
Sbjct: 336 WFATDSTLDLSPQGYLIVSTQGNVCL 361
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 127/328 (38%), Positives = 184/328 (56%), Gaps = 20/328 (6%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNP 72
+ + V + VG P K + D D+GS+LTW+QCDAPC C K P YK K ++VP +P
Sbjct: 76 YGLYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYKLKKGSLVPSKDP 135
Query: 73 RCAAL-----HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CAA+ H+ N K + +CDY++ Y D G S G LV D +N +V
Sbjct: 136 LCAAVQAGSGHYHNH---KEASQRCDYDVAYADHGYSEGFLVRDSVRALLTNKTVLTANS 192
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVL 185
FGCGYNQ P+S T G+LGLG G S+ SQ + GLI+NVIGHCI GR G +
Sbjct: 193 VFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCIFGAGRDGGYM 252
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGAS 240
F GD V +S + W PML + +KHY +G A++ + K G K +IFDSG++
Sbjct: 253 FFGDDLVSTSAMTWVPMLGRPS-IKHYYVGAAQMNFGNKPLDKDGDGKKLGGIIFDSGST 311
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALS 298
Y YFT++ Y +S++ +L G L+ D L +CWR F+++ + YFKPL L
Sbjct: 312 YTYFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCWRRKEGFRSVAEAAAYFKPLTLK 371
Query: 299 FTNRRNSVRLVVPPEAYLVISVSTSIII 326
F + + ++ + PE YLV++ ++ +
Sbjct: 372 FRSTKTK-QMEIFPEGYLVVNKKGNVCL 398
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 129/336 (38%), Positives = 175/336 (52%), Gaps = 24/336 (7%)
Query: 4 SWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
S + +P Y+ L +G P KL+ D DTGSDLTW+QCDAPC C P Y P
Sbjct: 11 SQLRGNIYPDGLYYMAML-IGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLYDPK 69
Query: 64 K-NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV 122
K +V C P CA + C P QCDY++EY DG S++G L+ D L +NG+
Sbjct: 70 KARLVDCRVPLCALVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLLLTNGTR 129
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQN 180
GCGY+Q +P T GV+GL +IS+ SQL + G++RNVIGHC+ G N
Sbjct: 130 SKTTAIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLAGGSN 189
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIF 235
G G LF GD VP+ G+ WTP++ S I G GKS D T ++F
Sbjct: 190 GGGYLFFGDSLVPALGMTWTPIMGKS------ITGN----IGGKSGDADDKTGDIGGVMF 239
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFK 293
DSG S+ Y Y ++S + + + L D TLP CWRG PF+++ V YFK
Sbjct: 240 DSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPFCWRGPSPFESVADVQRYFK 299
Query: 294 PLALSFTNRR---NSVRLVVPPEAYLVISVSTSIII 326
+ L F R S L + PE YL++S ++ +
Sbjct: 300 TVTLDFGKRNWYSASRVLELSPEGYLIVSTQGNVCL 335
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 182/320 (56%), Gaps = 18/320 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-KNIVPC 69
+PI +F V + +G P K + D DTGS LTW+QCD PC C K P YKP K V C
Sbjct: 33 YPIGHFF-VTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKYAVKC 91
Query: 70 SNPRCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ RCA L+ P +C P +QC Y I+Y GGSSIG L+ D F L SNG+ +
Sbjct: 92 TEQRCADLYADLRKPMKCG-PKNQCHYGIQY-VGGSSIGVLIVDSFSLPASNGT-NPTSI 148
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLF 186
FGCGYNQ P G+LGLGRG+++++SQL+ G+I ++V+GHCI G+G LF
Sbjct: 149 AFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLF 208
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYI--LGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
GD KVP+SGV W+PM + + KHY G + + K + +IFDSGA+Y YF
Sbjct: 209 FGDAKVPTSGVTWSPM---NREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTYF 265
Query: 245 TSRVYQEIVSLIMRDLIGT---PLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF 299
+ Y +S++ L ++ D+ L +CW+G + + +V + F+ L+L F
Sbjct: 266 ALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKCFRSLSLKF 325
Query: 300 TNRRNSVRLVVPPEAYLVIS 319
+ L +PPE YL+IS
Sbjct: 326 ADGDKKATLEIPPEHYLIIS 345
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 122/323 (37%), Positives = 172/323 (53%), Gaps = 14/323 (4%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCA 75
+ + + +G P KL+ D DTGSDLTW+QCDAPC C P Y P + +V C P CA
Sbjct: 31 YYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRARVVDCRRPTCA 90
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
+ C QCDYE++Y DG S++G LV D L +NG+ F GCGY+Q
Sbjct: 91 QVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNGTRFQTRAVIGCGYDQ 150
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDGKVP 193
+P T GV+GL +IS+ SQL G+ NVIGHC+ G NG G LF GD VP
Sbjct: 151 QGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGGSNGGGYLFFGDTLVP 210
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYAYFTSRV 248
+ G+ WTPM+ ++ Y + Y G+ L+ T +FDSG S+ Y
Sbjct: 211 ALGMTWTPMIGRPL-VEGYQARLRSIKYGGEVLELEGTTDDVGGAMFDSGTSFTYLVPNA 269
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF---TNRR 303
Y ++S ++R + L+ D TLP CWRG PF+++ V+ YFK + L F T
Sbjct: 270 YTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGPSPFESVADVSAYFKTVTLDFGGSTWWS 329
Query: 304 NSVRLVVPPEAYLVISVSTSIII 326
+ L + PE YL++S ++ +
Sbjct: 330 SGKLLELSPEGYLIVSTQGNVCL 352
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 181/320 (56%), Gaps = 18/320 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-KNIVPC 69
+PI +F V + + P K + D DTGS LTW+QCD PC C K P YKP K V C
Sbjct: 33 YPIGHFF-VTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKYAVKC 91
Query: 70 SNPRCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ RCA L+ P +C P +QC Y I+Y GGSSIG L+ D F L SNG+ +
Sbjct: 92 TEQRCADLYADLRKPMKCG-PKNQCHYGIQY-VGGSSIGVLIVDSFSLPASNGT-NPTSI 148
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLF 186
FGCGYNQ P G+LGLGRG+++++SQL+ G+I ++V+GHCI G+G LF
Sbjct: 149 AFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLF 208
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKS--CGLKDLTLIFDSGASYAYF 244
GD KVP+SGV W+PM + + KHY L ++ S + +IFDSGA+Y YF
Sbjct: 209 FGDAKVPTSGVTWSPM---NREHKHYSPRQGTLHFNSNSKPISAAPMEVIFDSGATYTYF 265
Query: 245 TSRVYQEIVSLIMRDLIGT---PLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSF 299
+ Y +S++ L ++ D+ L +CW+G + + +V + F+ L+L F
Sbjct: 266 ALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKCFRSLSLKF 325
Query: 300 TNRRNSVRLVVPPEAYLVIS 319
+ L +PPE YL+IS
Sbjct: 326 ADGDKKATLEIPPEHYLIIS 345
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 173/321 (53%), Gaps = 15/321 (4%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
+P YF L VG PP+ + D DT SDLTW+QCDAPCT C K YKP + NIV
Sbjct: 203 YPDGLYFTYIL-VGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGANALYKPRRDNIVTP 261
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C LH QCDYEIEY D SS+G L D L +NGS N+ F
Sbjct: 262 KDSLCVELHRNQKAGYCETCQQCDYEIEYADHSSSMGVLARDELHLTMANGSSTNLKFNF 321
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GC Y+Q + T G+LGL + ++S+ SQL G+I NV+GHC+ + G G +FL
Sbjct: 322 GCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLANDVVGGGYMFL 381
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYA 242
GD VP G++W PML +S + Y +L Y L + ++FDSG+SY
Sbjct: 382 GDDFVPRWGMSWVPML-DSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRRIVFDSGSSYT 440
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
YFT Y E+V+ ++ + G L D TLP CWR P +++ V +YFK L L F
Sbjct: 441 YFTKEAYSELVA-SLKQVSGEALIQDTSDPTLPFCWRAKFPIRSVIDVKQYFKTLTLQFG 499
Query: 301 NR--RNSVRLVVPPEAYLVIS 319
++ S + +PPE YL+IS
Sbjct: 500 SKWWIISTKFRIPPEGYLIIS 520
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 123/319 (38%), Positives = 176/319 (55%), Gaps = 18/319 (5%)
Query: 17 FAVNLTVGKPP--KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPR 73
+ + VGKP + + D DTGS+LTW+QCDAPCT C K + YKP K N+V S
Sbjct: 30 YYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEAF 89
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C + QCDYEIEY D S+G L D F L+ NGS+ + FGCGY
Sbjct: 90 CVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGY 149
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLGDGK 191
+Q + T G+LGL R +IS+ SQL G+I NV+GHC+ NG G +F+G
Sbjct: 150 DQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDL 209
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYAYFTS 246
VPS G+ W PML +S L Y + ++ Y L ++FD+G+SY YF +
Sbjct: 210 VPSHGMTWVPMLHDSR-LDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPN 268
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG----PFKALGQVTEYFKPLALSFTNR 302
+ Y ++V+ ++++ G L D+TLPICWR PF +L V ++F+P+ L ++
Sbjct: 269 QAYSQLVT-SLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSK 327
Query: 303 --RNSVRLVVPPEAYLVIS 319
S +L++ PE YL+IS
Sbjct: 328 WLIISRKLLIQPEDYLIIS 346
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 123/319 (38%), Positives = 176/319 (55%), Gaps = 18/319 (5%)
Query: 17 FAVNLTVGKPP--KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPR 73
+ + VGKP + + D DTGS+LTW+QCDAPCT C K + YKP K N+V S
Sbjct: 203 YYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEAF 262
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C + QCDYEIEY D S+G L D F L+ NGS+ + FGCGY
Sbjct: 263 CVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGY 322
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLGDGK 191
+Q + T G+LGL R +IS+ SQL G+I NV+GHC+ NG G +F+G
Sbjct: 323 DQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDL 382
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYAYFTS 246
VPS G+ W PML +S L Y + ++ Y L ++FD+G+SY YF +
Sbjct: 383 VPSHGMTWVPMLHDSR-LDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPN 441
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG----PFKALGQVTEYFKPLALSFTNR 302
+ Y ++V+ ++++ G L D+TLPICWR PF +L V ++F+P+ L ++
Sbjct: 442 QAYSQLVT-SLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSK 500
Query: 303 --RNSVRLVVPPEAYLVIS 319
S +L++ PE YL+IS
Sbjct: 501 WLIISRKLLIQPEDYLIIS 519
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/321 (39%), Positives = 180/321 (56%), Gaps = 19/321 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-KNIVPC 69
+PI +F V + + P K + D DTGS LTW+QCD PC C K P YKP K V C
Sbjct: 33 YPIGHFF-VTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKYAVKC 91
Query: 70 SNPRCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ RCA L+ P +C P +QC Y I+Y GGSSIG L+ D F L SNG+ +
Sbjct: 92 TEQRCADLYADLRKPMKCG-PKNQCHYGIQY-VGGSSIGVLIVDSFSLPASNGT-NPTSI 148
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLF 186
FGCGYNQ P G+LGLGRG+++++SQL+ G+I ++V+GHCI G+G LF
Sbjct: 149 AFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLF 208
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKS---CGLKDLTLIFDSGASYAY 243
GD KVP+SGV W+PM + + KHY L ++ + +IFDSGA+Y Y
Sbjct: 209 FGDAKVPTSGVTWSPM---NREHKHYSPRQGTLHFNSNKQSPISAAPMEVIFDSGATYTY 265
Query: 244 FTSRVYQEIVSLIMRDLIGT---PLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALS 298
F + Y +S++ L ++ D+ L +CW+G + + +V + F+ L+L
Sbjct: 266 FALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKCFRSLSLK 325
Query: 299 FTNRRNSVRLVVPPEAYLVIS 319
F + L +PPE YL+IS
Sbjct: 326 FADGDKKATLEIPPEHYLIIS 346
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/320 (39%), Positives = 181/320 (56%), Gaps = 18/320 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-KNIVPC 69
+PI +F + + +G P K + D DTGS LTW+QCDAPCT C P YKP K +V C
Sbjct: 33 YPIGHFF-ITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKLVTC 91
Query: 70 SNPRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
++ C L+ P RC QCDY I+Y D SS+G LV D F L SNG+ +
Sbjct: 92 ADSLCTDLYTDLGKPKRCG-SQKQCDYVIQYVD-SSSMGVLVIDRFSLSASNGT-NPTTI 148
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLF 186
FGCGY+Q P +LGL RG+++++SQL+ G+I ++V+GHCI G G LF
Sbjct: 149 AFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGGFLF 208
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD--LTLIFDSGASYAYF 244
GD +VP+SGV WTPM + + K+Y G L + S + + +IFDSGA+Y YF
Sbjct: 209 FGDAQVPTSGVTWTPM---NREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYF 265
Query: 245 TSRVYQEIVSLIMRDLIGT---PLKLAPDDKTLPICWRGPFK--ALGQVTEYFKPLALSF 299
++ YQ +S++ L ++ D+ L +CW+G K + +V + F+ L+L F
Sbjct: 266 AAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTIDEVKKCFRSLSLEF 325
Query: 300 TNRRNSVRLVVPPEAYLVIS 319
+ L +PPE YL+IS
Sbjct: 326 ADGDKKATLEIPPEHYLIIS 345
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 124/315 (39%), Positives = 178/315 (56%), Gaps = 17/315 (5%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-KNIVPCSNPRC 74
+F + + +G P K + D DTGS LTW+QCDAPCT C P YKP K +V C++ C
Sbjct: 402 HFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKLVTCADSLC 461
Query: 75 AALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
L+ P RC QCDY I+Y D SS+G LV D F L SNG+ + FGCG
Sbjct: 462 TDLYTDLGKPKRCG-SQKQCDYVIQYVD-SSSMGVLVIDRFSLSASNGT-NPTTIAFGCG 518
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLFLGDGK 191
Y+Q P +LGL RG+++++SQL+ G+I ++V+GHCI G G LF GD +
Sbjct: 519 YDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGGFLFFGDAQ 578
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD--LTLIFDSGASYAYFTSRVY 249
VP+SGV WTPM + + K+Y G L + S + + +IFDSGA+Y YF ++ Y
Sbjct: 579 VPTSGVTWTPM---NREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYFAAQPY 635
Query: 250 QEIVSLIMRDLIGT---PLKLAPDDKTLPICWRGPFK--ALGQVTEYFKPLALSFTNRRN 304
Q +S++ L ++ D+ L +CW+G K + +V + F+ L+L F +
Sbjct: 636 QATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTIDEVKKCFRSLSLEFADGDK 695
Query: 305 SVRLVVPPEAYLVIS 319
L +PPE YL+IS
Sbjct: 696 KATLEIPPEHYLIIS 710
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 97/236 (41%), Positives = 142/236 (60%), Gaps = 27/236 (11%)
Query: 91 QCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFNVPLTFGCGYNQ---HNPGPLSPPDT 146
QCDYEI+Y DG S+IGAL+ D F L R + N+P FGCGYNQ N SP +
Sbjct: 28 QCDYEIKYADGASTIGALIVDQFSLPRIATRP--NLP--FGCGYNQGIGENFQQTSPVN- 82
Query: 147 AGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQN 205
G+LGL RG++S VSQL+ G+I ++V+GHC+ G G+LF+GDG +L +
Sbjct: 83 -GILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGGGGLLFVGDGD-------GNLVLLH 134
Query: 206 SADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPL 265
+ +Y G A L + S G+ + ++FDSG++Y YFT++ YQ V I L T L
Sbjct: 135 A---NYYSPGSATLYFDRHSLGMNPMDVVFDSGSTYTYFTAQPYQATVYAIKGGLSSTSL 191
Query: 266 KLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVIS 319
+ D +LP+CW+G F+++ V + FK L L+F N N+V + +PPE YL+++
Sbjct: 192 EQV-SDPSLPLCWKGQKAFESVFDVKKEFKSLQLNFGN--NAV-MEIPPENYLIVT 243
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 121/319 (37%), Positives = 174/319 (54%), Gaps = 18/319 (5%)
Query: 17 FAVNLTVGKPP--KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPR 73
+ + VGKP + + D DTGSDLTW+QCDAPCT C K + YKP K N+V S P
Sbjct: 198 YYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEPF 257
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
C + QCDYEIEY D S+G L D F L+ NGS+ + FGCGY
Sbjct: 258 CVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGY 317
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLGDGK 191
+Q + T G+LGL R +IS+ SQL G+I NV+GHC+ NG G +F+G
Sbjct: 318 DQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDL 377
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYAYFTS 246
VPS G+ W PML + L+ Y + ++ Y L ++FD+G+SY YF +
Sbjct: 378 VPSHGMTWVPMLHH-PHLEVYQMQVTKMSYGNAMLSLDGENGRVGKVLFDTGSSYTYFPN 436
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG----PFKALGQVTEYFKPLALSFTNR 302
+ Y ++V+ ++++ L D+ LPICWR P +L V ++F+P+ L ++
Sbjct: 437 QAYSQLVT-SLQEVSDLELTRDDSDEALPICWRAKTNSPISSLSDVKKFFRPITLQIGSK 495
Query: 303 --RNSVRLVVPPEAYLVIS 319
S +L++ PE YL+IS
Sbjct: 496 WLIISKKLLIQPEDYLIIS 514
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 179/320 (55%), Gaps = 14/320 (4%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
+P YF ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K N+VP
Sbjct: 96 YPNGLYFT-HIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPL 154
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C + +QCDYEIEY D SS+G L +D L +NGS+ + + F
Sbjct: 155 KDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMF 214
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GC Y+Q S T G+LGL + ++S+ SQL +I NV+GHC+ + G G +FL
Sbjct: 215 GCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFL 274
Query: 188 GDGKVPSSGVAWTPMLQNSADLKH----YILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
GD VP G+AW PML + + H I + L G+ G + ++FD+G+SY Y
Sbjct: 275 GDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTE-RVVFDTGSSYTY 333
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
F Y +V+ ++D+ L D TLP+CWR P +++ V ++F+PL L F +
Sbjct: 334 FPKEAYYALVA-SLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRS 392
Query: 302 RR--NSVRLVVPPEAYLVIS 319
+ S + +PPE YL+IS
Sbjct: 393 KWWIVSTKFRIPPEGYLIIS 412
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/333 (38%), Positives = 182/333 (54%), Gaps = 31/333 (9%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-------------PE 57
+PI +F V + +G P K + D DTGS LTW+QCD PC C K P
Sbjct: 33 YPI-GHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSLFYPRLIGSFVPH 91
Query: 58 KQYKPH-KNIVPCSNPRCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFP 114
YKP K V C+ RCA L+ P +C P +QC Y I+Y GGSSIG L+ D F
Sbjct: 92 GLYKPELKYAVKCTEQRCADLYADLRKPMKCG-PKNQCHYGIQY-VGGSSIGVLIVDSFS 149
Query: 115 LRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVI 173
L SNG+ + FGCGYNQ P G+LGLGRG+++++SQL+ G+I ++V+
Sbjct: 150 LPASNGT-NPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVL 208
Query: 174 GHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYI--LGPAELLYSGKSCGLKDL 231
GHCI G+G LF GD KVP+SGV W+PM + + KHY G + + K +
Sbjct: 209 GHCISSKGKGFLFFGDAKVPTSGVTWSPM---NREHKHYSPRQGTLQFNSNSKPISAAPM 265
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGT---PLKLAPDDKTLPICWRG--PFKALG 286
+IFDSGA+Y YF + Y +S++ L ++ D+ L +CW+G + +
Sbjct: 266 EVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTID 325
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVIS 319
+V + F+ L+L F + L +PPE YL+IS
Sbjct: 326 EVKKCFRSLSLKFADGDKKATLEIPPEHYLIIS 358
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 179/320 (55%), Gaps = 14/320 (4%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
+P YF ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K N+VP
Sbjct: 309 YPNGLYFT-HIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPL 367
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C + +QCDYEIEY D SS+G L +D L +NGS+ + + F
Sbjct: 368 KDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMF 427
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GC Y+Q S T G+LGL + ++S+ SQL +I NV+GHC+ + G G +FL
Sbjct: 428 GCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFL 487
Query: 188 GDGKVPSSGVAWTPMLQNSADLKH----YILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
GD VP G+AW PML + + H I + L G+ G + ++FD+G+SY Y
Sbjct: 488 GDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTE-RVVFDTGSSYTY 546
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
F Y +V+ ++D+ L D TLP+CWR P +++ V ++F+PL L F +
Sbjct: 547 FPKEAYYALVA-SLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRS 605
Query: 302 RR--NSVRLVVPPEAYLVIS 319
+ S + +PPE YL+IS
Sbjct: 606 KWWIVSTKFRIPPEGYLIIS 625
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 210 bits (534), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 129/324 (39%), Positives = 177/324 (54%), Gaps = 21/324 (6%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
+P YF + L VG PPK + D DTGSDLTW+QCDAPC C K QYKP + N+V
Sbjct: 189 YPDGLYFTI-LRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQYKPTRSNVVSS 247
Query: 70 SNPRCAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ C + N H QCDYEI+Y D SS+G LV D L +NGS + +
Sbjct: 248 VDSLCLDVQ-KNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHLVTTNGSKTKLNV 306
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVL 185
FGCGY+Q + T G++GL R ++S+ QL GLI+NV+GHC+ +G G +
Sbjct: 307 VFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSNDGAGGGYM 366
Query: 186 FLGDGKVPSSGVAWTPMLQN-SADLKHYIL-----GPAELLYSGKSCGLKDLTLIFDSGA 239
FLGD VP G+ W PM + DL + G +L + G+S K + FDSG+
Sbjct: 367 FLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFDGQS---KVGKVFFDSGS 423
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPF--KALGQVTEYFKPLAL 297
SY YF Y ++V+ + ++ G L D TLPICW+ F +++ V +YFK L L
Sbjct: 424 SYTYFPKEAYLDLVA-SLNEVSGLGLVQDDSDTTLPICWQANFQIRSIKDVKDYFKTLTL 482
Query: 298 SFTNR--RNSVRLVVPPEAYLVIS 319
F ++ S +PPE YL+IS
Sbjct: 483 RFGSKWWILSTLFQIPPEGYLIIS 506
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 176/317 (55%), Gaps = 21/317 (6%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA---PCTGCTKPPEKQYKPHKNIVPCSNP 72
+F V + +G+P K + D DTGS+LTW++C A PC C K P Y+P K +VPC++P
Sbjct: 39 HFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLYRP-KKLVPCADP 97
Query: 73 RCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C ALH C+ DQC Y+I Y DG +S+G L+ D F L GS N+ FG
Sbjct: 98 LCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSL--PTGSARNI--AFG 153
Query: 131 CGYNQHNPGPLSPPDTA---GVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLF 186
CGY+Q P+ G+LGLGRG + +VSQL+ G + +NVIGHC+ G G LF
Sbjct: 154 CGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVIGHCLSSKGGGYLF 213
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
+G+ VPSS + + S + HY G A L G K IFDSG++Y Y
Sbjct: 214 IGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKPFKAIFDSGSTYTYLPE 273
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRG--PFKALGQVTEYFKPL-ALSFTNR 302
++ ++VS + LI + LKL D D L +CW+G PFK + + + FK L L F
Sbjct: 274 NLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKGPKPFKTVHDLPKEFKSLVTLKFD-- 331
Query: 303 RNSVRLVVPPEAYLVIS 319
+ V + +PPE YL+I+
Sbjct: 332 -HGVTMTIPPENYLIIT 347
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 127/328 (38%), Positives = 172/328 (52%), Gaps = 19/328 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 182 FPDGQYY-TSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKEKIVPP 240
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D L +NG + F
Sbjct: 241 RDLLCQELQ-GNQNYCETCK-QCDYEIEYADQSSSMGVLARDDMHLIATNGGREKLDFVF 298
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFL 187
GC Y+Q SP T G+LGL IS+ SQL +G+I N+ GHCI Q G G +FL
Sbjct: 299 GCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQGGGGYMFL 358
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD-----LTLIFDSGASYA 242
GD VP G+ WT + +L H + Y + +++ + +IFDSG+SY
Sbjct: 359 GDDYVPRWGITWTSIRSGPDNLYH--TEAHHVKYGDQQLRMREQAGNTVQVIFDSGSSYT 416
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
Y +Y+ +V+ I G D+TLP+CW+ P + L V ++FKPL L F
Sbjct: 417 YLPDEIYENLVAAIKYASPG--FVQDSSDRTLPLCWKADFPVRYLEDVKQFFKPLNLHFG 474
Query: 301 NRR--NSVRLVVPPEAYLVISVSTSIII 326
+ S + PE YL+IS ++ +
Sbjct: 475 KKWLFMSKTFTISPEDYLIISDKGNVCL 502
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 129/330 (39%), Positives = 172/330 (52%), Gaps = 23/330 (6%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 189 FPDGQYY-TSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 247
Query: 70 SNPRCAALHWPNP--PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ C L CK QCDYEIEY D SS+G L D + +NG +
Sbjct: 248 RDLLCQELQGDQNYCATCK----QCDYEIEYADRSSSMGVLAKDDMHMIATNGGREKLDF 303
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVL 185
FGC Y+Q SP T G+LGL IS+ SQL G+I NV GHCI + NG G +
Sbjct: 304 VFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEPNGGGYM 363
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKH-----YILGPAELLYSGKSCGLKDLTLIFDSGAS 240
FLGD VP G+ W P+ +L H G +L G++ + +IFDSG+S
Sbjct: 364 FLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAG--SSIQVIFDSGSS 421
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPF--KALGQVTEYFKPLALS 298
Y Y +Y+++V+ I D D TLP+CW+ F + L V ++FKPL L
Sbjct: 422 YTYLPDEIYKKLVTAIKYDY--PSFVQDTSDTTLPLCWKADFDVRYLEDVKQFFKPLNLH 479
Query: 299 FTNRRNSV--RLVVPPEAYLVISVSTSIII 326
F NR + + P+ YL+IS ++ +
Sbjct: 480 FGNRWFVIPRTFTILPDDYLIISDKGNVCL 509
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/324 (39%), Positives = 176/324 (54%), Gaps = 21/324 (6%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
+P YF + L VG PPK + D DTGSDLTW+QCDAPC C K YKP + N+V
Sbjct: 187 YPDGLYFTI-LRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHVLYKPTRSNVVSS 245
Query: 70 SNPRCAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ C + N H QCDYEI+Y D SS+G LV D L +NGS + +
Sbjct: 246 VDALCLDVQ-KNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHLVTTNGSKTKLNV 304
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVL 185
FGCGY+Q + T G++GL R ++S+ QL GLI+NV+GHC+ +G G +
Sbjct: 305 VFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSNDGAGGGYM 364
Query: 186 FLGDGKVPSSGVAWTPMLQN-SADLKHYIL-----GPAELLYSGKSCGLKDLTLIFDSGA 239
FLGD VP G+ W PM + DL + G +L + G+S K ++FDSG+
Sbjct: 365 FLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRFDGQS---KVGKMVFDSGS 421
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLAL 297
SY YF Y ++V+ + ++ G L D TLPICW+ P K++ V +YFK L L
Sbjct: 422 SYTYFPKEAYLDLVA-SLNEVSGLGLVQDDSDTTLPICWQANFPIKSVKDVKDYFKTLTL 480
Query: 298 SFTNR--RNSVRLVVPPEAYLVIS 319
F ++ S + PE YL+IS
Sbjct: 481 RFGSKWWILSTLFQISPEGYLIIS 504
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 127/344 (36%), Positives = 170/344 (49%), Gaps = 46/344 (13%)
Query: 20 NLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCAALH 78
NL PP+ + DFDTGSDLTW+QCDAPCT C K YKP + NIVP + C +
Sbjct: 193 NLYPDGPPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANAWYKPRRGNIVPPKDLLCMEVQ 252
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNP 138
DQCDYEIEY D SS+G L TD L +NGS+ + FGC Y+Q
Sbjct: 253 RNQKAGYCETCDQCDYEIEYADHSSSMGVLATDKLLLMVANGSLTKLNFIFGCAYDQQGL 312
Query: 139 GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFLGDGKVPSSG 196
+ T G+LGL R ++S+ SQL G+I NVIGHC+ + G G +FLGD VP G
Sbjct: 313 LLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFLGDDFVPRWG 372
Query: 197 VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGASYAYFTSRVYQE 251
+AW PML +S ++ Y +L Y L + ++FDSG+SY YF Y E
Sbjct: 373 MAWVPML-DSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGSSYTYFPKEAYSE 431
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRGPF----------------------------- 282
+V+ + ++ G L + D TLP+CWR F
Sbjct: 432 LVA-SLNEVSGAGLVQSTSDTTLPLCWRANFPIRKFIYRTELTRPIRRRRRRRRRRRRRR 490
Query: 283 -----KALGQVTEYFKPLALSFTNR--RNSVRLVVPPEAYLVIS 319
G V ++FK L F + S + +PPE YL++S
Sbjct: 491 RRRRQHIKGDVKKFFKTLTFQFGTKWLVISTKFRIPPEGYLMMS 534
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 127/327 (38%), Positives = 172/327 (52%), Gaps = 17/327 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ +G PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 182 FPDGQYY-TSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 240
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D + +NG + F
Sbjct: 241 RDLLCQELQG-NQNYCETCK-QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDFVF 298
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFL 187
GC Y+Q SP T G+LGL IS SQL +G+I NV GHCI Q G G +FL
Sbjct: 299 GCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFL 358
Query: 188 GDGKVPSSGVAWTPMLQNSADL----KHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
GD VP GV WT + +L H++ + L + G + +IFDSG+SY Y
Sbjct: 359 GDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAG-STVQVIFDSGSSYTY 417
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
+ +Y+ +V+ I G D+TLP+CW+ P + L V ++F+PL L F
Sbjct: 418 LPNEIYENLVAAIKYASPG--FVQDTSDRTLPLCWKADFPVRYLEDVKQFFEPLNLHFGK 475
Query: 302 RR--NSVRLVVPPEAYLVISVSTSIII 326
+ S + PE YL+IS ++ +
Sbjct: 476 KWLFMSKTFTISPEDYLIISDKGNVCL 502
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 121/318 (38%), Positives = 173/318 (54%), Gaps = 16/318 (5%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCAALHW 79
+ VG+PP+ + D DTGSDLTWVQCDAPC+ C K YKP + N+V + C +
Sbjct: 203 IMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSPLYKPRRENVVSFKDSLCMEVQR 262
Query: 80 PNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPG 139
QC+YE++Y D SS+G LV D F LRFSNGS+ + FGC Y+Q
Sbjct: 263 NYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRFSNGSLTKLNAIFGCAYDQQGLL 322
Query: 140 PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFLGDGKVPSSGV 197
+ T G+LGL R ++S+ SQL G+I NV+GHC+ + G G LFLGD VP G+
Sbjct: 323 LNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLTGDPAGGGYLFLGDDFVPQWGM 382
Query: 198 AWTPMLQNSADLKHYILGPAELLY-----SGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
AW ML +S + Y + Y S + G ++FDSG+SY YFT Y ++
Sbjct: 383 AWVAML-DSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQVVFDSGSSYTYFTKEAYYQL 441
Query: 253 VSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFKPLALSFTNR--RNSVRL 308
V+ + + L D + ICW+ +++ V +FKPL L F +R S +L
Sbjct: 442 VANLEE---VSAFGLILQDSSDTICWKTEQSIRSVKDVKHFFKPLTLQFGSRFWLVSTKL 498
Query: 309 VVPPEAYLVISVSTSIII 326
V+ PE YL+I+ ++ +
Sbjct: 499 VILPENYLLINKEGNVCL 516
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 128/327 (39%), Positives = 166/327 (50%), Gaps = 27/327 (8%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 186 FPDGQYY-TSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 244
Query: 70 SNPRCAALHWPNP--PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
+ C L CK QCDYEIEY D SS+G L D L +NG +
Sbjct: 245 RDSLCQELQGDQNYCETCK----QCDYEIEYADRSSSMGVLAKDDMHLIATNGGREKLDF 300
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVL 185
FGC Y+Q SP T G+LGL IS+ SQL G+I NV GHCI + NG G +
Sbjct: 301 VFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRETNGGGYM 360
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPA----ELLYSGKSCGLKDLTLIFDSGASY 241
FLGD VP G+ W P+ +L H + L++G S + +IFDSG+SY
Sbjct: 361 FLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNS-----VQVIFDSGSSY 415
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
Y +Y+ ++ I D D TLP+CW+ F V +FKPL L F
Sbjct: 416 TYLPEEMYKNLIDAIKED--SPSFVQDSSDTTLPLCWKADF----SVRSFFKPLNLHFGR 469
Query: 302 RRNSV--RLVVPPEAYLVISVSTSIII 326
R V + P+ YL+IS ++ +
Sbjct: 470 RWFVVPKTFTIVPDDYLIISDKGNVCL 496
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 126/327 (38%), Positives = 171/327 (52%), Gaps = 17/327 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ +G PP+ + D DTGSDLTW+QCDAPCT K P YKP K IVP
Sbjct: 182 FPDGQYY-TSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKIVPP 240
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D + +NG + F
Sbjct: 241 RDLLCQELQG-NQNYCETCK-QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDFVF 298
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFL 187
GC Y+Q SP T G+LGL IS SQL +G+I NV GHCI Q G G +FL
Sbjct: 299 GCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFL 358
Query: 188 GDGKVPSSGVAWTPMLQNSADL----KHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
GD VP GV WT + +L H++ + L + G + +IFDSG+SY Y
Sbjct: 359 GDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAG-STVQVIFDSGSSYTY 417
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTN 301
+ +Y+ +V+ I G D+TLP+CW+ P + L V ++F+PL L F
Sbjct: 418 LPNEIYENLVAAIKYASPG--FVQDTSDRTLPLCWKADFPVRYLEDVKQFFEPLNLHFGK 475
Query: 302 RR--NSVRLVVPPEAYLVISVSTSIII 326
+ S + PE YL+IS ++ +
Sbjct: 476 KWLFMSKTFTISPEDYLIISDKGNVCL 502
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 122/328 (37%), Positives = 169/328 (51%), Gaps = 19/328 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
FP Y+ ++ +G PP+ + D DTGSDLTW+QCDAPCT C K P YKP K N+VP
Sbjct: 154 FPDGQYY-TSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVVPP 212
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L + QCDYEI Y D SS+G L D L ++G N+ F
Sbjct: 213 RDSYCQELQGNQ--NYGDTSKQCDYEITYADRSSSMGILARDNMQLITADGERENLDFVF 270
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GCGY+Q SP +T G+LGL IS+ +QL G+I NV GHCI + G +FL
Sbjct: 271 GCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMFL 330
Query: 188 GDGKVPSSGVAWTPMLQN-----SADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
GD VP G+ W P+ S +++ G +L K+ L +IFDSG+SY
Sbjct: 331 GDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLT--QVIFDSGSSYT 388
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
Y Y +++ + + D+TLP C + P +++ V FKPL+L F
Sbjct: 389 YLPHDDYTNLIASLKSLSPSLLQDES--DRTLPFCMKPNFPVRSMDDVKHLFKPLSLVFK 446
Query: 301 NRRNSV--RLVVPPEAYLVISVSTSIII 326
R + V+PPE YL+IS +I +
Sbjct: 447 KRLFILPRTFVIPPEDYLIISDKNNICL 474
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 122/328 (37%), Positives = 169/328 (51%), Gaps = 19/328 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
FP Y+ ++ +G PP+ + D DTGSDLTW+QCDAPCT C K P YKP K N+VP
Sbjct: 154 FPDGQYY-TSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVVPP 212
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L + QCDYEI Y D SS+G L D L ++G N+ F
Sbjct: 213 RDSYCQELQ--GNQNYGDTSKQCDYEITYADRSSSMGILARDNMQLITADGERENLDFVF 270
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GCGY+Q SP +T G+LGL IS+ +QL G+I NV GHCI + G +FL
Sbjct: 271 GCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMFL 330
Query: 188 GDGKVPSSGVAWTPMLQN-----SADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
GD VP G+ W P+ S +++ G +L K+ L +IFDSG+SY
Sbjct: 331 GDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLT--QVIFDSGSSYT 388
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
Y Y +++ + + D+TLP C + P +++ V FKPL+L F
Sbjct: 389 YLPHDDYTNLIASLKSLSPSLLQDES--DRTLPFCMKPNFPVRSMDDVKHLFKPLSLVFK 446
Query: 301 NRRNSV--RLVVPPEAYLVISVSTSIII 326
R + V+PPE YL+IS +I +
Sbjct: 447 KRLFILPRTFVIPPEDYLIISDKNNICL 474
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 119/315 (37%), Positives = 162/315 (51%), Gaps = 18/315 (5%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPCSNPRCA 75
+ ++ +G PP+ + D DTGSD TW+ CDAPCT CTK P YKP + IV +P C
Sbjct: 16 YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVHPRDPLCE 75
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
L N C+ QCDYEI Y D SS G L D L ++G + NV FGC +NQ
Sbjct: 76 ELQG-NQNYCETCK-QCDYEITYADRSSSKGVLARDNMQLTTADGEMKNVDFVFGCAHNQ 133
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFLGDGKVP 193
SP T G+LGL G IS+ +QL G+I NV GHC+ + G +FLGD VP
Sbjct: 134 QGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPSSGGYMFLGDDYVP 193
Query: 194 SSGVAWTPMLQN-----SADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRV 248
G+ W P+ S ++ G EL G++ L +IFDSG+SY YF +
Sbjct: 194 RWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQ--VIFDSGSSYTYFPHEI 251
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSV 306
Y +++L+ G D+TLP C + P +++G V + F PL L R +
Sbjct: 252 YTNLIALLEDASPG--FVRDESDQTLPFCMKPNVPVRSVGDVEQLFNPLILQLRKRWFVI 309
Query: 307 --RLVVPPEAYLVIS 319
+ PE YL+IS
Sbjct: 310 PTTFAISPENYLIIS 324
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/319 (38%), Positives = 173/319 (54%), Gaps = 23/319 (7%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQC---DAPCTGCTKPPEKQYK-PHKNIVPCSN 71
+F V + +G+P + + D DTGS TW++C D PC C K P Y+ K +VPC++
Sbjct: 38 HFYVTMNIGEPAEPYFLDIDTGSSFTWLECHAKDGPCKTCNKVPHPLYRLTRKKLVPCAD 97
Query: 72 PRCAALHWP--NPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
P C ALH +C +QCDY+++Y DG SS+G L+ D F L G N+
Sbjct: 98 PLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLDKFSL--PTGGARNI--A 153
Query: 129 FGCGYNQHNPGPLSPPDTA---GVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGV 184
FGCGY+Q P+ G+LGLGRG + + SQL+ G + +NVIGHC+ G G
Sbjct: 154 FGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSKNVIGHCLSSKGGGY 213
Query: 185 LFLGDGKVPSSGVAWTPMLQNS-ADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
LF+G+ VPSS V W PM + + HY G A L G K L IFDSG++Y Y
Sbjct: 214 LFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIGTKPLKAIFDSGSTYTY 273
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPL-ALSFT 300
++ ++VS + L + LK D LP+CW+G PFK + + FK L L F
Sbjct: 274 LPENLHAQLVSALKASLSKSSLKQV-SDPALPLCWKGPKPFKTVHDTPKEFKSLVTLKFD 332
Query: 301 NRRNSVRLVVPPEAYLVIS 319
V +++PPE YL+I+
Sbjct: 333 ---LGVTMIIPPENYLIIT 348
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 129/336 (38%), Positives = 171/336 (50%), Gaps = 20/336 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 198 FPDGQYY-TSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 256
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D + +NG + F
Sbjct: 257 KDLLCQELQG-NQNYCETCK-QCDYEIEYADRSSSMGVLARDDMHIITTNGGREKLDFVF 314
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFL 187
GC Y+Q SP T G+LGL IS+ SQL G+I NV GHCI + NG G +FL
Sbjct: 315 GCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGGGYMFL 374
Query: 188 GDGKVPSSGVAWTPMLQNSADLKH-----YILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
GD VP G+ TP+ +L H G +L G S + +IFDSG+SY
Sbjct: 375 GDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASG--NSVQVIFDSGSSYT 432
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
Y +Y+ +++ I D+TLP+C P + L V + FKPL L F
Sbjct: 433 YLPDEIYKNLIAAIKYAYPN--FVQDSSDRTLPLCLATDFPVRYLEDVKQLFKPLNLHFG 490
Query: 301 NRRNSV--RLVVPPEAYLVISVSTSIIIIAYLTGKS 334
R + + P+ YL+IS + + + +L GK
Sbjct: 491 KRWFVMPRTFTILPDNYLIISDKGN-VCLGFLNGKD 525
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 129/336 (38%), Positives = 171/336 (50%), Gaps = 20/336 (5%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-IVPC 69
FP Y+ ++ VG PP+ + D DTGSDLTW+QCDAPCT C K P YKP K IVP
Sbjct: 199 FPDGQYY-TSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPP 257
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C+ QCDYEIEY D SS+G L D + +NG + F
Sbjct: 258 KDLLCQELQG-NQNYCETCK-QCDYEIEYADRSSSMGVLARDDMHIITTNGGREKLDFVF 315
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFL 187
GC Y+Q SP T G+LGL IS+ SQL G+I NV GHCI + NG G +FL
Sbjct: 316 GCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGGGYMFL 375
Query: 188 GDGKVPSSGVAWTPMLQNSADLKH-----YILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
GD VP G+ TP+ +L H G +L G S + +IFDSG+SY
Sbjct: 376 GDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASG--NSVQVIFDSGSSYT 433
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
Y +Y+ +++ I D+TLP+C P + L V + FKPL L F
Sbjct: 434 YLPDEIYKNLIAAIKYAYPN--FVQDSSDRTLPLCLATDFPVRYLEDVKQLFKPLNLHFG 491
Query: 301 NRRNSV--RLVVPPEAYLVISVSTSIIIIAYLTGKS 334
R + + P+ YL+IS + + + +L GK
Sbjct: 492 KRWFVMPRTFTILPDNYLIISDKGN-VCLGFLNGKD 526
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 116/328 (35%), Positives = 175/328 (53%), Gaps = 25/328 (7%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----KPPEKQYK 61
+E +P+ ++A L +G+P K + D DTGS+LTW++C P GC +PP Y
Sbjct: 28 LEGNVYPVGHFYAT-LNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYT 86
Query: 62 PHKN--IVPCSNPRCAALHW--PNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPL 115
P V C +P C A+ P P C ND +C YEI+Y G S G L TD+ +
Sbjct: 87 PADGNLKVVCGSPLCVAVRRDVPGIPECSR-NDPHRCHYEIQYVTGKSE-GDLATDIISV 144
Query: 116 RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIG 174
+ + FGCGY Q P P G+LGLG G+ + +QL+ + +I+ NVIG
Sbjct: 145 NGRDKKR----IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIG 200
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTL 233
HC+ G+GVL++GD P+ GV W PM ++ L +Y G AE+ + G
Sbjct: 201 HCLSSKGKGVLYVGDFNPPTRGVTWAPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEA 257
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEY 291
+FDSG++Y + +++Y EIVS + L + L+ + LP+CW+G PF ++ V
Sbjct: 258 VFDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEV-KGRALPLCWKGKKPFGSVNDVKNQ 316
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVIS 319
FK L+L T+ R + L +PP+ YL +
Sbjct: 317 FKALSLKITHARGTSNLDIPPQNYLFVK 344
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 120/325 (36%), Positives = 167/325 (51%), Gaps = 27/325 (8%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-NIVPC 69
P Y+ ++ +G P + + D DTGS LTW+QCDAPCT CTK P YKP K NIVP
Sbjct: 124 LPERQYY-TSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKENIVPP 182
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C L N C QCDYEI Y D SS G L D L ++G N+ L F
Sbjct: 183 RDSHCQELQG-NQNYCDTCK-QCDYEIAYADRSSSAGVLARDNMELITADGERENMDLVF 240
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFL 187
GC ++Q SP + G+LGL G +S+ +QL + G+I NV GHCI + G +FL
Sbjct: 241 GCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSAYMFL 300
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYIL-----GPAELLYSGKSCGLKDLTLIFDSGASYA 242
GD VP G+ W P+ D+ ++ G EL ++ L +IFDSG+SY
Sbjct: 301 GDDYVPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQ--VIFDSGSSYT 358
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFT 300
YF +Y +++ + + + D+TLP C + P +++ V + KPL L F+
Sbjct: 359 YFPHEIYTSLITSL--EAVSPGFVRDESDQTLPFCMKPNFPVRSVDDVKQLHKPLLLHFS 416
Query: 301 NRRNSVRLVVP------PEAYLVIS 319
LV+P PE YL+IS
Sbjct: 417 K----TWLVIPRTFEISPENYLIIS 437
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 116/328 (35%), Positives = 174/328 (53%), Gaps = 25/328 (7%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----KPPEKQYK 61
+E +P+ ++A L +G+P K + D DTGS+LTW++C P GC +PP Y
Sbjct: 28 LEGNVYPVGHFYAT-LNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYYT 86
Query: 62 PHKN--IVPCSNPRCAALHW--PNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPL 115
P V C +P C A+ P P C ND +C YEI+Y G S G L TD+ +
Sbjct: 87 PADGNLKVVCGSPLCVAVRRDVPGIPECSR-NDPHRCHYEIQYVTGKSE-GDLATDIISV 144
Query: 116 RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIG 174
+ + FGCGY Q P P G+LGLG G+ +QL+ + +I+ NVIG
Sbjct: 145 NGRDKKR----IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIG 200
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTL 233
HC+ G+GVL++GD P+ GV W PM ++ L +Y G AE+ + G
Sbjct: 201 HCLSSKGKGVLYVGDFNPPTRGVTWAPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEA 257
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEY 291
+FDSG++Y + +++Y EIVS + L + L+ + LP+CW+G PF ++ V
Sbjct: 258 VFDSGSTYTHVPAQIYNEIVSKVRGTLSESSLEEV-KGRALPLCWKGKKPFGSVNDVKNQ 316
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVIS 319
FK L+L T+ R + L +PP+ YL +
Sbjct: 317 FKALSLKITHARGTNNLDIPPQNYLFVK 344
>gi|388518245|gb|AFK47184.1| unknown [Lotus japonicus]
Length = 245
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 90/175 (51%), Positives = 125/175 (71%), Gaps = 6/175 (3%)
Query: 148 GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSA 207
G+LGLGRG+ S+VSQL GL+RNV+GHC+ G G +F GD SS + WTPM +S
Sbjct: 14 GMLGLGRGKSSLVSQLNSQGLVRNVVGHCLSAQGGGYIFFGD-VYDSSRLTWTPM--SSR 70
Query: 208 DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL 267
DLKHY+ G AEL++ GK G+ L +FD+G+SY YF S YQ ++S + ++L G PLK
Sbjct: 71 DLKHYVAGAAELIFGGKKTGIGGLLPVFDTGSSYTYFNSNAYQAVISWLKKELAGKPLKE 130
Query: 268 APDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNR-RNSVRLVVPPEAYLVIS 319
APDD+TLP+CW G PF+++ +V +YFK +ALSFT+ R + + +PPEAYL++S
Sbjct: 131 APDDQTLPLCWHGKRPFRSVYEVRKYFKSMALSFTSSGRTNTQFEIPPEAYLIVS 185
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 116/333 (34%), Positives = 164/333 (49%), Gaps = 31/333 (9%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP-CTGCTKPPEKQYKPHK--NI 66
FP Y+ +++G PP+ + D DTGS TWVQCDAP C C K Y+P + +
Sbjct: 154 LFPEGLYYTA-ISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLYRPARTADA 212
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+P S+P C NP +QCDYEI Y DG SS+G V D +G N
Sbjct: 213 LPASDPLCEGAQHENP-------NQCDYEISYADGSSSMGVYVRDSMQFVGEDGERENAD 265
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN---GRG 183
+ FGCGY+Q + T GVLGL +S+ +QL G+I N GHC+ + G
Sbjct: 266 IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGG 325
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSAD------LKHYILGPAELLYSGKSCGLKDLTLIFDS 237
LFLGD +P G+ W P+ AD +K G +L GK ++FD+
Sbjct: 326 YLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLT-----QVVFDT 380
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRG--PFKALGQVTEYFKP 294
G++Y YF ++S + +P + D DKTLP C + P +++ V +FKP
Sbjct: 381 GSTYTYFPDEALTRLISSLKE--AASPRFVQDDSDKTLPFCMKSDFPVRSVEDVKHFFKP 438
Query: 295 LALSFTNRRNSVRLV-VPPEAYLVISVSTSIII 326
L+L F R R + PE YLVIS ++ +
Sbjct: 439 LSLQFEKRFFFSRTFNIRPEHYLVISDKGNVCL 471
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 116/328 (35%), Positives = 173/328 (52%), Gaps = 25/328 (7%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----KPPEKQYK 61
+E +P+ ++A L +G+P K + D DTGS+LTW++C P GC +PP Y
Sbjct: 28 LEGNVYPVGHFYAT-LNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPHPYYT 86
Query: 62 PH--KNIVPCSNPRCAALHW--PNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPL 115
P K V C +P C A+ P P C ND +C YEI+Y G S G L TD+ +
Sbjct: 87 PADGKLKVVCGSPLCVAVRRDVPGIPECSR-NDPHRCHYEIQYVTGKSE-GDLATDIISV 144
Query: 116 RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIG 174
+ + FGCGY Q P P G+LGLG G+ +QL+ +I+ NVIG
Sbjct: 145 NGRDKK----RIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIG 200
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTL 233
HC+ G+GVL++GD P+ GV W PM ++ L +Y G AE+ + G
Sbjct: 201 HCLSSKGKGVLYVGDFNPPTRGVTWAPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEA 257
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEY 291
+FDSG++Y + +++Y EIVS + + L+ + LP+CW+G PF ++ V
Sbjct: 258 VFDSGSTYTHVPAQIYNEIVSKVRGTFSESSLEEV-KGRALPLCWKGKKPFGSVNDVKNQ 316
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVIS 319
FK L+L T+ R + L +PP+ YL +
Sbjct: 317 FKALSLKITHARGTNNLDIPPQNYLFVK 344
>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
partial [Brachypodium distachyon]
Length = 354
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 158/310 (50%), Gaps = 48/310 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
+ V +++G+ K + D DTGS LTW++ + ++K
Sbjct: 35 HIYVTMSIGEQEKPYFLDIDTGSTLTWLE------------DVRFKHD------------ 70
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
CK +QCDY++ Y G SS+G L+ D F L G LTFGCGY+Q
Sbjct: 71 ---------CKENPNQCDYDVRYAGGESSLGVLIADKFSL---PGRDARPTLTFGCGYDQ 118
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLFLGDGKVPS 194
P D GVLG+GRG + SQL++ G I NVIGHC+ G G LF G KVPS
Sbjct: 119 EGGKAEMPVD--GVLGIGRGTRDLASQLKQQGAIAENVIGHCLRIQGGGYLFFGHEKVPS 176
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYS---GKSCGLKDLTLIFDSGASYAYFTSRVYQE 251
S V W PM+ N+ +Y G A L ++ G + + ++ DSG++Y Y + Y+
Sbjct: 177 SVVTWVPMVPNN---HYYSPGLAALHFNGNLGNPISVAPMEVVIDSGSTYTYMPTETYRR 233
Query: 252 IVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLV 309
+V +++ L + L L D LP+CW G PFK +G V + FKPL L+F + +
Sbjct: 234 LVFVVIASLSKSSLTLV-RDPALPVCWAGKEPFKXIGDVKDKFKPLELAFIQGTSQAIME 292
Query: 310 VPPEAYLVIS 319
+PPE YL+IS
Sbjct: 293 IPPENYLIIS 302
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 90/247 (36%), Positives = 123/247 (49%), Gaps = 25/247 (10%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP-CTGCTKPPEKQYKPHK--NI 66
FP Y+ +++G PP+ + D DTGS TWVQCDAP C C K Y+P + +
Sbjct: 154 LFPEGLYYTA-ISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLYRPARTADA 212
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+P S+P C NP +QCDYEI Y DG SS+G V D +G N
Sbjct: 213 LPASDPLCEGAQHENP-------NQCDYEISYADGSSSMGVYVRDSMQFVGEDGERENAD 265
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV-- 184
+ FGCGY+Q + T GVLGL +S+ +QL G+I N GHC+ + G
Sbjct: 266 IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGG 325
Query: 185 -LFLGDGKVPSSGVAWTPMLQNSAD------LKHYILGPAELLYSGKSCGLKDLTLIFDS 237
LFLGD +P G+ W P+ AD +K G +L GK ++FD+
Sbjct: 326 YLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLT-----QVVFDT 380
Query: 238 GASYAYF 244
G++Y YF
Sbjct: 381 GSTYTYF 387
>gi|62954897|gb|AAY23266.1| Similar to nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|77548966|gb|ABA91763.1| Aspartic proteinase Asp1 precursor, putative [Oryza sativa Japonica
Group]
Length = 307
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 93/252 (36%), Positives = 132/252 (52%), Gaps = 54/252 (21%)
Query: 91 QCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGV 149
QCDYEI+Y DG S+IGAL+ D F L R + N+P FGCGYNQ
Sbjct: 28 QCDYEIKYADGASTIGALIVDQFSLPRIATRP--NLP--FGCGYNQ-------------- 69
Query: 150 LGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVLFLGDG------------------ 190
G+G S L+ G+I ++V+GHC+ G G+LF+GDG
Sbjct: 70 -GIGE-NFQQTSPLKMLGIITKHVVGHCLSSGGGGLLFVGDGDGNLVLLHASLGSLCPIA 127
Query: 191 -KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVY 249
PSS PML N +Y G A L + S G+ + ++FDSG++Y YFT++ Y
Sbjct: 128 ISTPSS--YNEPMLMN-----YYSPGSATLYFDRHSLGMNPMDVVFDSGSTYTYFTAQPY 180
Query: 250 QEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLALSFTNRRNSVR 307
Q V I L T L+ D +LP+CW+G F+++ V + FK L L+F N N+V
Sbjct: 181 QATVYAIKGGLSSTSLEQV-SDPSLPLCWKGQKAFESVFDVKKEFKSLQLNFGN--NAV- 236
Query: 308 LVVPPEAYLVIS 319
+ +PPE YL+++
Sbjct: 237 MEIPPENYLIVT 248
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 137/329 (41%), Gaps = 43/329 (13%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKP 62
P + + +G P + F+ DTGSD+ WV C +PC GC +
Sbjct: 79 PFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGCPDSSGLGIELNLFDTTKSS 137
Query: 63 HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD--LFPLRFSNG 120
++PC++P CAA+ +C D C Y Y D + G VTD F +
Sbjct: 138 SARVLPCTDPICAAVS-TTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGES 196
Query: 121 SVFN--VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI- 177
++ N + FGC Q+ + G+ G G+G S++SQL G+ V HC+
Sbjct: 197 TIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLK 256
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL----- 231
G+NG G+L LG+ PS + ++P++ + HY L + SG+ +
Sbjct: 257 GGENGGGILVLGEILEPS--IVYSPLIPSQ---PHYTLKLQSIALSGQLFPNPTMFPISN 311
Query: 232 --TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQ 287
I DSG + AY VY IVS+I A P RG F+
Sbjct: 312 AGETIIDSGTTLAYLVEEVYDWIVSVITS---------AVSQSATPTISRGSQCFRVSMS 362
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
V + F L +F +VV PE YL
Sbjct: 363 VADIFPVLRFNF---EGIASMVVTPEEYL 388
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 137/329 (41%), Gaps = 43/329 (13%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKP 62
P + + +G P + F+ DTGSD+ WV C +PC GC +
Sbjct: 79 PFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGCPDSSGLGIELNLFDTTKSS 137
Query: 63 HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD--LFPLRFSNG 120
++PC++P CAA+ +C D C Y Y D + G VTD F +
Sbjct: 138 SARVLPCTDPICAAVS-TTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGES 196
Query: 121 SVFN--VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI- 177
++ N + FGC Q+ + G+ G G+G S++SQL G+ V HC+
Sbjct: 197 TIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLK 256
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL----- 231
G+NG G+L LG+ PS + ++P++ + HY L + SG+ +
Sbjct: 257 GGENGGGILVLGEILEPS--IVYSPLIPSQ---PHYTLKLQSIALSGQLFPNPTMFPISN 311
Query: 232 --TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQ 287
I DSG + AY VY IVS+I A P RG F+
Sbjct: 312 AGETIIDSGTTLAYLVEEVYDWIVSVITS---------AVSQSATPTISRGSQCFRVSMS 362
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
V + F L +F +VV PE YL
Sbjct: 363 VADIFPVLRFNF---EGIASMVVTPEEYL 388
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 129/278 (46%), Gaps = 37/278 (13%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP--------- 56
+ F+F + L +G PP+ F DTGSD+ WV C + C GC
Sbjct: 79 VGFYFGSFCRLYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSS-CNGCPVSSGLHIPLNFF 137
Query: 57 EKQYKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
+ P +++ CS+ RC+ + C N+QC Y +YGDG + G V+DL L
Sbjct: 138 DPGSSPTASLISCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDL--LH 195
Query: 117 FSN---GSVF---NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGL 168
F GSV + P+ FGC Q G L+ PD A G+ G G+ +S++SQL G+
Sbjct: 196 FDTILGGSVMKNSSAPIVFGCSTLQ--TGDLTKPDRAVDGIFGFGQQDMSVISQLASQGI 253
Query: 169 IRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC 226
V HC+ +G G+L LG+ P+ + +TP++ + HY L + +G++
Sbjct: 254 TPRVFSHCLKGDDSGGGILVLGEIVEPN--IVYTPLVPSQ---PHYNLNLQSIYVNGQTL 308
Query: 227 GL--------KDLTLIFDSGASYAYFTSRVYQEIVSLI 256
+ + I DSG + AY T Y +S I
Sbjct: 309 AIDPSVFATSSNQGTIIDSGTTLAYLTEAAYDPFISAI 346
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 89/314 (28%), Positives = 136/314 (43%), Gaps = 37/314 (11%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC---------TKPPEKQYKPH 63
I + + +G PP+ ++ DTGSDL WV C PC GC P + +
Sbjct: 32 IAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGCPAFSDLKIPIVPYDVKASAS 90
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ VPCS+P C + + C N QC Y +YGDG ++G LV D+ + +
Sbjct: 91 SSKVPCSDPSCTLITQISESGCNDQN-QCGYSFQYGDGSGTLGYLVEDVLHYMVNATAT- 148
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNG 181
+ FGCG+ Q S G++G G +S SQL + G NV HC+ G+ G
Sbjct: 149 ---VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERG 205
Query: 182 RGVLFLGDGKVPSSGVAWTP----MLQNSADLKHYILGPAELLYSGKSCGLKDLT-LIFD 236
G+L LG+ P + +TP M + L+ + A L K + IFD
Sbjct: 206 GGILVLGNVIEPD--IQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFD 263
Query: 237 SGASYAYFTSRVYQ---EIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
SG + AY YQ + VSL++ + +L+ R +K V YF+
Sbjct: 264 SGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTRLS----------RFIYKLFPNVVLYFE 313
Query: 294 PLALSFTNRRNSVR 307
+++ T +R
Sbjct: 314 GASMTLTPAEYLIR 327
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 148/342 (43%), Gaps = 48/342 (14%)
Query: 6 IEFFFFPIFSYFAVNL-----TVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
++F F F V L +G PP F+ DTGSD+ WV C++ C+GC + Q
Sbjct: 9 VDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CSGCPQTSGLQI 67
Query: 61 KPH---------KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD 111
+ + +++ CS+ RC + C N+QC Y +YGDG + G V+D
Sbjct: 68 QLNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSD 127
Query: 112 LFPLR-FSNGSVFN---VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLRE 165
+ L GSV P+ FGC Q G L+ D A G+ G G+ +S++SQL
Sbjct: 128 MMHLNTIFEGSVTTNSTAPVVFGCSNQQ--TGDLTKSDRAVDGIFGFGQQEMSVISQLSS 185
Query: 166 YGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG 223
G+ V HC+ +G G+L LG+ P+ + +T ++ HY L + +G
Sbjct: 186 QGIAPRVFSHCLKGDSSGGGILVLGEIVEPN--IVYTSLV---PAQPHYNLNLQSIAVNG 240
Query: 224 KSCGLKDLTL--------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
++ + I DSG + AY Y VS I + P +
Sbjct: 241 QTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI---PQSVHTAVSRGN 297
Query: 276 ICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
C+ VTE F ++L+F +++ P+ YL+
Sbjct: 298 QCYL----ITSSVTEVFPQVSLNFA---GGASMILRPQDYLI 332
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 148/344 (43%), Gaps = 52/344 (15%)
Query: 6 IEFFFFPIFSYFAVNL-----TVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
++F F F V L +G PP F+ DTGSD+ WV C++ C+GC + Q
Sbjct: 59 VDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CSGCPQTSGLQI 117
Query: 61 KPH---------KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD 111
+ + +++ CS+ RC + C N+QC Y +YGDG + G V+D
Sbjct: 118 QLNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSD 177
Query: 112 LFPLR-FSNGSVFN---VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLRE 165
+ L GSV P+ FGC Q G L+ D A G+ G G+ +S++SQL
Sbjct: 178 MMHLNTIFEGSVTTNSTAPVVFGCSNQQ--TGDLTKSDRAVDGIFGFGQQEMSVISQLSS 235
Query: 166 YGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG 223
G+ V HC+ +G G+L LG+ P+ + +T ++ HY L + +G
Sbjct: 236 QGIAPRVFSHCLKGDSSGGGILVLGEIVEPN--IVYTSLVPAQ---PHYNLNLQSIAVNG 290
Query: 224 KSCGLKDLTL--------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
++ + I DSG + AY Y VS I +
Sbjct: 291 QTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI---------PQSVHT 341
Query: 276 ICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ RG + VTE F ++L+F +++ P+ YL+
Sbjct: 342 VVSRGNQCYLITSSVTEVFPQVSLNFA---GGASMILRPQDYLI 382
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 140/329 (42%), Gaps = 53/329 (16%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC---------TKPPEKQYKPH 63
I + + +G PP+ ++ DTGSDL WV C PC GC P + +
Sbjct: 32 IAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGCPAFSDLKIPIVPYDVKASAS 90
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ VPCS+P C + + C N QC Y +YGDG ++G LV D+ + +
Sbjct: 91 SSKVPCSDPSCTLITQISESGCNDQN-QCGYSFQYGDGSGTLGYLVEDVLHYMVNATAT- 148
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNG 181
+ FGCG+ Q S G++G G +S SQL + G NV HC+ G+ G
Sbjct: 149 ---VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERG 205
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------- 233
G+L LG+ P + +TP++ + HY ++ S +LT+
Sbjct: 206 GGILVLGNVIEPD--IQYTPLVPY---MSHY-----NVVLQSISVNNANLTIDPKLFSND 255
Query: 234 -----IFDSGASYAYFTSRVYQ---EIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
IFDSG + AY YQ + VSL++ + +L+ R +K
Sbjct: 256 VMQGTIFDSGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTRLS----------RFIYKLF 305
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEA 314
V YF+ +++ T +R A
Sbjct: 306 PNVVLYFEGASMTLTPAEYLIRQASAANA 334
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 144/329 (43%), Gaps = 48/329 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G PP+ F+ DTGSD+ WV C + C+ C + + +
Sbjct: 81 YF-TRVKLGTPPREFNVQIDTGSDVLWVTCSS-CSNCPQTSGLGIQLNYFDTTSSSTARL 138
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
VPCS+P C + +C ++QC Y +YGDG + G V+D F G
Sbjct: 139 VPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIAN 198
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIG-- 178
+ + FGC + + G L+ D A G+ G G+G +S++SQL +G+ V HC+
Sbjct: 199 SSAAIVFGC--STYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGE 256
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------KD 230
+G G+L LG+ P G+ ++P++ + HY L + SG+ + +
Sbjct: 257 DSGGGILVLGEILEP--GIVYSPLVPSQ---PHYNLDLQSIAVSGQLLPIDPAAFATSSN 311
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
I D+G + AY Y VS I A P +G + V
Sbjct: 312 RGTIIDTGTTLAYLVEEAYDPFVSAITA---------AVSQLATPTINKGNQCYLVSNSV 362
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+E F P++ +F +++ PE YL+
Sbjct: 363 SEVFPPVSFNFA---GGATMLLKPEEYLM 388
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 91/344 (26%), Positives = 146/344 (42%), Gaps = 52/344 (15%)
Query: 6 IEFFFFPIFSYFAVNL-----TVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
++F F F V L +G PP F+ DTGSD+ WV C++ C GC + Q
Sbjct: 62 VDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CNGCPQTSGLQI 120
Query: 61 KPH---------KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD 111
+ + +++ CS+ RC + C N+QC Y +YGDG + G V+D
Sbjct: 121 QLNFFDPGSSSTSSMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSD 180
Query: 112 LFPLR--FSNGSVFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLRE 165
+ L F N P+ FGC Q G L+ D A G+ G G+ +S++SQL
Sbjct: 181 MMHLNTIFEGSMTTNSTAPVVFGCSNQQ--TGDLTKSDRAVDGIFGFGQQEMSVISQLSS 238
Query: 166 YGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG 223
G+ + HC+ +G G+L LG+ P+ + +T ++ HY L + +G
Sbjct: 239 QGIAPRIFSHCLKGDSSGGGILVLGEIVEPN--IVYTSLVPAQ---PHYNLNLQSISVNG 293
Query: 224 KSCGLKDLTL--------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
++ + I DSG + AY Y VS I A
Sbjct: 294 QTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAI---------TAAIPQSVRT 344
Query: 276 ICWRGP--FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ RG + VT+ F ++L+F +++ P+ YL+
Sbjct: 345 VVSRGNQCYLITSSVTDVFPQVSLNFA---GGASMILRPQDYLI 385
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 141/316 (44%), Gaps = 38/316 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C K + +++P + + C N
Sbjct: 75 YYTTRLWIGTPPQEFALIVDTGSTVTYVPC-STCKQCGKHQDPKFQPELSTSYQALKC-N 132
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
P C C C YE Y + SS G L DL + F N S + FG
Sbjct: 133 PDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDL--ISFGNESQLSPQRAVFG 181
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++S+V QL + G+I +V C G + G G + LG
Sbjct: 182 C--ENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLG 239
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
P V +S + +Y + ++ +GKS L + DSG +
Sbjct: 240 KISPPPGMV-----FSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTT 294
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAYF + I +++++ PD +C+ G + + ++ +F +A+ F
Sbjct: 295 YAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFG 354
Query: 301 NRRNSVRLVVPPEAYL 316
N + +L++ PE YL
Sbjct: 355 NGQ---KLILSPENYL 367
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 101/340 (29%), Positives = 147/340 (43%), Gaps = 51/340 (15%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----------PEKQ 59
F + YF + +G PPK + DTGSD+ WV C +PCTGC P+
Sbjct: 86 FMVGLYF-TRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTS 143
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDL--FPLR 116
K +PCS+ RC A + C+ N C Y YGDG + G V+D F
Sbjct: 144 STSSK--IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTV 201
Query: 117 FSNGSVFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNV 172
N N + FGC +Q G L+ D A G+ G G+ ++S+VSQL G+ V
Sbjct: 202 MGNEQTANSSASIVFGCSNSQS--GDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKV 259
Query: 173 IGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD 230
HC+ NG G+L LG+ P G+ +TP++ + HY L ++ +G+ + D
Sbjct: 260 FSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIVVNGQKLPI-D 313
Query: 231 LTL---------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
+L I DSG + AY Y V+ I ++P ++L
Sbjct: 314 SSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAIT-------AAVSPSVRSLVSKGNQC 366
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVS 321
F V F ++L F V + V PE YL+ S
Sbjct: 367 FVTSSSVDSSFPTVSLYFM---GGVAMTVKPENYLLQQAS 403
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 141/316 (44%), Gaps = 38/316 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C K + +++P + + C N
Sbjct: 75 YYTTRLWIGTPPQEFALIVDTGSTVTYVPC-STCKQCGKHQDPKFQPELSTSYQALKC-N 132
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
P C C C YE Y + SS G L DL + F N S + FG
Sbjct: 133 PDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDL--ISFGNESQLSPQRAVFG 181
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++S+V QL + G+I +V C G + G G + LG
Sbjct: 182 C--ENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLG 239
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
P V +S + +Y + ++ +GKS L + DSG +
Sbjct: 240 KISPPPGMV-----FSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTT 294
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAYF + I +++++ PD +C+ G + + ++ +F +A+ F
Sbjct: 295 YAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFG 354
Query: 301 NRRNSVRLVVPPEAYL 316
N + +L++ PE YL
Sbjct: 355 NGQ---KLILSPENYL 367
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 101/340 (29%), Positives = 147/340 (43%), Gaps = 51/340 (15%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----------PEKQ 59
F + YF + +G PPK + DTGSD+ WV C +PCTGC P+
Sbjct: 86 FMVGLYF-TRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTS 143
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDL--FPLR 116
K +PCS+ RC A + C+ N C Y YGDG + G V+D F
Sbjct: 144 STSSK--IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSV 201
Query: 117 FSNGSVFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNV 172
N N + FGC +Q G L+ D A G+ G G+ ++S+VSQL G+ V
Sbjct: 202 MGNEQTANSSASIVFGCSNSQS--GDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKV 259
Query: 173 IGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD 230
HC+ NG G+L LG+ P G+ +TP++ + HY L ++ +G+ + D
Sbjct: 260 FSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIVVNGQKLPI-D 313
Query: 231 LTL---------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
+L I DSG + AY Y V+ I ++P ++L
Sbjct: 314 SSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAIT-------AAVSPSVRSLVSKGNQC 366
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVS 321
F V F ++L F V + V PE YL+ S
Sbjct: 367 FVTSSSVDSSFPTVSLYFM---GGVAMTVKPENYLLQQAS 403
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 145/335 (43%), Gaps = 51/335 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----------PEKQYKPHK 64
YF + +G PPK + DTGSD+ WV C +PCTGC P+ K
Sbjct: 117 YF-TRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTSSTSSK 174
Query: 65 NIVPCSNPRCAALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGS 121
+PCS+ RC A + C+ N C Y YGDG + G V+D F N
Sbjct: 175 --IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQ 232
Query: 122 VFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + FGC +Q G L+ D A G+ G G+ ++S+VSQL G+ V HC+
Sbjct: 233 TANSSASIVFGCSNSQS--GDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 290
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
NG G+L LG+ P G+ +TP++ + HY L ++ +G+ + D +L
Sbjct: 291 KGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIVVNGQKLPI-DSSLFT 344
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG + AY Y V+ I ++P ++L F
Sbjct: 345 TSNTQGTIVDSGTTLAYLADGAYDPFVNAIT-------AAVSPSVRSLVSKGNQCFVTSS 397
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVS 321
V F ++L F V + V PE YL+ S
Sbjct: 398 SVDSSFPTVSLYFM---GGVAMTVKPENYLLQQAS 429
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 90/327 (27%), Positives = 142/327 (43%), Gaps = 44/327 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G PP+ F+ DTGSD+ WV C++ C C + +
Sbjct: 66 YFT-KVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTSGLGIQLNFFDSSSSSTAGQ 123
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
V CS+P C + +C DQC Y +YGDG + G V+D G
Sbjct: 124 VRCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDN 183
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ + FGC + + G L+ D A G+ G G+G +S++SQL G+ V HC+ +
Sbjct: 184 SSALIVFGC--SAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGD 241
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
G G L G++ G+ ++P++ + HY L + +G+ +
Sbjct: 242 GSGGGILVLGEILEPGIVYSPLVPSQ---PHYNLNLLSIAVNGQLLPIDPAAFATSNSQG 298
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTE 290
I DSG + AY + Y VS + + I +P PI +G + V++
Sbjct: 299 TIVDSGTTLAYLVAEAYDPFVSAV--NAIVSP-------SVTPITSKGNQCYLVSTSVSQ 349
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV 317
F PLA SF N +V+ PE YL+
Sbjct: 350 MF-PLA-SF-NFAGGASMVLKPEDYLI 373
>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
gi|219887685|gb|ACL54217.1| unknown [Zea mays]
Length = 292
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 108/228 (47%), Gaps = 20/228 (8%)
Query: 105 IGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
+G V D +G N + FGCGY+Q + T GVLGL +S+ +QL
Sbjct: 1 MGVYVRDSMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLA 60
Query: 165 EYGLIRNVIGHCIGQN---GRGVLFLGDGKVPSSGVAWTPMLQNSAD------LKHYILG 215
G+I N GHC+ + G LFLGD +P G+ W P+ AD +K G
Sbjct: 61 SRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHG 120
Query: 216 PAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTL 274
+L GK ++FD+G++Y YF ++S + +P + D DKTL
Sbjct: 121 DQQLNAQGKLT-----QVVFDTGSTYTYFPDEALTRLISSLKE--AASPRFVQDDSDKTL 173
Query: 275 PICWRG--PFKALGQVTEYFKPLALSFTNRRNSVRLV-VPPEAYLVIS 319
P C + P +++ V +FKPL+L F R R + PE YLVIS
Sbjct: 174 PFCMKSDFPVRSVEDVKHFFKPLSLQFEKRFFFSRTFNIRPEHYLVIS 221
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 140/316 (44%), Gaps = 38/316 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C K + +++P + + C N
Sbjct: 79 YYTTRLWIGTPPQEFALIVDTGSTVTYVPC-STCKQCGKHQDPKFQPELSSSYKALKC-N 136
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
P C C C YE Y + SS G L DL + F N S FG
Sbjct: 137 PDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDL--ISFGNESQLTPQRAVFG 185
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++S+V QL + G+I +V C G + G G + LG
Sbjct: 186 C--ENVETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLG 243
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
P+ V +S + +Y + ++ +GKS L + DSG +
Sbjct: 244 KISPPAGMV-----FSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTT 298
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAYF + I I++++ PD +C+ G + + ++ +F + + F
Sbjct: 299 YAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFG 358
Query: 301 NRRNSVRLVVPPEAYL 316
N + +L++ PE YL
Sbjct: 359 NGQ---KLILSPENYL 371
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/341 (29%), Positives = 146/341 (42%), Gaps = 51/341 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKNI---- 66
YF + +G P K F DTGSD+ WV C +PCTGC + + P +
Sbjct: 5 YF-TRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTASR 62
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQ---CDYEIEYGDGGSSIGALVTD--LFPLRFSNGS 121
+ CS+ RC A C+ N Q C Y YGDG + G V+D F N
Sbjct: 63 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 122
Query: 122 VFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + FGC +Q G L+ D A G+ G G+ ++S++SQL G+ V HC+
Sbjct: 123 TANSSASIVFGCSNSQ--SGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 180
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
NG G+L LG+ P G+ +TP++ + HY L + +G+ + D +L
Sbjct: 181 KGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIAVNGQKLPI-DSSLFT 234
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG + AY Y VS I ++P ++L F
Sbjct: 235 TSNTQGTIVDSGTTLAYLADGAYDPFVSAI-------AAAVSPSVRSLVSKGSQCFITSS 287
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLV--ISVSTSII 325
V F + L F V + V PE YL+ SV S++
Sbjct: 288 SVDSSFPTVTLYF---MGGVAMSVKPENYLLQQASVDNSVL 325
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 141/333 (42%), Gaps = 27/333 (8%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
SYF L +G P + F DTGS +T++ C C+ C K + + P K+ + C
Sbjct: 11 SYFYTTLKLGTPERTFSVIIDTGSTITYIPC-KDCSHCGKHTAEWFDPDKSTTAKKLACG 69
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+P C P C ND+C Y Y + SS G ++ D F S+ V L FG
Sbjct: 70 DPLCNC----GTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV---RLVFG 122
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C G + G++G+G + SQL + +I +V C G G+L LGD
Sbjct: 123 C--ENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGILLLGDV 180
Query: 191 KVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASYAY 243
+P + +TP+L + L +Y + + +G++ + + DSG ++ Y
Sbjct: 181 TLPEGANTVYTPLLTH-LHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFTY 239
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAP--DDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
+ ++ + + + L+ P D + ICW+G + +YF P F
Sbjct: 240 LPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFPPAEFVFG- 298
Query: 302 RRNSVRLVVPPEAYLVISVSTSIIIIAYLTGKS 334
+L +PP YL +S + + G S
Sbjct: 299 --GGAKLTLPPLRYLFLSKPAEYCLGIFDNGNS 329
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 93/316 (29%), Positives = 139/316 (43%), Gaps = 34/316 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPC 69
+ Y+ + +G PP+ F DTGS LT+V C + C C K + ++P + + C
Sbjct: 89 YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPC-STCEQCGKHQDPNFQPDWSSTYQPLKC 147
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT- 128
S C C C Y+ +Y + SS G L D+ + F S T
Sbjct: 148 SM-ECT---------CDSEMMHCVYDRQYAEMSSSSGVLGEDI--VSFGKQSELKPQRTV 195
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLF 186
FGC G + G++GLGRG +SIV QL E G+I N C G G G +
Sbjct: 196 FGC--ENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMV 253
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
LG G P +G+ +T + A +Y + E+ +GK + + I DSG +
Sbjct: 254 LG-GISPPAGMVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTT 310
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAY ++ IM++L L PD IC+ G + Q+++ F + L F+
Sbjct: 311 YAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFS 370
Query: 301 NRRNSVRLVVPPEAYL 316
N RL + PE YL
Sbjct: 371 NGN---RLSLSPENYL 383
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 101/329 (30%), Positives = 140/329 (42%), Gaps = 53/329 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------------QYKPH 63
YFA + +G P K + DTGSD+ WV C GC + P K +
Sbjct: 155 YFA-KIGIGTPSKDYYVQVDTGSDILWVNC----AGCDRCPTKSDLGVDLTLYDMKASTT 209
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ V C + C+ P P CK P QC Y + YGDG S+ G V D +G+
Sbjct: 210 SDAVGCDDNFCSLYDGP-LPGCK-PGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQ 267
Query: 124 NVP----LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
P + FGCG Q S G+LG G+ S++SQL G ++ V HC+
Sbjct: 268 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN 327
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILG------PAELLYSGKSCGL 228
+G G+ +G+ P V TP++QN A +K +G P++ SG G
Sbjct: 328 VDGGGIFAIGEVVEPK--VNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG- 384
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQ 287
I DSG + AYF VY V LI + L P L+L ++ F G
Sbjct: 385 ----TIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTC-----FDYTGN 432
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
V + F + L F S+ L V P YL
Sbjct: 433 VDDGFPTVTLHFD---KSISLTVYPHEYL 458
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 101/329 (30%), Positives = 140/329 (42%), Gaps = 53/329 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------------QYKPH 63
YFA + +G P K + DTGSD+ WV C GC + P K +
Sbjct: 74 YFA-KIGIGTPSKDYYVQVDTGSDILWVNC----AGCDRCPTKSDLGVDLTLYDMKASTT 128
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ V C + C+ P P CK P QC Y + YGDG S+ G V D +G+
Sbjct: 129 SDAVGCDDNFCSLYDGP-LPGCK-PGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQ 186
Query: 124 NVP----LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
P + FGCG Q S G+LG G+ S++SQL G ++ V HC+
Sbjct: 187 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN 246
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILG------PAELLYSGKSCGL 228
+G G+ +G+ P V TP++QN A +K +G P++ SG G
Sbjct: 247 VDGGGIFAIGEVVEPK--VNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG- 303
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQ 287
I DSG + AYF VY V LI + L P L+L ++ F G
Sbjct: 304 ----TIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTC-----FDYTGN 351
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
V + F + L F S+ L V P YL
Sbjct: 352 VDDGFPTVTLHFD---KSISLTVYPHEYL 377
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 101/341 (29%), Positives = 146/341 (42%), Gaps = 51/341 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKNI---- 66
YF + +G P K F DTGSD+ WV C +PCTGC + + P +
Sbjct: 91 YF-TRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTASR 148
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQ---CDYEIEYGDGGSSIGALVTD--LFPLRFSNGS 121
+ CS+ RC A C+ N Q C Y YGDG + G V+D F N
Sbjct: 149 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 208
Query: 122 VFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + FGC +Q G L+ D A G+ G G+ ++S++SQL G+ V HC+
Sbjct: 209 TANSSASIVFGCSNSQS--GDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 266
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
NG G+L LG+ P G+ +TP++ + HY L + +G+ + D +L
Sbjct: 267 KGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIAVNGQKLPI-DSSLFT 320
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG + AY Y VS I ++P ++L F
Sbjct: 321 TSNTQGTIVDSGTTLAYLADGAYDPFVSAI-------AAAVSPSVRSLVSKGSQCFITSS 373
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLV--ISVSTSII 325
V F + L F V + V PE YL+ SV S++
Sbjct: 374 SVDSSFPTVTLYFM---GGVAMSVKPENYLLQQASVDNSVL 411
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/316 (29%), Positives = 139/316 (43%), Gaps = 34/316 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPC 69
+ Y+ + +G PP+ F DTGS LT+V C + C C K + ++P + + C
Sbjct: 89 YGYYTTRIWIGTPPQTFALIVDTGSTLTYVPC-STCEQCGKHQDPNFQPDWSSTYQPLKC 147
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT- 128
S C C C Y+ +Y + SS G L D+ + F S T
Sbjct: 148 SM-ECT---------CDSEMMHCVYDRQYAEMSSSSGVLGEDI--VSFGKQSELKPQRTV 195
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLF 186
FGC G + G++GLGRG +SIV QL E G+I N C G G G +
Sbjct: 196 FGC--ENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMV 253
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
LG G P +G+ +T + A +Y + E+ +GK + + I DSG +
Sbjct: 254 LG-GISPPAGMVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTT 310
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAY ++ IM++L L PD IC+ G + Q+++ F + L F+
Sbjct: 311 YAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFS 370
Query: 301 NRRNSVRLVVPPEAYL 316
N RL + PE YL
Sbjct: 371 NGN---RLSLSPENYL 383
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/341 (29%), Positives = 146/341 (42%), Gaps = 51/341 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKNI---- 66
YF + +G P K F DTGSD+ WV C +PCTGC + + P +
Sbjct: 89 YF-TRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTASR 146
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQ---CDYEIEYGDGGSSIGALVTD--LFPLRFSNGS 121
+ CS+ RC A C+ N Q C Y YGDG + G V+D F N
Sbjct: 147 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 206
Query: 122 VFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + FGC +Q G L+ D A G+ G G+ ++S++SQL G+ V HC+
Sbjct: 207 TANSSASIVFGCSNSQS--GDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 264
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
NG G+L LG+ P G+ +TP++ + HY L + +G+ + D +L
Sbjct: 265 KGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIAVNGQKLPI-DSSLFT 318
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
I DSG + AY Y VS I ++P ++L F
Sbjct: 319 TSNTQGTIVDSGTTLAYLADGAYDPFVSAI-------AAAVSPSVRSLVSKGSQCFITSS 371
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLV--ISVSTSII 325
V F + L F V + V PE YL+ SV S++
Sbjct: 372 SVDSSFPTVTLYFM---GGVAMSVKPENYLLQQASVDNSVL 409
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/329 (30%), Positives = 140/329 (42%), Gaps = 53/329 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------------QYKPH 63
YFA + +G P K + DTGSD+ WV C GC + P K +
Sbjct: 155 YFA-KIGIGTPSKDYYVQVDTGSDILWVNC----AGCDRCPTKSDLGVDLTLYDMKASTT 209
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ V C + C+ P P CK P QC Y + YGDG S+ G V D +G+
Sbjct: 210 SDAVGCDDNFCSLYDGP-LPGCK-PGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQ 267
Query: 124 NVP----LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
P + FGCG Q S G+LG G+ S++SQL G ++ V HC+
Sbjct: 268 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN 327
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILG------PAELLYSGKSCGL 228
+G G+ +G+ P V TP++QN A +K +G P++ SG G
Sbjct: 328 VDGGGIFAIGEVVEPK--VNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKG- 384
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQ 287
I DSG + AYF VY V LI + L P L+L ++ F G
Sbjct: 385 ----TIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTC-----FDYTGN 432
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
V + F + L F S+ L V P YL
Sbjct: 433 VDDGFPTVTLHFD---KSISLTVYPHEYL 458
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 137/321 (42%), Gaps = 42/321 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNI 66
YF + +G PPK + DTGSD+ W+ C PC C ++
Sbjct: 74 YFT-KIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNASSTSKK 131
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
V C + C+ + + C+ P C Y I Y D +S G + D+ L G + P
Sbjct: 132 VGCDDDFCSFISQSDS--CQ-PALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGP 188
Query: 127 L----TFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
L FGCG +Q G L D+A GV+G G+ S++SQL G + V HC+ N
Sbjct: 189 LGQEVVFGCGSDQ--SGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DN 245
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIF 235
+G G V S V TPM+ N HY + + G S L ++ I
Sbjct: 246 VKGGGIFAVGVVDSPKVKTTPMVPNQM---HYNVMLMGMDVDGTSLDLPRSIVRNGGTIV 302
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + AYF +Y ++ I L P+KL ++T F V E F P+
Sbjct: 303 DSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEETFQC-----FSFSTNVDEAFPPV 354
Query: 296 ALSFTNRRNSVRLVVPPEAYL 316
+ F +SV+L V P YL
Sbjct: 355 SFEF---EDSVKLTVYPHDYL 372
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 137/321 (42%), Gaps = 42/321 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNI 66
YF + +G PPK + DTGSD+ W+ C PC C ++
Sbjct: 74 YFT-KIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNASSTSKK 131
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
V C + C+ + + C+ P C Y I Y D +S G + D+ L G + P
Sbjct: 132 VGCDDDFCSFISQSDS--CQ-PALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGP 188
Query: 127 L----TFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
L FGCG +Q G L D+A GV+G G+ S++SQL G + V HC+ N
Sbjct: 189 LGQEVVFGCGSDQ--SGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DN 245
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIF 235
+G G V S V TPM+ N HY + + G S L ++ I
Sbjct: 246 VKGGGIFAVGVVDSPKVKTTPMVPNQM---HYNVMLMGMDVDGTSLDLPRSIVRNGGTIV 302
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + AYF +Y ++ I L P+KL ++T F V E F P+
Sbjct: 303 DSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEETFQC-----FSFSTNVDEAFPPV 354
Query: 296 ALSFTNRRNSVRLVVPPEAYL 316
+ F +SV+L V P YL
Sbjct: 355 SFEF---EDSVKLTVYPHDYL 372
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 121/270 (44%), Gaps = 35/270 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHK 64
+ + + +G PP+ F DTGSD+ W+ C+ C+ C K +
Sbjct: 81 YGLYTTKVKMGTPPREFTVQIDTGSDILWINCNT-CSNCPKSSGLGIELNFFDTVGSSTA 139
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSV 122
+VPCS+P CA+ +C +QC Y +Y DG + G V+D F + +
Sbjct: 140 ALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTP 199
Query: 123 FNVP----LTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHC 176
NV + FGC + + G L+ D A G+LG G G +S+VSQL G+ V HC
Sbjct: 200 ANVASSATIVFGC--STYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHC 257
Query: 177 I--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL- 233
+ NG G+L LG+ PS + ++P++ + HY L + +G+ +
Sbjct: 258 LKGDGNGGGILVLGEILEPS--IVYSPLVPSQ---PHYNLNLQSIAVNGQVLSINPAVFA 312
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLI 256
I DSG + +Y Y +V+ +
Sbjct: 313 TSDKRGTIIDSGTTLSYLVQEAYDPLVNAV 342
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/315 (28%), Positives = 138/315 (43%), Gaps = 36/315 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ + +G PP+ F DTGS +T+V C + C C + + +++P + V C N
Sbjct: 89 YYTTRIWIGTPPQTFALIVDTGSTVTYVPC-STCEQCGRHQDPKFEPELSSTYQPVSC-N 146
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--LTF 129
C C + QC YE +Y + SS G L D+ + F N S VP F
Sbjct: 147 IDCT---------CDNERKQCVYERQYAEMSSSSGVLGEDI--ISFGNQSEL-VPQRAIF 194
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC G L G++GLGRG +SIV QL E G+I + C G G G + L
Sbjct: 195 GC--ENQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMIL 252
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASY 241
G G P SG+ + + ++Y + + +GK L + DSG +Y
Sbjct: 253 G-GISPPSGMVFAE--SDPVRSQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGTTY 309
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
AY + +M++L PD IC+ G + Q++ F + + F+N
Sbjct: 310 AYLPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDICFSGAESDVSQLSNTFPAVEMVFSN 369
Query: 302 RRNSVRLVVPPEAYL 316
+ +L + PE YL
Sbjct: 370 GQ---KLSLSPENYL 381
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/314 (28%), Positives = 136/314 (43%), Gaps = 34/314 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C K + +++P + V C N
Sbjct: 76 YYTTRLFIGTPPQEFALIVDTGSTVTYVPCSS-CEQCGKHQDPRFQPDLSSTYRPVKC-N 133
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
P C C QC YE Y + SS G + D+ + F N S FG
Sbjct: 134 PSC---------NCDDEGKQCTYERRYAEMSSSSGVIAEDV--VSFGNESELKPQRAVFG 182
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRGR+S+V QL + G+I + C G G G + LG
Sbjct: 183 C--ENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLG 240
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
P + V N +Y + EL +GK LK + DSG +YA
Sbjct: 241 QISPPPNMVF---SHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTYA 297
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
YF + + IM+++ PD IC+ G + + +++ F + + F +
Sbjct: 298 YFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSG 357
Query: 303 RNSVRLVVPPEAYL 316
+ +L + PE YL
Sbjct: 358 Q---KLSLSPENYL 368
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 105/349 (30%), Positives = 156/349 (44%), Gaps = 53/349 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++L +G PP + DTGSDL W QC APC C P ++P ++ +VPC +P
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
CAAL + P C C Y+ YGD S+ G L ++ F +N S V + FGC
Sbjct: 151 LCAALPY---PACFQ-RSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGC 206
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLG 188
G N G L+ +++G++GLGRG +S+VSQL + + R GV
Sbjct: 207 G--NINSGQLA--NSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATL 262
Query: 189 DGKVPSSG---VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL---TLIF------- 235
+G SS V TP++ N+A Y + G S G K L L+F
Sbjct: 263 NGTNASSSGSPVQSTPLVVNAALPSLYFMS-----LKGISLGQKRLPIDPLVFAINDDGT 317
Query: 236 -----DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQ 287
DSG S + Y + R+L+ L P + T L C+ P+
Sbjct: 318 GGVFIDSGTSLTWLQQDAYDA----VRRELVSVLRPLPPTNDTEIGLETCF--PWPPPPS 371
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYL-TGKST 335
V + L F N + VPPE Y++I +T + +A + +G +T
Sbjct: 372 VAVTVPDMELHFDGGAN---MTVPPENYMLIDGATGFLCLAMIRSGDAT 417
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 125/275 (45%), Gaps = 41/275 (14%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----------PEKQ 59
F + YF + +G PPK + DTGSD+ WV C +PCTGC P+
Sbjct: 86 FMVGLYF-TRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTS 143
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDL--FPLR 116
K +PCS+ RC A + C+ N C Y YGDG + G V+D F
Sbjct: 144 STSSK--IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTV 201
Query: 117 FSNGSVFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNV 172
N N + FGC +Q G L+ D A G+ G G+ ++S+VSQL G+ V
Sbjct: 202 MGNEQTANSSASIVFGCSNSQS--GDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKV 259
Query: 173 IGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD 230
HC+ NG G+L LG+ P G+ +TP++ + HY L ++ +G+ + D
Sbjct: 260 FSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQ---PHYNLNLESIVVNGQKLPI-D 313
Query: 231 LTL---------IFDSGASYAYFTSRVYQEIVSLI 256
+L I DSG + AY Y V+ I
Sbjct: 314 SSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAI 348
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 90/332 (27%), Positives = 142/332 (42%), Gaps = 49/332 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNIV 67
+ + +G PP+ F DTGSD+ WV C PC C + + + +
Sbjct: 41 YYTRIELGTPPRPFYVQIDTGSDILWVNC-KPCNACPLTSGLGVALNFFDPRGSSTASPL 99
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFN-- 124
C + +C + + + C + C Y EYGDG ++G V+D F ++ N V N
Sbjct: 100 SCIDSKCVSSNQISESVCT-TDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNA 158
Query: 125 -VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
+TFGC YNQ G L+ PD A G+ G G+ +S+VSQL GL + HC+
Sbjct: 159 SAKITFGCSYNQS--GDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGAD 216
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
G G+L LG+ P G+ +TP++ + HY L + +G+ +
Sbjct: 217 PGGGILVLGEITEP--GMVYTPIVPSQ---PHYNLNLQGIAVNGQQLSIDPQVFATTNTR 271
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVT 289
I D G + AY Y+ V+ I+ A T P +G F + +
Sbjct: 272 GTIIDCGTTLAYLAEEAYEPFVNTIIA---------AVSQSTQPFMLKGNPCFLTVHSID 322
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISVS 321
E F + L F + + P+ YL+ +S
Sbjct: 323 EIFPSVTLYF----EGAPMDLKPKDYLIQQLS 350
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 144/332 (43%), Gaps = 47/332 (14%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-----YKPHKNI- 66
+ + + +G PP+ F DTGSD+ WV C A C GC + Q + P ++
Sbjct: 77 VVGLYYTKIRLGSPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
V CS+ RC+ + C N+ C Y +YGDG + G V+D+ GS
Sbjct: 136 ATPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
P+ FGC +Q G L D A G+ G G+ +S++SQL GL V HC+
Sbjct: 196 VPNSTAPVVFGCSTSQ--TGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCL 253
Query: 178 -GQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
G+N G G+L LG+ P+ + +TP++ + HY + + +G++ +
Sbjct: 254 KGENGGGGILVLGEIVEPN--MVFTPLVPSQ---PHYNVNLLSISVNGQALPINPSVFST 308
Query: 234 ------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKAL 285
I D+G + AY + Y V I A P+ +G +
Sbjct: 309 SNGQGTIIDTGTTLAYLSEAAYVPFVEAITN---------AVSQSVRPVVSKGNQCYVIA 359
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
V + F P++L+F + + P+ YL+
Sbjct: 360 TSVADIFPPVSLNFA---GGASMFLNPQDYLI 388
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 119/267 (44%), Gaps = 37/267 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNIV 67
+ L +G PP+ F DTGSD+ WV C C GC + P +++
Sbjct: 52 YYTRLQLGTPPRDFYVQIDTGSDVLWVSC-GSCNGCPVNSGLHIPLNFFDPGSSPTASLI 110
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN---GSVFN 124
CS+ RC+ + C N+ C Y +YGDG + G V+DL L F GSV N
Sbjct: 111 SCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDL--LHFDTVLGGSVMN 168
Query: 125 ---VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI-- 177
P+ FGC Q G L+ D A G+ G G+ +S+VSQL G+ HC+
Sbjct: 169 NSSAPIVFGCSALQ--TGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKG 226
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------K 229
+G G+L LG+ P+ + +TP++ + HY L + +G++ +
Sbjct: 227 DDSGGGILVLGEIVEPN--IVYTPLVPSQ---PHYNLNMQSISVNGQTLAIDPSVFGTSS 281
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLI 256
I DSG + AY Y +S I
Sbjct: 282 SQGTIIDSGTTLAYLAEAAYDPFISAI 308
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 74/194 (38%), Positives = 97/194 (50%), Gaps = 20/194 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G PP DTGSDL W QCDAPC C P Y P ++ V C +P
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL P RC P+ C Y YGDG S+ G L T+ F L S+ +V V FGCG
Sbjct: 152 MCQALQSPW-SRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG-SDTAVRGV--AFGCG 207
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
N G S +++G++G+GRG +S+VSQL G+ R +C LFLG
Sbjct: 208 --TENLG--STDNSSGLVGMGRGPLSLVSQL---GVTR--FSYCFTPFNATAASPLFLGS 258
Query: 190 GKVPSSGVAWTPML 203
SS TP +
Sbjct: 259 SARLSSAAKTTPFV 272
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 74/194 (38%), Positives = 97/194 (50%), Gaps = 20/194 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G PP DTGSDL W QCDAPC C P Y P ++ V C +P
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL P RC P+ C Y YGDG S+ G L T+ F L S+ +V V FGCG
Sbjct: 152 MCQALQSPW-SRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG-SDTAVRGV--AFGCG 207
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
N G S +++G++G+GRG +S+VSQL G+ R +C LFLG
Sbjct: 208 --TENLG--STDNSSGLVGMGRGPLSLVSQL---GVTR--FSYCFTPFNATAASPLFLGS 258
Query: 190 GKVPSSGVAWTPML 203
SS TP +
Sbjct: 259 SARLSSAAKTTPFV 272
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 135/327 (41%), Gaps = 48/327 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKNI- 66
YFA + +G PPK + DTGSD+ WV C C K P K Y P +
Sbjct: 82 YFA-KIGLGNPPKDYYVQVDTGSDILWVNC----ANCDKCPTKSDLGVKLTLYDPQSSTS 136
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG--- 120
+ C + CAA + C + C Y + YGDG S+ G V D G
Sbjct: 137 ATRIYCDDDFCAATYNGVLQGCT-KDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQ 195
Query: 121 -SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
S N + FGCG Q S G+LG G+ S++SQL G ++ V HC+
Sbjct: 196 TSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL-D 254
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
N +G G+V S V TPM+ N + +K +G P ++ +G G
Sbjct: 255 NVKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPTDIFDTGDRRG-- 312
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + AY VY+ +++ I+ + G L + T F+ G V
Sbjct: 313 ---TIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQFTC-------FQYTGNVN 362
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYL 316
E F + F S+ L V P YL
Sbjct: 363 EGFPVVKFHF---NGSLSLTVNPHDYL 386
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 87/328 (26%), Positives = 139/328 (42%), Gaps = 49/328 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNI 66
YF + +G PP F+ DTGSD+ WV C + C+ C H
Sbjct: 105 YFT-KVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDLHFFDAPGSLTAGS 162
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
V CS+P C+++ +C N+QC Y YGDG + G +TD F G
Sbjct: 163 VTCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVAN 221
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ P+ FGC + + G L+ D A G+ G G+G++S+VSQL G+ V HC+ +
Sbjct: 222 SSAPIVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 279
Query: 181 GR--GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----- 233
G GV LG+ VP G+ ++P++ + HY L + +G+ L
Sbjct: 280 GSGGGVFVLGEILVP--GMVYSPLVPSQ---PHYNLNLLSIGVNGQMLPLDAAVFEASNT 334
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
I D+G + Y Y DL + + PI G + +
Sbjct: 335 RGTIVDTGTTLTYLVKEAY---------DLFLNAISNSVSQLVTPIISNGEQCYLVSTSI 385
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYL 316
++ F ++L+F +++ P+ YL
Sbjct: 386 SDMFPSVSLNFA---GGASMMLRPQDYL 410
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 87/328 (26%), Positives = 139/328 (42%), Gaps = 49/328 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNI 66
YF + +G PP F+ DTGSD+ WV C + C+ C H
Sbjct: 100 YFT-KVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDLHFFDAPGSLTAGS 157
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
V CS+P C+++ +C N+QC Y YGDG + G +TD F G
Sbjct: 158 VTCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVAN 216
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ P+ FGC + + G L+ D A G+ G G+G++S+VSQL G+ V HC+ +
Sbjct: 217 SSAPIVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 181 GR--GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----- 233
G GV LG+ VP G+ ++P++ + HY L + +G+ L
Sbjct: 275 GSGGGVFVLGEILVP--GMVYSPLVPSQ---PHYNLNLLSIGVNGQMLPLDAAVFEASNT 329
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
I D+G + Y Y DL + + PI G + +
Sbjct: 330 RGTIVDTGTTLTYLVKEAY---------DLFLNAISNSVSQLVTPIISNGEQCYLVSTSI 380
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYL 316
++ F ++L+F +++ P+ YL
Sbjct: 381 SDMFPSVSLNFA---GGASMMLRPQDYL 405
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/318 (29%), Positives = 132/318 (41%), Gaps = 34/318 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK--------PPEKQYKPHKNIV 67
YFA + +G P + F DTGSD+ WV C A C C + P + V
Sbjct: 85 YFA-KIGLGTPSRDFHVQVDTGSDILWVNC-AGCIRCPRKSDLVELTPYDADASSTAKSV 142
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS----VF 123
CS+ C+ + N H C Y I YGDG S+ G LV D+ L G+
Sbjct: 143 SCSDNFCS---YVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGST 199
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
N + FGCG Q S G++G G+ S +SQL G ++ HC+ N G
Sbjct: 200 NGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG 259
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSA----DLKHYILGPAELLYSGKSCGL-KDLTLIFDSG 238
+F G+V S V TPML SA +L +G + L S + D +I DSG
Sbjct: 260 GIF-AIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSG 318
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+ Y VY +++ I+ L D T F + ++ + F +
Sbjct: 319 TTLVYLPDAVYNPLMNQILASHQELNLHTVQDSFTC-------FHYIDRL-DRFPTVTFQ 370
Query: 299 FTNRRNSVRLVVPPEAYL 316
F SV L V P+ YL
Sbjct: 371 FD---KSVSLAVYPQEYL 385
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 144/332 (43%), Gaps = 47/332 (14%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-----YKPHKNI- 66
+ + L +G PP+ F DTGSD+ WV C A C GC + Q + P ++
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ CS+ RC+ + C N+ C Y +YGDG + G V+D+ GS
Sbjct: 136 ASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
P+ FGC +Q G L D A G+ G G+ +S++SQL G+ V HC+
Sbjct: 196 VPNSTAPVVFGCSTSQ--TGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253
Query: 178 -GQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
G+N G G+L LG+ P+ + +TP++ + HY + + +G++ +
Sbjct: 254 KGENGGGGILVLGEIVEPN--MVFTPLVPSQ---PHYNVNLLSISVNGQALPINPSVFST 308
Query: 234 ------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKAL 285
I D+G + AY + Y V I A P+ +G +
Sbjct: 309 SNGQGTIIDTGTTLAYLSEAAYVPFVEAITN---------AVSQSVRPVVSKGNQCYVIT 359
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
V + F P++L+F + + P+ YL+
Sbjct: 360 TSVGDIFPPVSLNFA---GGASMFLNPQDYLI 388
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/258 (31%), Positives = 115/258 (44%), Gaps = 22/258 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+AV + +G P K F FDTGSDLTW QC+ C K E + P K+ + CS+
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSA 192
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L C P C Y+++YGDG SIG T+ L SN VF L FGCG
Sbjct: 193 FCKLLDTEGGESCSSPT--CLYQVQYGDGSYSIGFFATETLTLSSSN--VFKNFL-FGCG 247
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
Q N G AG+LGLGR ++S+ SQ + + + +C+ + +L G
Sbjct: 248 --QQNSGLFR--GAAGLLGLGRTKLSLPSQTAQK--YKKLFSYCLPASSSSKGYLSFGGQ 301
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYAYFTS 246
S V +TP+ ++ Y L EL G + D ++ + DSG S
Sbjct: 302 VSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSI-DASIFSTSGTVIDSGTVITRLPS 360
Query: 247 RVYQEIVSLIMRDLIGTP 264
Y + S + + P
Sbjct: 361 TAYSALSSAFQKLMTDYP 378
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/326 (25%), Positives = 140/326 (42%), Gaps = 45/326 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNI 66
YF + +G PP F+ DTGSD+ WV C + C+ C H
Sbjct: 100 YFT-KVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDLHFFDAPGSFTAGS 157
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
V CS+P C+++ +C N+QC Y YGDG + G +TD F G
Sbjct: 158 VTCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVAN 216
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ P+ FGC + + G L+ D A G+ G G+G++S+VSQL G+ V HC+ +
Sbjct: 217 SSAPIVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 181 GR--GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----- 233
G GV LG+ VP G+ ++P+L + HY L + +G+ +
Sbjct: 275 GSGGGVFVLGEILVP--GMVYSPLLPSQ---PHYNLNLLSIGVNGQILPIDAAVFEASNT 329
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I D+G + Y Y ++ I + + + + + +++
Sbjct: 330 RGTIVDTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNGEQC-------YLVSTSISD 382
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYL 316
F P++L+F +++ P+ YL
Sbjct: 383 MFPPVSLNFA---GGASMMLRPQDYL 405
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/337 (27%), Positives = 148/337 (43%), Gaps = 49/337 (14%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-----TKPPEKQYKPHKN 65
F + YF + +G PPK F DTGSD+ WV C + C GC + P + P +
Sbjct: 79 FLVGLYFT-RVQLGSPPKDFYVQIDTGSDVLWVSCSS-CNGCPVTSGLQIPLTFFDPGSS 136
Query: 66 ----IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF---PLRFS 118
+V CS+ RC A + C +QC Y +YGDG + G V DL L S
Sbjct: 137 TTAALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLS 196
Query: 119 NGSV------FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIR 170
+G + ++ ++F C Q G L+ D A G+ G G+ +S++SQL G+
Sbjct: 197 SGELSQICQTYDSSVSFMCSTLQ--TGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITP 254
Query: 171 NVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL 228
V HC+ +G GVL LG+ P+ + +TP++ + HY L + +G++ +
Sbjct: 255 RVFSHCLKGDDSGGGVLVLGEIVEPN--IVYTPLVPSQ---PHYNLYLQSISVAGQTLAI 309
Query: 229 --------KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
+ I DSG + AY Y VS I ++ + +T
Sbjct: 310 DPSVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITS-------VVSLNARTYLSKGNQ 362
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ V + F ++L+F L++ P+ YL+
Sbjct: 363 CYLVTSSVNDVFPQVSLNFA---GGASLILNPQDYLL 396
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 120/268 (44%), Gaps = 38/268 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
Y+A + +G PP F DTGSD+ WV C C+ C K + + Y P + +
Sbjct: 73 YYA-RIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNCPKKSDIGVDLQLYNPKSSSTSTL 130
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
+ C P C+A + P CK P+ C Y++ YGDG ++ G V D L+ + G S
Sbjct: 131 ITCDQPFCSATYDAPIPGCK-PDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
N + FGCG Q S G+LG G+ S++SQL G ++ + HC+
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISG 249
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------- 233
G +F G+V + TP++ N A HY ++ +G G L L
Sbjct: 250 GGIF-AIGEVVEPKLKTTPVVPNQA---HY-----NVVLNGVKVGDTALDLPLGLFETSY 300
Query: 234 ----IFDSGASYAYFTSRVYQEIVSLIM 257
I DSG + AY +Y ++ I+
Sbjct: 301 KRGAIIDSGTTLAYLPDSIYLPLMEKIL 328
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 144/332 (43%), Gaps = 47/332 (14%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-----YKPHKNI- 66
+ + L +G PP+ F DTGSD+ WV C A C GC + Q + P ++
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ CS+ RC+ + C N+ C Y +YGDG + G V+D+ GS
Sbjct: 136 ASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
P+ FGC +Q G L D A G+ G G+ +S++SQL G+ V HC+
Sbjct: 196 VPNSTAPVVFGCSTSQ--TGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253
Query: 178 -GQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
G+N G G+L LG+ P+ + +TP++ + HY + + +G++ +
Sbjct: 254 KGENGGGGILVLGEIVEPN--MVFTPLVPSQ---PHYNVNLLSISVNGQALPINPSVFST 308
Query: 234 ------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKAL 285
I D+G + AY + Y V I A P+ +G +
Sbjct: 309 SNGQGTIIDTGTTLAYLSEAAYVPFVEAITN---------AVSQSVRPVVSKGNQCYVIT 359
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
V + F P++L+F + + P+ YL+
Sbjct: 360 TSVGDIFPPVSLNFA---GGASMFLNPQDYLI 388
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 115/262 (43%), Gaps = 29/262 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK----QYKPHK------- 64
YFA + +G P + F DTGSD+ WV C GC + P K + P+
Sbjct: 85 YFA-KIGLGTPSRDFHVQVDTGSDILWVNC----AGCIRCPRKSDLVELTPYDVDASSTA 139
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS--- 121
V CS+ C+ + N H C Y I YGDG S+ G LV D+ L G+
Sbjct: 140 KSVSCSDNFCS---YVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQT 196
Query: 122 -VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
N + FGCG Q S G++G G+ S +SQL G ++ HC+ N
Sbjct: 197 GSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNN 256
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSA----DLKHYILGPAEL-LYSGKSCGLKDLTLIF 235
G +F G+V S V TPML SA +L +G + L L S D +I
Sbjct: 257 NGGGIF-AIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVII 315
Query: 236 DSGASYAYFTSRVYQEIVSLIM 257
DSG + Y VY +++ I+
Sbjct: 316 DSGTTLVYLPDAVYNPLLNEIL 337
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/328 (26%), Positives = 139/328 (42%), Gaps = 49/328 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNI 66
YF + +G PP F+ DTGSD+ WV C + C+ C H
Sbjct: 100 YF-TKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDLHFFDAPGSLTAGS 157
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
V CS+P C+++ +C N+QC Y YGDG + G +TD F G
Sbjct: 158 VTCSDPICSSVFQTTAAQCSE-NNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVAN 216
Query: 124 -NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ P+ FGC + + G L+ D A G+ G G+G++S+VSQL G+ V HC+ +
Sbjct: 217 SSAPIVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 181 GR--GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----- 233
G GV LG+ VP G+ ++P++ + HY L + +G+ L
Sbjct: 275 GSGGGVFVLGEILVP--GMVYSPLVPSQ---PHYNLNLLSIGVNGQMLPLDAAVFEASNT 329
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQV 288
I D+G + Y Y DL + + PI G + +
Sbjct: 330 RGTIVDTGTTLTYLVKEAY---------DLFLNAISNSVSQLVTPIISNGEQCYLVSTSI 380
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYL 316
++ F ++L+F +++ P+ YL
Sbjct: 381 SDMFPSVSLNFA---GGASMMLRPQDYL 405
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 104/349 (29%), Positives = 155/349 (44%), Gaps = 53/349 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++L +G PP + DTGSDL W QC APC C P ++P ++ +VPC +P
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
CAAL + P C C Y+ YGD S+ G L ++ F +N S V + FGC
Sbjct: 151 LCAALPY---PACFQ-RSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGC 206
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLG 188
G N G L+ +++G++GLGRG +S+VSQL + + R GV
Sbjct: 207 G--NINSGQLA--NSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATL 262
Query: 189 DGKVPSSG---VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL---TLIF------- 235
+G SS V TP++ N+A Y + G S G K L L+F
Sbjct: 263 NGTNASSSGSPVQSTPLVVNAALPSLYFMS-----LKGISLGQKRLPIDPLVFAINDDGT 317
Query: 236 -----DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQ 287
DSG S + Y + +L+ L P + T L C+ P+
Sbjct: 318 GGVFIDSGTSLTWLQQDAYDA----VRHELVSVLRPLPPTNDTEIGLETCF--PWPPPPS 371
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYL-TGKST 335
V + L F N + VPPE Y++I +T + +A + +G +T
Sbjct: 372 VAVTVPDMELHFDGGAN---MTVPPENYMLIDGATGFLCLAMIRSGDAT 417
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 120/268 (44%), Gaps = 38/268 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
Y+A + +G PP F DTGSD+ WV C C+ C K + + Y P + +
Sbjct: 73 YYA-RIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNCPKKSDIGVDLQLYNPKSSSTSTL 130
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
+ C P C+A + P CK P+ C Y++ YGDG ++ G V D L+ + G S
Sbjct: 131 ITCDQPFCSATYDAPIPGCK-PDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
N + FGCG Q S G+LG G+ S++SQL G ++ + HC+
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISG 249
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------- 233
G +F G+V + TP++ N A HY ++ +G G L L
Sbjct: 250 GGIF-AIGEVVEPKLXNTPVVPNQA---HY-----NVVLNGVKVGDTALDLPLGLFETSY 300
Query: 234 ----IFDSGASYAYFTSRVYQEIVSLIM 257
I DSG + AY +Y ++ I+
Sbjct: 301 KRGAIIDSGTTLAYLPESIYLPLMEKIL 328
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 144/332 (43%), Gaps = 47/332 (14%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-----YKPHKNI- 66
+ + L +G PP+ F DTGSD+ WV C A C GC + Q + P ++
Sbjct: 77 VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQLNFFDPGSSVT 135
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ CS+ RC+ + C N+ C Y +YGDG + G V+D+ GS
Sbjct: 136 ASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
P+ FGC +Q G L D A G+ G G+ +S++SQL G+ V HC+
Sbjct: 196 VPNSTAPVVFGCSTSQ--TGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253
Query: 178 -GQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-- 233
G+N G G+L LG+ P+ + +TP++ + HY + + +G++ +
Sbjct: 254 KGENGGGGILVLGEIVEPN--MVFTPLVPSQ---PHYNVNLLSISVNGQALPINPSVFST 308
Query: 234 ------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKAL 285
I D+G + AY + Y V I A P+ +G +
Sbjct: 309 SNGQGTIIDTGTTLAYLSEAAYVPFVEAITN---------AVSQSVRPVVSKGNQCYVIT 359
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
V + F P++L+F + + P+ YL+
Sbjct: 360 TSVGDIFPPVSLNFA---GGASMFLNPQDYLI 388
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/326 (29%), Positives = 138/326 (42%), Gaps = 49/326 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE----------KQYKPHKN 65
YF + +G PPK + DTGSD+ WV C APC C + K KN
Sbjct: 77 YFT-KIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKASSTSKN 134
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
V C + C+ + K P C Y + YGDG +S G V D L G++
Sbjct: 135 -VGCEDAFCSFIMQSETCGAKKP---CSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTA 190
Query: 126 PLT----FGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
PL FGCG NQ G L ++A G++G G+ S++SQL G ++ + HC+
Sbjct: 191 PLAQEVVFGCGKNQ--SGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDN 248
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------KD 230
NG G+ +G+ V S V TP++ N HY + + G+ L D
Sbjct: 249 MNGGGIFAIGE--VESPVVKTTPLVPNQV---HYNVILKGMDVDGEPIDLPPSLASTNGD 303
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY +Y SLI + +KL +T F +
Sbjct: 304 GGTIIDSGTTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFAC-----FSFTSNTDK 355
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYL 316
F + L F +S++L V P YL
Sbjct: 356 AFPVVNLHF---EDSLKLSVYPHDYL 378
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 143/322 (44%), Gaps = 29/322 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-KPPEKQYKPHKNI----VP 68
+ YF L +G P K F DTGS +T+V C + +GC + + P + +
Sbjct: 75 YGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDPEASSTASRIS 134
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C++P+C+ PRC QC Y Y + SS G L+ D+ L + + P+
Sbjct: 135 CTSPKCSC----GSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLAL---HDGLPGAPII 187
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRGVLFL 187
FGC G + G+ GLG S+V+QL + G+I +V C G G G L L
Sbjct: 188 FGC--ETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGDGALLL 245
Query: 188 GDGKVPSS-GVAWTPMLQNSADLKHY------ILGPAELLYSGKSCGLKDLTLIFDSGAS 240
GD +VP S + +TP+L ++ +Y + +LL +S + + DSG +
Sbjct: 246 GDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQGYGTVLDSGTT 305
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLPICW-RGP-FKALGQVTEYFKPLA 296
+ Y S V++ + + + LK PD + IC+ + P L ++ F +
Sbjct: 306 FTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLEALSSVFPSME 365
Query: 297 LSFTNRRNSVRLVVPPEAYLVI 318
+ F LV+ P YL +
Sbjct: 366 VQFD---QGTSLVLGPLNYLFV 384
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 137/324 (42%), Gaps = 45/324 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE----------KQYKPHKN 65
YF + +G PPK + DTGSD+ WV C APC C + K KN
Sbjct: 78 YFT-KIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKTSSTSKN 135
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
V C + C+ + K P C Y + YGDG +S G + D L G++
Sbjct: 136 -VGCEDDFCSFIMQSETCGAKKP---CSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTA 191
Query: 126 PLT----FGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
PL FGCG NQ G L D+A G++G G+ SI+SQL G + + HC+
Sbjct: 192 PLAQEVVFGCGKNQ--SGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDN 249
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG------PAELLYSGKSCGLKDLT 232
NG G+ +G+ V S V TP++ N + G P +L S S D
Sbjct: 250 MNGGGIFAVGE--VESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTN-GDGG 306
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
I DSG + AY +Y SLI + +KL +T F + F
Sbjct: 307 TIIDSGTTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFAC-----FSFTSNTDKAF 358
Query: 293 KPLALSFTNRRNSVRLVVPPEAYL 316
+ L F +S++L V P YL
Sbjct: 359 PVVNLHF---EDSLKLSVYPHDYL 379
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 137/324 (42%), Gaps = 45/324 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE----------KQYKPHKN 65
YF + +G PPK + DTGSD+ WV C APC C + K KN
Sbjct: 74 YF-TKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKTSSTSKN 131
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
V C + C+ + K P C Y + YGDG +S G + D L G++
Sbjct: 132 -VGCEDDFCSFIMQSETCGAKKP---CSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTA 187
Query: 126 PLT----FGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
PL FGCG NQ G L D+A G++G G+ SI+SQL G + + HC+
Sbjct: 188 PLAQEVVFGCGKNQ--SGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDN 245
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG------PAELLYSGKSCGLKDLT 232
NG G+ +G+ V S V TP++ N + G P +L S S D
Sbjct: 246 MNGGGIFAVGE--VESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTN-GDGG 302
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
I DSG + AY +Y SLI + +KL +T F + F
Sbjct: 303 TIIDSGTTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFAC-----FSFTSNTDKAF 354
Query: 293 KPLALSFTNRRNSVRLVVPPEAYL 316
+ L F +S++L V P YL
Sbjct: 355 PVVNLHF---EDSLKLSVYPHDYL 375
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 82/273 (30%), Positives = 116/273 (42%), Gaps = 46/273 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN-- 65
YF + +G PPK + DTGSD+ WV C C K P K Y P +
Sbjct: 84 YF-TEIKLGTPPKRYYVQVDTGSDILWVNC----ISCEKCPRKSGLGLDLTFYDPKASSS 138
Query: 66 --IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
V C CAA + P C N C+Y + YGDG S+ G VTD G
Sbjct: 139 GSTVSCDQGFCAATYGGKLPGCT-ANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQ 197
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
N +TFGCG Q S G+LG G+ S++SQL G ++ + HC+
Sbjct: 198 TQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDT 257
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG-------------PAELLYSGKS 225
G G+ +G+ P V TP++ AD+ HY + PA + +G+
Sbjct: 258 IKGGGIFAIGNVVQPK--VKTTPLV---ADMPHYNVNLKSIDVGGTTLQLPAHVFETGER 312
Query: 226 CGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
G I DSG + Y V++E+++ I
Sbjct: 313 KG-----TIIDSGTTLTYLPELVFKEVMAAIFN 340
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 97/323 (30%), Positives = 137/323 (42%), Gaps = 46/323 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI--------- 66
YF + +G PPK + DTGSD+ WV C PC C P + H ++
Sbjct: 74 YFT-KIKLGSPPKEYHVQVDTGSDILWVNC-KPCPEC--PSKTNLNFHLSLFDVNASSTS 129
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
V C + C+ + + C+ P C Y I Y D +S G + D L G +
Sbjct: 130 KKVGCDDDFCSFISQSD--SCQ-PAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQT 186
Query: 125 VPL----TFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
PL FGCG +Q G L D+A GV+G G+ S++SQL G + V HC+
Sbjct: 187 GPLGQEVVFGCGSDQ--SGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL- 243
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTL 233
N +G G V S V TPM+ N HY + + G + L ++
Sbjct: 244 DNVKGGGIFAVGVVDSPKVKTTPMVPNQM---HYNVMLMGMDVDGTALDLPPSIMRNGGT 300
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I DSG + AYF +Y ++ I L P+KL + T F V F
Sbjct: 301 IVDSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEDTFQC-----FSFSENVDVAFP 352
Query: 294 PLALSFTNRRNSVRLVVPPEAYL 316
P++ F +SV+L V P YL
Sbjct: 353 PVSFEF---EDSVKLTVYPHDYL 372
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 88/328 (26%), Positives = 135/328 (41%), Gaps = 45/328 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G P K F DTGSD+ W+ C C+ C + +
Sbjct: 83 YFT-KVKLGSPAKDFYVQIDTGSDILWINC-ITCSNCPHSSGLGIELDFFDTAGSSTAAL 140
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF---PLRFSNGSVF 123
V C++P C+ C +QC Y +YGDG + G V+D + V
Sbjct: 141 VSCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVA 200
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
N T G + + G L+ D A G+ G G G +S++SQL G+ V HC+ G+
Sbjct: 201 NSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE 260
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKS--------CGLKDL 231
NG GVL LG+ PS + ++P++ + L HY L + +G+ +
Sbjct: 261 NGGGVLVLGEILEPS--IVYSPLVPS---LPHYNLNLQSIAVNGQLLPIDSNVFATTNNQ 315
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVT 289
I DSG + AY Y V I A + PI +G + V
Sbjct: 316 GTIVDSGTTLAYLVQEAYNPFVDAITA---------AVSQFSKPIISKGNQCYLVSNSVG 366
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ F ++L+F +V+ PE YL+
Sbjct: 367 DIFPQVSLNF---MGGASMVLNPEHYLM 391
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 90/327 (27%), Positives = 134/327 (40%), Gaps = 50/327 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN--- 65
+ + +G PPK + DTGSD+ WV C C + P K Y P +
Sbjct: 86 YYTEIKLGTPPKHYYVQVDTGSDILWVNC----ITCEQCPHKSGLGLDLTLYDPKASSTG 141
Query: 66 -IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL----RFSNG 120
+V C CAA P+C N C+Y + YGDG S+IG+ VTD R
Sbjct: 142 SMVMCDQAFCAATFGGKLPKCG-ANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQT 200
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N + FGCG Q S G+LG G S++SQL G ++ + HC+
Sbjct: 201 QPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTI 260
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
G G+ +GD P V TP++ + + +LK +G PA + G+ G
Sbjct: 261 KGGGIFSIGDVVQPK--VKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKG-- 316
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + Y V++E +M + + D +C++ P G V
Sbjct: 317 ---TIIDSGTTLTYLPELVFKE----VMLAVFNKHQDITFHDVQGFLCFQYP----GSVD 365
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYL 316
+ F + F + + L V P Y
Sbjct: 366 DGFPTITFHF---EDDLALHVYPHEYF 389
>gi|213998812|gb|ACJ60773.1| nucellin [Hordeum euclaston]
Length = 154
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 81/147 (55%), Gaps = 5/147 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDD 271
+++Y EIVS + L + L+ D
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLEEVKGD 152
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/153 (41%), Positives = 84/153 (54%), Gaps = 12/153 (7%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
+ + V++ +G PP DTGSDL W QCDAPC C P Y P ++ V C
Sbjct: 90 ATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCR 149
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+P C AL P RC P+ C Y YGDG S+ G L T+ F L S+ +V V FG
Sbjct: 150 SPMCQALQSPW-SRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL-GSDTAVRGV--AFG 205
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
CG N G S +++G++G+GRG +S+VSQL
Sbjct: 206 CG--TENLG--STDNSSGLVGMGRGPLSLVSQL 234
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 97/328 (29%), Positives = 133/328 (40%), Gaps = 51/328 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKNI- 66
YF + +G P K + DTGSD+ WV C C P K Y P +
Sbjct: 81 YF-TQIGIGTPAKSYYVQVDTGSDILWVNC----VFCDTCPRKSGLGIELTLYDPSGSSS 135
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG--- 120
V C C A H P C P C Y I YGDG S+ G VTD +G
Sbjct: 136 GTGVTCGQDFCVATHGGVIPSCV-PAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQ 194
Query: 121 -SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
++ N +TFGCG S G+LG G+ S++SQL G +R V HC+
Sbjct: 195 TTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDT 254
Query: 180 -NGRGVLFLGDGKVPSSGVAWTPML----QNSADLKHYILG------PAELLYSGKSCGL 228
NG G+ +GD P V+ TP++ + +L+ +G P + G+S G
Sbjct: 255 INGGGIFAIGDVVQPK--VSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGESKG- 311
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
I DSG + AY VY I+S + PLK D + F+ G V
Sbjct: 312 ----TIIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQDFQC--------FRYSGSV 359
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYL 316
+ F + F + L + P YL
Sbjct: 360 DDGFPIITFHF---EGGLPLNIHPHDYL 384
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 140/326 (42%), Gaps = 43/326 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKP----HKNI 66
YF + +G PPK F DTGSD+ WV C + C GC + P + P ++
Sbjct: 68 YFT-RVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGCPQSSGLHIPLNFFDPGSSSTASL 125
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
+ CS+ RC+ + C +QC Y +YGDG + G V+DL GS
Sbjct: 126 ISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS 185
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
+ + FGC +Q G L+ D A G+ G G+ +S++SQ+ G+ V HC+ +G
Sbjct: 186 SASIVFGCSISQ--TGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDG 243
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK--------DLTL 233
G L G++ + ++P++ + HY L + +GKS + +
Sbjct: 244 GGGGILVLGEIVEEDIVYSPLVPSQ---PHYNLNLQSISVNGKSLAIDPEVFATSTNRGT 300
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEY 291
I DSG + AY Y VS I A P+ +G + V
Sbjct: 301 IVDSGTTLAYLAEEAYDPFVSAITE---------AVSQSVRPLLSKGTQCYLITSSVKGI 351
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV 317
F ++L+F V + + PE YL+
Sbjct: 352 FPTVSLNFA---GGVSMNLKPEDYLL 374
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 140/326 (42%), Gaps = 43/326 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKP----HKNI 66
YF + +G PPK F DTGSD+ WV C + C GC + P + P ++
Sbjct: 83 YFT-RVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGCPQSSGLHIPLNFFDPGSSSTASL 140
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF--- 123
+ CS+ RC+ + C +QC Y +YGDG + G V+DL GS
Sbjct: 141 ISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS 200
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
+ + FGC +Q G L+ D A G+ G G+ +S++SQ+ G+ V HC+ +G
Sbjct: 201 SASIVFGCSISQ--TGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDG 258
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK--------DLTL 233
G L G++ + ++P++ + HY L + +GKS + +
Sbjct: 259 GGGGILVLGEIVEEDIVYSPLVPSQ---PHYNLNLQSISVNGKSLAIDPEVFATSTNRGT 315
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEY 291
I DSG + AY Y VS I A P+ +G + V
Sbjct: 316 IVDSGTTLAYLAEEAYDPFVSAITE---------AVSQSVRPLLSKGTQCYLITSSVKGI 366
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV 317
F ++L+F V + + PE YL+
Sbjct: 367 FPTVSLNFA---GGVSMNLKPEDYLL 389
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 138/330 (41%), Gaps = 55/330 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C C GC + Y P + +
Sbjct: 90 YF-TRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGEL 147
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C A + P C + C+Y I YGDG S+ G VTD +G +
Sbjct: 148 VTCDQQFCVANYGGVLPSCTSTS-PCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N ++FGCG G L + A G+LG G+ S++SQL G +R + HC+
Sbjct: 207 ANASVSFGCGAKL--GGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTV 264
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY------------ILG-PAELLYSGKSC 226
NG G+ +G+ P V TP++ +D+ HY LG P + SG S
Sbjct: 265 NGGGIFAIGNVVQPK--VKTTPLV---SDMPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319
Query: 227 GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
G I DSG + AY VY+ + +++ ++ D F+ G
Sbjct: 320 G-----TIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFSC--------FQYSG 366
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
V + F + F V L+V P YL
Sbjct: 367 SVDDGFPEVTFHF---EGDVSLIVSPHDYL 393
>gi|213998798|gb|ACJ60766.1| nucellin [Hordeum brevisubulatum subsp. violaceum]
Length = 141
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 55/136 (40%), Positives = 78/136 (57%), Gaps = 5/136 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I+ NVIGHC+ G+GVL
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMIKENVIGHCLSSKGKGVL 60
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 61 YVGDFNPPSRGVTWVPMRES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 117
Query: 245 TSRVYQEIVSLIMRDL 260
+++Y EIVS + L
Sbjct: 118 PAQIYNEIVSKVRGTL 133
>gi|213998826|gb|ACJ60780.1| nucellin [Hordeum intercedens]
Length = 148
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 VAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLE 147
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 136/323 (42%), Gaps = 40/323 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE---------KQYKPHKNI 66
YFA + +G P + + DTGSD+ WV C A CT C K + N
Sbjct: 74 YFA-KIGLGTPVQDYYVQVDTGSDILWVNC-AGCTNCPKKSDLGIELSLYSPSSSSTSNR 131
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---- 122
V C+ C + + P C P C+Y + YGDG S+ G V D L G+
Sbjct: 132 VTCNQDFCTSTYDGPIPGCT-PELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTS 190
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NG 181
N + FGCG Q + G+LG G+ S++SQL G ++ V HC+ NG
Sbjct: 191 TNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNING 250
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLT--L 233
G+ +G+ P V TP++ A HY + + + L DL
Sbjct: 251 GGIFAIGEVVQPK--VRTTPLVPQQA---HYNVFMKAIEVDNEVLNLPTDVFDTDLRKGT 305
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I DSG + AYF +Y+ ++S I + LKL ++ F+ G V + F
Sbjct: 306 IIDSGTTLAYFPDVIYEPLISKIFARQ--STLKLHTVEEQFTC-----FEYDGNVDDGFP 358
Query: 294 PLALSFTNRRNSVRLVVPPEAYL 316
+ F +S+ L V P YL
Sbjct: 359 TVTFHF---EDSLSLTVYPHEYL 378
>gi|213998842|gb|ACJ60788.1| nucellin [Hordeum cordobense]
Length = 154
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 81/142 (57%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G ++FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEVVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLE 147
>gi|213998836|gb|ACJ60785.1| nucellin [Hordeum bogdanii]
Length = 154
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK-DLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMRES---LFYYSPGLAELLIDNQPIGGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLE 147
>gi|213998830|gb|ACJ60782.1| nucellin [Hordeum pusillum]
Length = 147
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 81/142 (57%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 2 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 61
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 62 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 118
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS ++ L + L+
Sbjct: 119 PAQIYNEIVSKVIGTLSESSLE 140
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 88/327 (26%), Positives = 140/327 (42%), Gaps = 44/327 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G PP F DTGSD+ WV C++ C GC + + ++
Sbjct: 79 YFT-KVKLGTPPMEFTVQIDTGSDILWVNCNS-CNGCPRSSGLGIQLNFFDASSSSSSSL 136
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD--LFPLRFSNGSVFN 124
V CS+P C + +C ++QC Y +YGDG + G V++ F + + N
Sbjct: 137 VSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIAN 196
Query: 125 --VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ FGC + + G L+ D A G+ G G G +S++SQL G+ V HC+
Sbjct: 197 SSASVVFGC--STYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGE 254
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK--------DLT 232
G G L G+V G+ ++P++ + HY L + +G++ + +
Sbjct: 255 GNGGGILVLGEVLEPGIVYSPLVPSQ---PHYNLYLQSISVNGQTLPIDPSVFATSINRG 311
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTE 290
I DSG + AY Y VS I A P +G + V E
Sbjct: 312 TIIDSGTTLAYLVEEAYTPFVSAITA---------AVSQSVTPTISKGNQCYLVSTSVGE 362
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV 317
F ++L+F S +V+ PE YL+
Sbjct: 363 IFPLVSLNFA---GSASMVLKPEEYLM 386
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/328 (26%), Positives = 133/328 (40%), Gaps = 45/328 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G P K F DTGSD+ W+ C C+ C + +
Sbjct: 83 YF-TKVKLGSPAKEFYVQIDTGSDILWINC-ITCSNCPHSSGLGIELDFFDTAGSSTAAL 140
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF---PLRFSNGSVF 123
V C +P C+ C +QC Y +YGDG + G V+D + V
Sbjct: 141 VSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVA 200
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
N T G + + G L+ D A G+ G G G +S++SQL G+ V HC+ G+
Sbjct: 201 NSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE 260
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKS--------CGLKDL 231
NG GVL LG+ PS + ++P++ + HY L + +G+ +
Sbjct: 261 NGGGVLVLGEILEPS--IVYSPLVPSQ---PHYNLNLQSIAVNGQLLPIDSNVFATTNNQ 315
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVT 289
I DSG + AY Y V I A + PI +G + V
Sbjct: 316 GTIVDSGTTLAYLVQEAYNPFVKAITA---------AVSQFSKPIISKGNQCYLVSNSVG 366
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ F ++L+F +V+ PE YL+
Sbjct: 367 DIFPQVSLNF---MGGASMVLNPEHYLM 391
>gi|357461293|ref|XP_003600928.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355489976|gb|AES71179.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 295
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 131/320 (40%), Gaps = 92/320 (28%)
Query: 8 FFFFP----IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
FF+ P I + V+L +G P + FD DTGSDLTW K YK H
Sbjct: 5 FFYDPLKISIVGGYTVSLKIGYPGQSFDVFIDTGSDLTW------------DKYKLYKLH 52
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
N V Y DG + G LV D PL S+ ++
Sbjct: 53 NNFVYVRIKLAI----------------------YVDGLQTKGFLVQDNIPLESSDRTLQ 90
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGR 182
T P P+S G+LGLG G SI+SQL+ GLI+NV+GHC G+ G+
Sbjct: 91 RPKCTNILKVTDKKPKPISK----GILGLGHGETSILSQLKSKGLIKNVVGHCFSGKEGQ 146
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
G G+ K+ G Y PA L++ K +KDL LIFDSG + +
Sbjct: 147 G----GNTKIDLEG--------------RYFSEPANLIFDEKLTFIKDLQLIFDSGTTLS 188
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
F S+ ++ +V P+++ +Y KP+ + F+N
Sbjct: 189 AFNSKDHKVLVD--------------PENEV--------------SKDYLKPIIMRFSNN 220
Query: 303 RNSVRLVVPPEAYLVISVST 322
LV E Y++IS S+
Sbjct: 221 VQCQLLV---EDYIIISCSS 237
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 71/262 (27%), Positives = 113/262 (43%), Gaps = 28/262 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE------KQYKPHKN----I 66
+ + +G PP + DTGSD+TW+ C APCT C + Y P ++
Sbjct: 37 YYTKIYLGTPPVGYYVQVDTGSDVTWLNC-APCTSCVTETQLPSIKLTTYDPSRSSTDGA 95
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR-FSNGSVFN- 124
+ C + C A N C C Y YGDG S+ G + D+ + N + N
Sbjct: 96 LSCRDSNCGAALGSNEVSCTSAG-YCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNG 154
Query: 125 -VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGCG Q +S G++G G+ +SI SQL G + N HC+ + +G
Sbjct: 155 TASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQG 214
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK---DLT------LI 234
+ G V +++TP++ HY +G + +G++ D T +I
Sbjct: 215 GGTIVIGSVSEPNISYTPIVSR----NHYAVGMQNIAVNGRNVTTPASFDTTSTSAGGVI 270
Query: 235 FDSGASYAYFTSRVYQEIVSLI 256
DSG + AY Y + V+ +
Sbjct: 271 MDSGTTLAYLVDPAYTQFVNAV 292
>gi|213998804|gb|ACJ60769.1| nucellin [Hordeum muticum]
gi|213998808|gb|ACJ60771.1| nucellin [Hordeum erectifolium]
gi|213998820|gb|ACJ60777.1| nucellin [Hordeum patagonicum subsp. mustersii]
gi|213998822|gb|ACJ60778.1| nucellin [Hordeum patagonicum subsp. santacrucense]
gi|333069937|gb|AEF13570.1| nucellin, partial [Hordeum pubiflorum]
Length = 154
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLE 147
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/325 (27%), Positives = 140/325 (43%), Gaps = 40/325 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G PP+ F+ DTGSD+ WV C++ C C + + +
Sbjct: 66 YFT-KVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTSGLGIQLNFFDSSSSSTAGL 123
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFN 124
V CS+P C + +C +QC Y +Y DG + G V+D F V N
Sbjct: 124 VHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVN 183
Query: 125 VP--LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
+ FGC Q ++ G+ G G+G +S++SQL +G+ V HC+ G
Sbjct: 184 SSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLKGEGI 243
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------KDLTLI 234
G L G++ G+ ++P++ + HY L + +GK + I
Sbjct: 244 GGGILVLGEILEPGMVYSPLVPSQ---PHYNLNLQSIAVNGKLLPIDPSVFATSNSQGTI 300
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYF 292
DSG + AY + Y VS + ++I +P PI +G + V++ F
Sbjct: 301 VDSGTTLAYLVAEAYDPFVSAV--NVIVSP-------SVTPIISKGNQCYLVSTSVSQMF 351
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLV 317
PLA SF N +V+ PE YL+
Sbjct: 352 -PLA-SF-NFAGGASMVLKPEDYLI 373
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 127/301 (42%), Gaps = 26/301 (8%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C PC+GC + P Y P ++ +
Sbjct: 2 YF-TQVGLGNPVKHYIVQVDTGSDVLWVNC-RPCSGCPRKSALNIPLTMYDPRESSTTSL 59
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF--SNG-SVF 123
V CS+P C +C + C+Y YGDG +S G V D SNG +
Sbjct: 60 VSCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 119
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGC Q S G++G G+ +S+ +QL I V HC+ RG
Sbjct: 120 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRG 179
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILGPAELLYSGKS-CGLKDLTLIFDSG 238
L G + G+ +TP++ +S L+ + L + D +I DSG
Sbjct: 180 GGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSG 239
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+ AYF S Y V I TP+++ D F G++++ F + L+
Sbjct: 240 TTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQC-------FLVSGRLSDLFPNVTLN 292
Query: 299 F 299
F
Sbjct: 293 F 293
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 137/330 (41%), Gaps = 55/330 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C C GC + Y P + +
Sbjct: 90 YF-TRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGEL 147
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C A + P C + C+Y I YGDG S+ G VTD +G +
Sbjct: 148 VTCDQQFCVANYGGVLPSCTSTS-PCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N ++FGCG G L + A G+LG G+ S++SQL G +R + HC+
Sbjct: 207 ANASVSFGCGAKL--GGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTV 264
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY------------ILG-PAELLYSGKSC 226
NG G+ +G+ P V TP++ D+ HY LG P + SG S
Sbjct: 265 NGGGIFAIGNVVQPK--VKTTPLV---PDMPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319
Query: 227 GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
G I DSG + AY VY+ + +++ ++ D F+ G
Sbjct: 320 G-----TIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFSC--------FQYSG 366
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
V + F + F V L+V P YL
Sbjct: 367 SVDDGFPEVTFHF---EGDVSLIVSPHDYL 393
>gi|213998834|gb|ACJ60784.1| nucellin [Hordeum bulbosum]
Length = 154
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/136 (38%), Positives = 78/136 (57%), Gaps = 5/136 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QLR + +I+ NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLRGHKMIKENVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD P+ GV W PM ++ L +Y G AE+ + G +FDSG++Y +
Sbjct: 69 YVGDFNPPTRGVTWVPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDL 260
+++Y EIVS + L
Sbjct: 126 PAQIYSEIVSKVRGTL 141
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 137/330 (41%), Gaps = 55/330 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C C GC + Y P + +
Sbjct: 90 YF-TRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGEL 147
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C A + P C + C+Y I YGDG S+ G VTD +G +
Sbjct: 148 VTCDQQFCVANYGGVLPSCTSTS-PCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N ++FGCG G L + A G+LG G+ S++SQL G +R + HC+
Sbjct: 207 ANASVSFGCGAKL--GGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTV 264
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY------------ILG-PAELLYSGKSC 226
NG G+ +G+ P V TP++ D+ HY LG P + SG S
Sbjct: 265 NGGGIFAIGNVVQPK--VKTTPLV---PDMPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319
Query: 227 GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
G I DSG + AY VY+ + +++ ++ D F+ G
Sbjct: 320 G-----TIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFSC--------FQYSG 366
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
V + F + F V L+V P YL
Sbjct: 367 SVDDGFPEVTFHF---EGDVSLIVSPHDYL 393
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/293 (33%), Positives = 132/293 (45%), Gaps = 48/293 (16%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPC 69
YFAV + VG PP DTGSDL W+QC PC C + Y P H+ I PC
Sbjct: 87 EYFAV-INVGDPPTRALVVIDTGSDLIWLQC-VPCRHCYRQVTPLYDPRSSSTHRRI-PC 143
Query: 70 SNPRCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD--LFPLRFSNGSVFNVP 126
++PRC L +P C C Y + YGDG +S G L TD +FP + V NV
Sbjct: 144 ASPRCRDVLRYPG---CDARTGGCVYMVVYGDGSASSGDLATDRLVFP---DDTHVHNV- 196
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIG------Q 179
T GCG++ N G L AG+LG+GRG++S +QL YG +V +C+G Q
Sbjct: 197 -TLGCGHD--NVGLLE--SAAGLLGVGRGQLSFPTQLAPAYG---HVFSYCLGDRLSRAQ 248
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNS-------ADLKHYILGPAELL-YSGKSCGLKDL 231
NG L G P S A+TP+ N D+ + +G + +S S L
Sbjct: 249 NGSSYLVFGRTPEPPS-TAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPA 307
Query: 232 T----LIFDSGASYAYFTSRVYQEIVSLI--MRDLIGTPLKLAPDDKTLPICW 278
T ++ DSG + + F Y + GT KLA C+
Sbjct: 308 TGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACY 360
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 130/308 (42%), Gaps = 36/308 (11%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH----KNIVPCSNPRCAALH 78
+G PP+ F DTGS +T+V C++ C C + +++P + V C NP C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQPDLSDTYHPVKC-NPDCT--- 56
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFGCGYNQHN 137
C NDQC YE +Y + SS G L DL + F N S FGC
Sbjct: 57 ------CDTENDQCTYERQYAEMSSSSGILGEDL--VSFGNMSELKPQRAVFGC--ENAE 106
Query: 138 PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDGKVPSS 195
G L G++GLGRG +SIV QL E G+I + C G + G G + LG PS
Sbjct: 107 TGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSD 166
Query: 196 GVAWTPMLQNSADLK-HYILGPAELLYSGKSCGLKDLTL------IFDSGASYAYFTSRV 248
V + D +Y + L +GK + I DSG +YAY
Sbjct: 167 MV----FSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAA 222
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRL 308
+ + I +L G PD +C+ G + ++ + F + + F N +
Sbjct: 223 FLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGE---KY 279
Query: 309 VVPPEAYL 316
+ PE YL
Sbjct: 280 SLSPENYL 287
>gi|213998816|gb|ACJ60775.1| nucellin [Hordeum patagonicum subsp. patagonicum]
Length = 152
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 7 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKVITGNVIGHCLSSKGKGVL 66
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 67 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 123
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 124 PAQIYNEIVSKVRGTLSESSLE 145
>gi|213998838|gb|ACJ60786.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 154
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 81/142 (57%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ + +I+ NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD P+ GV W PM ++ L +Y G AE+ + G +FDSG++Y +
Sbjct: 69 YVGDFNPPTRGVTWAPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRVTLSESSLE 147
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 89/321 (27%), Positives = 132/321 (41%), Gaps = 38/321 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN--IVPC 69
+ + +G PPK + DTGSD+ WV C C GC QY P + V C
Sbjct: 84 YYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGCPTRSGLGIELTQYDPAGSGTTVGC 142
Query: 70 SNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SVFN 124
C A PP C + C + I YGDG ++ G VTD +G + N
Sbjct: 143 EQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSN 202
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRG 183
+TFGCG S G+LG G+ S++SQL +R + HC+ G G
Sbjct: 203 ASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGG 262
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------IF 235
+ +G+ P V TP++ N + HY + + G + L T I
Sbjct: 263 IFAIGNVVQPK--VKTTPLVPN---VTHYNVNLQGISVGGATLQLPTSTFDSGDSKGTII 317
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + AY VY+ +++ + PL D +C F+ G + + F +
Sbjct: 318 DSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD----FVC----FQFSGSIDDGFPVI 369
Query: 296 ALSFTNRRNSVRLVVPPEAYL 316
SF + + L V P+ YL
Sbjct: 370 TFSF---KGDLTLNVYPDDYL 387
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 134/319 (42%), Gaps = 30/319 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C PC+GC + P Y P ++ +
Sbjct: 29 YF-TQVGLGNPVKHYIVQVDTGSDVLWVNC-RPCSGCPRKSALNIPLTMYDPRESSTTSL 86
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF--SNG-SVF 123
V CS+P C +C + C+Y YGDG +S G V D SNG +
Sbjct: 87 VSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 146
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGC Q S G++G G+ +S+ +QL I V HC+ RG
Sbjct: 147 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRG 206
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSAD----LKHYILGPAELLYSGKS-CGLKDLTLIFDSG 238
L G + G+ +TP++ +S L+ + L + D +I DSG
Sbjct: 207 GGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSG 266
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+ AYF S Y V I TP+++ D F G++++ F + L+
Sbjct: 267 TTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQC-------FLVSGRLSDLFPNVTLN 319
Query: 299 FTNRRNSVRLVVPPEAYLV 317
F + + P+ YL+
Sbjct: 320 FEGG----AMELQPDNYLM 334
>gi|213998824|gb|ACJ60779.1| nucellin [Hordeum chilense]
Length = 140
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 78/142 (54%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 60
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
+ GD PS GV W PM ++ +Y G AELL + G +FDSG++Y +
Sbjct: 61 YFGDFNPPSRGVTWVPMKESXX---YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 117
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 118 PAQIYNEIVSKVRGTLSESSLE 139
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 83/321 (25%), Positives = 139/321 (43%), Gaps = 49/321 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKPHKNI----V 67
+ + +G PP+ F DTGSD+ WV C PCT C + P + P K+ +
Sbjct: 48 YYTRIYLGTPPQQFYVHVDTGSDVAWVNC-VPCTNCKRASNVALPISIFDPEKSTSKTSI 106
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF-----PLRFSNGSV 122
C++ C + + +C + C Y YGDG S+ G L+ D+ P S +
Sbjct: 107 SCTDEEC---YLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATS 163
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
LTFGCG NQ T G++G G+ +S+ SQL + + N+ HC+ + +
Sbjct: 164 GTARLTFGCGSNQTGTWL-----TDGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNK 218
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK---DLT----LIF 235
G L G + G+ +TP++ + HY + + SG + DL+ +I
Sbjct: 219 GSGTLVIGHIREPGLVYTPIVPKQS---HYNVELLNIGVSGTNVTTPTAFDLSNSGGVIM 275
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + Y Y + + + RD + + + LP+ F+ + YF +
Sbjct: 276 DSGTTLTYLVQPAYDQFQAKV-RDCMRSGV--------LPVA----FQFFCTIEGYFPNV 322
Query: 296 ALSFTNRRNSVRLVVPPEAYL 316
L F +++ P +YL
Sbjct: 323 TLYFA---GGAAMLLSPSSYL 340
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 90/310 (29%), Positives = 132/310 (42%), Gaps = 40/310 (12%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH----KNIVPCSNPRCAALH 78
+G PP+ F DTGS +T+V C++ C C + +++P + V C NP C
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQPDLSDTYHPVKC-NPDCT--- 56
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFGCGYNQHN 137
C NDQC YE +Y + SS G L DL + F N S FGC
Sbjct: 57 ------CDTENDQCTYERQYAEMSSSSGILGEDL--VSFGNMSELKPQRAVFGC--ENAE 106
Query: 138 PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDGKVPSS 195
G L G++GLGRG +SIV QL E G+I + C G + G G + LG PS
Sbjct: 107 TGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSD 166
Query: 196 GVAWTPMLQNSADLK---HYILGPAELLYSGKSCGLKDLTL------IFDSGASYAYFTS 246
M+ + +D +Y + L +GK + I DSG +YAY
Sbjct: 167 ------MVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPE 220
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
+ + I +L G PD +C+ G + ++ + F + + F N
Sbjct: 221 AAFLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGE--- 277
Query: 307 RLVVPPEAYL 316
+ + PE YL
Sbjct: 278 KYSLSPENYL 287
>gi|213998818|gb|ACJ60776.1| nucellin [Hordeum patagonicum subsp. setifolium]
Length = 149
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EI+S + L + L+
Sbjct: 126 PAQIYNEILSKVRGTLSESSLE 147
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 89/323 (27%), Positives = 135/323 (41%), Gaps = 39/323 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C CT C + + Y P ++
Sbjct: 69 YFT-KIGLGSPSKDYYVQVDTGSDILWVNC-VECTRCPRKSDIGIGLTLYDPKRSKTSEF 126
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C + C++ + CK N C Y I YGDG ++ G V D NG +
Sbjct: 127 VSCEHNFCSSTYEGRILGCKAEN-PCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTAT 185
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
N + FGCG Q S + G++G G+ S++SQL G ++ + HC+ N
Sbjct: 186 QNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNV 245
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------- 233
G +F G+V V TP++ N A HY + + G L T
Sbjct: 246 GGGIF-SIGEVVEPKVKTTPLVPNMA---HYNVILKNIEVDGDILQLPSDTFDSENGKGT 301
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
+ DSG + AY VY +++S ++ + L + + F+ G V F
Sbjct: 302 VIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSC-------FQYTGNVDSGFP 354
Query: 294 PLALSFTNRRNSVRLVVPPEAYL 316
+ L F +S+ L V P YL
Sbjct: 355 IVKLHF---EDSLSLTVYPHDYL 374
>gi|308080924|ref|NP_001183009.1| uncharacterized protein LOC100501329 [Zea mays]
gi|238008766|gb|ACR35418.1| unknown [Zea mays]
Length = 205
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 56/128 (43%), Positives = 70/128 (54%), Gaps = 4/128 (3%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN 65
I+ FP Y+ ++ +G PP+ + D DTGSDLTW+QCDAPCT C K P YKP K
Sbjct: 80 IKGNVFPDGQYY-TSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKE 138
Query: 66 -IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
IVP + C L N C+ QCDYEIEY D SS+G L D + +NG
Sbjct: 139 KIVPPRDLLCQELQG-NQNYCETCK-QCDYEIEYADQSSSMGVLARDDMHMIATNGGREK 196
Query: 125 VPLTFGCG 132
+ FGC
Sbjct: 197 LDFVFGCA 204
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 89/323 (27%), Positives = 131/323 (40%), Gaps = 41/323 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +G P ++F DTGSDLTWVQC +PC C + + P+ + + C
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGTCYSQNDSLFIPNTSTSFTKLACGTE 61
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L + P C C Y YGDG S G V D + NG VP FGC
Sbjct: 62 LCNGLPY---PMCNQTT--CVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGC 116
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVLF 186
G++ N G + D G+LGLG+G +S SQL+ + +C+ L
Sbjct: 117 GHD--NEGSFAGAD--GILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTSPLL 170
Query: 187 LGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----------IF 235
GD VP+ GV + +L N +Y + + GK + IF
Sbjct: 171 FGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIF 230
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + V+QE+++ + + P K + D L +C LG E P
Sbjct: 231 DSGTTVTQLAGEVHQEVLAAMNASTMDYPRK-SDDSSGLDLC-------LGGFAEGQLPT 282
Query: 296 ALSFTNRRNSVRLVVPPEAYLVI 318
S T + +PP Y +
Sbjct: 283 VPSMTFHFEGGDMELPPSNYFIF 305
>gi|213998840|gb|ACJ60787.1| nucellin [Hordeum patagonicum subsp. magellanicum]
Length = 154
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EI+S + L + L+
Sbjct: 126 PAQIYNEILSKVRGTLSESSLE 147
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 89/321 (27%), Positives = 131/321 (40%), Gaps = 38/321 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN--IVPC 69
+ + +G PPK + DTGSD+ WV C C GC QY P + V C
Sbjct: 84 YYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGCPTRSGLGIELTQYDPAGSGTTVGC 142
Query: 70 SNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SVFN 124
C A PP C + C + I YGDG ++ G VTD +G + N
Sbjct: 143 EQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSN 202
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRG 183
+TFGCG S G+LG G+ S++SQL +R + HC+ G G
Sbjct: 203 ASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGG 262
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------IF 235
+ +G+ P V TP++ N + HY + + G + L T I
Sbjct: 263 IFAIGNVVQPK--VKTTPLVPN---VTHYNVNLQGISVGGATLQLPTSTFDSGDSKGTII 317
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + AY VY+ +++ + PL D +C F+ G + + F +
Sbjct: 318 DSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD----FVC----FQFSGSIDDGFPVI 369
Query: 296 ALSFTNRRNSVRLVVPPEAYL 316
SF + L V P+ YL
Sbjct: 370 TFSF---EGDLTLNVYPDDYL 387
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 118/269 (43%), Gaps = 43/269 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPH----KNIV 67
+ + +G PPK + DTGSD+ WV C C C + + + Y P + V
Sbjct: 83 YYTEIEIGTPPKQYHVQVDTGSDILWVNC-ISCNKCPRKSDLGIDLRLYDPKGSSSGSTV 141
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS----VF 123
C CAA + P C N C+Y + YGDG S+ G V+D +G
Sbjct: 142 SCDQKFCAATYGGKLPGCA-KNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDGQTRHA 200
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-N 180
N + FGCG Q G L + A G++G G+ S++SQL G ++ + HC+
Sbjct: 201 NASVIFGCGAQQ--GGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIK 258
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG-------------PAELLYSGKSCG 227
G G+ +GD P V TP++ D+ HY + P+ + +G+ G
Sbjct: 259 GGGIFAIGDVVQPK--VKSTPLV---PDMPHYNVNLESINVGGTTLQLPSHMFETGEKKG 313
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLI 256
I DSG + Y VY+++++ +
Sbjct: 314 -----TIIDSGTTLTYLPELVYKDVLAAV 337
>gi|172034220|gb|ACB69715.1| putative nucellin-like aspartic protease [Hordeum vulgare]
Length = 310
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 68/208 (32%), Positives = 94/208 (45%), Gaps = 13/208 (6%)
Query: 116 RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
R++ G G ++Q SP T+G+LGL IS+ SQL G+I NV GH
Sbjct: 5 RYNGGR--KASFVLGVTFDQQGQLLSSPAKTSGILGLSSAAISLPSQLASKGIISNVFGH 62
Query: 176 CIGQ--NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL 233
CI + NG G +FLGD VP G+ W P+ +L H G+ + +
Sbjct: 63 CITRETNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGIP-VQV 121
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I G SY Y +Y+ ++ I D D TLP+CW+ F V +FK
Sbjct: 122 ISRCGTSYTYLPEEMYKNLIDAIKED--SPSFVQDSSDTTLPLCWKADF----SVRSFFK 175
Query: 294 PLALSFTNRRNSV--RLVVPPEAYLVIS 319
PL L F R V + P+ YL+IS
Sbjct: 176 PLNLHFGRRWFVVPKTFTIVPDDYLIIS 203
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 122/285 (42%), Gaps = 28/285 (9%)
Query: 1 MYVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
M I+ P + +NL +G PP DTGSDLTW QC PCT C K +
Sbjct: 76 MTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPLF 134
Query: 61 KPHKNIV----PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
P + C C AL R +C + Y DG + G L ++ +
Sbjct: 135 DPKNSSTYRDSSCGTSFCLAL---GKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVD 191
Query: 117 FSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ G + P FGCG H+ G + ++G++GLG G +S++SQL+ I + +
Sbjct: 192 STAGKPVSFPGFAFGCG---HSSGGIFDKSSSGIVGLGGGELSLISQLKS--TINGLFSY 246
Query: 176 CI------GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSG 223
C+ + F G+V G TP++Q S D +Y+ +G L Y G
Sbjct: 247 CLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKG 306
Query: 224 --KSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
K +++ +I DSG +Y + Y ++ + + G ++
Sbjct: 307 YSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVR 351
>gi|213998806|gb|ACJ60770.1| nucellin [Hordeum flexuosum]
Length = 136
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/130 (41%), Positives = 75/130 (57%), Gaps = 5/130 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDSG++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHV 125
Query: 245 TSRVYQEIVS 254
+++Y EIVS
Sbjct: 126 PAQIYNEIVS 135
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 94/327 (28%), Positives = 137/327 (41%), Gaps = 47/327 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPH----KNI 66
YF L +G PPK + DTGSD+ WV C C+ C + + Y P +
Sbjct: 70 YFT-KLGLGSPPKDYYVQVDTGSDILWVNC-VKCSRCPRKSDLGIDLTLYDPKGSETSEL 127
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+ C C+A + P CK C Y I YGDG ++ G V D N ++ P
Sbjct: 128 ISCDQEFCSATYDGPIPGCK-SEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAP 186
Query: 127 ----LTFGCGYNQHNPGPLSPPDTA---GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
+ FGCG Q G LS G++G G+ S++SQL G ++ + HC+
Sbjct: 187 QNSSIIFGCGAVQ--SGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCL-D 243
Query: 180 NGRGVLFLGDGKVPSSGVAWTPM---------LQNSADLKHYILG-PAELLYSGKSCGLK 229
N RG G+V V+ TP+ + S ++ IL P+++ SG G
Sbjct: 244 NIRGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGNGKG-- 301
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + AY + VY E++ +M L L + F+ G V
Sbjct: 302 ---TIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVEQQFSC-------FQYTGNVD 351
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYL 316
F + L F +S+ L V P YL
Sbjct: 352 RGFPVVKLHF---EDSLSLTVYPHDYL 375
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 91/329 (27%), Positives = 136/329 (41%), Gaps = 44/329 (13%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN-- 65
I + + +G P K + DTGSD+ WV C C C K Y +++
Sbjct: 74 ILGLYYAKIGIGTPTKDYYVQVDTGSDIMWVNC-IQCRECPKTSSLGIDLTLYNINESDT 132
Query: 66 --IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG--- 120
+VPC C ++ P C N C Y YGDG S+ G V D+ +G
Sbjct: 133 GKLVPCDQEFCYEINGGQLPGCT-ANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLK 191
Query: 121 -SVFNVPLTFGCGYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI- 177
+ N + FGCG Q + G + G+LG G+ S++SQL G ++ + HC+
Sbjct: 192 TTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLD 251
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPMLQN---------SADLKHYILG-PAELLYSGKSCG 227
G NG G+ +G P V TP++ N + + H L P ++ +G G
Sbjct: 252 GTNGGGIFVIGHVVQPK--VNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKG 309
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ 287
I DSG + AY VY+ +VS I+ + D+ T F+
Sbjct: 310 -----AIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEYTC-------FQYSDS 357
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
+ + F + F NSV L V P YL
Sbjct: 358 LDDGFPNVTFHF---ENSVILKVYPHEYL 383
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 78/271 (28%), Positives = 117/271 (43%), Gaps = 41/271 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPH----K 64
+ + +G PPK F DTGSD+ WV C C K P K Y P
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVN----CVSCDKCPTKSGLGIDLALYDPKGSSSG 142
Query: 65 NIVPCSNPRCAALHWPNP--PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV 122
+ V C N CAA + P C C+Y EYGDG S+ G+ V+D +G+
Sbjct: 143 SAVSCDNKFCAATYGSGEKLPGCT-AGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNA 201
Query: 123 ----FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ FGCG Q G L + A G++G G+ S +SQL G ++ + HC
Sbjct: 202 QTRHAKANVIFGCGAQQ--GGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHC 259
Query: 177 IGQ-NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------- 228
+ G G+ +G+ P V TP+L N + HY + + +G + L
Sbjct: 260 LDTIKGGGIFAIGEVVQPK--VKSTPLLPN---MSHYNVNLQSIDVAGNALQLPPHIFET 314
Query: 229 -KDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
+ I DSG + Y VY++I++ + +
Sbjct: 315 SEKRGTIIDSGTTLTYLPELVYKDILAAVFQ 345
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 85/327 (25%), Positives = 140/327 (42%), Gaps = 44/327 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G PP+ F+ DTGSD+ WV C++ C C + + ++
Sbjct: 86 YFT-KVKLGSPPREFNVQIDTGSDILWVTCNS-CNDCPRTSGLGIELSFFDPSSSSTTSL 143
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFN 124
V CS+P C +L C ++QC Y YGDG + G V+D+ F + + N
Sbjct: 144 VSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIAN 203
Query: 125 --VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ FGC + + G L+ D A G+ G G+ +S+VSQL G+ V HC+
Sbjct: 204 SSASIVFGC--STYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGE 261
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------KDLT 232
G G L G++ + ++P++ + + HY L + +G+ + +
Sbjct: 262 GDGGGKLVLGEILEPNIIYSPLVPSQS---HYNLNLQSISVNGQLLPIDPAVFATSNNQG 318
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTE 290
I DSG + Y Y VS I + T P+ +G + V E
Sbjct: 319 TIVDSGTTLTYLVETAYDPFVSAITATV---------SSSTTPVLSKGNQCYLVSTSVDE 369
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLV 317
F P++L+F +V+ P YL+
Sbjct: 370 IFPPVSLNFA---GGASMVLKPGEYLM 393
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 152/359 (42%), Gaps = 65/359 (18%)
Query: 9 FFFPIFS--------YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
F PIFS YFAV + VG P + DTGSD+TW+QC APCT C K + +
Sbjct: 1 FEAPIFSGLAFGTGEYFAV-VGVGTPRRDMYLVVDTGSDITWLQC-APCTNCYKQKDALF 58
Query: 61 KPHKN----IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL- 115
P + ++ CS+ C L C +++C Y+ +YGDG ++G LVTD L
Sbjct: 59 NPSSSSSFKVLDCSSSLCLNLDVMG---CL--SNKCLYQADYGDGSFTMGELVTDNVVLD 113
Query: 116 -RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIG 174
F G V + GCG++ N G AG+LGLGRG +S + L RN+
Sbjct: 114 DAFGPGQVVLTNIPLGCGHD--NEGTFGT--AAGILGLGRGPLSFPNNLDAS--TRNIFS 167
Query: 175 HCIGQ-----NGRGVLFLGDGKVPSSG---VAWTPMLQNSADLKHYILGPAELLYSGKSC 226
+C+ N + L GD +P + V + P L+N +Y + +G S
Sbjct: 168 YCLPDRESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYY-----VQITGISV 222
Query: 227 GLKDLT----------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD 270
G LT IFDSG + +R Y + + L A D
Sbjct: 223 GGNLLTNIPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATM--HLTSAAD 280
Query: 271 DKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAY 329
K C+ F + ++ + F + V + +PP Y+V + +I A+
Sbjct: 281 FKIFDTCYD--FTGMNSIS--VPTVTFHF---QGDVDMRLPPSNYIVPVSNNNIFCFAF 332
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 86/263 (32%), Positives = 125/263 (47%), Gaps = 33/263 (12%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---- 66
F YFA+ + VG P DTGSDL W+QC +PC C + + P ++
Sbjct: 81 FESGEYFAL-VGVGTPSTKAMLVIDTGSDLVWLQC-SPCRRCYAQRGQVFDPRRSSTYRR 138
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
VPCS+P+C AL +P C Y + YGDG SS G L TD L F+N + N
Sbjct: 139 VPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATD--KLAFANDTYVN-N 195
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIG-QNGRGV 184
+T GCG + N G AG+LG+GRG+ISI +Q+ YG +V +C+G + R
Sbjct: 196 VTLGCG--RDNEGLFD--SAAGLLGVGRGKISISTQVAPAYG---SVFEYCLGDRTSRST 248
Query: 185 L--FLGDGKVPS-SGVAWTPMLQNS-------ADLKHYILGPAELL-YSGKSCGLKDLT- 232
+L G+ P A+T +L N D+ + +G + +S S L T
Sbjct: 249 RSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATG 308
Query: 233 ---LIFDSGASYAYFTSRVYQEI 252
++ DSG + + F Y +
Sbjct: 309 RGGVVVDSGTAISRFARDAYAAL 331
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 129/295 (43%), Gaps = 46/295 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
V+L VG PP+ DTGS+L+W+ C AP K ++P + VPC++
Sbjct: 85 LTVSLAVGTPPQNVTMVLDTGSELSWLLC-APAGARNKFSAMSFRPRASSTFAAVPCASA 143
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+C + P+PP C + +C + Y DG SS GAL TD+F + GS + FGC
Sbjct: 144 QCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAV----GSGPPLRAAFGCM 199
Query: 133 YNQHNPGPLSPPD---TAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRGVLFLG 188
+ + S PD +AG+LG+ RG +S VSQ +CI ++ GVL LG
Sbjct: 200 SSAFD----SSPDGVASAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVLLLG 250
Query: 189 DGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------------- 233
+P+ + +TPM Q + L ++ + G G K L +
Sbjct: 251 HSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQ 310
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD------KTLPICWRGP 281
+ DSG + + Y + + R PL A DD + C+R P
Sbjct: 311 TMVDSGTQFTFLLGDAYSALKAEFTRQ--ARPLLPALDDPSFAFQEAFDTCFRVP 363
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 90/322 (27%), Positives = 133/322 (41%), Gaps = 40/322 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN--IVPC 69
+ + +G P K + DTGSD+ WV C C GC QY P + V C
Sbjct: 85 YYTQIEIGSPSKGYYVQVDTGSDILWVNC-IRCDGCPTTSGLGIELTQYDPAGSGTTVGC 143
Query: 70 SNPRCAALHWPN--PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
C A + PN PP C + C + I YGDG S+ G V+D +G+ P
Sbjct: 144 DQEFCVA-NSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPS 202
Query: 127 ---LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGR 182
+TFGCG S G+LG G+ S++SQL +R + HC+ +G
Sbjct: 203 NASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVHGG 262
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------I 234
G+ +G+ P V TP++QN + HY + + G + L T I
Sbjct: 263 GIFAIGNVVQPK--VKTTPLVQN---VTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTI 317
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG + AY VY+ +++ + L D +C F+ G + + F
Sbjct: 318 IDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQD----FVC----FQFSGSIDDGFPV 369
Query: 295 LALSFTNRRNSVRLVVPPEAYL 316
+ SF + L V P YL
Sbjct: 370 VTFSF---EGEITLNVYPHDYL 388
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 145/334 (43%), Gaps = 51/334 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNIV 67
+ + +G PP+ F+ DTGSD+ WV C + C GC K E Q + ++V
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSELQIQLSFFDPGVSSSASLV 142
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-- 125
CS+ RC + ++ C PN+ C Y +YGDG + G ++D S +
Sbjct: 143 SCSDRRCYS-NFQTESGCS-PNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINS 200
Query: 126 --PLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
P FGC Q G L P A G+ GLG+G +S++SQL GL V HC+ +
Sbjct: 201 SAPFVFGCSNLQ--TGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKD 230
+G G++ LG K P + +TP++ + HY + + +G+ + D
Sbjct: 259 SGGGIMVLGQIKRPDT--VYTPLVPSQ---PHYNVNLQSIAVNGQILPIDPSVFTIATGD 313
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQV 288
T+I D+G + AY Y + I A PI + F+
Sbjct: 314 GTII-DTGTTLAYLPDEAYSPFIQAIAN---------AVSQYGRPITYESYQCFEITAGD 363
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISVST 322
+ F ++LSF +V+ P AYL I S+
Sbjct: 364 VDVFPEVSLSFA---GGASMVLRPHAYLQIFSSS 394
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/328 (28%), Positives = 135/328 (41%), Gaps = 56/328 (17%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPH----KNIVPCS 70
+G PK + DTGSD WV C GCT P+K Y P+ VPC
Sbjct: 80 IGLGPKDYYVQVDTGSDTLWVNC----VGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCD 135
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP---- 126
+ C + + C C Y I YGDG ++ G+ + D G + VP
Sbjct: 136 DEFCTSTYDGQISGCTKGM-SCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTS 194
Query: 127 LTFGCGYNQHNPGPLSPP-DTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGCG Q G LS DT+ G++G G+ S++SQL G ++ + HC+ G
Sbjct: 195 VIFGCGSKQ--SGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSISGG 252
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHY-------------ILGPAELLYSGKSCGLKD 230
+F G+V V TP+LQ A HY I P+++L S G
Sbjct: 253 GIF-AIGEVVQPKVKTTPLLQGMA---HYNVVLKDIEVAGDPIQLPSDILDSSSGRG--- 305
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY +Y +++ I+ G L L D T C+ + V +
Sbjct: 306 --TIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQFT---CFH--YSDEESVDD 358
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVI 318
F + +F + L P YL +
Sbjct: 359 LFPTVKFTF---EEGLTLTTYPRDYLFL 383
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 108/214 (50%), Gaps = 27/214 (12%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----V 67
PI + L +G PP+ F+ DTGSD+ WV C + C GC + P + +
Sbjct: 77 PISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCIS-CVGCPLQNVTFFDPGASSSAVKL 135
Query: 68 PCSNPRC-AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV- 125
CS+ RC + LH K +Y++EY DG + G ++DL S V
Sbjct: 136 ACSDKRCFSDLHK------KSGCSPLEYKVEYSDGSFTSGYYISDLISFETVMSSNLTVK 189
Query: 126 ---PLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--G 178
P FGC N H G +S P+T+ G++GLG+GR+ +VSQL L V C+ G
Sbjct: 190 SSAPFVFGCS-NLH-AGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCLSGG 247
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY 212
Q G GV+ LG+ ++P++ +TP++++ HY
Sbjct: 248 QEGGGVIILGENRLPNT--VYTPLVRSQT---HY 276
>gi|213998848|gb|ACJ60790.1| nucellin [Psathyrostachys stoloniformis]
Length = 154
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 78/142 (54%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD P+ GV W PM ++ L +Y G A L + G +FDSG++Y Y
Sbjct: 69 YVGDFNPPTRGVTWVPMRES---LFYYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTYM 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y E+VS I L + L+
Sbjct: 126 PAQIYNELVSKIRGTLSESSLE 147
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 81/287 (28%), Positives = 129/287 (44%), Gaps = 26/287 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +G PPK + DTGS L+W+QC C + ++P + + CS+
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSS 179
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTF 129
C+ L N P C + C Y YGD S+G L DL L S +P T+
Sbjct: 180 ECSLLKAATLNDPLCT-ASGVCVYTASYGDASYSMGYLSRDLLTLTPSQ----TLPSFTY 234
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCI-GQNGRGVLFL 187
GCG Q N G AG++GL R ++S+++QL +YG +C+ G FL
Sbjct: 235 GCG--QDNEGLFG--KAAGIVGLARDKLSMLAQLSPKYGY---AFSYCLPTSTSSGGGFL 287
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASYAY 243
GK+ S +TPM++NS + Y L A + +G+ G+ + I DSG
Sbjct: 288 SIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTR 347
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
+Y + ++ ++ + AP L C++G K++ E
Sbjct: 348 LPISIYAALREAFVK-IMSRRYEQAPAYSILDTCFKGSLKSMSGAPE 393
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 89/327 (27%), Positives = 137/327 (41%), Gaps = 50/327 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKNI-- 66
+ + +G P K + DTGSD+ WV C C + P K Y P +
Sbjct: 4 YYTEIGIGTPTKRYYVQVDTGSDILWVNC----ISCDRCPRKSGLGLELTLYDPKDSSTG 59
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
V C CAA + P C + C+Y + YGDG S+ G V+DL +G
Sbjct: 60 SKVSCDQGFCAATYGGLLPGCT-TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQT 118
Query: 123 --FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N +TFGCG Q S G++G G+ S++SQL G ++ + HC+
Sbjct: 119 RPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTI 178
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
NG G+ +G+ P V TP++ N + +LK +G P+ + +G+ G
Sbjct: 179 NGGGIFAIGNVVQPK--VKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKG-- 234
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + Y VY+E IM + + + +C F+ +G+V
Sbjct: 235 ---TIIDSGTTLTYLPEIVYKE----IMLAVFAKHKDITFHNVQEFLC----FQYVGRVD 283
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYL 316
+ F + F N + L V P Y
Sbjct: 284 DDFPKITFHF---ENDLPLNVYPHDYF 307
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/323 (29%), Positives = 136/323 (42%), Gaps = 39/323 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPH----KNI 66
YF L +G PP+ + DTGSD+ WV C C+ C + + Y P ++
Sbjct: 70 YFT-KLGLGSPPRDYYVQVDTGSDILWVNC-VECSRCPRKSDLGIDLTLYDPKGSETSDV 127
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
V C C+A P CK C Y I YGDG ++ G V D NG++ P
Sbjct: 128 VSCDQDFCSATFDGPIPGCK-SEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSP 186
Query: 127 ----LTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
+ FGCG Q G S G++G G+ S++SQL G ++ + HC+ N
Sbjct: 187 QNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL-DNV 245
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHY--ILGPAEL------LYSGKSCGLKDLTL 233
RG G+V V+ TP++ A HY +L E+ L S +
Sbjct: 246 RGGGIFAIGEVVEPKVSTTPLVPRMA---HYNVVLKSIEVDTDILQLPSDIFDSVNGKGT 302
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
+ DSG + AY VY E++ ++ G L L +R F G V F
Sbjct: 303 VIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLVEQQ------FR-CFLYTGNVDRGFP 355
Query: 294 PLALSFTNRRNSVRLVVPPEAYL 316
+ L F ++S+ L V P YL
Sbjct: 356 VVKLHF---KDSLSLTVYPHDYL 375
>gi|213998845|gb|ACJ60789.1| nucellin [Psathyrostachys fragilis subsp. fragilis]
Length = 150
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 78/142 (54%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 7 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGKGVL 66
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD P+ GV W PM ++ L +Y G A L + G +FDSG++Y Y
Sbjct: 67 YVGDFNPPTRGVTWVPMRES---LFYYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTYV 123
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y E+VS I L + L+
Sbjct: 124 PAQIYNELVSKIRGTLSESSLE 145
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/334 (27%), Positives = 145/334 (43%), Gaps = 51/334 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNIV 67
+ + +G PP+ F+ DTGSD+ WV C + C GC K E Q + ++V
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSELQIQLSFFDPGVSSSASLV 142
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-- 125
CS+ RC + ++ C PN+ C Y +YGDG + G ++D S +
Sbjct: 143 SCSDRRCYS-NFQTESGCS-PNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINS 200
Query: 126 --PLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQ 179
P FGC Q G L P A G+ GLG+G +S++SQL GL V HC+ +
Sbjct: 201 SAPFVFGCSNLQS--GDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKD 230
+G G++ LG K P + +TP++ + HY + + +G+ + D
Sbjct: 259 SGGGIMVLGQIKRPDT--VYTPLVPSQ---PHYNVNLQSIAVNGQILPIDPSVFTIATGD 313
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQV 288
T+I D+G + AY Y + + A PI + F+
Sbjct: 314 GTII-DTGTTLAYLPDEAYSPFIQAVAN---------AVSQYGRPITYESYQCFEITAGD 363
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISVST 322
+ F ++LSF +V+ P AYL I S+
Sbjct: 364 VDVFPQVSLSFA---GGASMVLGPRAYLQIFSSS 394
>gi|213998810|gb|ACJ60772.1| nucellin [Hordeum comosum]
Length = 154
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 79/142 (55%), Gaps = 5/142 (3%)
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVL 185
+ FGCGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL
Sbjct: 9 IAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 68
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYF 244
++GD PS GV W PM ++ L +Y G AELL + G +FDS ++Y +
Sbjct: 69 YVGDFNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSDSTYTHV 125
Query: 245 TSRVYQEIVSLIMRDLIGTPLK 266
+++Y EIVS + L + L+
Sbjct: 126 PAQIYNEIVSKVRGTLSESSLE 147
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 117/264 (44%), Gaps = 39/264 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKNI-- 66
+ + +G P K + DTGSD+ WV C + C + P K Y P +
Sbjct: 33 YYTEIGIGTPTKRYYVQVDTGSDILWVNCIS----CDRCPRKSGLGLELTLYDPKDSSTG 88
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
V C CAA + P C + C+Y + YGDG S+ G V+DL +G
Sbjct: 89 SKVSCDQGFCAATYGGLLPGCT-TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQT 147
Query: 123 --FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N +TFGCG Q S G++G G+ S++SQL G ++ + HC+
Sbjct: 148 RPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTI 207
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
NG G+ +G+ P V TP++ N + +LK +G P+ + +G+ G
Sbjct: 208 NGGGIFAIGNVVQPK--VKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKG-- 263
Query: 230 DLTLIFDSGASYAYFTSRVYQEIV 253
I DSG + Y VY+EI+
Sbjct: 264 ---TIIDSGTTLTYLPEIVYKEIM 284
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/327 (27%), Positives = 137/327 (41%), Gaps = 50/327 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKNI-- 66
+ + +G P K + DTGSD+ WV C C + P K Y P +
Sbjct: 89 YYTEIGIGTPTKRYYVQVDTGSDILWVNC----ISCDRCPRKSGLGLELTLYDPKDSSTG 144
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
V C CAA + P C + C+Y + YGDG S+ G V+DL +G
Sbjct: 145 SKVSCDQGFCAATYGGLLPGCT-TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQT 203
Query: 123 --FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N +TFGCG Q S G++G G+ S++SQL G ++ + HC+
Sbjct: 204 RPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTI 263
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
NG G+ +G+ P V TP++ N + +LK +G P+ + +G+ G
Sbjct: 264 NGGGIFAIGNVVQPK--VKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKG-- 319
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + Y VY+E IM + + + +C F+ +G+V
Sbjct: 320 ---TIIDSGTTLTYLPEIVYKE----IMLAVFAKHKDITFHNVQEFLC----FQYVGRVD 368
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYL 316
+ F + F N + L V P Y
Sbjct: 369 DDFPKITFHF---ENDLPLNVYPHDYF 392
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 138/316 (43%), Gaps = 38/316 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C K + +++P ++ V C N
Sbjct: 87 YYTTRLWIGTPPQEFALIVDTGSTVTYVPC-SDCEHCGKHQDPRFQPDESSTYHPVKC-N 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFG 130
C C H C YE Y + SS G L D+ + F N S V FG
Sbjct: 145 MDC---------NCDHDGVNCVYERRYAEMSSSSGVLGEDI--ISFGNQSEVVPQRAVFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C G L G++GLGRG++SIV QL + +I + C G + +G G
Sbjct: 194 C--ENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGG-----MHVGGG 246
Query: 191 KVPSSGVAWTP-MLQNSAD---LKHYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
+ G+ P M+ + +D +Y + E+ +GK L T + DSG +
Sbjct: 247 AMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTT 306
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAY + I++ PD IC+ G + + Q+++ F + + F+
Sbjct: 307 YAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFS 366
Query: 301 NRRNSVRLVVPPEAYL 316
N + +L + PE YL
Sbjct: 367 NGQ---KLSLTPENYL 379
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 131/297 (44%), Gaps = 25/297 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + F FDTGSDLTWVQC PCT C + E + P K+ VPC
Sbjct: 126 YVVTIGIGTPARNFTVLFDTGSDLTWVQCK-PCTDSCYQQQEPLFDPSKSSTYVDVPCGT 184
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+C + C C+Y ++YGD + G L + F L S V FGC
Sbjct: 185 PQC-KIGGGQDLTCG--GTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGV--VFGC 239
Query: 132 G--YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFL 187
Y+ G AG+LGLGRG SI+SQ R G +V +C+ G G L +
Sbjct: 240 SHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRR-GNSGDVFSYCLPPRGSSAGYLTI 298
Query: 188 GDGKVPSSGVAWTPMLQNSADLKH-YILGPAELLYSGKSCGLKD----LTLIFDSGASYA 242
G P S +++TP++ +++ L Y++ + SG + + + + DSG
Sbjct: 299 GAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTVIDSGTVIT 358
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+ + Y + R + G + ++L C+ G P+AL F
Sbjct: 359 HMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCY----DVTGHDVVTAPPVALEF 411
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 141/319 (44%), Gaps = 44/319 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F D+GS +T+V C A C C + +++P + V C N
Sbjct: 87 YYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQPDLSSTYSPVKC-N 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
C C +QC YE +Y + SS G L D+ + F S FG
Sbjct: 145 VDCT---------CDSDKNQCTYERQYAEMSSSSGVLGEDI--VSFGTESELKPQRAVFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++SI+ QL + G+I + C G G G + LG
Sbjct: 194 C--ENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
P G+ +T N+ +Y + E+ +GK+ + + DSG +YA
Sbjct: 252 AMPAP-PGMIYT--HSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYA 308
Query: 243 YFTSRVYQEIVSLIMRDLIGT---PLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
Y + + + +D + + PLK PD IC+ G + + Q++E F + +
Sbjct: 309 YLPEQAF-----VAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDM 363
Query: 298 SFTNRRNSVRLVVPPEAYL 316
F N + +L + PE YL
Sbjct: 364 VFGNGQ---KLSLSPENYL 379
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/258 (29%), Positives = 115/258 (44%), Gaps = 36/258 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----PEKQYKPHKNI----V 67
+ +++G PP+ F D DTGS++ WV+C APCTGC P + P K+ +
Sbjct: 41 YYTRISLGTPPQQFYVDVDTGSNVAWVKC-APCTGCEHSGDVPVPMSTFDPRKSTTKISI 99
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF-----PLRFSNGSV 122
C++ C L+ +C C Y + YGDG S+ G + D+F P S
Sbjct: 100 SCTDAECGVLN--KKLQCSPERLSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKS 157
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN-- 180
L FGCG Q + G+LG G +S+ +QL + + N+ HC+ +
Sbjct: 158 GTARLVFGCGGTQTGSWSVD-----GLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVS 212
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK---DLT----L 233
GRG L +G + P + +TPM+ HY + + SG++ DL +
Sbjct: 213 GRGSLVIGTIREPD--LVYTPMVFGE---DHYNVQLLNIGISGRNVTTPASFDLEYTGGV 267
Query: 234 IFDSGASYAYFTSRVYQE 251
I DSG + Y Y E
Sbjct: 268 IIDSGTTLTYLVQPAYDE 285
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 137/317 (43%), Gaps = 40/317 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + +++P + V C N
Sbjct: 88 YYTTRLWIGSPPQEFALIVDTGSTVTYVPC-SNCVQCGNHQDPRFQPELSSTYQPVKC-N 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--LTF 129
C C QC YE Y + +S G L D+ + F S VP F
Sbjct: 146 ADC---------NCDENGVQCTYERRYAEMSTSSGVLAEDV--MSFGKESEL-VPQRAVF 193
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GC G L G++GLGRG +S++ QL G++ N C G + +G
Sbjct: 194 GC--ETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGG-----MDVGG 246
Query: 190 GKVPSSGVAWTP-MLQNSADLK---HYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
G + G++ P M+ + +D +Y + E+ +GK L T I DSG
Sbjct: 247 GAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGT 306
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+YAYF + Y IM+ + PD IC+ G + + ++ + F + + F
Sbjct: 307 TYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVF 366
Query: 300 TNRRNSVRLVVPPEAYL 316
N + ++ + PE YL
Sbjct: 367 ANGQ---KISLSPENYL 380
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/264 (28%), Positives = 112/264 (42%), Gaps = 39/264 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN--- 65
+ + +G PPK F DTGSD+ WV C C + P K Y P +
Sbjct: 88 YYTEVRLGTPPKRFYVQVDTGSDILWVNC----ITCDQCPHKSGLGLDLTLYDPKASSTG 143
Query: 66 -IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
V C CA P+C N C+Y + YGDG S++G+ V D G
Sbjct: 144 STVMCDQGFCADTFGGRLPKCS-ANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQT 202
Query: 123 --FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N + FGCG Q S G+LG G S++SQL G ++ + HC+
Sbjct: 203 QPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTI 262
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLK 229
G G+ +GD P V TP++ + + +LK +G PA++ G+ G
Sbjct: 263 KGGGIFAIGDVVQPK--VKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRG-- 318
Query: 230 DLTLIFDSGASYAYFTSRVYQEIV 253
I DSG + Y V+++++
Sbjct: 319 ---TIIDSGTTLTYLPELVFKKVM 339
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/263 (32%), Positives = 124/263 (47%), Gaps = 33/263 (12%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---- 66
F YFA+ + VG P DTGSDL W+QC +PC C + + P ++
Sbjct: 81 FESGEYFAL-VGVGTPSTKAMLVIDTGSDLVWLQC-SPCRRCYAQRGQVFDPRRSSTYRR 138
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
VPCS+P+C AL +P C Y + YGDG SS G L TD L F+N + N
Sbjct: 139 VPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATD--KLAFANDTYVN-N 195
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIG-QNGRGV 184
+T GCG + N G AG+LG+ RG+ISI +Q+ YG +V +C+G + R
Sbjct: 196 VTLGCG--RDNEGLFD--SAAGLLGVARGKISISTQVAPAYG---SVFEYCLGDRTSRST 248
Query: 185 L--FLGDGKVPS-SGVAWTPMLQNS-------ADLKHYILGPAELL-YSGKSCGLKDLT- 232
+L G+ P A+T +L N D+ + +G + +S S L T
Sbjct: 249 RSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATG 308
Query: 233 ---LIFDSGASYAYFTSRVYQEI 252
++ DSG + + F Y +
Sbjct: 309 RGGVVVDSGTAISRFARDAYAAL 331
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 120/276 (43%), Gaps = 28/276 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + F FDTGSDLTW QC+ C E + P K+ + CS+P
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSP 197
Query: 73 RCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L N P C C Y I+YGD S+G D L ++ VFN L FG
Sbjct: 198 TCDELKSGTGNSPSCSAST--CVYGIQYGDQSYSVGFFAQD--KLALTSTDVFNNFL-FG 252
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI--GQNGRGVLFL 187
CG Q+N G AG++GLGR +S+VSQ ++YG + +C+ + G L
Sbjct: 253 CG--QNNRGLF--VGVAGLIGLGRNALSLVSQTAQKYG---KLFSYCLPSTSSSTGYLTF 305
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG-----LKDLTLIFDSGASYA 242
G G S V +TP L NS Y L + G+ I DSG +
Sbjct: 306 GSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIIDSGTVIS 365
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
Y ++ + + + P K AP L C+
Sbjct: 366 RLPPTAYSDLRASFQQQMSKYP-KAAP-ASILDTCY 399
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 137/317 (43%), Gaps = 40/317 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + +++P + V C N
Sbjct: 88 YYTTRLWIGSPPQEFALIVDTGSTVTYVPC-SNCVQCGNHQDPRFQPELSSTYQPVKC-N 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--LTF 129
C C QC YE Y + +S G L D+ + F S VP F
Sbjct: 146 ADC---------NCDENGVQCTYERRYAEMSTSSGVLAEDV--MSFGKESEL-VPQRAVF 193
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GC G L G++GLGRG +S++ QL G++ N C G + +G
Sbjct: 194 GC--ETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGG-----MDVGG 246
Query: 190 GKVPSSGVAWTP-MLQNSADLK---HYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
G + G++ P M+ + +D +Y + E+ +GK L T I DSG
Sbjct: 247 GAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGT 306
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+YAYF + Y IM+ + PD IC+ G + + ++ + F + + F
Sbjct: 307 TYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVF 366
Query: 300 TNRRNSVRLVVPPEAYL 316
N + ++ + PE YL
Sbjct: 367 ANGQ---KISLSPENYL 380
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/328 (28%), Positives = 140/328 (42%), Gaps = 41/328 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC----------TKPPEKQYKPHKN 65
Y+A + +G P K + DTG+D+ WV C C C T K+ K
Sbjct: 73 YYA-KIGIGTPSKDYYLQVDTGTDMMWVNC-IQCKECPTRSNLGMDLTLYNIKESSSGK- 129
Query: 66 IVPCSNPRCAALHWPNPPRC-KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV-- 122
+VPC C ++ C ND C Y YGDG S+ G V D+ +G +
Sbjct: 130 LVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKT 189
Query: 123 --FNVPLTFGCGYNQHNPGPLSPPDTA---GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + FGCG Q G LS + G+LG G+ S++SQL G ++ + HC+
Sbjct: 190 ASANGSVIFGCGARQ--SGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL 247
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILGPAELLYSGKSCGLKDLT 232
G NG G+ +G P+ V TP+L + S ++ +G L S + +D
Sbjct: 248 NGVNGGGIFAIGHVVQPT--VNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQRDSK 305
Query: 233 -LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
I DSG + AY +YQ +V I+ ++ D+ T F+ G V +
Sbjct: 306 GTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEYTC-------FQYSGSVDDG 358
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVIS 319
F + F N + L V P YL +S
Sbjct: 359 FPNVTFYF---ENGLSLKVYPHDYLFLS 383
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 90/265 (33%), Positives = 120/265 (45%), Gaps = 39/265 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + F FDTGSDLTW QC+ PC G C + E + P ++ V C +
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCE-PCVGYCYQQREHIFDPSTSLSYSNVSCDS 205
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P C L N P C + C Y I YGDG SIG + L ++ VFN F
Sbjct: 206 PSCEKLESATGNSPGCS--SSTCLYGIRYGDGSYSIGFFARE--KLSLTSTDVFN-NFQF 260
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI--GQNGRGVLF 186
GCG Q+N G TAG+LGL R +S+VSQ ++YG V +C+ + G L
Sbjct: 261 GCG--QNNRGLFG--GTAGLLGLARNPLSLVSQTAQKYG---KVFSYCLPSSSSSTGYLS 313
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----------IFD 236
G G S V +TP NS Y L G S G + L + I D
Sbjct: 314 FGSGDGDSKAVKFTPSEVNSDYPSFYFLDMV-----GISVGERKLPIPKSVFSTAGTIID 368
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLI 261
SG + VY V + R+L+
Sbjct: 369 SGTVISRLPPTVYSS-VQKVFRELM 392
>gi|213998814|gb|ACJ60774.1| nucellin [Hordeum cf. pusillum GP-2003]
Length = 142
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 78/138 (56%), Gaps = 5/138 (3%)
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGD 189
CGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL++GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGD 60
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRV 248
PS GV W PM ++ L +Y G AELL + G +FDSG++Y + +++
Sbjct: 61 FNPPSRGVTWVPMKES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAQI 117
Query: 249 YQEIVSLIMRDLIGTPLK 266
Y EIVS ++ L + L+
Sbjct: 118 YNEIVSKVIGTLSESSLE 135
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 87/324 (26%), Positives = 140/324 (43%), Gaps = 47/324 (14%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-----TKPPEKQYKP----HKNIVPCSN 71
+ +G PPK F DTGSD+ WV C++ C GC + P + P ++V CS+
Sbjct: 87 VQLGNPPKDFYVQIDTGSDVLWVSCNS-CNGCPATSGLQIPLNFFDPGSSTTASLVSCSD 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF----SNGSVFNVPL 127
CA + C ++QC Y +YGDG + G V D+ L S S + +
Sbjct: 146 QICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASV 205
Query: 128 TFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRG 183
FGC +Q G L+ D A G+ G G+ +S++SQL G+ V HC+ +G G
Sbjct: 206 VFGCSTSQ--TGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGGG 263
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------IF 235
+L LG+ P+ V +TP++ + HY L + +G+ + I
Sbjct: 264 ILVLGEIVEPN--VVYTPLVPSQ---PHYNLNLQSISVNGQVLPISPAVFATSSSQGTII 318
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP--FKALGQVTEYFK 293
DSG + AY Y V + + T + +G + V++ F
Sbjct: 319 DSGTTLAYLAEEAYNAFVVAVTNIV---------SQSTQSVVLKGNRCYVTSSSVSDIFP 369
Query: 294 PLALSFTNRRNSVRLVVPPEAYLV 317
++L+F LV+ + YL+
Sbjct: 370 QVSLNFA---GGASLVLGAQDYLI 390
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 141/319 (44%), Gaps = 44/319 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F D+GS +T+V C A C C + +++P + V C N
Sbjct: 87 YYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQPDLSSTYSPVKC-N 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
C C +QC YE +Y + SS G L D+ + F S FG
Sbjct: 145 VDCT---------CDSDKNQCTYERQYAEMSSSSGVLGEDI--VSFGTESELKPQRAVFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++SI+ QL + G+I + C G G G + LG
Sbjct: 194 C--ENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
P G+ +T N+ +Y + E+ +GK+ + + DSG +YA
Sbjct: 252 AMPAP-PGMIYT--HSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYA 308
Query: 243 YFTSRVYQEIVSLIMRDLIGT---PLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
Y + + + +D + + PLK PD IC+ G + + Q++E F + +
Sbjct: 309 YLPEQAF-----VAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDM 363
Query: 298 SFTNRRNSVRLVVPPEAYL 316
F N + +L + PE YL
Sbjct: 364 VFGNGQ---KLSLSPENYL 379
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 94/328 (28%), Positives = 135/328 (41%), Gaps = 59/328 (17%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPH----KNIVPCS 70
+G P K F DTGSD+ WV C GCT P+K Y P+ N VPC
Sbjct: 78 LGSPAKEFYVQVDTGSDILWVNC----AGCTACPKKSGLGMDLTLYDPNGSKTSNAVPCG 133
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP---- 126
+ C + CK + C Y I YGDG ++ G+ V D +G++ P
Sbjct: 134 DGFCTDTYSGPISGCKQ-DMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSS 192
Query: 127 LTFGCGYNQHNPGPLSP-PDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGCG Q G LS D A G++G G+ S++SQL G ++ + HC+ + G
Sbjct: 193 VIFGCGAKQ--SGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGG 250
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHY-------------ILGPAELLYSGKSCGLKD 230
+F G+V TP++ A HY IL P L SG G
Sbjct: 251 GIF-SIGQVMEPKFNTTPLVPRMA---HYNVILKDMDVDGEPILLPLYLFDSGSGRG--- 303
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY +Y +++ ++ G L + D T F ++ E
Sbjct: 304 --TIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFTC-------FHYSDKLDE 354
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYLVI 318
F + F + L V P YL +
Sbjct: 355 GFPVVKFHF----EGLSLTVHPHDYLFL 378
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 132/316 (41%), Gaps = 38/316 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F D+GS +T+V C A C C + +++P + V C N
Sbjct: 88 YYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQPDLSSSYSPVKC-N 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C C QC YE +Y + SS G L D+ + F S FG
Sbjct: 146 VDCT---------CDSDKKQCTYERQYAEMSSSSGVLGEDI--VSFGRESELKAQRAVFG 194
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C G L G++GLGRG++SI+ QL E G+I + C G G G + LG
Sbjct: 195 C--ENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLG 252
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
PS V S L+ +Y + E+ +GK+ + + DSG +
Sbjct: 253 GVPTPSDMV-----FSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGTT 307
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAY + + + + PD IC+ G + + ++ E F + + F
Sbjct: 308 YAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDVDMVFG 367
Query: 301 NRRNSVRLVVPPEAYL 316
N + +L + PE YL
Sbjct: 368 NGQ---KLSLTPENYL 380
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 134/325 (41%), Gaps = 43/325 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQC----DAPCTGCTKPPEKQYKPHKNI----V 67
Y+A + +G P K + DTGSD+ WV C + P T Y ++ V
Sbjct: 86 YYA-KVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLV 144
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV----F 123
PC C ++ C N C Y YGDG S+ G V D+ +G +
Sbjct: 145 PCDEEFCYEVNGGPLSGCT-ANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSS 203
Query: 124 NVPLTFGCGYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNG 181
N + FGCG Q + GP S G+LG G+ S++SQL ++ + HC+ G NG
Sbjct: 204 NGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGING 263
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADL----------KHYILGPAELLYSGKSCGLKDL 231
G+ +G P V TP++ N + ++ P E +G G
Sbjct: 264 GGIFAIGHVVQPK--VNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKG---- 317
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
I DSG + AY VY+ +VS I+ + + D+ T F+ G V +
Sbjct: 318 -AIIDSGTTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYTC-------FQYSGSVDDG 369
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYL 316
F + F NSV L V P YL
Sbjct: 370 FPNVTFHF---ENSVFLKVHPHEYL 391
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 118/272 (43%), Gaps = 43/272 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP-----------PEKQYKPHK 64
YF + +G P K + DTGSD+ WV C +PCTGC P+ +
Sbjct: 89 YF-TRVKLGNPAKEYFVQIDTGSDILWVAC-SPCTGCPTSSGLNIQLEFFNPDSSSTSSR 146
Query: 65 NIVPCSNPRCAALHWPNPPRCKH---PNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSN 119
+PCS+ RC A C+ P+ C Y YGDG + G V+D F N
Sbjct: 147 --IPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGN 204
Query: 120 GSVFN--VPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGH 175
N + FGC +Q G L D A G+ G G+ ++S+VSQL G+ H
Sbjct: 205 EQTANSSASVVFGCSNSQS--GDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSH 262
Query: 176 CI--GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL 233
C+ NG G+L LG+ P G+ +TP++ + HY L + SG+ + D +L
Sbjct: 263 CLKGSDNGGGILVLGEIVEP--GLVFTPLVPSQ---PHYNLNLESIAVSGQKLPI-DSSL 316
Query: 234 ---------IFDSGASYAYFTSRVYQEIVSLI 256
I DSG + Y Y ++ I
Sbjct: 317 FATSNTQGTIVDSGTTLVYLVDGAYDPFINAI 348
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 97/335 (28%), Positives = 137/335 (40%), Gaps = 57/335 (17%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------------QYKPH 63
YFA + +G P K + DTGSD+ WV C GC + P K +
Sbjct: 78 YFA-KIGIGTPSKDYYVQVDTGSDILWVNC----AGCDRCPTKSDLGVDLTLYDMKASTT 132
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ V C + C+ P P CK P QC Y + YGDG S+ G V D +G+
Sbjct: 133 SDAVGCDDNFCSLYDGP-LPGCK-PGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQ 190
Query: 124 NVP----LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
P + FGCG Q S G+LG G+ S++SQL G ++ V HC+
Sbjct: 191 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN 250
Query: 180 -NGRGVLFLGDGKVPS------SGVAWTPMLQNSAD----LKHYILG------PAELLYS 222
+G G+ +G+ P + V + + A +K +G P++ S
Sbjct: 251 VDGGGIFAIGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAFES 310
Query: 223 GKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGP 281
G G I DSG + AYF VY V LI + L P L+L ++
Sbjct: 311 GDRKG-----TIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTC----- 357
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
F G V + F + L F S+ L V P YL
Sbjct: 358 FDYTGNVDDGFPTVTLHFD---KSISLTVYPHEYL 389
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 125/296 (42%), Gaps = 47/296 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP------EKQYKPHKNIVPCS 70
+++TVG PP+ DTGS+L+W+ C+ T P Y P + CS
Sbjct: 66 LTISITVGTPPQNMSMVIDTGSELSWLHCNTNTTATIPYPFFNPNISSSYTP----ISCS 121
Query: 71 NPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+P C +P P C N+ C + Y D SS G L +D F GS FN +
Sbjct: 122 SPTCTTRTRDFPIPASCDS-NNLCHATLSYADASSSEGNLASDTFGF----GSSFNPGIV 176
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFL 187
FGC + ++ S +T G++G+ G +S+VSQL+ +CI G + G+L L
Sbjct: 177 FGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKF-----SYCISGSDFSGILLL 231
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------------- 233
G+ G + +TP++Q S L ++ + G K L +
Sbjct: 232 GESNFSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAG 291
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT------LPICWRGP 281
+FD G ++Y VY + + GT L DD + +C+R P
Sbjct: 292 QTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRAL--DDPNFVFQIAMDLCYRVP 345
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 88/314 (28%), Positives = 131/314 (41%), Gaps = 34/314 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C C C K + +++P + + C N
Sbjct: 87 YYTTRLFIGTPPQEFALIVDTGSTVTYVPCST-CEQCGKHQDPRFQPESSSTYKPMQC-N 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
P C C QC YE Y + SS G L D+ L F N S FG
Sbjct: 145 PSC---------NCDDEGKQCTYERRYAEMSSSSGLLAEDV--LSFGNESELTPQRAIFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFLG 188
C + G L G++GLGRG +S+V QL ++ N C G G + LG
Sbjct: 194 CETVE--TGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLG 251
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
+ P V SA +Y + EL +GK L + DSG +YA
Sbjct: 252 NIPPPPDMVFAHSDPYRSA---YYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSGTTYA 308
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
Y + I++++ PD IC+ G + + Q+++ F + + F N
Sbjct: 309 YLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVFGNG 368
Query: 303 RNSVRLVVPPEAYL 316
+ +L + PE YL
Sbjct: 369 Q---KLSLSPENYL 379
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 135/316 (42%), Gaps = 38/316 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F D+GS +T+V C A C C + +++P + V C N
Sbjct: 88 YYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQPDLSSSYSPVKC-N 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
C C QC YE +Y + SS G L D+ + F S FG
Sbjct: 146 VDCT---------CDSDKKQCTYERQYAEMSSSSGVLGEDI--VSFGRESELKPQRAVFG 194
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C ++ G L G++GLGRG++SI+ QL E G+I + C G G G + LG
Sbjct: 195 CENSE--TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLG 252
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
PS V +S L+ +Y + E+ +GK+ + + DSG +
Sbjct: 253 GVPAPSDMV-----FSHSDPLRSPYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTT 307
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAY + + + + PD IC+ G + + ++ E F + + F
Sbjct: 308 YAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFG 367
Query: 301 NRRNSVRLVVPPEAYL 316
N + +L + PE YL
Sbjct: 368 NGQ---KLSLTPENYL 380
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 127/299 (42%), Gaps = 49/299 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKNI----V 67
V+L VG PP+ DTGS+L+W+ C G + ++P + V
Sbjct: 63 LTVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAV 122
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
PC + +C++ P PP C + QC + Y DG +S GAL TD+F + G +
Sbjct: 123 PCGSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAV----GEAPPLRS 178
Query: 128 TFGCGYNQHNPGPLSPPD---TAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRG 183
FGC ++ S PD TAG+LG+ RG +S V+Q +CI ++ G
Sbjct: 179 AFGCMSTAYD----SSPDGVATAGLLGMNRGTLSFVTQAST-----RRFSYCISDRDDAG 229
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------- 233
VL LG +P + +TP+ Q + L ++ + G G K L +
Sbjct: 230 VLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHT 289
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD------KTLPICWRGP 281
+ DSG + + Y + + ++ PL A DD + L C+R P
Sbjct: 290 GAGQTMVDSGTQFTFLLGDAYSALKAEFLKQT--KPLLRALDDPSFAFQEALDTCFRVP 346
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 120/270 (44%), Gaps = 32/270 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +VG PP DTGSD+ W+QC+ PC C + P K+ +PCS+
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCE-PCEQCYNQTTPIFNPSKSSSYKNIPCSSK 145
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C H C N C Y+I YGD S G L D L ++GS + P + GC
Sbjct: 146 LC---HSVRDTSCSDQN-SCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIGC 201
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G + N G ++G++GLG G +S+++QL I +C+ N +L
Sbjct: 202 GTD--NAGTFGGA-SSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSIL 256
Query: 186 FLGDGKVPS-SGVAWTPMLQNS-----ADLKHYILGPAELLYSGKSCGLKDL-TLIFDSG 238
GD V S GV TP+++ L+ + +G + + G S G D +I DSG
Sbjct: 257 SFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316
Query: 239 ASYAYFTSRVYQE----IVSLIMRDLIGTP 264
+ S VY +V L+ D + P
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDRVDDP 346
>gi|213998802|gb|ACJ60768.1| nucellin [Hordeum murinum subsp. glaucum]
Length = 142
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/138 (38%), Positives = 76/138 (55%), Gaps = 5/138 (3%)
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGD 189
CGY Q P P G+LGLG G+ QL+ +I+ N+IGHC+ G+GVL++GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAVQLKGQKMIKENIIGHCLSSKGKGVLYVGD 60
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRV 248
PS GV W PM ++ L +Y G AELL + G +FDSG++Y + + +
Sbjct: 61 FNPPSRGVTWVPMRES---LFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAHI 117
Query: 249 YQEIVSLIMRDLIGTPLK 266
Y EIVS + L + L+
Sbjct: 118 YSEIVSKVRGTLSESSLE 135
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 112/267 (41%), Gaps = 32/267 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP---------------HKN 65
+ +G P F D+GSDL W+ C+ C C Y
Sbjct: 101 IDIGTPSVSFLVALDSGSDLLWIPCN--CVQCAPLSSAYYSSLATKDLNEFDPSASTTSK 158
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYG-DGGSSIGALVTDLFPLRFSNGSVFN 124
+ PCS+ C + P C+ P +QC Y + Y + SS G LV D+ L +S + +
Sbjct: 159 VFPCSHKLCES-----APACESPKEQCPYTVTYASENTSSSGLLVEDVLHLAYSANASSS 213
Query: 125 VP--LTFGCGYNQHNPGPLS-PPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V + GCG Q PD GV+GLG G IS+ S L + GL+RN C +
Sbjct: 214 VKARVVVGCGEKQSGEFLKGIAPD--GVMGLGPGEISVPSFLAKAGLMRNSFSMCFDEED 271
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGAS 240
G ++ GD V S T L + Y +G E+ G SC T + DSG S
Sbjct: 272 SGRIYFGD--VGPSTQQSTRFLPYKNEFVAYFVG-VEVCCVGNSCLKQSSFTTLIDSGQS 328
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKL 267
+ + +Y+E+ I + T K+
Sbjct: 329 FTFLPEEIYREVALEIDSHINATVKKI 355
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 90/320 (28%), Positives = 132/320 (41%), Gaps = 39/320 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ------YKPHKN---- 65
Y+ + +G P + F DTGS +T+V PC+ CT Q +KP +
Sbjct: 98 YYTSRVFIGTPAQEFALIVDTGSTVTYV----PCSSCTHCGHHQACFDPRFKPDNSSSYQ 153
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN- 124
V C++P C C QC YE Y + SS G L DL L F NGS
Sbjct: 154 TVSCNSPDCIT------KMCDARVHQCKYERVYAEMSSSKGVLGKDL--LGFGNGSRLQP 205
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGR 182
PL FGC G L G++GLGRG +SIV QL G + + C G G
Sbjct: 206 HPLLFGC--ETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGG 263
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD------LTLIFD 236
G + LG P + + N ++ +Y L +E+ G S + L + D
Sbjct: 264 GSMVLG-AIPPPPAMVFAKSDPNRSN--YYNLELSEIQVQGVSLNVPSEVFNGRLGTVLD 320
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG +YAY + + I + L PD +C+ G + ++F P+
Sbjct: 321 SGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVD 380
Query: 297 LSFTNRRNSVRLVVPPEAYL 316
F+ + ++ + PE YL
Sbjct: 381 FVFSGNQ---KVFLAPENYL 397
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 131/314 (41%), Gaps = 34/314 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G P + F D+GS +T+V C A C C + +++P + V C N
Sbjct: 90 YYTTRLYIGTPSQEFALIVDSGSTVTYVPC-ATCEQCGNHQDPRFQPDLSSTYSPVKC-N 147
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
C C + QC YE +Y + SS G L D+ + F S FG
Sbjct: 148 VDCT---------CDNERSQCTYERQYAEMSSSSGVLGEDI--MSFGKESELKPQRAVFG 196
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C + G L G++GLGRG++SI+ QL E G+I + C G G G + LG
Sbjct: 197 CENTE--TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLG 254
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYA 242
P V N +Y + E+ +GK+ L + DSG +YA
Sbjct: 255 GMPAPPDMVFSH---SNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYA 311
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNR 302
Y + + + + PD IC+ G + + Q++E F + + F N
Sbjct: 312 YLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNG 371
Query: 303 RNSVRLVVPPEAYL 316
+ +L + PE YL
Sbjct: 372 Q---KLSLSPENYL 382
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 77/266 (28%), Positives = 114/266 (42%), Gaps = 34/266 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----------- 65
+ + +G PPK F+ DTGSD+ WV C+ C+ C P Q N
Sbjct: 78 YYTKVKMGTPPKEFNVQIDTGSDILWVNCNT-CSNC--PQSSQLGIELNFFDTVGSSTAA 134
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVF 123
++PCS+P C + C +QC Y +YGDG + G V+D F L
Sbjct: 135 LIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAV 194
Query: 124 NVPLT--FGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
N T FGC +Q G L+ D A G+ G G G +S+VSQL G+ V HC+
Sbjct: 195 NSSATIVFGCSISQS--GDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKG 252
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKD 230
+G G L G++ + ++P++ + HY L + +G+ S
Sbjct: 253 DGDGGGVLVLGEILEPSIVYSPLVPSQ---PHYNLNLQSIAVNGQLLPINPAVFSISNNR 309
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLI 256
I D G + AY Y +V+ I
Sbjct: 310 GGTIVDCGTTLAYLIQEAYDPLVTAI 335
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 77/251 (30%), Positives = 109/251 (43%), Gaps = 28/251 (11%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK----PPEKQ-----YKPH----KN 65
N+++G P + DTGSDL W+ CD +GC + P +Q Y+P+
Sbjct: 115 ANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQ 174
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGS--V 122
+PC+N C+ RC C Y+++Y +G SS G LV DL L +
Sbjct: 175 TIPCNNTLCS-----RQSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTTDDAQSRA 229
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
+ + FGCG Q L G+ GLG IS+ S L G N C G++G
Sbjct: 230 LDAKIIFGCGRVQTG-SFLDGAAPNGLFGLGMTNISVPSTLAREGYTSNSFSMCFGRDGI 288
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLK-HYILGPAELLYSGKSCGLKDLTLIFDSGASY 241
G + GD SSG TP N L Y + ++ G+ L + + IFDSG S+
Sbjct: 289 GRISFGD--TGSSGQGETPF--NLRQLHPTYNVSITKINVGGRDADL-EFSAIFDSGTSF 343
Query: 242 AYFTSRVYQEI 252
Y Y I
Sbjct: 344 TYLNDPAYTLI 354
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/259 (31%), Positives = 115/259 (44%), Gaps = 45/259 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ +NL++G P + F DTGSDL W QC PCT C + P + +PCS+
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL P C N+ C Y YGDG + G++ T+ L F + S+ N+ TFGCG
Sbjct: 154 LCQALQ---SPTCS--NNSCQYTYGYGDGSETQGSMGTE--TLTFGSVSIPNI--TFGCG 204
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
N G + AG++G+GRG +S+ SQL +C IG + L LG
Sbjct: 205 ENNQGFG---QGNGAGLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSNSSTLLLGS 256
Query: 190 -GKVPSSGVAWTPMLQNSA-------DLKHYILGPAEL--------LYSGKSCGLKDLTL 233
++G T ++Q+S L +G L L S G +
Sbjct: 257 LANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTG----GI 312
Query: 234 IFDSGASYAYFTSRVYQEI 252
I DSG + YF YQ +
Sbjct: 313 IIDSGTTLTYFVDNAYQAV 331
>gi|213998832|gb|ACJ60783.1| nucellin [Hordeum vulgare subsp. spontaneum]
Length = 127
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 75/128 (58%), Gaps = 5/128 (3%)
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGD 189
CGY Q P P G+LGLG G+ + +QL+ + +I+ NVIGHC+ G+GVL++GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGKGVLYVGD 60
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRV 248
P+ GV W PM ++ L +Y G AE+ + G +FDSG++Y + +++
Sbjct: 61 FNPPTRGVTWVPMRES---LFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHVPAQI 117
Query: 249 YQEIVSLI 256
Y EIVS +
Sbjct: 118 YNEIVSKV 125
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/314 (30%), Positives = 135/314 (42%), Gaps = 35/314 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPPEKQ-----YKPHKNI- 66
F ++AV + +G P F DTGSDL WV CD C + P Y P K+
Sbjct: 106 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCDCLKCAPLSSPDYGNLKFDVYSPRKSST 164
Query: 67 ---VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNG-- 120
VPCS+ C C ++ C Y+IEY D SS G LV D+ L +G
Sbjct: 165 SRKVPCSSNMCDL-----QTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHS 219
Query: 121 SVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+ P+TFGCG Q G +P G+LGLG S+ S L G+ N C G
Sbjct: 220 KITQAPITFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASQGVAANSFSMCFG 276
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
++G G + GD S+ TP L +Y + + GK+ K + + DSG
Sbjct: 277 EDGHGRINFGD--TGSADQLETP-LNIYKHNPYYNISIVGAMAGGKTFSTK-FSAVVDSG 332
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
S+ + +Y EI S + + K P D +LP + + G V+ P +S
Sbjct: 333 TSFTALSDPMYTEITSAFDKQV---KEKRNPADSSLPFEYCYTISSKGAVS----PPNIS 385
Query: 299 FTNRRNSVRLVVPP 312
T + SV V P
Sbjct: 386 LTAKGGSVFPVKDP 399
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/326 (28%), Positives = 136/326 (41%), Gaps = 36/326 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
F +NL +G PP+ + DTGSDL W QC PCT C P + P K+ +
Sbjct: 100 FLMNLAIGTPPETYSAIMDTGSDLIWTQC-KPCTQCFDQPSPIFDPKKSSSFSKLSCSSQ 158
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
L P +D C+Y YGD S+ G + T+ F F S+ NV FGCG +
Sbjct: 159 LCKALPQ--SSCSDSCEYLYTYGDYSSTQGTMATETF--TFGKVSIPNVG--FGCGEDNE 212
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV---P 193
G +G++GLGRG +S+VSQL+E + I L +G
Sbjct: 213 GDG---FTQGSGLVGLGRGPLSLVSQLKEAKFSYCLTS--IDDTKTSTLLMGSLASVNGT 267
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGASYAY 243
S+ + TP++QN Y L + G +K+ T LI DSG + Y
Sbjct: 268 SAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITY 327
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP-LALSFTNR 302
+ ++V +G P+ L +C+ P +E P L L FT
Sbjct: 328 LEESAF-DLVKKEFTSQMGLPVD-NSGATGLELCYNLP----SDTSELEVPKLVLHFTG- 380
Query: 303 RNSVRLVVPPEAYLVISVSTSIIIIA 328
L +P E Y++ S +I +A
Sbjct: 381 ---ADLELPGENYMIADSSMGVICLA 403
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/197 (37%), Positives = 98/197 (49%), Gaps = 23/197 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + F FDTGSDLTW QC+ PC G C + E + P ++ V C +
Sbjct: 89 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCE-PCVGYCYQQREHIFDPSTSLSYSNVSCDS 147
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P C L N P C + C Y I YGDG SIG + L ++ VFN F
Sbjct: 148 PSCEKLESATGNSPGCS--SSTCLYGIRYGDGSYSIGFFARE--KLSLTSTDVFN-NFQF 202
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI--GQNGRGVLF 186
GCG Q+N G TAG+LGL R +S+VSQ ++YG V +C+ + G L
Sbjct: 203 GCG--QNNRGLFG--GTAGLLGLARNPLSLVSQTAQKYG---KVFSYCLPSSSSSTGYLS 255
Query: 187 LGDGKVPSSGVAWTPML 203
G G S V +TP L
Sbjct: 256 FGSGDGDSKAVKFTPRL 272
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/335 (28%), Positives = 142/335 (42%), Gaps = 39/335 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++ +G PP+ F DTGSDL W QC APC C + P ++P K+ +PCS+
Sbjct: 85 YLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASLPCSSA 143
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C AL+ P C + C Y+ YGD SS G L + F +N + VP ++FGC
Sbjct: 144 MCNALY---SPLCFQ--NACVYQAFYGDSASSAGVLANETFTFG-TNSTRVAVPRVSFGC 197
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR----GVLFL 187
G N G L + +G++G GRG +S+VSQL + R L
Sbjct: 198 G--NMNAGTLF--NGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATL 253
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKDLT--LIF 235
SSG V TP + N A Y L + +G + D T +I
Sbjct: 254 NSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVII 313
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + + Y + + +G P A T C++ P VT +
Sbjct: 314 DSGTTVTFLAQPAYAMVQGAFVA-WVGLPRANATPSDTFDTCFKWPPPPRRMVT--LPEM 370
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYL 330
L F + + +P E Y+V+ T + +A L
Sbjct: 371 VLHF----DGADMELPLENYMVMDGGTGNLCLAML 401
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/339 (29%), Positives = 143/339 (42%), Gaps = 52/339 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V+L +G PP + DTGSDL W QC APC C P + ++ +PC +
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCAAQPTPYFDVKRSATYRALPCRSS 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFNVPLTFGC 131
RCAAL + P C C Y+ YGD S+ G L + F S+ V ++FGC
Sbjct: 148 RCAAL---SSPSCFK--KMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFGC 202
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLG 188
G N G L+ +++G++G GRG +S+VSQL + + R GV
Sbjct: 203 G--SLNAGELA--NSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSPTPSRLYFGVFANL 258
Query: 189 DGKVPSSG--VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------------- 233
+ SSG V TP + N A Y L G S G K L +
Sbjct: 259 NSTNTSSGSPVQSTPFVINPALPNMYFLS-----VKGISLGTKRLPIDPLVFAINDDGTG 313
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPD-DKTLPICWRGPFKALGQVT 289
I DSG S + Y+ + R L T PL D D L C++ P VT
Sbjct: 314 GVIIDSGTSITWLQQDAYEA----VRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVT 369
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIA 328
F + + +PPE Y++I+ +T + +A
Sbjct: 370 ------VPDFVFHFDGANMTLPPENYMLIASTTGYLCLA 402
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 122/276 (44%), Gaps = 55/276 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ ++L VG PP+ DTGSDL W QCD CT C + P+ + P + + C+
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 73 RCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C LH C P D C Y YGDG +++G T+ F S+G +VPL FGC
Sbjct: 157 LCGDILHHS----CVRP-DTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGC 211
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G N G L+ + +G++G GR +S+VSQL IR +C+ + + L G
Sbjct: 212 G--TMNVGSLN--NASGIVGFGRDPLSLVSQLS----IRR-FSYCLTPYASSRKSTLQFG 262
Query: 189 ---------DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
D P V TP+LQ++ + Y + ++G + G + L +
Sbjct: 263 SLADVGLYDDATGP---VQTTPILQSAQNPTFYYVA-----FTGVTVGARRLRIPASAFA 314
Query: 234 ---------IFDSGASYAYFTSRVYQEIVSLIMRDL 260
I DSG + F + V E+V L
Sbjct: 315 LRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQL 350
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 135/316 (42%), Gaps = 38/316 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F D+GS +T+V C + C C + +++P + V C N
Sbjct: 87 YYTTRLYIGTPPQEFALIVDSGSTVTYVPCSS-CEQCGNHQDPRFQPDLSSSYSPVKC-N 144
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
C C QC YE +Y + SS G L D+ + F S FG
Sbjct: 145 VDCT---------CDSDKKQCTYERQYAEMSSSSGVLGEDI--VSFGRESELKPQHAIFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
C ++ G L G++GLGRG++SI+ QL E G+I + C G G G + LG
Sbjct: 194 CENSE--TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLG 251
Query: 189 DGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGAS 240
P + NS L+ +Y + E+ +GK+ ++ + DSG +
Sbjct: 252 GMLAPPDMI-----FSNSDPLRSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTT 306
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFT 300
YAY + + + + PD IC+ G + + ++ E F + + F
Sbjct: 307 YAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFG 366
Query: 301 NRRNSVRLVVPPEAYL 316
N + +L + PE YL
Sbjct: 367 NGQ---KLSLTPENYL 379
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/335 (28%), Positives = 142/335 (42%), Gaps = 39/335 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++ +G PP+ F DTGSDL W QC APC C + P ++P K+ +PCS+
Sbjct: 88 YLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASLPCSSA 146
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C AL+ P C + C Y+ YGD SS G L + F +N + VP ++FGC
Sbjct: 147 MCNALY---SPLCFQ--NACVYQAFYGDSASSAGVLANETFTFG-TNSTRVAVPRVSFGC 200
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR----GVLFL 187
G N G L + +G++G GRG +S+VSQL + R L
Sbjct: 201 G--NMNAGTLF--NGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATL 256
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKDLT--LIF 235
SSG V TP + N A Y L + +G + D T +I
Sbjct: 257 NSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVII 316
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG + + Y + + +G P A T C++ P VT +
Sbjct: 317 DSGTTVTFLAQPAYAMVQGAFVA-WVGLPRANATPSDTFDTCFKWPPPPRRMVT--LPEM 373
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYL 330
L F + + +P E Y+V+ T + +A L
Sbjct: 374 VLHF----DGADMELPLENYMVMDGGTGNLCLAML 404
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/287 (29%), Positives = 130/287 (45%), Gaps = 40/287 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ +++G P K+F DTGSDL W+QC PC C + + P + + C +
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQC-KPCQACFNQKDPIFDPEGSSSYTTMSCGDT 98
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C +L PR K + CDY YGDG + G L ++ L + G + FGC
Sbjct: 99 LCDSL-----PR-KSCSPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGC 152
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLF 186
G+ N G + D +G++GLGRG +S VSQL + L + +C+ + +F
Sbjct: 153 GH--LNRGSFN--DASGLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPSKTSPMF 206
Query: 187 LGD-GKVPSSG----VAWTPMLQNSADLKHYILGPAELLYSGKS----CGLKDLT----- 232
GD SSG A+TPM+ N A Y + ++ +G++ G D+
Sbjct: 207 FGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSG 266
Query: 233 -LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
+IFDSG + YQ IV +R I P K+ L +C+
Sbjct: 267 GMIFDSGTTLTLLPDAPYQ-IVLRALRSKISFP-KIDGSSAGLDLCY 311
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 129/324 (39%), Gaps = 42/324 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN--IVPC 69
+ + +G PPK + DTGSD+ WV C GC QY P + V C
Sbjct: 85 YYTRIEIGSPPKGYYVQVDTGSDILWVN-GISCDGCPTRSGLGIELTQYDPAGSGTTVGC 143
Query: 70 SNPRCAALHWPN--PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV----F 123
C A + PP C C + I YGDG S+ G VTD +G+
Sbjct: 144 EQEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQTTPS 203
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGR 182
NV +TFGCG S G+LG G+ S++SQL +R + HC+ G
Sbjct: 204 NVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDTVRGG 263
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG----------PAELLYSGKSCGLKDLT 232
G+ +G+ P V TP++ N+ + G P SG S G
Sbjct: 264 GIFAIGNVVQPPI-VKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDSGDSKG----- 317
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
I DSG + AY VY+ +++ + LA + IC F+ G + E F
Sbjct: 318 TIIDSGTTLAYLPREVYRTLLTAVFD----KHPDLAVRNYEDFIC----FQFSGSLDEEF 369
Query: 293 KPLALSFTNRRNSVRLVVPPEAYL 316
+ SF + L V P YL
Sbjct: 370 PVITFSF---EGDLTLNVYPHDYL 390
>gi|213998800|gb|ACJ60767.1| nucellin [Hordeum marinum subsp. marinum]
Length = 142
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 76/138 (55%), Gaps = 5/138 (3%)
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGD 189
CGY Q P P G+LGLG G+ +QL+ +I NVIGHC+ G+GVL++G+
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGN 60
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRV 248
PS GV W PM ++S +Y G AELL + G +FDSG++Y S++
Sbjct: 61 FNPPSRGVTWVPMRESSF---YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQI 117
Query: 249 YQEIVSLIMRDLIGTPLK 266
Y EIVS + L + L+
Sbjct: 118 YNEIVSKVRGTLSESSLE 135
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 143/347 (41%), Gaps = 66/347 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F + L++G P + DTGSDL W QC PCT C P + P K+ V CS+
Sbjct: 107 FLMELSIGNPAVKYSAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCSSG 165
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL N C D C+Y YGD S+ G L T+ F N S+ + FGCG
Sbjct: 166 LCNALPRSN---CNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN-SISGIG--FGCG 219
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
G +G++GLGRG +S++SQL+E +C+ LF+G
Sbjct: 220 VENEGDG---FSQGSGLVGLGRGPLSLISQLKE-----TKFSYCLTSIEDSEASSSLFIG 271
Query: 189 ---DGKVPSSGVAWTPMLQNSADLKHYILGPA--ELLYSGKSCGLKDLT----------- 232
G V +G + + + L P+ L G + G K L+
Sbjct: 272 SLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAED 331
Query: 233 ----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK---TLPICWRGPFKAL 285
+I DSG + Y ++ ++++ + + L DD L +C++ P A
Sbjct: 332 GTGGMIIDSGTTITYLEETAFK-----VLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAK 386
Query: 286 G----QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIA 328
++ +FK L +P E Y+V ST ++ +A
Sbjct: 387 NIAVPKMIFHFK-----------GADLELPGENYMVADSSTGVLCLA 422
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 86/320 (26%), Positives = 135/320 (42%), Gaps = 49/320 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
V+L VG PP+ DTGS+L+W+ C TG ++P + VPC +
Sbjct: 61 LTVSLAVGTPPQNVTMVLDTGSELSWLLC---ATGRAAAAAADSFRPRASATFAAVPCGS 117
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
RC++ P PP C + +C + Y DG +S GAL TD+F + G + FGC
Sbjct: 118 ARCSSRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAV----GDAPPLRSAFGC 173
Query: 132 GYNQHNPGPLSPPD---TAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRGVLFL 187
++ S PD TAG+LG+ RG +S V+Q +CI ++ GVL L
Sbjct: 174 MSAAYD----SSPDAVATAGLLGMNRGALSFVTQAST-----RRFSYCISDRDDAGVLLL 224
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------------- 233
G +P + +TP+ Q + L ++ + G G K L +
Sbjct: 225 GHSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQ 284
Query: 234 -IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD------KTLPICWRGPFKALG 286
+ DSG + + Y + + ++ PL A +D + C+R P K
Sbjct: 285 TMVDSGTQFTFLLGDAYSAVKAEFLKQT--KPLLPALEDPSFAFQEAFDTCFRVP-KGRP 341
Query: 287 QVTEYFKPLALSFTNRRNSV 306
+ P+ L F + SV
Sbjct: 342 PPSARLPPVTLLFNGAQMSV 361
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 118/263 (44%), Gaps = 29/263 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNIV 67
+ + +G PP+ F DTGSD+ WV C + C GC + Q + + +++
Sbjct: 77 YYTKVKLGTPPREFYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQLNYFDPRSSSTSSLI 135
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFN- 124
CS+ RC + + C N+QC Y +YGDG + G V+DL F F N
Sbjct: 136 SCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTNS 195
Query: 125 -VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN-G 181
+ FGC Q S G+ G G+ +S++SQL G+ V HC+ G N G
Sbjct: 196 SASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKGDNSG 255
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------KDLTL 233
GVL LG+ P+ + ++P++Q+ HY L + +G+ + +
Sbjct: 256 GGVLVLGEIVEPN--IVYSPLVQSQ---PHYNLNLQSISVNGQIVPIAPAVFATSNNRGT 310
Query: 234 IFDSGASYAYFTSRVYQEIVSLI 256
I DSG + AY Y V+ I
Sbjct: 311 IVDSGTTLAYLAEEAYNPFVNAI 333
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 121/272 (44%), Gaps = 28/272 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-KPPEKQYKPHKNI----VPCS 70
YF V++ +G PP+ DTGSDLTWV+C A T C+ PP + + C
Sbjct: 83 YF-VSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCF 141
Query: 71 NPRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-PL 127
+ C + PNP C H + C YE Y DG + G + L S+G + +
Sbjct: 142 SSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKSI 201
Query: 128 TFGCGYNQHNPGPL--SPPDTAGVLGLGRGRISIVSQL-REYGLIRN--VIGHCIGQNGR 182
FGCG++ P + S +GV+GLGRG IS SQL R +G + ++ + +
Sbjct: 202 AFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSPPPT 261
Query: 183 GVLFLGD----GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC----------GL 228
L +GD K S +++TP+L N Y + + G L
Sbjct: 262 SYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWSLDEL 321
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDL 260
+ + DSG + + T Y+EI+S R++
Sbjct: 322 GNGGTVIDSGTTLTFLTEPAYREILSAFKREV 353
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/331 (27%), Positives = 132/331 (39%), Gaps = 55/331 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G P K + DTGSD+ WV C C C + +
Sbjct: 80 YYA-KIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIELTLYNIDESDSGKL 137
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL---------FPLRF 117
V C + C + CK N C Y YGDG S+ G V D+ +
Sbjct: 138 VSCDDDFCYQISGGPLSGCK-ANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196
Query: 118 SNGSVFNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+NGSV FGCG Q S + G+LG G+ S++SQL G ++ + HC
Sbjct: 197 ANGSVI-----FGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHC 251
Query: 177 I-GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADL----------KHYILGPAELLYSGKS 225
+ G+NG G+ + G+V V TP++ N + ++ PA+L G
Sbjct: 252 LDGRNGGGIFAI--GRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDR 309
Query: 226 CGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
G I DSG + AY +Y+ +V I + + D F+
Sbjct: 310 KG-----AIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKC-------FQYS 357
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
G+V E F + F NSV L V P YL
Sbjct: 358 GRVDEGFPNVTFHF---ENSVFLRVYPHDYL 385
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 123/279 (44%), Gaps = 40/279 (14%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN----IVPCS 70
+G P + DTGSD WV C GCT P+K Y P+ + +VPC
Sbjct: 81 IGLGPNDYYVQVDTGSDTLWVNC----VGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCD 136
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP---- 126
+ C + + CK + C Y I YGDG ++ G+ + D G + VP
Sbjct: 137 DEFCTSTYDGPISGCKK-DMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTS 195
Query: 127 LTFGCGYNQHNPGPLSPP-DTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGR 182
+ FGCG Q G LS DT+ G++G G+ S++SQL G ++ V HC+ NG
Sbjct: 196 VIFGCGSKQ--SGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGG 253
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLT----LI 234
G+ +G+ P V TP++ A HY + ++ +G L D T I
Sbjct: 254 GIFAIGEVVQPK--VKTTPLVPRMA---HYNVVLKDIEVAGDPIQLPTDIFDSTSGRGTI 308
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT 273
DSG + AY +Y +++ + G L L D T
Sbjct: 309 IDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFT 347
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 130/287 (45%), Gaps = 40/287 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ +++G P K+F DTGSDL W+QC PC C + + P + + C +
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQC-KPCQACFNQKDPIFDPEGSSSYTTMSCGDT 98
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C +L P + PN CDY YGDG + G L ++ L + G + FGC
Sbjct: 99 LCDSL----PRKSCSPN--CDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGC 152
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLF 186
G+ N G + D +G++GLGRG +S VSQL + L + +C+ + +F
Sbjct: 153 GH--LNRGSFN--DASGLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPSKTSPMF 206
Query: 187 LGD-GKVPSSG----VAWTPMLQNSADLKHYILGPAELLYSGKS----CGLKDLT----- 232
GD SSG A+TPM+ N A Y + ++ +G++ G D+
Sbjct: 207 FGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSG 266
Query: 233 -LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
+IFDSG + YQ IV +R + P ++ L +C+
Sbjct: 267 GMIFDSGTTLTLLPDAPYQ-IVLRALRSKVSFP-EIDGSSAGLDLCY 311
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 81/152 (53%), Gaps = 15/152 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP--HKNIVP--CSNP 72
+ ++L +G PP+ DTGSDL W QC APC C P+ + P + VP CS
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAPAASSSYVPMRCSGQ 161
Query: 73 RCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C LH C+ P D C Y YGDG +++G T+ F S+G +VPL FGC
Sbjct: 162 LCNDILHHS----CQRP-DTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGC 216
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
G N G L+ + +G++G GR +S+VSQL
Sbjct: 217 G--TMNVGSLN--NGSGIVGFGRDPLSLVSQL 244
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 84/292 (28%), Positives = 130/292 (44%), Gaps = 33/292 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNI----V 67
+ V L +G PPK + DTGS L+W+QC C + Y P +K + V
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASV 184
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
CS + A L N P C+ ++ C Y YGD SIG L DL L S +P
Sbjct: 185 ECSRLKAATL---NDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLPQ 237
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCI---GQNGR 182
T+GCG Q N G AG++GL R ++S+++QL +YG + +C+
Sbjct: 238 FTYGCG--QDNQGLFG--RAAGIIGLARDKLSMLAQLSTKYG---HAFSYCLPTANSGSS 290
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK----SCGLKDLTLIFDSG 238
G FL G + + +TPML +S + Y L + SG+ + + + + DSG
Sbjct: 291 GGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSG 350
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
+Y + ++ ++ T AP L C++G K++ V E
Sbjct: 351 TVITRLPMSMYAALRQAFVK-IMSTKYAKAPAYSILDTCFKGSLKSISAVPE 401
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 133/318 (41%), Gaps = 32/318 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR-- 73
Y+ L +G P + F D+GS +T+V PC C + Q + NI+ +PR
Sbjct: 91 YYTTRLYIGTPSQEFALIVDSGSTVTYV----PCATCEQCGNHQSE-SPNIIEAHDPRFQ 145
Query: 74 --CAALHWPNPPR----CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VP 126
++ + P C + QC YE +Y + SS G L D+ + F S
Sbjct: 146 PDLSSTYSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDI--MSFGKESELKPQR 203
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV 184
FGC + G L G++GLGRG++SI+ QL E G+I + C G G G
Sbjct: 204 AVFGCENTE--TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 261
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSG 238
+ LG P V N +Y + E+ +GK+ L + DSG
Sbjct: 262 MVLGGMPAPPDMVFSH---SNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSG 318
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+YAY + + + + PD IC+ G + + Q++E F + +
Sbjct: 319 TTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMV 378
Query: 299 FTNRRNSVRLVVPPEAYL 316
F N + +L + PE YL
Sbjct: 379 FGNGQ---KLSLSPENYL 393
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/264 (33%), Positives = 126/264 (47%), Gaps = 44/264 (16%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPC 69
YFAV + VG PP DTGSDL W+QC PC C + Y P H+ I PC
Sbjct: 91 EYFAV-IGVGDPPTHALVVIDTGSDLIWLQC-LPCRRCYRQVTPLYDPRNSKTHRRI-PC 147
Query: 70 SNPRC-AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
++P+C L +P C C Y + YGDG +S G L TD L + V NV T
Sbjct: 148 ASPQCRGVLRYPG---CDARTGGCVYMVVYGDGSASSGDLATDTLVLP-DDTRVHNV--T 201
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIG------QNG 181
GCG++ N G L+ AG+LG GRG++S +QL YG +V +C+G +N
Sbjct: 202 LGCGHD--NEGLLA--SAAGLLGAGRGQLSFPTQLAPAYG---HVFSYCLGDRMSRARNS 254
Query: 182 RGVLFLGDG-KVPSSGVAWTPMLQNS-------ADLKHYILGPAELL-YSGKSCGLKDLT 232
L G ++PS+ A+TP+ N D+ + +G + +S S L T
Sbjct: 255 SSYLVFGRTPELPST--AFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPAT 312
Query: 233 ----LIFDSGASYAYFTSRVYQEI 252
++ DSG + + FT Y +
Sbjct: 313 GRGGVVVDSGTAISRFTRDAYAAV 336
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 121/276 (43%), Gaps = 55/276 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ ++L VG PP+ DTGSDL W QCD CT C + P+ + P + + C+
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 73 RCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C LH C P D C Y YGDG +++G T+ F S+G +VPL FGC
Sbjct: 157 LCGDILHHS----CVRP-DTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGC 211
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G N G L+ + +G++G GR +S+VSQL IR +C+ + + L G
Sbjct: 212 G--TMNVGSLN--NASGIVGFGRDPLSLVSQLS----IRR-FSYCLTPYASSRKSTLQFG 262
Query: 189 ---------DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
D P V TP+LQ++ + Y + ++G + G + L +
Sbjct: 263 SLADVGLYDDATGP---VQTTPILQSAQNPTFYYVA-----FTGVTVGARRLRIPASAFA 314
Query: 234 ---------IFDSGASYAYFTSRVYQEIVSLIMRDL 260
I DSG + F V E+V L
Sbjct: 315 LRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQL 350
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 133/317 (41%), Gaps = 34/317 (10%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH----KNIVP 68
I Y+ L +G PP+ F DTGS +T+V C + C C + + +++P V
Sbjct: 9 INGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSS-CEQCGRHQDPKFQPDLSSTYQSVK 67
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPL 127
C N C C QC YE +Y + +S G L D+ + F N S
Sbjct: 68 C-NIDC---------NCDDEKQQCVYERQYAEMSTSSGVLGEDI--ISFGNLSALAPQRA 115
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL 185
FGC G L G++G+GRG +SIV L + G+I + C G G +
Sbjct: 116 VFGC--ENMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAM 173
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
LG G P S + ++ + +Y + E+ +GK L I DSG
Sbjct: 174 VLG-GISPPSNMVFSQ--SDPVRSPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGT 230
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+YAY + IM++L PD IC+ G + Q++ F + + F
Sbjct: 231 TYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVF 290
Query: 300 TNRRNSVRLVVPPEAYL 316
N + +L++ PE YL
Sbjct: 291 GNGQ---KLLLSPENYL 304
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 133/318 (41%), Gaps = 32/318 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR-- 73
Y+ L +G P + F D+GS +T+V PC C + Q + NI+ +PR
Sbjct: 90 YYTTRLYIGTPSQEFALIVDSGSTVTYV----PCATCEQCGNHQSE-SPNIIEAHDPRFQ 144
Query: 74 --CAALHWPNPPR----CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VP 126
++ + P C + QC YE +Y + SS G L D+ + F S
Sbjct: 145 PDLSSTYSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDI--MSFGKESELKPQR 202
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV 184
FGC + G L G++GLGRG++SI+ QL E G+I + C G G G
Sbjct: 203 AVFGCENTE--TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 260
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSG 238
+ LG P V N +Y + E+ +GK+ L + DSG
Sbjct: 261 MVLGGMPAPPDMVFSH---SNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSG 317
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+YAY + + + + PD IC+ G + + Q++E F + +
Sbjct: 318 TTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMV 377
Query: 299 FTNRRNSVRLVVPPEAYL 316
F N + +L + PE YL
Sbjct: 378 FGNGQ---KLSLSPENYL 392
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 136/317 (42%), Gaps = 34/317 (10%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VP 68
I Y+ L +G PP+ F DTGS +T+V C + C C + + +++P + V
Sbjct: 85 INGYYTTRLWIGTPPQRFALIVDTGSTVTYVPC-STCEHCGRHQDPKFQPDLSETYQPVK 143
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPL 127
C+ P C C +QC Y+ +Y + SS G L D+ + F N S
Sbjct: 144 CT-PDC---------NCDGDTNQCMYDRQYAEMSSSSGVLGEDV--VSFGNLSELAPQRA 191
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVL 185
FGC ++ G L G++GLGRG +SI+ QL + +I + C G G G +
Sbjct: 192 VFGCENDE--TGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAM 249
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
LG G P + +T + + +Y + E+ +GK L + DSG
Sbjct: 250 ILG-GISPPEDMVFTHSDPDRS--PYYNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGT 306
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+YAY + IM++ PD IC+ G + Q+ + F + + F
Sbjct: 307 TYAYLPETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVF 366
Query: 300 TNRRNSVRLVVPPEAYL 316
N +L + PE YL
Sbjct: 367 ---ENGHKLSLSPENYL 380
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 87/312 (27%), Positives = 125/312 (40%), Gaps = 45/312 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPPEKQYKPHK----NIVPCS 70
+ V++ +G P + FDTGSDL+WVQC PC+ GC + + P + V C
Sbjct: 85 YVVSVGLGTPARDLTVVFDTGSDLSWVQC-GPCSSGGCYHQQDPLFAPSSSSTFSAVRCG 143
Query: 71 NPRCAALHWPNPPRCKHP------NDQCDYEIEYGDGGSSIGALVTDLFPL------RFS 118
P C PR + +D+C YE+ YGD ++G L D L S
Sbjct: 144 EPEC--------PRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNAS 195
Query: 119 NGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
+ +P FGCG N N G D G+ GLGRG++S+ SQ G +C+
Sbjct: 196 ENNSNKLPGFVFGCGEN--NTGLFGKAD--GLFGLGRGKVSLSSQ--AAGKYGEGFSYCL 249
Query: 178 ---GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD---- 230
N G L LG + +TPML S Y + + +G++ +
Sbjct: 250 PSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPAL 309
Query: 231 --LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
LI DSG R Y + + + + K AP L C+ F A
Sbjct: 310 WPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCY--DFTAHANA 367
Query: 289 TEYFKPLALSFT 300
T +AL F
Sbjct: 368 TVSIPAVALVFA 379
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 129/290 (44%), Gaps = 34/290 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++ +VG PP DTGSD+ W+QC PC C + + P K+ I+P S+
Sbjct: 86 YLISYSVGIPPFQLYGIIDTGSDMIWLQC-KPCEKCYNQTTRIFDPSKSNTYKILPFSST 144
Query: 73 RCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C ++ C N + C+Y I YGDG S G L + L +NGS T G
Sbjct: 145 TCQSVE---DTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIG 201
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREY-GLIRNVIGHCIG--QNGRGVLFL 187
CG N ++G++GLG G +S+++QLR I +C+ N L
Sbjct: 202 CG---RNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLNF 258
Query: 188 GDGKVPS-SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDL-TLIFDSGA 239
GD V S G TP++ + + +Y+ +G + ++ S + +I DSG
Sbjct: 259 GDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIIDSGT 318
Query: 240 SYAYFTSRVYQEIVS----LIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
+ + +Y ++ S L+ D + PL K L +C+R F L
Sbjct: 319 TLTLLPNDIYSKLESAVADLVELDRVKDPL------KQLSLCYRSTFDEL 362
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 80/259 (30%), Positives = 115/259 (44%), Gaps = 45/259 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ +NL++G P + F DTGSDL W QC PCT C + P + +PCS+
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL P C N+ C Y YGDG + G++ T+ L F + S+ N+ TFGCG
Sbjct: 154 LCQALQ---SPTCS--NNSCQYTYGYGDGSETQGSMGTE--TLTFGSVSIPNI--TFGCG 204
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
N G + AG++G+GRG +S+ SQL +C IG + L LG
Sbjct: 205 ENNQGFG---QGNGAGLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSTSSTLLLGS 256
Query: 190 -GKVPSSGVAWTPMLQNSA-------DLKHYILGPAEL--------LYSGKSCGLKDLTL 233
++G T ++++S L +G L L S G +
Sbjct: 257 LANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTG----GI 312
Query: 234 IFDSGASYAYFTSRVYQEI 252
I DSG + YF YQ +
Sbjct: 313 IIDSGTTLTYFADNAYQAV 331
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/335 (26%), Positives = 134/335 (40%), Gaps = 58/335 (17%)
Query: 8 FFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE---------- 57
FF ++ + +G P F D GSD+ WV CD C C
Sbjct: 96 FFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCD--CIECASLSAGNYNVLDRDL 153
Query: 58 KQYKPH----KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDL 112
QY+P +PC + C + CK D C YE++Y SS G + D
Sbjct: 154 NQYRPSLSNTSRHLPCGHKLCDVHSF-----CKGSKDPCPYEVQYASANTSSSGYVFEDK 208
Query: 113 FPL----RFSNGSVFNVPLTFGCGYNQ-----HNPGPLSPPDTAGVLGLGRGRISIVSQL 163
L + + + + GCG Q H GP GVLGLG G IS+ S L
Sbjct: 209 LHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGP------DGVLGLGPGNISVPSLL 262
Query: 164 REYGLIRNVIGHCIGQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYS 222
+ GLI+N C+ +N G + GD G V + P++ ++ + +G
Sbjct: 263 AKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPFLPIIAYMVGVESFCVG------- 315
Query: 223 GKSCGLKD--LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
S LK+ + DSG+S+ + + VYQ++V+ + + + + L W
Sbjct: 316 --SLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQSS-------WEY 366
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAY 315
+ A Q PL L+F+ RN L+ P Y
Sbjct: 367 CYNASSQELVNIPPLKLAFS--RNQTFLIQNPIFY 399
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/342 (26%), Positives = 140/342 (40%), Gaps = 60/342 (17%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRC 74
+ L++G P + DTGSDL W QC PCT C P + P K+ V CS+ C
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 59
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYN 134
AL N C D C+Y YGD S+ G L T+ F N S+ + FGCG
Sbjct: 60 NALPRSN---CNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN-SISGIG--FGCGVE 113
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG-- 188
G +G++GLGRG +S++SQL+E +C+ LF+G
Sbjct: 114 NEGDG---FSQGSGLVGLGRGPLSLISQLKE-----TKFSYCLTSIEDSEASSSLFIGSL 165
Query: 189 -DGKVPSSGVAW-------TPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-------- 232
G V +G + +L+N Y L + K ++ T
Sbjct: 166 ASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGT 225
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK---TLPICWRGPFKALGQ 287
+I DSG + Y ++ ++++ + + L DD L +C++ P A
Sbjct: 226 GGMIIDSGTTITYLEETAFK-----VLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAA--- 277
Query: 288 VTEYFKPLAL-SFTNRRNSVRLVVPPEAYLVISVSTSIIIIA 328
K +A+ L +P E Y+V ST ++ +A
Sbjct: 278 -----KNIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLA 314
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 142/347 (40%), Gaps = 66/347 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F + L++G P + DTGSDL W QC PCT C P + P K+ V CS+
Sbjct: 108 FLMELSIGNPAVKYAAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCSSG 166
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL N C D C+Y YGD S+ G L T+ F N S+ + FGCG
Sbjct: 167 LCNALPRSN---CNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDEN-SISGIG--FGCG 220
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
G +G++GLGRG +S++SQL+E +C+ LF+G
Sbjct: 221 VENEGDG---FSQGSGLVGLGRGPLSLISQLKE-----TKFSYCLTSIEDSEASSSLFIG 272
Query: 189 ---DGKVPSSGVAWTPMLQNSADLKHYILGPA--ELLYSGKSCGLKDLT----------- 232
G V +G + + L P+ L G + G K L+
Sbjct: 273 SLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSED 332
Query: 233 ----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK---TLPICWRGPFKAL 285
+I DSG + Y ++ ++++ + + L DD L +C++ P A
Sbjct: 333 GTGGMIIDSGTTITYLEETAFK-----VLKEEFTSRMSLPVDDSGSTGLDLCFKLPNAAK 387
Query: 286 G----QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIA 328
++ +FK L +P E Y+V ST ++ +A
Sbjct: 388 NIAVPKLIFHFK-----------GADLELPGENYMVADSSTGVLCLA 423
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/331 (27%), Positives = 132/331 (39%), Gaps = 55/331 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G P K + DTGSD+ WV C C C + +
Sbjct: 80 YYA-KIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIELTLYNIDESDSGKL 137
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL---------FPLRF 117
V C + C + CK N C Y YGDG S+ G V D+ +
Sbjct: 138 VSCDDDFCYQISGGPLSGCK-ANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196
Query: 118 SNGSVFNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+NGSV FGCG Q S + G+LG G+ S++SQL G ++ + HC
Sbjct: 197 ANGSVI-----FGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHC 251
Query: 177 I-GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADL----------KHYILGPAELLYSGKS 225
+ G+NG G+ + G+V V TP++ N + ++ PA+L G
Sbjct: 252 LDGRNGGGIFAI--GRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDR 309
Query: 226 CGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
G I DSG + AY +Y+ +V I + + D F+
Sbjct: 310 KG-----AIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKC-------FQYS 357
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
G+V E F + F NSV L V P YL
Sbjct: 358 GRVDEGFPNVTFHF---ENSVFLRVYPHDYL 385
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 74/267 (27%), Positives = 114/267 (42%), Gaps = 37/267 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNIV 67
+ + +G PP+ DTGSD+ WV C C GC + Q + + +++
Sbjct: 77 YYTKVKLGTPPRELYVQIDTGSDVLWVSC-GSCNGCPQTSGLQIQLNYFDPGSSSTSSLI 135
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
C + RC + + C N+QC Y +YGDG + G V+DL S+F L
Sbjct: 136 SCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHF----ASIFEGTL 191
Query: 128 T--------FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
T FGC Q S G+ G G+ +S++SQL G+ V HC+ G
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251
Query: 179 QN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL--------K 229
N G GVL LG+ P+ + ++P++ + HY L + +G+ +
Sbjct: 252 DNSGGGVLVLGEIVEPN--IVYSPLVPSQ---PHYNLNLQSISVNGQIVRIAPSVFATSN 306
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLI 256
+ I DSG + AY Y V I
Sbjct: 307 NRGTIVDSGTTLAYLAEEAYNPFVIAI 333
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 124/287 (43%), Gaps = 42/287 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ + L +G PP + DTGSDL W QC PCT C K P + P + V C +
Sbjct: 108 YLIELAIGTPPVSYPAVLDTGSDLIWTQC-KPCTRCYKQPTPIFDPKKSSSFSKVSCGSS 166
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+AL C +D C+Y YGD + G L T+ F S V + FGCG
Sbjct: 167 LCSALPSST---C---SDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCG 220
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
+ G +G++GLGRG +S+VSQL+E +C I VL LG
Sbjct: 221 EDNEGDG---FEQASGLVGLGRGPLSLVSQLKE-----QRFSYCLTPIDDTKESVLLLGS 272
Query: 190 -GKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDS 237
GKV + V TP+L+N Y L + ++ T +I DS
Sbjct: 273 LGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDS 332
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGP 281
G + Y + Y+ + ++ I + KLA D + L +C+ P
Sbjct: 333 GTTITYVQQKAYEA----LKKEFI-SQTKLALDKTSSTGLDLCFSLP 374
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 79/254 (31%), Positives = 116/254 (45%), Gaps = 35/254 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ +NL++G P + F DTGSDL W QC PCT C + P + +PCS+
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL + P C N+ C Y YGDG + G++ T+ L F + S+ N+ TFGCG
Sbjct: 154 LCQAL---SSPTCS--NNFCQYTYGYGDGSETQGSMGTE--TLTFGSVSIPNI--TFGCG 204
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHCIGQNGRGVLFLGD- 189
N G + AG++G+GRG +S+ SQL ++ IG N L LG
Sbjct: 205 ENNQGFG---QGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSN----LLLGSL 257
Query: 190 GKVPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLT----LIFDSG 238
++G T ++Q+S L +G L + L +I DSG
Sbjct: 258 ANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSG 317
Query: 239 ASYAYFTSRVYQEI 252
+ YF + YQ +
Sbjct: 318 TTLTYFVNNAYQSV 331
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 104/257 (40%), Gaps = 30/257 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
YF + +G P K F DTGSD+ W+ C+ C C K + +
Sbjct: 71 YFT-KVKMGSPAKEFYVQIDTGSDILWLNCNT-CNNCPKSSGLGIDLNYFDTASSSTAAL 128
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFN- 124
V CS+P C+ +C +QC Y +YGDG + G V D G SVF+
Sbjct: 129 VSCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYYVYDAMYFDVIMGQSVFSN 188
Query: 125 --VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
+ FGC Q + G+ G G G +S+VSQ+ G+ V HC+ G
Sbjct: 189 SSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQGS 248
Query: 183 GVLFLGDGKVPSSGVAWTPM--LQNSADLKHYILGPAELLYSGKSCGL--------KDLT 232
G L G++ + +TP+ LQ HY L + +G+ + +
Sbjct: 249 GGGILVLGEILEPNIVYTPLVPLQ-----PHYNLNLQSIAVNGQILPIDQDVFATGNNRG 303
Query: 233 LIFDSGASYAYFTSRVY 249
I DSG + AY Y
Sbjct: 304 TIVDSGTTLAYLVQEAY 320
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 119/270 (44%), Gaps = 32/270 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +VG PP DTGSD+ W+QC+ PC C + P K+ +PC +
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCE-PCEQCYNQTTPIFNPSKSSSYKNIPCLSK 145
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C H C N C Y+I YGD S G L D L ++GS + P T GC
Sbjct: 146 LC---HSVRDTSCSDQN-SCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIGC 201
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G + N G ++G++GLG G +S+++QL I +C+ N +L
Sbjct: 202 GTD--NAGTFGGA-SSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSIL 256
Query: 186 FLGDGKVPS-SGVAWTPMLQNS-----ADLKHYILGPAELLYSGKSCGLKDL-TLIFDSG 238
GD V S GV TP+++ L+ + +G + + G S G D +I DSG
Sbjct: 257 SFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316
Query: 239 ASYAYFTSRVYQE----IVSLIMRDLIGTP 264
+ S VY +V L+ D + P
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDRVDDP 346
>gi|168021169|ref|XP_001763114.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685597|gb|EDQ71991.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 641
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 122/309 (39%), Gaps = 64/309 (20%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--TKPPEKQYKPHKNI-VPCSNPR 73
+ V + VGK KLF F DTGS +W+ C P P Y P K + V C +P
Sbjct: 126 YYVKMRVGKSKKLFHFLIDTGSQPSWLHCKWPAIEKHPVAGPNGMYVPEKEVQVDCRSPE 185
Query: 74 CAALHW--------PNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
C +L N C PND +C Y+I Y D G V D+ L G +
Sbjct: 186 CLSLQRIPSNFNNIRNLFPCNEPNDWRCTYDITYLDRSHLRGFYVQDVVSLATLEGEQLD 245
Query: 125 VPLTFGCGYNQHNPGPL-------------------SPPDTAGVLGLGRGRISIVSQLRE 165
+T G H P SP T G+LGL +G S VSQL+
Sbjct: 246 AKITLGYATPNHRAAPFGFCSWHASSDRYGEEELERSPLTTDGLLGLNKGTESFVSQLKR 305
Query: 166 YGLI-RNVIGHCIG-------QNGRGVLFLGDGKVPSS-GVAWTPMLQNSAD-----LKH 211
G I +V+GHC + G +F G K+ S + W+PM ++D +K
Sbjct: 306 QGAISSHVVGHCFRSLDTTDFETNSGFMFFGKSKLLDSLPITWSPMASPTSDGFILVVKL 365
Query: 212 YILGP---------AELLYS--GKSCGLKDLTL--------IFDSGASYAYFTSRVYQEI 252
+ P AE LY K L +L+L I DSG++ + +Y I
Sbjct: 366 KVPLPLKRDGQSSIAEYLYKVYVKKIKLGELSLEMTDKSNIIIDSGSTTTHILDSIYNPI 425
Query: 253 VSLIMRDLI 261
+ + +
Sbjct: 426 RDEVAKQAL 434
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 74/268 (27%), Positives = 106/268 (39%), Gaps = 37/268 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
+ + + +G PPK F+ DTGSDL W+QC PC+ C + Y P + +
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCK-PCSQCYSQSDPIYDPSASSTFAKTSCSTS 62
Query: 77 LHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYN 134
P C C Y +YGD S+ G + LR S GS P FGCG
Sbjct: 63 SCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG-- 120
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLFLGD 189
+ N G AG++GLG+G+IS+ +QL I N +C+ + L G
Sbjct: 121 RLNSGSFG--GAAGIVGLGQGKISLSTQLGS--AINNKFSYCLVDFDDDSSKTSPLIFGS 176
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------- 233
SG TP++ NS +Y +G + GK L +
Sbjct: 177 SASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRAL 236
Query: 234 -------IFDSGASYAYFTSRVYQEIVS 254
IFDSG + VY ++ S
Sbjct: 237 EVNSGGTIFDSGTTLTLLDDAVYSKVKS 264
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 132/327 (40%), Gaps = 47/327 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKP----H 63
YF + +G P K + DTGSD+ WV C C P K Y P
Sbjct: 89 YF-TQIGIGTPSKGYYVQVDTGSDILWVNC----ISCDSCPRKSGLGIDLTLYDPTASAS 143
Query: 64 KNIVPCSNPRCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-- 120
V C CA A + PP C N C Y I YGDG S+ G V D +G
Sbjct: 144 SKTVTCGQEFCATATNGGVPPSCA-ANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDG 202
Query: 121 --SVFNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHC 176
++ N +TFGCG G L + A G+LG G+ S++SQL G + + HC
Sbjct: 203 QTNLANASVTFGCGAKI--GGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHC 260
Query: 177 IGQ-NGRGVLFLGDGKVPSSGVAWTPML----QNSADLKHYILGPAELLYSGK--SCGLK 229
+ NG G+ +G+ P V TP++ + LK +G + L G
Sbjct: 261 LDTVNGGGIFAIGNVVQPK--VKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGG 318
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
I DSG + AY VY+ ++S + + LK D +C F+ G V
Sbjct: 319 SRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQD----FLC----FQYSGSVD 370
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYL 316
F + F + LVV P YL
Sbjct: 371 NGFPEVTFHF---DGDLPLVVYPHDYL 394
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 83/271 (30%), Positives = 120/271 (44%), Gaps = 45/271 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+L +G PP+ DTGSDL W QC APC C P+ + P ++ + C+
Sbjct: 102 YVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAPGESASYEPMRCAGQ 160
Query: 73 RCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFG 130
C+ LH C+ P D C Y YGDG ++G T+ F S G + VPL FG
Sbjct: 161 LCSDILHHG----CEMP-DTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFG 215
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLF 186
CG N G L+ + +G++G GR +S+VSQL IR +C+ G G +LF
Sbjct: 216 CG--SMNVGSLN--NGSGIVGFGRNPLSLVSQLS----IRR-FSYCLTSYGSGRKSTLLF 266
Query: 187 -------LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------- 232
GD P V TP+LQ+ + Y + A L + + +
Sbjct: 267 GSLSGGVYGDATGP---VQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDG 323
Query: 233 ---LIFDSGASYAYFTSRVYQEIVSLIMRDL 260
+I DSG + V E+V + L
Sbjct: 324 SGGVIVDSGTALTLLPGAVLAEVVRAFRQQL 354
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 87/315 (27%), Positives = 134/315 (42%), Gaps = 36/315 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + + ++ P + + C N
Sbjct: 82 YYTTRLWIGTPPQQFALIVDTGSTVTYVPC-STCEQCGRHQDPKFDPESSSTYKPIKC-N 139
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--LTF 129
C C QC YE +Y + +S G L D+ + F N S +P F
Sbjct: 140 IDCI---------CDSDGVQCVYERQYAEMSTSSGVLGEDV--ISFGNQSEL-IPQRAVF 187
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC G L G++GLG G +S+V QL E G I + C G G G + L
Sbjct: 188 GC--ENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVL 245
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK----SCGLKD--LTLIFDSGASY 241
G G P S + +T + +Y + E+ +GK S G+ D + DSG +Y
Sbjct: 246 G-GISPPSDMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTY 302
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
AY + + IM ++ PD IC+ G +++ F + + F N
Sbjct: 303 AYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFEN 362
Query: 302 RRNSVRLVVPPEAYL 316
+ +L + PE Y
Sbjct: 363 GQ---KLSLTPENYF 374
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 134/318 (42%), Gaps = 35/318 (11%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VP 68
I Y+ L +G PP++F D+GS +T+V C + C C K + +++P + V
Sbjct: 89 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQPEMSSTYQPVK 147
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPL 127
C N C C +QC YE EY + SS G L DL + F N S
Sbjct: 148 C-NMDC---------NCDDDREQCVYEREYAEHSSSKGVLGEDL--ISFGNESQLTPQRA 195
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVL 185
FGC + G L G++GLG+G +S+V QL + GLI N G C G G G +
Sbjct: 196 VFGCETVE--TGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSM 253
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
LG PS V S +Y + + +GK L + DSG
Sbjct: 254 ILGGFDYPSDMVFTDSDPDRSP---YYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGT 310
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR-GPFKALGQVTEYFKPLALS 298
+YAY + +MR++ PD C++ + ++++ F + +
Sbjct: 311 TYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMV 370
Query: 299 FTNRRNSVRLVVPPEAYL 316
F ++ ++ PE Y+
Sbjct: 371 F---KSGQSWLLSPENYM 385
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 80/254 (31%), Positives = 111/254 (43%), Gaps = 36/254 (14%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE---------KQYKPHKNI--- 66
N+TVG P F DTGSDL W+ CD CT C + + Y P+ +
Sbjct: 57 ANVTVGTPSDWFMVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNASSTST 114
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSVFN 124
VPC++ C RC P C Y+I Y +G SS G LV D+ L ++ S
Sbjct: 115 KVPCNSTLCT-----RGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKA 169
Query: 125 VP--LTFGCGYNQ----HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+P +TFGCG Q H+ + P+ G+ GLG IS+ S L + G+ N C G
Sbjct: 170 IPARVTFGCGQVQTGVFHDG---AAPN--GLFGLGLEDISVPSVLAKEGIAANSFSMCFG 224
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
+G G + GD S TP+ + I + G + G + +FDSG
Sbjct: 225 NDGAGRISFGDKG--SVDQRETPLNIRQPHPTYNI--TVTKISVGGNTGDLEFDAVFDSG 280
Query: 239 ASYAYFTSRVYQEI 252
S+ Y T Y I
Sbjct: 281 TSFTYLTDAAYTLI 294
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 87/315 (27%), Positives = 134/315 (42%), Gaps = 36/315 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + + ++ P + + C N
Sbjct: 82 YYTTRLWIGTPPQQFALIVDTGSTVTYVPC-STCEQCGRHQDPKFDPESSSTYKPIKC-N 139
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--LTF 129
C C QC YE +Y + +S G L D+ + F N S +P F
Sbjct: 140 IDCI---------CDSDGVQCVYERQYAEMSTSSGVLGEDV--ISFGNQSEL-IPQRAVF 187
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC G L G++GLG G +S+V QL E G I + C G G G + L
Sbjct: 188 GC--ENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVL 245
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK----SCGLKD--LTLIFDSGASY 241
G G P S + +T + +Y + E+ +GK S G+ D + DSG +Y
Sbjct: 246 G-GISPPSDMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTY 302
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
AY + + IM ++ PD IC+ G +++ F + + F N
Sbjct: 303 AYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFEN 362
Query: 302 RRNSVRLVVPPEAYL 316
+ +L + PE Y
Sbjct: 363 GQ---KLSLTPENYF 374
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 119/289 (41%), Gaps = 55/289 (19%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHK--- 64
YF + +G PPK + DTGSD+ WV C C+K P K Y P
Sbjct: 87 YF-TEIKLGTPPKRYYVQVDTGSDILWVN----CISCSKCPRKSGLGLDLTFYDPKASSS 141
Query: 65 -NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ V C CAA + P C N C+Y + YGDG S+ G +TD G
Sbjct: 142 GSTVSCDQGFCAATYGGKLPGCT-ANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQ 200
Query: 124 ----NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
N +TFGCG Q S G+LG G+ S++SQL G + + HC+
Sbjct: 201 TQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDT 260
Query: 180 -NGRGVLFLGDGKVP--------SSGVAWTPML----------QNSADLKHYILG----- 215
G G+ +G+ P + G+ P+ + +LK +G
Sbjct: 261 IKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQ 320
Query: 216 -PAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIM---RDL 260
PA + +G+ G I DSG + Y V+++++ ++ RD+
Sbjct: 321 LPAHVFETGEKKG-----TIIDSGTTLTYLPELVFKQVMDVVFSKHRDI 364
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 133/318 (41%), Gaps = 42/318 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK------PPEKQ--YKPHKNIV 67
Y+ L +G PP++F DTGS +T+V C + C C + PE Y+P K +
Sbjct: 83 YYTTRLWIGTPPQMFALIVDTGSTVTYVPC-STCEQCGRHQDPKFQPESSSTYQPVKCTI 141
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VP 126
C+ C QC YE +Y + +S G L DL + F N S
Sbjct: 142 DCN--------------CDSDRMQCVYERQYAEMSTSSGVLGEDL--ISFGNQSELAPQR 185
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV 184
FGC G L G++GLGRG +SI+ QL + +I + C G G G
Sbjct: 186 AVFGC--ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGA 243
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSG 238
+ LG G P S +A+ + +Y + E+ +GK L + DSG
Sbjct: 244 MVLG-GISPPSDMAFA--YSDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSG 300
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+YAY + I+++L PD IC+ G + Q+++ F + +
Sbjct: 301 TTYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMV 360
Query: 299 FTNRRNSVRLVVPPEAYL 316
F N + + + PE Y+
Sbjct: 361 FENGQ---KYTLSPENYM 375
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 83/260 (31%), Positives = 116/260 (44%), Gaps = 39/260 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE---------KQYKPHK 64
F ++A N+TVG P F DTGSDL W+ CD CT C + + Y P+
Sbjct: 102 FLHYA-NVTVGTPSDWFMVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNA 158
Query: 65 NI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
+ VPC++ C RC P C Y+I Y +G SS G LV D+ L ++
Sbjct: 159 SSTSTKVPCNSTLCT-----RGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213
Query: 120 GSVFNVP--LTFGCGYNQ----HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVI 173
S +P +TFGCG Q H+ + P+ G+ GLG IS+ S L + G+ N
Sbjct: 214 KSSKAIPARVTFGCGQVQTGVFHDG---AAPN--GLFGLGLEDISVPSVLAKEGIAANSF 268
Query: 174 GHCIGQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT 232
C G +G G + GD G V TP+ + I + G + G +
Sbjct: 269 SMCFGNDGAGRISFGDKGSVDQRE---TPLNIRQPHPTYNI--TVTKISVGGNTGDLEFD 323
Query: 233 LIFDSGASYAYFTSRVYQEI 252
+FDSG S+ Y T Y I
Sbjct: 324 AVFDSGTSFTYLTDAAYTLI 343
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 127/322 (39%), Gaps = 35/322 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ + +G PP F DTGS +T+V PC+ CT Q + + C +PR
Sbjct: 39 YYTSRVFIGTPPNEFALIVDTGSTVTYV----PCSSCTHCGHHQASFSTHRLFCRDPRFK 94
Query: 76 ALHWPNPPR------------CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ + + C + QC YE Y + +S G L DL L F S
Sbjct: 95 PENSSSYQKIGCRSSDCITGLCDSNSHQCKYERMYAEMSTSKGVLGKDL--LDFGPASRL 152
Query: 124 NVPL-TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QN 180
L +FGC G L G++GLGRG +SIV QL G I + C G
Sbjct: 153 QSQLLSFGC--ETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDE 210
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD------LTLI 234
G G + LG PS V + S +Y L E+ G S L I
Sbjct: 211 GGGSMVLGAIPAPSGMVFAKSDPRRS---NYYNLELTEIQVQGASLKLDSNVFNGKFGTI 267
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG +YAY R ++ ++ L PD IC+ G ++ ++F
Sbjct: 268 LDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFPL 327
Query: 295 LALSFTNRRNSVRLVVPPEAYL 316
+ F + ++ + PE YL
Sbjct: 328 VDFVFAENQ---KVSLAPENYL 346
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 74/249 (29%), Positives = 102/249 (40%), Gaps = 23/249 (9%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR------- 73
+ +G P F D GSDL WV CD C C Y + NP
Sbjct: 107 IDLGTPSVPFLVALDVGSDLLWVPCD--CIQCAPLSANYYSVLDRDLSEYNPALSSTSKH 164
Query: 74 --CAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL----RFSNGSVFNVP 126
C CK ND C Y+ +Y D S+ G ++ D L + S+
Sbjct: 165 LFCGHQLCAWSTTCKSANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHSLLQAS 224
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG-VL 185
+ FGCG Q L GV+GLG G IS+ + L + GL+RN C NG G +L
Sbjct: 225 VVFGCGRKQSG-SYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRIL 283
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD-LTLIFDSGASYAYF 244
F DG + P+ + Y +G E G SC + + DSG+S+ Y
Sbjct: 284 FGDDGPATQQTTQFLPLF---GEFAAYFIG-VESFCVGSSCLQRSGFQALVDSGSSFTYL 339
Query: 245 TSRVYQEIV 253
+ VY++IV
Sbjct: 340 PAEVYKKIV 348
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 77/271 (28%), Positives = 115/271 (42%), Gaps = 23/271 (8%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPC 69
+ VG P F DTGSDL WV CD AP +G ++ Y+P ++ +PC
Sbjct: 100 VDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPC 159
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FNVPL 127
S+ C ++ P C +P C Y I+Y + +S G L+ D L + V N +
Sbjct: 160 SHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
GCG Q L G+LGLG IS+ S L GL++N C ++ G +F
Sbjct: 215 IIGCGQKQSG-DYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFF 273
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSR 247
GD VPS TP + L+ Y + + K + DSG S+
Sbjct: 274 GDQGVPSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFD 331
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
VY+ + + T ++ +D T C+
Sbjct: 332 VYKAFTMEFDKQMNAT--RVPYEDTTWKYCY 360
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 77/271 (28%), Positives = 115/271 (42%), Gaps = 23/271 (8%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPC 69
+ VG P F DTGSDL WV CD AP +G ++ Y+P ++ +PC
Sbjct: 100 VDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPC 159
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FNVPL 127
S+ C ++ P C +P C Y I+Y + +S G L+ D L + V N +
Sbjct: 160 SHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
GCG Q L G+LGLG IS+ S L GL++N C ++ G +F
Sbjct: 215 IIGCGQKQSG-DYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFF 273
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSR 247
GD VPS TP + L+ Y + + K + DSG S+
Sbjct: 274 GDQGVPSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFD 331
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
VY+ + + T ++ +D T C+
Sbjct: 332 VYKAFTMEFDKQMNAT--RVPYEDTTWKYCY 360
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 65/203 (32%), Positives = 98/203 (48%), Gaps = 18/203 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK--QYKPHKNI----VPCS 70
V+L VG PP+ DTGS+L+W+ C AP G ++P ++ VPC
Sbjct: 66 LTVSLAVGTPPQNVTMVLDTGSELSWLLC-APGGGGGGGGRSALSFRPRASLTFASVPCD 124
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ +C + P+PP C + QC + Y DG SS GAL T++F + G + FG
Sbjct: 125 SAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV----GQGPPLRAAFG 180
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRGVLFLGD 189
C + P TAG+LG+ RG +S VSQ +CI ++ GVL LG
Sbjct: 181 CMATAFDTSP-DGVATAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVLLLGH 234
Query: 190 GKVPSSGVAWTPMLQNSADLKHY 212
+P + +TP+ Q + L ++
Sbjct: 235 SDLPFLPLNYTPLYQPAMPLPYF 257
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/257 (31%), Positives = 116/257 (45%), Gaps = 44/257 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHK----NIVPCSN 71
F V + G P + + FDTGSD++W+QC PC+G C K + + P K ++VPC +
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSATYSVVPCGH 193
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P+CAA + +C N C Y++EYGDG SS G L + L S +P FG
Sbjct: 194 PQCAAA---DGSKCS--NGTCLYKVEYGDGSSSAGVLSHETLSLT----STRALPGFAFG 244
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G D G++GLGRG++S+ SQ +C+ + +L G
Sbjct: 245 CG--QTNLGDFG--DVDGLIGLGRGQLSLSSQAA--ASFGGTFSYCLPSDNTTHGYLTIG 298
Query: 191 -KVPSSG--VAWTPMLQN------------SADLKHYILGPAELLYSGKSCGLKDLTLIF 235
P+S V +T M+Q S D+ YIL L++ D
Sbjct: 299 PTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFT-------DDGTFL 351
Query: 236 DSGASYAYFTSRVYQEI 252
DSG Y Y +
Sbjct: 352 DSGTILTYLPPEAYTAL 368
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/328 (29%), Positives = 139/328 (42%), Gaps = 42/328 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V+L +G PP + DTGSDL W QC APC C P + K+ +PC +
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCRSS 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
RCA+L + P C C Y+ YGD S+ G L + F +N + V + FGC
Sbjct: 148 RCASL---SSPSCFK--KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGC 202
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLG 188
G N G L+ +++G++G GRG +S+VSQL + + R GV
Sbjct: 203 G--SLNAGDLA--NSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANL 258
Query: 189 DGKVPSSG--VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFD 236
SSG V TP + N A Y L + K + L +I D
Sbjct: 259 SSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIID 318
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPD-DKTLPICWRGPFKALGQVTEYFKP 294
SG S + Y+ + R L+ PL D D L C++ P VT
Sbjct: 319 SGTSITWLQQDAYEA----VRRGLVSAIPLPAMNDTDIGLDTCFQWPPPP--NVTVTVPD 372
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISVST 322
L F +S + + PE Y++I+ +T
Sbjct: 373 LVFHF----DSANMTLLPENYMLIASTT 396
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 81/170 (47%), Gaps = 17/170 (10%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN---- 65
P + V L +G PP F DT SDL W QC PCTGC + + P +
Sbjct: 82 IMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYA 140
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+PCS+ C L + RC H +D+ C Y Y ++ G L D + G
Sbjct: 141 ALPCSSDTCDEL---DVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAF 193
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNV 172
+ FGC + P PP +GV+GLGRG +S+VSQL R YG+I ++
Sbjct: 194 RGVAFGCSTSSTGGAP--PPQASGVVGLGRGPLSLVSQLSVRRYGMIIDI 241
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/247 (29%), Positives = 111/247 (44%), Gaps = 22/247 (8%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPE-KQYKPH-------KNIVP 68
+ +G P F DTGSDL W+ C+ AP + +K P Q P+ V
Sbjct: 115 IDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVL 174
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTD-LFPLRFSNGSVFNVP 126
CS+P C C P DQC YEI Y +S GAL D ++ +R S G+ +P
Sbjct: 175 CSDPLCEM-----SSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGGNPVKLP 229
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ GCG Q L G++GLG IS+ ++L G + + CI G G L
Sbjct: 230 VYLGCGKVQTG-SLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLT 288
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
GD + P++ TP++ S + + + + G + L +FD+G S+ Y +
Sbjct: 289 FGD-EGPAAQRT-TPIIPKSVSMLDTYIVEIDSITVGNTNLLMASHALFDTGTSFTYLSK 346
Query: 247 RVYQEIV 253
VY + V
Sbjct: 347 TVYPQFV 353
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/203 (32%), Positives = 98/203 (48%), Gaps = 18/203 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK--QYKPHKNI----VPCS 70
V+L VG PP+ DTGS+L+W+ C AP G ++P ++ VPC
Sbjct: 65 LTVSLAVGTPPQNVTMVLDTGSELSWLLC-APGGGGGGGGRSALSFRPRASLTFASVPCG 123
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ +C + P+PP C + QC + Y DG SS GAL T++F + G + FG
Sbjct: 124 SAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV----GQGPPLRAAFG 179
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRGVLFLGD 189
C + P TAG+LG+ RG +S VSQ +CI ++ GVL LG
Sbjct: 180 CMATAFDTSP-DGVATAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVLLLGH 233
Query: 190 GKVPSSGVAWTPMLQNSADLKHY 212
+P + +TP+ Q + L ++
Sbjct: 234 SDLPFLPLNYTPLYQPAMPLPYF 256
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/271 (28%), Positives = 115/271 (42%), Gaps = 23/271 (8%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPC 69
+ VG P F DTGSDL WV CD AP +G ++ Y+P ++ +PC
Sbjct: 70 VDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPC 129
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FNVPL 127
S+ C ++ P C +P C Y I+Y + +S G L+ D L + V N +
Sbjct: 130 SHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 184
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
GCG Q L G+LGLG IS+ S L GL++N C ++ G +F
Sbjct: 185 IIGCGQKQSG-DYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFF 243
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSR 247
GD VPS TP + L+ Y + + K + DSG S+
Sbjct: 244 GDQGVPSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPLD 301
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
VY+ + + T ++ +D T C+
Sbjct: 302 VYKAFTMEFDKQMNAT--RVPYEDTTWKYCY 330
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/328 (27%), Positives = 138/328 (42%), Gaps = 39/328 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-TKPP--------EKQYKPHKNI 66
Y+A + +G PPK + DTGSD+ WV C C C T+ + + +
Sbjct: 83 YYA-KIGIGTPPKNYYLQVDTGSDIMWVNC-IQCKECPTRSSLGMDLTLYDIKESSSGKL 140
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---- 122
VPC C ++ C N C Y YGDG S+ G V D+ +G +
Sbjct: 141 VPCDQEFCKEINGGLLTGCT-ANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDS 199
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA---GVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
N + FGCG Q G LS + G+LG G+ S++SQL G ++ + HC+ G
Sbjct: 200 ANGSIVFGCGARQ--SGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNG 257
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILGPAELLYSGKSCGLKDLT-L 233
NG G+ +G P V TP+L + S ++ +G L S + D
Sbjct: 258 VNGGGIFAIGHVVQPK--VNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGT 315
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I DSG + AY +Y+ +V ++ ++ D+ T F+ V + F
Sbjct: 316 IIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYTC-------FQYSESVDDGFP 368
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISVS 321
+ F N + L V P YL SV+
Sbjct: 369 AVTFFF---ENGLSLKVYPHDYLFPSVN 393
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 88/320 (27%), Positives = 137/320 (42%), Gaps = 39/320 (12%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VP 68
I Y+ L +G PP++F D+GS +T+V C + C C K + +++P + V
Sbjct: 90 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQPELSSTYQPVK 148
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPL 127
C N C C +QC YE EY + SS G L DL + F N S
Sbjct: 149 C-NMDC---------NCDDDKEQCVYEREYAEHSSSKGVLGEDL--ISFGNESQLTPQRA 196
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVL 185
FGC + G L G++GLG+G +S+V QL + GLI N G C G G G +
Sbjct: 197 VFGCETVE--TGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSM 254
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGA 239
LG PS + S +Y + + +GK L + DSG
Sbjct: 255 ILGGFDYPSDMIFTDSDPDRSP---YYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSGT 311
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLPICWR-GPFKALGQVTEYFKPLA 296
+YAY + +MR++ +PLK PD C+ + ++++ F +
Sbjct: 312 TYAYLPDAAFAAFEEAVMREV--SPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSVE 369
Query: 297 LSFTNRRNSVRLVVPPEAYL 316
+ F ++ ++ PE Y+
Sbjct: 370 MIF---KSGQSWLLSPENYM 386
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 132/323 (40%), Gaps = 39/323 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G PPK + DTGSD+ WV C C C + +
Sbjct: 85 YYA-KIGIGTPPKNYYLQVDTGSDIMWVNC-IQCKECPTRSNLGMDLTLYDIKESSSGKF 142
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---- 122
VPC C ++ C N C Y YGDG S+ G V D+ +G +
Sbjct: 143 VPCDQEFCKEINGGLLTGCT-ANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDS 201
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDT---AGVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
N + FGCG Q G LS + G+LG G+ S++SQL G ++ + HC+ G
Sbjct: 202 ANGSIVFGCGARQ--SGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNG 259
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPML----QNSADLKHYILGPAELLYSGKSCGLKDLT-L 233
NG G+ +G P V TP+L S ++ +G A L S + D
Sbjct: 260 VNGGGIFAIGHVVQPK--VNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGT 317
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
I DSG + AY +Y+ +V I+ ++ D+ T F+ V + F
Sbjct: 318 IIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYTC-------FQYSESVDDGFP 370
Query: 294 PLALSFTNRRNSVRLVVPPEAYL 316
+ F N + L V P YL
Sbjct: 371 AVTFYF---ENGLSLKVYPHDYL 390
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/337 (30%), Positives = 144/337 (42%), Gaps = 50/337 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C P+ Y P K+
Sbjct: 97 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CIKCAPLASPDYGDLKFDMYSPRKS 153
Query: 66 I----VPCSNPRCAALHWPNP-PRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
VPCS+ C +P C ++ C Y I+Y + SS G LV D+ L +
Sbjct: 154 STSRKVPCSSSLC------DPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTES 207
Query: 120 GS--VFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
G + P+TFGCG Q G +P G+LGLG S+ S L G+ N
Sbjct: 208 GQSKITQAPITFGCGQVQSGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGIAANSFSM 264
Query: 176 CIGQNGRGVLFLGDGKVPSSGVAWTPM---LQNSADLKHYILGPAELLYSGKSCGLKDLT 232
C G++G G + GD SS TP+ QN +Y + + GKS K +
Sbjct: 265 CFGEDGHGRINFGD--TGSSDQLETPLNIYKQN----PYYNISITGAMVGGKSFDTK-FS 317
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
+ DSG S+ + +Y EI S + + L D ++P + A G V
Sbjct: 318 AVVDSGTSFTALSDPMYTEITSTFNAQVKESRKHL---DASMPFEYCYSISAQGAV---- 370
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAY 329
P +S T + S+ V P ++ TS IAY
Sbjct: 371 NPPNISLTAKGGSIFPVNGP---IITITDTSSRPIAY 404
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/270 (30%), Positives = 112/270 (41%), Gaps = 24/270 (8%)
Query: 24 GKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAA--- 76
G P DTGSDLTWVQC PC+ C + + P + V C+ CAA
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACAASLK 255
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
P C N++C Y + YGDG S G L TD L ++ F FGCG +
Sbjct: 256 AATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGASLDGF----VFGCGLS-- 309
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQ--LREYGLIRNVIGHCIGQNGRGVLFLGDGKVP- 193
N G TAG++GLGR +S+VSQ LR G+ + + G L LG
Sbjct: 310 NRGLFG--GTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSY 367
Query: 194 --SSGVAWTPMLQNSADLKHYILGPAELLYSGKSC---GLKDLTLIFDSGASYAYFTSRV 248
++ VA+T M+ + A Y L G + GL ++ DSG V
Sbjct: 368 RNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSV 427
Query: 249 YQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
Y+ + + R AP L C+
Sbjct: 428 YRGVRAEFTRQFAAAGYPTAPGFSILDTCY 457
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/282 (29%), Positives = 120/282 (42%), Gaps = 35/282 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ VN+ +G P K FDTGSDLTW QC C + + P + + C++
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSA 213
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C++L N P C N C Y I+YGD +IG D L + VF+ FG
Sbjct: 214 ACSSLKSATGNSPGCSSSN--CVYGIQYGDSSFTIGFFAKD--KLTLTQNDVFD-GFMFG 268
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYG--------LIRNVIGHCIGQNG 181
CG Q+N G TAG++GLGR +SIV Q +++G R GH NG
Sbjct: 269 CG--QNNKGLFGK--TAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGHLTFGNG 324
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFD 236
GV K +G+ +TP +S +Y + + GK+ + ++ I D
Sbjct: 325 NGV---KASKAVKNGITFTP-FASSQGTAYYFIDVLGISVGGKALSISPMLFQNAGTIID 380
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
SG S Y + S + + P AP L C+
Sbjct: 381 SGTVITRLPSTAYGSLKSAFKQFMSKYP--TAPALSLLDTCY 420
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/261 (28%), Positives = 110/261 (42%), Gaps = 38/261 (14%)
Query: 24 GKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN-----------IVPCSNP 72
G F+ DTGSD+ WV C+ C+ C P Q N ++PCS+
Sbjct: 75 GXXXXXFNVQIDTGSDILWVNCNT-CSNC--PQSSQLGIELNFFDTVGSSTAALIPCSDL 131
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFN--VPLT 128
C + C +QC Y +YGDG + G V+D F L N +
Sbjct: 132 ICTSGVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIV 191
Query: 129 FGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGV 184
FGC +Q G L+ D A G+ G G G +S+VSQL G+ V HC+ NG G+
Sbjct: 192 FGCSISQS--GDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGI 249
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IF 235
L LG+ PS + ++P++ + HY L + +G+ + I
Sbjct: 250 LVLGEILEPS--IVYSPLVPSQ---PHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIV 304
Query: 236 DSGASYAYFTSRVYQEIVSLI 256
D G + AY Y +V+ I
Sbjct: 305 DCGTTLAYLIQEAYDPLVTAI 325
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/315 (26%), Positives = 138/315 (43%), Gaps = 36/315 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNP-RC 74
Y+ L +G PP+ F D+GS +T+V C A C C + +++P ++ +P +C
Sbjct: 84 YYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP--DLSSTYSPVKC 140
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFGCGY 133
+A C QC YE +Y + SS G L D+ + F S FGC
Sbjct: 141 SA-----DCTCDSDKSQCTYERQYAEMSSSSGVLGEDI--VSFGTESELKPQRAVFGC-- 191
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDGK 191
G L G++GLGRG++SI+ QL + G+I + C G G G + LG
Sbjct: 192 ENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMP 251
Query: 192 VPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL------IFDSGASYAY 243
P V S ++ +Y + E+ +GK+ L + DSG +YAY
Sbjct: 252 APPDMV-----FSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDSGTTYAY 306
Query: 244 FTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTN 301
+ + + + PLK PD IC+ G + + Q+++ F + + F +
Sbjct: 307 LPEQAFVAFKDAVTSKV--RPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGD 364
Query: 302 RRNSVRLVVPPEAYL 316
+ +L + PE YL
Sbjct: 365 GQ---KLSLSPENYL 376
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/326 (27%), Positives = 133/326 (40%), Gaps = 47/326 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G P + + DTGSD+ WV C C C K + + +
Sbjct: 98 YYA-KIGIGTPARDYYVQVDTGSDIMWVNC-IQCNECPKKSSLGMELTLYDIKESLTGKL 155
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---- 122
V C C A++ P C N C Y Y DG SS G V D+ +G +
Sbjct: 156 VSCDQDFCYAINGGPPSYCI-ANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTS 214
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
N + FGC Q G LS + G+LG G+ S++SQL G +R + HC+ G N
Sbjct: 215 ANGSVIFGCSATQ--SGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLN 272
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSCGLKD 230
G G+ +G P V TP++ N + ++K +G P ++ G G
Sbjct: 273 GGGIFAIGHIVQPK--VNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKG--- 327
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY VY +++S I + D T F+ + +
Sbjct: 328 --TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTC-------FQYSESLDD 378
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYL 316
F + F NS+ L V P YL
Sbjct: 379 GFPAVTFHF---ENSLYLKVHPHEYL 401
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 118/284 (41%), Gaps = 31/284 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ V C+ P
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 238
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L + C C Y ++YGDG SIG D L S ++ F G
Sbjct: 239 ACSDL---DTRGCS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 288
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC+ G G L G
Sbjct: 289 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSTGTGYLDFGA 343
Query: 190 GKVPSSGVAWTPMLQNSADLKHY-----ILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
G P++ + TPML ++ +Y I LLY +S I DSG
Sbjct: 344 GS-PAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSV-FATAGTIVDSGTVITRL 401
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
Y + S + K AP L C+ F + QV
Sbjct: 402 PPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCY--DFAGMSQV 443
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 118/285 (41%), Gaps = 31/285 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC+ C + EK + P ++ + C+ P
Sbjct: 186 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAP 245
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L+ C C Y ++YGDG SIG D L S ++ F G
Sbjct: 246 ACSDLYTKG---CS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAIKGFRFG 295
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC +G G L G
Sbjct: 296 CGERNEGLFG--EAAGLLGLGRGKTSLPVQAYDKYG---GVFAHCFPARSSGTGYLDFGP 350
Query: 190 GKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAY 243
G P+ S TPML ++ L Y +G + GK + I DSG
Sbjct: 351 GSSPAVSTKLTTPMLVDNG-LTFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITR 409
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
Y + S + K AP L C+ F + QV
Sbjct: 410 LPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCY--DFTGMSQV 452
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 104/251 (41%), Gaps = 27/251 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----GCTKPPEKQYKPHKNI----VP 68
+ +G P F D GSDL WV C+ AP + G +Y+P + +
Sbjct: 107 IDIGTPSVSFLVALDAGSDLLWVPCNCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHIS 166
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL----RFSNGSVF 123
CS+ C + C+ P C Y I+Y + SS G L+ D+ L S+
Sbjct: 167 CSHNLCDS-----GQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTI 221
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
P+ GCG Q G LS G+ GLG G IS++S L + L++N C ++G G
Sbjct: 222 QAPVILGCGMKQSG-GYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSG 280
Query: 184 VLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
+F GD G ++ P+ + YI+G + DSG S+
Sbjct: 281 RIFFGDEGPASQQTTSFVPL---DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFT 337
Query: 243 YFTSRVYQEIV 253
Y Y+ IV
Sbjct: 338 YLPEEAYENIV 348
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 90/326 (27%), Positives = 132/326 (40%), Gaps = 47/326 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G P + + DTGSD+ WV C C C K + + +
Sbjct: 98 YYA-KIGIGTPARDYYVQVDTGSDIMWVNC-IQCNECPKKSSLGMELTLYDIKESLTGKL 155
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---- 122
V C C A++ P C N C Y Y DG SS G V D+ +G +
Sbjct: 156 VSCDQDFCYAINGGPPSYCI-ANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTS 214
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
N + FGC Q G LS + G+LG G+ S++SQL G +R + HC+ G N
Sbjct: 215 ANGSVIFGCSATQ--SGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLN 272
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQN---------SADLKHYILG-PAELLYSGKSCGLKD 230
G G+ +G P V TP++ N + ++ Y L P ++ G G
Sbjct: 273 GGGIFAIGHIVQPK--VNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKG--- 327
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY VY +++S I + D T F+ + +
Sbjct: 328 --TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTC-------FQYSESLDD 378
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYL 316
F + F NS+ L V P YL
Sbjct: 379 GFPAVTFHF---ENSLYLKVHPHEYL 401
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 74/248 (29%), Positives = 103/248 (41%), Gaps = 21/248 (8%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR------- 73
+ +G P F D+GSDL WV CD C C Y + +P
Sbjct: 102 IDIGTPHVSFMVALDSGSDLFWVPCD--CVQCAPLSASHYSSLDRDLSEYSPSQSSTSKQ 159
Query: 74 --CAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSVFN----VP 126
C+ P CK+P C Y I Y + SS G LV D+ L N P
Sbjct: 160 LSCSHRLCDMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDTLNTSVKAP 219
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ GCG Q G L G+LGLG IS+ S L + GLI+N C ++ G +F
Sbjct: 220 VIIGCGMKQSG-GYLDGVAPDGLLGLGLQEISVPSFLAKAGLIQNSFSMCFNEDDSGRIF 278
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFT 245
GD + A P L+ + + YI+G E+ G SC + + DSG S+ +
Sbjct: 279 FGDQGPATQQSA--PFLKLNGNYTTYIVG-VEVCCVGTSCLKQSSFSALVDSGTSFTFLP 335
Query: 246 SRVYQEIV 253
V++ I
Sbjct: 336 DDVFEMIA 343
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/326 (27%), Positives = 131/326 (40%), Gaps = 49/326 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---------EKQYKPHKNI 66
Y+A + +G P K + DTGSD+ WV C C C + +
Sbjct: 80 YYA-KIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIELTLYNIDESDSGKL 137
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---- 122
V C + C + CK N C Y YGDG S+ G V D+ G +
Sbjct: 138 VSCDDDFCYQISGGPLSGCK-ANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
N + FGCG Q S + G+LG G+ S++SQL G ++ + HC+ G+N
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRN 256
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADL----------KHYILGPAELLYSGKSCGLKD 230
G G+ + G+V V TP++ N + ++ PA+L G G
Sbjct: 257 GGGIFAI--GRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG--- 311
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY +Y+ +V LK+ DK F+ G+V E
Sbjct: 312 --AIIDSGTTLAYLPEIIYEPLVKK------EPALKVHIVDKDYKC-----FQYSGRVDE 358
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYL 316
F + F NSV L V P YL
Sbjct: 359 GFPNVTFHF---ENSVFLRVYPHDYL 381
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 109/264 (41%), Gaps = 32/264 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----------KPPEKQYKPHKNI---- 66
+ +G P F D GSDL WV CD C C +Y P +++
Sbjct: 104 IDIGTPSTSFLVALDAGSDLLWVPCD--CIHCAPLSASFYSNLDRDLNEYSPSRSLSSKH 161
Query: 67 VPCSNPRCAALHWPNPPRCK-HPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSVFN 124
+ CS+ C CK QC Y I Y D SS G LV D+F L+ +GS N
Sbjct: 162 LSCSHRLCDM-----GSNCKTSKQQQCPYTINYLSDNTSSSGLLVEDIFHLQSGDGSTSN 216
Query: 125 ----VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
P+ GCG Q + G L G++GLG G S+ S L + GLIR+ C ++
Sbjct: 217 SSVQAPVVVGCGMKQ-SGGYLDGTAPDGLIGLGPGESSVPSFLAKSGLIRDSFSLCFNED 275
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGA 239
G LF GD S+ TP L YI+G E G SC + FDSG
Sbjct: 276 DSGRLFFGDQG--STVQQSTPFLLVDGMFSTYIVG-VETCCIGNSCPKVTSFNAQFDSGT 332
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGT 263
S+ + Y I + + T
Sbjct: 333 SFTFLPGHAYGAIAEEFDKQVNAT 356
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 108/256 (42%), Gaps = 31/256 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F + L +G PP+ F DTGSDL W QC PC C + P ++ + CS+
Sbjct: 366 FLMKLAIGSPPRSFSAIMDTGSDLIWTQC-KPCQQCFDQSTPIFDPKQSSSFYKISCSSE 424
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C AL P +D C+Y YGD S+ G L + F S ++P L FGC
Sbjct: 425 LCGAL-----PTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGC 479
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD-- 189
G + + G AG++GLGRG +S+VSQL+E + I + L LG
Sbjct: 480 GNDNNGDG---FSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTA--IDDSKPSSLLLGSLA 534
Query: 190 ---GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFD 236
K + TP+++N + Y L + G + T +I D
Sbjct: 535 NITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIID 594
Query: 237 SGASYAYFTSRVYQEI 252
SG + Y + + +
Sbjct: 595 SGTTITYVENSAFTSL 610
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 116/308 (37%), Gaps = 51/308 (16%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVP 68
+ + + V+L VG PP+ DTGSDL W QC APC C P + +P
Sbjct: 88 VTNEYLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFHQGLPLLDPAASSTYAALP 146
Query: 69 CSNPRCAAL-----------HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF 117
C PRC AL W N N C Y YGD ++G + TD F
Sbjct: 147 CGAPRCRALPFTSCGGGGRSSWGN------GNRSCAYIYHYGDKSVTVGEIATDRFTFGG 200
Query: 118 SNGS----VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVI 173
NG + LTFGCG+ N G +T G+ G GRGR S+ SQL
Sbjct: 201 DNGDGDSRLPTRRLTFGCGH--FNKGVFQSNET-GIAGFGRGRWSLPSQLNV-----TTF 252
Query: 174 GHC-------------IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELL 220
+C +G L S V TP+L+N + Y L +
Sbjct: 253 SYCFTSMFESKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGIS 312
Query: 221 YSGKSCGLKDLTL---IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPIC 277
+ + L I DSGAS VY E V +G P + L +C
Sbjct: 313 VGKTRLAVPEAKLRSTIIDSGASITTLPEAVY-EAVKAEFAAQVGLPPTGVVEGSALDLC 371
Query: 278 WRGPFKAL 285
+ P AL
Sbjct: 372 FALPVTAL 379
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/282 (29%), Positives = 119/282 (42%), Gaps = 35/282 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ VN+ +G P K FDTGSDLTW QC C + + P + + C++
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTST 213
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C+ L N P C N C Y I+YGD ++G D L + VF+ FG
Sbjct: 214 ACSGLKSATGNSPGCSSSN--CVYGIQYGDSSFTVGFFAKD--TLTLTQNDVFD-GFMFG 268
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYG--------LIRNVIGHCIGQNG 181
CG Q+N G TAG++GLGR +SIV Q +++G R GH NG
Sbjct: 269 CG--QNNRGLFGK--TAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGHLTFGNG 324
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFD 236
GV K +G+ +TP +S Y + + GK+ + ++ I D
Sbjct: 325 NGV---KTSKAVKNGITFTP-FASSQGATFYFIDVLGISVGGKALSISPMLFQNAGTIID 380
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
SG S VY + S + + P AP L C+
Sbjct: 381 SGTVITRLPSTVYGSLKSTFKQFMSKYP--TAPALSLLDTCY 420
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 114/271 (42%), Gaps = 23/271 (8%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPC 69
+ VG P F DTGSDL WV CD AP +G ++ Y+P ++ +PC
Sbjct: 100 VDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPC 159
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FNVPL 127
S+ C ++ P C +P C Y I+Y + +S G L+ D L + V N +
Sbjct: 160 SHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASV 214
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
GCG Q L G+L LG IS+ S L GL++N C ++ G +F
Sbjct: 215 IIGCGQKQSG-DYLDGIAPDGLLALGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFF 273
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSR 247
GD VPS TP + L+ Y + + K + DSG S+
Sbjct: 274 GDQGVPSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFD 331
Query: 248 VYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
VY+ + + T ++ +D T C+
Sbjct: 332 VYKAFTMEFDKQMNAT--RVPYEDTTWKYCY 360
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 89/345 (25%), Positives = 140/345 (40%), Gaps = 49/345 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+ ++G P + F DTGSDL +VQC APC C + Y+P + VPC +
Sbjct: 34 YFVDFSLGTPEQKFHLIVDTGSDLAFVQC-APCDLCYEQDGPLYQPSNSSTFTPVPCDSA 92
Query: 73 RCAALHWPNPPRCKH------PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
C + P C P C YE YGD S++G + + G +
Sbjct: 93 ECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATV----GGIRVNH 148
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NG 181
+ FGCG N G GVLGLG+G +S SQ N +C+ +
Sbjct: 149 VAFGCG--NRNQGSFV--SAGGVLGLGQGALSFTSQAGY--AFENKFAYCLTSYLSPTSV 202
Query: 182 RGVLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-------- 232
L GD + + + +TP++ N + Y + + + G++ + D
Sbjct: 203 FSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGN 262
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKALGQVT 289
IFDSG + Y++ + Y I++ + + P A P + LP+C V+
Sbjct: 263 GGTIFDSGTTVTYWSPQAYARIIAAFEKSV---PYPRAPPSPQGLPLCVN--------VS 311
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTGKS 334
P+ SFT + P + I VS +I +A L S
Sbjct: 312 GIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSS 356
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 123/284 (43%), Gaps = 26/284 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKN----IVPCSN 71
+ + +G P K + DTGS LTW+QC +PC C + + P + V CS+
Sbjct: 117 YVTRMGLGTPAKPYIMVVDTGSSLTWLQC-SPCRVSCHRQSGPVFDPKTSSSYAAVSCSS 175
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P+C L NP C P++ C Y+ YGD S+G L D + F SV N +
Sbjct: 176 PQCDGLSTATLNPAVCS-PSNVCIYQASYGDSSFSVGYLSKDT--VSFGANSVPN--FYY 230
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GCG Q N G +AG++GL R ++S++ QL + +C+ +L
Sbjct: 231 GCG--QDNEGLFG--RSAGLMGLARNKLSLLYQLAP--TLGYSFSYCLPSTSSSG-YLSI 283
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYF 244
G G ++TPM+ N+ D Y + + + +GK S L I DSG
Sbjct: 284 GSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRL 343
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
+ VY + + + G+ K A L C+ G L V
Sbjct: 344 PTSVYTALSKAVAAAMKGS-TKRAAAYSILDTCFEGQASKLRAV 386
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 104/251 (41%), Gaps = 27/251 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----GCTKPPEKQYKPHKNI----VP 68
+ +G P F D GSDL WV C+ AP + G +Y+P + +
Sbjct: 107 IDIGTPSVSFLVALDAGSDLLWVPCNCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHIS 166
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL----RFSNGSVF 123
CS+ C + C+ P C Y I+Y + SS G L+ D+ L S+
Sbjct: 167 CSHNLCDSGQ-----SCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTI 221
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
P+ GCG Q G LS G+ GLG G IS++S L + L++N C ++G G
Sbjct: 222 QAPVILGCGMKQSG-GYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSG 280
Query: 184 VLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
+F GD G ++ P+ + YI+G + DSG S+
Sbjct: 281 RIFFGDEGPASQQTTSFVPL---DGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFT 337
Query: 243 YFTSRVYQEIV 253
Y Y+ IV
Sbjct: 338 YLPEEAYENIV 348
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 122/266 (45%), Gaps = 42/266 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +++G PP+ F DTGSDL WVQC APC C + P+ + P + C++
Sbjct: 8 YVLQISLGTPPQQFSAIVDTGSDLCWVQC-APCARCFEQPDPLFIPLASSSYSNASCTDS 66
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL P C N C Y YGDG ++ G + L NGS + FGCG
Sbjct: 67 LCDALPRPT---CSMRN-TCTYSYSYGDGSNTRGDFAFETVTL---NGSTL-ARIGFGCG 118
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC-IGQNGRGV---LFLG 188
+NQ G + D G++GLG+G +S+ SQL ++ +C + Q+ G + G
Sbjct: 119 HNQE--GTFAGAD--GLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFSPITFG 172
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILG--------------PAELLYSGKSCGLKDLTLI 234
+ +S ++TP+LQN + +Y +G P+ G +I
Sbjct: 173 NA-AENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVG----GVI 227
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDL 260
DSG + Y+ + I++ + R +
Sbjct: 228 LDSGTTITYWRLAAFIPILAELRRQI 253
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/256 (28%), Positives = 108/256 (42%), Gaps = 31/256 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F + L +G PP+ F DTGSDL W QC PC C + P ++ + CS+
Sbjct: 111 FLMKLAIGSPPRSFSAIMDTGSDLIWTQC-KPCQQCFDQSTPIFDPKQSSSFYKISCSSE 169
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C AL P +D C+Y YGD S+ G L + F S ++P L FGC
Sbjct: 170 LCGAL-----PTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGC 224
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD-- 189
G + + G AG++GLGRG +S+VSQL+E + I + L LG
Sbjct: 225 GNDNNGDG---FSQGAGLVGLGRGPLSLVSQLKEQKFAYCLT--AIDDSKPSSLLLGSLA 279
Query: 190 ---GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFD 236
K + TP+++N + Y L + G + T +I D
Sbjct: 280 NITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIID 339
Query: 237 SGASYAYFTSRVYQEI 252
SG + Y + + +
Sbjct: 340 SGTTITYVENSAFTSL 355
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 78/280 (27%), Positives = 114/280 (40%), Gaps = 30/280 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----------GCTKPPEKQYKPHKNI 66
+ VG P F DTGSDL WV CD AP + G KP E H
Sbjct: 106 VDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSRH--- 162
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FN 124
+PCS+ C+ C +P C Y I+Y + +S G L+ D+ L G N
Sbjct: 163 LPCSHELCSPASG-----CTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAPVN 217
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+ GCG Q L G+LGLG IS+ S L GL+RN C ++ G
Sbjct: 218 ASVIIGCGKKQSG-SYLEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGR 276
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+F GD VP+ TP + + L+ Y + + K + D+G S+
Sbjct: 277 IFFGDQGVPTQ--QSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSL 334
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR-GPFK 283
Y+ I + + + + + DD + C+ GP +
Sbjct: 335 PLDAYKSITMEFDKQINAS--RASSDDYSFEYCYSTGPLE 372
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 78/280 (27%), Positives = 114/280 (40%), Gaps = 30/280 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----------GCTKPPEKQYKPHKNI 66
+ VG P F DTGSDL WV CD AP + G KP E H
Sbjct: 106 VDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSRH--- 162
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FN 124
+PCS+ C+ C +P C Y I+Y + +S G L+ D+ L G N
Sbjct: 163 LPCSHELCSPASG-----CTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAPVN 217
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+ GCG Q L G+LGLG IS+ S L GL+RN C ++ G
Sbjct: 218 ASVIIGCGKKQSG-SYLEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGR 276
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+F GD VP+ TP + + L+ Y + + K + D+G S+
Sbjct: 277 IFFGDQGVPTQ--QSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSL 334
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR-GPFK 283
Y+ I + + + + + DD + C+ GP +
Sbjct: 335 PLDAYKSITMEFDKQINAS--RASSDDYSFEYCYSTGPLE 372
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/273 (27%), Positives = 120/273 (43%), Gaps = 27/273 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ V++ +G P + FDTGSDL+WVQC PC+ C + + + P + + VPC++P
Sbjct: 146 YVVSMGLGTPARDMTVVFDTGSDLSWVQC-TPCSDCYEQKDPLFDPARSSTYSAVPCASP 204
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L + R K +C YE+ YGD + GAL D L S+ +P FGC
Sbjct: 205 ECQGLDSRSCSRDK----KCRYEVVYGDQSQTDGALARDTLTLTQSD----VLPGFVFGC 256
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCIGQNGRGVLFLGDG 190
G + + G D G++GLGR ++S+ SQ +YG +C+ + +L G
Sbjct: 257 G--EQDTGLFGRAD--GLVGLGREKVSLSSQAASKYGA---GFSYCLPSSPSAAGYLSLG 309
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAYFT 245
+ +T M Y + + +G++ + + + DSG
Sbjct: 310 GPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTVITRLP 369
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
RVY + S R + K AP L C+
Sbjct: 370 PRVYAALRSAFARSMGRYGYKRAPALSILDTCY 402
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 117/277 (42%), Gaps = 33/277 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC+ C K EK + P ++ + C+ P
Sbjct: 161 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTYANISCAAP 220
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV--PLTFG 130
C+ L+ C C Y ++YGDG SIG D L S ++ FG
Sbjct: 221 ACSDLYIKG---CS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAIKGFRFG 270
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFL 187
CG + N G + AG+LGLGRG+ S+ V +YG V HC +G G L
Sbjct: 271 CG--ERNEGLYG--EAAGLLGLGRGKTSLPVQAYDKYG---GVFAHCFPARSSGTGYLDF 323
Query: 188 GDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASY 241
G G +P+ S TPML ++ +Y+ G + GK + I DSG
Sbjct: 324 GPGSLPAVSAKLTTPMLVDNGPTFYYV-GLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVI 382
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
Y + S + K AP L C+
Sbjct: 383 TRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCY 419
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 125/290 (43%), Gaps = 34/290 (11%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
SY+ ++ ++G PP DTGSD W QC PC C + P K+ + CS
Sbjct: 88 SYYVMSYSIGTPPFQLYGVVDTGSDGIWFQC-KPCKPCLNQTSPIFNPSKSSTYKNIRCS 146
Query: 71 NPRCAALHWPNPPRC-KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
+P C RC + +C+YEI Y D S G + D L ++GS + P +
Sbjct: 147 SPICKR---GEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKIV 203
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRG 183
GCG H + +G++G GRG SIVSQL I +C+ N
Sbjct: 204 IGCG---HKNSLTTEGLASGIIGFGRGNFSIVSQLGSS--IGGKFSYCLASLFSKANISS 258
Query: 184 VLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI-------- 234
L+ GD V S GV TP++Q S + +Y LKD +LI
Sbjct: 259 KLYFGDMAVVSGHGVVSTPLIQ-SFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAV 317
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFK 283
DSG++ + VY ++ + ++ + LK D + L +C++ K
Sbjct: 318 IDSGSTITQLPNDVYSQLETAVISMV---KLKRVKDPTQQLSLCYKTTLK 364
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 113/275 (41%), Gaps = 29/275 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C K EK + P ++ V C+ P
Sbjct: 182 YVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAP 241
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L+ C C Y ++YGDG SIG D L S ++ F G
Sbjct: 242 ACSDLYTRG---CS--GGHCLYSVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 291
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC+ +G G L G
Sbjct: 292 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSSGTGYLDFGP 346
Query: 190 GKVPSSGVAW-TPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
G + G TPML ++ +Y+ G + G+ + I DSG
Sbjct: 347 GSPAAVGARQTTPMLTDNGPTFYYV-GMTGIRVGGQLLSIPQSVFSTAGTIVDSGTVITR 405
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
Y + S + K AP L C+
Sbjct: 406 LPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCY 440
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 119/286 (41%), Gaps = 29/286 (10%)
Query: 1 MYVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
M I+ P + +NL++G PP DTGSDLTW QC PCT C K +
Sbjct: 76 MTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPFF 134
Query: 61 KPHKNIV----PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
P + C C AL N C++ +C + Y DG + G L + +
Sbjct: 135 DPKNSSTYRDSSCGTSFCLAL--GNDRSCRN-GKKCTFMYSYADGSFTGGNLAVETLTVA 191
Query: 117 FSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ G + P FGC H G + ++G++GLG +S++SQL+ I +
Sbjct: 192 STAGKPVSFPGFAFGC---VHRSGGIFDEHSSGIVGLGVAELSMISQLKS--TINGRFSY 246
Query: 176 CI------GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYS 222
C+ + F G V +G TP++ D +Y++ G L Y
Sbjct: 247 CLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYK 306
Query: 223 G--KSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
G K +++ +I DSG +Y Y Y ++ + + G ++
Sbjct: 307 GFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVR 352
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 122/266 (45%), Gaps = 42/266 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F +NL +G P + + DTGSDL W QC PC C P + P K+ +PCS+
Sbjct: 97 FLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPCSSD 155
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL + C +D C+Y YGD S+ G L T+ F F + SV + FGCG
Sbjct: 156 LCVALPISS---C---SDGCEYRYSYGDHSSTQGVLATETF--TFGDASVSKI--GFGCG 205
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
+ N G + AG++GLGRG +S++SQL G+ + +C+ G L +G
Sbjct: 206 --EDNRGR-AYSQGAGLVGLGRGPLSLISQL---GVPK--FSYCLTSIDDSKGISTLLVG 257
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDL---TLIFDSG 238
S + TP++QN + Y L G L + ++D LI DSG
Sbjct: 258 SEATVKSAIP-TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSG 316
Query: 239 ASYAYFTSRVY----QEIVSLIMRDL 260
+ Y + +E +S + D+
Sbjct: 317 TTITYLKDNAFAALKKEFISQMKLDV 342
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 111/252 (44%), Gaps = 26/252 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IV 67
+ YF L +G P + F DTGS +T+V C A C P K + P + ++
Sbjct: 59 YGYFYATLHLGTPARQFAVIVDTGSTITYVPC-ASCGRNCGPHHKDAAFDPASSSSSAVI 117
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
C + +C PP +C Y+ Y + SS G LV+D LR +G+V +
Sbjct: 118 GCDSDKCIC---GRPPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQLR--DGAV---EV 169
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRGVLF 186
FGC G + + G+LGLG +S+V+QL G+I +V C G G G L
Sbjct: 170 VFGC--ETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEGDGALM 227
Query: 187 LGDGKVPSSGVA--WTPMLQNSADLKHYILGPAELLYSGKSCGLK------DLTLIFDSG 238
LGD VA +T +L + A +Y + L G+ +K + DSG
Sbjct: 228 LGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGYGTVLDSG 287
Query: 239 ASYAYFTSRVYQ 250
++ Y S +Q
Sbjct: 288 TTFTYLPSEAFQ 299
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 122/266 (45%), Gaps = 42/266 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F +NL +G P + + DTGSDL W QC PC C P + P K+ +PCS+
Sbjct: 97 FLMNLAIGTPAETYSAIMDTGSDLIWTQC-KPCKVCFDQPTPIFDPEKSSSFSKLPCSSD 155
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL + C +D C+Y YGD S+ G L T+ F F + SV + FGCG
Sbjct: 156 LCVALPISS---C---SDGCEYRYSYGDHSSTQGVLATETF--TFGDASVSKI--GFGCG 205
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
+ N G + AG++GLGRG +S++SQL G+ + +C+ G L +G
Sbjct: 206 --EDNRGR-AYSQGAGLVGLGRGPLSLISQL---GVPK--FSYCLTSIDDSKGISTLLVG 257
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDL---TLIFDSG 238
S + TP++QN + Y L G L + ++D LI DSG
Sbjct: 258 SEATVKSAIP-TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSG 316
Query: 239 ASYAYFTSRVY----QEIVSLIMRDL 260
+ Y + +E +S + D+
Sbjct: 317 TTITYLKDSAFAALKKEFISQMKLDV 342
>gi|224097210|ref|XP_002334633.1| predicted protein [Populus trichocarpa]
gi|222873871|gb|EEF11002.1| predicted protein [Populus trichocarpa]
Length = 143
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/83 (51%), Positives = 58/83 (69%), Gaps = 3/83 (3%)
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG--PFKALGQVTEYFKPLAL 297
SY Y S+ YQ ++SLI R+L PL+ A DD+TLPICW+G PFK++ V +YFK AL
Sbjct: 1 SYTYLNSQAYQGLISLIKRELSTKPLREALDDQTLPICWKGRKPFKSVHDVKKYFKTFAL 60
Query: 298 SFTNR-RNSVRLVVPPEAYLVIS 319
SF N ++ +L PPEAYL++S
Sbjct: 61 SFANDGKSKTQLEFPPEAYLIVS 83
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 76/258 (29%), Positives = 115/258 (44%), Gaps = 35/258 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC--TGCTKPPEKQYKPHKNI----VPCS 70
+ V + +G PP+ F FDTGSDLTWVQC PC + C E + P K+ VPCS
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQC-LPCPDSSCYPQQEPLFDPSKSSTYVDVPCS 180
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR-------FSNGSVF 123
P C H + + C+Y ++YGD + G+L + F L + G VF
Sbjct: 181 APEC---HIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVF 237
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGR 182
+ +N G AG+LGLGRG SI+SQ R V +C+ G
Sbjct: 238 GCSHEYISVFNDTGMG------VAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGS 291
Query: 183 --GVLFLGDGKVPS----SGVAWTPMLQNSADLKH-YILGPAELLYSGKSCGLK----DL 231
G L +G G S +++TP++ + L+ Y++ A + +G + + L
Sbjct: 292 STGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL 351
Query: 232 TLIFDSGASYAYFTSRVY 249
+ DSG + + Y
Sbjct: 352 GAVIDSGTVVTHMPAAAY 369
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 138/308 (44%), Gaps = 41/308 (13%)
Query: 17 FAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPP----EKQYKPHKNIVPCSN 71
F +++T+G PP K+F DTGSDLTWVQC PC C K +K+ PC +
Sbjct: 85 FFMSITIGTPPIKVFAIA-DTGSDLTWVQC-KPCQQCYKENGPIFDKKKSSTYKSEPCDS 142
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C AL C N+ C Y YGD S G + T+ + ++GS + P T FG
Sbjct: 143 RNCQALS-STERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFG 201
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVL 185
CGYN G +G++GLG G +S++SQL I +C+ NG V+
Sbjct: 202 CGYNN---GGTFDETGSGIIGLGGGHLSLISQLGSS--ISKKFSYCLSHKSATTNGTSVI 256
Query: 186 FLGDGKVPS-----SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDL--- 231
LG +PS SGV TP++ +Y+ +G ++ Y+G S D
Sbjct: 257 NLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGIL 316
Query: 232 -----TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
+I DSG + + + + S + + G +++ L C++ +G
Sbjct: 317 SETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAK-RVSDPQGLLSHCFKSGSAEIG 375
Query: 287 --QVTEYF 292
++T +F
Sbjct: 376 LPEITVHF 383
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 136/306 (44%), Gaps = 60/306 (19%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPC 69
YF +++ VG PPK DTGSDL+W+QCD PC C + Y P ++NI C
Sbjct: 170 EYF-LDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGSHYYPKDSSTYRNI-SC 226
Query: 70 SNPRCAALHWPNP-PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS--NG-SVFN- 124
+PRC + +P CK N C Y +Y DG ++ G ++ F + + NG F
Sbjct: 227 YDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQ 286
Query: 125 -VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH----CI-- 177
V + FGCG+ N G +G+LGLGRG IS SQ I+++ GH C+
Sbjct: 287 VVDVMFGCGH--WNKGFFYG--ASGLLGLGRGPISFPSQ------IQSIYGHSFSYCLTD 336
Query: 178 ---GQNGRGVLFLGDGK--VPSSGVAWTPML--QNSADLKHYILGPAELLYSGKSCGLKD 230
+ L G+ K + + + +T +L + + D Y L ++ G+ + +
Sbjct: 337 LFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISE 396
Query: 231 LT---------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL---APDDK 272
T I DSG++ +F Y I+++ +KL A DD
Sbjct: 397 QTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYD-----IIKEAFEKKIKLQQIAADDF 451
Query: 273 TLPICW 278
+ C+
Sbjct: 452 VMSPCY 457
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 127/303 (41%), Gaps = 28/303 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPPEKQYKPHKNIVPCSNPRC 74
+ V++ +G P + FDTGSDL+WVQC PC+ GC K + + P + S RC
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQC-GPCSSGGCYKQQDPLFAPSDSST-FSAVRC 211
Query: 75 AALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRF---SNGSVFN---VP 126
A C +D+C YE+ YGD + G L D L +N S N +P
Sbjct: 212 GARECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLP 271
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGR 182
FGCG N N G D G+ GLGRG++S+ SQ G +C+ +
Sbjct: 272 GFVFGCGEN--NTGLFGQAD--GLFGLGRGKVSLSSQ--AAGKFGEGFSYCLPSSSSSAP 325
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD----LTLIFDSG 238
G L LG + +TPML + Y + + +G++ + L LI DSG
Sbjct: 326 GYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSG 385
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
R Y+ + + + + K AP L C+ F A T +AL
Sbjct: 386 TVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCY--DFTAHANATVSIPAVALV 443
Query: 299 FTN 301
F
Sbjct: 444 FAG 446
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 90/340 (26%), Positives = 139/340 (40%), Gaps = 41/340 (12%)
Query: 4 SWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
S IE + + +N+ +G P F DTGSDL W QC+ PCT C P + P
Sbjct: 83 SGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE-PCTQCFSQPTPIFNPQ 141
Query: 64 K----NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN 119
+ +PC + C L P N++C Y YGDG ++ G + T+ F F
Sbjct: 142 DSSSFSTLPCESQYCQDL-----PSETCNNNECQYTYGYGDGSTTQGYMATETF--TFET 194
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-- 177
SV N+ FGCG + G + AG++G+G G +S+ SQL +C+
Sbjct: 195 SSVPNI--AFGCGEDNQGFG---QGNGAGLIGMGWGPLSLPSQLG-----VGQFSYCMTS 244
Query: 178 -GQNGRGVLFLGDGK--VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-- 232
G + L LG VP G T ++ +S + +Y + + G + G+ T
Sbjct: 245 YGSSSPSTLALGSAASGVP-EGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQ 303
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK- 283
+I DSG + Y Y V+ D I P + L C++ P
Sbjct: 304 LQDDGTGGMIIDSGTTLTYLPQDAY-NAVAQAFTDQINLP-TVDESSSGLSTCFQQPSDG 361
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTS 323
+ QV E N L+ P E + +++ +S
Sbjct: 362 STVQVPEISMQFDGGVLNLGEQNILISPAEGVICLAMGSS 401
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 87/326 (26%), Positives = 132/326 (40%), Gaps = 45/326 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK---------PPEKQYKPHKNI 66
Y+A + +G P K + DTGSD+ WV C C C + P + + +
Sbjct: 87 YYA-KIGIGTPSKDYYVQVDTGSDIVWVNC-IQCRECPRTSSLGMELTPYDLEESTTGKL 144
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C ++ C N C Y YGDG S+ G V D +G +
Sbjct: 145 VSCDEQFCLEVNGGPLSGCT-TNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTA 203
Query: 123 FNVPLTFGCGYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
N + FGCG Q + G G+LG G+ SI+SQL ++ + HC+ G N
Sbjct: 204 ANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTN 263
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS---------ADLKHYILG-PAELLYSGKSCGLKD 230
G G+ +G P V TP++ N + H IL A++ +G G
Sbjct: 264 GGGIFAMGHVVQPK--VNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKG--- 318
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY +Y+ +V+ I+ ++ + F+ +V +
Sbjct: 319 --TIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKC-------FQYSERVDD 369
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYL 316
F P+ F NS+ L V P YL
Sbjct: 370 GFPPVIFHF---ENSLLLKVYPHEYL 392
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 90/284 (31%), Positives = 117/284 (41%), Gaps = 45/284 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--------------TKPPEKQ 59
F ++AV + +G P F DTGSDL WV CD C C T P+K
Sbjct: 102 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKS 158
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL--R 116
K VPCS+ C P Y IEY D SS G LV D+ L
Sbjct: 159 STSRK--VPCSSNLCDLQSACRSASSSCP-----YSIEYLSDNTSSTGVLVEDVLYLITE 211
Query: 117 FSNGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIG 174
+ + P+TFGCG Q G +P G+LGLG IS+ S L G+ N
Sbjct: 212 YGQPKIVTAPITFGCGRIQTGSFLGSAAP---NGLLGLGMDSISVPSLLASEGVAANSFS 268
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPM---LQNSADLKHYILGPAELLYSGKSCGLKDL 231
C G +GRG + GD SS TP+ QN +Y + + KS +
Sbjct: 269 MCFGDDGRGRINFGD--TGSSDQQETPLNIYKQN----PYYNISITGAMVGSKSFN-TNF 321
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
I DSG S+ + +Y EI S + P +L D +LP
Sbjct: 322 NAIVDSGTSFTALSDPMYSEITSSFNSQVQDKPTQL---DSSLP 362
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 82/259 (31%), Positives = 115/259 (44%), Gaps = 37/259 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE---------KQYKPHK 64
F ++A N+TVG P F DTGSDL W+ CD CT C + + Y P+
Sbjct: 102 FLHYA-NVTVGTPSDWFLVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNA 158
Query: 65 NI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
+ VPC++ C RC P C Y+I Y +G SS G LV D+ L ++
Sbjct: 159 SSTSTKVPCNSTLCT-----RGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213
Query: 120 GSVFNVP--LTFGCGYNQ----HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVI 173
S +P +T GCG Q H+ + P+ G+ GLG IS+ S L + G+ N
Sbjct: 214 KSSKAIPARVTLGCGQVQTGVFHDG---AAPN--GLFGLGLEDISVPSVLAKEGIAANSF 268
Query: 174 GHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL 233
C G +G G + GD S TP L Y + ++ G + L +
Sbjct: 269 SMCFGNDGAGRISFGDKG--SVDQRETP-LNIRQPHPTYNITVTKISVEGNTGDL-EFDA 324
Query: 234 IFDSGASYAYFTSRVYQEI 252
+FDSG S+ Y T Y I
Sbjct: 325 VFDSGTSFTYLTDAAYTLI 343
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 109/262 (41%), Gaps = 29/262 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----GCTKPPEKQYKPH----KNIVP 68
+ +G P F D GSDL W+ CD AP + G QY P +
Sbjct: 104 IDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 163
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLR-----FSNGSV 122
CS+ C + P C P C Y I Y + SS G L+ D+ L SN SV
Sbjct: 164 CSHQLCES-----SPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSV 218
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
P+ GCG Q G L G++GLG G IS+ S L + GL++N C +
Sbjct: 219 -RAPVIIGCGMRQTG-GYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDS 276
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASY 241
G +F GD + + T L + + YI+G E G SC + DSGAS+
Sbjct: 277 GRIFFGDQGLATQQT--TLFLPSDGKYETYIVG-VEACCIGSSCIKQTSFRALVDSGASF 333
Query: 242 AYFTSRVYQEIVSLIMRDLIGT 263
+ Y+ +V + + T
Sbjct: 334 TFLPDESYRNVVDEFDKQVNAT 355
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 119/285 (41%), Gaps = 31/285 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ + C+ P
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAP 239
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L + C N C Y ++YGDG SIG D L S ++ F G
Sbjct: 240 ACSDL---DTRGCSGGN--CLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 289
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC+ +G G L G
Sbjct: 290 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSSGTGYLDFGP 344
Query: 190 GKVPSSGVAW-TPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
G ++G TPML ++ +Y+ G + G+ + I DSG
Sbjct: 345 GSPAAAGARLTTPMLTDNGPTFYYV-GMTGIRVGGQLLSIPQSVFTTAGTIVDSGTVITR 403
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
Y + S + K AP L C+ F + QV
Sbjct: 404 LPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCY--DFTGMSQV 446
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 74/287 (25%), Positives = 126/287 (43%), Gaps = 25/287 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V + +G P + + DTGS L+W+QC C + + P + + C++
Sbjct: 13 YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 72
Query: 73 RCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTF 129
+C++L N P C+ ++ C Y YGD S+G L DL L S +P +
Sbjct: 73 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLPGFVY 128
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI-GQNGRGVLFL 187
GCG Q + G AG+LGLGR ++S++ Q+ ++G +C+ + G G L +
Sbjct: 129 GCG--QDSEGLFG--RAAGILGLGRNKLSMLGQVSSKFGY---AFSYCLPTRGGGGFLSI 181
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASYAY 243
G + S +TPM + + Y L + G++ G+ + I DSG
Sbjct: 182 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITR 241
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
VY ++ ++ + AP L C++G K + V E
Sbjct: 242 LPMSVYTPFQQAFVK-IMSSKYARAPGFSILDTCFKGNLKDMQSVPE 287
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 87/326 (26%), Positives = 132/326 (40%), Gaps = 45/326 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK---------PPEKQYKPHKNI 66
Y+A + +G P K + DTGSD+ WV C C C + P + + +
Sbjct: 87 YYA-KIGIGTPSKDYYVQVDTGSDIVWVNC-IQCRECPRTSSLGMELTPYDLEESTTGKL 144
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C ++ C N C Y YGDG S+ G V D +G +
Sbjct: 145 VSCDEQFCLEVNGGPLSGCT-TNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTA 203
Query: 123 FNVPLTFGCGYNQH-NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
N + FGCG Q + G G+LG G+ SI+SQL ++ + HC+ G N
Sbjct: 204 ANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTN 263
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS---------ADLKHYILG-PAELLYSGKSCGLKD 230
G G+ +G P V TP++ N + H IL A++ +G G
Sbjct: 264 GGGIFAMGHVVQPK--VNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKG--- 318
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG + AY +Y+ +V+ I+ ++ + F+ +V +
Sbjct: 319 --TIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKC-------FQYSERVDD 369
Query: 291 YFKPLALSFTNRRNSVRLVVPPEAYL 316
F P+ F NS+ L V P YL
Sbjct: 370 GFPPVIFHF---ENSLLLKVYPHEYL 392
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 100/206 (48%), Gaps = 28/206 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F + L +G P + + DTGSDL W QC PC C P + P K+ +PCS+
Sbjct: 97 FLMKLAIGTPAETYSAIMDTGSDLIWTQC-KPCKDCFDQPTPIFDPKKSSSFSKLPCSSD 155
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CAAL + C +D C+Y YGD S+ G L T+ F F + SV + FGCG
Sbjct: 156 LCAALPISS---C---SDGCEYLYSYGDYSSTQGVLATETFA--FGDASVSKI--GFGCG 205
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV--LFLG 188
+ G AG++GLGRG +S++SQL E +C+ + +G+ L +G
Sbjct: 206 EDNDGSG---FSQGAGLVGLGRGPLSLISQLGE-----PKFSYCLTSMDDSKGISSLLVG 257
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYIL 214
+ + TP++QN + Y L
Sbjct: 258 SEATMKNAIT-TPLIQNPSQPSFYYL 282
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 80/152 (52%), Gaps = 15/152 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V+L +G PP + DTGSDL W QC APC C P + K+ +PC +
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCRSS 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
RCA+L + P C C Y+ YGD S+ G L + F +N + V + FGC
Sbjct: 148 RCASL---SSPSCFK--KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGC 202
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
G N G L+ +++G++G GRG +S+VSQL
Sbjct: 203 G--SLNAGDLA--NSSGMVGFGRGPLSLVSQL 230
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 109/262 (41%), Gaps = 29/262 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----GCTKPPEKQYKPH----KNIVP 68
+ +G P F D GSDL W+ CD AP + G QY P +
Sbjct: 85 IDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 144
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLR-----FSNGSV 122
CS+ C + P C P C Y I Y + SS G L+ D+ L SN SV
Sbjct: 145 CSHQLCES-----SPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSV 199
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
P+ GCG Q G L G++GLG G IS+ S L + GL++N C +
Sbjct: 200 -RAPVIIGCGMRQTG-GYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDS 257
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASY 241
G +F GD + + T L + + YI+G E G SC + DSGAS+
Sbjct: 258 GRIFFGDQGLATQQT--TLFLPSDGKYETYIVG-VEACCIGSSCIKQTSFRALVDSGASF 314
Query: 242 AYFTSRVYQEIVSLIMRDLIGT 263
+ Y+ +V + + T
Sbjct: 315 TFLPDESYRNVVDEFDKQVNAT 336
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/325 (26%), Positives = 132/325 (40%), Gaps = 47/325 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +G P ++F DTGSDLTWVQC +PC C + + P+ + + C +
Sbjct: 13 YLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGKCYSQNDALFLPNTSTSFTKLACGSA 71
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L + P C C Y YGDG + G V D + NG VP FGC
Sbjct: 72 LCNGLPF---PMCNQTT--CVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFGC 126
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVLF 186
G++ N G + D G+LGLG+G +S SQL+ + +C+ L
Sbjct: 127 GHD--NEGSFAGAD--GILGLGQGPLSFHSQLKS--VYNGKFSYCLVDWLAPPTQTSPLL 180
Query: 187 LGDGKVPS-SGVAWTPMLQNSADLKHY------------ILGPAELLYSGKSCGLKDLTL 233
GD VP V + P+L N +Y +L + ++ S G
Sbjct: 181 FGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVG--GAGT 238
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG-PFKALGQVTEYF 292
IFDSG + Y+E+++ + + K+ D L +C G P L
Sbjct: 239 IFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKI-DDISRLDLCLSGFPKDQL------- 290
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLV 317
P + T +V+PP Y +
Sbjct: 291 -PTVPAMTFHFEGGDMVLPPSNYFI 314
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 85/319 (26%), Positives = 134/319 (42%), Gaps = 44/319 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE--------KQYKPHKNIV 67
Y+ + +G PP F DTGS +T+V C + CT C + YKP +
Sbjct: 34 YYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSS-CTHCGNHQDPRFSPALSSSYKPLECGS 92
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVP 126
CS C + Y+ +Y + +S G L D+ + FSN S +
Sbjct: 93 ECSTGFC--------------DGSRKYQRQYAEKSTSSGVLGKDV--IGFSNSSDLGGQR 136
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV 184
L FGC G L G++GLGRG +SI+ QL E + +V C G G G
Sbjct: 137 LVFGC--ETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGA 194
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK------DLTLIFDSG 238
+ LG + P V S +Y L + G LK + DSG
Sbjct: 195 MILGGFQPPKDMVFTASDPHRSP---YYNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSG 251
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKL-APDDKTLPICWRGPFKALGQVTEYFKPLAL 297
+YAYF +Q S + ++ +G+ ++ PD+K IC+ G + ++++F +
Sbjct: 252 TTYAYFPGAAFQAFKSAV-KEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDF 310
Query: 298 SFTNRRNSVRLVVPPEAYL 316
F + ++ + + PE YL
Sbjct: 311 VFGDGQS---VTLSPENYL 326
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/314 (28%), Positives = 123/314 (39%), Gaps = 44/314 (14%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE----------KQYKPH----KNI 66
+ +G P F D GSD+ WV CD C C QY+P
Sbjct: 109 IDIGTPNVSFLVALDAGSDMLWVPCD--CIECASLSAGNYNVLDRDLNQYRPSLSNTSRH 166
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPL----RFSNGS 121
+PC + C CK D C Y ++Y SS G + D L + + +
Sbjct: 167 LPCGHKLCDV-----HSVCKGSKDPCPYAVQYSSANTSSSGYVFEDKLHLTSNGKHAEQN 221
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
+ GCG Q L GVLGLG G IS+ S L + GLI+N C +N
Sbjct: 222 SVQASIILGCGRKQTGE-YLRGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSICFEENE 280
Query: 182 RGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD--LTLIFDSG 238
G + GD G V TP L YI+G E G C LK+ + DSG
Sbjct: 281 SGRIIFGDQGHVTQHS---TPFLPIDGKFNAYIVG-VESFCVGSLC-LKETRFQALIDSG 335
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+S+ + + VYQ++V + + T + L W + A Q PL L+
Sbjct: 336 SSFTFLPNEVYQKVVIEFDKQVNATSI-------VLQNSWEYCYNASSQELISIPPLNLA 388
Query: 299 FTNRRNSVRLVVPP 312
F+ RN L+ P
Sbjct: 389 FS--RNQTYLIQNP 400
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/310 (25%), Positives = 132/310 (42%), Gaps = 26/310 (8%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
Y+ L +G PP++F DTGS +T+V C + C C + + +++P ++ P
Sbjct: 80 YYTTRLWIGTPPQMFALIVDTGSTVTYVPC-STCEQCGRHQDPKFQP--DLSSTYQPVKC 136
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFGCGYN 134
L C + QC YE +Y + +S G L D+ + F N S FGC
Sbjct: 137 TLDC----NCDNDRMQCVYERQYAEMSTSSGVLGEDV--VSFGNQSELAPQRAVFGC--E 188
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDGKV 192
G L G++GLGRG +SI+ QL + ++ + C G G G + LG G
Sbjct: 189 NVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLG-GIS 247
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYAYFTS 246
P S + + + +Y + E+ +GK L + DSG +YAY
Sbjct: 248 PPSDMVFAQ--SDPVRSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLDSGTTYAYLPE 305
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSV 306
+ I+++L PD +C+ G + Q+++ F + + F N
Sbjct: 306 EAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGH--- 362
Query: 307 RLVVPPEAYL 316
+ + PE Y+
Sbjct: 363 KYSLSPENYM 372
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 96/200 (48%), Gaps = 15/200 (7%)
Query: 34 FDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKH 87
DT SD+ WVQC APC C + Y P K+ PCS+P C L P C
Sbjct: 160 IDTASDVPWVQC-APCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNL-GPYANGCTP 217
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
DQC Y ++Y DG +S G ++D+ L + + FGC + PG S T+
Sbjct: 218 AGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFS-NKTS 276
Query: 148 GVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQN--GRGVLFLGDGKVPSSGVAWTPMLQ 204
G++ LGRG S+ +Q + YG +V +C+ G LG +V +S A TPML+
Sbjct: 277 GIMALGRGAQSLPTQTKATYG---DVFSYCLPPTPVHSGFFILGVPRVAASRYAVTPMLR 333
Query: 205 NSADLKHYILGPAELLYSGK 224
+ A Y++ + +GK
Sbjct: 334 SKAAPMLYLVRLIAIEVAGK 353
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/318 (26%), Positives = 131/318 (41%), Gaps = 42/318 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK------PPEKQ--YKPHKNIV 67
Y+ L +G PP++F DTGS +T+V C + C C + PE Y+P K +
Sbjct: 111 YYTTRLWIGTPPQMFALIVDTGSTVTYVPC-STCEQCGRHQDPKFQPESSSTYQPVKCTI 169
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VP 126
C+ C QC YE +Y + +S G L D+ + F N S
Sbjct: 170 DCN--------------CDGDRMQCVYERQYAEMSTSSGVLGEDV--ISFGNQSELAPQR 213
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGV 184
FGC G L G++GLGRG +SI+ QL + +I + C G G G
Sbjct: 214 AVFGC--ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGA 271
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSG 238
+ LG G P S + + + +Y + E+ +GK L + DSG
Sbjct: 272 MVLG-GISPPSDMTFA--YSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSG 328
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
+YAY + I+++L PD IC+ G + Q+++ F + +
Sbjct: 329 TTYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMV 388
Query: 299 FTNRRNSVRLVVPPEAYL 316
F N + + PE Y+
Sbjct: 389 FGNGH---KYSLSPENYM 403
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 107/253 (42%), Gaps = 24/253 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKN- 65
+S + +G P F DTGSDL WV CD AP G + + Y P K+
Sbjct: 1 YSLHYTTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASDFELSVYSPKKSS 60
Query: 66 ---IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDG-GSSIGALVTDLFPLRFSN-- 119
VPC+N CA +C C Y + Y S+ G L+ DL L+ N
Sbjct: 61 TSKTVPCNNSLCAQRD-----QCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKH 115
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
+TFGCG Q L G+ GLG +IS+ S L GL+ N C
Sbjct: 116 SEPIQAYITFGCGQVQSG-SFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSD 174
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+G G + GD S TP N + I + + G + D+T +FDSG
Sbjct: 175 DGVGRINFGDKG--SLEQEETPFNLNQLHPNYNITVTS--IRVGTTLIDADITALFDSGT 230
Query: 240 SYAYFTSRVYQEI 252
S++YFT +Y ++
Sbjct: 231 SFSYFTDPIYSKL 243
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 138/329 (41%), Gaps = 55/329 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F +++++G P + DTGSDL W QC PC C K + P + VPCS+
Sbjct: 105 FLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSA 163
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L P +C Y YGD S+ G L T+ F L S +P + FGC
Sbjct: 164 SCSDL----PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGC 214
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLG 188
G G AG++GLGRG +S+VSQL GL + +C + L LG
Sbjct: 215 GDTNEGDG---FSQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDTNNSPLLLG 266
Query: 189 D------GKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDL---T 232
+S V TP+++N + LK +G + + ++D
Sbjct: 267 SLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG 326
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVT 289
+I DSG S Y + Y+ ++ + L D + L +C+R P K + QV
Sbjct: 327 VIVDSGTSITYLEVQGYRA-----LKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE 381
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
L F + L +P E Y+V+
Sbjct: 382 --VPRLVFHFDGGAD---LDLPAENYMVL 405
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/253 (32%), Positives = 112/253 (44%), Gaps = 32/253 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC------TKPPE--KQYKPHKN 65
F Y+A +TVG P + DTGSDL W+ CD C C T+ P Y P+ +
Sbjct: 105 FLYYA-EVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFNIYSPNNS 161
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN- 119
V CS+ C+ L +C P+D C Y++ Y D SS G LV D+ L ++
Sbjct: 162 STSKEVQCSSSLCSHLD-----QCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDV 216
Query: 120 -GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
N +T GCG +Q LS G+ GLG +S+ S L GLI N C G
Sbjct: 217 QSKPVNARITLGCGKDQSG-AFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFG 275
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKH--YILGPAELLYSGKSCGLKDLTLIFD 236
G + GD P G TP + +H Y + ++ G L D+ +IFD
Sbjct: 276 PARMGRIEFGDKGSP--GQNETPF---NLGRRHPTYNVSITQIGVGGHISDL-DVAVIFD 329
Query: 237 SGASYAYFTSRVY 249
SG S+ Y Y
Sbjct: 330 SGTSFTYLNDPAY 342
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 138/329 (41%), Gaps = 55/329 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F +++++G P + DTGSDL W QC PC C K + P + VPCS+
Sbjct: 74 FLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSA 132
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L P +C Y YGD S+ G L T+ F L S +P + FGC
Sbjct: 133 SCSDL----PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGC 183
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLG 188
G G AG++GLGRG +S+VSQL GL + +C + L LG
Sbjct: 184 GDTNEGDG---FSQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDTNNSPLLLG 235
Query: 189 D------GKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDL---T 232
+S V TP+++N + LK +G + + ++D
Sbjct: 236 SLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG 295
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVT 289
+I DSG S Y + Y+ ++ + L D + L +C+R P K + QV
Sbjct: 296 VIVDSGTSITYLEVQGYRA-----LKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE 350
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
L F + L +P E Y+V+
Sbjct: 351 --VPRLVFHFDGGAD---LDLPAENYMVL 374
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/236 (32%), Positives = 106/236 (44%), Gaps = 20/236 (8%)
Query: 35 DTGSDLTWVQCDAPC--TGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKHP 88
DTGSDLTWVQC +PC T C Y P + ++PC + C L + + C
Sbjct: 114 DTGSDLTWVQC-SPCDNTKCFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPY-SQYVCSDY 171
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAG 148
D C Y YGD S G L +D L +N + FGCG+ S T G
Sbjct: 172 GD-CIYAYTYGDNSYSYGGLSSDSIRLMLLQLH-YNSKICFGCGFQNKFTADKS-GKTTG 228
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGK-VPSSGVAWTPMLQ 204
++GLG G +S+VSQL + I + +C+ N L G+ V +GV TP++
Sbjct: 229 IVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFSSNSNSKLKFGEAAIVQGNGVVSTPLII 286
Query: 205 NSADLKHYILGPAELLYSGKSC--GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
DL Y L + K+ G D +I DSG++ Y Y E VSL+
Sbjct: 287 K-PDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGSTLTYLEESFYNEFVSLVKE 341
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 132/329 (40%), Gaps = 42/329 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
F + L +G PP+ + DTGSDL W QC PCT C P + P K+ +
Sbjct: 97 FLMKLAIGTPPETYSAIMDTGSDLIWTQC-KPCTQCFDQPTPIFDPKKSSSFSKLSCSSK 155
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
L P +D C+Y YGD S+ G L ++ L F SV V FGCG +
Sbjct: 156 LCEALPQST--CSDGCEYLYGYGDYSSTQGMLASE--TLTFGKVSVPEV--AFGCGEDNE 209
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLRE----YGLIRNVIGHCIGQNGRGVLFLG---D 189
G +G++GLGRG +S+VSQL+E Y L + L +G
Sbjct: 210 GSG---FSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTS------VDDTKASTLLMGSLAS 260
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGA 239
K S + TP++QNSA Y L + S +K T LI DSG
Sbjct: 261 VKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGT 320
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+ Y + ++V+ I P+ L +C+ P G L F
Sbjct: 321 TITYLEQSAF-DLVAKEFTSQINLPVD-NSGSTGLEVCFTLPS---GSTDIEVPKLVFHF 375
Query: 300 TNRRNSVRLVVPPEAYLVISVSTSIIIIA 328
+ L +P E Y++ S + +A
Sbjct: 376 ----DGADLELPAENYMIADASMGVACLA 400
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 113/265 (42%), Gaps = 35/265 (13%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----------KPPEKQYKPHKNI---- 66
+ +G P F D GSDL WV CD C C +Y P +
Sbjct: 117 IDIGTPHVSFLVALDAGSDLLWVPCD--CLQCAPLSASYYSSLDRDLNEYSPSHSSTSKH 174
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGS---- 121
+ CS+ C P C P C Y ++Y + SS G LV D+ L SNG
Sbjct: 175 LSCSHQLCEL-----GPNCNSPKQPCPYSMDYYTENTSSSGLLVEDILHLA-SNGDNALS 228
Query: 122 -VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
P+ GCG Q G L G++GLG IS+ S L + GLIRN C ++
Sbjct: 229 YSVRAPVVIGCGMKQSG-GYLDGVAPDGLMGLGLAEISVPSFLAKAGLIRNSFSMCFDED 287
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--IFDSG 238
G +F GD + P++ + TP L + Y++G E G SC LK + + D+G
Sbjct: 288 DSGRIFFGD-QGPTTQQS-TPFLTLDGNYTTYVVG-VEGFCVGSSC-LKQTSFRALVDTG 343
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGT 263
S+ + + VY+ I R + T
Sbjct: 344 TSFTFLPNGVYERITEEFDRQVNAT 368
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 138/329 (41%), Gaps = 55/329 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F +++++G P + DTGSDL W QC PC C K + P + VPCS+
Sbjct: 95 FLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSA 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L P +C Y YGD S+ G L T+ F L S +P + FGC
Sbjct: 154 SCSDL----PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGC 204
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLG 188
G G AG++GLGRG +S+VSQL GL + +C + L LG
Sbjct: 205 GDTNEGDG---FSQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDTNNSPLLLG 256
Query: 189 D------GKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDL---T 232
+S V TP+++N + LK +G + + ++D
Sbjct: 257 SLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGG 316
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVT 289
+I DSG S Y + Y+ ++ + L D + L +C+R P K + QV
Sbjct: 317 VIVDSGTSITYLEVQGYRA-----LKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE 371
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
L F + L +P E Y+V+
Sbjct: 372 --VPRLVFHFDGGAD---LDLPAENYMVL 395
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 90/203 (44%), Gaps = 22/203 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P + V C+ P
Sbjct: 183 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 242
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L C C Y ++YGDG SIG D L S ++ F G
Sbjct: 243 ACSDLDVSG---CS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 292
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
+ N G + AG+LGLGRG+ S+ Q YG V HC+ G G L G G
Sbjct: 293 CGERNDGLFG--EAAGLLGLGRGKTSLPVQ--TYGKYGGVFAHCLPARSTGTGYLDFGAG 348
Query: 191 KVPSSGVAWTPMLQNSADLKHYI 213
P++ TPML + +Y+
Sbjct: 349 SPPAT--TTTPMLTGNGPTFYYV 369
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 119/272 (43%), Gaps = 32/272 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPEKQYKPHKNI---VPCS 70
YF V+L +G+PP+ DTGSDL WV+C A C C+ P + H + C
Sbjct: 83 YF-VDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHSSTFSPAHCY 140
Query: 71 NPRCAALHWP-NPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
+P C + P PRC H + C YE Y DG + G + L+ S+G +
Sbjct: 141 DPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKS 200
Query: 127 LTFGCGY--NQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNG-- 181
+ FGCG+ + + S GV+GLGRG IS SQL R +G N +C+
Sbjct: 201 VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFG---NKFSYCLMDYTLS 257
Query: 182 ---RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK--------- 229
L +GDG S + +TP+L N Y + + +G +
Sbjct: 258 PPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDS 317
Query: 230 -DLTLIFDSGASYAYFTSRVYQEIVSLIMRDL 260
+ + DSG + A+ Y+ +++ + + +
Sbjct: 318 GNGGTVMDSGTTLAFLADPAYRLVIAAVKQRI 349
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/253 (32%), Positives = 112/253 (44%), Gaps = 32/253 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC------TKPPE--KQYKPHKN 65
F Y+A +TVG P + DTGSDL W+ CD C C T+ P Y P+ +
Sbjct: 128 FLYYA-EVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFNIYSPNNS 184
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN- 119
V CS+ C+ L +C P+D C Y++ Y D SS G LV D+ L ++
Sbjct: 185 STSKEVQCSSSLCSHLD-----QCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDV 239
Query: 120 -GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
N +T GCG +Q LS G+ GLG +S+ S L GLI N C G
Sbjct: 240 QSKPVNARITLGCGKDQSG-AFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFG 298
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKH--YILGPAELLYSGKSCGLKDLTLIFD 236
G + GD P G TP + +H Y + ++ G L D+ +IFD
Sbjct: 299 PARMGRIEFGDKGSP--GQNETPF---NLGRRHPTYNVSITQIGVGGHISDL-DVAVIFD 352
Query: 237 SGASYAYFTSRVY 249
SG S+ Y Y
Sbjct: 353 SGTSFTYLNDPAY 365
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/203 (32%), Positives = 91/203 (44%), Gaps = 22/203 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P + V C+ P
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 239
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L C C Y ++YGDG SIG D L S ++ F G
Sbjct: 240 ACSDLDVSG---CS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 289
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
+ N G + AG+LGLGRG+ S+ ++ YG V HC+ G G L G G
Sbjct: 290 CGERNDGLFG--EAAGLLGLGRGKTSL--PVQTYGKYGGVFAHCLPPRSTGTGYLDFGAG 345
Query: 191 KVPSSGVAWTPMLQNSADLKHYI 213
P++ TPML + +Y+
Sbjct: 346 SPPAT--TTTPMLTGNGPTFYYV 366
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/313 (26%), Positives = 136/313 (43%), Gaps = 32/313 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + +++P + V C+
Sbjct: 92 YYTARLWIGTPPQRFALIVDTGSTVTYVPC-STCRHCGSHQDPKFRPEDSETYQPVKCTW 150
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
+C C + QC YE Y + +S GAL D+ + F N + + FG
Sbjct: 151 -QC---------NCDNDRKQCTYERRYAEMSTSSGALGEDV--VSFGNQTELSPQRAIFG 198
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C ++ G + G++GLGRG +SI+ QL E +I + C G G G + G
Sbjct: 199 CENDE--TGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLG 256
Query: 191 KV-PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYAY 243
+ P + + +T + +Y + E+ +GK L + DSG +YAY
Sbjct: 257 GISPPADMVFT--RSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAY 314
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
+ IM++ PD + IC+ G + Q+++ F + + F N
Sbjct: 315 LPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVEMVFGNGH 374
Query: 304 NSVRLVVPPEAYL 316
+L + PE YL
Sbjct: 375 ---KLSLSPENYL 384
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 130/342 (38%), Gaps = 77/342 (22%)
Query: 2 YVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-- 59
+V W+ +F I +G P K + DTGSD+ WV C GC K P K
Sbjct: 20 FVHWLSLYFAKI--------GLGNPSKDYYVQVDTGSDILWVN----CIGCDKCPTKSDL 67
Query: 60 ------YKPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALV 109
Y P ++ V C + C + + P CK C Y + YGDG S+ G V
Sbjct: 68 GIKLTLYDPASSVSATRVSCDDDFCTSTYNGLLPDCKKEL-PCQYNVVYGDGSSTAGYFV 126
Query: 110 TDLFPLRFSNGS----VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE 165
+D G+ + N +TFGCG Q S G+LG
Sbjct: 127 SDAVQFERVTGNLQTGLSNGTVTFGCGAQQSGGLGTSGEALDGILG-------------- 172
Query: 166 YGLIRNVIGHCIGQ-NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILG--------- 215
HC+ NG G+ + G++ S V TPM+ N A Y+
Sbjct: 173 ------AFAHCLDNVNGGGIFAI--GELVSPKVNTTPMVPNQAHYNVYMKEIEVGGTVLE 224
Query: 216 -PAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL 274
P ++ SG G I DSG + AY VY +++ I G L +
Sbjct: 225 LPTDVFDSGDRRG-----TIIDSGTTLAYLPEVVYDSMMNEIRSQQPGLSLHTVEEQF-- 277
Query: 275 PICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
IC FK G V + F + F ++S+ L V P YL
Sbjct: 278 -IC----FKYSGNVDDGFPDIKFHF---KDSLTLTVYPHDYL 311
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/258 (32%), Positives = 108/258 (41%), Gaps = 39/258 (15%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-------KPPEK---QYKPHKNI---- 66
+ +G P F D GSDL+WV CD C C KP ++ +Y+P +
Sbjct: 106 IDIGTPNVSFLVALDAGSDLSWVPCD--CIQCAPLSASLYKPLDRDLSEYRPSLSTTSRH 163
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGD-GGSSIGALVTDLFPL------RFSN 119
+ C++ C CK+ D C Y +Y D SS G LV D+ L S
Sbjct: 164 LSCNHQLCEL-----GSHCKNLKDPCPYIADYADPNTSSSGFLVEDILHLASVSDDSNST 218
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
+ GCG Q G L GV+GLG G IS+ S L + GLIR C
Sbjct: 219 QKRVQASVILGCGRKQ-TGGYLDGAAPDGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDV 277
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC----GLKDLTLIF 235
NG G + GD S TP+L + Y++ E G SC G K L
Sbjct: 278 NGSGTILFGDQGHTSQKS--TPLLPTQGNYDAYLI-EVESYCVGNSCLKQSGFKALV--- 331
Query: 236 DSGASYAYFTSRVYQEIV 253
DSGAS+ Y VY +IV
Sbjct: 332 DSGASFTYLPIDVYNKIV 349
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/203 (32%), Positives = 91/203 (44%), Gaps = 22/203 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P + V C+ P
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAP 238
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L C C Y ++YGDG SIG D L S ++ F G
Sbjct: 239 ACSDLDVSG---CS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 288
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
+ N G + AG+LGLGRG+ S+ ++ YG V HC+ G G L G G
Sbjct: 289 CGERNDGLFG--EAAGLLGLGRGKTSL--PVQTYGKYGGVFAHCLPARSTGTGYLDFGAG 344
Query: 191 KVPSSGVAWTPMLQNSADLKHYI 213
P++ TPML + +Y+
Sbjct: 345 SPPAT--TTTPMLTGNGPTFYYV 365
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 97/216 (44%), Gaps = 32/216 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK----PPEKQYKPHKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C P + +++PCS+P
Sbjct: 415 YLVHLAIGTPPQPVQLILDTGSDLVWTQCR-PCPVCFSRALGPLDPSNSSTFDVLPCSSP 473
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVP-LTFG 130
C L W + + N C Y Y DG + G L + F ++G+ VP L FG
Sbjct: 474 VCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFG 533
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLF 186
CG N G + +T G+ G GRG +S+ SQL+ + HC G VL
Sbjct: 534 CGL--FNNGIFTSNET-GIAGFGRGALSLPSQLKV-----DNFSHCFTAITGSEPSSVLL 585
Query: 187 --------LGDGKVPSSGVAWTPMLQNSADLKHYIL 214
DG V S TP++QN + L+ Y L
Sbjct: 586 GLPANLYSDADGAVQS-----TPLVQNFSSLRAYYL 616
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 118/285 (41%), Gaps = 31/285 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ V C+ P
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAP 238
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L + C C Y ++YGDG SIG D L S ++ F G
Sbjct: 239 ACFDL---DTRGCS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 288
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC+ +G G L G
Sbjct: 289 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSSGTGYLDFGP 343
Query: 190 GKVPSSGVAW-TPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAY 243
G ++G TPML ++ +Y+ G + G+ + I DSG
Sbjct: 344 GSPAAAGARLTTPMLTDNGPTFYYV-GMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITR 402
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
Y + S + + K AP L C+ F + QV
Sbjct: 403 LPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCY--DFTGMSQV 445
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/336 (27%), Positives = 136/336 (40%), Gaps = 41/336 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC---------TKPPEKQYKPH- 63
F Y+A N++VG P F DTGSDL W+ C+ C+ C K Y P+
Sbjct: 102 FLYYA-NVSVGTPSLDFLVALDTGSDLFWLPCE--CSSCFTYLNTSNGGKFMLNHYSPND 158
Query: 64 ---KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
+ VPC++ C RC + C YE+ Y SSIG LV D+ L +
Sbjct: 159 STTSSTVPCTSSLCN--------RCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLATDD 210
Query: 120 GSV--FNVPLTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ +TFGCG Q + P+ G++GLG +IS+ S L + GL N C
Sbjct: 211 SLLKPVEAKITFGCGTVQTGIFATTAAPN--GLIGLGMEKISVPSFLADQGLTSNSFSMC 268
Query: 177 IGQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIF 235
G +G G + GD G + ML+ + + ++ G T IF
Sbjct: 269 FGADGYGRIDFGDTGPADQKQTPFNTMLEYQSYNVTF-----NVINVGGEPNDVPFTAIF 323
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
DSG S+ Y T Y I + + L + C+ P A + F+ L
Sbjct: 324 DSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGA-----KEFQYL 378
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLT 331
L+FT + +L + VST II T
Sbjct: 379 TLNFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETT 414
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 138/340 (40%), Gaps = 44/340 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ + + +G P + + DTGSDL W QC APC C P + P + + CS P
Sbjct: 92 YLMEMGIGTPARFYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPANSSTYRSLGCSAP 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL++ P C C Y+ YGD S+ G L + F ++ V ++FGCG
Sbjct: 151 ACNALYY---PLCYQ--KTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCG 205
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---QNGRGVLFLGD 189
N G L+ + +G++G GRG +S+VSQL G R +C+ R L+ G
Sbjct: 206 --NLNAGSLA--NGSGMVGFGRGSLSLVSQL---GSPR--FSYCLTSFLSPVRSRLYFGA 256
Query: 190 ----GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----------I 234
+S V TP + N A Y L + G + L I
Sbjct: 257 YATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTI 316
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
DSG + Y Y + + L T PL + L C++ P VT
Sbjct: 317 IDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVT--LP 374
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTGK 333
L L F + +P + Y+++ ST + +A T
Sbjct: 375 QLVLHF----DGADWELPLQNYMLVDPSTGGLCLAMATSS 410
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 119/303 (39%), Gaps = 31/303 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP--------------HKNI 66
+ +G P F DTGSD+ WV CD C C Y
Sbjct: 106 IDIGTPNVSFLVALDTGSDMFWVPCD--CIECAPLSAAFYNALDRDLNQYSPSLSSSSRH 163
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGS--VF 123
+PC + C CK D+C Y EY D SS G L+ D L +N +
Sbjct: 164 LPCGHQLCN-----QNSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLASNNATKNSI 218
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ GCG Q L G+LGLG G IS+ + L + GLIRN I C+ + G G
Sbjct: 219 QASVILGCGRKQSGYF-LEGAAPNGMLGLGPGSISVPALLAKAGLIRNSISICLNEKGSG 277
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD + + TP L + +L +Y +G + D+G S+ Y
Sbjct: 278 RILFGDQGHATQRRS-TPFLLDDGELLNYFVGVERFCVGSFCYKETEFKAFIDTGTSFTY 336
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
VY+ +V+ + + T + + C + A + + F P+ +F+ +
Sbjct: 337 LPKGVYETVVAEFEKQVHATRIT-SQIQSDFNCC----YNASSRESNNFPPMKFTFSKNQ 391
Query: 304 NSV 306
+ +
Sbjct: 392 SFI 394
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 118/300 (39%), Gaps = 36/300 (12%)
Query: 3 VSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP 62
S I + ++ + G P DTGSDLTWVQC PC+ C + + P
Sbjct: 134 TSGIRLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDP 192
Query: 63 HKN----IVPCSNPRCA---ALHWPNPPRCKHP---NDQCDYEIEYGDGGSSIGALVTDL 112
+ V C+ CA P C +++C Y + YGDG S G L TD
Sbjct: 193 AGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDT 252
Query: 113 FPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRN 171
L G FGCG + N G TAG++GLGR +S+VSQ YG
Sbjct: 253 VAL----GGASLGGFVFGCGLS--NRGLFG--GTAGLMGLGRTELSLVSQTASRYG---G 301
Query: 172 VIGHCI----GQNGRGVLFLGDGKVPSSG------VAWTPMLQNSADLKHYILGPAELLY 221
V +C+ + G L LG G +S VA+T M+ + A Y L
Sbjct: 302 VFSYCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAV 361
Query: 222 SGKSC---GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
G + GL ++ DSG VY+ + + MR AP L C+
Sbjct: 362 GGTALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCY 421
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 139/321 (43%), Gaps = 44/321 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + +G P KL DTGSD+ W+QC +PC C K + + P + + CS
Sbjct: 14 YF-VRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSSFRRLSCST 71
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+C L + C +++C Y++ YGDG ++G L +D F + S P+ FGC
Sbjct: 72 PQCKLL---DVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS----PVVFGC 124
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G++ N G AG+LGLG G++S SQL ++ G L GD
Sbjct: 125 GHD--NEGLF--VGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSA 180
Query: 192 VPSSG-VAWTPMLQN-------SADLKHYILGPAELLYSGKSCGLKDLT----LIFDSGA 239
+P+S A+T +L+N A L +G L + L T +I DSG
Sbjct: 181 LPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGT 240
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTP---LKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
S + Y +MRD + L A D C+ F AL VT ++
Sbjct: 241 SVTRLPTYAYT-----VMRDAFRSATQKLPRAADFSLFDTCY--DFSALTSVT--IPTVS 291
Query: 297 LSFTNRRNSVRLVVPPEAYLV 317
F + +PP YLV
Sbjct: 292 FHF---EGGASVQLPPSNYLV 309
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 133/303 (43%), Gaps = 31/303 (10%)
Query: 12 PIFSYFAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---- 66
P + ++ +VG PP K++ F DTGS++ W+QC PC C + P K+
Sbjct: 84 PELGEYLISYSVGTPPFKVYGF-MDTGSNIVWLQCQ-PCNTCFNQTSPIFNPSKSSSYKN 141
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+PC++ C + + C + D C+Y I YG S G L D L ++GS P
Sbjct: 142 IPCTSSTCKDTNDTH-ISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFP 200
Query: 127 -LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQN 180
+ GCG H ++GV+G+GRG +S++ Q+ + + +C+ N
Sbjct: 201 NIVIGCG---HINVLQDNSQSSGVVGMGRGPMSLIKQVGSSS-VGSKFSYCLIPYNSDSN 256
Query: 181 GRGVLFLGDGKVPSSG-VAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLT 232
L G+ V S V TPM++ + +Y L G + Y G+
Sbjct: 257 SSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEY-GERSNASTQN 315
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTE 290
++ DSG + ++VS + ++ + P ++ P D L +C+ K L +T
Sbjct: 316 ILIDSGTPLTMLPNLFLSKLVSYVAQE-VKLP-RIEPPDHHLSLCYNTTGKQLNVPDITA 373
Query: 291 YFK 293
+F
Sbjct: 374 HFN 376
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 91/335 (27%), Positives = 145/335 (43%), Gaps = 42/335 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ +++ +G PP+ + DTGSDL W QC APC C P + P ++ +PC++P
Sbjct: 89 YLMSMGIGTPPRYYSAILDTGSDLIWTQC-APCMLCVDQPTPFFDPAQSPSYAKLPCNSP 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL++P R + C Y+ YGD ++ G L + F ++ V + FGCG
Sbjct: 148 MCNALYYPLCYR-----NVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCG 202
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQL---REYGLIRNVIGHCIGQNGRGVLFLGD 189
N G L + +G++G GRG +S+VSQL R + + + + G +
Sbjct: 203 --NLNAGSLF--NGSGMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPVPSRLYFGAYATLN 258
Query: 190 GKVPSSG--VAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKDLT--LIFD 236
S+G V TP + N Y L + G+ + D T +I D
Sbjct: 259 STSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIID 318
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPIC--WRGPFKALGQVTEYFK 293
SG++ Y Y ++V D +G PL A L C W P + + + E
Sbjct: 319 SGSTITYLARAAY-DMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPPRKIVTMPE--- 374
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIA 328
LA F + +P E Y++I T + +A
Sbjct: 375 -LAFHF----EGANMELPLENYMLIDGDTGNLCLA 404
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 89/204 (43%), Gaps = 20/204 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + +G PP D+GSD+ WVQC PC C + + P + VPC +
Sbjct: 127 YF-VRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPATSATFSAVPCGS 184
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L C + CDYE+ YGDG + GAL + L G + GC
Sbjct: 185 AVCRTLRTSG---CGD-SGGCDYEVSYGDGSYTKGALALETLTL----GGTAVEGVAIGC 236
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G+ N G AG+LGLG G +S+V QL +C+ G G L LG +
Sbjct: 237 GH--RNRGLFV--GAAGLLGLGWGPMSLVGQLGG--AAGGAFSYCLASRGAGSLVLGRSE 290
Query: 192 VPSSGVAWTPMLQNSADLKHYILG 215
G W P+++N Y +G
Sbjct: 291 AVPEGAVWVPLVRNPQAPSFYYVG 314
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 74/159 (46%), Gaps = 15/159 (9%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN---- 65
P + V L +G PP F DT SDL W QC PCTGC + + P +
Sbjct: 82 IMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYA 140
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+PCS+ C L + RC H +D+ C Y Y ++ G L D + G
Sbjct: 141 ALPCSSDTCDEL---DVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAF 193
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
+ FGC + P PP +GV+GLGRG +S+VSQL
Sbjct: 194 RGVAFGCSTSSTGGAP--PPQASGVVGLGRGPLSLVSQL 230
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 74/159 (46%), Gaps = 15/159 (9%)
Query: 10 FFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN---- 65
P + V L +G PP F DT SDL W QC PCTGC + + P +
Sbjct: 82 IMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYA 140
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+PCS+ C L + RC H +D+ C Y Y ++ G L D + G
Sbjct: 141 ALPCSSDTCDEL---DVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAF 193
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
+ FGC + P PP +GV+GLGRG +S+VSQL
Sbjct: 194 RGVAFGCSTSSTGGAP--PPQASGVVGLGRGPLSLVSQL 230
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 74/255 (29%), Positives = 110/255 (43%), Gaps = 34/255 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVP----CSNP 72
+ + L +G PP + DTGSDL W QC PCT C K P + P K+ C +
Sbjct: 108 YLMELAIGTPPVSYPAVLDTGSDLIWTQC-KPCTQCYKQPTPIFDPKKSSSFSKVSCGSS 166
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+A+ C +D C+Y YGD + G L T+ F S V + FGCG
Sbjct: 167 LCSAVPSST---C---SDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCG 220
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGD 189
+ G +G++GLGRG +S+VSQL+E +C+ +L LG
Sbjct: 221 EDNEGDG---FEQASGLVGLGRGPLSLVSQLKE-----PRFSYCLTPMDDTKESILLLGS 272
Query: 190 -GKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDS 237
GKV + V TP+L+N Y L + ++ T +I DS
Sbjct: 273 LGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDS 332
Query: 238 GASYAYFTSRVYQEI 252
G + Y + ++ +
Sbjct: 333 GTTITYIEQKAFEAL 347
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 113/256 (44%), Gaps = 32/256 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---------YKPHK 64
F +FA N++VG PP F DTGSDL W+ C+ CT C + + Q Y+ K
Sbjct: 111 FLHFA-NVSVGTPPLWFLVALDTGSDLFWLPCN--CTSCVRGLKTQNGKVIDLNIYELDK 167
Query: 65 NI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
+ VPC++ C +C C YE+EY + SS G LV D+ L N
Sbjct: 168 SSTRKNVPCNSNMCKQ------TQCHSSGSSCRYEVEYLSNDTSSSGFLVEDVLHLITDN 221
Query: 120 GSV--FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
+ +T GCG Q L+ G+ GLG +S+ S L + GLI + C
Sbjct: 222 DQTKDIDTQITIGCGQVQTGVF-LNGAAPNGLFGLGMENVSVPSILAQKGLISDSFSMCF 280
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPM-LQNSADLKHYILGPAELLYSGKSCGLKDLTLIFD 236
G +G G + GD SS TP L+ S Y + +++ G + + IFD
Sbjct: 281 GSDGSGRITFGD--TGSSDQGKTPFNLRESHPT--YNVTITQIIVGGYAAD-HEFHAIFD 335
Query: 237 SGASYAYFTSRVYQEI 252
SG S+ Y Y I
Sbjct: 336 SGTSFTYLNDPAYTLI 351
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 78/236 (33%), Positives = 115/236 (48%), Gaps = 34/236 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHK----NIVPCS 70
+ V +++G P + DTGSD++WVQC PC C + + P + + VPC+
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCK-PCPSPPCYSQRDPLFDPTRSSSYSAVPCA 200
Query: 71 NPRCAALH-WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C+ L + N C QC Y + YGDG ++ G +D L SN F
Sbjct: 201 AASCSQLALYSN--GCS--GGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNA---LKGFLF 253
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI--GQNGRGVLF 186
GCG+ Q G + D G+LGLGR S+VSQ YG V +C+ QN G +
Sbjct: 254 GCGHAQQ--GLFAGVD--GLLGLGRQGQSLVSQASSTYG---GVFSYCLPPTQNSVGYIS 306
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---IFDSGA 239
LG G ++G + TP+L S D +YI ++ +G S G + L++ +F SGA
Sbjct: 307 LG-GPSSTAGFSTTPLLTASNDPTYYI-----VMLAGISVGGQPLSIDASVFASGA 356
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 113/262 (43%), Gaps = 40/262 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-YKPHKNI------ 66
F +FA N++VG PP F DTGSDL W+ C+ CT C + E K NI
Sbjct: 100 FLHFA-NVSVGTPPLSFLVALDTGSDLFWLPCN--CTKCVRGVESNGEKIAFNIYDLKGS 156
Query: 67 -----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNG 120
V C++ C +C + C YE+ Y +G S+ G LV D+ L +
Sbjct: 157 STSQTVLCNSNLCELQR-----QCPSSDSICPYEVNYLSNGTSTTGFLVEDVLHLITDDD 211
Query: 121 SV--FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+ +TFGCG Q L G+ GLG G S+ S L + GL N C G
Sbjct: 212 ETKDADTRITFGCGQVQ-TGAFLDGAAPNGLFGLGMGNESVPSILAKEGLTSNSFSMCFG 270
Query: 179 QNGRGVLFLGD------GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT 232
+G G + GD GK P + A P Y + +++ G + L +
Sbjct: 271 SDGLGRITFGDNSSLVQGKTPFNLRALHPT---------YNITVTQIIVGGNAADL-EFH 320
Query: 233 LIFDSGASYAYFTSRVYQEIVS 254
IFDSG S+ + Y++I +
Sbjct: 321 AIFDSGTSFTHLNDPAYKQITN 342
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 75/162 (46%), Gaps = 15/162 (9%)
Query: 7 EFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN- 65
E P + V L +G PP F DT SDL W QC PCTGC + + P +
Sbjct: 79 ETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSS 137
Query: 66 ---IVPCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGS 121
+PCS+ C L + RC H +D+ C Y Y ++ G L D + G
Sbjct: 138 TYAALPCSSDTCDEL---DVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI----GE 190
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
+ FGC + P PP +GV+GLGRG +S+VSQL
Sbjct: 191 DAFRGVAFGCSTSSTGGAP--PPQASGVVGLGRGPLSLVSQL 230
>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
Length = 566
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 118/262 (45%), Gaps = 50/262 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKNIV 67
+ + +G PP+ F+ DTGSD+ WV C + C GC K E Q + ++V
Sbjct: 132 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSELQIQLSFFDPGVSSSASLV 190
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CS+ RC + ++ C PN+ C Y +YGDG + G ++D
Sbjct: 191 SCSDRRCYS-NFQTESGCS-PNNLCSYSFKYGDGSGTSGYYISD---------------- 232
Query: 128 TFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRG 183
F C Q G L P A G+ GLG+G +S++SQL GL V HC+ ++G G
Sbjct: 233 -FMCSNLQS--GDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGG 289
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK---------SCGLKDLTLI 234
++ LG K P + +TP++ + HY + + +G+ + D T+I
Sbjct: 290 IMVLGQIKRPDT--VYTPLVPSQP---HYNVNLQSIAVNGQILPIDPSVFTIATGDGTII 344
Query: 235 FDSGASYAYFTSRVYQEIVSLI 256
D+G + AY Y + +
Sbjct: 345 -DTGTTLAYLPDEAYSPFIQAV 365
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 88/308 (28%), Positives = 139/308 (45%), Gaps = 41/308 (13%)
Query: 17 FAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPP----EKQYKPHKNIVPCSN 71
F +++T+G PP K+F DTGSDLTWVQC PC C K +K+ PC +
Sbjct: 85 FFMSITIGTPPMKVFAI-ADTGSDLTWVQC-KPCQQCYKENGPIFDKKKSSTYKSEPCDS 142
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C AL + C + C Y YGD S G + T+ + ++GS + P T FG
Sbjct: 143 RNCHALS-SSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVFG 201
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVL 185
CGYN G +G++GLG G +S++SQL I +C+ NG V+
Sbjct: 202 CGYNN---GGTFDETGSGIIGLGGGHLSLISQLGSS--ISKKFSYCLSHKSATTNGTSVI 256
Query: 186 FLGDGKVPS-----SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKD---- 230
LG +PS SGV TP++ +Y+ +G ++ Y+G S D
Sbjct: 257 NLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGIF 316
Query: 231 ----LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
+I DSG + S + + + + +L+ +++ L C++ +G
Sbjct: 317 SETSGNIIIDSGTTLTLLDSGFFDKFGAAV-EELVTGAKRVSDPQGLLSHCFKSGSAEIG 375
Query: 287 --QVTEYF 292
++T +F
Sbjct: 376 LPEITVHF 383
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 95/206 (46%), Gaps = 26/206 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH---------KNIV 67
+ + +G PP+ DTGSD+ WV C + C GC + Q + + +++
Sbjct: 77 YYTKVKLGTPPRELYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQLNYFDPGSSSTSSLI 135
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
C + RC + + C N+QC Y +YGDG + G V+DL S+F L
Sbjct: 136 SCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHF----ASIFEGTL 191
Query: 128 T--------FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-G 178
T FGC Q S G+ G G+ +S++SQL G+ V HC+ G
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251
Query: 179 QN-GRGVLFLGDGKVPSSGVAWTPML 203
N G GVL LG+ P+ + ++P++
Sbjct: 252 DNSGGGVLVLGEIVEPN--IVYSPLV 275
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 96/208 (46%), Gaps = 19/208 (9%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPC 69
+YF V + +G P + FDTGSDLTW QC+ PC G C K + + P K+ + C
Sbjct: 135 NYFVV-VGLGTPKRDLSLVFDTGSDLTWTQCE-PCAGSCYKQQDAIFDPSKSSSYINITC 192
Query: 70 SNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
++ C L RC C Y I+YGD +S+G L + + ++
Sbjct: 193 TSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITATD---IVDDFL 249
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLF 186
FGCG Q N G S +AG++GLGR IS V Q + + +C+ + G L
Sbjct: 250 FGCG--QDNEGLFS--GSAGLIGLGRHPISFVQQTSS--IYNKIFSYCLPSTSSSLGHLT 303
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYIL 214
G ++ + +TP+ S D Y L
Sbjct: 304 FGASAATNANLKYTPLSTISGDNTFYGL 331
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 142/341 (41%), Gaps = 54/341 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
F +++++G P + DTGSDL W QC PC C + P + +PCS+
Sbjct: 118 FLMDMSIGTPALAYAAIVDTGSDLVWTQCK-PCVECFNQSTPVFDPSSSSTYSTLPCSSS 176
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L C C Y YGD S+ G L + F L + +P + FGC
Sbjct: 177 LCSDLPTST---CTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKT-----KLPGVAFGC 228
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLG 188
G G AG++GLGRG +S+VSQL GL + +C + + L LG
Sbjct: 229 GDTNEGDG---FTQGAGLVGLGRGPLSLVSQL---GLGK--FSYCLTSLDDTSKSPLLLG 280
Query: 189 D------GKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDL---T 232
++ + TP+++N + LK +G + G + ++D
Sbjct: 281 SLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGG 340
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVT 289
+I DSG S Y + Y+ ++ +KL D + L +C++ P + V
Sbjct: 341 VIVDSGTSITYLELQGYRP-----LKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDVE 395
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYL 330
L L F + L +P E Y+V+ ++ + + +
Sbjct: 396 --VPKLVLHFDGGAD---LDLPAENYMVLDSASGALCLTVM 431
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 111/265 (41%), Gaps = 32/265 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGC-----TKPPEKQYKPHKN----IV 67
+ +G P F DTGSDL W+ C+ AP T +Y P + +
Sbjct: 104 IDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPL------RFSNG 120
CS+ C + C+ P +QC Y + Y G SS G LV D+ L R NG
Sbjct: 164 LCSHKLCDS-----ASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218
Query: 121 SV-FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
S + GCG Q L G++GLG IS+ S L + GL+RN C +
Sbjct: 219 SSSVKARVVIGCGKKQSG-DYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDE 277
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSG 238
G ++ GD + S TP LQ + YI+G E G SC T DSG
Sbjct: 278 EDSGRIYFGD--MGPSIQQSTPFLQLENN-SGYIVG-VEACCIGNSCLKQTSFTTFIDSG 333
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGT 263
S+ Y +Y+++ I R + T
Sbjct: 334 QSFTYLPEEIYRKVALEIDRHINAT 358
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 117/272 (43%), Gaps = 32/272 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +VG PP DTGSD+ W+QC PC C K + P K+ +PCS+
Sbjct: 87 YLMTYSVGTPPFNVYGVVDTGSDIVWLQC-KPCEQCYKQTTPIFNPSKSSSYKNIPCSSN 145
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C ++ + + C N C+Y I + D S G L + L + G + P T GC
Sbjct: 146 LCQSVRYTS---CNKQN-SCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGC 201
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC-----IGQNGRGVLF 186
G HN + +T+G++GLG G +S+ +QL+ I +C + N L
Sbjct: 202 G---HNNRGMFQGETSGIVGLGIGPVSLTTQLKSS--IGGKFSYCLLPLLVDSNKTSKLN 256
Query: 187 LGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL------TLIFDSGA 239
GD V S GV TP ++ +Y+ A K + L +I DSG
Sbjct: 257 FGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEA-FSVGNKRIEFEVLDDSEEGNIILDSGT 315
Query: 240 SYAYFTSRVYQEIVS----LIMRDLIGTPLKL 267
+ S VY + S L+ D + P +L
Sbjct: 316 TLTLLPSHVYTNLESAVAQLVKLDRVDDPNQL 347
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 71/279 (25%), Positives = 113/279 (40%), Gaps = 37/279 (13%)
Query: 1 MYVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
M I+ P + +NL +G PP DTGSDLTW QC PCT C K +
Sbjct: 76 MTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPLF 134
Query: 61 KPHKNIV----PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
P + C C AL R +C + Y DG + G L ++ +
Sbjct: 135 DPKNSSTYRDSSCGTSFCLAL---GKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVD 191
Query: 117 FSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ G + P FGCG H+ G + ++G++GLG G +S++SQL+ I + +
Sbjct: 192 STAGKPVSFPGFAFGCG---HSSGGIFDKSSSGIVGLGGGELSLISQLKS--TINGLFSY 246
Query: 176 CI------GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG--KSCG 227
C+ + F G+V G TP+ L Y G K
Sbjct: 247 CLLPVSTDSSISSRINFGASGRVSGYGTVSTPL---------------RLPYKGYSKKTE 291
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
+++ +I DSG +Y + Y ++ + + G ++
Sbjct: 292 VEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVR 330
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 91/329 (27%), Positives = 140/329 (42%), Gaps = 50/329 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKN----IVPCSN 71
+ + L +G PP + DTGSDL W QC APC + C K + Y P + ++PC++
Sbjct: 88 YIMTLAIGTPPLSYPAIADTGSDLIWTQC-APCGSQCFKQAGQPYNPSSSTTFGVLPCNS 146
Query: 72 --PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
CAAL P+PP P C Y YG G ++ G + F + VP +
Sbjct: 147 SVSMCAALAGPSPP----PGCSCMYNQTYGTGWTA-GIQSVETFTFGSTPADQTRVPGIA 201
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
FGC N +AG++GLGRG +S+VSQL G+ + N L LG
Sbjct: 202 FGC----SNASSDDWNGSAGLVGLGRGSMSLVSQLGA-GMFSYCLTPFQDANSTSTLLLG 256
Query: 189 -DGKVPSSGVAWTPMLQ--NSADLKHYILGPAELLYSGKSCGLKDLT------------- 232
+ +GV TP + + A + Y L +G S G L+
Sbjct: 257 PSAALNGTGVLTTPFVASPSKAPMSTYYY----LNLTGISIGTTALSIPPNAFALRTDGT 312
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
LI DSG + YQ++ + I L+ P+ D L +C+ +E
Sbjct: 313 GGLIIDSGTTITSLVDAAYQQVRAAI-ESLVTLPVADGSDSTGLDLCF-------ALTSE 364
Query: 291 YFKPLAL-SFTNRRNSVRLVVPPEAYLVI 318
P ++ S T + +V+P + Y+++
Sbjct: 365 TSTPPSMPSMTFHFDGADMVLPVDNYMIL 393
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 139/321 (43%), Gaps = 44/321 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + +G P KL DTGSD+ W+QC +PC C K + + P + + CS
Sbjct: 14 YF-VRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSSFRRLSCST 71
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+C L + C +++C Y++ YGDG ++G L +D F + S P+ FGC
Sbjct: 72 PQCKLL---DVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTS----PVVFGC 124
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G++ N G AG+LGLG G++S SQL ++ G L GD
Sbjct: 125 GHD--NEGLF--VGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSA 180
Query: 192 VPSSG-VAWTPMLQN-------SADLKHYILGPAELLYSGKSCGLKDLT----LIFDSGA 239
+P+S A+T +L+N A L +G L + L T +I DSG
Sbjct: 181 LPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGT 240
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTP---LKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
S + Y +MRD + L A D C+ F AL VT ++
Sbjct: 241 SVTRLPTYAYT-----VMRDAFRSATQKLPRAADFSLFDTCY--DFSALTSVT--IPTVS 291
Query: 297 LSFTNRRNSVRLVVPPEAYLV 317
F + +PP YLV
Sbjct: 292 FHF---EGGASVQLPPSNYLV 309
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 81/275 (29%), Positives = 122/275 (44%), Gaps = 51/275 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+L +G PP+ DTGSDL W QC APC C P+ + P ++ + C+
Sbjct: 96 YVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLSQPDPLFAPGQSASYEPMRCAGT 154
Query: 73 RCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS---NGSVFNVPLT 128
C+ LH C+ P D C Y YGDG ++G T+ F S + VPL
Sbjct: 155 LCSDILHHS----CERP-DTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLG 209
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGV 184
FGCG N G L+ + +G++G GR +S+VSQL IR +C+ + +
Sbjct: 210 FGCG--SVNVGSLN--NGSGIVGFGRNPLSLVSQLS----IRR-FSYCLTSYASRRQSTL 260
Query: 185 LF--LGDGKV--PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
LF L DG + V TP+LQ+ + Y + ++G + G + L +
Sbjct: 261 LFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVH-----FTGLTVGARRLRIPESAFAL 315
Query: 234 --------IFDSGASYAYFTSRVYQEIVSLIMRDL 260
I DSG + + V E+V + L
Sbjct: 316 RPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQL 350
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 87/202 (43%), Gaps = 16/202 (7%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN--IVPC 69
+ + +G PPK + DTGSD+ WV C C GC QY P + V C
Sbjct: 84 YYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGCPTRSGLGIELTQYDPAGSGTTVGC 142
Query: 70 SNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SVFN 124
C A PP C + C + I YGDG ++ G VTD +G + N
Sbjct: 143 EQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSN 202
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRG 183
+TFGCG S G+LG G+ S++SQL +R + HC+ G G
Sbjct: 203 ASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGG 262
Query: 184 VLFLGDGKVPSSGVAWTPMLQN 205
+ +G+ P V TP++ N
Sbjct: 263 IFAIGNVVQPK--VKTTPLVPN 282
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 122/272 (44%), Gaps = 32/272 (11%)
Query: 12 PIFS---YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
PIF+ + V ++VG PP DTGSD+ W QC PC+ C + + P K+
Sbjct: 75 PIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQC-KPCSNCYQQNAPMFDPSKSTTY 133
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
V CS+P C+ + + C + +C Y I YGD S G L D ++ ++G
Sbjct: 134 KNVACSSPVCS--YSGDGSSCSD-DSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVA 190
Query: 125 VPLT-FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-------EYGLIRNVIGHC 176
P T GCG++ N G + + +G++GLGRG S+V+QL Y LI IG
Sbjct: 191 FPRTVIGCGHD--NAGTFN-ANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIP--IGTG 245
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQN-------SADLKHYILGPAELLY-SGKSCGL 228
+ + F + V SG TP+ + S L+ +G + + G S
Sbjct: 246 STNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLG 305
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDL 260
+ +I DSG + Y S + S I + +
Sbjct: 306 GESNIIIDSGTTLTYLPSALLNSFGSAISQSM 337
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 89/335 (26%), Positives = 142/335 (42%), Gaps = 46/335 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F +++ +G P + DTGSDL W QC PC C K + P + VPCS+
Sbjct: 100 FLMDVAIGTPALSYAAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSA 158
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L P +C Y YGD S+ G L ++ F L + V FGCG
Sbjct: 159 LCSDL----PTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKLPGV--AFGCG 212
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ----NGRGVLFLG 188
G AG++GLGRG +S+VSQL GL + +C+ +G+ L LG
Sbjct: 213 DTNEGDG---FTQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDGDGKSPLLLG 264
Query: 189 DGKVPSSG------VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDL---T 232
S V TP+++N + L +G + + ++D
Sbjct: 265 GSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGG 324
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
+I DSG S Y + Y+ + + + P + + L +C++GP K + +V
Sbjct: 325 VIVDSGTSITYLELQGYRALKKAFVAQM-ALP-TVDGSEIGLDLCFQGPAKGVDEV--QV 380
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIII 327
L L F + L +P E Y+V+ ++ + +
Sbjct: 381 PKLVLHFDGGAD---LDLPAENYMVLDSASGALCL 412
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 74/280 (26%), Positives = 123/280 (43%), Gaps = 31/280 (11%)
Query: 12 PIFSY---FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
PI++Y + + L++G PP DTGSDLTW C PC C K + P K+
Sbjct: 64 PIYAYLGHYLMELSIGTPPFKIYGIADTGSDLTWTSC-VPCNNCYKQRNPMFDPQKSTTY 122
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ C + C H + C P +C+Y Y + G L + L + G +
Sbjct: 123 RNISCDSKLC---HKLDTGVCS-PQKRCNYTYAYASAAITRGVLAQETITLSSTKGK--S 176
Query: 125 VPL---TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRN----VIGHC 176
VPL FGCG+N N G + + G++GLG G +S++SQ+ +G R V H
Sbjct: 177 VPLKGIVFGCGHN--NTGGFNDHE-MGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHT 233
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKD 230
+ F KV GV TP++ +++ + L ++G S ++
Sbjct: 234 DVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEK 293
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD 270
+ DSG +++Y ++V+ + ++ P+ PD
Sbjct: 294 GNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPD 333
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 90/296 (30%), Positives = 116/296 (39%), Gaps = 48/296 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V+L VG PP+ DTGSDL W QC APC C P + +PC P
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFDQGIPLLDPAASSTYAALPCGAP 144
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-----RFSNGSV-FNVP 126
RC AL P C Y YGD ++G + TD F R +GS+
Sbjct: 145 RCRAL-----PFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRR 199
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ---NGRG 183
LTFGCG+ N G +T G+ G GRGR S+ SQL +C +
Sbjct: 200 LTFGCGH--FNKGVFQSNET-GIAGFGRGRWSLPSQLNA-----TSFSYCFTSMFDSKSS 251
Query: 184 VLFLGDG------KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL------ 231
++ LG S V TP+ +N + Y L G S G L
Sbjct: 252 IVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLS-----LKGISVGKTRLPVPETK 306
Query: 232 --TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
+ I DSGAS VY E V +G P + L +C+ P AL
Sbjct: 307 FRSTIIDSGASITTLPEEVY-EAVKAEFAAQVGLPPS-GVEGSALDVCFALPVSAL 360
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 104/248 (41%), Gaps = 24/248 (9%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKN----IV 67
+ +G P F DTGSDL WV CD AP G + + Y P K+ V
Sbjct: 114 TTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASDFELSVYSPKKSSTSKTV 173
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDG-GSSIGALVTDLFPLR--FSNGSVFN 124
PC+N CA +C C Y + Y S+ G L+ DL L+ +
Sbjct: 174 PCNNNLCAQRD-----QCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTEHKHSEPIQ 228
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+TFGCG Q L G+ GLG +IS+ S L GL+ N C +G G
Sbjct: 229 AYITFGCGQVQSG-SFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGR 287
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+ GD S TP N + I + G + D+T +FDSG S++YF
Sbjct: 288 INFGDKG--SLEQEETPFNLNQLHPNYNIT--VTSIRVGTTLIDADITALFDSGTSFSYF 343
Query: 245 TSRVYQEI 252
T +Y ++
Sbjct: 344 TDPIYSKL 351
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/236 (33%), Positives = 115/236 (48%), Gaps = 34/236 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHK----NIVPCS 70
+ V +++G P + DTGSD++WVQC PC C + + P + + VPC+
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCK-PCPSPPCYSQRDPLFDPTRSSSYSAVPCA 189
Query: 71 NPRCAALH-WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C+ L + N C QC Y + YGDG ++ G +D L SN F
Sbjct: 190 AASCSQLALYSN--GCS--GGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNA---LKGFLF 242
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI--GQNGRGVLF 186
GCG+ Q G + D G+LGLGR S+VSQ YG V +C+ QN G +
Sbjct: 243 GCGHAQQ--GLFAGVD--GLLGLGRQGQSLVSQASSTYG---GVFSYCLPPTQNSVGYIS 295
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---IFDSGA 239
LG G ++G + TP+L S D +YI ++ +G S G + L++ +F SGA
Sbjct: 296 LG-GPSSTAGFSTTPLLTASNDPTYYI-----VMLAGISVGGQPLSIDASVFASGA 345
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 99/338 (29%), Positives = 137/338 (40%), Gaps = 42/338 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C + P ++ C +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 73 RCAALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L + K PN C Y YGD + G L D F + SV V FGC
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV--AFGC 198
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G N G +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 199 GL--FNNGVFKSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVNGLKPSTVLLDLPAD 254
Query: 192 VPSSG---VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLT--LIFDSGA 239
+ SG V TP++QN A+ LK +G L LK+ T I DSG
Sbjct: 255 LYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGT 314
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTEYFKPLA 296
+ +RVY+ ++RD +KL + T P C P +A Y L
Sbjct: 315 AMTSLPTRVYR-----LVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRA----KPYVPKLV 365
Query: 297 LSFTNRRNSVRLVVPPEAYL--VISVSTSIIIIAYLTG 332
L F + +P E Y+ V +SI+ +A + G
Sbjct: 366 LHF----EGATMDLPRENYVFEVEDAGSSILCLAIIEG 399
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/262 (26%), Positives = 115/262 (43%), Gaps = 37/262 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ ++++ G PP+ DTGSDL W QC PC C + P K + V C++
Sbjct: 80 YLIDISFGSPPQKASVIVDTGSDLIWTQC-LPCETCNAAASVIFDPVKSSTYDTVSCASN 138
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C++L + + C Y+ YGDG S+ GAL +P + FGC
Sbjct: 139 FCSSLPF------QSCTTSCKYDYMYGDGSSTSGAL-----STETVTVGTGTIPNVAFGC 187
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLG 188
G+ N G + AG++GLG+G +S++SQ + +C +G + +G
Sbjct: 188 GHT--NLGSFA--GAAGIVGLGQGPLSLISQASS--ITSKKFSYCLVPLGSTKTSPMLIG 241
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSG 238
D + GVA+T +L N+A+ Y + SGK+ T I DSG
Sbjct: 242 D-SAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSG 300
Query: 239 ASYAYFTSRVYQEIVSLIMRDL 260
+ Y + + +V+ + ++
Sbjct: 301 TTLTYLETGAFNALVAALKAEV 322
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 117/279 (41%), Gaps = 33/279 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+TVG + DTGSDLTWVQC PC C E + P + +PC++P C A
Sbjct: 68 VTVGIGGQNSTLIVDTGSDLTWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVA 126
Query: 77 LH--WPNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
L + C + N CDY+I+YGDG S G L + L G FGCG
Sbjct: 127 LQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL----GKTEIDNFIFGCGR 182
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDG 190
N N G +G++GL R +S+VSQ L +V +C+ G G L LG
Sbjct: 183 N--NKGLFG--GASGLMGLARSELSLVSQTSS--LFGSVFSYCLPTTGVGSSGSLTLGGA 236
Query: 191 KVPS----SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------LIFDSGAS 240
+ S +++T M+QN Y L + G + + L+ + DSG
Sbjct: 237 DFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTV 296
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
+ +Y+ + + G + P L C+
Sbjct: 297 ITRLSPSIYKAFKAEFEKQFSG--YRTTPGFSILNTCFN 333
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/259 (27%), Positives = 112/259 (43%), Gaps = 31/259 (11%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQ--YKPHKNIVPCSNPRCA 75
++L++G PP+ +F S +WV C + C CT Q +PC +P C+
Sbjct: 1 MDLSLGTPPQPLNFTLAVDSGFSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSCS 60
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQ 135
A + C P+ C Y YG SS G LV+D+ + L+ GCG +
Sbjct: 61 AFSAVST-SCG-PSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCG--R 116
Query: 136 HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDGKVP- 193
+ G L DT+G +G +G +S + QL G R+ +C+ RG L +G+ K+
Sbjct: 117 DSGGLLELLDTSGFVGFDKGNVSFMGQLSALGY-RSKFIYCLPSDTFRGKLVIGNYKLRN 175
Query: 194 ---SSGVAWTPMLQNSADLKHYILG-------------PAELLYSGKSCGLKDLTLIFDS 237
SS +A+TPM+ N + Y + P + S + G + D+
Sbjct: 176 ASISSSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGTGG-----TVIDT 230
Query: 238 GASYAYFTSRVYQEIVSLI 256
+Y TS Y ++V I
Sbjct: 231 TTFLSYLTSDFYTQLVQAI 249
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/313 (26%), Positives = 133/313 (42%), Gaps = 32/313 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
Y+ L +G PP+ F DTGS +T+V C + C C + +++P + V C+
Sbjct: 92 YYTTRLWIGTPPQRFALIVDTGSTVTYVPC-STCKHCGSHQDPKFRPEASETYQPVKCT- 149
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTFG 130
+C C QC YE Y + +S G L D+ + F N S + FG
Sbjct: 150 WQC---------NCDDDRKQCTYERRYAEMSTSSGVLGEDV--VSFGNQSELSPQRAIFG 198
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C ++ G + G++GLGRG +SI+ QL E +I + C G G G + G
Sbjct: 199 CENDE--TGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLG 256
Query: 191 KV-PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------IFDSGASYAY 243
+ P + + +T + +Y + E+ +GK L + DSG +YAY
Sbjct: 257 GISPPADMVFTH--SDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAY 314
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRR 303
+ IM++ PD IC+ G + Q+++ F + + F N
Sbjct: 315 LPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPVVEMVFGNGH 374
Query: 304 NSVRLVVPPEAYL 316
+L + PE YL
Sbjct: 375 ---KLSLSPENYL 384
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 144/341 (42%), Gaps = 57/341 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F +++++G P + DTGSDL W QC PC C + P + +PCS+
Sbjct: 102 FLMDMSIGTPAVAYAAIIDTGSDLVWTQCK-PCVECFNQSTPVFDPSSSSTYAALPCSST 160
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L P K + +C Y YGD S+ G L + F L + +P + FGC
Sbjct: 161 LCSDL-----PSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKT-----KLPDVAFGC 210
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLG 188
G G AG++GLGRG +S+VSQL GL N +C + + L LG
Sbjct: 211 GDTNEGDG---FTQGAGLVGLGRGPLSLVSQL---GL--NKFSYCLTSLDDTSKSPLLLG 262
Query: 189 D------GKVPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDL---T 232
+S V TP+++N + +LK +G + + ++D
Sbjct: 263 SLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGG 322
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVT 289
+I DSG S Y + Y+ ++ +KL D + L C+ P + QV
Sbjct: 323 VIVDSGTSITYLELQGYRA-----LKKAFAAQMKLPAADGSGIGLDTCFEAPASGVDQV- 376
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYL 330
E K L F + L +P E Y+V+ + + + +
Sbjct: 377 EVPK---LVF--HLDGADLDLPAENYMVLDSGSGALCLTVM 412
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/306 (28%), Positives = 136/306 (44%), Gaps = 38/306 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP----EKQYKPHKNIVPCSN 71
YF +++++G PP F DTGSDLTWVQC PC C K +K+ C +
Sbjct: 85 YF-MSISIGTPPSKFLAIADTGSDLTWVQC-KPCQQCYKQNTPLFDKKKSSTYKTESCDS 142
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C AL + C + C Y YGD + G + T+ + S+GS + P T FG
Sbjct: 143 ITCNALS-EHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPVSFPGTAFG 201
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVL 185
CGYN G +G++GLG G +S+VSQL I +C+ NG V+
Sbjct: 202 CGYNN---GGTFEETGSGIIGLGGGPLSLVSQLGSS--IGKKFSYCLSHTSATTNGTSVI 256
Query: 186 FLGDGKVPS-----SGVAWTPMLQNSADLKHYI------LGPAELLYSG------KSCGL 228
LG + S S + TP++Q + +++ +G +L Y+G
Sbjct: 257 NLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGGYSLNRKSK 316
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG-- 286
K +I DSG + S Y + +++ + G +++ L C++ K +G
Sbjct: 317 KTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAK-RVSDPQGILTHCFKSGDKEIGLP 375
Query: 287 QVTEYF 292
+T +F
Sbjct: 376 TITMHF 381
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 131/302 (43%), Gaps = 26/302 (8%)
Query: 27 PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALHWPNPPRCK 86
+ F+ DTGS T++ C C C +Y + S C+A +C
Sbjct: 44 AQTFELIVDTGSSRTYLPCKG-CASCGAHEAGRYYDYDASADFSRVECSACAGIGG-KCG 101
Query: 87 HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDT 146
+ C Y++ Y +G S G LV D+ L GSV N + FGC + G +
Sbjct: 102 -TSGVCRYDVHYLEGSGSEGYLVRDVVSL---GGSVGNATVVFGC--EERELGSIKQQSA 155
Query: 147 AGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVLFLG--DGKVPSSGV 197
G+ G GR ++ +QL +I ++ C+ G++ G+L LG D + +
Sbjct: 156 DGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGNFDFGADAPAL 215
Query: 198 AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIM 257
+TPM+ S+ + + + + L + G + + I DSG SY Y ++ + L
Sbjct: 216 VYTPMV--SSAMYYQVTTTSWTLGNSVVEGSRGVLTIIDSGTSYTYVPGNMHARFLQLAE 273
Query: 258 RDLIGTPL-KLAPDDKTLPICWRGPFKALG--QVTEYFKPLALSFTNRRNSVRLVVPPEA 314
+ L K+AP + +C+ G LG V+EYF L + + S RL + PE
Sbjct: 274 DAARESGLEKVAPPEDYPDLCF-GNSGGLGWSTVSEYFPALKIEY---HGSARLTLSPET 329
Query: 315 YL 316
YL
Sbjct: 330 YL 331
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 117/279 (41%), Gaps = 33/279 (11%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+TVG + DTGSDLTWVQC PC C E + P + +PC++P C A
Sbjct: 147 VTVGIGGQNSTLIVDTGSDLTWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVA 205
Query: 77 LH--WPNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
L + C + N CDY+I+YGDG S G L + L G FGCG
Sbjct: 206 LQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL----GKTEIDNFIFGCGR 261
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDG 190
N N G +G++GL R +S+VSQ L +V +C+ G G L LG
Sbjct: 262 N--NKGLFG--GASGLMGLARSELSLVSQTSS--LFGSVFSYCLPTTGVGSSGSLTLGGA 315
Query: 191 KVPS----SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------LIFDSGAS 240
+ S +++T M+QN Y L + G + + L+ + DSG
Sbjct: 316 DFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTV 375
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
+ +Y+ + + G + P L C+
Sbjct: 376 ITRLSPSIYKAFKAEFEKQFSG--YRTTPGFSILNTCFN 412
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 93/204 (45%), Gaps = 23/204 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-----KQYKPHKN----I 66
YF + +G P K + DTGSD+ WV C C GC + Y P + +
Sbjct: 90 YF-TRIGIGTPAKRYYVQVDTGSDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGEL 147
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG----SV 122
V C C A + P C + C+Y I YGDG S+ G VTD +G +
Sbjct: 148 VTCDQQFCVANYGGVLPSCTSTS-PCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTP 206
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTA--GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ- 179
N ++FGCG G L + A G+LG G+ S++SQL G +R + HC+
Sbjct: 207 ANASVSFGCGAKLG--GDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTV 264
Query: 180 NGRGVLFLGDGKVPSSGVAWTPML 203
NG G+ +G+ P V TP++
Sbjct: 265 NGGGIFAIGNVVQPK--VKTTPLV 286
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 116/290 (40%), Gaps = 48/290 (16%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---------YKP-- 62
F ++A N+T+G P + F DTGSDL W+ C+ T C + E Y P
Sbjct: 87 FLHYA-NVTIGTPAQWFLVALDTGSDLFWLPCNCNST-CVRSMETDQGERIKLNIYNPSK 144
Query: 63 --HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSN 119
+ V C++ CA + RC P C Y I Y G S G LV D+ +
Sbjct: 145 SKSSSKVTCNSTLCALRN-----RCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEE 199
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
G + +TFGC +Q G G++GL I++ + L + G+ + C G
Sbjct: 200 GEARDARITFGCSESQ--LGLFKEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGP 257
Query: 180 NGRGVLFLGDG------KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL 233
NG+G + GD + P SG +PM + + K + GK + T
Sbjct: 258 NGKGTISFGDKGSSDQLETPLSGTI-SPMFYDVSITKFKV---------GKVTVDTEFTA 307
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
FDSG + + Y + T L+ D+ L PF+
Sbjct: 308 TFDSGTAVTWLIEPYYTALT---------TNFHLSVPDRRLSKSVDSPFE 348
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/272 (27%), Positives = 119/272 (43%), Gaps = 32/272 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPEKQYKPHKNI---VPCS 70
YF V+L +G+PP+ DTGSDL WV+C A C C+ P + H + C
Sbjct: 84 YF-VDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHSSTFSPAHCY 141
Query: 71 NPRCAALHWPN-PPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
+P C + P+ P C H + C YE Y DG + G + L+ S+G +
Sbjct: 142 DPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKS 201
Query: 127 LTFGCGY--NQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNG-- 181
+ FGCG+ + + S GV+GLGRG IS SQL R +G N +C+
Sbjct: 202 VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFG---NKFSYCLMDYTLS 258
Query: 182 ---RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK--------- 229
L +G+G S + +TP+L N Y + + +G +
Sbjct: 259 PPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDS 318
Query: 230 -DLTLIFDSGASYAYFTSRVYQEIVSLIMRDL 260
+ + DSG + A+ Y+ +++ + R +
Sbjct: 319 GNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRV 350
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/265 (29%), Positives = 109/265 (41%), Gaps = 32/265 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGC-----TKPPEKQYKPHK----NIV 67
+ +G P F DTGSDL W+ C+ AP T +Y P +
Sbjct: 104 IDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSSSKVF 163
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPL------RFSNG 120
CS+ C + C P +QC Y ++Y G SS G LV D+ L R NG
Sbjct: 164 LCSHKLCGS-----ASDCDSPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218
Query: 121 SV-FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
S + GCG Q L G++GLG IS+ S L + GL+RN C +
Sbjct: 219 SSSVKARVVVGCGKKQSG-DYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDE 277
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSG 238
G ++ GD A L+N++ YI+G E G SC T DSG
Sbjct: 278 EDSGRIYFGDMGPSIQQSAPFLQLENNSG---YIVG-VEACCIGNSCLKQTSFTTFIDSG 333
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGT 263
S+ Y +Y+++ I R + T
Sbjct: 334 QSFTYLPEEIYRKVALEIDRHINAT 358
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 96/205 (46%), Gaps = 26/205 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + F FDTGSD TWVQC PC C + E + P K+ + CS+
Sbjct: 96 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQ-PCVAYCYRQKEPLFDPTKSATYANISCSS 154
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C+ L+ C C Y I+YGDG +IG D L + F FGC
Sbjct: 155 SYCSDLYVSG---CS--GGHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFR----FGC 205
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
G + N G AG+LGLGRG+ S+ V +YG V +C+ G G L LG
Sbjct: 206 G--EKNRGLFG--RAAGLLGLGRGKTSLPVQAYDKYG---GVFAYCLPATSAGTGFLDLG 258
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYI 213
G P++ TPML + +Y+
Sbjct: 259 PG-APAANARLTPMLVDRGPTFYYV 282
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 96/205 (46%), Gaps = 26/205 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + F FDTGSD TWVQC PC C + E + P K+ + CS+
Sbjct: 161 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQ-PCVAYCYRQKEPLFDPTKSATYANISCSS 219
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C+ L+ C C Y I+YGDG +IG D L + F FGC
Sbjct: 220 SYCSDLYVSG---CS--GGHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFR----FGC 270
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
G + N G AG+LGLGRG+ S+ V +YG V +C+ G G L LG
Sbjct: 271 G--EKNRGLFG--RAAGLLGLGRGKTSLPVQAYDKYG---GVFAYCLPATSAGTGFLDLG 323
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYI 213
G P++ TPML + +Y+
Sbjct: 324 PG-APAANARLTPMLVDRGPTFYYV 347
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 66/205 (32%), Positives = 93/205 (45%), Gaps = 23/205 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ V C+ P
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L N C C Y ++YGDG SIG D L S ++ F G
Sbjct: 240 ACSDL---NIHGCS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 289
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+ N G + AG+LGLGRG+ S+ V +YG V HC+ G G L G
Sbjct: 290 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSTGTGYLDFGA 344
Query: 190 GKVPSSGVAW-TPMLQNSADLKHYI 213
G + ++ TPML + +Y+
Sbjct: 345 GSLAAARARLTTPMLTENGPTFYYV 369
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 100/198 (50%), Gaps = 15/198 (7%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA--A 76
V+LTVG PP+ DTGS+L+W+ C+ + T + ++ I PCS+P C
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYPTTFDPTRSTSYQTI-PCSSPTCTNRT 91
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
+P P C N+ C + Y D SS G L +D+F + S+ S L FGC +
Sbjct: 92 QDFPIPASCDS-NNLCHATLSYADASSSDGNLASDVFHIGSSDIS----GLVFGCMDSVF 146
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDGKVP-S 194
+ + G++G+ RG +S VSQL G + +CI G + G+L LG+ + S
Sbjct: 147 SSNSDEDSKSTGLMGMNRGSLSFVSQL---GFPK--FSYCISGTDFSGLLLLGESNLTWS 201
Query: 195 SGVAWTPMLQNSADLKHY 212
+ +TP++Q S L ++
Sbjct: 202 VPLNYTPLIQISTPLPYF 219
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/271 (26%), Positives = 112/271 (41%), Gaps = 48/271 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V L VG P DTGSD++W+QC PC C + P + +PC++
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQC-VPCKDCVPALRPPFNPRHSSSFFKLPCASS 197
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF-----------PLRFSNGS 121
C ++ P C C + I+YGDG S G L + P++ SN
Sbjct: 198 TCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSN-- 255
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-- 179
+T GC P +G+LG+ R IS SQL + HC
Sbjct: 256 -----ITLGCADIDREGLPTG---ASGLLGMDRRPISFPSQLSSRYARK--FSHCFPDKI 305
Query: 180 ---NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG-------PAELLYSGKS 225
N G++F G+ + S + +TP++QN SA L +Y +G + L S K+
Sbjct: 306 AHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKN 365
Query: 226 CGLKDLT----LIFDSGASYAYFTSRVYQEI 252
+ +T I DSG ++ Y +Q +
Sbjct: 366 FDIDKVTGSGGTIIDSGTAFTYLKKPAFQAM 396
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 121/291 (41%), Gaps = 25/291 (8%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP-----EKQYKPHKNIV 67
+ + + V+L+VG PP+ DTGSDL W QC APC C + V
Sbjct: 90 VTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQC-APCLNCFDQGAIPVLDPAASSTHAAV 148
Query: 68 PCSNPRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPL----RFSNGS 121
C P C AL + + R C Y YGD ++G L +D F G
Sbjct: 149 RCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGG 208
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V LTFGCG+ N G +T G+ G GRGR S+ SQL +
Sbjct: 209 VSERRLTFGCGH--FNKGIFQANET-GIAGFGRGRWSLPSQLGVTSFSYCFTSMFESTSS 265
Query: 182 RGVLFLGDGKVPSSG-VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLTL 233
L + ++ +G V TP+L++ + LK +G + + L++ +
Sbjct: 266 LVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASA 325
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA 284
I DSGAS VY+ + + + +G P+ A + L +C+ P A
Sbjct: 326 IIDSGASITTLPEDVYEAVKAEFVAQ-VGLPVS-AVEGSALDLCFALPSAA 374
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 112/266 (42%), Gaps = 32/266 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGC-----TKPPEKQYKPHKN----IV 67
+ +G P F DTGS+L W+ C+ AP T +Y P + +
Sbjct: 104 IDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPL------RFSNG 120
CS+ C + C+ P +QC Y + Y G SS G LV D+ L R NG
Sbjct: 164 LCSHKLCDS-----ASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218
Query: 121 SV-FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
S + GCG Q L G++GLG IS+ S L + GL+RN C +
Sbjct: 219 SSSVKARVVIGCGKKQSG-DYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDE 277
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQ-NSADLKHYILGPAELLYSGKSC-GLKDLTLIFDS 237
G ++ GD + S TP LQ ++ YI+G E G SC T DS
Sbjct: 278 EDSGRIYFGD--MGPSIQQSTPFLQLDNNKYSGYIVG-VEACCIGNSCLKQTSFTTFIDS 334
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGT 263
G S+ Y +Y+++ I R + T
Sbjct: 335 GQSFTYLPEEIYRKVALEIDRHINAT 360
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 78/151 (51%), Gaps = 14/151 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + + +G P + + DTGSDL W QC APC C P + P ++ + C++P
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSATYRSLGCASP 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL++ P C C Y+ YGD S+ G L + F + V ++FGCG
Sbjct: 149 ACNALYY---PLCYQ--KVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCG 203
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
N G L+ + +G++G GRG +S+VSQL
Sbjct: 204 --NLNAGSLA--NGSGMVGFGRGSLSLVSQL 230
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/338 (29%), Positives = 137/338 (40%), Gaps = 42/338 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C + P ++ C +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 73 RCAALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L + K PN C Y YGD + G L D F + SV V FGC
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV--AFGC 198
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G N G +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 199 GL--FNNGVFKSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVNGLKPSTVLLDLPAD 254
Query: 192 VPSSG---VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLT--LIFDSGA 239
+ SG V TP++QN A+ LK +G L LK+ T I DSG
Sbjct: 255 LYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTGGTIIDSGT 314
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTEYFKPLA 296
+ +RVY+ ++RD +KL + T P C P +A Y L
Sbjct: 315 AMTSLPTRVYR-----LVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRA----KPYVPKLV 365
Query: 297 LSFTNRRNSVRLVVPPEAYL--VISVSTSIIIIAYLTG 332
L F + +P E Y+ V +SI+ +A + G
Sbjct: 366 LHF----EGATMDLPRENYVFEVEDAGSSILCLAIIEG 399
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/280 (27%), Positives = 116/280 (41%), Gaps = 25/280 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ L +G P + DTGS LTW+QC C + Y P + VPCS
Sbjct: 134 YVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSAS 193
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C L NP C N C Y+ YGD S+G L D + F +GS N +G
Sbjct: 194 QCDELQAATLNPSACSVRN-VCIYQASYGDSSFSVGYLSRDT--VSFGSGSYPN--FYYG 248
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G +AG++GL R ++S++ QL + +C+ +L G
Sbjct: 249 CG--QDNEGLFG--RSAGLIGLARNKLSLLYQLAPS--LGYSFSYCL-PTPASTGYLSIG 301
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYFT 245
S ++TPM +S D Y + + + G + L I DSG
Sbjct: 302 PYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLP 361
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
+ VY + + ++G ++ AP L C++G L
Sbjct: 362 TAVYTALSKAVAAAMVG--VQSAPAFSILDTCFQGQASQL 399
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/293 (27%), Positives = 132/293 (45%), Gaps = 34/293 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKN----IVPCSN 71
+ V + +G P K + DTGS +W+QC PCT C + + P + VPCS+
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQ-PCTIYCHIQEDPVFNPSASKTYKTVPCSS 161
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNVPLT 128
+C++L N P C ++ C Y+ YGD S+G L D+ L S S F
Sbjct: 162 SQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSF----V 217
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIGQN------- 180
+GCG Q N G D G++GL +S++SQL +YG N +C+ +
Sbjct: 218 YGCG--QDNQGLFGRTD--GIIGLANNELSMLSQLSGKYG---NAFSYCLPTSFSTPNSP 270
Query: 181 GRGVLFLGDGKV-PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIF 235
G L +G + PSS +TP+L+N + Y + + +G+ G+ + I
Sbjct: 271 KEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTII 330
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
DSG + VY + + + ++ + AP L C++G + +V
Sbjct: 331 DSGTVITRLPTPVYTTLKNAYVT-ILSKKYQQAPGISLLDTCFKGSLAGISEV 382
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/293 (27%), Positives = 132/293 (45%), Gaps = 34/293 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKN----IVPCSN 71
+ V + +G P K + DTGS +W+QC PCT C + + P + VPCS+
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQ-PCTIYCHIQEDPVFNPSASKTYKTVPCSS 161
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNVPLT 128
+C++L N P C ++ C Y+ YGD S+G L D+ L S S F
Sbjct: 162 SQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSF----V 217
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIGQN------- 180
+GCG Q N G D G++GL +S++SQL +YG N +C+ +
Sbjct: 218 YGCG--QDNQGLFGRTD--GIIGLANNELSMLSQLSGKYG---NAFSYCLPTSFSTPNSP 270
Query: 181 GRGVLFLGDGKV-PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIF 235
G L +G + PSS +TP+L+N + Y + + +G+ G+ + I
Sbjct: 271 KEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTII 330
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
DSG + VY + + + ++ + AP L C++G + +V
Sbjct: 331 DSGTVITRLPTPVYTTLKNAYVT-ILSKKYQQAPGISLLDTCFKGSLAGISEV 382
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 108/274 (39%), Gaps = 29/274 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCT----------GCTKPPEKQYKPHKNI 66
+ VG P F DTGSDL WV CD AP + G KP E H
Sbjct: 104 VDVGTPTTSFLVALDTGSDLFWVPCDCIQCAPLSSYRGNLDRDLGIYKPAESTTSRH--- 160
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGSV-FN 124
+PCS+ C C +P C Y I+Y + +S G L+ D L G N
Sbjct: 161 LPCSHELCQPGSG-----CTNPKQPCTYNIDYFSENTTSSGLLIEDSLHLNSREGHAPVN 215
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+ GCG Q L G+LGLG IS+ S L GL+RN C ++ G
Sbjct: 216 ASVIIGCGRKQSG-DYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSSGR 274
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+F GD V S TP + L+ Y + + K + DSG S+
Sbjct: 275 IFFGDQGVSSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGSSFQALVDSGTSFTSL 332
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
VY+ + + + + ++ +D T C+
Sbjct: 333 PPDVYKAFTTEFDKQINAS--RVPYEDSTWKYCY 364
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 81/166 (48%), Gaps = 21/166 (12%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD-APCTGCTKPPEKQYKPHKN---- 65
FP F+ + V+L G PP+ DTGSD+TW QC P + C + P +
Sbjct: 83 FP-FTEYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFA 141
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQ----CDYEIEYGDGGSSIGALVTDLFPLR--FSN 119
+PCS+P C P C ND C+Y I YGDG S G + ++F
Sbjct: 142 SLPCSSPACETT-----PPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGE 196
Query: 120 GSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
GS VP L FGCG+ N G + +T G+ G GRG +S+ SQL+
Sbjct: 197 GSSAAVPGLVFGCGH--ANRGVFTSNET-GIAGFGRGSLSLPSQLK 239
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 132/318 (41%), Gaps = 61/318 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ + ++G+PP L + DTGSDL WV+C +PC GC PP Y P ++ +PCS+
Sbjct: 87 YIMQFSIGEPPLLIWAEVDTGSDLMWVKC-SPCNGCNPPPSPLYDPARSRSSGKLPCSSQ 145
Query: 73 RCAALHWPN--PPRCKHPNDQCDYEIEYGDGG--SSIGALVTDLFPLRFSNGSVFNVPLT 128
C AL +C C Y YG G S+ G L T+ F T
Sbjct: 146 LCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETF--------------T 191
Query: 129 FGCGYNQHNP--GPLSPPD------TAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
FG GY +N G D TAG++GLGRG +S+VSQL G R +C+ +
Sbjct: 192 FGDGYVANNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQL---GAGR--FAYCLAAD 246
Query: 181 GR---GVLF--LGDGKVPSSGVAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLT- 232
+LF L + V+ TP++ N + HY + + G +KD T
Sbjct: 247 PNVYSTILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTF 306
Query: 233 ---------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
+ FDSGA YQ ++R I + ++ D C+
Sbjct: 307 AINSDGSGGVFFDSGAIDTSLKDAAYQ-----VVRQAITSEIQRLGYDAGDDTCF---VA 358
Query: 284 ALGQVTEYFKPLALSFTN 301
A Q PL L F +
Sbjct: 359 ANQQAVAQMPPLVLHFDD 376
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 109/266 (40%), Gaps = 41/266 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD---------APCTGCTKPPEKQYKPHKNI 66
Y+A + +G P F DTGSDL WV CD A TG PP + Y P ++
Sbjct: 110 YYA-EVELGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANATGPDAPPLRPYSPRRSS 168
Query: 67 ----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSN-- 119
V C NP C + + N C YE++Y SS G LV D+ L
Sbjct: 169 TSEQVACDNPLCGRRNGCS----AATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPG 224
Query: 120 ----GSVFNVPLTFGCGYNQHNP------GPLSPPDTAGVLGLGRGRISIVSQLREYGLI 169
G P+ FGCG Q G + G++GLG G++S+ S L GL+
Sbjct: 225 PGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVD-----GLMGLGMGKVSVPSALAASGLV 279
Query: 170 -RNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL 228
+ C G +G G + GD S G A TP S + + + + G
Sbjct: 280 ASDSFSMCFGDDGVGRVNFGDAG--SRGQAETPFTVRSLNPTYNV--SFTSIGIGSESVA 335
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVS 254
+ + DSG S+ Y + Y ++ +
Sbjct: 336 AEFAAVMDSGTSFTYLSDPEYTQLAT 361
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 88/192 (45%), Gaps = 22/192 (11%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VP 68
I Y+ L +G PP++F D+GS +T+V C + C C K + +++P + V
Sbjct: 89 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQPEMSSTYQPVK 147
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPL 127
C N C C +QC YE EY + SS G L DL + F N S
Sbjct: 148 C-NMDC---------NCDDDREQCVYEREYAEHSSSKGVLGEDL--ISFGNESQLTPQRA 195
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVL 185
FGC G L G++GLG+G +S+V QL + GLI N G C G G G +
Sbjct: 196 VFGC--ETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSM 253
Query: 186 FLGDGKVPSSGV 197
LG PS V
Sbjct: 254 ILGGFDYPSDMV 265
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 113/263 (42%), Gaps = 42/263 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C + P Y P ++
Sbjct: 60 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQS 116
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF--S 118
VPCS+ C + C+ ++ C Y I+Y D SS G LV D+ L +
Sbjct: 117 TTSRKVPCSSNLCDLQN-----ACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 171
Query: 119 NGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ P+ FGCG Q G +P G+LGLG S+ S L GL N C
Sbjct: 172 QSKIVTAPIMFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGLAANSFSMC 228
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGP-AELLYSGKSCGLK----DL 231
G +G G + GD SS TP L Y P + +G + G K +
Sbjct: 229 FGDDGHGRINFGD--TGSSDQKETP-------LNVYKQNPYYNITITGITVGSKSISTEF 279
Query: 232 TLIFDSGASYAYFTSRVYQEIVS 254
+ I DSG S+ + +Y +I S
Sbjct: 280 SAIVDSGTSFTALSDPMYTQITS 302
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 70/197 (35%), Positives = 94/197 (47%), Gaps = 25/197 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP+ D+GSD+ WVQC+ PCT C + + P + V C++
Sbjct: 134 YF-VRIGVGSPPRNQYVVIDSGSDIIWVQCE-PCTQCYHQSDPVFNPADSSSYAGVSCAS 191
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C+ H N C +C YE+ YGDG + G L L L F + NV + GC
Sbjct: 192 TVCS--HVDNAG-CH--EGRCRYEVSYGDGSYTKGTLA--LETLTFGRTLIRNVAI--GC 242
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ HN G AG+LGLG G +S V QL G +C+ G G+L G
Sbjct: 243 GH--HNQGMFV--GAAGLLGLGSGPMSFVGQLG--GQAGGTFSYCLVSRGIQSSGLLQFG 296
Query: 189 DGKVPSSGVAWTPMLQN 205
VP G AW P++ N
Sbjct: 297 REAVP-VGAAWVPLIHN 312
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 139/331 (41%), Gaps = 47/331 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI--VPCSNPRC 74
+ + + +G P DTGSDL W +C+ PCT C+ V C + C
Sbjct: 42 YLIQMAIGTPALSLSAIMDTGSDLVWTKCN-PCTDCSTSSIYDPSSSSTYSKVLCQSSLC 100
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYN 134
P+ C + D C+Y YGD S+ G L + F + S+ S+ N+ TFGCG++
Sbjct: 101 ---QPPSIFSCNNDGD-CEYVYPYGDRSSTSGILSDETFSI--SSQSLPNI--TFGCGHD 152
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLGD- 189
+ G++G GRG +S+VSQL + N +C+ + LF+G+
Sbjct: 153 NQGFDKV-----GGLVGFGRGSLSLVSQLGPS--MGNKFSYCLVSRTDSSKTSPLFIGNT 205
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGA 239
+ ++ V TP++Q+S+ HY L + G+S + T LI DSG
Sbjct: 206 ASLEATTVGSTPLVQSSS-TNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGT 264
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
+ + Y + ++ + + + L D L +C F G F + F
Sbjct: 265 TLTFLQQTAYDAV-----KEAMVSSINLPQADGQLDLC----FNQQGSSNPGFPSMTFHF 315
Query: 300 TNRRNSVRLVVPPEAYLVISVSTSIIIIAYL 330
VP E YL ++ I+ +A +
Sbjct: 316 ----KGADYDVPKENYLFPDSTSDIVCLAMM 342
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 113/263 (42%), Gaps = 42/263 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C + P Y P ++
Sbjct: 74 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQS 130
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF--S 118
VPCS+ C + C+ ++ C Y I+Y D SS G LV D+ L +
Sbjct: 131 TTSRKVPCSSNLCDLQN-----ACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 185
Query: 119 NGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ P+ FGCG Q G +P G+LGLG S+ S L GL N C
Sbjct: 186 QSKIVTAPIMFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGLAANSFSMC 242
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGP-AELLYSGKSCGLK----DL 231
G +G G + GD SS TP L Y P + +G + G K +
Sbjct: 243 FGDDGHGRINFGD--TGSSDQKETP-------LNVYKQNPYYNITITGITVGSKSISTEF 293
Query: 232 TLIFDSGASYAYFTSRVYQEIVS 254
+ I DSG S+ + +Y +I S
Sbjct: 294 SAIVDSGTSFTALSDPMYTQITS 316
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 78/151 (51%), Gaps = 14/151 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + + +G P + + DTGSDL W QC APC C P + P ++ + C++P
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSATYRSLGCASP 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C AL++ P C C Y+ YGD S+ G L + F + V ++FGCG
Sbjct: 149 ACNALYY---PLCYQ--KVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCG 203
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
N G L+ + +G++G GRG +S+VSQL
Sbjct: 204 --NLNAGLLA--NGSGMVGFGRGSLSLVSQL 230
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 75/159 (47%), Gaps = 19/159 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+ +G PP DTGSDL W QCDAPC C P Y P +++ V C +
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGSR 159
Query: 73 RCAAL--------HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
C AL + C Y YGDG S+ G L T+ F F G+ +
Sbjct: 160 LCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFT--FGAGTTVH 217
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
L FGCG + +++G++G+GRG +S+VSQL
Sbjct: 218 -DLAFGCGTDNLG----GTDNSSGLVGMGRGPLSLVSQL 251
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 113/263 (42%), Gaps = 42/263 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C + P Y P ++
Sbjct: 33 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQS 89
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF--S 118
VPCS+ C + C+ ++ C Y I+Y D SS G LV D+ L +
Sbjct: 90 TTSRKVPCSSNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 144
Query: 119 NGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ P+ FGCG Q G +P G+LGLG S+ S L GL N C
Sbjct: 145 QSKIVTAPIMFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGLAANSFSMC 201
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGP-AELLYSGKSCGLK----DL 231
G +G G + GD SS TP L Y P + +G + G K +
Sbjct: 202 FGDDGHGRINFGD--TGSSDQKETP-------LNVYKQNPYYNITITGITVGSKSISTEF 252
Query: 232 TLIFDSGASYAYFTSRVYQEIVS 254
+ I DSG S+ + +Y +I S
Sbjct: 253 SAIVDSGTSFTALSDPMYTQITS 275
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 119/291 (40%), Gaps = 40/291 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKP-PEKQYKPHKNIVPCSNPR 73
V+LTVG PP+ DTGS+L+W+ C T P Y P +PCS+P
Sbjct: 40 LTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSP----IPCSSPV 95
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C PNP C P C + Y D S G L +D F GS FGC
Sbjct: 96 CRTRTRDLPNPVTCD-PKKLCHAIVSYADASSLEGNLASD----NFRIGSSALPGTLFGC 150
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDG 190
+ + T G++G+ RG +S V+QL GL + +CI G++ GVL GD
Sbjct: 151 MDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL---GLPK--FSYCISGRDSSGVLLFGDS 205
Query: 191 KVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
+ G + +TP++Q S L ++ + G G K L L +
Sbjct: 206 HLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 265
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD----KTLPICWRGP 281
DSG + + VY + + + G L + + +C+R P
Sbjct: 266 VDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVP 316
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 113/263 (42%), Gaps = 42/263 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C + P Y P ++
Sbjct: 97 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPLQSPNYGSLKFDVYSPAQS 153
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF--S 118
VPCS+ C + C+ ++ C Y I+Y D SS G LV D+ L +
Sbjct: 154 TTSRKVPCSSNLCDLQN-----ACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 208
Query: 119 NGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ P+ FGCG Q G +P G+LGLG S+ S L GL N C
Sbjct: 209 QSKIVTAPIMFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGLAANSFSMC 265
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGP-AELLYSGKSCGLK----DL 231
G +G G + GD SS TP L Y P + +G + G K +
Sbjct: 266 FGDDGHGRINFGD--TGSSDQKETP-------LNVYKQNPYYNITITGITVGSKSISTEF 316
Query: 232 TLIFDSGASYAYFTSRVYQEIVS 254
+ I DSG S+ + +Y +I S
Sbjct: 317 SAIVDSGTSFTALSDPMYTQITS 339
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 116/252 (46%), Gaps = 24/252 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKNI----VPCSN 71
+AV + +G P K F FDTGSDLTW QC+ PC+ GC ++++ P K+ + CS+
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCE-PCSGGCFPQNDEKFDPTKSTSYKNLSCSS 190
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C ++ + C N C Y ++YG G ++G L T+ + S+ VF GC
Sbjct: 191 EPCKSIGKESAQGCSSSN-SCLYGVKYGT-GYTVGFLATETLTITPSD--VFE-NFVIGC 245
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G + N G S TAG+LGLGR +++ SQ +N+ +C+ + L G
Sbjct: 246 G--ERNGGRFS--GTAGLLGLGRSPVALPSQTSS--TYKNLFSYCLPASSSSTGHLSFGG 299
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYFTS 246
S +TP+ +L Y L + + G+ + + I DSG + Y S
Sbjct: 300 GVSQAAKFTPITSKIPEL--YGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPS 357
Query: 247 RVYQEIVSLIMR 258
+ + S
Sbjct: 358 TAHSALSSAFQE 369
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 141/326 (43%), Gaps = 44/326 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKN----IVPCS 70
+ + L++G PP + DTGSDL W QC APC+G C P Y P + ++PC+
Sbjct: 92 YLMTLSIGTPPLSYPAIADTGSDLIWTQC-APCSGDQCFAQPAPLYNPASSTTFGVLPCN 150
Query: 71 N--PRCAA-LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
+ CA L PP P C Y YG G ++ G ++ F + VP
Sbjct: 151 SSLSMCAGVLAGKAPP----PGCACMYNQTYGTGWTA-GVQGSETFTFGSAAADQARVPG 205
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ FGC N +AG++GLGRG +S+VSQL G + N L
Sbjct: 206 IAFGC----SNASSSDWNGSAGLVGLGRGSLSLVSQLGA-GRFSYCLTPFQDTNSTSTLL 260
Query: 187 LG-DGKVPSSGVAWTPMLQNSA----------DLKHYILGPAELLYSGKSCGLK-DLT-- 232
LG + +GV TP + + A +L LG L S + LK D T
Sbjct: 261 LGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGG 320
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
LI DSG + + YQ++ + + + L+ P D L +C+ P T
Sbjct: 321 LIIDSGTTITSLVNAAYQQVRAAV-QSLVTLPAIDGSDSTGLDLCYALP-------TPTS 372
Query: 293 KPLAL-SFTNRRNSVRLVVPPEAYLV 317
P A+ S T + +V+P ++Y++
Sbjct: 373 APPAMPSMTLHFDGADMVLPADSYMI 398
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 95/336 (28%), Positives = 130/336 (38%), Gaps = 46/336 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V L VG P + DTGSDL W QC APC C P + +PC
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQC-APCRDCFDQDLPVLDPAASSTYAALPCGAA 142
Query: 73 RCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG---SVFNVPLT 128
RC AL + + R + C Y YGD ++G + TD F S G S+ LT
Sbjct: 143 RCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLT 202
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---QNGRGVL 185
FGCG+ N G +T G+ G GRGR S+ SQL +C ++ ++
Sbjct: 203 FGCGH--LNKGVFQSNET-GIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFESKSSLV 254
Query: 186 FLGD------GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-------- 231
LG S V TP+L+N + Y L G S G L
Sbjct: 255 TLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLS-----LKGISVGKTRLPVPETKFR 309
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ I DSGAS VY E V +G P + L +C+ P AL +
Sbjct: 310 STIIDSGASITTLPEEVY-EAVKAEFAAQVGLPPS-GVEGSALDLCFALPVTAL-----W 362
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIII 327
+P S T +P Y+ + ++ I
Sbjct: 363 RRPAVPSLTLHLEGADWELPRSNYVFEDLGARVMCI 398
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/273 (28%), Positives = 115/273 (42%), Gaps = 27/273 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P F FDTGSD TWVQC PC C + E + P K+ + C++
Sbjct: 165 YVVPIRLGTPAARFTVVFDTGSDTTWVQCQ-PCVAYCYQQKEPLFTPTKSATYANISCTS 223
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C+ L + C C Y ++YGDG ++G D L + F FGC
Sbjct: 224 SYCSDL---DTRGCS--GGHCLYAVQYGDGSYTVGFYAQDTLTLGYDTVKDFR----FGC 274
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
G + N G AG++GLGRG+ S+ Q Y V +CI +G G L G
Sbjct: 275 G--EKNRGLFG--KAAGLMGLGRGKTSVPVQ--AYDKYSGVFAYCIPATSSGTGFLDFGP 328
Query: 190 GKVPSSGVAWTPMLQNSADLKHYI----LGPAELLYSGKSCGLKDLTLIFDSGASYAYFT 245
G ++ TPML ++ +Y+ + L S + D + DSG
Sbjct: 329 GAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLP 388
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
Y+ + S + + G K AP L C+
Sbjct: 389 PSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCY 421
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 118/265 (44%), Gaps = 44/265 (16%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHP-- 88
DTGSD+TW+QCD PC C K + ++P + +PC++ C L H
Sbjct: 6 DTGSDITWIQCD-PCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQ-----SFSHSCL 59
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTA 147
N C+Y + YGD ++ G + LR + + +VP FGCG+ N G + A
Sbjct: 60 NSSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGH--ANKGLFN--GAA 115
Query: 148 GVLGLGRGRISIVSQLR-EYGLIRNVIGHCIGQNG----RGVLFLGDGKVPSSGVAWTPM 202
G++GLG+ I +Q +G V +C+ G+L G+ + V +TP+
Sbjct: 116 GLMGLGKSSIGFPAQTSVAFG---KVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPL 172
Query: 203 LQNSADLKHYILGPAELLYSGKSCGLKD------LTLIFDSGASYAYFTSRVYQEIVSLI 256
+ +S+ GP++ S + D T++ DSG + F Y+ +
Sbjct: 173 VDSSS-------GPSQYFVSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAF 225
Query: 257 MRDLIG--TPLKLAPDDKTLPICWR 279
+ L G T + +AP D C+R
Sbjct: 226 TQILPGLQTAVSVAPFDT----CFR 246
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 131/289 (45%), Gaps = 39/289 (13%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSN 71
P + F +N ++G+PP DTGS LTWV C PC+ C++ + P K+ SN
Sbjct: 88 PRYVVFLMNFSIGEPPIPQLAVMDTGSSLTWVMCH-PCSSCSQQSVPIFDPSKS-STYSN 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C+ + +C N +C Y +EY GSS G + L + S+ VP L FG
Sbjct: 146 LSCSECN-----KCDVVNGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFG 200
Query: 131 CGYN---QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV--- 184
CG N P + GV GLG GR S+ L +G +CIG N R
Sbjct: 201 CGRKFSISSNGYPYQGIN--GVFGLGSGRFSL---LPSFG---KKFSYCIG-NLRNTNYK 251
Query: 185 ---LFLGDGKVPSSGVAWTPMLQNS---ADLKHYILGPAEL-----LYSGKSCGLKDLTL 233
L LGD K G + T + N +L+ +G +L L+ +S + +
Sbjct: 252 FNRLVLGD-KANMQGDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFE-RSITDNNSGV 309
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP--ICWRG 280
I DSGA + + T + E++S + +L+ L LA DK P +C+ G
Sbjct: 310 IIDSGADHTWLTKYGF-EVLSFEVENLLEGVLVLAQQDKHNPYTLCYSG 357
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 108/286 (37%), Gaps = 41/286 (14%)
Query: 9 FFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
FF Y N+T+G P + F DTGSDL W+ C+ T Q + H N
Sbjct: 105 LFFNYLHY--ANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGETHMNAQR 162
Query: 67 ----------------VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALV 109
V C++ CA + RC P C Y I Y G S G LV
Sbjct: 163 IRLNIYNPSISTSSSKVTCNSTLCALRN-----RCISPLSDCPYRIRYLSPGSKSTGVLV 217
Query: 110 TDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI 169
D+ + G + +TFGC Q G G++GL I++ + L + G+
Sbjct: 218 EDVIHMSTEEGEARDARITFGCSETQ--LGLFQEVAVNGIMGLAMADIAVPNMLVKAGVA 275
Query: 170 RNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK 229
+ C G NG+G + GD SS TP+ + L + + GK
Sbjct: 276 SDSFSMCFGPNGKGTISFGDKG--SSDQHETPLGGTISPLFYDV--SITKFKVGKVTVET 331
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
+ IFDSG + + Y + T L+ D+ LP
Sbjct: 332 KFSAIFDSGTAVTWLLDPYYTALT---------TNFHLSVPDRRLP 368
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 95/342 (27%), Positives = 135/342 (39%), Gaps = 42/342 (12%)
Query: 4 SWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
S IE P F + L +G PP+ + DTGSDL W QC PCT C + P
Sbjct: 84 SEIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCK-PCTQCFHQSTPIFDPK 142
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
K+ + L P N+ C+Y YGD S+ G L ++ L F SV
Sbjct: 143 KSSSFSKLSCSSQLCEALPQ--SSCNNGCEYLYSYGDYSSTQGILASE--TLTFGKASVP 198
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE----YGLIRNVIGHCIGQ 179
NV FGCG + G AG++GLGRG +S+VSQL+E Y L +
Sbjct: 199 NV--AFGCGADNEGSG---FSQGAGLVGLGRGPLSLVSQLKEPKFSYCLTT------VDD 247
Query: 180 NGRGVLFLG---DGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLK 229
L +G SS + TP++ + A Y L G L + L+
Sbjct: 248 TKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQ 307
Query: 230 DL---TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
D LI DSG + Y + +V+ I P+ + L +C+ P G
Sbjct: 308 DDGSGGLIIDSGTTITYLEESAFN-LVAKEFTAKINLPVD-SSGSTGLDVCFTLPS---G 362
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIA 328
L F + L +P E Y++ S + +A
Sbjct: 363 STNIEVPKLVFHF----DGADLELPAENYMIGDSSMGVACLA 400
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 87/327 (26%), Positives = 142/327 (43%), Gaps = 47/327 (14%)
Query: 12 PIFSY---FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
PI +Y + + L +G PP DTGSDL WVQC PC GC + P K+
Sbjct: 56 PINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQC-VPCLGCYNQINPMFDPLKSSTY 114
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ C +P C + P C P +CDY Y D + G L + L + G +
Sbjct: 115 TNISCDSPLC---YKPYIGECS-PEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPIS 170
Query: 125 VP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL------REYG-----LIRNV 172
+ + FGCG+N N G + + G++GLG G S+VSQ+ +++ + ++
Sbjct: 171 LQGILFGCGHN--NTGNFNDHE-MGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDI 227
Query: 173 IGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY---ILG-PAELLYSGKSCGL 228
G+G LG+ GV TP++Q D+ Y +LG E Y + +
Sbjct: 228 TISSQMSFGKGSEVLGE------GVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTI 281
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL--PICWRGPFKALG 286
+ ++ DSG ++Y + + + PL+ DD +L +C+R G
Sbjct: 282 EKGNMLVDSGTPPNILPQQLYDRVYVEVKNKV---PLEPITDDPSLGPQLCYRTQTNLKG 338
Query: 287 -QVTEYFKPLALSFTNRRNSVRLVVPP 312
+T +F+ L T ++ +PP
Sbjct: 339 PTLTYHFEGANLLLT----PIQTFIPP 361
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 113/263 (42%), Gaps = 42/263 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPE------KQYKPHKN 65
F ++AV + +G P F DTGSDL WV CD C C + P Y P ++
Sbjct: 97 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQS 153
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRF--S 118
VPCS+ C + C+ ++ C Y I+Y D SS G LV D+ L +
Sbjct: 154 TTSRKVPCSSNLCDLQN-----ACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 208
Query: 119 NGSVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ P+ FGCG Q G +P G+LGLG S+ S L GL N C
Sbjct: 209 QSKIVTAPIMFGCGQVQTGSFLGSAAP---NGLLGLGMDSKSVPSLLASKGLAANSFSMC 265
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGP-AELLYSGKSCGLK----DL 231
G +G G + GD SS TP L Y P + +G + G K +
Sbjct: 266 FGDDGHGRINFGD--TGSSDQKETP-------LNVYKQNPYYNITITGITVGSKSISTEF 316
Query: 232 TLIFDSGASYAYFTSRVYQEIVS 254
+ I DSG S+ + +Y +I S
Sbjct: 317 SAIVDSGTSFTALSDPMYTQITS 339
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 127/297 (42%), Gaps = 37/297 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--------TKPPEKQYKPHKNIVP 68
+ V + +G P K F DTGS L+W+QC C T K YK +P
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKA----LP 168
Query: 69 CSNPRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
CS+ +C++L N P C + C Y+ YGD SIG L D+ L S +
Sbjct: 169 CSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAP--SSG 226
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNG---- 181
+GCG Q N G ++G++GL +IS++ QL ++YG N +C+ +
Sbjct: 227 FVYGCG--QDNQGLFG--RSSGIIGLANDKISMLGQLSKKYG---NAFSYCLPSSFSAPN 279
Query: 182 ----RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTL 233
G L +G + SS +TP+++N Y L + +GK G+ ++
Sbjct: 280 SSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPT 339
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
I DSG VY + + ++ AP L C++G K + V E
Sbjct: 340 IIDSGTVITRLPVAVYNALKKSFVL-IMSKKYAQAPGFSILDTCFKGSVKEMSTVPE 395
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 61/194 (31%), Positives = 88/194 (45%), Gaps = 16/194 (8%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
+YF V + +G P + FDTGSDLTW QC+ C K + + P K+ + C+
Sbjct: 144 NYFVV-VGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNITCT 202
Query: 71 NPRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+ C L N P C C Y I+YGD S+G + + ++ V N
Sbjct: 203 STLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSVTATD-IVDN--FL 259
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
FGCG Q+N G +AG++GLGR IS V Q + R + +C+ L
Sbjct: 260 FGCG--QNNQGLFG--GSAGLIGLGRHPISFVQQTA--AVYRKIFSYCLPATSSSTGRLS 313
Query: 189 DGKVPSSGVAWTPM 202
G +S V +TP
Sbjct: 314 FGTTTTSYVKYTPF 327
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 136/337 (40%), Gaps = 43/337 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPC 69
+ F V + +G PP+ DTGSDLTW+Q + PC C + + + P K N + C
Sbjct: 22 YGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSE-PCRACFEQADPIFDPSKSSTYNKIAC 80
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
S+ CA L C + C Y YGDG + G + + G + F
Sbjct: 81 SSSACADLLGTQ--TCSAAAN-CIYAYGYGDGSVTRGYFSKETITATDTAGE----EVKF 133
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGV 184
G + +N G G+LGLG+G +S+ SQL ++ N +C+ +
Sbjct: 134 GA--SVYNTGTFGDTGGEGILGLGQGPVSMPSQLGS--VLGNKFSYCLVDWLSAGSETST 189
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LI 234
++ GD VPS V +TP++ N+ +Y + + G + I
Sbjct: 190 MYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTI 249
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG + Y V+ +V+ + P + L RG P
Sbjct: 250 IDSGTTITYLQQEVFNALVAAYTSQ-VRYPTTTSATGLDLCFNTRGT----------GSP 298
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLT 331
+ + T + V L + P A IS+ T+II +A+ +
Sbjct: 299 VFPAMTIHLDGVHLEL-PTANTFISLETNIICLAFAS 334
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 108/264 (40%), Gaps = 34/264 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE----------KQYKPHKNI---- 66
+ +G P F D GSDL W+ CD C C +Y P +++
Sbjct: 100 IDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLSSSYYSNLDRDLNEYSPSRSLSSKH 157
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLR----FSNGS 121
+ CS+ C CK QC Y + Y + SS G LV D+ L+ SN S
Sbjct: 158 LSCSHQLC-----DKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGSLSNSS 212
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V P+ GCG Q G L G+LGLG G S+ S L + GLI + C ++
Sbjct: 213 V-QAPVVLGCGMKQSG-GYLDGVAPDGLLGLGPGESSVPSFLAKSGLIHDSFSLCFNEDD 270
Query: 182 RGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGA 239
G +F GD G ++ P+ YI+G E G SC + + DSG
Sbjct: 271 SGRIFFGDQGPTIQQSTSFLPL---DGLYSTYIIG-VESCCVGNSCLKMTSFKVQVDSGT 326
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGT 263
S+ + VY I + + G+
Sbjct: 327 SFTFLPGHVYGAIAEEFDQQVNGS 350
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 98/325 (30%), Positives = 137/325 (42%), Gaps = 42/325 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V L VG P + DTGSDL W+QC PC C K + + P + +PC +
Sbjct: 129 YF-VRLGVGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRNSSSFQRIPCLS 186
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C AL + + +C Y++ YGDG S+G +DLF L + + + + FGC
Sbjct: 187 PLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKA---MSVAFGC 243
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL---REYGLIRNVIGHCIGQNGR------ 182
G++ AG+LGLG G++S SQ+ N +C+
Sbjct: 244 GFDNEG----LFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSS 299
Query: 183 GVLFLGDGKVPSSGVAWTPMLQN-SADLKHYI------LGPAELLYSGKSCGLKDL---T 232
L G +PS+ A +P+L+N D +Y +G A+L S KS L
Sbjct: 300 SSLIFGAAAIPSTA-ALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGG 358
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
+I DSG S F + VY I T L AP C+ KA V
Sbjct: 359 VIIDSGTSVTRFPTSVYATIRDAFRN--ATTNLPSAPRYSLFDTCYNFSGKASVDV---- 412
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLV 317
L L F N L +PP YL+
Sbjct: 413 PALVLHF---ENGADLQLPPTNYLI 434
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 97/325 (29%), Positives = 137/325 (42%), Gaps = 42/325 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V L +G P + DTGSDL W+QC PC C K + + P + +PC +
Sbjct: 54 YF-VRLGLGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRNSSSFQRIPCLS 111
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C AL + + +C Y++ YGDG S+G +DLF L + + + + FGC
Sbjct: 112 PLCKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKA---MSVAFGC 168
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL---REYGLIRNVIGHCIGQNGR------ 182
G++ AG+LGLG G++S SQ+ N +C+
Sbjct: 169 GFDNEG----LFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSS 224
Query: 183 GVLFLGDGKVPSSGVAWTPMLQN-SADLKHYI------LGPAELLYSGKSCGLKDL---T 232
L G +PS+ A +P+L+N D +Y +G A+L S KS L
Sbjct: 225 SSLIFGVAAIPST-AALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGG 283
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYF 292
+I DSG S F + VY I I P AP C+ KA V
Sbjct: 284 VIIDSGTSVTRFPTSVYATIRDAFRNATINLP--SAPRYSLFDTCYNFSGKASVDV---- 337
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLV 317
L L F N L +PP YL+
Sbjct: 338 PALVLHF---ENGADLQLPPTNYLI 359
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 72/271 (26%), Positives = 111/271 (40%), Gaps = 48/271 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V L +G P DTGSD++W+QC PC C + P + +PC++
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQC-VPCKDCVPALRPPFNPRHSSSFFKLPCASS 196
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF-----------PLRFSNGS 121
C ++ P C C + I+YGDG S G L + P++ SN
Sbjct: 197 TCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSN-- 254
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-- 179
+T GC P +G+LG+ R IS SQL HC
Sbjct: 255 -----ITLGCADIDREGLPTG---ASGLLGMDRRPISFPSQLSSR--YARKFSHCFPDKI 304
Query: 180 ---NGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG-------PAELLYSGKS 225
N G++F G+ + S + +TP++QN SA L +Y +G + L S K+
Sbjct: 305 AHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKN 364
Query: 226 CGLKDLT----LIFDSGASYAYFTSRVYQEI 252
+ +T I DSG ++ Y +Q +
Sbjct: 365 FDIDKVTGSGGTIIDSGTAFTYLKKPAFQAM 395
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 83/303 (27%), Positives = 133/303 (43%), Gaps = 45/303 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N+++G PP DTGSDL W QC+ PC C + + P ++ V CS+
Sbjct: 86 YLMNISIGTPPVPILAIADTGSDLIWTQCN-PCEDCYQQTSPLFDPKESSTYRKVSCSSS 144
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF--- 129
+C AL C + C Y I YGD + G + D + GS P++
Sbjct: 145 QCRALE---DASCSTDENTCSYTITYGDNSYTKGDVAVDTVTM----GSSGRRPVSLRNM 197
Query: 130 --GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---------- 177
GCG+ N G P +G++GLG G S+VSQLR+ I +C+
Sbjct: 198 IIGCGH--ENTGTFDPA-GSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETGLT 252
Query: 178 -----GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT 232
G NG + GDG V +S V P +L+ +G ++ ++ G +
Sbjct: 253 SKINFGTNG---IVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGN 309
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR--GPFKALGQVTE 290
++ DSG + S Y E+ S++ + ++ D L +C+R FK + +T
Sbjct: 310 IVIDSGTTLTLLPSNFYYELESVVASTIKAE--RVQDPDGILSLCYRDSSSFK-VPDITV 366
Query: 291 YFK 293
+FK
Sbjct: 367 HFK 369
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 115/273 (42%), Gaps = 37/273 (13%)
Query: 5 WIEFFFFPIFSYFAV-------NLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPP 56
W+ P+ S +V L +G P + D+GS LTW+QC APC C
Sbjct: 89 WVAASSVPLASGASVGVGNYITRLGLGTPTTTYVMVVDSGSSLTWLQC-APCAVSCHPQA 147
Query: 57 EKQYKPHKN----IVPCSNPRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVT 110
Y P + VPCS P+CA L NP C + C Y+ YGDG S G L
Sbjct: 148 GPLYDPRASSTYAAVPCSAPQCAELQAATLNPSSCSG-SGVCQYQASYGDGSFSFGYLSK 206
Query: 111 DLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR 170
D L S+GS +GCG Q N G AG++GL R ++S++SQL +
Sbjct: 207 DTVSLS-SSGSFPG--FYYGCG--QDNVGLFG--RAAGLIGLARNKLSLLSQLAPS--VG 257
Query: 171 NVIGHCI---GQNGRGVLFLG---DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK 224
N +C+ G L G D K P ++T M+ +S D Y + A + +G
Sbjct: 258 NSFAYCLPTSAAASAGYLSFGSNSDNKNPGK-YSYTSMVSSSLDASLYFVSLAGMSVAGS 316
Query: 225 -----SCGLKDLTLIFDSGASYAYFTSRVYQEI 252
S L I DSG + VY +
Sbjct: 317 PLAVPSSEYGSLPTIIDSGTVITRLPTPVYTAL 349
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 90/294 (30%), Positives = 128/294 (43%), Gaps = 42/294 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C P + ++ ++PC +
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCK-PCVSCFDQPLPYFDTSRSSTNALLPCEST 93
Query: 73 RCAALHWPNPPRCKHPN---DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
+C P C N C Y YGD +IG L D F F G+ ++P +T
Sbjct: 94 QCKL--DPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKF--TFVAGT--SLPGVT 147
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG 188
FGCG N N G + +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 148 FGCGLN--NTGVFNSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTTITGAIPSTVLLDL 203
Query: 189 DGKVPSSG---VAWTPMLQ---NSAD-------LKHYILGPAELLYSGKSCGLKDLT--L 233
+ S+G V TP++Q N A+ LK +G L + L + T
Sbjct: 204 PADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGT 263
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKA 284
I DSG S +VYQ ++RD +KL P + T C+ P +A
Sbjct: 264 IIDSGTSITSLPPQVYQ-----VVRDEFAAQIKLPVVPGNATGHYTCFSAPSQA 312
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 115/275 (41%), Gaps = 39/275 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKP-PEKQYKPHKNIVPCSNPR 73
V+LTVG PP+ DTGS+L+W+ C T P Y P +PCS+P
Sbjct: 1000 LTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSP----IPCSSPI 1055
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C PNP C P C + Y D S G L +D F + GS FGC
Sbjct: 1056 CRTRTRDLPNPVTCD-PKKLCHAIVSYADASSLEGNLASDNFRI----GSSALPGTLFGC 1110
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDG 190
+ + T G++G+ RG +S V+QL GL + +CI G++ GVL GD
Sbjct: 1111 MDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL---GLPK--FSYCISGRDSSGVLLFGDL 1165
Query: 191 KVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
+ G + +TP++Q S L ++ + G G K L L +
Sbjct: 1166 HLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 1225
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAP 269
DSG + + VY + + + G LAP
Sbjct: 1226 VDSGTQFTFLLGPVYTALRNEFLEQTKGV---LAP 1257
>gi|213998828|gb|ACJ60781.1| nucellin [Hordeum brachyantherum subsp. californicum]
Length = 133
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 52/129 (40%), Positives = 73/129 (56%), Gaps = 7/129 (5%)
Query: 140 PLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGDGKVPSSGVA 198
P SP D G+LGLG G+ QL+ +I NVIGHC+ G+GVL++GD PS GV
Sbjct: 5 PPSPVD--GILGLGMGKAGFAVQLKGQKMITGNVIGHCLSSQGKGVLYVGDFNPPSRGVT 62
Query: 199 WTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRVYQEIVSLIM 257
W PM ++ L +Y G AE L + G +FDSG++Y + ++VY EIVS +
Sbjct: 63 WVPMKES---LFYYSPGLAEPLIDNQPIRGNPTFEAVFDSGSTYTHVPAQVYNEIVSKVR 119
Query: 258 RDLIGTPLK 266
L + L+
Sbjct: 120 GTLSESSLE 128
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 108/263 (41%), Gaps = 35/263 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD---------APCTGCTKPPEKQYKPHKNI 66
Y+A + +G P F DTGSDL WV CD A TG P + Y P ++
Sbjct: 108 YYA-EVELGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANGTGQDAPSLRPYSPRRSS 166
Query: 67 ----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSN-- 119
V C NP C + + N C YE++Y SS G LV D+ L
Sbjct: 167 TSKQVACDNPLCGQRNGCS----AATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPG 222
Query: 120 ----GSVFNVPLTFGCGYNQHNP---GPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RN 171
G P+ FGCG Q G D G++GLG G++S+ S L GL+ +
Sbjct: 223 PGAAGEALQAPVVFGCGQVQTGAFLDGGGGAVD--GLMGLGMGKVSVPSALAASGLVASD 280
Query: 172 VIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL 231
C G +G G + GD S G A TP S + + + + G +
Sbjct: 281 SFSMCFGDDGVGRVNFGDAG--SRGQAETPFTVRSLNPTYNV--SFTSIGVGSESVAAEF 336
Query: 232 TLIFDSGASYAYFTSRVYQEIVS 254
+ DSG S+ Y + Y ++ +
Sbjct: 337 AAVMDSGTSFTYLSDPEYTQLAT 359
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 75/272 (27%), Positives = 120/272 (44%), Gaps = 27/272 (9%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWP--NPPRCKH 87
DTGS L+W+QC C + Y P + + C++ C+ L N P C+
Sbjct: 3 LDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCET 62
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDT 146
++ C Y YGD SIG L DL L S +P T+GCG Q N G
Sbjct: 63 DSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLPQFTYGCG--QDNQGLFG--RA 114
Query: 147 AGVLGLGRGRISIVSQLR-EYGLIRNVIGHCI---GQNGRGVLFLGDGKVPSSGVAWTPM 202
AG++GL R ++S+++QL +YG + +C+ G FL G + + +TPM
Sbjct: 115 AGIIGLARDKLSMLAQLSTKYG---HAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPM 171
Query: 203 LQNSADLKHYILGPAELLYSGK----SCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
L +S + Y L + SG+ + + + + DSG +Y + ++
Sbjct: 172 LTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVK 231
Query: 259 DLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
++ T AP L C++G K++ V E
Sbjct: 232 -IMSTKYAKAPAYSILDTCFKGSLKSISAVPE 262
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 78/290 (26%), Positives = 118/290 (40%), Gaps = 41/290 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V L VG PP+ DTGS+L+W+ C +P G P Y P VPCS+P
Sbjct: 61 LTVTLAVGSPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSP----VPCSSPI 116
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P P C C I Y D S G L D F + GSV FGC
Sbjct: 117 CRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVI----GSVTRPGTLFGC 172
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDG 190
+ + + G++G+ RG +S V+QL G + +CI G + G+L LGD
Sbjct: 173 MDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQL---GFSK--FSYCISGSDSSGILLLGDA 227
Query: 191 KVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
G + +TP++ + L ++ + G G K L+L +
Sbjct: 228 SYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTM 287
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWR 279
DSG + + VY + + + + L++ D T+ +C+R
Sbjct: 288 VDSGTQFTFLMGPVYTALKNEFIAQ-TKSVLRIVDDPNFVFQGTMDLCYR 336
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 74/273 (27%), Positives = 110/273 (40%), Gaps = 28/273 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C K E + P K+ V C++
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDS 222
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CA L + C C Y ++YGDG ++G D + F FGCG
Sbjct: 223 ACADL---DTNGCT--GGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFR----FGCG 273
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLGDG 190
+ N G TAG++GLGRG+ S+ Q Y +C+ G G L G G
Sbjct: 274 --EKNNGLFG--KTAGLMGLGRGKTSLTVQ--AYNKYGGAFAYCLPALTTGTGYLDFGPG 327
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAYFT 245
+ TPML + +Y+ G + G+ + + + DSG
Sbjct: 328 SA-GNNARLTPMLTDKGQTFYYV-GMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLP 385
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
+ Y + S + ++ K AP L C+
Sbjct: 386 ATAYTALSSAFDKVMLARGYKKAPGYSILDTCY 418
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 84/320 (26%), Positives = 128/320 (40%), Gaps = 47/320 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V L VG PP+ DTGS+L+W+ C +P G P Y P VPCS+P
Sbjct: 65 LTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSP----VPCSSPI 120
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P P C C I Y D S G L + F + GSV FGC
Sbjct: 121 CRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI----GSVTRPGTLFGC 176
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDG 190
+ + + G++G+ RG +S V+QL G + +CI G + G L LGD
Sbjct: 177 MDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQL---GFSK--FSYCISGSDSSGFLLLGDA 231
Query: 191 KVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
G + +TP++ S L ++ + G G K L+L +
Sbjct: 232 SYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTM 291
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICW------RGPFK 283
DSG + + VY + + + + L+L D T+ +C+ R F
Sbjct: 292 VDSGTQFTFLMGPVYTALKNEFITQ-TKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFS 350
Query: 284 ALGQVTEYFKPLALSFTNRR 303
L V+ F+ +S + ++
Sbjct: 351 GLPMVSLMFRGAEMSVSGQK 370
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 72/246 (29%), Positives = 102/246 (41%), Gaps = 27/246 (10%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH----KNIVPCSNPRC 74
+ VG P F DTGSDL W+ C+ C C K Y P VPC +P C
Sbjct: 123 AEVEVGTPSSKFLVALDTGSDLFWLPCE--CKLCAKNGSTMYSPSLSSTSKTVPCGHPLC 180
Query: 75 AALHWPNPPRCK---HPNDQCDYEIEY--GDGGSSIGALVTDLFPL----RFSNGSVFNV 125
P C + C YE++Y + GSS G LV D+ L G
Sbjct: 181 E-----RPDACATAGKSSSSCPYEVKYVSANTGSS-GVLVEDVLHLVDGGGGGGGKAVQA 234
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCIGQNGRGV 184
P+ FGCG Q L G++GLG ++S+ S L GL+ + C ++G G
Sbjct: 235 PIVFGCGQVQTG-AFLRGAAAGGLMGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGR 293
Query: 185 LFLGDGKVPSSGVAWTPML-QNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD P A TP++ S +Y + + K+ + + T + DSG S+ Y
Sbjct: 294 INFGDAGSPDQ--AETPLIAAGSLQPSYYNISVGAITVDSKAMAV-EFTAVVDSGTSFTY 350
Query: 244 FTSRVY 249
Y
Sbjct: 351 LDDPAY 356
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 77/251 (30%), Positives = 113/251 (45%), Gaps = 31/251 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHK----NIVPCSN 71
F V + G P + + FDTGSD++W+QC PC+G C K + + P K + VPC +
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSATYSAVPCGH 178
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P+CAA +C N C Y+++YGDG S+ G L + L S +P FG
Sbjct: 179 PQCAAAGG----KCSS-NGTCLYKVQYGDGSSTAGVLSHETLSLT----SARALPGFAFG 229
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG + N G D G++GLGRG++S+ SQ G L +G
Sbjct: 230 CG--ETNLGDFG--DVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGT- 284
Query: 191 KVPSS---GVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASY 241
P+S GV +T M+Q Y + ++ G + +D TL+ DSG
Sbjct: 285 TTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLL-DSGTVL 343
Query: 242 AYFTSRVYQEI 252
Y Y +
Sbjct: 344 TYLPPEAYTAL 354
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 83/317 (26%), Positives = 132/317 (41%), Gaps = 34/317 (10%)
Query: 28 KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK-------NIVPCSNPRCAALHWP 80
+ +D DTGS T+V PC GC + E + + + C A L
Sbjct: 49 QTYDLIVDTGSARTYV----PCKGCARCGEHAHGYYDYDRSMEFERLDCGEASDATLCEE 104
Query: 81 NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGP 140
+ +C Y + Y +G SS G +V D +R G++ + L FGC + N
Sbjct: 105 TMKGTCQSDGRCSYVVSYAEGSSSRGYVVRD--RVRLGEGTL-SAMLAFGCEEAETNAIY 161
Query: 141 LSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG--DGKVPSS 195
D G+ G GRG ++ +QL GLI NV C+ G NG GVL LG D +
Sbjct: 162 EQKAD--GLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANG-GVLTLGRFDFGADAP 218
Query: 196 GVAWTPMLQNSAD-LKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVS 254
+A TP++ + A+ H + + L L T DSG ++ + V+ +
Sbjct: 219 ALARTPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFVPRSVWVSFKT 278
Query: 255 LIMRDLIGTPLKL--APDDKTLPICWRGPFKALGQ------VTEYFKPLALSFTNRRNSV 306
+ L++ PD + +C+ A+ V+E+F PL +++ V
Sbjct: 279 RLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEWFPPLTIAY---EGGV 335
Query: 307 RLVVPPEAYLVISVSTS 323
L + PE YL + S
Sbjct: 336 SLTLGPENYLFAHETNS 352
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 80/276 (28%), Positives = 114/276 (41%), Gaps = 43/276 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPP----EKQYKPHKNIVPC 69
YF V L VG P K F DTGSDLTW+QC+ P T + PP +K +PC
Sbjct: 27 YF-VELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPC 85
Query: 70 SNPRCAALHWPNPPRC--KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS------ 121
++ C L P C K P+ CDY Y D + G L + ++ S
Sbjct: 86 TDDECLFLPAPIGSSCSIKSPS-PCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGN 144
Query: 122 -------VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIG 174
+ NV L GC L +GVLGLG+G IS+ +Q R L +
Sbjct: 145 HKTRTIRIKNVAL--GCSRESVGASFLG---ASGVLGLGQGPISLATQTRHTAL-GGIFS 198
Query: 175 HCIGQNGRG---VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC----- 226
+C+ RG FL G+ +A TP+++N A Y + + GK
Sbjct: 199 YCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIAS 258
Query: 227 ------GLKDLTLIFDSGASYAYFTSRVYQEIVSLI 256
G + IFDSG + +Y Y +++ +
Sbjct: 259 SDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGAL 294
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 105/234 (44%), Gaps = 30/234 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ + + G P K FDTGS++ W+QC C E + P ++NI C++
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNI-SCTS 74
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L + C C Y + YGDG S++G L T+ F L + G+VFN FGC
Sbjct: 75 AACTGL---SSRGCS--GSTCVYGVTYGDGSSTVGFLATETFTL--AAGNVFN-NFIFGC 126
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G Q+N G + AG++GLGR S+ SQL + N+ +C+ +L G
Sbjct: 127 G--QNNQGLFT--GAAGLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGYLNIGN 180
Query: 192 VPSSGVAWTPMLQNS-------ADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
P +T ML NS DL +G L S S + + I DSG
Sbjct: 181 -PLRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALS--STVFQSVGTIIDSG 231
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 86/266 (32%), Positives = 115/266 (43%), Gaps = 43/266 (16%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--- 67
F YFA ++ VG PP DTGSD+ W+QC PC C + Y P +
Sbjct: 94 FASGEYFA-SVGVGTPPTPALLVIDTGSDVVWLQCK-PCVHCYRQLSPLYDPRGSSTYAQ 151
Query: 68 -PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNV 125
PCS P+C NP C C Y I YGD S+ G L TD L FSN SV NV
Sbjct: 152 TPCSPPQCR-----NPQTCDGTTGGCGYRIVYGDASSTSGNLATDR--LVFSNDTSVGNV 204
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRG- 183
T GCG++ N G AG+LG+ RG S +Q+ + YG +C+G R
Sbjct: 205 --TLGCGHD--NEGLFG--SAAGLLGVARGNNSFATQVADSYG---RYFAYCLGDRTRSG 255
Query: 184 -----VLFLGDGKVPSSGVAWTPMLQNS-------ADLKHYILGPAELL-YSGKSCGLKD 230
++F P S V +TP+ N D+ + +G + +S S L
Sbjct: 256 SSSSYLVFGRTAPEPPSSV-FTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDP 314
Query: 231 LT----LIFDSGASYAYFTSRVYQEI 252
T ++ DSG S F Y +
Sbjct: 315 ATGRGGVVVDSGTSITRFARDAYGAL 340
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 91/335 (27%), Positives = 139/335 (41%), Gaps = 55/335 (16%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALH 78
+G P + DTGSDL W QC PC C K + P + VPCS+ C+ L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLP 231
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHN 137
+C + +C Y YGD S+ G L T+ F L S +P + FGCG
Sbjct: 232 T---SKCTSAS-KCGYTYTYGDSSSTQGVLATETFTLAKS-----KLPGVVFGCGDTNEG 282
Query: 138 PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD----- 189
G AG++GLGRG +S+VSQL GL + +C + L LG
Sbjct: 283 DG---FSQGAGLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSPLLLGSLAGIS 334
Query: 190 -GKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDL---TLIFDSG 238
+S V TP+++N + LK +G + + ++D +I DSG
Sbjct: 335 EASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSG 394
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALGQVTEYFKPL 295
S Y + Y+ ++ + L D + L +C+R P K + QV L
Sbjct: 395 TSITYLEVQGYRA-----LKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVE--VPRL 447
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYL 330
F + L +P E Y+V+ + + + +
Sbjct: 448 VFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM 479
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 75/265 (28%), Positives = 107/265 (40%), Gaps = 22/265 (8%)
Query: 4 SWIEFFFFPIFSYFA--VNLTVGKPPKLFDFDFDTGSDLTWVQCD-APCTGCTKPPEKQ- 59
S + + +F Y N++VG P F DTGS+L W+ CD + C + P
Sbjct: 47 SCVSLYSNGLFGYILHYANVSVGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTV 106
Query: 60 ----YKPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVT 110
Y P+ + VPC++ C+ RC C Y++ Y +G S+ G +V
Sbjct: 107 DLNIYSPNTSSTSEKVPCNSTLCSQTQR---DRCPSDQSNCPYQVVYLSNGTSTTGYIVQ 163
Query: 111 DLFPL--RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGL 168
DL L S + +TFGCG Q L+ G+ GLG IS+ S L G
Sbjct: 164 DLLHLISDDSQSKAVDAKITFGCGKVQTG-SFLTGGAPNGLFGLGMSNISVPSTLAHNGY 222
Query: 169 IRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL 228
C NG G + GD S+G T Q Y + + G++ L
Sbjct: 223 TSGSFSMCFSPNGIGRISFGDKG--STGQGETSFNQGQPRSSLYNISITQTSIGGQASDL 280
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIV 253
+ IFDSG S+ Y Y I
Sbjct: 281 V-YSAIFDSGTSFTYLNDPAYTLIA 304
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 88/308 (28%), Positives = 131/308 (42%), Gaps = 41/308 (13%)
Query: 12 PIFSYFAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---- 66
P + + +VG PP KL+ DTGSD+ W+QC+ PC C + P K+
Sbjct: 82 PDIGEYLMTYSVGTPPFKLYGI-VDTGSDIVWLQCE-PCQECYNQTTPMFNPSKSSSYKN 139
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+PC + C ++ C N C+Y YGD S G L D L +NG + P
Sbjct: 140 IPCPSKLCQSME---DTSCNDKN-YCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFP 195
Query: 127 -LTFGCGYNQHNPGPLS-PPDTAGVLGLGRGRISIVSQLR-------EYGLIRNVIGHCI 177
+ GCG N LS ++G++G G G S ++QL Y L I
Sbjct: 196 NIVIGCGTNN----ILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNI 251
Query: 178 GQNGRGVLFLGDGK-VPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKD 230
N L GD V GV TP+L+ + +Y+ +G + G G +
Sbjct: 252 QSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNGDNE 311
Query: 231 LTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFKALGQ-- 287
+I DSG + T Y + S ++ DL+ L+ D +TL +C+ KA G
Sbjct: 312 GNIIIDSGTTLTSLTKDDYSFLESAVV-DLV--KLERVDDPTQTLNLCYS--VKAEGYDF 366
Query: 288 --VTEYFK 293
+T +FK
Sbjct: 367 PIITMHFK 374
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 118/288 (40%), Gaps = 32/288 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C + P ++ C +
Sbjct: 82 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 73 RCAALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L + K PN C Y YGD + G L D F + SV V FGC
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV--AFGC 198
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G N G +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 199 GL--FNNGVFKSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVNGLKPSTVLLDLPAD 254
Query: 192 VPSSG---VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLT--LIFDSGA 239
+ SG V TP++QN A+ LK +G L LK+ T I DSG
Sbjct: 255 LYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGT 314
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKA 284
+ +RVY+ ++RD +KL + T P C P +A
Sbjct: 315 AMTSLPTRVYR-----LVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRA 357
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 84/292 (28%), Positives = 128/292 (43%), Gaps = 40/292 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC----TKPPE-----KQYKPHKNI 66
Y+A N+++G P F DTGSDL W+ C+ CT C TK Y + +
Sbjct: 104 YYA-NVSIGTPGLYFLVALDTGSDLFWLPCE--CTKCPTYLTKRDNGKFWLNHYSSNASS 160
Query: 67 ----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFSNGS 121
VPCS+ C + +C C Y+ Y + SS G LV D+ + +
Sbjct: 161 TSIRVPCSSSLCELAN-----QCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMATDDSQ 215
Query: 122 V--FNVPLTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+ +V +T GCG Q ++ P+ G++GLG G++S+ S L GL + C G
Sbjct: 216 LKPVDVKVTLGCGKVQTGKFSNVTAPN--GLIGLGMGKVSVPSFLASQGLTTDSFSMCFG 273
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
G G + GD + G TP N A L Y + +++ + + + LT I DSG
Sbjct: 274 YYGYGRIDFGD--IGPVGQRETPF--NPASLS-YNVTILQIIVTNRPTNVH-LTAIIDSG 327
Query: 239 ASYAYFTSRVYQEIVSLIMRDL-IGTPLKLAPDDKTLPI--CWRGPFKALGQ 287
AS+ Y T Y S+I ++ L+ D P C+R + Q
Sbjct: 328 ASFTYLTDPFY----SIITENMDAAMELERIKSDSDFPFEYCYRLSLATIFQ 375
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 80.5 bits (197), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 76/277 (27%), Positives = 114/277 (41%), Gaps = 27/277 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + FDTGSDLTW QC+ PC G C K + + P K+ + C++
Sbjct: 46 YVVVVGLGTPKRDLSLVFDTGSDLTWTQCE-PCAGSCYKQQDAIFDPSKSSSYTNITCTS 104
Query: 72 PRCAALHWPN-PPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C L C D C Y+ +YGD +S+G L + + ++ F
Sbjct: 105 SLCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITATD---IVDDFLF 161
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFL 187
GCG Q N G + +AG++GLGR ISIV Q + +C+ + G L
Sbjct: 162 GCG--QDNEGLFNG--SAGLMGLGRHPISIVQQTSSN--YNKIFSYCLPATSSSLGHLTF 215
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG------KSCGLKDLTLIFDSGASY 241
G ++ + +TP+ S D Y L + G S I DSG
Sbjct: 216 GASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVI 275
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
VY + S R + P +A + L C+
Sbjct: 276 TRLAPTVYAALRSAFRRXMEKYP--VANEAGLLDTCY 310
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/252 (30%), Positives = 105/252 (41%), Gaps = 25/252 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
F ++A+ +TVG P + F DTGSDL W+ C C GCT P +P +
Sbjct: 107 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFYIPGMSST 163
Query: 74 CAALHWPNPPRCKHPND-----QCDYEIEYGDGG-SSIGALVTDLFPLRFSNG--SVFNV 125
A+ N C + QC Y++ Y G SS G LV D+ L N +
Sbjct: 164 SKAVPC-NSNFCDLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKA 222
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ GCG Q L G+ GLG +S+ S L + GL N C G++G G +
Sbjct: 223 QIMLGCGQTQTG-SFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRI 281
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASY 241
GD + SS TP+ N + I SG + G K D IFD+G S+
Sbjct: 282 SFGDQE--SSDQEETPLDINRQHPTYAI------TISGITVGNKPTDMDFITIFDTGTSF 333
Query: 242 AYFTSRVYQEIV 253
Y Y I
Sbjct: 334 TYLADPAYTYIT 345
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/316 (28%), Positives = 132/316 (41%), Gaps = 42/316 (13%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPN 89
DTGSDL W QC APC C P + K+ +PC + RCA+L + P C
Sbjct: 1 MDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCRSSRCASL---SSPSCFK-- 54
Query: 90 DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGCGYNQHNPGPLSPPDTAG 148
C Y+ YGD S+ G L + F +N + V + FGCG N G L+ +++G
Sbjct: 55 KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCG--SLNAGDLA--NSSG 110
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR---GVLFLGDGKVPSSG--VAWTPML 203
++G GRG +S+VSQL + + R GV SSG V TP +
Sbjct: 111 MVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFV 170
Query: 204 QNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGASYAYFTSRVYQEIV 253
N A Y L + K + L +I DSG S + Y+
Sbjct: 171 INPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEA-- 228
Query: 254 SLIMRDLIGT-PLKLAPD-DKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVP 311
+ R L+ PL D D L C++ P VT L F +S + +
Sbjct: 229 --VRRGLVSAIPLPAMNDTDIGLDTCFQWPPPP--NVTVTVPDLVFHF----DSANMTLL 280
Query: 312 PEAYLVISVSTSIIII 327
PE Y++I+ +T + +
Sbjct: 281 PENYMLIASTTGYLCL 296
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 118/272 (43%), Gaps = 29/272 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--KPPEKQYKPHK---NIVPCS 70
YF V+L +G PP+ DTGSDL WV+C +PC C+ P + H + + C
Sbjct: 86 YF-VSLRIGTPPQTLLLVADTGSDLIWVKC-SPCRNCSHRSPGSAFFARHSTTYSAIHCY 143
Query: 71 NPRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
+P+C + P+P C + C Y+ Y D ++ G + L S G V + L
Sbjct: 144 SPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLNGL 203
Query: 128 TFGCGYNQHNPG--PLSPPDTAGVLGLGRGRISIVSQL-REYG--LIRNVIGHCIGQNGR 182
+FGCG+ P S GV+GLGR IS SQL R +G ++ + +
Sbjct: 204 SFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSPPPT 263
Query: 183 GVLFLGDGK---VPSSGV-AWTPMLQNSADLKHYILGPAELLYSGKSC-------GLKDL 231
L +G + V G+ ++TP+L N Y + + +G + DL
Sbjct: 264 SFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDDL 323
Query: 232 ---TLIFDSGASYAYFTSRVYQEIVSLIMRDL 260
I DSG + + T Y EI+ + +
Sbjct: 324 GNGGTIIDSGTTLTFITEPAYTEILKAFKKRV 355
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/252 (30%), Positives = 105/252 (41%), Gaps = 25/252 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
F ++A+ +TVG P + F DTGSDL W+ C C GCT P +P +
Sbjct: 107 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFYIPGMSST 163
Query: 74 CAALHWPNPPRCKHPND-----QCDYEIEYGDGG-SSIGALVTDLFPLRFSNG--SVFNV 125
A+ N C + QC Y++ Y G SS G LV D+ L N +
Sbjct: 164 SKAVPC-NSNFCDLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKA 222
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ GCG Q L G+ GLG +S+ S L + GL N C G++G G +
Sbjct: 223 QIMLGCGQTQTG-SFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRI 281
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASY 241
GD + SS TP+ N + I SG + G K D IFD+G S+
Sbjct: 282 SFGDQE--SSDQEETPLDINRQHPTYAI------TISGITVGNKPTDMDFITIFDTGTSF 333
Query: 242 AYFTSRVYQEIV 253
Y Y I
Sbjct: 334 TYLADPAYTYIT 345
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 137/313 (43%), Gaps = 41/313 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP----EKQYKPHKNIVPCSN 71
YF +++++G PP DTGSDLTWVQC PC C K +K+ C +
Sbjct: 85 YF-MSISIGTPPSKVFAIADTGSDLTWVQC-KPCQQCYKQNSPLFDKKKSSTYKTESCDS 142
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C AL + C D C Y YGD + G + T+ + S+GS + P T FG
Sbjct: 143 KTCQALS-EHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFG 201
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-----NGRGVL 185
CGYN G +G++GLG G +S+VSQL I +C+ NG V+
Sbjct: 202 CGYNN---GGTFEETGSGIIGLGGGPLSLVSQLGSS--IGKKFSYCLSHTAATTNGTSVI 256
Query: 186 FLGDGKVPS-----SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLT-- 232
LG +PS S TP++Q + +++ +G +L Y+G GL +
Sbjct: 257 NLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSK 316
Query: 233 ----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
+I DSG + S Y + + + + G +++ L C++ K +G
Sbjct: 317 RTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAK-RVSDPQGLLTHCFKSGDKEIG-- 373
Query: 289 TEYFKPLALSFTN 301
+ + FTN
Sbjct: 374 ---LPAITMHFTN 383
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/294 (28%), Positives = 130/294 (44%), Gaps = 45/294 (15%)
Query: 12 PIFSYFAVNLT---VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK---- 64
PI +Y +L +G PP DTGSDL W+QC APC GC K + + P K
Sbjct: 60 PINAYIGQHLMEIYIGTPPIKITGLVDTGSDLIWIQC-APCLGCYKQIKPMFDPLKSSTY 118
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
N + C +P C H + C P +C+Y YGD + G L D + G +
Sbjct: 119 NNISCDSPLC---HKLDTGVCS-PEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVS 174
Query: 125 VP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGR 182
+ FGCG+N N G + + G++GLG G S++SQ+ +G + C+
Sbjct: 175 LSRFLFGCGHN--NTGGFNDHE-MGLIGLGGGPTSLISQIGPLFGGKK--FSQCL----- 224
Query: 183 GVLFLGDGKVPS------------SGVAWTPMLQNSADLKHYI--LG-PAELLYSGKSCG 227
V FL D K+ S +GV TP++ D +++ LG E Y +
Sbjct: 225 -VPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNST 283
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL--PICWR 279
+ ++ DSG ++Y ++ + + + LK DD +L +C+R
Sbjct: 284 IGKANMLVDSGTPPILLPQQLYDKVFAEVRNKV---ALKPITDDPSLGTQLCYR 334
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 118/281 (41%), Gaps = 50/281 (17%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-------------KPPEKQYK 61
+Y+A + VG P + + DTGSD+ W +C C GC+ + P Y
Sbjct: 87 TYYA-QIGVGHPVQFLNAIVDTGSDILWFKCKL-CQGCSSKKNVIVCSSIIMQGPITLYD 144
Query: 62 PHKNIVP----CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF 117
P +I CS+P C+ C+ N+ C Y+I Y D SS G D+ L
Sbjct: 145 PELSITASPATCSDPLCS-----EGGSCRGNNNSCAYDISYEDTSSSTGIYFRDVVHL-- 197
Query: 118 SNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
+ + N + GC + P+ G++G GR ++S+ +QL N+ HC+
Sbjct: 198 GHKASLNTTMFLGCATSISGLWPVD-----GIMGFGRSKVSVPNQLAAQAGSYNIFYHCL 252
Query: 178 G--QNGRGVLFLG-DGKVPSSGVAWTPMLQN-----------SADLKHYILGPAELLYSG 223
+ G G+L LG + + P + +TPML N S + K + +E Y+
Sbjct: 253 SGEKEGGGILVLGKNDEFPE--MVYTPMLANDIVYNVKLVSLSVNSKALPIEASEFEYNA 310
Query: 224 KSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP 264
+ + I DSG S A F S+ V + + P
Sbjct: 311 T---VGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAIP 348
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/245 (30%), Positives = 100/245 (40%), Gaps = 24/245 (9%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALHWP 80
+TVG P + F DTGSDL W+ C C GCT P +P + A+
Sbjct: 11 VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFYIPGMSSTSKAVPC- 67
Query: 81 NPPRCKHPND-----QCDYEIEYGDGG-SSIGALVTDLFPLRFSNG--SVFNVPLTFGCG 132
N C + QC Y++ Y G SS G LV D+ L N + + GCG
Sbjct: 68 NSNFCDLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQIMLGCG 127
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
Q L G+ GLG +S+ S L + GL N C G++G G + GD +
Sbjct: 128 QTQTG-SFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQE- 185
Query: 193 PSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASYAYFTSRV 248
SS TP+ N + I SG + G K D IFD+G S+ Y
Sbjct: 186 -SSDQEETPLDINRQHPTYAI------TISGITVGNKPTDMDFITIFDTGTSFTYLADPA 238
Query: 249 YQEIV 253
Y I
Sbjct: 239 YTYIT 243
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/222 (30%), Positives = 95/222 (42%), Gaps = 20/222 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP D+GSD+ WVQC PC C + + P + V C +
Sbjct: 130 YF-VRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCGS 187
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L +CDY + YGDG + G L + L G + GC
Sbjct: 188 AICRTLSGTGCGG-GGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGC 242
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ N G AG+LGLG G +S+V QL G V +C+ G G G L LG
Sbjct: 243 GH--RNSGLFV--GAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAGGAGSLVLG 296
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD 230
+ G W P+++N+ Y +G + G+ L+D
Sbjct: 297 RTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQD 338
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/343 (26%), Positives = 139/343 (40%), Gaps = 48/343 (13%)
Query: 4 SWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH 63
S IE + + +N+ +G P DTGSDL W QC+ PCT C P + P
Sbjct: 83 SGIETPVYAGSGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCE-PCTQCFSQPTPIFNPQ 141
Query: 64 K----NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN 119
+ +PC + C L P ND C Y YGDG S+ G + T+ F F
Sbjct: 142 DSSSFSTLPCESQYCQDL-----PSESCYND-CQYTYGYGDGSSTQGYMATETF--TFET 193
Query: 120 GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-- 177
SV N+ FGCG + G + AG++G+G G +S+ SQL +C+
Sbjct: 194 SSVPNI--AFGCGEDNQGFG---QGNGAGLIGMGWGPLSLPSQLG-----VGQFSYCMTS 243
Query: 178 -GQNGRGVLFLGDGK--VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-- 232
G + L LG VP G T ++ +S + +Y + + G + G+ T
Sbjct: 244 SGSSSPSTLALGSAASGVP-EGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQ 302
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGP 281
+I DSG + Y Y + + L+P D++ L C++ P
Sbjct: 303 LQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ-----INLSPVDESSSGLSTCFQLP 357
Query: 282 FK-ALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTS 323
+ QV E N L+ P E + +++ +S
Sbjct: 358 SDGSTVQVPEISMQFDGGVLNLGEENVLISPAEGVICLAMGSS 400
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 115/273 (42%), Gaps = 27/273 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +GKPP F DTGSDLTW QC PC C Y P + +PCS+
Sbjct: 71 YLMELAIGKPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPLPCSSA 129
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C + W R P+ C Y YGDG S G L T+ L S+ V + FGCG
Sbjct: 130 TCLPI-WS---RNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCG 185
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKV 192
+ ++ G +GLGRG +S+++QL G + LG
Sbjct: 186 TDNGG----DSLNSTGTVGLGRGTLSLLAQL-GVGKFSYCLTDFFNSALDSPFLLGTLAE 240
Query: 193 PSSG---VAWTPMLQNSADLKHYI-------LGPAELLYSGKSCGLK-DLT--LIFDSGA 239
+ G V TP+LQ+ + Y LG L + L+ D T +I DSG
Sbjct: 241 LAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGT 300
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK 272
++ ++E+V + R L P+ + D
Sbjct: 301 TFTILAESGFREVVGRVARVLGQPPVNASSLDA 333
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/263 (29%), Positives = 108/263 (41%), Gaps = 32/263 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE----------KQYKPHKNI---- 66
+ +G P F D GSDL W+ CD C C +Y P +++
Sbjct: 101 IDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLSSSYYSNLDRDLNEYSPSRSLSSKH 158
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLR----FSNGS 121
+ CS+ C CK QC Y + Y + SS G LV D+ L+ SN S
Sbjct: 159 LSCSHRLC-----DKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGTLSNSS 213
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V P+ GCG Q G L G+LGLG G S+ S L + GLI C ++
Sbjct: 214 V-QAPVVLGCGMKQSG-GYLDGVAPDGLLGLGPGESSVPSFLAKSGLIHYSFSLCFNEDD 271
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGAS 240
G +F GD + P+S + T L YI+G E G SC + DSG S
Sbjct: 272 SGRMFFGD-QGPTSQQS-TSFLPLDGLYSTYIIG-VESCCIGNSCLKMTSFKAQVDSGTS 328
Query: 241 YAYFTSRVYQEIVSLIMRDLIGT 263
+ + VY I + + G+
Sbjct: 329 FTFLPGHVYGAITEEFDQQVNGS 351
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 79/167 (47%), Gaps = 17/167 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK--PPEKQYKPHKNIVPCS- 70
+ Y+A L +G PP+ F DTGS++T+V C C K P Q + P +
Sbjct: 47 YGYYATKLYIGTPPQEFTLVVDTGSNMTFVPCCGSEEYCGKHEDPAFQTESSSTYQPVNC 106
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN-VPLTF 129
+P C C + QC Y++ YGDG S G L D+ + F N S F L F
Sbjct: 107 HPSC---------DCDYLRSQCSYKMHYGDGSYSRGVLAEDI--ISFGNESEFAPQRLVF 155
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
GC + G L G++GLGRGR +IV QL + G+I + C
Sbjct: 156 GCELDA--IGSLYSLRADGIIGLGRGRSTIVDQLVDKGVISDSFSLC 200
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/257 (30%), Positives = 103/257 (40%), Gaps = 35/257 (13%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-----------KPPEKQYKPH----KN 65
+ +G P F D GSDL WV CD C C +Y P
Sbjct: 111 IDIGTPNVSFLVALDAGSDLLWVPCD--CIQCAPLSASYYNISLDRDLSEYSPSLSSTSR 168
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGD--GGSSIGALVTDLFPLR----FSN 119
+ C + C W + CK+P D C Y Y D +S G LV D L +
Sbjct: 169 HLSCDHQLC---EWGS--NCKNPKDPCPYIFNYDDFENTTSAGFLVEDKLHLASVGDHTA 223
Query: 120 GSVFNVPLTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+ + GCG Q + PD GV+GLG G IS+ S L + GLI+N C
Sbjct: 224 RKMLQASVVLGCGRKQGGSFFDGAAPD--GVMGLGPGDISVPSLLAKAGLIQNCFSLCFD 281
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD-LTLIFDS 237
+N G + GD S TP L Y +G E G SC + + DS
Sbjct: 282 ENDSGRILFGDRGHASQQS--TPFLPIQGTYVAYFVG-VESYCVGNSCLKRSGFKALVDS 338
Query: 238 GASYAYFTSRVYQEIVS 254
G+S+ Y S VY E+VS
Sbjct: 339 GSSFTYLPSEVYNELVS 355
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 112/275 (40%), Gaps = 41/275 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPP----EKQYKPHKNIVPC 69
YF V L VG P K F DTGSDLTW+QC+ P T + PP +K +PC
Sbjct: 59 YF-VELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPC 117
Query: 70 SNPRCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS------- 121
++ C L P C + CDY Y D + G L + ++ S
Sbjct: 118 TDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNH 177
Query: 122 ------VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ NV L GC L +GVLGLG+G IS+ +Q R L + +
Sbjct: 178 KTRRIRIKNVAL--GCSRESVGASFLG---ASGVLGLGQGPISLATQTRHTAL-GGIFSY 231
Query: 176 CIGQNGRG---VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC------ 226
C+ RG FL G+ +A TP+++N A Y + + GK
Sbjct: 232 CLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASS 291
Query: 227 -----GLKDLTLIFDSGASYAYFTSRVYQEIVSLI 256
G + IFDSG + +Y Y +++ +
Sbjct: 292 DWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGAL 326
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 69/130 (53%), Gaps = 14/130 (10%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPC 69
YF +++ VG PPK F DTGSDL W+QC PC C + Y P +KNI C
Sbjct: 154 EYF-MDVLVGSPPKHFSLILDTGSDLNWIQC-LPCHDCFQQNGAFYDPKASASYKNIT-C 210
Query: 70 SNPRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS----NGSVFN 124
++PRC + P+PP+ CK N C Y YGD ++ G + F + + + ++N
Sbjct: 211 NDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYN 270
Query: 125 VP-LTFGCGY 133
V + FGCG+
Sbjct: 271 VENMMFGCGH 280
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/252 (30%), Positives = 104/252 (41%), Gaps = 25/252 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
F ++A+ +TVG P + F DTGSDL W+ C C GCT P +P +
Sbjct: 106 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSATFYIPGMSST 162
Query: 74 CAALHWPNPPRCKHPND-----QCDYEIEYGDGG-SSIGALVTDLFPLRFSNG--SVFNV 125
A+ N C + QC Y++ Y G SS G LV D+ L N +
Sbjct: 163 SKAVPC-NSNFCDLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKA 221
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ GCG Q L G+ GLG +S+ S L + GL N C G++G G +
Sbjct: 222 QIMLGCGQTQTG-SFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRI 280
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASY 241
GD SS TP+ N + I SG + G K D IFD+G S+
Sbjct: 281 SFGDQG--SSDQEETPLNINQQHPTYAI------TISGITIGNKPTDLDFITIFDTGTSF 332
Query: 242 AYFTSRVYQEIV 253
Y Y I
Sbjct: 333 TYLADPAYTYIT 344
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 82/168 (48%), Gaps = 18/168 (10%)
Query: 1 MYVSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQY 60
+ V W ++ +YF +L +G P + DTGSD +W+QC PC C + E +
Sbjct: 121 LQVGWGKYL--DTTNYF-TSLRLGTPATDLLVELDTGSDQSWIQCK-PCPDCYEQHEALF 176
Query: 61 KPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
P K+ + CS+ C L + C + +C YEI Y D ++G L D L
Sbjct: 177 DPSKSSTYSDITCSSRECQELGSSHKHNCSS-DKKCPYEITYADDSYTVGNLARDTLTLS 235
Query: 117 FSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
++ VP FGCG+N N G D G+LGLGRG+ S+ SQ+
Sbjct: 236 PTDA----VPGFVFGCGHN--NAGSFGEID--GLLGLGRGKASLSSQV 275
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/274 (27%), Positives = 114/274 (41%), Gaps = 31/274 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V++ +G P K + FDTGSDL+WVQC PC C + + + P + V C P
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSSTYAAVACGAP 207
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L C + +C YE++YGD + G LV D L S+ +P FGC
Sbjct: 208 ECQELDASG---CSS-DSRCRYEVQYGDQSQTDGNLVRDTLTLSASD----TLPGFVFGC 259
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCIGQNGRGVLFLGDG 190
G N G D G+ GLGR ++S+ SQ YG +C+ + G +L G
Sbjct: 260 G--DQNAGLFGQVD--GLFGLGREKVSLPSQGAPSYG---PGFTYCLPSSSSGRGYLSLG 312
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASYAYF 244
P + +T L + A Y + + G++ + + DSG
Sbjct: 313 GAPPANAQFT-ALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRL 371
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
R Y + + R + K AP L C+
Sbjct: 372 PPRAYAPLRAAFARSM--AQYKKAPALSILDTCY 403
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 109/258 (42%), Gaps = 34/258 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKPPEKQ------YKPH-- 63
F ++A N+TVG P F DTGSDL W+ CD C K P Y P+
Sbjct: 102 FLHYA-NVTVGTPSDWFLVALDTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNIYSPNAS 160
Query: 64 --KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL--RFS 118
+ VPC++ C + RC P C Y+I Y +G SS G LV D+ L
Sbjct: 161 STSSKVPCNSTLCTRVD-----RCASPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEK 215
Query: 119 NGSVFNVPLTFGCGYNQ----HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIG 174
N +T GCG Q H+ + P+ G+ GLG IS+ S L + G+ N
Sbjct: 216 NSKPIRARITLGCGLVQTGVFHDG---AAPN--GLFGLGLEDISVPSVLAKEGIAANSFS 270
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI 234
C G +G G + GD S TP+ + + + G + G + +
Sbjct: 271 MCFGDDGAGRISFGDKG--SVDQRETPLNIRQPHPTYNV--TVTQISVGGNTGDLEFDAV 326
Query: 235 FDSGASYAYFTSRVYQEI 252
FD+G S+ Y T Y I
Sbjct: 327 FDTGTSFTYLTDAPYTLI 344
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 95/222 (42%), Gaps = 20/222 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP D+GSD+ WVQC PC C + + P + V C +
Sbjct: 130 YF-VRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCGS 187
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L +CDY + YGDG + G L + L G + GC
Sbjct: 188 AICRTLSGTGCGG-GGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGC 242
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ N G AG+LGLG G +S++ QL G V +C+ G G G L LG
Sbjct: 243 GH--RNSGLFV--GAAGLLGLGWGAMSLIGQLG--GAAGGVFSYCLASRGAGGAGSLVLG 296
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD 230
+ G W P+++N+ Y +G + G+ L+D
Sbjct: 297 RTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQD 338
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/274 (27%), Positives = 114/274 (41%), Gaps = 31/274 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V++ +G P K + FDTGSDL+WVQC PC C + + + P + V C P
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSSTYAAVACGAP 207
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L C + +C YE++YGD + G LV D L S+ +P FGC
Sbjct: 208 ECQELDASG---CSS-DSRCRYEVQYGDQSQTDGNLVRDTLTLSASD----TLPGFVFGC 259
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCIGQNGRGVLFLGDG 190
G N G D G+ GLGR ++S+ SQ YG +C+ + G +L G
Sbjct: 260 G--DQNAGLFGQVD--GLFGLGREKVSLPSQGAPSYG---PGFTYCLPSSSSGRGYLSLG 312
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASYAYF 244
P + +T L + A Y + + G++ + + DSG
Sbjct: 313 GAPPANAQFT-ALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRL 371
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
R Y + + R + K AP L C+
Sbjct: 372 PPRAYAPLRAAFARSM--AQYKKAPALSILDTCY 403
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 113/265 (42%), Gaps = 33/265 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N+++G PP DTGSDL W QC PC C + + P + V CS+
Sbjct: 94 YLMNISLGTPPFPIMAIADTGSDLLWTQC-KPCDDCYTQVDPLFDPKASSTYKDVSCSSS 152
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF--- 129
+C AL N C ++ C Y YGD + G + D L GS P+
Sbjct: 153 QCTALE--NQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTL----GSTDTRPVQLKNI 206
Query: 130 --GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGR- 182
GCG+N N G + + V G +S+++QL + I +C+ +N R
Sbjct: 207 IIGCGHN--NAGTFNKKGSGIVGLGGGA-VSLITQLGDS--IDGKFSYCLVPLTSENDRT 261
Query: 183 -GVLFLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIF 235
+ F + V +GV TP++ S + +Y+ +G E+ Y G G + +I
Sbjct: 262 SKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIII 321
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDL 260
DSG + + Y E+ + +
Sbjct: 322 DSGTTLTLLPTEFYSELEDAVASSI 346
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/255 (28%), Positives = 111/255 (43%), Gaps = 21/255 (8%)
Query: 35 DTGSDLTWVQC-DAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPN 89
DT SD+ WVQC P C + Y P K+ +PC +P C L C
Sbjct: 174 DTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTT 233
Query: 90 DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGV 149
D+C Y + YGDG ++ G VTD + + ++ FGC + G S + AG+
Sbjct: 234 DECKYIVNYGDGKATTGTYVTDTLTM---SPTIVVKDFRFGCSHAVR--GSFSNQN-AGI 287
Query: 150 LGLGRGRISIVSQLRE-YGLIRNVIGHCIGQ-NGRGVLFLGDGKVPSSGVAWTPMLQNSA 207
L LG GR S++ Q + YG N +CI + + G L LG S ++TP+++N
Sbjct: 288 LALGGGRGSLLEQTADAYG---NAFSYCIPKPSSAGFLSLGGPVEASLKFSYTPLIKNKH 344
Query: 208 DLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFTSRVYQEIVSLIMRDLIGT 263
YI+ ++ +GK + + DSGA +VY + + R +
Sbjct: 345 APTFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRA-AFRSAMAA 403
Query: 264 PLKLAPDDKTLPICW 278
LA + L C+
Sbjct: 404 YGPLAAPVRNLDTCY 418
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 110/279 (39%), Gaps = 33/279 (11%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----V 67
+ +G P F DTGSDL WV CD AP G + + Y P ++ V
Sbjct: 103 TTVELGTPGMKFMVALDTGSDLFWVPCDCSKCAPTQGVAYASDFELSIYDPKQSSTSKKV 162
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPL--RFSNGSVFN 124
C+N CA + RC C Y + Y +S G LV D+ L SN
Sbjct: 163 TCNNNLCAHRN-----RCLGTFSSCPYMVSYVSAQTSTSGILVEDVLHLTSEDSNQESIK 217
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+TFGCG Q L+ G+ GLG +IS+ S L GL + C G +G G
Sbjct: 218 AYVTFGCGQVQSG-SFLNTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCFGHDGVGR 276
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+ GD P TP N + + I + G + D T +FDSG S+ Y
Sbjct: 277 ISFGDKGSPDQ--EETPFNSNPSHPSYNI--SVTQVRVGTTLVDVDFTALFDSGTSFTYL 332
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
+ +Y ++ DK P R PF+
Sbjct: 333 INPIYA---------MVSENFHAQAQDKRRPPDPRIPFE 362
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 81/167 (48%), Gaps = 22/167 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHK----NIVPCSN 71
+ V + +G P FDTGSDLTW QC+ PC G C E ++ P + V CS+
Sbjct: 134 YIVTIGIGTPKHDISLMFDTGSDLTWTQCE-PCLGSCYSQKEPKFNPSSSSSYHNVSCSS 192
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C NP C N C Y I YGDG ++G L + F L +N V + + FGC
Sbjct: 193 PMCG-----NPESCSASN--CLYGIGYGDGSVTVGFLAKEKFTL--TNSDVLD-DIYFGC 242
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
G N N G +AG+LGLG G+ S L+ N+ +C G
Sbjct: 243 GEN--NKGVF--IGSAGILGLGPGKFSF--PLQTTTTYNNIFSYCCG 283
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/263 (28%), Positives = 110/263 (41%), Gaps = 28/263 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L+VG PP DTGSD+ W QC+ PCT C + + P K+ V CS+P
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQCE-PCTNCYQQDLPMFNPSKSTTYRKVSCSSP 143
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C+ N C D C Y I YGD S G D + ++G V P T GC
Sbjct: 144 VCSFTGEDN--SCSFKPD-CTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGC 200
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRG---VL 185
G++ N G + +G++GLG G S++ Q+ + +C IG + G +
Sbjct: 201 GHD--NAGSFD-ANVSGIVGLGLGPASLIKQMGS--AVGGKFSYCLTPIGNDDGGSNKLN 255
Query: 186 FLGDGKVPSSGVAWTPMLQN-------SADLKHYILGPAELLYSGKSCGL-KDLTLIFDS 237
F + V SG TP+ + S LK +G YS + L +I DS
Sbjct: 256 FGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDS 315
Query: 238 GASYAYFTSRVYQEIVSLIMRDL 260
G + +Y I +
Sbjct: 316 GTTLTLLPVDLYHNFAKAISNSI 338
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 98/323 (30%), Positives = 135/323 (41%), Gaps = 51/323 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IVPCS 70
+ V L +G P DTGSDL+WVQC PC P+K + P K+ +PC+
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCNASDCYPQKDPLFDPSKSSTFATIPCA 183
Query: 71 NPRCAAL---HWPNPPRCKHPND----QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
+ C L + N C + QC Y IEYG+G + G T+ L S
Sbjct: 184 SDACKQLPVDGYDN--GCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLAL---GSSAV 238
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIG--QN 180
FGCG +QH GP D G+LGLG S+VSQ YG +C+ +
Sbjct: 239 VKSFRFGCGSDQH--GPYDKFD--GLLGLGGAPESLVSQTASVYG---GAFSYCLPPLNS 291
Query: 181 GRGVLFLG---DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---- 233
G G L LG +SG +TPM S + + + + +G S G K L +
Sbjct: 292 GAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYV----VTLTGISVGGKALDIPPAV 347
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
I DSG + Y+ + + + PL L P D L C+ F G V
Sbjct: 348 FAKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPL-LPPADSALDTCYN--FTGHGTV 404
Query: 289 TEYFKPLALSFTNRRNSVRLVVP 311
T +AL+F +V L VP
Sbjct: 405 T--VPKVALTFVGGA-TVDLDVP 424
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/325 (26%), Positives = 135/325 (41%), Gaps = 49/325 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPPEK--QYKPHKNIVPCSNPR 73
V+LTVG PP+ DTGS+L+W+ C AP P + Y P +PC++P
Sbjct: 63 LTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSP----IPCTSPT 118
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C + P C C I Y D S G L +D F + S +P T FG
Sbjct: 119 CRTRTRDFSIPVSCDK-KKLCHAIISYADASSIEGNLASDTFHIGNS-----AIPATIFG 172
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
C + + T G++G+ RG +S V+Q+ GL + +CI GQ+ G+L G+
Sbjct: 173 CMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM---GLQK--FSYCISGQDSSGILLFGE 227
Query: 190 GKVP-SSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----L 233
+ +TP++Q S L ++ I +L KS D T
Sbjct: 228 SSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQT 287
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWRGPFKA---- 284
+ DSG + + VY + + +R + LK+ D + +C+R P
Sbjct: 288 MVDSGTQFTFLLGPVYTALKNEFVRQTKAS-LKVLEDPNFVFQGAMDLCYRVPLTRRTLP 346
Query: 285 -LGQVTEYFKPLALSFTNRRNSVRL 308
L VT F+ +S + R R+
Sbjct: 347 PLPTVTLMFRGAEMSVSAERLMYRV 371
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/258 (29%), Positives = 112/258 (43%), Gaps = 19/258 (7%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IV 67
PI Y + +G PP DTGSDL WVQC APC C + P K+ V
Sbjct: 88 PITEYL-MRFYIGTPPVERFAIADTGSDLIWVQC-APCEKCVPQNAPLFDPRKSSTFKTV 145
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
PC + C L P+ C + QC Y+ YGD G L + N ++ L
Sbjct: 146 PCDSQPCTLLP-PSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKL 204
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGV 184
TFGC ++ ++ S + G++GLG G +S++SQL Y + R +C + N
Sbjct: 205 TFGCTFSNNDTVDESKRNM-GLVGLGVGPLSLISQL-GYQIGRK-FSYCFPPLSSNSTSK 261
Query: 185 LFLGDGKVPSS--GVAWTPMLQNSADLKHYILGPAELLYSGK----SCGLKDLTLIFDSG 238
+ G+ + GV TP++ S +Y L + K S D ++ DSG
Sbjct: 262 MRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSG 321
Query: 239 ASYAYFTSRVYQEIVSLI 256
S+ Y + V+L+
Sbjct: 322 TSFTILKQSFYNKFVALV 339
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 130/289 (44%), Gaps = 37/289 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPC 69
+ + ++L +G PP+ DTGSDL W QC PC C Y ++ + C
Sbjct: 88 MTEYLLHLAIGTPPQPVQLTLDTGSDLVWTQCQ-PCAVCFNQSLPYYDASRSSTFALPSC 146
Query: 70 SNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
+ +C P+ C + Q C + YGD ++IG L D+ + F G+ +VP +
Sbjct: 147 DSTQCKL--DPSVTMCVNQTVQTCAFSYSYGDKSATIGFL--DVETVSFVAGA--SVPGV 200
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
FGCG N N G +T G+ G GRG +S+ SQL+ G + G+ VLF
Sbjct: 201 VFGCGLN--NTGIFRSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVSGRKPSTVLFD 256
Query: 188 GDGKVPSSG---VAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLT--LIF 235
+ +G V TP+++N A LK +G L + LK+ T I
Sbjct: 257 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 316
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGP 281
DSG ++ RVY+ ++ D +KL P ++T P +C+ P
Sbjct: 317 DSGTAFTSLPPRVYR-----LVHDEFAAHVKLPVVPSNETGPLLCFSAP 360
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/325 (26%), Positives = 134/325 (41%), Gaps = 49/325 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V+LTVG PP+ DTGS+L+W+ C AP P Y P +PC++P
Sbjct: 56 LTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSP----IPCTSPT 111
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FG 130
C + P C C I Y D S G L +D F + S +P T FG
Sbjct: 112 CRTRTRDFSIPVSCDK-KKLCHAIISYADASSIEGNLASDTFHIGNS-----AIPATIFG 165
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
C + + T G++G+ RG +S V+Q+ GL + +CI GQ+ G+L G+
Sbjct: 166 CMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM---GLQK--FSYCISGQDSSGILLFGE 220
Query: 190 GKVP-SSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----L 233
+ +TP++Q S L ++ I +L KS D T
Sbjct: 221 SSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQT 280
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWRGPFKA---- 284
+ DSG + + VY + + +R + LK+ D + +C+R P
Sbjct: 281 MVDSGTQFTFLLGPVYTALKNEFVRQTKAS-LKVLEDPNFVFQGAMDLCYRVPLTRRTLP 339
Query: 285 -LGQVTEYFKPLALSFTNRRNSVRL 308
L VT F+ +S + R R+
Sbjct: 340 PLPTVTLMFRGAEMSVSAERLMYRV 364
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/152 (37%), Positives = 74/152 (48%), Gaps = 15/152 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P FDTGSDLTW QC C E + P K+ V CS+
Sbjct: 104 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 163
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C +L N C N C Y I+YGD S+G L + F L +N VF+ + FG
Sbjct: 164 ACGSLSSATGNAGSCSASN--CIYGIQYGDQSFSVGFLAKEKFTL--TNSDVFD-GVYFG 218
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ 162
CG N N G + AG+LGLGR ++S SQ
Sbjct: 219 CGEN--NQGLFT--GVAGLLGLGRDKLSFPSQ 246
>gi|213998796|gb|ACJ60765.1| nucellin [Hordeum marinum subsp. gussoneanum]
Length = 133
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/115 (41%), Positives = 67/115 (58%), Gaps = 6/115 (5%)
Query: 142 SPP-DTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQNGRGVLFLGDGKVPSSGVAW 199
SPP G+LGLG G+ +QL+ +I NVIGHC+ G+GVL++G+ PS GV W
Sbjct: 4 SPPLPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGNFNPPSRGVTW 63
Query: 200 TPMLQNSADLKHYILGPAELLYSGKSC-GLKDLTLIFDSGASYAYFTSRVYQEIV 253
PM ++S +Y G AELL + G +FDSG++Y S++Y EIV
Sbjct: 64 VPMRESSF---YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYNEIV 115
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 71/251 (28%), Positives = 114/251 (45%), Gaps = 23/251 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++++G PP + DTGSDLTW QC PC C + + P K+ VPC+
Sbjct: 92 YLMSVSIGTPPVDYLGIADTGSDLTWAQC-LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQ 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C H + C CDY YGD S G DL + + GS +V GCG
Sbjct: 151 TC---HAVDDGHCG-VQGVCDYSYTYGDRTYSKG----DLGFEKITIGSS-SVKSVIGCG 201
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
+ + +GV+GLG G++S+VSQ+ + I +C + + G + G+
Sbjct: 202 HASSGGFGFA----SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGE 257
Query: 190 GKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-KDLTLIFDSGASYAYFTSR 247
V S GV TP++ + +YI A + + + K +I DSG +
Sbjct: 258 NAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLTILPKE 317
Query: 248 VYQEIVSLIMR 258
+Y +VS +++
Sbjct: 318 LYDGVVSSLLK 328
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/131 (37%), Positives = 67/131 (51%), Gaps = 16/131 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
YF +++ VG PPK F DTGSDL W+QC PC C + Y P + + C
Sbjct: 196 EYF-MDVFVGTPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISCH 253
Query: 71 NPRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS--NGS-----V 122
+PRC + P+PP+ CK N C Y YGDG ++ G + F + + NG+ V
Sbjct: 254 DPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHV 313
Query: 123 FNVPLTFGCGY 133
NV FGCG+
Sbjct: 314 ENV--MFGCGH 322
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 95/348 (27%), Positives = 143/348 (41%), Gaps = 53/348 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F ++L+VG P + DTGSDL W QC PC C + P + +PCS+
Sbjct: 116 FLMDLSVGTPALPYAAIVDTGSDLVWTQCK-PCVECFNQTTPVFDPAASSTYAALPCSSA 174
Query: 73 RCAALHWPNPPRCKHPNDQCD---YEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
CA L + Y YGD S+ G L T+ F L VP +
Sbjct: 175 LCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQ-----KVPGVA 229
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGV 184
FGCG G AG++GLGRG +S+VSQL G+ R +C+ GR
Sbjct: 230 FGCGDTNEGDGFTQ---GAGLVGLGRGPLSLVSQL---GIDR--FSYCLTSLDDAAGRSP 281
Query: 185 LFLGDGKVPSSG-----VAWTPMLQNSADLKHY-------ILGPAELLYSGKSCGLKDL- 231
L LG S+ TP+++N + Y +G L + ++D
Sbjct: 282 LLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDG 341
Query: 232 --TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT---LPICWRGPFKALG 286
+I DSG S Y R Y+ +R + L D + L +C++GP A+
Sbjct: 342 TGGVIVDSGTSITYLELRAYRA-----LRKAFVAHMSLPTVDASEIGLDLCFQGPAGAVD 396
Query: 287 QVTEYFKP-LALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTGK 333
Q + P L L F + L +P E Y+V+ ++ + + + +
Sbjct: 397 QDVQVQVPKLVLHFDGGAD---LDLPAENYMVLDSASGALCLTVMASR 441
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 149/356 (41%), Gaps = 90/356 (25%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKPHKN--- 65
+ + L++G PP + DTGSDL W QC APC + Q Y P +
Sbjct: 87 YIMTLSIGTPPLSYRAIADTGSDLIWTQC-APCGDTVTDTDNQCFKQSGCLYNPSSSTTF 145
Query: 66 -IVPCSNP--RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS- 121
++PC++P CAA+ P+PP P C Y YG G + A V + F + S
Sbjct: 146 GVLPCNSPLSMCAAMAGPSPP----PGCACMYNQTYGTGWT---AGVQSVETFTFGSSST 198
Query: 122 --VFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI- 177
VP + FGC N +AG++GLGRG +S+VSQL +C+
Sbjct: 199 PPAVRVPNIAFGCSNASSNDW----NGSAGLVGLGRGSMSLVSQLGA-----GAFSYCLT 249
Query: 178 ---GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKH--YILGPAE--------LLYSGK 224
N L LG PS+ A L+ + ++ ++ GP++ L +G
Sbjct: 250 PFQDANSTSTLLLG----PSAAAA----LKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGI 301
Query: 225 SCGLKDLT---------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA- 268
S G L LI DSG + YQ++ + + R L+ T L LA
Sbjct: 302 SVGETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAV-RSLLVTRLPLAH 360
Query: 269 -PDDKT-LPICW----RGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
PD T L +C+ P A+ +T +F+ +V+P E Y+++
Sbjct: 361 GPDHSTGLDLCFALKASTPPPAMPSMTLHFE----------GGADMVLPVENYMIL 406
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 66/129 (51%), Gaps = 14/129 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-----KNIVPCS 70
YF +++ VG PPK F DTGSDL W+QC PC C E Y P KNI C+
Sbjct: 162 YF-MDVLVGTPPKHFSLILDTGSDLNWLQC-LPCYDCFHQNEAFYDPKTSASFKNIT-CN 218
Query: 71 NPRCAALHWPNPP-RCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN----GSVFNV 125
+PRC+ + P PP +CK N C Y YGD ++ G + F + + S + V
Sbjct: 219 DPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKV 278
Query: 126 P-LTFGCGY 133
+ FGCG+
Sbjct: 279 ENMMFGCGH 287
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 74/152 (48%), Gaps = 15/152 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P FDTGSDLTW QC C E + P K+ V CS+
Sbjct: 133 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 192
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C +L N C N C Y I+YGD S+G L D F L S+ VF+ + FG
Sbjct: 193 ACGSLSSATGNAGSCSASN--CIYGIQYGDQSFSVGFLAKDKFTLTSSD--VFD-GVYFG 247
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ 162
CG N N G + AG+LGLGR ++S SQ
Sbjct: 248 CGEN--NQGLFT--GVAGLLGLGRDKLSFPSQ 275
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 72/153 (47%), Gaps = 17/153 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G PP F FDTGSD TWVQC C K ++ + P K+ V C++P
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADP 222
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CA L + C C Y I+YGDG ++G D + F FGCG
Sbjct: 223 ACADL---DASGCN--AGHCLYGIQYGDGSYTVGFFAKDTLAVAQDAIKGFK----FGCG 273
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE 165
+ N G TAG+LGLGRG SI Q E
Sbjct: 274 --EKNRGLFG--QTAGLLGLGRGPTSITVQAYE 302
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 68/213 (31%), Positives = 93/213 (43%), Gaps = 22/213 (10%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK-PPEK--QYKPH----K 64
P Y+ LT+G P + DTGS L PC+GCT+ P K +KP
Sbjct: 76 PELGYYYTYLTIGTPGQTVSGILDTGSTLPAF----PCSGCTRCGPSKTGMFKPELSSTS 131
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ CS+ RC + C N+QC Y I Y +G S+ G L D+ + G N
Sbjct: 132 STFGCSDARC----FCGANSCSCNNEQCGYSIRYLEGSSTSGFLAEDMLAVG-DGGPAAN 186
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
FGC Q G L GV G+GR S+ QL + G+I + C G GV
Sbjct: 187 --FVFGCA--QSESGLLYSQIADGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPREGV 242
Query: 185 LFLGDGKVPSSGVA--WTPMLQNSADLKHYILG 215
L LG+ +P+ A TP++ N+ I G
Sbjct: 243 LLLGNVALPADAPAPVVTPVVGNTNKFNIQIEG 275
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/259 (30%), Positives = 106/259 (40%), Gaps = 37/259 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--------YKP--- 62
F ++A+ +TVG P + F DTGSDL W+ C C GCT P Y P
Sbjct: 107 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPATAASGSFQATFYIPGMS 163
Query: 63 -HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG 120
VPC++ C C QC Y++ Y G SS G LV D+ L N
Sbjct: 164 STSKAVPCNSNFCDLQK-----ECSTAL-QCPYKMVYVSAGTSSSGFLVEDVLYLSTENA 217
Query: 121 --SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
+ + GCG Q L G+ GLG +S+ S L + GL N C G
Sbjct: 218 HPQILKAQIMLGCGQTQTGSF-LDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFG 276
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLI 234
++G G + GD + SS TP+ N + I SG + G K D I
Sbjct: 277 RDGIGRISFGDQE--SSDQEETPLDINRQHPTYAI------TISGITVGNKPTDMDFITI 328
Query: 235 FDSGASYAYFTSRVYQEIV 253
FD+G S+ Y Y I
Sbjct: 329 FDTGTSFTYLADPAYTYIT 347
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 75/263 (28%), Positives = 109/263 (41%), Gaps = 28/263 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L+VG PP DTGSD+ W QC PCT C + + P K+ V CS+P
Sbjct: 85 YLMKLSVGTPPFPIIAVADTGSDIIWTQC-VPCTNCYQQDLPMFNPSKSTTYRKVSCSSP 143
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C+ N C D C Y I YGD S G D + ++G V P T GC
Sbjct: 144 VCSFTGEDN--SCSFKPD-CTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGC 200
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRG---VL 185
G++ N G + +G++GLG G S++ Q+ + +C IG + G +
Sbjct: 201 GHD--NAGSFD-ANVSGIVGLGLGPASLIKQMGS--AVGGKFSYCLTPIGNDDGGSNKLN 255
Query: 186 FLGDGKVPSSGVAWTPMLQN-------SADLKHYILGPAELLYSGKSCGL-KDLTLIFDS 237
F + V SG TP+ + S LK +G YS + L +I DS
Sbjct: 256 FGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDS 315
Query: 238 GASYAYFTSRVYQEIVSLIMRDL 260
G + +Y I +
Sbjct: 316 GTTLTLLPVDLYHNFAKAISNSI 338
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 111/246 (45%), Gaps = 25/246 (10%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALH 78
+G PP + DTGSDLTW QC PC C + + P K+ VPC+ C H
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQC-LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTC---H 141
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNP 138
+ C CDY YGD S G DL + + GS +V GCG+
Sbjct: 142 AVDDGHCG-VQGVCDYSYTYGDRTYSKG----DLGFEKITIGSS-SVKSVIGCGHASSGG 195
Query: 139 GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLFLGDGKVP 193
+ +GV+GLG G++S+VSQ+ + I +C+ NG+ + F + V
Sbjct: 196 FGFA----SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGK-INFGQNAVVS 250
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-KDLTLIFDSGASYAYFTSRVYQEI 252
GV TP++ + +YI A + + + K +I DSG + ++ +Y +
Sbjct: 251 GPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLSFLPKELYDGV 310
Query: 253 VSLIMR 258
VS +++
Sbjct: 311 VSSLLK 316
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 141/339 (41%), Gaps = 60/339 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----------KPPEKQYKPHKNI 66
+ + L++G PP+L DTGSDL W++CD C C YK
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDN-CDHCDLDHHGETIFFSDASSSYKK---- 59
Query: 67 VPCSNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS---- 121
+PC++ C+ + PRC+ + C Y+ EYGDG + G + +D R S+G+
Sbjct: 60 LPCNSTHCSGMSSAGIGPRCE---ETCKYKYEYGDGSRTSGDVGSDRISFR-SHGAGEDH 115
Query: 122 -VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE---YGLIRNVIGHCI 177
F FGCG T G++GLG+ S++ QL + Y ++ +
Sbjct: 116 RSFFDGFLFGCGRKLKGDWNF----TQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDS 171
Query: 178 GQNGRGVLFLG-DGKVPSSGVAWTPMLQNS--------ADLKHYILGPAELLYSGKSCG- 227
+ + LFLG + V TP+L DL+ +G ++ K G
Sbjct: 172 PPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGH 231
Query: 228 -------LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
L + T+I DSG +Y T VY+ + I +I L + L +C
Sbjct: 232 NTSVGPFLANKTVI-DSGTTYTLLTPPVYEAMRKSIEEQVI---LPTLGNSAGLDLC--- 284
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVIS 319
F + G + F + F N+ V+LV+P E ++
Sbjct: 285 -FNSSGDTSYGFPSVTFYFANQ---VQLVLPFENIFQVT 319
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 76/152 (50%), Gaps = 15/152 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + F FDTGSDLTW QC+ C E + P K+ + CS+P
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSP 197
Query: 73 RCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L N P C C Y I+YGD S+G D L ++ VFN L FG
Sbjct: 198 TCDELKSGTGNSPSCSAST--CVYGIQYGDQSYSVGFFAQD--KLALTSTDVFNNFL-FG 252
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ 162
CG Q+N G AG++GLGR +S++S+
Sbjct: 253 CG--QNNRGLF--VGVAGLIGLGRNALSLMSK 280
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 120/279 (43%), Gaps = 27/279 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---YKPHKNI---- 66
F Y + + VG PP DTGSDL WV C + G ++P ++
Sbjct: 101 FEYL-MYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQ 159
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNV 125
+ C + C AL + C + +C Y+ YGDG +IG L T+ F G V
Sbjct: 160 LSCQSNACQALSQAS---CD-ADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRV 215
Query: 126 P-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQN 180
P + FGC + + G + G++GLG G S+VSQL I + +C+ N
Sbjct: 216 PRVNFGC--STASAGTFR---SDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDAN 270
Query: 181 GRGVLFLGDGKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
L G V S G A TP++ + D +Y + + G+ D +I DSG
Sbjct: 271 SSSTLNFGSRAVVSEPGAASTPLVPSDVD-SYYTVALESVAVGGQEVATHDSRIIVDSGT 329
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
+ + + +V+ + R + ++ P ++ L +C+
Sbjct: 330 TLTFLDPALLGPLVTELERRI--KLQRVQPPEQLLQLCY 366
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 69/240 (28%), Positives = 103/240 (42%), Gaps = 30/240 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
++ + + L VG PP + + DTGSDL W QC PCT C QY P I SN
Sbjct: 58 YNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNC----YSQYAP---IFDPSNSS 109
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCG 132
RC + C Y+I Y D S G L T+ + ++G F +P T GCG
Sbjct: 110 TF-----KEKRCN--GNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCG 162
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-DGK 191
+N P +G++GL G S+++Q+ G ++ +C G + G +
Sbjct: 163 HNS----SWFKPTFSGMVGLSWGPSSLITQMG--GEYPGLMSYCFASQGTSKINFGTNAI 216
Query: 192 VPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
V GV T M +A +L +G + G + + +I DSG + YF
Sbjct: 217 VAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYF 276
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 112/274 (40%), Gaps = 48/274 (17%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWP--------NP 82
DT S+LTWVQC APC C + P + VPC +P C AL P
Sbjct: 159 DTASELTWVQC-APCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGAP 217
Query: 83 PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLS 142
P C Y + Y DG S G L D L G V + FGCG + P P
Sbjct: 218 PCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL---AGEVID-GFVFGCGTSNQGP-PFG 272
Query: 143 PPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI----GQNGRGVLFLGDGKVP---S 194
T+G++GLGR ++S+VSQ + ++G V +C+ + G L LGD S
Sbjct: 273 --GTSGLMGLGRSQLSLVSQTVDQFG---GVFSYCLPLSRESDASGSLVLGDDPSAYRNS 327
Query: 195 SGVAWTPMLQNS----------ADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
+ V +T M+ NS +L +G E+ +G S I DSG
Sbjct: 328 TPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVESTGFSA-----RAIVDSGTVITSL 382
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
VY + + M L P AP L C+
Sbjct: 383 VPSVYNAVRAEFMSQLAEYP--QAPGFSILDTCF 414
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 76/156 (48%), Gaps = 19/156 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+L VG PP+ DTGSDL W QC APC C P+ + P + + C+
Sbjct: 104 YLVDLAVGTPPQPVSALLDTGSDLIWTQC-APCASCLPQPDPIFSPGASSSYEPMRCAGE 162
Query: 73 RCA-ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL----RFSNGSVFNVPL 127
C LH C+ P D C Y YGDG ++ G T+ F + + PL
Sbjct: 163 LCNDILHHS----CQRP-DTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPL 217
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
FGCG N G L+ + +G++G GR +S+VSQL
Sbjct: 218 GFGCG--TMNKGSLN--NGSGIVGFGRAPLSLVSQL 249
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 87/185 (47%), Gaps = 26/185 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F VN +VG+PP DTGSDL WVQC PC C + + P K+ + +P
Sbjct: 91 FLVNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSP 149
Query: 73 RCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
C PN P+ K+ + +QC Y Y DG +S G L T+ S+ G+V + FG
Sbjct: 150 IC-----PNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 204
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-----QNGRGVL 185
CG++ N G +G+LGL G SIVS+L +CIG L
Sbjct: 205 CGHS--NRGRFDGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHNQL 255
Query: 186 FLGDG 190
LGDG
Sbjct: 256 VLGDG 260
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 69/240 (28%), Positives = 103/240 (42%), Gaps = 30/240 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
++ + + L VG PP + + DTGSDL W QC PCT C QY P I SN
Sbjct: 58 YNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNC----YSQYAP---IFDPSNSS 109
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCG 132
RC + C Y+I Y D S G L T+ + ++G F +P T GCG
Sbjct: 110 TF-----KEKRCN--GNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCG 162
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-DGK 191
+N P +G++GL G S+++Q+ G ++ +C G + G +
Sbjct: 163 HNS----SWFKPTFSGMVGLSWGPSSLITQMG--GEYPGLMSYCFASQGTSKINFGTNAI 216
Query: 192 VPSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
V GV T M +A +L +G + G + + +I DSG + YF
Sbjct: 217 VAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYF 276
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 112/274 (40%), Gaps = 28/274 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNPRCAA 76
+T+G DTGSDLTWVQC+ PC C +KP V C++ C +
Sbjct: 67 VTMGLGSTNMTVIIDTGSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125
Query: 77 LHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYN 134
L + N C C+Y + YGDG + G L + + S G V FGCG N
Sbjct: 126 LQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVE----QLSFGGVSVSDFVFGCGRN 181
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGK 191
N G +G++GLGR +S+VSQ V +C+ G L +G+
Sbjct: 182 --NKGLFG--GVSGLMGLGRSYLSLVSQTN--ATFGGVFSYCLPTTESGASGSLVMGNES 235
Query: 192 VPSSGV---AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL---TLIFDSGASYAYFT 245
V +T ML N YIL + G + + ++ DSG
Sbjct: 236 SVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLIDSGTVITRLP 295
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
S VY+ + +L ++ G P AP L C+
Sbjct: 296 SSVYKALKALFLKQFTGFP--SAPGFSILDTCFN 327
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/152 (37%), Positives = 74/152 (48%), Gaps = 15/152 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P FDTGSDLTW QC C E + P K+ V CS+
Sbjct: 132 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSA 191
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C +L N C N C Y I+YGD S+G L + F L +N VF+ + FG
Sbjct: 192 ACGSLSSATGNAGSCSASN--CIYGIQYGDQSFSVGFLAKEKFTL--TNSDVFD-GVYFG 246
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ 162
CG N N G + AG+LGLGR ++S SQ
Sbjct: 247 CGEN--NQGLFT--GVAGLLGLGRDKLSFPSQ 274
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 77/155 (49%), Gaps = 11/155 (7%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ V++ +G PP+ DTGSDLTW QC APC C + ++ P + +++PC
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV--FNVP-LTF 129
C L W + N C Y Y D + G L +D F ++ ++ +VP LTF
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
GCG N G +T G+ G RG +S+ +QL+
Sbjct: 230 GCGL--FNNGIFVSNET-GIAGFSRGALSMPAQLK 261
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 128/284 (45%), Gaps = 45/284 (15%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
YF +++ +G PPK + DTGSDL W+QC PC C + Y P ++ + C
Sbjct: 191 EYF-MDVFIGTPPKHYSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKESSSFENITCH 248
Query: 71 NPRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS--NGS-----V 122
+PRC + P+PP+ CK N C Y YGD ++ G + F + + NG V
Sbjct: 249 DPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHV 308
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNG 181
NV FGCG+ N G AG+LGLGRG +S SQL+ YG + +C+
Sbjct: 309 ENV--MFGCGH--WNRGLFH--GAAGLLGLGRGPLSFASQLQSIYG---HSFSYCLVDRN 359
Query: 182 RGV-----LFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDL 231
L G+ K + + +T + +NS D +Y+ G ++ G+ + +
Sbjct: 360 SDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYV-GIKSIMVDGEVLKIPEE 418
Query: 232 T----------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPL 265
T I DSG + YF Y+ I M+ + G L
Sbjct: 419 TWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYEL 462
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 109/246 (44%), Gaps = 24/246 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V + G P + + DTGS L+W+QC C + + P + + C++
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 73 RCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTF 129
+C++L N P C+ ++ C Y YGD S+G L DL L S +P +
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLPGFVY 233
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI-GQNGRGVLFL 187
GCG Q + G AG+LGLGR ++S++ Q+ ++G +C+ + G G L +
Sbjct: 234 GCG--QDSDGLFG--RAAGILGLGRNKLSMLGQVSSKFGY---AFSYCLPTRGGGGFLSI 286
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIFDSGASYAY 243
G + S +TPM + + Y L + G++ G+ + I DSG
Sbjct: 287 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITR 346
Query: 244 FTSRVY 249
VY
Sbjct: 347 LPMSVY 352
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 77/155 (49%), Gaps = 11/155 (7%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ V++ +G PP+ DTGSDLTW QC APC C + ++ P + +++PC
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV--FNVP-LTF 129
C L W + N C Y Y D + G L +D F ++ ++ +VP LTF
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
GCG N G +T G+ G RG +S+ +QL+
Sbjct: 230 GCGL--FNNGIFVSNET-GIAGFSRGALSMPAQLK 261
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 78/255 (30%), Positives = 114/255 (44%), Gaps = 26/255 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC----TKPPEKQYKPHKNIVPCSNP 72
+ + L +G PP F DTGSDLTW QC PC C T + + VPC++
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPIYDTAVSSSFSPVPCASA 151
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C + W + C + C Y YGDG S G L T+ + G V + FGCG
Sbjct: 152 TCLPI-W-SSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPG-VSVGGIAFGCG 208
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF--LGDG 190
+ G LS ++ G +GLGRG +S+V+QL + G VLF L +
Sbjct: 209 VDN---GGLS-YNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGALAEL 264
Query: 191 KVPSSGVA--WTPMLQNS-------ADLKHYILGPAELLYSGKSCGLKDL---TLIFDSG 238
PS+G A TP++Q+ L+ LG A L + L+D +I DSG
Sbjct: 265 AAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSG 324
Query: 239 ASYAYFTSRVYQEIV 253
++ + ++ +V
Sbjct: 325 TTFTFLVESAFRVVV 339
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 78/281 (27%), Positives = 120/281 (42%), Gaps = 27/281 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKN----IVPCSN 71
+ + +G P K + DTGS LTW+QC +PC C + + P + V CS
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQC-SPCRVSCHRQSGPVFDPKTSSSYAAVSCST 195
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P+C L NP C +D C Y+ YGD S+G L D + F + SV N +
Sbjct: 196 PQCNDLSTATLNPAACSS-SDVCIYQASYGDSSFSVGYLSKDT--VSFGSNSVPN--FYY 250
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GCG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 251 GCG--QDNEGLFG--RSAGLMGLARNKLSLLYQLAP--TLGYSFSYCLPSSSSSGYLSIG 304
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYF 244
P ++TPM+ ++ D Y + + + +GK S L I DSG
Sbjct: 305 SYNPGQ-YSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSGTVITRL 363
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
+ VY + + + GT K A L C+ G +L
Sbjct: 364 PTTVYDALSKAVAGAMKGT--KRADAYSILDTCFVGQASSL 402
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 77/259 (29%), Positives = 117/259 (45%), Gaps = 23/259 (8%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNI 66
P + V + +G P K F FDTGSDLTW QC+ GC + ++ P +KN
Sbjct: 135 PTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKN- 193
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
V CS+ C + N P ++ C Y I+YG G +IG L T+ L ++ VF
Sbjct: 194 VSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGS-GYTIGFLATET--LAIASSDVFKNF 250
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
L FGC ++ + G + T G+LGLGR I++ SQ +N+ +C+ +
Sbjct: 251 L-FGC--SEESRGTFN--GTTGLLGLGRSPIALPSQTTNK--YKNLFSYCLPASPSSTGH 303
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKH-YILGPAELLYSGKSCGLKDLT--LIFDSGASYAY 243
L G S TP+ S LK Y L + G+ + I DSG ++ +
Sbjct: 304 LSFGVEVSQAAKSTPI---SPKLKQLYGLNTVGISVRGRELPINGSISRTIIDSGTTFTF 360
Query: 244 FTSRVYQEIVSLIMRDLIG 262
S Y + S R+++
Sbjct: 361 LPSPTYSALGS-AFREMMA 378
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 127/320 (39%), Gaps = 47/320 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V L VG PP+ DTGS+L+W+ C +P G P Y P VPCS+P
Sbjct: 65 LTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSP----VPCSSPI 120
Query: 74 C--AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P P C C I Y D S G L + F + GSV FGC
Sbjct: 121 CRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI----GSVTRPGTLFGC 176
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDG 190
+ + + G++G+ RG +S V+QL G + +CI G + L LGD
Sbjct: 177 MDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQL---GFSK--FSYCISGSDSSVFLLLGDA 231
Query: 191 KVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
G + +TP++ S L ++ + G G K L+L +
Sbjct: 232 SYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTM 291
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICW------RGPFK 283
DSG + + VY + + + + L+L D T+ +C+ R F
Sbjct: 292 VDSGTQFTFLMGPVYTALKNEFITQ-TKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFS 350
Query: 284 ALGQVTEYFKPLALSFTNRR 303
L V+ F+ +S + ++
Sbjct: 351 GLPMVSLMFRGAEMSVSGQK 370
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 77/155 (49%), Gaps = 11/155 (7%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
+ V++ +G PP+ DTGSDLTW QC APC C + ++ P + +++PC
Sbjct: 85 YLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLR 143
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV--FNVP-LTF 129
C L W + N C Y Y D + G L +D F ++ ++ +VP LTF
Sbjct: 144 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 203
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
GCG N G +T G+ G RG +S+ +QL+
Sbjct: 204 GCGL--FNNGIFVSNET-GIAGFSRGALSMPAQLK 235
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 74/298 (24%), Positives = 118/298 (39%), Gaps = 46/298 (15%)
Query: 12 PIFSY---FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
P+ +Y + + L++G PP + DTGSDL W QC PCT C K + P +
Sbjct: 52 PVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQC-IPCTKCYKQQNPMFDPRSSSSY 110
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VF 123
+ C C L + C C+Y Y D + G L + L + G V
Sbjct: 111 TNITCGTESCNKL---DSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVA 167
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI----- 177
+ FGCG+N G++GLGRG +S++SQ+ G N+ C+
Sbjct: 168 FQGIIFGCGHNNSGFNDRE----MGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNT 223
Query: 178 -------GQNGRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILGPAELLYS-GK 224
G+G LG+G V TP++ A L + L +S G
Sbjct: 224 DPSITSQMNFGKGSEVLGNGTVS------TPLISKDGTGYFATLLGISVEDINLPFSNGS 277
Query: 225 SCG-LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
S G + ++ DSG + Y Y ++ + + P ++ +C++ P
Sbjct: 278 SLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRI----DGYELCYQTP 331
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 126/315 (40%), Gaps = 41/315 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPP--EKQYKPHKNIVPCSNPR 73
V+LTVG PP+ DTGS+L+W+ C P T P Y P PC++
Sbjct: 60 LTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTP----TPCNSSI 115
Query: 74 CA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P C N C + Y D S+ G L + F L FGC
Sbjct: 116 CTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSL----AGAAQPGTLFGC 171
Query: 132 GYNQHNPGPLSP-PDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
+ ++ T G++G+ RG +S+V+Q+ +CI G++ GVL LGD
Sbjct: 172 MDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMS-----LPKFSYCISGEDALGVLLLGD 226
Query: 190 GKVPSSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----LI 234
G S + +TP++ + ++ I +LL KS + D T +
Sbjct: 227 GTDAPSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTM 286
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PD---DKTLPICWRGP--FKALGQV 288
DSG + + VY + + G ++ P+ + + +C+ P F A+ V
Sbjct: 287 VDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFAAVPAV 346
Query: 289 TEYFKPLALSFTNRR 303
T F + + R
Sbjct: 347 TLVFSGAEMRVSGER 361
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 118/278 (42%), Gaps = 40/278 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT-KPPEKQYKPHKNI----VPCS 70
YF V++ +G PP+ DTGSDL WV+C A C C+ PP + P + C
Sbjct: 88 YF-VDIRLGTPPQSLLLVADTGSDLVWVKCSA-CRNCSHHPPSSAFLPRHSSSFSPFHCF 145
Query: 71 NPRCAALHWPNPPR--CKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
+P C L P+ P C H + C + Y DG S G + L+ +GS ++
Sbjct: 146 DPHCRLL--PHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLK 203
Query: 127 -LTFGCGYNQHNPGPLSPP--DTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNG- 181
L+FGCG+ P GV+GLGRG IS SQL R +G N +C+
Sbjct: 204 GLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFG---NKFSYCLMDYTL 260
Query: 182 ----RGVLFLGDG--KVP---SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT 232
L +G G +P ++ +++TP+ N Y + + G +
Sbjct: 261 SPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAV 320
Query: 233 ----------LIFDSGASYAYFTSRVYQEIVSLIMRDL 260
+ DSG + Y T Y+E++ + R +
Sbjct: 321 WEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRV 358
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 88/181 (48%), Gaps = 14/181 (7%)
Query: 35 DTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKNIV----PCSNPRCAAL-HWPNPPRCKH 87
DT SD+ WVQC APC C + Y P K+I+ PCS+P+C +L + N
Sbjct: 179 DTASDVPWVQC-APCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGAG 237
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFS-NGSVFNVPLTFGCGYNQHNPGPLSPPDT 146
C Y + Y DG + G V+DL L G+V FGC + PG + T
Sbjct: 238 NTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVSK--FQFGCSHALLRPGSFNN-KT 294
Query: 147 AGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG--RGVLFLGDGKVPSSGVAWTPMLQ 204
AG + LGRG S+ SQ + NV +C+ G +G L LG + +S A TPML+
Sbjct: 295 AGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPMLK 354
Query: 205 N 205
+
Sbjct: 355 S 355
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 73/273 (26%), Positives = 110/273 (40%), Gaps = 28/273 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C K + P K+ V C++
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDS 222
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CA L + C C Y ++YGDG ++G D + F FGCG
Sbjct: 223 ACADL---DTNGCT--GGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFR----FGCG 273
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLGDG 190
+ N G TAG++GLGRG+ S+ Q Y +C+ G G L G G
Sbjct: 274 --EKNNGLFG--KTAGLMGLGRGKTSLTVQ--AYNKYGGAFAYCLPALTTGTGYLDFGPG 327
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAYFT 245
++ TPML + +Y+ G + G+ + + + DSG
Sbjct: 328 SAGNN-ARLTPMLTDKGQTFYYV-GMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLP 385
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
+ Y + S + ++ K AP L C+
Sbjct: 386 ATAYTALSSAFDKVMLARGYKKAPGYSILDTCY 418
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/290 (26%), Positives = 120/290 (41%), Gaps = 36/290 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++ +G PP DT +D W QC+ PC C + P K+ +PCS+P
Sbjct: 89 YIISFLIGTPPFQLYGVMDTANDNIWFQCN-PCKPCFNTTSPMFDPSKSSTYKTIPCSSP 147
Query: 73 RCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF-- 129
+C + C + + C+Y YG S G L D L +N + P++F
Sbjct: 148 KCKNVE---NTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNN----DTPISFKN 200
Query: 130 ---GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNG 181
GCG+ N GPL +G +GLGRG +S +SQL I +C+ +
Sbjct: 201 IVIGCGH--RNKGPLEGY-VSGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGI 255
Query: 182 RGVLFLGDGKVPSS-GVAWTPMLQN----SADLKHYILGPAELLYSGKSCGLKDL-TLIF 235
G L GD V S G TP+ S L +G + + + +L I
Sbjct: 256 SGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTII 315
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
DSG + VY + S I+ ++ +P+ + +C++ K L
Sbjct: 316 DSGTTLTILPENVYSRLES-IVTSMVKLERAKSPNQQ-FKLCYKATLKNL 363
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 72/152 (47%), Gaps = 13/152 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F VN ++G+P DTGS++ WV+C APC CT+ P K+ +PC+N
Sbjct: 99 FLVNFSMGQPATPQLAIMDTGSNILWVRC-APCKRCTQQNGPLLDPSKSSTYASLPCTNT 157
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C H+ C N QC Y + Y G SS G L T+ S+ V VP + FGC
Sbjct: 158 MC---HYAPSAYCNRLN-QCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGC 213
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
H G GV GLG+G S V+++
Sbjct: 214 ---SHENGDYKDRRFTGVFGLGKGITSFVTRM 242
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 81/296 (27%), Positives = 120/296 (40%), Gaps = 35/296 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--------TKPPEKQYKPHKNIVP 68
+ V + VG P K F DTGS L+W+QC C T K YK
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSS- 165
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
S N P C + C Y+ YGD SIG L D+ L S +
Sbjct: 166 -SQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSAAP--SSGFV 222
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI--------GQ 179
+GCG Q N G +AG++GL ++S++ QL +YG N +C+
Sbjct: 223 YGCG--QDNQGLFG--RSAGIIGLANDKLSMLGQLSNKYG---NAFSYCLPSSFSAQPNS 275
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK----DLTLIF 235
+ G L +G + SS +TP+++N Y LG + +GK G+ ++ I
Sbjct: 276 SVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPTII 335
Query: 236 DSGASYAYFTSRVYQEI-VSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTE 290
DSG +Y + S +M ++ AP L C++G K + V E
Sbjct: 336 DSGTVITRLPVAIYNALKKSFVM--IMSKKYAQAPGFSILDTCFKGSVKEMSTVPE 389
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 80/176 (45%), Gaps = 20/176 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ V C+ P
Sbjct: 180 YVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAP 239
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L N C C Y ++YGDG SIG D L S ++ F G
Sbjct: 240 ACSDL---NIHGCS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 289
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCIGQNGRGVLFL 187
+ N G + AG+LGLGRG+ S+ V +YG V HC+ G +L
Sbjct: 290 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSTGTGYL 340
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 87/185 (47%), Gaps = 26/185 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F VN +VG+PP DTGSDL WVQC PC C + + P K+ + +P
Sbjct: 59 FLVNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSP 117
Query: 73 RCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
C PN P+ K+ + +QC Y Y DG +S G L T+ S+ G+V + FG
Sbjct: 118 IC-----PNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 172
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-----QNGRGVL 185
CG++ N G +G+LGL G SIVS+L +CIG L
Sbjct: 173 CGHS--NRGRFDGQ-QSGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHNQL 223
Query: 186 FLGDG 190
LGDG
Sbjct: 224 VLGDG 228
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 78/253 (30%), Positives = 105/253 (41%), Gaps = 27/253 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ------YKPH---- 63
F ++A+ +TVG P + F DTGSDL W+ C C GCT P Y P
Sbjct: 114 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPASAASGSASFYIPSMSST 170
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG-- 120
VPC++ C C QC Y++ Y SS G LV D+ L +
Sbjct: 171 SQAVPCNSQFCELRK-----ECS-TTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIP 224
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ + FGCG Q L G+ GLG ISI S L + GL N C ++
Sbjct: 225 QILKAQILFGCGQVQTG-SFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRD 283
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
G G + GD SS TP+ N Y + +E+ G S + + IFD+G S
Sbjct: 284 GIGRISFGDQG--SSDQEETPLDVNPQH-PTYTISISEMTV-GNSLTDLEFSTIFDTGTS 339
Query: 241 YAYFTSRVYQEIV 253
+ Y Y I
Sbjct: 340 FTYLADPAYTYIT 352
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 72/153 (47%), Gaps = 14/153 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V L +G P F DT SDL W QC PC C K + + P + +VPC++
Sbjct: 88 YLVKLGLGTPQHCFTAAIDTASDLIWTQCQ-PCVKCYKQLDPVFNPVASTSYAVVPCNSD 146
Query: 73 RCAALHWPNPPRCKHPNDQ--CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L R +D+ C Y YG ++ G L D R + G + FG
Sbjct: 147 TCDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVD----RLAIGDDVFRGVVFG 202
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
C + GP PP +GV+GLGRG +S+VSQL
Sbjct: 203 CSSSSVG-GP--PPQVSGVVGLGRGALSLVSQL 232
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 87/185 (47%), Gaps = 26/185 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
F VN +VG+PP DTGSDL WVQC PC C + + P K+ + +P
Sbjct: 59 FLVNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSP 117
Query: 73 RCAALHWPNPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
C PN P+ K+ + +QC Y Y DG +S G L T+ S+ G+V + FG
Sbjct: 118 IC-----PNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 172
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-----QNGRGVL 185
CG++ N G +G+LGL G SIVS+L +CIG L
Sbjct: 173 CGHS--NRGRFDGQ-QSGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHNQL 223
Query: 186 FLGDG 190
LGDG
Sbjct: 224 VLGDG 228
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 80/176 (45%), Gaps = 20/176 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G P + FDTGSD TWVQC C + EK + P ++ V C+ P
Sbjct: 178 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAP 237
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C+ L N C C Y ++YGDG SIG D L S ++ F G
Sbjct: 238 ACSDL---NIHGCS--GGHCLYGVQYGDGSYSIGFFAMDTLTL-----SSYDAVKGFRFG 287
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISI-VSQLREYGLIRNVIGHCIGQNGRGVLFL 187
+ N G + AG+LGLGRG+ S+ V +YG V HC+ G +L
Sbjct: 288 CGERNEGLFG--EAAGLLGLGRGKTSLPVQTYDKYG---GVFAHCLPARSTGTGYL 338
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 129/289 (44%), Gaps = 37/289 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPC 69
+ + ++L +G PP+ DTGS L W QC PC C Y ++ + C
Sbjct: 32 MTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ-PCAVCFNQSLPYYDASRSSTFALPSC 90
Query: 70 SNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
+ +C P+ C + Q C Y YGD ++IG L D+ + F G+ +VP +
Sbjct: 91 DSTQCKL--DPSVTMCVNQTVQTCAYSYSYGDKSATIGFL--DVETVSFVAGA--SVPGV 144
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
FGCG N N G +T G+ G GRG +S+ SQL+ G + G+ VLF
Sbjct: 145 VFGCGLN--NTGIFRSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVSGRKPSTVLFD 200
Query: 188 GDGKVPSSG---VAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLT--LIF 235
+ +G V TP+++N A LK +G L + LK+ T I
Sbjct: 201 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 260
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGP 281
DSG ++ RVY+ ++ D +KL P ++T P +C+ P
Sbjct: 261 DSGTAFTSLPPRVYR-----LVHDEFAAHVKLPVVPSNETGPLLCFSAP 304
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 88/336 (26%), Positives = 137/336 (40%), Gaps = 38/336 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V L VG P + F DTGSDLTWV+C PP + ++P + +PCS+
Sbjct: 116 YF-VKLRVGTPVQEFTLVADTGSDLTWVKCAG-----ASPPGRVFRPKTSRSWAPIPCSS 169
Query: 72 PRCAALHWP-NPPRCKHPNDQCDYEIEYGDGGSSIGALV-TDLFPLRFSNGSVFNVP-LT 128
C L P C P C Y+ Y +G + +V T+ + G V + +
Sbjct: 170 DTC-KLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVV 228
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREY---GLIRNVIGHCIGQNGRGVL 185
GC + H+ D GVL LG +IS +Q ++ H +N G L
Sbjct: 229 LGCS-SSHDGQSFRSAD--GVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYL 285
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-------KDLTLIFDSG 238
G G+VP + T + + ++ Y + + +GK+ + K +I DSG
Sbjct: 286 AFGPGQVPRTPATQTKLFLDP-EMPFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSG 344
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTP-LKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
+ + Y+ +V+ + + L G P + P + R P E LA+
Sbjct: 345 NTLTVLAAPAYKAVVAALSKHLDGVPKVSFPPFEHCYNWTARRP-----GAPEIIPKLAV 399
Query: 298 SFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTGK 333
F S RL P ++Y VI V + I G+
Sbjct: 400 QFA---GSARLEPPAKSY-VIDVKPGVKCIGVQEGE 431
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 66/257 (25%), Positives = 105/257 (40%), Gaps = 37/257 (14%)
Query: 35 DTGSDLTWVQCDAP-CTGCTKPPEKQYKPHKNIV----PCSNPRCAALHWPNPPRCKHPN 89
D+GS L W+QC P C C + + P K++ C+ C RCK PN
Sbjct: 119 DSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPN 178
Query: 90 DQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
C Y +Y D + G + TD+ FP S + + + FGCGYN +P PP
Sbjct: 179 QICKYHEDYLDDSYTEGVISTDIFTFPEHISGFGNYTLRIIFGCGYNNSDPQHFYPP--- 235
Query: 148 GVLGLGRGRISIVSQLREYGLIRNVIGHCIG----QNGRGVLFLGDGKVPSSGVAWTPML 203
G++GL + S+V Q+ + +C+ QN +G + + G S T ++
Sbjct: 236 GLVGLTNNKASLVGQMD-----VDQFSYCVSIDTEQNLKGSMEIRFGLAASISGHSTQLV 290
Query: 204 QNSADLKHYILGPAELLYSGK--------------SCGLKDLTLIFDSGASYAYFTSRVY 249
NS YI + +Y + G LT+ D+G +Y + V
Sbjct: 291 PNSDGW--YIFKNVDGIYVNEFEVEGYPAWVFKYTEGGQGGLTM--DTGTTYTELHNSVM 346
Query: 250 QEIVSLIMRDLIGTPLK 266
++ L+ + P K
Sbjct: 347 DPLIKLLEEHITIVPEK 363
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 103/249 (41%), Gaps = 26/249 (10%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----V 67
+ +G P F DTGSDL WV CD AP G + + + Y P ++ V
Sbjct: 99 TTVELGTPGVKFMVALDTGSDLFWVPCDCSRCAPTHGASYASDFELSIYNPRESSTSKKV 158
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNG--SVFN 124
C+N CA + RC C Y + Y +S G LV D+ L +G
Sbjct: 159 TCNNDMCAQRN-----RCLGTFSSCPYIVSYVSAQTSTSGILVKDVLHLTTEDGGREFVE 213
Query: 125 VPLTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+TFGCG Q ++ P+ G+ GLG +IS+ S L GLI + C G +G G
Sbjct: 214 AYVTFGCGQVQSGSFLDIAAPN--GLFGLGMEKISVPSVLSREGLIADSFSMCFGHDGIG 271
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD P TP N A + + + G + T +FDSG S+ Y
Sbjct: 272 RISFGDKGSPDQ--EETPFNVNPAHPTYNVTVTQARV--GTMLIDVEFTALFDSGTSFTY 327
Query: 244 FTSRVYQEI 252
Y +
Sbjct: 328 MVDPAYSRV 336
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 78/253 (30%), Positives = 105/253 (41%), Gaps = 27/253 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ------YKPH---- 63
F ++A+ +TVG P + F DTGSDL W+ C C GCT P Y P
Sbjct: 114 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPASAASGSASFYIPSMSST 170
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG-- 120
VPC++ C C QC Y++ Y SS G LV D+ L +
Sbjct: 171 SQAVPCNSQFCELRK-----ECS-TTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIP 224
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ + FGCG Q L G+ GLG ISI S L + GL N C ++
Sbjct: 225 QILKAQILFGCGQVQTG-SFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRD 283
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
G G + GD SS TP+ N Y + +E+ G S + + IFD+G S
Sbjct: 284 GIGRISFGDQG--SSDQEETPLDVNPQH-PTYTISISEITV-GNSLTDLEFSTIFDTGTS 339
Query: 241 YAYFTSRVYQEIV 253
+ Y Y I
Sbjct: 340 FTYLADPAYTYIT 352
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 78/253 (30%), Positives = 105/253 (41%), Gaps = 27/253 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ------YKPH---- 63
F ++A+ +TVG P + F DTGSDL W+ C C GCT P Y P
Sbjct: 114 FLHYAL-VTVGTPGQTFMVALDTGSDLFWLPCQ--CDGCTPPASAASGSASFYIPSMSST 170
Query: 64 KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG-- 120
VPC++ C C QC Y++ Y SS G LV D+ L +
Sbjct: 171 SQAVPCNSQFCELRK-----ECS-TTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIP 224
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
+ + FGCG Q L G+ GLG ISI S L + GL N C ++
Sbjct: 225 QILKAQILFGCGQVQTGSF-LDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRD 283
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
G G + GD SS TP+ N Y + +E+ G S + + IFD+G S
Sbjct: 284 GIGRISFGDQG--SSDQEETPLDVNPQH-PTYTISISEITV-GNSLTDLEFSTIFDTGTS 339
Query: 241 YAYFTSRVYQEIV 253
+ Y Y I
Sbjct: 340 FTYLADPAYTYIT 352
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 129/289 (44%), Gaps = 37/289 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPC 69
+ + ++L +G PP+ DTGS L W QC PC C Y ++ + C
Sbjct: 88 MTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ-PCAVCFNQSLPYYDASRSSTFALPSC 146
Query: 70 SNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
+ +C P+ C + Q C Y YGD ++IG L D+ + F G+ +VP +
Sbjct: 147 DSTQCKL--DPSVTMCVNQTVQTCAYSYSYGDKSATIGFL--DVETVSFVAGA--SVPGV 200
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
FGCG N N G +T G+ G GRG +S+ SQL+ G + G+ VLF
Sbjct: 201 VFGCGLN--NTGIFRSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTAVSGRKPSTVLFD 256
Query: 188 GDGKVPSSG---VAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLT--LIF 235
+ +G V TP+++N A LK +G L + LK+ T I
Sbjct: 257 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 316
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGP 281
DSG ++ RVY+ ++ D +KL P ++T P +C+ P
Sbjct: 317 DSGTAFTSLPPRVYR-----LVHDEFAAHVKLPVVPSNETGPLLCFSAP 360
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 113/276 (40%), Gaps = 30/276 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNPRCAA 76
+T+G K DTGSDLTWVQC+ PC C +KP V C++ C +
Sbjct: 67 VTMGLGSKNMTVIIDTGSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125
Query: 77 LHWP--NPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
L + N C N C+Y + YGDG + G L + G V FGCG
Sbjct: 126 LQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSF----GGVSVSDFVFGCGR 181
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDG 190
N N G +G++GLGR +S+VSQ V +C+ G L +G+
Sbjct: 182 N--NKGLFG--GVSGLMGLGRSYLSLVSQTN--ATFGGVFSYCLPTTEAGSSGSLVMGNE 235
Query: 191 KV---PSSGVAWTPMLQNSADLKHYILGPAELLYSGKS----CGLKDLTLIFDSGASYAY 243
++ + +T ML N YIL + G + + ++ DSG
Sbjct: 236 SSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGILIDSGTVITR 295
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
S VY+ + + ++ G P AP L C+
Sbjct: 296 LPSSVYKALKAEFLKKFTGFP--SAPGFSILDTCFN 329
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 106/235 (45%), Gaps = 26/235 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L++G PP DTGSDL W+QC PCT C K + + + C +
Sbjct: 59 YLMELSIGTPPVKIYAQADTGSDLIWLQC-IPCTNCYKQLNPMFDSQSSSTFSNIACGSE 117
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
C+ L+ C C Y Y DG + G L + L + G V + FGC
Sbjct: 118 SCSKLY---STSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGC 174
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLF 186
G+N N G + + G++GLGRG +S+VSQ+ L N+ C+ + +
Sbjct: 175 GHN--NNGAFNDKE-MGIIGLGRGPLSLVSQIGS-SLGGNMFSQCLVPFNTNPSISSPMS 230
Query: 187 LGDG-KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
G G +V +GV TP++ + Y + LL ++D+ L F++G+S
Sbjct: 231 FGKGSEVLGNGVVSTPLVSKTTYQSFYFV---TLL----GISVEDINLPFNAGSS 278
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 65/130 (50%), Gaps = 16/130 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF +++ VG PPK F DTGSDL W+QC PC C + Y P + + C +
Sbjct: 195 YF-MDVFVGTPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISCHD 252
Query: 72 PRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS--NGS-----VF 123
PRC + P+PP CK N C Y YGDG ++ G + F + + NG V
Sbjct: 253 PRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVE 312
Query: 124 NVPLTFGCGY 133
NV FGCG+
Sbjct: 313 NV--MFGCGH 320
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 73/250 (29%), Positives = 103/250 (41%), Gaps = 26/250 (10%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPH----KNIV 67
+++G P K F DTGSDL WV CD AP G T + + Y P V
Sbjct: 105 TTVSLGTPGKKFLVALDTGSDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKV 164
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNG--SVFN 124
C N CA + RC C Y + Y +S G LV D+ L +
Sbjct: 165 TCDNSLCAHRN-----RCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVE 219
Query: 125 VPLTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+TFGCG Q ++ P+ G+ GLG +IS+ S L + G + C G +G G
Sbjct: 220 AYVTFGCGQVQTGSFLDIAAPN--GLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIG 277
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD P TP N+ + I + G + D T +FDSG S+ Y
Sbjct: 278 RISFGDKGSPDQ--EETPFNLNALHPTYNI--TVTQVRVGTTLIDLDFTALFDSGTSFTY 333
Query: 244 FTSRVYQEIV 253
+Y ++
Sbjct: 334 LVDPIYTNVL 343
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 66/193 (34%), Positives = 91/193 (47%), Gaps = 29/193 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G P K FDTGSDLTW +C A T + P K+ V CS P
Sbjct: 134 YIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAET---------FDPTKSTSYANVSCSTP 184
Query: 73 RCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C+++ NP RC C Y I+YGDG SIG L + L + +FN FG
Sbjct: 185 LCSSVISATGNPSRCAAST--CVYGIQYGDGSYSIGFLGKE--RLTIGSTDIFN-NFYFG 239
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCIGQNGRGVLFLGD 189
CG Q G AG+LGLGR ++S+VSQ +Y + +C+ + FL
Sbjct: 240 CG--QDVDGLFGKA--AGLLGLGRDKLSVVSQTAPKY---NQLFSYCL-PSSSSTGFLSF 291
Query: 190 GKVPSSGVAWTPM 202
G S +TP+
Sbjct: 292 GSSQSKSAKFTPL 304
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 66/132 (50%), Gaps = 17/132 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
YF +++ +G PPK F DTGSDL W+QC PC C + Y P +I + C+
Sbjct: 195 EYF-IDVFIGSPPKHFSLILDTGSDLNWIQC-VPCFDCFEQNGPYYDPKDSISFRNITCN 252
Query: 71 NPRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-------- 121
+PRC + P+PPR CK C Y YGD ++ G + F + ++ +
Sbjct: 253 DPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRR 312
Query: 122 VFNVPLTFGCGY 133
V NV FGCG+
Sbjct: 313 VENV--MFGCGH 322
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 66/132 (50%), Gaps = 17/132 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
YF +++ +G PPK F DTGSDL W+QC PC C + Y P +I + C+
Sbjct: 195 EYF-IDVFIGSPPKHFSLILDTGSDLNWIQC-VPCFDCFEQNGPYYDPKDSISFRNITCN 252
Query: 71 NPRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-------- 121
+PRC + P+PPR CK C Y YGD ++ G + F + ++ +
Sbjct: 253 DPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRR 312
Query: 122 VFNVPLTFGCGY 133
V NV FGCG+
Sbjct: 313 VENV--MFGCGH 322
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 87/333 (26%), Positives = 138/333 (41%), Gaps = 60/333 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----------KPPEKQYKPHKNI 66
+ + L++G PP+L DTGSDL W++CD C C YK
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCDN-CDHCDLDHHGETIFFSDASSSYKK---- 59
Query: 67 VPCSNPRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS---- 121
+PC++ C+ + PRC+ + C Y+ EYGDG + G + +D R S+G+
Sbjct: 60 LPCNSTHCSGMSSAGIGPRCE---ETCKYKYEYGDGSRTSGDVGSDRISFR-SHGAGEDH 115
Query: 122 -VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE---YGLIRNVIGHCI 177
F FGC T G++GLG+ S++ QL + Y ++ +
Sbjct: 116 RSFFDGFLFGCARKLKGDWNF----TQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDS 171
Query: 178 GQNGRGVLFLG-DGKVPSSGVAWTPMLQNS--------ADLKHYILGPAELLYSGKSCG- 227
+ + LFLG + V TP+L DL+ +G ++ K G
Sbjct: 172 PPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGH 231
Query: 228 -------LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
L + T+I DSG +Y T VY+ + I +I L + L +C
Sbjct: 232 NTSVGPFLANKTVI-DSGTTYTLLTPPVYEAMRKSIEEQVI---LPTLGNSAGLDLC--- 284
Query: 281 PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPE 313
F + G + F + F N+ V+LV+P E
Sbjct: 285 -FNSSGDTSYGFPSVTFYFANQ---VQLVLPFE 313
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 68/129 (52%), Gaps = 14/129 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCS 70
YF +++ VG PPK F DTGSDL W+QC PC C + Y P +KNI C+
Sbjct: 170 YF-MDVLVGSPPKHFSLILDTGSDLNWIQC-LPCYDCFQQNGAFYDPKASASYKNIT-CN 226
Query: 71 NPRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF-SNG---SVFNV 125
+ RC + P+PP CK N C Y YGD ++ G + F + +NG ++NV
Sbjct: 227 DQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNV 286
Query: 126 P-LTFGCGY 133
+ FGCG+
Sbjct: 287 ENMMFGCGH 295
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 110/251 (43%), Gaps = 34/251 (13%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKHPND 90
DT S+LTWVQC+ PC C E + P + VPC++ C AL + +D
Sbjct: 129 DTASELTWVQCE-PCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDD 187
Query: 91 Q---CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
Q C Y + Y DG S G L D L + F FGCG + N GP T+
Sbjct: 188 QPAACSYTLSYRDGSYSRGVLAHDRLSLAGEDIQGF----VFGCGTS--NQGPFG--GTS 239
Query: 148 GVLGLGRGRISIVSQ-LREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGVAWT 200
G++GLGR ++S++SQ + ++G V +C+ G L LGD S+ + +T
Sbjct: 240 GLMGLGRSQLSLISQTMDQFG---GVFSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYT 296
Query: 201 PMLQNS-------ADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIV 253
M+ + A+L +G ++ G S G ++ DSG VY +
Sbjct: 297 AMVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGGGGKAIV-DSGTIITSLVPSVYAAVR 355
Query: 254 SLIMRDLIGTP 264
+ + L P
Sbjct: 356 AEFVSQLAEYP 366
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/269 (31%), Positives = 109/269 (40%), Gaps = 48/269 (17%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--------------TKPPEKQ 59
F ++AV + +G P F DTGSDL WV CD C C T P+K
Sbjct: 86 FLHYAV-VALGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKS 142
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFS 118
K VPCS+ C P Y I+Y D SS G LV D+ L
Sbjct: 143 STSRK--VPCSSNLCDEQSACRSASSSCP-----YSIQYLSDNTSSTGVLVEDVLYLVTE 195
Query: 119 NG---SVFNVPLTFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQLREYGL-IRNV 172
G + P+TFGCG Q G +P G+LGLG IS+ S L G+ N
Sbjct: 196 YGRQPKIVTAPITFGCGRTQTGSFLGTAAP---NGLLGLGMDTISVPSLLASQGVAAANS 252
Query: 173 IGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGP-AELLYSGKSCGLKDL 231
C Q+G G + GD SS TP L Y P + +G + G K +
Sbjct: 253 FSMCFAQDGHGRINFGD--TGSSDQQETP-------LNMYKQNPYYNISITGATVGSKSI 303
Query: 232 ----TLIFDSGASYAYFTSRVYQEIVSLI 256
I DSG S+ + +Y +I S +
Sbjct: 304 HTKFNAIVDSGTSFTALSDPMYTQITSSV 332
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 114/279 (40%), Gaps = 41/279 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ + + G P + FDTGSD+ W+QC C E + P ++N V C+
Sbjct: 16 YVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRN-VSCTE 74
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL----RFSNGSVFNVPL 127
P C L C + C Y + YGDG S+IG L D F L +F N
Sbjct: 75 PACVGLSTRG---CS--SSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQKFKN-------F 122
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRI-SIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
FGCG Q+N G TAG++GLGR S+ SQ+ + NV +C+ +
Sbjct: 123 IFGCG--QNNTGLFQ--GTAGLVGLGRSSTYSLNSQVAPS--LGNVFSYCLPSTSSATGY 176
Query: 187 LGDGKVPSSGVAWTPMLQNS-------ADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
L G P + +T ML ++ DL +G L S S + + I DSG
Sbjct: 177 LNIGN-PQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLS--STVFQSVGTIIDSGT 233
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
Y + + + + T LAP L C+
Sbjct: 234 VITRLPPTAYSALKTAVRAAM--TQYTLAPAVTILDTCY 270
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 69/252 (27%), Positives = 114/252 (45%), Gaps = 25/252 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++++G PP + DTGSDL W QC PC C K + P K+ VPC++
Sbjct: 92 YLMSVSIGTPPVDYIGMADTGSDLMWAQC-LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQ 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C A+ + C CDY YGD + G DL + + GS +V GCG
Sbjct: 151 NCKAI---DDSHCG-AQGVCDYSYTYGDQTYTKG----DLGFEKITIGSS-SVKSVIGCG 201
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLFL 187
+ +GV+GLG G++S+VSQ+ + I +C+ NG+ + F
Sbjct: 202 HESG----GGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGK-INFG 256
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYI-LGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
+ V GV TP++ + +Y+ L + K +I DSG + ++
Sbjct: 257 QNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPK 316
Query: 247 RVYQEIVSLIMR 258
+Y +VS +++
Sbjct: 317 ELYDGVVSSLLK 328
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/250 (29%), Positives = 104/250 (41%), Gaps = 26/250 (10%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPH----KNIV 67
+++G P K F DTGSDL WV CD AP G T + + Y P V
Sbjct: 105 TTVSLGTPGKKFLVALDTGSDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKV 164
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNG--SVFN 124
C+N CA + RC C Y + Y +S G LV D+ L +
Sbjct: 165 TCNNSLCA-----HRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVE 219
Query: 125 VPLTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+TFGCG Q ++ P+ G+ GLG +IS+ S L + G + C G +G G
Sbjct: 220 AYVTFGCGQVQTGSFLDIAAPN--GLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIG 277
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD P TP N+ + I + G + D T +FDSG S+ Y
Sbjct: 278 RISFGDKGGPDQ--EETPFNLNALHPTYNI--TVTQVRVGTTLIDLDFTALFDSGTSFTY 333
Query: 244 FTSRVYQEIV 253
+Y ++
Sbjct: 334 LVDPIYTNVL 343
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 138/341 (40%), Gaps = 61/341 (17%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----I 66
P + + L +G PP + DTGSDL W QC APCT C + P Y P + +
Sbjct: 87 PTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAV 145
Query: 67 VPCSNPRCAALHWPN------PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
+PC++ PP C C Y + YG G +S+ ++ F +
Sbjct: 146 LPCNSSLSVCAAALAGTGTAPPPGCA-----CTYNVTYGSGWTSVFQ-GSETFTFGSTPA 199
Query: 121 SVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-- 177
VP + FGC + +G++GLGRGR+S+VSQL G+ + +C+
Sbjct: 200 GHARVPGIAFGCSTASSG---FNASSASGLVGLGRGRLSLVSQL---GVPK--FSYCLTP 251
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLY----SGKSCGLKDL 231
N L LG PS+ + T + ++ + P Y +G S G L
Sbjct: 252 YQDTNSTSTLLLG----PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTAL 307
Query: 232 T---------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI 276
+ LI DSG + + YQ++ + ++ L+ P D L +
Sbjct: 308 SIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVV-SLVTLPTTDGSADTGLDL 366
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
C+ P + P S T N +V+P ++Y++
Sbjct: 367 CFMLP------SSTSAPPAMPSMTLHFNGADMVLPADSYMM 401
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 78/162 (48%), Gaps = 16/162 (9%)
Query: 7 EFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN- 65
E P + V L +G P F DT SDL W+QC PC C + + + P +
Sbjct: 78 EAPLVPRGGEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQ-PCVSCYRQLDPIFNPRLSS 136
Query: 66 ---IVPCSNPRCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGS 121
+VPCS+ C+ L + RC +DQ C Y +Y + G L D + G+
Sbjct: 137 SYAVVPCSSDTCSQL---DGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAV---GGN 190
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
VF+ + GC + GP PP +G++GL RG +S++SQL
Sbjct: 191 VFHA-VVLGCS-DSSVGGP--PPQASGLVGLARGPLSLLSQL 228
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 120/281 (42%), Gaps = 25/281 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +G PP DT SDL WVQC +PC C ++PHK+ + C +
Sbjct: 90 YLMRFYIGTPPVERLAIADTASDLIWVQC-SPCETCFPQDTPLFEPHKSSTFANLSCDSQ 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C + N C + C Y YGDG S+ G L T+ + F + +V FGCG
Sbjct: 149 PCTS---SNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTE--SIHFGSQTVTFPKTIFGCG 203
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVLFLG 188
N +S T G++GLG G +S+VSQL + I + +C+ + + F
Sbjct: 204 SNNDFMHQISNKVT-GIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKLKFGN 260
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-----TLIFDSGASYAY 243
D + +GV TP++ + +Y L + K ++ +I D G Y
Sbjct: 261 DTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLTY 320
Query: 244 FTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA 284
Y V+L +R+ +G + DD P + P +A
Sbjct: 321 LEVNFYHNFVTL-LREALG--ISETKDDIPYPFDFCFPNQA 358
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/168 (33%), Positives = 81/168 (48%), Gaps = 18/168 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G P K FDTGSDLTW QC C + + P ++ + CS+P
Sbjct: 131 YIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSP 190
Query: 73 RCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C+ L N P C C Y I+YGD S+G + L S + N FG
Sbjct: 191 DCSQLESGTGNQPGCSAAR-ACIYGIQYGDQSFSVGYFAKETLTLT-STDVIEN--FLFG 246
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI 177
CG Q+N G AG++GLG+ +ISIV Q ++YG V +C+
Sbjct: 247 CG--QNNRGLFG--SAAGLIGLGQDKISIVKQTAQKYG---QVFSYCL 287
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 111/270 (41%), Gaps = 29/270 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK--PPEKQYKPHKNIVP---CS 70
YF V+L +G PP+ DTGSDL WV+C A C CT+ P H C
Sbjct: 89 YF-VDLRLGTPPQKLLLVADTGSDLVWVKCSA-CRNCTRHTPGSAFLARHSTTFSPNHCY 146
Query: 71 NPRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
+ C + P RC H + C YE YGDG + G + L S+G + +
Sbjct: 147 DSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGI 206
Query: 128 TFGCGYNQHNP--GPLSPPDTAGVLGLGRGRISIVSQL-REYG--LIRNVIGHCIGQNGR 182
FGC + P S GV+GLGRG IS+ SQL +G ++ H I +
Sbjct: 207 AFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPT 266
Query: 183 GVLFLG----DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-------GLKDL 231
L +G D + +TP+ N Y +G + G L +L
Sbjct: 267 SYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALDEL 326
Query: 232 ---TLIFDSGASYAYFTSRVYQEIVSLIMR 258
I DSG + + Y +I+++I R
Sbjct: 327 GNGGTIVDSGTTLTFLPEPAYLQILTVIKR 356
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 138/341 (40%), Gaps = 61/341 (17%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----I 66
P + + L +G PP + DTGSDL W QC APCT C + P Y P + +
Sbjct: 27 PTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAV 85
Query: 67 VPCSNPRCAALHWPN------PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
+PC++ PP C C Y + YG G +S+ ++ F +
Sbjct: 86 LPCNSSLSVCAAALAGTGTAPPPGCA-----CTYNVTYGSGWTSV-FQGSETFTFGSTPA 139
Query: 121 SVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-- 177
VP + FGC + +G++GLGRGR+S+VSQL G+ + +C+
Sbjct: 140 GHARVPGIAFGCSTASSG---FNASSASGLVGLGRGRLSLVSQL---GVPK--FSYCLTP 191
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLY----SGKSCGLKDL 231
N L LG PS+ + T + ++ + P Y +G S G L
Sbjct: 192 YQDTNSTSTLLLG----PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTAL 247
Query: 232 T---------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI 276
+ LI DSG + + YQ++ + ++ L+ P D L +
Sbjct: 248 SIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVS-LVTLPTTDGSADTGLDL 306
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
C+ P + P S T N +V+P ++Y++
Sbjct: 307 CFMLP------SSTSAPPAMPSMTLHFNGADMVLPADSYMM 341
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 118/283 (41%), Gaps = 43/283 (15%)
Query: 22 TVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAAL 77
TVG DT S+LTWVQC PC C + + P + VPC++ C AL
Sbjct: 123 TVGLGAAEATVVVDTASELTWVQCQ-PCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDAL 181
Query: 78 H---WPNPPRCKHPNDQ---CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C N+Q C Y + Y DG S G L D LR + + FGC
Sbjct: 182 RVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARD--KLRLAGQDIEG--FVFGC 237
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI---GQNGRGVLFL 187
G + P T+G++GLGR +S+VSQ + ++G V +C+ G L L
Sbjct: 238 GTSNQG-APFG--GTSGLMGLGRSHVSLVSQTMDQFG---GVFSYCLPMRESGSSGSLVL 291
Query: 188 GDGKVP---SSGVAWTPMLQNSADLKHYILGPAELL-YSGKSCGLKDLT--------LIF 235
GD S+ + +T M+ +S L+ GP L +G + G +++ +I
Sbjct: 292 GDDSSAYRNSTPIVYTAMVSDSGPLQ----GPFYFLNLTGITVGGQEVESPWFSAGRVII 347
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
DSG VY + + + L P AP L C+
Sbjct: 348 DSGTIITTLVPSVYNAVRAEFLSQLAEYP--QAPAFSILDTCF 388
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 118/279 (42%), Gaps = 37/279 (13%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+T+G + DTGSDLTWVQC+ PC C +KP + + C++ C +
Sbjct: 124 VTMGLGSQNMSVIVDTGSDLTWVQCE-PCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQS 182
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
L + CDY + YGDG + G L + L F SV N FGCG N
Sbjct: 183 LELGACGSDPSTSATCDYVVNYGDGSYTSGEL--GIEKLGFGGISVSN--FVFGCGRN-- 236
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNG-RGVLFLGDGKV 192
N G +G++GLGR +S++SQ V +C+ Q G G L +G+
Sbjct: 237 NKGLFG--GASGLMGLGRSELSMISQTN--ATFGGVFSYCLPSTDQAGASGSLVMGN--- 289
Query: 193 PSSGV-------AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-----LIFDSGAS 240
SGV A+T ML N YIL + G S ++ + +I DSG
Sbjct: 290 -QSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSFGNGGVILDSGTV 348
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
+ VY+ + + + G P AP L C+
Sbjct: 349 ISRLAPSVYKALKAKFLEQFSGFP--SAPGFSILDTCFN 385
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 133/329 (40%), Gaps = 39/329 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSN 71
Y N T+G PP+ D +L W QC + C C K + P+ + PC
Sbjct: 53 YNVANFTIGTPPQAASAFIDLTGELVWTQC-SQCIHCFKQDLPVFVPNASSTFKPEPCGT 111
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C ++ P K +D C Y+ G GG ++G + TD F + G+ L FGC
Sbjct: 112 DVCKSIPTP-----KCASDVCAYDGVTGLGGHTVGIVATDTFAI----GTAAPASLGFGC 162
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
P +G +GLGR S+V+Q++ + H G+N R LFLG
Sbjct: 163 VVASDIDTMGGP---SGFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSR--LFLGASA 217
Query: 192 VPSSGVAWTPMLQNSAD--LKHYILGPAELLYSGKSCGL----KDLTLIFDSGASYAYFT 245
+ G AWTP ++ S + + Y E + +G + ++ L+ + +
Sbjct: 218 KLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLV 277
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
VYQE +M + P P +C+ P + + L FT + +
Sbjct: 278 DSVYQEFKKAVMASVGAAPTA-TPVGAPFEVCF--PKAGVSGAPD------LVFTFQAGA 328
Query: 306 VRLVVPPEAYLV----ISVSTSIIIIAYL 330
L VPP YL +V S++ IA L
Sbjct: 329 A-LTVPPANYLFDVGNDTVCLSVMSIALL 356
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 116/277 (41%), Gaps = 30/277 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G P + FDTGSDL+WVQC PC GC + + + P ++ VPC
Sbjct: 138 YIVSVGLGTPKRDLLVVFDTGSDLSWVQCK-PCDGCYQQHDPLFDPSQSTTYSAVPCGAQ 196
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD---LFPLRFSNGSVFNVPLTF 129
C L + C + +C YE+ YGD + G L D L P S+ S F
Sbjct: 197 ECRRL---DSGSCS--SGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVF 251
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI--GQNGRGVLF 186
GCG + G D G+ GLGR R+S+ SQ +YG +C+ G L
Sbjct: 252 GCG--DDDTGLFGKAD--GLFGLGRDRVSLASQAAAKYGA---GFSYCLPSSSTAEGYLS 304
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASY 241
LG P++ +T M+ S Y L + +G++ + + DSG
Sbjct: 305 LGSAAPPNA--RFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVI 362
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
SR Y + S + K AP L C+
Sbjct: 363 TRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCY 399
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 114/267 (42%), Gaps = 28/267 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +G PP F DTGSDLTW QC PC C Y P + VPCS+
Sbjct: 66 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 124
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS-NGSVFNV-PLTFG 130
C W C +P+ C Y Y DG S+G L T+ + S G +V + FG
Sbjct: 125 TCLP-TW-RSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFG 182
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG + ++ G +GLGRG +S+++QL G + FLG
Sbjct: 183 CGTDNGG----DSLNSTGTVGLGRGTLSLLAQL-GVGKFSYCLTDFFNSTMDSPFFLGTL 237
Query: 191 KVPSSG---VAWTPMLQNSADLKHYI-------LGPAELLYSGKSCGLK---DLTLIFDS 237
+ G V TP+LQ+ + Y LG L + L+ + ++ DS
Sbjct: 238 AELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDS 297
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTP 264
G ++ ++E+V + + L+G P
Sbjct: 298 GTTFTILAKSGFREVVDRVAQ-LLGQP 323
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 96/228 (42%), Gaps = 23/228 (10%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCS 70
S F V + VG PP+ F FD +D TW+QC PC C P+ + P ++ ++ C
Sbjct: 185 SNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQ-PCIKCYDQPDSIFDPSQSSSYTLLSCE 243
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L P + C Y I Y DG ++ G L+ + S+G V V L G
Sbjct: 244 TKHCNLL----PNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFE-SSGWVDRVSL--G 296
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
C + N GP D G GLGRG +S S++ + +C+ ++G L
Sbjct: 297 C--SNKNQGPFVGSD--GTFGLGRGSLSFPSRINASSM-----SYCLVESKDGYSSSTLE 347
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFD 236
P SG +LQN Y +G + G+ + + T D
Sbjct: 348 FNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTID 395
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 75/152 (49%), Gaps = 16/152 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V L G P F DT SDL W+QC PC C + + + P + +VPC++
Sbjct: 92 YLVKLGTGTPQHFFSAAIDTASDLVWMQCQ-PCVSCYRQLDPVFNPKLSSSYAVVPCTSD 150
Query: 73 RCAALHWPNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
CA L + RC +D C Y +Y G + G L D + G VF+ + FGC
Sbjct: 151 TCAQL---DGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI---GGDVFHA-VVFGC 203
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
+ GP + +G++GLGRG +S+VSQL
Sbjct: 204 S-DSSVGGPAA--QASGLVGLGRGPLSLVSQL 232
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 125/282 (44%), Gaps = 25/282 (8%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---EKQYKP 62
+E P + ++++VG P K F DTGSDL WVQ + PCTGC+ +Q
Sbjct: 44 VESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWVQSE-PCTGCSGGTIFDPRQSST 102
Query: 63 HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV 122
+ + CS+ C L P C+ + C Y EYG G + G D L ++G
Sbjct: 103 FREM-DCSSQLCTEL----PGSCEPGSSACSYSYEYGS-GETEGEFARDTISLGTTSGGS 156
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----G 178
P +F G N G G++GLG+G +S+ SQL I + +C+
Sbjct: 157 QKFP-SFAVGCGMVNSG---FDGVDGLVGLGQGPVSLTSQLS--AAIDSKFSYCLVDINS 210
Query: 179 QNGRGVLFLG-DGKVPSSGVAWTPMLQNSADL-KHYILGPAELLYSGKSCGLKDLTLIFD 236
Q+ L G + +G+ T + S +Y+L + +G++ G T+I D
Sbjct: 211 QSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTTII-D 269
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
SG + Y S VY ++S M ++ P ++ L +C+
Sbjct: 270 SGTTLTYVPSGVYGRVLSR-MESMVTLP-RVDGSSMGLDLCY 309
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 78/165 (47%), Gaps = 14/165 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ + LT+G PP+ FD DTGSDL WVQC PC C + P ++ P K+ C++
Sbjct: 39 YLMTLTLGSPPQSFDVIVDTGSDLNWVQC-LPCRVCYQQPGPKFDPSKSRSFRKAACTDN 97
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C P C + C Y+ YGD ++ G L + L G+ FGCG
Sbjct: 98 LCNVSALP-LKACAA--NVCQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGCG 154
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N G + AG++GLG+G +S+ SQL N +C+
Sbjct: 155 --TQNLGTFA--GAAGLVGLGQGPLSLNSQLSH--TFANKFSYCL 193
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 117/270 (43%), Gaps = 41/270 (15%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF V+ +G PP+ F D+GSDL WVQC APC C Y P N VPC +
Sbjct: 65 YF-VDFFLGTPPQKFSLIVDSGSDLLWVQC-APCLQCYAQDTPLYAPSNSSTFNPVPCLS 122
Query: 72 PRCAALHWPNPPRCK-HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV---PL 127
P C + C H C YE Y D S G + + +V +V +
Sbjct: 123 PECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFA-------YESATVDDVRIDKV 175
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQ-----NG 181
FGCG + N G + GVLGLG+G +S SQ+ YG N +C+ +
Sbjct: 176 AFGCG--RDNQGSFAA--AGGVLGLGQGPLSFGSQVGYAYG---NKFAYCLVNYLDPTSV 228
Query: 182 RGVLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
L GD + + + +TP++ NS + Y + +++ G+S +
Sbjct: 229 SSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGN 288
Query: 234 ---IFDSGASYAYFTSRVYQEIVSLIMRDL 260
IFDSG + Y+ Y+ I++ +++
Sbjct: 289 GGSIFDSGTTVTYWLPPAYRNILAAFDKNV 318
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 91/337 (27%), Positives = 145/337 (43%), Gaps = 51/337 (15%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----I 66
P + + L +G PP + DTGSDL W QC APC+ C + P Y P + +
Sbjct: 81 PTAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQC-APCSSQCFQQPTPLYNPSSSTTFAV 139
Query: 67 VPCSNPR---CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSV 122
+PC++ AAL PP P C Y + YG G +S+ ++ F S +
Sbjct: 140 LPCNSSLSMCAAALAGTTPP----PGCTCMYNMTYGSGWTSV-YQGSETFTFGSSTPANQ 194
Query: 123 FNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---- 177
VP + FGC + G + +G++GLGRG +S+VSQL G+ + +C+
Sbjct: 195 TGVPGIAFGC---SNASGGFNTSSASGLVGLGRGSLSLVSQL---GVPK--FSYCLTPYQ 246
Query: 178 GQNGRGVLFLGDGKV--PSSGVAWTPMLQNSAD----------LKHYILGPAELLYSGKS 225
N L LG + GV+ TP + + +D L LG L +
Sbjct: 247 DTNSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTA 306
Query: 226 CGLK-DLT--LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGP 281
LK D T I DSG + + YQ++ + ++ L+ P T L +C+ P
Sbjct: 307 LSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVV-SLVTLPTTDGGSAATGLDLCFELP 365
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
+ P S T + +V+P ++Y+++
Sbjct: 366 ------SSTSAPPTMPSMTLHFDGADMVLPADSYMML 396
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 134/329 (40%), Gaps = 39/329 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSN 71
Y N T+G PP+ D +L W QC + C C K + P+ + PC
Sbjct: 23 YNVANFTIGTPPQAASAFIDLTGELVWTQC-SQCIHCFKQDLPVFVPNASSTFKPEPCGT 81
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C ++ P K +D C ++ G GG ++G + TD F + G+ L FGC
Sbjct: 82 DVCKSIPTP-----KCASDVCAFDGVTGLGGHTVGIVATDTFAI----GTAAPASLGFGC 132
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
P +G +GLGR S+V+Q++ + H G+N R LFLG
Sbjct: 133 VVASDIDTMGGP---SGFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSR--LFLGASA 187
Query: 192 VPSSGVAWTPMLQNSAD--LKHYILGPAELLYSGKSCGL----KDLTLIFDSGASYAYFT 245
+ G AWTP ++ S + + Y E + +G + ++ L+ + +
Sbjct: 188 KLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLV 247
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNS 305
VYQE +M + P P + +C+ P + + L FT + +
Sbjct: 248 DSVYQEFKKAVMASVGAAPTA-TPVGEPFEVCF--PKAGVSGAPD------LVFTFQAGA 298
Query: 306 VRLVVPPEAYLV----ISVSTSIIIIAYL 330
L VPP YL +V S++ IA L
Sbjct: 299 A-LTVPPANYLFDVGNDTVCLSVMSIALL 326
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 125/301 (41%), Gaps = 50/301 (16%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPC 69
F + ++ +G P + DTGS+LTW++C PC C + Y +++ V C
Sbjct: 97 FGEYYTSIKLGSPGQEAILIVDTGSELTWLKC-LPCKVCAPSVDTIYDAARSVSYKPVTC 155
Query: 70 SNPR-CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS--VFNVP 126
+N + C+ C QC + YGDG S G+L TD + G V
Sbjct: 156 NNSQLCSNSSQGTYAYCAR-GSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQD 214
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQ-----N 180
FGC L P +G+LGL G++++ QL + +G HC N
Sbjct: 215 FAFGCAQGDLE---LVPTGASGILGLNAGKMALPMQLGQRFGW---KFSHCFPDRSSHLN 268
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------- 233
GV+F G+ ++P V +T + +++L+ + G S +L L
Sbjct: 269 STGVVFFGNAELPHEQVQYTSVALTNSELQRKFY---HVALKGVSINSHELVLLPRGSVV 325
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDL--------------IGTPLKLAPDD-----KTL 274
I DSG+S++ F + ++ ++ +GT K++ DD +TL
Sbjct: 326 ILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTL 385
Query: 275 P 275
P
Sbjct: 386 P 386
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/131 (37%), Positives = 65/131 (49%), Gaps = 9/131 (6%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---- 66
F YFA+ + VG P DTGSDL W+QC +PC C + + P ++
Sbjct: 81 FESGEYFAL-VGVGTPSTKAMLVIDTGSDLVWLQC-SPCRRCYAQRGQVFDPRRSSTYRR 138
Query: 67 VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
VPCS+P+C AL +P C Y + YGDG SS G L TD L F+N + N
Sbjct: 139 VPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATD--KLAFANDTYVNN- 195
Query: 127 LTFGCGYNQHN 137
+T GCG +
Sbjct: 196 VTLGCGRDNEG 206
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/239 (27%), Positives = 100/239 (41%), Gaps = 30/239 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRC 74
S + + L VG PP DTGS++TW QC PC C + + P K+ RC
Sbjct: 63 SVYLMKLQVGTPPFEIQAIIDTGSEITWTQC-LPCVHCYEQNAPIFDPSKSST-FKEKRC 120
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGCGY 133
C YE++Y D ++G L T+ L ++G F +P T GCG+
Sbjct: 121 DG-------------HSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGH 167
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDGKV 192
N P +G++GL G S+++Q+ G ++ +C GQ + F + V
Sbjct: 168 NN----SWFKPSFSGMVGLNWGPSSLITQMG--GEYPGLMSYCFSGQGTSKINFGANAIV 221
Query: 193 PSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
GV T M +A Y L G + G + + ++ DSG + YF
Sbjct: 222 AGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGTTLTYF 280
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 79/169 (46%), Gaps = 16/169 (9%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
+YF V + +G P + FDTGSDLTW QC+ C K + + P K+ + C+
Sbjct: 145 NYFVV-VGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNITCT 203
Query: 71 NPRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+ C L N P C C Y I+YGD S+G + + ++ V N
Sbjct: 204 SALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATD-VVDN--FL 260
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
FGCG Q+N G +AG++GLGR IS V Q R + +C+
Sbjct: 261 FGCG--QNNQGLFG--GSAGLIGLGRHPISFVQQTA--AKYRKIFSYCL 303
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 111/259 (42%), Gaps = 23/259 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ V + +G P K F FDTGSD+TW QC+ C K E + P +KNI CS+
Sbjct: 119 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNI-SCSS 177
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + + C Y+++YGDG SIG T+ L SN VF L FGC
Sbjct: 178 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN--VFKNFL-FGC 234
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
G Q+N G+ R ++++ SQ + + + +C+ + +G L LG
Sbjct: 235 G-QQNNGLFGGAAGLLGLG---RTKLALPSQTAK--TYKKLFSYCLPASSSSKGYLSLG- 287
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFT 245
G+V S V +TP+ + Y L L G+ + + + DSG +
Sbjct: 288 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLS 346
Query: 246 SRVYQEIVSLIMRDLIGTP 264
Y E+ S + P
Sbjct: 347 PTAYSELSSAFQNLMTDYP 365
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 126/284 (44%), Gaps = 29/284 (10%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP---EKQYKP 62
+E P + ++++VG P K F DTGSDL WVQ + PCTGC+ +Q
Sbjct: 44 VESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWVQSE-PCTGCSGGTIFDPRQSST 102
Query: 63 HKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-RFSNGS 121
+ + CS+ CA L P C+ + C Y EYG G + G D L S+GS
Sbjct: 103 FREM-DCSSQLCAEL----PGSCEPGSSTCSYSYEYGS-GETEGEFARDTISLGTTSDGS 156
Query: 122 VFNVPLTFGCGY-NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
GCG N G G++GLG+G +S+ SQL I + +C+
Sbjct: 157 QKFPSFAVGCGMVNSGFDG------VDGLVGLGQGPVSLTSQLS--AAIDSKFSYCLVDI 208
Query: 178 -GQNGRGVLFLG-DGKVPSSGVAWTPMLQNSADL-KHYILGPAELLYSGKSCGLKDLTLI 234
Q+ L G + +G+ T + S +Y+L + +G++ G T+I
Sbjct: 209 NSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTTII 268
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
DSG + Y S VY ++S M ++ P ++ L +C+
Sbjct: 269 -DSGTTLTYVPSGVYGRVLSR-MESMVTLP-RVDGSSMGLDLCY 309
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/246 (29%), Positives = 106/246 (43%), Gaps = 22/246 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
+ V++ +G P + FDTGSDL+WVQC PC C K + + P ++ + P C A
Sbjct: 188 YIVSVGLGTPRRDLLVVFDTGSDLSWVQCK-PCNNCYKQHDPLFDPSQSTTYSAVP-CGA 245
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
+ C + +C YE+ YGD + G L D L S+ + FGCG
Sbjct: 246 QECLDSGTCS--SGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQG--FVFGCG--DD 299
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCIGQNGR--GVLFLGDGKVP 193
+ G D G+ GLGR R+S+ SQ YG +C+ + R G L LG P
Sbjct: 300 DTGLFGRAD--GLFGLGRDRVSLASQAAARYGA---GFSYCLPSSWRAEGYLSLGSAAAP 354
Query: 194 SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYFTSRV 248
+T M+ S Y L + +G++ + K + DSG SR
Sbjct: 355 PH-AQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITRLPSRA 413
Query: 249 YQEIVS 254
Y + S
Sbjct: 414 YSALRS 419
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 106/247 (42%), Gaps = 26/247 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPC 69
+ +G P F DTGSDL WV CD AP G T E + Y P + V C
Sbjct: 111 VKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTC 170
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNGSVFNVP-- 126
+N CA + +C C Y + Y +S G L+ D+ L + + V
Sbjct: 171 NNSLCAQRN-----QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAY 225
Query: 127 LTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+TFGCG Q ++ P+ G+ GLG +IS+ S L GL+ + C G +G G +
Sbjct: 226 VTFGCGQVQSGSFLDIAAPN--GLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRI 283
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFT 245
GD SS TP N + + I + G + + T +FD+G S+ Y
Sbjct: 284 SFGDKG--SSDQEETPFNLNPSHPNYNI--TVTRVRVGTTLIDDEFTALFDTGTSFTYLV 339
Query: 246 SRVYQEI 252
+Y +
Sbjct: 340 DPMYTTV 346
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 89/201 (44%), Gaps = 33/201 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ + + +G P K DTGSD++WVQC PC+ C + + P + CS+
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCSSA 191
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
CA L C + QC Y + YGDG S+ G +D L GS FGC
Sbjct: 192 ACAQLGQEGNG-CS--SSQCQYTVTYGDGSSTTGTYSSDTLAL----GSNAVRKFQFGCS 244
Query: 132 ----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVL 185
G+N T G++GLG G S+VSQ G +C+ + G L
Sbjct: 245 NVESGFNDQ---------TDGLMGLGGGAQSLVSQ--TAGTFGAAFSYCLPATSSSSGFL 293
Query: 186 FLGDGKVPSSGVAWTPMLQNS 206
LG G +SG TPML++S
Sbjct: 294 TLGAG---TSGFVKTPMLRSS 311
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/128 (35%), Positives = 67/128 (52%), Gaps = 9/128 (7%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+L VG PP+ F DTGSDL W+QC APC C + + P ++ V C +P
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASLSYRNVTCGDP 210
Query: 73 RCAALHWPNPPR-CKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNV-PLT 128
RC + P PR C+ P+ D C Y YGD ++ G L + F + + G+ V +
Sbjct: 211 RCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVV 270
Query: 129 FGCGYNQH 136
FGCG++
Sbjct: 271 FGCGHSNR 278
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/256 (30%), Positives = 117/256 (45%), Gaps = 27/256 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V++ +G PP+ F DTGSDL W+QC APC C + + P +I V C +
Sbjct: 149 YLVDVYLGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQSGPIFDPAASISYRNVTCGDD 207
Query: 73 RCAALHWP---NPPRCKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
RC + P P C+ P +D C Y YGD ++ G L + F + + V +
Sbjct: 208 RCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGV 267
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV-- 184
FGCG+ N G AG+LGLGRG +S SQLR YG + +C+ ++G
Sbjct: 268 AFGCGHR--NRGLFH--GAAGLLGLGRGPLSFASQLRGVYG--GHAFSYCLVEHGSAAGS 321
Query: 185 -LFLG--DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFD 236
+ G D + + +T + Y L +L G++ + TL I D
Sbjct: 322 KIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIID 381
Query: 237 SGASYAYFTSRVYQEI 252
SG + +YF YQ I
Sbjct: 382 SGTTLSYFPEPAYQAI 397
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 124/298 (41%), Gaps = 44/298 (14%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPC 69
F + ++ +G P + DTGS+LTW+QC PC C + Y ++ V C
Sbjct: 97 FGEYYTSIKLGSPGQEAILIVDTGSELTWLQC-LPCKVCAPSVDTIYDAARSASYRPVTC 155
Query: 70 SNPR-CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS--VFNVP 126
+N + C+ C QC + YGDG S G+L TD + G V
Sbjct: 156 NNSQLCSNSSQGTYAYCAR-GSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQD 214
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQ-----N 180
FGC L P +G+LGL G++++ QL + +G HC N
Sbjct: 215 FAFGCAQGDLE---LVPTGASGILGLNAGKMALPMQLGQRFGW---KFSHCFPDRSSHLN 268
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL----KDLTLIFD 236
GV+F G+ ++P V +T + +++L+ A S S L + +I D
Sbjct: 269 STGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSVVILD 328
Query: 237 SGASYAYFTSRVYQEIVSLIMRDL--------------IGTPLKLAPDD-----KTLP 275
SG+S++ F + ++ ++ +GT K++ DD +TLP
Sbjct: 329 SGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLP 386
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 111/259 (42%), Gaps = 23/259 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ V + +G P K F FDTGSD+TW QC+ C K E + P +KNI CS+
Sbjct: 71 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNI-SCSS 129
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + + C Y+++YGDG SIG T+ L SN VF L FGC
Sbjct: 130 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN--VFKNFL-FGC 186
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
G Q+N G+ R ++++ SQ + + + +C+ + +G L LG
Sbjct: 187 G-QQNNGLFGGAAGLLGLG---RTKLALPSQTAK--TYKKLFSYCLPASSSSKGYLSLG- 239
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFT 245
G+V S V +TP+ + Y L L G+ + + + DSG +
Sbjct: 240 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGTVITRLS 298
Query: 246 SRVYQEIVSLIMRDLIGTP 264
Y E+ S + P
Sbjct: 299 PTAYSELSSAFQNLMTDYP 317
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/128 (35%), Positives = 67/128 (52%), Gaps = 9/128 (7%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V+L VG PP+ F DTGSDL W+QC APC C + + P ++ V C +P
Sbjct: 152 YLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPATSLSYRNVTCGDP 210
Query: 73 RCAALHWPNPPR-CKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNV-PLT 128
RC + P PR C+ P+ D C Y YGD ++ G L + F + + G+ V +
Sbjct: 211 RCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVV 270
Query: 129 FGCGYNQH 136
FGCG++
Sbjct: 271 FGCGHSNR 278
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 82/300 (27%), Positives = 121/300 (40%), Gaps = 55/300 (18%)
Query: 51 GCTKPPEKQ--------YKPH----KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY 98
GCT P+K Y P+ N VPC + C + CK + C Y I Y
Sbjct: 32 GCTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITY 90
Query: 99 GDGGSSIGALVTDLFPLRFSNGSVFNVP----LTFGCGYNQHNPGPLSP-PDTA--GVLG 151
GDG ++ G+ V D +G++ P + FGCG Q G LS D A G++G
Sbjct: 91 GDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQ--SGSLSSNSDEALDGIIG 148
Query: 152 LGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKH 211
G+ S++SQL G ++ + HC+ + G +F G+V TP++ A H
Sbjct: 149 FGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIF-SIGQVMEPKFNTTPLVPRMA---H 204
Query: 212 Y-------------ILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
Y IL P L SG G I DSG + AY +Y +++ ++
Sbjct: 205 YNVILKDMDVDGEPILLPLYLFDSGSGRG-----TIIDSGTTLAYLPLSIYNQLLPKVLG 259
Query: 259 DLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
G L + D T F ++ E F + F + L V P YL +
Sbjct: 260 RQPGLKLMIVEDQFTC-------FHYSDKLDEGFPVVKFHF----EGLSLTVHPHDYLFL 308
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 129/316 (40%), Gaps = 43/316 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPP--EKQYKPHKNIVPCSNPR 73
++LT+G PP+ DTGS+L+W+ C P T P Y P PC++
Sbjct: 59 LTISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTP----TPCNSSV 114
Query: 74 CA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN--GSVFNVPLTF 129
C P C N C + Y D S+ G L + F L + G++F +
Sbjct: 115 CMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGCMDSA 174
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLG 188
G + + T G++G+ RG +S+V+Q ++ +CI G++ GVL LG
Sbjct: 175 GYTSDINEDA-----KTTGLMGMNRGSLSLVTQ-----MVLPKFSYCISGEDAFGVLLLG 224
Query: 189 DGKVPSSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----L 233
DG S + +TP++ + ++ I +LL KS + D T
Sbjct: 225 DGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQT 284
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PD---DKTLPICWRGP--FKALGQ 287
+ DSG + + VY + + G ++ P+ + + +C+ P A+
Sbjct: 285 MVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASLAAVPA 344
Query: 288 VTEYFKPLALSFTNRR 303
VT F + + R
Sbjct: 345 VTLVFSGAEMRVSGER 360
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 118/273 (43%), Gaps = 32/273 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ +N+++G PP DTGSDL W QC PC C + E + P K+ I+ C
Sbjct: 95 YLMNISLGTPPVSMHGIADTGSDLLWRQC-KPCDSCYEQIEPIFDPAKSKTYQILSCEGK 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L C N C Y YGDG + G L D + + G +VP + FGC
Sbjct: 154 SCSNLGGQG--GCSDDN-TCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVFGC 210
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG------VL 185
G HN G +G++GLG G +S++SQLR LI +C+ G +
Sbjct: 211 G---HNNGGTFELHGSGLVGLGGGPLSMISQLRP--LIGGRFSYCLVPLGNDPSVSSKMH 265
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKS------CGLKDLTL 233
F G V +G TP+ D +Y+ +G +L Y G S + +
Sbjct: 266 FGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNI 325
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
I DSG + Y + S ++ + G P++
Sbjct: 326 IIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVR 358
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/249 (28%), Positives = 106/249 (42%), Gaps = 26/249 (10%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----V 67
+ +G P F DTGSDL WV CD AP G T E + Y P + V
Sbjct: 109 TTVKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKV 168
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNGSVFNVP 126
C+N CA + +C C Y + Y +S G L+ D+ L + + V
Sbjct: 169 TCNNSLCAQRN-----QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVE 223
Query: 127 --LTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+TFGCG Q ++ P+ G+ GLG +IS+ S L GL+ + C G +G G
Sbjct: 224 AYVTFGCGQVQSGSFLDIAAPN--GLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVG 281
Query: 184 VLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAY 243
+ GD SS TP N + + I + G + + T +FD+G S+ Y
Sbjct: 282 RISFGDKG--SSDQEETPFNLNPSHPNYNI--TVTRVRVGTTLIDDEFTALFDTGTSFTY 337
Query: 244 FTSRVYQEI 252
+Y +
Sbjct: 338 LVDPMYTTV 346
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 85/184 (46%), Gaps = 22/184 (11%)
Query: 34 FDTGSDLTWVQC-DAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKHP 88
DT SD+ WVQC P + C + Y P K+ CS+P C L P C
Sbjct: 186 LDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQL-GPYANGCSSS 244
Query: 89 ND---QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPP 144
++ QC Y + Y DG ++ G LV D L ++ VP FGC + G S
Sbjct: 245 SNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTS----QVPKFEFGCSHAAR--GSFSRS 298
Query: 145 DTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTP 201
TAG++ LGRG S+VSQ +YG V +C + +G LG + SS A TP
Sbjct: 299 KTAGIMALGRGVQSLVSQTSTKYG---QVFSYCFPPTASHKGFFVLGVPRRSSSRYAVTP 355
Query: 202 MLQN 205
ML+
Sbjct: 356 MLKT 359
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 67/135 (49%), Gaps = 15/135 (11%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWP--NPPRCKH 87
DTGSDLTWVQC+ PC C +KP + +PC++ C +L N C+
Sbjct: 160 IDTGSDLTWVQCE-PCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACES 218
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
C Y + YGDG + G L + L F SV N FGCG N N G +
Sbjct: 219 NPSNCSYAVNYGDGSYTNGELGAE--HLSFGGISVSN--FVFGCGKN--NKGLFG--GVS 270
Query: 148 GVLGLGRGRISIVSQ 162
G++GLGR +S++SQ
Sbjct: 271 GLMGLGRSNLSLISQ 285
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 66/239 (27%), Positives = 103/239 (43%), Gaps = 23/239 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
++ F++ L +G K DTGS+ VQC + P Q VPC +
Sbjct: 97 YALFSMQLGIGSLQKNLSAIIDTGSEAVLVQCGSRSRPVFDPAASQSYRQ---VPCISQL 153
Query: 74 CAALHWP----NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP--- 126
C A+ + C + + C Y + YGD +S G D+ L +N S V
Sbjct: 154 CLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRD 213
Query: 127 LTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN----- 180
+ FGC H+P G L + G++G RG +S+ SQL++ L + +C
Sbjct: 214 VAFGCA---HSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPR 269
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQN---SADLKHYILGPAELLYSGKSCGLKDLTLIFD 236
GV+FLGD + S V +TP+L N A + Y +G + GK+ + + D
Sbjct: 270 ATGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLD 328
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 66/129 (51%), Gaps = 12/129 (9%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
YF +++ VG PPK DTGSDL+W+QCD PC C + Y P+++ + C
Sbjct: 169 EYF-IDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGPHYNPNESSSYRNISCY 226
Query: 71 NPRCAALHWPNP-PRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS--NGSV---FN 124
+PRC + P+P CK N C Y +Y DG ++ G + F + + NG
Sbjct: 227 DPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHV 286
Query: 125 VPLTFGCGY 133
V + FGCG+
Sbjct: 287 VDVMFGCGH 295
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 106/254 (41%), Gaps = 33/254 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALHWP 80
+ +G P F D GSDL WV CD C C Y + +P ++ P
Sbjct: 107 IDIGTPNVSFLVALDAGSDLLWVPCD--CMQCAPLSASYYDRLGRDLNEYSPSLSSTSKP 164
Query: 81 NP---------PRCKHPNDQCDYEIEY-GDGGSSIGALVTDL-----FPLRFSNGSVFNV 125
CK D C Y Y + SS G L+ D F S SV+
Sbjct: 165 LSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVW-A 223
Query: 126 PLTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+ GCG Q + PD G++GLG G +S+ S L + GL+RN C N G
Sbjct: 224 SVIIGCGRKQSGAFSDGAAPD--GLMGLGPGDLSVPSLLAKAGLVRNTFSICFDDNHSGT 281
Query: 185 LFLGD-GKVPSSGVAWTPM----LQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+ GD G V ++ P+ + +++ Y++G + L K+ G + L DSG
Sbjct: 282 ILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSL----KTAGFQALV---DSGT 334
Query: 240 SYAYFTSRVYQEIV 253
S+ + +Y++IV
Sbjct: 335 SFTFLPYEIYEKIV 348
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 111/259 (42%), Gaps = 23/259 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ V + +G P K F FDTGSD+TW QC+ C K E + P +KNI CS+
Sbjct: 131 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNI-SCSS 189
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + + C Y+++YGDG SIG T+ L SN VF L FGC
Sbjct: 190 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN--VFKNFL-FGC 246
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
G Q+N G+ R ++++ SQ + + + +C+ + +G L LG
Sbjct: 247 G-QQNNGLFGGAAGLLGLG---RTKLALPSQTAK--TYKKLFSYCLPASSSSKGYLSLG- 299
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFT 245
G+V S V +TP+ + Y L L G+ + + + DSG +
Sbjct: 300 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLS 358
Query: 246 SRVYQEIVSLIMRDLIGTP 264
Y E+ S + P
Sbjct: 359 PTAYSELSSAFQNLMTDYP 377
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 75/158 (47%), Gaps = 11/158 (6%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP-----EKQYKPHKNIV 67
+ + + ++++VG PP+ DTGSDL W QC APC C + + +
Sbjct: 86 VTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQC-APCLDCFEQGAAPVLDPAASSTHAAL 144
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN--GSVFNV 125
PC P C AL + + + C Y YGD ++G L TD F + G +
Sbjct: 145 PCDAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAAR 204
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
+TFGCG+ N G +T G+ G GRGR S+ SQL
Sbjct: 205 RVTFGCGHI--NKGIFQANET-GIAGFGRGRWSLPSQL 239
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/254 (27%), Positives = 106/254 (41%), Gaps = 33/254 (12%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALHWP 80
+ +G P F D GSDL WV CD C C Y + +P ++ P
Sbjct: 97 IDIGTPNVSFLVALDAGSDLLWVPCD--CMQCAPLSASYYDRLGRDLNEYSPSLSSTSKP 154
Query: 81 NP---------PRCKHPNDQCDYEIEY-GDGGSSIGALVTDL-----FPLRFSNGSVFNV 125
CK D C Y Y + SS G L+ D F S SV+
Sbjct: 155 LSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVW-A 213
Query: 126 PLTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
+ GCG Q + PD G++GLG G +S+ S L + GL+RN C N G
Sbjct: 214 SVIIGCGRKQSGAFSDGAAPD--GLMGLGPGDLSVPSLLAKAGLVRNTFSICFDDNHSGT 271
Query: 185 LFLGD-GKVPSSGVAWTPM----LQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+ GD G V ++ P+ + +++ Y++G + L K+ G + L DSG
Sbjct: 272 ILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSL----KTAGFQALV---DSGT 324
Query: 240 SYAYFTSRVYQEIV 253
S+ + +Y++IV
Sbjct: 325 SFTFLPYEIYEKIV 338
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 127/284 (44%), Gaps = 45/284 (15%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-----KNIVPC 69
YF +++ VG PPK F DTGSDL W+QC PC C + Y P KNI C
Sbjct: 194 EYF-MDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYACFEQNGPYYDPKDSSSFKNIT-C 250
Query: 70 SNPRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS------- 121
+PRC + P+PP+ CK C Y YGD ++ G + F + +
Sbjct: 251 HDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKI 310
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC-IGQN 180
V NV FGCG+ N G AG+LGLGRG +S +QL+ L + +C + +N
Sbjct: 311 VENV--MFGCGH--WNRGLFH--GAAGLLGLGRGPLSFATQLQ--SLYGHSFSYCLVDRN 362
Query: 181 GRGV----LFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDL 231
L G+ K + + +T + +N D +Y+L + ++ G+ + +
Sbjct: 363 SNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKS-IMVGGEVLKIPEE 421
Query: 232 T----------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPL 265
T I DSG + YF Y+ I MR + G PL
Sbjct: 422 TWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPL 465
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 75/281 (26%), Positives = 124/281 (44%), Gaps = 29/281 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-YKPHKN----IVP 68
F Y + + VG PP DTGSDL WV C + G + P ++ ++
Sbjct: 98 FEYL-MYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLS 156
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV---FNV 125
C + C AL + C + +C Y+ YGDG +IG L T+ F + G V
Sbjct: 157 CQSAACQALSQAS---CDA-DSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRV 212
Query: 126 P-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQ 179
P ++FGC + G + G++GLG G +S+VSQL I +C+
Sbjct: 213 PRVSFGC-----STGSAGSFRSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAA 267
Query: 180 NGRGVLFLGDGKVPSS-GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT-LIFDS 237
N L G V S G A TP++ + D +Y + + +G+ + + +I DS
Sbjct: 268 NSSSTLSFGARAVVSDPGAASTPLVPSEVD-SYYTVALESVAVAGQDVASANSSRIIVDS 326
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
G + + + + +V+ + R I P + P ++ L +C+
Sbjct: 327 GTTLTFLDPALLRPLVAELERR-IRLP-RAQPPEQLLQLCY 365
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 77/260 (29%), Positives = 102/260 (39%), Gaps = 34/260 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQYKPH-------K 64
Y+AV + VG P F DTGSDL WV CD A T P +P+
Sbjct: 111 YYAV-VEVGTPNATFLVALDTGSDLFWVPCDCKQCASIANVTGQPATALRPYSPRESSTS 169
Query: 65 NIVPCSNPRCAALHWPNPPRCKHP-NDQCDYEIEYGDGGSSI-GALVTDLFPLRFSN--- 119
V C N C P C N C YE++Y +S G LV D+ L
Sbjct: 170 KQVTCDNALC-----DRPNGCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHLTRERPGA 224
Query: 120 ----GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIG 174
G P+ FGCG Q L G++GLGR +S+ S L GL+ +
Sbjct: 225 AAEAGEALQAPVVFGCGQVQTGTF-LDGAAFDGLMGLGRENVSVPSVLASSGLVASDSFS 283
Query: 175 HCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI 234
C G +G G + GD SSG TP + Y + + KS + +
Sbjct: 284 MCFGDDGVGRINFGDSG--SSGQGETPF---TGRRTLYNVSFTAVNVETKSVA-AEFAAV 337
Query: 235 FDSGASYAYFTSRVYQEIVS 254
DSG S+ Y Y E+ +
Sbjct: 338 IDSGTSFTYLADPEYTELAT 357
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 105/247 (42%), Gaps = 26/247 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPH----KNIVPC 69
+ +G P F DTGSDL WV CD AP G T E + Y P V C
Sbjct: 109 VKLGTPGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKISTTNKKVTC 168
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNGSVFNVP-- 126
+N CA + +C C Y + Y +S G L+ D+ L + + V
Sbjct: 169 NNSLCAQRN-----QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAY 223
Query: 127 LTFGCGYNQHNPG-PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+TFGCG Q ++ P+ G+ GLG +IS+ S L GL+ + C G +G G +
Sbjct: 224 VTFGCGQVQSGSFLDIAAPN--GLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRI 281
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFT 245
GD SS TP N + + I + G + + T +FD+G S+ Y
Sbjct: 282 SFGDKG--SSDQEETPFNLNPSHPNYNI--TVTRVRVGTTLIDDEFTALFDTGTSFTYLV 337
Query: 246 SRVYQEI 252
+Y +
Sbjct: 338 DPMYTTV 344
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 92/368 (25%), Positives = 151/368 (41%), Gaps = 71/368 (19%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--PCTGCTKP-----PEKQYKPHKN- 65
+ ++V+L G PP+ F FDTGS L W C A C+ C+ P ++ P +
Sbjct: 129 YGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSS 188
Query: 66 ---IVPCSNPRCAALHWPN-PPRCKHPN-------DQC-DYEIEYGDGGSSIGALVTDLF 113
+V C NP+CA + PN RC++ N D C Y ++YG G ++ G L+++
Sbjct: 189 SVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATA-GILLSETL 247
Query: 114 PLRFSNGSVFNVPLTFGCG-YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNV 172
L F V GC + H P AG+ G GRG S+ SQ+R L R
Sbjct: 248 DLENKRVPDFLV----GCSVMSVHQP--------AGIAGFGRGPESLPSQMR---LKR-- 290
Query: 173 IGHCIGQNG------RGVLFLGDGKVPSSGVAWT---------PMLQNSADLKHYILGPA 217
HC+ G L L G + P + N+A ++Y L
Sbjct: 291 FSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLR 350
Query: 218 ELLYSGKSCG------LKDLT----LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP-LK 266
+L GK + D T I DSG+++ + +++ I + + L+ P K
Sbjct: 351 RILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAK 410
Query: 267 LAPDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIII 326
L C+ P + + + F + L F + +L + E YL + ++
Sbjct: 411 DVEAQSGLRPCFNIPKE---EESAEFPDVVLKF---KGGGKLSLAAENYLAMVTDEGVVC 464
Query: 327 IAYLTGKS 334
+ +T ++
Sbjct: 465 LTMMTDEA 472
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 128/296 (43%), Gaps = 45/296 (15%)
Query: 12 PIFSY---FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI-- 66
PI++Y + + +++G PP DTGSDLTW C PC C K + P K+
Sbjct: 17 PIYAYLGHYLMEVSIGTPPFKIYGIADTGSDLTWTSC-VPCNKCYKQRNPIFDPQKSTSY 75
Query: 67 --VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ C + C H + C P C+Y Y + G L + L + G +
Sbjct: 76 RNISCDSKLC---HKLDTGVCS-PQKHCNYTYAYASAAITQGVLAQETITLSSTKGE--S 129
Query: 125 VPL---TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI--- 177
VPL FGCG+N N G + + G++GLG G +S +SQ+ +G R C+
Sbjct: 130 VPLKGIVFGCGHN--NTGGFNDRE-MGIIGLGGGPVSFISQIGSSFGGKR--FSQCLVPF 184
Query: 178 --GQNGRGVLFLGDG-KVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSC-G 227
+ + LG G +V GV TP++ +++ +G L ++G S
Sbjct: 185 HTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQS 244
Query: 228 LKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTP----LKLAPDDKTLPICWR 279
++ + DSG +++Y +V+ + ++ P L L P +C+R
Sbjct: 245 VEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQ-----LCYR 295
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 72/266 (27%), Positives = 112/266 (42%), Gaps = 18/266 (6%)
Query: 7 EFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN- 65
E P + + +G PP DTGS L W+QC +PC C ++P K+
Sbjct: 79 ESLLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQC-SPCHNCFPQETPLFEPLKSS 137
Query: 66 ---IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS- 121
C + C L P+ C QC Y I YGD S+G L T+ + G+
Sbjct: 138 TYKYATCDSQPCTLLQ-PSQRDCGKLG-QCIYGIMYGDKSFSVGILGTETLSFGSTGGAQ 195
Query: 122 VFNVPLT-FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
+ P T FGCG + +N + G+ GLG G +S+VSQL I + +C+
Sbjct: 196 TVSFPNTIFGCGVD-NNFTIYTSNKVMGIAGLGAGPLSLVSQLG--AQIGHKFSYCLLPY 252
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK--SCGLKDLTLI 234
+ + F + + ++GV TP++ + +Y L + K S G D ++
Sbjct: 253 DSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIV 312
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDL 260
DSG Y + Y V+ + L
Sbjct: 313 IDSGTPLTYLENTFYNNFVASLQETL 338
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 73/260 (28%), Positives = 115/260 (44%), Gaps = 40/260 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
V+LTVG PP+ DTGS+L+W++C+ T+ + + P+++ VPCS+
Sbjct: 85 LTVSLTVGTPPQNVSMVLDTGSELSWLRCNK-----TQTFQTTFDPNRSSSYSPVPCSSL 139
Query: 73 RCA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-F 129
C +P P C N C + Y D SS G L +D F + S ++P T F
Sbjct: 140 TCTDRTRDFPIPASCD-SNQLCHAILSYADASSSEGNLASDTFYIGNS-----DMPGTIF 193
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG-RGVLFLG 188
GC + + G++G+ RG +S VSQ+ +CI + GVL LG
Sbjct: 194 GCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMD-----FPKFSYCISDSDFSGVLLLG 248
Query: 189 DGKVP-SSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT---- 232
D + +TP++Q S L ++ I ++LL KS + D T
Sbjct: 249 DANFSWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQ 308
Query: 233 LIFDSGASYAYFTSRVYQEI 252
+ DSG + + VY +
Sbjct: 309 TMVDSGTQFTFLLGPVYSAL 328
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 105/242 (43%), Gaps = 29/242 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSNP 72
+ ++L++G PP DTGSDL W QC PC C K + + P + C
Sbjct: 95 YLMSLSLGTPPFKIMGIADTGSDLIWTQC-KPCERCYKQVDPLFDPKSSKTYRDFSCDAR 153
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
+C+ L + C + C Y+ YGD ++G + +D L + GS + P T GC
Sbjct: 154 QCSLL---DQSTCS--GNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGC 208
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G+ N G S +G++GLG G +S++SQ+ + +C+ N +
Sbjct: 209 GH--ENDGTFSDKG-SGIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNSSKLN 263
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLTLIFDSG 238
F + V GV TP+L + Y L G + + S G + +I DSG
Sbjct: 264 FGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSG 323
Query: 239 AS 240
+
Sbjct: 324 TT 325
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 95/230 (41%), Gaps = 27/230 (11%)
Query: 3 VSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP 62
S I + ++ + G P DTGSDLTWVQC PC+ C + + P
Sbjct: 82 TSGIRLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDP 140
Query: 63 HKN----IVPCSNPRCA---ALHWPNPPRCKHP---NDQCDYEIEYGDGGSSIGALVTDL 112
+ V C+ CA P C +++C Y + YGDG S G L TD
Sbjct: 141 AGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDT 200
Query: 113 FPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIR 170
L G FGCG + N G TAG++GLGR +S+VSQ R G+
Sbjct: 201 VAL----GGASLGGFVFGCGLS--NRGLFG--GTAGLMGLGRTELSLVSQTASRYGGVFS 252
Query: 171 NVIGHCIGQNGRGVLFLGDGKVPSSG------VAWTPMLQNSADLKHYIL 214
+ + G L LG G +S VA+T M+ + A Y L
Sbjct: 253 YCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFL 302
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 80/281 (28%), Positives = 129/281 (45%), Gaps = 42/281 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +++ +G PPK + DTGSDL W+QC PC C + Y P ++ + C +P
Sbjct: 90 YFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCHDCFEQNGPYYDPKESSSFRNIGCHDP 148
Query: 73 RCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-------GSVFN 124
RC + P+PP CK N C Y YGD ++ G T+ F + ++ V N
Sbjct: 149 RCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVEN 208
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQ 179
V FGCG+ N G +G+LGLGRG +S SQL+ L + +C+
Sbjct: 209 V--MFGCGH--WNRGLFH--GASGLLGLGRGPLSFSSQLQ--SLYGHSFSYCLVDRNSDT 260
Query: 180 NGRGVLFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT-- 232
N L G+ K + + +T ++ +N D +Y+ + ++ G+ + + T
Sbjct: 261 NVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKS-IMVGGEVLNIPESTWN 319
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPL 265
I DSG + +YFT YQ I ++ + G P+
Sbjct: 320 MTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPI 360
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 115/266 (43%), Gaps = 27/266 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG P + F DTGS+LTWV+ C G PP ++P + VPCS+
Sbjct: 91 YF-VKVLVGTPAQEFTLVADTGSELTWVK----CAGGASPPGLVFRPEASKSWAPVPCSS 145
Query: 72 PRCAALHWP-NPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFSNGSVFNVP-LT 128
C L P + C C Y+ Y +G + ++G + TD + G V + +
Sbjct: 146 DTC-KLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVV 204
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREY---GLIRNVIGHCIGQNGRGVL 185
GC + H+ D GVL LG +IS S+ ++ H +N G L
Sbjct: 205 LGCS-STHDGQSFKSVD--GVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATGYL 261
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-------KDLTLIFDSG 238
G G+VP + T + + A + Y + + +G++ + K +I DSG
Sbjct: 262 AFGPGQVPRTPATQTKLFLDPA-MPFYGVKVDAVHVAGQALDIPAEVWDPKSGGVILDSG 320
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTP 264
+ + Y+ +V+ + + L G P
Sbjct: 321 TTLTVLATPAYKAVVAALTKLLAGVP 346
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 129/328 (39%), Gaps = 54/328 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPHKNI--- 66
+ ++ +G P + DTGS WV C C P E Y P ++
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQC--PHESDILRKLTFYDPRSSVSSK 139
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSV- 122
V C + C + P C + +C Y Y DGG ++G L TDL + NG
Sbjct: 140 EVKCDDTICTS-----RPPC-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQ 193
Query: 123 -FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
+ +TFGCG Q S G++G G + +SQL G + + HC+ N
Sbjct: 194 PTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN 253
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILG------PAELLYSGKSCGLK 229
G G+ +G+ P V TP+++N+ +LK + PA + + K+ G
Sbjct: 254 GGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKG-- 309
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
DSG++ Y +Y E++ + PD + F LG V
Sbjct: 310 ---TFIDSGSTLVYLPEIIYSELILAVFAK--------HPDITMGAMYNFQCFHFLGSVD 358
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ F + F N + L V P YL+
Sbjct: 359 DKFPKITFHF---ENDLTLDVYPYDYLL 383
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 109/261 (41%), Gaps = 33/261 (12%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALH------WPNPPR 84
DT S+LTWVQC APC C + P + ++PC++ C AL
Sbjct: 143 DTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGG 201
Query: 85 CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPP 144
+ P+ C Y + Y DG S G L D L G V + FGCG + N GP
Sbjct: 202 GEQPS--CSYTLSYRDGSYSQGVLAHDKLSL---AGEVID-GFVFGCGTS--NQGPFG-- 251
Query: 145 DTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGV 197
T+G++GLGR ++S++SQ + ++G V +C+ G L LGD S+ +
Sbjct: 252 GTSGLMGLGRSQLSLISQTMDQFG---GVFSYCLPLKESESSGSLVLGDDTSVYRNSTPI 308
Query: 198 AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIM 257
+T M+ + Y + + G+ +I DSG VY + + +
Sbjct: 309 VYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFL 368
Query: 258 RDLIGTPLKLAPDDKTLPICW 278
P AP L C+
Sbjct: 369 SQFAEYP--QAPGFSILDTCF 387
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 88/326 (26%), Positives = 130/326 (39%), Gaps = 44/326 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--TKPPEKQYKPHKNI----VPCS 70
+ +N+++G PP F DTGS+L W QC APCT C P +P ++ +PC+
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRCFPRPTPAPVLQPARSSTFSRLPCN 149
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L + PR + C Y YG G + G L T+ L +G+ V FG
Sbjct: 150 GSFCQYLPTSSRPRTCNATAACAYNYTYGS-GYTAGYLATET--LTVGDGTFPKV--AFG 204
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
C +++G++GLGRG +S+VSQL G + + G + G
Sbjct: 205 CSTEN------GVDNSSGIVGLGRGPLSLVSQL-AVGRFSYCLRSDMADGGASPILFGSL 257
Query: 191 KVPSSG--VAWTPMLQNS---------ADLKHYILGPAELLYSGKSCGLKDLTL----IF 235
+ G V TP+L+N +L + EL +G + G L I
Sbjct: 258 AKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIV 317
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIG----TPLKLAPDDKTLPICWRGPFKALGQVTEY 291
DSG + Y Y + + TP AP D L +C++ P G
Sbjct: 318 DSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYD--LDLCYK-PSAGGGGKAVR 374
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV 317
LAL F + VP + Y
Sbjct: 375 VPRLALRFA---GGAKYNVPVQNYFA 397
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 78/267 (29%), Positives = 119/267 (44%), Gaps = 40/267 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPH-----KNIVPCSN 71
+ +++ VG PPK F DTGSDL W+QC PC C Y P KNI C++
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQC-LPCYDCFHQNGMFYDPKTSASFKNIT-CND 217
Query: 72 PRCAALHWPNPP-RCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN----GSVFNV- 125
PRC+ + P+PP +C+ N C Y YGD ++ G + F + + S + V
Sbjct: 218 PRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVG 277
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQN 180
+ FGCG+ N G S LG G S SQL+ L + +C+ N
Sbjct: 278 NMMFGCGH--WNRGLFSGASGLLGLGRGPLSFS--SQLQ--SLYGHSFSYCLVDRNSNTN 331
Query: 181 GRGVLFLGDGK--VPSSGVAWTPML---QNSADLKHYILGPAELLYSGKSCGLKDLT--- 232
L G+ K + + + +T + +NS + +YI + +L GK+ + + T
Sbjct: 332 VSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKS-ILVGGKALDIPEETWNI 390
Query: 233 -------LIFDSGASYAYFTSRVYQEI 252
I DSG + +YF Y+ I
Sbjct: 391 SSDGDGGTIIDSGTTLSYFAEPAYEII 417
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 92/339 (27%), Positives = 135/339 (39%), Gaps = 48/339 (14%)
Query: 3 VSWIEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC--TGCTKPPEKQY 60
V W E S + +G PP+ + DTGS+L W QC + C GC Y
Sbjct: 64 VHWAE-------SQYIAEYLIGDPPQQAEAIIDTGSNLIWTQC-STCQPAGCFSQNLSFY 115
Query: 61 KPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR 116
P ++ V C++ CA + RC N C YG G G L T+ F +
Sbjct: 116 DPSRSRTARPVACNDTACA---LGSETRCARDNKACAVLTAYG-AGVIGGVLGTEAFTFQ 171
Query: 117 FSNGSVFNVPLTFGC-GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGH 175
+ NV L FGC + PG L +G++GLGRG +S+VSQL + + +
Sbjct: 172 PQSE---NVSLAFGCIAATRLTPGSLD--GASGIIGLGRGNLSLVSQLGDNKFSYCLTPY 226
Query: 176 CIGQNGRGVLFLGDGKVPSSGVA---WTPMLQN-SAD---------LKHYILGPAELLYS 222
LF+G SSG A P L+N D L +G A+L
Sbjct: 227 FSQSTNTSRLFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVP 286
Query: 223 GKSCGLKDLT------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI 276
+ L+ + + DSG+ + YQ + +++ L + + + L +
Sbjct: 287 EAAFDLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDL 346
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAY 315
C A G V + PL L F + V VPPE Y
Sbjct: 347 C---AAVAHGDVGKLVPPLVLHFGSGGGDV--AVPPENY 380
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 78/263 (29%), Positives = 108/263 (41%), Gaps = 41/263 (15%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP--EKQYKPHKNI----- 66
F +FA N++VG PP F DTGSDL W+ C+ CT C K NI
Sbjct: 99 FLHFA-NVSVGTPPLSFLVALDTGSDLFWLPCN--CTKCVHGIGLSNGEKIAFNIYDLKG 155
Query: 67 ------VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL--RF 117
V C++ C +C + C YE+ Y +G S+ G LV D+ L
Sbjct: 156 SSTSQPVLCNSSLCELQR-----QCPSSDTICPYEVNYLSNGTSTTGFLVEDVLHLITDD 210
Query: 118 SNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
+ +TFGCG Q L G+ GLG S+ S L + GL N C
Sbjct: 211 DKTKDADTRITFGCGQVQ-TGAFLDGAAPNGLFGLGMSNESVPSILAKEGLTSNSFSMCF 269
Query: 178 GQNGRGVLFLGD------GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL 231
G +G G + GD GK P + A P Y + +++ K L +
Sbjct: 270 GSDGLGRITFGDNSSLVQGKTPFNLRALHPT---------YNITVTQIIVGEKVDDL-EF 319
Query: 232 TLIFDSGASYAYFTSRVYQEIVS 254
IFDSG S+ Y Y++I +
Sbjct: 320 HAIFDSGTSFTYLNDPAYKQITN 342
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 109/261 (41%), Gaps = 33/261 (12%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALH------WPNPPR 84
DT S+LTWVQC APC C + P + ++PC++ C AL
Sbjct: 142 DTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGG 200
Query: 85 CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPP 144
+ P+ C Y + Y DG S G L D L G V + FGCG + N GP
Sbjct: 201 GEQPS--CSYTLSYRDGSYSQGVLAHDKLSL---AGEVID-GFVFGCGTS--NQGPFG-- 250
Query: 145 DTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGV 197
T+G++GLGR ++S++SQ + ++G V +C+ G L LGD S+ +
Sbjct: 251 GTSGLMGLGRSQLSLISQTMDQFG---GVFSYCLPLKESESSGSLVLGDDTSVYRNSTPI 307
Query: 198 AWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIM 257
+T M+ + Y + + G+ +I DSG VY + + +
Sbjct: 308 VYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFL 367
Query: 258 RDLIGTPLKLAPDDKTLPICW 278
P AP L C+
Sbjct: 368 SQFAEYP--QAPGFSILDTCF 386
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 129/328 (39%), Gaps = 54/328 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPHKNI--- 66
+ ++ +G P + DTGS WV C C P E Y P ++
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQC--PHESDILRKLTFYDPRSSVSSK 139
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSV- 122
V C + C + P C + +C Y Y DGG ++G L TDL + NG
Sbjct: 140 EVKCDDTICTS-----RPPC-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQ 193
Query: 123 -FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
+ +TFGCG Q S G++G G + +SQL G + + HC+ N
Sbjct: 194 PTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN 253
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILG------PAELLYSGKSCGLK 229
G G+ +G+ P V TP+++N+ +LK + PA + + K+ G
Sbjct: 254 GGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKG-- 309
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
DSG++ Y +Y E++ + PD + F LG V
Sbjct: 310 ---TFIDSGSTLVYLPEIIYSELILAVFAK--------HPDITMGAMYNFQCFHFLGSVD 358
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ F + F N + L V P YL+
Sbjct: 359 DKFPKITFHF---ENDLTLDVYPYDYLL 383
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 83/281 (29%), Positives = 116/281 (41%), Gaps = 41/281 (14%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+TV K DTGSDLTWVQC PC C Y P + V C++ C
Sbjct: 140 VTVELGGKNMSLIVDTGSDLTWVQCQ-PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQD 198
Query: 77 L--HWPNPPRCKHPN----DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
L N C N C+Y + YGDG + G L ++ L G L FG
Sbjct: 199 LVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVL----GDTKLENLVFG 254
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFL 187
CG N N G +G++GLGR +S+VSQ + V +C + G L
Sbjct: 255 CGRN--NKGLFGGA--SGLMGLGRSSVSLVSQTLK--TFNGVFSYCLPSLEDGASGTLSF 308
Query: 188 GDG---KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCG---LKDLT----LIFDS 237
G+ S+ V +TP++QN YIL +G S G LK L+ ++ DS
Sbjct: 309 GNDFSVYKNSTSVFYTPLVQNPQLRSFYILN-----LTGASIGGVELKTLSFGRGILIDS 363
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
G +Y+ + + ++ G P AP L C+
Sbjct: 364 GTVITRLPPSIYKAVKTEFLKQFSGFP--SAPGYSILDTCF 402
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 83/302 (27%), Positives = 128/302 (42%), Gaps = 40/302 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPPEKQYKPHK----NIVPCS 70
+ V +++G P + DTGSD++WVQC PC+ C ++ + P K + VPC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C+ L QC Y + YGDG ++ G +D L + G+ L FG
Sbjct: 202 ADACSELRIYEA---GCSGSQCGYVVSYGDGSNTTGVYGSDTLAL--APGNTVGTFL-FG 255
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
CG+ Q G + D G+L LGR +S+ SQ G V +C+ Q+ G L LG
Sbjct: 256 CGHAQA--GMFAGID--GLLALGRQSMSLKSQ--AAGAYGGVFSYCLPSKQSAAGYLTLG 309
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFDSGA 239
G +SG A T +L A Y+ ++ +G S G + + + + D+G
Sbjct: 310 -GPTSASGFATTGLLTAWAAPTFYM-----VMLTGISVGGQQVAVPASAFAGGTVVDTGT 363
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
Y + S + AP + L C+ F G VT +AL+F
Sbjct: 364 VITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYD--FSRYGVVT--LPTVALTF 419
Query: 300 TN 301
+
Sbjct: 420 SG 421
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 129/328 (39%), Gaps = 54/328 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPHKNI--- 66
+ ++ +G P + DTGS WV C C P E Y P ++
Sbjct: 83 YYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQC--PHESDILRKLTFYDPRSSVSSK 139
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSV- 122
V C + C + P C + +C Y Y DGG ++G L TDL + NG
Sbjct: 140 EVKCDDTICTS-----RPPC-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQ 193
Query: 123 -FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
+ +TFGCG Q S G++G G + +SQL G + + HC+ N
Sbjct: 194 PTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN 253
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILG------PAELLYSGKSCGLK 229
G G+ +G+ P V TP+++N+ +LK + PA + + K+ G
Sbjct: 254 GGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKG-- 309
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
DSG++ Y +Y E++ + PD + F LG V
Sbjct: 310 ---TFIDSGSTLVYLPEIIYSELILAVFAK--------HPDITMGAMYNFQCFHFLGSVD 358
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ F + F N + L V P YL+
Sbjct: 359 DKFPKITFHF---ENDLTLDVYPYDYLL 383
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 85/341 (24%), Positives = 137/341 (40%), Gaps = 61/341 (17%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----I 66
P + + L +G PP + DTGSDL W QC APCT C + P Y P + +
Sbjct: 85 PTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAV 143
Query: 67 VPCSNPRCAALHWPN------PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
+PC++ PP C C Y + YG G +S+ ++ F +
Sbjct: 144 LPCNSSLSVCAAALAGTGTAPPPGCA-----CTYNVTYGSGWTSVFQ-GSETFTFGSTPA 197
Query: 121 SVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-- 177
VP + FGC + +G++GLGRGR+S+VSQL G+ + +C+
Sbjct: 198 GQSRVPGIAFGCSTASSG---FNASSASGLVGLGRGRLSLVSQL---GVPK--FSYCLTP 249
Query: 178 --GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLY----SGKSCGLKDL 231
N L LG PS+ + T + ++ + P Y +G S G L
Sbjct: 250 YQDTNSTSTLLLG----PSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTAL 305
Query: 232 T---------------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI 276
+ LI DSG + + YQ++ + ++ L+ P L +
Sbjct: 306 SIPPDAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVV-SLVTLPTTDGSAATGLDL 364
Query: 277 CWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
C+ P + P S T N +V+P ++Y++
Sbjct: 365 CFMLP------SSTSAPPAMPSMTLHFNGADMVLPADSYMM 399
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 77/284 (27%), Positives = 112/284 (39%), Gaps = 37/284 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF V + +G PP D+GSD+ WVQC PC C + + P + V C +
Sbjct: 125 YF-VRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPASSATFSAVSCGS 182
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L C + C+YE+ YGDG + G L + L G + GC
Sbjct: 183 AICRTLRTSG---CGD-SGGCEYEVSYGDGSYTKGTLALETLTL----GGTAVEGVAIGC 234
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG---------R 182
G+ N G AG+LGLG G +S+V QL +C+ G
Sbjct: 235 GH--RNRGLFV--GAAGLLGLGWGPMSLVGQLGG--AAGGAFSYCLASRGGSGSGAADAA 288
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKD--LTLIFDSGAS 240
G L LG + G W P+++N Y +G + + + L+D L D G
Sbjct: 289 GSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGG 348
Query: 241 YAYFT----SRVYQEIVSLIMRDLIGT--PLKLAPDDKTLPICW 278
T +R+ QE + + +G L AP L C+
Sbjct: 349 VVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCY 392
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 75/156 (48%), Gaps = 15/156 (9%)
Query: 14 FSYFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VP 68
++ + ++ +G P P+ + DTGSD+ W QC PC C P ++ + V
Sbjct: 89 YTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQC-RPCFDCFTQPLPRFDTSASDTVHGVL 147
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
C++P C AL P C C Y++ YGD +IG L D F G VP L
Sbjct: 148 CTDPICRAL---RPHACFLGG--CTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDL 202
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
FGCG Q+N G +T G+ G GRG +S+ QL
Sbjct: 203 VFGCG--QYNTGNFHSNET-GIAGFGRGPLSLPRQL 235
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 113/270 (41%), Gaps = 39/270 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--------PCTGCTKPPE--KQYKPHKNI 66
+ V++ G PP+ DTGSDL W+QC P C++ P ++
Sbjct: 54 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 113
Query: 67 VPCSNPRCAALHWP--NPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
VPCS +C + P + P C C Y +Y DG S+ G L D + SNG+
Sbjct: 114 VPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATI--SNGTSG 171
Query: 124 NVP---LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
+ FGCG ++ G S T GV+GLG+G++S +Q L +C+
Sbjct: 172 GAAVRGVAFGCG-TRNQGGSFS--GTGGVIGLGQGQLSFPAQ--SGSLFAQTFSYCLLDL 226
Query: 178 --GQNGRGVLFLGDGK-VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-------- 226
G+ GR FL G+ + A+TP++ N Y +G + +
Sbjct: 227 EGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWA 286
Query: 227 --GLKDLTLIFDSGASYAYFTSRVYQEIVS 254
L + + DSG++ Y Y +VS
Sbjct: 287 IDVLGNGGTVIDSGSTLTYLRLGAYLHLVS 316
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/302 (27%), Positives = 128/302 (42%), Gaps = 40/302 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPPEKQYKPHK----NIVPCS 70
+ V +++G P + DTGSD++WVQC PC+ C ++ + P K + VPC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C+ L QC Y + YGDG ++ G +D L + G+ L FG
Sbjct: 202 ADACSELRIYEA---GCSGSQCGYVVSYGDGSNTTGVYGSDTLAL--APGNTVGTFL-FG 255
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
CG+ Q G + D G+L LGR +S+ SQ G V +C+ Q+ G L LG
Sbjct: 256 CGHAQA--GMFAGID--GLLALGRQSMSLKSQ--AAGAYGGVFSYCLPSKQSAAGYLTLG 309
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFDSGA 239
G +SG A T +L A Y+ ++ +G S G + + + + D+G
Sbjct: 310 -GPSSASGFATTGLLTAWAAPTFYM-----VMLTGISVGGQQVAVPASAFAGGTVVDTGT 363
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALSF 299
Y + S + AP + L C+ F G VT +AL+F
Sbjct: 364 VITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYD--FSRYGVVT--LPTVALTF 419
Query: 300 TN 301
+
Sbjct: 420 SG 421
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 115/262 (43%), Gaps = 26/262 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N+++G PP DTGSDL W QC APC C + + P + V CS+
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQC-APCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
+C AL N C ++ C Y + YGD + G + D L S+ + + GC
Sbjct: 149 QCTALE--NQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 206
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G+N N G + +G++GLG G +S++ QL + I +C+ +
Sbjct: 207 GHN--NAGTFNKK-GSGIVGLGGGPVSLIKQLGDS--IDGKFSYCLVPLTSKKDQTSKIN 261
Query: 186 FLGDGKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
F + V SGV TP++ ++ LK +G ++ YSG + +I DSG
Sbjct: 262 FGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSG 321
Query: 239 ASYAYFTSRVYQEIVSLIMRDL 260
+ + Y E+ + +
Sbjct: 322 TTLTLLPTEFYSELEDAVASSI 343
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 92/242 (38%), Gaps = 49/242 (20%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
F + Y + L +G PP + DTGS+L W QC PC C + P K+
Sbjct: 60 FDTYEYL-MKLQIGTPPFEVEAVLDTGSELIWTQC-LPCLHCYDQKAPIFDPSKS----- 112
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-F 129
RC P+ C Y++ Y D + G L T+ + ++G F +P T
Sbjct: 113 -------STFKETRCNTPDHSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETII 165
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GC N N G P ++G++GL RG +S++SQ+
Sbjct: 166 GCSRN--NSGSGFRPSSSGIVGLSRGSLSLISQM-------------------------G 198
Query: 190 GKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLTLIFDSGASYA 242
G P GV T M +A Y L G + G + ++ DSG
Sbjct: 199 GAYPGDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLT 258
Query: 243 YF 244
YF
Sbjct: 259 YF 260
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 64/239 (26%), Positives = 98/239 (41%), Gaps = 30/239 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRC 74
S + + L VG PP + DTGS++TW QC PC C K + P K+
Sbjct: 378 SVYLMKLQVGTPPFEIEAVIDTGSEITWTQC-LPCVHCYKQNAPIFDPSKS--------- 427
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGCGY 133
RC + C YE++Y D + G L TD + ++G F + T GCG
Sbjct: 428 ---STFKEKRCH--DHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGR 482
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-DGKV 192
N P G +GL G +S+++Q+ G ++ +C NG + G + V
Sbjct: 483 NN----SWFRPSFEGFVGLNWGPLSLITQMG--GEYPGLMSYCFAGNGTSKINFGTNAIV 536
Query: 193 PSSGVAWTPMLQNSA-------DLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
GV T M +A +L +G + G + ++ DSG + YF
Sbjct: 537 GGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTLTYF 595
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 115/262 (43%), Gaps = 26/262 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N+++G PP DTGSDL W QC APC C + + P + V CS+
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQC-APCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
+C AL N C ++ C Y + YGD + G + D L S+ + + GC
Sbjct: 149 QCTALE--NQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 206
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G+N N G + +G++GLG G +S++ QL + I +C+ +
Sbjct: 207 GHN--NAGTFNKK-GSGIVGLGGGPVSLIKQLGDS--IDGKFSYCLVPLTSKKDQTSKIN 261
Query: 186 FLGDGKVPSSGVAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
F + V SGV TP++ ++ LK +G ++ YSG + +I DSG
Sbjct: 262 FGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSG 321
Query: 239 ASYAYFTSRVYQEIVSLIMRDL 260
+ + Y E+ + +
Sbjct: 322 TTLTLLPTEFYSELEDAVASSI 343
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/302 (26%), Positives = 133/302 (44%), Gaps = 38/302 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L++G PP DTGSDL W+QC PCT C K + P + + +
Sbjct: 59 YLMELSIGTPPVKTYAQVDTGSDLIWLQC-IPCTNCYKQLNPMFDPQSSSTYSNIAYGSE 117
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C+ L+ + C + C+Y Y D + G L + L + G + + FGC
Sbjct: 118 SCSKLYSTS---CSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGC 174
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCI-----GQNGRGVL 185
G+N N G + + G++GLGRG +S+VSQ+ +G + C+ + +
Sbjct: 175 GHN--NNGVFNDKE-MGIIGLGRGPLSLVSQIGSSFG--GKMFSQCLVPFHTNPSITSPM 229
Query: 186 FLGDG-KVPSSGVAWTPMLQNSADLKHY---ILGPA----ELLYSGKSCGLKDLT---LI 234
G G +V +GV TP++ + Y +LG + L ++ S L+ +T ++
Sbjct: 230 SFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFNDGS-SLEPITKGNMV 288
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL--PICWRGPFKALG-QVTEY 291
DSG Y +V + + P+ P D TL +C+R P G +T +
Sbjct: 289 IDSGTPTTLLPEDFYHRLVEEVRNKVALDPI---PIDPTLGYQLCYRTPTNLKGTTLTAH 345
Query: 292 FK 293
F+
Sbjct: 346 FE 347
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 112/275 (40%), Gaps = 29/275 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP----EKQYKPHKNIVPCSNPRCAA 76
+T+G + DTGSDLTWVQCD PC C N + C++ C
Sbjct: 135 VTIGLGNQNMTVIIDTGSDLTWVQCD-PCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQN 193
Query: 77 LHWP--NPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
L + N C+ N C++ + YGDG + G L + L F SV N FGCG
Sbjct: 194 LQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVE--HLSFGGISVSN--FVFGCGR 249
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDG 190
N N G +G++GLGR +S++SQ V +C+ G L +G+
Sbjct: 250 N--NKGLFGG--VSGIMGLGRSNLSMISQTNT--TFGGVFSYCLPTTDSGASGSLVIGNE 303
Query: 191 KVPSSG---VAWTPMLQNSADLKHYILGPAELLYSG---KSCGLKDLTLIFDSGASYAYF 244
+A+T M+ N Y+L + G + + ++ DSG
Sbjct: 304 SSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGNGGILIDSGTVITRL 363
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
+Y + + ++ G P +AP L C+
Sbjct: 364 APSLYNALKAEFLKQFSGYP--IAPALSILDTCFN 396
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/328 (25%), Positives = 130/328 (39%), Gaps = 54/328 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPHKNI--- 66
+ ++ +G P + DTGS WV C C P E Y P ++
Sbjct: 59 YYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQC--PHESDILRKLTFYDPRSSVSSK 115
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSV- 122
V C + C + PP C + +C Y Y DGG ++G L TDL + NG
Sbjct: 116 EVKCDDTICTS----RPP-C-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQ 169
Query: 123 -FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
+ +TFGCG Q S G++G G + +SQL G + + HC+ N
Sbjct: 170 PTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN 229
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILG------PAELLYSGKSCGLK 229
G G+ +G+ P V TP+++N+ +LK + PA + + K+ G
Sbjct: 230 GGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKG-- 285
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
DSG++ Y +Y E++ + PD + F LG V
Sbjct: 286 ---TFIDSGSTLVYLPEIIYSELILAVFAK--------HPDITMGAMYNFQCFHFLGSVD 334
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ F + F N + L V P YL+
Sbjct: 335 DKFPKITFHF---ENDLTLDVYPYDYLL 359
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 113/270 (41%), Gaps = 39/270 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--------PCTGCTKPPE--KQYKPHKNI 66
+ V++ G PP+ DTGSDL W+QC P C++ P ++
Sbjct: 53 YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSV 112
Query: 67 VPCSNPRCAALHWP--NPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
VPCS +C + P + P C C Y +Y DG S+ G L D + SNG+
Sbjct: 113 VPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATI--SNGTSG 170
Query: 124 NVP---LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
+ FGCG ++ G S T GV+GLG+G++S +Q L +C+
Sbjct: 171 GAAVRGVAFGCG-TRNQGGSFS--GTGGVIGLGQGQLSFPAQ--SGSLFAQTFSYCLLDL 225
Query: 178 --GQNGRGVLFLGDGK-VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC-------- 226
G+ GR FL G+ + A+TP++ N Y +G + +
Sbjct: 226 EGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWA 285
Query: 227 --GLKDLTLIFDSGASYAYFTSRVYQEIVS 254
L + + DSG++ Y Y +VS
Sbjct: 286 IDVLGNGGTVIDSGSTLTYLRLGAYLHLVS 315
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 117/261 (44%), Gaps = 35/261 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++++ G PP+ DTGSDL WVQC PC C + ++ P K+ + C +
Sbjct: 90 YLIDISYGNPPQKSTAIVDTGSDLNWVQC-LPCKSCYETLSAKFDPSKSASYKTLGCGSN 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L + + C Y+ YGDG S+ GAL TD + G + NV FGCG
Sbjct: 149 FCQDLPFQSCAA------SCQYDYMYGDGSSTSGALSTD--DVTIGTGKIPNV--AFGCG 198
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRGVLFLGD 189
N G + G++GLG+G +S+VSQL G +C +G L++GD
Sbjct: 199 --NSNLGTFAG--AGGLVGLGKGPLSLVSQLG--GTATKKFSYCLVPLGSTKTSPLYIGD 252
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------LIFDSGA 239
+ + GVA+TPML N+ Y + GK+ T LI DSG
Sbjct: 253 STL-AGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGT 311
Query: 240 SYAYFTSRVYQEIVSLIMRDL 260
+ Y + +V+ + L
Sbjct: 312 TLTYLDVDAFNPMVAALKAAL 332
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/263 (26%), Positives = 112/263 (42%), Gaps = 27/263 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ ++L++G PP DTGSDL W QC PC C K + P + + C
Sbjct: 93 YLMSLSLGTPPFEILAIADTGSDLIWTQC-TPCDKCYKQIAPLFDPKSSKTYRDLSCDTR 151
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
+C L + + C Y YGD + G L D L +NG P T GC
Sbjct: 152 QCQNLGESSSCSSEQ---LCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGC 208
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGV 184
G + N G D +G++GLG G +S++SQ+ + +C+ N +
Sbjct: 209 G--RRNNGTFDKKD-SGIIGLGGGPMSLISQMGSS--VGGKFSYCLVPFSSESAGNSSKL 263
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSG 238
F + V SGV TP++ + D +Y+ +G ++ + G S G + +I DSG
Sbjct: 264 HFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEGNIIIDSG 323
Query: 239 ASYAYFTSRVYQEIVSLIMRDLI 261
S F + E + + +I
Sbjct: 324 TSLTLFPVNFFTEFATAVENAVI 346
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 63/125 (50%), Gaps = 13/125 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF L VG PPK DTGSD+ W+QC PCT C ++ + P K+ +PC +
Sbjct: 130 YF-TRLGVGTPPKYLYMVLDTGSDVVWLQCK-PCTKCYSQTDQIFDPSKSKSFAGIPCYS 187
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + P C N+ C Y++ YGDG + G T+ L F +V V + GC
Sbjct: 188 PLCRRL---DSPGCSLKNNLCQYQVSYGDGSFTFGDFSTET--LTFRRAAVPRVAI--GC 240
Query: 132 GYNQH 136
G++
Sbjct: 241 GHDNE 245
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 131/308 (42%), Gaps = 35/308 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +G PP DTGSDL WVQC +PC C ++P K+ C +
Sbjct: 90 YLMRFYIGTPPVERLATADTGSDLIWVQC-SPCASCFPQSTPLFQPLKSSTFMPTTCRSQ 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRF-SNGSVFNVPLT-- 128
C L P C + +C Y +YGD S S G L T+ LRF S G V V
Sbjct: 149 PCTLL-LPEQKGCGK-SGECIYTYKYGDQYSFSEGLLSTET--LRFDSQGGVQTVAFPNS 204
Query: 129 -FGCG-YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC---IGQNGRG 183
FGCG YN P G++GLG G +S+VSQ+ + I + +C +G
Sbjct: 205 FFGCGLYNNITVFP--SYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTS 260
Query: 184 VLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSC--GLKDLTLIFDSGAS 240
L G+ + + GV TPM+ +Y L + + K+ G D +I DSG
Sbjct: 261 KLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTL 320
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPLALSF 299
Y Y + + L ++L D + LP C+ P++ F +A F
Sbjct: 321 LTYLGESFYYNFAASLQESL---AVELVQDVLSPLPFCF--PYRD----NFVFPEIAFQF 371
Query: 300 TNRRNSVR 307
T R S++
Sbjct: 372 TGARVSLK 379
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 90/326 (27%), Positives = 133/326 (40%), Gaps = 44/326 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC--TKPPEKQYKPHKNI----VPCS 70
+ +N+++G PP F DTGS+L W QC APCT C P +P ++ +PC+
Sbjct: 91 YNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRCFPRPTPAPVLQPARSSTFSRLPCN 149
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L + PR + C Y YG G ++ G L T+ L +G+ V FG
Sbjct: 150 GSFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATET--LTVGDGTFPKV--AFG 204
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG-VLFLGD 189
C +++G++GLGRG +S+VSQL G + + G +LF
Sbjct: 205 CSTEN------GVDNSSGIVGLGRGPLSLVSQL-AVGRFSYCLRSDMADGGASPILFGSL 257
Query: 190 GKVPS-SGVAWTPMLQNS---------ADLKHYILGPAELLYSGKSCGLKDLTL----IF 235
K+ S V TP+L+N +L + EL +G + G L I
Sbjct: 258 AKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIV 317
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIG----TPLKLAPDDKTLPICWRGPFKALGQVTEY 291
DSG + Y Y + + TP AP D L +C++ P G
Sbjct: 318 DSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYD--LDLCYK-PSAGGGGKAVR 374
Query: 292 FKPLALSFTNRRNSVRLVVPPEAYLV 317
LAL F + VP + Y
Sbjct: 375 VPRLALRFA---GGAKYNVPVQNYFA 397
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 129/328 (39%), Gaps = 54/328 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPHKNI--- 66
+ ++ +G P + DTGS WV C C P E Y P ++
Sbjct: 59 YYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQC--PHESDILRKLTFYDPRSSVSSK 115
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLR--FSNGSV- 122
V C + C + P C + +C Y Y DGG ++G L TDL + NG
Sbjct: 116 EVKCDDTICTS-----RPPC-NMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQ 169
Query: 123 -FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQN 180
+ +TFGCG Q S G++G G + +SQL G + + HC+ N
Sbjct: 170 PTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTN 229
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNS-----ADLKHYILG------PAELLYSGKSCGLK 229
G G+ +G+ P V TP+++N+ +LK + PA + + K+ G
Sbjct: 230 GGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKG-- 285
Query: 230 DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVT 289
DSG++ Y +Y E++ + PD + F LG V
Sbjct: 286 ---TFIDSGSTLVYLPEIIYSELILAVFAK--------HPDITMGAMYNFQCFHFLGSVD 334
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ F + F N + L V P YL+
Sbjct: 335 DKFPKITFHF---ENDLTLDVYPYDYLL 359
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 113/272 (41%), Gaps = 26/272 (9%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPC 69
+ VG P F DTGSDL WV CD AP G + ++ YKP ++ +PC
Sbjct: 147 VDVGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRDLGIYKPAESTTSRHLPC 206
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL--RFSNGSVFNVP 126
S+ C P C P C Y +Y + +S G L+ D+ L R S+ V
Sbjct: 207 SHELC-----PPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESHAPV-KAS 260
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ GCG Q L G+LGLG IS+ S L GL+RN C ++ G +F
Sbjct: 261 VVIGCGRKQSG-SYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDS-GRIF 318
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
GD V S TP + + Y + + K + DSG S+
Sbjct: 319 FGDQGV--SIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDSGTSFTALPL 376
Query: 247 RVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
VY+ V++ + P ++ +D + C+
Sbjct: 377 NVYK-AVAVEFDKQVHAP-RITQEDASFEYCY 406
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 134/322 (41%), Gaps = 34/322 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHK----NIVPCSN 71
+ + L +G PP + DTGSDL W QC APC T C + P Y P +++PC N
Sbjct: 114 YLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASSTTFSVLPC-N 171
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
+ P C Y YG G ++ G ++ F S VP + FG
Sbjct: 172 SSLSMCAGALAGAAPPPGCACMYYQTYGTGWTA-GVQGSETFTFGSSAADQARVPGVAFG 230
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-D 189
C N +AG++GLGRG +S+VSQL G + N L LG
Sbjct: 231 C----SNASSSDWNGSAGLVGLGRGSLSLVSQLGA-GRFSYCLTPFQDTNSTSTLLLGPS 285
Query: 190 GKVPSSGVAWTPMLQNSA----------DLKHYILGPAELLYSGKSCGLK-DLT--LIFD 236
+ +GV TP + + A +L LG L S + LK D T LI D
Sbjct: 286 AALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIID 345
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPDDKTLPICWRGPFKALGQVTEYFKPL 295
SG + + YQ++ + + L+ T P D L +C+ AL T +
Sbjct: 346 SGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCF-----ALPAPTSAPPAV 400
Query: 296 ALSFTNRRNSVRLVVPPEAYLV 317
S T + +V+P ++Y++
Sbjct: 401 LPSMTLHFDGADMVLPADSYMI 422
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 107/261 (40%), Gaps = 32/261 (12%)
Query: 13 IFSYFA-VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE-------------- 57
+F Y N++VG P + DTGSDL W+ C+ CT C +
Sbjct: 108 LFGYLHFANVSVGTPASSYLVALDTGSDLFWLPCN--CTKCVHGIQLSTGQKIAFNIYDN 165
Query: 58 KQYKPHKNIVPCSNPRCAALHWPNPPRCKHPND-QCDYEIEY-GDGGSSIGALVTDLFPL 115
K+ KN V C++ C +C + C Y++EY + S+ G LV D+ L
Sbjct: 166 KESSTSKN-VACNSSLC-----EQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHL 219
Query: 116 RFSNGSVF---NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNV 172
N N +TFGCG Q L G+ GLG +S+ S L + GL N
Sbjct: 220 ITDNDDQTQHANPLITFGCGQVQ-TGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNS 278
Query: 173 IGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT 232
C +G G + GD S TP Y + +++ G S L +
Sbjct: 279 FSMCFAADGLGRITFGDNN-SSLDQGKTP-FNIRPSHSTYNITVTQIIVGGNSADL-EFN 335
Query: 233 LIFDSGASYAYFTSRVYQEIV 253
IFD+G S+ Y + Y++I
Sbjct: 336 AIFDTGTSFTYLNNPAYKQIT 356
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 75/154 (48%), Gaps = 16/154 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +++G PP +DTGSDL W QC PC C K + P K+ V C +
Sbjct: 91 YLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKSTSFKEVSCESQ 149
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG---SVFNVPLTF 129
+C L + C P CD+ YGDG + G + T+ L ++G S+ N+ F
Sbjct: 150 QCRLLDTVS---CSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNI--VF 204
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
GCG+N N G + + G+ G G +S+ SQ+
Sbjct: 205 GCGHN--NSGTFN-ENEMGLFGTGGRPLSLTSQI 235
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/283 (25%), Positives = 124/283 (43%), Gaps = 38/283 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++ +VG P DTGSD+ W+QC PC C + + K+ +PC +
Sbjct: 89 YLISYSVGTPSLQVFGILDTGSDIIWLQCQ-PCKKCYEQTTPIFDSSKSQTYKTLPCPSN 147
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C ++ KH C Y I Y DG S+G L + L +NGS P T GC
Sbjct: 148 TCQSVQGTFCSSRKH----CLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGC 203
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-------EYGLIRNVIGHCIGQNGRGV 184
G ++N + + +G++GLGRG +S+++QL Y L+ +
Sbjct: 204 G--RYNAIGIEEKN-SGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGL------STASSK 254
Query: 185 LFLGDGKVPSS-GVAWTPMLQNSA------DLKHYILGPAELLYSGKSCGLKDLTLIFDS 237
L G+ V S G TP+ + L+ + +G + + G K +I DS
Sbjct: 255 LNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGSGGKG-NIIIDS 313
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWR 279
G + + VY ++ + + + +I L+ D ++ L +C++
Sbjct: 314 GTTLTALPNGVYSKLEAAVAKTVI---LQRVRDPNQVLGLCYK 353
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 95/213 (44%), Gaps = 29/213 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------QYKPHKNIVPCS 70
V + VG PP+ DTGS+L+W++C+ T PP+ CS
Sbjct: 62 LTVPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCS 121
Query: 71 NPRCAALHW-----PNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+P C W P PP C P++ C + Y D S+ G L D F L G
Sbjct: 122 SPEC---QWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLL----GGAPP 174
Query: 125 VPLTFGCGYNQHNPGPLSPPDT---AGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QN 180
V FGC + + + D+ G+LG+ RG +S V+Q +R +CI +
Sbjct: 175 VRALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQT---ATLR--FAYCIAPGD 229
Query: 181 GRGVLFL-GDGKVPSSGVAWTPMLQNSADLKHY 212
G G+L L GDG + + +TP++Q S L ++
Sbjct: 230 GPGLLVLGGDGAALAPQLNYTPLIQISRPLPYF 262
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/212 (29%), Positives = 88/212 (41%), Gaps = 32/212 (15%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----- 67
I Y+ L +G PP++F D+GS +T+V C + C C K P I+
Sbjct: 88 INGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQVMLSSPKDQILCLVSC 146
Query: 68 ---------------PCSNPRCAALHWP----NPPRCKHPNDQCDYEIEYGDGGSSIGAL 108
P P ++ + P C +QC YE EY + SS G L
Sbjct: 147 KVQIFKISYGLFDEDPKFQPELSSTYQPVKCNMDCNCDDDKEQCVYEREYAEHSSSKGVL 206
Query: 109 VTDLFPLRFSNGSVFN-VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYG 167
DL + F N S FGC + G L G++GLG+G +S+V QL + G
Sbjct: 207 GEDL--ISFGNESHLTPQRAVFGCKTVE--TGDLYSQRADGIIGLGQGDLSLVGQLVDKG 262
Query: 168 LIRNVIGHCIG--QNGRGVLFLGDGKVPSSGV 197
LI N G C G G G + +G PS +
Sbjct: 263 LISNSFGLCYGGLDVGGGSMIVGGFDYPSDMI 294
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/271 (30%), Positives = 111/271 (40%), Gaps = 32/271 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ V+L +G PP+ DTGSDL W QC PC C + P ++ C +
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDST 93
Query: 73 RCAALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L + K PN C Y YGD + G L D F + SV V FGC
Sbjct: 94 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV--AFGC 151
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G N G +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 152 GL--FNNGVFKSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTTITGAIPSTVLLDLPAD 207
Query: 192 VPSSG---VAWTPMLQ---NSAD-------LKHYILGPAELLYSGKSCGLKDLT--LIFD 236
+ S+G V TP++Q N A+ LK +G L + L + T I D
Sbjct: 208 LFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIID 267
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKL 267
SG S +VYQ ++RD +KL
Sbjct: 268 SGTSITSLPPQVYQ-----VVRDEFAAQIKL 293
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 114/288 (39%), Gaps = 55/288 (19%)
Query: 9 FFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK---- 64
F F +NL +G PP+ DTGS L+W+QC +PP + P
Sbjct: 67 FSFKYSMALIINLPIGTPPQTQPMVLDTGSQLSWIQCHK-----KQPPTASFDPSLSSTF 121
Query: 65 NIVPCSNPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSV 122
+I+PC++P C + P C N C Y Y DG + G LV + F + SV
Sbjct: 122 SILPCTHPLCKPRIPDFTLPTSCDQ-NRLCHYSYFYADGTYAEGNLVREKFTF---SRSV 177
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----- 177
PL GC +P G+LG+ GR+S Q + +C+
Sbjct: 178 STPPLILGCATESTDP--------RGILGMNLGRLSFAKQSKI-----TKFSYCVPPRQT 224
Query: 178 --GQNGRGVLFLGDGKVPSS-GVAWTPMLQNSA------DLKHYILGPAELLYSGKSCGL 228
G G +LG+ PSS G + M+ +S D Y + + +GK +
Sbjct: 225 RPGFTPTGSFYLGNN--PSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNI 282
Query: 229 KDLTL----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
+ DSG+ + Y S Y ++ + ++R +G LK
Sbjct: 283 SPAVFRADAGGSGQTMIDSGSEFTYLVSEAYDKVRAQVVR-AVGPRLK 329
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 110/287 (38%), Gaps = 38/287 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V ++VG PP D+GSD+ WVQC PC C + + P + V C +
Sbjct: 171 YLVRVSVGSPPTEQYLVVDSGSDVMWVQCK-PCLECYVQADPLFDPATSATFSGVSCGSA 229
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L P C+YE+ Y DG + GAL + L G + GCG
Sbjct: 230 ICRIL--PTSACGDGELGGCEYEVSYADGSYTKGALALETLTL----GGTAVEGVVIGCG 283
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG----------R 182
+ N G AG++GLG G +S+V QL G + +C+ G
Sbjct: 284 H--RNRGLFV--GAAGLMGLGWGPMSLVGQLG--GEVGGAFSYCLASRGGYGSGAADDDA 337
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK----SCGLKDLT------ 232
G L LG + G W P+++N Y +G + + + GL LT
Sbjct: 338 GWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGD 397
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGT-PLKLAPDDKTLPICW 278
++ D+G + Y + + L G P L C+
Sbjct: 398 VVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCY 444
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/229 (26%), Positives = 93/229 (40%), Gaps = 15/229 (6%)
Query: 54 KPPEKQYKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDL 112
+P E H +PCS+ C ++ P C +P C Y I+Y + +S G L+ D
Sbjct: 10 RPAESTTSRH---LPCSHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDT 61
Query: 113 FPLRFSNGSV-FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRN 171
L + V N + GCG Q L G+LGLG IS+ S L GL++N
Sbjct: 62 LHLNYREDHVPVNASVIIGCGQKQSG-DYLDGIAPDGLLGLGMADISVPSFLARAGLVQN 120
Query: 172 VIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL 231
C ++ G +F GD VPS TP + L+ Y + + K
Sbjct: 121 SFSMCFKEDSSGRIFFGDQGVPSQ--QSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSF 178
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
+ DSG S+ VY+ + + T ++ +D T C+
Sbjct: 179 KALVDSGTSFTSLPFDVYKAFTMEFDKQMNAT--RVPYEDTTWKYCYSA 225
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 95/197 (48%), Gaps = 27/197 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +NL++G PP F DTGS L W QC APCT C P ++P + +PC++
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQC-APCTECAARPAPPFQPASSSTFSKLPCASS 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L P C C Y YG G ++ G L T+ + G+ F +TFGC
Sbjct: 149 LCQFLTSPY-RTCNATG--CVYYYPYGMGFTA-GYLATETLHV---GGASFP-GVTFGCS 200
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLFLG 188
++ G ++G++GLGR +S+VSQ+ G+ R +C+ N +LF
Sbjct: 201 -TENGVG----NSSSGIVGLGRSPLSLVSQV---GVAR--FSYCLRSNADAGDSPILFGS 250
Query: 189 DGKVPSSGVAWTPMLQN 205
KV V TP+L+N
Sbjct: 251 LAKVTGGNVQSTPLLEN 267
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/250 (28%), Positives = 106/250 (42%), Gaps = 26/250 (10%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCK-HPN 89
DTGSDL W QC PC C + + P + + CS +C L C N
Sbjct: 110 DTGSDLIWTQC-KPCDQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLK--EGASCSGEGN 166
Query: 90 DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAG 148
C Y YGD + G + D L ++G +P GCG HN G +G
Sbjct: 167 KTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIGCG---HNNGGSFTEKGSG 223
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVLFLGDGKVPSSGVAWTPM 202
++GLG G IS++SQL I +C+ N + F +G V GV TP+
Sbjct: 224 IVGLGGGPISLISQLGS--TIDGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPL 281
Query: 203 LQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLI 256
+ D +++ +G + + G S G + +I DSG + F + E+ S +
Sbjct: 282 ISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAV 341
Query: 257 MRDLIGTPLK 266
+ GTP++
Sbjct: 342 QDAVAGTPVE 351
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/342 (26%), Positives = 145/342 (42%), Gaps = 54/342 (15%)
Query: 18 AVNLTVGKPPKLFDFDFDTGSDLTWVQC--DAPCTGCT---KPPEK------QYKPHKNI 66
+++L+ G PP+ F DTGSD+ W C D CT C+ P+K + I
Sbjct: 79 SISLSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKI 138
Query: 67 VPCSNPRCAALHWP----NPPRC----KHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFS 118
+ C NP+C + ++P PRC KH + C Y +YG G SS L+ + L+F
Sbjct: 139 LDCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGASSGYFLLEN---LKFP 195
Query: 119 NGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHC 176
++ N L GC + + G GR S+ Q+ +++ N +
Sbjct: 196 RKTIRNFLL--GC-----TTSAARELSSDALAGFGRSMFSLPIQMGVKKFAYCLNSHDYD 248
Query: 177 IGQN-GRGVLFLGDGKVPSSGVAWTPMLQN-SADLKHYILGPAELLYSGKSCGLKDLTL- 233
+N G+ +L DGK + G+++TP L++ A +Y LG ++ K + L
Sbjct: 249 DTRNSGKLILDYRDGK--TKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLA 306
Query: 234 ---------IFDSGASYA-YFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICWRGPF 282
I DSG A Y T V++ + + + + + L + +T L C+
Sbjct: 307 PGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCY---- 362
Query: 283 KALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSI 324
G + PL F R +VVP + Y IS S+
Sbjct: 363 NFTGHKSIKIPPLIYQF---RGGANMVVPGKNYFGISPQESL 401
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 72/161 (44%), Gaps = 17/161 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP-CTGCTKPPEKQYKPHKN----IVPCSN 71
+ + +G PP DTGS++ W+QC +P CT C K + P K+ I C +
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGH 167
Query: 72 PRCAALHW--PNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFNVPL 127
C W CK C Y I Y D S G + TD+ FP + +++ +
Sbjct: 168 RECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRM 227
Query: 128 TFGCGYNQ-----HNPGPLSPPDTAGVLGLGRGRISIVSQL 163
FGCGYN +P + P GV+GLG S+V QL
Sbjct: 228 FFGCGYNNSETPGQDPNSFTAP---GVVGLGNEMASLVGQL 265
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/204 (31%), Positives = 93/204 (45%), Gaps = 25/204 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + VG P DTGSD+TW+QC PC C + P + + P
Sbjct: 134 YMAKIAVGTPAVEALLAMDTGSDITWLQCQ-PCRRCYPQSGPVFDPRHSTSYREMGYDAP 192
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSS-IGALVTDLFPLRFSNGSVFNVP-LTFG 130
C AL K C Y + YGD GS+ +G + + L F+ G VP ++ G
Sbjct: 193 DCQALGRSGGGDAKRMT--CVYAVGYGDDGSTTVGDFIEET--LTFAGG--VQVPHMSIG 246
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--------GQNGR 182
CG++ N G + P AG+LGLGRG+IS SQ+ G +C+ G++
Sbjct: 247 CGHD--NKGLFAAP-AAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSPGRSVS 303
Query: 183 GVLFLGDGKVPSS-GVAWTPMLQN 205
L +GDG S ++TP +QN
Sbjct: 304 STLTIGDGAAAGSPPPSFTPTVQN 327
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 88/320 (27%), Positives = 126/320 (39%), Gaps = 38/320 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF V + VG PP D+GSD+ W+QC PC C + + + P + VPC +
Sbjct: 133 YF-VRVGVGSPPTEQYLVVDSGSDVIWIQCR-PCAECYQQADPLFDPAASASFTAVPCDS 190
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L P + C Y++ YGDG + G L + L F + + + GC
Sbjct: 191 GVCRTL--PGGSSGCADSGACRYQVSYGDGSYTQGVLAME--TLTFGDSTPVQ-GVAIGC 245
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN----GRGVLFL 187
G+ N G AG+LGLG G +S+V QL +C+ G G L
Sbjct: 246 GH--RNRGLFV--GAAGLLGLGWGPMSLVGQLGG--AAGGAFSYCLASRGADAGAGSLVF 299
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC----GLKDLT------LIFDS 237
G G W P+L+N+ Y +G L G+ GL DLT ++ D+
Sbjct: 300 GRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDT 359
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G + Y + IG L AP L C + G + +AL
Sbjct: 360 GTAVTRLPPDAYAALRDAFA-STIGGDLPRAPGVSLLDTC----YDLSGYASVRVPTVAL 414
Query: 298 SFTNRRNSVRLVVPPEAYLV 317
F R+ L +P LV
Sbjct: 415 YFG--RDGAALTLPARNLLV 432
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/304 (27%), Positives = 123/304 (40%), Gaps = 43/304 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT--GCTKPPEKQYKPHKN----IVPCS 70
+ V +++G P + DTGSDL+WVQC PC C + + P ++ VPC
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCT-PCAAPACYSQKDPLFDPAQSSSYAAVPCG 198
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
P C L C QC Y + YGDG + G +D L N +V FG
Sbjct: 199 GPVCGGLGI-YASSCSAA--QCGYVVSYGDGSKTTGVYSSDTLTLS-PNDAVRG--FFFG 252
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLG 188
CG+ Q + D G+LGLGR S+V Q G V +C+ + G L LG
Sbjct: 253 CGHAQSG---FTGND--GLLGLGREEASLVEQ--TAGTYGGVFSYCLPTRPSTTGYLTLG 305
Query: 189 --DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFDS 237
G P G + T +L + +Y+ ++ +G S G + L++ + D+
Sbjct: 306 GPSGAAP-PGFSTTQLLSSPNAATYYV-----VMLTGISVGGQQLSVPSSVFAGGTVVDT 359
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G Y + S + AP L C+ F G VT +AL
Sbjct: 360 GTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYN--FSGYGTVT--LPNVAL 415
Query: 298 SFTN 301
+F+
Sbjct: 416 TFSG 419
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 112/281 (39%), Gaps = 26/281 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ L +G P + DTGS LTW+QC C + + P + V CS
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCSAS 193
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C L NP C N C Y+ YGD S+G+L TD S GS +G
Sbjct: 194 QCDELQAATLNPSACSASN-VCIYQASYGDSSFSVGSLSTD----TVSFGSTRYPSFYYG 248
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
CG Q N G +AG++GL R ++S++ QL + +C+ G L +G
Sbjct: 249 CG--QDNEGLFG--RSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTAASTGYLSIGP 302
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYF 244
++TPM +S D Y + + + G + L I DSG
Sbjct: 303 YNT-GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRL 361
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
+ V+ + + + + G + AP L C+ G L
Sbjct: 362 PTAVHTALSKAVAQAMAGA--QRAPAFSILDTCFEGQASQL 400
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 74/152 (48%), Gaps = 12/152 (7%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +++G PP +DTGSDL W QC PC C K + P K+ V C +
Sbjct: 91 YLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKSTSFKEVSCESQ 149
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
+C L + C P CD+ YGDG + G + T+ L ++G ++ + FGC
Sbjct: 150 QCRLL---DTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVFGC 206
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
G+N N G + + G+ G G +S+ SQ+
Sbjct: 207 GHN--NSGTFN-ENEMGLFGTGGRPLSLTSQI 235
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/321 (27%), Positives = 132/321 (41%), Gaps = 33/321 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHK----NIVPCSN 71
+ + L +G PP + DTGSDL W QC APC T C + P Y P +++PC N
Sbjct: 112 YLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASSTTFSVLPC-N 169
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
+ P C Y YG G ++ G ++ F S VP + FG
Sbjct: 170 SSLSMCAGALAGAAPPPGCACMYNQTYGTGWTA-GVQGSETFTFGSSAADQARVPGVAFG 228
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-D 189
C N +AG++GLGRG +S+VSQL G + N L LG
Sbjct: 229 C----SNASSSDWNGSAGLVGLGRGSLSLVSQLGA-GRFSYCLTPFQDTNSTSTLLLGPS 283
Query: 190 GKVPSSGVAWTPMLQNSA----------DLKHYILGPAELLYSGKSCGLK-DLT--LIFD 236
+ +GV TP + + A +L LG L S + LK D T LI D
Sbjct: 284 AALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIID 343
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
SG + + YQ++ + + + P D L +C+ AL T +
Sbjct: 344 SGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCF-----ALPAPTSAPPAVL 398
Query: 297 LSFTNRRNSVRLVVPPEAYLV 317
S T + +V+P ++Y++
Sbjct: 399 PSMTLHFDGADMVLPADSYMI 419
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/275 (27%), Positives = 115/275 (41%), Gaps = 24/275 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSN----P 72
+ + +G P K + DTGS LTW+QC C + + P + S P
Sbjct: 121 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAP 180
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C AL NP C N C Y+ YGD S+G L D + F + SV N +G
Sbjct: 181 QCDALTTATLNPSTCSTSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSVPN--FYYG 235
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G +AG++GL R ++S++ QL + +C+ + +L G
Sbjct: 236 CG--QDNEGLFG--QSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSGYLSIG 289
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYFT 245
++TPM ++S D Y + + +GK + L I DSG
Sbjct: 290 SYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDSGTVITRLP 349
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
+ VY + + + GTP A L C++G
Sbjct: 350 TDVYSALSKAVAGAMKGTPRASA--FSILDTCFQG 382
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 71/153 (46%), Gaps = 13/153 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +G PP F DTGSDLTW QC PC C Y P + VPCS+
Sbjct: 77 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 135
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVP-LTFG 130
C L C P+ C Y Y DG S G L T+ L S G +V + FG
Sbjct: 136 TC--LPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFG 193
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
CG + ++ G +GLGRG +S+++QL
Sbjct: 194 CGTDNGG----DSLNSTGTVGLGRGTLSLLAQL 222
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 94/213 (44%), Gaps = 29/213 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK------QYKPHKNIVPCS 70
V + VG PP+ DTGS+L+W++C+ T PP+ CS
Sbjct: 60 LTVPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCS 119
Query: 71 NPRCAALHW-----PNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+P C W P PP C P+ C + Y D S+ G L D F L G
Sbjct: 120 SPEC---QWRGRDLPVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLL----GGAPP 172
Query: 125 VPLTFGCGYNQHNPGPLSPPDT---AGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QN 180
V FGC + + + D+ G+LG+ RG +S V+Q +R +CI +
Sbjct: 173 VXALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQT---ATLR--FAYCIAPGD 227
Query: 181 GRGVLFL-GDGKVPSSGVAWTPMLQNSADLKHY 212
G G+L L GDG + + +TP++Q S L ++
Sbjct: 228 GPGLLVLGGDGAALAPQLNYTPLIQISRPLPYF 260
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 101/247 (40%), Gaps = 22/247 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ L +G P + DTGS LTW+QC C + + P + V CS+
Sbjct: 131 YVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCSSS 190
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L NP C N C Y+ YGD S+G L D + F +GS +G
Sbjct: 191 ECGELQAATLNPSACSVSN-VCIYQASYGDSSYSVGYLSKDT--VSFGSGSFPG--FYYG 245
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G +AG++GL + ++S++ QL + +C+ + +L G
Sbjct: 246 CG--QDNEGLFG--RSAGLIGLAKNKLSLLYQLAPS--LGYAFSYCLPTSSAAAGYLSIG 299
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYFT 245
++TPM +S D Y + + + +G + + L I DSG
Sbjct: 300 SYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITRLP 359
Query: 246 SRVYQEI 252
VY +
Sbjct: 360 PNVYTAL 366
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 100/247 (40%), Gaps = 25/247 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V G P K DTGSD+TW+QC PC+ C + ++P ++ + C +
Sbjct: 138 YIVTAGFGTPAKNSLLIIDTGSDVTWIQCK-PCSDCYSQVDPIFEPQQSSSYKHLSCLSS 196
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L N R C YEI YGDG S G + L GS FGCG
Sbjct: 197 ACTELTTMNHCRL----GGCVYEINYGDGSRSQGDFSQETLTL----GSDSFPSFAFGCG 248
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREY--GLIRNVIGHCIGQNGRGVLFLGDG 190
+ N G +AG+LGLGR +S SQ + G + + G +G G
Sbjct: 249 HT--NTGLF--KGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSFSVGQG 304
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-----IFDSGASYAYFT 245
+P++ + P++ NS Y +G + G+ + L I DSG
Sbjct: 305 SIPATAT-FVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVITRLV 363
Query: 246 SRVYQEI 252
+ Y +
Sbjct: 364 PQAYDAL 370
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/132 (36%), Positives = 65/132 (49%), Gaps = 18/132 (13%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPC 69
YF +++ VG PPK F DTGSDL W+QC PC C + Y P ++NI C
Sbjct: 180 EYF-IDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYECFEQNGPHYDPGQSSSYRNI-GC 236
Query: 70 SNPRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS------- 121
+ RC + P+PP+ CK N C Y YGD ++ G + F + + S
Sbjct: 237 HDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRR 296
Query: 122 VFNVPLTFGCGY 133
V NV FGCG+
Sbjct: 297 VENV--MFGCGH 306
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/260 (26%), Positives = 112/260 (43%), Gaps = 28/260 (10%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWP--NPPRCKHP 88
DTGSDLTWVQC PC C + + P + + C++ C +L + N C
Sbjct: 83 DTGSDLTWVQCQ-PCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSN 141
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAG 148
C+Y + YGDG + G L + L ++ S F FGCG N N G +G
Sbjct: 142 TPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSNF----IFGCGRN--NKGLFG--GASG 193
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGVAWTPM 202
++GLG+ +S+VSQ + V +C+ + G L LG ++ +++T M
Sbjct: 194 LMGLGKSDLSLVSQTS--AIFEGVFSYCLPTTAADASGSLILGGNSSVYKNTTPISYTRM 251
Query: 203 LQNSADLKHYILGPAELLYSG---KSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRD 259
+ N Y L + G ++ + ++ DSG VY+++ + ++
Sbjct: 252 IANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQ 311
Query: 260 LIGTPLKLAPDDKTLPICWR 279
G P AP L C+
Sbjct: 312 FSGFP--SAPPFSILDTCFN 329
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 110/256 (42%), Gaps = 27/256 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IVPCS 70
+ V ++ G P DTGSD++W+QC PC+ P+K Y P + VPC+
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 137
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ C L QC + I Y DG S++GA D L + G++ FG
Sbjct: 138 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQD--KLTLAPGAIVQ-NFYFG 194
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
CG+ +H L GVLGLGR R S+ ++ YG V +C+ + G L LG
Sbjct: 195 CGHGKHAVRGLFD----GVLGLGRLRESLGAR---YG---GVFSYCLPSVSSKPGFLALG 244
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----LIFDSGASYAYF 244
GK P SG +TPM + A + GK L+ +I DSG
Sbjct: 245 AGKNP-SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGL 303
Query: 245 TSRVYQEIVSLIMRDL 260
S Y+ + S + +
Sbjct: 304 QSTAYRALRSAFRKAM 319
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 110/256 (42%), Gaps = 27/256 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IVPCS 70
+ V ++ G P DTGSD++W+QC PC+ P+K Y P + VPC+
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 171
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ C L QC + I Y DG S++GA D L + G++ FG
Sbjct: 172 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQD--KLTLAPGAIVQ-NFYFG 228
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLG 188
CG+ +H L GVLGLGR R S+ ++ YG V +C+ + G L LG
Sbjct: 229 CGHGKHAVRGL----FDGVLGLGRLRESLGAR---YG---GVFSYCLPSVSSKPGFLALG 278
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----LIFDSGASYAYF 244
GK P SG +TPM + A + GK L+ +I DSG
Sbjct: 279 AGKNP-SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGL 337
Query: 245 TSRVYQEIVSLIMRDL 260
S Y+ + S + +
Sbjct: 338 QSTAYRALRSAFRKAM 353
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 127/322 (39%), Gaps = 50/322 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP------EKQYKPHKNIVPCS 70
V+LTVG PP+ DTGS+L+W+ C+ + Y P +PCS
Sbjct: 73 LTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSP----IPCS 128
Query: 71 NPRCA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+ C +P P C N C + Y D SS G L TD F + GS +
Sbjct: 129 SSTCTDQTRDFPIRPSCDS-NQFCHATLSYADASSSEGNLATDTFYI----GSSGIPNVV 183
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRGVLFL 187
FGC + + G++G+ RG +S VSQ+ G + +CI + + G+L L
Sbjct: 184 FGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQM---GFPK--FSYCISEYDFSGLLLL 238
Query: 188 GDGKVP-SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------------- 233
GD + + +TP+++ S L ++ + G K L +
Sbjct: 239 GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG 298
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-----TLPICWRGPFKA-- 284
+ DSG + + Y + + G+ L++ D + +C+R P
Sbjct: 299 QTMVDSGTQFTFLLGPAYTALRDHFLNKTAGS-LRVYEDSNFVFQGAMDLCYRVPTNQTR 357
Query: 285 ---LGQVTEYFKPLALSFTNRR 303
L VT F+ ++ T R
Sbjct: 358 LPPLPSVTLVFRGAEMTVTGDR 379
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 100/234 (42%), Gaps = 23/234 (9%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALH 78
+ L +G K DTGS+ VQC + P Q VPC + C A+
Sbjct: 1 MQLGIGSLQKNLSAIIDTGSEAVLVQCGSRSRPVFDPAASQSYRQ---VPCISQLCLAVQ 57
Query: 79 WP----NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP---LTFGC 131
+ C + + C Y + YGD +S G D+ L +N S V + FGC
Sbjct: 58 QQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGC 117
Query: 132 GYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN-----GRGVL 185
H+P G L + G++G RG +S+ SQL++ L + +C GV+
Sbjct: 118 A---HSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPRATGVI 173
Query: 186 FLGDGKVPSSGVAWTPMLQN---SADLKHYILGPAELLYSGKSCGLKDLTLIFD 236
FLGD + S V++TP+L N A + Y +G + GK+ + + D
Sbjct: 174 FLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLD 227
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 88/288 (30%), Positives = 133/288 (46%), Gaps = 36/288 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ +++ VG PP+ F DTGSDL W+QC APC C + + P ++N+ C +
Sbjct: 149 YLIDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNVT-CGD 206
Query: 72 PRCAALHWPNPPR-CKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVP-L 127
RC + P PR C+ P D C Y YGD ++ G L + F + + G+ V +
Sbjct: 207 QRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGV 266
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV-- 184
FGCG+ N G AG+LGLGRG +S SQLR YG + +C+ ++G
Sbjct: 267 VFGCGH--RNRGLFH--GAAGLLGLGRGPLSFASQLRAVYG---HTFSYCLVEHGSDAGS 319
Query: 185 --------LFLGDGKVPSSGVAWTPMLQNS---ADLKHYILGPAELLYSGKSCGL-KDLT 232
L L ++ + A T ++ LK ++G L S + + KD +
Sbjct: 320 KVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGS 379
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
I DSG + +YF YQ ++ DL+ L PD L C+
Sbjct: 380 GGTIIDSGTTLSYFVEPAYQ-VIRQAFVDLMSRLYPLIPDFPVLNPCY 426
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 69/234 (29%), Positives = 101/234 (43%), Gaps = 26/234 (11%)
Query: 34 FDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPCSNPRCAALHWPNP 82
DTGSDL WV CD AP G T E + Y P + V C+N CA +
Sbjct: 4 LDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRN---- 59
Query: 83 PRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNGSVFNVP--LTFGCGYNQHNPG 139
+C C Y + Y +S G L+ D+ L + + V +TFGCG Q
Sbjct: 60 -QCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSF 118
Query: 140 -PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVA 198
++ P+ G+ GLG +IS+ S L GL+ + C G +G G + GD SS
Sbjct: 119 LDIAAPN--GLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKG--SSDQE 174
Query: 199 WTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEI 252
TP N + + I + G + + T +FD+G S+ Y +Y +
Sbjct: 175 ETPFNLNPSHPNYNI--TVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTV 226
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 141/340 (41%), Gaps = 49/340 (14%)
Query: 16 YFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGC-TKPPEKQYKPHKNIVPCSNPR 73
Y+ N+ +G P P+ F DTGS LT+V C A C C T ++ P + C +
Sbjct: 111 YYYANIALGDPSPRTFQVIVDTGSTLTYVPC-ATCAKCGTHTGGTRFDPTGKWLTCQEKQ 169
Query: 74 CAALHWPN---PPRCKHPNDQCDYEIEYGDGGSSIGALVTDL--FPLRFSNGSVFNVPLT 128
C A P R N +C Y Y +G G LV D F + + + +
Sbjct: 170 CKAAGGPGICAGGRGAAAN-RCTYSRTYAEGSGVSGDLVRDKMHFGGDIAPATNGTLDVV 228
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRI-SIVSQLREYGLIRNVIGHCIGQ-NGRGVLF 186
FGC G + + G++GLG + SI +QL + + V C G G G L
Sbjct: 229 FGC--TNAESGTIHDQEADGLIGLGNNQFASIPNQLADTHGLPRVFSLCFGSFEGGGALS 286
Query: 187 LGDGKVPSS----GVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-KDLTL----IFDS 237
G++P++ + +T M N A +Y++ A + + DL + + DS
Sbjct: 287 F--GRLPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVATPSDLAVGYGTVMDS 344
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTP---LKLA---------PDDKTLPICWR------ 279
G ++ Y ++V+ + + + KLA PDD +C++
Sbjct: 345 GTTFTYVPTKVFHATAAALDAAVTTNAKPEKKLAKVPGPDPSYPDD----VCFQREGATE 400
Query: 280 -GPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVI 318
P + + EY+ PL ++F S LV+PP YL +
Sbjct: 401 IEPIVTMANLGEYYPPLTIAFDGEGAS--LVLPPSNYLFV 438
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 59/125 (47%), Gaps = 12/125 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF+ + VG P + DTGSD+TWVQC PC C + + + P + V C N
Sbjct: 167 YFS-RVGVGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSTSYASVACDN 224
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
PRC H + C++ C YE+ YGDG ++G T+ L S + GC
Sbjct: 225 PRC---HDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAIGC 278
Query: 132 GYNQH 136
G++
Sbjct: 279 GHDNE 283
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 99/243 (40%), Gaps = 29/243 (11%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
F + Y + L +G PP + DTGS+ W QC PC C + P K+
Sbjct: 54 FDTYEYL-MKLQIGTPPFEIEAVLDTGSEHIWTQC-LPCVHCYNQTAPIFDPSKS----- 106
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-F 129
RC + C YE+ YG + G LVT+ + ++G F +P T
Sbjct: 107 -------STFKEIRCDTHDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETII 159
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG- 188
GCG N N G P AGV+GL RG S+++Q+ G ++ +C G + G
Sbjct: 160 GCGRN--NSG--FKPGFAGVVGLDRGPKSLITQMG--GEYPGLMSYCFAGKGTSKINFGA 213
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLTLIFDSGASY 241
+ V GV T + +A Y L G + G ++ DSG++
Sbjct: 214 NAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTL 273
Query: 242 AYF 244
YF
Sbjct: 274 TYF 276
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 99/243 (40%), Gaps = 29/243 (11%)
Query: 11 FPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCS 70
F + Y + L +G PP + DTGS+ W QC PC C + P K+
Sbjct: 60 FDTYEYL-MKLQIGTPPFEIEAVLDTGSEHIWTQC-LPCVHCYNQTAPIFDPSKS----- 112
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-F 129
RC + C YE+ YG + G LVT+ + ++G F +P T
Sbjct: 113 -------STFKEIRCDTHDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETII 165
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG- 188
GCG N N G P AGV+GL RG S+++Q+ G ++ +C G + G
Sbjct: 166 GCGRN--NSG--FKPGFAGVVGLDRGPKSLITQMG--GEYPGLMSYCFAGKGTSKINFGA 219
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYIL-------GPAELLYSGKSCGLKDLTLIFDSGASY 241
+ V GV T + +A Y L G + G ++ DSG++
Sbjct: 220 NAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTL 279
Query: 242 AYF 244
YF
Sbjct: 280 TYF 282
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/275 (28%), Positives = 109/275 (39%), Gaps = 31/275 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKNI----VPCS 70
+ V L G P DTGSD++WVQC APC P+K + P K+ + C
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQC-APCNSTECYPQKDPLFDPSKSSTYAPIACG 183
Query: 71 NPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C L H+ N C QC Y +EYGDG S+ G + + F+ G
Sbjct: 184 ADACNKLGDHYRN--GCTSGGTQCGYRVEYGDGSSTRGVYSNET--ITFAPGITVK-DFH 238
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGVLFL 187
FGCG++Q GP D G+LGLG S+V Q YG +C+ FL
Sbjct: 239 FGCGHDQR--GPSDKFD--GLLGLGGAPESLVVQTASVYG---GAFSYCLPALNSEAGFL 291
Query: 188 GDGKVPS-----SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----LIFDSG 238
G PS S +TPM D Y++ + GK + ++ DSG
Sbjct: 292 ALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFRGGMLIDSG 351
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT 273
Y + + + + P+ + D T
Sbjct: 352 TIVTELPETAYNALNAALRKAFAAYPMVASEDFDT 386
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 111/281 (39%), Gaps = 26/281 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ L +G P + DTGS LTW+QC C + + P + V CS
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCSAS 193
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C L NP C N C Y+ YGD S+G L TD S GS +G
Sbjct: 194 QCDELQAATLNPSACSASN-VCIYQASYGDSSFSVGYLSTD----TVSFGSTSYPSFYYG 248
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
CG Q N G +AG++GL R ++S++ QL + +C+ G L +G
Sbjct: 249 CG--QDNEGLFG--RSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTAASTGYLSIGP 302
Query: 190 GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDSGASYAYF 244
++TPM +S D Y + + + G + L I DSG
Sbjct: 303 YNT-GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRL 361
Query: 245 TSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKAL 285
+ V+ + + + + G + AP L C+ G L
Sbjct: 362 PTAVHTALSKAVAQAMAGA--QRAPAFSILDTCFEGQASQL 400
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 97/240 (40%), Gaps = 26/240 (10%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
+S + + L +G PP + DTGSDL W QC PC C + P K+ R
Sbjct: 58 YSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQC-MPCPNCYTQFAPIFDPSKSST-FKEKR 115
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGCG 132
C H N C YEI Y D S G L T+ ++ ++G F + T GCG
Sbjct: 116 C------------HGN-SCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCG 162
Query: 133 YNQHN-PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLG-DG 190
N N P ++G++GL G S++SQ+ I +I +C G + G +
Sbjct: 163 LNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDL--PIPGLISYCFSSQGTSKINFGTNA 220
Query: 191 KVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGASYAYF 244
V G M +Y+ +G + G +D + DSG +Y Y
Sbjct: 221 VVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTTYTYL 280
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 74/280 (26%), Positives = 119/280 (42%), Gaps = 44/280 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + ++G PP+ DTGSDL W +C A CT C Y P+K+ +PCS
Sbjct: 82 YDMTFSIGTPPQELSALADTGSDLIWAKCGA-CTRCVPQGSPSYYPNKSSSFSKLPCSGS 140
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
C+ L P+ +C +CDY+ YG L +D P ++ G + + T G
Sbjct: 141 LCSDL--PS-SQCSAGGAECDYKYSYG--------LASD--PHHYTQGYLGSETFTLGSD 187
Query: 132 -----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV-- 184
G+ +G++GLGRG +S+VSQL +C+ +
Sbjct: 188 AVPGIGFGCTTMSEGGYGSGSGLVGLGRGPLSLVSQLN-----VGAFSYCLTSDAAKTSP 242
Query: 185 LFLGDGKVPSSGVAWTPMLQNSA-----DLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
L G G + +GV TP+L+ S +L+ +G A +G S +IFDSG
Sbjct: 243 LLFGSGALTGAGVQSTPLLRTSTYYYTVNLESISIGAATTAGTGSS------GIIFDSGT 296
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
+ A+ Y ++ T L +A +C++
Sbjct: 297 TVAFLAEPAYTLAKEAVLSQT--TNLTMASGRDGYEVCFQ 334
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 74/256 (28%), Positives = 114/256 (44%), Gaps = 39/256 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N++VG P F DTGSDL W QC APCT C + P ++P + +PC++
Sbjct: 86 YNMNISVGTPLLTFSVVADTGSDLIWTQC-APCTKCFQQPAPPFQPASSSTFSKLPCTSS 144
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L PN R + C Y +YG G ++ G L T+ L+ + S +V FGC
Sbjct: 145 FCQFL--PNSIRTCNATG-CVYNYKYGSGYTA-GYLATE--TLKVGDASFPSV--AFGCS 196
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLFLG 188
++ G T+G+ GLGRG +S++ QL G+ R +C+ +LF
Sbjct: 197 -TENGVG----NSTSGIAGLGRGALSLIPQL---GVGR--FSYCLRSGSAAGASPILFGS 246
Query: 189 DGKVPSSGVAWTPMLQNSA--------DLKHYILGPAELLYSGKSCGLKDLTL----IFD 236
+ V TP + N A +L +G +L + + G L I D
Sbjct: 247 LANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVD 306
Query: 237 SGASYAYFTSRVYQEI 252
SG + Y Y+ +
Sbjct: 307 SGTTLTYLAKDGYEMV 322
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 59/125 (47%), Gaps = 12/125 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF+ + VG P + DTGSD+TWVQC PC C + + + P + V C N
Sbjct: 163 YFS-RVGVGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSTSYASVACDN 220
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
PRC H + C++ C YE+ YGDG ++G T+ L S + GC
Sbjct: 221 PRC---HDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAIGC 274
Query: 132 GYNQH 136
G++
Sbjct: 275 GHDNE 279
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 67/270 (24%), Positives = 107/270 (39%), Gaps = 49/270 (18%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
V+L +G PP++ DTGS L+W+QC PP + P + +PC++P C
Sbjct: 99 VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPA-KPPPTASFDPSLSSTFSTLPCTHPVC 157
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + F + S+F PL GC
Sbjct: 158 KPRIPDFTLPTSCDQ-NRLCHYSYFYADGTYAEGNLVREKFTF---SRSLFTPPLILGCA 213
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
+P G+LG+ RGR+S SQ + +C+ G G
Sbjct: 214 TESTDP--------RGILGMNRGRLSFASQSKI-----TKFSYCVPTRVTRPGYTPTGSF 260
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAE--LLYSGKSCGLKDLTL---------- 233
+LG S+ + ML + + L P + G G + L +
Sbjct: 261 YLGHNP-NSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAG 319
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMR 258
+ DSG+ + Y + Y ++ + ++R
Sbjct: 320 GSGQTMLDSGSEFTYLVNEAYDKVRAEVVR 349
>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
Length = 472
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 68/258 (26%), Positives = 109/258 (42%), Gaps = 19/258 (7%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC---TKPP--EKQYKPHKNIVPCSN 71
FA+NL +G PP +F S+ W C +PC C T P +PC++
Sbjct: 88 FAMNLNLGTPPVQHNFTMALNSEFFWAAC-SPCVDCNVSTNDPLFSSASSTSYTRIPCTS 146
Query: 72 PRCAALHWPNPPRCKHP---NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
P C+ + C + C Y Y SS G + +D+ ++ + N L
Sbjct: 147 PFCSTSPGFSTNACGSSAVGSTTCLYNFSYSTDYSSAGEMASDVVAMKTPRKTRGNKSLR 206
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFL 187
G + + L +T+G++G + S + QL E I +C+ G + L
Sbjct: 207 MSLGCGRESTTLLGILNTSGLVGFAKTDKSFIGQLAEMDYTSKFI-YCVPSDTFSGKIVL 265
Query: 188 GDGKVPS-SGVAWTPMLQNSADLKHYI----LGPAELLYSGKSCGLKDLT--LIFDSGAS 240
G+ K+ S S +++TPM+ NS L +YI + + L L D T I DS +
Sbjct: 266 GNYKISSHSSLSYTPMIVNSTAL-YYIGLRSISITDTLTFPVQGILADGTGGTIIDSTFA 324
Query: 241 YAYFTSRVYQEIVSLIMR 258
++YFT Y +V I
Sbjct: 325 FSYFTPDSYTPLVQAIQN 342
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 91/200 (45%), Gaps = 26/200 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V G P K DTGSDLTW+QC PC C + ++P ++ +PC +
Sbjct: 137 YIVTAGFGTPAKNSLLIIDTGSDLTWIQCK-PCADCYSQVDAIFEPKQSSSYKTLPCLSA 195
Query: 73 RCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L NP C C YEI YGDG SS G + L GS FG
Sbjct: 196 TCTELITSESNPTPCLLGG--CVYEINYGDGSSSQGDFSQETLTL----GSDSFQNFAFG 249
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-EYGLIRNVIGHCI----GQNGRGVL 185
CG+ N G ++G+LGLG+ +S SQ + +YG +C+ G
Sbjct: 250 CGHT--NTGLFK--GSSGLLGLGQNSLSFPSQSKSKYG---GQFAYCLPDFGSSTSTGSF 302
Query: 186 FLGDGKVPSSGVAWTPMLQN 205
+G G +P+S V +TP++ N
Sbjct: 303 SVGKGSIPASAV-FTPLVSN 321
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 127/304 (41%), Gaps = 40/304 (13%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKN----IVP 68
S + V ++G P + DTGSDL+WVQC PC C + + + P ++ VP
Sbjct: 135 SNYVVTASLGTPGMAQTLEVDTGSDLSWVQCK-PCAAPSCYRQKDPLFDPAQSSSYAAVP 193
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C CA L C QC Y + YGDG ++ G +D L +N +V
Sbjct: 194 CGRSACAGLGI-YASACSAA--QCGYVVSYGDGSNTTGVYSSDTLTLA-ANATVQG--FL 247
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLF 186
FGCG+ Q G + D G+LG GR + S+V Q G V +C+ + G L
Sbjct: 248 FGCGHAQSG-GLFTGID--GLLGFGREQPSLVQQ--TAGAYGGVFSYCLPTKSSTTGYLT 302
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFDS 237
LG + G + T +L + +Y+ ++ +G S G + L++ + D+
Sbjct: 303 LGGPSGVAPGFSTTQLLPSPNAPTYYV-----VMLTGISVGGQPLSVPASAFAAGTVVDT 357
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLAL 297
G Y + S + P AP L C+ F G V +AL
Sbjct: 358 GTVITRLPPAAYAALRSAFRSGMASYP--SAPPIGILDTCYS--FAGYGTVN--LTSVAL 411
Query: 298 SFTN 301
+F++
Sbjct: 412 TFSS 415
>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
Length = 475
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 62/247 (25%), Positives = 104/247 (42%), Gaps = 32/247 (12%)
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLF-------PLRFSNGSVFNVPLTFGCGYNQHNPGPL 141
N++C Y Y + SS G +V D F P+R + FGC G +
Sbjct: 4 NEKCYYSRTYAERSSSEGWMVEDAFGFPDDQPPVR----------MVFGC--ENGETGEI 51
Query: 142 SPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS-SGVAWT 200
G++G+G + SQL G+I +V C G G+L LGD +P + +T
Sbjct: 52 YRQLADGIMGMGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYT 111
Query: 201 PMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASYAYFTSRVYQEIVS 254
P+L N+ L +Y + + +G L + ++ DSG ++ Y + + + +
Sbjct: 112 PLL-NNLHLHYYNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAA 170
Query: 255 LIMRDLIGTPLKLAP--DDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPP 312
I + L+ P D + ICW+G + +F F ++ RL +PP
Sbjct: 171 AIGSYALSHGLQSTPGADPQYNDICWKGAPDNFQGLENHFPSAEFVFG---DNARLSLPP 227
Query: 313 EAYLVIS 319
YL +S
Sbjct: 228 LRYLFVS 234
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 103/252 (40%), Gaps = 25/252 (9%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
F ++A+ +TVG P F DTGSDL W+ C C GC P +P +
Sbjct: 100 FLHYAL-VTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCPPPASGASGSASFYIPSMSST 156
Query: 74 CAALHWPNPPRCKHPND-----QCDYEIEYGDGG-SSIGALVTDLFPLRFSNG--SVFNV 125
A+ N C H D C Y++ Y SS G LV D+ L + +
Sbjct: 157 SQAVPC-NSDFCDHRKDCSTTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDNHPQILKA 215
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
+ FGCG Q L G+ GLG IS+ S L GL + C G++G G +
Sbjct: 216 QIMFGCGQVQ-TGSFLDAAAPNGLFGLGIDMISVPSILAHKGLTSDSFSMCFGRDGIGRI 274
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASY 241
GD SS TP+ N KH + +G + G + + L IFD+G ++
Sbjct: 275 SFGDQG--SSDQEETPLDINQ---KHPTYA---ITITGITVGTEPMDLEFSTIFDTGTTF 326
Query: 242 AYFTSRVYQEIV 253
Y Y I
Sbjct: 327 TYLADPAYTYIT 338
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 82/183 (44%), Gaps = 24/183 (13%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDF------DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN 65
P + +TVG P + D F D GSD+TW+QC PC C P Y K+
Sbjct: 120 PTSGEYIAKITVGTPYE-NDSSFEALLSPDMGSDVTWLQC-MPCFRCYHQPGPVYNRLKS 177
Query: 66 I----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS 121
V C P C AL + C ++C Y++EYGDG SS G + L F G
Sbjct: 178 SSASDVGCYAPACRALG--SSGGCVQFLNECQYKVEYGDGSSSAGDFGVET--LTFPPG- 232
Query: 122 VFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
VP + GCG + L P AG+LGLGRG +S SQ+ G +C+
Sbjct: 233 -VRVPGVAIGCGSDNQG---LFPAPAAGILGLGRGSLSFPSQIA--GRYGRSFSYCLAGQ 286
Query: 181 GRG 183
G G
Sbjct: 287 GTG 289
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 82/195 (42%), Gaps = 17/195 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSNP 72
+ + + +G P DTGSD++WVQC PC+ C + + P + CS+
Sbjct: 131 YVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSASSTYSPFSCSSA 189
Query: 73 RCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L C + QC Y + Y DG S+ G +D L GS FGC
Sbjct: 190 ACVQLSQSQQGNGCS--SSQCQYIVSYVDGSSTTGTYSSDTLTL----GSNAIKGFQFGC 243
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
+Q G S T G++GLG S+VSQ G +C+ FL G
Sbjct: 244 --SQSESGGFS-DQTDGLMGLGGDAQSLVSQTA--GTFGKAFSYCLPPTPGSSGFLTLGA 298
Query: 192 VPSSGVAWTPMLQNS 206
SG TPML+++
Sbjct: 299 ASRSGFVKTPMLRST 313
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 90/294 (30%), Positives = 130/294 (44%), Gaps = 43/294 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ +++ VG PP+ F DTGSDL W+QC APC C + P ++N+ C +
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFDQVGPVFDPAASSSYRNVT-CGD 208
Query: 72 PRCAALHWPNPPR-CKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNV-PL 127
RC + P PPR C+ P D C Y YGD ++ G L + F + + G+ V +
Sbjct: 209 QRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDDV 268
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV-- 184
FGCG+ N G AG+LGLGRG +S SQLR YG + +C+ +G V
Sbjct: 269 VFGCGH--WNRGLFH--GAAGLLGLGRGPLSFASQLRAVYG---HTFSYCLVDHGSDVAS 321
Query: 185 -LFLGDGKVPSSG--------VAWTPMLQNSADLKHY-----ILGPAELL------YSGK 224
+ G+ + A+ P + AD +Y +L ELL +
Sbjct: 322 KVVFGEDDALALAAAHPQLNYTAFAPA-SSPADTFYYVKLKGVLVGGELLNISSDTWGVG 380
Query: 225 SCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
I DSG + +YF YQ I + D +G L PD L C+
Sbjct: 381 EGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFI-DRMGRSYPLIPDFPVLSPCY 433
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 87/222 (39%), Gaps = 24/222 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP--------HKNIVP 68
+ ++ +VG PP++ D SD W+QC A T P P V
Sbjct: 97 YVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREVR 156
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGS--SIGALVTDLFPLRFSNGSVFNVP 126
C+N C L P C + C Y YG G + + G L D F +V
Sbjct: 157 CANRGCQRLV---PQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAF----ATVRADG 209
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ FGC D GV+GLGRG +S+VSQL+ + G +LF
Sbjct: 210 VIFGCAVATEG-------DIGGVIGLGRGELSLVSQLQIGRFSYYLAPDDAVDVGSFILF 262
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL 228
L D K +S TP++ N A Y + A + G+ +
Sbjct: 263 LDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAI 304
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 55/158 (34%), Positives = 70/158 (44%), Gaps = 13/158 (8%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----V 67
P + + VG P DT SDLTW+QC PC C + P + +
Sbjct: 129 PTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPRHSTSYGEM 187
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFP--LRFSNGSVFNV 125
P C AL K C Y ++YGDG S V DL L F+ G V
Sbjct: 188 NYDAPDCQALGRSGGGDAKR--GTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG-VRQA 244
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
L+ GCG++ N G P AG+LGLGRG+ISI Q+
Sbjct: 245 YLSIGCGHD--NKGLFGAP-AAGILGLGRGQISIPHQI 279
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 89/334 (26%), Positives = 128/334 (38%), Gaps = 54/334 (16%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-----TKPPEKQYKPHKNI----VPC 69
+ +G P F DTGSDL WV CD C C T K Y P ++ V C
Sbjct: 85 AKVALGTPNATFVVALDTGSDLFWVPCD--CKRCAPIANTSELLKPYSPRQSSTSKPVTC 142
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSN--------- 119
S+ C P C + N C Y ++Y SS G LV D+ + +
Sbjct: 143 SHSLC-----DRPNACGNGNGSCPYTVKYVSANTSSSGVLVEDVLYMTRQSSSSRSGNGG 197
Query: 120 --GSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHC 176
G + FGCG Q L G+LGLG R+S+ S L GL+ + C
Sbjct: 198 NVGEAVGARVVFGCGQEQTG-AFLDGAAMEGLLGLGMDRVSVPSLLAAAGLVGSDSFSMC 256
Query: 177 IGQNGRGVLFLGDGKVPSSGVAW--TPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI 234
+G G + G+ PS A TP + S Y + + GK + +
Sbjct: 257 FSPDGNGRINFGE---PSDAGAQNETPFIV-SKTRPTYNISVTAVNVKGKGAMAAEFAAV 312
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK-----ALGQVT 289
DSG S+ Y Y L+ T +K + PF+ + GQ T
Sbjct: 313 VDSGTSFTYLNDPAYS---------LLATSFNSQVREKRANLSASIPFEYCYALSRGQ-T 362
Query: 290 EYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTS 323
E P +S T R +V V P +++++ T+
Sbjct: 363 EVLMP-EVSLTTRGGAVFPVTRP--FVIVAGETT 393
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 64/131 (48%), Gaps = 16/131 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
YF +++ +G PP+ F DTGSDL W+QC PC C Y P ++ + C
Sbjct: 191 EYF-MDVFIGTPPRHFSLILDTGSDLNWIQC-VPCYDCFVQNGPYYDPKESSSFKNIGCH 248
Query: 71 NPRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-------GSV 122
+PRC + P+PP+ CK N C Y YGD ++ G + F + ++ V
Sbjct: 249 DPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRV 308
Query: 123 FNVPLTFGCGY 133
NV FGCG+
Sbjct: 309 ENV--MFGCGH 317
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 130/332 (39%), Gaps = 62/332 (18%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
++L +G PP+ DTGS L+W+QC P+ + P + +PCS+P C
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLC 131
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + + FSN + PL GC
Sbjct: 132 KPRIPDFTLPTSCDS-NRLCHYSYFYADGTFAEGNLVKE--KITFSNTEI-TPPLILGCA 187
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
D G+LG+ RGR+S VSQ + + +CI G G
Sbjct: 188 TESS--------DDRGILGMNRGRLSFVSQAKI-----SKFSYCIPPKSNRPGFTPTGSF 234
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYS----GKSCGLKDLTL-------- 233
+LGD S G + +L + L P L Y+ G GLK L +
Sbjct: 235 YLGDNP-NSHGFKYVSLLTFPESQRMPNLDP--LAYTVPMIGIRFGLKKLNISGSVFRPD 291
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKAL 285
+ DSG+ + + Y ++ + IM +G LK T +C+ G +
Sbjct: 292 AGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTR-VGRRLKKGYVYGGTADMCFDG---NV 347
Query: 286 GQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ L FT V ++VP E LV
Sbjct: 348 AMIPRLIGDLVFVFT---RGVEILVPKERVLV 376
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/182 (33%), Positives = 83/182 (45%), Gaps = 18/182 (9%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IVPCSNPRCAALHWPNPPRCKH 87
DT SD+TWVQC +PC P+K Y P K+ + C++P C L P C +
Sbjct: 173 LDTASDVTWVQC-SPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLG-PYANGCTN 230
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
N+QC Y + Y DG S+ G ++DL + + FGC + A
Sbjct: 231 -NNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRS---FQFGCSHGVQGSFSFG-SSAA 285
Query: 148 GVLGLGRGRISIVSQLRE-YGLIRNVIGHCI-GQNGRGVLFLGDGKVPSSGVAWTPMLQN 205
G++ LG G S+VSQ YG V HC RG LG +V + TPML+N
Sbjct: 286 GIMALGGGPESLVSQTAATYG---RVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKN 342
Query: 206 SA 207
A
Sbjct: 343 PA 344
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 66/125 (52%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF+ + +GKPP DTGSD+ WVQC APC C + + ++P + + C+
Sbjct: 149 YFS-RVGIGKPPSQAYLILDTGSDVNWVQC-APCADCYQQADPIFEPASSASFSTLSCNT 206
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C +L + C+ ND C YE+ YGDG ++G VT+ L + V NV + GC
Sbjct: 207 RQCRSL---DVSECR--NDTCLYEVSYGDGSYTVGDFVTETITL--GSAPVDNVAI--GC 257
Query: 132 GYNQH 136
G+N
Sbjct: 258 GHNNE 262
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/172 (32%), Positives = 80/172 (46%), Gaps = 28/172 (16%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPN------PPR 84
DT S+LTWVQC APC C + + P + VPC++ C AL
Sbjct: 169 DTASELTWVQC-APCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAAA 227
Query: 85 CKHPNDQ---CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPL 141
C+ + C Y + Y DG S G L D L G V + FGCG + P P
Sbjct: 228 CQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSL---AGEVID-GFVFGCGTSNQGP-PF 282
Query: 142 SPPDTAGVLGLGRGRISIVSQ-LREYGLIRNVIGHCI---GQNGRGVLFLGD 189
T+G++GLGR ++S+VSQ + ++G V +C+ + G L +GD
Sbjct: 283 G--GTSGLMGLGRSQLSLVSQTMDQFG---GVFSYCLPLKESDSSGSLVIGD 329
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 124/286 (43%), Gaps = 33/286 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + VG PP+ F DTGSDL W+QC APC C + P + V C +
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFDQRGPVFDPMASTSYRNVTCGDT 208
Query: 73 RCAALHWPNPPR-CKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTF 129
RC + P PR C+ +D C Y YGD ++ G L + F + + S V +
Sbjct: 209 RCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVL 268
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGV---L 185
GCG+ N G AG+LGLGRG +S SQLR YG + +C+ +G V +
Sbjct: 269 GCGH--RNRGLFHG--AAGLLGLGRGPLSFASQLRAVYG---HAFSYCLVDHGSAVGSKI 321
Query: 186 FLGDGKVPSS--GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----------- 232
GD V S + +T ++A+ Y + +L G+ + T
Sbjct: 322 VFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGG 381
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
I DSG + +YF Y+ I + D + L D L C+
Sbjct: 382 TIIDSGTTLSYFPEPAYKAIRQAFV-DRMDKAYPLIADFPVLSPCY 426
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 62/128 (48%), Gaps = 15/128 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF L VG PP+ DTGSD+ W+QC +PC C + + P+K+ +PCS+
Sbjct: 110 YF-TRLGVGTPPRYLYMVLDTGSDVVWLQC-SPCRKCYSQSDPIFNPYKSKSFAGIPCSS 167
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + C C Y++ YGDG + G T+ L F + V L GC
Sbjct: 168 PLCRRL---DSSGCSTRRHTCLYQVSYGDGSFTTGDFATE--TLTFRGNKIAKVAL--GC 220
Query: 132 GYNQHNPG 139
G+ HN G
Sbjct: 221 GH--HNEG 226
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/319 (24%), Positives = 129/319 (40%), Gaps = 48/319 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
V+LTVG PP+ DTGS+L+W+ C + + PH + +PC +P
Sbjct: 70 LTVSLTVGTPPQSVTMVLDTGSELSWLHCKK-----QQNINSVFNPHLSSSYTPIPCMSP 124
Query: 73 RCA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C + P C N+ C + Y D S G L +D F + S + FG
Sbjct: 125 ICKTRTRDFLIPVSCDS-NNLCHVTVSYADFTSLEGNLASDTFAISGSG----QPGIIFG 179
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
+ + T G++G+ RG +S V+Q+ G + +CI G++ GVL GD
Sbjct: 180 SMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQM---GFPK--FSYCISGKDASGVLLFGD 234
Query: 190 GKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------------- 233
G + +TP+++ + L ++ + G G K L +
Sbjct: 235 ATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQT 294
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-----DKTLPICWR----GPFKA 284
+ DSG + + VY + + + G L L D + + +C+R G A
Sbjct: 295 MVDSGTRFTFLLGSVYTALRNEFVAQTRGV-LTLLEDPNFVFEGAMDLCFRVRRGGVVPA 353
Query: 285 LGQVTEYFKPLALSFTNRR 303
+ VT F+ +S + R
Sbjct: 354 VPAVTMVFEGAEMSVSGER 372
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 62/125 (49%), Gaps = 12/125 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG+P + DTGSD+TW+QC PC C + Y P + V C +
Sbjct: 163 YFS-RVGVGRPARQLYMVLDTGSDVTWLQCQ-PCADCYAQSDPVYDPSVSTSYATVGCDS 220
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
PRC L + C++ C YE+ YGDG ++G T+ L S V NV + GC
Sbjct: 221 PRCRDL---DAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDS-APVSNVAI--GC 274
Query: 132 GYNQH 136
G++
Sbjct: 275 GHDNE 279
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 87/289 (30%), Positives = 121/289 (41%), Gaps = 51/289 (17%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF L VG P + DTGSD+ W+QC APC C + + P K+ +PC +
Sbjct: 145 YF-TRLGVGTPARYVYMVLDTGSDIVWIQC-APCIKCYSQTDPVFDPTKSRSFANIPCGS 202
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L +P C C Y++ YGDG ++G T+ L F V V L GC
Sbjct: 203 PLCRRLDYPG---CSTKKQICLYQVSYGDGSFTVGEFSTE--TLTFRGTRVGRVVL--GC 255
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR----GVLFL 187
G++ N G LG GR+S SQ+ + +C+G +
Sbjct: 256 GHD--NEGLFVGAAGLLGLGR--GRLSFPSQIGRR--FNSKFSYCLGDRSASSRPSSIVF 309
Query: 188 GDGKVPSSGVAWTPMLQN-SADLKHYILGPAELL--------YSGKSCGLKDLT------ 232
GD + S +TP+L N D +Y+ ELL SG S L L
Sbjct: 310 GDSAI-SRTTRFTPLLSNPKLDTFYYV----ELLGISVGGTRVSGISASLFKLDSTGNGG 364
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRD--LIG-TPLKLAPDDKTLPICW 278
+I DSG S T Y + +RD L+G + LK AP+ C+
Sbjct: 365 VIIDSGTSVTRLTRAAY-----VALRDAFLVGASNLKRAPEFSLFDTCF 408
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/182 (33%), Positives = 83/182 (45%), Gaps = 18/182 (9%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKN----IVPCSNPRCAALHWPNPPRCKH 87
DT SD+TWVQC +PC P+K Y P K+ + C++P C L P C +
Sbjct: 148 LDTASDVTWVQC-SPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLG-PYANGCTN 205
Query: 88 PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTA 147
N+QC Y + Y DG S+ G ++DL + + FGC + A
Sbjct: 206 -NNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRS---FQFGCSHGVQGSFSFG-SSAA 260
Query: 148 GVLGLGRGRISIVSQLRE-YGLIRNVIGHCI-GQNGRGVLFLGDGKVPSSGVAWTPMLQN 205
G++ LG G S+VSQ YG V HC RG LG +V + TPML+N
Sbjct: 261 GIMALGGGPESLVSQTAATYG---RVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKN 317
Query: 206 SA 207
A
Sbjct: 318 PA 319
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 61/124 (49%), Gaps = 8/124 (6%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++ +VG PP DTGSD+ W+QC PC C + P ++ +PCS+
Sbjct: 94 YLMSYSVGTPPFQILGIVDTGSDIIWLQCQ-PCEDCYNQTTPIFDPSQSKTYKTLPCSSN 152
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C ++ + C ND+C+Y I YGD S G L + L ++GS P T GC
Sbjct: 153 ICQSVQ--SAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVIGC 210
Query: 132 GYNQ 135
G+N
Sbjct: 211 GHNN 214
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/331 (25%), Positives = 127/331 (38%), Gaps = 60/331 (18%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
++L +G PP+ DTGS L+W+QC P+ + P + +PCS+P C
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLC 131
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + + FSN + PL GC
Sbjct: 132 KPRIPDFTLPTSCDS-NRLCHYSYFYADGTFAEGNLVKE--KITFSNTEI-TPPLILGCA 187
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
D G+LG+ RGR+S VSQ + + +CI G G
Sbjct: 188 TESS--------DDRGILGMNRGRLSFVSQAKI-----SKFSYCIPPKSNRPGFTPTGSF 234
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYS----GKSCGLKDLTL-------- 233
+LGD S G + +L + L P L Y+ G GLK L +
Sbjct: 235 YLGDNP-NSHGFKYVSLLTFPESQRMPNLDP--LAYTVPMIGIRFGLKKLNISGSVFRPD 291
Query: 234 -------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
+ DSG+ + + Y ++ + IM + K T +C+ G +
Sbjct: 292 AGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDG---NVA 348
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ L FT V + VP E LV
Sbjct: 349 MIPRLIGDLVFVFT---RGVEIFVPKERVLV 376
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 69/144 (47%), Gaps = 20/144 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF L VG PPK DTGSD+ W+QC APC C + + P K + + C +
Sbjct: 174 YF-TRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISCRS 231
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P C L + P C + C Y++ YGDG + G T+ R + VP + G
Sbjct: 232 PLCLRL---DSPGC-NSRQSCLYQVAYGDGSFTFGEFSTETLTFRGT-----RVPKVALG 282
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGR 154
CG++ N G AG+LGLGR
Sbjct: 283 CGHD--NEGLFV--GAAGLLGLGR 302
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 59/126 (46%), Gaps = 13/126 (10%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
YF + VG P + DTGSD+ W+QC APC C + + P K+ +PC
Sbjct: 117 EYF-TRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQTDHVFDPTKSRTYAGIPCG 174
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
P C L + P C + N C Y++ YGDG + G T+ L F V V L G
Sbjct: 175 APLCRRL---DSPGCSNKNKVCQYQVSYGDGSFTFGDFSTE--TLTFRRNRVTRVAL--G 227
Query: 131 CGYNQH 136
CG++
Sbjct: 228 CGHDNE 233
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 82/165 (49%), Gaps = 19/165 (11%)
Query: 13 IFSYFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IV 67
I S + ++L++G P P+ DTGSDL W QC C C P + + V
Sbjct: 96 IDSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQC--ACHVCFAQPFPTFDALASQTTLAV 153
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF---SNGSVFN 124
PCS+P C + +P C ++ C Y +Y D + G +V D F R +NGS +
Sbjct: 154 PCSDPICTSGKYP-LSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAH 212
Query: 125 ----VP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
VP + FGCG Q+N G + + +G+ G RG +S+ SQL+
Sbjct: 213 AGVAVPNVRFGCG--QYNKG-IFKSNESGIAGFSRGPMSLPSQLK 254
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 109/261 (41%), Gaps = 30/261 (11%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWP--NPPRCKHP 88
DTGSDL+WVQC PC C + + P + V CS+P C +L N C
Sbjct: 151 DTGSDLSWVQCQ-PCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSN 209
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAG 148
C+Y + YGDG + G L T+ L N + N FGCG N N G +G
Sbjct: 210 PPSCNYVVNYGDGSYTRGELGTE--HLDLGNSTAVN-NFIFGCGRN--NQGLFG--GASG 262
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGVAWTPM 202
++GLGR +S++SQ + V +C+ G L +G ++ +++T M
Sbjct: 263 LVGLGRSSLSLISQTS--AMFGGVFSYCLPITETEASGSLVMGGNSSVYKNTTPISYTRM 320
Query: 203 LQNSADLKHYILGPAELLYSGKSCGL----KDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
+ N L Y L + + KD +I DSG +YQ + ++
Sbjct: 321 IPN-PQLPFYFLNLTGITVGSVAVQAPSFGKDGMMI-DSGTVITRLPPSIYQALKDEFVK 378
Query: 259 DLIGTPLKLAPDDKTLPICWR 279
G P AP L C+
Sbjct: 379 QFSGFP--SAPAFMILDTCFN 397
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 84/322 (26%), Positives = 129/322 (40%), Gaps = 40/322 (12%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDA-PCTGCTKPPEKQYKPHKNI----VPCSNPRCAAL 77
+G PP+ DTGS+L W QC GC Y P ++ V C++ C
Sbjct: 90 IGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDTACL-- 147
Query: 78 HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC-GYNQH 136
+ RC C YG G+ G L T++F S NV L FGC ++
Sbjct: 148 -LGSETRCARDGKACAVLTAYG-AGAIGGFLGTEVFTFGHGQSSENNVSLAFGCITASRL 205
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL------GDG 190
PG L +G++GLGRG++S+ SQL + + + LF+ G
Sbjct: 206 TPGSLD--GASGIIGLGRGKLSLPSQLGDNKFSYCLTPYFSDAANTSTLFVGASAGLSGG 263
Query: 191 KVPSSGVAWTPMLQNSAD----------LKHYILGPAELLYSGKSCGLKDLT------LI 234
P++ V P L+N D L +G A+L + L+++ +
Sbjct: 264 GAPATSV---PFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAKWGGTL 320
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKP 294
DSG+ + YQ + ++R L + + + L +C G A G + P
Sbjct: 321 IDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGG--VAPGDAGKLVPP 378
Query: 295 LALSFTNRRNSVR-LVVPPEAY 315
L L F + +VVPPE Y
Sbjct: 379 LVLHFGSGGGGGGDVVVPPENY 400
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 119/283 (42%), Gaps = 53/283 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP---------HKNIV 67
V+L +G PP+ D DTGS L+W+QC PP + K +++
Sbjct: 66 LVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKKRLPPLPKPKTTSFDPSLSSSFSLL 125
Query: 68 PCSNPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
PC++P C + P C N C Y Y DG + G LV + F + S+
Sbjct: 126 PCNHPICKPRIPDFTLPTSCDQ-NRLCHYSYFYADGTLAEGNLVREKFTF---SKSLSTP 181
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNG 181
P+ GC + + G+LG+ RGR+S +SQ + + +C+ G N
Sbjct: 182 PVILGCAQ--------ASTENRGILGMNRGRLSFISQAK-----ISKFSYCVPSRTGSNP 228
Query: 182 RGVLFLGDGKVPSSGVAWTPML-----QNSADLK--HYILGPAELLYSGK---------- 224
G+ +LGD SS + ML Q+S +L Y L + +GK
Sbjct: 229 TGLFYLGDNP-NSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFK 287
Query: 225 -SCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
G T+I DSG+ Y Y+++ ++R L+G +K
Sbjct: 288 PDAGGSGQTMI-DSGSDLTYLVDEAYEKVKEEVVR-LVGAMMK 328
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 90/199 (45%), Gaps = 25/199 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKNIV----PCS 70
+ + +T+G P DTGSD++WVQC APC C+ +K + P + C
Sbjct: 129 YVITVTIGTPAVTQVMSIDTGSDVSWVQC-APCAAQSCSSQKDKLFDPAMSATYSAFSCG 187
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ +CA L K QC Y ++YGDG ++ G +D L S+ FG
Sbjct: 188 SAQCAQLGDEGNGCLK---SQCQYIVKYGDGSNTAGTYGSDTLSLTSSDAV---KSFQFG 241
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI---GQNGRGVLF 186
C + G + D G++GLG S+VSQ YG +C+ +G G L
Sbjct: 242 C--SHRAAGFVGELD--GLMGLGGDTESLVSQTAATYG---KAFSYCLPPPSSSGGGFLT 294
Query: 187 LG-DGKVPSSGVAWTPMLQ 204
LG G SS + TPM++
Sbjct: 295 LGAAGGASSSRYSHTPMVR 313
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 74/256 (28%), Positives = 114/256 (44%), Gaps = 39/256 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +N++VG P F DTGSDL W QC APCT C + P ++P + +PC++
Sbjct: 86 YNMNISVGTPLLTFPVVADTGSDLIWTQC-APCTKCFQQPAPPFQPASSSTFSKLPCTSS 144
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C L PN R + C Y +YG G ++ G L T+ L+ + S +V FGC
Sbjct: 145 FCQFL--PNSIRTCNATG-CVYNYKYGSGYTA-GYLATE--TLKVGDASFPSV--AFGCS 196
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLFLG 188
++ G T+G+ GLGRG +S++ QL G+ R +C+ +LF
Sbjct: 197 -TENGVG----NSTSGIAGLGRGALSLIPQL---GVGR--FSYCLRSGSAAGASPILFGS 246
Query: 189 DGKVPSSGVAWTPMLQNSA--------DLKHYILGPAELLYSGKSCGLKDLTL----IFD 236
+ V TP + N A +L +G +L + + G L I D
Sbjct: 247 LANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVD 306
Query: 237 SGASYAYFTSRVYQEI 252
SG + Y Y+ +
Sbjct: 307 SGTTLTYLAKDGYEMV 322
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 88/196 (44%), Gaps = 22/196 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSNP 72
+ + + +G P K D+GSD++WVQC PC C + + P + CS+
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCK-PCLQCHSQVDPLFDPSLSSTYSPFSCSSA 189
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
CA L + C + QC Y + Y DG S+ G +D L + S F FGC
Sbjct: 190 ACAQLGQ-DGNGCSS-SSQCQYIVRYADGSSTTGTYSSDTLALGSNTISNFQ----FGCS 243
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
+ + L T G++GLG G S+ SQ G +C+ + G L LG G
Sbjct: 244 HVESGFNDL----TDGLMGLGGGAPSLASQ--TAGTFGTAFSYCLPPTPSSSGFLTLGAG 297
Query: 191 KVPSSGVAWTPMLQNS 206
+SG TPML++S
Sbjct: 298 ---TSGFVKTPMLRSS 310
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 120/286 (41%), Gaps = 41/286 (14%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPC-TGCTKPPEKQYKPHKNI----VPCSNPRCA 75
L+VG PP F DTGSDLTW QC APC T C P Y P ++ +PC++P C
Sbjct: 100 LSVGTPPLAFPAIIDTGSDLTWTQC-APCTTACFAQPTPLYDPARSSTFSKLPCASPLCQ 158
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL----RFSNGSVFNVPLTFGC 131
AL P+ R + C Y+ Y G ++ G L D + + S + FGC
Sbjct: 159 AL--PSAFRACNATG-CVYDYRYAVGFTA-GYLAADTLAIGDGDGDGDASSSFAGVAFGC 214
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLFL 187
+ N G + +G++GLGR +S++SQ+ G+ R +C+ + +LF
Sbjct: 215 --STANGGDMD--GASGIVGLGRSALSLLSQI---GVGR--FSYCLRSDADAGASPILFG 265
Query: 188 GDGKVPSSGVAWTPMLQNS-----------ADLKHYILGPAELLYSGKSCGLKDL---TL 233
V V T +L+N +L +G +L + + G +
Sbjct: 266 ALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGV 325
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
I DSG ++ Y Y + + G +++ +C+
Sbjct: 326 IVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFE 371
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 115/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS TWV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|66817422|ref|XP_642564.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
gi|60470632|gb|EAL68608.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
Length = 492
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 134/337 (39%), Gaps = 41/337 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKP----HKNIVP 68
+++ +N+ V + F DTGS LT + P GC + + Y P ++P
Sbjct: 94 NFYQINVNVLIGQQKFILQVDTGSTLTAI----PLKGCNSCKDNRPVYDPALSSSSQLIP 149
Query: 69 CSNPRCAALHWPNPPRCKHPNDQ--CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
CS+ +C +P H N + CD+ I YGDG G + +D +V V
Sbjct: 150 CSSDKCLGSGSASPSCKLHQNAKSTCDFIILYGDGSKIKGKVFSDEI-------TVSGVS 202
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGR--GRISIV-----SQLREYGLIRNVIGHCIGQ 179
T G N G P G++GLGR ++V S +R I+N+ G +
Sbjct: 203 STIYFGANVEEVGAFEYPRADGIMGLGRTSNNKNLVPTIFDSMVRSNSSIKNIFGIYLDY 262
Query: 180 NGRGVLFLG--DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-TLIFD 236
+G+G L LG + + +TP +Q + Y + P S + +I D
Sbjct: 263 HGQGYLSLGKINHHYYIGSIQYTP-IQPAGPF--YAIKPTSFRVDNTSFPANSMGQVIVD 319
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKALGQVTEYFKPL 295
SG S TSRVY ++ + + + P + +C+ + E F
Sbjct: 320 SGTSDLILTSRVYDHLIQYFRKHYCHIDMVCSYPSIFSSRVCF--------EKEEDFATF 371
Query: 296 ALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTG 332
VR+ +PP+ Y++ + S + Y G
Sbjct: 372 PWLHFGFEGGVRIAIPPKNYMIKTESNQQGVYGYCWG 408
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/320 (24%), Positives = 131/320 (40%), Gaps = 34/320 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK-----PPEKQYKPHKNIVPCSN 71
+ + +G P + DTGSD+ WV+C +PC C PP Y + +
Sbjct: 83 YYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQDIIPPLSIYNLSASSTSSVS 141
Query: 72 PRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
L C N C Y I Y D +SIGA V D G+ + F
Sbjct: 142 SCSDPLCTGEQAVCSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVLQGGNATTSHIFF 201
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFL 187
GC N P G++G G+ ++ +Q+ + V HC+G ++G G+L
Sbjct: 202 GCAINITGSWP-----ADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEF 256
Query: 188 GDGKVPSSGVAWTPMLQN---------SADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
G+ + ++ + +TP+L S + +L +S S + +I DSG
Sbjct: 257 GE-EPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVIIDSG 315
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLALS 298
S+A ++ + + S I ++L T KL P + L + K+ V F + L+
Sbjct: 316 TSFALLATKANRILFSEI-KNL--TTAKLGPKLEGLQCFY---LKSGLTVETSFPNVTLT 369
Query: 299 FTNRRNSVRLVVPPEAYLVI 318
F+ + + P+ YLV+
Sbjct: 370 FS---GGSTMKLKPDNYLVM 386
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 112/265 (42%), Gaps = 43/265 (16%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V+ +G PP+ F D+GSDL WVQC +PC C Y P + VPC +
Sbjct: 64 YF-VDFFLGTPPQKFSLIVDSGSDLLWVQC-SPCRQCYAQDSPLYVPSNSSTFSPVPCLS 121
Query: 72 PRCAALHWPN--PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV---P 126
C + P ++P C YE Y D SS G + + +V V
Sbjct: 122 SDCLLIPATEGFPCDFRYPG-ACAYEYLYADTSSSKGVFA-------YESATVDGVRIDK 173
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQ-----N 180
+ FGCG + N G + GVLGLG+G +S SQ+ YG N +C+ +
Sbjct: 174 VAFGCGSD--NQGSFAA--AGGVLGLGQGPLSFGSQVGYAYG---NKFAYCLVNYLDPTS 226
Query: 181 GRGVLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL------ 233
L GD + + + +TP++ N Y + ++ GKS + D
Sbjct: 227 VSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLG 286
Query: 234 ----IFDSGASYAYFTSRVYQEIVS 254
IFDSG + Y+ Y I++
Sbjct: 287 NGGSIFDSGTTLTYWFPSAYSHILA 311
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/234 (32%), Positives = 112/234 (47%), Gaps = 33/234 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG P K DTGSD+ W+QC+ PC C + + + P + + CS
Sbjct: 162 YFS-RIGVGTPAKEMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
P+C+ L C+ +++C Y++ YGDG ++G L TD + F N G + NV L G
Sbjct: 220 PQCSLLE---TSACR--SNKCLYQVSYGDGSFTVGELATD--TVTFGNSGKINNVAL--G 270
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----EYGLIRNVIGHCIGQNGRGVLF 186
CG++ N G + AG+LGLG G +SI +Q++ Y L+ G + V
Sbjct: 271 CGHD--NEGLFTG--AAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQL 326
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
G G A P+L+N Y +G + G+ L D IFD AS
Sbjct: 327 GG-------GDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPD--AIFDVDAS 371
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 102/247 (41%), Gaps = 24/247 (9%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTKPPEKQ---YKPHKNI----VPC 69
+ VG P F DTGSDL W+ CD AP +G ++ YKP ++ +PC
Sbjct: 212 VDVGTPNTSFMVALDTGSDLFWIPCDCIECAPLSGYHGSLDRDLGIYKPAESTTSRHLPC 271
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPL--RFSNGSVFNVP 126
S+ C C + C Y +Y + +S G LV D+ L R S+ V
Sbjct: 272 SHELCLL-----GSDCTNQKQPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAPV-KAS 325
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ GCG Q L G+LGLG IS+ S L GL+RN C ++ G +F
Sbjct: 326 VIIGCGRKQSG-SYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFTKDS-GRIF 383
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYAYFTS 246
GD V S TP + L+ Y + + K I DSG S+
Sbjct: 384 FGDQGV--STQQSTPFVPLYGKLQTYTVNVDKSCVGHKCFESTSFQAIVDSGTSFTALPL 441
Query: 247 RVYQEIV 253
+Y+ +
Sbjct: 442 DIYKAVA 448
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 112/260 (43%), Gaps = 29/260 (11%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWP--NPPRCKHP 88
DTGSDL+WVQC PC C + + P K+ V C++ C +L N C
Sbjct: 82 DTGSDLSWVQCQ-PCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSN 140
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAG 148
C+Y + YGDG + G + + L N +V N FGCG + N G +G
Sbjct: 141 PPTCNYVVNYGDGSYTSGEV--GMEHLNLGNTTVNN--FIFGCG--RKNQGLFG--GASG 192
Query: 149 VLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGKV---PSSGVAWTPM 202
++GLGR +S++SQ+ + V +C+ G L +G ++ +++T M
Sbjct: 193 LVGLGRTDLSLISQISP--MFGGVFSYCLPTTEAEASGSLVMGGNSSVYKNTTPISYTRM 250
Query: 203 LQNSADLKHYILGPAELLYSG---KSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRD 259
+ N L Y L + G ++ +I DSG + +YQ + + ++
Sbjct: 251 IHNPL-LPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGTVISRLPPSIYQALKAEFVKQ 309
Query: 260 LIGTPLKLAPDDKTLPICWR 279
G P AP L C+
Sbjct: 310 FSGYP--SAPSFMILDSCFN 327
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 88/211 (41%), Gaps = 34/211 (16%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-----DAPCTGCTKPPEKQYKPHKNIVPCSN 71
V + VG PP+ DTGS+L+W+ C DAP Y P VPCS+
Sbjct: 63 LTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSRHDAPFDASAS---SSYAP----VPCSS 115
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L P R + C + Y D S+ G L D F L S +P FGC
Sbjct: 116 PACTWLGRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGSS-----PMPALFGC 170
Query: 132 --GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ-NGRGVLFLG 188
Y+ +PP G+LG+ RG +S V+Q +CI G G+L LG
Sbjct: 171 ITSYSSSTDPSETPP--TGLLGMNRGGLSFVTQ-----TATRRFAYCIAAGQGPGILLLG 223
Query: 189 DGKV-------PSSGVAWTPMLQNSADLKHY 212
P + +TP+++ S L ++
Sbjct: 224 GNDTETPLTSPPQQQLNYTPLVEISQPLPYF 254
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 123/284 (43%), Gaps = 40/284 (14%)
Query: 1 MYVSWIEFFFFPIFS---YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPE 57
M V ++ P+++ F + + +G P F DTGSDLTW QC PCT C P
Sbjct: 96 MSVDEVKAVEAPVYAGNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQC-KPCTDCYPQPT 154
Query: 58 KQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLF 113
Y P ++ VPCS+ C AL P C+Y YGD S+ G L + F
Sbjct: 155 PIYDPSQSSTYSKVPCSSSMCQAL-----PMYSCSGANCEYLYSYGDQSSTQGILSYESF 209
Query: 114 PLRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNV 172
L ++P + FGCG Q N G G++G GRG +S++SQL + + N
Sbjct: 210 TLTSQ-----SLPHIAFGCG--QENEG-GGFSQGGGLVGFGRGPLSLISQLGQS--LGNK 259
Query: 173 IGHCI-----GQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSC 226
+C+ + LF+G + + V+ TP++Q+ + Y L + G+
Sbjct: 260 FSYCLVSITDSPSKTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLL 319
Query: 227 GLKDLT----------LIFDSGASYAYFTSRVYQEIVSLIMRDL 260
+ D T +I DSG + Y Y + ++ +
Sbjct: 320 DIADGTFDLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSI 363
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 65/125 (52%), Gaps = 14/125 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG+P K F DTGSD+ W+QC PCT C + + + P + +PC +
Sbjct: 155 YFS-RVGVGQPAKPFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPRSSSSFASLPCES 212
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C AL C+ +C Y++ YGDG ++G VT+ L F N + N + GC
Sbjct: 213 QQCQALETSG---CRA--SKCLYQVSYGDGSFTVGEFVTE--TLTFGNSGMIN-DVAVGC 264
Query: 132 GYNQH 136
G++
Sbjct: 265 GHDNE 269
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 121/297 (40%), Gaps = 45/297 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC------TKPPEKQYKPHKNIVPCS 70
V+LTVG PP+ DTGS+L+W+ C+ T + Y+P +PCS
Sbjct: 31 LTVSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPTTFNQTRSISYRP----IPCS 86
Query: 71 NPRCA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-L 127
+ C + P C N C + Y D SS G L +D F + S ++P +
Sbjct: 87 SSTCTNQTRDFSIPASCDS-NSLCHATLSYADASSSEGNLASDTFHMGAS-----DIPGM 140
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLF 186
FGC + + G++G+ RG +S VSQ+ G + +CI G + G+L
Sbjct: 141 VFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQM---GFPK--FSYCISGTDFSGMLL 195
Query: 187 LGDGKVP-SSGVAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT-- 232
LG+ + + +TP++Q S L ++ I LL KS D T
Sbjct: 196 LGESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGA 255
Query: 233 --LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDD----KTLPICWRGPFK 283
+ DSG + + Y + S + G L D + +C+R P
Sbjct: 256 GQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPIS 312
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 92/334 (27%), Positives = 129/334 (38%), Gaps = 43/334 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT----------KPPEKQYKPHKN--- 65
+ VG P F DTGSDL WV CD C C P + Y P K+
Sbjct: 109 AEVAVGTPNATFLVALDTGSDLFWVPCD--CKQCAPIANASDLRGGPDLRPYSPGKSSTS 166
Query: 66 -IVPCSNPRCAALHWPNP-PRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPL-RFSNG- 120
V C + C PN + + C Y + Y SS G LV D+ L R + G
Sbjct: 167 KAVTCEHALC---ERPNACAAAGNSSTSCPYTVRYVSANTSSSGVLVEDVLHLSREAAGG 223
Query: 121 --SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLI-RNVIGHCI 177
+ P+ GCG Q L G+LGLG ++S+ S L GL+ + C
Sbjct: 224 ASTAVTAPVVLGCGQVQTG-AFLDGAAVDGLLGLGMDKVSVPSVLHAAGLVASDSFSMCF 282
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDS 237
+G G + GD G A TP + Y + + SGK + I DS
Sbjct: 283 SPDGFGRINFGDSG--RRGQAETPFTVRNTH-PTYNISVTAMSVSGKEVA-AEFAAIVDS 338
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPI--CWRGPFKALGQ-VTEYFKP 294
G S+ Y Y E+ + ++ L+ ++P C+ LG+ TE F P
Sbjct: 339 GTSFTYLNDPAYTELATGFNSEVRERRANLS---ASIPFEYCYE-----LGRGQTELFVP 390
Query: 295 LALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIA 328
+S T R +V V P + S I+ A
Sbjct: 391 -EVSLTTRGGAVFPVTRPIVVIYGETSDGRIVAA 423
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 77/169 (45%), Gaps = 10/169 (5%)
Query: 98 YGDGGSSIGALVTDLFPLRFSNGS----VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLG 153
YGDG S+ G LV D+ L G+ N + FGCG Q S G++G G
Sbjct: 2 YGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFG 61
Query: 154 RGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSA----DL 209
+ S +SQL G ++ HC+ N G +F G+V S V TPML SA +L
Sbjct: 62 QSNSSFISQLASQGKVKRSFAHCLDNNNGGGIF-AIGEVVSPKVKTTPMLSKSAHYSVNL 120
Query: 210 KHYILGPAEL-LYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIM 257
+G + L L S D +I DSG + Y VY +++ I+
Sbjct: 121 NAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEIL 169
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/234 (32%), Positives = 112/234 (47%), Gaps = 33/234 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG P K DTGSD+ W+QC+ PC C + + + P + + CS
Sbjct: 162 YFS-RIGVGTPAKDMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
P+C+ L C+ +++C Y++ YGDG ++G L TD + F N G + NV L G
Sbjct: 220 PQCSLLE---TSACR--SNKCLYQVSYGDGSFTVGELATD--TVTFGNSGKINNVAL--G 270
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----EYGLIRNVIGHCIGQNGRGVLF 186
CG++ N G + AG+LGLG G +SI +Q++ Y L+ G + V
Sbjct: 271 CGHD--NEGLFTG--AAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQL 326
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGAS 240
G G A P+L+N Y +G + G+ L D IFD AS
Sbjct: 327 GG-------GDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPD--AIFDVDAS 371
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 58/125 (46%), Gaps = 13/125 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + VG PP+ DTGSD+ W+QC APC C + + P K+ + C +
Sbjct: 126 YF-TRIGVGTPPRYVYMVLDTGSDIVWIQC-APCKRCYAQSDPVFDPRKSRSFASIACRS 183
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C H + P C C Y++ YGDG + G T+ L F V V L GC
Sbjct: 184 PLC---HRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTE--TLTFRRTRVARVAL--GC 236
Query: 132 GYNQH 136
G++
Sbjct: 237 GHDNE 241
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 131/284 (46%), Gaps = 33/284 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ ++ +VG PP DTGSD+ W+QC+ PC C ++ P K+ + CS+
Sbjct: 87 YIMSYSVGTPPIKSYGIVDTGSDIVWLQCE-PCEQCYNQTTPKFNPSKSSSYKNISCSSK 145
Query: 73 RCAALHWPNPPRCKHPNDQ--CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-F 129
C ++ R ND+ C+Y I YG+ S G L + L + G + P T
Sbjct: 146 LCQSV------RDTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVI 199
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR-------EYGLIRNVIGHCIGQNGR 182
GCG N N G ++GV+GLG G S+++QL Y L+R I G
Sbjct: 200 GCGTN--NIGSF-KRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGS 256
Query: 183 GVLFLGDGKVPS-SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIF 235
L GD + S V TP+++ +Y+ +G + ++G S G+++ +I
Sbjct: 257 SKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVEEGNIII 316
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
DS + S VY ++ S I+ DL+ T ++ ++ +C+
Sbjct: 317 DSSTIVTFVPSDVYTKLNSAIV-DLV-TLERVDDPNQQFSLCYN 358
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 101/257 (39%), Gaps = 31/257 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCT--------------KPPEKQ 59
F +FA N++VG PP F DTGSDL W+ CD C C +
Sbjct: 103 FLHFA-NVSVGTPPLWFLVALDTGSDLFWLPCD--CISCVHGGLRTRTGKILKFNTYDLD 159
Query: 60 YKPHKNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDLFPLRFS 118
N V C+N + +C C Y+++Y + SS G +V D+ L
Sbjct: 160 KSSTSNEVSCNN----STFCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHLITD 215
Query: 119 NGSV--FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
+ + + FGCG Q L+ G+ GLG IS+ S L GLI N C
Sbjct: 216 DDQTKDADTRIAFGCGQVQTGVF-LNGAAPNGLFGLGMDNISVPSILAREGLISNSFSMC 274
Query: 177 IGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLK-HYILGPAELLYSGKSCGLKDLTLIF 235
G + G + GD P TP N L Y + +++ L+ IF
Sbjct: 275 FGSDSAGRITFGDTGSPDQ--RKTPF--NVRKLHPTYNITITKIIVEDSVADLE-FHAIF 329
Query: 236 DSGASYAYFTSRVYQEI 252
DSG S+ Y Y I
Sbjct: 330 DSGTSFTYINDPAYTRI 346
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 62/125 (49%), Gaps = 12/125 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + +G P + DTGSD+TWVQC PC C + + + P + V C +
Sbjct: 169 YFS-RVGIGSPARELYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSASYAAVSCDS 226
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
PRC L + C++ C YE+ YGDG ++G T+ L S V NV + GC
Sbjct: 227 PRCRDL---DTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST-PVTNVAI--GC 280
Query: 132 GYNQH 136
G++
Sbjct: 281 GHDNE 285
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 112/261 (42%), Gaps = 45/261 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +NL++G PP F DTGS L W QC APCT C P ++P + +PC++
Sbjct: 90 YNMNLSIGTPPVTFSVLADTGSSLIWTQC-APCTECAARPAPPFQPASSSTFSKLPCASS 148
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C + P C Y YG G ++ G L T+ + G+ F + FGC
Sbjct: 149 LC---QFLTSPYLTCNATGCVYYYPYGMGFTA-GYLATETLHV---GGASFP-GVAFGCS 200
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG----VLFLG 188
++ G ++G++GLGR +S+VSQ+ G+ R +C+ + +LF
Sbjct: 201 -TENGVG----NSSSGIVGLGRSPLSLVSQV---GVGR--FSYCLRSDADAGDSPILFGS 250
Query: 189 DGKVPSSGVAWTPMLQN---------SADLKHYILGPAEL--------LYSGKSCGLKDL 231
KV V TP+L+N +L +G +L G GL
Sbjct: 251 LAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGG 310
Query: 232 TLIFDSGASYAYFTSRVYQEI 252
T++ DSG + Y Y +
Sbjct: 311 TIV-DSGTTLTYLVKEGYAMV 330
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 66/239 (27%), Positives = 107/239 (44%), Gaps = 25/239 (10%)
Query: 92 CD----YEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGCGYNQHNPGPLSPPDT 146
CD Y+ +Y + +S G L D+ + FSN S + L FGC G L
Sbjct: 97 CDGSRKYQRQYAEKSTSSGVLGKDV--ISFSNSSDLGGQRLVFGC--ETAETGDLYDQTA 152
Query: 147 AGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLGDGKVPSSGVAWTPMLQ 204
G++GLGRG +SI+ QL E + +V C G G G + LG + P V +
Sbjct: 153 DGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTSSDPH 212
Query: 205 NSADLKHYILGPAELLYSGKSCGLK------DLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
S +Y L + G LK + DSG +YAYF +Q S + +
Sbjct: 213 RSP---YYNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSGTTYAYFPGAAFQAFKSAV-K 268
Query: 259 DLIGTPLKL-APDDKTLPICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
+ +G+ ++ PD+K IC+ G + ++++F + F + ++ + + PE YL
Sbjct: 269 EQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQS---VTLSPENYL 324
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 62/124 (50%), Gaps = 10/124 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ + +VG PP DTGSD+ W+QC+ PC C K + P K+ +PCS+
Sbjct: 91 YLMRYSVGSPPFQVLGIVDTGSDILWLQCE-PCEDCYKQTTPIFDPSKSKTYKTLPCSSN 149
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C +L C N C+Y I+YGDG S G L + L ++GS + P T GC
Sbjct: 150 TCESLR---NTACSSDN-VCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVIGC 205
Query: 132 GYNQ 135
G+N
Sbjct: 206 GHNN 209
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 116/267 (43%), Gaps = 36/267 (13%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSN 71
S + +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 80 SLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGT 137
Query: 72 PRCAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
C L + P C+ + C + + Y DG +S G L D L FS+ V +P T
Sbjct: 138 SMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFT 191
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL- 185
FGC + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 192 FGCNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFS 246
Query: 186 ----FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIF 235
+ GKV + + V +T M+ + + + + A + G+ GL ++F
Sbjct: 247 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 306
Query: 236 DSGASYAYFTSR----VYQEIVSLIMR 258
DSG+ +Y R + Q I L++R
Sbjct: 307 DSGSELSYIPDRALSVLSQRIRELLLR 333
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 79/304 (25%), Positives = 127/304 (41%), Gaps = 43/304 (14%)
Query: 26 PPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRC--AALHW 79
PP+ DTGS+L+W++C+ P + P ++ +PCS+P C +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNR---SSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDF 138
Query: 80 PNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPG 139
P C + C + Y D SS G L ++F F N S + L FGC +
Sbjct: 139 LIPASCD-SDKLCHATLSYADASSSEGNLAAEIF--HFGN-STNDSNLIFGCMGSVSGSD 194
Query: 140 PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFLGDGKVP-SSG 196
P T G+LG+ RG +S +SQ+ G + +CI G L LGD +
Sbjct: 195 PEEDTKTTGLLGMNRGSLSFISQM---GFPK--FSYCISGTDDFPGFLLLGDSNFTWLTP 249
Query: 197 VAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----LIFDSGASY 241
+ +TP+++ S L ++ I +LL KS L D T + DSG +
Sbjct: 250 LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQF 309
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-----TLPICWR-GPFKALGQVTEYFKPL 295
+ VY + S + G L + D + T+ +C+R PF+ + +
Sbjct: 310 TFLLGPVYTALRSDFLNQTNGI-LTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTV 368
Query: 296 ALSF 299
+L F
Sbjct: 369 SLVF 372
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 71/147 (48%), Gaps = 12/147 (8%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALH 78
+G P L DTGS+L W+QC PCT C + P ++ V +P C A+
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQC-LPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVR 121
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHN 137
+ C+ + C Y+ YGDG ++ G L TD+F ++ V LTFGC H+
Sbjct: 122 RIS---CREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGC---SHD 175
Query: 138 PGPLSPPDTAGVLGLGRGRISIVSQLR 164
AGV+GL R S+VSQL+
Sbjct: 176 TKARLKGHQAGVVGLNRHPNSLVSQLK 202
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 88/190 (46%), Gaps = 32/190 (16%)
Query: 35 DTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKN----IVPCSNPRCAAL---HWPNPPRC 85
DTGSDLTWVQC+ PC G C + + P + VPC +P CAA P C
Sbjct: 199 DTGSDLTWVQCE-PCPGSSCYAQRDPLFDPAASPTFAAVPCGSPACAASLKDATGAPGSC 257
Query: 86 K----HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCGYNQHNPGP 140
+ +C Y + YGDG S G L D L G+ + FGCG + N G
Sbjct: 258 ARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGL----GTTTKLDGFVFGCGLS--NRGL 311
Query: 141 LSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSG-- 196
TAG++GLGR +S+VSQ V +C+ G L LG G PSS
Sbjct: 312 FG--GTAGLMGLGRTDLSLVSQ--TAARFGGVFSYCLPATTTSTGSLSLGPG--PSSSFP 365
Query: 197 -VAWTPMLQN 205
+A+T M+ +
Sbjct: 366 NMAYTRMIAD 375
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 59/125 (47%), Gaps = 13/125 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF + VG P + DTGSD+ W+QC APC C + + P K+ +PC
Sbjct: 129 YF-TRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQADPVFDPTKSRTYAGIPCGA 186
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + P C + N C Y++ YGDG + G T+ L F V V L GC
Sbjct: 187 PLCRRL---DSPGCNNKNKVCQYQVSYGDGSFTFGDFSTE--TLTFRRTRVTRVAL--GC 239
Query: 132 GYNQH 136
G++
Sbjct: 240 GHDNE 244
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 88/213 (41%), Gaps = 27/213 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP-----EKQYKPHKNIVPCSN 71
V + VG PP+ DTGS+L+W+ C+ G PP VPC +
Sbjct: 55 LTVPVAVGTPPQNVTMVLDTGSELSWLLCN----GSYAPPLTPAFNASGSSSYGAVPCPS 110
Query: 72 PRCA--ALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C P PP C P++ C + Y D S+ G L TD F L V
Sbjct: 111 TACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTF-LLTGGAPPVAVGAY 169
Query: 129 FGC--------GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-Q 179
FGC N + G G+LG+ RG +S V+Q G R +CI
Sbjct: 170 FGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQT---GTRR--FAYCIAPG 224
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY 212
G GVL LGD + + +TP+++ S L ++
Sbjct: 225 EGPGVLLLGDDGGVAPPLNYTPLIEISQPLPYF 257
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 58/190 (30%), Positives = 85/190 (44%), Gaps = 23/190 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
F V++ G PP+ F DTGS +TW QC PC C K + + P A+
Sbjct: 162 FLVDVAFGTPPQKFTLILDTGSSITWTQC-KPCVRCLKASRRHFDPS-----------AS 209
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
L + + C Y + YGD +S+G D L S+ VF FGCG N
Sbjct: 210 LTY-SLGSCIPSTVGNTYNMTYGDKSTSVGNYGCDTMTLEHSD--VF-PKFQFGCGRN-- 263
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDGKVP-S 194
N G G+LGLG+G++S VSQ + V +C+ ++ G L G+ S
Sbjct: 264 NEGDFG-SGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFGEKATSQS 320
Query: 195 SGVAWTPMLQ 204
S + +T ++
Sbjct: 321 SSLKFTSLVN 330
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 84/204 (41%), Gaps = 14/204 (6%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
+YF V + +G P K F FDTGSDLTW QC+ C E + P ++ + C
Sbjct: 152 NYF-VTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANISCG 210
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ C +L + C Y I+YGD SIG + L ++ VFN FG
Sbjct: 211 STLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATD--VFN-DFYFG 267
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG N + R ++S+VSQ + + +C+ + FL G
Sbjct: 268 CGQNNKGLFGGAAGLLGLG----RDKLSLVSQTAQR--YNKIFSYCLPSSSSSTGFLTFG 321
Query: 191 KVPSSGVAWTPMLQNSADLKHYIL 214
S ++TP+ S Y L
Sbjct: 322 GSTSKSASFTPLATISGGSSFYGL 345
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 60/126 (47%), Gaps = 8/126 (6%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + +G P + DTGSD+TW+QC APC C + + P + VPC +
Sbjct: 196 YFS-RIGIGSPARQLYMVLDTGSDVTWLQC-APCADCYAQSDPLFDPALSSSYATVPCDS 253
Query: 72 PRCAALHWPN-PPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
P C AL + N C YE+ YGDG ++G T+ L +GS + G
Sbjct: 254 PHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLG-GDGSAAVHDVAIG 312
Query: 131 CGYNQH 136
CG++
Sbjct: 313 CGHDNE 318
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 88/213 (41%), Gaps = 27/213 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPP-----EKQYKPHKNIVPCSN 71
V + VG PP+ DTGS+L+W+ C+ G PP VPC +
Sbjct: 55 LTVPVAVGTPPQNVTMVLDTGSELSWLLCN----GSYAPPLTPAFNASGSSSYGAVPCPS 110
Query: 72 PRCA--ALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C P PP C P++ C + Y D S+ G L TD F L V
Sbjct: 111 TACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTF-LLTGGAPPVAVGAY 169
Query: 129 FGC--------GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-Q 179
FGC N + G G+LG+ RG +S V+Q G R +CI
Sbjct: 170 FGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQT---GTRR--FAYCIAPG 224
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHY 212
G GVL LGD + + +TP+++ S L ++
Sbjct: 225 EGPGVLLLGDDGGVAPPLNYTPLIEISQPLPYF 257
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 94/212 (44%), Gaps = 27/212 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP+ D+GSD+ WVQC PCT C + + P + V CS+
Sbjct: 43 YF-VRIGVGSPPRSQYMVIDSGSDIVWVQCK-PCTQCYHQTDPLFDPADSASFMGVSCSS 100
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + + C + +C YE+ YGDG S+ G L + L G + GC
Sbjct: 101 AVCDQV---DNAGCN--SGRCRYEVSYGDGSSTKGTLALETLTL----GRTVVQNVAIGC 151
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQ---NGRGVLFL 187
G+ N G LG G +S V QL RE G N +C+ N G L
Sbjct: 152 GH--MNQGMFVGAAGLLGLGG--GSMSFVGQLSRERG---NAFSYCLVSRVTNSNGFLEF 204
Query: 188 GDGKVPSSGVAWTPMLQNSADLKHYILGPAEL 219
G +P G AW P+++N +Y +G + L
Sbjct: 205 GSEAMP-VGAAWIPLIRNPHSPSYYYIGLSGL 235
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/129 (39%), Positives = 63/129 (48%), Gaps = 14/129 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHK----NIVPCSN 71
+ V + +G P + F FDTGS +TW QC PC G C E+++ P K N V CS+
Sbjct: 135 YVVTVGLGTPKEDFTLVFDTGSGITWTQCQ-PCLGSCYPQKEQKFDPTKSTSYNNVSCSS 193
Query: 72 PRCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C L P R C N C Y+I YGD S G T+ L S+ VF L FG
Sbjct: 194 ASCNLL--PTSERGCSASNSTCLYQIIYGDQSYSQGFFATE--TLTISSSDVFTNFL-FG 248
Query: 131 CGYNQHNPG 139
CG Q N G
Sbjct: 249 CG--QSNNG 255
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/260 (30%), Positives = 115/260 (44%), Gaps = 47/260 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHK----NIVPCSN 71
F V + G P + + DTGSD++W+QC PC+G C K + + P K + VPC +
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQC-LPCSGHCYKQHDPVFDPTKSATYSAVPCGH 219
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P+CAA +C + + C Y++ YGDG S+ G L + L S ++P FG
Sbjct: 220 PQCAAAGG----KCSN-SGTCLYKVTYGDGSSTAGVLSHETLSLS----STRDLPGFAFG 270
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG--QNGRGVLFLG 188
CG Q N G G++GLGRG +S+ SQ +C+ G L +G
Sbjct: 271 CG--QTNLGEFG--GVDGLVGLGRGALSLPSQAA--ATFGATFSYCLPSYDTTHGYLTMG 324
Query: 189 DGKVPSSG----VAWTPMLQN------------SADLKHYILGPAELLYSGKSCGLKDLT 232
+S V +T M+Q S D+ YIL +++ +D T
Sbjct: 325 STTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFT------RDGT 378
Query: 233 LIFDSGASYAYFTSRVYQEI 252
L FDSG Y Y +
Sbjct: 379 L-FDSGTILTYLPPEAYASL 397
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 67/258 (25%), Positives = 104/258 (40%), Gaps = 36/258 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQC-DAPCTGCTKPPEKQYKPHKNIVPCSNPRCA 75
V+LT G P + DTGS+L+W+ C P P K +PCS+P C
Sbjct: 67 LTVSLTAGTPLQNITMVLDTGSELSWLHCKKEPNFNSIFNPLASKTYTK--IPCSSPTCE 124
Query: 76 --ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
P P C P C + I Y D S G L + F + GSV FGC
Sbjct: 125 TRTRDLPLPVSCD-PAKLCHFIISYADASSVEGNLAFETFRV----GSVTGPATVFGCMD 179
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHCIG-QNGRGVLFLGDG 190
+ + T G++G+ RG +S V+Q+ R++ +CI ++ GVL LG+
Sbjct: 180 SGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFRKF-------SYCISDRDSSGVLLLGEA 232
Query: 191 KVP-SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------------I 234
+ +TP+++ S L ++ + G K L+L +
Sbjct: 233 SFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTM 292
Query: 235 FDSGASYAYFTSRVYQEI 252
DSG + + VY +
Sbjct: 293 VDSGTQFTFLLGPVYSAL 310
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 128/295 (43%), Gaps = 31/295 (10%)
Query: 6 IEFFFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN 65
+E+ P + + +++G PP DTGSDL WVQC PC C K + P ++
Sbjct: 83 LEYDIIPGGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQ-PCQECYKQKSPIFNPKQS 141
Query: 66 I----VPCSNPRCAALHWPNPPRCKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
V C C AL+ H C Y YGD ++G L T+ F + +N
Sbjct: 142 STYRRVLCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNN 201
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--- 177
S+ L FGCG N G +G++GLG G +S++SQL I N +C+
Sbjct: 202 SI--QELAFGCG--NSNGGNFDEV-GSGIVGLGGGSLSLISQLGTK--IDNKFSYCLVPI 254
Query: 178 ---GQNGRGVLFLGDGKVPSSGVAW--TPMLQNSADLKHYI------LGPAELLY--SGK 224
G + GD S + TP++ + +Y+ +G L Y S
Sbjct: 255 LEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRN 314
Query: 225 SCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
++ +I DSG + + S++Y ++ ++ + + G +++ + IC+R
Sbjct: 315 DGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGE--RVSDPNGIFSICFR 367
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 65/125 (52%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF + + +GKPP DTGSD++W+QC APC+ C + + + P + + C
Sbjct: 149 YF-LRVGIGKPPSQAYVVLDTGSDVSWIQC-APCSECYQQSDPIFDPISSNSYSPIRCDE 206
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+C +L C+ N C YE+ YGDG ++G T+ L + +V NV + GC
Sbjct: 207 PQCKSLDL---SECR--NGTCLYEVSYGDGSYTVGEFATETVTL--GSAAVENVAI--GC 257
Query: 132 GYNQH 136
G+N
Sbjct: 258 GHNNE 262
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 115/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ ++ +G P K + DTGS ++WV C+ C GC P + V C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 67/274 (24%), Positives = 111/274 (40%), Gaps = 52/274 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPGFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVS---------------QLREYGLIRNVIGH 175
C + D G+LG+G G++S++ Q+ E G G+
Sbjct: 113 CNMDSFGANEFGNVD--GLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 176 CIGQNGRGVLFLGDGKVPS--SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL----- 228
F GK+ + + V +T M+ + + + + + G+ GL
Sbjct: 171 ----------FSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 229 KDLTLIFDSGASYAYFTSR----VYQEIVSLIMR 258
++FDSG+ +Y R + Q I L++R
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLR 254
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 126/297 (42%), Gaps = 51/297 (17%)
Query: 17 FAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCS 70
+ ++ ++G PP K+F F DTGSDL W+QC+ PC C + P ++NI PC
Sbjct: 88 YLMSYSIGTPPFKVFGF-VDTGSDLVWLQCE-PCKQCYPQITPIFDPSLSSSYQNI-PCL 144
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF- 129
+ C ++ CD G L + L + G + P T
Sbjct: 145 SDTCHSMR----------TTSCDVR----------GYLSVETLTLDSTTGYSVSFPKTMI 184
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---QNGRGVLF 186
GCGY N G P ++G++GLG G +S+ SQL I +C+G N L
Sbjct: 185 GCGY--RNTGTFHGP-SSGIVGLGSGPMSLPSQLGT--SIGGKFSYCLGPWLPNSTSKLN 239
Query: 187 LGDGK-VPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGA 239
GD V G TP+++ A +Y+ +G + + G + G + ++ DSG
Sbjct: 240 FGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGT 299
Query: 240 SYAYFTSRVYQEIVSLIMRDLIGTPLKLAPD-DKTLPICWRGPFKALGQ--VTEYFK 293
++ + VY S + + L+ D + T +C+ + +T +FK
Sbjct: 300 TFTFLPYDVYYRFESAVAEYI---NLEHVEDPNGTFKLCYNVAYHGFEAPLITAHFK 353
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 67/274 (24%), Positives = 111/274 (40%), Gaps = 52/274 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPGFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVS---------------QLREYGLIRNVIGH 175
C + D G+LG+G G++S++ Q+ E G G+
Sbjct: 113 CNMDSFGANEFGNVD--GLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 176 CIGQNGRGVLFLGDGKVPS--SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL----- 228
F GK+ + + V +T M+ + + + + + G+ GL
Sbjct: 171 ----------FSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 229 KDLTLIFDSGASYAYFTSR----VYQEIVSLIMR 258
++FDSG+ +Y R + Q I L++R
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLR 254
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 115/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 65/125 (52%), Gaps = 14/125 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG P K DTGSD+ W+QC+ PC+ C + + + P + + CS
Sbjct: 162 YFS-RIGVGTPAKEMYLVLDTGSDVNWIQCE-PCSDCYQQSDPVFNPTSSSTYKSLTCSA 219
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+C+ L C+ +++C Y++ YGDG ++G L TD + F N N + GC
Sbjct: 220 PQCSLLE---TSACR--SNKCLYQVSYGDGSFTVGELATD--TVTFGNSGKIN-DVALGC 271
Query: 132 GYNQH 136
G++
Sbjct: 272 GHDNE 276
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 117/282 (41%), Gaps = 28/282 (9%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP----------PEKQYKPHK 64
S + N++VG PP F DTGSDL W+ C+ T C + P Y P+
Sbjct: 100 SLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT-CIRDLEDIGVPQSVPLNLYTPNA 158
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ S+ RC+ +C P C Y+I Y + + G L+ D+ L + ++
Sbjct: 159 STT-SSSIRCSDKRCFGSKKCSSPKSICPYQISYSNSTGTTGTLLQDVLHLATEDENLTP 217
Query: 125 VP--LTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V +T GCG Q G ++ GVLGLG S+ S L + + + C G+
Sbjct: 218 VKTNVTLGCG--QKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITADSFSMCFGRVI 275
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASY 241
V + G + TP + + A Y L + G G + L FD+G+S+
Sbjct: 276 GNVGRISFGDKGYTDQEETPFI-SVAPSTAYGLNVTGVSVGGDPVGTR-LFAKFDTGSSF 333
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFK 283
+ Y +++ DL+ +DK P+ PF+
Sbjct: 334 THLMEPAYG-VLTKSFDDLV--------EDKRRPVDPELPFE 366
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 115/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 132/331 (39%), Gaps = 51/331 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNP 72
V+LTVG PP+ DTGS+L+W+ C T+ + P + VPC +P
Sbjct: 69 LTVSLTVGSPPQNVTMVLDTGSELSWLHCKK-----TQFLNSVFNPLSSKTYSKVPCLSP 123
Query: 73 RCA--ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C P C C + Y D S G L + F L GS+ FG
Sbjct: 124 TCKTRTRDLTIPVSCD-ATKLCHVIVSYADATSIEGNLAFETFRL----GSLTKPATIFG 178
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGD 189
C + + T G++G+ RG +S V+Q+ G + +CI G + GVL LG+
Sbjct: 179 CMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQM---GYPK--FSYCISGFDSAGVLLLGN 233
Query: 190 GKVP-SSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL--------------- 233
P +++TP++Q S L ++ + G K L+L
Sbjct: 234 ASFPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQT 293
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDK-----TLPICW-----RGPFK 283
+ DSG + + VY + + + G LK+ DD + +C+ R +
Sbjct: 294 MVDSGTQFTFLLGPVYTALKNEFLSQTRGI-LKVLNDDNFVFQGAMDLCYLLDSSRPNLQ 352
Query: 284 ALGQVTEYFKPLALSFTNRRNSVRLVVPPEA 314
L V+ F+ +S + R R VP E
Sbjct: 353 NLPVVSLMFQGAEMSVSGERLLYR--VPGEV 381
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 59/125 (47%), Gaps = 13/125 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF L VG P + DTGSD+ W+QC APC C + + P K+ +PCS+
Sbjct: 142 YF-TRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPCSS 199
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + C C Y++ YGDG ++G T+ L F V V L GC
Sbjct: 200 PHCRRL---DSAGCNTRRKTCLYQVSYGDGSFTVGDFSTET--LTFRRNRVKGVAL--GC 252
Query: 132 GYNQH 136
G++
Sbjct: 253 GHDNE 257
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 115/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 59/125 (47%), Gaps = 13/125 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF L VG P + DTGSD+ W+QC APC C + + P K+ +PCS+
Sbjct: 142 YF-TRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPCSS 199
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + C C Y++ YGDG ++G T+ L F V V L GC
Sbjct: 200 PHCRRL---DSAGCNTRRKTCLYQVSYGDGSFTVGDFSTET--LTFRRNRVKGVAL--GC 252
Query: 132 GYNQH 136
G++
Sbjct: 253 GHDNE 257
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 59/125 (47%), Gaps = 13/125 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF L VG P + DTGSD+ W+QC APC C + + P K+ +PCS+
Sbjct: 142 YF-TRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPCSS 199
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + C C Y++ YGDG ++G T+ L F V V L GC
Sbjct: 200 PHCRRL---DSAGCNTRRKTCLYQVSYGDGSFTVGDFSTET--LTFRRNRVKGVAL--GC 252
Query: 132 GYNQH 136
G++
Sbjct: 253 GHDNE 257
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 76/254 (29%), Positives = 103/254 (40%), Gaps = 28/254 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPH--- 63
F ++A+ +TVG P F DTGSDL W+ C C GCT PP Y P
Sbjct: 96 FLHYAL-VTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASFYIPSLSS 152
Query: 64 -KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG- 120
VPC++ C C C Y++ Y SS G LV D+ L +
Sbjct: 153 TSQAVPCNSDFCGLR-----KECS-KTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDTH 206
Query: 121 -SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
+ FGCG Q L G+ GLG IS+ S L + GL N C G+
Sbjct: 207 PQFLKAQIMFGCGEVQ-TGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFGR 265
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+G G + GD SS TP+ N + I + G + +++ IFD+G
Sbjct: 266 DGIGRISFGDQG--SSDQEETPLDINQKHPTYAITITG--IAVGNNLMDLEVSTIFDTGT 321
Query: 240 SYAYFTSRVYQEIV 253
S+ Y Y I
Sbjct: 322 SFTYLADPAYTYIT 335
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 76/254 (29%), Positives = 103/254 (40%), Gaps = 28/254 (11%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------YKPH--- 63
F ++A+ +TVG P F DTGSDL W+ C C GCT PP Y P
Sbjct: 96 FLHYAL-VTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASFYIPSLSS 152
Query: 64 -KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSNG- 120
VPC++ C C C Y++ Y SS G LV D+ L +
Sbjct: 153 TSQAVPCNSDFCGLR-----KECS-KTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDTH 206
Query: 121 -SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
+ FGCG Q L G+ GLG IS+ S L + GL N C G+
Sbjct: 207 PQFLKAQIMFGCGEVQ-TGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFGR 265
Query: 180 NGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
+G G + GD SS TP+ N + I + G + +++ IFD+G
Sbjct: 266 DGIGRISFGDQG--SSDQEETPLDINQKHPTYAITITG--IAVGNNLMDLEVSTIFDTGT 321
Query: 240 SYAYFTSRVYQEIV 253
S+ Y Y I
Sbjct: 322 SFTYLADPAYTYIT 335
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 68/143 (47%), Gaps = 24/143 (16%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRC-AALHWPN--PPRCK- 86
DTGSDLTWVQC PC+ C + + P + VPC+ C A+L P C
Sbjct: 181 DTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCAT 239
Query: 87 -------HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPG 139
+++C Y + YGDG S G L TD L G FGCG + N G
Sbjct: 240 VGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL----GGASVDGFVFGCGLS--NRG 293
Query: 140 PLSPPDTAGVLGLGRGRISIVSQ 162
TAG++GLGR +S+VSQ
Sbjct: 294 LFG--GTAGLMGLGRTELSLVSQ 314
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 68/143 (47%), Gaps = 24/143 (16%)
Query: 35 DTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRC-AALHWPN--PPRCK- 86
DTGSDLTWVQC PC+ C + + P + VPC+ C A+L P C
Sbjct: 182 DTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCAT 240
Query: 87 -------HPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPG 139
+++C Y + YGDG S G L TD L G FGCG + N G
Sbjct: 241 VGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL----GGASVDGFVFGCGLS--NRG 294
Query: 140 PLSPPDTAGVLGLGRGRISIVSQ 162
TAG++GLGR +S+VSQ
Sbjct: 295 LFG--GTAGLMGLGRTELSLVSQ 315
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 92/346 (26%), Positives = 137/346 (39%), Gaps = 65/346 (18%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-TKPPEKQYKPHKNI--------VPCSNPR 73
VG PP+ + DTGS L W Q CT C K +Q P+ N VPC +
Sbjct: 92 VGDPPQRAEALIDTGSSLIWTQ----CTACLRKVCVRQDLPYFNASSSGSFAPVPCQDKA 147
Query: 74 CAA--LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
CA LH+ C + C + + YG GG IG L TD F + S G+ L FGC
Sbjct: 148 CAGNYLHF-----CAL-DGTCTFRVTYGAGG-IIGFLGTDAFTFQ-SGGAT----LAFGC 195
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
P +G++GLGRGR+S+ SQ + + LF+G
Sbjct: 196 VSFTRFAAPDVLHGASGLIGLGRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLFVGAAA 255
Query: 192 VPSSG---VAWTPMLQNSAD----------LKHYILGPAELLYSGKSCGLKDLT------ 232
S G V +++ D L +G +L + L+++
Sbjct: 256 SLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEG 315
Query: 233 -LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAP----DDKTLPICWRGPFKALGQ 287
+I DSG+ + Y+ ++ + R L G+ L P DD + +C A G
Sbjct: 316 GVIIDSGSPFTSLVEDAYEPLMGELARQLNGS---LVPPPGEDDGGMALC-----VARGD 367
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAY---LVISVSTSIIIIAYL 330
+ L L F+ + + +PPE Y L S + I+ YL
Sbjct: 368 LDRVVPTLVLHFSGGAD---MALPPENYWAPLEKSTACMAIVRGYL 410
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 64/125 (51%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF + + +GKPP DTGSD++W+QC APC+ C + + + P + + C
Sbjct: 149 YF-LRVGIGKPPSQAYVVLDTGSDVSWIQC-APCSECYQQSDPIFDPVSSNSYSPIRCDA 206
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+C +L C+ N C YE+ YGDG ++G T+ L +V NV + GC
Sbjct: 207 PQCKSLDL---SECR--NGTCLYEVSYGDGSYTVGEFATETVTL--GTAAVENVAI--GC 257
Query: 132 GYNQH 136
G+N
Sbjct: 258 GHNNE 262
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 78/326 (23%), Positives = 128/326 (39%), Gaps = 44/326 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK-----PPEKQYKPHKNIVPCSN 71
+ + +G P + DTGSD+ WV+C +PC C PP Y + +
Sbjct: 83 YYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQDIIPPLSIYNLSASSTSSVS 141
Query: 72 PRCAALHWPNPPRCKHP--NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
L C N C Y Y D +S+GA V D G+ + F
Sbjct: 142 SCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNATTSRIFF 201
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGD 189
GC N P+ G++G G ++ +Q+ + V HC+G G L
Sbjct: 202 GCATNITGSWPVD-----GIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEF 256
Query: 190 GKVP-SSGVAWTPMLQN-----------SADLKHYILGPAELLYSGKSCGLKDLTLIFDS 237
G+ P ++ + +TP+L S + K + P E Y S + +I DS
Sbjct: 257 GEAPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNST--NNTGVIIDS 314
Query: 238 GASYAYFTSR----VYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
G ++ T++ ++QEI SL T KL P + L + K+ + F
Sbjct: 315 GTTFVLLTTKANRMLFQEIKSL-------TTAKLGPKLEGLECFY---LKSGLTMETSFP 364
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVIS 319
+ L+F+ + + P+ YLV++
Sbjct: 365 NVTLTFS---GGSTMKLKPDNYLVMA 387
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 86/222 (38%), Gaps = 24/222 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP--------HKNIVP 68
+ ++ +VG PP++ D SD W+QC A T P P V
Sbjct: 97 YVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREVR 156
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG--SSIGALVTDLFPLRFSNGSVFNVP 126
C+N C L P C + C Y YG G ++ G L D F +V
Sbjct: 157 CANRGCQRLV---PQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAF----ATVRADG 209
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLF 186
+ FGC D GV+GLGRG +S VSQL+ + G +LF
Sbjct: 210 VIFGCAVATEG-------DIGGVIGLGRGELSPVSQLQIGRFSYYLAPDDAVDVGSFILF 262
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL 228
L D K +S TP++ + A Y + A + G+ +
Sbjct: 263 LDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAI 304
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 90/192 (46%), Gaps = 20/192 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ V + G P + + DTGS L+W+QC C + + P + + C++
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSS 177
Query: 73 RCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTF 129
+C++L N P C+ ++ C Y YGD S+G L DL L S +P +
Sbjct: 178 QCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLPGFVY 233
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL-REYGLIRNVIGHCIGQNGRGVLFLG 188
GCG Q + G AG+LGLGR ++S++ Q+ ++G +C+ G G FL
Sbjct: 234 GCG--QDSDGLFG--RAAGILGLGRNKLSMLGQVSSKFGY---AFSYCLPTRGGGG-FLS 285
Query: 189 DGKVPSSGVAWT 200
GK +G A+
Sbjct: 286 IGKASLAGSAYN 297
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 105/252 (41%), Gaps = 35/252 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP+ D+GSD+ WVQC PCT C + + P + V CS+
Sbjct: 140 YF-VRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCTQCYHQSDPVFDPADSASFTGVSCSS 197
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L C +C YE+ YGDG + G L L L F V +V + GC
Sbjct: 198 SVCDRLENAG---CH--AGRCRYEVSYGDGSYTKGTLA--LETLTFGRTMVRSVAI--GC 248
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ N G LG G +S V QL G +C+ G + G L G
Sbjct: 249 GH--RNRGMFVGAAGLLGLGG--GSMSFVGQLG--GQTGGAFSYCLVSRGTDSSGSLVFG 302
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSG----------KSCGLKDLTLIFDSG 238
+P +G AW P+++N Y +G A L G + L D ++ D+G
Sbjct: 303 REALP-AGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTG 361
Query: 239 ASYAYFTSRVYQ 250
+ + YQ
Sbjct: 362 TAVTRLPTLAYQ 373
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 74/294 (25%), Positives = 122/294 (41%), Gaps = 28/294 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + LT+G PP DTGSDL W QC PC GC + ++P ++ +PC +
Sbjct: 82 YLMKLTLGSPPVDIYGLVDTGSDLVWAQC-TPCGGCYRQKSPMFEPLRSKTYSPIPCESE 140
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
+C+ + C P C Y Y D + G L + ++G V + FGC
Sbjct: 141 QCSFFGY----SCS-PQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFGC 195
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRN----VIGHCIGQNGRGVLF 186
G++ N G + D ++G+G G +S+VSQ+ YG R V H + F
Sbjct: 196 GHS--NSGTFNENDMG-IIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINF 252
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGAS 240
+ V GV TP+ + + +G + ++ L ++ DSG
Sbjct: 253 GEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSET-LSKGNIMIDSGTP 311
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV-TEYFK 293
Y Y+ +V + P++ PD T +C+R G + T +F+
Sbjct: 312 ATYIPQEFYERLVEELKVQSSLLPIEDDPDLGT-QLCYRSETNLEGPILTAHFE 364
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 66/266 (24%), Positives = 116/266 (43%), Gaps = 33/266 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSN 71
S + +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 80 SLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGT 137
Query: 72 PRCAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LT 128
C L + P C+ + C + + Y DG +S G L D L FS+ V +P +
Sbjct: 138 SMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPGFS 191
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL- 185
FGC + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 192 FGCNMDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSERGFFS 246
Query: 186 ----FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIF 235
+ GKV + + V +T M+ + + + + + G+ GL ++F
Sbjct: 247 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF 306
Query: 236 DSGASYAYFTSRVYQEIVSLIMRDLI 261
DSG+ +Y R ++S +R+L+
Sbjct: 307 DSGSELSYIPDRAL-SVLSQRIRELL 331
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 120/285 (42%), Gaps = 45/285 (15%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI--------VPCSNPR- 73
+G PP+ + DTGSDL W QC C K KQ P+ N+ VPC++
Sbjct: 92 IGSPPQRTEALIDTGSDLIWTQCATTCL--PKSCAKQGLPYYNLSQSSTFVPVPCADKAG 149
Query: 74 -CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
CAA N + C + YG G IG+L T+ F F +G+ L FGC
Sbjct: 150 FCAA----NGVHLCGLDGSCTFIASYG-AGRVIGSLGTESFA--FESGT---TSLAFGCV 199
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
+ G L+ D +G++GLGRGR+S+VSQ+ + + LF+G
Sbjct: 200 SLTRITSGALN--DASGLIGLGRGRLSLVSQIGATRFSYCLTPYFHSSGASSHLFVGASA 257
Query: 192 VPSSGVAWTPMLQNSADLKH---YILGPAELLYSGK---------SCGLKDL-------T 232
G A P +++ D + Y L P E + GK + L+ L
Sbjct: 258 SLGGGGASMPFVKSPKDYPYSTFYYL-PLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGG 316
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPIC 277
+I D+G+ S Y+ + + L L AP+D L +C
Sbjct: 317 VIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELC 361
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 75/283 (26%), Positives = 118/283 (41%), Gaps = 53/283 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP---------HKNIV 67
V+L +G PP+ D DTGS L+W+QC PP + K +++
Sbjct: 66 LVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLL 125
Query: 68 PCSNPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
PC++P C + P C N C Y Y DG + G LV + F + S+
Sbjct: 126 PCNHPICKPRIPDFTLPTSCDQ-NRLCHYSYFYADGTLAEGNLVREKFTF---SKSLSTP 181
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNG 181
P+ GC + + G+LG+ GR+S +SQ + + +C+ G N
Sbjct: 182 PVILGCAQ--------ASTENRGILGMNHGRLSFISQAK-----ISKFSYCVPSRTGSNP 228
Query: 182 RGVLFLGDGKVPSSGVAWTPML-----QNSADLK--HYILGPAELLYSGK---------- 224
G+ +LGD SS + ML Q+S +L Y L + +GK
Sbjct: 229 TGLFYLGDNP-NSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFK 287
Query: 225 -SCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
G T+I DSG+ Y Y+++ ++R L+G +K
Sbjct: 288 PDAGGSGQTMI-DSGSDLTYLVDEAYEKVKEEVVR-LVGAMMK 328
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 69/273 (25%), Positives = 118/273 (43%), Gaps = 44/273 (16%)
Query: 15 SYFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKP---PEKQYKPHKN----I 66
S + V++ +G P P+ F DTGSDLTW+ C+ C C KP P + ++ + +
Sbjct: 117 SQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRT 176
Query: 67 VPCSNPRCAA--LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS--- 121
+PCS+ C + + C +PN C ++ Y +G +IG + + ++
Sbjct: 177 IPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIR 236
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---- 177
+F+V + +N+ N P GV+GLG + S+ +L E + N +C+
Sbjct: 237 LFDVLIGCTESFNETNGFP------DGVMGLGYRKHSLALRLAE--IFGNKFSYCLVDHL 288
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT---- 232
N + L GD +P + P +Q++ L YI + SG S G L+
Sbjct: 289 SSSNHKNFLSFGD--IPEMKL---PKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSD 343
Query: 233 ---------LIFDSGASYAYFTSRVYQEIVSLI 256
+I DSG S Y ++V +
Sbjct: 344 IWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDAL 376
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 91/205 (44%), Gaps = 25/205 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKP-PEKQYKPHKNIVPCSNPR 73
+LT+G PP+ DTGS+L+W++C T P K Y +PCS+
Sbjct: 67 LTASLTIGTPPQNITMVLDTGSELSWLRCKKEPNFTSIFNPLASKTYTK----IPCSSQT 122
Query: 74 CAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P C P C + I Y D S G L + F RF GS+ FGC
Sbjct: 123 CKTRTSDLTLPVTCD-PAKLCHFIISYADASSVEGHLAFETF--RF--GSLTRPATVFGC 177
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHCI-GQNGRGVLFLG 188
+ + T G++G+ RG +S V+Q+ R++ +CI G + G L LG
Sbjct: 178 MDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRKF-------SYCISGLDSTGFLLLG 230
Query: 189 DGKVP-SSGVAWTPMLQNSADLKHY 212
+ + + +TP++Q S L ++
Sbjct: 231 EARYSWLKPLNYTPLVQISTPLPYF 255
>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 430
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 75/283 (26%), Positives = 111/283 (39%), Gaps = 29/283 (10%)
Query: 58 KQYKPH----KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEY-GDGGSSIGALVTDL 112
Y P+ + VPC++ C RC + C YE+ Y SSIG LV D+
Sbjct: 4 NHYSPNDSTTSSTVPCTSSLCN--------RCTSNQNVCPYEMRYLSANTSSIGYLVEDV 55
Query: 113 FPLRFSNGSV--FNVPLTFGCGYNQHNP-GPLSPPDTAGVLGLGRGRISIVSQLREYGLI 169
L + + +TFGCG Q + P+ G++GLG +IS+ S L + GL
Sbjct: 56 LHLATDDSLLKPVEAKITFGCGTVQTGIFATTAAPN--GLIGLGMEKISVPSFLADQGLT 113
Query: 170 RNVIGHCIGQNGRGVLFLGD-GKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL 228
N C G +G G + GD G + ML+ + + ++ G
Sbjct: 114 SNSFSMCFGADGYGRIDFGDTGPADQKQTPFNTMLEYQSYNVTF-----NVINVGGEPND 168
Query: 229 KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV 288
T IFDSG S+ Y T Y I + + L + C+ P A
Sbjct: 169 VPFTAIFDSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGA---- 224
Query: 289 TEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLT 331
+ F+ L L+FT + +L + VST II T
Sbjct: 225 -KEFQYLTLNFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETT 266
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 115/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 79/269 (29%), Positives = 112/269 (41%), Gaps = 38/269 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +G PP F DTGSDLTW QC PC C Y + VPC++
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCK-PCKLCFPQDTPIYDTAASASFSPVPCASA 153
Query: 73 RCAALHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP----- 126
C + W + C C Y Y DG S G L T+ L F+ GS P
Sbjct: 154 TCLPI-WRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTET--LTFA-GSSPGAPGPGVS 209
Query: 127 ---LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRG 183
+ FGCG + G LS ++ G +GLGRG +S+V+QL + G
Sbjct: 210 VGGVAFGCGVDN---GGLS-YNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSP 265
Query: 184 VLF--LGDGKVPSS----GVAWTPMLQNSADLKHYI-------LGPAELLYSGKSCGLKD 230
VLF L + PS+ V TP++Q + Y LG A L + L+D
Sbjct: 266 VLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRD 325
Query: 231 L---TLIFDSGASYAYFTSRVYQEIVSLI 256
+I DSG + ++ +V+ +
Sbjct: 326 DGSGGMIVDSGTIFTVLVESAFRVVVNHV 354
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 115/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 81/303 (26%), Positives = 123/303 (40%), Gaps = 41/303 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG---CTKPPEKQYKPHKN----IVPC 69
+ V ++G P + DTGSDL+WVQC PC+ C + + P ++ VPC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCSAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P CA L QC Y + YGDG ++ G +D L S+ F
Sbjct: 199 GGPVCAGLGIYA--ASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQG---FFF 253
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVL 185
GCG+ Q G + D G+LGLGR + S+V Q G V +C+ G L
Sbjct: 254 GCGHAQS--GLFNGVD--GLLGLGREQPSLVEQ--TAGTYGGVFSYCLPTKPSTAGYLTL 307
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL---------IFD 236
LG + G + T +L + +Y+ ++ +G S G + L++ + D
Sbjct: 308 GLGGPSGAAPGFSTTQLLPSPNAPTYYV-----VMLTGISVGGQQLSVPASAFAGGTVVD 362
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFKPLA 296
+G Y + S + AP + L C+ F G VT +A
Sbjct: 363 TGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYN--FAGYGTVT--LPNVA 418
Query: 297 LSF 299
L+F
Sbjct: 419 LTF 421
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 65/125 (52%), Gaps = 14/125 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF+ + +G PPK DTGSD+ WVQC APC C + + ++P + + C
Sbjct: 155 YFS-RVGIGSPPKHVYMVVDTGSDVNWVQC-APCADCYQQADPIFEPSFSSSYAPLTCET 212
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C +L + C+ ND C YE+ YGDG ++G T+ L S S+ NV + GC
Sbjct: 213 HQCKSL---DVSECR--NDSCLYEVSYGDGSYTVGDFATETITLDGS-ASLNNVAI--GC 264
Query: 132 GYNQH 136
G++
Sbjct: 265 GHDNE 269
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 63/125 (50%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF+ + +G+PP DTGSD++WVQC APC C + + ++P + + C
Sbjct: 151 YFS-RVGIGRPPSPVYMVLDTGSDVSWVQC-APCAECYEQTDPXFEPTSSASFTSLSCET 208
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C +L + C+ N C YE+ YGDG ++G VT+ L GS + GC
Sbjct: 209 EQCKSL---DVSECR--NGTCLYEVSYGDGSYTVGDFVTETVTL----GSTSLGNIAIGC 259
Query: 132 GYNQH 136
G+N
Sbjct: 260 GHNNE 264
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 115/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 114/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ ++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 60/121 (49%), Gaps = 15/121 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----IVPCSN 71
F V + G P + FDTGSDL+W+QC PC+G C K + + P K+ +VPC
Sbjct: 112 FVVVVGFGSPAQTSATMFDTGSDLSWIQCQ-PCSGHCYKQHDPVFDPAKSSSYAVVPCGT 170
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
CAA C C Y +EYGDG S+ G L + L FS+ S F FGC
Sbjct: 171 TECAAAGG----ECN--GTTCVYGVEYGDGSSTTGVLARET--LTFSSSSEFT-GFIFGC 221
Query: 132 G 132
G
Sbjct: 222 G 222
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 66/265 (24%), Positives = 114/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P +FG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPGFSFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNMDSFGANEFGNVD--GLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/265 (24%), Positives = 114/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P +FG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPGFSFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNMDSFGANEFGNVD--GLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 115/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|330842955|ref|XP_003293432.1| hypothetical protein DICPUDRAFT_158270 [Dictyostelium purpureum]
gi|325076242|gb|EGC30045.1| hypothetical protein DICPUDRAFT_158270 [Dictyostelium purpureum]
Length = 484
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 84/339 (24%), Positives = 137/339 (40%), Gaps = 45/339 (13%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCS 70
+++ +N V + F DTGS LT + C C + Y P + ++PCS
Sbjct: 80 NFYQINANVYIGGQKFILQVDTGSTLTAIPL-KNCNNC-RGERPVYNPEISNSSILIPCS 137
Query: 71 NPRCAALHWPNPPRCKHPNDQ--CDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+ C P H + + CD+ I YGDG G + +D + NG V
Sbjct: 138 SDHCLGSGSAAPSCRLHQSSKSSCDFVILYGDGSKVRGKIYSDEITM---NG----VKSI 190
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGR--GRISIV-----SQLREYGLIRNVIGHCIGQNG 181
G N G P G++GLGR ++V S +R ++NV G + G
Sbjct: 191 GFFGANVEEVGTFEYPRADGIMGLGRTGNNKNLVPTIFESMVRANSSMKNVFGIYLDYQG 250
Query: 182 RGVLFLG--DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-TLIFDSG 238
+G L LG + + +TP++QN Y + P S S L +I DSG
Sbjct: 251 QGHLSLGRINPNFYVGEIEYTPVVQNGP---FYSIKPTSFRISNTSFLASSLGQVIVDSG 307
Query: 239 ASYAYFTSRVYQEIVSLIMR-----DLIGTPLKLAPDDKTLPICWRGPFKALGQVTEYFK 293
S + ++Y +++ R D++ P+ + T C+ + E F
Sbjct: 308 TSDIILSGKIYDHLIAFFRRHYCHIDMVCDPISIF----TGRACFERE-----EDFESFP 358
Query: 294 PLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTG 332
L F+ VR+ +PP+ Y++ + ST + Y G
Sbjct: 359 WLHFGFS---GGVRIAIPPKNYMIKTQSTQPGVYGYCWG 394
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 69/149 (46%), Gaps = 16/149 (10%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNPRCAA 76
+T+G K DT SDLTWVQC+ PC C +KP V C++ C +
Sbjct: 67 VTMGLGSKNMTVIIDTRSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125
Query: 77 LHWP--NPPRCKHPN-DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGY 133
L + N C N C+Y + YGDG + G DL S G V FGCG
Sbjct: 126 LQFATGNTGACGSSNPSTCNYVVNYGDGSYTNG----DLGVEALSFGGVSVSDFVFGCGR 181
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQ 162
N N G +G++GLGR +S+VSQ
Sbjct: 182 N--NKGLFGG--VSGLMGLGRSYLSLVSQ 206
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 114/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ ++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 114/265 (43%), Gaps = 36/265 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ ++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + A + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227
Query: 238 GASYAYFTSR----VYQEIVSLIMR 258
G+ +Y R + Q I L++R
Sbjct: 228 GSELSYIPDRALSVLSQRIRELLLR 252
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/223 (28%), Positives = 100/223 (44%), Gaps = 20/223 (8%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ V + +G PP DTGSD+ WVQC +PC+ C + + P + VPC++
Sbjct: 123 YLVRVGIGSPPLEQHLVADTGSDVIWVQC-SPCSDCYAQGDPLFDPANSASFSPVPCNSG 181
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C A + C +C+Y++ YGD + G L + L +G + GCG
Sbjct: 182 VCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTL---DGGTEVQGVAMGCG 238
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG------QNGRGVLF 186
+ N G + + AG+LGLG G +S+V QL +C+ +G G L
Sbjct: 239 H--ENRGLFA--EAAGLLGLGWGPMSLVGQLGG--AAGGAFSYCLAGYYSGEGSGSGSLV 292
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLK 229
LG +G W P+++N Y +G L +G+ L+
Sbjct: 293 LGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQ 335
>gi|449533387|ref|XP_004173657.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 254
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 88/204 (43%), Gaps = 33/204 (16%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYK---------PHKN 65
S V+L +G PP+ D DTGS L+W+QC PP + K +
Sbjct: 65 SALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFS 124
Query: 66 IVPCSNPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVF 123
++PC++P C + P C N C Y Y DG + G LV + F FSN S+
Sbjct: 125 LLPCNHPICKPRIPDFTLPTSCDQ-NRLCHYSYFYADGTLAEGNLVREKF--TFSN-SLS 180
Query: 124 NVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQ 179
P+ GC + G+LG+ GR+S +SQ + + +C+ G
Sbjct: 181 TPPVILGCAQGST--------ENRGILGMNHGRLSFISQAK-----ISKFSYCVPSRTGP 227
Query: 180 NGRGVLFLGDGKVPSSGVAWTPML 203
N G+ +LGD SS + ML
Sbjct: 228 NPTGLFYLGDNPN-SSKFKYVTML 250
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/280 (29%), Positives = 112/280 (40%), Gaps = 33/280 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEK--QYKPHKNI----VPCS 70
+ + +G P DTGS LTWVQC PC P++ + P+ + VPC
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWVQCK-PCNSSQCYPQRLPLFDPNTSSSYSPVPCD 187
Query: 71 NPRCAALHWP-NPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+ C AL + C D C YEI YG G + G TD L G++
Sbjct: 188 SQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTD--ALTLGPGAIVKR-FH 244
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ--LREYGLIRNVIGHCIGQNGRGVLF 186
FGCG++Q G D GVLGLGR S+ Q R G V HC+ G F
Sbjct: 245 FGCGHHQQR-GKFDMAD--GVLGLGRLPQSLAWQASARRGG---GVFSHCLPPTGVSTGF 298
Query: 187 LGDGK-VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL-------TLIFDSG 238
L G +S +TP+L Y L P + +G+ L D+ +I DSG
Sbjct: 299 LALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQ---LLDIPPAVFREGVITDSG 355
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
+ Y + + + P LAP L C+
Sbjct: 356 TVLSALQETAYTALRTAFRSAMAEYP--LAPPVGHLDTCF 393
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 64/125 (51%), Gaps = 14/125 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG+P K F DTGSD+ W+QC PCT C + + + P + +PC +
Sbjct: 155 YFS-RVGVGQPAKPFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPRSSSSFASLPCES 212
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C AL C+ +C Y++ YGDG ++G V + L F N + N + GC
Sbjct: 213 QQCQALETSG---CRA--SKCLYQVSYGDGSFTVGEFVIE--TLTFGNSGMIN-NVAVGC 264
Query: 132 GYNQH 136
G++
Sbjct: 265 GHDNE 269
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 64/125 (51%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF+ + +GKP DTGSD+ W+QC APC C + ++P + + C
Sbjct: 144 YFS-RVGIGKPSSPVYMVLDTGSDVNWIQC-APCADCYHQADPIFEPASSTSYSPLSCDT 201
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C +L + C+ N+ C YE+ YGDG ++G VT+ L + SV NV + GC
Sbjct: 202 KQCQSL---DVSECR--NNTCLYEVSYGDGSYTVGDFVTETITL--GSASVDNVAI--GC 252
Query: 132 GYNQH 136
G+N
Sbjct: 253 GHNNE 257
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 67/153 (43%), Gaps = 14/153 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
F VN+++G PP DT SDL W+QC PC C + P ++ N C
Sbjct: 85 FLVNISIGSPPVTQLLHMDTASDLLWLQC-RPCINCYAQSLPIFDPSRSYTH-RNESCRT 142
Query: 77 LHWPNPP-RCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-----RFSNGSVFNVPLTFG 130
+ P R C+Y + Y DG S G L ++ S+ ++ +V FG
Sbjct: 143 SQYSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDV--VFG 200
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
CG++ + P G+LGLG G S+V +
Sbjct: 201 CGHDNYG----EPLVGTGILGLGYGEFSLVHRF 229
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 131/335 (39%), Gaps = 44/335 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--------- 67
+ + +G P K + DTGS LTW+QC C + + P +
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CS+ A L NP C N C Y+ YGD S+G L D + F + SV N
Sbjct: 189 QCSDLTTATL---NPASCSTSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSVPN--F 240
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
+GCG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 241 YYGCG--QDNEGLFG--QSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGY 294
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASY 241
+ G ++TPM +S D Y + + +GK S L I DSG
Sbjct: 295 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 354
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTEYFKPLALSF 299
+ VY + + + GTP A L C++G L +VT F A
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTP--RASAFSILDTCFQGQAARLRVPEVTMAFAGGAALK 412
Query: 300 TNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTGKS 334
RN L++ V ++ +A+ +S
Sbjct: 413 LAARN-----------LLVDVDSATTCLAFAPARS 436
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 92/341 (26%), Positives = 138/341 (40%), Gaps = 58/341 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ +NL++G PP DTGS+L W QC PC C + + P + V CS+
Sbjct: 94 YLMNLSLGTPPSPIMAVADTGSNLIWTQC-KPCDDCYTQVDPLFDPKASSTYKDVSCSSS 152
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF--- 129
+C AL N C + C Y + Y DG ++G D L GS N P+
Sbjct: 153 QCTALE--NQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTL----GSTDNRPVQLKNI 206
Query: 130 --GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGV 184
GCG N ++GV+GLG G +S++ QL + I +C+ +
Sbjct: 207 IIGCG---QNNAVTFRNKSSGVVGLGGGAVSLIKQLGDS--IDGKFSYCLVPENDQTSKI 261
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDL---------TLIF 235
F + V G TP++ S D +Y L S G K++ ++
Sbjct: 262 NFGTNAVVSGPGTVSTPLVVKSRDTFYY------LTLKSISVGSKNMQTPDSNIKGNMVI 315
Query: 236 DSGASYAYFTSRVYQEI----VSLIMRD-----LIGTPLKL-APDDKTLPIC---WRG-- 280
DSG + + Y EI SLI D IG+ L A D +P+ + G
Sbjct: 316 DSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATADLNIPVITMHFEGAD 375
Query: 281 ----PFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
P+ + +VTE LA + RN + V + +LV
Sbjct: 376 VKLYPYNSFFKVTEDLVCLAFGMSFYRNGIYGNVAQKNFLV 416
>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 441
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/336 (24%), Positives = 128/336 (38%), Gaps = 63/336 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----------HKN 65
V L +G PP+L DTGS ++W+ CD K P+K+ P
Sbjct: 69 LVVTLPIGTPPQLQQMVLDTGSQVSWIHCDN-----KKGPQKKQPPTTSSFDPSLSSSFF 123
Query: 66 IVPCSNPRCAALHWPNPPRCKHPND-----QCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
+PC++P C P P P D C Y Y DG G LV + L +
Sbjct: 124 ALPCNHPLCK----PQVPDISLPTDCDANRLCHYSFSYTDGTVVEGNLVRENIAL---SP 176
Query: 121 SVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN 180
S+ P+ GC NQ + D G+LG+ GR+S +Q + + Q
Sbjct: 177 SLTTPPIILGCA-NQSD-------DARGILGMNLGRLSFPNQAK-ITKFSYFVPVKQTQP 227
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYS----GKSCGLKDLTL--- 233
G G L+LG+ SS + +L S + L ++ G S G K L +
Sbjct: 228 GSGSLYLGNNP-NSSCFRYVKLLTFSKSQSQRMPNLDPLAFTLPMQGISIGGKKLNIPPS 286
Query: 234 ------------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGP 281
I DSG+ ++Y + Y I + +++ + K IC+ G
Sbjct: 287 VFKPDTTGFGQTIIDSGSEFSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVADICFDGD 346
Query: 282 FKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+G++ + F V +V+P E L+
Sbjct: 347 ATEIGRLV---GDMVFEF---EKGVEIVIPKERVLI 376
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 63/119 (52%), Gaps = 15/119 (12%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALH 78
VG P + F DTGSD+ W+QC PCT C + + + P + V C + +C++L
Sbjct: 167 VGNPARQFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPTASSTYAPVTCQSQQCSSLE 225
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFGCGYNQH 136
+ C+ + QC Y++ YGDG + G T+ + F N GSV NV L GCG++
Sbjct: 226 MSS---CR--SGQCLYQVNYGDGSYTFGDFATE--SVSFGNSGSVKNVAL--GCGHDNE 275
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 63/125 (50%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF+ + +G+PP DTGSD++WVQC APC C + + ++P + + C
Sbjct: 151 YFS-RVGIGRPPSPVYMVLDTGSDVSWVQC-APCAECYEQTDPIFEPTSSASFTSLSCET 208
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C +L + C+ N C YE+ YGDG ++G VT+ L GS + GC
Sbjct: 209 EQCKSL---DVSECR--NGTCLYEVSYGDGSYTVGDFVTETVTL----GSTSLGNIAIGC 259
Query: 132 GYNQH 136
G+N
Sbjct: 260 GHNNE 264
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 61/118 (51%), Gaps = 13/118 (11%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALH 78
+G PPK DTGSD+ WVQC APC C + + ++P + + C +C +L
Sbjct: 59 IGSPPKHVYMVVDTGSDVNWVQC-APCADCYQQADPIFEPSFSSSYAPLTCETHQCKSL- 116
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
+ C+ ND C YE+ YGDG ++G T+ L S S+ NV + GCG++
Sbjct: 117 --DVSECR--NDSCLYEVSYGDGSYTVGDFATETITLDGS-ASLNNVAI--GCGHDNE 167
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 88/198 (44%), Gaps = 25/198 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
YF + + VG PP+ D+GSD+ WVQC PCT C + + P + VPCS
Sbjct: 141 EYF-IRIGVGSPPREQYVVIDSGSDIVWVQCQ-PCTQCYHQTDPVFDPADSASFMGVPCS 198
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ C + C C YE+ YGDG + G L L L F V NV + G
Sbjct: 199 SSVCERIENAG---CHAGG--CRYEVMYGDGSYTKGTLA--LETLTFGRTVVRNVAI--G 249
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFL 187
CG+ N G LG G +S+V QL G +C+ G + G L
Sbjct: 250 CGH--RNRGMFVGAAGLLGLGG--GSMSLVGQLG--GQTGGAFSYCLVSRGTDSAGSLEF 303
Query: 188 GDGKVPSSGVAWTPMLQN 205
G G +P G AW P+++N
Sbjct: 304 GRGAMP-VGAAWIPLIRN 320
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 12/125 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + +G P + DTGSD+TWVQC PC C + + + P + V C +
Sbjct: 166 YFS-RVGIGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSASYAAVSCDS 223
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
RC L + C++ C YE+ YGDG ++G T+ L S V NV + GC
Sbjct: 224 QRCRDL---DTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST-PVGNVAI--GC 277
Query: 132 GYNQH 136
G++
Sbjct: 278 GHDNE 282
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 115/269 (42%), Gaps = 34/269 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
S + VN+ +G P K FDTGS L W QC PC C P + P K+ +PCS
Sbjct: 130 SDYIVNVGIGTPKKEMPLIFDTGSGLIWTQC-KPCKACY-PKVPVFDPTKSASFKGLPCS 187
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ C ++ C P +C Y Y D SS G L T+ + FS+ + G
Sbjct: 188 SKLCQSIRQ----GCSSP--KCTYLTAYVDNSSSTGTLATET--ISFSHLKYDFKNILIG 239
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQN--GRGVLFLG 188
C +Q + L +G++GL R IS+ SQ + + +CI G L G
Sbjct: 240 CS-DQVSGESLGE---SGIMGLNRSPISLASQTAN--IYDKLFSYCIPSTPGSTGHLTFG 293
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLTLIFDSGASYA 242
GKVP+ V ++P+ + + + I +G +LL + + DSGA
Sbjct: 294 -GKVPND-VRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIAS---TIDSGAVLT 348
Query: 243 YFTSRVYQEIVSLIMRDLIGTPLKLAPDD 271
+ Y + S+ + G PL L DD
Sbjct: 349 RLPPKAYSALRSVFREMMKGYPL-LDQDD 376
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 138/330 (41%), Gaps = 63/330 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCT-GCTKPPEKQYKPHKNI----VPCSN 71
F + L +G PP F DTGSDL W QC APC+ C + P Y P + +PC++
Sbjct: 85 FLMTLAIGTPPLPFLAIADTGSDLIWTQC-APCSRQCFQQPTPLYNPSSSTTFSALPCNS 143
Query: 72 P--RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVP-L 127
CA P C C Y + YG G + + T+ F S VP +
Sbjct: 144 SLGLCA-------PACA-----CMYNMTYGSGWTYVFQ-GTETFTFGSSTPADQVRVPGI 190
Query: 128 TFGC-----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----G 178
FGC G+N + +G++GLGRG +S+VSQL +C+
Sbjct: 191 AFGCSNASSGFNASS--------ASGLVGLGRGSLSLVSQLGAPKF-----SYCLTPYQD 237
Query: 179 QNGRGVLFLG-DGKVPSSG-VAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLK- 229
N L LG + +G V+ TP + + + + +Y+ LG L + LK
Sbjct: 238 TNSTSTLLLGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKA 297
Query: 230 DLT--LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQ 287
D T LI DSG + + YQ++ + ++ L+ P L +C+ P
Sbjct: 298 DGTGGLIIDSGTTITMLGNTAYQQVRAAVL-SLVTLPTTDGSAATGLDLCFELP------ 350
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ P S T + +V+P + Y++
Sbjct: 351 SSTSAPPSMPSMTLHFDGADMVLPADNYMM 380
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/323 (23%), Positives = 124/323 (38%), Gaps = 79/323 (24%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTW----------VQCD------------------AP 48
+ ++L++G PP++ DTGSDLTW ++CD +
Sbjct: 80 YLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHSSSSH 139
Query: 49 CTGCTKPPEKQYKPHKN-IVPCSNPRC-------AALHWPNPPRCKHPNDQCDYEIEYGD 100
CT P N + PC+ C A WP PP + YG
Sbjct: 140 RDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPP----------FAYTYGA 189
Query: 101 GGSSIGALVTDLFPLRFSN-GSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRIS 158
GG G L D + N G +P FGC + + + G+ G GRG +S
Sbjct: 190 GGVVTGTLTRDTLRVHGRNLGVTQEIPRFCFGCVASSYR-------EPIGIAGFGRGALS 242
Query: 159 IVSQLREYGLIRNVIGHCI-------GQNGRGVLFLGDGKVPSS-GVAWTPMLQNSADLK 210
+ SQL G +R HC N L +GD + S + +TPML++
Sbjct: 243 LPSQL---GFLRKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPN 299
Query: 211 HYILGPAELLYSGKSC-----------GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRD 259
+Y +G + S L + ++ DSG +Y + Y +++S +++
Sbjct: 300 YYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLS-VLQS 358
Query: 260 LIGTPLKLAPDDKT-LPICWRGP 281
+I P + +T +C++ P
Sbjct: 359 IINYPRATDMEMRTGFDLCYKVP 381
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 57/125 (45%), Gaps = 14/125 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + VG PPK DTGSD+ W+QC APC C + + P K+ V C
Sbjct: 129 YF-TRIGVGTPPKYVYMVLDTGSDIVWLQC-APCKNCYSQTDPVFNPVKSGSFAKVLCRT 186
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L P C C Y++ YGDG + G VT+ L F V V L GC
Sbjct: 187 PLCRRLESPG---CNQ-RQTCLYQVSYGDGSYTTGEFVTET--LTFRRTKVEQVAL--GC 238
Query: 132 GYNQH 136
G++
Sbjct: 239 GHDNE 243
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 69/157 (43%), Gaps = 13/157 (8%)
Query: 17 FAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
+ ++ +G P P+ DTGSDL W QC PC C P + P + V C +
Sbjct: 87 YLIHFNIGTPRPQRVALTMDTGSDLVWTQC-TPCPVCFDQPFPLFDPSVSSTFRAVACPD 145
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS----VFNVPL 127
P C + C +C Y YGD + G + D F NG V L
Sbjct: 146 PICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGL 205
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
FGCG +N G + + +G+ G GRG +S+ SQLR
Sbjct: 206 AFGCG--DYNTGVFA-SNESGIAGFGRGPLSLPSQLR 239
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/275 (25%), Positives = 112/275 (40%), Gaps = 25/275 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ + +G P + DTGS LTW+QC C + + P + V CS
Sbjct: 122 YVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQ 181
Query: 73 RCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+C+ L NP C N C Y+ YGD S+G L D + F + S+ N +G
Sbjct: 182 QCSDLPSATLNPSACSSSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSLPN--FYYG 236
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 237 CG--QDNEGLFG--RSAGLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSLGS 290
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYFT 245
P ++TPM+ +S D Y + + + +G S L I DSG
Sbjct: 291 YNPGQ-YSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLP 349
Query: 246 SRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
+ VY + + + GT A L C++G
Sbjct: 350 TSVYSALSKAVAAAMKGT--SRASAYSILDTCFKG 382
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 63/119 (52%), Gaps = 15/119 (12%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALH 78
VG P + F DTGSD+ W+QC PCT C + + + P + V C + +C++L
Sbjct: 26 VGNPARQFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPTASSTYAPVTCQSQQCSSLE 84
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFGCGYNQH 136
+ C+ + QC Y++ YGDG + G T+ + F N GSV NV L GCG++
Sbjct: 85 MSS---CR--SGQCLYQVNYGDGSYTFGDFATE--SVSFGNSGSVKNVAL--GCGHDNE 134
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 77/180 (42%), Gaps = 20/180 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP D+GSD+ WVQC PC C + + P + V C +
Sbjct: 130 YF-VRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCGS 187
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L +CDY + YGDG + G L + L G + GC
Sbjct: 188 AICRTLSGTGCGG-GGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGC 242
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ N G AG+LGLG G +S+V QL G V +C+ G G G L LG
Sbjct: 243 GH--RNSGLFV--GAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAGGAGSLVLG 296
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 124/291 (42%), Gaps = 37/291 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCS 70
S F NL++G PP DTGSDL W+QC+ PC C K + Y K+ + C+
Sbjct: 91 SAFLANLSIGNPPTNVYVVLDTGSDLFWIQCE-PCDVCYKQKDPIYNRTKSDSYTEMLCN 149
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD--LFPLRFSNGSVFNVPLT 128
P C +L +C + C Y+ Y DG + G L + F +S+ +
Sbjct: 150 EPPCVSLGREG--QCSD-SGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDK-TAQVG 205
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ----NGRGV 184
FGCG N ++ GVLGLG G +S+VSQL G + +C G N G
Sbjct: 206 FGCGL--QNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGF 263
Query: 185 LFLGDGKVPSSGVAWTPMLQNS---ADLKHYILGPAELLYSGKSCGLKDL-----TLIFD 236
L GD + + TPM+ +L LG E S + +I D
Sbjct: 264 LVFGDATYLNGDM--TPMVIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIID 321
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIG----TPLKLAPDDKTLPICWRGPFK 283
SG++ + F VY+ + + ++ L +PL +PD C+ G +
Sbjct: 322 SGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD------CFEGKIE 366
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/271 (26%), Positives = 111/271 (40%), Gaps = 25/271 (9%)
Query: 21 LTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAA 76
+ +G P + DTGS LTW+QC C + + P + V CS +C+
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 77 L--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYN 134
L NP C N C Y+ YGD S+G L D + F + S+ N +GCG
Sbjct: 61 LPSATLNPSACSSSN-VCIYQASYGDSSFSVGYLSKD--TVSFGSTSLPN--FYYGCG-- 113
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGKVPS 194
Q N G +AG++GL R ++S++ QL + +C+ + P
Sbjct: 114 QDNEGLFG--RSAGLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSLGSYNPG 169
Query: 195 SGVAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASYAYFTSRVY 249
++TPM+ +S D Y + + + +G S L I DSG + VY
Sbjct: 170 Q-YSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVY 228
Query: 250 QEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
+ + + GT A L C++G
Sbjct: 229 SALSKAVAAAMKGT--SRASAYSILDTCFKG 257
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/278 (24%), Positives = 106/278 (38%), Gaps = 50/278 (17%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
V+L +G PP+ DTGS L+W+QC PP + P +++PC++P C
Sbjct: 84 VSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPR-KPPPSSVFDPSLSSSFSVLPCNHPLC 142
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + S + PL GC
Sbjct: 143 KPRIPDFTLPTSCDQ-NRLCHYSYFYADGTLAEGNLVREKITFSRSQST---PPLILGCA 198
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
D G+LG+ GR+S SQ + +C+ G G
Sbjct: 199 EESS--------DAKGILGMNLGRLSFASQAK-----LTKFSYCVPTRQVRPGFTPTGSF 245
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYILGPAE--LLYSGKSCGLKDLTL---------- 233
+LG+ S G + +L S + L P + G G + L +
Sbjct: 246 YLGENP-NSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPS 304
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
+ DSG+ + Y Y ++ ++R L+G LK
Sbjct: 305 GAGQTMIDSGSEFTYLVDEAYNKVREEVVR-LVGARLK 341
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 131/335 (39%), Gaps = 44/335 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--------- 67
+ + +G P K + DTGS LTW+QC C + + P +
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CS+ A L NP C N C Y+ YGD S+G L D + F + SV N
Sbjct: 187 QCSDLTTATL---NPASCSTSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSVPN--F 238
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
+GCG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 239 YYGCG--QDNEGLFG--QSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGY 292
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASY 241
+ G ++TPM +S D Y + + +GK S L I DSG
Sbjct: 293 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 352
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTEYFKPLALSF 299
+ VY + + + GTP A L C++G L +VT F A
Sbjct: 353 TRLPTGVYSALSKAVAGAMKGTPRASA--FSILDTCFQGQAARLRVPEVTMAFAGGAALK 410
Query: 300 TNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTGKS 334
RN L++ V ++ +A+ +S
Sbjct: 411 LAARN-----------LLVDVDSATTCLAFAPARS 434
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 106/248 (42%), Gaps = 19/248 (7%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP----------PEKQYKPHK 64
S + N++VG PP F DTGSDL W+ C+ T C + P Y P+
Sbjct: 100 SLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT-CIRDLEDIGVPQSVPLNLYTPNA 158
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ S+ RC+ +C P+ C Y+I Y + + G L+ D+ L + ++
Sbjct: 159 STTS-SSIRCSDKRCFGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTP 217
Query: 125 VP--LTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG 181
V +T GCG Q G ++ GVLGLG S+ S L + + N C G+
Sbjct: 218 VKANVTLGCG--QKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVI 275
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASY 241
V + G + TP + + A Y + + + +G ++ L FD+G+S+
Sbjct: 276 GNVGRISFGDRGYTDQEETPFI-SVAPSTAYGVNISGVSVAGDPVDIR-LFAKFDTGSSF 333
Query: 242 AYFTSRVY 249
+ Y
Sbjct: 334 THLREPAY 341
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 131/335 (39%), Gaps = 44/335 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--------- 67
+ + +G P K + DTGS LTW+QC C + + P +
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQ 186
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CS+ A L NP C N C Y+ YGD S+G L D + F + SV N
Sbjct: 187 QCSDLTTATL---NPASCSTSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSVPN--F 238
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
+GCG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 239 YYGCG--QDNEGLFG--QSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGY 292
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASY 241
+ G ++TPM +S D Y + + +GK S L I DSG
Sbjct: 293 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 352
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTEYFKPLALSF 299
+ VY + + + GTP A L C++G L +VT F A
Sbjct: 353 TRLPTGVYSALSKAVAGAMKGTPRASA--FSILDTCFQGQAARLRVPEVTMAFAGGAALK 410
Query: 300 TNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTGKS 334
RN L++ V ++ +A+ +S
Sbjct: 411 LAARN-----------LLVDVDSATTCLAFAPARS 434
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 78/184 (42%), Gaps = 21/184 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP D+GSD+ WVQC PC C + + P + V C +
Sbjct: 130 YF-VRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCGS 187
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L +CDY + YGDG + G L + L G + GC
Sbjct: 188 AICRTLSGTGCGG-GGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIGC 242
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDGK 191
G+ N G AG+LGLG G +S+V QL G V +C+ G G G G
Sbjct: 243 GH--RNSGLFV--GAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG----GAGS 292
Query: 192 VPSS 195
+ SS
Sbjct: 293 LASS 296
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 60/123 (48%), Gaps = 8/123 (6%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ ++ +VG PP DTGS +TW+QC C C + + P K+ +PCS+
Sbjct: 97 YLMSYSVGTPPFEILGVVDTGSGITWMQCQR-CEDCYEQTTPIFDPSKSKTYKTLPCSSN 155
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGC 131
C ++ + P C C Y I+YGDG S G L + L +NGS P T GC
Sbjct: 156 MCQSV--ISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVIGC 213
Query: 132 GYN 134
G+N
Sbjct: 214 GHN 216
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 67/125 (53%), Gaps = 14/125 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF+ + VG P K DTGSD+ W+QC PC+ C + + + P + + CS+
Sbjct: 164 YFS-RIGVGTPAKEMYVVLDTGSDVNWIQC-LPCSECYQQSDPIFDPTSSSTFKSLTCSD 221
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P+CA+L + C+ +++C Y++ YGDG ++G TD S G V +V L GC
Sbjct: 222 PKCASL---DVSACR--SNKCLYQVSYGDGSFTVGNYATDTVTFGES-GKVNDVAL--GC 273
Query: 132 GYNQH 136
G++
Sbjct: 274 GHDNE 278
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 110/272 (40%), Gaps = 42/272 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKNI----VPCS 70
+ V + +G P DTGSDL+WVQC APC T P+K + P ++ +PC+
Sbjct: 120 YVVTVGLGTPAVSQVLLIDTGSDLSWVQC-APCNSTTCYPQKDPLFDPSRSSTYAPIPCN 178
Query: 71 NPRCAAL----HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP 126
C L + + QC Y I YGDG + G +SN ++ P
Sbjct: 179 TDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGV---------YSNETLTMAP 229
Query: 127 ------LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI-- 177
FGCG++Q GP D G+LGLG S+V Q YG +C+
Sbjct: 230 GVTVKDFHFGCGHDQD--GPNDKYD--GLLGLGGAPESLVVQTSSVYG---GAFSYCLPA 282
Query: 178 GQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----L 233
+ G L LG +SG +TPM++ Y++ + G+ + +
Sbjct: 283 ANDQAGFLALGAPVNDASGFVFTPMVREQQTF--YVVNMTGITVGGEPIDVPPSAFSGGM 340
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPL 265
I DSG Y + + + + PL
Sbjct: 341 IIDSGTVVTELQHTAYAALQAAFRKAMAAYPL 372
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/128 (35%), Positives = 68/128 (53%), Gaps = 17/128 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC---TGCTKPPEKQYKPHKNIVP---- 68
YFA + VG+P + + F DTGSD++W+QC PC GC K + P +
Sbjct: 184 YFA-RIGVGQPVQSYFFVPDTGSDVSWLQCQ-PCDGENGCYKQIGPIFDPKSSSSYSPLS 241
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C + +C H + C + C YE+EYGDG ++G L T+ F R SN S+ N+P+
Sbjct: 242 CDSEQC---HLLDEAACDA--NSCIYEVEYGDGSFTVGELATETFSFRHSN-SIPNLPI- 294
Query: 129 FGCGYNQH 136
GCG++
Sbjct: 295 -GCGHDNE 301
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 115/264 (43%), Gaps = 33/264 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P +FG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPGFSFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNMDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDS 227
Query: 238 GASYAYFTSRVYQEIVSLIMRDLI 261
G+ +Y R ++S +R+L+
Sbjct: 228 GSELSYIPDRALS-VLSQRIRELL 250
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 70/152 (46%), Gaps = 14/152 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKN----IVPCS 70
+ +++ +G P DTGSD++WVQC+ PC C + P K+ V C+
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCN-PCPNPPCYAQTGALFDPAKSSTYRAVSCA 185
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
CA L C N +C Y ++YGDG ++ G D L ++ +V FG
Sbjct: 186 AAECAQLEQQGNG-CGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKG--FQFG 242
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ 162
C + + T G++GLG G S+VSQ
Sbjct: 243 CSHVESG----FSDQTDGLMGLGGGAQSLVSQ 270
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 95/201 (47%), Gaps = 24/201 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF + ++VG PP+ DTGSD+ W+QC APC C ++ + P+K + + C++
Sbjct: 37 YF-IRVSVGTPPRGMYLVMDTGSDILWLQC-APCVSCYHQCDEVFDPYKSSTYSTLGCNS 94
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS---VFN-VPL 127
+C L C ++C Y+++YGDG S G TD L ++G V N +PL
Sbjct: 95 RQCLNLDVGG---CV--GNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPL 149
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR---NVIGHCIGQNGRGV 184
GCG++ N G LG+G +S +Q+ R + G R
Sbjct: 150 --GCGHD--NEGYFVGAAGLLG--LGKGPLSFPNQINSENGGRFSYCLTGRDTDSTERSS 203
Query: 185 LFLGDGKVPSSGVAWTPMLQN 205
L GD VP +GV +TP N
Sbjct: 204 LIFGDAAVPPAGVRFTPQASN 224
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 125/303 (41%), Gaps = 41/303 (13%)
Query: 26 PPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRC--AALHW 79
PP+ DTGS+L+W++C+ P + P ++ +PCS+P C +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNR---SSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDF 138
Query: 80 PNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQHNPG 139
P C + C + Y D SS G L ++F F N S + L FGC +
Sbjct: 139 LIPASCD-SDKLCHATLSYADASSSEGNLAAEIF--HFGN-STNDSNLIFGCMGSVSGSD 194
Query: 140 PLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR--GVLFLGDGKVP-SSG 196
P T G+LG+ RG +S +SQ+ G + +CI G L LGD +
Sbjct: 195 PEEDTKTTGLLGMNRGSLSFISQM---GFPK--FSYCISGTDDFPGFLLLGDSNFTWLTP 249
Query: 197 VAWTPMLQNSADLKHY-----------ILGPAELLYSGKSCGLKDLT----LIFDSGASY 241
+ +TP+++ S L ++ I +LL KS + D T + DSG +
Sbjct: 250 LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQF 309
Query: 242 AYFTSRVYQEIVSLIMRDLIGT-PLKLAPD---DKTLPICWR-GPFKALGQVTEYFKPLA 296
+ VY + S + G + PD T+ +C+R P + + ++
Sbjct: 310 TFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVS 369
Query: 297 LSF 299
L F
Sbjct: 370 LVF 372
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 70/152 (46%), Gaps = 14/152 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG--CTKPPEKQYKPHKN----IVPCS 70
+ +++ +G P DTGSD++WVQC+ PC C + P K+ V C+
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCN-PCPNPPCHAQTGALFDPAKSSTYRAVSCA 185
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
CA L C N +C Y ++YGDG ++ G D L ++ +V FG
Sbjct: 186 AAECAQLEQQGNG-CGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKG--FQFG 242
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ 162
C + + T G++GLG G S+VSQ
Sbjct: 243 CSHLESG----FSDQTDGLMGLGGGAQSLVSQ 270
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/128 (35%), Positives = 68/128 (53%), Gaps = 17/128 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPC---TGCTKPPEKQYKPHKNIVP---- 68
YFA + VG+P + + F DTGSD++W+QC PC GC K + P +
Sbjct: 184 YFA-RIGVGQPVQSYFFVPDTGSDVSWLQCQ-PCDGENGCYKQIGPIFDPKSSSSYSPLS 241
Query: 69 CSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C + +C H + C + C YE+EYGDG ++G L T+ F R SN S+ N+P+
Sbjct: 242 CDSEQC---HLLDEAACDA--NSCIYEVEYGDGSFTVGELATETFSFRHSN-SIPNLPI- 294
Query: 129 FGCGYNQH 136
GCG++
Sbjct: 295 -GCGHDNE 301
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 95/251 (37%), Gaps = 28/251 (11%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI------------ 66
+ +G P F DTGSDL WV CD CT C + ++
Sbjct: 102 TTVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAASDSTAFASDFDLNVYNPNGSSTSK 159
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNG--SV 122
V C+N C + +C C Y + Y +S G LV D+ L + +
Sbjct: 160 KVTCNNSLCT-----HRSQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDL 214
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
+ FGCG Q + L G+ GLG +IS+ S L G + C G++G
Sbjct: 215 VEANVIFGCGQIQ-SGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGI 273
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
G + GD S TP N + + I + G + + T +FDSG S+
Sbjct: 274 GRISFGDKG--SFDQDETPFNLNPSHPTYNI--TVTQVRVGTTVIDVEFTALFDSGTSFT 329
Query: 243 YFTSRVYQEIV 253
Y Y +
Sbjct: 330 YLVDPTYTRLT 340
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 115/264 (43%), Gaps = 33/264 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P +FG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPGFSFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC--IGQNGRGVL--- 185
C + D G+LG+G G +S+ L++ + +C + ++ RG
Sbjct: 113 CNMDSFGANEFGNVD--GLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSERGFFSKT 167
Query: 186 --FLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----KDLTLIFDS 237
+ GKV + + V +T M+ + + + + + G+ GL ++FDS
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRKGVVFDS 227
Query: 238 GASYAYFTSRVYQEIVSLIMRDLI 261
G+ +Y R ++S +R+L+
Sbjct: 228 GSELSYIPDRALS-VLSQRIRELL 250
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 57/125 (45%), Gaps = 14/125 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF + VG PPK DTGSD+ W+QC APC C + + P K+ V C
Sbjct: 42 YF-TRIGVGTPPKYVYMVLDTGSDIVWLQC-APCKNCYSQTDPVFNPVKSGSFAKVLCRT 99
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L P C C Y++ YGDG + G VT+ L F V V L GC
Sbjct: 100 PLCRRLESPG---CNQ-RQTCLYQVSYGDGSYTTGEFVTE--TLTFRRTKVEQVAL--GC 151
Query: 132 GYNQH 136
G++
Sbjct: 152 GHDNE 156
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 68/266 (25%), Positives = 109/266 (40%), Gaps = 51/266 (19%)
Query: 11 FPIFSYFAVNLTVGKPP-KLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI--- 66
P + + +VG PP KL+ DTGSD+ W+QC+ PC C ++KP K+
Sbjct: 81 IPDHGEYLMTYSVGTPPFKLYGIA-DTGSDIVWLQCE-PCKECYNQTTPKFKPSKSSTYK 138
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV 125
+PCS+ C + G L D L S G +
Sbjct: 139 NIPCSSDLCKSGQQ--------------------------GNLSVDTLTLESSTGHPISF 172
Query: 126 PLT-FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC-----IGQ 179
P T GCG + + ++G++GLG G S+++QL I +C +
Sbjct: 173 PKTVIGCGTDNTVSFEGA---SSGIVGLGGGPASLITQLGSS--IDAKFSYCLLPNPVES 227
Query: 180 NGRGVLFLGDGKVPS-SGVAWTPMLQNSADLKHYI------LGPAELLYSGKSCGLKDLT 232
N L GD V S GV TP+++ + +Y+ +G + + G S G +
Sbjct: 228 NTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEGSSNGGHEGN 287
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMR 258
+I DSG + + VY + S ++
Sbjct: 288 IIIDSGTTLTVIPTDVYNNLESAVLE 313
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 58/125 (46%), Gaps = 13/125 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF L VG P + DTGSD+ W+QC APC C + + P K+ +PC +
Sbjct: 147 YF-TRLGVGTPARYVFMVLDTGSDVVWIQC-APCKKCYSQTDPVFNPTKSRSFANIPCGS 204
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L + P C C Y++ YGDG + G T+ L F V V L GC
Sbjct: 205 PLCRRL---DSPGCSTKKHICLYQVSYGDGSFTYGEFSTET--LTFRGTRVGRVAL--GC 257
Query: 132 GYNQH 136
G++
Sbjct: 258 GHDNE 262
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/273 (24%), Positives = 109/273 (39%), Gaps = 52/273 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P +FG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPSFSFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVS---------------QLREYGLIRNVIGH 175
C + D G+LG+G G +S++ Q+ E G G
Sbjct: 113 CNMDSFGANEFGNVD--GLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTG- 169
Query: 176 CIGQNGRGVLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----K 229
+ GKV + + V +T M+ + + + + + G+ GL
Sbjct: 170 ----------YFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFS 219
Query: 230 DLTLIFDSGASYAYFTSR----VYQEIVSLIMR 258
++FDSG+ +Y R + Q I L++R
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLSQRIRELLLR 252
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 105/273 (38%), Gaps = 29/273 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKNI----VPCS 70
+ V L G P DTGSD++WVQC PC P+K + P K+ + C+
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQC-TPCNSTKCYPQKDPLFDPSKSSTYAPIACN 189
Query: 71 NPRCAAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
C L H+ N C QC Y +EY DG S G + L +
Sbjct: 190 TDACRKLGDHYHN--GCTSGGTQCGYSVEYADGSHSRGVYSNETLTLA---PGITVEDFH 244
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGVLFL 187
FGCG +Q GP D G+LGLG +S+V Q YG +C+ FL
Sbjct: 245 FGCGRDQR--GPSDKYD--GLLGLGGAPVSLVVQTSSVYG---GAFSYCLPALNSEAGFL 297
Query: 188 GDGKVPS---SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT----LIFDSGAS 240
G PS S +TPM Y++ + GK + +I DSG
Sbjct: 298 VLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMIIDSGTV 357
Query: 241 YAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT 273
Y + + + + L PL + D T
Sbjct: 358 DTELPETAYNALEAALRKALKAYPLVPSDDFDT 390
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 110/274 (40%), Gaps = 52/274 (18%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P +FG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPGFSFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVS---------------QLREYGLIRNVIGH 175
C + D G+LG+G G +S++ Q+ E G G+
Sbjct: 113 CNMDSFGANEFGNVD--GLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 176 CIGQNGRGVLFLGDGKVPS--SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL----- 228
F GK+ + + V +T M+ + + + + + G+ GL
Sbjct: 171 ----------FSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 229 KDLTLIFDSGASYAYFTSR----VYQEIVSLIMR 258
++FDSG+ +Y R + Q I L++R
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLR 254
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 58/125 (46%), Gaps = 14/125 (11%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF L VG PP+ DTGSD+ W+QC PC C + + P + VPC+
Sbjct: 153 YF-TRLGVGTPPRYTYMVLDTGSDIMWIQC-LPCAKCYGQTDPLFNPAASSTYRKVPCAT 210
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C L C++ C+Y++ YGDG ++G T+ R G V + GC
Sbjct: 211 PLCKKLDISG---CRNKR-YCEYQVSYGDGSFTVGDFSTETLTFR---GQVIR-RVALGC 262
Query: 132 GYNQH 136
G++
Sbjct: 263 GHDNE 267
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 123/288 (42%), Gaps = 37/288 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCS 70
S F NL++G PP DTGSDL W+QC+ PC C K + Y K+ + C+
Sbjct: 104 SAFLANLSIGNPPTNVYVVLDTGSDLFWIQCE-PCDVCYKQKDPIYNRTKSDSYTEMLCN 162
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD--LFPLRFSNGSVFNVPLT 128
P C +L +C + C Y+ Y DG + G L + F +S+ +
Sbjct: 163 EPPCLSLGREG--QCSD-SGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDK-TAQVG 218
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ----NGRGV 184
FGCG N ++ GVLGLG G +S+VSQL G + +C G N G
Sbjct: 219 FGCGL--QNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGF 276
Query: 185 LFLGDGKVPSSGVAWTPMLQNS---ADLKHYILGPAELLYSGKSCGLKDL-----TLIFD 236
L GD + + TPM+ +L LG E S + +I D
Sbjct: 277 LVFGDATYLNGDM--TPMVIAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIID 334
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIG----TPLKLAPDDKTLPICWRG 280
SG++ + F VY+ + + ++ L +PL +PD C+ G
Sbjct: 335 SGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD------CFEG 376
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 79/178 (44%), Gaps = 17/178 (9%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP----------PEKQYKPHK 64
S + N++VG PP F DTGSDL W+ C+ T C + P Y P+
Sbjct: 100 SLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTT-CIRDLEDIGVPQSVPLNLYTPNA 158
Query: 65 NIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFN 124
+ S+ RC+ +C P+ C Y+I Y + + G L+ D+ L + ++
Sbjct: 159 STTS-SSIRCSDKRCFGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTP 217
Query: 125 VP--LTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ 179
V +T GCG Q G ++ GVLGLG S+ S L + + N C G+
Sbjct: 218 VKANVTLGCG--QKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGR 273
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 68/297 (22%), Positives = 108/297 (36%), Gaps = 46/297 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
+ + LT+G PP DT SDL W QC PC GC K + P K + C+
Sbjct: 31 YLMKLTLGTPPVDVYGLVDTDSDLVWAQC-TPCQGCYKQKNPMFDPLKECNSFFDHSCS- 88
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
P CDY Y D ++ G L ++ ++G + FGCG+N
Sbjct: 89 -----------PEKACDYVYAYADDSATKGMLAKEIATFSSTDGKPIVESIIFGCGHN-- 135
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVLFLGDGK 191
N G + D + G + YG R C+ + G + LG+
Sbjct: 136 NTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKR--FSQCLVPFHADPHTSGTISLGEAS 193
Query: 192 -VPSSGVAWTPMLQNSADLKHYI-------------LGPAELLYSGKSCGLKDLTLIFDS 237
V GV TP++ + + +E+L G ++ DS
Sbjct: 194 DVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKGN--------IMIDS 245
Query: 238 GASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQV-TEYFK 293
G Y Y +V + + P+ + PD T +C++ G + T +F+
Sbjct: 246 GTPETYLPQEFYDRLVEELKVQINLPPIHVDPDLGT-QLCYKSETNLEGPILTAHFE 301
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 60/121 (49%), Gaps = 15/121 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKN----IVPCSN 71
F V + G P + DTGSDL+W+QC PC+G C + + + P K+ VPC
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCK-PCSGHCYRQHDPDFDPAKSSSYAAVPCGT 195
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P CAA C C Y ++YGDG S+ G L D L F++ S F TFGC
Sbjct: 196 PVCAAAGG----MCN--GTTCLYGVQYGDGSSTTGVLSRDT--LTFNSSSKFTG-FTFGC 246
Query: 132 G 132
G
Sbjct: 247 G 247
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 83/318 (26%), Positives = 129/318 (40%), Gaps = 46/318 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--PCT-GCT----KPPEKQYKPHKNIVPC 69
F + +++G PP + TGSDL W+ C + PCT C P E +KN VPC
Sbjct: 98 FLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPMES--STYKN-VPC 154
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSS-IGALVTDLFPLRFSNGSVFNVPLT 128
+ RC N C+ + C Y + S G L D L + G F +P T
Sbjct: 155 DSYRC---QITNAATCQFSD--CFYSCDPRHQDSCPDGDLAMDTLTLNSTTGKSFMLPNT 209
Query: 129 -FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGV 184
F CG P G+LGLG G +S+++++ LI HCI N
Sbjct: 210 GFICGNRIGGDYP-----GVGILGLGHGSLSLLNRISH--LIDGKFSHCIVPYSSNQTSK 262
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT------------ 232
L GD V S ++ L + Y L + G S G K ++
Sbjct: 263 LSFGDKAVVSGSAMFSTRLDMTGGPYSYTLS-----FYGISVGNKSISAGGIGSDYYMNG 317
Query: 233 LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR-GPFKALGQVTEY 291
L DSG + YF Y ++ + + PL P + L +C+R P + +T +
Sbjct: 318 LGMDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDP-TRRLRLCYRYSPDFSPPTITMH 376
Query: 292 FKPLALSFTNRRNSVRLV 309
F+ ++ ++ + +R+
Sbjct: 377 FEGGSVELSSSNSFIRMT 394
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 55/118 (46%), Gaps = 14/118 (11%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALH 78
+GKP + DTGSD+ W+QC PC C E ++P + + C P+C AL
Sbjct: 154 IGKPAREVYMVLDTGSDVNWLQC-TPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALE 212
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
C+ N C YE+ YGDG ++G T+ + GS + GCG++
Sbjct: 213 V---SECR--NATCLYEVSYGDGSYTVGDFATETLTI----GSTLVQNVAVGCGHSNE 261
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 72/253 (28%), Positives = 97/253 (38%), Gaps = 31/253 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTK-----PPEKQYKPHKNIVPC 69
+ VG P F DTGSDL WV CD AP T PE +
Sbjct: 107 AEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLGNLTAVDGGGGPELRQYSPSKSSTS 166
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSN-------GS 121
CA+ P C C Y + Y SS G LV D+ L G+
Sbjct: 167 KTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAAGA 226
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQN 180
P+ FGCG Q L G++GLG ++S+ S L G+++ N C ++
Sbjct: 227 AVRTPVVFGCGQVQTG-SFLDGAAADGLMGLGMEKVSVPSILASTGVVKSNSFSMCFSKD 285
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIF----D 236
G G + GD S+ + TP + S + I + S G K+L L F D
Sbjct: 286 GLGRINFGD--TGSADQSETPFIVKSTHSYYNI------SITSMSVGDKNLPLGFYAIAD 337
Query: 237 SGASYAYFTSRVY 249
SG S+ Y Y
Sbjct: 338 SGTSFTYLNDPAY 350
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 66/125 (52%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF+ + +G+PP DTGSD++WVQC APC C + + ++P + + C
Sbjct: 132 YFS-RIGIGEPPSQAYMVLDTGSDISWVQC-APCADCYRQADPIFEPTASASYAPLSCEA 189
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C L + +C++ N C Y++ YGDG ++G VT+ + + V NV L GC
Sbjct: 190 AQCRYL---DQSQCRNGN--CLYQVSYGDGSYTVGDFVTETVTIGVNK--VKNVAL--GC 240
Query: 132 GYNQH 136
G+N
Sbjct: 241 GHNNE 245
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 72/253 (28%), Positives = 97/253 (38%), Gaps = 31/253 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCD----APCTGCTK-----PPEKQYKPHKNIVPC 69
+ VG P F DTGSDL WV CD AP T PE +
Sbjct: 107 AEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAPLGNLTAVDGGGGPELRQYSPSKSSTS 166
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGG-SSIGALVTDLFPLRFSN-------GS 121
CA+ P C C Y + Y SS G LV D+ L G+
Sbjct: 167 KTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAAGA 226
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIR-NVIGHCIGQN 180
P+ FGCG Q L G++GLG ++S+ S L G+++ N C ++
Sbjct: 227 AVRTPVVFGCGQVQTG-SFLDGAAADGLMGLGMEKVSVPSILASTGVVKSNSFSMCFSKD 285
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIF----D 236
G G + GD S+ + TP + S + I + S G K+L L F D
Sbjct: 286 GLGRINFGD--TGSADQSETPFIVKSTHSYYNI------SITSMSVGDKNLPLGFYAIAD 337
Query: 237 SGASYAYFTSRVY 249
SG S+ Y Y
Sbjct: 338 SGTSFTYLNDPAY 350
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 58/190 (30%), Positives = 83/190 (43%), Gaps = 23/190 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
F V++ G PP+ F DTGS +TW QC A C C K + H + + S +
Sbjct: 127 FLVDVAFGTPPQKFKLILDTGSSITWTQCKA-CVHCLKDSHR----HFDSLASSTYSFGS 181
Query: 77 LHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
C Y + YGD +S+G D L S+ VF FGCG N
Sbjct: 182 --------CIPSTVGNTYNMTYGDKSTSVGNYGCDTMTLEPSD--VFQ-KFQFGCGRN-- 228
Query: 137 NPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLGDGKVP-S 194
N G G+LGLG+G++S VSQ + V +C+ +N G L G+ S
Sbjct: 229 NEGDFGS-GADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEENSIGSLLFGEKATSQS 285
Query: 195 SGVAWTPMLQ 204
S + +T ++
Sbjct: 286 SSLKFTSLVN 295
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 59/126 (46%), Gaps = 16/126 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF L VG PPK DTGSD+ W+QC APC C + + P K + + C +
Sbjct: 147 YF-TRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISCRS 204
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P C L + P C + C Y++ YGDG + G T+ R + VP + G
Sbjct: 205 PLCLRL---DSPGC-NSRQSCLYQVAYGDGSFTFGEFSTETLTFRGT-----RVPKVALG 255
Query: 131 CGYNQH 136
CG++
Sbjct: 256 CGHDNE 261
>gi|297838267|ref|XP_002887015.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297332856|gb|EFH63274.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 324
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 68/152 (44%), Gaps = 20/152 (13%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
++L +G PP+ DTGS L+W+QC P+ + P + +PCS+P C
Sbjct: 76 ISLPIGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLC 133
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + + FSN + PL GC
Sbjct: 134 KPRIPDFTLPTSCDS-NRLCHYSYFYADGTFAEGNLVKE--KITFSNTEI-TPPLILGCA 189
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
D G+LG+ RGR+S VSQ +
Sbjct: 190 TESS--------DDRGILGMNRGRLSFVSQAK 213
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 76/288 (26%), Positives = 116/288 (40%), Gaps = 44/288 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + LT+G PP DTGSDL W QC PC GC + ++P ++ +PC +
Sbjct: 50 YLMKLTLGTPPVDVYGLVDTGSDLVWAQC-TPCQGCYRQKSPMFEPLRSNTYTPIPCDSE 108
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGS-VFNVPLTFGC 131
C +L + C P C Y Y D + G L + ++G V + FGC
Sbjct: 109 ECNSLFGHS---CS-PQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDIVFGC 164
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCI-----GQNGRGVL 185
G++ N G + D ++GLG G +S+VSQ YG R C+ + G +
Sbjct: 165 GHS--NSGTFNENDMG-IIGLGGGPLSLVSQFGNLYGSKR--FSQCLVPFHADPHTLGTI 219
Query: 186 FLGDGK-VPSSGVAWTPMLQNSADLKHYI-------------LGPAELLYSGKSCGLKDL 231
GD V GVA TP++ + + +E+L G
Sbjct: 220 SFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGN------- 272
Query: 232 TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWR 279
++ DSG Y Y +V + P+ PD T +C+R
Sbjct: 273 -IMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGT-QLCYR 318
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/255 (27%), Positives = 100/255 (39%), Gaps = 34/255 (13%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
F Y + L V PP DTGS L W++C P P Y +PC
Sbjct: 74 FEYL-MALDVSTPPVRMLALADTGSSLVWLKCKLP--AAHTPASSSYAR----LPCDAFA 126
Query: 74 CAALHWPNPPRCKHP---NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C AL + C+ N+ C Y + DG + G + D F F+ L FG
Sbjct: 127 CKALG--DAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAF--------TFSTRLDFG 176
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-----GQNGRGVL 185
C LS PD G++GL G IS+VSQL + +C+ + L
Sbjct: 177 CATRTEG---LSVPDD-GLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSSSL 232
Query: 186 FLGDGKVPSS--GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLT--LIFDSGASY 241
G + SS G A TP++ + Y + + +GK L+ T LI DSG
Sbjct: 233 NFGSHAIVSSSPGAATTPLVAGR-NKSFYTIALDSIKVAGKPVPLQTTTTKLIVDSGTML 291
Query: 242 AYFTSRVYQEIVSLI 256
Y V +V+ +
Sbjct: 292 TYLPKAVLDPLVAAL 306
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 122/298 (40%), Gaps = 52/298 (17%)
Query: 13 IFSYFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IV 67
+ S + ++L++G P + DTGSD+ W QC+ PC C P ++ + V
Sbjct: 88 VNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE-PCAECFTQPLPRFDTAASNTVRSV 146
Query: 68 PCSNPRCAALHWPNPPRCKHPN--DQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFN 124
CS+P C A +H C Y YGDG S G + D F G
Sbjct: 147 ACSDPLCNA-------HSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVT 199
Query: 125 VP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG---QN 180
VP + FGCG +N G +T G+ G GRG +S+ SQL+ +R +C +
Sbjct: 200 VPDIGFGCG--MYNAGRFLQTET-GIAGFGRGPLSLPSQLK----VRQ-FSYCFTTRFEA 251
Query: 181 GRGVLFL---GDGKVPSSG-VAWTPMLQN---SADLKHYILGPAELLYSGKSCGLKDL-- 231
+FL GD K ++G + TP +++ D HY+L + G + G L
Sbjct: 252 KSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLS-----FKGVTVGKTRLPV 306
Query: 232 ---------TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRG 280
DSG F V++++ S + P+ D+ + W G
Sbjct: 307 PEIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQ-AALPVNKTADEDDICFSWDG 363
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 115/291 (39%), Gaps = 47/291 (16%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCS 70
S F VNL++G PP DTGS L WVQC PC C + + P K++ + C
Sbjct: 102 SGFLVNLSIGSPPVTQLVVVDTGSSLLWVQC-LPCINCFQQSTSWFDPLKSVSFKTLGCG 160
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTD-LFPLRFSNGSVFNVPLTF 129
P ++ N +C N Q +Y++ Y G SS G L + L G + +TF
Sbjct: 161 FP---GYNYINGYKCNRFN-QAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITF 216
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRG-RISIVSQLREYGLIRNVIGHCIGQNG-----RG 183
GCG+ N + GV GLG I++ +QL N +CIG
Sbjct: 217 GCGH--MNIKTNNDDAYNGVFGLGAYPHITMATQL------GNKFSYCIGDINNPLYTHN 268
Query: 184 VLFLGDGKVPSS---------GVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLI 234
L LG G G + + S K + P S G ++
Sbjct: 269 HLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSG----GVL 324
Query: 235 FDSGASYAYFTS----RVYQEIVSLIMRDLIGTPLKLAPDDKTLP-ICWRG 280
DSG +Y + +Y EIV DL+ L+ P + +C++G
Sbjct: 325 IDSGMTYTKLANGGFELLYDEIV-----DLMKGLLERIPTQRKFEGLCFKG 370
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 72/158 (45%), Gaps = 13/158 (8%)
Query: 15 SYFAVNLTVGKP-PKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPC 69
S + ++L +G P P+ DTGSDL W QC CT C P ++ + VPC
Sbjct: 92 SEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQC--ACTVCFDQPVPVFRASVSHTFSRVPC 149
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN--GSVFNVP- 126
S+P C + C + C Y Y D + G + D F + + + VP
Sbjct: 150 SDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPN 209
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
+ FGCG + L P+ +G+ G G G +S+ SQL+
Sbjct: 210 IRFGCGMMNYG---LFTPNQSGIAGFGTGPLSLPSQLK 244
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 65/199 (32%), Positives = 89/199 (44%), Gaps = 29/199 (14%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + +G PP+ D+GSD+ WVQC PCT C + + P + V CS+
Sbjct: 43 YF-VRIGLGSPPRSQYMVIDSGSDIVWVQCK-PCTQCYHQTDPLFDPADSASFMGVSCSS 100
Query: 72 PRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C + N RC+ YE+ YGDG + G L L L F V NV +
Sbjct: 101 AVCDRVENAGCNSGRCR-------YEVSYGDGSYTKGTLA--LETLTFGRTVVRNVAI-- 149
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLF 186
GCG++ N G LG G +S + QL G N +C+ G N G L
Sbjct: 150 GCGHS--NRGMFVGAAGLLGLGG--GSMSFMGQLS--GQTGNAFSYCLVSRGTNTNGFLE 203
Query: 187 LGDGKVPSSGVAWTPMLQN 205
G +P G AW P+++N
Sbjct: 204 FGSEAMP-VGAAWIPLVRN 221
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 66/273 (24%), Positives = 109/273 (39%), Gaps = 52/273 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI---VPCSNPR 73
+ +++ +G P K + DTGS +WV C+ C GC P + V C
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 74 CAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
C L + P C+ + C + + Y DG +S G L D L FS+ V +P TFG
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQD--TLTFSD--VQKIPGFTFG 112
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVS---------------QLREYGLIRNVIGH 175
C + D G+LG+G G +S++ Q+ E G G
Sbjct: 113 CNLDSFGANEFGNVD--GLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTG- 169
Query: 176 CIGQNGRGVLFLGDGKVPS-SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL-----K 229
+ GKV + + V +T M+ + + + + + G+ GL
Sbjct: 170 ----------YFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS 219
Query: 230 DLTLIFDSGASYAYFTSR----VYQEIVSLIMR 258
++FDSG+ +Y R + Q I L+++
Sbjct: 220 RKGVVFDSGSELSYIPDRALSVLRQRIRELLLK 252
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 95/251 (37%), Gaps = 28/251 (11%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI------------ 66
+ +G P F DTGSDL WV CD CT C + ++
Sbjct: 98 TTVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAATDSSAFASDFDLNVYNPNGSSTSK 155
Query: 67 -VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRFSNG--SV 122
V C+N C + +C C Y + Y +S G LV D+ L + +
Sbjct: 156 KVTCNNSLCM-----HRSQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDL 210
Query: 123 FNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR 182
+ FGCG Q + L G+ GLG +IS+ S L G + C G++G
Sbjct: 211 VEANVIFGCGQIQ-SGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGI 269
Query: 183 GVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
G + GD S TP N + + I + G + + T +FDSG S+
Sbjct: 270 GRISFGDKG--SFDQDETPFNLNPSHPTYNI--TVTQVRVGTTLIDVEFTALFDSGTSFT 325
Query: 243 YFTSRVYQEIV 253
Y Y +
Sbjct: 326 YLVDPTYTRLT 336
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 71/263 (26%), Positives = 99/263 (37%), Gaps = 39/263 (14%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKP--HKNIVPCS 70
S F VN +VG+PP DTGS L W+QC PC C+ + P V CS
Sbjct: 66 SLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCH-PCKHCSSNHMIHPVFNPALSSTFVECS 124
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNVPLTF 129
C P +++C YE Y G S G L + NG +V P+ F
Sbjct: 125 ---CDDRFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAF 181
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-----QNGRGV 184
GCG H G + G+LGLG S+ QL + +CIG G
Sbjct: 182 GCG---HENGEQLESEFTGILGLGAKPTSLAVQL------GSKFSYCIGDLANKNYGYNQ 232
Query: 185 LFLGDGK-----------VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL 233
L LG+ +G+ + + S K + P G G +
Sbjct: 233 LVLGEDADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTG-----V 287
Query: 234 IFDSGASYAYFTSRVYQEIVSLI 256
I D+G Y + Y+E+ + I
Sbjct: 288 ILDTGTLYTWLADIAYRELYNEI 310
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 69/276 (25%), Positives = 121/276 (43%), Gaps = 46/276 (16%)
Query: 18 AVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--PCTGCT-KPPEK------QYKPHKNIVP 68
+ L+ G PP+ F DTGS + W C CT C+ P+K + I+
Sbjct: 88 TIPLSFGTPPQKLSFLMDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILG 147
Query: 69 CSNPRCAALHWPN----PPRCKHPNDQC-----DYEIEYGDGGSSIGALVTDL-FPLRFS 118
C +P+CA PB PRC + +C Y ++YG G +S L+ +L FP
Sbjct: 148 CRDPKCADTSSPBVHLGXPRCNGNSKKCSHACPQYTLQYGTGAASGFFLLENLDFP---- 203
Query: 119 NGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHC 176
G + L GC + P + + G GR S+ Q+ +++ N +
Sbjct: 204 -GKTIHKFLV-GCTTSADR-----EPSSDALAGFGRTMFSLPMQMGVKKFAYCLNSHDYD 256
Query: 177 IGQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLK-HYILGPAELLYSGKSCGL--KDLT 232
+N G+ +L DG+ + G+++ P +N D +Y LG ++ K + K LT
Sbjct: 257 DTRNSGKLILDYSDGE--TQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGKYLT 314
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDL 260
++ DSG +Y+Y T V++ + + + + +
Sbjct: 315 PGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKKQM 350
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 131/339 (38%), Gaps = 66/339 (19%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCS 70
+ + V+L +G PP+ DTGSDL W QC PC C + P ++ C
Sbjct: 87 TEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCD 145
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ C L + PR +D F + SV V FG
Sbjct: 146 STLCQGLPVASLPR-------------------------SDKFTFVGAGASVPGV--AFG 178
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG N G +T G+ G GRG +S+ SQL+ G + G VL
Sbjct: 179 CGL--FNNGVFKSNET-GIAGFGRGPLSLPSQLK-VGNFSHCFTTITGAIPSTVLLDLPA 234
Query: 191 KVPSSG---VAWTPMLQNSAD-------LKHYILGPAELLYSGKSCGLKDLT--LIFDSG 238
+ S+G V TP++QN A+ LK +G L LK+ T I DSG
Sbjct: 235 DLFSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSG 294
Query: 239 ASYAYFTSRVYQEIVSLIMRDLIGTPLKL--APDDKTLP-ICWRGPFKALGQVTEYFKPL 295
+ +RVY+ ++RD +KL + T P C P +A Y L
Sbjct: 295 TAMTSLPTRVYR-----LVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRA----KPYVPKL 345
Query: 296 ALSFTNRRNSVRLVVPPEAYL--VISVSTSIIIIAYLTG 332
L F + +P E Y+ V +SI+ +A + G
Sbjct: 346 VLHF----EGATMDLPRENYVFEVEDAGSSILCLAIIEG 380
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 119/300 (39%), Gaps = 39/300 (13%)
Query: 9 FFFPIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG----------------- 51
F+ F Y A + VG PP F DTGSDL W++C+
Sbjct: 75 LFYGDFEYLAA-VNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPP 133
Query: 52 -CTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIG 106
+ P + V C P C AL C + CD+ Y DG S+ G
Sbjct: 134 PPPPEAVVYFNPFDSSSYSRVGCDGPSCLAL--ATNASCNGDSHACDFRYSYRDGASATG 191
Query: 107 ALVTDLFPL--RFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL- 163
L D F +N + + FGC G D G++GLG G +S+ SQL
Sbjct: 192 LLAADTFTFGGNINNDTTSTASIDFGCATG--TAGREFQAD--GMVGLGAGPLSLASQLG 247
Query: 164 REYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSAD-LKHYILGPAELLYS 222
R++ + + I + F V G A TP++ +S++ +Y + L +
Sbjct: 248 RKFSFC--LTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVA 305
Query: 223 GKSC-GLKDLT-LIFDSGASYAYFT-SRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICW 278
G+ G ++ +I D+G + + + + + R + G L A P D+TL +C+
Sbjct: 306 GQPVPGTTSVSKVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPPDETLELCY 365
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/155 (33%), Positives = 74/155 (47%), Gaps = 18/155 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + ++G PP+ DTGSDL W +CDA Y P+ + +PCS+
Sbjct: 100 YDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGG-AAWGGSSSYHPNASSTFTRLPCSDR 158
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGS---SIGALVTDLFPLRFSNGSVFNVP-LT 128
CAAL + RC +CDY+ YG G + G L ++ F L G VP +
Sbjct: 159 LCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTL---GGDA--VPGVG 213
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
FGC + AG++GLGRG +S+VSQL
Sbjct: 214 FGCTTALEG----DYGEGAGLVGLGRGPLSLVSQL 244
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 59/128 (46%), Gaps = 16/128 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F + L +GKP + DTGSDLTW QC PC+ C K P Y P + V C +
Sbjct: 21 FLMQLAIGKPSLAYSAILDTGSDLTWTQC-MPCSDCYKQPTPIYDPSLSSTYGTVSCKSS 79
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C AL P + C+Y YGD S+ G L + F L S ++P + FGC
Sbjct: 80 LCLAL-----PASACISATCEYLYTYGDYSSTQGILSYETFTL-----SSQSIPHIAFGC 129
Query: 132 GYNQHNPG 139
G + G
Sbjct: 130 GQDNEGSG 137
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 62/205 (30%), Positives = 90/205 (43%), Gaps = 22/205 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ + + +G P DTGSD++WVQC PC+ C + + P + CS+
Sbjct: 122 YVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSSSSTYSPFSCSSA 180
Query: 73 RCAALHWPNPPR-CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
CA L C + QC Y + YGD S+ G +D L GS FGC
Sbjct: 181 PCAQLSQSQEGNGCM--SSQCQYIVNYGDSSSTTGTYSSDTLTL----GSSAMTDFQFGC 234
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGD 189
+Q G + T G++GLG G S+ SQ G +C+ G L LG
Sbjct: 235 --SQSESGGFN-DQTDGLMGLGGGAQSLASQ--TAGTFGTAFSYCLPPTSGSSGFLTLGT 289
Query: 190 GKVPSSGVAWTPMLQNSADLKHYIL 214
G SSG TPML+++ +Y++
Sbjct: 290 G---SSGFVKTPMLRSTQIPTYYVV 311
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 58/123 (47%), Gaps = 15/123 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG---CTKPPEKQYKPHKN----IVPC 69
F V + +G P + FDTGSDL+WVQC PC C + + P K+ V C
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQ-PCGSSGHCHPQQDPLFDPSKSSTYAAVHC 207
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P+CAA C N C Y + YGDG S+ G L D L S+ ++ P F
Sbjct: 208 GEPQCAAAGG----LCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALT-SSRALAGFP--F 260
Query: 130 GCG 132
GCG
Sbjct: 261 GCG 263
>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
Length = 431
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 74/270 (27%), Positives = 107/270 (39%), Gaps = 41/270 (15%)
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-------RFS 118
+V C C A++ P C N C Y Y DG SS G V +
Sbjct: 130 LVSCDQDFCYAINGGPPSYCI-ANMSCSYTEIYADGSSSFGYFVKGYCTASKYNSIPHLN 188
Query: 119 NGSVFNVPLTFGCGYNQHNPGPLSPPDTA-GVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
N + VPL C Q G LS + G+LG G+ S++SQL G +R + HC+
Sbjct: 189 NNPLLEVPLR--CSATQ--SGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL 244
Query: 178 -GQNGRGVLFLGDGKVPSSGVAWTPMLQN----SADLKHYILG------PAELLYSGKSC 226
G NG G+ +G P V TP++ N + ++K +G P ++ G
Sbjct: 245 DGLNGGGIFAIGHIVQPK--VNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKK 302
Query: 227 GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG 286
G I DSG + AY VY +++S I + D T F+
Sbjct: 303 G-----TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTC-------FQYSE 350
Query: 287 QVTEYFKPLALSFTNRRNSVRLVVPPEAYL 316
+ + F + F NS+ L V P YL
Sbjct: 351 SLDDGFPAVTFHF---ENSLYLKVHPHEYL 377
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 84/335 (25%), Positives = 131/335 (39%), Gaps = 44/335 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV--------- 67
+ + +G P K + DTGS LTW+QC C + + P +
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQ 188
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
CS+ A L +P C N C Y+ YGD S+G L D + F + SV N
Sbjct: 189 QCSDLTTATL---SPASCSTSN-VCIYQASYGDSSFSVGYLSKDT--VSFGSTSVPN--F 240
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFL 187
+GCG Q N G +AG++GL R ++S++ QL + +C+ +
Sbjct: 241 YYGCG--QDNEGLFG--QSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGY 294
Query: 188 GDGKVPSSG-VAWTPMLQNSADLKHYILGPAELLYSGK-----SCGLKDLTLIFDSGASY 241
+ G ++TPM +S D Y + + +GK S L I DSG
Sbjct: 295 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 354
Query: 242 AYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALG--QVTEYFKPLALSF 299
+ VY + + + GTP A L C++G L +VT F A
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTP--RASAFSILDTCFQGQAARLRVPEVTMAFAGGAALK 412
Query: 300 TNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTGKS 334
RN L++ V ++ +A+ +S
Sbjct: 413 LAARN-----------LLVDVDSATTCLAFAPARS 436
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 59/128 (46%), Gaps = 16/128 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
F + L +GKP + DTGSDLTW QC PC+ C K P Y P + V C +
Sbjct: 21 FLMQLAIGKPSLAYSAILDTGSDLTWTQC-IPCSDCYKQPTPIYDPSLSSTYGTVSCKSS 79
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C AL P + C+Y YGD S+ G L + F L S ++P + FGC
Sbjct: 80 LCLAL-----PASACISATCEYLYTYGDYSSTQGILSYETFTL-----SSQSIPHIAFGC 129
Query: 132 GYNQHNPG 139
G + G
Sbjct: 130 GQDNEGSG 137
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 72/288 (25%), Positives = 118/288 (40%), Gaps = 61/288 (21%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKP---PEK------QYKPHKN 65
++++L +G PP+ F DTGS L W C + C+ C P P K +
Sbjct: 88 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAK 147
Query: 66 IVPCSNPRCAALHWPNP----PRCKHPNDQC------DYEIEYGDGGSSIGALVTDL-FP 114
++ C NP+C L P+ P+CK P Q Y I+YG G ++ L+ +L FP
Sbjct: 148 LLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNLNFP 207
Query: 115 LRFSNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR----EYGLI 169
+ VP GC LS +G+ G GRG+ S+ SQ+ Y L+
Sbjct: 208 GK-------TVPQFLVGCSI-------LSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLV 253
Query: 170 RNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSAD-----------LKHYILG--- 215
+ + + G ++G+++TP N ++ L+ I+G
Sbjct: 254 SHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVD 313
Query: 216 ---PAELLYSGKSCGLKDLTLIFDSGASYAYFTSRVYQEIVSLIMRDL 260
P + L G + I DSG+++ + VY + +R L
Sbjct: 314 VKIPYKFLEPGSD---GNGGTIVDSGSTFTFMERPVYNLVAQEFLRQL 358
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 71/258 (27%), Positives = 103/258 (39%), Gaps = 28/258 (10%)
Query: 34 FDTGSDLTWVQC-DAPCTGCTKPPEKQYKPHKN----IVPCSNPRCAALHWPNPPRCKHP 88
D+ SD+ WVQC P C + Y P ++ CS+P C AL P C
Sbjct: 33 LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTAL-GPYANGCA-- 89
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNVPLTFGCGYNQHNPGPLSPPDTA 147
N+QC Y + Y DG S+ GA + DL L N S F FGC + + A
Sbjct: 90 NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFK----FGCSHAEQGS---FDARAA 142
Query: 148 GVLGLGRGRISIVSQ-LREYGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQ 204
G++ LG G S++SQ YG N +CI + G LG + SS TPM++
Sbjct: 143 GIMALGGGPESLLSQTASRYG---NAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVR 199
Query: 205 NSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFTSRVYQEIVSLIMRDL 260
Y + + G+ G+ + DS + YQ + + +
Sbjct: 200 FRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSM 259
Query: 261 IGTPLKLAPDDKTLPICW 278
T + AP L C+
Sbjct: 260 --TMYRSAPPKGYLDTCY 275
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 127/308 (41%), Gaps = 49/308 (15%)
Query: 18 AVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP--------HKNIVPC 69
++ + VG PP+ D GSDL W QC P KQ +P +++PC
Sbjct: 108 SLTVGVGTPPQPSKVILDLGSDLLWTQC-----SLVGPTAKQLEPVFDAARSSSFSVLPC 162
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
+ C A + N C + +C YE +YG ++ G L T+ F +G N LTF
Sbjct: 163 DSKLCEAGTFTN-KTCT--DRKCAYENDYGI-MTATGVLATETFTFGAHHGVSAN--LTF 216
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGVL 185
GCG + + + +G+LGL G +S++ Q L +C+ + V+
Sbjct: 217 GCGKLANG----TIAEASGILGLSPGPLSMLKQ-----LAITKFSYCLTPFADRKTSPVM 267
Query: 186 F--LGD-GKVPSSGVAWT-PMLQNSADLKHYILGPAELLYSGKSCGLKDLTL-------- 233
F + D GK ++G T P+L+N + +Y + + K + TL
Sbjct: 268 FGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTG 327
Query: 234 --IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKALGQVTEY 291
+ DS + AY + E+ +M + + DD P+C+ P + +
Sbjct: 328 GTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRSVDD--YPVCFELP-RGMSMEGVQ 384
Query: 292 FKPLALSF 299
PL L F
Sbjct: 385 VPPLVLHF 392
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 115/281 (40%), Gaps = 55/281 (19%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
V+L +G PP+ DTGS L+W+QC PP + P +++PC++P C
Sbjct: 82 VSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLC 141
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G+LV + S + PL GC
Sbjct: 142 KPRIPDFTLPTTCDQ-NRLCHYSYFYADGTYAEGSLVREKITFSSSQST---PPLILGCA 197
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
+ D G+LG+ GR S SQ + + +C+ G + G
Sbjct: 198 E--------ASTDEKGILGMNLGRRSFASQAKI-----SKFSYCVPTRQARAGLSSTGSF 244
Query: 186 FLGDGKVPSSG-------VAWTPMLQNSADLK--HYILGPAELLYSGKSCGLKDLTL--- 233
+LG+ P+SG + +TP Q S +L Y + P + + G + TL
Sbjct: 245 YLGNN--PNSGRFQYINLLTFTPS-QRSPNLDPLAYTI-PMQGIRMGNARLNISATLFRP 300
Query: 234 --------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
I DSG+ + Y Y ++ ++R L+G LK
Sbjct: 301 DPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVR-LVGPKLK 340
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 100/247 (40%), Gaps = 25/247 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ--YKPHKNIVPCSNPRC 74
+ + + +G P DTGSD++WV+C++ P K Y P CS+ C
Sbjct: 129 YVITVGIGSPAVTQTMMIDTGSDVSWVRCNSTDGLTLFDPSKSTTYAPFS----CSSAAC 184
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYN 134
A L N C N C Y ++YGDG ++ G +D L S+ FGC ++
Sbjct: 185 AQL-GNNGDGCS--NSGCQYRVQYGDGSNTTGTYSSDTLALSASD---TVTDFHFGCSHH 238
Query: 135 QHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGR--GVLFLGDGK 191
+ + G++GLG S+VSQ YG +C+ R G L G
Sbjct: 239 EED---FDGEKIDGLMGLGGDAQSLVSQTAATYG---KSFSYCLPPTNRTSGFLTFGAPN 292
Query: 192 VPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFTSR 247
S G TPML+ Y + ++ G G++ L + DSG + R
Sbjct: 293 GTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLSNGSVMDSGTVITWLPRR 352
Query: 248 VYQEIVS 254
Y + S
Sbjct: 353 AYSALSS 359
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 101/258 (39%), Gaps = 34/258 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P FDTGSDLTW QC+ PC G C E ++ P + V CS+
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCE-PCLGSCYSQKEPKFNPSSSSTYQNVSCSS 190
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C + C N C Y I YGD + G L + F L +N V + FGC
Sbjct: 191 PMC-----EDAESCSASN--CVYSIVYGDKSFTQGFLAKEKFTL--TNSDVLE-DVYFGC 240
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G N N G G + + N+ +C+ N G L G
Sbjct: 241 GEN--NQGLFDGVAGLLG----LGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFG 294
Query: 189 DGKVPSSGVAWTPM------LQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
+ S V +TP+ D+ +G EL + S + I DSG +
Sbjct: 295 SAGISES-VKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEG--AIIDSGTVFT 351
Query: 243 YFTSRVYQEIVSLIMRDL 260
++VY E+ S+ +
Sbjct: 352 RLPTKVYAELRSVFKEKM 369
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 10/151 (6%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
F VN+++G PP DT SDL W+QC PC C + P ++ N C
Sbjct: 85 FLVNISIGSPPITQLLHMDTASDLLWIQC-LPCINCYAQSLPIFDPSRSYTH-RNETCRT 142
Query: 77 LHWPNPPRCKHPNDQ-CDYEIEYGDGGSSIGALVTD--LFPLRFSNGSVFNV-PLTFGCG 132
+ P + N + C+Y + Y D S G L + LF + S + + FGCG
Sbjct: 143 SQYSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCG 202
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
++ + P G+LGLG G S+V +
Sbjct: 203 HDNYG----EPLVGTGILGLGYGEFSLVHRF 229
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 38/118 (32%), Positives = 61/118 (51%), Gaps = 13/118 (11%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALH 78
VG P K + DTGSD+ W+QC PC+ C + + + P + + C + +C +L
Sbjct: 165 VGNPAKSYYMVLDTGSDINWIQCQ-PCSDCYQQSDPIFTPAASSSYSPLTCDSQQCNSLQ 223
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
+ C+ N QC Y++ YGDG + G VT+ S G+V ++ L GCG++
Sbjct: 224 MSS---CR--NGQCRYQVNYGDGSFTFGDFVTETMSFGGS-GTVNSIAL--GCGHDNE 273
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 88/206 (42%), Gaps = 21/206 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P + FDTGS LTW QC+ PC G C K + + P K+ + C++
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTWTQCE-PCAGSCYKQQDPIFDPSKSSSYTNIKCTS 198
Query: 72 PRCAALHWPNPPRCKHPND-QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
C C D C Y+++YGD S G L + + ++ FG
Sbjct: 199 SLCTQFRSAG---CSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD---IVHDFLFG 252
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQ--NGRGVLFLG 188
CG Q N G TAG++GL R IS V Q + + +C+ + G L G
Sbjct: 253 CG--QDNEGLFR--GTAGLMGLSRHPISFVQQTSS--IYNKIFSYCLPSTPSSLGHLTFG 306
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYIL 214
++ + +TP S + Y L
Sbjct: 307 ASAATNANLKYTPFSTISGENSFYGL 332
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 65/242 (26%), Positives = 100/242 (41%), Gaps = 30/242 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
++ + + L VG PP + DTGSDL W QC PC C + + P K+
Sbjct: 79 YNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQC-MPCPDCYSQFDPIFDPSKS-------- 129
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGCG 132
N RC C YEI Y D S G L T+ + ++G F + T GCG
Sbjct: 130 ----STFNEQRCH--GKSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCG 183
Query: 133 YNQ---HNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLG 188
+ N G S ++G++GL G S++SQ+ +I +C GQ + F
Sbjct: 184 LHNTDLDNSGFAS--SSSGIVGLNMGPRSLISQMDL--PYPGLISYCFSGQGTSKINFGT 239
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASYA 242
+ V G M + +Y+ A + + L +D ++ DSG++
Sbjct: 240 NAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGNIVIDSGSTVT 299
Query: 243 YF 244
YF
Sbjct: 300 YF 301
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 99/242 (40%), Gaps = 30/242 (12%)
Query: 14 FSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPR 73
+S + + L VG PP + DTGSD+ W QC PC C + P K+ S R
Sbjct: 418 YSIYLMKLQVGTPPFEIVAEIDTGSDIIWTQC-MPCPNCYSQFAPIFDPSKS----STFR 472
Query: 74 CAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT-FGCG 132
RC + C YEI Y D S G L T+ + ++G F + T GCG
Sbjct: 473 --------EQRCN--GNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCG 522
Query: 133 YNQHN---PGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-GQNGRGVLFLG 188
+ N G S ++G++GL G +S++SQ+ +I +C GQ + F
Sbjct: 523 LDNTNLQYSGFAS--SSSGIVGLNMGPLSLISQMDL--PYPGLISYCFSGQGTSKINFGT 578
Query: 189 DGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGL------KDLTLIFDSGASYA 242
+ V G M + +Y+ A + L +D + DSG +
Sbjct: 579 NAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAEDGNIFIDSGTTLT 638
Query: 243 YF 244
YF
Sbjct: 639 YF 640
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 72/170 (42%), Gaps = 27/170 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ + + +G P DTGSD++WVQC PC+ C + + P + C +
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 110
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
CA L C + QC Y + YGDG S+ G +D L GS FGC
Sbjct: 111 DCAQLGQEGNG-CSS-SSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCS 164
Query: 132 ----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
G+N T G++GLG G S+VSQ G + +C+
Sbjct: 165 NVESGFNDQ---------TDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCL 203
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 101/258 (39%), Gaps = 34/258 (13%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG-CTKPPEKQYKPHKNI----VPCSN 71
+ V + +G P FDTGSDLTW QC+ PC G C E ++ P + V CS+
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCE-PCLGSCYSQKEPKFNPSSSSTYQNVSCSS 190
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
P C + C N C Y I YGD + G L + F L +N V + FGC
Sbjct: 191 PMC-----EDAESCSASN--CVYSIGYGDKSFTQGFLAKEKFTL--TNSDVLE-DVYFGC 240
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G N N G G + + N+ +C+ N G L G
Sbjct: 241 GEN--NQGLFDGVAGLLG----LGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFG 294
Query: 189 DGKVPSSGVAWTPM------LQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGASYA 242
+ S V +TP+ D+ +G EL + S + I DSG +
Sbjct: 295 SAGISES-VKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEG--AIIDSGTVFT 351
Query: 243 YFTSRVYQEIVSLIMRDL 260
++VY E+ S+ +
Sbjct: 352 RLPTKVYAELRSVFKEKM 369
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 64/125 (51%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF+ + VG+P K F DTGSD+ W+QC PC+ C + + + P N + C
Sbjct: 157 YFS-RVGVGQPSKPFYMVLDTGSDVNWLQC-KPCSDCYQQSDPIFDPTASSSYNPLTCDA 214
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
+C L C+ N +C Y++ YGDG ++G VT+ + F GSV V + GC
Sbjct: 215 QQCQDLEM---SACR--NGKCLYQVSYGDGSFTVGEYVTE--TVSFGAGSVNRVAI--GC 265
Query: 132 GYNQH 136
G++
Sbjct: 266 GHDNE 270
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 67/129 (51%), Gaps = 17/129 (13%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSN 71
YF + ++VG PP+ DTGSD+ W+QC APC C + + P+K + + CS
Sbjct: 58 YF-IRISVGTPPRRMYLVMDTGSDILWLQC-APCVNCYHQSDAIFDPYKSSTYSTLGCST 115
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG---SVFN-VPL 127
+C L C+ ++C Y+++YGDG + G TD L ++G V N +PL
Sbjct: 116 RQCLNLDIGT---CQA--NKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPL 170
Query: 128 TFGCGYNQH 136
GCG++
Sbjct: 171 --GCGHDNE 177
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/258 (27%), Positives = 103/258 (39%), Gaps = 28/258 (10%)
Query: 34 FDTGSDLTWVQC-DAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHP 88
D+ SD+ WVQC P C + Y P ++ CS+P C AL P C
Sbjct: 163 LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTAL-GPYANGCA-- 219
Query: 89 NDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNVPLTFGCGYNQHNPGPLSPPDTA 147
N+QC Y + Y DG S+ GA + DL L N S F FGC + + A
Sbjct: 220 NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFK----FGCSHAEQGSFDAR---AA 272
Query: 148 GVLGLGRGRISIVSQL-REYGLIRNVIGHCI--GQNGRGVLFLGDGKVPSSGVAWTPMLQ 204
G++ LG G S++SQ YG N +CI + G LG + SS TPM++
Sbjct: 273 GIMALGGGPESLLSQTASRYG---NAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVR 329
Query: 205 NSADLKHYILGPAELLYSGKSCGLKDLTL----IFDSGASYAYFTSRVYQEIVSLIMRDL 260
Y + + G+ G+ + DS + YQ + S +
Sbjct: 330 FRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSM 389
Query: 261 IGTPLKLAPDDKTLPICW 278
T + AP L C+
Sbjct: 390 --TMYRSAPPKGYLDTCY 405
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/285 (23%), Positives = 119/285 (41%), Gaps = 37/285 (12%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSNPRC---A 75
+G PP+ DT S+LTWVQ CT C+ + P + PC++ C +
Sbjct: 5 IGTPPREVLLLVDTASELTWVQ-GTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRS 63
Query: 76 ALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNV-PLTFGCGYN 134
L + + C C +++ Y DG + G + ++F L+ +G+ + + FGC
Sbjct: 64 KLGFQSA--CNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASK 121
Query: 135 QHNPGPLSPPD-TAGVLGLGRGRISIVSQL--REYGLIRNVIGHCIGQ-----NGRGVLF 186
P D ++G LGL RG S +Q+ R + + +C N GV+
Sbjct: 122 DLQ----RPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVII 177
Query: 187 LGDGKVPSSGVAWTPMLQN---SADLKHYILG------PAELLYSGKSC----GLKDLTL 233
GD +P+ + + Q ++ + Y +G ELL+ +S L +
Sbjct: 178 FGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGT 237
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICW 278
FDSG + ++ + +V R ++ + + D T +C+
Sbjct: 238 YFDSGTTVSFLVEPAHTALVEAFGRRVLHLN-RTSGSDFTKELCY 281
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 85/353 (24%), Positives = 137/353 (38%), Gaps = 52/353 (14%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCS 70
S + +G PP+ + DTGS+L W QC C + Y P ++ V C+
Sbjct: 69 SQYIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCN 128
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ CA + +C N C YG G+ G L T+ + V L FG
Sbjct: 129 DAACA---LGSETQCLSDNKTCAVVTGYG-AGNIAGTLATENLTFQSE-----TVSLVFG 179
Query: 131 C-GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE----YGL---IRNVI--GHCIGQN 180
C + +PG L+ +G++GLGRG++S+ SQL + Y L + I H +
Sbjct: 180 CIVVTKLSPGSLN--GASGIIGLGRGKLSLPSQLGDTRFSYCLTPYFEDTIEPSHMVVGA 237
Query: 181 GRGVLFLGDGKVPSSGVAWTPMLQNSAD----------LKHYILGPAELLYSGKSCGLKD 230
G++ +G S+ V P +++ +D L G +L + L+
Sbjct: 238 SAGLI---NGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQ 294
Query: 231 LT------LIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLPICWRGPFKA 284
+ DSGA YQ + + + R L ++ +C A
Sbjct: 295 VAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTGFDLC-----VA 349
Query: 285 LGQVTEYFKPLALSFTNRRNS-VRLVVPPEAYL--VISVSTSIIIIAYLTGKS 334
L PL L F + LVVPP Y V S + +++ + + KS
Sbjct: 350 LKDAERLVPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKS 402
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 89/197 (45%), Gaps = 25/197 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF V + VG PP+ D+GSD+ WVQC+ PCT C + + P + V C++
Sbjct: 136 YF-VRIGVGSPPRNQYVVMDSGSDIIWVQCE-PCTQCYHQSDPVFNPADSSSFSGVSCAS 193
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C+ H N C +C YE+ YGDG + G L L + F + NV + GC
Sbjct: 194 TVCS--HVDNAA-CH--EGRCRYEVSYGDGSYTKGTLA--LETITFGRTLIRNVAI--GC 244
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ HN G LG G +S V QL G +C+ G G+L G
Sbjct: 245 GH--HNQGMFVGAAGLLGLGG--GPMSFVGQLG--GQTGGAFSYCLVSRGIESSGLLEFG 298
Query: 189 DGKVPSSGVAWTPMLQN 205
+P G AW P++ N
Sbjct: 299 REAMP-VGAAWVPLIHN 314
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 58/123 (47%), Gaps = 15/123 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG---CTKPPEKQYKPHKN----IVPC 69
F V + +G P + FDTGSDL+WVQC PC C + + P K+ V C
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQ-PCGSSGHCHPQQDPLFDPSKSSTYAAVHC 202
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
P+CAA C N C Y + YGDG S+ G L D L S+ ++ P F
Sbjct: 203 GEPQCAAAG----DLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALT-SSRALTGFP--F 255
Query: 130 GCG 132
GCG
Sbjct: 256 GCG 258
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 61/124 (49%), Gaps = 12/124 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YFA + +G P + + + DTGSD+TW+QC APC+ C + Y P + V C +
Sbjct: 45 YFA-RMGIGSPQRSYYLELDTGSDVTWIQC-APCSSCYSQVDPIYDPSNSSSYRRVYCGS 102
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C AL + C+ C Y + YGD +S G L + F L N S + FGC
Sbjct: 103 ALCQALDYSA---CQGMG--CSYRVVYGDSSASSGDLGIESFYLG-PNSSTAMRNIAFGC 156
Query: 132 GYNQ 135
G++
Sbjct: 157 GHSN 160
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 72/170 (42%), Gaps = 27/170 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ + + +G P DTGSD++WVQC PC+ C + + P + C +
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 186
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
CA L C + QC Y + YGDG S+ G +D L GS FGC
Sbjct: 187 ACAQLGQEG-NGCSS-SSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVKSFQFGCS 240
Query: 132 ----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
G+N T G++GLG G S+VSQ G + +C+
Sbjct: 241 NVESGFNDQ---------TDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCL 279
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 96/254 (37%), Gaps = 32/254 (12%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ-------------YKPH-- 63
+ +G P F DTGSDL WV CD CT C+ Y P+
Sbjct: 103 TTIELGTPGVKFMVALDTGSDLFWVPCD--CTRCSATRSSAFASALASDFDLSVYNPNGS 160
Query: 64 --KNIVPCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSI-GALVTDLFPLRF--S 118
V C+N C + +C C Y + Y +S G LV D+ L
Sbjct: 161 STSKKVTCNNSLCT-----HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDD 215
Query: 119 NGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG 178
N + + FGCG Q + L G+ GLG +IS+ S L G + C G
Sbjct: 216 NHDLVEANVIFGCGQVQ-SGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFG 274
Query: 179 QNGRGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSG 238
++G G + GD S TP N + + I + G + + T +FDSG
Sbjct: 275 RDGIGRISFGDKG--SLDQDETPFNVNPSHPTYNI--TINQVRVGTTLIDVEFTALFDSG 330
Query: 239 ASYAYFTSRVYQEI 252
S+ Y Y +
Sbjct: 331 TSFTYLVDPTYSRL 344
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 121/293 (41%), Gaps = 62/293 (21%)
Query: 34 FDTGSDLTWVQCDAPCT---GCTKPPEK---------QYKPHKNIVPCSNPRCAALHWPN 81
DTGSDL WV PCT C PE + ++V C++ C L+ N
Sbjct: 1 MDTGSDLVWV----PCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNN 56
Query: 82 PP----RCKHPNDQCD-----YEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
C C Y I+YG G S+ G L+T+ L NG F G
Sbjct: 57 TELLCQSCAGSLKNCSETCPPYGIQYGRG-STAGLLLTETLNLPLENGEGARAITHFAVG 115
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG------QNGRGVLF 186
+ +S +G+ G GRG +S+ SQL E+ + ++ +C+ +N + ++
Sbjct: 116 CS-----IVSSQQPSGIAGFGRGALSMPSQLGEH-IGKDRFAYCLQSHRFDEENKKSLMV 169
Query: 187 LGDGKVPSS-GVAWTPMLQNSAD------LKHYILGPAELLYSGKSCGLKDL-------- 231
LGD +P++ + +TP L NS +Y +G + GK LK L
Sbjct: 170 LGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKR--LKQLPSKLLRFD 227
Query: 232 -----TLIFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT-LPICW 278
I DSG ++ F+ +++ I + IG +DKT + +C+
Sbjct: 228 TKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQ-IGYRRAGEVEDKTGMGLCY 279
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 72/170 (42%), Gaps = 27/170 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ + + +G P DTGSD++WVQC PC+ C + + P + C +
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 186
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
CA L C + QC Y + YGDG S+ G +D L GS FGC
Sbjct: 187 DCAQLGQEG-NGCSS-SSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCS 240
Query: 132 ----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
G+N T G++GLG G S+VSQ G + +C+
Sbjct: 241 NVESGFNDQ---------TDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCL 279
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 121/276 (43%), Gaps = 46/276 (16%)
Query: 18 AVNLTVGKPPKLFDFDFDTGSDLTWVQCDA--PCTGCT-KPPEK------QYKPHKNIVP 68
+ L+ G PP+ F DTGS + W C CT C+ P+K + I+
Sbjct: 88 TIPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILG 147
Query: 69 CSNPRCAALHWPNP----PRCKHPNDQC-----DYEIEYGDGGSSIGALVTDL-FPLRFS 118
C +P+CA P+ PRC + +C Y ++YG G +S L+ +L FP
Sbjct: 148 CRDPKCANTSSPDVHLGCPRCNGNSKKCSHACPQYTLQYGTGAASGFFLLENLDFP---- 203
Query: 119 NGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL--REYGLIRNVIGHC 176
G + L GC + P + + G GR S+ Q+ +++ N +
Sbjct: 204 -GKTIHKFLV-GCTTSADR-----EPSSDALAGFGRTMFSLPMQMGVKKFAYCLNSHDYD 256
Query: 177 IGQN-GRGVLFLGDGKVPSSGVAWTPMLQNSADLK-HYILGPAELLYSGKSCGL--KDLT 232
+N G+ +L DG+ + G+++ P L+N D +Y LG ++ K + K LT
Sbjct: 257 DTRNSGKLILDYSDGE--TQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGKYLT 314
Query: 233 --------LIFDSGASYAYFTSRVYQEIVSLIMRDL 260
++ DSG +Y Y T V++ + + + + +
Sbjct: 315 PGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQM 350
>gi|357152658|ref|XP_003576193.1| PREDICTED: F-box/FBD/LRR-repeat protein At5g22660-like
[Brachypodium distachyon]
Length = 594
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 51/101 (50%), Gaps = 7/101 (6%)
Query: 56 PEKQYKPHK-NIVPCSNPRCAALH--WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDL 112
P YKP + N + C + RC +H C +QCDYEIEY +G +S+G L+ D
Sbjct: 383 PHDLYKPRRMNKLLCGDERCVKVHKDLDIEQDCTLDPNQCDYEIEYTNGENSMGVLLADT 442
Query: 113 FPLRFSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLG 153
F L + N L FGCGY ++P D GVL +G
Sbjct: 443 FSLPTTTNDRLN--LAFGCGYGHQGGQEVTPVD--GVLRIG 479
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/256 (26%), Positives = 108/256 (42%), Gaps = 27/256 (10%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF + +++G P DTGSDLTWVQC PC C + + P ++ + C +
Sbjct: 94 YF-MKMSIGTPLVEVIVIADTGSDLTWVQC-LPCDPCYRQKSPLFDPSRSSSYRHMLCGS 151
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPL-RFSNGSVFNVPLTFG 130
C AL + C + C+Y YGD + G L T+ F + S+ V P+ FG
Sbjct: 152 RFCNALDV-SEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFG 210
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGV 184
CG N G + V G +S+VSQL +I+ +C+ +
Sbjct: 211 CGTG--NGGTFDELGSGIVGLGGGA-LSLVSQLS--SIIKGKFSYCLVPLSEQSNVTSKI 265
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGK--SCGLKDLTLIFD 236
F D + V TP++ D +Y+ +G L Y+ + ++ +I D
Sbjct: 266 KFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIID 325
Query: 237 SGASYAYFTSRVYQEI 252
SG + + S + E+
Sbjct: 326 SGTTLTFLDSEFFTEL 341
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 71/162 (43%), Gaps = 19/162 (11%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----V 67
P + + VG P DT SDLTW+QC PC C + P + +
Sbjct: 136 PTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPRHSTSYGEM 194
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDG------GSSIGALVTDLFPLRFSNGS 121
P C AL K C Y + YGDG +S+G LV + L F+ G
Sbjct: 195 NYDAPDCQALGRSGGGDAK--RGTCIYTVLYGDGDGHGSTSTSVGDLVEET--LTFAGG- 249
Query: 122 VFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
V L+ GCG++ N G P AG+LGL RG+ISI Q+
Sbjct: 250 VRQAYLSIGCGHD--NKGLFGAP-AAGILGLSRGQISIPHQI 288
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 70/158 (44%), Gaps = 38/158 (24%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + L +G PP F DTGSDLTW QC PC C Y + +PCS+
Sbjct: 83 YLMELAIGTPPVPFIALADTGSDLTWTQC-KPCKLCFGQDTPIYDTTTSSSFSPLPCSSA 141
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDG-------GSSIGALVTDLFPLRFSNGSVFNV 125
C + W + RC P+ C Y Y DG G S+G +
Sbjct: 142 TCLPI-WSS--RCSTPSATCRYRYAYDDGAYSPECAGISVGGIA---------------- 182
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
FGCG + G LS ++ G +GLGRG +S+V+QL
Sbjct: 183 ---FGCGVDN---GGLS-YNSTGTVGLGRGSLSLVAQL 213
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 72/170 (42%), Gaps = 27/170 (15%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNP 72
+ + + +G P DTGSD++WVQC PC+ C + + P + C +
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 256
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC- 131
CA L C + QC Y + YGDG S+ G +D L GS FGC
Sbjct: 257 DCAQLGQEGNG-CS-SSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCS 310
Query: 132 ----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI 177
G+N T G++GLG G S+VSQ G + +C+
Sbjct: 311 NVESGFNDQ---------TDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCL 349
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/223 (26%), Positives = 92/223 (41%), Gaps = 22/223 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
+ V + +G P K FDTGSD+TW QC C K E+ + P ++ + ++
Sbjct: 149 YIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSS 208
Query: 77 LHWP------NPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFG 130
+ N P C + C Y I+YGD S+G T+ L ++ FN + FG
Sbjct: 209 ICNSLTSATGNTPGC--ASSACVYGIQYGDSSFSVGFFGTE--KLTLTSTDAFN-NIYFG 263
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVLFLGDG 190
CG N S R ++S+VSQ + + +C+ + FL G
Sbjct: 264 CGQNNQGLFGGSAGLLGLG----RDKLSVVSQTAQK--YNKIFSYCLPSSSSSTGFLTFG 317
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTL 233
S +TP+ SA Y L ++G S G K L +
Sbjct: 318 GSASKNAKFTPLSTISAGPSFY-----GLDFTGISVGGKKLAI 355
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 71/149 (47%), Gaps = 21/149 (14%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI--------VPCSNPRC 74
+G PP+ DTGS+L W QC C K KQ P+ N+ VPC++
Sbjct: 90 IGDPPQRAAALIDTGSNLIWTQCGTTCG--LKACAKQDLPYYNLSRSSTFAAVPCADS-- 145
Query: 75 AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC-GY 133
A L N + C + YG GS G+L T+ F F +G+ L FGC
Sbjct: 146 AKLCAANGVHLCGLDGSCTFAASYG-AGSVFGSLGTEAFT--FQSGA---AKLGFGCVSL 199
Query: 134 NQHNPGPLSPPDTAGVLGLGRGRISIVSQ 162
+ G L+ +G++GLGRGR+S+VSQ
Sbjct: 200 TRITKGALN--GASGLIGLGRGRLSLVSQ 226
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 70/273 (25%), Positives = 106/273 (38%), Gaps = 40/273 (14%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCSNPRC 74
V+L +G PP+ DTGS L+W+QC PP + P +++PC++P C
Sbjct: 79 VSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPR-KPPPSTVFDPSLSSSFSVLPCNHPLC 137
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + S + PL GC
Sbjct: 138 KPRIPDFTLPTSCDL-NRLCHYSYFYADGTLAEGNLVREKITFSTSQST---PPLILGCA 193
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI--GQNGRGVLFLGDG 190
+ D G+LG+ GR+S SQ + V + G G +LG+
Sbjct: 194 EDAS--------DDKGILGMNLGRLSFASQAKITKFSYCVPTRQVRPGFTPTGSFYLGEN 245
Query: 191 KVPSSGVAWTPMLQNSADLKHYILGP--AELLYSGKSCGLKDLTL--------------- 233
S+G + +L S + L P + G G K L +
Sbjct: 246 P-NSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQS 304
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK 266
+ DSG+ + Y Y ++ ++R L G LK
Sbjct: 305 MIDSGSEFTYLVDVAYNKVREEVVR-LAGPRLK 336
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 100/238 (42%), Gaps = 26/238 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---YKPHKNI----VPC 69
F + +++GKPP + DTGS L+WVQC C K + P ++ V C
Sbjct: 114 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRC 173
Query: 70 SNPRCAALHWP---NPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFSNGSVFNV 125
S+ +C L + C D C Y + YG+G + S+G +VTD + G F +
Sbjct: 174 SSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI----GDSF-M 228
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYG--LIRNVIGHCI--GQNG 181
L FGC + AG+ G G S QL Y L + +C+ +
Sbjct: 229 DLMFGCSMDVKY-----SEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPTDETK 283
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
G + LG + +TP+ + S + Y L L+ +G+ +I DSGA
Sbjct: 284 PGYMILGRYDRAAMDGGYTPLFR-SINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 340
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 100/238 (42%), Gaps = 26/238 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---YKPHKNI----VPC 69
F + +++GKPP + DTGS L+WVQC C K + P ++ V C
Sbjct: 116 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRC 175
Query: 70 SNPRCAALHWP---NPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFSNGSVFNV 125
S+ +C L + C D C Y + YG+G + S+G +VTD + G F +
Sbjct: 176 SSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI----GDSF-M 230
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYG--LIRNVIGHCI--GQNG 181
L FGC + AG+ G G S QL Y L + +C+ +
Sbjct: 231 DLMFGCSMDVKY-----SEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPTDETK 285
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
G + LG + +TP+ + S + Y L L+ +G+ +I DSGA
Sbjct: 286 PGYMILGRYDRAAMDGGYTPLFR-SINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 342
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 67/153 (43%), Gaps = 22/153 (14%)
Query: 13 IFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNP 72
+FSY +G PP+ D SDL W C G T P VPC++
Sbjct: 101 VFSY-----GIGTPPQQVSGALDISSDLVWTAC-----GATAPFNPVRSTTVADVPCTDD 150
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C P C +C Y YG G + + G L T+ F F + + V FGC
Sbjct: 151 ACQQFA---PQTCGAGASECAYTYMYGGGAANTTGLLGTEAF--TFGDTRIDGV--VFGC 203
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLR 164
G N G S +GV+GLGRG +S+VSQL+
Sbjct: 204 GLK--NVGDFS--GVSGVIGLGRGNLSLVSQLQ 232
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 88/197 (44%), Gaps = 25/197 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF V + VG PP+ D+GSD+ WVQC PC C K + + P K+ V C +
Sbjct: 131 YF-VRIGVGSPPRDQYMVIDSGSDMVWVQCQ-PCKLCYKQSDPVFDPAKSGSYTGVSCGS 188
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + C + C YE+ YGDG + G L L L F+ V NV + GC
Sbjct: 189 SVCDRIENSG---CH--SGGCRYEVMYGDGSYTKGTLA--LETLTFAKTVVRNVAM--GC 239
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ N G +G G +S V QL G G+C+ G + G L G
Sbjct: 240 GH--RNRGMFIGAAGLLGIGG--GSMSFVGQLS--GQTGGAFGYCLVSRGTDSTGSLVFG 293
Query: 189 DGKVPSSGVAWTPMLQN 205
+P G +W P+++N
Sbjct: 294 REALP-VGASWVPLVRN 309
>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
Length = 431
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 86/206 (41%), Gaps = 29/206 (14%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAA 76
V + VG PP+ DTGS+L+W+ C+ G PP + S R
Sbjct: 55 LTVPVAVGTPPQNVTMVLDTGSELSWLLCN----GSYAPP---------LTRRSTRRWRG 101
Query: 77 LHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC---- 131
P PP C P++ C + Y D S+ G L TD F L V FGC
Sbjct: 102 RDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTF-LLTGGAPPVAVGAYFGCITSY 160
Query: 132 ----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIG-QNGRGVLF 186
N + G G+LG+ RG +S V+Q G R +CI G GVL
Sbjct: 161 SSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQT---GTRR--FAYCIAPGEGPGVLL 215
Query: 187 LGDGKVPSSGVAWTPMLQNSADLKHY 212
LGD + + +TP+++ S L ++
Sbjct: 216 LGDDGGVAPPLNYTPLIEISQPLPYF 241
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/358 (22%), Positives = 144/358 (40%), Gaps = 64/358 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKPPEK---------QYKPHKN 65
++ L+ G P + FDTGS L W C + C+ C+ P +
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 66 IVPCSNPRCAALHWPN-PPRCKHPNDQCD--------YEIEYGDGGSSIGALVTDLFPLR 116
+V C NP+C+ + P+ +C+ N + + Y ++YG GS+ G L+++ L
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGS-GSTAGLLLSET--LD 197
Query: 117 FSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
F + + N GC + LS +G+ G GRG S+ SQ+ GL + +C
Sbjct: 198 FPDKKIPN--FVVGCSF-------LSIHQPSGIAGFGRGSESLPSQM---GLKK--FAYC 243
Query: 177 IGQNG------RGVLFLGDGKVPSSGVAWTPMLQ-----NSADLKHYILGPAELLYSGKS 225
+ G L L V SSG+ +TP Q N+A ++Y L +++ ++
Sbjct: 244 LASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQA 303
Query: 226 CGLKDLTL----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
+ L I DSG+++ + V + + + L A D +TL
Sbjct: 304 VKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN--WTRATDVETL- 360
Query: 276 ICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTGK 333
R F + + F L F + + +P Y + S+ + + +T +
Sbjct: 361 TGLRPCFDISKEKSVKFPELIFQF---KGGAKWALPLNNYFALVSSSGVACLTVVTHQ 415
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 54/118 (45%), Gaps = 14/118 (11%)
Query: 23 VGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALH 78
+G P + DTGSD+ W+QC PC C E ++P + + C P+C AL
Sbjct: 157 IGNPAREVYMVLDTGSDVNWLQC-TPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALE 215
Query: 79 WPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
C+ N C YE+ YGDG ++G T+ + GS + GCG++
Sbjct: 216 V---SECR--NATCLYEVSYGDGSYTVGDFATETLTI----GSTLVQNVAVGCGHSNE 264
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 83/325 (25%), Positives = 129/325 (39%), Gaps = 57/325 (17%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIV----PCSN 71
Y N T+G PP+ D +L W QC C+ C K + P+ + PC
Sbjct: 66 YNVANFTIGTPPQPASAIIDVAGELVWTQCSM-CSRCFKQDLPLFVPNASSTFRPEPCGT 124
Query: 72 PRCAALHWPNPPRCKHPNDQCDYE--IEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
C ++ P ++ C YE I GG ++G + TD F + + S L F
Sbjct: 125 DACKSI-----PTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS-----LGF 174
Query: 130 GC----GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGVL 185
GC G + GP +G++GLGR S+VSQ+ + H G+N R L
Sbjct: 175 GCVVASGIDTMG-GP------SGLIGLGRAPSSLVSQMNITKFSYCLTPHDSGKNSR--L 225
Query: 186 FLGDGKVPSSG--VAWTPMLQNS--ADLKHYILGPAELLYSGKSCGLKDL-------TLI 234
LG + G TP ++ S D+ Y P +L G G + T++
Sbjct: 226 LLGSSAKLAGGGNSTTTPFVKTSPGDDMSQYY--PIQL--DGIKAGDAAIALPPSGNTVL 281
Query: 235 FDSGASYAYFTSRVYQEIVSLIMRDLIGTPLK--LAPDDKTLPICWRGPFKALGQVTEYF 292
+ A ++ YQ + + + + P L P D +C+ P L +
Sbjct: 282 VQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFD----LCF--PKAGLSNASAP- 334
Query: 293 KPLALSFTNRRNSVRLVVPPEAYLV 317
L FT ++ + L VPP YL+
Sbjct: 335 ---DLVFTFQQGAAALTVPPPKYLI 356
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 61/124 (49%), Gaps = 12/124 (9%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YFA + +G P + + + DTGSD+TW+QC APC+ C + Y P + V C +
Sbjct: 12 YFA-RMGIGNPQRSYYLELDTGSDVTWIQC-APCSSCYSQVDPIYDPSNSSSYRRVYCGS 69
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C AL + C+ C Y + YGD +S G L + F L N S + FGC
Sbjct: 70 ALCQALDY---SACQGMG--CSYRVVYGDSSASSGDLGIESFYLG-PNSSTAMRNIAFGC 123
Query: 132 GYNQ 135
G++
Sbjct: 124 GHSN 127
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/358 (22%), Positives = 144/358 (40%), Gaps = 64/358 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKPPEK---------QYKPHKN 65
++ L+ G P + FDTGS L W C + C+ C+ P +
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 66 IVPCSNPRCAALHWPN-PPRCKHPNDQCD--------YEIEYGDGGSSIGALVTDLFPLR 116
+V C NP+C+ + P+ +C+ N + + Y ++YG GS+ G L+++ L
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGS-GSTAGLLLSET--LD 197
Query: 117 FSNGSVFNVPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
F + + N GC + LS +G+ G GRG S+ SQ+ GL + +C
Sbjct: 198 FPDKXIPN--FVVGCSF-------LSIHQPSGIAGFGRGSESLPSQM---GLKK--FAYC 243
Query: 177 IGQNG------RGVLFLGDGKVPSSGVAWTPMLQ-----NSADLKHYILGPAELLYSGKS 225
+ G L L V SSG+ +TP Q N+A ++Y L +++ ++
Sbjct: 244 LASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQA 303
Query: 226 CGLKDLTL----------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTLP 275
+ L I DSG+++ + V + + + L A D +TL
Sbjct: 304 VKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN--WTRATDVETL- 360
Query: 276 ICWRGPFKALGQVTEYFKPLALSFTNRRNSVRLVVPPEAYLVISVSTSIIIIAYLTGK 333
R F + + F L F + + +P Y + S+ + + +T +
Sbjct: 361 TGLRPCFDISKEKSVKFPELIFQF---KGGAKWALPLNNYFALVSSSGVACLTVVTHQ 415
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 142/350 (40%), Gaps = 74/350 (21%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVP-------- 68
+ ++L +G PPK+ DTGSDLTWV C C Y+ +K +
Sbjct: 12 YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDC--NDYRNNKLMSTYSPSYSSS 69
Query: 69 -----CSNPRCAALHWPNPP-----------------RCKHPNDQCDYEIEYGDGGSSIG 106
C +P C+ +H + C P Y YG GG IG
Sbjct: 70 SLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAY--TYGAGGVVIG 127
Query: 107 ALVTDLFPLRFSNGS-VFNVP-LTFGC-GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
L D S+ S VP FGC G P G+ G GRG +S+ SQL
Sbjct: 128 TLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREP--------IGIAGFGRGVLSLPSQL 179
Query: 164 REYGLIRNVIGHCI-------GQNGRGVLFLGDGKVPSSG-VAWTPMLQNSADLKHYILG 215
G ++ HC N L +GD + S+ + +T +L+N +Y +G
Sbjct: 180 ---GFLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIG 236
Query: 216 PAELLYSGKSCGLK------------DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIGT 263
E + G + ++ + +I DSG +Y + Y +++S+ ++ +I
Sbjct: 237 -LEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSM-LQSIITY 294
Query: 264 PLKLAPDDKT-LPICWRGPFKALGQVTEYFKPL-ALSFTNRRNSVRLVVP 311
P + +T +C+R P VT++ L ++SF + N+V LV+P
Sbjct: 295 PRAQEQEARTGFDLCYRIPCPN-NVVTDHDHLLPSISF-HFSNNVSLVLP 342
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 112/264 (42%), Gaps = 29/264 (10%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPN 89
FDTGSDL+W+QC PC C + P ++ VPC + C +P R +
Sbjct: 105 FDTGSDLSWLQC-TPCKTCYPQEAPLFDPTQSSTYVDVPCESQPCTL--FPQNQRECGSS 161
Query: 90 DQCDYEIEYGDGGSSIGALVTDLFPLRFSN------GSVFNVPLTFGCGYNQHNPGPLSP 143
QC Y +YG +IG L D + FS+ G+ F + FGC + + +S
Sbjct: 162 KQCIYLHQYGTDSFTIGRLGYDT--ISFSSTGMGQGGATFPKSV-FGCAFYSNFTFKIS- 217
Query: 144 PDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLGDGKVPSSGVAWT 200
G +GLG G +S+ SQL + I + +C+ G L G P++ V T
Sbjct: 218 TKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTGKLKFGS-MAPTNEVVST 274
Query: 201 PMLQNSADLKHYILGPAELLYSGKSC--GLKDLTLIFDSGASYAYFTSRVYQEIVSLIMR 258
P + N + +Y+L + K G +I DS + +Y + +S +
Sbjct: 275 PFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILTHLEQGIYTDFISSVKE 334
Query: 259 DLIGTPLKLAPDDKT-LPICWRGP 281
+ +++A D T C R P
Sbjct: 335 AI---NVEVAEDAPTPFEYCVRNP 355
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 88/202 (43%), Gaps = 18/202 (8%)
Query: 12 PIFSYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----V 67
P + + VG P DT SDLTW+QC PC C + P + +
Sbjct: 133 PTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPRHSTSYREM 191
Query: 68 PCSNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP- 126
+ C AL K C Y + YGDG +++G + + L F+ G +P
Sbjct: 192 SFNAADCQALGRSGGGDAKR--GTCVYTVGYGDGSTTVGDFIEET--LTFAGG--VRLPR 245
Query: 127 LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNG--RGV 184
++ GCG++ N G P AG+LGLGRG +S +Q+ G + + G
Sbjct: 246 ISIGCGHD--NKGLFGAP-AAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLSGPGSLSST 302
Query: 185 LFLGDGKVPSS-GVAWTPMLQN 205
L G G V +S V++TP + N
Sbjct: 303 LTFGAGAVDTSPPVSFTPTVLN 324
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 82/196 (41%), Gaps = 19/196 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTK---PPEKQ--YKPHKNIVPCSN 71
+ + +++G P DTGSD++WV C A + P K Y P CS+
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGAGSSLFFDPGKSSTYTPFS----CSS 180
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L + C N C Y + YGDG ++ G +D L S V N FGC
Sbjct: 181 AACTRLEGRD-NGCSL-NSTCQYTVRYGDGSNTTGTYGSDTLALN-STEKVEN--FQFGC 235
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLRE-YGLIRNVIGHCIGQNGRGVLFLGDG 190
L T G++GLG G S+VSQ YG + +C+ R FL G
Sbjct: 236 SETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYG---SAFSYCLPATTRSSGFLTLG 292
Query: 191 -KVPSSGVAWTPMLQN 205
+SG TPM ++
Sbjct: 293 ASTGTSGFVTTPMFRS 308
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 88/197 (44%), Gaps = 25/197 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF V + VG PP+ D+GSD+ WVQC PC C K + + P K+ V C +
Sbjct: 132 YF-VRIGVGSPPRDQYMVIDSGSDMVWVQCQ-PCKLCYKQSDPVFDPAKSGSYTGVSCGS 189
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C + C + C YE+ YGDG + G L L L F+ V NV + GC
Sbjct: 190 SVCDRIENSG---CH--SGGCRYEVMYGDGSYTKGTLA--LETLTFAKTVVRNVAM--GC 240
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI---GQNGRGVLFLG 188
G+ N G +G G +S V QL G G+C+ G + G L G
Sbjct: 241 GH--RNRGMFIGAAGLLGIGG--GSMSFVGQLS--GQTGGAFGYCLVSRGTDSTGSLVFG 294
Query: 189 DGKVPSSGVAWTPMLQN 205
+P G +W P+++N
Sbjct: 295 REALP-VGASWVPLVRN 310
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 128/330 (38%), Gaps = 60/330 (18%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP----HKNIVPCSNPRC 74
V+L +G PP+ DTGS L+W+QC P K P + P +++PC++ C
Sbjct: 80 VSLPIGTPPQTQQMVLDTGSQLSWIQCKVP----PKTPPTAFDPLLSSSFSVLPCNHSLC 135
Query: 75 AAL--HWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCG 132
+ P C N C Y Y DG + G LV + F + S PL GC
Sbjct: 136 KPRVPDYTLPTSCDQ-NRLCHYSYFYADGTYAEGNLVREKFTF---SSSQTTPPLILGCA 191
Query: 133 YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI-------GQNGRGVL 185
+ DT G+LG+ GR+S S + + +C+ G + G
Sbjct: 192 TDSS--------DTQGILGMNLGRLSFSSLAK-----ISKFSYCVPPRRSQSGSSPTGSF 238
Query: 186 FLGDGKVPSSGVAWTPML-----QNSADLK--HYILGPAELLYSGKSCGLKDLTL----- 233
+LG S+G + ++ Q +L Y L + +GK +
Sbjct: 239 YLGPNP-SSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPS 297
Query: 234 -----IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLA-PDDKTLPICWRGPFKALGQ 287
+ DSG + + Y ++ I++ L G LK +L +C+ G +G+
Sbjct: 298 GAGQTLIDSGTWFTFLVDEAYSKVKEEIVK-LAGPKLKKGYVYGGSLDMCFDGDAMVIGR 356
Query: 288 VTEYFKPLALSFTNRRNSVRLVVPPEAYLV 317
+ +A F N V +VV E L
Sbjct: 357 M---IGNMAFEF---ENGVEIVVEREKMLA 380
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/300 (26%), Positives = 124/300 (41%), Gaps = 53/300 (17%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTKPP---------EKQYKPHKN 65
+A ++++G PP+ DTGS L+WV C + C C+ P +
Sbjct: 91 YAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSR 150
Query: 66 IVPCSNPRCAALHWPNPPRCKHPNDQCD------YEIEYGDGGSSIGALVTDLFPLRFSN 119
+V C NP C +H +P C + + Y + YG G +S G L++D LR S
Sbjct: 151 LVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGSTS-GLLISDT--LRLSP 207
Query: 120 GSVFNVPLTF-----GCG-YNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVI 173
S + P F GC + H P +G+ G GRG S+ SQL+ ++
Sbjct: 208 SSSSSAPAPFRNFAIGCSIVSVHQP-------PSGLAGFGRGAPSVPSQLKVPKFSYCLL 260
Query: 174 GHCIGQNG--RGVLFLGDGKVPS----SGVAWTPMLQNSADLKHYILGPAELLYSGKSCG 227
N G L LGD VP+ + + + P+L N+A Y + L +G S G
Sbjct: 261 SRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVY-YYLALTGISVG 319
Query: 228 LKDLTL-------------IFDSGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKTL 274
K + L I DSG ++ Y V++ + + + + G + P + L
Sbjct: 320 GKPVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDAL 379
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/154 (34%), Positives = 64/154 (41%), Gaps = 11/154 (7%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKP----PEKQYKPHKNIVPCS 70
S F VN +VG+PP DTGS L W+QC PC C+ P V CS
Sbjct: 94 SLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQ-PCKHCSSDHMIHPVFNPALSSTFVECS 152
Query: 71 NPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG-SVFNVPLTF 129
+ PN C N +C YE Y G S G L + NG +V P+ F
Sbjct: 153 CDDRFCRYAPN-GHCGSSN-KCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAF 210
Query: 130 GCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
GCGY G G+LGLG S+ QL
Sbjct: 211 GCGYEN---GEQLESHFTGILGLGAKPTSLAVQL 241
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 68/277 (24%), Positives = 110/277 (39%), Gaps = 50/277 (18%)
Query: 19 VNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCAALH 78
N+++G+PP DTGSD+ WV C PCT C + P K+
Sbjct: 103 ANISIGQPPIPQLVVMDTGSDILWVMC-TPCTNCDNDLGLLFDPSKSST----------- 150
Query: 79 WPNPPRCKHPND----QCD---YEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFG 130
P CK P D +CD + + Y D ++ G D ++ + + FG
Sbjct: 151 --FSPLCKTPCDFEGCRCDPIPFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLFG 208
Query: 131 CGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGR-----GVL 185
CG HN G + P G+LGL G S+V++L + +CIG L
Sbjct: 209 CG---HNIGHDTDPGHNGILGLNNGPDSLVTKLGQK------FSYCIGNLADPYYNYHQL 259
Query: 186 FLGDGKVPS---------SGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFD 236
LG+G +G + M S K + P G +I D
Sbjct: 260 ILGEGADLEGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAG----GVIID 315
Query: 237 SGASYAYFTSRVYQEIVSLIMRDLIGTPLKLAPDDKT 273
+G++ + V++ ++S +R+L+G + A +K+
Sbjct: 316 TGSTITFLVDSVHK-LLSKEVRNLLGWSFRQATIEKS 351
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 65/126 (51%), Gaps = 16/126 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSN 71
YF + + +G+P K F DTGSD+ W+QC PC C + + + P + + C
Sbjct: 160 YF-LRVGIGRPSKTFYMVIDTGSDVNWLQC-KPCDDCYQQVDPIFDPASSSSFSRLGCQT 217
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVPLTFG 130
P+C L + C+ ND C Y++ YGDG ++G T+ + F N GSV V + G
Sbjct: 218 PQCRNL---DVFACR--NDSCLYQVSYGDGSYTVGDFATE--TVSFGNSGSVDKVAI--G 268
Query: 131 CGYNQH 136
CG++
Sbjct: 269 CGHDNE 274
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 89/351 (25%), Positives = 143/351 (40%), Gaps = 76/351 (21%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVP-------- 68
+ ++L +G PPK+ DTGSDLTWV C C Y+ +K +
Sbjct: 29 YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDC--NDYRNNKLMSTYSPSYSSS 86
Query: 69 -----CSNPRCAALHWPNPP-----------------RCKHPNDQC-DYEIEYGDGGSSI 105
C +P C+ +H + C P C + YG GG I
Sbjct: 87 SLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRP---CPSFAYTYGAGGVVI 143
Query: 106 GALVTDLFPLRFSNGS-VFNVP-LTFGC-GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQ 162
G L D S+ S VP FGC G P G+ G GRG +S+ SQ
Sbjct: 144 GTLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREP--------IGIAGFGRGVLSLPSQ 195
Query: 163 LREYGLIRNVIGHCI-------GQNGRGVLFLGDGKVPSSG-VAWTPMLQNSADLKHYIL 214
L G ++ HC N L +GD + S+ + +T +L+N +Y +
Sbjct: 196 L---GFLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYI 252
Query: 215 GPAELLYSGKSCGLK------------DLTLIFDSGASYAYFTSRVYQEIVSLIMRDLIG 262
G E + G + ++ + +I DSG +Y + Y +++S+ ++ +I
Sbjct: 253 G-LEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSM-LQSIIT 310
Query: 263 TPLKLAPDDKT-LPICWRGPFKALGQVTEYFKPL-ALSFTNRRNSVRLVVP 311
P + +T +C+R P VT++ L ++SF + N+V LV+P
Sbjct: 311 YPRAQEQEARTGFDLCYRIPCPN-NVVTDHDHLLPSISF-HFSNNVSLVLP 359
>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 66/155 (42%), Gaps = 14/155 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNIVPCSNPRCA- 75
V++ VG PP+ DTGS+L+ + C+ P + V CS+P C
Sbjct: 65 LTVSVVVGTPPQNVTMVLDTGSELSGLLCNGSSLSPPAPFNASASLTYSAVDCSSPACVW 124
Query: 76 -ALHWPNPPRCKH-PNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC-- 131
P P C P+ C I Y D S+ G LV D F L VP FGC
Sbjct: 125 RGRDLPVRPFCDAPPSTSCRVSISYADASSADGHLVADTFIL-----GTQAVPALFGCIT 179
Query: 132 GYNQH---NPGPLSPPDTA-GVLGLGRGRISIVSQ 162
Y+ N P + A G+LG+ RG +S V+Q
Sbjct: 180 SYSSSTAINSSATDPSEAATGLLGMNRGSLSFVTQ 214
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 64/269 (23%), Positives = 107/269 (39%), Gaps = 32/269 (11%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSNP 72
+ +N+++G PP DTGSDL W QC PC C + E + P ++ + C N
Sbjct: 94 YLMNISLGTPPVPMLGIADTGSDLIWRQC-LPCPNCYEQVEPLFDPKESETYKTLDCDNE 152
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVP-LTFGC 131
C L C N C Y YGD + G L +D + + G + P + FGC
Sbjct: 153 FCQDLGQQG--SCDDDN-TCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFGC 209
Query: 132 GYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI------GQNGRGVL 185
G++ N G + D + G ++ E G +C+ +
Sbjct: 210 GHD--NGGTFNEKDGGLIGLGGGPLSLVMQLSSEVG---GQFSYCLVPLSSDSTVSSKIN 264
Query: 186 FLGDGKVPSSGVAWTPMLQNSADLKHYI------LGPAELLYSGKS------CGLKDLTL 233
F G V SG TP+++ + D +Y+ +G + + G S +++ +
Sbjct: 265 FGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNI 324
Query: 234 IFDSGASYAYFTSRVYQEIVSLIMRDLIG 262
I DSG + Y ++ S + + G
Sbjct: 325 IIDSGTTLTLLPQDFYTDVESALTNAIGG 353
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 90/211 (42%), Gaps = 27/211 (12%)
Query: 15 SYFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHK----NIVPCS 70
S + + L G PP+ F DTGS++ W+ C+ PC+GC+ ++ ++P K N + C+
Sbjct: 122 SNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCN-PCSGCSS-KQQPFEPSKSSTYNYLTCA 179
Query: 71 NPRCAALHWPNPPRCKHPND--QCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLT 128
+ +C L C ++ C YGD L ++ + GS
Sbjct: 180 SQQCQLLR-----VCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSV----GSQQVENFV 230
Query: 129 FGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCI----GQNGRGV 184
FGC N T ++G GR +S VSQ L + +C+ G
Sbjct: 231 FGC----SNAARGLIQRTPSLVGFGRNPLSFVSQTAT--LYDSTFSYCLPSLFSSAFTGS 284
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILG 215
L LG + + G+ +TP+L NS Y +G
Sbjct: 285 LLLGKEALSAQGLKFTPLLSNSRYPSFYYVG 315
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 99/238 (41%), Gaps = 26/238 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---YKPHKNI----VPC 69
F + +++GKPP + DTGS L+WVQC C K + P ++ V C
Sbjct: 114 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRC 173
Query: 70 SNPRCAALHWP---NPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFSNGSVFNV 125
S+ +C L + C D C Y + YG+G + S+G +VTD + G F +
Sbjct: 174 SSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI----GDSF-M 228
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYG--LIRNVIGHCI--GQNG 181
L FGC + AG+ G G S QL Y L +C+ +
Sbjct: 229 DLMFGCSMDVKY-----SEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETK 283
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
G + LG + +TP+ + S + Y L L+ +G+ +I DSGA
Sbjct: 284 PGYMILGRYDRAAMDGGYTPLFR-SINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 340
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 99/238 (41%), Gaps = 26/238 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---YKPHKNI----VPC 69
F + +++GKPP + DTGS L+WVQC C K + P ++ V C
Sbjct: 114 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRC 173
Query: 70 SNPRCAALHWP---NPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFSNGSVFNV 125
S+ +C L + C D C Y + YG+G + S+G +VTD + G F +
Sbjct: 174 SSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI----GDSF-M 228
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYG--LIRNVIGHCI--GQNG 181
L FGC + AG+ G G S QL Y L +C+ +
Sbjct: 229 DLMFGCSMDVKY-----SEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETK 283
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
G + LG + +TP+ + S + Y L L+ +G+ +I DSGA
Sbjct: 284 PGYMILGRYDRAAMDGGYTPLFR-SINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 340
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 99/238 (41%), Gaps = 26/238 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---YKPHKNI----VPC 69
F + +++GKPP + DTGS L+WVQC C K + P ++ V C
Sbjct: 116 FLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRC 175
Query: 70 SNPRCAALHWP---NPPRCKHPNDQCDYEIEYGDGGS-SIGALVTDLFPLRFSNGSVFNV 125
S+ +C L + C D C Y + YG+G + S+G +VTD + G F +
Sbjct: 176 SSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRI----GDSF-M 230
Query: 126 PLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYG--LIRNVIGHCI--GQNG 181
L FGC + AG+ G G S QL Y L +C+ +
Sbjct: 231 DLMFGCSMDVKY-----SEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPTDETK 285
Query: 182 RGVLFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGKSCGLKDLTLIFDSGA 239
G + LG + +TP+ + S + Y L L+ +G+ +I DSGA
Sbjct: 286 PGYMILGRYDRAAMDGGYTPLFR-SINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 342
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 65/131 (49%), Gaps = 13/131 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKP-----HKNIVPCSN 71
+ +++ VG PP+ F DTGSDL W+QC APC C + + P ++N+ C +
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNLT-CGD 203
Query: 72 PRCAAL---HWPNPPRCKHP-NDQCDYEIEYGDGGSSIGALVTDLFPLRFSN-GSVFNVP 126
PRC + P P C+ P D C Y YGD +S G L + F + + G+ V
Sbjct: 204 PRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRVD 263
Query: 127 -LTFGCGYNQH 136
+ FGCG+
Sbjct: 264 GVVFGCGHRNR 274
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 62/125 (49%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF V+L VG PP+ + DTGSD+ W+QC PC C + + P + + C +
Sbjct: 81 YF-VSLGVGTPPRTVNMVADTGSDVLWLQC-LPCQSCYGQTDPLFNPSFSSTFQSITCGS 138
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L R +QC Y++ YGDG ++G T+ L F + +V +V + GC
Sbjct: 139 SLCQQLLIRGCRR-----NQCLYQVSYGDGSFTVGEFSTE--TLSFGSNAVNSVAI--GC 189
Query: 132 GYNQH 136
G+N
Sbjct: 190 GHNNQ 194
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 116/282 (41%), Gaps = 56/282 (19%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP-----CTGC-------TKPP--EKQYKP 62
++V ++G PP+ DTGS L W C P C C TK P +
Sbjct: 74 YSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSS 133
Query: 63 HKNIVPCSNPRC-----AALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRF 117
+PC +P+C + L+ RC + Y +EYG GS+ G LV+D+ L
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNCSTTKRCPY------YGLEYGL-GSTTGQLVSDVLGLSK 186
Query: 118 SNGSVFNVP-LTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHC 176
N +P FGC +S G+ G GRG SI +QL ++ H
Sbjct: 187 LN----RIPDFLFGCSL-------VSNRQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHR 235
Query: 177 IG---QNGRGVLFLG--DGKVPSSGVAWTPMLQNSA---DLKHYILGPAELLYSGKSCGL 228
Q+G VL G ++GVA+ P ++ A ++Y + +++L GK +
Sbjct: 236 FDDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPI 295
Query: 229 ----------KDLTLIFDSGASYAYFTSRVYQEIVSLIMRDL 260
D +I DSG+++ + ++ + + + +
Sbjct: 296 PPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHM 337
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 65/127 (51%), Gaps = 16/127 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG---CTKPPEKQYKPH----KNIVPC 69
+ + VG+P KLF DTGSD+TW+QC PC C K + + P + + C
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ-PCASENTCYKQFDPIFDPKSSSSYSPLSC 206
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
++ +C L N C +D C Y++ YGDG + G L T+ SN S+ N+P+
Sbjct: 207 NSQQCKLLDKAN---CN--SDTCIYQVHYGDGSFTTGELATETLSFGNSN-SIPNLPI-- 258
Query: 130 GCGYNQH 136
GCG++
Sbjct: 259 GCGHDNE 265
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 65/127 (51%), Gaps = 16/127 (12%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTG---CTKPPEKQYKPH----KNIVPC 69
+ + VG+P KLF DTGSD+TW+QC PC C K + + P + + C
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ-PCASENTCYKQFDPIFDPKSSSSYSPLSC 206
Query: 70 SNPRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTF 129
++ +C L N C +D C Y++ YGDG + G L T+ SN S+ N+P+
Sbjct: 207 NSQQCKLLDKAN---CN--SDTCIYQVHYGDGSFTTGELATETLSFGNSN-SIPNLPI-- 258
Query: 130 GCGYNQH 136
GCG++
Sbjct: 259 GCGHDNE 265
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 72/279 (25%), Positives = 114/279 (40%), Gaps = 57/279 (20%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAP--CTGCTK----PPEKQYKPHKN----I 66
+A +++G PP+ DTGS L+WV C + C C+ P + P + +
Sbjct: 89 YAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSSRL 148
Query: 67 VPCSNPRCAALHWPN----------------PPRCKHPNDQC-DYEIEYGDGGSSIGALV 109
+ C NP C +H P+ PR + N+ C Y + YG GS+ G L+
Sbjct: 149 IGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGS-GSTAGLLI 207
Query: 110 TDLFPLRFSNGSVFNVPLTFGCGYNQ-HNPGPLSPPDTAGVLGLGRGRISIVSQLR---- 164
+D LR +V N GC H P +G+ G GRG S+ SQL
Sbjct: 208 SDT--LRTPGRAVRN--FVIGCSLASVHQP-------PSGLAGFGRGAPSVPSQLGLTKF 256
Query: 165 EYGLIRNVIGHCIGQNGRGVLFLGDGKVPSSGVAWTPMLQNSADLK----HYILGPAELL 220
Y L+ +G +L GK G+ + P+ ++++ +Y L +
Sbjct: 257 SYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAIT 316
Query: 221 YSGKSCGLKDLTL---------IFDSGASYAYFTSRVYQ 250
GKS L + I DSG +++YF V++
Sbjct: 317 VGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFE 355
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 62/125 (49%), Gaps = 15/125 (12%)
Query: 16 YFAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKN----IVPCSN 71
YF V+L VG PP+ + DTGSD+ W+QC PC C + + P + + C +
Sbjct: 81 YF-VSLGVGTPPRTVNMVADTGSDVLWLQC-LPCQSCYGQTDPLFNPSFSSTFQSITCGS 138
Query: 72 PRCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGC 131
C L R +QC Y++ YGDG ++G T+ L F + +V +V + GC
Sbjct: 139 SLCQQLLIRGCRR-----NQCLYQVSYGDGSFTVGEFSTE--TLSFGSNAVNSVAI--GC 189
Query: 132 GYNQH 136
G+N
Sbjct: 190 GHNNQ 194
>gi|91806508|gb|ABE65981.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 203
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 53/108 (49%), Gaps = 7/108 (6%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNP 72
+ + +G PP+ D DTGSDL WV C++ C GC + P + + CS+
Sbjct: 78 YYTTVQIGTPPRELDVVIDTGSDLVWVSCNS-CVGCPLHNVTFFDPGASSSAVKLACSDK 136
Query: 73 RCAALHWPNPPRCKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNG 120
RC++ RC + C Y++EYGDG + G ++DL +G
Sbjct: 137 RCSS-DLQKKSRCSLL-ESCTYKVEYGDGSVTSGYYISDLISFDTMSG 182
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 66/156 (42%), Gaps = 17/156 (10%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGCTKPPEKQ---YKPHKNI----VPC 69
+ + +++G PP DTGS L+WVQC C K + P+ + V C
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGC 65
Query: 70 SNPRCAALHWPNPPR--CKHPNDQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPL 127
S C +H C +D C Y + YG G S+G L D L SN S+ N
Sbjct: 66 STEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA-SNRSIDN--F 122
Query: 128 TFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQL 163
FGCG + G AG++G G S +Q+
Sbjct: 123 IFGCGEDNLYNGV-----NAGIIGFGTKSYSFFNQV 153
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 53/107 (49%), Gaps = 11/107 (10%)
Query: 34 FDTGSDLTWVQCDAPCTGCTKPPEKQYKPHKNI----VPCSNPRCAALHWPNPPRCKHPN 89
DTGSD+TWVQC PC C + + + P + V C + RC L + C++
Sbjct: 3 LDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDL---DTAACRNAT 58
Query: 90 DQCDYEIEYGDGGSSIGALVTDLFPLRFSNGSVFNVPLTFGCGYNQH 136
C YE+ YGDG ++G T+ L S V NV + GCG++
Sbjct: 59 GACLYEVAYGDGSYTVGDFATETLTLGDST-PVGNVAI--GCGHDNE 102
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 91/220 (41%), Gaps = 20/220 (9%)
Query: 17 FAVNLTVGKPPKLFDFDFDTGSDLTWVQCDAPCTGC-TKPPE--KQYKPHKN----IVPC 69
F +++++G PP DTGS L+WV C C T PE + P K+ +V C
Sbjct: 75 FFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDPDKSTTYELVGC 134
Query: 70 SNPRCAALHWP--NPPRCKHPNDQCDYEIEYGDGGS---SIGALVTDLFPLRFSNGSVFN 124
S+ CA + P C D C Y + YG G S S G L TD L S+ S+ +
Sbjct: 135 SSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLA-SSSSIID 193
Query: 125 VPLTFGCGYNQHNPGPLSPPDTAGVLGLGRGRISIVSQLREYGLIRNVIGHCIGQNGRGV 184
FGC + G S GV+G G S +Q+ R +C +
Sbjct: 194 -GFIFGCSGDDSFKGYES-----GVIGFGGANFSFFNQVARQTNYR-AFSYCFPGDHTAE 246
Query: 185 LFLGDGKVPSSGVAWTPMLQNSADLKHYILGPAELLYSGK 224
FL G P + +T ++ + D Y L +++ G
Sbjct: 247 GFLSIGAYPKDELVYTNLIPHFGDRSVYSLQQIDMMVDGN 286
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.142 0.458
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,123,562,338
Number of Sequences: 23463169
Number of extensions: 288215770
Number of successful extensions: 479441
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 428
Number of HSP's successfully gapped in prelim test: 1302
Number of HSP's that attempted gapping in prelim test: 475762
Number of HSP's gapped (non-prelim): 1855
length of query: 335
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 192
effective length of database: 9,003,962,200
effective search space: 1728760742400
effective search space used: 1728760742400
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)