BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008104
(577 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 796 bits (2055), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/593 (66%), Positives = 465/593 (78%), Gaps = 26/593 (4%)
Query: 1 MDSDESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNS-----PQSQQTR------ 49
M+SD+ S VVII+LPPPNNPSLGKTITA+TLTD+ PQS Q
Sbjct: 1 MESDDQSSH----VKVVIISLPPPNNPSLGKTITAFTLTDDDHDATYPQSHQNHEQEPSI 56
Query: 50 ---HRQQQEHPLPPQLHPPQNSQFNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYT 106
HR+ Q P L PPQN Q FS L+ PRKL L IS+FA+I+Y S+FS T
Sbjct: 57 IQTHRESQLPVQSPSL-PPQNPQIQFSFSGLYFSTPRKLLFLLCISLFAVIVYRSLFSNT 115
Query: 107 LQDRYKSNND-DENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDG-I 164
L + S++D DE +SF+FPLYHKFGIRE+SQ + E K R V ES+VASVND +
Sbjct: 116 LLELKVSDDDNDEKTKSFIFPLYHKFGIREISQSNLEHKSIRSVY--KESLVASVNDDDV 173
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSD 224
I P++ N KL SSNA AVDSSS+FP+RGN+YPDGLYFTY++VGNPPRPYYLD+DT SD
Sbjct: 174 IVPNR---NYKLASSNAAAVDSSSVFPVRGNVYPDGLYFTYILVGNPPRPYYLDIDTASD 230
Query: 225 LTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY 284
LTWIQCDAPC+SCAKGAN LYKPR NI+ KDSLC+E+ RN K GYCETCQQCDYEIEY
Sbjct: 231 LTWIQCDAPCTSCAKGANALYKPRRDNIVTPKDSLCVELHRNQKAGYCETCQQCDYEIEY 290
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
ADHSSSMGVLARDELHLT+ NGS T FGCAYDQQGLLLNTLVKTDGILGLS+AKVS
Sbjct: 291 ADHSSSMGVLARDELHLTMANGSSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVS 350
Query: 345 LPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEIL 404
LPSQLA++GII NVVGHCL + GGGYMFLG D VP WGM+WVPMLDSP ++ Y T+I+
Sbjct: 351 LPSQLANRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIM 410
Query: 405 KINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT 464
K+NYGS PL+LG + +V +FD+GSSYTYFTK+AYSEL+ASLK+VS + L+ D SDPT
Sbjct: 411 KLNYGSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPT 470
Query: 465 LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGIL 524
LP CWRAKFPIRS++DVKQ+FKTLTL FGSKW I+STKF I PEGYL+IS KGN+CLGIL
Sbjct: 471 LPFCWRAKFPIRSVIDVKQYFKTLTLQFGSKWWIISTKFRIPPEGYLIISNKGNVCLGIL 530
Query: 525 DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFLEG 577
DGS+VH+GS+IILGDISLRGQL++YDNVN +IGW +S C+ P F +LPF +G
Sbjct: 531 DGSDVHDGSSIILGDISLRGQLIIYDNVNNKIGWTQSDCIKPKTFSTLPFFQG 583
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 750 bits (1937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/614 (63%), Positives = 452/614 (73%), Gaps = 54/614 (8%)
Query: 1 MDSDESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNS-PQSQQT--RHRQQQEHP 57
M+SD+ SP QL GVVII+LPPP+NPSLGKTITA+TLT+N PQS QT H++ Q
Sbjct: 1 MESDDDQSP--QLKGVVIISLPPPDNPSLGKTITAFTLTNNDYPQSHQTPQTHQEDQLPI 58
Query: 58 LPPQLHPPQNSQFNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTLQDRYKSNNDD 117
P P QNSQ F LF G PRKL F+ IS+FAL +Y S+F+ T Q+ +NNDD
Sbjct: 59 SSPPPPPSQNSQLQFPSSRLFLGTPRKLLSFVFISLFALAIYSSLFTNTFQELKSNNNDD 118
Query: 118 ENKE--SFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKK 175
++++ S+VFPLYHK GIRE+ D E L RFV E++VASV D + PHK I+K
Sbjct: 119 DDQKPKSYVFPLYHKLGIREIPLNDLENHLRRFVY--KENLVASV-DHLNGPHK--ISKL 173
Query: 176 LVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS 235
S+ A A+DSS+IFP+RGN+YPDG PP+PYYLD DTGSDLTWIQCDAPC+
Sbjct: 174 ASSNAAAAMDSSAIFPVRGNLYPDG----------PPQPYYLDFDTGSDLTWIQCDAPCT 223
Query: 236 SCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLA 295
SCAKGAN YKPR GNI+P KD LCME+QRN K GYCETC QCDYEIEYADHSSSMGVLA
Sbjct: 224 SCAKGANAWYKPRRGNIVPPKDLLCMEVQRNQKAGYCETCDQCDYEIEYADHSSSMGVLA 283
Query: 296 RDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
D+L L + NGSLTK N +FGCAYDQQGLLL TLVKTDGILGLSRAKVSLPSQLASQGII
Sbjct: 284 TDKLLLMVANGSLTKLNFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGII 343
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
NV+GHCLTT+ GGGGYMFLG D VP WGMAWVPMLDSP ME YHTE++K+NYGSSPL+L
Sbjct: 344 NNVIGHCLTTDLGGGGYMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSL 403
Query: 416 GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPI 475
G S+V LFD+GSSYTYF K+AYSEL+ASL EVS GLV SD TLP+CWRA FPI
Sbjct: 404 GGMESRVKHILFDSGSSYTYFPKEAYSELVASLNEVSGAGLVQSTSDTTLPLCWRANFPI 463
Query: 476 RSIV--------------------------------DVKQFFKTLTLHFGSKWQIVSTKF 503
R + DVK+FFKTLT FG+KW ++STKF
Sbjct: 464 RKFIYRTELTRPIRRRRRRRRRRRRRRRRRRQHIKGDVKKFFKTLTFQFGTKWLVISTKF 523
Query: 504 HISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I PEGYL++S KGN+CLGIL+GS+VH+GSTIILGDISLRGQLVVYDNVNK+IGW S C
Sbjct: 524 RIPPEGYLMMSDKGNVCLGILEGSKVHDGSTIILGDISLRGQLVVYDNVNKKIGWTPSDC 583
Query: 564 MNPGRFKSLPFLEG 577
P R SL F +G
Sbjct: 584 AKPKRSDSLQFFDG 597
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 702 bits (1812), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/582 (62%), Positives = 438/582 (75%), Gaps = 36/582 (6%)
Query: 11 PQLTGVVIITLPPPNNPSLGKTITAYTLTD---NSPQSQQTRHRQQQE------------ 55
PQL GVVIITLPPP+NPSLGKTITA+TL+D + P + ++QQ
Sbjct: 124 PQLKGVVIITLPPPDNPSLGKTITAFTLSDPPLDRPHHTHQQLQRQQHQEEEEEEEEEEE 183
Query: 56 --HPLPPQLHPPQNSQFNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTLQDRYKS 113
H LP P N FS+ L G PR L FL +S+F +L+ S L + +
Sbjct: 184 EPHQLPSP--SPPNPALQFSVRKLSLGNPRILMGFLGVSLFVFLLWNFASSSPLVE-LRR 240
Query: 114 NNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKIN 173
NDD SF+ PLY K G R + D E KLG+FVD VND ++P IN
Sbjct: 241 KNDDREPTSFILPLYPKLGSRSLG--DLELKLGKFVDF-------HVND--MKP--GGIN 287
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
K ++++ A DSS+IFP+RG++YP+GLYFT++ VG+PPR Y+LDMDTGSDLTWIQCDAP
Sbjct: 288 K--LATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAP 345
Query: 234 CSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGV 293
C+SCAKG NPLYKP+ GN++P KDSLC+E+QRN K GYCETC+QCDYEIEYADHSSSMGV
Sbjct: 346 CTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGV 405
Query: 294 LARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQG 353
LA D+LHL + NGSLTK ++FGCAYDQQGLLLN+L KTDGILGLS+AKVSLPSQLASQ
Sbjct: 406 LASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQR 465
Query: 354 IIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
II NV+GHCLT++A GGGYMFLG D VP WGMAWVPML+S YH++I+KI++GS L
Sbjct: 466 IINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNS-HSPNYHSQIMKISHGSRQL 524
Query: 414 NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
+LG ++ + +FDTGSSYTYF K+AY L+ASLK+VS +GL+ D SDPTLPVCWRAKF
Sbjct: 525 SLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWRAKF 584
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
PIRS++DVKQFF+ LTL F SKW IVSTKF I PEGYL+IS KGN+CLGILDGS VH+GS
Sbjct: 585 PIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGS 644
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
TIILGDISLRG+LVVYDNVN++IGWA+S C+ P + KSLPF
Sbjct: 645 TIILGDISLRGKLVVYDNVNQKIGWAQSTCVKPQKIKSLPFF 686
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/587 (56%), Positives = 424/587 (72%), Gaps = 29/587 (4%)
Query: 1 MDSDESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQ--------SQQTRHRQ 52
MDSD ++ GVV+ITLPPP+NPSLGK++TA+TLTD+ P+ Q+ +
Sbjct: 1 MDSD-------KIKGVVVITLPPPDNPSLGKSVTAFTLTDDFPEPPGESVAVDQEVQQPN 53
Query: 53 QQEHPLPPQLHPPQNSQFNFSLPM---LFPGLPRKLFLFLAISIFALILYGSVFSYTLQD 109
LPP L P Q S+P+ LF G PRKL L I++ A+ LY S F T+++
Sbjct: 54 NDHLTLPPNL-PIQAPLSQRSIPLSRELFAGTPRKLVFVLGIALAAVYLYASNFPETIRE 112
Query: 110 -RYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPH 168
R NDD+ SF+FPLY + + + S D + KLGR V ++ + + ND + P
Sbjct: 113 LRRSERNDDDRPSSFLFPLYFQSELGDSS--DFQLKLGRTVRVNKDDLGVRFNDVLGVPK 170
Query: 169 KSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWI 228
SK+ S ++ DSS++FP+RG+IYPDGLY+TY++VG PPRPY+LD+DTGSDLTW+
Sbjct: 171 PSKL-----ISASLKSDSSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWV 225
Query: 229 QCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHS 288
QCDAPCSSC KG +PLYKPR N++ +KDSLCME+QRN+ C CQQC+YE++YAD S
Sbjct: 226 QCDAPCSSCGKGRSPLYKPRRENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQS 285
Query: 289 SSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQ 348
SS+GVL +DE L NGSLTK N +FGCAYDQQGLLLNTL KTDGILGLSRAKVSLPSQ
Sbjct: 286 SSLGVLVKDEFTLRFSNGSLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQ 345
Query: 349 LASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINY 408
LAS+GII NVVGHCLT + GGGY+FLG D VP WGMAWV MLDSP ++ Y T++++I+Y
Sbjct: 346 LASRGIINNVVGHCLTGDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDY 405
Query: 409 GSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVC 468
GS PL+L S +FD+GSSYTYFTK+AY +L+A+L+EVS+ GL+L S T +C
Sbjct: 406 GSIPLSLDTWGSSREQVVFDSGSSYTYFTKEAYYQLVANLEEVSAFGLILQDSSDT--IC 463
Query: 469 WRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSE 528
W+ + IRS+ DVK FFK LTL FGS++ +VSTK I PE YL+I+K+GN+CLGILDGS+
Sbjct: 464 WKTEQSIRSVKDVKHFFKPLTLQFGSRFWLVSTKLVILPENYLLINKEGNVCLGILDGSQ 523
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
VH+GSTIILGD +LRG+LVVYDNVN+RIGW S C NP + K LP
Sbjct: 524 VHDGSTIILGDNALRGKLVVYDNVNQRIGWTSSDCHNPRKIKHLPLF 570
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 652 bits (1683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/488 (65%), Positives = 389/488 (79%), Gaps = 17/488 (3%)
Query: 88 FLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGR 147
FL +S+F +L+ S L + + NDD SF+ PLY K G R + D E KLG+
Sbjct: 3 FLGVSLFVFLLWNFASSSPLVE-LRRKNDDREPTSFILPLYPKLGSRSLG--DLELKLGK 59
Query: 148 FVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMI 207
FVD VND ++P INK ++++ A DSS+IFP+RG++YP+GLYFT++
Sbjct: 60 FVDF-------HVND--MKP--GGINK--LATSVSAFDSSTIFPVRGDVYPNGLYFTHIF 106
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
VG+PPR Y+LDMDTGSDLTWIQCDAPC+SCAKG NPLYKP+ GN++P KDSLC+E+QRN
Sbjct: 107 VGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNL 166
Query: 268 KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLN 327
K GYCETC+QCDYEIEYADHSSSMGVLA D+LHL + NGSLTK ++FGCAYDQQGLLLN
Sbjct: 167 KTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLN 226
Query: 328 TLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAW 387
+L KTDGILGLS+AKVSLPSQLASQ II NV+GHCLT++A GGGYMFLG D VP WGMAW
Sbjct: 227 SLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAW 286
Query: 388 VPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIAS 447
VPML+S YH++I+KI++GS L+LG ++ + +FDTGSSYTYF K+AY L+AS
Sbjct: 287 VPMLNS-HSPNYHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVAS 345
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
LK+VS +GL+ D SDPTLPVCWRAKFPIRS++DVKQFF+ LTL F SKW IVSTKF I P
Sbjct: 346 LKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPP 405
Query: 508 EGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPG 567
EGYL+IS KGN+CLGILDGS VH+GSTIILGDISLRG+LVVYDNVN++IGWA+S C+ P
Sbjct: 406 EGYLIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTCVKPQ 465
Query: 568 RFKSLPFL 575
+ KSLPF
Sbjct: 466 KIKSLPFF 473
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/580 (57%), Positives = 409/580 (70%), Gaps = 36/580 (6%)
Query: 1 MDSDESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNS---PQ--SQQTRHRQQQE 55
M+ D+S Q+ GVVII+LPPP+NPSLGKTITA+ ++N PQ Q +H+ QQ
Sbjct: 1 MEDDQST----QIKGVVIISLPPPDNPSLGKTITAFAFSNNPSPPPQLFIQPHQHQSQQT 56
Query: 56 HPL------PPQLHPPQNSQFNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTLQD 109
HP PP P N Q +FS LF P KLF F +FAL LYGSV S T D
Sbjct: 57 HPNAQHNTDPPLQSYPSNPQLSFSFRRLFHSTPVKLFSFFGTLLFALFLYGSVSSTTTVD 116
Query: 110 RY--KSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRP 167
K++ DD+ SF+FPL+ KFG+ + Q+D + +LG+ V + V DG
Sbjct: 117 LRGRKNDGDDDKATSFLFPLFPKFGV--LGQKDLKLQLGKLVQKEKFLTQRDVGDG---- 170
Query: 168 HKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTW 227
S VAVDSSS+FP+ GN+YPDGLYFT + VGNPP+ Y+LD+DTGSDLTW
Sbjct: 171 -----------SGVVAVDSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTW 219
Query: 228 IQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYC-ETCQQCDYEIEYAD 286
+QCDAPC SC KGA+ YKP N++ DSLC+++Q+N K G+ E+ QCDYEI+YAD
Sbjct: 220 MQCDAPCRSCGKGAHVQYKPTRSNVVSSVDSLCLDVQKNQKNGHHDESLLQCDYEIQYAD 279
Query: 287 HSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLP 346
HSSS+GVL RDELHL NGS TK NVVFGC YDQ+GL+LNTL KTDGI+GLSRAKVSLP
Sbjct: 280 HSSSLGVLVRDELHLVTTNGSKTKLNVVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLP 339
Query: 347 SQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKI 406
QLAS+G+IKNVVGHCL+ + GGGYMFLG D VP WGM WVPM + +LY TEIL I
Sbjct: 340 YQLASKGLIKNVVGHCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGI 399
Query: 407 NYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP 466
NYG+ L + S+VG FD+GSSYTYF K+AY +L+ASL EVS GLV D SD TLP
Sbjct: 400 NYGNRQLKFDGQ-SKVGKVFFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLP 458
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDG 526
+CW+A F IRSI DVK +FKTLTL FGSKW I+ST F I PEGYL+IS KG++CLGILDG
Sbjct: 459 ICWQANFQIRSIKDVKDYFKTLTLRFGSKWWILSTLFQIPPEGYLIISNKGHVCLGILDG 518
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNP 566
S+V++GS+IILGDISLRG VVYDNV ++IGW ++ C P
Sbjct: 519 SKVNDGSSIILGDISLRGYSVVYDNVKQKIGWKRADCGMP 558
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/577 (58%), Positives = 415/577 (71%), Gaps = 34/577 (5%)
Query: 1 MDSDESPSPPPQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQS--QQTRHRQQQEHPL 58
M+ DESP Q+ GVVII+LPPP+NPSLGKTITA+T ++ SPQ Q +H+ Q HP
Sbjct: 1 MEDDESP----QIKGVVIISLPPPDNPSLGKTITAFTFSNPSPQPSIQPHQHQSQPTHPN 56
Query: 59 ------PPQLHPPQNSQFNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYT---LQD 109
PP P N Q +FS LF P KLF F I +FAL LYGSV S T L+
Sbjct: 57 AQHNTDPPLQSYPSNPQLSFSFRRLFHSTPVKLFSFFGILLFALFLYGSVSSTTTVELRG 116
Query: 110 RYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHK 169
R ++DD+ SF+FPL+ KFG+ + Q+D + +LG+ E + +DG
Sbjct: 117 RNNDDDDDDKATSFLFPLFPKFGV--LGQKDLKLQLGKLSQ--KEKFLTHRDDGD----- 167
Query: 170 SKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQ 229
S VAVDSSS+FP+ GN+YPDGLYFT + VGNPP+ Y+LD+DTGSDLTW+Q
Sbjct: 168 --------GSGVVAVDSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQ 219
Query: 230 CDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYC-ETCQQCDYEIEYADHS 288
CDAPC SC KGA+ LYKP N++ D+LC+++Q+N K G+ E+ QCDYEI+YADHS
Sbjct: 220 CDAPCISCGKGAHVLYKPTRSNVVSSVDALCLDVQKNQKNGHHDESLLQCDYEIQYADHS 279
Query: 289 SSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQ 348
SS+GVL RDELHL NGS TK NVVFGC YDQ GLLLNTL KTDGI+GLSRAKVSLP Q
Sbjct: 280 SSLGVLVRDELHLVTTNGSKTKLNVVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQ 339
Query: 349 LASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINY 408
LAS+G+IKNVVGHCL+ + GGGYMFLG D VP WGM WVPM + +LY TEIL INY
Sbjct: 340 LASKGLIKNVVGHCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINY 399
Query: 409 GSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVC 468
G+ L + S+VG +FD+GSSYTYF K+AY +L+ASL EVS GLV D SD TLP+C
Sbjct: 400 GNRQLRFDGQ-SKVGKMVFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPIC 458
Query: 469 WRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSE 528
W+A FPI+S+ DVK +FKTLTL FGSKW I+ST F ISPEGYL+IS KG++CLGILDGS
Sbjct: 459 WQANFPIKSVKDVKDYFKTLTLRFGSKWWILSTLFQISPEGYLIISNKGHVCLGILDGSN 518
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
V++GS+IILGDISLRG VVYDNV ++IGW ++ C++
Sbjct: 519 VNDGSSIILGDISLRGYSVVYDNVKQKIGWKRADCVD 555
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 612 bits (1578), Expect = e-172, Method: Compositional matrix adjust.
Identities = 313/577 (54%), Positives = 401/577 (69%), Gaps = 19/577 (3%)
Query: 12 QLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTRHRQQQEHPLPPQLHPPQNSQFN 71
+L VVIITLPP ++PS GKTI+A+TL D+ Q P LH Q S+
Sbjct: 10 RLHSVVIITLPPSDDPSQGKTISAFTLNDHDYPLQIPPEDNPNPSFQPDPLHQNQQSRLL 69
Query: 72 FSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTLQ----DRYKSNNDDENKE--SFVF 125
FS L G PR + L S+ A+ Y SVF ++Q ++ +DD ++E SFVF
Sbjct: 70 FS--DLSMGSPRLVLGLLGFSLLAVAFYASVFPNSVQMFRVSDERNRDDDSSRETTSFVF 127
Query: 126 PLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVD 185
P+YHK RE +R LG L+ V S++ ++ P K+N L +S
Sbjct: 128 PVYHKLRAREFHERILAEDLG----LENGKFVESMDLELVNP--VKVNDVLSTSAGSIDS 181
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPP--RPYYLDMDTGSDLTWIQCDAPCSSCAKGANP 243
S++IFP+ GN+YPDGLY+T ++VG P + Y+LD+DTGSDLTWIQCDAPC+SCAKGAN
Sbjct: 182 STTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQ 241
Query: 244 LYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTI 303
LYKPR N++ + C+E+QRN +CE+C QCDYEIEYADHS SMGVL +D+ HL +
Sbjct: 242 LYKPRKDNLVRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKL 301
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
NGSL + ++VFGC YDQQGLLLNTL+KTDGILGLSRAK+SLPSQLAS+GII NVVGHCL
Sbjct: 302 HNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCL 361
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
++ G GY+F+G DLVPS GM WVPML P +E+Y ++ K++YG++ L+L N +VG
Sbjct: 362 ASDLNGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRVG 421
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF--PIRSIVDV 481
LFDTGSSYTYF QAYS+L+ SL+EVS L D SD LP+CWRAK PI S+ DV
Sbjct: 422 KVLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPICWRAKTNSPISSLSDV 481
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
K+FF+ +TL GSKW I+S K I PE YL+IS KGN+CLGILDGS VH+GSTII+GDIS
Sbjct: 482 KKFFRPITLQIGSKWLIISKKLLIQPEDYLIISNKGNVCLGILDGSNVHDGSTIIIGDIS 541
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFK-SLPFLEG 577
+RG+L+VYDNV +RIGW KS C+ P F ++PF +G
Sbjct: 542 MRGRLIVYDNVKQRIGWMKSDCVRPSEFDHNVPFFQG 578
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 600 bits (1547), Expect = e-169, Method: Compositional matrix adjust.
Identities = 309/579 (53%), Positives = 395/579 (68%), Gaps = 21/579 (3%)
Query: 12 QLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTRHRQQQEHPLPPQLHPPQNSQFN 71
++ VVIITLPP ++PS GKTI+A+TLTD+ + P LH Q S+
Sbjct: 13 RVHSVVIITLPPSDDPSQGKTISAFTLTDHDYPLEIPPEDNPNPSFQPDPLHRNQQSRLL 72
Query: 72 FSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTLQ--------DRYKSNNDDENKESF 123
FS L PR + L IS+ A+ Y SVF ++Q +++ SF
Sbjct: 73 FS--DLSMNSPRLVLGLLGISLLAVAFYASVFPNSVQMFRVSPDERNRDDDDNLRETASF 130
Query: 124 VFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVA 183
VFP+YHK RE +R E LG L+ E+ V S++ ++ P K+N L +S
Sbjct: 131 VFPVYHKLRAREFHERILEEDLG----LENENFVESMDLELVNP--VKVNDVLSTSAGSI 184
Query: 184 VDSSSIFPLRGNIYPDGLYFTYMIVGNPP--RPYYLDMDTGSDLTWIQCDAPCSSCAKGA 241
S++IFP+ GN+YPDGLY+T ++VG P + Y+LD+DTGS+LTWIQCDAPC+SCAKGA
Sbjct: 185 DSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGA 244
Query: 242 NPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
N LYKPR N++ ++ C+E+QRN +CE C QCDYEIEYADHS SMGVL +D+ HL
Sbjct: 245 NQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHL 304
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+ NGSL + ++VFGC YDQQGLLLNTL+KTDGILGLSRAK+SLPSQLAS+GII NVVGH
Sbjct: 305 KLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGH 364
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
CL ++ G GY+F+G DLVPS GM WVPML ++ Y ++ K++YG L+L N +
Sbjct: 365 CLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGR 424
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK--FPIRSIV 479
VG LFDTGSSYTYF QAYS+L+ SL+EVS L D SD TLP+CWRAK FP S+
Sbjct: 425 VGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLS 484
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
DVK+FF+ +TL GSKW I+S K I PE YL+IS KGN+CLGILDGS VH+GSTIILGD
Sbjct: 485 DVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGD 544
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK-SLPFLEG 577
IS+RG L+VYDNV +RIGW KS C+ P ++PF +G
Sbjct: 545 ISMRGHLIVYDNVKRRIGWMKSDCVRPREIDHNVPFFQG 583
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 523 bits (1346), Expect = e-145, Method: Compositional matrix adjust.
Identities = 274/583 (46%), Positives = 359/583 (61%), Gaps = 62/583 (10%)
Query: 13 LTGVVIITLPPPNNPSLGKTITAYTLTDNS-------------PQSQQTRHRQQQEHPLP 59
L GVVIITLPP + PS GKTITA+T TD++ P + Q R R
Sbjct: 16 LHGVVIITLPPSDQPSKGKTITAFTYTDDAPPPPRPPEPVMGYPAATQVRRR-------- 67
Query: 60 PQLHPPQNSQFNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFS-----YTLQDRYKSN 114
P R L + A+ Y +S + ++ ++
Sbjct: 68 ---------------PRRVLSTRRVAAAALVLGALAVAAYYCFYSDVAVQFLGMEQEEAQ 112
Query: 115 NDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVAS--VNDGIIRPHKSKI 172
D SF+ PL+ K + GR + G+ +A+ ++DG K++
Sbjct: 113 KDRNETRSFLLPLHPKA------------RQGRALREFGDVKLAARRIDDGW---RKARN 157
Query: 173 NKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA 232
++ + A +S+++ P++GN++PDG Y+T + VGNPPRPY+LD+DTGSDLTWIQCDA
Sbjct: 158 KMEVAKAAAAGTNSTALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDA 217
Query: 233 PCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMG 292
PC++CAKG +PLYKP I+P +D LC E+Q N YCETC+QCDYEIEYAD SSSMG
Sbjct: 218 PCTNCAKGPHPLYKPTKEKIVPPRDLLCQELQGNQN--YCETCKQCDYEIEYADQSSSMG 275
Query: 293 VLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
VLARD++HL NG K + VFGCAYDQQG LL++ KTDGILGLS A +SLPSQLAS
Sbjct: 276 VLARDDMHLIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASH 335
Query: 353 GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSP 412
GII N+ GHC+T GGGGYMFLG D VP WG+ W + P LYHTE + YG
Sbjct: 336 GIISNIFGHCITREQGGGGYMFLGDDYVPRWGITWTSIRSGP-DNLYHTEAHHVKYGDQQ 394
Query: 413 LNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
L + + +FD+GSSYTY + Y L+A++K +S G V D+SD TLP+CW+A
Sbjct: 395 LRMREQAGNTVQVIFDSGSSYTYLPDEIYENLVAAIK-YASPGFVQDSSDRTLPLCWKAD 453
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG 532
FP+R + DVKQFFK L LHFG KW +S F ISPE YL+IS KGN+CLG+L+G+E+++G
Sbjct: 454 FPVRYLEDVKQFFKPLNLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHG 513
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
STII+GD+SLRG+LVVYDN ++IGW S C P K PF
Sbjct: 514 STIIVGDVSLRGKLVVYDNQRRQIGWTNSDCTKPQSQKGFPFF 556
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 271/570 (47%), Positives = 354/570 (62%), Gaps = 34/570 (5%)
Query: 11 PQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTRHRQQQEHPLPPQLHPPQNSQF 70
PQL GVVIITLPP + PS GKT+TA+ T N P ++ +P +
Sbjct: 16 PQLHGVVIITLPPADQPSKGKTVTAFAYT-NDPPPPRSPPDPVMGYPAATEAR------- 67
Query: 71 NFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTLQDRYKSNNDDENK---ESFVFPL 127
P R L + A+ Y +S ++E + SF+ PL
Sbjct: 68 --RRPRRALSTRRVATAALVLGALAVAAYYCFYSDVAVQFLGMEQEEEQRNETRSFLLPL 125
Query: 128 YHKFGIREVSQRDAEFKLGRFVDLDGESVVAS--VNDGIIRPHKSKINKKLVSSNAVAVD 185
Y K + GR + G+ +A+ V+DG K++ ++ + +
Sbjct: 126 YPKA------------RQGRALREFGDVKLAARRVDDG---GRKARNRMEVAKAATARTN 170
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+++ P++GN++PDG Y+T + +GNPPRPY+LD+DTGSDLTWIQCDAPC++CAKG +PLY
Sbjct: 171 STALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 230
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
KP I+P +D LC E+Q N YCETC+QCDYEIEYAD SSSMGVLARD++H+ N
Sbjct: 231 KPAKEKIVPPRDLLCQELQGNQN--YCETCKQCDYEIEYADQSSSMGVLARDDMHMIATN 288
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
G K + VFGCAYDQQG LL++ KTDGILGLS A +S PSQLAS GII NV GHC+T
Sbjct: 289 GGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITR 348
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
GGGGYMFLG D VP WG+ W + P LYHT+ + YG L +
Sbjct: 349 EQGGGGYMFLGDDYVPRWGVTWTSIRSGP-DNLYHTQAHHVKYGDQQLRRPEQAGSTVQV 407
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+FD+GSSYTY + Y L+A++K +S G V D SD TLP+CW+A FP+R + DVKQFF
Sbjct: 408 IFDSGSSYTYLPNEIYENLVAAIK-YASPGFVQDTSDRTLPLCWKADFPVRYLEDVKQFF 466
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
+ L LHFG KW +S F ISPE YL+IS KGN+CLG+L+G+E+++GSTII+GD+SLRG+
Sbjct: 467 EPLNLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGK 526
Query: 546 LVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
LVVYDN K+IGWA S C P K PF
Sbjct: 527 LVVYDNQRKQIGWADSDCTKPQSQKGFPFF 556
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 283/578 (48%), Positives = 361/578 (62%), Gaps = 57/578 (9%)
Query: 12 QLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTRHRQQQEHPLPPQLH--PPQNSQ 69
QL GVVIITLPPP+ PS GKTITA+T TD P PP H PP +
Sbjct: 16 QLHGVVIITLPPPDQPSKGKTITAFTYTDEPGAGA----------PSPPHPHRGPPMAAA 65
Query: 70 FNFSLPMLFPGLPRKLFLFLAISIFALIL-YGSVFS-----YTLQDRYKSNNDDENKESF 123
+ G PR+ + + Y S +S + + ++ + +SF
Sbjct: 66 GREARRSRRAGSPRRAAAMVLALGALALAAYYSFYSDVAVQFLGMEEEEAQRERNETKSF 125
Query: 124 VFPLYHKF----GIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSS 179
+F LY K G+RE + KL V+DG K+ KKL
Sbjct: 126 LFQLYPKAHQGRGLREF----GDIKL----------AAKRVDDG-----GRKVTKKLDVK 166
Query: 180 NAVAV--DSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC 237
A + +S+ + P++GN++PDG Y+T + VGNPPRPY+LD+DTGSDLTWIQCDAPC++C
Sbjct: 167 GAASAGTNSTVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNC 226
Query: 238 AKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD 297
AKG +PLYKP I+P +DSLC E+Q + YCETC+QCDYEIEYAD SSSMGVLA+D
Sbjct: 227 AKGPHPLYKPAKEKIVPPRDSLCQELQGDQN--YCETCKQCDYEIEYADRSSSMGVLAKD 284
Query: 298 ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357
++HL NG K + VFGCAYDQQG LL++ KTDGILGLS A +SLPSQLAS+GII N
Sbjct: 285 DMHLIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISN 344
Query: 358 VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA 417
V GHC+T GGGYMFLG D VP WGM W P+ P LYHTE K+NYG L+ G
Sbjct: 345 VFGHCITRETNGGGYMFLGDDYVPRWGMTWAPIRGGP-DNLYHTEAQKVNYGDQELHAG- 402
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRS 477
+ QV +FD+GSSYTY ++ Y LI ++KE S V D+SD TLP+CW+A F +RS
Sbjct: 403 NSVQV---IFDSGSSYTYLPEEMYKNLIDAIKE-DSPSFVQDSSDTTLPLCWKADFSVRS 458
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
FFK L LHFG +W +V F I P+ YL+IS KGN+CLG+L+G+E+++GSTII+
Sbjct: 459 ------FFKPLNLHFGRRWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIV 512
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
GD+SLRG+LVVYDN ++IGWA S C P K PF
Sbjct: 513 GDVSLRGKLVVYDNERRQIGWANSECTKPQSQKGFPFF 550
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 270/570 (47%), Positives = 353/570 (61%), Gaps = 34/570 (5%)
Query: 11 PQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTRHRQQQEHPLPPQLHPPQNSQF 70
PQL GVVIITLPP + PS GKT+TA+ T N P ++ +P +
Sbjct: 16 PQLHGVVIITLPPADQPSKGKTVTAFAYT-NDPPPPRSPPDPVMGYPAATEAR------- 67
Query: 71 NFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTLQDRYKSNNDDENK---ESFVFPL 127
P R L + A+ Y +S ++E + SF+ PL
Sbjct: 68 --RRPRRALSTRRVATAALVLGALAVAAYYCFYSDVAVQFLGMEQEEEQRNETRSFLLPL 125
Query: 128 YHKFGIREVSQRDAEFKLGRFVDLDGESVVAS--VNDGIIRPHKSKINKKLVSSNAVAVD 185
Y K + GR + G+ +A+ V+DG K++ ++ + +
Sbjct: 126 YPKA------------RQGRALREFGDVKLAARRVDDG---GRKARNRMEVAKAATARTN 170
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+++ P++GN++PDG Y+T + +GNPPRPY+LD+DTGSDLTWIQCDAPC++ AKG +PLY
Sbjct: 171 STALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLY 230
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
KP I+P +D LC E+Q N YCETC+QCDYEIEYAD SSSMGVLARD++H+ N
Sbjct: 231 KPAKEKIVPPRDLLCQELQGNQN--YCETCKQCDYEIEYADQSSSMGVLARDDMHMIATN 288
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
G K + VFGCAYDQQG LL++ KTDGILGLS A +S PSQLAS GII NV GHC+T
Sbjct: 289 GGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITR 348
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
GGGGYMFLG D VP WG+ W + P LYHT+ + YG L +
Sbjct: 349 EQGGGGYMFLGDDYVPRWGVTWTSIRSGP-DNLYHTQAHHVKYGDQQLRRPEQAGSTVQV 407
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+FD+GSSYTY + Y L+A++K +S G V D SD TLP+CW+A FP+R + DVKQFF
Sbjct: 408 IFDSGSSYTYLPNEIYENLVAAIK-YASPGFVQDTSDRTLPLCWKADFPVRYLEDVKQFF 466
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
+ L LHFG KW +S F ISPE YL+IS KGN+CLG+L+G+E+++GSTII+GD+SLRG+
Sbjct: 467 EPLNLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGK 526
Query: 546 LVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
LVVYDN K+IGWA S C P K PF
Sbjct: 527 LVVYDNQRKQIGWADSDCTKPQSQKGFPFF 556
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 510 bits (1314), Expect = e-142, Method: Compositional matrix adjust.
Identities = 274/576 (47%), Positives = 357/576 (61%), Gaps = 33/576 (5%)
Query: 12 QLTGVVIITLPPPNNPSLGKTITAYTLTDNS--PQSQQTRHRQQQEHPLPPQLHPP---- 65
QL GVVIITLPPP+ PS GKTITA+T TD+ P L P
Sbjct: 18 QLHGVVIITLPPPDQPSKGKTITAFTYTDDDVTPPPPTPPPTHLPTRALVPAGAGAGAEA 77
Query: 66 QNSQFNFSLPMLFPGLPRKLF-LFLAISIFALILYGSVFSYT----LQDRYKSNNDDENK 120
+ S+ FS PR+ + L + A+ Y S +S L + ++ N+
Sbjct: 78 RRSRRGFS--------PRRAAAMVLVLGALAVAAYYSFYSDVAVQFLGMQEEAQNERNET 129
Query: 121 ESFVFPLYHKFGIREVSQRDAEFKLG-RFVDLDGESVVASVNDGIIRPHKSKINKKLVSS 179
+SF+ PLY K + + KL R D DG G+ R ++K+ K +
Sbjct: 130 KSFLLPLYPKARQGRALREFGDIKLAARRFDNDG-------GGGVGRKSRNKLEVK--KA 180
Query: 180 NAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239
A +S+++ P++GN++PDG Y+T + VGNPPRPY+LD+DTGSDLTWIQCDAPC++CAK
Sbjct: 181 AAAGTNSTALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAK 240
Query: 240 GANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
G +PLYKP I+P KD LC E+Q N YCETC+QCDYEIEYAD SSSMGVLARD++
Sbjct: 241 GPHPLYKPAKEKIVPPKDLLCQELQGNQN--YCETCKQCDYEIEYADRSSSMGVLARDDM 298
Query: 300 HLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVV 359
H+ NG K + VFGCAYDQQG LL + KTDGILGLS A +SLPSQLA+QGII NV
Sbjct: 299 HIITTNGGREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVF 358
Query: 360 GHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN 419
GHC+T + GGGYMFLG D VP WGM P+ +P L+HTE K+ YG L++ +
Sbjct: 359 GHCITRDPNGGGYMFLGDDYVPRWGMTSTPIRSAP-DNLFHTEAQKVYYGDQQLSMRGAS 417
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+FD+GSSYTY + Y LIA++K + V D+SD TLP+C FP+R +
Sbjct: 418 GNSVQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPN-FVQDSSDRTLPLCLATDFPVRYLE 476
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
DVKQ FK L LHFG +W ++ F I P+ YL+IS KGN+CLG L+G ++ +GST+I+GD
Sbjct: 477 DVKQLFKPLNLHFGKRWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGD 536
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
+LRG+LVVYDN ++IGW S C P K PF
Sbjct: 537 NALRGKLVVYDNQQRQIGWTNSDCTKPQTQKGFPFF 572
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 510 bits (1314), Expect = e-142, Method: Compositional matrix adjust.
Identities = 274/576 (47%), Positives = 357/576 (61%), Gaps = 33/576 (5%)
Query: 12 QLTGVVIITLPPPNNPSLGKTITAYTLTDNS--PQSQQTRHRQQQEHPLPPQLHPP---- 65
QL GVVIITLPPP+ PS GKTITA+T TD+ P L P
Sbjct: 19 QLHGVVIITLPPPDQPSKGKTITAFTYTDDDVTPPPPTPPPTHLPTRALVPAGAGAGAEA 78
Query: 66 QNSQFNFSLPMLFPGLPRKLF-LFLAISIFALILYGSVFSYT----LQDRYKSNNDDENK 120
+ S+ FS PR+ + L + A+ Y S +S L + ++ N+
Sbjct: 79 RRSRRGFS--------PRRAAAMVLVLGALAVAAYYSFYSDVAVQFLGMQEEAQNERNET 130
Query: 121 ESFVFPLYHKFGIREVSQRDAEFKLG-RFVDLDGESVVASVNDGIIRPHKSKINKKLVSS 179
+SF+ PLY K + + KL R D DG G+ R ++K+ K +
Sbjct: 131 KSFLLPLYPKARQGRALREFGDIKLAARRFDNDG-------GGGVGRKSRNKLEVK--KA 181
Query: 180 NAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239
A +S+++ P++GN++PDG Y+T + VGNPPRPY+LD+DTGSDLTWIQCDAPC++CAK
Sbjct: 182 AAAGTNSTALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAK 241
Query: 240 GANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
G +PLYKP I+P KD LC E+Q N YCETC+QCDYEIEYAD SSSMGVLARD++
Sbjct: 242 GPHPLYKPAKEKIVPPKDLLCQELQGNQN--YCETCKQCDYEIEYADRSSSMGVLARDDM 299
Query: 300 HLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVV 359
H+ NG K + VFGCAYDQQG LL + KTDGILGLS A +SLPSQLA+QGII NV
Sbjct: 300 HIITTNGGREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVF 359
Query: 360 GHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN 419
GHC+T + GGGYMFLG D VP WGM P+ +P L+HTE K+ YG L++ +
Sbjct: 360 GHCITRDPNGGGYMFLGDDYVPRWGMTSTPIRSAP-DNLFHTEAQKVYYGDQQLSMRGAS 418
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+FD+GSSYTY + Y LIA++K + V D+SD TLP+C FP+R +
Sbjct: 419 GNSVQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPN-FVQDSSDRTLPLCLATDFPVRYLE 477
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
DVKQ FK L LHFG +W ++ F I P+ YL+IS KGN+CLG L+G ++ +GST+I+GD
Sbjct: 478 DVKQLFKPLNLHFGKRWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGD 537
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
+LRG+LVVYDN ++IGW S C P K PF
Sbjct: 538 NALRGKLVVYDNQQRQIGWTNSDCTKPQTQKGFPFF 573
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 267/570 (46%), Positives = 350/570 (61%), Gaps = 29/570 (5%)
Query: 11 PQLTGVVIITLPPPNNPSLGKTITAYTLTDNSPQSQQTRHRQQQEHPLPPQLHPPQNSQF 70
PQL GVVIITLPPP+ PS GKTITAYT TD+ ++ + P +
Sbjct: 18 PQLHGVVIITLPPPDQPSKGKTITAYTYTDDPGTPPTPPPPPRRPRS---GMDPAAARRP 74
Query: 71 NFSLPMLFPGLPRKLFLFLAISIFALILYGSVFS-----YTLQDRYKSNNDDENKESFVF 125
+ + L + FAL Y +S + + + + SF+
Sbjct: 75 RRVVSPRRAAA-----MVLVLGAFALAAYYCFYSDVAVQFLGVEEEEVEKERNETRSFLL 129
Query: 126 PLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVD 185
PLY K + + KL ++DG +R +K+ K +S +
Sbjct: 130 PLYPKTRQGRALREFGDIKL----------AAKKIDDGGVRKGVNKLEAKRATS--AGTN 177
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+ + P++GN++PDG Y+T + VGNPPRPY+LD+DTGSDLTWIQCDAPC++CAKG +PLY
Sbjct: 178 STVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 237
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
KP I+P +D LC E+Q + YC TC+QCDYEIEYAD SSSMGVLA+D++H+ N
Sbjct: 238 KPAKEKIVPPRDLLCQELQGDQN--YCATCKQCDYEIEYADRSSSMGVLAKDDMHMIATN 295
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
G K + VFGCAYDQQG LL + KTDGILGLS A +SLPSQLASQGII NV GHC+T
Sbjct: 296 GGREKLDFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITK 355
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
GGGYMFLG D VP WGM W P+ P LYHTE K+NYG L + +
Sbjct: 356 EPNGGGYMFLGDDYVPRWGMTWAPIRGGP-DNLYHTEAQKVNYGDQQLRMHGQAGSSIQV 414
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+FD+GSSYTY + Y +L+ ++K V D SD TLP+CW+A F +R + DVKQFF
Sbjct: 415 IFDSGSSYTYLPDEIYKKLVTAIK-YDYPSFVQDTSDTTLPLCWKADFDVRYLEDVKQFF 473
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
K L LHFG++W ++ F I P+ YL+IS KGN+CLG+L+G+E+ + ST+I+GD+SLRG+
Sbjct: 474 KPLNLHFGNRWFVIPRTFTILPDDYLIISDKGNVCLGLLNGAEIDHASTLIVGDVSLRGK 533
Query: 546 LVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
LVVYDN ++IGWA S C P K PF
Sbjct: 534 LVVYDNERRQIGWADSECTKPQPQKGFPFF 563
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 238/382 (62%), Positives = 293/382 (76%), Gaps = 5/382 (1%)
Query: 201 LYFTYMIVGNPP--RPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
LY+T ++VG P + Y+LD+DTGS+LTWIQCDAPC+SCAKGAN LYKPR N++ ++
Sbjct: 29 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEA 88
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C+E+QRN +CE C QCDYEIEYADHS SMGVL +D+ HL + NGSL + ++VFGC
Sbjct: 89 FCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCG 148
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
YDQQGLLLNTL+KTDGILGLSRAK+SLPSQLAS+GII NVVGHCL ++ G GY+F+G D
Sbjct: 149 YDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSD 208
Query: 379 LVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTK 438
LVPS GM WVPML ++ Y ++ K++YG L+L N +VG LFDTGSSYTYF
Sbjct: 209 LVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPN 268
Query: 439 QAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK--FPIRSIVDVKQFFKTLTLHFGSKW 496
QAYS+L+ SL+EVS L D SD TLP+CWRAK FP S+ DVK+FF+ +TL GSKW
Sbjct: 269 QAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSKW 328
Query: 497 QIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRI 556
I+S K I PE YL+IS KGN+CLGILDGS VH+GSTIILGDIS+RG L+VYDNV +RI
Sbjct: 329 LIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRI 388
Query: 557 GWAKSHCMNPGRF-KSLPFLEG 577
GW KS C+ P ++PF +G
Sbjct: 389 GWMKSDCVRPREIDHNVPFFQG 410
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 215/391 (54%), Positives = 277/391 (70%), Gaps = 4/391 (1%)
Query: 185 DSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL 244
+SS++ P+RGN++PDG Y+T M +GNPPRPY+LD+DTGSDLTWIQCDAPC++CAKG +PL
Sbjct: 142 NSSALLPIRGNVFPDGQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPL 201
Query: 245 YKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
YKP N++P +DS C E+Q N Y +T +QCDYEI YAD SSSMG+LARD + L
Sbjct: 202 YKPEKPNVVPPRDSYCQELQGNQN--YGDTSKQCDYEITYADRSSSMGILARDNMQLITA 259
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
+G + VFGC YDQQG LL++ TDGILGLS A +SLP+QLASQGII NV GHC+
Sbjct: 260 DGERENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIA 319
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
+ GGYMFLG D VP WGM W+P+ + P LY TE+ K+NYG LN+ + ++
Sbjct: 320 ADPSNGGYMFLGDDYVPRWGMTWMPIRNGP-ENLYSTEVQKVNYGDQQLNVRRKAGKLTQ 378
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FD+GSSYTY Y+ LIASLK +S L+ D SD TLP C + FP+RS+ DVK
Sbjct: 379 VIFDSGSSYTYLPHDDYTNLIASLKSLSP-SLLQDESDRTLPFCMKPNFPVRSMDDVKHL 437
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
FK L+L F + I+ F I PE YL+IS K NICLG+LDG+E+ + S I++GD+SLRG
Sbjct: 438 FKPLSLVFKKRLFILPRTFVIPPEDYLIISDKNNICLGVLDGTEIGHDSAIVIGDVSLRG 497
Query: 545 QLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
+LVVY+N K+IGW +S C P + PFL
Sbjct: 498 KLVVYNNDEKQIGWVQSDCAKPQKQSGFPFL 528
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 215/391 (54%), Positives = 277/391 (70%), Gaps = 4/391 (1%)
Query: 185 DSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL 244
+SS++ P+RGN++PDG Y+T M +GNPPRPY+LD+DTGSDLTWIQCDAPC++CAKG +PL
Sbjct: 142 NSSALLPIRGNVFPDGQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPL 201
Query: 245 YKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
YKP N++P +DS C E+Q N Y +T +QCDYEI YAD SSSMG+LARD + L
Sbjct: 202 YKPEKPNVVPPRDSYCQELQGNQN--YGDTSKQCDYEITYADRSSSMGILARDNMQLITA 259
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
+G + VFGC YDQQG LL++ TDGILGLS A +SLP+QLASQGII NV GHC+
Sbjct: 260 DGERENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIA 319
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
+ GGYMFLG D VP WGM W+P+ + P LY TE+ K+NYG LN+ + ++
Sbjct: 320 ADPSNGGYMFLGDDYVPRWGMTWMPIRNGP-ENLYSTEVQKVNYGDQQLNVRRKAGKLTQ 378
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FD+GSSYTY Y+ LIASLK +S L+ D SD TLP C + FP+RS+ DVK
Sbjct: 379 VIFDSGSSYTYLPHDDYTNLIASLKSLSP-SLLQDESDRTLPFCMKPNFPVRSMDDVKHL 437
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
FK L+L F + I+ F I PE YL+IS K NICLG+LDG+E+ + S I++GD+SLRG
Sbjct: 438 FKPLSLVFKKRLFILPRTFVIPPEDYLIISDKNNICLGVLDGTEIGHDSAIVIGDVSLRG 497
Query: 545 QLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
+LVVY+N K+IGW +S C P + PFL
Sbjct: 498 KLVVYNNDEKQIGWVQSDCAKPQKQSGFPFL 528
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 210/399 (52%), Positives = 271/399 (67%), Gaps = 8/399 (2%)
Query: 177 VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
VS A A + S + P+ Y+T + +GNP RPY+LD+DTGS LTWIQCDAPC++
Sbjct: 108 VSFKAAAAEEGST----AAVLPERQYYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTN 163
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLAR 296
C KG +PLYKP NI+P +DS C E+Q N YC+TC+QCDYEI YAD SSS GVLAR
Sbjct: 164 CTKGPHPLYKPAKENIVPPRDSHCQELQGNQN--YCDTCKQCDYEIAYADRSSSAGVLAR 221
Query: 297 DELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356
D + L +G ++VFGCA+DQQG LL + +DGILGLS +SLP+QLA QGII
Sbjct: 222 DNMELITADGERENMDLVFGCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIIS 281
Query: 357 NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
NV GHC+ T+ G YMFLG D VP WGM WVP+ + P ++Y T + K+NYG LN+
Sbjct: 282 NVFGHCIATDPSGSAYMFLGDDYVPRWGMTWVPVRNGP-EDVYSTVVQKVNYGCQELNVR 340
Query: 417 ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
+ ++ +FD+GSSYTYF + Y+ LI SL+ VS G V D SD TLP C + FP+R
Sbjct: 341 EQAGKLTQVIFDSGSSYTYFPHEIYTSLITSLEAVSP-GFVRDESDQTLPFCMKPNFPVR 399
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
S+ DVKQ K L LHF W ++ F ISPE YL+IS KGN+CLG+LDG+E+ + STI+
Sbjct: 400 SVDDVKQLHKPLLLHFSKTWLVIPRTFEISPENYLIISGKGNVCLGVLDGTEIGHSSTIV 459
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
+GD+SLRG+LV YDN +IGWA+S C P + +PF
Sbjct: 460 IGDVSLRGKLVAYDNDANQIGWAQSDCARPQKASMVPFF 498
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 203/380 (53%), Positives = 265/380 (69%), Gaps = 4/380 (1%)
Query: 196 IYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY 255
+ P+ Y+T + +GNPPRPY+LD+DTGSD TWI CDAPC++C KG +P+YKP G I+
Sbjct: 10 VVPERQYYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVHP 69
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+D LC E+Q N YCETC+QCDYEI YAD SSS GVLARD + LT +G + + VF
Sbjct: 70 RDPLCEELQGNQN--YCETCKQCDYEITYADRSSSKGVLARDNMQLTTADGEMKNVDFVF 127
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GCA++QQG LL++ TDGILGLS +SL +QLA+ GII NV GHC+ T+ GGYMFL
Sbjct: 128 GCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPSSGGYMFL 187
Query: 376 GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
G D VP WGM WVP+ + P +Y TE+ K+NYG+ LNL + ++ +FD+GSSYTY
Sbjct: 188 GDDYVPRWGMTWVPIRNGP-GNVYSTEVPKVNYGAQELNLRGQAGKLTQVIFDSGSSYTY 246
Query: 436 FTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSK 495
F + Y+ LIA L++ +S G V D SD TLP C + P+RS+ DV+Q F L L +
Sbjct: 247 FPHEIYTNLIALLED-ASPGFVRDESDQTLPFCMKPNVPVRSVGDVEQLFNPLILQLRKR 305
Query: 496 WQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKR 555
W ++ T F ISPE YL+IS KGN+CLG+LDG+E+ + STII+GD SLRG+ VVYDN R
Sbjct: 306 WFVIPTTFAISPENYLIISDKGNVCLGVLDGTEIGHSSTIIIGDASLRGKFVVYDNDENR 365
Query: 556 IGWAKSHCMNPGRFKSLPFL 575
IGW +S C P + +PF
Sbjct: 366 IGWVQSDCTRPQKQSRVPFF 385
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 201/393 (51%), Positives = 269/393 (68%), Gaps = 16/393 (4%)
Query: 186 SSSIFP--LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP-CSSCAKGAN 242
+S++FP L GN++P+GLY+T + +G+PPRPY+LD+DTGS TW+QCDAP C+SCAKGA+
Sbjct: 142 NSTLFPHSLAGNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAH 201
Query: 243 PLYKP-RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
PLY+P R + LP D LC Q E QCDYEI YAD SSSMGV RD +
Sbjct: 202 PLYRPARTADALPASDPLCEGAQH-------ENPNQCDYEISYADGSSSMGVYVRDSMQF 254
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
E+G ++VFGC YDQQG+LLN L TDG+LGL+ +SLP+QLAS+GII N GH
Sbjct: 255 VGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGH 314
Query: 362 CLTTN-AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS 420
C++T+ +G GGY+FLG D +P WGM WVP+ D P ++ ++ +IN+G LN + +
Sbjct: 315 CMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLT 374
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
QV +FDTGS+YTYF +A + LI+SLKE +S V D SD TLP C ++ FP+RS+ D
Sbjct: 375 QV---VFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFCMKSDFPVRSVED 431
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
VK FFK L+L F ++ S F+I PE YLVIS KGN+CLG+L+G+ + S +I+GD+
Sbjct: 432 VKHFFKPLSLQFEKRF-FFSRTFNIRPEHYLVISDKGNVCLGVLNGTTIGYDSVVIVGDV 490
Query: 541 SLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
SLRG+LV YDN +GW C NP + +P
Sbjct: 491 SLRGKLVAYDNDKNEVGWVDFDCTNPRKRSRIP 523
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 183/391 (46%), Positives = 248/391 (63%), Gaps = 2/391 (0%)
Query: 187 SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK 246
++ +P+ GNIYPDGLY+ M +GNP + YYLDMDTGSDLTW+QCDAPC SCA G + LY
Sbjct: 16 TAAYPIGGNIYPDGLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYD 75
Query: 247 PRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG 306
P+ ++ + C ++QR + +QCDYE++Y D SS+MG+L D + L + NG
Sbjct: 76 PKRARVVDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNG 135
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+ + V GC YDQQG L TDG++GLS +K+SLPSQLA++GI NV+GHCL
Sbjct: 136 TRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGG 195
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
+ GGGY+F G LVP+ GM W PM+ P +E Y + I YG L L VG A+
Sbjct: 196 SNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDVGGAM 255
Query: 427 FDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
FD+G+S+TY AY+ ++ A +++ GL +D TLP CWR P S+ DV +F
Sbjct: 256 FDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGPSPFESVADVSAYF 315
Query: 486 KTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
KT+TL F GS W +SPEGYL++S +GN+CLG+LD S T ILGDIS+RG
Sbjct: 316 KTVTLDFGGSTWWSSGKLLELSPEGYLIVSTQGNVCLGVLDASVASLEVTNILGDISMRG 375
Query: 545 QLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
LVVYDN+ ++IGW + +C N R + L
Sbjct: 376 YLVVYDNMREQIGWVRRNCYNRPRTATSQIL 406
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 188/404 (46%), Positives = 259/404 (64%), Gaps = 12/404 (2%)
Query: 170 SKINKKLVSSNAVAVDSSSI------FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGS 223
++I + L+ + + SS+ F + GNIYPDGLY+ +++G+PP+ Y+LDMDTGS
Sbjct: 2 TQIRRTLLERDLSRLGKSSVGNHSVRFHVGGNIYPDGLYYMALLLGSPPKLYFLDMDTGS 61
Query: 224 DLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIE 283
DLTW QCDAPC +CA G + LY P+ ++ +C +IQ+ +QCDYE+E
Sbjct: 62 DLTWAQCDAPCRNCAIGPHGLYNPKKAKVVDCHLPVCAQIQQGGSYECNSDVKQCDYEVE 121
Query: 284 YADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKV 343
YAD SS+MGVL D L + + NG+L + + GC YDQQG L + TDG++GLS +KV
Sbjct: 122 YADGSSTMGVLVEDTLTVRLTNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKV 181
Query: 344 SLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEI 403
+LP+QLA +GIIKNV+GHCL + GGGY+F G +LVPSWGM W PM+ P M Y +
Sbjct: 182 ALPAQLAEKGIIKNVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARL 241
Query: 404 LKINYGSSPLNLGARNS---QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA 460
I YG L L +FD+G+S+TY QAY+ +++++ + S GL+
Sbjct: 242 QSIRYGGDSLVLNNDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQS--GLLRVK 299
Query: 461 SDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSK-WQIVSTKFHISPEGYLVISKKGNI 519
SD TLP CWR P +SI DV Q+FKTLTL FG + W + +SP+GYL++S +GN+
Sbjct: 300 SDTTLPYCWRGPSPFQSITDVHQYFKTLTLDFGGRNWFATDSTLDLSPQGYLIVSTQGNV 359
Query: 520 CLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CLGILD S T I+GD+S+RG LVVYDNV RIGW + +C
Sbjct: 360 CLGILDASGASLEVTNIIGDVSMRGYLVVYDNVRDRIGWIRRNC 403
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 185/380 (48%), Positives = 252/380 (66%), Gaps = 6/380 (1%)
Query: 187 SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK 246
++IF L+GN+ P GLY+ M+VGNP +PY+LD+D+GS+LTWIQCDAPC SCAKG +PLYK
Sbjct: 64 TAIFSLKGNVVPYGLYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYK 123
Query: 247 PRMGNILPYKDSLCMEIQ--RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
+ G+++P KD LC +Q H + E Q+CDY++ YADH S G L RD + +
Sbjct: 124 LKKGSLVPSKDPLCAAVQAGSGHYHNHKEASQRCDYDVAYADHGYSEGFLVRDSVRALLT 183
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
N ++ N VFGC Y+Q+ L + +TDGILGL SLPSQ A QG+IKNV+GHC+
Sbjct: 184 NKTVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCIF 243
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
GGYMF G DLV + M WVPML P ++ Y+ ++N+G+ PL+ ++G
Sbjct: 244 GAGRDGGYMFFGDDLVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKKLGG 303
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKE-VSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+FD+GS+YTYFT QAY ++ +KE +S L D+SD L +CWR K RS+ +
Sbjct: 304 IIFDSGSTYTYFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCWRRKEGFRSVAEAAA 363
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
+FK LTL F S + + I PEGYLV++KKGN+CLGIL+G+ + T +LGDIS +
Sbjct: 364 YFKPLTLKFRS---TKTKQMEIFPEGYLVVNKKGNVCLGILNGTAIGIVDTNVLGDISFQ 420
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
GQLVVYDN +IGWA+S C
Sbjct: 421 GQLVVYDNEKNQIGWARSDC 440
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 178/386 (46%), Positives = 249/386 (64%), Gaps = 12/386 (3%)
Query: 183 AVDSSSIFP-LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGA 241
A ++++F LRGNIYPDGLY+ M++G P + YYLDMDTGSDLTW+QCDAPC SCA G
Sbjct: 3 ADKNATVFSQLRGNIYPDGLYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGP 62
Query: 242 NPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
+ LY P+ ++ + LC +Q+ +QCDY++EYAD SS+MGVL D + L
Sbjct: 63 HGLYDPKKARLVDCRVPLCALVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITL 122
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+ NG+ +K + GC YDQQG L T TDG++GLS AK+SLPSQLA +GI++NV+GH
Sbjct: 123 LLTNGTRSKTTAIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGH 182
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
CL + GGGY+F G LVP+ GM W P++ N G + +
Sbjct: 183 CLAGGSNGGGYLFFGDSLVPALGMTWTPIMGKSITG---------NIGGKSGDADDKTGD 233
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
+G +FD+G+S+TY +AY+ ++++++ +V GLV +D TLP CWR P S+ D
Sbjct: 234 IGGVMFDSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPFCWRGPSPFESVAD 293
Query: 481 VKQFFKTLTLHFGSK-WQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
V+++FKT+TL FG + W S +SPEGYL++S +GN+CLGILD S T I+GD
Sbjct: 294 VQRYFKTVTLDFGKRNWYSASRVLELSPEGYLIVSTQGNVCLGILDASGASLEVTNIIGD 353
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMN 565
+S+RG LVVYDN +IGW + +C N
Sbjct: 354 VSMRGYLVVYDNARNQIGWVRRNCHN 379
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 182/407 (44%), Positives = 253/407 (62%), Gaps = 23/407 (5%)
Query: 177 VSSNAVAVD---SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
SS+ V+ SS++FPL G++YP GLY+ M +GNPP+PY+LD+DTGSDLTW+QCDAP
Sbjct: 38 ASSSVAGVETEASSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAP 97
Query: 234 CSSCAKGANPLYKPRMGNILPYKDSLCMEIQ----RNHKPGYCET-CQQCDYEIEYADHS 288
C SC K +PLY+P ++P D LC + R HK C++ +QCDY I+YAD
Sbjct: 98 CRSCNKVPHPLYRPTKNKLVPCVDQLCASLHNGLNRKHK---CDSPYEQCDYVIKYADQG 154
Query: 289 SSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQ 348
SS GVL D L + NGS+ +P++ FGC YDQQ + + TDG+LGL VSL SQ
Sbjct: 155 SSTGVLVNDSFALRLANGSVVRPSLAFGCGYDQQ-VSSGEMSPTDGVLGLGTGSVSLLSQ 213
Query: 349 LASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINY 408
G+ KNVVGHCL+ GGG++F G DLVP + W PM+ SP Y + +
Sbjct: 214 FKQHGVTKNVVGHCLSLR--GGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYF 271
Query: 409 GSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVC 468
G L + ++ +FD+GSS+TYF Q Y L+ +LK S L + SDP+LP+C
Sbjct: 272 GDQSLRV-----KLTEVVFDSGSSFTYFAAQPYQALVTALKGDLSRTLK-EVSDPSLPLC 325
Query: 469 WRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSE 528
W+ K P +S++DVK+ FK+L L+FG+ + I P+ YL+++K GN CLGIL+GSE
Sbjct: 326 WKGKKPFKSVLDVKKEFKSLVLNFGNGNKAF---MEIPPQNYLIVTKYGNACLGILNGSE 382
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFL 575
V ILGDI+++ Q+V+YDN +IGW ++ C +F S L
Sbjct: 383 VGLKDLSILGDITMQDQMVIYDNEKGQIGWIRAPCDRIPKFGSSALL 429
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 347 bits (891), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 182/400 (45%), Positives = 250/400 (62%), Gaps = 19/400 (4%)
Query: 177 VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
+S A A +SS++FPL G++YP GLY+ M +GNPPRPY+LD+DTGSDLTW+QCDAPC S
Sbjct: 33 LSVTAGAEESSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVS 92
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRN----HKPGYCET-CQQCDYEIEYADHSSSM 291
C+K +PLY+P ++P D +C + HK C++ QQCDYEI+YAD SS+
Sbjct: 93 CSKVPHPLYRPTKNKLVPCVDQMCAALHGGLTGRHK---CDSPKQQCDYEIKYADQGSSL 149
Query: 292 GVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
GVL D L + N S+ +P + FGC YDQQ + TDG+LGL VSL SQL
Sbjct: 150 GVLVTDSFALRLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQ 209
Query: 352 QGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS 411
GI KNVVGHCL+T GGG++F G D+VP W PM S Y + +G
Sbjct: 210 HGITKNVVGHCLSTR--GGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGR 267
Query: 412 PLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
P LG R +V +FD+GSS+TYF+ Q Y L+ ++K S L + D +LP+CW+
Sbjct: 268 P--LGVRPMEV---VFDSGSSFTYFSAQPYQALVDAIKGDLSKNLK-EVPDHSLPLCWKG 321
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
K P +S++DVK+ FKT+ L F + + + I PE YL+++K GN CLGIL+GSEV
Sbjct: 322 KKPFKSVLDVKKEFKTVVLSFSNGKKAL---MEIPPENYLIVTKYGNACLGILNGSEVGL 378
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
I+GDI+++ Q+V+YDN +IGW ++ C +F S
Sbjct: 379 KDLNIVGDITMQDQMVIYDNERGQIGWIRAPCDRIPKFGS 418
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 181/400 (45%), Positives = 250/400 (62%), Gaps = 19/400 (4%)
Query: 177 VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
+S A A +SS++FPL G++YP GLY+ M +GNPPRPY+LD+DTGSDLTW+QCDAPC S
Sbjct: 33 LSVTAGAEESSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVS 92
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRN----HKPGYCET-CQQCDYEIEYADHSSSM 291
C+K +PLY+P ++P D +C + HK C++ QQCDYEI+YAD SS+
Sbjct: 93 CSKVPHPLYRPTKNKLVPCVDQMCAALHGGLTGRHK---CDSPKQQCDYEIKYADQGSSL 149
Query: 292 GVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
GVL D L + N S+ +P + FGC YDQQ + TDG+LGL VSL SQL
Sbjct: 150 GVLVTDSFALRLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQ 209
Query: 352 QGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS 411
GI KNVVGHCL+T GGG++F G D+VP W PM S Y + +G
Sbjct: 210 HGITKNVVGHCLSTR--GGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGR 267
Query: 412 PLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
P LG R +V +FD+GSS+TYF+ Q Y L+ ++K S L + D +LP+CW+
Sbjct: 268 P--LGVRPMEV---VFDSGSSFTYFSAQPYQALVDAIKGDLSKNLK-EVPDHSLPLCWKG 321
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
K P +S++DVK+ F+T+ L F + + + I PE YL+++K GN CLGIL+GSEV
Sbjct: 322 KKPFKSVLDVKKEFRTVVLSFSNGKKAL---MEIPPENYLIVTKYGNACLGILNGSEVGL 378
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
I+GDI+++ Q+V+YDN +IGW ++ C +F S
Sbjct: 379 KDLNIVGDITMQDQMVIYDNERGQIGWIRAPCDRIPKFGS 418
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 345 bits (885), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 179/392 (45%), Positives = 247/392 (63%), Gaps = 19/392 (4%)
Query: 177 VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
+S A A +SS++FPL G++YP GLY+ M +GNPPRPY+LD+DTGSDLTW+QCDAPC S
Sbjct: 33 LSVTAGAEESSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVS 92
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRN----HKPGYCET-CQQCDYEIEYADHSSSM 291
C+K +PLY+P ++P D +C + HK C++ QQCDYEI+YAD SS+
Sbjct: 93 CSKVPHPLYRPTKNKLVPCVDQMCAALHGGLTGRHK---CDSPKQQCDYEIKYADQGSSL 149
Query: 292 GVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
GVL D L + N S+ +P + FGC YDQQ + TDG+LGL VSL SQL
Sbjct: 150 GVLVTDSFALRLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQ 209
Query: 352 QGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS 411
GI KNVVGHCL+T GGG++F G D+VP W PM S Y + +G
Sbjct: 210 HGITKNVVGHCLSTR--GGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGR 267
Query: 412 PLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
P LG R +V +FD+GSS+TYF+ Q Y L+ ++K S L + D +LP+CW+
Sbjct: 268 P--LGVRPMEV---VFDSGSSFTYFSAQPYQALVDAIKGDLSKNLK-EVPDHSLPLCWKG 321
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
K P +S++DVK+ F+T+ L F + + + I PE YL+++K GN CLGIL+GSEV
Sbjct: 322 KKPFKSVLDVKKEFRTVVLSFSNGKKAL---MEIPPENYLIVTKYGNACLGILNGSEVGL 378
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+GDI+++ Q+V+YDN +IGW ++ C
Sbjct: 379 KDLNIVGDITMQDQMVIYDNERGQIGWIRAPC 410
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 179/399 (44%), Positives = 249/399 (62%), Gaps = 19/399 (4%)
Query: 178 SSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC 237
++ A +SS++F L G++YP GLY+ M +GNPPRPY+LD+DTGSDLTW+QCDAPC SC
Sbjct: 34 AAEAEPEESSAVFQLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSC 93
Query: 238 AKGANPLYKPRMGNILPYKDSLCMEIQ----RNHKPGYCET-CQQCDYEIEYADHSSSMG 292
K +PLY+P I+P D LC + HK C++ QQCDYEI+YAD SS+G
Sbjct: 94 NKVPHPLYRPTKNKIVPCVDQLCSSLHGGLSGKHK---CDSPKQQCDYEIKYADQGSSLG 150
Query: 293 VLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
VL D + + N S+ +P++ FGC YDQQ + TDG+LGL +SL SQL
Sbjct: 151 VLLTDSFAVRLANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQH 210
Query: 353 GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSP 412
GI KNVVGHCL+ GGG++F G +LVP WVPM+ S F Y + +G
Sbjct: 211 GITKNVVGHCLSIR--GGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGR- 267
Query: 413 LNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
+LG R +V + D+GSS+TYF Q Y L+ +LK S L + DP+LP+CW+ K
Sbjct: 268 -SLGVRPMEV---VLDSGSSFTYFGAQPYQALVTALKSDLSKTLK-EVFDPSLPLCWKGK 322
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG 532
P +S++DVK+ FK+L L F + + + I PE YL+++K GN CLGIL+GSE+
Sbjct: 323 KPFKSVLDVKKEFKSLVLSFSNGKKAL---MEIPPENYLIVTKFGNACLGILNGSEIGLK 379
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
I+GDI+++ Q+V+YDN +IGW ++ C +F S
Sbjct: 380 DLNIVGDITMQDQMVIYDNERGQIGWIRAPCDRIPKFGS 418
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 173/398 (43%), Positives = 248/398 (62%), Gaps = 14/398 (3%)
Query: 177 VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
+++ A SS++FPL G++YP GLY+ M +GNPP+PY+LD+D+GSDLTW+QCDAPC S
Sbjct: 39 IAAGAETEPSSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRS 98
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPG--YCET-CQQCDYEIEYADHSSSMGV 293
C + +PLY+P ++P LC + G CE+ +QCDY I+YAD SS GV
Sbjct: 99 CNEVPHPLYRPTKSKLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGV 158
Query: 294 LARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQG 353
L D L + NGS+ +P+V FGC YDQQ + TDG+LGL VSL SQL +G
Sbjct: 159 LVNDSFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRG 218
Query: 354 IIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
+ KNVVGHCL+ GGG++F G DLVP W PM S F Y + +G
Sbjct: 219 VTKNVVGHCLSLR--GGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDR-- 274
Query: 414 NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
+LG R ++V +FD+GSS+TYF + Y L+ +LK+ S L + D +LP+CW+ +
Sbjct: 275 SLGVRLAKV---VFDSGSSFTYFAAKPYQALVTALKDGLSRTLE-EEPDTSLPLCWKGQE 330
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
P +S++DV++ FK+L L+F S + T I PE YL++++ GN CLGIL+GSE+
Sbjct: 331 PFKSVLDVRKEFKSLVLNFASGKK---TLMEIPPENYLIVTENGNACLGILNGSEIGLKD 387
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
I+GDI+++ +V+YDN +IGW ++ C +F S
Sbjct: 388 LSIIGDITMQDHMVIYDNEKGKIGWIRAPCDRAPKFGS 425
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 171/397 (43%), Positives = 248/397 (62%), Gaps = 13/397 (3%)
Query: 177 VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
+++ A SS++FPL G++YP GLY+ M +GNPP+PY+LD+D+GSDLTW+QCDAPC S
Sbjct: 41 IAAGAETEPSSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRS 100
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGY-CET-CQQCDYEIEYADHSSSMGVL 294
C + +PLY+P ++P LC + + C++ +QCDY I+YAD SS GVL
Sbjct: 101 CNEVPHPLYRPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVL 160
Query: 295 ARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
D L + NGS+ +P+V FGC YDQQ + TDG+LGL VSL SQL +G+
Sbjct: 161 INDSFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGV 220
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414
KNVVGHCL+ GGG++F G DLVP W PM S F Y + +G +
Sbjct: 221 TKNVVGHCLSLR--GGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDR--S 276
Query: 415 LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
LG R ++V +FD+GSS+TYF + Y L+ +LK+ S L + D +LP+CW+ + P
Sbjct: 277 LGVRLAKV---VFDSGSSFTYFAAKPYQALVTALKDGLSRTLE-EEPDTSLPLCWKGQEP 332
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
+S++DV++ FK+L L+F S + T I PE YL++++ GN CLGIL+GSE+
Sbjct: 333 FKSVLDVRKEFKSLVLNFASGKK---TLMEIPPENYLIVTENGNACLGILNGSEIGLKDL 389
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
I+GDI+++ +V+YDN +IGW ++ C +F S
Sbjct: 390 SIIGDITMQDHMVIYDNEKGKIGWIRAPCDRAPKFGS 426
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 332 bits (852), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 168/387 (43%), Positives = 243/387 (62%), Gaps = 13/387 (3%)
Query: 187 SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK 246
+++FPL G++YP GLY+ M +GNPP+PY+LD+D+GSDLTW+QCDAPC SC + +PLY+
Sbjct: 42 AAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYR 101
Query: 247 PRMGNILPYKDSLCMEIQRNHKPGY-CET-CQQCDYEIEYADHSSSMGVLARDELHLTIE 304
P ++P LC + + C++ +QCDY I+YAD SS GVL D L +
Sbjct: 102 PTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLT 161
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
NGS+ +P+V FGC YDQQ + TDG+LGL VSL SQL +G+ KNVVGHCL+
Sbjct: 162 NGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLS 221
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
GGG++F G DLVP W PM S F Y + +G +LG R ++V
Sbjct: 222 LR--GGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDR--SLGVRLAKV-- 275
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FD+GSS+TYF + Y L+ +LK+ S L + D +LP+CW+ + P +S++DV++
Sbjct: 276 -VFDSGSSFTYFAAKPYQALVTALKDGLSRTLE-EEPDTSLPLCWKGQEPFKSVLDVRKE 333
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
FK+L L+F S + T I PE YL++++ GN CLGIL+GSE+ I+GDI+++
Sbjct: 334 FKSLVLNFASGKK---TLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQD 390
Query: 545 QLVVYDNVNKRIGWAKSHCMNPGRFKS 571
+V+YDN +IGW ++ C +F S
Sbjct: 391 HMVIYDNEKGKIGWIRAPCDRAPKFGS 417
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 332 bits (850), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 170/401 (42%), Positives = 249/401 (62%), Gaps = 17/401 (4%)
Query: 164 IIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGS 223
I+ KS I K + S+ SS +FPL GN++P G Y M +G+PP+ + D+DTGS
Sbjct: 15 IVPLSKSSIFKTFIKSSP----SSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGS 70
Query: 224 DLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIE 283
DLTW+QCDAPCS C N YKP+ GNI+P + +C + +KP +QCDYE++
Sbjct: 71 DLTWVQCDAPCSGCTLPPNLQYKPK-GNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVK 129
Query: 284 YADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKV 343
YAD SSMG L D+ L + NGS +P V FGC YDQ + T G+LGL R K+
Sbjct: 130 YADQGSSMGALVTDQFPLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKI 189
Query: 344 SLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEI 403
L +QL S G+ +NVVGHCL++ GGG++F G +LVPS G+AW P+L Y T
Sbjct: 190 GLLTQLVSAGLTRNVVGHCLSSK--GGGFLFFGDNLVPSIGVAWTPLLSQD--NHYTTGP 245
Query: 404 LKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASD 462
+ + P L +FDTGSSYTYF +AY +I + ++ L + D
Sbjct: 246 ADLLFNGKPTGLKGLK-----LIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKED 300
Query: 463 PTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLG 522
TLP+CW+ P +S+++VK FFKT+T++F + + +T+ +++PE YL++SK GN+CLG
Sbjct: 301 KTLPICWKGAKPFKSVLEVKNFFKTITINFTNGRR--NTQLYLAPELYLIVSKTGNVCLG 358
Query: 523 ILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+L+GSEV ++ ++GDIS++G +++YDN +++GW S C
Sbjct: 359 LLNGSEVGLQNSNVIGDISMQGLMMIYDNEKQQLGWVSSDC 399
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 176/393 (44%), Positives = 247/393 (62%), Gaps = 16/393 (4%)
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
KKL S N + SS++F ++GN+YP G Y + +G PP+ Y LD+D+GSDLTW+QCDAP
Sbjct: 36 KKLSSDNHHRLSSSAVFKVQGNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAP 95
Query: 234 CSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGV 293
C C K + LYKP N++ D LC E+Q + + QCDYE+EYADH SS+GV
Sbjct: 96 CKGCTKPRDQLYKPNH-NLVQCVDQLCSEVQLSMEYTCASPDDQCDYEVEYADHGSSLGV 154
Query: 294 LARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQG 353
L RD + NGS+ +P V FGC YDQ+ N+ T G+LGL + S+ SQL S G
Sbjct: 155 LVRDYIPFQFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLG 214
Query: 354 IIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
+I NVVGHCL+ A GGG++F G D +PS G+ W ML S + Y + G + L
Sbjct: 215 LIHNVVGHCLS--ARGGGFLFFGDDFIPSSGIVWTSMLPSSSEKHYSS-------GPAEL 265
Query: 414 NLGARNSQV-GWAL-FDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWR 470
+ + V G L FD+GSSYTYF QAY ++ + +++ L DP+LP+CW+
Sbjct: 266 VFNGKATVVKGLELIFDSGSSYTYFNSQAYQAVVDLVTQDLKGKQLKRATDDPSLPICWK 325
Query: 471 AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVH 530
+S+ DVK++FK L L F +K +I+ + H+ PE YL+I+K GN+CLGILDG+EV
Sbjct: 326 GAKSFKSLSDVKKYFKPLALSF-TKTKIL--QMHLPPEAYLIITKHGNVCLGILDGTEVG 382
Query: 531 NGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ I+GDISL+ ++V+YDN ++IGW S+C
Sbjct: 383 LENLNIIGDISLQDKMVIYDNEKQQIGWVSSNC 415
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 168/379 (44%), Positives = 236/379 (62%), Gaps = 20/379 (5%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
+F L G++YP G Y+ M +G+P +PY+LD+DTGSDLTW+QCDAPC SC K +PLY+P
Sbjct: 44 VFLLSGDVYPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPT 103
Query: 249 MGNILPYKDSLCMEIQRNHKPG-YCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307
++P +S+C + P C T QQCDY+I+Y D +SS+GVL D L + N S
Sbjct: 104 KNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKS 163
Query: 308 LTKPNVVFGCAYDQQ-GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+P++ FGC YDQQ G TDG+LGL R VSL SQL QGI KNV+GHCL+T+
Sbjct: 164 NVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTS 223
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR--NSQVGW 424
GGG++F G D+VP+ + WVPM+ S Y GS+ L R +++
Sbjct: 224 --GGGFLFFGDDMVPTSRVTWVPMVRSTSGNYYSP-------GSATLYFDRRSLSTKPME 274
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FD+GS+YTYF+ Q Y I+++K S L SDP+LP+CW+ + +S+ DVK+
Sbjct: 275 VVFDSGSTYTYFSAQPYQATISAIKGSLSKSLK-QVSDPSLPLCWKGQKAFKSVSDVKKD 333
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
FK+L FG + I PE YL+++K GN+CLGILDGS +II GDI+++
Sbjct: 334 FKSLQFIFGK-----NAVMEIPPENYLIVTKNGNVCLGILDGSAAKLSFSII-GDITMQD 387
Query: 545 QLVVYDNVNKRIGWAKSHC 563
Q+V+YDN ++GW + C
Sbjct: 388 QMVIYDNEKAQLGWIRGSC 406
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 324 bits (831), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 166/385 (43%), Positives = 237/385 (61%), Gaps = 15/385 (3%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+++F L+G++YP G Y+ M +GNP +PY+LD+DTGSDLTW+QCDAPC SC K +PLY
Sbjct: 37 STAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY 96
Query: 246 KPRMGNILPYKDSLCMEIQRNHKP-GYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
+P ++P ++LC + C + +QCDY+I+Y D +SS GVL D L +
Sbjct: 97 RPTANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMR 156
Query: 305 NGSLTKPNVVFGCAYDQQ-GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
+ ++ +P + FGC YDQQ G DG+LGL R VSL SQL QGI KNVVGHCL
Sbjct: 157 SSNI-RPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
+TN GGG++F G D+VPS + WVPM Y + + +LG + +V
Sbjct: 216 STN--GGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRR--SLGVKPMEV- 270
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+FD+GS+YTYFT Q Y ++++LK S L SDPTLP+CW+ + +S+ DVK
Sbjct: 271 --VFDSGSTYTYFTAQPYQAVVSALKGGLSKSL-KQVSDPTLPLCWKGQKAFKSVFDVKN 327
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
FK++ L F S + I PE YL+++K GN+CLGILDG+ +I GDI+++
Sbjct: 328 EFKSMFLSFASA---KNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVI-GDITMQ 383
Query: 544 GQLVVYDNVNKRIGWAKSHCMNPGR 568
Q+V+YDN ++GWA+ C +
Sbjct: 384 DQMVIYDNEKSQLGWARGACTRSAK 408
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 324 bits (830), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 166/385 (43%), Positives = 237/385 (61%), Gaps = 15/385 (3%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+++F L+G++YP G Y+ M +GNP +PY+LD+DTGSDLTW+QCDAPC SC K +PLY
Sbjct: 37 STAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY 96
Query: 246 KPRMGNILPYKDSLCMEIQRNHKP-GYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
+P ++P ++LC + C + +QCDY+I+Y D +SS GVL D L +
Sbjct: 97 RPTANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMR 156
Query: 305 NGSLTKPNVVFGCAYDQQ-GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
+ ++ +P + FGC YDQQ G DG+LGL R VSL SQL QGI KNVVGHCL
Sbjct: 157 SSNI-RPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
+TN GGG++F G D+VPS + WVPM Y + + +LG + +V
Sbjct: 216 STN--GGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRR--SLGVKPMEV- 270
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+FD+GS+YTYFT Q Y ++++LK S L SDPTLP+CW+ + +S+ DVK
Sbjct: 271 --VFDSGSTYTYFTAQPYQAVVSALKGGLSKSL-KQVSDPTLPLCWKGQKAFKSVFDVKN 327
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
FK++ L F S + I PE YL+++K GN+CLGILDG+ +I GDI+++
Sbjct: 328 EFKSMFLSFSSA---KNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVI-GDITMQ 383
Query: 544 GQLVVYDNVNKRIGWAKSHCMNPGR 568
Q+V+YDN ++GWA+ C +
Sbjct: 384 DQMVIYDNEKSQLGWARGACTRSAK 408
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 166/375 (44%), Positives = 229/375 (61%), Gaps = 12/375 (3%)
Query: 190 FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRM 249
F ++GN+YP G Y + +GNPP+ Y LD+DTGSDLTW+QCDAPC C N LYKP
Sbjct: 52 FQIKGNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRNRLYKPN- 110
Query: 250 GNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
GN++ D LC IQ +QCDYE+EYAD SS+GVL RD + L NGSL
Sbjct: 111 GNLVKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLA 170
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
+P + FGC YDQ+ + N T G+LGL K S+ SQL S G+I+NVVGHCL+ G
Sbjct: 171 RPILAFGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCLSER--G 228
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDT 429
GG++F G LVP G+ W P+L S + Y T + + P ++ +FD+
Sbjct: 229 GGFLFFGDQLVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKGLQ-----LIFDS 283
Query: 430 GSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
GSSYTYF +A+ L+ + ++ L D +LP+CWR P +S+ DV FK L
Sbjct: 284 GSSYTYFNSKAHKALVNLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLHDVTSNFKPL 343
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
L F + ++ + PE YL+++K GN+CLGILDG+E+ G+T I+GDISL+ +LV+
Sbjct: 344 LLSF---TKSKNSLLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVI 400
Query: 549 YDNVNKRIGWAKSHC 563
YDN ++IGWA ++C
Sbjct: 401 YDNEKQQIGWASANC 415
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 180/414 (43%), Positives = 250/414 (60%), Gaps = 27/414 (6%)
Query: 168 HKSKINKKLVSSNAVA------VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDT 221
H+ K K + A + + SS +FPL GN+YP G Y+ + +G PP+PY+LD DT
Sbjct: 27 HQHKRKKAVFPEPAASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPKPYFLDPDT 86
Query: 222 GSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGY-CETCQQCDY 280
GSDL+W+QCDAPC C K +PLY+P N++ KD +C + H PGY CE +QCDY
Sbjct: 87 GSDLSWLQCDAPCVRCTKAPHPLYRPN-NNLVICKDPMCASL---HPPGYKCEHPEQCDY 142
Query: 281 EIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSR 340
E+EYAD SS+GVL +D L NG P + GC YDQ + + DG+LGL +
Sbjct: 143 EVEYADGGSSLGVLVKDVFPLNFTNGLRLAPRLALGCGYDQ--IPGQSYHPLDGVLGLGK 200
Query: 341 AKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYH 400
K S+ SQL SQG+I+NVVGHC+++ GGG++F G DL S + W PML H
Sbjct: 201 GKSSIVSQLHSQGVIRNVVGHCVSSR--GGGFLFFGDDLYDSSRVVWTPMLRD-----QH 253
Query: 401 TEILKINYGSSPLNLGARNSQVGWAL--FDTGSSYTYFTKQAYSELIASLKEVSSDGLVL 458
T + G + L LG + + L FD+GSSYTY AY L+ +++ S+ V
Sbjct: 254 TH---YSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHLVRKELSEKPVR 310
Query: 459 DA-SDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG 517
+A D TLP+CWR K P +S+ DVK+FFK L L F + T++ I E YL+IS KG
Sbjct: 311 EALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALSFPGGGR-TKTQYDIPLESYLIISLKG 369
Query: 518 NICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
N+CLGIL+G+E ++GDIS++ ++VVYDN +IGWA ++C +FK+
Sbjct: 370 NVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDRLPKFKA 423
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 323 bits (828), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 168/379 (44%), Positives = 235/379 (62%), Gaps = 20/379 (5%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
+F L G++YP G Y+ M +G+P +PY+LD+DTGSDLTW+QCDAPC SC K +PLY+P
Sbjct: 44 VFLLSGDVYPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPT 103
Query: 249 MGNILPYKDSLCMEIQRNHKPG-YCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307
++P +S+C + P C T QQCDY+I+Y D +SS+GVL D L + N S
Sbjct: 104 KNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKS 163
Query: 308 LTKPNVVFGCAYDQQ-GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+P++ FGC YDQQ G TDG+LGL R VSL SQL QGI KNV+GHCL+T+
Sbjct: 164 NVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTS 223
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR--NSQVGW 424
GGG++F G D+VP+ + WV M+ S Y GS+ L R +++
Sbjct: 224 --GGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSP-------GSATLYFDRRSLSTKPME 274
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FD+GS+YTYF+ Q Y I+++K S L SDP+LP+CW+ + +S+ DVK+
Sbjct: 275 VVFDSGSTYTYFSAQPYQATISAIKGSLSKSLK-QVSDPSLPLCWKGQKAFKSVSDVKKD 333
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
FK+L FG + I PE YL+I+K GN+CLGILDGS +II GDI+++
Sbjct: 334 FKSLQFIFGK-----NAVMDIPPENYLIITKNGNVCLGILDGSAAKLSFSII-GDITMQD 387
Query: 545 QLVVYDNVNKRIGWAKSHC 563
Q+V+YDN ++GW + C
Sbjct: 388 QMVIYDNEKAQLGWIRGSC 406
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 175/389 (44%), Positives = 243/389 (62%), Gaps = 21/389 (5%)
Query: 180 NAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239
N SS +FP+ GN+YP G Y + +G PPRPY+LD+DTGSDLTW+QCDAPCS C++
Sbjct: 57 NRFRAGSSVVFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQ 116
Query: 240 GANPLYKPRMGNILPYKDSLC--MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD 297
+PLY+P +++P + +LC + + N+ CE QCDYE++YADH SS+GVL D
Sbjct: 117 TPHPLYRPS-NDLVPCRHALCASLHLSDNYD---CEVPHQCDYEVQYADHYSSLGVLLHD 172
Query: 298 ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357
L NG K + GC YDQ + DG+LGL R K SL SQL SQG+++N
Sbjct: 173 VYTLNFTNGVQLKVRMALGCGYDQI-FPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRN 231
Query: 358 VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA 417
V+GHCL+ A GGGY+F G D+ S+ + W PM + H + G++ L G
Sbjct: 232 VIGHCLS--AQGGGYIFFG-DVYDSFRLTWTPMSSRDYK---HYSVA----GAAELLFGG 281
Query: 418 RNSQVG--WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA-SDPTLPVCWRAKFP 474
+ S VG A+FDTGSSYTYF AY LI+ LK+ S + +A D TLP+CWR + P
Sbjct: 282 KKSGVGNLHAVFDTGSSYTYFNSYAYQVLISWLKKESGGKPLKEAHDDQTLPLCWRGRRP 341
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
RSI +V+++FK + L F S + +F + PE YL++S GN+CLGIL+GSEV G
Sbjct: 342 FRSIYEVRKYFKPIVLSFTSNGR-SKAQFEMLPEAYLIVSNMGNVCLGILNGSEVGMGDL 400
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++GDIS+ +++V+DN + IGWA + C
Sbjct: 401 NLIGDISMLNKVMVFDNDKQLIGWAPADC 429
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 165/379 (43%), Positives = 233/379 (61%), Gaps = 11/379 (2%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS +FP+ GN+YP G Y+ + +GNPP+ + LD+DTGSDLTW+QCDAPC+ C K Y
Sbjct: 52 SSVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQY 111
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
KP N LP LC + + QCDYEI Y+DH+SS+G L DE L + N
Sbjct: 112 KPNH-NTLPCSHLLCSGLDLTQNRPCDDPEDQCDYEIGYSDHASSIGALVTDEFPLKLAN 170
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
GS+ P++ FGC YDQQ + T GILGL R KV + +QL S GI KNV+ HCL+
Sbjct: 171 GSIMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVHCLSH 230
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
G G++ +G +LVPS G+ W + + + Y T ++ + + N
Sbjct: 231 T--GKGFLSIGDELVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKGIN-----V 283
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FD+GSSYTYF +AY ++ + K+++ L D +LPVCW+ K P++S+ +VK++
Sbjct: 284 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKY 343
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
FKT+TL FG +Q F + PE YL+I++KGN+CLGIL+G+EV S I+GDIS +G
Sbjct: 344 FKTITLRFG--YQKNGQLFQVPPESYLIITEKGNVCLGILNGTEVGLDSYNIVGDISFQG 401
Query: 545 QLVVYDNVNKRIGWAKSHC 563
+V+YDN +RIGW S C
Sbjct: 402 IMVIYDNEKQRIGWISSDC 420
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 175/388 (45%), Positives = 235/388 (60%), Gaps = 19/388 (4%)
Query: 180 NAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239
N SS +FP+ GN+YP G Y + +G PPRPY+LD+DTGSDLTW+QCDAPCS C++
Sbjct: 55 NRFRAGSSVVFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQ 114
Query: 240 GANPLYKPRMGNILPYKDSLCMEIQRNHKPGY-CETCQQCDYEIEYADHSSSMGVLARDE 298
+PLY+P + +P + SLC + +H Y CE QCDYE++YADH SS+GVL D
Sbjct: 115 TPHPLYRPS-NDFVPCRHSLCASL--HHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDV 171
Query: 299 LHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
L NG K + GC YDQ + DG+LGL R K SL SQL SQG+++NV
Sbjct: 172 YTLNFTNGVQLKVRMALGCGYDQI-FPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNV 230
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR 418
+GHCL+ A GGGY+F G D+ S + W PM + G++ L G +
Sbjct: 231 IGHCLS--AQGGGYIFFG-DVYDSSRLTWTPMSSRDYKHYSAA-------GAAELLFGGK 280
Query: 419 NSQVG--WALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPI 475
S +G A+FDTGSSYTYF AY LI+ L KE L D TLP+CWR + P
Sbjct: 281 KSGIGSLHAVFDTGSSYTYFNPYAYQALISWLGKESGGKPLKEAHDDQTLPLCWRGRRPF 340
Query: 476 RSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI 535
RSI +V+++FK + L F S + +F + PE YL+IS GN+CLGIL+GSEV G
Sbjct: 341 RSIYEVRKYFKPIVLSFTSNGR-SKAQFEMPPEAYLIISNMGNVCLGILNGSEVGMGDLN 399
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++GDIS+ +++V+DN + IGW + C
Sbjct: 400 LIGDISMLNKVMVFDNDKQLIGWTPADC 427
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 169/378 (44%), Positives = 240/378 (63%), Gaps = 17/378 (4%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
+ PL+GN+YP+G Y + VG PP+PY+LD DTGSDLTW+QCDAPC C + +PLY+P
Sbjct: 44 VLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPS 103
Query: 249 MGNILPYKDSLCMEIQR--NHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG 306
+++P KD LCM + +H+ CE QCDYE+EYAD SS+GVL RD L + NG
Sbjct: 104 -NDLVPCKDPLCMSLHSSMDHR---CENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNG 159
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+P + GC YDQ ++ DGILGL R VS+ SQL +QGI++NVVGHC N
Sbjct: 160 DPIRPRLALGCGYDQDP-GSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCF--N 216
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
+ GGGY+F G + + + W PM + + Y ++ + + G RN V +
Sbjct: 217 SKGGGYLFFGDGIYDPYRLVWTPM-SRDYPKHYSPGFGELIFNGR--STGLRNLFV---V 270
Query: 427 FDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
FD+GSSYTYF QAY L + L +E++ L D TLP+CWR + PI+S+ DV+++F
Sbjct: 271 FDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYF 330
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
K L L F S + + F I EGY++IS GN+CLGIL+G++V ++ I+GDIS++ +
Sbjct: 331 KPLALSFSSGGRSKAV-FEIPTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDK 389
Query: 546 LVVYDNVNKRIGWAKSHC 563
+VVY+N + IGWA ++C
Sbjct: 390 MVVYNNEKQAIGWATANC 407
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 169/382 (44%), Positives = 240/382 (62%), Gaps = 17/382 (4%)
Query: 184 VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP 243
V SS F + GN+YP G Y + +GNPP+ + LD+DTGSDLTW+QCDAPC C K +
Sbjct: 50 VGSSVFFRVTGNVYPTGHYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDK 109
Query: 244 LYKPRMGNILPYKDSLCMEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARDELHLT 302
LYKP+ N +P SLC IQ N+ C+ +QCDYE+EYAD SS+GVL D L
Sbjct: 110 LYKPK-NNRVPCASSLCQAIQNNN----CDIPTEQCDYEVEYADLGSSLGVLLSDYFPLR 164
Query: 303 IENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
+ NGSL +P + FGC YDQ+ L ++ T GILGL R K S+ SQL + GI +NVVGHC
Sbjct: 165 LNNGSLLQPRIAFGCGYDQKYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHC 224
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
+ GG++F G L+P G+ W PML S LY + ++ +G P G + Q+
Sbjct: 225 FSRVT--GGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKP--TGIKGLQL 280
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA-SDPTLPVCWRAKFPIRSIVDV 481
+FD+GSSYTYF Q Y ++ +++ S + DA + L VCW+ PI+SI+D+
Sbjct: 281 ---IFDSGSSYTYFNAQVYQSILNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDI 337
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
K FFK LT++F + + + ++PE YL+I+K GN+CLGIL+G E G+ ++GDI
Sbjct: 338 KSFFKPLTINF---IKAKNVQLQLAPEDYLIITKDGNVCLGILNGGEQGLGNLNVIGDIF 394
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
++ ++VVYDN ++IGW ++C
Sbjct: 395 MQDRVVVYDNERQQIGWFPTNC 416
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 167/362 (46%), Positives = 227/362 (62%), Gaps = 19/362 (5%)
Query: 177 VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
+S A A +SS++FPL G++YP GLY+ M +GNPPRPY+LD+DTGSDLTW+QCDAPC S
Sbjct: 33 LSVTAGAEESSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVS 92
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRN----HKPGYCET-CQQCDYEIEYADHSSSM 291
C+K +PLY+P ++P D +C + HK C++ QQCDYEI+YAD SS+
Sbjct: 93 CSKVPHPLYRPTKNKLVPCVDQMCAALHGGLTGRHK---CDSPKQQCDYEIKYADQGSSL 149
Query: 292 GVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
GVL D L + N S+ +P + FGC YDQQ + TDG+LGL VSL SQL
Sbjct: 150 GVLVTDSFALRLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQ 209
Query: 352 QGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS 411
GI KNVVGHCL+T GGG++F G D+VP W PM S Y + +G
Sbjct: 210 HGITKNVVGHCLSTR--GGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGR 267
Query: 412 PLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
P LG R +V +FD+GSS+TYF+ Q Y L+ ++K S L + D +LP+CW+
Sbjct: 268 P--LGVRPMEV---VFDSGSSFTYFSAQPYQALVDAIKGDLSKNLK-EVPDHSLPLCWKG 321
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
K P +S++DVK+ F+T+ L F + + + I PE YL+++K GN CLGIL+GSE+
Sbjct: 322 KKPFKSVLDVKKEFRTVVLSFSNGKKAL---MEIPPENYLIVTKYGNACLGILNGSELPQ 378
Query: 532 GS 533
GS
Sbjct: 379 GS 380
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 174/382 (45%), Positives = 235/382 (61%), Gaps = 13/382 (3%)
Query: 183 AVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN 242
A SS +FP+ GN+YP G Y + +G PPRPY+LD+DTGS+LTW+QCDAPCS C++ +
Sbjct: 55 AAGSSIVFPIYGNVYPVGFYNVTLNIGQPPRPYFLDVDTGSELTWLQCDAPCSQCSETPH 114
Query: 243 PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLT 302
PLYKP + +P KD LC +Q CE QCDYEI+YAD S++GVL D L
Sbjct: 115 PLYKPS-NDFIPCKDPLCASLQPTDDY-TCEDPNQCDYEIKYADQYSTLGVLLNDVYLLN 172
Query: 303 IENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
NG K + GC YDQ +T DGILGL R K SL SQL SQG+++NV+GHC
Sbjct: 173 FTNGVQLKVRMALGCGYDQI-FSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHC 231
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
L++ GGGY+F G ++ S M+W P+ + Y ++ +G +G+ N
Sbjct: 232 LSSR--GGGYIFFG-NVYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLN--- 285
Query: 423 GWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+FDTGSSYTYF QAY +I+ L KE+ + D TLP+CW K P RSI +V
Sbjct: 286 --IIFDTGSSYTYFNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEV 343
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
K++FK LTL F + + V +F I PE YL+IS GN+CLGIL+G EV G ++GDIS
Sbjct: 344 KKYFKPLTLSFTNGGR-VKPQFEIPPEAYLIISNMGNVCLGILNGPEVGLGELNLIGDIS 402
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ +++V+DN + IGW + C
Sbjct: 403 MLDKVMVFDNEKQLIGWGPADC 424
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 318 bits (814), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 166/379 (43%), Positives = 234/379 (61%), Gaps = 18/379 (4%)
Query: 188 SIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP 247
++F L G++YP G Y+ M +G+P +PY+LD+DTGSDLTW+QCDAPC SC K +PLYKP
Sbjct: 38 AVFQLNGDVYPTGHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKP 97
Query: 248 RMGNILPYKDSLCMEIQRNHKPG-YCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG 306
++P S+C + P C QQCDY+I+Y D +SS+GVL D L + N
Sbjct: 98 TKNKLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNS 157
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVK--TDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
S +P+ FGC YDQQ + N +V+ TDG+LGL + VSL SQL GI KNV+GHCL+
Sbjct: 158 SSVRPSFTFGCGYDQQ-VGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLS 216
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
TN GGG++F G ++VP+ WVPM+ S Y + + +LG + +V
Sbjct: 217 TN--GGGFLFFGDNVVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRR--SLGVKPMEV-- 270
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FD+GS+YTYF Q Y +++LK S L SDP+LP+CW+ + +S+ DVK
Sbjct: 271 -VFDSGSTYTYFAAQPYQATVSALKAGLSKSL-QQVSDPSLPLCWKGQKVFKSVSDVKND 328
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
FK+L L F + ++ I PE YL+++K GN CLGILDGS II GDI+++
Sbjct: 329 FKSLFLSF-----VKNSVLEIPPENYLIVTKNGNACLGILDGSAAKLTFNII-GDITMQD 382
Query: 545 QLVVYDNVNKRIGWAKSHC 563
QL++YDN ++GW + C
Sbjct: 383 QLIIYDNERGQLGWIRGSC 401
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 172/390 (44%), Positives = 236/390 (60%), Gaps = 18/390 (4%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS +FP+ GN+YP G Y + +G PPRPYYLD+DTGSDLTW+QCDAPC C + +PLY
Sbjct: 44 SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 103
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
+P +++P D LC + N CET +QCDYE+EYAD SS+GVL RD +
Sbjct: 104 QPS-SDLIPCNDPLCKALHLNSNQ-RCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTK 161
Query: 306 GSLTKPNVVFGCAYDQ-QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
G P + GC YDQ G ++ DG+LGL R KVS+ SQL SQG +KNV+GHCL+
Sbjct: 162 GLRLTPRLALGCGYDQIPG--ASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLS 219
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG- 423
+ GGG +F G DL S ++W PM + + Y + L G R + +
Sbjct: 220 SL--GGGILFFGDDLYDSSRVSWTPM-SREYSKHYSPAM------GGELLFGGRTTGLKN 270
Query: 424 -WALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+FD+GSSYTYF +AY + LK E+S L D TLP+CW+ + P SI +V
Sbjct: 271 LLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEV 330
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
K++FK L L F + W+ T F I PE YL+IS KGN+CLGIL+G+E+ + ++GDIS
Sbjct: 331 KKYFKPLALSFKTGWR-SKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDIS 389
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
++ Q+++YDN + IGW + C K+
Sbjct: 390 MQDQMIIYDNEKQSIGWMPADCDELASLKA 419
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 177/398 (44%), Positives = 237/398 (59%), Gaps = 34/398 (8%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS +FP+ GN+YP G Y + +G PPRPYYLD+DTGSDLTW+QCDAPC C + +PLY
Sbjct: 41 SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVHCLEAPHPLY 100
Query: 246 KPRMGNILPYKDSLC--MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTI 303
+P +++P D LC + NH+ CET +QCDYE+EYAD SS+GVL RD L
Sbjct: 101 QPS-NDLIPCNDPLCKALHFNGNHR---CETPEQCDYEVEYADGGSSLGVLVRDVFSLNY 156
Query: 304 ENGSLTKPNVVFGCAYDQ-QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
G P + GC YDQ G + DG+LGL R KVS+ SQL SQG +KNVVGHC
Sbjct: 157 TKGLRLTPRLALGCGYDQIPG--ASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHC 214
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLD------SPFMELYHTEILKINYGSSPLNLG 416
L++ GGG +F G+DL S ++W PM SP M L G
Sbjct: 215 LSSL--GGGILFFGNDLYDSSRVSWTPMARENSKHYSPAM-------------GGELLFG 259
Query: 417 ARNSQVG--WALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKF 473
R + + +FD+GSSYTYF +AY + LK E+S L D TLP+CW+ +
Sbjct: 260 GRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRR 319
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
P SI +VK++FK L L F + W+ T F I PE YL+IS KGN+CLGIL+G+E+ +
Sbjct: 320 PFMSIEEVKKYFKPLALSFKTGWR-SKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQN 378
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
++GDIS++ Q+++YDN + IGW + C K+
Sbjct: 379 LNLIGDISMQDQMIIYDNEKQSIGWIPADCDEIASLKA 416
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 158/379 (41%), Positives = 232/379 (61%), Gaps = 13/379 (3%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS + L GN++P G Y + +GNPP+ + D+DTGSD+TW+QCDAPC+ C Y
Sbjct: 38 SSVVLLLSGNVFPLGYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGCNLPPKLQY 97
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
KP+ GN +P D +C+ + + P +QCDYE+ YAD SSMG L D+ + N
Sbjct: 98 KPK-GNTVPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPFKLLN 156
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
GS +P + FGC YDQ + T G+LGL R K+ L +QL S G+ +NVVGHCL++
Sbjct: 157 GSAMQPRLAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSS 216
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
GGGY+F G L+PS G+AW P+L P Y T ++ + P L
Sbjct: 217 K--GGGYLFFGDTLIPSLGVAWTPLL--PPDNHYTTGPAELLFNGKPTGLKGLK-----L 267
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FDTGSSYTYF + Y ++ + ++ L + D TLP+CW+ P +S+++VK F
Sbjct: 268 IFDTGSSYTYFNSKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNF 327
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
FKT+T++F + + +T+ I PE YL+ISK GN CLG+L+GSEV ++ ++GDIS++G
Sbjct: 328 FKTITINFTNARR--NTQLQIPPESYLIISKTGNACLGLLNGSEVGLQNSNVIGDISMQG 385
Query: 545 QLVVYDNVNKRIGWAKSHC 563
L++YDN +++GW S+C
Sbjct: 386 LLIIYDNEKQQLGWVSSNC 404
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 173/395 (43%), Positives = 236/395 (59%), Gaps = 18/395 (4%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS +FP+ GN+YP G Y + +G PPRPYYLD+DTGSDLTW+QCDAPC C + +PLY
Sbjct: 44 SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 103
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
+P +++P D LC + N CET +QCDYE+EYAD SS+GVL RD +
Sbjct: 104 QPS-SDLIPCNDPLCKALHLNSNQ-RCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQ 161
Query: 306 GSLTKPNVVFGCAYDQ-QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
G P + GC YDQ G ++ DG+LGL R KVS+ SQL SQG +KNV+GHCL+
Sbjct: 162 GLRLTPRLALGCGYDQIPG--ASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLS 219
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG- 423
+ GGG +F G DL S ++W PM + + Y + L G R + +
Sbjct: 220 SL--GGGILFFGDDLYDSSRVSWTPM-SREYSKHYSPAM------GGELLFGGRTTGLKN 270
Query: 424 -WALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+FD+GSSYTYF +AY + LK E+S L D TLP+CW+ + P SI +V
Sbjct: 271 LLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEV 330
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
K++FK L L F + W+ T F I PE YL+IS KGN+CLGIL+G+E+ + ++GDIS
Sbjct: 331 KKYFKPLALSFKTGWR-SKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDIS 389
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFLE 576
++ Q+++YDN + IGW C K+ E
Sbjct: 390 MQDQMIIYDNEKQSIGWMPVDCDELASLKAAQVYE 424
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 173/395 (43%), Positives = 236/395 (59%), Gaps = 18/395 (4%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS +FP+ GN+YP G Y + +G PPRPYYLD+DTGSDLTW+QCDAPC C + +PLY
Sbjct: 32 SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 91
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
+P +++P D LC + N CET +QCDYE+EYAD SS+GVL RD +
Sbjct: 92 QPS-SDLIPCNDPLCKALHLNSNQ-RCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQ 149
Query: 306 GSLTKPNVVFGCAYDQ-QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
G P + GC YDQ G ++ DG+LGL R KVS+ SQL SQG +KNV+GHCL+
Sbjct: 150 GLRLTPRLALGCGYDQIPG--ASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLS 207
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG- 423
+ GGG +F G DL S ++W PM + + Y + L G R + +
Sbjct: 208 SL--GGGILFFGDDLYDSSRVSWTPM-SREYSKHYSPAM------GGELLFGGRTTGLKN 258
Query: 424 -WALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+FD+GSSYTYF +AY + LK E+S L D TLP+CW+ + P SI +V
Sbjct: 259 LLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEV 318
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
K++FK L L F + W+ T F I PE YL+IS KGN+CLGIL+G+E+ + ++GDIS
Sbjct: 319 KKYFKPLALSFKTGWR-SKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDIS 377
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPFLE 576
++ Q+++YDN + IGW C K+ E
Sbjct: 378 MQDQMIIYDNEKQSIGWMPVDCDELASLKAAQVYE 412
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 163/380 (42%), Positives = 234/380 (61%), Gaps = 11/380 (2%)
Query: 184 VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP 243
V SS F + GN+YP G Y + +GNPP+ + D+DTGSDLTW+QCDAPC C K +
Sbjct: 36 VGSSVFFRVTGNVYPTGYYSVILNIGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRDK 95
Query: 244 LYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTI 303
LYKP+ N++P +SLC + QCDYEIEYAD SS+GVL D L +
Sbjct: 96 LYKPK-NNLVPCSNSLCQAVSTGENYHCDAPDDQCDYEIEYADLGSSIGVLLSDSFPLRL 154
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
NG+L +P + FGC YDQ+ L + T GILGL R KVS+ SQL + GI +NVVGHC
Sbjct: 155 SNGTLLQPKMAFGCGYDQKHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCF 214
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
+ GG++F G L PS + W PML S LY + ++ +G P G + Q+
Sbjct: 215 SR--ARGGFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKP--TGIKGLQL- 269
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+FD+GSSYTYF Q Y ++ +++ + + DA + L VCW+ PI+SI+D+K
Sbjct: 270 --IFDSGSSYTYFNAQVYQSILNLVRKDLAGKPLKDAPEKELAVCWKTAKPIKSILDIKS 327
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
+FK LT+ F + + + ++PE YL+I+K GN+CLGIL+GSE G+ ++GDI ++
Sbjct: 328 YFKPLTISFMNAKNV---QLQLAPEDYLIITKDGNVCLGILNGSEQQLGNFNVIGDIFMQ 384
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
++V+YDN ++IGW ++C
Sbjct: 385 DRVVIYDNEKQQIGWFPANC 404
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 162/379 (42%), Positives = 233/379 (61%), Gaps = 18/379 (4%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+++F L+G +YP G Y+ M +G+P +PY+LD+DTGSDLTW+QCDAPC SC K +P Y
Sbjct: 57 STAVFQLQGAVYPIGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWY 116
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
KP I+P SLC + N K C QQCDY+I+Y D +SS+GVL D L++ N
Sbjct: 117 KPTKNKIVPCAASLCTSLTPNKK---CAVPQQCDYQIKYTDKASSLGVLIADNFTLSLRN 173
Query: 306 GSLTKPNVVFGCAYDQQ-GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
S + N+ FGC YDQQ G TDG+LGL + VSL SQL QG+ KNV+GHC +
Sbjct: 174 SSTVRANLTFGCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFS 233
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
TN GGG++F G D+VP+ + WVPM + Y + + +LG + +V
Sbjct: 234 TN--GGGFLFFGDDIVPTSRVTWVPMARTTSGNYYSPGSGTLYFDRR--SLGMKPMEV-- 287
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FD+GS+Y YF + Y +++LK S L + SD +LP+CW+ + +S+ +VK
Sbjct: 288 -VFDSGSTYAYFAAEPYQATVSALKAGLSKSLK-EVSDVSLPLCWKGQKVFKSVSEVKND 345
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
FK+L L FG ++ I PE YL+++K GN+CLGILDG+ II GDI+++
Sbjct: 346 FKSLFLSFGK-----NSVMEIPPENYLIVTKYGNVCLGILDGTTAKLKFNII-GDITMQD 399
Query: 545 QLVVYDNVNKRIGWAKSHC 563
Q+++YDN ++GW + C
Sbjct: 400 QMIIYDNEKGQLGWIRGSC 418
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 315 bits (807), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 172/402 (42%), Positives = 242/402 (60%), Gaps = 17/402 (4%)
Query: 166 RPHKSKINKKLVS-SNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSD 224
+P +K K S +N + SS++F L+GN+YP G Y + +G PP+ Y LD+D+GSD
Sbjct: 27 QPRNAKKPKTPYSDNNHHRLSSSAVFKLQGNVYPLGHYTVSLNIGYPPKLYDLDIDSGSD 86
Query: 225 LTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY 284
LTW+QCDAPC C K + LYKP N++ D LC E+ + CDYE+EY
Sbjct: 87 LTWVQCDAPCKGCTKPRDQLYKPNH-NLVQCVDQLCSEVHLSMAYNCPSPDDPCDYEVEY 145
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
ADH SS+GVL RD + NGS+ +P V FGC YDQ+ N+ T G+LGL + S
Sbjct: 146 ADHGSSLGVLVRDYIPFQFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRAS 205
Query: 345 LPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEIL 404
+ SQL S G+I+NVVGHCL+ A GGG++F G D +PS G+ W ML S +
Sbjct: 206 ILSQLHSLGLIRNVVGHCLS--AQGGGFLFFGDDFIPSSGIVWTSMLSS-------SSEK 256
Query: 405 KINYGSSPLNLGARNSQV-GWAL-FDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDAS 461
+ G + L + + V G L FD+GSSYTYF QAY ++ + K++ L
Sbjct: 257 HYSSGPAELVFNGKATAVKGLELIFDSGSSYTYFNSQAYQAVVDLVTKDLKGKQLKRATD 316
Query: 462 DPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICL 521
DP+LP+CW+ S+ DVK++FK L L F + + H+ PE YL+I+K GN+CL
Sbjct: 317 DPSLPICWKGAKSFESLSDVKKYFKPLALSFKKSXNL---QMHLPPESYLIITKHGNVCL 373
Query: 522 GILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
GILDG+EV + I+GDI+L+ ++V+YDN ++IGW S+C
Sbjct: 374 GILDGTEVGLENLNIIGDITLQDKMVIYDNEKQQIGWVSSNC 415
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 164/385 (42%), Positives = 230/385 (59%), Gaps = 19/385 (4%)
Query: 184 VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP 243
+ S+ +FP+ GN+YP G Y+ + +GNPP+ + LD+DTGSDLTW+QCDAPC+ C K
Sbjct: 49 LSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAK 108
Query: 244 LYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTI 303
YKP N LP LC + + QCDYEI Y+DH+SS+G L DE+ L +
Sbjct: 109 QYKPNH-NTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKL 167
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
NGS+ + FGC YDQQ + T GILGL R KV L +QL S GI KNV+ HCL
Sbjct: 168 ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCL 227
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
+ G G++ +G +LVPS G+ W + T NY + P L + G
Sbjct: 228 SHT--GKGFLSIGDELVPSSGVTWTSL---------ATNSPSKNYMAGPAELLFNDKTTG 276
Query: 424 W----ALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
+FD+GSSYTYF +AY ++ + K+++ L D +LPVCW+ K P++S+
Sbjct: 277 VKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSL 336
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
+VK++FKT+TL FG+ Q F + PE YL+I++KG +CLGIL+G+E+ I+G
Sbjct: 337 DEVKKYFKTITLRFGN--QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIG 394
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
DIS +G +V+YDN +RIGW S C
Sbjct: 395 DISFQGIMVIYDNEKQRIGWISSDC 419
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 311 bits (796), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 164/385 (42%), Positives = 230/385 (59%), Gaps = 19/385 (4%)
Query: 184 VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP 243
+ S+ +FP+ GN+YP G Y+ + +GNPP+ + LD+DTGSDLTW+QCDAPC+ C K
Sbjct: 49 LSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAK 108
Query: 244 LYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTI 303
YKP N LP LC + + QCDYEI Y+DH+SS+G L DE+ L +
Sbjct: 109 QYKPNH-NTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKL 167
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
NGS+ + FGC YDQQ + T GILGL R KV L +QL S GI KNV+ HCL
Sbjct: 168 ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCL 227
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
+ G G++ +G +LVPS G+ W + T NY + P L + G
Sbjct: 228 SHT--GKGFLSIGDELVPSSGVTWTSL---------ATNSPSKNYMAGPAELLFNDKTTG 276
Query: 424 W----ALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
+FD+GSSYTYF +AY ++ + K+++ L D +LPVCW+ K P++S+
Sbjct: 277 VKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSL 336
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
+VK++FKT+TL FG+ Q F + PE YL+I++KG +CLGIL+G+E+ I+G
Sbjct: 337 DEVKKYFKTITLRFGN--QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIG 394
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
DIS +G +V+YDN +RIGW S C
Sbjct: 395 DISFQGIMVIYDNEKQRIGWISSDC 419
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 174/393 (44%), Positives = 241/393 (61%), Gaps = 25/393 (6%)
Query: 184 VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP 243
+ SS +FPL GN+YP G Y+ + +G PP PY+LD TGSDL+W+QCDAPC C K +
Sbjct: 49 IQSSVVFPLYGNVYPLGYYYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAXHX 108
Query: 244 LYKPRMGNILPYKDSLCMEIQRNHKPGY-CETCQQCDYEIEYADHSSSMGVLARDELHLT 302
LY+P N++ KD +C + H PGY CE +QCDYE+EYAD SS+GVL +D L
Sbjct: 109 LYRPN-NNLVICKDPMCAXL---HPPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLN 164
Query: 303 IENGSLTKPNVVFGCAYDQ-QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
NG P + GC YDQ G + L DG+LGL + K S+ SQL SQG+I+NVVGH
Sbjct: 165 FTNGLRLAPRLALGCGYDQIPGXSYHPL---DGVLGLGKGKSSIVSQLHSQGVIRNVVGH 221
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
C++++ GGG++F G DL S + W PML HT + G + L LG + +
Sbjct: 222 CVSSH--GGGFLFFGDDLYDSSRVVWTPMLRDQ-----HTH---YSSGYAELILGGKTTV 271
Query: 422 VGWAL--FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA-SDPTLPVCWRAKFPIRSI 478
L FD+GSSYTY AY L+ +++ S+ V +A D TLP+CWR K P +S+
Sbjct: 272 FKNLLVTFDSGSSYTYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRGKRPFKSV 331
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
DV++FFK L L F + T++ I E YL+IS GN+CLGIL+G+E ++G
Sbjct: 332 RDVRKFFKPLALSFAGGGR-TKTQYDIPLESYLIIS--GNVCLGILNGTEAGLQDFNLIG 388
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
DIS++ ++VVYDN +IGWA ++C +FK+
Sbjct: 389 DISMQDKMVVYDNEKNQIGWAPTNCDRLPKFKA 421
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 172/389 (44%), Positives = 234/389 (60%), Gaps = 27/389 (6%)
Query: 180 NAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239
N SS +FP+ GN+YP G Y + +G PPRPY+LD+DTGSDLTW+QCDAPCS C++
Sbjct: 63 NRFRSGSSVVFPVHGNVYPVGFYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQ 122
Query: 240 GANPLYKPRMGNILPYKDSLCMEIQR--NHKPGYCETCQQCDYEIEYADHSSSMGVLARD 297
+PLY+P +++P + LC + + N++ CE QCDYE+EYADH SS+GVL D
Sbjct: 123 TPHPLYRPS-NDLVPCRHPLCASVHQTDNYE---CEVEHQCDYEVEYADHYSSLGVLVND 178
Query: 298 ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357
L NG K + GC YDQ ++ DG+LGL R K SL SQL QG+++N
Sbjct: 179 VYVLNFTNGVQLKVRMALGCGYDQI-FPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRN 237
Query: 358 VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA 417
VVGHCL+ A GGGY+F G D+ S +AW PM + + G++ L LG
Sbjct: 238 VVGHCLS--AQGGGYIFFG-DVYDSSRLAWTPMSSRDYKHY--------SAGAAELVLGG 286
Query: 418 RNSQVG--WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPI 475
+ + G A+FD GSSYTYF AY KE++ + D TLP+CW K P
Sbjct: 287 KRTGFGNLLAVFDAGSSYTYFNSNAYQ----LTKELAGKPIKEAPEDQTLPLCWYGKRPF 342
Query: 476 RSIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
RS+ +VK++FK + L F GS+ +F I PE YL+IS GN+CLGILDGSEV
Sbjct: 343 RSVYEVKKYFKPIALSFPGSRRS--KAQFEIPPEAYLIISNMGNVCLGILDGSEVGVEDL 400
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++GDIS+ +++V+DN + IGW + C
Sbjct: 401 NLIGDISMLDKVMVFDNEKQLIGWTAADC 429
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 168/382 (43%), Positives = 233/382 (60%), Gaps = 23/382 (6%)
Query: 190 FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRM 249
PL GN+YP G Y +G PP+PY+LD DTGSDLTW+QCDAPC C +PLY+P
Sbjct: 55 LPLYGNVYPSGYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQP-T 113
Query: 250 GNILPYKDSLCMEIQRNHKPGY-CETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
+++ KD +C + H Y C+ QCDYE+EYAD SS+GVL D + + +G
Sbjct: 114 NDLVVCKDPICASL---HPDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMR 170
Query: 309 TKPNVVFGCAYDQ-QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
+P + GC YDQ G+ + L DG+LGL R S+ +QL+SQG+++NVVGHC +
Sbjct: 171 ARPRLTIGCGYDQLPGIAYHPL---DGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRR- 226
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG--WA 425
GGGY+F G D+ S + W PM +++ Y G + L L R+S +
Sbjct: 227 -GGGYLFFGDDIYDSSKVIWTPM-SRDYLKHYTP-------GFAELILNGRSSGLKNLLV 277
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FD+GSSYTYF Q Y L++ + K++ L D TLPVCWR K P +SI D K++
Sbjct: 278 VFDSGSSYTYFNTQTYQTLLSFIKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKY 337
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
FK L L FGS W+ ++F I E YL+IS KG++CLGIL+G+EV + I+GDIS++
Sbjct: 338 FKPLALSFGSGWK-TKSQFEIQQESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQE 396
Query: 545 QLVVYDNVNKRIGWAKSHCMNP 566
+LV+YDN + IGW S+C P
Sbjct: 397 KLVIYDNEKQVIGWQPSNCDRP 418
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 164/385 (42%), Positives = 230/385 (59%), Gaps = 24/385 (6%)
Query: 184 VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP 243
+ S+ +FP+ GN+YP G Y+ + +GNPP+ + LD+DTGSDLTW+QCDAPC+ C K
Sbjct: 49 LSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTK---- 104
Query: 244 LYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTI 303
YKP N LP LC + + QCDYEI Y+DH+SS+G L DE+ L +
Sbjct: 105 -YKPNH-NTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKL 162
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
NGS+ + FGC YDQQ + T GILGL R KV L +QL S GI KNV+ HCL
Sbjct: 163 ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCL 222
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
+ G G++ +G +LVPS G+ W + T NY + P L + G
Sbjct: 223 SHT--GKGFLSIGDELVPSSGVTWTSL---------ATNSPSKNYMAGPAELLFNDKTTG 271
Query: 424 W----ALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
+FD+GSSYTYF +AY ++ + K+++ L D +LPVCW+ K P++S+
Sbjct: 272 VKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSL 331
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
+VK++FKT+TL FG+ Q F + PE YL+I++KG +CLGIL+G+E+ I+G
Sbjct: 332 DEVKKYFKTITLRFGN--QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIG 389
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
DIS +G +V+YDN +RIGW S C
Sbjct: 390 DISFQGIMVIYDNEKQRIGWISSDC 414
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 173/394 (43%), Positives = 239/394 (60%), Gaps = 19/394 (4%)
Query: 176 LVSSNAVAVDSSSI-FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC 234
L+S+ +V +SSI F ++GN+YP G Y + +GNPP+ Y LD+DTGSDLTW+QCDAPC
Sbjct: 21 LLSAISVLSHASSIAFQIKGNVYPLGYYSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPC 80
Query: 235 SSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVL 294
C + YKP GN++ D LC IQ P +QCDYE+EYAD SS+GVL
Sbjct: 81 KGCTLPRDRQYKPH-GNLVKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYADQGSSLGVL 139
Query: 295 ARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
RD + L + NG+LT + FGC YDQ + N G+LGL + S+ SQL S+G+
Sbjct: 140 VRDIIPLKLTNGTLTHSMLAFGCGYDQTHVGHNPPPSAAGVLGLGNGRASILSQLNSKGL 199
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414
I+NVVGHCL+ GG++F G L+P G+ W P+L S L H Y + P +
Sbjct: 200 IRNVVGHCLSGTG--GGFLFFGDQLIPQSGVVWTPILQSSSSLLKH-------YKTGPAD 250
Query: 415 L---GARNSQVGWAL-FDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCW 469
+ G S G L FD+GSSYTYF A+ L+ + ++ L DP+LP+CW
Sbjct: 251 MFFNGKATSVKGLELTFDSGSSYTYFNSLAHKALVDLITNDIKGKPLSRATEDPSLPICW 310
Query: 470 RAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEV 529
+ P +S+ DV FK L L F ++ F + PE YL+++K GN+CLGILDG+E+
Sbjct: 311 KGPKPFKSLHDVTSNFKPLVLSFTKS---KNSLFQVPPEAYLIVTKHGNVCLGILDGTEI 367
Query: 530 HNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G+T I+GDISL+ +LV+YDN +RIGWA ++C
Sbjct: 368 GLGNTNIIGDISLQDKLVIYDNEKQRIGWASANC 401
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 307 bits (787), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 177/422 (41%), Positives = 242/422 (57%), Gaps = 36/422 (8%)
Query: 176 LVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS 235
+V S +AV SS +FP+ GN+YP G Y + +G PPRPYYLD+DTGSDLTW+QCDAPC
Sbjct: 13 MVMSLVLAV-SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCV 71
Query: 236 SCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLA 295
C + +PLY+P +++P D LC + N CET +QCDYE+EYAD SS+GVL
Sbjct: 72 RCLEAPHPLYQPS-SDLIPCNDPLCKALHLNSNQ-RCETPEQCDYEVEYADGGSSLGVLV 129
Query: 296 RDELHLTIENGSLTKPNVVFGCAYDQ-QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
RD + G P + GC YDQ G ++ DG+LGL R KVS+ SQL SQG
Sbjct: 130 RDVFSMNYTQGLRLTPRLALGCGYDQIPG--ASSHHPLDGVLGLGRGKVSILSQLHSQGY 187
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414
+KNV+GHCL++ GGG +F G DL S ++W PM + + Y + L
Sbjct: 188 VKNVIGHCLSSL--GGGILFFGDDLYDSSRVSWTPM-SREYSKHYSPAM------GGELL 238
Query: 415 LGARNSQVG--WALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRA 471
G R + + +FD+GSSYTYF +AY + LK E+S L D TLP+CW+
Sbjct: 239 FGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQG 298
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVIS----------------- 514
+ P SI +VK++FK L L F + W+ T F I PE YL+IS
Sbjct: 299 RRPFMSIEEVKKYFKPLALSFKTGWR-SKTLFEIPPEAYLIISVWFSHTMLKGRFIKMLQ 357
Query: 515 KKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
KGN+CLGIL+G+E+ + ++GDIS++ Q+++YDN + IGW C K+
Sbjct: 358 MKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCDELASLKAAQV 417
Query: 575 LE 576
E
Sbjct: 418 YE 419
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 163/385 (42%), Positives = 236/385 (61%), Gaps = 18/385 (4%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQC---DAPCSSCAKGAN 242
SS ++ ++GN+YPDGLY + +GNPP+PY LD+DTGSDLTW+QC DAPC C +
Sbjct: 46 SSLVYTIKGNVYPDGLYTVSINIGNPPKPYELDIDTGSDLTWVQCDGPDAPCKGCTMPKD 105
Query: 243 PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELH 300
LYKP ++ D +C+ Q H G + Q C Y ++YADH+S++GVL RD +H
Sbjct: 106 KLYKPNGKQVVKCSDPICVATQSTHVLGQICSKQSPPCVYNVQYADHASTLGVLVRDYMH 165
Query: 301 LTIENGSLTKPNVVFGCAYDQQ-GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVV 359
+ + S P V FGC Y+Q+ K GILGL K S+ SQL S G I NV+
Sbjct: 166 IGSPSSSTKDPLVAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVL 225
Query: 360 GHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN 419
GHCL+ A GGGY+FLG VPS G+ W P++ S + Y+T + + + P A+
Sbjct: 226 GHCLS--AEGGGYLFLGDKFVPSSGIVWTPIIQSSLEKHYNTGPVDLFFNGKPTP--AKG 281
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLD-ASDPTLPVCWRAKFPIRSI 478
Q+ +FD+GSSYTYF+ Y+ ++A++ G L DP+LP+CW+ P +S+
Sbjct: 282 LQI---IFDSGSSYTYFSSPVYT-IVANMVNNDLKGKPLSRVKDPSLPICWKGVKPFKSL 337
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
+V +FK LTL F + +F + P YL+I+K GN+CLGIL+G+E G+ ++G
Sbjct: 338 NEVNNYFKPLTLSFTKSKNL---QFQLPPVAYLIITKYGNVCLGILNGNEAGLGNRNVVG 394
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
DISL+ ++VVYDN ++IGWA ++C
Sbjct: 395 DISLQDKVVVYDNEKQQIGWASANC 419
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 162/392 (41%), Positives = 235/392 (59%), Gaps = 19/392 (4%)
Query: 176 LVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS 235
L N+ + SS +FPL+GN+YP G Y + +G + D+D+GSDLTW+QCDAPC+
Sbjct: 29 LRKKNSDRLLSSVVFPLKGNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCT 88
Query: 236 SCAKGANPLYKPRMGNILPYKDSLCMEIQ--RNHKPGYCETCQ-QCDYEIEYADHSSSMG 292
C K LYKP N L + LC + NH +C++ QC YEIEYADH SS+G
Sbjct: 89 HCTKPREQLYKPN-NNALNCFEPLCTSLHPITNH---HCKSADDQCQYEIEYADHGSSLG 144
Query: 293 VLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
VL D + L + NGSL P + FGC YD + + ++ T G+LGL +VS SQL+S
Sbjct: 145 VLVNDHVPLKLTNGSLAAPRIAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSM 204
Query: 353 GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSP 412
G+++NVVGHCL+ GG++F G + VPS G+ W M Y + ++ +G
Sbjct: 205 GVVRNVVGHCLSDE---GGFLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKA 261
Query: 413 LNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA-SDPTLPVCWRA 471
G ++ + +FD+GSSYTYF QAY+ ++A +K + DA D +LPVCW+
Sbjct: 262 --TGIKDLTL---VFDSGSSYTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKG 316
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
P +S+ DVK++F L L F + + + + PE YL+I+K GN+C GIL+G+EV
Sbjct: 317 TRPFKSLRDVKKYFNLLALRFT---KTKNAQIQLPPENYLIITKYGNVCFGILNGTEVGL 373
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G I+GDISL+ ++V+YDN +RIGW ++C
Sbjct: 374 GDLNIIGDISLKDKMVIYDNERRRIGWFPTNC 405
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 164/396 (41%), Positives = 232/396 (58%), Gaps = 27/396 (6%)
Query: 176 LVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS 235
L N+ + SS +FPL+GN+YP G Y + +G + D+D+GSDLTW+QCDAPC+
Sbjct: 29 LRKKNSDRLLSSVVFPLKGNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCT 88
Query: 236 SCAKGANPLYKPRMGNILPYKDSLCMEIQ--RNHKPGYCETCQ-QCDYEIEYADHSSSMG 292
C K LYKP N L + LC + NH +C++ QC YEIEYADH SS+G
Sbjct: 89 HCTKPREQLYKPN-NNALNCFEPLCTSLHPITNH---HCKSADDQCQYEIEYADHGSSLG 144
Query: 293 VLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
VL D + L + NGSL P + FGC YD + + ++ T G+LGL +VS SQL+S
Sbjct: 145 VLVNDHVPLKLTNGSLAAPRIAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSM 204
Query: 353 GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSP 412
G+++NVVGHCL+ GG++F G + VPS G+ W M E + Y S P
Sbjct: 205 GVVRNVVGHCLSDE---GGFLFFGDEFVPSSGVTWTSM---------SHESIGSYYSSGP 252
Query: 413 LNLGARNSQVGWA----LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA-SDPTLPV 467
+ G +FD+GSSYTYF QAY+ ++A +K + DA D +LPV
Sbjct: 253 AEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPV 312
Query: 468 CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS 527
CW+ P +S+ DVK++F L L F + + + + PE YL+I+K GN+C GIL+G+
Sbjct: 313 CWKGTRPFKSLRDVKKYFNPLALRFT---KTKNAQIQLPPENYLIITKYGNVCFGILNGT 369
Query: 528 EVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
EV G I+GDISL+ ++V+YDN +RIGW ++C
Sbjct: 370 EVGLGDLNIIGDISLKDKMVIYDNERRRIGWFPTNC 405
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 176/403 (43%), Positives = 243/403 (60%), Gaps = 19/403 (4%)
Query: 166 RPHKSKINKKLVSSNAV-AVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSD 224
R K+ ++ ++ SS + SS +FPL GN+YP G Y + +G P +PY+LD+DTGSD
Sbjct: 34 RWRKAVLSGEITSSMMINRAGSSLVFPLHGNVYPAGYYNVTLSIGQPAKPYFLDVDTGSD 93
Query: 225 LTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGY--CETCQQCDYEI 282
LTW+QCDAPC C + +PLY+P N++ +D LC +Q PG C+ QCDYE+
Sbjct: 94 LTWLQCDAPCRQCIEAPHPLYRPS-NNLVICEDPLCASLQ---PPGVHNCQDPDQCDYEV 149
Query: 283 EYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAK 342
EYAD SS+GVL +D L NG P + GC YDQ L + DGILGL R
Sbjct: 150 EYADGGSSLGVLVKDVFVLNFTNGKRLNPLLALGCGYDQ--LPGRSNHPLDGILGLGRGI 207
Query: 343 VSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTE 402
S+PSQL+SQG++ NV+GHCL+ GG++F G D+ S G+ W PM ++ Y
Sbjct: 208 SSIPSQLSSQGLVSNVIGHCLSGRG--GGFLFFGEDIYDSSGVTWTPM-SRDHLKHYSPG 264
Query: 403 ILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDAS 461
++ + + G RN V +FD+GSSYTY QAY L+ SLK E+S +
Sbjct: 265 FAELIFDGK--STGIRNLLV---VFDSGSSYTYLNAQAYQHLVFSLKRELSRKPISEALD 319
Query: 462 DPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG-SKWQIVSTKFHISPEGYLVISKKGNIC 520
D TLP+CW+ K P +SI DVK++FK L F S + T+F SPE YL+IS KGN C
Sbjct: 320 DQTLPLCWKGKRPFKSIRDVKKYFKPFALVFKTSSGRSSKTQFEFSPEAYLIISSKGNAC 379
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
LGIL+G+EV ++GD+S+ +LV+Y+N + IGWA + C
Sbjct: 380 LGILNGTEVGLRDLNVIGDVSMLDRLVIYNNEKQMIGWAAASC 422
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 157/363 (43%), Positives = 221/363 (60%), Gaps = 15/363 (4%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
+GNP +PY+LD+DTGSDLTW+QCDAPC SC K +PLY+P ++P ++LC +
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTALHSGQ 60
Query: 268 KP-GYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQ-GLL 325
C + +QCDY+I+Y D +SS GVL D L + + ++ +P + FGC YDQQ G
Sbjct: 61 GSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSNI-RPGLTFGCGYDQQVGKN 119
Query: 326 LNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGM 385
DG+LGL R VSL SQL QGI KNVVGHCL+TN GGG++F G D+VPS +
Sbjct: 120 GAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTN--GGGFLFFGDDVVPSSRV 177
Query: 386 AWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELI 445
WVPM Y + + +LG + +V +FD+GS+YTYFT Q Y ++
Sbjct: 178 TWVPMAQRTSGNYYSPGSGTLYFDRR--SLGVKPMEV---VFDSGSTYTYFTAQPYQAVV 232
Query: 446 ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHI 505
++LK S L SDPTLP+CW+ + +S+ DVK FK++ L F S + I
Sbjct: 233 SALKGGLSKSLK-QVSDPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFASA---KNAAMEI 288
Query: 506 SPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
PE YL+++K GN+CLGILDG+ +I GDI+++ Q+V+YDN ++GWA+ C
Sbjct: 289 PPENYLIVTKNGNVCLGILDGTAAKLSFNVI-GDITMQDQMVIYDNEKSQLGWARGACTR 347
Query: 566 PGR 568
+
Sbjct: 348 SAK 350
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 302 bits (773), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 167/377 (44%), Positives = 228/377 (60%), Gaps = 16/377 (4%)
Query: 190 FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRM 249
F ++GN+YP G Y + +GNPP+ Y LD+DTGSDLTW+QCDAPC C N LYKP
Sbjct: 52 FQIKGNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRNRLYKPH- 110
Query: 250 GNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
G+++ D LC IQ +QCDYE+EYAD SS+GVL RD + L NGSL
Sbjct: 111 GDLVKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLA 170
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
+P + FGC YDQ N T G+LGL + S+ SQL S G+I+NVVGHCL+
Sbjct: 171 RPMLAFGCGYDQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCLSGRG-- 228
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-GWAL-F 427
GG++F G L+P G+ W P+L S + Y T G + L + + V G L F
Sbjct: 229 GGFLFFGDQLIPPSGVVWTPLLQSSSAQHYKT-------GPADLFFDRKTTSVKGLELIF 281
Query: 428 DTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+GSSYTYF QA+ L+ + ++ L DP+LP+CW+ P +S+ DV FK
Sbjct: 282 DSGSSYTYFNSQAHKALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLHDVTSNFK 341
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
L L F ++ + PE YL+++K GN+CLGILDG+E+ G+T I+GDISL+ +L
Sbjct: 342 PLLLSFTKS---KNSPLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKL 398
Query: 547 VVYDNVNKRIGWAKSHC 563
V+YDN ++IGWA ++C
Sbjct: 399 VIYDNEKQQIGWASANC 415
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 165/377 (43%), Positives = 222/377 (58%), Gaps = 16/377 (4%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
+FPL+GN+YP G Y + +GNPP+PY LD+D+GSDLTW+QCDAPC SC K +P YKP
Sbjct: 55 VFPLQGNVYPQGFYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPN 114
Query: 249 MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
G I D +C + KP + +QCDYE+ YADH SS+GVL D L + NG+L
Sbjct: 115 KGPIT-CNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTL 173
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
P + FGC YDQ N DG+LGL K S+ +QL S G+I+++VGHCL+ G
Sbjct: 174 AAPRLAFGCGYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGG 233
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-GWAL- 426
G ++ G P G+ W PM Y G + L +NS V G L
Sbjct: 234 GFLFLGDGLSTTP--GIIWTPMSRKSGESAYA-------LGPADLLFNGQNSGVKGLRLV 284
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
FD+GSSYTYF QAY + SL +G + + +D +LPVCWR P +SI +VK +FK
Sbjct: 285 FDSGSSYTYFNAQAYKTTL-SLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFK 343
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
L F + S + + PE YL+ISK GN CLGIL+GSEV G + ++GDI+ + ++
Sbjct: 344 PFALSFT---KAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIAFQDKM 400
Query: 547 VVYDNVNKRIGWAKSHC 563
V+YDN ++IGW C
Sbjct: 401 VIYDNERQQIGWVPKDC 417
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 169/378 (44%), Positives = 239/378 (63%), Gaps = 17/378 (4%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
+ PL+GN+YP+G Y + VG PP+PY+LD DTGSDLTW+QCDAPC C + +PLY+P
Sbjct: 44 VLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPS 103
Query: 249 MGNILPYKDSLCMEIQR--NHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG 306
+++P KD LCM + +H+ CE QCDYE+EYAD SS+GVL RD L + NG
Sbjct: 104 -NDLVPCKDPLCMSLHSSMDHR---CENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNG 159
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+P + GC YDQ ++ DGILGL R VS+ SQL +QGI++NVVGHC N
Sbjct: 160 DPIRPRLALGCGYDQDP-GSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCF--N 216
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
+ GGGY F G + + + W PM + + Y ++ + + G RN V +
Sbjct: 217 SKGGGYXFFGDGIYDPYRLVWTPM-SRDYPKHYSPGFGELIFNGR--STGLRNLFV---V 270
Query: 427 FDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
FD+GSSYTYF QAY L + L +E++ L D TLP+CWR + PI+S+ DV+++F
Sbjct: 271 FDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYF 330
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
K L L F S + + F I EGY++IS GN+CLGIL+G++V ++ I+GDIS++ +
Sbjct: 331 KPLALSFSSGGRSKAV-FEIPTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDK 389
Query: 546 LVVYDNVNKRIGWAKSHC 563
+VVY+N + IGWA ++C
Sbjct: 390 MVVYNNEKQAIGWATANC 407
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 165/377 (43%), Positives = 222/377 (58%), Gaps = 16/377 (4%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
+FPL+GN+YP G Y + +GNPP+PY LD+D+GSDLTW+QCDAPC SC K +P YKP
Sbjct: 22 VFPLQGNVYPQGFYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPPYKPN 81
Query: 249 MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
G I D +C + KP + +QCDYE+ YADH SS+GVL D L + NG+L
Sbjct: 82 KGPIT-CNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNGTL 140
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
P + FGC YDQ N DG+LGL K S+ +QL S G+I+++VGHCL+ G
Sbjct: 141 AAPRLAFGCGYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGG 200
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-GWAL- 426
G ++ G P G+ W PM Y G + L +NS V G L
Sbjct: 201 GFLFLGDGLSTTP--GIIWTPMSRKSGESAYA-------LGPADLLFNGQNSGVKGLRLV 251
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
FD+GSSYTYF QAY + SL +G + + +D +LPVCWR P +SI +VK +FK
Sbjct: 252 FDSGSSYTYFNAQAYKTTL-SLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFK 310
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
L F + S + + PE YL+ISK GN CLGIL+GSEV G + ++GDI+ + ++
Sbjct: 311 PFALSFT---KAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIAFQDKM 367
Query: 547 VVYDNVNKRIGWAKSHC 563
V+YDN ++IGW C
Sbjct: 368 VIYDNERQQIGWVPKDC 384
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 163/409 (39%), Positives = 231/409 (56%), Gaps = 13/409 (3%)
Query: 156 VVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPY 215
V+A+ +G + K S+ SS + P+ GN+YP G Y + +GNPP+ +
Sbjct: 22 VLAATFEGSFSAASQRCTLK-KSTQHSCFGSSLVLPVFGNVYPLGYYSVSLYIGNPPKLF 80
Query: 216 YLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETC 275
LD+DTGSDLTW+QCDAPC+ C K + LYKPR N+L D LC +Q +
Sbjct: 81 ELDIDTGSDLTWVQCDAPCTGCTKPLHHLYKPR-NNLLSCIDPLCSAVQNSGTYQCQSAT 139
Query: 276 QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGI 335
QCDYEI+YAD SS+GVL D L + NGS +P + FGC YDQ+ T G+
Sbjct: 140 DQCDYEIQYADEGSSLGVLVTDYFPLRLMNGSFLRPKMTFGCGYDQKSPGPVAPPPTTGV 199
Query: 336 LGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPF 395
LGL K S+ SQL + G++ NV+GHCL+ GGG++F G D VPS+G++W PM
Sbjct: 200 LGLGNGKTSIISQLQALGVMGNVIGHCLSRK--GGGFLFFGQDPVPSFGISWAPMSQKSL 257
Query: 396 MELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAY-SELIASLKEVSSD 454
+ Y + ++ YG P A +FD+GSSYTYF Q Y S L KE+S
Sbjct: 258 DKYYASGPAELLYGGKPTGTKAEE-----FIFDSGSSYTYFNAQVYQSTLNLIRKELSGK 312
Query: 455 GLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVIS 514
L + L +CW+ +S+ +VK +FK L F + S + I PE YL+++
Sbjct: 313 PLRDAPEEKALAICWKGTKRFKSVNEVKSYFKPFALSFT---KAKSVQLQIPPEDYLIVT 369
Query: 515 KKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
GN+CLGIL+GSEV G+ ++GD + +LV+YD+ +IGW ++C
Sbjct: 370 NDGNVCLGILNGSEVGLGNFNVIGDNLFQDKLVIYDSDKHQIGWIPANC 418
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 298 bits (763), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 164/392 (41%), Positives = 240/392 (61%), Gaps = 14/392 (3%)
Query: 178 SSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC 237
SS DSS + P++GN+YP G + + +GNPP+ + LD+DTGSDLTW+QCDAPC+ C
Sbjct: 31 SSAVNPFDSSILLPVKGNVYPLGHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGC 90
Query: 238 AKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD 297
+ LYKP N++ + LC + K QCDYE+EYADH SS+GVL +D
Sbjct: 91 TLPHDRLYKPH-NNVVRCGEPLCSALFSASKSPCKNPNDQCDYEVEYADHGSSIGVLVKD 149
Query: 298 ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357
+ L + NG++ PN+ FGC YDQ T G+LGL +K ++ +QL++ ++N
Sbjct: 150 PVPLRLTNGTILAPNLGFGCGYDQHNGGSQLPPLTAGVLGLGNSKATMATQLSALSHVRN 209
Query: 358 VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA 417
V+GHC + GG++F G DLVPS GM+W+P+L +P + Y ++ +G +P +G
Sbjct: 210 VLGHCFSGQG--GGFLFFGGDLVPSSGMSWMPILRTPGGK-YSAGPAEVYFGGNP--VGI 264
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA-SDPTLPVCWRAKFPIR 476
R + FD+GSSYTYF Q Y ++ L+ + DA D TLP+CW+ +
Sbjct: 265 RGLIL---TFDSGSSYTYFNSQVYGAVLNLLRNGLKGQPLRDAPEDKTLPICWKGSKAFK 321
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
S+ DV+ FFK L L FG+ +F I PE YL+IS GN+CLGIL+GS+V G+ +
Sbjct: 322 SVADVRNFFKPLALSFGNS----KVQFQIPPEAYLIISNLGNVCLGILNGSQVGLGNVNL 377
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHCMNPGR 568
+GDIS+ +++VYDN ++IGWA ++C P R
Sbjct: 378 IGDISMLDKMMVYDNERQQIGWAPANCSKPPR 409
>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
gi|219887685|gb|ACL54217.1| unknown [Zea mays]
Length = 292
Score = 297 bits (760), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 140/284 (49%), Positives = 191/284 (67%), Gaps = 5/284 (1%)
Query: 291 MGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA 350
MGV RD + E+G ++VFGC YDQQG+LLN L TDG+LGL+ +SLP+QLA
Sbjct: 1 MGVYVRDSMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLA 60
Query: 351 SQGIIKNVVGHCLTTN-AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYG 409
S+GII N GHC++T+ +G GGY+FLG D +P WGM WVP+ D P ++ ++ +IN+G
Sbjct: 61 SRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHG 120
Query: 410 SSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW 469
LN + +QV +FDTGS+YTYF +A + LI+SLKE +S V D SD TLP C
Sbjct: 121 DQQLNAQGKLTQV---VFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFCM 177
Query: 470 RAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEV 529
++ FP+RS+ DVK FFK L+L F ++ S F+I PE YLVIS KGN+CLG+L+G+ +
Sbjct: 178 KSDFPVRSVEDVKHFFKPLSLQFEKRF-FFSRTFNIRPEHYLVISDKGNVCLGVLNGTTI 236
Query: 530 HNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
S +I+GD+SLRG+LV YDN +GW C NP + +P
Sbjct: 237 GYDSVVIVGDVSLRGKLVAYDNDKNEVGWVDFDCTNPRKRSRIP 280
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 292 bits (747), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 164/383 (42%), Positives = 227/383 (59%), Gaps = 21/383 (5%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS +FP+RGN+YP G + + +GNP + + LD+DTGSDLTW+QCD C C + LY
Sbjct: 37 SSVLFPVRGNVYPLGHFTVLLNIGNPSKVFELDIDTGSDLTWVQCDVECIGCTLPRDMLY 96
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
+P N + +D LC + K + QC YE+EYADH SS+GVL +D + + + N
Sbjct: 97 RPH-NNAVSREDPLCAALSSLGKFIFKNPNDQCAYEVEYADHGSSVGVLVKDLVPMRLTN 155
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
G PN+ FGC YDQ+ L G+LGLS +K ++ SQL+ G + NVVGHCLT
Sbjct: 156 GKRISPNLGFGCGYDQENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTG 215
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
GG++F G D+VPS GM+W P+L + + Y S P + VG
Sbjct: 216 RG--GGFLFFGGDVVPSSGMSWTPILRNS----------EGKYSSGPAEVYFNGRAVGIG 263
Query: 426 ----LFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
FD+GSSYTYF Q Y + LK ++ + L L + D TL +CW+ P S+VD
Sbjct: 264 GLTLTFDSGSSYTYFNSQVYRAIEKLLKNDLKGNPLKLASDDKTLELCWKGPKPFESVVD 323
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
V+ FFK L + F + + +F I PE YL+IS+ GN+CLGILDGS+ G+ I+GDI
Sbjct: 324 VRNFFKPLAMSFKNSKNV---QFQIPPEAYLIISEFGNVCLGILDGSKEGMGNVNIIGDI 380
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
S+ ++VVYDN +RIGWA S+C
Sbjct: 381 SMLNKIVVYDNERERIGWASSNC 403
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 291 bits (745), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 168/392 (42%), Positives = 236/392 (60%), Gaps = 19/392 (4%)
Query: 184 VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP 243
V SS + PL GN+YP+G Y + +G P +PY+LD+DTGSDLTW+QCDAPC C + +P
Sbjct: 16 VPSSIVLPLHGNVYPNGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHP 75
Query: 244 LYKPRMGNILPYKDSLCMEIQRN--HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
Y+PR N++P D +C + N H+ CE QCDYE+EYAD SS GVL D +L
Sbjct: 76 YYRPR-NNLVPCMDPICQSLHSNGDHR---CENPGQCDYEVEYADGGSSFGVLVTDTFNL 131
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+ P + GC YDQ + DG+LGL + K S+ SQL+S G+++NV+GH
Sbjct: 132 NFTSEKRHSPLLALGCGYDQ--FPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGH 189
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
CL+ + GG++F G DL S +AW PM SP + Y + ++ + G +N
Sbjct: 190 CLSGHG--GGFLFFGDDLYDSSRVAWTPM--SPDAKHYSPGLAELTFDGK--TTGFKNL- 242
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
FD+G+SYTY QAY LI+ L KE+S L D TLP+CW+ + P +SI D
Sbjct: 243 --LTTFDSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRD 300
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
VK++FKT L F ++ + T+ PE YL+IS KGN CLGIL+G+EV ++GDI
Sbjct: 301 VKKYFKTFALSFTNERK-SKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLNVIGDI 359
Query: 541 SLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSL 572
S++ ++V+YDN +RIGWA +C + KS
Sbjct: 360 SMQDRVVIYDNEKERIGWAPGNCNRLPKSKSF 391
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 159/348 (45%), Positives = 212/348 (60%), Gaps = 18/348 (5%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS +FP+ GN+YP G Y + +G PPRPYYLD+DTGSDLTW+QCDAPC C + +PLY
Sbjct: 41 SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 100
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
+P +++P D LC + N CET +QCDYE+EYAD SS+GVL RD +
Sbjct: 101 QPS-SDLIPCNDPLCKALHLNSNQ-RCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQ 158
Query: 306 GSLTKPNVVFGCAYDQ-QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
G P + GC YDQ G ++ DG+LGL R KVS+ SQL SQG +KNV+GHCL+
Sbjct: 159 GLRLTPRLALGCGYDQIPG--ASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLS 216
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG- 423
+ GGG +F G DL S ++W PM + + Y + L G R + +
Sbjct: 217 SL--GGGILFFGDDLYDSSRVSWTPM-SREYSKHYSPAM------GGELLFGGRTTGLKN 267
Query: 424 -WALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+FD+GSSYTYF +AY + LK E+S L D TLP+CW+ + P SI +V
Sbjct: 268 LLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEV 327
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEV 529
K++FK L L F + W+ T F I PE YL+IS KGN+CLGIL+G+E+
Sbjct: 328 KKYFKPLALSFKTGWR-SKTLFEIPPEAYLIISMKGNVCLGILNGTEI 374
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 167/384 (43%), Positives = 234/384 (60%), Gaps = 20/384 (5%)
Query: 184 VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP 243
V SS + PL GN+YP+G Y + +G P +PY+LD+DTGSDLTW+QCDAPC C + +P
Sbjct: 2 VPSSIVLPLHGNVYPNGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHP 61
Query: 244 LYKPRMGNILPYKDSLCMEIQRN--HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
Y+PR N++P D +C + N H+ CE QCDYE+EYAD SS GVL RD +L
Sbjct: 62 YYRPR-NNLVPCMDPICQSLHSNGDHR---CENPGQCDYEVEYADGGSSFGVLVRDTFNL 117
Query: 302 TIENGSLTKPNVVFG-CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVG 360
+ P + G C YDQ + DG+LGL + K S+ SQL+S G+++NV+G
Sbjct: 118 NFTSEKRHSPLLALGLCGYDQ--FPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIG 175
Query: 361 HCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS 420
HCL+ + GG++F G DL S +AW PM SP + Y + ++ + G +N
Sbjct: 176 HCLSGHG--GGFLFFGDDLYDSSRVAWTPM--SPDAKHYSPGLAELTFDGK--TTGFKNL 229
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
FD+G+SYTY QAY LI+ L KE+S L D TLP+CW+ + P +SI
Sbjct: 230 ---LTTFDSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIR 286
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
DVK++FKT L F ++ + T+ PE YL+IS KGN CLGIL+G+EV ++GD
Sbjct: 287 DVKKYFKTFALSFTNERK-SKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLNVIGD 345
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
IS++ ++V+YDN +RIGWA +C
Sbjct: 346 ISMQDRVVIYDNEKERIGWAPGNC 369
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 285 bits (729), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 156/387 (40%), Positives = 234/387 (60%), Gaps = 28/387 (7%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQC---DAPCSSCAKGAN 242
SS ++ ++GN+YPDG+Y + +GNPP PY LD+DTGSDLTW+QC DAPC C +
Sbjct: 46 SSLVYTIKGNVYPDGIYTVSINIGNPPNPYELDIDTGSDLTWVQCDGPDAPCKGCTLPKD 105
Query: 243 PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQ----CDYEIEYADHSSSMGVLARDE 298
LYKP ++ D +C +Q + + C + C Y++EYAD++ S G LARD
Sbjct: 106 KLYKPNGNQLVKCSDPICAAVQPPFS-TFGQKCAKPIPPCVYKVEYADNAESTGALARDY 164
Query: 299 LHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
+H+ +GS P VVFGC Y+Q+ T G+LGL K+S+ SQL S G I NV
Sbjct: 165 MHIGSPSGS-NVPLVVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNV 223
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR 418
+GHCL+ A GGGY+FLG +PS G+ W P++ S + Y T + + + P A+
Sbjct: 224 LGHCLS--AEGGGYLFLGDKFIPSSGIFWTPIIQSSLEKHYSTGPVDLFFNGKP--TPAK 279
Query: 419 NSQVGWALFDTGSSYTYFTKQAYSELIASL--KEVSSDGLVLDASDPTLPVCWRAKFPIR 476
Q+ +FD+GSSYTYF+ + Y+ ++A++ ++ L + DP+LP+CW+ P +
Sbjct: 280 GLQI---IFDSGSSYTYFSPRVYT-IVANMVNNDLKGKPLRRETKDPSLPICWKGVKPFK 335
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
S+ +V +FK LTL F + +F + P K GN+CLGIL+G+E G+ +
Sbjct: 336 SLNEVNNYFKPLTLSFTKSKNL---QFQLPP------VKFGNVCLGILNGNEAGLGNRNV 386
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+GDISL+ ++VVYDN ++IGWA ++C
Sbjct: 387 VGDISLQDKVVVYDNEKQQIGWASANC 413
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 285 bits (729), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 148/343 (43%), Positives = 212/343 (61%), Gaps = 19/343 (5%)
Query: 177 VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
+++ A SS++FPL G++YP GLY+ M +GNPP+PY+LD+D+GSDLTW+QCDAPC S
Sbjct: 41 IAAGAETEPSSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRS 100
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQR----NHKPGYCET-CQQCDYEIEYADHSSSM 291
C + +PLY+P ++P LC + H+ C++ +QCDY I+YAD SS
Sbjct: 101 CNEVPHPLYRPTKSKLVPCVHRLCASLHNGLTGKHR---CDSPHEQCDYVIKYADQGSST 157
Query: 292 GVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
GVL D L + NGS+ +P+V FGC YDQQ + TDG+LGL VSL SQL
Sbjct: 158 GVLINDSFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQ 217
Query: 352 QGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS 411
+G+ KNVVGHCL+ GGG++F G DLVP W PM S F Y + +G
Sbjct: 218 RGVTKNVVGHCLSLR--GGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDR 275
Query: 412 PLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
+LG R ++V +FD+GSS+TYF + Y L+ +LK+ S L + D +LP+CW+
Sbjct: 276 --SLGVRLAKV---VFDSGSSFTYFAAKPYQALVTALKDGLSRTLE-EEPDTSLPLCWKG 329
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVIS 514
+ P +S++DV++ FK+L L+F S + T I PE YL+++
Sbjct: 330 QEPFKSVLDVRKEFKSLVLNFASGKK---TLMEIPPENYLIVT 369
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 166/383 (43%), Positives = 230/383 (60%), Gaps = 22/383 (5%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS + PL GN+YP G Y + +G P RPY+LD+DTGSDLTW+QCDAPC+ C++ +PLY
Sbjct: 53 SSIVLPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAPCTHCSETPHPLY 112
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
+P + +P +D LC +Q CE QCDYEI YAD S+ GVL D L N
Sbjct: 113 RPS-NDFVPCRDPLCASLQPTEDYN-CEHPDQCDYEINYADQYSTFGVLLNDVYLLNFTN 170
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
G K + GC YDQ ++ DG+LGL R K SL SQL SQG+++NV+GHCL+
Sbjct: 171 GVQLKVRMALGCGYDQV-FSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHCLS- 228
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL--GARNSQVG 423
A GGGY+F G + S + W P+ + + +Y + P L G R + VG
Sbjct: 229 -AQGGGYIFFG-NAYDSARVTWTPI----------SSVDSKHYSAGPAELVFGGRKTGVG 276
Query: 424 --WALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
A+FDTGSSYTYF AY L++ L KE+S L + D TLP+CW K P S+ +
Sbjct: 277 SLTAVFDTGSSYTYFNSHAYQALLSWLKKELSGKPLKVAPDDQTLPLCWHGKRPFTSLRE 336
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
V+++FK + L F + + +F I PE YL+IS GN+CLGIL+GSEV ++GDI
Sbjct: 337 VRKYFKPVALGFTNGGR-TKAQFEILPEAYLIISNLGNVCLGILNGSEVGLEELNLIGDI 395
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
S++ +++V++N + IGW + C
Sbjct: 396 SMQDKVMVFENEKQLIGWGPADC 418
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 137/264 (51%), Positives = 183/264 (69%), Gaps = 15/264 (5%)
Query: 186 SSSIFP--LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP-CSSCAKGAN 242
+S++FP L GN++P+GLY+T + +G+PPRPY+LD+DTGS TW+QCDAP C+SCAKGA+
Sbjct: 142 NSTLFPHSLAGNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAH 201
Query: 243 PLYKP-RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
PLY+P R + LP D LC Q E QCDYEI YAD SSSMGV RD +
Sbjct: 202 PLYRPARTADALPASDPLCEGAQH-------ENPNQCDYEISYADGSSSMGVYVRDSMQF 254
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
E+G ++VFGC YDQQG+LLN L TDG+LGL+ +SLP+QLAS+GII N GH
Sbjct: 255 VGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGH 314
Query: 362 CLTTN-AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS 420
C++T+ +G GGY+FLG D +P WGM WVP+ D P ++ ++ +IN+G LN + +
Sbjct: 315 CMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLT 374
Query: 421 QVGWALFDTGSSYTYFTKQAYSEL 444
QV +FDTGS+YTYF +A + L
Sbjct: 375 QV---VFDTGSTYTYFPDEALTRL 395
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 278 bits (711), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 164/383 (42%), Positives = 230/383 (60%), Gaps = 22/383 (5%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS +FPL GN+YP G Y + +G P RPY+LD+DTGSDLTW+QCDAPC+ C++ +PL+
Sbjct: 55 SSIVFPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAPCTHCSETPHPLH 114
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
+P + +P +D LC +Q CE QCDYEI YAD S+ GVL D L N
Sbjct: 115 RPS-NDFVPCRDPLCASLQPTEDYN-CEHPDQCDYEINYADQYSTYGVLLNDVYLLNSSN 172
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
G K + GC YDQ ++ DG+LGL R K SL SQL SQG+++NV+GHCL++
Sbjct: 173 GVQLKVRMALGCGYDQV-FSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHCLSS 231
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL--GARNSQVG 423
GGGY+F G + S + W P+ + + +Y + P L G R + VG
Sbjct: 232 Q--GGGYIFFG-NAYDSARVTWTPI----------SSVDSKHYSAGPAELVFGGRKTGVG 278
Query: 424 --WALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
A+FDTGSSYTYF AY L++ L KE+S L + D TL +CW K P S+ +
Sbjct: 279 SLTAVFDTGSSYTYFNSHAYQALLSWLNKELSGKPLKVAPDDQTLSLCWHGKRPFTSLRE 338
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
V+++FK + L F + + V +F I PE YL+IS GN+CLGIL+G EV ++GDI
Sbjct: 339 VRKYFKPVALSFTNGGR-VKAQFEIPPEAYLIISNLGNVCLGILNGFEVGLEELNLVGDI 397
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
S++ +++V++N + IGW + C
Sbjct: 398 SMQDKVMVFENEKQLIGWGPADC 420
>gi|172034220|gb|ACB69715.1| putative nucellin-like aspartic protease [Hordeum vulgare]
Length = 310
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 142/275 (51%), Positives = 179/275 (65%), Gaps = 13/275 (4%)
Query: 299 LHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
+H NG K + V G +DQQG LL++ KT GILGLS A +SLPSQLAS+GII NV
Sbjct: 1 MHFNRYNGG-RKASFVLGVTFDQQGQLLSSPAKTSGILGLSSAAISLPSQLASKGIISNV 59
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR 418
GHC+T GGGYMFLG D VP WGM W P+ P LYHTE K+NYG L+ G
Sbjct: 60 FGHCITRETNGGGYMFLGDDYVPRWGMTWAPIRGGP-DNLYHTEAQKVNYGDQELHAGIP 118
Query: 419 NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
+ G+SYTY ++ Y LI ++KE S V D+SD TLP+CW+A F +RS
Sbjct: 119 VQVIS----RCGTSYTYLPEEMYKNLIDAIKE-DSPSFVQDSSDTTLPLCWKADFSVRS- 172
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
FFK L LHFG +W +V F I P+ YL+IS KGN+CLG+L+G+E+++GSTII+G
Sbjct: 173 -----FFKPLNLHFGRRWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVG 227
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
D+SLRG+LVVYDN ++IGWA S C P K P
Sbjct: 228 DVSLRGKLVVYDNERRQIGWANSECTKPQSQKGFP 262
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 161/384 (41%), Positives = 228/384 (59%), Gaps = 20/384 (5%)
Query: 184 VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP 243
V SS + PL GN+YP G Y + +G P +PY+LD+DTGSDLTW+QCD P + C + +P
Sbjct: 2 VPSSIVLPLHGNVYPTGFYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPHP 61
Query: 244 LYKPRMGNILPYKDSLCMEIQRNHKPG--YCETCQQCDYEIEYADHSSSMGVLARDELHL 301
YKP N++ KD +C + H G CE QCDYE+EYAD SS+GVL +D +L
Sbjct: 62 YYKPS-NNLVACKDPICQSL---HTGGDQRCENPGQCDYEVEYADGGSSLGVLVKDAFNL 117
Query: 302 TIENGSLTKPNVVFG-CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVG 360
+ P + G C YDQ L T DG+LGL R K S+ SQL+ G+++NV+G
Sbjct: 118 NFTSEKRQSPLLALGLCGYDQ--LPGGTYHPIDGVLGLGRGKPSIVSQLSGLGLVRNVIG 175
Query: 361 HCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS 420
HCL+ GG++F G DL S +AW PM SP + Y ++ + G +N
Sbjct: 176 HCLSGRG--GGFLFFGDDLYDSSRVAWTPM--SPNAKHYSPGFAELTFDGK--TTGFKNL 229
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
V FD+G+SYTY Q Y LI+ +K E+S+ L D TLP+CW+ + P +S+
Sbjct: 230 IVA---FDSGASYTYLNSQVYQGLISLIKRELSTKPLREALDDQTLPICWKGRKPFKSVR 286
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
DVK++FKT L F + + T+ PE YL++S KGN CLG+L+G+EV ++GD
Sbjct: 287 DVKKYFKTFALSFANDGK-SKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGD 345
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
IS++ ++V+YDN + IGWA +C
Sbjct: 346 ISMQDRVVIYDNEKQLIGWAPRNC 369
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 275 bits (702), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 149/336 (44%), Positives = 205/336 (61%), Gaps = 31/336 (9%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
IF L+GN+YP G Y+ M +GNP +PY+LD+DTGSDLTW+QCDAPC SC K +PLY+P
Sbjct: 41 IFQLQGNVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPT 100
Query: 249 MGNILPYKDSLCMEIQRNHKP-GYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307
+++P ++LC + H C + +QCDY+I+Y D +SS GVL D L + + +
Sbjct: 101 ANSLVPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPMRSSN 160
Query: 308 LTKPNVVFGCAYDQQ-GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+ +P + FGC YDQQ G TDG+LGL R VSL SQL QGI KNV+GHCL+TN
Sbjct: 161 I-RPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCLSTN 219
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINY---GSSPL-----NLGAR 418
GGG++F G D+VP+ + WVPM + NY GS L +LG +
Sbjct: 220 --GGGFLFFGDDIVPTSRVTWVPMAK-----------ISGNYYSPGSGTLYFDRRSLGVK 266
Query: 419 NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
+V +FD+GS+YTYFT Q Y ++++LK S L SDP+LP+CW+ +S+
Sbjct: 267 PMEV---VFDSGSTYTYFTAQPYQAVVSALKSGLSKSLK-QVSDPSLPLCWKGPKAFKSV 322
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVIS 514
DVK+ FK+L L F S V I PE YL+++
Sbjct: 323 FDVKKEFKSLFLSFASAKNAV---MEIPPENYLIVT 355
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 156/391 (39%), Positives = 229/391 (58%), Gaps = 29/391 (7%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+ + L GN+YP G +F M +G+P + Y+LD+DTGS LTW+QCDAPC++C + LY
Sbjct: 22 SAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLY 81
Query: 246 KPRMGNILPYKDSLCMEIQRN-HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
KP ++ DSLC ++ + KP C + +QCDY I+Y D SSSMGVL D L+
Sbjct: 82 KPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSAS 140
Query: 305 NGSLTKP-NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHC 362
NG T P + FGC YDQ N + D ILGLSR KV+L SQL SQG+I K+V+GHC
Sbjct: 141 NG--TNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 198
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
+++ GGG++F G VP+ G+ W PM + E + G L+ + + +
Sbjct: 199 ISSK--GGGFLFFGDAQVPTSGVTWTPM---------NREHKYYSPGHGTLHFDSNSKAI 247
Query: 423 GWA----LFDTGSSYTYFTKQAYSELIASLKE-VSSDGLVL---DASDPTLPVCWRAKFP 474
A +FD+G++YTYF Q Y ++ +K ++S+ L D L VCW+ K
Sbjct: 248 SAAPMAVIFDSGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDK 307
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVH--NG 532
I +I +VK+ F++L+L F + I PE YL+IS++G++CLGILDGS+ H
Sbjct: 308 IVTIDEVKKCFRSLSLEFADGDK--KATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLA 365
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T ++G I++ Q+V+YD+ +GW C
Sbjct: 366 GTNLIGGITMLDQMVIYDSERSLLGWVNYQC 396
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 149/375 (39%), Positives = 219/375 (58%), Gaps = 29/375 (7%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
+F M +G+P + Y+LD+DTGS LTW+QCDAPC++C + LYKP ++ DSLC
Sbjct: 403 FFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKLVTCADSLCT 462
Query: 262 EIQRN-HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP-NVVFGCAY 319
++ + KP C + +QCDY I+Y D SSSMGVL D L+ NG T P + FGC Y
Sbjct: 463 DLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSASNG--TNPTTIAFGCGY 519
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHCLTTNAGGGGYMFLGHD 378
DQ N + D ILGLSR KV+L SQL SQG+I K+V+GHC+++ GGG++F G
Sbjct: 520 DQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSK--GGGFLFFGDA 577
Query: 379 LVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA----LFDTGSSYT 434
VP+ G+ W PM + E + G L+ + + + A +FD+G++YT
Sbjct: 578 QVPTSGVTWTPM---------NREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYT 628
Query: 435 YFTKQAYSELIASLKE-VSSDGLVL---DASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
YF Q Y ++ +K ++S+ L D L VCW+ K I +I +VK+ F++L+L
Sbjct: 629 YFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTIDEVKKCFRSLSL 688
Query: 491 HFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVH--NGSTIILGDISLRGQLVV 548
F + I PE YL+IS++G++CLGILDGS+ H T ++G I++ Q+V+
Sbjct: 689 EFADGDK--KATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGITMLDQMVI 746
Query: 549 YDNVNKRIGWAKSHC 563
YD+ +GW C
Sbjct: 747 YDSERSLLGWVNYQC 761
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 111/292 (38%), Positives = 170/292 (58%), Gaps = 35/292 (11%)
Query: 277 QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQ-GLLLNTLVKTDGI 335
QCDYEI+YAD +S++G L D+ L T+PN+ FGC Y+Q G +GI
Sbjct: 28 QCDYEIKYADGASTIGALIVDQFSLP---RIATRPNLPFGCGYNQGIGENFQQTSPVNGI 84
Query: 336 LGLSRAKVSLPSQLASQGII-KNVVGHCLTTNAGGGGYMFLGH---DLVPSWGMAWVPML 391
LGL R KVS SQL GII K+VVGHCL++ GGGG +F+G +LV + P
Sbjct: 85 LGLDRGKVSFVSQLKMLGIITKHVVGHCLSS--GGGGLLFVGDGDGNLVLLHANYYSPGS 142
Query: 392 DSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEV 451
+ + + + + G +P+++ +FD+GS+YTYFT Q Y + ++K
Sbjct: 143 ATLYFDRH-------SLGMNPMDV----------VFDSGSTYTYFTAQPYQATVYAIKGG 185
Query: 452 SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL 511
S + SDP+LP+CW+ + S+ DVK+ FK+L L+FG+ + I PE YL
Sbjct: 186 LSSTSLEQVSDPSLPLCWKGQKAFESVFDVKKEFKSLQLNFGN-----NAVMEIPPENYL 240
Query: 512 VISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++++ GN+CLGIL G ++ I+GDI+++ Q+V+YDN +++GW + C
Sbjct: 241 IVTEYGNVCLGILHGCRLNFN---IIGDITMQDQMVIYDNEREQLGWIRGSC 289
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/387 (37%), Positives = 216/387 (55%), Gaps = 21/387 (5%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+ + L GN+YP G +F M +G+P +PY+LD+DTGS LTW+QCD PC +C K + LY
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81
Query: 246 KPRMGNILPYKDSLCMEIQRN-HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
KP + + + C ++ + KP C QC Y I+Y SS+GVL D L
Sbjct: 82 KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVG-GSSIGVLIVDSFSLPAS 140
Query: 305 NGSLTKP-NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHC 362
NG T P ++ FGC Y+Q N +GILGL R KV+L SQL SQG+I K+V+GHC
Sbjct: 141 NG--TNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHC 198
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
+++ G G++F G VP+ G+ W PM L+ N S P++
Sbjct: 199 ISSK--GKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPME--- 253
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV----LDASDPTLPVCWRAKFPIRSI 478
+FD+G++YTYF Q Y ++ +K S + D L VCW+ K IR+I
Sbjct: 254 --VIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTI 311
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN--GSTII 536
+VK+ F++L+L F + + I PE YL+IS++G++CLGILDGS+ H T +
Sbjct: 312 DEVKKCFRSLSLKFADGDKKAT--LEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNL 369
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G I++ Q+V+YD+ +GW C
Sbjct: 370 IGGITMLDQMVIYDSERSLLGWVNYQC 396
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 146/387 (37%), Positives = 214/387 (55%), Gaps = 21/387 (5%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+ + L GN+YP G +F M + +P +PY+LD+DTGS LTW+QCD PC +C K + LY
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81
Query: 246 KPRMGNILPYKDSLCMEIQRN-HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
KP + + + C ++ + KP C QC Y I+Y SS+GVL D L
Sbjct: 82 KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVG-GSSIGVLIVDSFSLPAS 140
Query: 305 NGSLTKP-NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHC 362
NG T P ++ FGC Y+Q N +GILGL R KV+L SQL SQG+I K+V+GHC
Sbjct: 141 NG--TNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHC 198
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
+++ G G++F G VP+ G+ W PM L N S P++
Sbjct: 199 ISSK--GKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHFNSNSKPISAAPME--- 253
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV----LDASDPTLPVCWRAKFPIRSI 478
+FD+G++YTYF Q Y ++ +K S + D L VCW+ K IR+I
Sbjct: 254 --VIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTI 311
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN--GSTII 536
+VK+ F++L+L F + + I PE YL+IS++G++CLGILDGS+ H T +
Sbjct: 312 DEVKKCFRSLSLKFADGDKKAT--LEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNL 369
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G I++ Q+V+YD+ +GW C
Sbjct: 370 IGGITMLDQMVIYDSERSLLGWVNYQC 396
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 152/401 (37%), Positives = 221/401 (55%), Gaps = 36/401 (8%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+ + L GN+YP G +F M +G+P +PY+LD+DTGS LTW+QCD PC +C K A+ L+
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNK-AHSLF 80
Query: 246 KPRM-GNILP---YKDSL----------CMEIQRN-HKPGYCETCQQCDYEIEYADHSSS 290
PR+ G+ +P YK L C ++ + KP C QC Y I+Y SS
Sbjct: 81 YPRLIGSFVPHGLYKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVG-GSS 139
Query: 291 MGVLARDELHLTIENGSLTKP-NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL 349
+GVL D L NG T P ++ FGC Y+Q N +GILGL R KV+L SQL
Sbjct: 140 IGVLIVDSFSLPASNG--TNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQL 197
Query: 350 ASQGII-KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINY 408
SQG+I K+V+GHC+++ G G++F G VP+ G+ W PM L+ N
Sbjct: 198 KSQGVITKHVLGHCISSK--GKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNS 255
Query: 409 GSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV----LDASDPT 464
S P++ +FD+G++YTYF Q Y ++ +K S + D
Sbjct: 256 NSKPISAAPME-----VIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRA 310
Query: 465 LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGIL 524
L VCW+ K IR+I +VK+ F++L+L F + I PE YL+IS++G++CLGIL
Sbjct: 311 LTVCWKGKDKIRTIDEVKKCFRSLSLKFADGDK--KATLEIPPEHYLIISQEGHVCLGIL 368
Query: 525 DGSEVHN--GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
DGS+ H T ++G I++ Q+V+YD+ +GW C
Sbjct: 369 DGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQC 409
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 148/392 (37%), Positives = 216/392 (55%), Gaps = 30/392 (7%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S+ + L GN+YP G +F M + +P +PY+LD+DTGS LTW+QCD PC +C K + LY
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81
Query: 246 KPRMGNILPYKDSLCMEIQRN-HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
KP + + + C ++ + KP C QC Y I+Y SS+GVL D L
Sbjct: 82 KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVG-GSSIGVLIVDSFSLPAS 140
Query: 305 NGSLTKP-NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHC 362
NG T P ++ FGC Y+Q N +GILGL R KV+L SQL SQG+I K+V+GHC
Sbjct: 141 NG--TNPTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHC 198
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-----SPFMELYHTEILKINYGSSPLNLGA 417
+++ G G++F G VP+ G+ W PM SP H K SP++
Sbjct: 199 ISSK--GKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHFNSNK----QSPISAAP 252
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV----LDASDPTLPVCWRAKF 473
+FD+G++YTYF Q Y ++ +K S + D L VCW+ K
Sbjct: 253 ME-----VIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKD 307
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN-- 531
IR+I +VK+ F++L+L F + + I PE YL+IS++G++CLGILDGS+ H
Sbjct: 308 KIRTIDEVKKCFRSLSLKFADGDKKAT--LEIPPEHYLIISQEGHVCLGILDGSKEHPSL 365
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T ++G I++ Q+V+YD+ +GW C
Sbjct: 366 AGTNLIGGITMLDQMVIYDSERSLLGWVNYQC 397
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 138/378 (36%), Positives = 200/378 (52%), Gaps = 67/378 (17%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS + PL GN++P G Y + +G PP+ + D+DTGSDLTW+QCDAPC+ C Y
Sbjct: 38 SSVVLPLSGNVFPLGYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCTLPPIRQY 97
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
KP+ GN +P D +C+ + +KP +QCDYE+ YAD SSMG L D+ L + N
Sbjct: 98 KPK-GNTVPCLDPICLALHFPNKPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPLKLLN 156
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
GS +P + FGC YDQ + T G+LGL R K+ + QL + G+ +NVVGHCL++
Sbjct: 157 GSAMQPRLAFGCGYDQILPKAHPPPATAGVLGLGRGKIGVLPQLVAAGLTRNVVGHCLSS 216
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
GGGY+F G L +P L + L E
Sbjct: 217 K--GGGYLFFGDTL--------IPTLGVAWTPLLSPE----------------------- 243
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
YT+F + D L D + +S+++ K FF
Sbjct: 244 -------YTFFF------------HICRDRLQRDYTF------------FKSVLEFKNFF 272
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
KT+T++F + +I T+ I PE YL+ISK GN CLG+L+GSEV ++ ++GDIS++G
Sbjct: 273 KTITINFTNARRI--TQLQIPPESYLIISKTGNACLGLLNGSEVGLQNSNVIGDISMQGL 330
Query: 546 LVVYDNVNKRIGWAKSHC 563
+V+YDN +++GW S+C
Sbjct: 331 MVIYDNEKQQLGWVSSNC 348
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 147/390 (37%), Positives = 215/390 (55%), Gaps = 36/390 (9%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQC---DAPCSSCAKGANPLY 245
+F L G++YP G ++ M +G P PY+LD+DTGS TW++C D PC +C K +PLY
Sbjct: 26 VFKLDGSVYPVGHFYVTMNIGEPAEPYFLDIDTGSSFTWLECHAKDGPCKTCNKVPHPLY 85
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETC-----QQCDYEIEYADHSSSMGVLARDELH 300
+ ++P D LC + ++ G + C QCDY+++Y D SS+GVL D+
Sbjct: 86 RLTRKKLVPCADPLCDALHKDL--GTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLDKFS 143
Query: 301 LTIENGSLTKPNVVFGCAYDQQGLLLNTL---VKTDGILGLSRAKVSLPSQLASQGII-K 356
L N+ FGC YDQ V DGILGL R V L SQL G + K
Sbjct: 144 LPTGGAR----NIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSK 199
Query: 357 NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
NV+GHCL++ GGGY+F+G + VPS + WVPM + E H + G + L+L
Sbjct: 200 NVIGHCLSSK--GGGYLFIGEENVPSSHVTWVPMAPTTPGEPNH-----YSPGQATLHLD 252
Query: 417 AR--NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
+ ++ A+FD+GS+YTY + +++L+++LK S + SDP LP+CW+ P
Sbjct: 253 SNPIGTKPLKAIFDSGSTYTYLPENLHAQLVSALKASLSKSSLKQVSDPALPLCWKGPKP 312
Query: 475 IRSIVDVKQFFKTL-TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
+++ D + FK+L TL F + I PE YL+I+ GN C GILD +
Sbjct: 313 FKTVHDTPKEFKSLVTLKFD-----LGVTMIIPPENYLIITGHGNACFGILDMPGLDQ-- 365
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+GDI+++ QLV+YDN R+ W S C
Sbjct: 366 -YIIGDITMQEQLVIYDNEKGRLAWMPSPC 394
>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
partial [Brachypodium distachyon]
Length = 354
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 145/385 (37%), Positives = 202/385 (52%), Gaps = 58/385 (15%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
SS +F L G++YP G + M +G +PY+LD+DTGS LTW++
Sbjct: 20 SSMVFELHGDVYPTGHIYVTMSIGEQEKPYFLDIDTGSTLTWLE---------------- 63
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
+++ H C E QCDY++ YA SS+GVL D+ L
Sbjct: 64 ----------------DVRFKHD---CKENPNQCDYDVRYAGGESSLGVLIADKFSLP-- 102
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHCL 363
G +P + FGC YDQ+G + DG+LG+ R L SQL QG I +NV+GHCL
Sbjct: 103 -GRDARPTLTFGCGYDQEGG--KAEMPVDGVLGIGRGTRDLASQLKQQGAIAENVIGHCL 159
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
GGGY+F GH+ VPS + WVPM+ P Y + +++ NLG S
Sbjct: 160 RIQ--GGGYLFFGHEKVPSSVVTWVPMV--PNNHYYSPGLAALHFNG---NLGNPISVAP 212
Query: 424 W-ALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+ D+GS+YTY + Y L+ + +S L L DP LPVCW K P + I DV
Sbjct: 213 MEVVIDSGSTYTYMPTETYRRLVFVVIASLSKSSLTL-VRDPALPVCWAGKEPFKXIGDV 271
Query: 482 KQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
K FK L L F G+ I+ I PE YL+IS +GN+C+GILDG++ ++GD
Sbjct: 272 KDKFKPLELAFIQGTSQAIM----EIPPENYLIISGEGNVCMGILDGTQAGLRKLNVIGD 327
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCM 564
IS++ QLV+YDN RIGW ++ C+
Sbjct: 328 ISMQNQLVIYDNERARIGWVRAPCV 352
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 229 bits (585), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 144/392 (36%), Positives = 223/392 (56%), Gaps = 42/392 (10%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA---PCSSCAKGANPLY 245
+F L G+++P G ++ M +G P +PY+LD+DTGS+LTWI+C A PC +C K +PLY
Sbjct: 27 VFKLGGDVHPTGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLY 86
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCETCQ----QCDYEIEYADHSSSMGVLARDELHL 301
+P+ ++P D LC + ++ G + C+ QC Y+I YAD ++S+GVL D+ L
Sbjct: 87 RPK--KLVPCADPLCDALHKDL--GTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSL 142
Query: 302 TIENGSLTKPNVVFGCAYDQ-QGLLLNT--LVKTDGILGLSRAKVSLPSQLASQGII-KN 357
+ N+ FGC YDQ QG V DGILGL R V L SQL G + KN
Sbjct: 143 PTGSAR----NIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKN 198
Query: 358 VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA 417
V+GHCL++ GGGY+F+G + VPS + + ++ E + G + L+LG
Sbjct: 199 VIGHCLSSK--GGGYLFIGEENVPSSHLHII------YIYCISREPNHYSPGQATLHLG- 249
Query: 418 RN---SQVGWALFDTGSSYTYFTKQAYSELIASLKE--VSSDGLVLDASDPTLPVCWRAK 472
RN ++ A+FD+GS+YTY + +++L+++LK + S ++ +D L +CW+
Sbjct: 250 RNPIGTKPFKAIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKGP 309
Query: 473 FPIRSIVDVKQFFKTL-TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
P +++ D+ + FK+L TL F + I PE YL+I+ GN C GIL E+
Sbjct: 310 KPFKTVHDLPKEFKSLVTLKFDHGVTMT-----IPPENYLIITGHGNACFGIL---ELPG 361
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++G IS++ QLV++DN R+ W S C
Sbjct: 362 YDLFVIGGISMQEQLVIHDNEKGRLAWMPSPC 393
>gi|356507650|ref|XP_003522577.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 326
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/342 (38%), Positives = 185/342 (54%), Gaps = 39/342 (11%)
Query: 215 YYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCET 274
Y LD+DTGSDLTW Q DAPC C + L KP ++ D LC I H +
Sbjct: 12 YELDIDTGSDLTWFQWDAPCQGCTLPRDKLNKPHC-KLVKCGDRLCAAI---HSEPCADP 67
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDG 334
+QCDYE+EYAD SS+GVL D + L +GSL +P L D
Sbjct: 68 DEQCDYEVEYADQGSSLGVLVLDNIALKFTSGSLARP----------------ILAAPD- 110
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSP 394
+GL+ K S+ SQL S G+I+NVVGHCL+ GGG++F G L+P G+ W P+L +
Sbjct: 111 -MGLATGKTSILSQLHSLGLIRNVVGHCLSRR--GGGFLFFGDQLIPQSGVVWTPLLQNS 167
Query: 395 FMELYHTEILKINYGSSPLNL---GARNSQVGWAL-FDTGSSYTYFTKQAYSELIASL-K 449
+ +Y + P ++ G S G L FD+GSSYT F A+ L+ +
Sbjct: 168 -----SVTYTRPHYKTGPADMFFNGKATSVKGLELTFDSGSSYTXFNSHAHKALVGLITN 222
Query: 450 EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEG 509
++ DP+LP+CW+ +S+ DV +FK + L F ++ + PE
Sbjct: 223 DIKGKSFSRATEDPSLPICWKNPKTFKSLHDVTNYFKPIALSFTKS---KNSLLQLPPEA 279
Query: 510 YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
YL+ K GN+CLGILDG+E+ G+T I+GDISL+ ++V+YDN
Sbjct: 280 YLI--KYGNVCLGILDGTEIGLGNTNIIGDISLQDKMVIYDN 319
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/387 (33%), Positives = 208/387 (53%), Gaps = 29/387 (7%)
Query: 190 FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA----KGANPLY 245
FPL GN+YP G ++ + +G P +PY+LD+DTGS+LTW++C P C + +P Y
Sbjct: 26 FPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYY 85
Query: 246 KPRMGNILPYKDS-LCMEIQRNHKPGYCETCQ----QCDYEIEYADHSSSMGVLARDELH 300
P GN+ S LC+ ++R+ PG E + +C YEI+Y S G LA D +
Sbjct: 86 TPADGNLKVVCGSPLCVAVRRD-VPGIPECSRNDPHRCHYEIQYVT-GKSEGDLATDIIS 143
Query: 301 LTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVV 359
+ NG K + FGC Y Q+ + DGILGL K L +QL +IK NV+
Sbjct: 144 V---NGR-DKKRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVI 199
Query: 360 GHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN 419
GHCL++ G G +++G P+ G+ W PM +S F Y + ++ P+ R
Sbjct: 200 GHCLSSK--GKGVLYVGDFNPPTRGVTWAPMRESLFY--YSPGLAEVFIDKQPI----RG 251
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+ A+FD+GS+YT+ Q Y+E+++ ++ S+ + + LP+CW+ K P S+
Sbjct: 252 NPTFEAVFDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEVKGRALPLCWKGKKPFGSVN 311
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS---EVHNGSTII 536
DVK FK L+L ++ I P+ YL + + G CL ILD S + + I+
Sbjct: 312 DVKNQFKALSLKITHARG--TSNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFIL 369
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G ++++ V+YDN K++GW ++ C
Sbjct: 370 IGAVTMQDLFVIYDNEKKQLGWVRAQC 396
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 130/387 (33%), Positives = 206/387 (53%), Gaps = 29/387 (7%)
Query: 190 FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA----KGANPLY 245
FPL GN+YP G ++ + +G P +PY+LD+DTGS+LTW++C P C + +P Y
Sbjct: 26 FPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPHPYY 85
Query: 246 KPRMGNILPYKDS-LCMEIQRNHKPGYCETCQ----QCDYEIEYADHSSSMGVLARDELH 300
P GN+ S LC+ ++R+ PG E + +C YEI+Y S G LA D +
Sbjct: 86 TPADGNLKVVCGSPLCVAVRRD-VPGIPECSRNDPHRCHYEIQYVT-GKSEGDLATDIIS 143
Query: 301 LTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVV 359
+ NG K + FGC Y Q+ + DGILGL K +QL +IK NV+
Sbjct: 144 V---NGR-DKKRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVI 199
Query: 360 GHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN 419
GHCL++ G G +++G P+ G+ W PM +S F Y + ++ P+ R
Sbjct: 200 GHCLSSK--GKGVLYVGDFNPPTRGVTWAPMRESLFY--YSPGLAEVFIDKQPI----RG 251
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+ A+FD+GS+YT+ Q Y+E+++ ++ S+ + + LP+CW+ K P S+
Sbjct: 252 NPTFEAVFDSGSTYTHVPAQIYNEIVSKVRGTLSESSLEEVKGRALPLCWKGKKPFGSVN 311
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS---EVHNGSTII 536
DVK FK L+L + I P+ YL + + G CL ILD S + + I+
Sbjct: 312 DVKNQFKALSLKITHARG--TNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFIL 369
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G ++++ V+YDN K++GW ++ C
Sbjct: 370 IGAVTMQDLFVIYDNEKKQLGWVRAQC 396
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 211 bits (537), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 128/387 (33%), Positives = 205/387 (52%), Gaps = 29/387 (7%)
Query: 190 FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA----KGANPLY 245
FPL GN+YP G ++ + +G P +PY+LD+DTGS+LTW++C P C + +P Y
Sbjct: 26 FPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPHPYY 85
Query: 246 KPRMGNILPYKDS-LCMEIQRNHKPGYCETCQ----QCDYEIEYADHSSSMGVLARDELH 300
P G + S LC+ ++R+ PG E + +C YEI+Y S G LA D +
Sbjct: 86 TPADGKLKVVCGSPLCVAVRRD-VPGIPECSRNDPHRCHYEIQYVT-GKSEGDLATDIIS 143
Query: 301 LTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVV 359
+ NG K + FGC Y Q+ + +GILGL K +QL +IK NV+
Sbjct: 144 V---NGR-DKKRIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVI 199
Query: 360 GHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN 419
GHCL++ G G +++G P+ G+ W PM +S F Y + ++ P+ R
Sbjct: 200 GHCLSSK--GKGVLYVGDFNPPTRGVTWAPMRESLFY--YSPGLAEVFIDKQPI----RG 251
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+ A+FD+GS+YT+ Q Y+E+++ ++ S+ + + LP+CW+ K P S+
Sbjct: 252 NPTFEAVFDSGSTYTHVPAQIYNEIVSKVRGTFSESSLEEVKGRALPLCWKGKKPFGSVN 311
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS---EVHNGSTII 536
DVK FK L+L + I P+ YL + + G CL ILD S + + I+
Sbjct: 312 DVKNQFKALSLKITHARG--TNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFIL 369
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G ++++ V+YDN K++GW ++ C
Sbjct: 370 IGAVTMQDLFVIYDNEKKQLGWVRAQC 396
>gi|308080924|ref|NP_001183009.1| uncharacterized protein LOC100501329 [Zea mays]
gi|238008766|gb|ACR35418.1| unknown [Zea mays]
Length = 205
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 98/200 (49%), Positives = 134/200 (67%), Gaps = 19/200 (9%)
Query: 121 ESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVAS--VNDGIIRPHKSKINKKLVS 178
SF+ PLY K + GR + G+ +A+ V+DG K++ ++
Sbjct: 22 RSFLLPLYPKA------------RQGRALREFGDVKLAARRVDDG---GRKARNRMEVAK 66
Query: 179 SNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA 238
+ +S+++ P++GN++PDG Y+T + +GNPPRPY+LD+DTGSDLTWIQCDAPC++CA
Sbjct: 67 AATARTNSTALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCA 126
Query: 239 KGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDE 298
KG +PLYKP I+P +D LC E+Q N YCETC+QCDYEIEYAD SSSMGVLARD+
Sbjct: 127 KGPHPLYKPAKEKIVPPRDLLCQELQGNQ--NYCETCKQCDYEIEYADQSSSMGVLARDD 184
Query: 299 LHLTIENGSLTKPNVVFGCA 318
+H+ NG K + VFGCA
Sbjct: 185 MHMIATNGGREKLDFVFGCA 204
>gi|388518245|gb|AFK47184.1| unknown [Lotus japonicus]
Length = 245
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 107/234 (45%), Positives = 148/234 (63%), Gaps = 15/234 (6%)
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD 392
DG+LGL R K SL SQL SQG+++NVVGHCL+ A GGGY+F G D+ S + W PM
Sbjct: 13 DGMLGLGRGKSSLVSQLNSQGLVRNVVGHCLS--AQGGGYIFFG-DVYDSSRLTWTPMSS 69
Query: 393 SPFMELYHTEILKINYGSSPLNLGARNSQVGWAL--FDTGSSYTYFTKQAYSELIASLK- 449
+L H G++ L G + + +G L FDTGSSYTYF AY +I+ LK
Sbjct: 70 R---DLKHYVA-----GAAELIFGGKKTGIGGLLPVFDTGSSYTYFNSNAYQAVISWLKK 121
Query: 450 EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEG 509
E++ L D TLP+CW K P RS+ +V+++FK++ L F S + +T+F I PE
Sbjct: 122 ELAGKPLKEAPDDQTLPLCWHGKRPFRSVYEVRKYFKSMALSFTSSGR-TNTQFEIPPEA 180
Query: 510 YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
YL++S GN+CLGILDGSEV G ++GDIS+ +++V+DN + IGWA + C
Sbjct: 181 YLIVSNMGNVCLGILDGSEVGMGDLNLIGDISMLDKVMVFDNEKRLIGWAPADC 234
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 119/398 (29%), Positives = 191/398 (47%), Gaps = 58/398 (14%)
Query: 190 FPLRGN--IYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
FP+ G+ + GLY+T + +G PP+ +Y+ +DTGSD+ W+ C PC++C + +N
Sbjct: 34 FPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNC-VPCTNCKRASNVALPI 92
Query: 243 PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
++ P + D C + N K + C Y Y D SS+ G L D L
Sbjct: 93 SIFDPEKSTSKTSISCTDEECY-LASNSKCSF--NSMSCPYSTLYGDGSSTAGYLINDVL 149
Query: 300 HLT-IENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
+ +G+ T + + FGC +Q G L TDG++G +A+VSLPSQL+ Q +
Sbjct: 150 SFNQVPSGNSTATSGTARLTFGCGSNQTGTWL-----TDGLVGFGQAEVSLPSQLSKQNV 204
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYG----S 410
N+ HCL + G G + +GH P G+ + P++ P Y+ E+L I +
Sbjct: 205 SVNIFAHCLQGDNKGSGTLVIGHIREP--GLVYTPIV--PKQSHYNVELLNIGVSGTNVT 260
Query: 411 SPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWR 470
+P NS G + D+G++ TY + AY + A +++ G+ LPV ++
Sbjct: 261 TPTAFDLSNS--GGVIMDSGTTLTYLVQPAYDQFQAKVRDCMRSGV--------LPVAFQ 310
Query: 471 AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL----VISKKGNICLGILDG 526
++ +F +TL+F ++ +SP YL + + C L+
Sbjct: 311 F------FCTIEGYFPNVTLYFAGGAAML-----LSPSSYLYKEMLTTGLSAYCFSWLES 359
Query: 527 SEVHNG-STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ V+ S I GD L+ QLVVYDNVN RIGW C
Sbjct: 360 TSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDC 397
>gi|62954897|gb|AAY23266.1| Similar to nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|77548966|gb|ABA91763.1| Aspartic proteinase Asp1 precursor, putative [Oryza sativa Japonica
Group]
Length = 307
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/302 (36%), Positives = 164/302 (54%), Gaps = 50/302 (16%)
Query: 277 QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGIL 336
QCDYEI+YAD +S++G L D+ L T+PN+ FGC Y+Q GI
Sbjct: 28 QCDYEIKYADGASTIGALIVDQFSLPR---IATRPNLPFGCGYNQ------------GI- 71
Query: 337 GLSRAKVSLPSQLASQGII-KNVVGHCLTTNAGGGGYMFLGHD------LVPSWGMAWVP 389
S L GII K+VVGHCL++ GGGG +F+G L S G
Sbjct: 72 ---GENFQQTSPLKMLGIITKHVVGHCLSS--GGGGLLFVGDGDGNLVLLHASLGSLCPI 126
Query: 390 MLDSPFMELYHTEILKINY---GSSPL-----NLGARNSQVGWALFDTGSSYTYFTKQAY 441
+ +P + E + +NY GS+ L +LG V +FD+GS+YTYFT Q Y
Sbjct: 127 AISTPSS---YNEPMLMNYYSPGSATLYFDRHSLGMNPMDV---VFDSGSTYTYFTAQPY 180
Query: 442 SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVST 501
+ ++K S + SDP+LP+CW+ + S+ DVK+ FK+L L+FG+ +
Sbjct: 181 QATVYAIKGGLSSTSLEQVSDPSLPLCWKGQKAFESVFDVKKEFKSLQLNFGN-----NA 235
Query: 502 KFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKS 561
I PE YL++++ GN+CLGIL G ++ I+GDI+++ Q+V+YDN +++GW +
Sbjct: 236 VMEIPPENYLIVTEYGNVCLGILHGCRLNFN---IIGDITMQDQMVIYDNEREQLGWIRG 292
Query: 562 HC 563
C
Sbjct: 293 SC 294
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 145/506 (28%), Positives = 223/506 (44%), Gaps = 91/506 (17%)
Query: 81 LPRKLFLFLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRD 140
L RKL + +A+ + + + + S N FVF + HKF +E
Sbjct: 3 LRRKLCIVVAV-------------FVIVNEFASGN-------FVFKVQHKFAGKEK---- 38
Query: 141 AEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD- 199
KL F + H ++ + ++++S +D PL G+ D
Sbjct: 39 ---KLEHF-----------------KSHDTRRHSRMLAS----ID----LPLGGDSRVDS 70
Query: 200 -GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI------ 252
GLYFT + +G+PP+ Y++ +DTGSD+ W+ C PC C N + + ++
Sbjct: 71 VGLYFTKIKLGSPPKEYHVQVDTGSDILWVNC-KPCPECPSKTNLNFHLSLFDVNASSTS 129
Query: 253 --LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL-T 309
+ D C I ++ C+ C Y I YAD S+S G RD+L L G L T
Sbjct: 130 KKVGCDDDFCSFISQSDS---CQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQT 186
Query: 310 KP---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
P VVFGC DQ G L + DG++G ++ S+ SQLA+ G K V HCL N
Sbjct: 187 GPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DN 245
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
GGG +G +V S + PM+ P Y+ ++ ++ + L+L + G +
Sbjct: 246 VKGGGIFAVG--VVDSPKVKTTPMV--PNQMHYNVMLMGMDVDGTALDLPPSIMRNGGTI 301
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G++ YF K Y LI ++ ++ + L + T F VDV F
Sbjct: 302 VDSGTTLAYFPKVLYDSLIETI--LARQPVKLHIVEDTFQC-----FSFSENVDVA--FP 352
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST--IILGDISLRG 544
++ F S K + P YL +K C G G T I+LGD+ L
Sbjct: 353 PVSFEFED-----SVKLTVYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILLGDLVLSN 407
Query: 545 QLVVYDNVNKRIGWAKSHCMNPGRFK 570
+LVVYD N+ IGWA +C + + K
Sbjct: 408 KLVVYDLENEVIGWADHNCSSSIKIK 433
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 125/389 (32%), Positives = 193/389 (49%), Gaps = 34/389 (8%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C + C C K +P ++P + +
Sbjct: 70 LYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPC-STCKQCGKHQDPKFQPELSS 128
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
YK C N + + C YE YA+ SSS GVL+ D + E+ LT
Sbjct: 129 --SYKALKC-----NPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNES-QLTPQ 180
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
VFGC + G L + + DGI+GL R K+S+ QL +G+I++V C GGG
Sbjct: 181 RAVFGCENVETGDLFSQ--RADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGG 238
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK---INYGSSPLNLGARNSQVGWALFD 428
M LG + P GM + PF Y+ LK + S LN N + G L D
Sbjct: 239 AMVLGK-ISPPAGMVFSH--SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVL-D 294
Query: 429 TGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQFFK 486
+G++Y YF K+A+ + A +KE+ S + DP VC+ R + ++ FF
Sbjct: 295 SGTTYAYFPKEAFIAIKDAIIKEIPSLKRI-HGPDPNYDDVCFSGAG--RDVAEIHNFFP 351
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDISLRG 544
+ + FG+ +++ +SPE YL K G CLGI + ST +LG I +R
Sbjct: 352 EIDMEFGNGQKLI-----LSPENYLFRHTKVRGAYCLGIFPDRD----STTLLGGIVVRN 402
Query: 545 QLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
LV YD N ++G+ K++C + R + P
Sbjct: 403 TLVTYDRENDKLGFLKTNCSDLWRRLAAP 431
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 123/389 (31%), Positives = 193/389 (49%), Gaps = 34/389 (8%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C + C C K +P ++P +
Sbjct: 66 LYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPC-STCKQCGKHQDPKFQPELST 124
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
Y+ C N + + C YE YA+ SSS GVL+ D + E+ L+
Sbjct: 125 --SYQALKC-----NPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNES-QLSPQ 176
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
VFGC ++ G L + + DGI+GL R K+S+ QL +G+I++V C GGG
Sbjct: 177 RAVFGCENEETGDLFSQ--RADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGG 234
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK---INYGSSPLNLGARNSQVGWALFD 428
M LG + P GM + PF Y+ LK + S LN N + G L D
Sbjct: 235 AMVLGK-ISPPPGMVF--SHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVL-D 290
Query: 429 TGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQFFK 486
+G++Y YF K+A+ + A +KE+ S + DP VC+ R + ++ FF
Sbjct: 291 SGTTYAYFPKEAFIAIKDAVIKEIPSLKRI-HGPDPNYDDVCFSGAG--RDVAEIHNFFP 347
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDISLRG 544
+ + FG+ +++ +SPE YL K G CLGI + ST +LG I +R
Sbjct: 348 EIAMEFGNGQKLI-----LSPENYLFRHTKVRGAYCLGIFPDRD----STTLLGGIVVRN 398
Query: 545 QLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
LV YD N ++G+ K++C + R + P
Sbjct: 399 TLVTYDRENDKLGFLKTNCSDIWRRLAAP 427
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 123/389 (31%), Positives = 192/389 (49%), Gaps = 34/389 (8%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C C C K +P ++P +
Sbjct: 66 LYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST-CKQCGKHQDPKFQPELST 124
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
Y+ C N + + C YE YA+ SSS GVL+ D + E+ L+
Sbjct: 125 --SYQALKC-----NPDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNES-QLSPQ 176
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
VFGC ++ G L + + DGI+GL R K+S+ QL +G+I++V C GGG
Sbjct: 177 RAVFGCENEETGDLFSQ--RADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGG 234
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK---INYGSSPLNLGARNSQVGWALFD 428
M LG + P GM + PF Y+ LK + S LN N + G L D
Sbjct: 235 AMVLGK-ISPPPGMVF--SHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVL-D 290
Query: 429 TGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQFFK 486
+G++Y YF K+A+ + A +KE+ S + DP VC+ R + ++ FF
Sbjct: 291 SGTTYAYFPKEAFIAIKDAVIKEIPSLKRI-HGPDPNYDDVCFSGAG--RDVAEIHNFFP 347
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDISLRG 544
+ + FG+ +++ +SPE YL K G CLGI + ST +LG I +R
Sbjct: 348 EIAMEFGNGQKLI-----LSPENYLFRHTKVRGAYCLGIFPDRD----STTLLGGIVVRN 398
Query: 545 QLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
LV YD N ++G+ K++C + R + P
Sbjct: 399 TLVTYDRENDKLGFLKTNCSDIWRRLAAP 427
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 124/398 (31%), Positives = 182/398 (45%), Gaps = 41/398 (10%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP 247
PL G+ D GLYFT + +G+PP+ Y++ +DTGSD+ WI C PC C N ++
Sbjct: 60 LPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRL 118
Query: 248 RMGNI--------LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ ++ + D C I ++ C+ C Y I YAD S+S G RD L
Sbjct: 119 SLFDMNASSTSKKVGCDDDFCSFISQSDS---CQPALGCSYHIVYADESTSDGKFIRDML 175
Query: 300 HLTIENGSL-TKP---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
L G L T P VVFGC DQ G L N DG++G ++ S+ SQLA+ G
Sbjct: 176 TLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDA 235
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
K V HCL N GGG +G +V S + PM+ P Y+ ++ ++ + L+L
Sbjct: 236 KRVFSHCL-DNVKGGGIFAVG--VVDSPKVKTTPMV--PNQMHYNVMLMGMDVDGTSLDL 290
Query: 416 GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPI 475
+ G + D+G++ YF K Y LI ++ A P F
Sbjct: 291 PRSIVRNGGTIVDSGTTLAYFPKVLYDSLIETIL----------ARQPVKLHIVEETFQC 340
Query: 476 RSI-VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDG--SEVHNG 532
S +V + F ++ F S K + P YL ++ C G G +
Sbjct: 341 FSFSTNVDEAFPPVSFEFED-----SVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERS 395
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
I+LGD+ L +LVVYD N+ IGWA +C + + K
Sbjct: 396 EVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKIK 433
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 122/425 (28%), Positives = 197/425 (46%), Gaps = 48/425 (11%)
Query: 158 ASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPY 215
AS N +R H + + +L+++ + PL G P GLYFT + +G PP+ Y
Sbjct: 46 ASANISALRVHDGRRHGRLLAA--------ADLPLGGLGLPTDTGLYFTEIKLGTPPKRY 97
Query: 216 YLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPRM---GNILPYKDSLCMEIQRNH 267
Y+ +DTGSD+ W+ C C C + + Y P+ G+ + C
Sbjct: 98 YVQVDTGSDILWVNC-ISCEKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYGGK 156
Query: 268 KPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKP---NVVFGCAYDQQG 323
PG C C+Y + Y D SS+ G D L + T+P V FGC Q G
Sbjct: 157 LPG-CTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNATVTFGCGAQQGG 215
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW 383
L ++ DGILG +A S+ SQLA+ G +K + HCL T GGG + +G+ + P
Sbjct: 216 DLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIKGGGIFA-IGNVVQPK- 273
Query: 384 GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQA 440
+ P++ M Y+ + I+ G + L L A + G + D+G++ TY +
Sbjct: 274 -VKTTPLVAD--MPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTIIDSGTTLTYLPELV 330
Query: 441 YSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVS 500
+ E++A++ D + + D +C++ +P V F T+T HF +
Sbjct: 331 FKEVMAAIFNKHQDIVFHNVQD---FMCFQ--YP----GSVDDGFPTITFHFEDDLAL-- 379
Query: 501 TKFHISPEGYLVISKKGNICLGILDGS-EVHNGSTIIL-GDISLRGQLVVYDNVNKRIGW 558
H+ P Y + C+G +G+ + +G I+L GD+ L +LV+YD N+ IGW
Sbjct: 380 ---HVYPHEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGW 436
Query: 559 AKSHC 563
+C
Sbjct: 437 TDYNC 441
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 116/397 (29%), Positives = 180/397 (45%), Gaps = 46/397 (11%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
FP+ G P GLY+T + +G PPR +Y+ +DTGSD+ W+ C A C+ C + +
Sbjct: 67 FPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQL 125
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ P + + D C ++ G C Y +Y D S + G D L
Sbjct: 126 NFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVL 185
Query: 300 HLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+ GS PN VVFGC+ Q G L+ + DGI G + +S+ SQLASQGI
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIA 245
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKI--NYGSSPL 413
V HCL GGGG + LG + P+ M + P++ P Y+ +L I N + P+
Sbjct: 246 PRVFSHCLKGENGGGGILVLGEIVEPN--MVFTPLV--PSQPHYNVNLLSISVNGQALPI 301
Query: 414 NLGARNSQVGWA-LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
N ++ G + DTG++ Y ++ AY + ++ S PV +
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNA--------VSQSVRPVVSKGN 353
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI------CLGILDG 526
V F ++L+F ++P+ YL+ ++ N+ C+G
Sbjct: 354 QCYVITTSVGDIFPPVSLNFAG-----GASMFLNPQDYLI--QQNNVGGTAVWCIGF--- 403
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ N ILGD+ L+ ++ VYD V +RIGWA C
Sbjct: 404 QRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 116/397 (29%), Positives = 180/397 (45%), Gaps = 46/397 (11%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
FP+ G P GLY+T + +G PPR +Y+ +DTGSD+ W+ C A C+ C + +
Sbjct: 67 FPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQL 125
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ P + + D C ++ G C Y +Y D S + G D L
Sbjct: 126 NFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVL 185
Query: 300 HLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+ GS PN VVFGC+ Q G L+ + DGI G + +S+ SQLASQGI
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIA 245
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKI--NYGSSPL 413
V HCL GGGG + LG + P+ M + P++ P Y+ +L I N + P+
Sbjct: 246 PRVFSHCLKGENGGGGILVLGEIVEPN--MVFTPLV--PSQPHYNVNLLSISVNGQALPI 301
Query: 414 NLGARNSQVGWA-LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
N ++ G + DTG++ Y ++ AY + ++ S PV +
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNA--------VSQSVRPVVSKGN 353
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI------CLGILDG 526
V F ++L+F ++P+ YL+ ++ N+ C+G
Sbjct: 354 QCYVITTSVGDIFPPVSLNFAG-----GASMFLNPQDYLI--QQNNVGGTAVWCIGF--- 403
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ N ILGD+ L+ ++ VYD V +RIGWA C
Sbjct: 404 QRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 115/397 (28%), Positives = 180/397 (45%), Gaps = 46/397 (11%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
FP+ G P GLY+T + +G+PPR +Y+ +DTGSD+ W+ C A C+ C + +
Sbjct: 67 FPVDGTFDPFVVGLYYTKIRLGSPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQL 125
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ P + D C ++ G C Y +Y D S + G D L
Sbjct: 126 NFFDPGSSVTATPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVL 185
Query: 300 HLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+ GS PN VVFGC+ Q G L+ + DGI G + +S+ SQLASQG+
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLA 245
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKI--NYGSSPL 413
V HCL GGGG + LG + P+ M + P++ P Y+ +L I N + P+
Sbjct: 246 PRVFSHCLKGENGGGGILVLGEIVEPN--MVFTPLV--PSQPHYNVNLLSISVNGQALPI 301
Query: 414 NLGARNSQVGWA-LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
N ++ G + DTG++ Y ++ AY + ++ S PV +
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNA--------VSQSVRPVVSKGN 353
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI------CLGILDG 526
V F ++L+F ++P+ YL+ ++ N+ C+G
Sbjct: 354 QCYVIATSVADIFPPVSLNFAG-----GASMFLNPQDYLI--QQNNVGGTAVWCIGF--- 403
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ N ILGD+ L+ ++ VYD V +RIGWA C
Sbjct: 404 QRIQNQGITILGDLVLKDKIFVYDLVGQRIGWANYDC 440
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 125/426 (29%), Positives = 186/426 (43%), Gaps = 63/426 (14%)
Query: 163 GIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMD 220
G +R H + +L+S A+D PL G+ P+ GLYF + +G P R +++ +D
Sbjct: 52 GALRAHDVHRHSRLLS----AID----IPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVD 103
Query: 221 TGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY-------------KDSLCMEIQRNH 267
TGSD+ W+ C A C C + K + + PY D+ C + +
Sbjct: 104 TGSDILWVNC-AGCIRCPR------KSDLVELTPYDVDASSTAKSVSCSDNFCSYVNQRS 156
Query: 268 KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL----TKPNVVFGCAYDQQG 323
+ C + C Y I Y D SS+ G L +D +HL + G+ T ++FGC Q G
Sbjct: 157 E---CHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSG 213
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW 383
L + DGI+G ++ S SQLASQG +K HCL N GGG + +G + P
Sbjct: 214 QLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFA-IGEVVSPK- 271
Query: 384 GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQA 440
+ PML Y + I G+S L L + G + D+G++ Y
Sbjct: 272 -VKTTPMLSKSAH--YSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAV 328
Query: 441 YSELIASLKEVSSDGLVLDASDPTLPV-CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIV 499
Y+ L+ + AS P L + + F D F T+T F
Sbjct: 329 YNPLLNEIL----------ASHPELTLHTVQESFTCFHYTDKLDRFPTVTFQFDK----- 373
Query: 500 STKFHISPEGYLVISKKGNICLGILDGSEVHNG--STIILGDISLRGQLVVYDNVNKRIG 557
S + P YL ++ C G +G G S ILGD++L +LVVYD N+ IG
Sbjct: 374 SVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIG 433
Query: 558 WAKSHC 563
W +C
Sbjct: 434 WTNHNC 439
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 124/429 (28%), Positives = 190/429 (44%), Gaps = 69/429 (16%)
Query: 163 GIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMD 220
G +R H + +L+S A+D PL G+ P+ GLYF + +G P R +++ +D
Sbjct: 52 GALRAHDVHRHSRLLS----AID----LPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVD 103
Query: 221 TGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY-------------KDSLCMEIQRNH 267
TGSD+ W+ C A C C + K + + PY D+ C + +
Sbjct: 104 TGSDILWVNC-AGCIRCPR------KSDLVELTPYDADASSTAKSVSCSDNFCSYVNQRS 156
Query: 268 KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL----TKPNVVFGCAYDQQG 323
+ C + C Y I Y D SS+ G L RD +HL + G+ T ++FGC Q G
Sbjct: 157 E---CHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSG 213
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW 383
L + DGI+G ++ S SQLASQG +K HCL N GGG + +G + P
Sbjct: 214 QLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFA-IGEVVSPK- 271
Query: 384 GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQA 440
+ PML Y + I G+S L L + G + D+G++ Y
Sbjct: 272 -VKTTPMLSKS--AHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGTTLVYLPDAV 328
Query: 441 Y----SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKW 496
Y ++++AS +E++ + + F +D F T+T F
Sbjct: 329 YNPLMNQILASHQELNLHTV-------------QDSFTCFHYIDRLDRFPTVTFQFDK-- 373
Query: 497 QIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG--STIILGDISLRGQLVVYDNVNK 554
S + P+ YL ++ C G +G G S ILGD++L +LVVYD N+
Sbjct: 374 ---SVSLAVYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQ 430
Query: 555 RIGWAKSHC 563
IGW +C
Sbjct: 431 VIGWTNHNC 439
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 173/391 (44%), Gaps = 38/391 (9%)
Query: 190 FPLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN------ 242
F ++G+ P GLYFT + +GNP R + + +DTGSD+ W+ C +PC C +
Sbjct: 71 FSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGCPDSSGLGIELN 129
Query: 243 --PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELH 300
K +LP D +C + +T C Y Y D S + G D +H
Sbjct: 130 LFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQT-DHCSYSFHYRDRSGTSGFYVTDSMH 188
Query: 301 LTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356
I G T N +VFGC+ Q G L DGI G + + S+ SQL+S+GI
Sbjct: 189 FDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITP 248
Query: 357 NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
V HCL GGG + LG L PS + + P++ S + + ++ P
Sbjct: 249 KVFSHCLKGGENGGGILVLGEILEPS--IVYSPLIPSQPHYTLKLQSIALSGQLFPNPTM 306
Query: 417 ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
S G + D+G++ Y ++ Y +++ + S ++ PT+ R R
Sbjct: 307 FPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVS-----QSATPTIS---RGSQCFR 358
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL----VISKKGNICLGILDGSEVHNG 532
+ V F L +F +V ++PE YL ++ + C+G + N
Sbjct: 359 VSMSVADIFPVLRFNFEGIASMV-----VTPEEYLQFDSIVREPALWCIGFQKAEDGLN- 412
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
ILGD+ L+ +++VYD +RIGWA C
Sbjct: 413 ---ILGDLVLKDKIIVYDLARQRIGWANYDC 440
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 130/404 (32%), Positives = 194/404 (48%), Gaps = 50/404 (12%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP 247
PL G+ D GLYFT + +G+PP+ YY+ +DTGSD+ W+ C APC C P+ K
Sbjct: 63 LPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKC-----PV-KT 115
Query: 248 RMGNILPYKDSLCMEIQRN--HKPGYC------ETC---QQCDYEIEYADHSSSMGVLAR 296
+G L DS +N + +C ETC + C Y + Y D S+S G +
Sbjct: 116 DLGIPLSLYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVK 175
Query: 297 DELHLTIENGSL-TKP---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
D + L G+L T P VVFGC +Q G L T DGI+G ++ S+ SQLA+
Sbjct: 176 DNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAG 235
Query: 353 GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSS 411
G +K + HCL N GGG +G V S + P++ + ++++ ILK ++
Sbjct: 236 GSVKRIFSHCL-DNMNGGGIFAIGE--VESPVVKTTPLVPN---QVHYNVILKGMDVDGE 289
Query: 412 PLNLG---ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVC 468
P++L A + G + D+G++ Y + Y+ LI K + + L T
Sbjct: 290 PIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIE--KITAKQQVKLHMVQETF--- 344
Query: 469 WRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS- 527
A F S D + F + LHF S K + P YL ++ C G G
Sbjct: 345 --ACFSFTSNTD--KAFPVVNLHFED-----SLKLSVYPHDYLFSLREDMYCFGWQSGGM 395
Query: 528 EVHNGSTII-LGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+G+ +I LGD+ L +LVVYD N+ IGWA +C + + K
Sbjct: 396 TTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVK 439
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 119/401 (29%), Positives = 185/401 (46%), Gaps = 54/401 (13%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
FPL G+ P GLY+T + +G PP YY+ +DTGSD+TW+ C APC+SC
Sbjct: 23 FPLTGDDDPFVTGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNC-APCTSCVTETQLPSIK 81
Query: 243 -PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD- 297
Y P + L +DS C +++ C + C Y Y D SS+ G +D
Sbjct: 82 LTTYDPSRSSTDGALSCRDSNCGAALGSNEVS-CTSAGYCAYSTTYGDGSSTQGYFIQDV 140
Query: 298 ----ELHLTIE-NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
E+H + NG+ +V FGC Q G LL + DG++G +A VS+PSQLAS
Sbjct: 141 MTFQEIHNNTQVNGT---ASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASM 197
Query: 353 GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEI-LKINYGSS 411
G + N HCL + GGG + +G P+ +++ P++ + I + ++
Sbjct: 198 GKVGNRFAHCLQGDNQGGGTIVIGSVSEPN--ISYTPIVSRNHYAVGMQNIAVNGRNVTT 255
Query: 412 PLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW-- 469
P + ++ G + D+G++ Y AY++ + ++ S + + L + W
Sbjct: 256 PASFDTTSTSAGGVIMDSGTTLAYLVDPAYTQFVNAVSTFESS--MFSSHSQCLQLAWCS 313
Query: 470 -RAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL----VISKKGNICLGIL 524
+A FP VK FF +++P YL + + + C+G
Sbjct: 314 LQADFPT-----VKLFFD------------AGAVMNLTPRNYLYSQPLQNGQAAYCMG-W 355
Query: 525 DGSEVHNG--STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
S G S ILGDI L+ LVVYDN N+ +GW C
Sbjct: 356 QKSTTKAGYLSYSILGDIVLKDHLVVYDNDNRVVGWKSFDC 396
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 125/412 (30%), Positives = 189/412 (45%), Gaps = 49/412 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
+ H ++ + ++++S +D PL G+ D GLYFT + +G+PP+ Y++ +DTG
Sbjct: 43 FKSHDTRRHSRMLAS----ID----LPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTG 94
Query: 223 SDLTWIQCDAPCSSCAKGANPLYKPRMGNI--------LPYKDSLCMEIQRNHKPGYCET 274
SD+ WI C PC C N ++ + ++ + D C I ++ C+
Sbjct: 95 SDILWINC-KPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDS---CQP 150
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSL-TKP---NVVFGCAYDQQGLLLNTLV 330
C Y I YAD S+S G RD L L G L T P VVFGC DQ G L N
Sbjct: 151 ALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDS 210
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DG++G ++ S+ SQLA+ G K V HCL N GGG +G +V S + PM
Sbjct: 211 AVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DNVKGGGIFAVG--VVDSPKVKTTPM 267
Query: 391 LDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKE 450
+ P Y+ ++ ++ + L+L + G + D+G++ YF K Y LI ++
Sbjct: 268 V--PNQMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLIETIL- 324
Query: 451 VSSDGLVLDASDPTLPVCWRAKFPIRSI-VDVKQFFKTLTLHFGSKWQIVSTKFHISPEG 509
A P F S +V + F ++ F S K + P
Sbjct: 325 ---------ARQPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFED-----SVKLTVYPHD 370
Query: 510 YLVISKKGNICLGILDG--SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWA 559
YL ++ C G G + I+LGD+ L +LVVYD N+ IGWA
Sbjct: 371 YLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWA 422
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 151 bits (382), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 126/425 (29%), Positives = 201/425 (47%), Gaps = 48/425 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
+R H ++ + +++S AVD PL GN +P GLYF + +G P + YY+ +DTG
Sbjct: 124 LRAHDTRRHGRILS----AVD----LPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTG 175
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C A C C ++ LY + + + D+ C + PG C+
Sbjct: 176 SDILWVNC-AGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC-SLYDGPLPG-CKP 232
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSL----TKPNVVFGCAYDQQGLLLNTLV 330
QC Y + Y D SS+ G +D + +G+ T VVFGC Q G L ++
Sbjct: 233 GLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSE 292
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DGILG +A S+ SQLAS G +K V HCL N GGG +G + P + P+
Sbjct: 293 ALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-DNVDGGGIFAIGEVVEPKVNIT--PL 349
Query: 391 LDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIAS 447
+ + Y+ + +I G PL++ + + G + D+G++ YF ++ Y LI
Sbjct: 350 VQN--QAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLI-- 405
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
+ ++ D L +A +V F T+TLHF S + P
Sbjct: 406 ------EKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDK-----SISLTVYP 454
Query: 508 EGYLVISKKGNICLGILD-GSEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
YL K+ C+G + G++ +G + +LGD+ L +LVVYD + IGW + +C +
Sbjct: 455 HEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCSS 514
Query: 566 PGRFK 570
+ K
Sbjct: 515 SIKVK 519
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 120/406 (29%), Positives = 182/406 (44%), Gaps = 60/406 (14%)
Query: 182 VAVDSSSI-FPLRG--NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA 238
V + SS++ P+ G + Y GLYFT + +G PPR Y L +DTGSDL W+ C PC C
Sbjct: 13 VKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGC- 70
Query: 239 KGANPLYKPRMGNILPY-------------KDSLCMEIQRNHKPGYCETCQQCDYEIEYA 285
P + I+PY D C I + + G C QC Y +Y
Sbjct: 71 ----PAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESG-CNDQNQCGYSFQYG 125
Query: 286 DHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSL 345
D S ++G L D LH + N + T V+FGC + Q G L + DGI+G + +S
Sbjct: 126 DGSGTLGYLVEDVLHYMV-NATAT---VIFGCGFKQSGDLSTSERALDGIIGFGASDLSF 181
Query: 346 PSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK 405
SQLA QG NV HCL GGG + LG+ + P + + P++ P+M Y+ +
Sbjct: 182 NSQLAKQGKTPNVFAHCLDGGERGGGILVLGNVIEPD--IQYTPLV--PYMSHYNVVLQS 237
Query: 406 INYGSSPLNLGAR---NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD 462
I+ ++ L + + N + +FD+G++ Y +AY ++ V + L+ D
Sbjct: 238 ISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTR- 296
Query: 463 PTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN---- 518
++F + + F + L+F ++P YL+
Sbjct: 297 -------LSRF-------IYKLFPNVVLYFE------GASMTLTPAEYLIRQASAANAPI 336
Query: 519 ICLGILD-GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
C+G GS I GD+ L+ +LVVYD RIGW C
Sbjct: 337 WCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 120/406 (29%), Positives = 182/406 (44%), Gaps = 60/406 (14%)
Query: 182 VAVDSSSI-FPLRG--NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA 238
V + SS++ P+ G + Y GLYFT + +G PPR Y L +DTGSDL W+ C PC C
Sbjct: 13 VKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGC- 70
Query: 239 KGANPLYKPRMGNILPY-------------KDSLCMEIQRNHKPGYCETCQQCDYEIEYA 285
P + I+PY D C I + + G C QC Y +Y
Sbjct: 71 ----PAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESG-CNDQNQCGYSFQYG 125
Query: 286 DHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSL 345
D S ++G L D LH + N + T V+FGC + Q G L + DGI+G + +S
Sbjct: 126 DGSGTLGYLVEDVLHYMV-NATAT---VIFGCGFKQSGDLSTSERALDGIIGFGASDLSF 181
Query: 346 PSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK 405
SQLA QG NV HCL GGG + LG+ + P + + P++ P+M Y+ +
Sbjct: 182 NSQLAKQGKTPNVFAHCLDGGERGGGILVLGNVIEPD--IQYTPLV--PYMYHYNVVLQS 237
Query: 406 INYGSSPLNLGAR---NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD 462
I+ ++ L + + N + +FD+G++ Y +AY ++ V + L+ D
Sbjct: 238 ISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTR- 296
Query: 463 PTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN---- 518
++F + + F + L+F ++P YL+
Sbjct: 297 -------LSRF-------IYKLFPNVVLYFE------GASMTLTPAEYLIRQASAANAPI 336
Query: 519 ICLGILD-GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
C+G GS I GD+ L+ +LVVYD RIGW C
Sbjct: 337 WCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 126/425 (29%), Positives = 201/425 (47%), Gaps = 48/425 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
+R H ++ + +++S AVD PL GN +P GLYF + +G P + YY+ +DTG
Sbjct: 43 LRAHDTRRHGRILS----AVD----LPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTG 94
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C A C C ++ LY + + + D+ C + PG C+
Sbjct: 95 SDILWVNC-AGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC-SLYDGPLPG-CKP 151
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSL----TKPNVVFGCAYDQQGLLLNTLV 330
QC Y + Y D SS+ G +D + +G+ T VVFGC Q G L ++
Sbjct: 152 GLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSE 211
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DGILG +A S+ SQLAS G +K V HCL N GGG +G + P + P+
Sbjct: 212 ALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-DNVDGGGIFAIGEVVEPKVNIT--PL 268
Query: 391 LDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIAS 447
+ + Y+ + +I G PL++ + + G + D+G++ YF ++ Y LI
Sbjct: 269 VQN--QAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLI-- 324
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
+ ++ D L +A +V F T+TLHF S + P
Sbjct: 325 ------EKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDK-----SISLTVYP 373
Query: 508 EGYLVISKKGNICLGILD-GSEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
YL K+ C+G + G++ +G + +LGD+ L +LVVYD + IGW + +C +
Sbjct: 374 HEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCSS 433
Query: 566 PGRFK 570
+ K
Sbjct: 434 SIKVK 438
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 125/401 (31%), Positives = 190/401 (47%), Gaps = 44/401 (10%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G+ D GLYFT + +G+PP+ YY+ +DTGSD+ W+ C APC C +
Sbjct: 60 LPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPL 118
Query: 243 PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY + + + +D C I ++ C + C Y + Y D S+S G +D +
Sbjct: 119 SLYDSKTSSTSKNVGCEDDFCSFIMQSET---CGAKKPCSYHVVYGDGSTSDGDFIKDNI 175
Query: 300 HLTIENGSL-TKP---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
L G+L T P VVFGC +Q G L T DGI+G ++ S+ SQLA+ G
Sbjct: 176 TLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGST 235
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLN 414
K + HCL N GGG +G V S + P++ + ++++ ILK ++ P++
Sbjct: 236 KRIFSHCL-DNMNGGGIFAVGE--VESPVVKTTPIVPN---QVHYNVILKGMDVDGDPID 289
Query: 415 LG---ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
L A + G + D+G++ Y + Y+ LI K + + L T A
Sbjct: 290 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIE--KITAKQQVKLHMVQETF-----A 342
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVH 530
F S D + F + LHF S K + P YL ++ C G G
Sbjct: 343 CFSFTSNTD--KAFPVVNLHFED-----SLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQ 395
Query: 531 NGSTII-LGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+G+ +I LGD+ L +LVVYD N+ IGWA +C + + K
Sbjct: 396 DGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVK 436
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 125/401 (31%), Positives = 190/401 (47%), Gaps = 44/401 (10%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G+ D GLYFT + +G+PP+ YY+ +DTGSD+ W+ C APC C +
Sbjct: 64 LPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPL 122
Query: 243 PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY + + + +D C I ++ C + C Y + Y D S+S G +D +
Sbjct: 123 SLYDSKTSSTSKNVGCEDDFCSFIMQSET---CGAKKPCSYHVVYGDGSTSDGDFIKDNI 179
Query: 300 HLTIENGSL-TKP---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
L G+L T P VVFGC +Q G L T DGI+G ++ S+ SQLA+ G
Sbjct: 180 TLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGST 239
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLN 414
K + HCL N GGG +G V S + P++ + ++++ ILK ++ P++
Sbjct: 240 KRIFSHCL-DNMNGGGIFAVGE--VESPVVKTTPIVPN---QVHYNVILKGMDVDGDPID 293
Query: 415 LG---ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
L A + G + D+G++ Y + Y+ LI K + + L T A
Sbjct: 294 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIE--KITAKQQVKLHMVQETF-----A 346
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVH 530
F S D + F + LHF S K + P YL ++ C G G
Sbjct: 347 CFSFTSNTD--KAFPVVNLHFED-----SLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQ 399
Query: 531 NGSTII-LGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+G+ +I LGD+ L +LVVYD N+ IGWA +C + + K
Sbjct: 400 DGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVK 440
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 112/394 (28%), Positives = 176/394 (44%), Gaps = 41/394 (10%)
Query: 190 FPLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN------ 242
F ++G+ P GLYFT + +GNP R + + +DTGSD+ W+ C +PC C +
Sbjct: 71 FSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGCPDSSGLGIELN 129
Query: 243 --PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELH 300
K +LP D +C + +T C Y Y D S + G D +H
Sbjct: 130 LFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQT-DHCSYSFHYRDRSGTSGFYVTDSMH 188
Query: 301 LTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356
I G T N +VFGC+ Q G L DGI G + + S+ SQL+S+GI
Sbjct: 189 FDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITP 248
Query: 357 NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
V HCL GGG + LG L PS + + P++ S + + ++ P
Sbjct: 249 KVFSHCLKGGENGGGILVLGEILEPS--IVYSPLIPSQPHYTLKLQSIALSGQLFPNPTM 306
Query: 417 ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
S G + D+G++ Y ++ Y +++ + S ++ PT+ R R
Sbjct: 307 FPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVS-----QSATPTIS---RGSQCFR 358
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL----VIS--KKGNI-CLGILDGSEV 529
+ V F L +F +V ++PE YL ++S K ++ C+G +
Sbjct: 359 VSMSVADIFPVLRFNFEGIASMV-----VTPEEYLQFDSIVSCYKFASLWCIGFQKAEDG 413
Query: 530 HNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
N ILGD+ L+ +++VYD +RIGWA C
Sbjct: 414 LN----ILGDLVLKDKIIVYDLAQQRIGWANYDC 443
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 118/402 (29%), Positives = 188/402 (46%), Gaps = 31/402 (7%)
Query: 166 RPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDL 225
R H+ + + LV ++ S++ L ++ +G Y T + +G+PP+ + L +DTGS +
Sbjct: 57 RDHRLRHLQNLVKPHS----SNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTV 112
Query: 226 TWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYA 285
T++ C + C C +P ++P + + Y+ C N E QC YE YA
Sbjct: 113 TYVPC-SNCVQCGNHQDPRFQPELSST--YQPVKC-----NADCNCDENGVQCTYERRYA 164
Query: 286 DHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSL 345
+ S+S GVLA D + E+ L VFGC + G L + DGI+GL R +S+
Sbjct: 165 EMSTSSGVLAEDVMSFGKES-ELVPQRAVFGCETMESGDLYTQ--RADGIMGLGRGTLSV 221
Query: 346 PSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK 405
QL +G++ N C GGG M LG P GM + D Y+ E+ +
Sbjct: 222 MDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPP-GMVF-SHSDPSRSPYYNIELKE 279
Query: 406 INYGSSPLNLGARNSQVGW-ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT 464
I+ PL L R + A+ D+G++Y YF ++AY ++ + S + DP
Sbjct: 280 IHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPN 339
Query: 465 LP-VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICL 521
+C+ R + ++ + F + + F + K +SPE YL K G CL
Sbjct: 340 FKDICFSGAG--RDVTELPKVFPEVDMVFAN-----GQKISLSPENYLFRHTKVSGAYCL 392
Query: 522 GILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
GI N T +LG I +R LV Y+ N IG+ K++C
Sbjct: 393 GIFKNG---NDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNC 431
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 118/402 (29%), Positives = 188/402 (46%), Gaps = 31/402 (7%)
Query: 166 RPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDL 225
R H+ + + LV ++ S++ L ++ +G Y T + +G+PP+ + L +DTGS +
Sbjct: 57 RDHRLRHLQNLVKPHS----SNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTV 112
Query: 226 TWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYA 285
T++ C + C C +P ++P + + Y+ C N E QC YE YA
Sbjct: 113 TYVPC-SNCVQCGNHQDPRFQPELSST--YQPVKC-----NADCNCDENGVQCTYERRYA 164
Query: 286 DHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSL 345
+ S+S GVLA D + E+ L VFGC + G L + DGI+GL R +S+
Sbjct: 165 EMSTSSGVLAEDVMSFGKES-ELVPQRAVFGCETMESGDLYTQ--RADGIMGLGRGTLSV 221
Query: 346 PSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK 405
QL +G++ N C GGG M LG P GM + D Y+ E+ +
Sbjct: 222 MDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPP-GMVF-SHSDPSRSPYYNIELKE 279
Query: 406 INYGSSPLNLGARNSQVGW-ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT 464
I+ PL L R + A+ D+G++Y YF ++AY ++ + S + DP
Sbjct: 280 IHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPN 339
Query: 465 LP-VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICL 521
+C+ R + ++ + F + + F + K +SPE YL K G CL
Sbjct: 340 FKDICFSGAG--RDVTELPKVFPEVDMVFAN-----GQKISLSPENYLFRHTKVSGAYCL 392
Query: 522 GILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
GI N T +LG I +R LV Y+ N IG+ K++C
Sbjct: 393 GIFKNG---NDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNC 431
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 111/399 (27%), Positives = 179/399 (44%), Gaps = 47/399 (11%)
Query: 190 FPLRGNI--YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC--AKG---AN 242
F L+G Y GLY+T + +G PPRP+Y+ +DTGSD+ W+ C PC++C G A
Sbjct: 27 FTLQGTADPYVAGLYYTRIELGTPPRPFYVQIDTGSDILWVNC-KPCNACPLTSGLGVAL 85
Query: 243 PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ PR + L DS C+ + + C T + C Y EY D S ++G DE
Sbjct: 86 NFFDPRGSSTASPLSCIDSKCVSSNQISE-SVCTTDRYCGYSFEYGDGSGTLGYYVSDEF 144
Query: 300 HLT------IENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQG 353
+ N + K + FGC+Y+Q G L DGI G + +S+ SQL SQG
Sbjct: 145 DYNQYVNQYVTNNASAK--ITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQG 202
Query: 354 IIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
+ + HCL GGG + LG P GM + P++ P Y+ + I L
Sbjct: 203 LAPKIFSHCLEGADPGGGILVLGEITEP--GMVYTPIV--PSQPHYNLNLQGIAVNGQQL 258
Query: 414 NLGAR---NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWR 470
++ + + + D G++ Y ++AY + + ++ S T P +
Sbjct: 259 SIDPQVFATTNTRGTIIDCGTTLAYLAEEAYEPFVNT--------IIAAVSQSTQPFMLK 310
Query: 471 AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI----CLGILDG 526
++ + + F ++TL+F + P+ YL+ + C+G
Sbjct: 311 GNPCFLTVHSIDEIFPSVTLYFE------GAPMDLKPKDYLIQQLSPDSSPVWCIGWQKS 364
Query: 527 SEVHNGST--IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ S+ ILGD+ L+ ++ VYD N+RIGW C
Sbjct: 365 GQQATDSSKMTILGDLVLKDKVFVYDLENQRIGWTSFDC 403
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 115/379 (30%), Positives = 184/379 (48%), Gaps = 33/379 (8%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C + C C K +P ++P + +
Sbjct: 67 LFDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSS-CEQCGKHQDPRFQPDLSS 125
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
Y+ C N + +QC YE YA+ SSS GV+A D + + N S KP
Sbjct: 126 T--YRPVKC-----NPSCNCDDEGKQCTYERRYAEMSSSSGVIAEDVV--SFGNESELKP 176
Query: 312 N-VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
VFGC + G L + + DGI+GL R ++S+ QL +G+I + C GG
Sbjct: 177 QRAVFGCENVETGDLYSQ--RADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGG 234
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGAR--NSQVGWALF 427
G M LG + P M + +P+ Y+ E+ +++ PL L + + + G L
Sbjct: 235 GAMVLGQ-ISPPPNMVF--SHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVL- 290
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQFFK 486
D+G++Y YF + A+ L ++ + + DP +C+ R + + + F
Sbjct: 291 DSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAG--REVSHLSKVFP 348
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDISLRG 544
+ + FGS K +SPE YL K G CLGI N T +LG I +R
Sbjct: 349 EVNMVFGS-----GQKLSLSPENYLFRHTKVSGAYCLGIFQNG---NDLTTLLGGIVVRN 400
Query: 545 QLVVYDNVNKRIGWAKSHC 563
LV YD N +IG+ K++C
Sbjct: 401 TLVTYDRENDKIGFWKTNC 419
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 116/412 (28%), Positives = 194/412 (47%), Gaps = 49/412 (11%)
Query: 167 PHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLT 226
P+ S++ L V ++ L ++ +G Y T + +G PP+ + L +D+GS +T
Sbjct: 53 PNASRLAASLRRGLGDGVHPNARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVT 112
Query: 227 WIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCE---TC----QQCD 279
++ C + C C +P ++P ++ ++ P C TC +QC
Sbjct: 113 YVPCSS-CEQCGNHQDPRFQP--------------DLSSSYSPVKCNVDCTCDSDKKQCT 157
Query: 280 YEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
YE +YA+ SSS GVL D + E+ L + +FGC + G L + DGI+GL
Sbjct: 158 YERQYAEMSSSSGVLGEDIVSFGRES-ELKPQHAIFGCENSETGDLFSQ--HADGIMGLG 214
Query: 340 RAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLV-PSWGMAWVPMLDSPFMEL 398
R ++S+ QL +G+I + C GGG M LG L P + L SP+
Sbjct: 215 RGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLAPPDMIFSNSDPLRSPY--- 271
Query: 399 YHTEILKINYGSSPLNLGAR--NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL 456
Y+ E+ +I+ L + +R NS+ G L D+G++Y Y +QA+ ++
Sbjct: 272 YNIELKEIHVAGKALRVESRIFNSKHGTVL-DSGTTYAYLPEQAFVAFKEAVTSKVHSLK 330
Query: 457 VLDASDPTLP-VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISK 515
+ DP+ +C+ R++ + + F + + FG+ K ++PE YL
Sbjct: 331 KIRGPDPSYKDICFAGAG--RNVSKLHEVFPDVDMVFGN-----GQKLSLTPENYLFRHS 383
Query: 516 K--GNICLGILDGSEVHNGS--TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
K G CLG+ NG T +LG I +R LV YD N++IG+ K++C
Sbjct: 384 KVDGAYCLGVF-----QNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 430
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 125/425 (29%), Positives = 200/425 (47%), Gaps = 49/425 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
+R H ++ + +++S AVD PL GN +P GLYF + +G P + YY+ +DTG
Sbjct: 124 LRAHDTRRHGRILS----AVD----LPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTG 175
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C A C C ++ LY + + + D+ C + PG C+
Sbjct: 176 SDILWVNC-AGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC-SLYDGPLPG-CKP 232
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSL----TKPNVVFGCAYDQQGLLLNTLV 330
QC Y + Y D SS+ G +D + +G+ T VVFGC Q G L ++
Sbjct: 233 GLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSE 292
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DGILG +A S+ SQLAS G +K V HCL N GGG +G + P + P+
Sbjct: 293 ALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-DNVDGGGIFAIGEVVEPKVNIT--PL 349
Query: 391 LDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIAS 447
+ + Y+ + +I G PL++ + + G + D+G++ YF ++ Y LI
Sbjct: 350 VQN--QAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLI-- 405
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
+ ++ D L +A +V F T+TLHF S + P
Sbjct: 406 ------EKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDK-----SISLTVYP 454
Query: 508 EGYLVISKKGNICLGILD-GSEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
YL + C+G + G++ +G + +LGD+ L +LVVYD + IGW + +C +
Sbjct: 455 HEYL-FQHEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCSS 513
Query: 566 PGRFK 570
+ K
Sbjct: 514 SIKVK 518
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 112/395 (28%), Positives = 174/395 (44%), Gaps = 44/395 (11%)
Query: 190 FPLRGNI--YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC----AKGANP 243
FP++G+ Y GLYFT + +G+PP + + +DTGSD+ W+ C + CS+C G +
Sbjct: 86 FPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDL 144
Query: 244 LYKPRMGNI----LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ G++ + D +C + + C QC Y Y D S + G D
Sbjct: 145 HFFDAPGSLTAGSVTCSDPICSSVFQT-TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTF 203
Query: 300 HLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+ G N +VFGC+ Q G L + DGI G + K+S+ SQL+S+GI
Sbjct: 204 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 263
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
V HCL + GGG LG LVP GM + P++ P Y+ +L I L L
Sbjct: 264 PPVFSHCLKGDGSGGGVFVLGEILVP--GMVYSPLV--PSQPHYNLNLLSIGVNGQMLPL 319
Query: 416 GA---RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
A S + DTG++ TY K+AY + ++ S P+ +
Sbjct: 320 DAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISN--------SVSQLVTPIISNGE 371
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL----VISKKGNICLGILDGSE 528
+ F +++L+F ++ + P+ YL + C+G E
Sbjct: 372 QCYLVSTSISDMFPSVSLNFAGGASMM-----LRPQDYLFHYGIYDGASMWCIGFQKAPE 426
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
ILGD+ L+ ++ VYD +RIGWA C
Sbjct: 427 EQT----ILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 112/395 (28%), Positives = 174/395 (44%), Gaps = 44/395 (11%)
Query: 190 FPLRGNI--YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC----AKGANP 243
FP++G+ Y GLYFT + +G+PP + + +DTGSD+ W+ C + CS+C G +
Sbjct: 86 FPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDL 144
Query: 244 LYKPRMGNI----LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ G++ + D +C + + C QC Y Y D S + G D
Sbjct: 145 HFFDAPGSLTAGSVTCSDPICSSVFQT-TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTF 203
Query: 300 HLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+ G N +VFGC+ Q G L + DGI G + K+S+ SQL+S+GI
Sbjct: 204 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 263
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
V HCL + GGG LG LVP GM + P++ P Y+ +L I L L
Sbjct: 264 PPVFSHCLKGDGSGGGVFVLGEILVP--GMVYSPLV--PSQPHYNLNLLSIGVNGQMLPL 319
Query: 416 GA---RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
A S + DTG++ TY K+AY + ++ S P+ +
Sbjct: 320 DAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISN--------SVSQLVTPIISNGE 371
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL----VISKKGNICLGILDGSE 528
+ F +++L+F ++ + P+ YL + C+G E
Sbjct: 372 QCYLVSTSISDMFPSVSLNFAGGASMM-----LRPQDYLFHYGIYDGASMWCIGFQKAPE 426
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
ILGD+ L+ ++ VYD +RIGWA C
Sbjct: 427 EQT----ILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 123/445 (27%), Positives = 194/445 (43%), Gaps = 62/445 (13%)
Query: 160 VNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYL 217
N +R H + + +L+++ + PL G P GLYFT + +G PP+ YY+
Sbjct: 51 ANISALRAHDGRRHGRLLAA--------ADLPLGGLGLPTDTGLYFTEIKLGTPPKRYYV 102
Query: 218 DMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPRM---GNILPYKDSLCMEIQRNHKP 269
+DTGSD+ W+ C CS C + + Y P+ G+ + C P
Sbjct: 103 QVDTGSDILWVNC-ISCSKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYGGKLP 161
Query: 270 GYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG-SLTKP---NVVFGCAYDQQGLL 325
G C C+Y + Y D SS+ G D L G T+P + FGC Q G L
Sbjct: 162 G-CTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNATITFGCGAQQGGDL 220
Query: 326 LNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY------------- 372
N+ DGILG +A S+ SQLA+ G K + HCL T GGG +
Sbjct: 221 GNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIKGGGIFAIGNVVQPKCYFV 280
Query: 373 MFLGHDL--VPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALF 427
F H L +P + + + +L P Y+ + I+ G + L L A + G +
Sbjct: 281 FFFAHGLLNIPLFLLVMI-LLSRPH---YNVNLKSIDVGGTTLQLPAHVFETGEKKGTII 336
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D+G++ TY + + +++ + D + D +C++ V F T
Sbjct: 337 DSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQD---FLCFQYS------GSVDDGFPT 387
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVHNGSTIIL-GDISLRGQ 545
+T HF + H+ P Y + C+G +G+ + +G I+L GD+ L +
Sbjct: 388 ITFHFEDDLAL-----HVYPHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLMGDLVLSNK 442
Query: 546 LVVYDNVNKRIGWAKSHCMNPGRFK 570
LVVYD N+ IGW +C + + K
Sbjct: 443 LVVYDLENQVIGWTDYNCSSSIKIK 467
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 121/429 (28%), Positives = 191/429 (44%), Gaps = 48/429 (11%)
Query: 161 NDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLD 218
N +R H + +L+++ + PL G P GLY+T + +G PP+ +Y+
Sbjct: 53 NISALRAHDGTRHGRLLAT--------ADLPLGGLGLPTDTGLYYTEVRLGTPPKRFYVQ 104
Query: 219 MDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPG 270
+DTGSD+ W+ C C C + LY P+ G+ + C + P
Sbjct: 105 VDTGSDILWVNC-ITCDQCPHKSGLGLDLTLYDPKASSTGSTVMCDQGFCADTFGGRLP- 162
Query: 271 YCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG-SLTKP---NVVFGCAYDQQGLLL 326
C C+Y + Y D SS++G D L G T+P +V+FGC Q G L
Sbjct: 163 KCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANASVIFGCGAQQGGDLG 222
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMA 386
++ DGILG A S+ SQLA+ G +K + HCL T GGG +F D+V
Sbjct: 223 SSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTIKGGG--IFAIGDVVQPKVKT 280
Query: 387 WVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSE 443
+ D P Y+ + I+ G + L L A + G + D+G++ TY + + +
Sbjct: 281 TPLVADKPH---YNVNLKTIDVGGTTLELPADIFKPGEKRGTIIDSGTTLTYLPELVFKK 337
Query: 444 LIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKF 503
++ ++ D D D +C+ V F TLT HF +
Sbjct: 338 VMLAVFNKHQDITFHDVQD---FLCFEYSGS------VDDGFPTLTFHFEDDLAL----- 383
Query: 504 HISPEGYLVISKKGNICLGILDGS-EVHNGSTIIL-GDISLRGQLVVYDNVNKRIGWAKS 561
H+ P Y + C+G +G+ + +G I+L GD+ L +LVVYD N+ IGW
Sbjct: 384 HVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENRVIGWTDY 443
Query: 562 HCMNPGRFK 570
+C + + K
Sbjct: 444 NCSSSIKIK 452
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 119/390 (30%), Positives = 187/390 (47%), Gaps = 36/390 (9%)
Query: 185 DSSSIFPLRGNIYPD----GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKG 240
DS S+ R +Y D G Y T + +G PP+ + L +D+GS +T++ C + C C K
Sbjct: 72 DSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKH 130
Query: 241 ANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELH 300
+P ++P M + Y+ C N + +QC YE EYA+HSSS GVL D +
Sbjct: 131 QDPKFQPEMSST--YQPVKC-----NMDCNCDDDREQCVYEREYAEHSSSKGVLGEDLIS 183
Query: 301 LTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVG 360
E+ LT VFGC + G L + + DGI+GL + +SL QL +G+I N G
Sbjct: 184 FGNES-QLTPQRAVFGCETVETGDLYSQ--RADGIIGLGQGDLSLVDQLVDKGLISNSFG 240
Query: 361 HCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR-- 418
C GGG M LG PS M + D Y+ ++ I L+L +R
Sbjct: 241 LCYGGMDVGGGSMILGGFDYPS-DMVFTDS-DPDRSPYYNIDLTGIRVAGKQLSLHSRVF 298
Query: 419 NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKFPIRS 477
+ + G A+ D+G++Y Y A++ ++ S +D DP C++
Sbjct: 299 DGEHG-AVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAAS-NY 356
Query: 478 IVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGS 533
+ ++ + F ++ + F G W +SPE Y+ K G CLG+ + H
Sbjct: 357 VSELSKIFPSVEMVFKSGQSWL-------LSPENYMFRHSKVHGAYCLGVFPNGKDH--- 406
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T +LG I +R LVVYD N ++G+ +++C
Sbjct: 407 TTLLGGIVVRNTLVVYDRENSKVGFWRTNC 436
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 119/432 (27%), Positives = 200/432 (46%), Gaps = 51/432 (11%)
Query: 161 NDGI----IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRP 214
NDG+ +R S +++++ S VD FP++G P GLY+T + +G PPR
Sbjct: 34 NDGVELSELRARDSLRHRRMLQSTNYVVD----FPVKGTFDPSQVGLYYTKVKLGTPPRE 89
Query: 215 YYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPRMGNILPYK---DSLCMEIQRN 266
+Y+ +DTGSD+ W+ C + C+ C + + + PR + D C +
Sbjct: 90 FYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQLNYFDPRSSSTSSLISCSDRRCRSGVQT 148
Query: 267 HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLT-IENGSLT---KPNVVFGCAYDQQ 322
QC Y +Y D S + G D +H I G+LT +VVFGC+ Q
Sbjct: 149 SDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTNSSASVVFGCSILQT 208
Query: 323 GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPS 382
G L + DGI G + +S+ SQL+ QGI V HCL + GGG + LG + P+
Sbjct: 209 GDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKGDNSGGGVLVLGEIVEPN 268
Query: 383 WGMAWVPMLDSPFMELYHTEILKINYGSSPL--NLGARNSQVGWALFDTGSSYTYFTKQA 440
+ + P++ S + + + +N P+ + A ++ G + D+G++ Y ++A
Sbjct: 269 --IVYSPLVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNNRG-TIVDSGTTLAYLAEEA 325
Query: 441 YSELIASLKEVSSDGL--VLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQI 498
Y+ + ++ + + VL + C+ S VD+ F ++L+F +
Sbjct: 326 YNPFVNAITALVPQSVRSVLSRGNQ----CY--LITTSSNVDI---FPQVSLNFAGGASL 376
Query: 499 VSTKFHISPEGYLV----ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNK 554
V + P+ YL+ I + C+G + S ILGD+ L+ ++ VYD +
Sbjct: 377 V-----LRPQDYLMQQNYIGEGSVWCIGF---QRIPGQSITILGDLVLKDKIFVYDLAGQ 428
Query: 555 RIGWAKSHCMNP 566
RIGWA C P
Sbjct: 429 RIGWANYDCSLP 440
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/395 (28%), Positives = 174/395 (44%), Gaps = 44/395 (11%)
Query: 190 FPLRGNI--YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC----AKGANP 243
FP++G+ Y GLYFT + +G+PP + + +DTGSD+ W+ C + CS+C G +
Sbjct: 86 FPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSGLGIDL 144
Query: 244 LYKPRMGNI----LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ G+ + D +C + + C QC Y Y D S + G D
Sbjct: 145 HFFDAPGSFTAGSVTCSDPICSSVFQT-TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTF 203
Query: 300 HLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+ G N +VFGC+ Q G L + DGI G + K+S+ SQL+S+GI
Sbjct: 204 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 263
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
V HCL + GGG LG LVP GM + P+L P Y+ +L I L +
Sbjct: 264 PPVFSHCLKGDGSGGGVFVLGEILVP--GMVYSPLL--PSQPHYNLNLLSIGVNGQILPI 319
Query: 416 GA---RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
A S + DTG++ TY K+AY + ++ S + L S+ C+
Sbjct: 320 DAAVFEASNTRGTIVDTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNGEQ--CYLVS 377
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL----VISKKGNICLGILDGSE 528
+ F ++L+F ++ + P+ YL C+G E
Sbjct: 378 ------TSISDMFPPVSLNFAGGASMM-----LRPQDYLFHYGFYDGASMWCIGFQKAPE 426
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
ILGD+ L+ ++ VYD +RIGWA C
Sbjct: 427 EQT----ILGDLVLKDKVFVYDLARQRIGWANYDC 457
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 124/439 (28%), Positives = 208/439 (47%), Gaps = 65/439 (14%)
Query: 161 NDGI----IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRP 214
NDG+ +R S +++++ S VD FP++G P GLY+T + +G PPR
Sbjct: 34 NDGVELSELRARDSLRHRRMLQSTNYVVD----FPVKGTFDPSQVGLYYTKVKLGTPPRE 89
Query: 215 YYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPRMGNILPYKDSLCMEIQ-RNHK 268
Y+ +DTGSD+ W+ C + C+ C + + + P G+ C++ + R+
Sbjct: 90 LYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQLNYFDP--GSSSTSSLISCLDRRCRSGV 146
Query: 269 PGYCETC----QQCDYEIEYADHSSSMGVLARDELHL-TIENGSLT---KPNVVFGCAYD 320
+C QC Y +Y D S + G D +H +I G+LT +VVFGC+
Sbjct: 147 QTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNSSASVVFGCSIL 206
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLV 380
Q G L + DGI G + +S+ SQL+SQGI V HCL + GGG + LG +
Sbjct: 207 QTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDNSGGGVLVLGEIVE 266
Query: 381 PSWGMAWVPMLDS-PFMEL------YHTEILKINYGSSPLNLGARNSQVGWALFDTGSSY 433
P+ + + P++ S P L + +I++I +P N++ + D+G++
Sbjct: 267 PN--IVYSPLVPSQPHYNLNLQSISVNGQIVRI----APSVFATSNNR--GTIVDSGTTL 318
Query: 434 TYFTKQAYSELIASLKEVSSDGL--VLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
Y ++AY+ + ++ V + VL + C+ S VD+ F ++L+
Sbjct: 319 AYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQ----CYL--ITTSSNVDI---FPQVSLN 369
Query: 492 FGSKWQIVSTKFHISPEGYLV----ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
F +V + P+ YL+ I + C+G ++ S ILGD+ L+ ++
Sbjct: 370 FAGGASLV-----LRPQDYLMQQNFIGEGSVWCIGF---QKISGQSITILGDLVLKDKIF 421
Query: 548 VYDNVNKRIGWAKSHCMNP 566
VYD +RIGWA C P
Sbjct: 422 VYDLAGQRIGWANYDCSLP 440
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 115/401 (28%), Positives = 177/401 (44%), Gaps = 40/401 (9%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL GN P GLYFT + +G P + YY+ +DTGSD+ W+ C C SC + +
Sbjct: 75 LPLGGNGIPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNC-ISCDSCPRKSGLGIDL 133
Query: 243 PLYKPRMG---NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY P + C P C C Y I Y D SS+ G D L
Sbjct: 134 TLYDPTASASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFL 193
Query: 300 HLTIENG----SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+G +L +V FGC G L ++ V DGILG +A S+ SQL S G +
Sbjct: 194 QYDQVSGDGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKV 253
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
+ HCL T GGG +G+ + P + P++ P M Y+ + I+ G S L L
Sbjct: 254 TKIFSHCLDT-VNGGGIFAIGNVVQPK--VKTTPLV--PGMPHYNVVLKTIDVGGSTLQL 308
Query: 416 GARNSQVGWA----LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
+G + D+G++ Y + Y +++++ D + + D +C++
Sbjct: 309 PTNIFDIGGGSRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQD---FLCFQY 365
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVH 530
V F +T HF +V + P YL + + C+G G +
Sbjct: 366 SG------SVDNGFPEVTFHFDGDLPLV-----VYPHDYLFQNTEDVYCVGFQSGGVQSK 414
Query: 531 NG-STIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+G ++LGD++L +LVVYD N+ IGW +C + + K
Sbjct: 415 DGKDMVLLGDLALSNKLVVYDLENQVIGWTNYNCSSSIKIK 455
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 120/429 (27%), Positives = 194/429 (45%), Gaps = 48/429 (11%)
Query: 161 NDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLD 218
N +R H + +L+++ + PL G P GLY+T + +G PP+ YY+
Sbjct: 51 NISALRAHDGTRHGRLLAA--------ADLPLGGLGLPTDTGLYYTEIKLGTPPKHYYVQ 102
Query: 219 MDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPG 270
+DTGSD+ W+ C C C + LY P+ G+++ + C P
Sbjct: 103 VDTGSDILWVNC-ITCEQCPHKSGLGLDLTLYDPKASSTGSMVMCDQAFCAATFGGKLP- 160
Query: 271 YCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKP---NVVFGCAYDQQGLLL 326
C C+Y + Y D SS++G D L + T+P +V+FGC Q G L
Sbjct: 161 KCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANASVIFGCGAQQGGDLG 220
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMA 386
++ DGILG A S+ SQL + G +K + HCL T GGG +F D+V
Sbjct: 221 SSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTIKGGG--IFSIGDVVQPKVKT 278
Query: 387 WVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSE 443
+ D P Y+ + I+ G + L L A + G + D+G++ TY + + E
Sbjct: 279 TPLVADKPH---YNVNLKTIDVGGTTLQLPAHIFEPGEKKGTIIDSGTTLTYLPELVFKE 335
Query: 444 LIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKF 503
++ ++ D + D +C++ +P V F T+T HF +
Sbjct: 336 VMLAVFNKHQD---ITFHDVQGFLCFQ--YP----GSVDDGFPTITFHFEDDLAL----- 381
Query: 504 HISPEGYLVISKKGNICLGILDG-SEVHNGSTIIL-GDISLRGQLVVYDNVNKRIGWAKS 561
H+ P Y + C+G +G S+ +G I+L GD+ L +LV+YD N+ IGW
Sbjct: 382 HVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDY 441
Query: 562 HCMNPGRFK 570
+C + + K
Sbjct: 442 NCSSSIKIK 450
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 183/387 (47%), Gaps = 49/387 (12%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +D+GS +T++ C A C C +P ++P
Sbjct: 79 LHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP---- 133
Query: 252 ILPYKDSLCMEIQRNHKPGYCE---TC----QQCDYEIEYADHSSSMGVLARDELHLTIE 304
++ ++ P C TC +QC YE +YA+ SSS GVL D + E
Sbjct: 134 ----------DLSSSYSPVKCNVDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRE 183
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
+ L VFGC + G L + DGI+GL R ++S+ QL +G+I + C
Sbjct: 184 S-ELKPQRAVFGCENSETGDLFSQ--HADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYG 240
Query: 365 TNAGGGGYMFLGHDLVPS-WGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR--NSQ 421
GGG M LG PS + L SP+ Y+ E+ +I+ L + +R NS+
Sbjct: 241 GMDIGGGAMVLGGVPAPSDMVFSHSDPLRSPY---YNIELKEIHVAGKALRVDSRVFNSK 297
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKFPIRSIVD 480
G L D+G++Y Y +QA+ ++ + DP +C+ R++
Sbjct: 298 HGTVL-DSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAG--RNVSK 354
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGS--TII 536
+ + F + + FG+ K ++PE YL K G CLG+ NG T +
Sbjct: 355 LHEVFPDVDMVFGN-----GQKLSLTPENYLFRHSKVDGAYCLGVF-----QNGKDPTTL 404
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
LG I +R LV YD N++IG+ K++C
Sbjct: 405 LGGIIVRNTLVTYDRHNEKIGFWKTNC 431
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 117/389 (30%), Positives = 184/389 (47%), Gaps = 34/389 (8%)
Query: 185 DSSSIFPLRGNIYPD----GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKG 240
DS S+ R +Y D G Y T + +G PP+ + L +D+GS +T++ C + C C K
Sbjct: 73 DSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKH 131
Query: 241 ANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELH 300
+P ++P + + Y+ C N + +QC YE EYA+HSSS GVL D +
Sbjct: 132 QDPKFQPELSST--YQPVKC-----NMDCNCDDDKEQCVYEREYAEHSSSKGVLGEDLIS 184
Query: 301 LTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVG 360
E+ LT VFGC + G L + + DGI+GL + +SL QL +G+I N G
Sbjct: 185 FGNES-QLTPQRAVFGCETVETGDLYSQ--RADGIIGLGQGDLSLVDQLVDKGLISNSFG 241
Query: 361 HCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR-- 418
C GGG M LG PS M + D Y+ ++ I L+L +R
Sbjct: 242 LCYGGMDVGGGSMILGGFDYPS-DMIFTDS-DPDRSPYYNIDLTGIRVAGKKLSLNSRVF 299
Query: 419 NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
+ + G A+ D+G++Y Y A++ ++ S +D DP +
Sbjct: 300 DGEHG-AVLDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDV 358
Query: 479 VDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGST 534
++ + F ++ + F G W +SPE Y+ K G CLG+ + H T
Sbjct: 359 SELSKIFPSVEMIFKSGQSWL-------LSPENYMFRHSKVHGAYCLGVFPNGKDH---T 408
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+LG I +R LVVYD N ++G+ +++C
Sbjct: 409 TLLGGIVVRNTLVVYDRENSKVGFWRTNC 437
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 121/420 (28%), Positives = 195/420 (46%), Gaps = 52/420 (12%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGN--IYPDGLYFTYMIVGNPPRPYYLDMDTG 222
+R H + + +L++ A+D PL G+ GLYFT + +G P + YY+ +DTG
Sbjct: 59 LREHDGRRHGRLLA----AID----LPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTG 110
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C C C + +N +Y PR G ++ C+ P C +
Sbjct: 111 SDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVANYGGVLPS-CTS 168
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENG-SLTKP---NVVFGCAYDQQGLLLNTLV 330
C+Y I Y D SS+ G D L +G T P +V FGC G L ++ +
Sbjct: 169 TSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNL 228
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DGILG ++ S+ SQLA+ G ++ + HCL T GGG +G+ + P + P+
Sbjct: 229 ALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDT-VNGGGIFAIGNVVQPK--VKTTPL 285
Query: 391 LDSPFMELYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSSYTYFTKQAYSELI 445
+ P M Y+ + I+ G + L L + NS+ + D+G++ Y + Y L
Sbjct: 286 V--PDMPHYNVILKGIDVGGTALGLPTNIFDSGNSK--GTIIDSGTTLAYVPEGVYKALF 341
Query: 446 ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHI 505
A + + D V D + C++ V F +T HF ++ +
Sbjct: 342 AMVFDKHQDISVQTLQDFS---CFQYSGS------VDDGFPEVTFHFEGDVSLI-----V 387
Query: 506 SPEGYLVISKKGNICLGILDGS-EVHNG-STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
SP YL + K C+G +G + +G ++LGD+ L +LV+YD N+ IGWA +C
Sbjct: 388 SPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNC 447
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 107/420 (25%), Positives = 195/420 (46%), Gaps = 46/420 (10%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTG 222
+R +++++ S++ VD F ++G P GLY+T + +G PP + + +DTG
Sbjct: 43 LRARDELRHRRMLQSSSGVVD----FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTG 98
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKP---RMGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C++ C+ C + + + P +++ D C +++
Sbjct: 99 SDVLWVSCNS-CNGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGKQSSDATCSSQ 157
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPN---VVFGCAYDQQGLLLNTLV 330
QC Y +Y D S + G D +HL TI GS+T + VVFGC+ Q G L +
Sbjct: 158 NNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNSTAPVVFGCSNQQTGDLTKSDR 217
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DGI G + ++S+ SQL+SQGI + HCL ++ GGG + LG + P+ + + +
Sbjct: 218 AVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGILVLGEIVEPN--IVYTSL 275
Query: 391 LDSPFMELYHTEILKINYGSSPLNLGAR---NSQVGWALFDTGSSYTYFTKQAYSELIAS 447
+ P Y+ + I+ L + + S + D+G++ Y ++AY +++
Sbjct: 276 V--PAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSA 333
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
+ + + + + S+ DV F ++L+F ++ + P
Sbjct: 334 ITAA-----IPQSVRTVVSRGNQCYLITSSVTDV---FPQVSLNFAGGASMI-----LRP 380
Query: 508 EGYLV----ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ YL+ I C+G ++ ILGD+ L+ ++VVYD +RIGWA C
Sbjct: 381 QDYLIQQNSIGGAAVWCIGF---QKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDC 437
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 112/422 (26%), Positives = 192/422 (45%), Gaps = 50/422 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTG 222
+R + +++++ S+ VD F ++G P GLY+T + +G PP + + +DTG
Sbjct: 40 LRARDALRHRRMLQSSNGVVD----FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTG 95
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKP---RMGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C++ CS C + + + P +++ D C ++
Sbjct: 96 SDVLWVSCNS-CSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQ 154
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPN---VVFGCAYDQQGLLLNTLV 330
QC Y +Y D S + G D +HL TI GS+T + VVFGC+ Q G L +
Sbjct: 155 NNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPVVFGCSNQQTGDLTKSDR 214
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DGI G + ++S+ SQL+SQGI V HCL ++ GGG + LG + P+ + + +
Sbjct: 215 AVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEIVEPN--IVYTSL 272
Query: 391 LDSPFMELYHTEILKINYGSSPLNLGAR---NSQVGWALFDTGSSYTYFTKQAYSELIAS 447
+ P Y+ + I L + + S + D+G++ Y ++AY +++
Sbjct: 273 V--PAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSA 330
Query: 448 LKEVSSDGLVLDASDP--TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHI 505
+ AS P V R V + F ++L+F ++ +
Sbjct: 331 IT----------ASIPQSVHTVVSRGNQCYLITSSVTEVFPQVSLNFAGGASMI-----L 375
Query: 506 SPEGYLV----ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKS 561
P+ YL+ I C+G ++ ILGD+ L+ ++VVYD +RIGWA
Sbjct: 376 RPQDYLIQQNSIGGAAVWCIGF---QKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANY 432
Query: 562 HC 563
C
Sbjct: 433 DC 434
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 119/407 (29%), Positives = 186/407 (45%), Gaps = 52/407 (12%)
Query: 175 KLVSSNAVAVDS---SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD 231
+L SS V D S+ L ++ +G Y T + +G PP+ + L +D+GS +T++ C
Sbjct: 55 RLASSRRVLGDGGRPSARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPC- 113
Query: 232 APCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCE---TCQ----QCDYEIEY 284
A C C +P ++P ++ + P C TC QC YE +Y
Sbjct: 114 ASCEQCGNHQDPRFQP--------------DLSSTYSPVKCSADCTCDSDKSQCTYERQY 159
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
A+ SSS GVL D + E+ L VFGC + G L + DGI+GL R ++S
Sbjct: 160 AEMSSSSGVLGEDIVSFGTES-ELKPQRAVFGCENSETGDLFSQ--HADGIMGLGRGQLS 216
Query: 345 LPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGH-DLVPSWGMAWVPMLDSPFMELYHTEI 403
+ QL +G+I + C GGG M LG P + + SP+ Y+ E+
Sbjct: 217 IMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPDMVFSRSDPVRSPY---YNIEL 273
Query: 404 LKINYGSSPLNLGAR--NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDAS 461
+I+ L L R +S+ G + D+G++Y Y +QA+ ++ +
Sbjct: 274 KEIHVAGKALRLDPRIFDSKHG-TVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGP 332
Query: 462 DPTLP-VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GN 518
DP +C+ R++ + Q F + + FG K +SPE YL K G
Sbjct: 333 DPNYKDICFAGAG--RNVSQLSQAFPDVDMVFGD-----GQKLSLSPENYLFRHSKVEGA 385
Query: 519 ICLGILDGSEVHNGS--TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CLG+ NG T +LG I +R LV YD N++IG+ K++C
Sbjct: 386 YCLGVF-----QNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 427
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 120/431 (27%), Positives = 197/431 (45%), Gaps = 53/431 (12%)
Query: 163 GIIRPHKSKINK----------KLVSSNAVAVDSSSIFPLRGNIYP--DGLYFTYMIVGN 210
GI +K K++K +++ S+ V V FP++G P GLY+T + +G
Sbjct: 4 GITANYKLKLSKLKERDRVRHGRMLQSSGVGVVD---FPVQGTFDPFLVGLYYTRLQLGT 60
Query: 211 PPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---PL--YKPR---MGNILPYKDSLCME 262
PPR +Y+ +DTGSD+ W+ C + C+ C + PL + P +++ D C
Sbjct: 61 PPRDFYVQIDTGSDVLWVSCGS-CNGCPVNSGLHIPLNFFDPGSSPTASLISCSDQRCSL 119
Query: 263 IQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPN---VVFGCA 318
++ C Y +Y D S + G D LH T+ GS+ + +VFGC+
Sbjct: 120 GLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCS 179
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
Q G L + DGI G + +S+ SQLASQGI HCL + GGG + LG
Sbjct: 180 ALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGILVLGEI 239
Query: 379 LVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSSY 433
+ P+ + + P++ P Y+ + I+ L + G +SQ + D+G++
Sbjct: 240 VEPN--IVYTPLV--PSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQ--GTIIDSGTTL 293
Query: 434 TYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
Y + AY I+++ + S + P L SI D+ F ++L+F
Sbjct: 294 AYLAEAAYDPFISAITSIVSPSV-----RPYLSKGNHCYLISSSINDI---FPQVSLNFA 345
Query: 494 SKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
++ + P+ YL+ S G L + ++ ILGD+ L+ ++ VYD
Sbjct: 346 GGASMI-----LIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYDIA 400
Query: 553 NKRIGWAKSHC 563
N+RIGWA C
Sbjct: 401 NQRIGWANYDC 411
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 123/419 (29%), Positives = 189/419 (45%), Gaps = 49/419 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
++ H ++ +++++S AVD PL GN +P GLYF + +GNPP+ YY+ +DTG
Sbjct: 51 LKQHDARRHRRILS----AVD----LPLGGNGHPAEAGLYFAKIGLGNPPKDYYVQVDTG 102
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPRM---GNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C A C C ++ LY P+ + D C G C
Sbjct: 103 SDILWVNC-ANCDKCPTKSDLGVKLTLYDPQSSTSATRIYCDDDFCAATYNGVLQG-CTK 160
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSL----TKPNVVFGCAYDQQGLLLNTLV 330
C Y + Y D SS+ G +D L G+L +V+FGC Q G L +
Sbjct: 161 DLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSSANGSVIFGCGAKQSGELGTSSE 220
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DGILG +A S+ SQLA+ G +K V HCL N GGG +G + P + PM
Sbjct: 221 ALDGILGFGQANSSMISQLAAAGKVKRVFAHCL-DNVKGGGIFAIGEVVSPK--VNTTPM 277
Query: 391 LDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIAS 447
+ P Y+ + +I G + L L G + D+G++ Y + Y ++
Sbjct: 278 V--PNQPHYNVVMKEIEVGGNVLELPTDIFDTGDRRGTIIDSGTTLAYLPEVVYESMMT- 334
Query: 448 LKEVSSD-GLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHIS 506
K VS GL L + C++ +V + F + HF S ++
Sbjct: 335 -KIVSEQPGLKLHTVEEQF-TCFQYTG------NVNEGFPVVKFHFNG-----SLSLTVN 381
Query: 507 PEGYLVISKKGNICLGILD-GSEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
P YL + C G + G + +G + +LGD+ L +LV+YD N+ IGW +C
Sbjct: 382 PHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLENQAIGWTDYNC 440
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 114/394 (28%), Positives = 179/394 (45%), Gaps = 39/394 (9%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC--AKGAN--- 242
FP+ G+ P GLYFT + +G+PP+ Y++ +DTGSD+ W+ C +PC+ C + G N
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQL 135
Query: 243 PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARD 297
+ P + +P D C + + C+T C Y Y D S + G D
Sbjct: 136 EFFNPDTSSTSSKIPCSDDRCTAALQTSE-AVCQTSDNSPCGYTFTYGDGSGTSGYYVSD 194
Query: 298 ELHLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQG 353
++ G+ N +VFGC+ Q G L T DGI G + ++S+ SQL S G
Sbjct: 195 TMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLG 254
Query: 354 IIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
+ V HCL + GGG + LG + P G+ + P++ S + E + +N P+
Sbjct: 255 VSPKVFSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQPHYNLNLESIVVNGQKLPI 312
Query: 414 NLGA-RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
+ S + D+G++ Y AY + ++ S S +L
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS------PSVRSLVSKGNQC 366
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV--ISKKGNICLGILDGSEVH 530
F S VD F T++L+F + + PE YL+ S N+ I G + +
Sbjct: 367 FVTSSSVDSS--FPTVSLYF-----MGGVAMTVKPENYLLQQASIDNNVLWCI--GWQRN 417
Query: 531 NGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G I ILGD+ L+ ++ VYD N R+GW C
Sbjct: 418 QGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 451
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 141 bits (356), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 122/425 (28%), Positives = 192/425 (45%), Gaps = 56/425 (13%)
Query: 169 KSKINKKLVSSNAVAVDSSSIFPLRGNIYP----------DGLYFTYMIVGNPPRPYYLD 218
+ + ++ L SS VD FP++G P LY+T + +G+PPR +Y+
Sbjct: 51 RVRHSRMLQSSGGGVVD----FPVQGTFDPFLVGFYFGSFCRLYYTRLQLGSPPRDFYVQ 106
Query: 219 MDTGSDLTWIQCDAPCSSCAKGAN---PL--YKPR---MGNILPYKDSLCMEIQRNHKPG 270
+DTGSD+ W+ C + C+ C + PL + P +++ D C ++
Sbjct: 107 IDTGSDVLWVSCSS-CNGCPVSSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSSDSV 165
Query: 271 YCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPN---VVFGCAYDQQGLLL 326
QC Y +Y D S + G D LH TI GS+ K + +VFGC+ Q G L
Sbjct: 166 CAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSSAPIVFGCSTLQTGDLT 225
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMA 386
DGI G + +S+ SQLASQGI V HCL + GGG + LG + P+ +
Sbjct: 226 KPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGGGILVLGEIVEPN--IV 283
Query: 387 WVPMLDSPFMELYHTEILKINYGSSPLNLG----ARNSQVGWALFDTGSSYTYFTKQAYS 442
+ P++ P Y+ + I L + A +S G + D+G++ Y T+ AY
Sbjct: 284 YTPLV--PSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQG-TIIDSGTTLAYLTEAAYD 340
Query: 443 ELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTK 502
I+++ S + P L + SI DV F ++L+F T
Sbjct: 341 PFISAITSTVSPSV-----SPYLSKGNQCYLTSSSINDV---FPQVSLNFAG-----GTS 387
Query: 503 FHISPEGYLV----ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGW 558
+ P+ YL+ I+ C+G ++ ILGD+ L+ ++ VYD +RIGW
Sbjct: 388 MILIPQDYLIQQSSINGAALWCVGF---QKIQGQEITILGDLVLKDKIFVYDIAGQRIGW 444
Query: 559 AKSHC 563
A C
Sbjct: 445 ANYDC 449
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 170/391 (43%), Gaps = 52/391 (13%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK 256
Y GLYFT + +G+PPR + + +DTGSD+ W+ C++ C++C + + +G L +
Sbjct: 61 YLVGLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTSG------LGIQLNFF 113
Query: 257 DS--------------LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLT 302
DS +C + QC Y +Y D S + G D L+
Sbjct: 114 DSSSSSTAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFD 173
Query: 303 IENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
G N +VFGC+ Q G L T DGI G + ++S+ SQL+++GI V
Sbjct: 174 AILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRV 233
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS-----PL 413
HCL + GGG + LG L P G+ + P++ P Y+ +L I P
Sbjct: 234 FSHCLKGDGSGGGILVLGEILEP--GIVYSPLV--PSQPHYNLNLLSIAVNGQLLPIDPA 289
Query: 414 NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
NSQ + D+G++ Y +AY ++++ + S P+ +
Sbjct: 290 AFATSNSQ--GTIVDSGTTLAYLVAEAYDPFVSAVNAI--------VSPSVTPITSKGNQ 339
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNG 532
V Q F + +F +V + PE YL+ G + + +V
Sbjct: 340 CYLVSTSVSQMFPLASFNFAGGASMV-----LKPEDYLIPFGSSGGSAMWCIGFQKVQG- 393
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
ILGD+ L+ ++ VYD V +RIGWA C
Sbjct: 394 -VTILGDLVLKDKIFVYDLVRQRIGWANYDC 423
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 178/388 (45%), Gaps = 38/388 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGN 251
GLY+T + +G PP+ Y++ +DTGSD+ W+ C C+ C + ++ LY P+ G+
Sbjct: 81 GLYYTEIEIGTPPKQYHVQVDTGSDILWVNC-ISCNKCPRKSDLGIDLRLYDPKGSSSGS 139
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS---- 307
+ C PG C C+Y + Y D SS+ G D L +G
Sbjct: 140 TVSCDQKFCAATYGGKLPG-CAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDGQTR 198
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
+V+FGC Q G L +T DGI+G ++ S+ SQLA+ G +K + HCL T
Sbjct: 199 HANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIK 258
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---W 424
GGG +F D+V + P++ P M Y+ + IN G + L L + + G
Sbjct: 259 GGG--IFAIGDVVQP-KVKSTPLV--PDMPHYNVNLESINVGGTTLQLPSHMFETGEKKG 313
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ D+G++ TY + Y +++A++ D D +C I+ V
Sbjct: 314 TIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQD---FLC------IQYFQSVDDG 364
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVHNG-STIILGDISL 542
F +T HF + ++ P Y + C G +G + +G ++LGD+ L
Sbjct: 365 FPKITFHFEDDLGL-----NVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLVL 419
Query: 543 RGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
++VVYD N+ +GW +C + + K
Sbjct: 420 SNKVVVYDLENQVVGWTDYNCSSSIKIK 447
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 115/396 (29%), Positives = 181/396 (45%), Gaps = 43/396 (10%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC--AKGAN--- 242
FP+ G+ P GLYFT + +G+PP+ Y++ +DTGSD+ W+ C +PC+ C + G N
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQL 135
Query: 243 PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARD 297
+ P + +P D C + + C+T C Y Y D S + G D
Sbjct: 136 EFFNPDTSSTSSKIPCSDDRCTAALQTSE-AVCQTSDNSPCGYTFTYGDGSGTSGYYVSD 194
Query: 298 ELHLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQG 353
++ G+ N +VFGC+ Q G L T DGI G + ++S+ SQL S G
Sbjct: 195 TMYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLG 254
Query: 354 IIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
+ V HCL + GGG + LG + P G+ + P++ S + E + +N P+
Sbjct: 255 VSPKVFSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQPHYNLNLESIVVNGQKLPI 312
Query: 414 N---LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWR 470
+ N+Q + D+G++ Y AY + ++ S S +L
Sbjct: 313 DSSLFTTSNTQ--GTIVDSGTTLAYLADGAYDPFVNAITAAVS------PSVRSLVSKGN 364
Query: 471 AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV--ISKKGNICLGILDGSE 528
F S VD F T++L+F + + PE YL+ S N+ I G +
Sbjct: 365 QCFVTSSSVDSS--FPTVSLYF-----MGGVAMTVKPENYLLQQASIDNNVLWCI--GWQ 415
Query: 529 VHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ G I ILGD+ L+ ++ VYD N R+GW C
Sbjct: 416 RNQGQQITILGDLVLKDKIFVYDLANMRMGWTDYDC 451
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 120/420 (28%), Positives = 194/420 (46%), Gaps = 52/420 (12%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGN--IYPDGLYFTYMIVGNPPRPYYLDMDTG 222
+R H + + +L++ A+D PL G+ GLYFT + +G P + YY+ +DTG
Sbjct: 59 LREHDGRRHGRLLA----AID----LPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTG 110
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C C C + +N +Y PR G ++ C+ P C +
Sbjct: 111 SDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVANYGGVLPS-CTS 168
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENG-SLTKP---NVVFGCAYDQQGLLLNTLV 330
C+Y I Y D SS+ G D L +G T P +V FGC G L ++ +
Sbjct: 169 TSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNL 228
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DGILG ++ S+ SQLA+ G ++ + HCL T GGG +G+ + P + P+
Sbjct: 229 ALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDT-VNGGGIFAIGNVVQPK--VKTTPL 285
Query: 391 LDSPFMELYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSSYTYFTKQAYSELI 445
+ M Y+ + I+ G + L L + NS+ + D+G++ Y + Y L
Sbjct: 286 VSD--MPHYNVILKGIDVGGTALGLPTNIFDSGNSK--GTIIDSGTTLAYVPEGVYKALF 341
Query: 446 ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHI 505
A + + D V D + C++ V F +T HF ++ +
Sbjct: 342 AMVFDKHQDISVQTLQDFS---CFQYSGS------VDDGFPEVTFHFEGDVSLI-----V 387
Query: 506 SPEGYLVISKKGNICLGILDGS-EVHNG-STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
SP YL + K C+G +G + +G ++LGD+ L +LV+YD N+ IGWA +C
Sbjct: 388 SPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNC 447
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 128/429 (29%), Positives = 200/429 (46%), Gaps = 54/429 (12%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
+R H + +++S AVD L GN P GLYFT + +G+PPR YY+ +DTG
Sbjct: 39 VRAHDVRRRGRILS----AVD----LNLGGNGLPTETGLYFTKLGLGSPPRDYYVQVDTG 90
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C CS C + ++ LY P+ +++ C PG C++
Sbjct: 91 SDILWVNC-VECSRCPRKSDLGIDLTLYDPKGSETSDVVSCDQDFCSATFDGPIPG-CKS 148
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSL-TKP---NVVFGCAYDQQGLLLNTLV 330
C Y I Y D S++ G +D L NG+L T P +++FGC Q G L ++
Sbjct: 149 EIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSE 208
Query: 331 KT-DGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVP 389
+ DGI+G +A S+ SQLA+ G +K + HCL N GGG +G + P ++ P
Sbjct: 209 EALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL-DNVRGGGIFAIGEVVEPK--VSTTP 265
Query: 390 MLDSPFMELYHTEILKINYGSSPLNLGAR--NSQVG-WALFDTGSSYTYFTKQAYSELIA 446
++ P M Y+ + I + L L + +S G + D+G++ Y Y ELI
Sbjct: 266 LV--PRMAHYNVVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPDIVYDELIQ 323
Query: 447 SLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV---DVKQFFKTLTLHFGSKWQIVSTKF 503
+ A P L + + + R + +V + F + LHF S
Sbjct: 324 KVL----------ARQPGLKL-YLVEQQFRCFLYTGNVDRGFPVVKLHFKD-----SLSL 367
Query: 504 HISPEGYLVISKKGNICLGILDG-SEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKS 561
+ P YL K G C+G ++ NG + +LGD+ L +LV+YD N IGW
Sbjct: 368 TVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMVIGWTDY 427
Query: 562 HCMNPGRFK 570
+C + + K
Sbjct: 428 NCSSSIKVK 436
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 173/400 (43%), Gaps = 49/400 (12%)
Query: 190 FPLRGNIYP-------DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC----A 238
FP++G+ P LYFT + +G+PP + + +DTGSD+ W+ C + CS+C
Sbjct: 86 FPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPHSSG 144
Query: 239 KGANPLYKPRMGNI----LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVL 294
G + + G++ + D +C + + C QC Y Y D S + G
Sbjct: 145 LGIDLHFFDAPGSLTAGSVTCSDPICSSVFQT-TAAQCSENNQCGYSFRYGDGSGTSGYY 203
Query: 295 ARDELHLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA 350
D + G N +VFGC+ Q G L + DGI G + K+S+ SQL+
Sbjct: 204 MTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLS 263
Query: 351 SQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGS 410
S+GI V HCL + GGG LG LVP GM + P++ P Y+ +L I
Sbjct: 264 SRGITPPVFSHCLKGDGSGGGVFVLGEILVP--GMVYSPLV--PSQPHYNLNLLSIGVNG 319
Query: 411 SPLNLGA---RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV 467
L L A S + DTG++ TY K+AY + ++ S P+
Sbjct: 320 QMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISN--------SVSQLVTPI 371
Query: 468 CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL----VISKKGNICLGI 523
+ + F +++L+F ++ + P+ YL + C+G
Sbjct: 372 ISNGEQCYLVSTSISDMFPSVSLNFAGGASMM-----LRPQDYLFHYGIYDGASMWCIGF 426
Query: 524 LDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
E ILGD+ L+ ++ VYD +RIGWA C
Sbjct: 427 QKAPEEQT----ILGDLVLKDKVFVYDLARQRIGWASYDC 462
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 118/391 (30%), Positives = 173/391 (44%), Gaps = 39/391 (9%)
Query: 192 LRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN------- 242
L GN +P GLYF + +G P + YY+ +DTGSD+ W+ C A C++C K ++
Sbjct: 62 LGGNGHPSESGLYFAKIGLGTPVQDYYVQVDTGSDILWVNC-AGCTNCPKKSDLGIELSL 120
Query: 243 -PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
N + C PG C C+Y + Y D SS+ G RD + L
Sbjct: 121 YSPSSSSTSNRVTCNQDFCTSTYDGPIPG-CTPELLCEYRVAYGDGSSTAGYFVRDHVVL 179
Query: 302 TIENGSL----TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357
G+ T ++VFGC Q G L T DGILG +A S+ SQLAS G +K
Sbjct: 180 DRVTGNFQTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKR 239
Query: 358 VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA 417
V HCL N GGG +G + P + P++ P Y+ + I + LNL
Sbjct: 240 VFAHCL-DNINGGGIFAIGEVVQPK--VRTTPLV--PQQAHYNVFMKAIEVDNEVLNLPT 294
Query: 418 RNSQVGW---ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
+ D+G++ YF Y LI+ + S L L + C+
Sbjct: 295 DVFDTDLRKGTIIDSGTTLAYFPDVIYEPLISKIFARQST-LKLHTVEEQF-TCFEYD-- 350
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD-GSEVHNG- 532
+V F T+T HF S + P YL C+G + G++ +G
Sbjct: 351 ----GNVDDGFPTVTFHFED-----SLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGK 401
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+LGD+ L+ +LV+YD N+ IGW + +C
Sbjct: 402 DMILLGDLVLQNRLVMYDLENQTIGWTEYNC 432
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 117/383 (30%), Positives = 184/383 (48%), Gaps = 41/383 (10%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C C C K +P ++P +
Sbjct: 78 LYDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCST-CEQCGKHQDPRFQPESSS 136
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
YK C N + +QC YE YA+ SSS G+LA D L E+ LT
Sbjct: 137 T--YKPMQC-----NPSCNCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGNES-ELTPQ 188
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG-- 369
+FGC + G L + + DGI+GL R +S+ QL +IK VVG+ + GG
Sbjct: 189 RAIFGCETVETGELFSQ--RADGIMGLGRGPLSVVDQL----VIKEVVGNSFSLCYGGMD 242
Query: 370 --GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHT-EILKINYGSSPLNLGAR--NSQVGW 424
GG M LG ++ P M + P+ Y+ E+ +++ L L R + + G
Sbjct: 243 VVGGAMVLG-NIPPPPDMVFA--HSDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGT 299
Query: 425 ALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVK 482
L D+G++Y Y ++A+ A +KE+ + DP+ +C+ R + +
Sbjct: 300 VL-DSGTTYAYLPEEAFVAFKDAIIKEIKFLKQI-HGPDPSYNDICFSGAG--RDVSQLS 355
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDI 540
+ F + + FG+ K +SPE YL K G CLGI + T +LG I
Sbjct: 356 KIFPEVNMVFGN-----GQKLSLSPENYLFRHTKVSGAYCLGIFQNGK---DPTTLLGGI 407
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+R LV YD N +IG+ K++C
Sbjct: 408 VVRNTLVTYDRDNDKIGFWKTNC 430
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 120/400 (30%), Positives = 188/400 (47%), Gaps = 45/400 (11%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
F L G P GLYFT + +GNP + Y + +DTGSD+ W+ C PCS C + +
Sbjct: 15 FSLGGTADPLSGGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNC-RPCSGCPRKSALNIPL 73
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+Y PR +++ D LC+ +R + +T C+Y Y D S+S G RD +
Sbjct: 74 TMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAM 133
Query: 300 HLTI--ENG-SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356
+ NG + T V+FGC+ Q G L + DGI+G + ++S+P+QLA+Q I
Sbjct: 134 QYNVISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIP 193
Query: 357 NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
V HCL GGG + +G P GM + P++ P Y+ + I+ S+ L +
Sbjct: 194 RVFSHCLEGEKRGGGILVIGGIAEP--GMTYTPLV--PDSVHYNVVLRGISVNSNRLPID 249
Query: 417 ARN---SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
A + + + D+G++ YF AY+ + +++E +S V T +
Sbjct: 250 AEDFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGR- 308
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV---ISKKGNI---CLGILDGS 527
+ F +TL+F + P+ YL+ + G C+G S
Sbjct: 309 -------LSDLFPNVTLNFEGG------AMELQPDNYLMWGGTAPTGTTDVWCIGWQSSS 355
Query: 528 EV---HNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+GS + ILGDI L+ +LVVYD N RIGW +C
Sbjct: 356 SSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNC 395
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 176/388 (45%), Gaps = 44/388 (11%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC----AKGANPLYKPRMGN- 251
Y GLYFT + +G+P + +Y+ +DTGSD+ WI C CS+C G + G+
Sbjct: 78 YFVGLYFTKVKLGSPAKDFYVQIDTGSDILWINC-ITCSNCPHSSGLGIELDFFDTAGSS 136
Query: 252 ---ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGS 307
++ D +C + G QC Y +Y D S + G D ++ T+ G
Sbjct: 137 TAALVSCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQ 196
Query: 308 LTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
N +VFGC+ Q G L T DGI G +S+ SQL+S+G+ V HCL
Sbjct: 197 SMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL 256
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL----NLGARN 419
GGG + LG L PS + + P++ P + Y+ + I L N+ A
Sbjct: 257 KGGENGGGVLVLGEILEPS--IVYSPLV--PSLPHYNLNLQSIAVNGQLLPIDSNVFATT 312
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+ G + D+G++ Y ++AY+ + D + S + P+ +
Sbjct: 313 NNQG-TIVDSGTTLAYLVQEAYNPFV--------DAITAAVSQFSKPIISKGNQCYLVSN 363
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV----ISKKGNICLGILDGSEVHNGSTI 535
V F ++L+F +V ++PE YL+ + C+G +V G T
Sbjct: 364 SVGDIFPQVSLNFMGGASMV-----LNPEHYLMHYGFLDSAAMWCIGF---QKVERGFT- 414
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
ILGD+ L+ ++ VYD N+RIGWA +C
Sbjct: 415 ILGDLVLKDKIFVYDLANQRIGWADYNC 442
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 179/387 (46%), Gaps = 49/387 (12%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G P + + L +D+GS +T++ C A C C +P ++P
Sbjct: 81 LHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPC-ATCEQCGNHQDPRFQP---- 135
Query: 252 ILPYKDSLCMEIQRNHKPGYCE---TCQ----QCDYEIEYADHSSSMGVLARDELHLTIE 304
++ + P C TC QC YE +YA+ SSS GVL D + E
Sbjct: 136 ----------DLSSTYSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKE 185
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
+ L VFGC + G L + DGI+GL R ++S+ QL +G+I + C
Sbjct: 186 S-ELKPQRAVFGCENTETGDLFSQ--HADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYG 242
Query: 365 TNAGGGGYMFL-GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR--NSQ 421
GGG M L G P + + SP+ Y+ E+ +I+ L L + NS+
Sbjct: 243 GMDVGGGTMVLGGMPAPPDMVFSHSNPVRSPY---YNIELKEIHVAGKALRLDPKIFNSK 299
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKFPIRSIVD 480
G L D+G++Y Y +QA+ ++ + + DP +C+ R++
Sbjct: 300 HGTVL-DSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAG--RNVSQ 356
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGS--TII 536
+ + F + + FG+ K +SPE YL K G CLG+ NG T +
Sbjct: 357 LSEVFPDVDMVFGN-----GQKLSLSPENYLFRHSKVEGAYCLGVF-----QNGKDPTTL 406
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
LG I +R LV YD N++IG+ K++C
Sbjct: 407 LGGIVVRNTLVTYDRHNEKIGFWKTNC 433
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 138 bits (348), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 116/413 (28%), Positives = 179/413 (43%), Gaps = 47/413 (11%)
Query: 174 KKLVSSNAVAVDSSSIFPLRG--NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD 231
++L+ A VD FP+ G N Y GLYFT + +GNP + +++ +DTGSD+ W+ C
Sbjct: 63 RRLLGGVAGVVD----FPVEGSANPYMVGLYFTRVKLGNPAKEFFVQIDTGSDILWVTC- 117
Query: 232 APCSSCAKGA---------NPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQ----C 278
+PC+ C + NP I D Q C+T C
Sbjct: 118 SPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEA--ICQTSNSQSSPC 175
Query: 279 DYEIEYADHSSSMGVLARDELHLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDG 334
Y Y D S + G D + G+ N +VFGC+ Q G L DG
Sbjct: 176 GYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDG 235
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSP 394
I G + ++S+ SQL S G+ V HCL + GGG + LG + P G+ + P++ S
Sbjct: 236 IFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQ 293
Query: 395 FMELYHTEILKINYGSSPLN---LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEV 451
+ E + +N P++ N+Q + D+G++ Y AY ++++
Sbjct: 294 PHYNLNLESIAVNGQKLPIDSSLFTTSNTQ--GTIVDSGTTLAYLADGAYDPFVSAIAAA 351
Query: 452 SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL 511
S S +L F S VD F T+TL+F + + PE YL
Sbjct: 352 VS------PSVRSLVSKGSQCFITSSSVDSS--FPTVTLYF-----MGGVAMSVKPENYL 398
Query: 512 VISKKGNICLGILDGSEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + + G + + G I ILGD+ L+ ++ VYD N R+GWA C
Sbjct: 399 LQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 451
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 138 bits (348), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 116/413 (28%), Positives = 179/413 (43%), Gaps = 47/413 (11%)
Query: 174 KKLVSSNAVAVDSSSIFPLRG--NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD 231
++L+ A VD FP+ G N Y GLYFT + +GNP + +++ +DTGSD+ W+ C
Sbjct: 65 RRLLGGVAGVVD----FPVEGSANPYMVGLYFTRVKLGNPAKEFFVQIDTGSDILWVTC- 119
Query: 232 APCSSCAKGA---------NPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQ----C 278
+PC+ C + NP I D Q C+T C
Sbjct: 120 SPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEA--ICQTSNSQSSPC 177
Query: 279 DYEIEYADHSSSMGVLARDELHLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDG 334
Y Y D S + G D + G+ N +VFGC+ Q G L DG
Sbjct: 178 GYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDG 237
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSP 394
I G + ++S+ SQL S G+ V HCL + GGG + LG + P G+ + P++ S
Sbjct: 238 IFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQ 295
Query: 395 FMELYHTEILKINYGSSPLN---LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEV 451
+ E + +N P++ N+Q + D+G++ Y AY ++++
Sbjct: 296 PHYNLNLESIAVNGQKLPIDSSLFTTSNTQ--GTIVDSGTTLAYLADGAYDPFVSAIAAA 353
Query: 452 SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL 511
S S +L F S VD F T+TL+F + + PE YL
Sbjct: 354 VS------PSVRSLVSKGSQCFITSSSVDSS--FPTVTLYF-----MGGVAMSVKPENYL 400
Query: 512 VISKKGNICLGILDGSEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + + G + + G I ILGD+ L+ ++ VYD N R+GWA C
Sbjct: 401 LQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 453
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 112/393 (28%), Positives = 182/393 (46%), Gaps = 51/393 (12%)
Query: 187 SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK 246
S+ L ++ +G Y T + +G PP+ + L +D+GS +T++ C A C C +P ++
Sbjct: 73 SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQ 131
Query: 247 PRMGNILPYKDSLCMEIQRNHKPGYCE---TC----QQCDYEIEYADHSSSMGVLARDEL 299
P ++ + P C TC QC YE +YA+ SSS GVL D +
Sbjct: 132 P--------------DLSSTYSPVKCNVDCTCDSDKNQCTYERQYAEMSSSSGVLGEDIV 177
Query: 300 HLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVV 359
E+ L VFGC + G L + DGI+GL R ++S+ QL +G+I +
Sbjct: 178 SFGTES-ELKPQRAVFGCENSETGDLFSQ--HADGIMGLGRGQLSIMDQLVDKGVIGDSF 234
Query: 360 GHCLTTNAGGGGYMFLGHDLVPSWGMAWV--PMLDSPFMELYHTEILKINYGSSPLNLGA 417
C GGG M LG P GM + + SP+ Y+ E+ +++ L +
Sbjct: 235 SMCYGGMDIGGGAMVLGAMPAPP-GMIYTHSNAVRSPY---YNIELKEMHVAGKALRVDP 290
Query: 418 R--NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKFP 474
R + + G + D+G++Y Y +QA+ ++ + DP +C+
Sbjct: 291 RIFDGKHG-TVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAG- 348
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNG 532
R++ + + F + + FG+ K +SPE YL K G CLG+ NG
Sbjct: 349 -RNVSQLSEVFPKVDMVFGN-----GQKLSLSPENYLFRHSKVEGAYCLGVF-----QNG 397
Query: 533 S--TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T +LG I +R LV YD N++IG+ K++C
Sbjct: 398 KDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 184/384 (47%), Gaps = 41/384 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK----GANPLYKPRMGN---- 251
GLY T + +G PPR + + +DTGSD+ WI C+ CS+C K G + +G+
Sbjct: 82 GLYTTKVKMGTPPREFTVQIDTGSDILWINCNT-CSNCPKSSGLGIELNFFDTVGSSTAA 140
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
++P D +C + QC Y +Y D S + GV D ++ + G T
Sbjct: 141 LVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPA 200
Query: 312 NV------VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
NV VFGC+ Q G L T DGILG ++S+ SQL+S+GI V HCL
Sbjct: 201 NVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKG 260
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG----ARNSQ 421
+ GGG + LG L PS + + P++ P Y+ + I L++ A + +
Sbjct: 261 DGNGGGILVLGEILEPS--IVYSPLV--PSQPHYNLNLQSIAVNGQVLSINPAVFATSDK 316
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
G + D+G++ +Y ++AY L+ ++ D V + + + + SI D
Sbjct: 317 RG-TIIDSGTTLSYLVQEAYDPLVNAV-----DTAVSQFATSFISKGSQCYLVLTSIDDS 370
Query: 482 KQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
F T++ +F G+ + +++ ++ G+ +K C+G +V G T ILGD
Sbjct: 371 ---FPTVSFNFEGGASMDLKPSQYLLN-RGFQDGAKM--WCIGF---QKVQEGVT-ILGD 420
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ L+ ++VVYD ++IGW C
Sbjct: 421 LVLKDKIVVYDLARQQIGWTNYDC 444
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 121/393 (30%), Positives = 182/393 (46%), Gaps = 41/393 (10%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G P+ GLY+ + +G P R YY+ +DTGSD+ W+ C C+ C K ++
Sbjct: 84 LPLGGTGRPEAVGLYYAKIGIGTPARDYYVQVDTGSDIMWVNC-IQCNECPKKSSLGMEL 142
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY + G ++ C I P YC C Y YAD SSS G RD +
Sbjct: 143 TLYDIKESLTGKLVSCDQDFCYAINGG-PPSYCIANMSCSYTEIYADGSSSFGYFVRDIV 201
Query: 300 HLTIENGSL--TKPN--VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+G L T N V+FGC+ Q G L++ DGILG ++ S+ SQLAS G +
Sbjct: 202 QYDQVSGDLETTSANGSVIFGCSATQSG-DLSSEEALDGILGFGKSNTSMISQLASSGKV 260
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
+ + HCL GGG +GH + P + P++ P Y+ + + G LNL
Sbjct: 261 RKMFAHCL-DGLNGGGIFAIGHIVQPK--VNTTPLV--PNQTHYNVNMKAVEVGGYFLNL 315
Query: 416 GARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
VG + D+G++ Y + Y +L++ + SD V D C++
Sbjct: 316 PTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF--TCFQYS 373
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD-GSEVHN 531
S+ D F +T HF + S + P YL S G C+G + G + +
Sbjct: 374 ---ESLDDG---FPAVTFHFEN-----SLYLKVHPHEYL-FSYDGLWCIGWQNSGMQSRD 421
Query: 532 GSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I +LGD++L +LV+YD N+ IGW + +C
Sbjct: 422 RRNITLLGDLALSNKLVLYDLENQVIGWTEYNC 454
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 131/448 (29%), Positives = 204/448 (45%), Gaps = 50/448 (11%)
Query: 148 FVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSI-FPLRGNIYPD--GLYFT 204
FVD ++V V PH+S K ++I PL GN P GLY+T
Sbjct: 15 FVDASNANLVFPVQRKFNGPHRSLDAIKAHDDRRRGRFLAAIDVPLGGNGLPSSTGLYYT 74
Query: 205 YMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKP---RMGNILPYK 256
+ +G+P + +Y+ +DTGSD+ W+ C A C++C K + LY P + N +P
Sbjct: 75 KVGLGSPAKEFYVQVDTGSDILWVNC-AGCTACPKKSGLGMDLTLYDPNGSKTSNAVPCG 133
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL-TKPN--- 312
D C + G C+ C Y I Y D S++ G D L +G+L TKP+
Sbjct: 134 DGFCTDTYSGPISG-CKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSS 192
Query: 313 VVFGCAYDQQG-LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
V+FGC Q G L N+ DGI+G +A S+ SQLA+ G +K + HCL ++ GGG
Sbjct: 193 VIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGI 252
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSP--LNLGARNSQVGWA-LFD 428
+ +G + P + P++ P M Y+ + ++ P L L +S G + D
Sbjct: 253 FS-IGQVMEPKFNTT--PLV--PRMAHYNVILKDMDVDGEPILLPLYLFDSGSGRGTIID 307
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK----FPIRSIVDVKQF 484
+G++ Y Y++L+ + ++ D + K FP+ VK
Sbjct: 308 SGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFTCFHYSDKLDEGFPV-----VKFH 362
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVHNGSTIIL-GDISL 542
F+ L+L + P YL + K+ C+G S + G +IL GD+ L
Sbjct: 363 FEGLSL-------------TVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDLILIGDLVL 409
Query: 543 RGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+LVVYD N IGW +C + + K
Sbjct: 410 SNKLVVYDLENMVIGWTNFNCSSSIKVK 437
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 122/398 (30%), Positives = 182/398 (45%), Gaps = 51/398 (12%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G P+ GLY+ + +G P R YY+ +DTGSD+ W+ C C+ C K ++
Sbjct: 84 LPLGGTGRPEAVGLYYAKIGIGTPARDYYVQVDTGSDIMWVNC-IQCNECPKKSSLGMEL 142
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD-- 297
LY + G ++ C I P YC C Y YAD SSS G RD
Sbjct: 143 TLYDIKESLTGKLVSCDQDFCYAINGG-PPSYCIANMSCSYTEIYADGSSSFGYFVRDIV 201
Query: 298 -------ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA 350
+L T NGS V+FGC+ Q G L++ DGILG ++ S+ SQLA
Sbjct: 202 QYDQVSGDLETTSANGS-----VIFGCSATQSG-DLSSEEALDGILGFGKSNTSMISQLA 255
Query: 351 SQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGS 410
S G ++ + HCL GGG +GH + P + P++ P Y+ + + G
Sbjct: 256 SSGKVRKMFAHCL-DGLNGGGIFAIGHIVQPK--VNTTPLV--PNQTHYNVNMKAVEVGG 310
Query: 411 SPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV 467
LNL VG + D+G++ Y + Y +L++ + SD V D
Sbjct: 311 YFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF--T 368
Query: 468 CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD-G 526
C++ S+ D F +T HF + S + P YL S G C+G + G
Sbjct: 369 CFQYS---ESLDD---GFPAVTFHFEN-----SLYLKVHPHEYL-FSYDGLWCIGWQNSG 416
Query: 527 SEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + I +LGD++L +LV+YD N+ IGW + +C
Sbjct: 417 MQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNC 454
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 186/404 (46%), Gaps = 40/404 (9%)
Query: 186 SSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN- 242
+++ PL G P GLY+T + +G P + YY+ +DTGSD+ W+ C C C + +
Sbjct: 71 AAADIPLGGLGLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNC-ISCDRCPRKSGL 129
Query: 243 ----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLA 295
LY P+ G+ + C PG C T C+Y + Y D SS+ G
Sbjct: 130 GLELTLYDPKDSSTGSKVSCDQGFCAATYGGLLPG-CTTSLPCEYSVTYGDGSSTTGYFV 188
Query: 296 RDELHL-TIENGSLTKP---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
D L + T+P V FGC Q G L ++ DGI+G ++ S+ SQL++
Sbjct: 189 SDLLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSA 248
Query: 352 QGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS 411
G +K + HCL T GGG +G+ + P + P++ P M Y+ + I+ G +
Sbjct: 249 AGKVKKIFAHCLDT-INGGGIFAIGNVVQPK--VKTTPLV--PNMPHYNVNLKSIDVGGT 303
Query: 412 PLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVC 468
L L + G + D+G++ TY + Y E++ ++ D + + +C
Sbjct: 304 ALKLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQE---FLC 360
Query: 469 WRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS- 527
++ + V F +T HF + + ++ P Y + C+G +G
Sbjct: 361 FQY------VGRVDDDFPKITFHFENDLPL-----NVYPHDYFFENGDNLYCVGFQNGGL 409
Query: 528 EVHNG-STIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+ +G ++LGD+ L +LVVYD N+ IGW + +C + + K
Sbjct: 410 QSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEYNCSSSIKIK 453
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 118/406 (29%), Positives = 183/406 (45%), Gaps = 45/406 (11%)
Query: 181 AVAVDSSSIFPLRG--NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC- 237
A AV FP+ G N Y GLYFT + +GNP + Y++ +DTGSD+ W+ C +PC+ C
Sbjct: 66 APAVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEYFVQIDTGSDILWVAC-SPCTGCP 124
Query: 238 -AKGAN---PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQ----CDYEIEYAD 286
+ G N + P + +P D C + + C++ C Y Y D
Sbjct: 125 TSSGLNIQLEFFNPDSSSTSSRIPCSDDRCTAALQTGE-AVCQSSDSPSSPCGYTFTYGD 183
Query: 287 HSSSMGVLARDELHLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAK 342
S + G D ++ G+ N VVFGC+ Q G L+ T DGI G + +
Sbjct: 184 GSGTSGFYVSDTMYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQ 243
Query: 343 VSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTE 402
+S+ SQL S G+ HCL + GGG + LG + P G+ + P++ S + E
Sbjct: 244 LSVVSQLYSLGVSPKTFSHCLKGSDNGGGILVLGEIVEP--GLVFTPLVPSQPHYNLNLE 301
Query: 403 ILKINYGSSPLN---LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLD 459
+ ++ P++ N+Q + D+G++ Y AY I ++ S +
Sbjct: 302 SIAVSGQKLPIDSSLFATSNTQ--GTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSV 359
Query: 460 ASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI 519
S F S VD F T TL+F + PE YL+ ++G++
Sbjct: 360 VSKGI------QCFVTTSSVDSS--FPTATLYFKG-----GVSMTVKPENYLL--QQGSV 404
Query: 520 CLGIL--DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+L G + G T ILGD+ L+ ++ VYD N R+GWA C
Sbjct: 405 DNNVLWCIGWQRSQGIT-ILGDLVLKDKIFVYDLANMRMGWADYDC 449
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 172/384 (44%), Gaps = 37/384 (9%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN--------PLYKPR 248
Y GLYFT + +G PPR + + +DTGSD+ W+ C + CS+C + +
Sbjct: 76 YLVGLYFTRVKLGTPPREFNVQIDTGSDVLWVTCSS-CSNCPQTSGLGIQLNYFDTTSSS 134
Query: 249 MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
++P +C + QC Y +Y D S + G D + G
Sbjct: 135 TARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGES 194
Query: 309 TKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
N +VFGC+ Q G L T DGI G + ++S+ SQL+S GI V HCL
Sbjct: 195 LIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLK 254
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG----ARNS 420
GGG + LG L P G+ + P++ P Y+ ++ I L + A +S
Sbjct: 255 GEDSGGGILVLGEILEP--GIVYSPLV--PSQPHYNLDLQSIAVSGQLLPIDPAAFATSS 310
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + DTG++ Y ++AY ++++ S + PT+ + S+ +
Sbjct: 311 NRG-TIIDTGTTLAYLVEEAYDPFVSAITAAVS-----QLATPTINKGNQCYLVSNSVSE 364
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGD 539
V F ++ +F ++ + PE YL+ ++ L + ++ G T ILGD
Sbjct: 365 V---FPPVSFNFAGGATML-----LKPEEYLMYLTNYAGAALWCIGFQKIQGGIT-ILGD 415
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ L+ ++ VYD ++RIGWA C
Sbjct: 416 LVLKDKIFVYDLAHQRIGWANYDC 439
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 183/390 (46%), Gaps = 41/390 (10%)
Query: 190 FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRM 249
FPL+GN GLY+T + +GNP + + +DTGSD+ W++C +PC SC + + +
Sbjct: 71 FPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQDIIPPLSI 129
Query: 250 GNILPYKDSLCMEIQRNHKPGYCETCQQ------CDYEIEYADHSSSMGVLARDELHLTI 303
N+ S G C + C Y I Y D S+S+G +D++H +
Sbjct: 130 YNLSASSTSSVSSCSDPLCTGEQAVCSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVL 189
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
+ G+ T ++ FGCA + G DGI+G + ++P+Q+A+Q + V HCL
Sbjct: 190 QGGNATTSHIFFGCAINITGSW-----PADGIMGFGQISKTVPNQIATQRNMSRVFSHCL 244
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR----- 418
GGG + G + + M + P+L+ Y+ ++L I+ S L + ++
Sbjct: 245 GGEKHGGGILEFGEE-PNTTEMVFTPLLN--VTTHYNVDLLSISVNSKVLPIDSKEFSYV 301
Query: 419 --NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
++ + D+G+S+ +A L + +K +++ L P L F ++
Sbjct: 302 SNSTNETGVIIDSGTSFALLATKANRILFSEIKNLTTAKL-----GPKLE--GLQCFYLK 354
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV---ISKKGNICLGILDGSEVHNGS 533
S + V+ F +TL F + + P+ YLV + KK N G +G
Sbjct: 355 SGLTVETSFPNVTLTFSG-----GSTMKLKPDNYLVMVELKKKRN---GYCYAWSSADGL 406
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T I G+I L+ +LV YD N+RIGW +C
Sbjct: 407 T-IFGEIVLKDKLVFYDVENRRIGWKGQNC 435
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 108/393 (27%), Positives = 175/393 (44%), Gaps = 50/393 (12%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNIL 253
Y+T + +G PP+P+++ +DTGSD+ W+ C C C + LY P+ G+ +
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNC-VSCDKCPTKSGLGIDLALYDPKGSSSGSAV 145
Query: 254 PYKDSLCMEIQRNHK--PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL--- 308
+ C + + PG C + C+Y EY D SS+ G D L +G+
Sbjct: 146 SCDNKFCAATYGSGEKLPG-CTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTR 204
Query: 309 -TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
K NV+FGC Q G L +T DGI+G ++ S SQLAS G +K + HCL T
Sbjct: 205 HAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIK 264
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA---RNSQVGW 424
GGG + +G + P + P+L P M Y+ + I+ + L L S+
Sbjct: 265 GGGIFA-IGEVVQPK--VKSTPLL--PNMSHYNVNLQSIDVAGNALQLPPHIFETSEKRG 319
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA-----KFPIRSIV 479
+ D+G++ TY + Y +++A++ + D + +R F V
Sbjct: 320 TIIDSGTTLTYLPELVYKDILAAVFQKHQD------------ITFRTIQGFLCFEYSESV 367
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS--EVHNGSTIIL 537
D F +T HF + ++ P Y + CLG +G ++L
Sbjct: 368 D--DGFPKITFHFEDDLGL-----NVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLL 420
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
GD+ L ++VVYD + IGW +C + + K
Sbjct: 421 GDLVLSNKVVVYDLEKQVIGWTDYNCSSSIKIK 453
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 123/428 (28%), Positives = 196/428 (45%), Gaps = 52/428 (12%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
++ H ++ +++S AVD L GN P GLYFT + +G+PP+ YY+ +DTG
Sbjct: 39 VKAHDARRRGRILS----AVD----LNLGGNGLPTETGLYFTKLGLGSPPKDYYVQVDTG 90
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C CS C + ++ LY P+ ++ C PG C++
Sbjct: 91 SDILWVNC-VKCSRCPRKSDLGIDLTLYDPKGSETSELISCDQEFCSATYDGPIPG-CKS 148
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSL-TKP---NVVFGCAYDQQGLLLNTLV 330
C Y I Y D S++ G +D L N +L T P +++FGC Q G L ++
Sbjct: 149 EIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNSSIIFGCGAVQSGTLSSSSE 208
Query: 331 KT-DGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVP 389
+ DGI+G ++ S+ SQLA+ G +K + HCL N GGG +G + P ++ P
Sbjct: 209 EALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCL-DNIRGGGIFAIGEVVEPK--VSTTP 265
Query: 390 MLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIA 446
++ P M Y+ + I + L L + G + D+G++ Y Y ELI
Sbjct: 266 LV--PRMAHYNVVLKSIEVDTDILQLPSDIFDSGNGKGTIIDSGTTLAYLPAIVYDELIP 323
Query: 447 SLKEVSSDGLVLDASDPTLPV-CWRAKFP-IRSIVDVKQFFKTLTLHFGSKWQIVSTKFH 504
+ A P L + +F + +V + F + LHF S
Sbjct: 324 KVM----------ARQPRLKLYLVEQQFSCFQYTGNVDRGFPVVKLHFED-----SLSLT 368
Query: 505 ISPEGYLVISKKGNICLGILDG-SEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSH 562
+ P YL K G C+G ++ NG + +LGD+ L +LV+YD N IGW +
Sbjct: 369 VYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMAIGWTDYN 428
Query: 563 CMNPGRFK 570
C + + K
Sbjct: 429 CSSSIKVK 436
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 180/383 (46%), Gaps = 40/383 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---PL--YKP---RMGN 251
GLY+T + +GNPP+ +Y+ +DTGSD+ W+ C++ C+ C + PL + P +
Sbjct: 81 GLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNS-CNGCPATSGLQIPLNFFDPGSSTTAS 139
Query: 252 ILPYKDSLC-MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTI----ENG 306
++ D +C + +Q + + ++ QC Y +Y D S + G D +HL +
Sbjct: 140 LVSCSDQICALGVQSSDSACFGQS-NQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVT 198
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
S + +VVFGC+ Q G L + DGI G + +S+ SQL+S+GI V HCL +
Sbjct: 199 SNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGD 258
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGS-----SPLNLGARNSQ 421
GGG + LG + P+ + + P++ P Y+ + I+ SP +SQ
Sbjct: 259 DSGGGILVLGEIVEPN--VVYTPLV--PSQPHYNLNLQSISVNGQVLPISPAVFATSSSQ 314
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+ D+G++ Y ++AY+ + ++ + S T V + + V
Sbjct: 315 --GTIIDSGTTLAYLAEEAYNAFVVAVTNI--------VSQSTQSVVLKGNRCYVTSSSV 364
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK-GNICLGILDGSEVHNGSTIILGDI 540
F ++L+F +V + + YL+ G + + ++ ILGD+
Sbjct: 365 SDIFPQVSLNFAGGASLV-----LGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDL 419
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
L+ ++ +YD N+RIGW C
Sbjct: 420 VLKDKIFIYDLANQRIGWTNYDC 442
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 185/400 (46%), Gaps = 47/400 (11%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---PL 244
FP++G P GLYFT + +G+PP+ +Y+ +DTGSD+ W+ C + C+ C + PL
Sbjct: 70 FPVQGTFNPFLVGLYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSS-CNGCPVTSGLQIPL 128
Query: 245 --YKP---RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ P ++ D C ++ QC Y +Y D S + G D +
Sbjct: 129 TFFDPGSSTTAALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLM 188
Query: 300 HLT---IENGSLTK------PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA 350
HL + +G L++ +V F C+ Q G L + DGI G + ++S+ SQLA
Sbjct: 189 HLDTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLA 248
Query: 351 SQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDS-PFMELY--HTEILKIN 407
SQGI V HCL + GGG + LG + P+ + + P++ S P LY +
Sbjct: 249 SQGITPRVFSHCLKGDDSGGGVLVLGEIVEPN--IVYTPLVPSQPHYNLYLQSISVAGQT 306
Query: 408 YGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV 467
P GA ++Q + D+G++ Y + AY ++++ V S L+A L
Sbjct: 307 LAIDPSVFGASSNQ--GTIVDSGTTLAYLAEGAYDPFVSAITSVVS----LNAR-TYLSK 359
Query: 468 CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV----ISKKGNICLGI 523
+ S+ DV F ++L+F ++ ++P+ YL+ + C+G
Sbjct: 360 GNQCYLVTSSVNDV---FPQVSLNFAGGASLI-----LNPQDYLLQQNSVGGAAVWCVGF 411
Query: 524 LDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ ILGD+ L+ ++ VYD N+R+GW C
Sbjct: 412 ---QKTPGQQITILGDLVLKDKIFVYDIANQRVGWTNYDC 448
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 113/388 (29%), Positives = 183/388 (47%), Gaps = 41/388 (10%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G P + + L +D+GS +T++ PC++C + N ++ N
Sbjct: 81 LHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYV----PCATCEQCGN--HQSESPN 134
Query: 252 ILPYKDSLCM-EIQRNHKPGYCE---TCQ----QCDYEIEYADHSSSMGVLARDELHLTI 303
I+ D ++ + P C TC QC YE +YA+ SSS GVL D +
Sbjct: 135 IIEAHDPRFQPDLSSTYSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGK 194
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
E+ L VFGC + G L + DGI+GL R ++S+ QL +G+I + C
Sbjct: 195 ES-ELKPQRAVFGCENTETGDLFSQ--HADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY 251
Query: 364 TTNAGGGGYMFL-GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR--NS 420
GGG M L G P + + SP+ Y+ E+ +I+ L L + NS
Sbjct: 252 GGMDVGGGTMVLGGMPAPPDMVFSHSNPVRSPY---YNIELKEIHVAGKALRLDPKIFNS 308
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKFPIRSIV 479
+ G L D+G++Y Y +QA+ ++ + + DP +C+ R++
Sbjct: 309 KHGTVL-DSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAG--RNVS 365
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGS--TI 535
+ + F + + FG+ K +SPE YL K G CLG+ NG T
Sbjct: 366 QLSEVFPDVDMVFGN-----GQKLSLSPENYLFRHSKVEGAYCLGVF-----QNGKDPTT 415
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+LG I +R LV YD N++IG+ K++C
Sbjct: 416 LLGGIVVRNTLVTYDRHNEKIGFWKTNC 443
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 113/388 (29%), Positives = 183/388 (47%), Gaps = 41/388 (10%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G P + + L +D+GS +T++ PC++C + N ++ N
Sbjct: 82 LHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYV----PCATCEQCGN--HQSESPN 135
Query: 252 ILPYKDSLCM-EIQRNHKPGYCE---TCQ----QCDYEIEYADHSSSMGVLARDELHLTI 303
I+ D ++ + P C TC QC YE +YA+ SSS GVL D +
Sbjct: 136 IIEAHDPRFQPDLSSTYSPVKCNVDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGK 195
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
E+ L VFGC + G L + DGI+GL R ++S+ QL +G+I + C
Sbjct: 196 ES-ELKPQRAVFGCENTETGDLFSQ--HADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY 252
Query: 364 TTNAGGGGYMFL-GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR--NS 420
GGG M L G P + + SP+ Y+ E+ +I+ L L + NS
Sbjct: 253 GGMDVGGGTMVLGGMPAPPDMVFSHSNPVRSPY---YNIELKEIHVAGKALRLDPKIFNS 309
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKFPIRSIV 479
+ G L D+G++Y Y +QA+ ++ + + DP +C+ R++
Sbjct: 310 KHGTVL-DSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAG--RNVS 366
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGS--TI 535
+ + F + + FG+ K +SPE YL K G CLG+ NG T
Sbjct: 367 QLSEVFPDVDMVFGN-----GQKLSLSPENYLFRHSKVEGAYCLGVF-----QNGKDPTT 416
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+LG I +R LV YD N++IG+ K++C
Sbjct: 417 LLGGIVVRNTLVTYDRHNEKIGFWKTNC 444
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 115/400 (28%), Positives = 185/400 (46%), Gaps = 65/400 (16%)
Query: 187 SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK 246
S+ L ++ +G Y T + +G PP+ + L +D+GS +T++ C A C C +P ++
Sbjct: 73 SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQ 131
Query: 247 PRMGNILPYKDSLCMEIQRNHKPGYCE---TC----QQCDYEIEYADHSSSMGVLARDEL 299
P ++ + P C TC QC YE +YA+ SSS GVL D +
Sbjct: 132 P--------------DLSSTYSPVKCNVDCTCDSDKNQCTYERQYAEMSSSSGVLGEDIV 177
Query: 300 HLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVV 359
E+ L VFGC + G L + DGI+GL R ++S+ QL +G+I +
Sbjct: 178 SFGTES-ELKPQRAVFGCENSETGDLFSQ--HADGIMGLGRGQLSIMDQLVDKGVIGDSF 234
Query: 360 GHCLTTNAGGGGYMFLGHDLVPSWGMAWV--PMLDSPFMELYHTEILKINYGSSPLNLGA 417
C GGG M LG P GM + + SP+ Y+ E+ +++ L +
Sbjct: 235 SMCYGGMDIGGGAMVLGAMPAPP-GMIYTHSNAVRSPY---YNIELKEMHVAGKALRVDP 290
Query: 418 R--NSQVGWALFDTGSSYTYFTKQAY-------SELIASLKEVSS-DGLVLDASDPTLPV 467
R + + G + D+G++Y Y +QA+ S + LK++ D D +
Sbjct: 291 RIFDGKHG-TVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKD-------I 342
Query: 468 CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILD 525
C+ R++ + + F + + FG+ K +SPE YL K G CLG+
Sbjct: 343 CFAGAG--RNVSQLSEVFPKVDMVFGN-----GQKLSLSPENYLFRHSKVEGAYCLGVF- 394
Query: 526 GSEVHNGS--TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
NG T +LG I +R LV YD N++IG+ K++C
Sbjct: 395 ----QNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 135 bits (339), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 108/380 (28%), Positives = 171/380 (45%), Gaps = 37/380 (9%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC--AKGAN---PLYKPRMGNI---L 253
YFT + +G+PP+ Y++ +DTGSD+ W+ C +PC+ C + G N + P + +
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 175
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
P D C + + C+T C Y Y D S + G D ++ G+
Sbjct: 176 PCSDDRCTAALQTSE-AVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 234
Query: 312 N----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
N +VFGC+ Q G L T DGI G + ++S+ SQL S G+ V HCL +
Sbjct: 235 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD 294
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA-RNSQVGWAL 426
GGG + LG + P G+ + P++ S + E + +N P++ S +
Sbjct: 295 NGGGILVLGEIVEP--GLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTI 352
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G++ Y AY + ++ S S +L F S VD F
Sbjct: 353 VDSGTTLAYLADGAYDPFVNAITAAVS------PSVRSLVSKGNQCFVTSSSVDSS--FP 404
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLV--ISKKGNICLGILDGSEVHNGSTI-ILGDISLR 543
T++L+F + + PE YL+ S N+ I G + + G I ILGD+ L+
Sbjct: 405 TVSLYF-----MGGVAMTVKPENYLLQQASIDNNVLWCI--GWQRNQGQQITILGDLVLK 457
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
++ VYD N R+GW C
Sbjct: 458 DKIFVYDLANMRMGWTDYDC 477
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/397 (29%), Positives = 176/397 (44%), Gaps = 40/397 (10%)
Query: 190 FPLRG-NI-YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL--- 244
PL G NI Y GLY+T + +G P YY+ +DTGS W+ C C ++ L
Sbjct: 69 LPLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQCPHESDILRKL 127
Query: 245 --YKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL- 301
Y PR + + K+ C + +P C +C Y YAD +MG+L D LH
Sbjct: 128 TFYDPR--SSVSSKEVKCDDTICTSRPP-CNMTLRCPYITGYADGGLTMGILFTDLLHYH 184
Query: 302 -TIENGSL--TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
NG T +V FGC Q G L N+ V DGI+G + + SQLA+ G K +
Sbjct: 185 QLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKI 244
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGA 417
HCL + GGG + +G + P + P++ + E+YH LK IN + L L A
Sbjct: 245 FSHCLDSTNGGGIFA-IGEVVEPK--VKTTPIVKNN--EVYHLVNLKSINVAGTTLQLPA 299
Query: 418 R---NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
++ D+GS+ Y + YSELI L + A P + + F
Sbjct: 300 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELI----------LAVFAKHPDITMGAMYNFQ 349
Query: 475 IRSIV-DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
+ V F +T HF + + + P YL+ + C G D
Sbjct: 350 CFHFLGSVDDKFPKITFHFENDLTL-----DVYPYDYLLEYEGNQYCFGFQDAGIHGYKD 404
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
IILGD+ + ++VVYD + IGW + +C + + K
Sbjct: 405 MIILGDMVISNKVVVYDMEKQAIGWTEHNCSSSVKIK 441
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 178/387 (45%), Gaps = 38/387 (9%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNI 252
LY+T + +G P + YY+ +DTGSD+ W+ C C C + + LY P+ G+
Sbjct: 3 LYYTEIGIGTPTKRYYVQVDTGSDILWVNC-ISCDRCPRKSGLGLELTLYDPKDSSTGSK 61
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKP 311
+ C PG C T C+Y + Y D SS+ G D L + T+P
Sbjct: 62 VSCDQGFCAATYGGLLPG-CTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRP 120
Query: 312 ---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
V FGC Q G L ++ DGI+G ++ S+ SQL++ G +K + HCL T
Sbjct: 121 ANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT-IN 179
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WA 425
GGG +G+ + P + P++ P M Y+ + I+ G + L L + G
Sbjct: 180 GGGIFAIGNVVQPK--VKTTPLV--PNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 235
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+ D+G++ TY + Y E++ ++ D + + +C++ + V F
Sbjct: 236 IIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQE---FLCFQY------VGRVDDDF 286
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVHNG-STIILGDISLR 543
+T HF + + ++ P Y + C+G +G + +G ++LGD+ L
Sbjct: 287 PKITFHFENDLPL-----NVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLS 341
Query: 544 GQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+LVVYD N+ IGW + +C + + K
Sbjct: 342 NKLVVYDLENQVIGWTEYNCSSSIKIK 368
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 178/386 (46%), Gaps = 40/386 (10%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC----AKGANPLYKPRMGN- 251
Y GLYFT + +G+P + +Y+ +DTGSD+ WI C CS+C G + G+
Sbjct: 78 YFVGLYFTKVKLGSPAKEFYVQIDTGSDILWINC-ITCSNCPHSSGLGIELDFFDTAGSS 136
Query: 252 ---ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGS 307
++ D +C + QC Y +Y D S + G D ++ T+ G
Sbjct: 137 TAALVSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQ 196
Query: 308 LTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
N ++FGC+ Q G L T DGI G +S+ SQL+S+G+ V HCL
Sbjct: 197 SVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL 256
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL--NLGARNSQ 421
GGG + LG L PS + + P++ S + + + +N P+ N+ A +
Sbjct: 257 KGGENGGGVLVLGEILEPS--IVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNN 314
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
G + D+G++ Y ++AY+ + ++ S S P + + S+ D+
Sbjct: 315 QG-TIVDSGTTLAYLVQEAYNPFVKAITAAVS-----QFSKPIISKGNQCYLVSNSVGDI 368
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLV----ISKKGNICLGILDGSEVHNGSTIIL 537
F ++L+F +V ++PE YL+ + C+G +V G T IL
Sbjct: 369 ---FPQVSLNFMGGASMV-----LNPEHYLMHYGFLDGAAMWCIGF---QKVEQGFT-IL 416
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
GD+ L+ ++ VYD N+RIGWA C
Sbjct: 417 GDLVLKDKIFVYDLANQRIGWADYDC 442
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 177/391 (45%), Gaps = 45/391 (11%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S++ L ++ +G Y T + +G PP+ + L +DTGS +T++ C + C C + +P +
Sbjct: 67 SNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPC-STCEQCGRHQDPKF 125
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCET---CQ----QCDYEIEYADHSSSMGVLARDE 298
P E +KP C C QC YE +YA+ S+S GVL D
Sbjct: 126 DP--------------ESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDV 171
Query: 299 LHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
+ L VFGC + G L + + DGI+GL +SL QL +G I +
Sbjct: 172 ISFG-NQSELIPQRAVFGCENMETGDLFSQ--RADGIMGLGTGDLSLVDQLVEKGAINDS 228
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPS-WGMAWVPMLDSPFMELYHTEILKINYGSS--PLNL 415
C GGG M LG PS + + SP+ Y+ ++ +I+ PL+
Sbjct: 229 FSLCYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPY---YNVDLKEIHVAGKKLPLSS 285
Query: 416 GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKFP 474
G + + G A+ D+G++Y Y +A+S ++ + +D DP +C+
Sbjct: 286 GIFDGRYG-AVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG- 343
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNG 532
++ F T+ + F + K ++PE Y K G CLGI + N
Sbjct: 344 -SDAAELSNKFPTVDMVFEN-----GQKLSLTPENYFFRHSKVHGAYCLGIFENG---ND 394
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T +LG I +R LV+YD N +IG+ K++C
Sbjct: 395 QTTLLGGIVVRNTLVMYDRANSKIGFWKTNC 425
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 177/391 (45%), Gaps = 45/391 (11%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
S++ L ++ +G Y T + +G PP+ + L +DTGS +T++ C + C C + +P +
Sbjct: 67 SNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPC-STCEQCGRHQDPKF 125
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYCET---CQ----QCDYEIEYADHSSSMGVLARDE 298
P E +KP C C QC YE +YA+ S+S GVL D
Sbjct: 126 DP--------------ESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDV 171
Query: 299 LHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
+ L VFGC + G L + + DGI+GL +SL QL +G I +
Sbjct: 172 ISFG-NQSELIPQRAVFGCENMETGDLFSQ--RADGIMGLGTGDLSLVDQLVEKGAINDS 228
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPS-WGMAWVPMLDSPFMELYHTEILKINYGSS--PLNL 415
C GGG M LG PS + + SP+ Y+ ++ +I+ PL+
Sbjct: 229 FSLCYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPY---YNVDLKEIHVAGKKLPLSS 285
Query: 416 GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKFP 474
G + + G A+ D+G++Y Y +A+S ++ + +D DP +C+
Sbjct: 286 GIFDGRYG-AVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG- 343
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNG 532
++ F T+ + F + K ++PE Y K G CLGI + N
Sbjct: 344 -SDAAELSNKFPTVDMVFEN-----GQKLSLTPENYFFRHSKVHGAYCLGIFENG---ND 394
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T +LG I +R LV+YD N +IG+ K++C
Sbjct: 395 QTTLLGGIVVRNTLVMYDRANSKIGFWKTNC 425
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/395 (26%), Positives = 176/395 (44%), Gaps = 65/395 (16%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS- 258
GLYFT + +G PP + + +DTGSD+ W+ C++ C+ C + + +G L + D+
Sbjct: 77 GLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNS-CNGCPRSSG------LGIQLNFFDAS 129
Query: 259 -------------LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
+C + QC Y +Y D S + G + ++ +
Sbjct: 130 SSSSSSLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVM 189
Query: 306 GSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
G N VVFGC+ Q G L + DGI G +S+ SQL+++GI V H
Sbjct: 190 GQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSH 249
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDS-PFMELYHTEILKINYGSSPLNLGARNS 420
CL GGG + LG L P G+ + P++ S P LY I +N + P++ +
Sbjct: 250 CLKGEGNGGGILVLGEVLEP--GIVYSPLVPSQPHYNLYLQSI-SVNGQTLPIDPSVFAT 306
Query: 421 QVGWA-LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP---VCWRAKFPIR 476
+ + D+G++ Y ++AY+ ++++ S + PT+ C+
Sbjct: 307 SINRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVT-----PTISKGNQCYLVS---- 357
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS--------E 528
V + F ++L+F +V + PE YL + LG DG+ +
Sbjct: 358 --TSVGEIFPLVSLNFAGSASMV-----LKPEEYL-------MHLGFYDGAALWCIGFQK 403
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
V G T ILGD+ ++ ++ VYD +RIGWA C
Sbjct: 404 VQEGVT-ILGDLVMKDKIFVYDLARQRIGWASYDC 437
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 182/387 (47%), Gaps = 43/387 (11%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNI 252
LYFT + +GNP + Y + +DTGSD+ W+ C PCS C + + +Y PR ++
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNC-RPCSGCPRKSALNIPLTMYDPRESSTTSL 59
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTI--ENG-SLT 309
+ D LC+ +R + + C+Y Y D S+S G RD + + NG + T
Sbjct: 60 VSCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 119
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
V+FGC+ Q G L + DGI+G + ++S+P+QLA+Q I V HCL G
Sbjct: 120 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRG 179
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN---SQVGWAL 426
GG + +G P GM + P++ P Y+ + I+ S+ L + A + + +
Sbjct: 180 GGILVIGGIAEP--GMTYTPLV--PDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVI 235
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G++ YF AY+ + +++E +S V T + + F
Sbjct: 236 MDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGR--------LSDLFP 287
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLV---ISKKGNI---CLGILDGSEV---HNGSTI-I 536
+TL+F + P+ YL+ + G C+G S +GS + I
Sbjct: 288 NVTLNFEGG------AMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTI 341
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
LGDI L+ +LVVYD N RIGW +C
Sbjct: 342 LGDIVLKDKLVVYDLDNSRIGWMSYNC 368
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 180/383 (46%), Gaps = 32/383 (8%)
Query: 187 SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK 246
S+ L ++ G Y + + +G PP + L +DTGS +T++ C + C+ C +P +
Sbjct: 20 SARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSS-CTHCGNHQDPRFS 78
Query: 247 PRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG 306
P + + YK +E G+C+ ++ Y+ +YA+ S+S GVL +D + + +
Sbjct: 79 PALSS--SYKP---LECGSECSTGFCDGSRK--YQRQYAEKSTSSGVLGKDVIGFS-NSS 130
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
L +VFGC + G L + DGI+GL R +S+ QL + +++V C
Sbjct: 131 DLGGQRLVFGCETAETGDLYDQ--TADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGM 188
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGAR--NSQVG 423
GGG M LG P M + P Y+ +LK I G SPL L + + G
Sbjct: 189 DEGGGAMILG-GFQPPKDMVFTA--SDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYG 245
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKE-VSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
L D+G++Y YF A+ +++KE V S V + +C+ ++ ++
Sbjct: 246 TVL-DSGTTYAYFPGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAG--TNVSNLS 302
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDI 540
QFF ++ FG + +SPE YL K G CLG+ + + T +LG I
Sbjct: 303 QFFPSVDFVFGDGQSVT-----LSPENYLFRHTKISGAYCLGVFENGD----PTTLLGGI 353
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+R LV Y+ IG+ K+ C
Sbjct: 354 IVRNMLVTYNRGKASIGFLKTKC 376
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 119/426 (27%), Positives = 190/426 (44%), Gaps = 50/426 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
+R H ++ + + S A AVD PL GN P GLYFT + +G P + YY+ +DTG
Sbjct: 49 LRAHDARRHGR---SLAAAVD----LPLGGNGLPTETGLYFTQIGIGTPAKSYYVQVDTG 101
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C C +C + + LY P G + C+ P C
Sbjct: 102 SDILWVNC-VFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCGQDFCVATHGGVIPS-CVP 159
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENG----SLTKPNVVFGCAYDQQGLLLNTLV 330
C Y I Y D SS+ G D L +G +L ++ FGC G L ++
Sbjct: 160 AAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQ 219
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DGILG ++ S+ SQLA+ G ++ V HCL T GGG +G + P ++ P+
Sbjct: 220 ALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDT-INGGGIFAIGDVVQPK--VSTTPL 276
Query: 391 LDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA---LFDTGSSYTYFTKQAYSELIAS 447
+ P M Y+ + I+ G L L +G + + D+G++ Y Y+ +++
Sbjct: 277 V--PGMPHYNVNLEAIDVGGVKLQLPTNIFDIGESKGTIIDSGTTLAYLPGVVYNAIMSK 334
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
+ D + + D C+R V F +T HF + +I P
Sbjct: 335 VFAQYGDMPLKNDQDFQ---CFRYS------GSVDDGFPIITFHFEGGLPL-----NIHP 380
Query: 508 EGYLVISKKGNI-CLGILDGS-EVHNG-STIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
YL + G + C+G G + +G ++LGD++ +LV+YD N+ IGW +C
Sbjct: 381 HDYLF--QNGELYCMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDYNCS 438
Query: 565 NPGRFK 570
+ + K
Sbjct: 439 SSIKIK 444
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 124/430 (28%), Positives = 193/430 (44%), Gaps = 55/430 (12%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
I+ H S +++S AVD F L GN P GLYFT + +G+P + YY+ +DTG
Sbjct: 38 IKAHDSSRRGRILS----AVD----FNLGGNGLPTVTGLYFTKIGLGSPSKDYYVQVDTG 89
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPRMGNILPY---KDSLCMEIQRNHKPGYCET 274
SD+ W+ C C+ C + ++ LY P+ + + + C G C+
Sbjct: 90 SDILWVNC-VECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILG-CKA 147
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENG----SLTKPNVVFGCAYDQQGLLLNTLV 330
C Y I Y D S++ G +D L NG + +++FGC Q G ++
Sbjct: 148 ENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCGAAQSGTFASSSE 207
Query: 331 KT-DGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVP 389
+ DGI+G +A S+ SQLA+ G +K + HCL TN GGG + +G + P + P
Sbjct: 208 EALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFS-IGEVVEPK--VKTTP 264
Query: 390 MLDSPFMELYHTEILKINYGSSPLNL--GARNSQVG-WALFDTGSSYTYFTKQAYSELIA 446
++ P M Y+ + I L L +S+ G + D+G++ Y + Y +L++
Sbjct: 265 LV--PNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMS 322
Query: 447 SLKEVSSDGLVLDASDPTLPV--CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFH 504
+ A P L V + +V F + LHF S
Sbjct: 323 KVL----------AKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFED-----SLSLT 367
Query: 505 ISPEGYLVISKKGNI--CLGIL-DGSEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAK 560
+ P YL + KG+ C+G SE NG + +LGD L +LVVYD N IGW
Sbjct: 368 VYPHDYL-FNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTD 426
Query: 561 SHCMNPGRFK 570
+C + + K
Sbjct: 427 YNCSSSIKVK 436
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 172/390 (44%), Gaps = 48/390 (12%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPRMGN 251
Y GLYFT + +G+PPR + + +DTGSD+ W+ C++ C+ C + + + P +
Sbjct: 81 YLVGLYFTKVKLGSPPREFNVQIDTGSDILWVTCNS-CNDCPRTSGLGIELSFFDPSSSS 139
Query: 252 ILPYKDS---LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGS 307
+C + + QC Y Y D S + G D L+ T+ S
Sbjct: 140 TTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDS 199
Query: 308 L---TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
L + ++VFGC+ Q G L DGI G + +S+ SQL+S GI V HCL
Sbjct: 200 LIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLK 259
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN---LGARNSQ 421
GGG + LG L P+ + + P++ S + + + +N P++ N+Q
Sbjct: 260 GEGDGGGKLVLGEILEPN--IIYSPLVPSQSHYNLNLQSISVNGQLLPIDPAVFATSNNQ 317
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+ D+G++ TY + AY ++++ S T PV + V
Sbjct: 318 --GTIVDSGTTLTYLVETAYDPFVSAITAT--------VSSSTTPVLSKGNQCYLVSTSV 367
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS--------EVHNGS 533
+ F ++L+F +V + P YL + LG DG+ +V
Sbjct: 368 DEIFPPVSLNFAGGASMV-----LKPGEYL-------MHLGFSDGAAMWCIGFQKVAEPG 415
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
ILGD+ L+ ++ VYD ++RIGWA C
Sbjct: 416 ITILGDLVLKDKIFVYDLAHQRIGWANYDC 445
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 118/391 (30%), Positives = 173/391 (44%), Gaps = 40/391 (10%)
Query: 190 FPLRG-NI-YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL--- 244
PL G NI Y GLY+T + +G P YY+ +DTGS W+ C C ++ L
Sbjct: 45 LPLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQCPHESDILRKL 103
Query: 245 --YKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL- 301
Y PR + + K+ C + +P C +C Y YAD +MG+L D LH
Sbjct: 104 TFYDPR--SSVSSKEVKCDDTICTSRPP-CNMTLRCPYITGYADGGLTMGILFTDLLHYH 160
Query: 302 -TIENGSL--TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
NG T +V FGC Q G L N+ V DGI+G + + SQLA+ G K +
Sbjct: 161 QLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKI 220
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGA 417
HCL + GGG + +G + P + P++ + E+YH LK IN + L L A
Sbjct: 221 FSHCLDSTNGGGIFA-IGEVVEPK--VKTTPIVKNN--EVYHLVNLKSINVAGTTLQLPA 275
Query: 418 R---NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
++ D+GS+ Y + YSELI L + A P + + F
Sbjct: 276 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELI----------LAVFAKHPDITMGAMYNFQ 325
Query: 475 IRSIV-DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
+ V F +T HF + + + P YL+ + C G D
Sbjct: 326 CFHFLGSVDDKFPKITFHFENDLTL-----DVYPYDYLLEYEGNQYCFGFQDAGIHGYKD 380
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
IILGD+ + ++VVYD + IGW + + M
Sbjct: 381 MIILGDMVISNKVVVYDMEKQAIGWTEHNSM 411
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 181/385 (47%), Gaps = 33/385 (8%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
+++ PL ++ P G Y T + +G PP+ + L +DTGS LT++ C C C K +P +
Sbjct: 76 ATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST-CEQCGKHQDPNF 134
Query: 246 KPRMGNILPYKDSLC-MEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARDELHLTI 303
+P + Y+ C ME C++ C Y+ +YA+ SSS GVL D +
Sbjct: 135 QPDWSST--YQPLKCSMECT-------CDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFG- 184
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
+ L VFGC + G + + + DGI+GL R +S+ QL +G+I N C
Sbjct: 185 KQSELKPQRTVFGCENVETGDIYSQ--RADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY 242
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS--PLNLGARNSQ 421
GGG M LG + P GM + D Y+ ++ +I+ P+N + +
Sbjct: 243 GGMDVGGGAMVLG-GISPPAGMVFTHS-DPARSAYYNIDLKEIHIAGKQLPINPMVFDGK 300
Query: 422 VGWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G++Y Y + A+ A +KE++S L+ +C+ +
Sbjct: 301 YG-TILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVG--SDVSQ 357
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILG 538
+ + F + L F + + +SPE YL K G CLGI N T +LG
Sbjct: 358 LSKTFPAVDLVFSN-----GNRLSLSPENYLFQHSKAHGAYCLGIFQNE---NDQTTLLG 409
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
I +R LV+YD + +IG+ K++C
Sbjct: 410 GIIVRNTLVMYDREHLKIGFWKTNC 434
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 110/395 (27%), Positives = 183/395 (46%), Gaps = 43/395 (10%)
Query: 190 FPLRGNI--YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---PL 244
FP+ G Y GLYFT +++G+PP+ +Y+ +DTGSD+ W+ C + C+ C + + PL
Sbjct: 69 FPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGCPQSSGLHIPL 127
Query: 245 --YKP---RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ P +++ D C ++ G QC Y +Y D S + G D L
Sbjct: 128 NFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLL 187
Query: 300 HL-TIENGSLT--KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356
+ I S+T ++VFGC+ Q G L + DGI G + +S+ SQ++SQGI
Sbjct: 188 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 247
Query: 357 NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
V HCL + GGGG + LG + + + P++ P Y+ + I+ L +
Sbjct: 248 KVFSHCLKGDGGGGGILVLGE--IVEEDIVYSPLV--PSQPHYNLNLQSISVNGKSLAID 303
Query: 417 ----ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
A ++ G + D+G++ Y ++AY ++++ E S P+ +
Sbjct: 304 PEVFATSTNRG-TIVDSGTTLAYLAEEAYDPFVSAITEA--------VSQSVRPLLSKGT 354
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV----ISKKGNICLGILDGSE 528
VK F T++L+F ++ PE YL+ I C+G +
Sbjct: 355 QCYLITSSVKGIFPTVSLNFAG-----GVSMNLKPEDYLLQQNSIGDAAVWCIGF---QK 406
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ ILGD+ L+ ++ VYD +RIGWA C
Sbjct: 407 IQGQGITILGDLVLKDKIFVYDLAGQRIGWANYDC 441
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 181/385 (47%), Gaps = 33/385 (8%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY 245
+++ PL ++ P G Y T + +G PP+ + L +DTGS LT++ C C C K +P +
Sbjct: 76 ATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST-CEQCGKHQDPNF 134
Query: 246 KPRMGNILPYKDSLC-MEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARDELHLTI 303
+P + Y+ C ME C++ C Y+ +YA+ SSS GVL D +
Sbjct: 135 QPDWSST--YQPLKCSMECT-------CDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFG- 184
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
+ L VFGC + G + + + DGI+GL R +S+ QL +G+I N C
Sbjct: 185 KQSELKPQRTVFGCENVETGDIYSQ--RADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY 242
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS--PLNLGARNSQ 421
GGG M LG + P GM + D Y+ ++ +I+ P+N + +
Sbjct: 243 GGMDVGGGAMVLG-GISPPAGMVFTHS-DPARSAYYNIDLKEIHIAGKQLPINPMVFDGK 300
Query: 422 VGWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G++Y Y + A+ A +KE++S L+ +C+ +
Sbjct: 301 YG-TILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVG--SDVSQ 357
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILG 538
+ + F + L F + + +SPE YL K G CLGI N T +LG
Sbjct: 358 LSKTFPAVDLVFSN-----GNRLSLSPENYLFQHSKAHGAYCLGIFQNE---NDQTTLLG 409
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
I +R LV+YD + +IG+ K++C
Sbjct: 410 GIIVRNTLVMYDREHLKIGFWKTNC 434
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 110/395 (27%), Positives = 183/395 (46%), Gaps = 43/395 (10%)
Query: 190 FPLRGNI--YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---PL 244
FP+ G Y GLYFT +++G+PP+ +Y+ +DTGSD+ W+ C + C+ C + + PL
Sbjct: 54 FPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGCPQSSGLHIPL 112
Query: 245 --YKP---RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ P +++ D C ++ G QC Y +Y D S + G D L
Sbjct: 113 NFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLL 172
Query: 300 HL-TIENGSLTK--PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356
+ I S+T ++VFGC+ Q G L + DGI G + +S+ SQ++SQGI
Sbjct: 173 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 232
Query: 357 NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
V HCL + GGGG + LG + + + P++ P Y+ + I+ L +
Sbjct: 233 KVFSHCLKGDGGGGGILVLGE--IVEEDIVYSPLV--PSQPHYNLNLQSISVNGKSLAID 288
Query: 417 ----ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
A ++ G + D+G++ Y ++AY ++++ E S P+ +
Sbjct: 289 PEVFATSTNRG-TIVDSGTTLAYLAEEAYDPFVSAITEA--------VSQSVRPLLSKGT 339
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV----ISKKGNICLGILDGSE 528
VK F T++L+F ++ PE YL+ I C+G +
Sbjct: 340 QCYLITSSVKGIFPTVSLNFAG-----GVSMNLKPEDYLLQQNSIGDAAVWCIGF---QK 391
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ ILGD+ L+ ++ VYD +RIGWA C
Sbjct: 392 IQGQGITILGDLVLKDKIFVYDLAGQRIGWANYDC 426
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 115/394 (29%), Positives = 177/394 (44%), Gaps = 41/394 (10%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G PD GLY+ + +G P + YY+ +DTGSD+ W+ C C C K ++
Sbjct: 64 LPLGGIGRPDILGLYYAKIGIGTPTKDYYVQVDTGSDIMWVNC-IQCRECPKTSSLGIDL 122
Query: 243 PLY---KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY + G ++P C EI PG C C Y Y D SS+ G +D +
Sbjct: 123 TLYNINESDTGKLVPCDQEFCYEINGGQLPG-CTANMSCPYLEIYGDGSSTAGYFVKDVV 181
Query: 300 HLTIENGSL--TKPN--VVFGCAYDQQGLLLNTLVKT-DGILGLSRAKVSLPSQLASQGI 354
+G L T N V+FGC Q G L ++ + DGILG ++ S+ SQLA G
Sbjct: 182 QYARVSGDLKTTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGK 241
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414
+K + HCL GGG +GH + P M P++ P Y+ + + G L+
Sbjct: 242 VKKIFAHCL-DGTNGGGIFVIGHVVQPKVNMT--PLI--PNQPHYNVNMTAVQVGHEFLS 296
Query: 415 LGARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
L + G A+ D+G++ Y + Y L++ + D V D C++
Sbjct: 297 LPTDVFEAGDRKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEY--TCFQY 354
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD-GSEVH 530
+ F +T HF + S + P YL +G C+G + G +
Sbjct: 355 SDSL------DDGFPNVTFHFEN-----SVILKVYPHEYL-FPFEGLWCIGWQNSGVQSR 402
Query: 531 NGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + +LGD+ L +LV+YD N+ IGW + +C
Sbjct: 403 DRRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNC 436
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 166/385 (43%), Gaps = 41/385 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGA---------NPLYKPRMG 250
GLYFT + +GNP + +++ +DTGSD+ W+ C +PC+ C + NP
Sbjct: 3 GLYFTRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTAS 61
Query: 251 NILPYKDSLCMEIQRNHKPGYCETCQQ----CDYEIEYADHSSSMGVLARDELHLTIENG 306
I D Q C+T C Y Y D S + G D + G
Sbjct: 62 RITCSDDRCTAGFQTGEA--ICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMG 119
Query: 307 SLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
+ N +VFGC+ Q G L DGI G + ++S+ SQL S G+ V HC
Sbjct: 120 NEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC 179
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN---LGARN 419
L + GGG + LG + P G+ + P++ S + E + +N P++ N
Sbjct: 180 LKGSDNGGGILVLGEIVEP--GLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSN 237
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+Q + D+G++ Y AY ++++ S S +L F S V
Sbjct: 238 TQ--GTIVDSGTTLAYLADGAYDPFVSAIAAAVS------PSVRSLVSKGSQCFITSSSV 289
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI-ILG 538
D F T+TL+F + + PE YL+ + + G + + G I ILG
Sbjct: 290 DSS--FPTVTLYF-----MGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILG 342
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
D+ L+ ++ VYD N R+GWA C
Sbjct: 343 DLVLKDKIFVYDLANMRMGWADYDC 367
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 177/387 (45%), Gaps = 35/387 (9%)
Query: 190 FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRM 249
FPL+GN GLY+T + +GNP + + +DTGSD+ W++C +PC SC + + +
Sbjct: 71 FPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQDIIPPLSI 129
Query: 250 GNILPYKDSLCMEIQRNHKPGYCETCQQ------CDYEIEYADHSSSMGVLARDELHLTI 303
N+ S G C + C Y Y D S+S+G RD++H +
Sbjct: 130 YNLSASSTSSVSSCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVL 189
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
G+ T + FGCA + G DGI+G ++P+Q+A+Q + V HCL
Sbjct: 190 HGGNATTSRIFFGCATNITGSW-----PVDGIMGFGLISKTVPNQIATQRNMSRVFSHCL 244
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG------A 417
GGG + G + + M + P+L+ Y+ ++L I+ S L +
Sbjct: 245 GGEKHGGGILEFG-EAPNTTEMVFTPLLN--VTTHYNVDLLSISVNSKVLPIDPKEFSYV 301
Query: 418 RNSQVGWA-LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
RNS + D+G+++ T +A L +K +++ L P L F ++
Sbjct: 302 RNSTNNTGVIIDSGTTFVLLTTKANRMLFQEIKSLTTAKL-----GPKLE--GLECFYLK 354
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
S + ++ F +TL F + + P+ YLV+++ G +G T I
Sbjct: 355 SGLTMETSFPNVTLTFSG-----GSTMKLKPDNYLVMAEYKKKRNGYCYAWSSADGLT-I 408
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
G+I L+ +LV YD N+RIGW +C
Sbjct: 409 FGEIVLKDKLVFYDVENRRIGWKGQNC 435
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 171/387 (44%), Gaps = 40/387 (10%)
Query: 190 FPLRG-NI-YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL--- 244
PL G NI Y GLY+T + +G P YY+ +DTGS W+ C C ++ L
Sbjct: 69 LPLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQCPHESDILRKL 127
Query: 245 --YKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL- 301
Y PR + + K+ C + +P C +C Y YAD +MG+L D LH
Sbjct: 128 TFYDPR--SSVSSKEVKCDDTICTSRPP-CNMTLRCPYITGYADGGLTMGILFTDLLHYH 184
Query: 302 -TIENGSL--TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
NG T +V FGC Q G L N+ V DGI+G + + SQLA+ G K +
Sbjct: 185 QLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKI 244
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGA 417
HCL + GGG + +G + P + P++ + E+YH LK IN + L L A
Sbjct: 245 FSHCLDSTNGGGIFA-IGEVVEPK--VKTTPIVKNN--EVYHLVNLKSINVAGTTLQLPA 299
Query: 418 R---NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
++ D+GS+ Y + YSELI L + A P + + F
Sbjct: 300 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELI----------LAVFAKHPDITMGAMYNFQ 349
Query: 475 IRSIV-DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
+ V F +T HF + + + P YL+ + C G D
Sbjct: 350 CFHFLGSVDDKFPKITFHFENDLTL-----DVYPYDYLLEYEGNQYCFGFQDAGIHGYKD 404
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAK 560
IILGD+ + ++VVYD + IGW +
Sbjct: 405 MIILGDMVISNKVVVYDMEKQAIGWTE 431
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 171/387 (44%), Gaps = 40/387 (10%)
Query: 190 FPLRG-NI-YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL--- 244
PL G NI Y GLY+T + +G P YY+ +DTGS W+ C C ++ L
Sbjct: 45 LPLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQCPHESDILRKL 103
Query: 245 --YKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL- 301
Y PR + + K+ C + +P C +C Y YAD +MG+L D LH
Sbjct: 104 TFYDPR--SSVSSKEVKCDDTICTSRPP-CNMTLRCPYITGYADGGLTMGILFTDLLHYH 160
Query: 302 -TIENGSL--TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
NG T +V FGC Q G L N+ V DGI+G + + SQLA+ G K +
Sbjct: 161 QLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKI 220
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGA 417
HCL + GGG + +G + P + P++ + E+YH LK IN + L L A
Sbjct: 221 FSHCLDSTNGGGIFA-IGEVVEPK--VKTTPIVKNN--EVYHLVNLKSINVAGTTLQLPA 275
Query: 418 R---NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
++ D+GS+ Y + YSELI L + A P + + F
Sbjct: 276 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELI----------LAVFAKHPDITMGAMYNFQ 325
Query: 475 IRSIV-DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
+ V F +T HF + + + P YL+ + C G D
Sbjct: 326 CFHFLGSVDDKFPKITFHFENDLTL-----DVYPYDYLLEYEGNQYCFGFQDAGIHGYKD 380
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAK 560
IILGD+ + ++VVYD + IGW +
Sbjct: 381 MIILGDMVISNKVVVYDMEKQAIGWTE 407
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 110/398 (27%), Positives = 183/398 (45%), Gaps = 48/398 (12%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
F ++G P+ GLY+T + +G PP+ + + +DTGSD+ W+ C+ CS+C + +
Sbjct: 64 FSVQGTSDPNSVGLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNT-CSNCPQSSQLGIEL 122
Query: 243 ---PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
++P D +C + QC Y +Y D S + G D +
Sbjct: 123 NFFDTVGSSTAALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAM 182
Query: 300 HLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+ ++ G N +VFGC+ Q G L T DGI G +S+ SQL+S+GI
Sbjct: 183 YFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGIT 242
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN- 414
V HCL + GGG + LG L PS + + P++ S + + + +N P+N
Sbjct: 243 PKVFSHCLKGDGDGGGVLVLGEILEPS--IVYSPLVPSQPHYNLNLQSIAVNGQLLPINP 300
Query: 415 -LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
+ + ++ G + D G++ Y ++AY L+ ++ S S C+
Sbjct: 301 AVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ--CYLVST 358
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEV---- 529
I I F +++L+F +V + PE YL+ + G LDG+E+
Sbjct: 359 SIGDI------FPSVSLNFEGGASMV-----LKPEQYLMHN-------GYLDGAEMWCIG 400
Query: 530 ----HNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G++ ILGD+ L+ ++VVYD +RIGWA C
Sbjct: 401 FQKFQEGAS-ILGDLVLKDKIVVYDIAQQRIGWANYDC 437
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 162/380 (42%), Gaps = 44/380 (11%)
Query: 193 RGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI 252
RG G Y + +G P + Y + DTGSDL+W+QC PC+ C + +PL+ P + +
Sbjct: 140 RGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSST 198
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN 312
E Q G C + +C YE++Y D S + G L RD L L+ + T P
Sbjct: 199 YAAVACGAPECQELDASG-CSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD---TLPG 254
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY 372
VFGC GL + DG+ GL R KVSLPSQ A +CL +++ G GY
Sbjct: 255 FVFGCGDQNAGL----FGQVDGLFGLGREKVSLPSQGAPS--YGPGFTYCLPSSSSGRGY 308
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL-GARNSQVGWALFDTGS 431
+ LG P + + D Y+ +++ I G + + + G + D+G+
Sbjct: 309 LSLGG--APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGT 366
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK----- 486
T +AY+ L A+ + K P SI+D F
Sbjct: 367 VITRLPPRAYAPLRAAFAR---------------SMAQYKKAPALSILDTCYDFTGHRTA 411
Query: 487 ---TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
T+ L F + VS F G L +SK CL ++ + S ILG+ +
Sbjct: 412 QIPTVELAF-AGGATVSLDF----TGVLYVSKVSQACLAFAPNAD--DSSIAILGNTQQK 464
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
V YD N+RIG+ C
Sbjct: 465 TFAVAYDVANQRIGFGAKGC 484
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 162/380 (42%), Gaps = 44/380 (11%)
Query: 193 RGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI 252
RG G Y + +G P + Y + DTGSDL+W+QC PC+ C + +PL+ P + +
Sbjct: 140 RGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSST 198
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN 312
E Q G C + +C YE++Y D S + G L RD L L+ + T P
Sbjct: 199 YAAVACGAPECQELDASG-CSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD---TLPG 254
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY 372
VFGC GL + DG+ GL R KVSLPSQ A +CL +++ G GY
Sbjct: 255 FVFGCGDQNAGL----FGQVDGLFGLGREKVSLPSQGAPS--YGPGFTYCLPSSSSGRGY 308
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL-GARNSQVGWALFDTGS 431
+ LG P + + D Y+ +++ I G + + + G + D+G+
Sbjct: 309 LSLGG--APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGT 366
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK----- 486
T +AY+ L A+ + K P SI+D F
Sbjct: 367 VITRLPPRAYAPLRAAFAR---------------SMAQYKKAPALSILDTCYDFTGHRTA 411
Query: 487 ---TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
T+ L F + VS F G L +SK CL ++ + S ILG+ +
Sbjct: 412 QIPTVELAF-AGGATVSLDF----TGVLYVSKVSQACLAFAPNAD--DSSIAILGNTQQK 464
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
V YD N+RIG+ C
Sbjct: 465 TFAVTYDVANQRIGFGAKGC 484
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 119/422 (28%), Positives = 184/422 (43%), Gaps = 49/422 (11%)
Query: 164 IIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGS 223
+ R K+ I + ++ S+ L ++ G Y + + +G PP + L +DTGS
Sbjct: 2 LTRSKKNDIVDRRFERRGRKLEESARMTLHDDLLTKGYYTSRVFIGTPPNEFALIVDTGS 61
Query: 224 DLTWIQCDAPCSSCAKGA--------------NPLYKPRMGNILPYKDSLCMEIQRNHKP 269
+T++ PCSSC +P +KP N Y+ C +
Sbjct: 62 TVTYV----PCSSCTHCGHHQASFSTHRLFCRDPRFKPE--NSSSYQKIGCR--SSDCIT 113
Query: 270 GYCET-CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNT 328
G C++ QC YE YA+ S+S GVL +D L + L + FGC + G L
Sbjct: 114 GLCDSNSHQCKYERMYAEMSTSKGVLGKDLLDFGPAS-RLQSQLLSFGCETAESGDLY-- 170
Query: 329 LVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWV 388
L DGI+GL R +S+ QL G I++ C GGG M LG PS GM +
Sbjct: 171 LQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGGGSMVLGAIPAPS-GMVFA 229
Query: 389 PMLDSPFMELYHTEILKINYGSSPLNLGAR--NSQVGWALFDTGSSYTYFTKQAYSELIA 446
D Y+ E+ +I + L L + N + G + D+G++Y Y +A+
Sbjct: 230 KS-DPRRSNYYNLELTEIQVQGASLKLDSNVFNGKFG-TILDSGTTYAYLPDRAFEAFTD 287
Query: 447 SLKEVSSDGLVLDASDPTLP-VCWRAKFPIRSIVDVKQFFKTLTL--HFGSKWQIVSTKF 503
++ +D DP P +C+ + D K+ K L ++ Q VS
Sbjct: 288 AVVAQLGSLQAVDGPDPNYPDICYAG-----AGTDTKELGKHFPLVDFVFAENQKVS--- 339
Query: 504 HISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKS 561
++PE YL K G CLG + +T +LG I +R LV YD N +IG+ K+
Sbjct: 340 -LAPENYLFKHTKVPGAYCLGFFKNQD----ATTLLGGIIVRNMLVTYDRYNHQIGFLKT 394
Query: 562 HC 563
+C
Sbjct: 395 NC 396
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 124/435 (28%), Positives = 188/435 (43%), Gaps = 58/435 (13%)
Query: 161 NDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP--DGLYFTYMIVGNPPRPYYLD 218
N I+ H + + +S VA L GN P +GLY+T + +G P+ YY+
Sbjct: 41 NLAAIKAHDAGRRGRFLSVVDVA--------LGGNGRPTSNGLYYTKIGLG--PKDYYVQ 90
Query: 219 MDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPRMGNI---LPYKDSLCMEIQRNHKPG 270
+DTGSD W+ C C++C K + LY P + +P D C G
Sbjct: 91 VDTGSDTLWVNC-VGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFCTSTYDGQISG 149
Query: 271 YCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL-TKPN---VVFGCAYDQQGLLL 326
C C Y I Y D S++ G +D+L G L T P+ V+FGC Q G L
Sbjct: 150 -CTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTLS 208
Query: 327 NTL-VKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGM 385
+T DGI+G +A S+ SQLA+ G +K + HCL + +GGG + +G + P +
Sbjct: 209 STTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSISGGGIFA-IGEVVQPK--V 265
Query: 386 AWVPMLDSPFMELYHTEILKINYGSSPLNLGA---RNSQVGWALFDTGSSYTYFTKQAYS 442
P+L M Y+ + I P+ L + +S + D+G++ Y Y
Sbjct: 266 KTTPLLQG--MAHYNVVLKDIEVAGDPIQLPSDILDSSSGRGTIIDSGTTLAYLPVSIYD 323
Query: 443 ELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTK 502
+L+ + S G+ L + F V F T+ F + +
Sbjct: 324 QLLEKILAQRS-GMKLYLVEDQFTC-----FHYSDEESVDDLFPTVKFTFEEGLTLTT-- 375
Query: 503 FHISPEGYLVISKKGNICLG-------ILDGSEVHNGSTIILGDISLRGQLVVYDNVNKR 555
P YL + K+ C+G DG E+ I+LGD+ L +LVVYD N
Sbjct: 376 ---YPRDYLFLFKEDMWCVGWQKSMAQTKDGKEL-----ILLGDLVLANKLVVYDLDNMA 427
Query: 556 IGWAKSHCMNPGRFK 570
IGWA +C + + K
Sbjct: 428 IGWADYNCSSSIKVK 442
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 121/420 (28%), Positives = 194/420 (46%), Gaps = 52/420 (12%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGN--IYPDGLYFTYMIVGNPPRPYYLDMDTG 222
+R H + + +L++ A+D PL G+ GLYFT + +G P + YY+ +DTG
Sbjct: 59 LREHDGRRHGRLLA----AID----LPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTG 110
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C C C + +N +Y PR G ++ C+ P C +
Sbjct: 111 SDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVANYGGVLPS-CTS 168
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENG-SLTKP---NVVFGCAYDQQGLLLNTLV 330
C+Y I Y D SS+ G D L +G T P +V FGC G L ++ +
Sbjct: 169 TSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNL 228
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM 390
DGILG ++ S+ SQLA+ G ++ + HCL T GGG +G+ + P + P+
Sbjct: 229 ALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDT-VNGGGIFAIGNVVQPK--VKTTPL 285
Query: 391 LDSPFMELYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSSYTYFTKQAYSELI 445
+ P M Y+ + I+ G + L L + NS+ + D+G++ Y + Y L
Sbjct: 286 V--PDMPHYNVILKGIDVGGTALGLPTNIFDSGNSK--GTIIDSGTTLAYVPEGVYKALF 341
Query: 446 ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHI 505
A + + D V D + C++ V F +T HF ++ +
Sbjct: 342 AMVFDKHQDISVQTLQDFS---CFQYSGS------VDDGFPEVTFHFEGDVSLI-----V 387
Query: 506 SPEGYLVISKKGNICLGILD-GSEVHNGS-TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
SP YL + K C+G + G + +G +LGD+ L +LV+YD N+ IGWA +C
Sbjct: 388 SPHDYLFQNGKNLYCMGFQNGGGKTKDGKDLGLLGDLVLSNKLVLYDLENQAIGWADYNC 447
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 111/402 (27%), Positives = 186/402 (46%), Gaps = 39/402 (9%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
L GN +P GLY+ + +G+PP +++ +DTGSD+ W+ C CS+C K ++
Sbjct: 59 LELGGNGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNCPKKSDIGVDL 117
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY P+ ++ C PG C+ C Y++ Y D S++ G D +
Sbjct: 118 QLYNPKSSSTSTLITCDQPFCSATYDAPIPG-CKPDLLCQYKVIYGDGSATAGYFVNDYI 176
Query: 300 HLTIENG----SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
L G S T ++VFGC Q G L ++ DGILG +A S+ SQLA+ G +
Sbjct: 177 QLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKV 236
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
K + HCL + +GGG + +G + P + P++ P Y+ + + G + L+L
Sbjct: 237 KKIFAHCLDSISGGGIFA-IGEVVEPK--LXNTPVV--PNQAHYNVVLNGVKVGDTALDL 291
Query: 416 GARNSQVGW---ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
+ + A+ D+G++ Y + Y L+ + D L L D K
Sbjct: 292 PLGLFETSYKRGAIIDSGTTLAYLPESIYLPLMEKILGAQPD-LKLRTVDDQFTCFVFDK 350
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD-GSEVHN 531
+V F T+T F S I P YL + C+G + G++ +
Sbjct: 351 -------NVDDGFPTVTFKFEE-----SLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKD 398
Query: 532 GSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSL 572
G+ + +LGD+ L+ +LV Y+ N+ IGW + +C + + K +
Sbjct: 399 GNEVTLLGDLVLQNKLVYYNLENQTIGWTEYNCSSGIKLKDV 440
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 118/405 (29%), Positives = 182/405 (44%), Gaps = 49/405 (12%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G PD GLY+ + +G P + YY+ +DTGSD+ W+ C C C + +
Sbjct: 66 LPLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIEL 124
Query: 243 PLY---KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY + G ++ D C +I G C+ C Y Y D SS+ G +D +
Sbjct: 125 TLYNIDESDSGKLVSCDDDFCYQISGGPLSG-CKANMSCPYLEIYGDGSSTAGYFVKDVV 183
Query: 300 HLTIENGSL----TKPNVVFGCAYDQQGLLLNTLVKT-DGILGLSRAKVSLPSQLASQGI 354
G L +V+FGC Q G L ++ + DGILG +A S+ SQLAS G
Sbjct: 184 QYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGR 243
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414
+K + HCL GGG + +G + P M P++ P Y+ + + G LN
Sbjct: 244 VKKIFAHCLDGRNGGGIFA-IGRVVQPKVNMT--PLV--PNQPHYNVNMTAVQVGQEFLN 298
Query: 415 LGARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
+ A Q G A+ D+G++ Y + Y L+ K+++S +P L V
Sbjct: 299 IPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLV---KKITS-------QEPALKVHIVD 348
Query: 472 K----FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS 527
K F VD + F +T HF + S + P YL +G C+G + +
Sbjct: 349 KDYKCFQYSGRVD--EGFPNVTFHFEN-----SVFLRVYPHDYL-FPYEGMWCIGWQNSA 400
Query: 528 --EVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+ +LGD+ L +LV+YD N+ IGW + +C + + K
Sbjct: 401 MQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCSSSIKVK 445
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 179/383 (46%), Gaps = 42/383 (10%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C + C C K +P ++P +
Sbjct: 78 LYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPC-SDCEHCGKHQDPRFQPDESS 136
Query: 252 ILPYKDSLC-MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
Y C M+ +H C YE YA+ SSS GVL D ++ N S
Sbjct: 137 T--YHPVKCNMDCNCDHD------GVNCVYERRYAEMSSSSGVLGEDI--ISFGNQSEVV 186
Query: 311 PN-VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
P VFGC + G L + + DGI+GL R ++S+ QL + +I + C G
Sbjct: 187 PQRAVFGCENVETGDLYSQ--RADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVG 244
Query: 370 GGYMFLGH-----DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN-SQVG 423
GG M LG D+V S + SP+ Y+ E+ +I+ PL L +
Sbjct: 245 GGAMVLGGIPPPPDMVFSRSDPY----RSPY---YNIELKEIHVAGKPLKLSPSTFDRKH 297
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVK 482
+ D+G++Y Y ++A+ ++ + S + + DP +C+ R + +
Sbjct: 298 GTVLDSGTTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAG--RDVSQLS 355
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDI 540
+ F + + F + K ++PE YL K G CLGI + ST +LG I
Sbjct: 356 KAFPEVDMVFSN-----GQKLSLTPENYLFQHTKVHGAYCLGIFRNGD----STTLLGGI 406
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+R LV YD N++IG+ K++C
Sbjct: 407 IVRNTLVTYDRENEKIGFWKTNC 429
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 111/402 (27%), Positives = 185/402 (46%), Gaps = 39/402 (9%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
L GN +P GLY+ + +G+PP +++ +DTGSD+ W+ C CS+C K ++
Sbjct: 59 LELGGNGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNCPKKSDIGVDL 117
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY P+ ++ C PG C+ C Y++ Y D S++ G D +
Sbjct: 118 QLYNPKSSSTSTLITCDQPFCSATYDAPIPG-CKPDLLCQYKVIYGDGSATAGYFVNDYI 176
Query: 300 HLTIENG----SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
L G S T ++VFGC Q G L ++ DGILG +A S+ SQLA+ G +
Sbjct: 177 QLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKV 236
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
K + HCL + +GGG + +G + P + P++ P Y+ + + G + L+L
Sbjct: 237 KKIFAHCLDSISGGGIFA-IGEVVEPK--LKTTPVV--PNQAHYNVVLNGVKVGDTALDL 291
Query: 416 GARNSQVGW---ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
+ + A+ D+G++ Y Y L+ + D L L D K
Sbjct: 292 PLGLFETSYKRGAIIDSGTTLAYLPDSIYLPLMEKILGAQPD-LKLRTVDDQFTCFVFDK 350
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD-GSEVHN 531
+V F T+T F S I P YL + C+G + G++ +
Sbjct: 351 -------NVDDGFPTVTFKFEE-----SLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKD 398
Query: 532 GSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSL 572
G+ + +LGD+ L+ +LV Y+ N+ IGW + +C + + K +
Sbjct: 399 GNEVTLLGDLVLQNKLVYYNLENQTIGWTEYNCSSGIKLKDV 440
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 115/417 (27%), Positives = 185/417 (44%), Gaps = 46/417 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
+R H + +L+ AVD PL G P GLY+T + +G+P + YY+ +DTG
Sbjct: 54 LRRHDVGRHGRLLG----AVD----LPLGGVGLPTATGLYYTQIEIGSPSKGYYVQVDTG 105
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR-MGNILPYKDSLCMEIQRNHKPGYC-ETC 275
SD+ W+ C C C + Y P G + C+ N P C T
Sbjct: 106 SDILWVNC-IRCDGCPTTSGLGIELTQYDPAGSGTTVGCDQEFCVANSPNGLPPACPSTS 164
Query: 276 QQCDYEIEYADHSSSMGVLARDELHL--TIENGSLTKPN--VVFGCAYDQQGLLLNTLVK 331
C + I Y D SS+ G D + NG T N + FGC G L ++
Sbjct: 165 SPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPSNASITFGCGAQLGGDLGSSSQA 224
Query: 332 TDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPML 391
DGILG +A S+ SQLA+ ++ + HCL T GGG +G+ + P + P++
Sbjct: 225 LDGILGFGQADSSMLSQLAAARKVRKIFAHCLDT-VHGGGIFAIGNVVQPK--VKTTPLV 281
Query: 392 DSPFMELYHTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIASL 448
+ + Y+ + I+ G + L L + G + D+G++ Y ++ Y L+ ++
Sbjct: 282 QN--VTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAV 339
Query: 449 KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPE 508
+ D + + D VC++ I F +T F + ++ P
Sbjct: 340 FDKYQDLALHNYQD---FVCFQFSGSI------DDGFPVVTFSFEGE-----ITLNVYPH 385
Query: 509 GYLVISKKGNICLGILDGS-EVHNG-STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
YL ++ C+G LDG + +G ++LGD+ L +LVVYD + IGWA +C
Sbjct: 386 DYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWADYNC 442
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 174/380 (45%), Gaps = 36/380 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR-MGNIL 253
GLY+T + +G+PP+ YY+ +DTGSD+ W+ C C C + Y P G +
Sbjct: 82 GLYYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGCPTRSGLGIELTQYDPAGSGTTV 140
Query: 254 PYKDSLCMEIQRNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLTK 310
+ C+ P C T C + I Y D S++ G D + + NG T
Sbjct: 141 GCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTT 200
Query: 311 PN--VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
N + FGC G L ++ DGILG ++ S+ SQLA+ ++ + HCL T G
Sbjct: 201 SNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRG 260
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WA 425
GG + +G+ + P + P++ P + Y+ + I+ G + L L G
Sbjct: 261 GGIFA-IGNVVQPK--VKTTPLV--PNVTHYNVNLQGISVGGATLQLPTSTFDSGDSKGT 315
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+ D+G++ Y ++ Y L+A++ + D + + D VC++ I F
Sbjct: 316 IIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD---FVCFQFSGSI------DDGF 366
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVHNG-STIILGDISLR 543
+T F + ++ P+ YL ++ C+G LDG + +G ++LGD+ L
Sbjct: 367 PVITFSFKGDLTL-----NVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLS 421
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
+LVVYD + IGW +C
Sbjct: 422 NKLVVYDLEKEVIGWTDYNC 441
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 117/405 (28%), Positives = 181/405 (44%), Gaps = 49/405 (12%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G PD GLY+ + +G P + YY+ +DTGSD+ W+ C C C + +
Sbjct: 66 LPLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIEL 124
Query: 243 PLY---KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY + G ++ D C +I G C+ C Y Y D SS+ G +D +
Sbjct: 125 TLYNIDESDSGKLVSCDDDFCYQISGGPLSG-CKANMSCPYLEIYGDGSSTAGYFVKDVV 183
Query: 300 HLTIENGSL----TKPNVVFGCAYDQQGLLLNTLVKT-DGILGLSRAKVSLPSQLASQGI 354
G L +V+FGC Q G L ++ + DGILG +A S+ SQLAS G
Sbjct: 184 QYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGR 243
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414
+K + HCL GGG + +G + P M P++ P Y+ + + G L
Sbjct: 244 VKKIFAHCLDGRNGGGIFA-IGRVVQPKVNMT--PLV--PNQPHYNVNMTAVQVGQEFLT 298
Query: 415 LGARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
+ A Q G A+ D+G++ Y + Y L+ K+++S +P L V
Sbjct: 299 IPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLV---KKITS-------QEPALKVHIVD 348
Query: 472 K----FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS 527
K F VD + F +T HF + S + P YL +G C+G + +
Sbjct: 349 KDYKCFQYSGRVD--EGFPNVTFHFEN-----SVFLRVYPHDYL-FPHEGMWCIGWQNSA 400
Query: 528 --EVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+ +LGD+ L +LV+YD N+ IGW + +C + + K
Sbjct: 401 MQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCSSSIKVK 445
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 174/380 (45%), Gaps = 36/380 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR-MGNIL 253
GLY+T + +G+PP+ YY+ +DTGSD+ W+ C C C + Y P G +
Sbjct: 82 GLYYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGCPTRSGLGIELTQYDPAGSGTTV 140
Query: 254 PYKDSLCMEIQRNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLTK 310
+ C+ P C T C + I Y D S++ G D + + NG T
Sbjct: 141 GCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTT 200
Query: 311 PN--VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
N + FGC G L ++ DGILG ++ S+ SQLA+ ++ + HCL T G
Sbjct: 201 SNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRG 260
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WA 425
GG + +G+ + P + P++ P + Y+ + I+ G + L L G
Sbjct: 261 GGIFA-IGNVVQPK--VKTTPLV--PNVTHYNVNLQGISVGGATLQLPTSTFDSGDSKGT 315
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+ D+G++ Y ++ Y L+A++ + D + + D VC++ I F
Sbjct: 316 IIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD---FVCFQFSGSI------DDGF 366
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVHNG-STIILGDISLR 543
+T F + ++ P+ YL ++ C+G LDG + +G ++LGD+ L
Sbjct: 367 PVITFSFEGDLTL-----NVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLS 421
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
+LVVYD + IGW +C
Sbjct: 422 NKLVVYDLEKEVIGWTDYNC 441
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 111/394 (28%), Positives = 177/394 (44%), Gaps = 41/394 (10%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G+ PD GLY+ + +G P + YY+ +DTGSD+ W+ C C C + ++
Sbjct: 72 LPLGGSGRPDTVGLYYAKVGIGTPSKDYYVQVDTGSDIMWVNC-IQCRECPRTSSLGMEL 130
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY + G ++P + C E+ G C C Y Y D SS+ G +D +
Sbjct: 131 TLYNIKDSVSGKLVPCDEEFCYEVNGGPLSG-CTANMSCPYLEIYGDGSSTAGYFVKDVV 189
Query: 300 HLTIENGSL----TKPNVVFGCAYDQQGLLLNTLVKT-DGILGLSRAKVSLPSQLASQGI 354
+G L + +V+FGC Q G L T + DGILG ++ S+ SQLA+
Sbjct: 190 QYDRVSGDLQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRK 249
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414
+K + HCL GGG +GH + P M P++ P Y+ + + G L+
Sbjct: 250 VKKIFAHCL-DGINGGGIFAIGHVVQPKVNMT--PLI--PNQPHYNVNMTAVQVGEDFLH 304
Query: 415 LGARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
L + G A+ D+G++ Y + Y L++ + D V D C++
Sbjct: 305 LPTEEFEAGDRKGAIIDSGTTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEY--TCFQY 362
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD-GSEVH 530
V F +T HF + S + P YL +G C+G + G +
Sbjct: 363 SGS------VDDGFPNVTFHFEN-----SVFLKVHPHEYL-FPFEGLWCIGWQNSGMQSR 410
Query: 531 NGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + +LGD+ L +LV+YD N+ IGW + +C
Sbjct: 411 DRRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNC 444
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 176/400 (44%), Gaps = 58/400 (14%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK----GANP 243
F ++G P+ G+Y G + + +DTGSD+ W+ C+ CS+C + G
Sbjct: 60 FSVQGTSDPNSVGMY------GXXXXXFNVQIDTGSDILWVNCNT-CSNCPQSSQLGIEL 112
Query: 244 LYKPRMGN----ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ +G+ ++P D +C + QC Y +Y D S + G D +
Sbjct: 113 NFFDTVGSSTAALIPCSDLICTSGVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAM 172
Query: 300 HLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+ + G N +VFGC+ Q G L T DGI G +S+ SQL+SQGI
Sbjct: 173 YFNLIMGQPPAVNSTATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGIT 232
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
V HCL + GGG + LG L PS + + P++ P Y+ + I PL +
Sbjct: 233 PKVFSHCLKGDGNGGGILVLGEILEPS--IVYSPLV--PSQPHYNLNLQSIAVNGQPLPI 288
Query: 416 G----ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
+ ++ G + D G++ Y ++AY L+ ++ S S C+
Sbjct: 289 NPAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ--CYLV 346
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSE--- 528
I I F ++L+F +V + PE YL+ + G LDG+E
Sbjct: 347 STSIGDI------FPLVSLNFEGGASMV-----LKPEQYLMHN-------GYLDGAEMWC 388
Query: 529 -----VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ G++ ILGD+ L+ ++VVYD +RIGWA C
Sbjct: 389 VGFQKLQEGAS-ILGDLVLKDKIVVYDIAQQRIGWANYDC 427
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 114/395 (28%), Positives = 180/395 (45%), Gaps = 47/395 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTG 222
+R H ++ + +++S AVD PL GN +P GLYF + +G P + YY+ +DTG
Sbjct: 47 LRAHDTRRHGRILS----AVD----LPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTG 98
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C A C C ++ LY + + + D+ C + PG C+
Sbjct: 99 SDILWVNC-AGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC-SLYDGPLPG-CKP 155
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSL----TKPNVVFGCAYDQQGLLLNTLV 330
QC Y + Y D SS+ G +D + +G+ T VVFGC Q G L ++
Sbjct: 156 GLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSE 215
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVP- 389
DGILG +A S+ SQLAS G +K V HCL N GGG +G + P +
Sbjct: 216 ALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-DNVDGGGIFAIGEVVEPKVRFLLMNS 274
Query: 390 -MLDSPFMELYHTEIL--KINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSE 443
M+ F+ H ++ +I G PL++ + + G + D+G++ YF ++ Y
Sbjct: 275 VMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVP 334
Query: 444 LIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKF 503
LI + ++ D L +A +V F T+TLHF S
Sbjct: 335 LI--------EKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDK-----SISL 381
Query: 504 HISPEGYLVISKKGNICLGILD-GSEVHNGSTIIL 537
+ P YL K+ C+G + G++ +G + L
Sbjct: 382 TVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTL 416
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 117/395 (29%), Positives = 174/395 (44%), Gaps = 43/395 (10%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G+ PD GLY+ + +G PP+ YYL +DTGSD+ W+ C C C ++
Sbjct: 69 LPLGGSGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNC-IQCKECPTRSSLGMDL 127
Query: 243 PLY---KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY + G ++P C EI G C C Y Y D SS+ G +D +
Sbjct: 128 TLYDIKESSSGKLVPCDQEFCKEINGGLLTG-CTANISCPYLEIYGDGSSTAGYFVKDIV 186
Query: 300 HLTIENGSL----TKPNVVFGCAYDQQGLLLNTLVKT-DGILGLSRAKVSLPSQLASQGI 354
+G L ++VFGC Q G L ++ + DGILG +A S+ SQLAS G
Sbjct: 187 LYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGK 246
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPL 413
+K + HCL GGG +GH + P M P+L D P Y + + G + L
Sbjct: 247 VKKMFAHCL-NGVNGGGIFAIGHVVQPKVNMT--PLLPDQPH---YSVNMTAVQVGHTFL 300
Query: 414 NLGARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWR 470
+L S G + D+G++ Y + Y L+ + D V D C++
Sbjct: 301 SLSTDTSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEY--TCFQ 358
Query: 471 AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD-GSEV 529
V F +T F + + P YL S C+G + G++
Sbjct: 359 YS------ESVDDGFPAVTFFFEN-----GLSLKVYPHDYLFPSVN-FWCIGWQNSGTQS 406
Query: 530 HNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + +LGD+ L +LV YD N+ IGWA+ +C
Sbjct: 407 RDSKNMTLLGDLVLSNKLVFYDLENQAIGWAEYNC 441
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/441 (26%), Positives = 183/441 (41%), Gaps = 37/441 (8%)
Query: 136 VSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGN 195
V DA KL R + + E + +R S + +L+ S V + FP+ G
Sbjct: 24 VCGSDAVLKLERLIPPNHELGLTE-----LRAFDSARHGRLLQSPVGGVVN---FPVDGA 75
Query: 196 IYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR 248
P GLY+T + +G PPR + + +DTGSD+ W+ C + C+ C K + + P
Sbjct: 76 SDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSELQIQLSFFDPG 134
Query: 249 MGNILPYKDSLCMEIQRNHK-PGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENG 306
+ + N + C C Y +Y D S + G D + T+
Sbjct: 135 VSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITS 194
Query: 307 SL---TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
+L + VFGC+ Q G L DGI GL + +S+ SQLA QG+ V HCL
Sbjct: 195 TLAINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL 254
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
+ GGG M LG P + P++ S + + + +N P++ G
Sbjct: 255 KGDKSGGGIMVLGQIKRPD--TVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATG 312
Query: 424 -WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
+ DTG++ Y +AYS I ++ S P+ + +
Sbjct: 313 DGTIIDTGTTLAYLPDEAYSPFIQAIANA--------VSQYGRPITYESYQCFEITAGDV 364
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
F ++L F +V + P YL I + + + + ILGD+ L
Sbjct: 365 DVFPEVSLSFAGGASMV-----LRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVL 419
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ ++VVYD V +RIGWA+ C
Sbjct: 420 KDKVVVYDLVRQRIGWAEYDC 440
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 151/340 (44%), Gaps = 35/340 (10%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
FP+ G P GLY+T + +G PPR +Y+ +DTGSD+ W+ C A C+ C + +
Sbjct: 67 FPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQIQL 125
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ P + + D C ++ G C Y +Y D S + G D L
Sbjct: 126 NFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVL 185
Query: 300 HLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII 355
+ GS PN VVFGC+ Q G L+ + DGI G + +S+ SQLASQGI
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIA 245
Query: 356 KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKI--NYGSSPL 413
V HCL GGGG + LG + P+ M + P++ P Y+ +L I N + P+
Sbjct: 246 PRVFSHCLKGENGGGGILVLGEIVEPN--MVFTPLV--PSQPHYNVNLLSISVNGQALPI 301
Query: 414 NLGARNSQVGWA-LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
N ++ G + DTG++ Y ++ AY + ++ S PV +
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNA--------VSQSVRPVVSKGN 353
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV 512
V F ++L+F ++P+ YL+
Sbjct: 354 QCYVITTSVGDIFPPVSLNFAG-----GASMFLNPQDYLI 388
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 112/383 (29%), Positives = 181/383 (47%), Gaps = 41/383 (10%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C + C C + +P ++P +
Sbjct: 74 LHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPC-STCEQCGRHQDPKFQPESSS 132
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQ-QCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
Y+ C I N C++ + QC YE +YA+ S+S GVL D + L
Sbjct: 133 T--YQPVKCT-IDCN-----CDSDRMQCVYERQYAEMSTSSGVLGEDLISFG-NQSELAP 183
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
VFGC + G L + DGI+GL R +S+ QL + +I + C GG
Sbjct: 184 QRAVFGCENVETGDLYSQ--HADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGG 241
Query: 371 GYMFLGHDLVPS-WGMAWVPMLDSPFMELYHTEILKINYGSS--PLNLGARNSQVGWALF 427
G M LG PS A+ + SP+ Y+ ++ +I+ PLN + + G +
Sbjct: 242 GAMVLGGISPPSDMAFAYSDPVRSPY---YNIDLKEIHVAGKRLPLNANVFDGKHG-TVL 297
Query: 428 DTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQF- 484
D+G++Y Y + A+ A +KE+ S + DP +C+ + +DV Q
Sbjct: 298 DSGTTYAYLPEAAFLAFKDAIVKELQSLKKI-SGPDPNYNDICFSG-----AGIDVSQLS 351
Query: 485 --FKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDI 540
F + + F + K+ +SPE Y+ K G CLG+ N T +LG I
Sbjct: 352 KSFPVVDMVFEN-----GQKYTLSPENYMFRHSKVRGAYCLGVFQNG---NDQTTLLGGI 403
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+R LVVYD +IG+ K++C
Sbjct: 404 IVRNTLVVYDREQTKIGFWKTNC 426
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 105/403 (26%), Positives = 178/403 (44%), Gaps = 43/403 (10%)
Query: 187 SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK 246
++ PL G + G ++ + +G P R + + +DTGS +T++ PC+SC + P +K
Sbjct: 47 NATLPLHGAVKDYGYFYATLHLGTPARQFAVIVDTGSTITYV----PCASCGRNCGPHHK 102
Query: 247 PRMGNILPYKDSLCMEIQRN-----HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
+ S + + P C ++C Y+ YA+ SSS G+L D+L L
Sbjct: 103 DAAFDPASSSSSAVIGCDSDKCICGRPPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQL 162
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+G++ VVFGC + G + N + DGILGL ++VSL +QLA G+I +V
Sbjct: 163 --RDGAV---EVVFGCETKETGEIYNQ--EADGILGLGNSEVSLVNQLAGSGVIDDVFAL 215
Query: 362 CLTTNAGGGGYMFLGHDLVP-SWGMAWVPMLDS-PFMELYHTEILKINYGSSPLNLGARN 419
C + G G M D + + +L S Y ++ + G L +
Sbjct: 216 CFGSVEGDGALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPER 275
Query: 420 SQVGWA-LFDTGSSYTYFTKQAYSELIASLKEVSSD-GL-VLDASDPTLP-------VCW 469
+ G+ + D+G+++TY +A+ ++ + + GL + DP +C+
Sbjct: 276 YEEGYGTVLDSGTTFTYLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICF 335
Query: 470 RAKFPIRSIVD---VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI--SKKGNICLGIL 524
P D +++ F L F + P YL + + G CLG+
Sbjct: 336 GGA-PHAGHADQSKLEKVFPVFELQFAD-----GVRLRTGPLNYLFMHTGEMGAYCLGVF 389
Query: 525 DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPG 567
D + S +LG IS R LV YD N+R+G+ + C G
Sbjct: 390 D----NGASGTLLGGISFRNILVQYDRRNRRVGFGAASCQEIG 428
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 114/437 (26%), Positives = 182/437 (41%), Gaps = 37/437 (8%)
Query: 140 DAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP- 198
DA KL R + + E + +R S + +L+ S V + FP+ G P
Sbjct: 28 DAVLKLERLIPPNHELGLTE-----LRAFDSARHGRLLQSPVGGVVN---FPVDGASDPF 79
Query: 199 -DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPRMGNI 252
GLY+T + +G PPR + + +DTGSD+ W+ C + C+ C K + + P + +
Sbjct: 80 LVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSELQIQLSFFDPGVSSS 138
Query: 253 LPYKDSLCMEIQRNHK-PGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSL-- 308
N + C C Y +Y D S + G D + T+ +L
Sbjct: 139 ASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAI 198
Query: 309 -TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
+ VFGC+ Q G L DGI GL + +S+ SQLA QG+ V HCL +
Sbjct: 199 NSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG-WAL 426
GGG M LG P + P++ S + + + +N P++ G +
Sbjct: 259 SGGGIMVLGQIKRPD--TVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTI 316
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
DTG++ Y +AYS I ++ S P+ + + F
Sbjct: 317 IDTGTTLAYLPDEAYSPFIQAVANA--------VSQYGRPITYESYQCFEITAGDVDVFP 368
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
++L F +V + P YL I + + + + ILGD+ L+ ++
Sbjct: 369 QVSLSFAGGASMV-----LGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKV 423
Query: 547 VVYDNVNKRIGWAKSHC 563
VVYD V +RIGWA+ C
Sbjct: 424 VVYDLVRQRIGWAEYDC 440
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 125 bits (313), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 119/435 (27%), Positives = 191/435 (43%), Gaps = 48/435 (11%)
Query: 151 LDGESVVASVNDGIIRP------HKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFT 204
L+G + ++ ++ P H ++++ V+ + + L ++ G Y +
Sbjct: 43 LNGVRIQSNCGSALVLPLVESKRHGHVVDRRFERRGRGLVEDARMV-LHDDLLTKGYYTS 101
Query: 205 YMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN--PLYKPRMGNILPYKDSLCME 262
+ +G P + + L +DTGS +T++ PCSSC + + PR P S
Sbjct: 102 RVFIGTPAQEFALIVDTGSTVTYV----PCSSCTHCGHHQACFDPRFK---PDNSSSYQT 154
Query: 263 IQRNHKPGYCETC----QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN-VVFGC 317
+ N + C QC YE YA+ SSS GVL +D L NGS +P+ ++FGC
Sbjct: 155 VSCNSPDCITKMCDARVHQCKYERVYAEMSSSKGVLGKDLLGFG--NGSRLQPHPLLFGC 212
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGH 377
+ G L L DGI+GL R +S+ QL G +++ C GGG M LG
Sbjct: 213 ETAETGDLY--LQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSMVLGA 270
Query: 378 DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR--NSQVGWALFDTGSSYTY 435
+ P M + D Y+ E+ +I LN+ + N ++G L D+G++Y Y
Sbjct: 271 -IPPPPAMVFAKS-DPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLGTVL-DSGTTYAY 327
Query: 436 FTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
+A+ ++ + + DP+ P VC+ K L HF
Sbjct: 328 LPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAG---------AGSDSKALGKHFPP 378
Query: 495 KWQIVS--TKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDISLRGQLVVYD 550
+ S K ++PE YL K G CLG + +T +LG I +R LV YD
Sbjct: 379 VDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQD----ATTLLGGIVVRNTLVTYD 434
Query: 551 NVNKRIGWAKSHCMN 565
N +IG+ K++C N
Sbjct: 435 RANHQIGFFKTNCTN 449
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 125 bits (313), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 106/404 (26%), Positives = 178/404 (44%), Gaps = 41/404 (10%)
Query: 187 SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK 246
+S PL G + G ++ + +G P + + + +DTGS +T++ PCSSC G P ++
Sbjct: 63 NSTMPLHGAVKDYGYFYATLYLGTPAKKFAVIVDTGSTMTYV----PCSSCGSGCGPNHQ 118
Query: 247 PRMGNILPYKDSLCMEIQRNHKPGYCE------TCQQCDYEIEYADHSSSMGVLARDELH 300
+ P S I C + QQC Y YA+ SSS G+L D L
Sbjct: 119 DAAFD--PEASSTASRISCTSPKCSCGSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLA 176
Query: 301 LTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVG 360
L + L ++FGC + G + + DG+ GL + S+ +QL G+I +V
Sbjct: 177 L---HDGLPGAPIIFGCETRETGEIFRQ--RADGLFGLGNSDASVVNQLVKAGVIDDVFS 231
Query: 361 HCLTTNAGGGGYMFLGHDLVP-SWGMAWVPMLDS---PFMELYHTEILKINYGSSPLNLG 416
C G G + LG VP S + + P+L S PF Y+ ++L + L +
Sbjct: 232 LCFGMVEGDGA-LLLGDAEVPGSISLQYTPLLTSTTHPF--YYNVKMLSLAVEGQLLPVS 288
Query: 417 ARNSQVGWA-LFDTGSSYTYFTKQAYSELIASLKEVS-SDGLV-LDASDPTL-PVCWRAK 472
G+ + D+G+++TY + ++++ + S GL + DP +C+
Sbjct: 289 QSLFDQGYGTVLDSGTTFTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQA 348
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVIS--KKGNICLGILDGSEVH 530
+ + F ++ + F +V + P YL + G CLG+ D
Sbjct: 349 PSHDDLEALSSVFPSMEVQFDQGTSLV-----LGPLNYLFVHTFNSGKYCLGVFD----- 398
Query: 531 NG-STIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
NG + +LG I+ R LV YD N+R+G+ + C G + P
Sbjct: 399 NGRAGTLLGGITFRNVLVRYDRANQRVGFGPALCKELGEMQRPP 442
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 125 bits (313), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 183/383 (47%), Gaps = 41/383 (10%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C + C C + +P ++P + +
Sbjct: 71 LHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPC-STCEQCGRHQDPKFQPDLSS 129
Query: 252 IL-PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
P K +L + QC YE +YA+ S+S GVL D + ++ L
Sbjct: 130 TYQPVKCTLDCNCDNDR--------MQCVYERQYAEMSTSSGVLGEDVVSFGNQS-ELAP 180
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
VFGC + G L + DGI+GL R +S+ QL + ++ + C GG
Sbjct: 181 QRAVFGCENVETGDLYSQ--HADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGG 238
Query: 371 GYMFLGHDLVPS-WGMAWVPMLDSPFMELYHTEILKINYGSS--PLNLGARNSQVGWALF 427
G M LG PS A + SP+ Y+ ++ +I+ PLN + + G ++
Sbjct: 239 GAMVLGGISPPSDMVFAQSDPVRSPY---YNIDLKEIHVAGKRLPLNPSVFDGKHG-SVL 294
Query: 428 DTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQFF 485
D+G++Y Y ++A+ A +KE+ S + DP +C+ + +DV Q
Sbjct: 295 DSGTTYAYLPEEAFLAFKEAIVKELQSFSQI-SGPDPNYNDLCFSG-----AGIDVSQLS 348
Query: 486 KT---LTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDI 540
KT + + FG+ K+ +SPE Y+ K G CLGI + T +LG I
Sbjct: 349 KTFPVVDMIFGN-----GHKYSLSPENYMFRHSKVRGAYCLGIFQNGK---DPTTLLGGI 400
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+R LV+YD +IG+ K++C
Sbjct: 401 VVRNTLVLYDREQTKIGFWKTNC 423
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 125 bits (313), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 117/396 (29%), Positives = 174/396 (43%), Gaps = 45/396 (11%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G+ PD GLY+ + +G PP+ YYL +DTGSD+ W+ C C C +N
Sbjct: 71 LPLGGSGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNC-IQCKECPTRSNLGMDL 129
Query: 243 PLY---KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY + G +P C EI G C C Y Y D SS+ G +D +
Sbjct: 130 TLYDIKESSSGKFVPCDQEFCKEINGGLLTG-CTANISCPYLEIYGDGSSTAGYFVKDIV 188
Query: 300 HLTIENGSL----TKPNVVFGCAYDQQGLLLNTLVKT-DGILGLSRAKVSLPSQLASQGI 354
+G L ++VFGC Q G L ++ + GILG +A S+ SQLAS G
Sbjct: 189 LYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGK 248
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPL 413
+K + HCL GGG +GH + P M P+L D P Y + + G + L
Sbjct: 249 VKKMFAHCL-NGVNGGGIFAIGHVVQPKVNMT--PLLPDQPH---YSVNMTAVQVGHAFL 302
Query: 414 NLGARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWR 470
+L S G + D+G++ Y + Y L+ + D V D C++
Sbjct: 303 SLSTDTSTQGDRKGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEY--TCFQ 360
Query: 471 AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILD-GSE 528
V F +T +F + + P YL S G+ C+G + G++
Sbjct: 361 YS------ESVDDGFPAVTFYFEN-----GLSLKVYPHDYLFPS--GDFWCIGWQNSGTQ 407
Query: 529 VHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + +LGD+ L +LV YD N+ IGW + +C
Sbjct: 408 SRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNC 443
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 115/397 (28%), Positives = 177/397 (44%), Gaps = 45/397 (11%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G PD GLY+ + +G P + YYL +DTG+D+ W+ C C C +N
Sbjct: 59 LPLGGTGRPDSVGLYYAKIGIGTPSKDYYLQVDTGTDMMWVNC-IQCKECPTRSNLGMDL 117
Query: 243 PLYKPR---MGNILPYKDSLCMEIQRNHKPG-YCETCQQCDYEIEYADHSSSMGVLARDE 298
LY + G ++P LC EI G +T C Y Y D SS+ G +D
Sbjct: 118 TLYNIKESSSGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDV 177
Query: 299 LHLTIENGSLTKP----NVVFGCAYDQQG-LLLNTLVKTDGILGLSRAKVSLPSQLASQG 353
+ +G L +V+FGC Q G L + DGILG +A S+ SQL+S G
Sbjct: 178 VLFDQVSGDLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSG 237
Query: 354 IIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSP 412
+K + HCL GGG +GH + P+ + P+L D P Y + I G +
Sbjct: 238 KVKKMFAHCL-NGVNGGGIFAIGHVVQPT--VNTTPLLPDQPH---YSVNMTAIQVGHTF 291
Query: 413 LNLGARNSQ---VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW 469
LNL S+ + D+G++ Y Y L+ + + V D C+
Sbjct: 292 LNLSTDASEQRDSKGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEY--TCF 349
Query: 470 RAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILD-GS 527
+ V F +T +F + + + P YL +S+ N+ C+G + G+
Sbjct: 350 QYSGS------VDDGFPNVTFYFENGLSL-----KVYPHDYLFLSE--NLWCIGWQNSGA 396
Query: 528 EVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + + +LGD+ L +LV YD N+ IGW + +C
Sbjct: 397 QSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNC 433
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 175/387 (45%), Gaps = 43/387 (11%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK--------PR 248
Y GLYFT + +G+PPR + + +DTGSD+ W+ C++ C++C + + +
Sbjct: 61 YLVGLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTSGLGIQLNFFDSSSSS 119
Query: 249 MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
++ D +C + QC Y +Y D S + G D L+ G
Sbjct: 120 TAGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGES 179
Query: 309 TKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
N +VFGC+ Q G L T DGI G + ++S+ SQL++ GI V HCL
Sbjct: 180 LVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLK 239
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN---LGARNSQ 421
GGG + LG L P GM + P++ S + + + +N P++ NSQ
Sbjct: 240 GEGIGGGILVLGEILEP--GMVYSPLVPSQPHYNLNLQSIAVNGKLLPIDPSVFATSNSQ 297
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+ D+G++ Y +AY ++++ + S + P+ + V
Sbjct: 298 --GTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVT--------PIISKGNQCYLVSTSV 347
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLV---ISKKGNI--CLGILDGSEVHNGSTII 536
Q F + +F +V + PE YL+ S+ G++ C+G + G T I
Sbjct: 348 SQMFPLASFNFAGGASMV-----LKPEDYLIPFGPSQGGSVMWCIGF----QKVQGVT-I 397
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
LGD+ L+ ++ VYD V +RIGWA C
Sbjct: 398 LGDLVLKDKIFVYDLVRQRIGWANYDC 424
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 174/386 (45%), Gaps = 47/386 (12%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C C C + +P ++P
Sbjct: 102 LHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST-CEQCGRHQDPKFQP---- 156
Query: 252 ILPYKDSLCMEIQRNHKPGYCE---TCQ----QCDYEIEYADHSSSMGVLARDELHLTIE 304
E ++P C C QC YE +YA+ S+S GVL D +
Sbjct: 157 ----------ESSSTYQPVKCTIDCNCDGDRMQCVYERQYAEMSTSSGVLGEDVISFG-N 205
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
L VFGC + G L + DGI+GL R +S+ QL + +I + C
Sbjct: 206 QSELAPQRAVFGCENVETGDLYSQ--HADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYG 263
Query: 365 TNAGGGGYMFLGHDLVPS-WGMAWVPMLDSPFMELYHTEILKINYGSS--PLNLGARNSQ 421
GGG M LG PS A+ SP+ Y+ ++ +++ PLN + +
Sbjct: 264 GMDVGGGAMVLGGISPPSDMTFAYSDPDRSPY---YNIDLKEMHVAGKRLPLNANVFDGK 320
Query: 422 VGWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIV 479
G L D+G++Y Y + A+ A +KE+ S + DP +C+ +
Sbjct: 321 HGTVL-DSGTTYAYLPEAAFLAFKDAIVKELQSLKQI-SGPDPNYNDICFSGAG--NDVS 376
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIIL 537
+ + F + + FG+ K+ +SPE Y+ K G CLGI N T +L
Sbjct: 377 QLSKSFPVVDMVFGN-----GHKYSLSPENYMFRHSKVRGAYCLGIFQNG---NDQTTLL 428
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G I +R LV+YD +IG+ K++C
Sbjct: 429 GGIIVRNTLVMYDREQTKIGFWKTNC 454
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 112/427 (26%), Positives = 178/427 (41%), Gaps = 74/427 (17%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN--------PLYKPR 248
Y GLYFT + +G+P + +Y+ +DTGSD+ W+ C+ C++C K +
Sbjct: 66 YLVGLYFTKVKMGSPAKEFYVQIDTGSDILWLNCNT-CNNCPKSSGLGIDLNYFDTASSS 124
Query: 249 MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG-- 306
++ D +C + QC Y +Y D S + G D ++ + G
Sbjct: 125 TAALVSCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYYVYDAMYFDVIMGQS 184
Query: 307 --SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
S + VVFGC+ Q G L T DGI G +S+ SQ++SQG+ V HCL
Sbjct: 185 VFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLK 244
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL----NLGARNS 420
GGG + LG L P+ + + P++ P Y+ + I L ++ A +
Sbjct: 245 GQGSGGGILVLGEILEPN--IVYTPLV--PLQPHYNLNLQSIAVNGQILPIDQDVFATGN 300
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIAS---------LKEVSSDGLVLDASDPTL------ 465
G + D+G++ Y ++AY + + E +++ D ++
Sbjct: 301 NRG-TIVDSGTTLAYLVQEAYDPFLNAGSPCHFFTHFNEPTNNIKYEDGNNNHQSRVKRH 359
Query: 466 ---PVCWRAKFPIRSIV--DVKQFFKTLT----------LHFGSKWQIVSTKFH------ 504
V R +I+ V QF K + G + +VS F
Sbjct: 360 YYDEVTLRLVLKHSAIITTTVSQFSKPIISKGNQCYLVPTSLGDIFPLVSLNFMGGASMV 419
Query: 505 ISPEGYLVISKKGNICLGILDGS--------EVHNGSTIILGDISLRGQLVVYDNVNKRI 556
+ PE YL I G LDG+ +V G T ILGD+ L+ ++ VYD N+RI
Sbjct: 420 LKPEQYL-------IHYGFLDGAAMWCIGFQKVQKGYT-ILGDLVLKDKIFVYDLANQRI 471
Query: 557 GWAKSHC 563
GW C
Sbjct: 472 GWTDYDC 478
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 119/402 (29%), Positives = 179/402 (44%), Gaps = 42/402 (10%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
L GN P GLY+T + +G P YY+ +DTGSD W+ C C++C K +
Sbjct: 63 LALGGNGRPTSTGLYYTKIGLG--PNDYYVQVDTGSDTLWVNC-VGCTTCPKKSGLGMEL 119
Query: 243 PLYKP---RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY P + ++P D C G C+ C Y I Y D S++ G +D+L
Sbjct: 120 TLYDPNSSKTSKVVPCDDEFCTSTYDGPISG-CKKDMSCPYSITYGDGSTTSGSYIKDDL 178
Query: 300 HLTIENGSL-TKPN---VVFGCAYDQQGLLLNTL-VKTDGILGLSRAKVSLPSQLASQGI 354
G L T P+ V+FGC Q G L +T DGI+G +A S+ SQLA+ G
Sbjct: 179 TFDRVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGK 238
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414
+K V HCL T GGG +G + P + P++ P M Y+ + I P+
Sbjct: 239 VKRVFSHCLDT-VNGGGIFAIGEVVQPK--VKTTPLV--PRMAHYNVVLKDIEVAGDPIQ 293
Query: 415 LGAR--NSQVGWA-LFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWR 470
L +S G + D+G++ Y Y +L+ +L + S L L T C+
Sbjct: 294 LPTDIFDSTSGRGTIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFT---CFH 350
Query: 471 AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EV 529
+S+ D F T+ F + + P YL K+ C+G + +
Sbjct: 351 YSDE-KSLDDA---FPTVKFTFEEGLTLTA-----YPHDYLFPFKEDMWCIGWQKSTAQT 401
Query: 530 HNGSTIIL-GDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+G +IL GD+ L +L +YD N IGW +C + + K
Sbjct: 402 KDGKDLILLGDLVLTNKLFIYDLDNMSIGWTDYNCSSSIKLK 443
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 122 bits (305), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 169/371 (45%), Gaps = 42/371 (11%)
Query: 167 PHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLT 226
P+ S++ L ++ L ++ +G Y T + +G PP+ + L +D+GS +T
Sbjct: 54 PNASRLAASLRRGLGDGAHPNARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVT 113
Query: 227 WIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCE---TC----QQCD 279
++ C A C C +P ++P ++ ++ P C TC +QC
Sbjct: 114 YVPC-ASCEQCGNHQDPRFQP--------------DLSSSYSPVKCNVDCTCDSDKKQCT 158
Query: 280 YEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
YE +YA+ SSS GVL D + E+ L VFGC + G L + DGI+GL
Sbjct: 159 YERQYAEMSSSSGVLGEDIVSFGRES-ELKAQRAVFGCENSETGDLFSQ--HADGIMGLG 215
Query: 340 RAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPS-WGMAWVPMLDSPFMEL 398
R ++S+ QL +G+I + C GGG M LG PS + L SP+
Sbjct: 216 RGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRSPY--- 272
Query: 399 YHTEILKINYGSSPLNLGAR--NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL 456
Y+ E+ +I+ L + +R +S+ G L D+G++Y Y +QA+ ++
Sbjct: 273 YNIELKEIHVAGKALRVDSRIFDSKHGTVL-DSGTTYAYLPEQAFMAFKDAVTSKVHSLK 331
Query: 457 VLDASDPTLP-VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISK 515
+ DP+ +C+ R++ + + F + + FG+ K ++PE YL
Sbjct: 332 KIRGPDPSYKDICFAGA--RRNVSKLHEVFPDVDMVFGN-----GQKLSLTPENYLFRHS 384
Query: 516 K--GNICLGIL 524
K G CLG+
Sbjct: 385 KVDGAYCLGVF 395
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 177/390 (45%), Gaps = 55/390 (14%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C + C C + +P ++P
Sbjct: 79 LYDDLLINGYYTTRLWIGTPPQRFALIVDTGSTVTYVPC-STCEHCGRHQDPKFQP---- 133
Query: 252 ILPYKDSLCMEIQRNHKPGYCE-------TCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
++ ++P C QC Y+ +YA+ SSS GVL D +
Sbjct: 134 ----------DLSETYQPVKCTPDCNCDGDTNQCMYDRQYAEMSSSSGVLGED----VVS 179
Query: 305 NGSLTK---PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
G+L++ VFGC D+ G L + + DGI+GL R +S+ QL + +I +
Sbjct: 180 FGNLSELAPQRAVFGCENDETGDLYSQ--RADGIMGLGRGDLSIMDQLVDKKVISDSFSL 237
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR--N 419
C GGG M LG + P M + D Y+ + +++ L L + +
Sbjct: 238 CYGGMDVGGGAMILG-GISPPEDMVFTHS-DPDRSPYYNINLKEMHVAGKKLQLNPKVFD 295
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKFPIRSI 478
+ G L D+G++Y Y + A+ ++ + + ++ DP +C+ +
Sbjct: 296 GKHGTVL-DSGTTYAYLPETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTG-----AG 349
Query: 479 VDVKQF---FKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGS 533
+DV Q F + + F + K +SPE YL K G CLG+
Sbjct: 350 IDVSQLAKSFPVVDMVFEN-----GHKLSLSPENYLFRHSKVRGAYCLGVFSNGR---DP 401
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T +LG I +R LV+YD N +IG+ K++C
Sbjct: 402 TTLLGGIFVRNTLVMYDRENSKIGFWKTNC 431
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 108/392 (27%), Positives = 175/392 (44%), Gaps = 36/392 (9%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTW---IQCDAPCSSCAKGAN-P 243
PL G P GLY+T + +G+PP+ YY+ +DTGSD+ W I CD + G
Sbjct: 71 LPLGGVGLPTATGLYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELT 130
Query: 244 LYKPR-MGNILPYKDSLCM-EIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARD--E 298
Y P G + + C+ + P C + C + I Y D SS+ G D +
Sbjct: 131 QYDPAGSGTTVGCEQEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQ 190
Query: 299 LHLTIENGSLTKPNV--VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356
+ NG T NV FGC G L ++ DGILG ++ S+ SQLA+ ++
Sbjct: 191 YNQVSGNGQTTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVR 250
Query: 357 NVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
+ HCL T GGG +F ++V + P++ P Y+ + I+ G + L L
Sbjct: 251 KIFAHCLDTVRGGG--IFAIGNVVQPPIVKTTPLV--PNATHYNVNLQGISVGGATLQLP 306
Query: 417 ARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
G + D+G++ Y ++ Y L+ ++ + D V + D +C++
Sbjct: 307 TSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYED---FICFQFSG 363
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVHNG 532
+ + F +T F + ++ P YL + C+G LDG + +G
Sbjct: 364 SL------DEEFPVITFSFEGDLTL-----NVYPHDYLFQNGNDLYCMGFLDGGVQTKDG 412
Query: 533 -STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++LGD+ L +LVVYD + IGW +C
Sbjct: 413 KDMVLLGDLVLSNKLVVYDLEKQVIGWTDYNC 444
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 109/407 (26%), Positives = 169/407 (41%), Gaps = 41/407 (10%)
Query: 164 IIRPHKSKINKKLVSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTG 222
++ H+ K + S A SSS PL G G Y T + +G P Y + +DTG
Sbjct: 96 LLHGHRKKKAGGVGGSQA----SSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTG 151
Query: 223 SDLTWIQCDAPCS-SCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNH-KPGYCETCQQ 277
S LTW+QC +PCS SC + A P++ PR + S C E+Q P C
Sbjct: 152 SSLTWLQC-SPCSVSCHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNV 210
Query: 278 CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILG 337
C Y+ Y D S S+G L++D T+ GS + P +GC D +GL ++ G++G
Sbjct: 211 CIYQASYGDSSYSVGYLSKD----TVSFGSGSFPGFYYGCGQDNEGL----FGRSAGLIG 262
Query: 338 LSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPF-M 396
L++ K+SL QLA + +CL T++ GY+ +G + ++ PM S
Sbjct: 263 LAKNKLSLLYQLAPS--LGYAFSYCLPTSSAAAGYLSIGSYNPGQY--SYTPMASSSLDA 318
Query: 397 ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL 456
LY + I+ +PL + + + D+G+ T Y+ L ++ +
Sbjct: 319 SLYFVTLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITRLPPNVYTALSRAVAAAMASAA 378
Query: 457 VLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK 516
+ L C+R + V F +SP L+
Sbjct: 379 PRAPTYSILDTCFRGSAAGLRVPRVDMAFAG------------GATLALSPGNVLIDVDD 426
Query: 517 GNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CL G T I+G+ + VVYD RIG+A C
Sbjct: 427 STTCLAF-----APTGGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGC 468
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 170/381 (44%), Gaps = 48/381 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY----- 255
L+F + VG PP + + +DTGSDL W+ CD C SC G + R G IL +
Sbjct: 104 LHFANVSVGTPPLWFLVALDTGSDLFWLPCD--CISCVHGG---LRTRTGKILKFNTYDL 158
Query: 256 -KDSLCMEIQRNHKPGYCETCQQ-------CDYEIEY-ADHSSSMGVLARDELHLTIENG 306
K S E+ N+ +C QQ C Y+++Y ++ +SS G + D LHL ++
Sbjct: 159 DKSSTSNEVSCNNST-FCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHLITDDD 217
Query: 307 SLTKPN--VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
+ + FGC Q G+ LN +G+ GL +S+PS LA +G+I N C
Sbjct: 218 QTKDADTRIAFGCGQVQTGVFLNG-AAPNGLFGLGMDNISVPSILAREGLISNSFSMCFG 276
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
+++ G + G P P Y+ I KI S +L
Sbjct: 277 SDS--AGRITFGDTGSPD--QRKTPFNVRKLHPTYNITITKIIVEDSVADLEFH------ 326
Query: 425 ALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
A+FD+G+S+TY AY+ + +V + + D +P + I ++V
Sbjct: 327 AIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCYDISISQTIEVP- 385
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN-ICLGILDGSEVHNGSTIILGDISL 542
F LT+ G + ++ +S E ++G+ +CLGI V+ I+G +
Sbjct: 386 -FLNLTMKGGDDYYVMDPIIQVSSE------EEGDLLCLGIQKSDSVN-----IIGQNFM 433
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
G +V+D N +GW +++C
Sbjct: 434 TGYKIVFDRDNMNLGWKETNC 454
>gi|224097210|ref|XP_002334633.1| predicted protein [Populus trichocarpa]
gi|222873871|gb|EEF11002.1| predicted protein [Populus trichocarpa]
Length = 143
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/133 (43%), Positives = 86/133 (64%), Gaps = 2/133 (1%)
Query: 432 SYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
SYTY QAY LI+ +K E+S+ L D TLP+CW+ + P +S+ DVK++FKT L
Sbjct: 1 SYTYLNSQAYQGLISLIKRELSTKPLREALDDQTLPICWKGRKPFKSVHDVKKYFKTFAL 60
Query: 491 HFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYD 550
F + + T+ PE YL++S KGN CLG+L+G+EV ++GDIS++ ++V+YD
Sbjct: 61 SFANDGK-SKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDISMQDRVVIYD 119
Query: 551 NVNKRIGWAKSHC 563
N + IGWA +C
Sbjct: 120 NEKQLIGWAPGNC 132
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 118/408 (28%), Positives = 180/408 (44%), Gaps = 75/408 (18%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN--------- 242
L G+ D Y+ + VG+P + +DTGSD+ W +C C C+ N
Sbjct: 78 LNGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCKL-CQGCSSKKNVIVCSSIIM 136
Query: 243 ----PLYKPRM---GNILPYKDSLCMEIQRNHKPGYCE-TCQQCDYEIEYADHSSSMGVL 294
LY P + + D LC E G C C Y+I Y D SSS G+
Sbjct: 137 QGPITLYDPELSITASPATCSDPLCSE------GGSCRGNNNSCAYDISYEDTSSSTGIY 190
Query: 295 ARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
RD +HL SL + GCA GL DGI+G R+KVS+P+QLA+Q
Sbjct: 191 FRDVVHLG-HKASL-NTTMFLGCATSISGLW-----PVDGIMGFGRSKVSVPNQLAAQAG 243
Query: 355 IKNVVGHCLTTNAGGGGYMFLG-HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
N+ HCL+ GGG + LG +D P M + PML + +Y+ +++ ++ S L
Sbjct: 244 SYNIFYHCLSGEKEGGGILVLGKNDEFPE--MVYTPMLANDI--VYNVKLVSLSVNSKAL 299
Query: 414 NLGAR----NSQV--GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV 467
+ A N+ V G + D+G+S F +A + + ++ + ++ + PT P+
Sbjct: 300 PIEASEFEYNATVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTT-------AIPTAPL 352
Query: 468 CWRAKFPIRSIVD---VKQFFKTLTLHFGSKWQIVSTKFHISPEGYL--VISKKGN---- 518
SI D V+ F +TL F ++ YL V+S+K +
Sbjct: 353 ESSGSPCFISISDRNSVEVDFPNVTLKFDG-----GATMELTAHNYLEAVVSRKLSESTH 407
Query: 519 ------ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAK 560
+C+ G++ ILGD L+ ++VVYD RIGW K
Sbjct: 408 FQGVRLVCI------SWSVGNSTILGDAILKDKVVVYDMEKSRIGWVK 449
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 170/367 (46%), Gaps = 41/367 (11%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
+G PP+ + L +DTGS +T++ C++ C C +P ++P + + Y C +
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQPDLSDT--YHPVKC------N 52
Query: 268 KPGYCET-CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN-VVFGCAYDQQGLL 325
C+T QC YE +YA+ SSS G+L D ++ N S KP VFGC + G L
Sbjct: 53 PDCTCDTENDQCTYERQYAEMSSSSGILGED--LVSFGNMSELKPQRAVFGCENAETGDL 110
Query: 326 LNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGM 385
+ DGI+GL R +S+ QL +G+I + C GGG M LG + P M
Sbjct: 111 FSQ--HADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQ-ISPPSDM 167
Query: 386 AWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV----GWALFDTGSSYTYFTKQAY 441
+ D Y+ E+ ++ L++ N QV + D+G++Y Y + A+
Sbjct: 168 VF-SHSDPDRSPYYNIELRGLHVAGKKLDI---NPQVFDGKHGTILDSGTTYAYLPEAAF 223
Query: 442 SELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVS 500
I ++ + DP VC+ I ++ + F ++ + F +
Sbjct: 224 LPFIQAITSELHGLKQIRGPDPNYNDVCFSGAG--SEIPELYKTFPSVDMVFDN-----G 276
Query: 501 TKFHISPEGYLVISKK--GNICLGILDGSEVHNGS--TIILGDISLRGQLVVYDNVNKRI 556
K+ +SPE YL K G CLG+ NG T +LG I +R LV YD + ++
Sbjct: 277 EKYSLSPENYLFKHSKVHGAYCLGVF-----QNGKDPTTLLGGIVVRNTLVTYDREHSKV 331
Query: 557 GWAKSHC 563
G+ K++C
Sbjct: 332 GFWKTNC 338
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 170/367 (46%), Gaps = 41/367 (11%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
+G PP+ + L +DTGS +T++ C++ C C +P ++P + + Y C +
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQPDLSDT--YHPVKC------N 52
Query: 268 KPGYCET-CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN-VVFGCAYDQQGLL 325
C+T QC YE +YA+ SSS G+L D ++ N S KP VFGC + G L
Sbjct: 53 PDCTCDTENDQCTYERQYAEMSSSSGILGED--LVSFGNMSELKPQRAVFGCENAETGDL 110
Query: 326 LNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGM 385
+ DGI+GL R +S+ QL +G+I + C GGG M LG + P M
Sbjct: 111 FSQ--HADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQ-ISPPSDM 167
Query: 386 AWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV----GWALFDTGSSYTYFTKQAY 441
+ D Y+ E+ ++ L++ N QV + D+G++Y Y + A+
Sbjct: 168 VFS-HSDPDRSPYYNIELRGLHVAGKKLDI---NPQVFDGKHGTILDSGTTYAYLPEAAF 223
Query: 442 SELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVS 500
I ++ + DP VC+ I ++ + F ++ + F +
Sbjct: 224 LPFIQAITSELHGLKQIRGPDPNYNDVCFSGAG--SEIPELYKTFPSVDMVFDN-----G 276
Query: 501 TKFHISPEGYLVISKK--GNICLGILDGSEVHNGS--TIILGDISLRGQLVVYDNVNKRI 556
K+ +SPE YL K G CLG+ NG T +LG I +R LV YD + ++
Sbjct: 277 EKYSLSPENYLFKHSKVHGAYCLGVF-----QNGKDPTTLLGGIVVRNTLVTYDREHSKV 331
Query: 557 GWAKSHC 563
G+ K++C
Sbjct: 332 GFWKTNC 338
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 175/382 (45%), Gaps = 53/382 (13%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
+G Y T + +G PP+ + L +DTGS +T++ C C C +P ++P
Sbjct: 90 NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST-CKHCGSHQDPKFRP----------- 137
Query: 259 LCMEIQRNHKPGYC-------ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
E ++P C + +QC YE YA+ S+S GVL D + L+
Sbjct: 138 ---EASETYQPVKCTWQCNCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFG-NQSELSPQ 193
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
+FGC D+ G + N + DGI+GL R +S+ QL + +I + C GGG
Sbjct: 194 RAIFGCENDETGDIYNQ--RADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGG 251
Query: 372 YMFLGHDLVPSWGMAWVPM--LDSPFMELYHTEILKINYGSSPLNLGAR--NSQVGWALF 427
M LG + P M + + SP+ Y+ ++ +I+ L+L + + + G +
Sbjct: 252 AMVLG-GISPPADMVFTHSDPVRSPY---YNIDLKEIHVAGKRLHLNPKVFDGKHG-TVL 306
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQF-- 484
D+G++Y Y + A+ ++ + + + DP +C+ + ++V Q
Sbjct: 307 DSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSG-----AEINVSQLSK 361
Query: 485 -FKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDIS 541
F + + FG+ K +SPE YL K G CLG+ N T +LG I
Sbjct: 362 SFPVVEMVFGN-----GHKLSLSPENYLFRHSKVRGAYCLGVFSNG---NDPTTLLGGIV 413
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+R LV+YD + +IG+ K++C
Sbjct: 414 VRNTLVMYDREHSKIGFWKTNC 435
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 111/392 (28%), Positives = 171/392 (43%), Gaps = 52/392 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG-NILPYKDSL 259
L+F + VG P Y + +DTGSDL W+ C+ C+ C G ++ NI K+S
Sbjct: 112 LHFANVSVGTPASSYLVALDTGSDLFWLPCN--CTKCVHGIQLSTGQKIAFNIYDNKES- 168
Query: 260 CMEIQRN--HKPGYCETCQQCD--------YEIEY-ADHSSSMGVLARDELHLTIENGSL 308
+N CE QC Y++EY ++++S+ G L D LHL +N
Sbjct: 169 --STSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITDNDDQ 226
Query: 309 TK---PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
T+ P + FGC Q G L+ +G+ GL + VS+PS LA QG+ N C
Sbjct: 227 TQHANPLITFGCGQVQTGAFLDG-AAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMCFA- 284
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
A G G + G D S P P Y+ + +I G + +L A
Sbjct: 285 -ADGLGRITFG-DNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSADLEFN------A 336
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FDTG+S+TY AY ++ S ++ SD LP + ++V
Sbjct: 337 IFDTGTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNSDD-LPFEYCYDLRTNQTIEVPNI 395
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN---ICLGILDGSEVHNGSTIILGDIS 541
LT+ G + F + P ++ S GN +CL +L + V+ I+G
Sbjct: 396 --NLTMKGGDNY------FVMDP---IITSGGGNNGVLCLAVLKSNNVN-----IIGQNF 439
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
+ G +V+D N +GW +S+C + SLP
Sbjct: 440 MTGYRIVFDRENMTLGWKESNCYDD-ELSSLP 470
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 176/391 (45%), Gaps = 48/391 (12%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
G + G YF + VG P R YL +DTGSD+TW+QC APC++C K + L+ P +
Sbjct: 6 FSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQC-APCTNCYKQKDALFNPSSSS 64
Query: 252 ---ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL--TIENG 306
+L SLC+ + C + +C Y+ +Y D S +MG L D + L G
Sbjct: 65 SFKVLDCSSSLCLNLDVMG----CLS-NKCLYQADYGDGSFTMGELVTDNVVLDDAFGPG 119
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--- 363
+ N+ GC +D +G T GILGL R +S P+ L + +N+ +CL
Sbjct: 120 QVVLTNIPLGCGHDNEG----TFGTAAGILGLGRGPLSFPNNLDAS--TRNIFSYCLPDR 173
Query: 364 TTNAGGGGYMFLGHDLVP---SWGMAWVPMLDSP-FMELYHTEILKINYGSSPL-NLGAR 418
++ + G +P + + ++P L +P Y+ +I I+ G + L N+ A
Sbjct: 174 ESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPAS 233
Query: 419 NSQV-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
Q+ G +FD+G++ T +AY+ + + + + L A C+ F
Sbjct: 234 VFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMH-LTSAADFKIFDTCY--DF 290
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNG 532
+ + V T+T HF + + P Y+V NI C G
Sbjct: 291 TGMNSISV----PTVTFHFQGDVDM-----RLPPSNYIVPVSNNNIFCFAF----AASMG 337
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++I G++ + V+YDNV+K+IG C
Sbjct: 338 PSVI-GNVQQQSFRVIYDNVHKQIGLLPDQC 367
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 106/386 (27%), Positives = 174/386 (45%), Gaps = 40/386 (10%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G + G YF + VG PP P L +DTGSD+ W+QC PC C + +PLY PR +
Sbjct: 89 ISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCK-PCVHCYRQLSPLYDPRGSS 147
Query: 252 ILPYKDSLCMEIQ-RNHKPGYCE-TCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
Y + C Q RN P C+ T C Y I Y D SS+ G LA D L + + +
Sbjct: 148 T--YAQTPCSPPQCRN--PQTCDGTTGGCGYRIVYGDASSTSGNLATDRL---VFSNDTS 200
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTN 366
NV GC +D +GL + G+LG++R S +Q+A +CL T +
Sbjct: 201 VGNVTLGCGHDNEGLFGSAA----GLLGVARGNNSFATQVADS--YGRYFAYCLGDRTRS 254
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV--- 422
Y+ G + P+ +P LY+ +++ + G P+ G N+ +
Sbjct: 255 GSSSSYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVT-GFSNASLSLD 313
Query: 423 -----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRS 477
G + D+G+S T F + AY L + ++ + + + V + A + +R
Sbjct: 314 PATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAK-VGMRKVGRGISV-FDACYDLRG 371
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
+ + LHF + + PE YLV + G L+ + H+G ++I
Sbjct: 372 VAVADA--PGVVLHFAGGADVA-----LPPENYLVPEESGRYHCFALEAAG-HDGLSVI- 422
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G++ + VV+D N+R+G+ + C
Sbjct: 423 GNVLQQRFRVVFDVENERVGFEPNGC 448
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 133/283 (46%), Gaps = 25/283 (8%)
Query: 190 FPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC--AKGAN--- 242
FP+ G+ P GLYFT + +G+PP+ Y++ +DTGSD+ W+ C +PC+ C + G N
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQL 135
Query: 243 PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARD 297
+ P + +P D C + + C+T C Y Y D S + G D
Sbjct: 136 EFFNPDTSSTSSKIPCSDDRCTAALQTSE-AVCQTSDNSPCGYTFTYGDGSGTSGYYVSD 194
Query: 298 ELHLTIENGSLTKPN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQG 353
++ G+ N +VFGC+ Q G L T DGI G + ++S+ SQL S G
Sbjct: 195 TMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLG 254
Query: 354 IIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
+ V HCL + GGG + LG + P G+ + P++ S + E + +N P+
Sbjct: 255 VSPKVFSHCLKGSDNGGGILVLGEIVEP--GLVYTPLVPSQPHYNLNLESIVVNGQKLPI 312
Query: 414 N---LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSS 453
+ N+Q + D+G++ Y AY + ++ S
Sbjct: 313 DSSLFTTSNTQ--GTIVDSGTTLAYLADGAYDPFVNAITAAVS 353
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 176/379 (46%), Gaps = 33/379 (8%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y T + +G PP+ + L +DTGS +T++ C + C C + +P ++P + +
Sbjct: 3 LHDDLLINGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSS-CEQCGRHQDPKFQPDLSS 61
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
Y+ C N + QQC YE +YA+ S+S GVL D + +L
Sbjct: 62 T--YQSVKC-----NIDCNCDDEKQQCVYERQYAEMSTSSGVLGEDIISFG-NLSALAPQ 113
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
VFGC + G L + DGI+G+ R +S+ L +G+I + C GGG
Sbjct: 114 RAVFGCENMETGDLYSQ--HADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGG 171
Query: 372 YMFLGHDLVPSWGMAWVPM--LDSPFMELYHTEILKINYGSSPLNLGAR--NSQVGWALF 427
M LG + P M + + SP+ Y+ ++ +I+ PL L + + G +
Sbjct: 172 AMVLG-GISPPSNMVFSQSDPVRSPY---YNIDLKEIHVAGKPLPLNPTVFDGKHG-TIL 226
Query: 428 DTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G++Y Y + A+ A +KE+ S + +C+ I + F
Sbjct: 227 DSGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAG--SDISQLSSSFP 284
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDISLRG 544
+ + FG+ K +SPE YL K G CLGI + T +LG I +R
Sbjct: 285 AVEMVFGN-----GQKLLLSPENYLFRHSKVHGAYCLGIFQNGK---DPTTLLGGIVVRN 336
Query: 545 QLVVYDNVNKRIGWAKSHC 563
LV+YD N +IG+ K++C
Sbjct: 337 TLVLYDRENSKIGFWKTNC 355
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 185/397 (46%), Gaps = 51/397 (12%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G G YF +G P + ++L +DTGSDL ++QC APC C + PLY+P +
Sbjct: 24 VSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQC-APCDLCYEQDGPLYQPSNSS 82
Query: 252 I---LPYKDSLCM----EIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLT 302
+P + C+ + Y E+ Q C YE Y D+SS++GV A + T
Sbjct: 83 TFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYE----T 138
Query: 303 IENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
G + +V FGC QG + V G+LGL + +S SQ +N +C
Sbjct: 139 ATVGGIRVNHVAFGCGNRNQG----SFVSAGGVLGLGQGALSFTSQAGYA--FENKFAYC 192
Query: 363 LTTNAGGGGY---MFLGHDLVPS-WGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGA 417
LT+ + G D++ + + + P++ +P +Y+ +I++I +G L +
Sbjct: 193 LTSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPD 252
Query: 418 RNSQV-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
++ G +FD+G++ TY++ QAY+ +IA+ E S S LP+C
Sbjct: 253 SAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAF-EKSVPYPRAPPSPQGLPLCVNV- 310
Query: 473 FPIRSIVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEV 529
S +D + + T+ F G+ ++ +G I NI CL +L+ S
Sbjct: 311 ----SGID-HPIYPSFTIEFDQGATYR--------PNQGNYFIEVSPNIDCLAMLESSS- 356
Query: 530 HNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNP 566
+G +I G+I + LV YD RIG+A ++C P
Sbjct: 357 -DGFNVI-GNIIQQNYLVQYDREEHRIGFAHANCDAP 391
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 118 bits (296), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 156/370 (42%), Gaps = 43/370 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDS 258
Y + +G P R + DTGSDL+W+QC PC++C K +PL+ P + +P
Sbjct: 188 YIVSVGLGTPRRDLLVVFDTGSDLSWVQCK-PCNNCYKQHDPLFDPSQSTTYSAVPCGAQ 246
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C++ G C + +C YE+ Y D S + G LARD L L + L VFGC
Sbjct: 247 ECLD------SGTCSS-GKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQ--GFVFGCG 297
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
D GL + DG+ GL R +VSL SQ A++ +CL ++ GY+ LG
Sbjct: 298 DDDTGL----FGRADGLFGLGRDRVSLASQAAAR--YGAGFSYCLPSSWRAEGYLSLGSA 351
Query: 379 LVPSWGM--AWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYF 436
P A V D+P Y+ +++ I + + + + D+G+ T
Sbjct: 352 AAPPHAQFTAMVTRSDTP--SFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITRL 409
Query: 437 TKQAYSEL---IASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
+AYS L A L D R K I S+ + TL L FG
Sbjct: 410 PSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFG 469
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVN 553
G L ++ + CL S + S ILG++ + VVYD N
Sbjct: 470 ---------------GVLYVANRSQACLAF--ASNGDDTSVGILGNMQQKTFAVVYDLAN 512
Query: 554 KRIGWAKSHC 563
++IG+ C
Sbjct: 513 QKIGFGAKGC 522
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 118 bits (296), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 156/372 (41%), Gaps = 42/372 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G Y T + +G P Y + +DTGS LTW+QC SC + PLY PR + +P
Sbjct: 132 GNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCS 191
Query: 257 DSLCMEIQRNH-KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
S C E+Q P C C Y+ Y D S S+G L+RD T+ GS + PN +
Sbjct: 192 ASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRD----TVSFGSGSYPNFYY 247
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC D +GL ++ G++GL+R K+SL QLA + +CL T A GY+ +
Sbjct: 248 GCGQDNEGL----FGRSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTPA-STGYLSI 300
Query: 376 GHDLVPSWGMAWVPMLDSPF-MELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
G S ++ PM S LY + ++ G SPL + + D+G+ T
Sbjct: 301 GP--YTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVIT 358
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPT---LPVCWRAKFPIRSIVDVKQFFKTLTLH 491
Y+ +L + + +V S P L C++ + + V F
Sbjct: 359 RLPTAVYT----ALSKAVAAAMVGVQSAPAFSILDTCFQGQASQLRVPAVAMAFAG---- 410
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
++ + L+ CL ST I+G+ + VVYD
Sbjct: 411 --------GATLKLATQNVLIDVDDSTTCLAF-----APTDSTTIIGNTQQQTFSVVYDV 457
Query: 552 VNKRIGWAKSHC 563
RIG+A C
Sbjct: 458 AQSRIGFAAGGC 469
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 175/380 (46%), Gaps = 50/380 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWI--QCDA--PCSSCAKGANPLYKPRMGNI---L 253
L++ + VG P + + + +DTGSDL W+ QCD P ++ A G+ Y P M + +
Sbjct: 108 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAV 167
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADH-SSSMGVLARDELHLTIENG--SLTK 310
P + C ++Q+ C T QC Y++ Y +SS G L D L+L+ EN + K
Sbjct: 168 PCNSNFC-DLQKE-----CSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILK 221
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
++ GC Q G L+ +G+ GL +VS+PS LA +G+ N C + G
Sbjct: 222 AQIMLGCGQTQTGSFLDA-AAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRD--GI 278
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
G + G S P+ + Y I I G+ P ++ +FDTG
Sbjct: 279 GRISFGDQ--ESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDF------ITIFDTG 330
Query: 431 SSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCW---RAKFPIRSIVDVKQFFK 486
+S+TY AY+ + S +V ++ D+ P C+ A+FPI I+ +
Sbjct: 331 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIP-FEYCYDLSEARFPIPDII-----LR 384
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
T+T GS + ++ IS + + + CL I+ +++ I+G + G
Sbjct: 385 TVT---GSMFPVIDPGQVISIQEHEYV-----YCLAIVKSMKLN-----IIGQNFMTGLR 431
Query: 547 VVYDNVNKRIGWAKSHCMNP 566
VV+D K +GW K +C +P
Sbjct: 432 VVFDRERKILGWKKFNCFSP 451
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 175/384 (45%), Gaps = 52/384 (13%)
Query: 198 PDGLYFTYMIVGNPPRPYYLDMDTGSDLTWI--QCDA--PCSSCAKGANPLYKPRMGNI- 252
P L++ + VG P + + + +DTGSDL W+ QCD P ++ A G+ Y P M +
Sbjct: 3 PSSLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTS 62
Query: 253 --LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADH-SSSMGVLARDELHLTIENG--S 307
+P + C ++Q+ C T QC Y++ Y +SS G L D L+L+ EN
Sbjct: 63 KAVPCNSNFC-DLQKE-----CSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQ 116
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
+ K ++ GC Q G L+ +G+ GL +VS+PS LA +G+ N C +
Sbjct: 117 ILKAQIMLGCGQTQTGSFLDAAAP-NGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRD- 174
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALF 427
G G + G S P+ + Y I I G+ P ++ +F
Sbjct: 175 -GIGRISFGDQ--ESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDF------ITIF 225
Query: 428 DTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCW-----RAKFPIRSIVDV 481
DTG+S+TY AY+ + S +V ++ D+ P C+ A+FPI I+
Sbjct: 226 DTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIP-FEYCYDLSSSEARFPIPDII-- 282
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+T+T GS + ++ IS + + + CL I+ +++ I+G
Sbjct: 283 ---LRTVT---GSMFPVIDPGQVISIQEHEYV-----YCLAIVKSMKLN-----IIGQNF 326
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMN 565
+ G VV+D K +GW K +C +
Sbjct: 327 MTGLRVVFDRERKILGWKKFNCYD 350
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 175/382 (45%), Gaps = 52/382 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWI--QCDA--PCSSCAKGANPLYKPRMGNI---L 253
L++ + VG P + + + +DTGSDL W+ QCD P ++ A G+ Y P M + +
Sbjct: 108 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAV 167
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADH-SSSMGVLARDELHLTIENG--SLTK 310
P + C ++Q+ C T QC Y++ Y +SS G L D L+L+ EN + K
Sbjct: 168 PCNSNFC-DLQKE-----CSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILK 221
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
++ GC Q G L+ +G+ GL +VS+PS LA +G+ N C + G
Sbjct: 222 AQIMLGCGQTQTGSFLDA-AAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRD--GI 278
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
G + G S P+ + Y I I G+ P ++ +FDTG
Sbjct: 279 GRISFGDQ--ESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDF------ITIFDTG 330
Query: 431 SSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCW-----RAKFPIRSIVDVKQF 484
+S+TY AY+ + S +V ++ D+ P C+ A+FPI I+
Sbjct: 331 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIP-FEYCYDLSSSEARFPIPDII----- 384
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
+T+T GS + ++ IS + + + CL I+ +++ I+G + G
Sbjct: 385 LRTVT---GSMFPVIDPGQVISIQEHEYV-----YCLAIVKSMKLN-----IIGQNFMTG 431
Query: 545 QLVVYDNVNKRIGWAKSHCMNP 566
VV+D K +GW K +C +P
Sbjct: 432 LRVVFDRERKILGWKKFNCFSP 453
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 177/389 (45%), Gaps = 53/389 (13%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
L ++ +G Y + +G PP+ + L +DTGS +T++ C C C +P ++P
Sbjct: 83 LYDDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTYVPCST-CRHCGSHQDPKFRP---- 137
Query: 252 ILPYKDSLCMEIQRNHKPGYCE---TC----QQCDYEIEYADHSSSMGVLARDELHLTIE 304
E ++P C C +QC YE YA+ S+S G L D + +
Sbjct: 138 ----------EDSETYQPVKCTWQCNCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQ 187
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
L+ +FGC D+ G + N + DGI+GL R +S+ QL + +I + C
Sbjct: 188 T-ELSPQRAIFGCENDETGDIYNQ--RADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYG 244
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPM--LDSPFMELYHTEILKINYGSSPLNLGAR--NS 420
GGG M LG + P M + + SP+ Y+ ++ +I+ L+L + +
Sbjct: 245 GMGVGGGAMVLG-GISPPADMVFTRSDPVRSPY---YNIDLKEIHVAGKRLHLNPKVFDG 300
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIV 479
+ G + D+G++Y Y + A+ ++ + + + DP +C+ + +
Sbjct: 301 KHG-TVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSG-----AEI 354
Query: 480 DVKQF---FKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGST 534
DV Q F + + FG+ K +SPE YL K G CLG+ N T
Sbjct: 355 DVSQISKSFPVVEMVFGN-----GHKLSLSPENYLFRHSKVRGAYCLGVFSNG---NDPT 406
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+LG I +R LV+YD + +IG+ K++C
Sbjct: 407 TLLGGIVVRNTLVMYDREHTKIGFWKTNC 435
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/399 (27%), Positives = 176/399 (44%), Gaps = 51/399 (12%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G+ PD GLY+ + +G P + YY+ +DTGSD+ W+ C C C + ++
Sbjct: 73 IPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNC-IQCRECPRTSSLGMEL 131
Query: 243 ---PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD-- 297
L + G ++ + C+E+ G C T C Y Y D SS+ G +D
Sbjct: 132 TPYDLEESTTGKLVSCDEQFCLEVNGGPLSG-CTTNMSCPYLQIYGDGSSTAGYFVKDYV 190
Query: 298 -------ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKT-DGILGLSRAKVSLPSQL 349
+L T NGS+ FGC Q G L ++ + DGILG ++ S+ SQL
Sbjct: 191 QYNRVSGDLETTAANGSIK-----FGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQL 245
Query: 350 ASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYG 409
AS +K + HCL GGG +GH + P M P++ P Y+ + + G
Sbjct: 246 ASTRKVKKMFAHCL-DGTNGGGIFAMGHVVQPKVNMT--PLV--PNQPHYNVNMTGVQVG 300
Query: 410 SSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP 466
LN+ A + G + D+G++ Y + Y L+A + + V T+
Sbjct: 301 HIILNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEV-----QTIH 355
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD- 525
++ F VD F + HF + S + P YL + C+G +
Sbjct: 356 GEYKC-FQYSERVD--DGFPPVIFHFEN-----SLLLKVYPHEYL-FQYENLWCIGWQNS 406
Query: 526 GSEVHNGSTIIL-GDISLRGQLVVYDNVNKRIGWAKSHC 563
G + + + L GD+ L +LV+YD N+ IGW + +C
Sbjct: 407 GMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC 445
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 106/425 (24%), Positives = 185/425 (43%), Gaps = 57/425 (13%)
Query: 168 HKSKINKKLVSSNAVAVDSSSIFP---LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSD 224
H + ++ S +++A D + G + G YF + VG+PP + +DTGSD
Sbjct: 51 HAAPFTAQVASFHSIAADDDDRLRSPVMSGVPFDSGEYFAVINVGDPPTRALVVIDTGSD 110
Query: 225 LTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYE 281
L W+QC PC C + PLY PR + +P C ++ R PG C Y
Sbjct: 111 LIWLQC-VPCRHCYRQVTPLYDPRSSSTHRRIPCASPRCRDVLR--YPGCDARTGGCVYM 167
Query: 282 IEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRA 341
+ Y D S+S G LA D L + NV GC +D GLL + G+LG+ R
Sbjct: 168 VVYGDGSASSGDLATDRLVFPDDT---HVHNVTLGCGHDNVGLLESAA----GLLGVGRG 220
Query: 342 KVSLPSQLASQ--GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM-EL 398
++S P+QLA + +G L+ G Y+ G P A+ P+ +P L
Sbjct: 221 QLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRTPEPP-STAFTPLRTNPRRPSL 279
Query: 399 YHTEILKINYGSSPLNLGARNSQV--------GWALFDTGSSYTYFTKQAYSEL------ 444
Y+ +++ + G + G N+ + G + D+G++ + F + AY+ +
Sbjct: 280 YYVDMVGFSVGGERVT-GFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDS 338
Query: 445 ----IASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF--GSKWQI 498
+++++++ V DA C+ + V+ ++ LHF G+ +
Sbjct: 339 HAAAAGTMRKLATKFSVFDA-------CYDLRGNGAPAAAVR--VPSIVLHFAGGADMAL 389
Query: 499 VSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGW 558
+ I +G ++ CLG+ + N +LG++ +G +V+D RIG+
Sbjct: 390 PQANYLIPVQGG---DRRTYFCLGLQAADDGLN----VLGNVQQQGFGLVFDVERGRIGF 442
Query: 559 AKSHC 563
+ C
Sbjct: 443 TPNGC 447
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 164/381 (43%), Gaps = 54/381 (14%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPY 255
DG Y + +G P +P+ MDTGSDL W QC PC+ C + P++ P+ + LP
Sbjct: 92 DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPC 150
Query: 256 KDSLCMEIQRNHKPGYCETC--QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
LC +Q TC C Y Y D S + G + + L GS++ PN+
Sbjct: 151 SSQLCQALQS-------PTCSNNSCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNI 199
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGY 372
FGC + QG G++G+ R +SLPSQL + K +C+T +
Sbjct: 200 TFGCGENNQGFGQG---NGAGLVGMGRGPLSLPSQLD---VTK--FSYCMTPIGSSNSST 251
Query: 373 MFLGH--DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG------ARNSQVGW 424
+ LG + V + + S Y+ + ++ GS+PL + N+ G
Sbjct: 252 LLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGG 311
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGL-VLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+ D+G++ TYF AY + + +S L V++ S +C++ ++
Sbjct: 312 IIIDSGTTLTYFVDNAYQAVRQAF--ISQMNLSVVNGSSSGFDLCFQMPSDQSNLQ---- 365
Query: 484 FFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
T +HF G + S + ISP G ICL + S+ + I G+I
Sbjct: 366 -IPTFVMHFDGGDLVLPSENYFISP-------SNGLICLAMGSSSQGMS----IFGNIQQ 413
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ LVVYD N + + + C
Sbjct: 414 QNLLVVYDTGNSVVSFLSAQC 434
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 159/375 (42%), Gaps = 29/375 (7%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGNI 252
G+ G Y + +G P R DTGSDLTW QC+ PC+ C P++ P
Sbjct: 130 GSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCE-PCARYCYHQQEPIFNPSKSTS 188
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
+ C E++ + C Y I+Y D S S+G A+D+L LT +
Sbjct: 189 YTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD---V 245
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
N +FGC + +GL V G++GL R +SL SQ A + + +CL + +
Sbjct: 246 FNNFLFGCGQNNRGL----FVGVAGLIGLGRNALSLVSQTAQK--YGKLFSYCLPSTSSS 299
Query: 370 GGYMFLGHDLVPSWGMAWVP-MLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
GY+ G S + + P +++S Y ++ I+ G L+ A + D
Sbjct: 300 TGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIID 359
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
+G+ + AYS+L AS ++ S A L C+ F VDV + +
Sbjct: 360 SGTVISRLPPTAYSDLRASFQQQMSK-YPKAAPASILDTCY--DFSQYDTVDVPK----I 412
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
L+F ++ + P G I +CL S+ + ILG++ + VV
Sbjct: 413 NLYFSDGAEM-----DLDPSGIFYILNISQVCLAFAGNSDATD--IAILGNVQQKTFDVV 465
Query: 549 YDNVNKRIGWAKSHC 563
YD RIG+A C
Sbjct: 466 YDVAGGRIGFAPGGC 480
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 98/391 (25%), Positives = 169/391 (43%), Gaps = 50/391 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---------PLYKPRMGN 251
L++ + +G P Y + +DTGSDL W+ CD S C +G +Y+P +
Sbjct: 112 LHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASS 171
Query: 252 ---ILPYKDSLCMEIQRNHKPGYCETCQQ-CDYEIEY-ADHSSSMGVLARDELHLTIENG 306
+P ++LC R C + Q C Y+++Y ++ +SS GVL D LHLT ++
Sbjct: 172 TSQTIPCNNTLCSRQSR------CPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTTDDA 225
Query: 307 S--LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
++FGC Q G L+ +G+ GL +S+PS LA +G N C
Sbjct: 226 QSRALDAKIIFGCGRVQTGSFLDG-AAPNGLFGLGMTNISVPSTLAREGYTSNSFSMCFG 284
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
+ G G + G S G P Y+ I KI N+G R++ + +
Sbjct: 285 RD--GIGRISFGD--TGSSGQGETPFNLRQLHPTYNVSITKI-------NVGGRDADLEF 333
Query: 425 -ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
A+FD+G+S+TY AY+ + S + + SD C+ + +++
Sbjct: 334 SAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEMS---SNQTNLEI 390
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDISL 542
L + GS++ + P +++ +I CL I+ +V+ I+G +
Sbjct: 391 PTVNLVMQGGSQFNVT------DPIVIVILQGGASIYCLAIVKSGDVN-----IIGQNFM 439
Query: 543 RGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
G +V++ +GW S C + + P
Sbjct: 440 TGYRIVFNRERNVLGWKASDCYDDMDTTTFP 470
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 112/455 (24%), Positives = 190/455 (41%), Gaps = 48/455 (10%)
Query: 127 LYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDS 186
L+HKF + + + + D E + ++R H + + A
Sbjct: 34 LFHKFSKQAIEAMRSRNGMDYAQDWPTEGTIEF--QTMLRDHDVARHTRTARRILAASSM 91
Query: 187 SSIFPLRGN----IYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN 242
++GN ++ GL+++Y+ +G P + + +DTGSDL WI C+ C SCA +
Sbjct: 92 DQYVLIQGNATEQLFGGGLHYSYIDIGTPNVQFLVVLDTGSDLLWIPCE--CESCAPLSA 149
Query: 243 PLYKPRMGNILPYKDSLCMEIQRNHKPGYC--------ETCQ----QCDYEIEYAD-HSS 289
PR + PY SL KP C TC QC YEI Y ++S
Sbjct: 150 ESKDPRTSQLNPYTPSL----SSTAKPVLCSDPLCEMSSTCMAPTDQCPYEINYVSANTS 205
Query: 290 SMGVLARDELHLTIEN-GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQ 348
+ G L D ++ E+ G+ K V GC Q G LL +G++GL +S+P++
Sbjct: 206 TSGALYEDYMYFMRESGGNPVKLPVYLGCGKVQTGSLLKG-AAPNGLMGLGTTDISVPNK 264
Query: 349 LASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINY 408
LAS G + + C++ GG G + G + + + ++ Y EI I
Sbjct: 265 LASTGQLADSFSLCIS--PGGSGTLTFGDEGPAAQRTTPIIPKSVSMLDTYIVEIDSITV 322
Query: 409 GSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVC 468
G++ L + + ALFDTG+S+TY +K Y + + + S D +C
Sbjct: 323 GNTNLLMASH------ALFDTGTSFTYLSKTVYPQFVQAYDAQMSLPKWNDPRFSKWDLC 376
Query: 469 WRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSE 528
++ S + + +L L G+ +VS I + +I+ +C+ ++D
Sbjct: 377 YQT-----SNTNFQVPVVSLALSGGNSLDVVSGLKSIVDDNNAMIA----VCVTVMD--- 424
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G + + Y+ IGW S C
Sbjct: 425 -SGAGLSIIGQNFMTNYSITYNRAKMTIGWTPSDC 458
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 140/290 (48%), Gaps = 27/290 (9%)
Query: 176 LVSSNAVAVDSSSIFPLRGNIYP--DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
L SSN V VD F ++G P GLY+T + +G PP + + +DTGSD+ W+ C++
Sbjct: 2 LQSSNGV-VD----FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS- 55
Query: 234 CSSCAKGAN-----PLYKP---RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYA 285
CS C + + + P +++ D C ++ QC Y +Y
Sbjct: 56 CSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYG 115
Query: 286 DHSSSMGVLARDELHL-TIENGSLTKPN---VVFGCAYDQQGLLLNTLVKTDGILGLSRA 341
D S + G D +HL TI GS+T + VVFGC+ Q G L + DGI G +
Sbjct: 116 DGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQ 175
Query: 342 KVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHT 401
++S+ SQL+SQGI V HCL ++ GGG + LG + P+ + + ++ P Y+
Sbjct: 176 EMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEIVEPN--IVYTSLV--PAQPHYNL 231
Query: 402 EILKINYGSSPLNLGAR---NSQVGWALFDTGSSYTYFTKQAYSELIASL 448
+ I L + + S + D+G++ Y ++AY ++++
Sbjct: 232 NLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAI 281
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 115/408 (28%), Positives = 179/408 (43%), Gaps = 57/408 (13%)
Query: 190 FPLRGNIYPDG-----LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-- 242
F G I P G LY+T++ VG P + + +DTGSDL WI CD C CA +
Sbjct: 191 FSKDGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWIPCD--CIECAPLSGYH 248
Query: 243 -------PLYKPRMGNI---LPYKDSLCM--EIQRNHKPGYCETCQQCDYEIEY-ADHSS 289
+YKP LP LC+ N K Q C Y +Y ++++
Sbjct: 249 GSLDRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQK-------QPCPYNTKYLQENTT 301
Query: 290 SMGVLARDELHL-TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQ 348
S G+L D LHL + E+ + K +V+ GC Q G L+ + DG+LGL A +S+PS
Sbjct: 302 SSGLLVEDILHLDSRESHAPVKASVIIGCGRKQSGSYLDGIAP-DGLLGLGMADISVPSF 360
Query: 349 LASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELY-HTEILKIN 407
LA G+++N C T ++ G +F G V + +PF+ LY + +N
Sbjct: 361 LARAGLVRNSFSMCFTKDS---GRIFFGDQGVST-------QQSTPFVPLYGKLQTYTVN 410
Query: 408 YGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLP 466
S + S A+ D+G+S+T Y + K+V++ L +A+ +
Sbjct: 411 VDKSCVGHKCFESTSFQAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEAT--SFD 468
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDG 526
C+ A + + DV TLT +Q V+ F + E V CL ++
Sbjct: 469 YCYSASPLV--MPDVPTV--TLTFAGNKSFQPVNPTFLLHDEEGAV----AGFCLAVVQS 520
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
E I+ L G VV+D N ++GW +S C + ++P
Sbjct: 521 PEPIG----IIAQNFLLGYHVVFDRENMKLGWYRSECHDLDNSTTVPL 564
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 111/404 (27%), Positives = 172/404 (42%), Gaps = 44/404 (10%)
Query: 172 INKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD 231
+ +L + + V +SS+ G G Y T + +G P Y + +D+GS LTW+QC
Sbjct: 78 LASRLATKDKDWVAASSVPLASGASVGVGNYITRLGLGTPTTTYVMVVDSGSSLTWLQC- 136
Query: 232 APCS-SCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNH-KPGYCETCQQCDYEIEYAD 286
APC+ SC A PLY PR + +P C E+Q P C C Y+ Y D
Sbjct: 137 APCAVSCHPQAGPLYDPRASSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASYGD 196
Query: 287 HSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLP 346
S S G L++D + L+ +GS P +GC D GL + G++GL+R K+SL
Sbjct: 197 GSFSFGYLSKDTVSLS-SSGSF--PGFYYGCGQDNVGL----FGRAAGLIGLARNKLSLL 249
Query: 347 SQLASQGIIKNVVGHCL-TTNAGGGGYMFLG--HDLVPSWGMAWVPMLDSPF-MELYHTE 402
SQLA + N +CL T+ A GY+ G D ++ M+ S LY
Sbjct: 250 SQLAPS--VGNSFAYCLPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVS 307
Query: 403 ILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD 462
+ ++ SPL + + + D+G+ T Y+ L ++ + S
Sbjct: 308 LAGMSVAGSPLAVPSSEYGSLPTIIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYS- 366
Query: 463 PTLPVCWR---AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI 519
L C++ AK P+ ++ + F + ++P LV +
Sbjct: 367 -ILQTCFKGQVAKLPVPAV----------NMAFAGGATL-----RLTPGNVLVDVNETTT 410
Query: 520 CLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CL ST I+G+ + VVYD RIG+A C
Sbjct: 411 CLAF-----APTDSTAIIGNTQQQTFSVVYDVKGSRIGFAAGGC 449
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 169/387 (43%), Gaps = 51/387 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VG P + +DTGSD+ W+QC APC C + + P++ PR + Y
Sbjct: 127 GEYFTKIGVGTPATQALMVLDTGSDVVWVQC-APCRRCYEQSGPVFDPRRSS--SYGAVG 183
Query: 260 C-MEIQRNHKPGYCETCQ-QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
C + R G C+ + C Y++ Y D S + G + L G V GC
Sbjct: 184 CGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFA---GGARVARVALGC 240
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGY--- 372
+D +GL + L R +S P+Q++ + +CL T++G G
Sbjct: 241 GHDNEGLFVAAAGLLG----LGRGGLSFPTQISRR--YGRSFSYCLVDRTSSGAGAAPGS 294
Query: 373 -----MFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGS--------SPLNLGAR 418
+ G V + ++ PM+ +P ME Y+ +++ I+ G S L L
Sbjct: 295 HRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPS 354
Query: 419 NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRS 477
+ G + D+G+S T + +YS L + + ++ GL L +L C+ R
Sbjct: 355 TGR-GGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCY--DLGGRR 411
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTII 536
+V V T+++HF + + PE YL+ + +G C +G I
Sbjct: 412 VVKV----PTVSMHFAGGAEAA-----LPPENYLIPVDSRGTFCFAFAG----TDGGVSI 458
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+I +G VV+D +R+G+A C
Sbjct: 459 IGNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 169/389 (43%), Gaps = 46/389 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y+ + VG P L MDTGSD++WIQC PC C P + PR + LP S
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQC-VPCKDCVPALRPPFNPRHSSSFFKLPCASS 197
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP----NVV 314
C + + KP + + C + I+Y D S S G+LA + + N +P N+
Sbjct: 198 TCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNIT 257
Query: 315 FGCA-YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGGG 370
GCA D++GL G+LG+ R +S PSQL+S+ K HC +
Sbjct: 258 LGCADIDREGL----PTGASGLLGMDRRPISFPSQLSSRYARK--FSHCFPDKIAHLNSS 311
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPF-----MELYHTEILKINYGSSPLNLGARNSQV--- 422
G +F G + S + + P++ +P ++ Y+ ++ I+ S L L +N +
Sbjct: 312 GLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKV 371
Query: 423 ---GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
G + D+G+++TY K A+ + +S +D + P C+ ++
Sbjct: 372 TGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTP-CYNITSGTAALE 430
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-----SKKGNICLGILDGSEVHNGST 534
++TLHF +V P+ ++I ++ +CL L ++
Sbjct: 431 ST--ILPSITLHFRGGLDVV------LPKNSILIPVSSSEEQTTLCLAFLMSGDIPFN-- 480
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+ + V YD R+G A + C
Sbjct: 481 -IIGNYQQQNLWVEYDLEKLRLGIAPAQC 508
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 115 bits (288), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 164/381 (43%), Gaps = 54/381 (14%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPY 255
DG Y + +G P +P+ MDTGSDL W QC PC+ C + P++ P+ + LP
Sbjct: 92 DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPC 150
Query: 256 KDSLCMEIQRNHKPGYCETC--QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
LC +Q TC C Y Y D S + G + + L GS++ PN+
Sbjct: 151 SSQLCQALQS-------PTCSNNSCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNI 199
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
FGC + QG G++G+ R +SLPSQL + K +C+T
Sbjct: 200 TFGCGENNQGFGQG---NGAGLVGMGRGPLSLPSQLD---VTK--FSYCMTPIGSSTSST 251
Query: 374 FLGHDLVPS--WGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLG------ARNSQVGW 424
L L S G +++S + Y+ + ++ GS+PL + N+ G
Sbjct: 252 LLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGG 311
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGL-VLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+ D+G++ TYF AY + + +S L V++ S +C++ ++
Sbjct: 312 IIIDSGTTLTYFADNAYQAVRQAF--ISQMNLSVVNGSSSGFDLCFQMPSDQSNLQ---- 365
Query: 484 FFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
T +HF G + S + ISP G ICL + S+ + I G+I
Sbjct: 366 -IPTFVMHFDGGDLVLPSENYFISP-------SNGLICLAMGSSSQGMS----IFGNIQQ 413
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ LVVYD N + + + C
Sbjct: 414 QNLLVVYDTGNSVVSFLFAQC 434
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 165/381 (43%), Gaps = 54/381 (14%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPY 255
DG Y + +G P +P+ MDTGSDL W QC PC+ C + P++ P+ + LP
Sbjct: 92 DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPC 150
Query: 256 KDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
LC + TC C Y Y D S + G + + L GS++ PN+
Sbjct: 151 SSQLCQALSS-------PTCSNNFCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNI 199
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGY 372
FGC + QG G++G+ R +SLPSQL + K +C+T +
Sbjct: 200 TFGCGENNQGFGQG---NGAGLVGMGRGPLSLPSQLD---VTK--FSYCMTPIGSSTPSN 251
Query: 373 MFLGH--DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS--PLNLGA----RNSQVGW 424
+ LG + V + + S Y+ + ++ GS+ P++ A N+ G
Sbjct: 252 LLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGG 311
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGL-VLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+ D+G++ TYF AY + +S L V++ S +C++ ++
Sbjct: 312 IIIDSGTTLTYFVNNAYQSVRQEF--ISQINLPVVNGSSSGFDLCFQTPSDPSNLQ---- 365
Query: 484 FFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
T +HF G ++ S + ISP G ICL + S+ + I G+I
Sbjct: 366 -IPTFVMHFDGGDLELPSENYFISP-------SNGLICLAMGSSSQGMS----IFGNIQQ 413
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ LVVYD N + +A + C
Sbjct: 414 QNMLVVYDTGNSVVSFASAQC 434
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 176/386 (45%), Gaps = 40/386 (10%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA--KGANPLYKPRMGNILPYKDS 258
LY+T++ VG P + + +DTGSDL W+ CD C CA G +G P + +
Sbjct: 142 LYYTWVDVGTPNTSFMVALDTGSDLFWVPCD--CIECAPLAGYRETLDRDLGIYKPAEST 199
Query: 259 LCMEIQRNHK---PGY-CETCQQ-CDYEIEY-ADHSSSMGVLARDELHL-TIENGSLTKP 311
+ +H+ PG C + +Q C Y +Y ++++S G+L D LHL + E+ + K
Sbjct: 200 TSRHLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESHAPVKA 259
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
+VV GC Q G L+ + DG+LGL A +S+PS LA G+++N C ++ G
Sbjct: 260 SVVIGCGRKQSGSYLDGIAP-DGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDS---G 315
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFMELY-HTEILKINYGSSPLNLGARNSQVGWALFDTG 430
+F G V +PF+ LY + +N S + + AL D+G
Sbjct: 316 RIFFGDQGVSI-------QQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDSG 368
Query: 431 SSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
+S+T Y + K+V + + + D + C+ A P++ + DV TLT
Sbjct: 369 TSFTALPLNVYKAVAVEFDKQVHAPRITQE--DASFEYCYSAS-PLK-MPDVPTV--TLT 422
Query: 490 LHFGSKWQIVSTKFHISP-EGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
+Q V+ + EG + CL + E I+G L G +V
Sbjct: 423 FAANKSFQAVNPTIVLKDGEGSVA-----GFCLALQKSPE----PIGIIGQNFLTGYHIV 473
Query: 549 YDNVNKRIGWAKSHCMNPGRFKSLPF 574
+D N ++GW +S C +P ++P
Sbjct: 474 FDKENMKLGWYRSECHDPDNSTTVPL 499
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 114/472 (24%), Positives = 199/472 (42%), Gaps = 63/472 (13%)
Query: 118 ENKESFVFPLYHK---------FGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPH 168
E K S V + H+ ++E+ Q + R ++ +A++ G+ +
Sbjct: 63 EEKNSIVLQVVHRDSLSSSSNTSLVKEILQERLKRDAARVDSINARVQLAAM--GVSKAE 120
Query: 169 KSKINKKLVSSNAVAVD-SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTW 227
+N + + A D SSSI + G G YFT + VG PPR Y+ +DTGSD+ W
Sbjct: 121 MKPLNGSSIDARFDAKDFSSSI--ISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMW 178
Query: 228 IQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY 284
IQC PC+ C +PL+ P + +P LC ++ + C + C+Y++ Y
Sbjct: 179 IQC-LPCAKCYGQTDPLFNPAASSTYRKVPCATPLCKKLDISG----CRNKRYCEYQVSY 233
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
D S ++G + + L G + + V GC +D +GL + L R +S
Sbjct: 234 GDGSFTVGDFSTETLTF---RGQVIR-RVALGCGHDNEGLFIGAAGLLG----LGRGSLS 285
Query: 345 LPSQLASQGIIKNVVGHCLT--TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHT 401
PSQ +Q +CL + +G + G +P + + P+L +P ++ Y+
Sbjct: 286 FPSQTGAQ--FSKRFSYCLVDRSASGTASSLIFGKAAIPKSAI-FTPLLSNPKLDTFYYV 342
Query: 402 EILKINYG--------SSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSS 453
E++ I+ G +S + A + G + D+G+S T AYS + + + V +
Sbjct: 343 ELVGISVGGRRLTSIPASVFRMDATGN--GGVIIDSGTSVTRLVDSAYSTMRDAFR-VGT 399
Query: 454 DGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV- 512
L C+ + + VK TL HF I + YL+
Sbjct: 400 GNLKSAGGFSLFDTCY----DLSGLKTVK--VPTLVFHFQGGAHI-----SLPATNYLIP 448
Query: 513 ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
+ C + G I+G+I +G VV+D++ R+G+ C+
Sbjct: 449 VDSSATFCFAFAG----NTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGSCL 496
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 106/376 (28%), Positives = 164/376 (43%), Gaps = 38/376 (10%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGNILPYKD 257
G Y+ + +G PP+ Y + +DTGS L+W+QC PC+ C A+PLY P + YK
Sbjct: 122 SGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQ-PCAVYCHAQADPLYDPSVSKT--YKK 178
Query: 258 SLCMEIQRNHKPG------YCET-CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
C ++ + CET C Y Y D S S+G L++D L LT S T
Sbjct: 179 LSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLT---SSQTL 235
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGG 369
P +GC D QGL + GI+GL+R K+S+ +QL+++ + +CL T N+G
Sbjct: 236 PQFTYGCGQDNQGL----FGRAAGIIGLARDKLSMLAQLSTK--YGHAFSYCLPTANSGS 289
Query: 370 GGYMFLGHDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G FL + + PML DS LY + I PL+L A +V L D
Sbjct: 290 SGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVP-TLID 348
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF-PIRSIVDVKQFFKT 487
+G+ T Y+ L + ++ S + L C++ I ++ ++K F+
Sbjct: 349 SGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQG 408
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
+ L+ + KG CL S + I+G+ + +
Sbjct: 409 ------------GADLTLRAPSILIEADKGITCLAFAGSSGTNQ--IAIIGNRQQQTYNI 454
Query: 548 VYDNVNKRIGWAKSHC 563
YD RIG+A C
Sbjct: 455 AYDVSTSRIGFAPGSC 470
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 118/411 (28%), Positives = 171/411 (41%), Gaps = 42/411 (10%)
Query: 172 INKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD 231
I++ + + AV S+ RG G Y + +G P R + DTGSDL+W+QC
Sbjct: 55 IHRMIANETAVVGQDVSLPAERGISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQC- 113
Query: 232 APCSS--CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETC---QQCDYEIEYAD 286
PCSS C +PL+ P + + C E + C + +C YE+ Y D
Sbjct: 114 GPCSSGGCYHQQDPLFAPSSSST--FSAVRCGEPECPRARQSCSSSPGDDRCPYEVVYGD 171
Query: 287 HSSSMGVLARDELHLTI-------ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
S ++G L D L L EN S P VFGC + GL K DG+ GL
Sbjct: 172 KSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCGENNTGL----FGKADGLFGLG 227
Query: 340 RAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFME 397
R KVSL SQ A G +CL ++++ GY+ LG + PML+ S
Sbjct: 228 RGKVSLSSQAA--GKYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPS 285
Query: 398 LYHTEILKINYGSSPLNLGARNSQVGWA---LFDTGSSYTYFTKQAYSEL-IASLKEVSS 453
Y+ +++ I + + +R + W + D+G+ T +AYS L A L +
Sbjct: 286 FYYVKLVGIRVAGRAIKVSSRPAL--WPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGK 343
Query: 454 DGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI 513
G L C+ + V + + L F I S F G L +
Sbjct: 344 YGYKRAPRLSILDTCYDFTAHANATVSI----PAVALVFAGGATI-SVDF----SGVLYV 394
Query: 514 SKKGNICLGILDGSEVHNG-STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+K CL NG S ILG+ R VVYD ++IG+A C
Sbjct: 395 AKVAQACLAFAPNG---NGRSAGILGNTQQRTVAVVYDVGRQKIGFAAKGC 442
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 105/381 (27%), Positives = 174/381 (45%), Gaps = 52/381 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWI--QCDA--PCSSCAKGANPLYKPRMGNI---L 253
L++ + VG P + + + +DTGSDL W+ QCD P ++ A G+ Y P M + +
Sbjct: 107 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAV 166
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADH-SSSMGVLARDELHLTIENG--SLTK 310
P + C ++Q+ C T QC Y++ Y +SS G L D L+L+ EN + K
Sbjct: 167 PCNSNFC-DLQKE-----CSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILK 220
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
++ GC Q G L+ +G+ GL +VS+PS LA +G+ N C + G
Sbjct: 221 AQIMLGCGQTQTGSFLDA-AAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRD--GI 277
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
G + G S P+ + Y I I G+ P +L +FDTG
Sbjct: 278 GRISFGDQ--GSSDQEETPLNINQQHPTYAITISGITIGNKPTDLDF------ITIFDTG 329
Query: 431 SSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCW-----RAKFPIRSIVDVKQF 484
+S+TY AY+ + S +V ++ D+ P C+ A+FPI I+
Sbjct: 330 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIP-FEYCYDLSSSEARFPIPDII----- 383
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
+T++ GS + ++ IS + + + CL I+ +++ I+G + G
Sbjct: 384 LRTVS---GSLFPVIDPGQVISIQEHEYV-----YCLAIVKSRKLN-----IIGQNFMTG 430
Query: 545 QLVVYDNVNKRIGWAKSHCMN 565
VV+D K +GW K +C +
Sbjct: 431 LRVVFDRERKILGWKKFNCFS 451
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 107/376 (28%), Positives = 159/376 (42%), Gaps = 47/376 (12%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
Y + +G P R + DTGSDL+W+QC PC C + +PL+ P
Sbjct: 138 YIVSVGLGTPKRDLLVVFDTGSDLSWVQCK-PCDGCYQQHDPLFDPSQSTTYSAVPCGAQ 196
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV---VFGCA 318
E +R G C + +C YE+ Y D S + G LARD L L + S + + VFGC
Sbjct: 197 ECRR-LDSGSCSS-GKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCG 254
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
D GL K DG+ GL R +VSL SQ A++ +CL +++ GY+ LG
Sbjct: 255 DDDTGL----FGKADGLFGLGRDRVSLASQAAAK--YGAGFSYCLPSSSTAEGYLSLGSA 308
Query: 379 LVPSWGM-AWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFT 437
P+ A V D+P Y+ ++ I + + + + D+G+ T
Sbjct: 309 APPNARFTAMVTRSDTP--SFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVITRLP 366
Query: 438 KQAYSELIASLKEVSSDGLVLDASDPTLPV------CW----RAKFPIRSIVDVKQFFKT 487
+AY+ L +S GL+ S P C+ R K I S+ + T
Sbjct: 367 SRAYAALRSSFA-----GLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGAT 421
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
L L FG L ++ K CL S + S ILG++ + V
Sbjct: 422 LNLGFGE---------------VLYVANKSQACLAF--ASNGDDTSIAILGNMQQKTFAV 464
Query: 548 VYDNVNKRIGWAKSHC 563
VYD N++IG+ C
Sbjct: 465 VYDVANQKIGFGAKGC 480
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 117/440 (26%), Positives = 202/440 (45%), Gaps = 68/440 (15%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNP-PRPYYLDMDTGS 223
+R H + ++++ S A + +S FPL G++ G Y+ + +G+P PR + + +DTGS
Sbjct: 76 LREHDAHRRRRILESPAES-PGASTFPLHGSVKEHGYYYANIALGDPSPRTFQVIVDTGS 134
Query: 224 DLTWIQCDAPCSSCAKGANPLYKPRM---GNILPYKDSLCMEIQRNHKPGYC-----ETC 275
LT++ PC++CAK R G L ++ C + PG C
Sbjct: 135 TLTYV----PCATCAKCGTHTGGTRFDPTGKWLTCQEKQC---KAAGGPGICAGGRGAAA 187
Query: 276 QQCDYEIEYADHSSSMGVLARDELHLTIE-----NGSLTKPNVVFGCAYDQQGLLLNTLV 330
+C Y YA+ S G L RD++H + NG+L +VVFGC + G + +
Sbjct: 188 NRCTYSRTYAEGSGVSGDLVRDKMHFGGDIAPATNGTL---DVVFGCTNAESGTIHDQ-- 242
Query: 331 KTDGILGLSRAK-VSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVP-SWGMAWV 388
+ DG++GL + S+P+QLA + V C + GGG F P + + +
Sbjct: 243 EADGLIGLGNNQFASIPNQLADTHGLPRVFSLCFGSFEGGGALSFGRLPATPHTPPLVYT 302
Query: 389 PMLDSPFMELYH---TEILKIN--YGSSPLNLGARNSQVGWA-LFDTGSSYTYFTKQAYS 442
M + Y+ T +KI ++P +L VG+ + D+G+++TY + +
Sbjct: 303 DMRVNEAHPAYYVVSTAAMKIGDVAVATPSDL-----AVGYGTVMDSGTTFTYVPTKVFH 357
Query: 443 ELIASLKEVSSDG-------LVLDASDPTLP--VCWRAKF-----PIRSIVDVKQFFKTL 488
A+L + + DP+ P VC++ + PI ++ ++ +++ L
Sbjct: 358 ATAAALDAAVTTNAKPEKKLAKVPGPDPSYPDDVCFQREGATEIEPIVTMANLGEYYPPL 417
Query: 489 TLHF-GSKWQIVSTKFHISPEGYLVI--SKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
T+ F G +V + P YL + K G CLG++D + ++G IS+R
Sbjct: 418 TIAFDGEGASLV-----LPPSNYLFVHGKKPGAFCLGVMDNKQ----QGTLIGGISVRDV 468
Query: 546 LVVYDNV--NKRIGWAKSHC 563
LV YD RIG+A + C
Sbjct: 469 LVEYDKTVGGGRIGFAATDC 488
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 124/497 (24%), Positives = 205/497 (41%), Gaps = 81/497 (16%)
Query: 85 LFLFLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGI-REVSQRDAEF 143
FL + I +F+ G VF++ + R+ + + + + +P F ++ RD
Sbjct: 8 FFLLITIWVFSKTCKGRVFTFKMHHRFSDSFKNWSGLTRNWPEKGSFEYYAALAHRDQML 67
Query: 144 KLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYF 203
+ R D D + D +S F + + L++
Sbjct: 68 RGRRLSDADASLAFS--------------------------DGNSTFRISSLGF---LHY 98
Query: 204 TYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA--KGAN-------PLYKPRMGNILP 254
T + +G P + + +DTGSDL W+ CD CS CA GA+ +Y PR +
Sbjct: 99 TTVELGTPGVKFMVALDTGSDLFWVPCD--CSRCAPTHGASYASDFELSIYNPRESST-- 154
Query: 255 YKDSLC---MEIQRNHKPGYCETCQQCDYEIEYAD-HSSSMGVLARDELHLTIENG--SL 308
K C M QRN G T C Y + Y +S+ G+L +D LHLT E+G
Sbjct: 155 SKKVTCNNDMCAQRNRCLG---TFSSCPYIVSYVSAQTSTSGILVKDVLHLTTEDGGREF 211
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
+ V FGC Q G L+ + +G+ GL K+S+PS L+ +G+I + C +
Sbjct: 212 VEAYVTFGCGQVQSGSFLD-IAAPNGLFGLGMEKISVPSVLSREGLIADSFSMCFGHD-- 268
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G + G P P +P Y+ + + G+ +++ ALFD
Sbjct: 269 GIGRISFGDKGSPD--QEETPFNVNPAHPTYNVTVTQARVGTMLIDVEFT------ALFD 320
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV--CWRAKFPIRSIVDVKQFFK 486
+G+S+TY AYS + ++ D DP +P C+ P + V
Sbjct: 321 SGTSFTYMVDPAYSRVSEKFHSLARDK--RRPPDPRIPFEYCYDMS-PDANASLVPSM-- 375
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+LT+ G + + IS + +V CL ++ +E++ I+G + G
Sbjct: 376 SLTMKGGRHFTVYDPIIVISTQNEIV------YCLAVVKSTELN-----IIGQNFMTGYR 424
Query: 547 VVYDNVNKRIGWAKSHC 563
VV+D +GW K C
Sbjct: 425 VVFDREKLVLGWKKFDC 441
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 149/355 (41%), Gaps = 28/355 (7%)
Query: 209 GNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHK 268
G P + + DTGS++ WIQC SC PL+ P + + Y++ C
Sbjct: 23 GTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSST--YRNISCTSAACTGL 80
Query: 269 PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNT 328
+ C Y + Y D SS++G LA + L N N +FGC + QGL
Sbjct: 81 SSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGN---VFNNFIFGCGQNNQGL---- 133
Query: 329 LVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWV 388
G++GL R+ SL SQLA+ + N+ +CL + + GY+ +G+ L A +
Sbjct: 134 FTGAAGLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGYLNIGNPLRTPGYTAML 191
Query: 389 PMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASL 448
+P LY +++ I+ G + L L + Q + D+G+ T AY L +
Sbjct: 192 TNSRAP--TLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITRLPPTAYGALRTAF 249
Query: 449 KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPE 508
+ + A+ L C+ F + V F T+ LH+ I
Sbjct: 250 RAAMTQ-YTRAAAASILDTCY--DFSRTTTVT----FPTIKLHY------TGLDVTIPGA 296
Query: 509 GYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G + +CL S+ + I+G++ R V YDN KRIG+A C
Sbjct: 297 GVFYVISSSQVCLAFAGNSD--STQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 168/383 (43%), Gaps = 46/383 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VG P P + +DTGSD+ W+QC APC C + ++ PR + D
Sbjct: 145 GEYFTKIGVGTPVTPALMVLDTGSDVVWLQC-APCRRCYDQSGQMFDPRASHSYGAVD-C 202
Query: 260 CMEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
+ R G C+ ++ C Y++ Y D S + G A + LT +G+ P V GC
Sbjct: 203 AAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATET--LTFASGARV-PRVALGCG 259
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--------TNAGGG 370
+D +GL + L R +S PSQ++ + +CL +
Sbjct: 260 HDNEGLFVAAAGLLG----LGRGSLSFPSQISRR--FGRSFSYCLVDRTSSSASATSRSS 313
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGS--------SPLNLGARNSQ 421
F + PS ++ PM+ +P ME Y+ +++ I+ G S L L +
Sbjct: 314 TVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGR 373
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
G + D+G+S T + AY+ L + + ++ GL L +L + + + + V
Sbjct: 374 -GGVIVDSGTSVTRLARPAYAALRDAFRAAAA-GLRLSPGGFSL---FDTCYDLSGLKVV 428
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDI 540
K T+++HF + + PE YL+ + +G C +G I+G+I
Sbjct: 429 K--VPTVSMHFAGGAEAA-----LPPENYLIPVDSRGTFCFAFAG----TDGGVSIIGNI 477
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+G VV+D +R+G+ C
Sbjct: 478 QQQGFRVVFDGDGQRLGFVPKGC 500
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 109/406 (26%), Positives = 169/406 (41%), Gaps = 44/406 (10%)
Query: 170 SKINKKLVSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWI 228
SK++KKL ++N V+ S+ P + G+ G Y + +G P L DTGSDLTW
Sbjct: 101 SKLSKKL-TTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWT 159
Query: 229 QCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQR-NHKPGYCETCQQCDYEIEY 284
QC +C P++ P + + C + G C C Y I+Y
Sbjct: 160 QCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSA-SNCIYGIQY 218
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
D S S+G LA+D+ LT + V FGC + QGL G+LGL R K+S
Sbjct: 219 GDQSFSVGFLAKDKFTLTSSD---VFDGVYFGCGENNQGLFTGVA----GLLGLGRDKLS 271
Query: 345 LPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFMELYHTEI 403
PSQ A+ + +CL ++A G++ G + S + + P+ + Y I
Sbjct: 272 FPSQTATA--YNKIFSYCLPSSASYTGHLTFGSAGI-SRSVKFTPISTITDGTSFYGLNI 328
Query: 404 LKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP 463
+ I G L + + AL D+G+ T +AY+ L +S K S
Sbjct: 329 VAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMS---------- 378
Query: 464 TLPVCWRAKFPIRSIVDVKQF------FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG 517
K+P S V + FKT+T+ + + +G K
Sbjct: 379 --------KYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFKIS 430
Query: 518 NICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+CL S+ N + I G++ + VVYD R+G+A + C
Sbjct: 431 QVCLAFAGNSDDSNAA--IFGNVQQQTLEVVYDGAGGRVGFAPNGC 474
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 112/379 (29%), Positives = 170/379 (44%), Gaps = 44/379 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + VG PPR YL MDTGSD+ W+QC APC SC + ++ P + Y
Sbjct: 35 GEYFIRVSVGTPPRGMYLVMDTGSDILWLQC-APCVSCYHQCDEVFDPYKSST--YSTLG 91
Query: 260 CMEIQ-RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL--TIENGSLTKPNVVFG 316
C Q N G C +C Y+++Y D S S G A D + L T G + + G
Sbjct: 92 CNSRQCLNLDVGGC-VGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLG 150
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYM 373
C +D +G V G+LGL + +S P+Q+ S+ +CLT T++ +
Sbjct: 151 CGHDNEGY----FVGAAGLLGLGKGPLSFPNQINSEN--GGRFSYCLTGRDTDSTERSSL 204
Query: 374 FLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV-----GWALF 427
G VP G+ + P + + Y+ ++ I+ G S L + Q+ G +
Sbjct: 205 IFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVII 264
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D+G+S T AY+ L + + +SD LVL C+ S VDV T
Sbjct: 265 DSGTSVTRLQNAAYASLREAFRAGTSD-LVLTTEFSLFDTCY--NLSDLSSVDV----PT 317
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGST--IILGDISLRG 544
+TLHF + + YLV + CL G+T I+G+I +G
Sbjct: 318 VTLHFQGGADL-----KLPASNYLVPVDNSSTFCLAFA-------GTTGPSIIGNIQQQG 365
Query: 545 QLVVYDNVNKRIGWAKSHC 563
V+YDN++ ++G+ S C
Sbjct: 366 FRVIYDNLHNQVGFVPSQC 384
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/368 (26%), Positives = 166/368 (45%), Gaps = 31/368 (8%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
YFT + +G P +++DTGSD +WIQC PC C + L+ P + +
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCK-PCPDCYEQHEALFDPSKSSTYSDITCSSR 192
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C E+ +HK C + ++C YEI YAD S ++G LARD L L+ + P VFGC
Sbjct: 193 ECQELGSSHKHN-CSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDA---VPGFVFGCG 248
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM-FLGH 377
++ G + + DG+LGL R K SL SQ+A++ +CL ++ GY+ F G
Sbjct: 249 HNNAG----SFGEIDGLLGLGRGKASLSSQVAAR--YGAGFSYCLPSSPSATGYLSFSGA 302
Query: 378 DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL-GARNSQVGWALFDTGSSYTYF 436
+ M+ Y+ + I + + + + + D+G++++
Sbjct: 303 AAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTIIDSGTAFSCL 362
Query: 437 TKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKW 496
AY+ L +S++ S+ G A T+ + + + V+ ++ L F
Sbjct: 363 PPSAYAALRSSVR--SAMGRYKRAPSSTI---FDTCYDLTGHETVR--IPSVALVFADGA 415
Query: 497 QIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKR 555
+ H+ P G L S CL L + + S +LG+ R V+YD N++
Sbjct: 416 TV-----HLHPSGVLYTWSNVSQTCLAFLPNPD--DTSLGVLGNTQQRTLAVIYDVDNQK 468
Query: 556 IGWAKSHC 563
+G+ + C
Sbjct: 469 VGFGANGC 476
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 101/384 (26%), Positives = 171/384 (44%), Gaps = 49/384 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VG P P + +DTGSD+ W+QC APC C + ++ PR Y
Sbjct: 140 GEYFTKIGVGTPATPALMVLDTGSDVVWLQC-APCRRCYDQSGQVFDPRRSR--SYGAVG 196
Query: 260 CME-IQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
C + R G C+ ++ C Y++ Y D S + G A + L G + GC
Sbjct: 197 CSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA---GGARVARIALGC 253
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGGGGY-- 372
+D +GL + L R +S P+Q++ + +CL T++A +
Sbjct: 254 GHDNEGLFVAAAGLLG----LGRGSLSFPAQISRR--YGRSFSYCLVDRTSSANPASHSS 307
Query: 373 -MFLGHDLVPSW-GMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV------- 422
+ G V S ++ PM+ +P ME Y+ +++ I+ G + ++ G +S +
Sbjct: 308 TVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVS-GVADSDLRLDPSSG 366
Query: 423 -GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVD 480
G + D+G+S T + AYS L + + ++ GL L +L C+ R +V
Sbjct: 367 RGGVIVDSGTSVTRLARPAYSALRDAFRAAAA-GLRLSPGGFSLFDTCY--DLSGRKVVK 423
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGD 539
V T+++HF + + PE YL+ + KG C +G I+G+
Sbjct: 424 V----PTVSMHFAGGAEAA-----LPPENYLIPVDSKGTFCFAFAG----TDGGVSIIGN 470
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
I +G VV+D +R+G+ C
Sbjct: 471 IQQQGFRVVFDGDGQRVGFVPKGC 494
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/386 (27%), Positives = 170/386 (44%), Gaps = 53/386 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G YFT + VG P P + +DTGSD+ W+QC APC C + + ++ PR N +
Sbjct: 138 GEYFTKIGVGTPATPALMVLDTGSDVVWLQC-APCRRCYEQSGQVFDPRRSRSYNAVGCA 196
Query: 257 DSLCMEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
LC R G C+ + C Y++ Y D S + G A + L G V
Sbjct: 197 APLC----RRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFA---GGARVARVAL 249
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL------TTNAGG 369
GC +D +GL V G+LGL R +S P+Q++ + +CL A
Sbjct: 250 GCGHDNEGL----FVAAAGLLGLGRGSLSFPTQISRR--YGRSFSYCLVDRTSSANTASR 303
Query: 370 GGYMFLGHDLVPSW-GMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV----- 422
+ G V S ++ PM+ +P ME Y+ +++ I+ G + + G NS +
Sbjct: 304 SSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVP-GVANSDLRLDPS 362
Query: 423 ---GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSI 478
G + D+G+S T + AYS L + + ++ GL L +L C+ R +
Sbjct: 363 SGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAA-GLRLSPGGFSLFDTCY--DLSGRKV 419
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIIL 537
V V T+++HF + + PE YL+ + KG C +G I+
Sbjct: 420 VKV----PTVSMHFAGGAEAA-----LPPENYLIPVDSKGTFCFAFAG----TDGGVSII 466
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G+I +G VV+D +R+ + C
Sbjct: 467 GNIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 80/253 (31%), Positives = 127/253 (50%), Gaps = 29/253 (11%)
Query: 161 NDGI----IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD--GLYFTYMIVGNPPRP 214
NDG+ +R S +++++ S VD FP++G P GLY+T + +G PPR
Sbjct: 34 NDGVELSELRARDSLRHRRMLQSTNYVVD----FPVKGTFDPSQVGLYYTKVKLGTPPRE 89
Query: 215 YYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPRMGNILPYKDSLCMEIQ-RNHK 268
Y+ +DTGSD+ W+ C + C+ C + + + P G+ C++ + R+
Sbjct: 90 LYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQLNYFDP--GSSSTSSLISCLDRRCRSGV 146
Query: 269 PGYCETC----QQCDYEIEYADHSSSMGVLARDELHL-TIENGSLT---KPNVVFGCAYD 320
+C QC Y +Y D S + G D +H +I G+LT +VVFGC+
Sbjct: 147 QTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNSSASVVFGCSIL 206
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLV 380
Q G L + DGI G + +S+ SQL+SQGI V HCL + GGG + LG +
Sbjct: 207 QTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDNSGGGVLVLGEIVE 266
Query: 381 PSWGMAWVPMLDS 393
P+ + + P++ S
Sbjct: 267 PN--IVYSPLVPS 277
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 101/384 (26%), Positives = 159/384 (41%), Gaps = 50/384 (13%)
Query: 193 RGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI 252
RG G Y M +G P R + DTGSDL+W+QC PCS C + +PL+ P +
Sbjct: 137 RGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQC-TPCSDCYEQKDPLFDPARSST 195
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
+P C + C ++C YE+ Y D S + G LARD L LT +
Sbjct: 196 YSAVPCASPECQGLDSRS----CSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSD---V 248
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
P VFGC GL + DG++GL R KVSL SQ AS+ +CL ++
Sbjct: 249 LPGFVFGCGEQDTGL----FGRADGLVGLGREKVSLSSQAASK--YGAGFSYCLPSSPSA 302
Query: 370 GGYMFLGHDLVPSWGMAWVPML----DSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
GY+ LG P+ A + DSP Y+ ++ + + +
Sbjct: 303 AGYLSLGG---PAPANARFTAMETRHDSP--SFYYVRLVGVKVAGRTVRVSPIVFSAAGT 357
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV-KQF 484
+ D+G+ T + Y+ L ++ ++ + P SI+D F
Sbjct: 358 VIDSGTVITRLPPRVYAALRSAFAR-------------SMGRYGYKRAPALSILDTCYDF 404
Query: 485 FKTLTLHFGSKWQIVSTKFHISPE--GYLVISKKGNICLGIL---DGSEVHNGSTIILGD 539
T+ S + + + + G L ++K CL DG++ I+G+
Sbjct: 405 TGHTTVRIPSVALVFAGGAAVGLDFSGVLYVAKVSQACLAFAPNGDGADAG-----IIGN 459
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ VVYD ++IG+ + C
Sbjct: 460 TQQKTLAVVYDVARQKIGFGANGC 483
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 120/496 (24%), Positives = 207/496 (41%), Gaps = 73/496 (14%)
Query: 86 FLFLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKL 145
F+F+ S+F + +G V+++T+ R+ + + GI ++
Sbjct: 4 FVFIIASLFLSLCHGHVYTFTMHHRHSEPVRKWSHST-------ASGIPAPPEKGTVEYY 56
Query: 146 GRFVDLDGESVVASVNDGIIRPHK-SKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFT 204
D D ++R K S+I+ L S D +S F + + L++T
Sbjct: 57 AELAD----------RDRLLRGRKLSQIDDGLAFS-----DGNSTFRISSLGF---LHYT 98
Query: 205 YMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP---------LYKPRMGNI--- 252
+ +G P + + +DTGSDL W+ CD C+ CA + +Y P +
Sbjct: 99 TVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAATDSSAFASDFDLNVYNPNGSSTSKK 156
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYAD-HSSSMGVLARDELHLTIENG--SLT 309
+ +SLCM H+ T C Y + Y +S+ G+L D LHLT E+ L
Sbjct: 157 VTCNNSLCM-----HRSQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLV 211
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
+ NV+FGC Q G L+ + +G+ GL K+S+PS L+ +G + C + G
Sbjct: 212 EANVIFGCGQIQSGSFLD-VAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRD--G 268
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDT 429
G + G S+ P +P Y+ + ++ G++ +++ ALFD+
Sbjct: 269 IGRISFGDK--GSFDQDETPFNLNPSHPTYNITVTQVRVGTTLIDVEFT------ALFDS 320
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
G+S+TY Y+ L S D S C+ P + + +LT
Sbjct: 321 GTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMS-PDANTSLIPSV--SLT 377
Query: 490 LHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
+ GS + + IS + LV CL ++ +E++ I+G + G VV+
Sbjct: 378 MGGGSHFAVYDPIIIISTQSELV------YCLAVVKTAELN-----IIGQNFMTGYRVVF 426
Query: 550 DNVNKRIGWAKSHCMN 565
D +GW K C +
Sbjct: 427 DREKLVLGWKKFDCYD 442
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 77/263 (29%), Positives = 125/263 (47%), Gaps = 22/263 (8%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNI 252
LY+T + +G P + YY+ +DTGSD+ W+ C C C + + LY P+ G+
Sbjct: 32 LYYTEIGIGTPTKRYYVQVDTGSDILWVNC-ISCDRCPRKSGLGLELTLYDPKDSSTGSK 90
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKP 311
+ C PG C T C+Y + Y D SS+ G D L + T+P
Sbjct: 91 VSCDQGFCAATYGGLLPG-CTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRP 149
Query: 312 ---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
V FGC Q G L ++ DGI+G ++ S+ SQL++ G +K + HCL T
Sbjct: 150 ANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT-IN 208
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---WA 425
GGG +G+ + P + P++ P M Y+ + I+ G + L L + G
Sbjct: 209 GGGIFAIGNVVQPK--VKTTPLV--PNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 264
Query: 426 LFDTGSSYTYFTKQAYSELIASL 448
+ D+G++ TY + Y E++ ++
Sbjct: 265 IIDSGTTLTYLPEIVYKEIMLAV 287
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 104/419 (24%), Positives = 179/419 (42%), Gaps = 45/419 (10%)
Query: 154 ESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPR 213
ES VAS+ +S++ K L + + +++ + G Y + +G+P R
Sbjct: 107 ESRVASI--------QSRLAKNLAGGSNLKASKATLPSKSASTLGSGNYVVTVGLGSPKR 158
Query: 214 PYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC-------MEIQRN 266
DTGSDLTW QC+ C + ++ P L Y + C +E
Sbjct: 159 DLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTS--LSYSNVSCDSPSCEKLESATG 216
Query: 267 HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLL 326
+ PG C + C Y I Y D S S+G AR++L LT + N FGC + +GL
Sbjct: 217 NSPG-CSS-STCLYGIRYGDGSYSIGFFAREKLSLTSTD---VFNNFQFGCGQNNRGLFG 271
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMA 386
T G+LGL+R +SL SQ A + V +CL +++ GY+ G S +
Sbjct: 272 GTA----GLLGLARNPLSLVSQTAQK--YGKVFSYCLPSSSSSTGYLSFGSGDGDSKAVK 325
Query: 387 WVPM-LDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELI 445
+ P ++S + Y +++ I+ G L + + D+G+ + YS +
Sbjct: 326 FTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIIDSGTVISRLPPTVYSSVQ 385
Query: 446 ASLKEVSSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFH 504
+E+ SD + L C+ +K+ + + +F +
Sbjct: 386 KVFRELMSDYPRVKGVS-ILDTCYDLSKYKTVKVPKIILYFSG------------GAEMD 432
Query: 505 ISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++PEG + + K +CL S+ + I+G++ + VVYD+ R+G+A S C
Sbjct: 433 LAPEGIIYVLKVSQVCLAFAGNSD--DDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 489
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 108/408 (26%), Positives = 168/408 (41%), Gaps = 48/408 (11%)
Query: 170 SKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQ 229
SK++KKL + + S+ + G+ G Y + +G P L DTGSDLTW Q
Sbjct: 100 SKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQ 159
Query: 230 CDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQR-NHKPGYCETCQQCDYEIEYA 285
C +C P++ P + + C + G C + C Y I+Y
Sbjct: 160 CQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSC-SASNCIYGIQYG 218
Query: 286 DHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSL 345
D S S+G LA+++ LT S V FGC + QGL G+LGL R K+S
Sbjct: 219 DQSFSVGFLAKEKFTLT---NSDVFDGVYFGCGENNQGLFTGVA----GLLGLGRDKLSF 271
Query: 346 PSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFMELYHTEIL 404
PSQ A+ + +CL ++A G++ G + S + + P+ + Y I+
Sbjct: 272 PSQTATA--YNKIFSYCLPSSASYTGHLTFGSAGI-SRSVKFTPISTITDGTSFYGLNIV 328
Query: 405 KINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT 464
I G L + + AL D+G+ T +AY+ L +S K S
Sbjct: 329 AITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMS----------- 377
Query: 465 LPVCWRAKFPIRSIVDVKQF------FKTLTL---HFGSKWQIVSTKFHISPEGYLVISK 515
K+P S V + FKT+T+ F V + +G + K
Sbjct: 378 -------KYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAV---VELGSKGIFYVFK 427
Query: 516 KGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+CL S+ N + I G++ + VVYD R+G+A + C
Sbjct: 428 ISQVCLAFAGNSDDSNAA--IFGNVQQQTLEVVYDGAGGRVGFAPNGC 473
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 118/456 (25%), Positives = 192/456 (42%), Gaps = 42/456 (9%)
Query: 123 FVFPLYHKFGIREVSQRDAEFKLGRF-VDL-DGESVVASVNDGIIRPHKSKINKKLVSSN 180
+VF + F + +S R+A L F VDL +S + + + P + IN L S +
Sbjct: 4 WVFMILALFSLSTLSSREAREGLRGFSVDLIHRDSPSSPFYNPSLTPSERIINAALRSMS 63
Query: 181 AVAVDSSSI----FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
+ S + P I G Y +G+PP +DTGS L W+QC +PC +
Sbjct: 64 RLQRVSHFLDENKLPESLLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQC-SPCHN 122
Query: 237 CAKGANPLYKPRMGNILPYK--DSL-CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGV 293
C PL++P + Y DS C +Q + + C QC Y I Y D S S+G+
Sbjct: 123 CFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRD--CGKLGQCIYGIMYGDKSFSVGI 180
Query: 294 LARDELHLTIENGSLTK--PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
L + L G+ T PN +FGC D + T K GI GL +SL SQL +
Sbjct: 181 LGTETLSFGSTGGAQTVSFPNTIFGCGVDNN-FTIYTSNKVMGIAGLGAGPLSLVSQLGA 239
Query: 352 QGIIKNVVGHCLT--TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYH-TEILKINY 408
Q I + +CL + F ++ + G+ P++ P + Y+ + +
Sbjct: 240 Q--IGHKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTI 297
Query: 409 GSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVC 468
G ++ G + + + D+G+ TY Y+ +ASL+E L+ D LP
Sbjct: 298 GQKVVSTGQTDGNI---VIDSGTPLTYLENTFYNNFVASLQETLGVKLLQD-----LPSP 349
Query: 469 WRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGS 527
+ FP R+ + + +Q + P+ L+ NI CL ++ S
Sbjct: 350 LKTCFPNRANLAIPDI----------AFQFTGASVALRPKNVLIPLTDSNILCLAVVPSS 399
Query: 528 EVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + G I+ V YD K++ +A + C
Sbjct: 400 GI---GISLFGSIAQYDFQVEYDLEGKKVSFAPTDC 432
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 112 bits (281), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 167/405 (41%), Gaps = 42/405 (10%)
Query: 170 SKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQ 229
SK++KKL + + S+ + G+ G Y + +G P L DTGSDLTW Q
Sbjct: 72 SKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQ 131
Query: 230 CDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQR-NHKPGYCETCQQCDYEIEYA 285
C +C P++ P + + C + G C + C Y I+Y
Sbjct: 132 CQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSC-SASNCIYGIQYG 190
Query: 286 DHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSL 345
D S S+G LA+++ LT S V FGC + QGL G+LGL R K+S
Sbjct: 191 DQSFSVGFLAKEKFTLT---NSDVFDGVYFGCGENNQGLFTGVA----GLLGLGRDKLSF 243
Query: 346 PSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFMELYHTEIL 404
PSQ A+ + +CL ++A G++ G + S + + P+ + Y I+
Sbjct: 244 PSQTATA--YNKIFSYCLPSSASYTGHLTFGSAGI-SRSVKFTPISTITDGTSFYGLNIV 300
Query: 405 KINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT 464
I G L + + AL D+G+ T +AY+ L +S K S
Sbjct: 301 AITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMS----------- 349
Query: 465 LPVCWRAKFPIRSIVDVKQF------FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN 518
K+P S V + FKT+T+ + + +G + K
Sbjct: 350 -------KYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQ 402
Query: 519 ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+CL S+ N + I G++ + VVYD R+G+A + C
Sbjct: 403 VCLAFAGNSDDSNAA--IFGNVQQQTLEVVYDGAGGRVGFAPNGC 445
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 111/392 (28%), Positives = 170/392 (43%), Gaps = 54/392 (13%)
Query: 194 GNIYPDG-----LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN------ 242
G+I+P G LY+T++ VG P + + +DTGSDL W+ CD C CA ++
Sbjct: 89 GSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCD--CIQCAPLSSYHGSLD 146
Query: 243 ---PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLA 295
+YKP LP LC P Q C Y I+Y +++++S G+L
Sbjct: 147 RDLGIYKPSESTTSRHLPCSHELCSPASGCTNPK-----QPCPYNIDYFSENTTSSGLLI 201
Query: 296 RDELHLTIENG-SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
D LHL G + +V+ GC Q G L + DG+LGL A +S+PS LA G+
Sbjct: 202 EDMLHLDSREGHAPVNASVIIGCGKKQSGSYLEG-IAPDGLLGLGMADISVPSFLARAGL 260
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPS-WGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
++N C + G +F G VP+ +VPM ++ Y + K G
Sbjct: 261 VRNSFSMCFKKD--DSGRIFFGDQGVPTQQSTPFVPMNGK--LQTYAVNVDKYCIGHKCT 316
Query: 414 NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAK 472
AL DTG+S+T AY + K++++ D D + C+
Sbjct: 317 EGAGFQ-----ALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSD--DYSFEYCYSTG 369
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP-EGYLVISKKGNICLGILDGSEVHN 531
P+ + DV TLT +Q V+ + +G + CL +L E
Sbjct: 370 -PLE-MPDVPTI--TLTFAENKSFQAVNPILPFNDRQGEFAV-----FCLAVLPSPE--- 417
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G + G VV+D N ++GW +S C
Sbjct: 418 -PVGIIGQNFMVGYHVVFDRENMKLGWYRSEC 448
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 153/375 (40%), Gaps = 42/375 (11%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL-PYKDSL 259
LY+ + VG P PY + +DTGSDL W+ CD C +C G N P NI P S
Sbjct: 129 LYYAEVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFNIYSPNNSST 186
Query: 260 CMEIQRNHKPGYCETCQQCD-------YEIEY-ADHSSSMGVLARDELHLTIENGSLTKP 311
E+Q + C QC Y++ Y +D++SS G L D LHLT N +KP
Sbjct: 187 SKEVQCSSS--LCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLT-TNDVQSKP 243
Query: 312 ---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
+ GC DQ G L++ +G+ GL VS+PS LA+ G+I N C
Sbjct: 244 VNARITLGCGKDQSGAFLSS-AAPNGLFGLGIENVSVPSILANAGLISNSFSLCF--GPA 300
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G + G P G P Y+ I +I G +L +FD
Sbjct: 301 RMGRIEFGDKGSP--GQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVA------VIFD 352
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
+G+S+TY AYS + + SD C+ P ++ L
Sbjct: 353 SGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELS-PNQTTFTYP--LMNL 409
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
T+ G + I IS E K CL I ++ I+G + G +V
Sbjct: 410 TMKGGGHFVINHPIVLISTE------SKRLFCLAIARSDSIN-----IIGQNFMTGYHIV 458
Query: 549 YDNVNKRIGWAKSHC 563
+D +GW +S+C
Sbjct: 459 FDREKMVLGWKESNC 473
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 108/393 (27%), Positives = 172/393 (43%), Gaps = 53/393 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---------PLYKPRMGN 251
LY+ ++ VG P + + +DTGSDL W+ CD C CA ++ +YKP
Sbjct: 99 LYYAWVDVGTPTTSFLVALDTGSDLFWVPCD--CIQCAPLSSYRGNLDRDLGIYKPAEST 156
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENG- 306
LP LC P Q C Y I+Y +++++S G+L D LHL G
Sbjct: 157 TSRHLPCSHELCQPGSGCTNPK-----QPCTYNIDYFSENTTSSGLLIEDSLHLNSREGH 211
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+ +V+ GC Q G L+ + DG+LGL A +S+PS LA G+++N C +
Sbjct: 212 APVNASVIIGCGRKQSGDYLDGIAP-DGLLGLGMADISVPSFLARAGLVRNSFSMCFKED 270
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELY-HTEILKINYGSSPLNLGARNSQVGWA 425
+ G +F G V S +PF+ LY + +N S + A
Sbjct: 271 S--SGRIFFGDQGVSS-------QQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGSSFQA 321
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
L D+G+S+T Y K++++ + + D T C+ A P+ + DV
Sbjct: 322 LVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYE--DSTWKYCYSAS-PLE-MPDV--- 374
Query: 485 FKTLTLHFGSK--WQIVSTKFHISPE-GYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
T+ L F + +Q V+ + E G L CL +L +E I+G
Sbjct: 375 -PTIILAFAANKSFQAVNPILPFNDEQGALA-----RFCLAVLPSTEPIG----IIGQNF 424
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
L G VV+D + ++GW +S C + ++P
Sbjct: 425 LVGYHVVFDRESMKLGWYRSECRDVDNSTTVPL 457
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 153/375 (40%), Gaps = 42/375 (11%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL-PYKDSL 259
LY+ + VG P PY + +DTGSDL W+ CD C +C G N P NI P S
Sbjct: 106 LYYAEVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFNIYSPNNSST 163
Query: 260 CMEIQRNHKPGYCETCQQCD-------YEIEY-ADHSSSMGVLARDELHLTIENGSLTKP 311
E+Q + C QC Y++ Y +D++SS G L D LHLT N +KP
Sbjct: 164 SKEVQCSSS--LCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLT-TNDVQSKP 220
Query: 312 ---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
+ GC DQ G L++ +G+ GL VS+PS LA+ G+I N C
Sbjct: 221 VNARITLGCGKDQSGAFLSS-AAPNGLFGLGIENVSVPSILANAGLISNSFSLCF--GPA 277
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G + G P G P Y+ I +I G +L +FD
Sbjct: 278 RMGRIEFGDKGSP--GQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVA------VIFD 329
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
+G+S+TY AYS + + SD C+ P ++ L
Sbjct: 330 SGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELS-PNQTTFTYP--LMNL 386
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
T+ G + I IS E K CL I ++ I+G + G +V
Sbjct: 387 TMKGGGHFVINHPIVLISTE------SKRLFCLAIARSDSIN-----IIGQNFMTGYHIV 435
Query: 549 YDNVNKRIGWAKSHC 563
+D +GW +S+C
Sbjct: 436 FDREKMVLGWKESNC 450
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 161/388 (41%), Gaps = 59/388 (15%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP----------LYKPRMG 250
LY+ + VG PP + + +DTGSDL W+ C+ ++C + LY P
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPCNCG-TTCIRDLEDIGVPQSVPLNLYTPNAS 159
Query: 251 NI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307
+ D C ++ P C Y+I Y++ + + G L +D LHL E+ +
Sbjct: 160 TTSSSIRCSDKRCFGSKKCSSPS-----SICPYQISYSNSTGTKGTLLQDVLHLATEDEN 214
Query: 308 LT--KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
LT K NV GC Q GL +G+LGL S+PS LA I N C
Sbjct: 215 LTPVKANVTLGCGQKQTGLFQRN-NSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGR 273
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMEL-----YHTEILKINYGSSPLNLGARNS 420
G G + G + ++PF+ + Y I ++ P+++
Sbjct: 274 VIGNVGRISFGD-------RGYTDQEETPFISVAPSTAYGVNISGVSVAGDPVDIRL--- 323
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
+A FDTGSS+T+ + AY L S E LV D P P + P D
Sbjct: 324 ---FAKFDTGSSFTHLREPAYGVLTKSFDE-----LVEDRRRPVDP-----ELPFEFCYD 370
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI--CLGILD--GSEVHNGSTII 536
+ T+ I +K ++ + +++GN+ CLG+L G +++ +
Sbjct: 371 LSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVGLKIN-----V 425
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHCM 564
+G + G +V+D +GW +S C
Sbjct: 426 IGQNFVAGYRIVFDRERMILGWKQSLCF 453
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 111/392 (28%), Positives = 170/392 (43%), Gaps = 54/392 (13%)
Query: 194 GNIYPDG-----LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN------ 242
G+I+P G LY+T++ VG P + + +DTGSDL W+ CD C CA ++
Sbjct: 89 GSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCD--CIQCAPLSSYHGSLD 146
Query: 243 ---PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLA 295
+YKP LP LC P Q C Y I+Y +++++S G+L
Sbjct: 147 RDLGIYKPSESTTSRHLPCSHELCSPASGCTNPK-----QPCPYNIDYFSENTTSSGLLI 201
Query: 296 RDELHLTIENG-SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
D LHL G + +V+ GC Q G L + DG+LGL A +S+PS LA G+
Sbjct: 202 EDMLHLDSREGHAPVNASVIIGCGKKQSGSYLEG-IAPDGLLGLGMADISVPSFLARAGL 260
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPS-WGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
++N C + G +F G VP+ +VPM ++ Y + K G
Sbjct: 261 VRNSFSMCFKKD--DSGRIFFGDQGVPTQQSTPFVPMNGK--LQTYAVNVDKYCIGHKCT 316
Query: 414 NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAK 472
AL DTG+S+T AY + K++++ D D + C+
Sbjct: 317 EGAGFQ-----ALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSD--DYSFEYCYSTG 369
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP-EGYLVISKKGNICLGILDGSEVHN 531
P+ + DV TLT +Q V+ + +G + CL +L E
Sbjct: 370 -PLE-MPDVPTI--TLTFAENKSFQAVNPILPFNDRQGEFAV-----FCLAVLPSPE--- 417
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G + G VV+D N ++GW +S C
Sbjct: 418 -PVGIIGQNFMVGYHVVFDRENMKLGWYRSEC 448
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 112 bits (279), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 162/379 (42%), Gaps = 47/379 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---------PLYKPRMGN 251
LY+ ++ VG P + + +DTGSDL W+ CD C CA + +Y+P
Sbjct: 95 LYYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAEST 152
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTI-ENG 306
LP LC + PG Q C Y I+Y +++++S G+L D LHL E+
Sbjct: 153 TSRHLPCSHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDH 207
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+V+ GC Q G L+ + DG+LGL A +S+PS LA G+++N C +
Sbjct: 208 VPVNASVIIGCGQKQSGDYLDG-IAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKED 266
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELY-HTEILKINYGSSPLNLGARNSQVGWA 425
+ G +F G VPS +PF+ LY + +N S + A
Sbjct: 267 S--SGRIFFGDQGVPS-------QQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKA 317
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
L D+G+S+T Y + + + D T C+ A P+ + DV
Sbjct: 318 LVDSGTSFTSLPFDVYKAFTMEFDK-QMNATRVPYEDTTWKYCYSAS-PLE-MPDVPTI- 373
Query: 486 KTLTLHFGSKWQIVSTKFHIS-PEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
TLT Q V+ + +G L CL +L +E I+ L G
Sbjct: 374 -TLTFAADKSLQAVNPILPFNDKQGAL-----AGFCLAVLPSTEPIG----IIAQNFLVG 423
Query: 545 QLVVYDNVNKRIGWAKSHC 563
VV+D + ++GW +S C
Sbjct: 424 YHVVFDRESMKLGWYRSEC 442
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 112 bits (279), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 112/417 (26%), Positives = 187/417 (44%), Gaps = 52/417 (12%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSD 224
+R +++I K+ N+ S PL I + L + + +G + + +DTGSD
Sbjct: 95 VRSMQNRIRAKVSGHNSSEQSSEIQIPLASGINLETLNYI-VTIGLGNQNMTVIIDTGSD 153
Query: 225 LTWIQCDAPCSSCAKGANPLY---KPRMGNILPYKDSLCMEIQRNHKPGYCETCQ----- 276
LTW+QCD PC SC P++ N L S C +Q G E C+
Sbjct: 154 LTWVQCD-PCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQ--FTTGNTEACESNNPS 210
Query: 277 QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGIL 336
C++ + Y D S + G L + L G ++ N VFGC + +GL GI+
Sbjct: 211 SCNHTVSYGDGSFTDGELGVEHLSF----GGISVSNFVFGCGRNNKGL----FGGVSGIM 262
Query: 337 GLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHD------LVPSWGMAWVP 389
GL R+ +S+ SQ + V +CL TT++G G + +G++ L P +A+
Sbjct: 263 GLGRSNLSMISQ--TNTTFGGVFSYCLPTTDSGASGSLVIGNESSLFKNLTP---IAYTS 317
Query: 390 MLDSP-FMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIAS- 447
M+ +P Y + I+ G + + + G L D+G+ T Y+ L A
Sbjct: 318 MVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGN--GGILIDSGTVITRLAPSLYNALKAEF 375
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
LK+ S G + + L C F + I +V TL++HF + + ++
Sbjct: 376 LKQFS--GYPIAPALSILDTC----FNLTGIEEVS--IPTLSMHFEN-----NVDLNVDA 422
Query: 508 EGYLVISKKGN-ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G L + K G+ +CL + S+ ++ I+G+ R Q V+YD +IG+A+ C
Sbjct: 423 VGILYMPKDGSQVCLALASLSDEND--MAIIGNYQQRNQRVIYDAKQSKIGFAREDC 477
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 156/375 (41%), Gaps = 41/375 (10%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
G Y+ + +G+P R Y + +DTGS L+W+QC C A+PL+ P YK
Sbjct: 10 SGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKT--YKSL 67
Query: 259 LCMEIQRN-------HKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
C Q + + P CET C Y Y D S SMG L++D L L S T
Sbjct: 68 SCTSSQCSSLVDATLNNP-LCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAP---SQTL 123
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
P V+GC D +GL + GILGL R K+S+ Q++S+ +CL T GGG
Sbjct: 124 PGFVYGCGQDSEGL----FGRAAGILGLGRNKLSMLGQVSSK--FGYAFSYCLPTR-GGG 176
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFDT 429
G++ +G + + PM P LY + I G L + A +V + D+
Sbjct: 177 GFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRV-PTIIDS 235
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP-IRSIVDVKQFFKTL 488
G+ T Y+ + ++ S L C++ ++S+ +V+ F+
Sbjct: 236 GTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRLIFQG- 294
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
++ P L+ +G CL N I+G+ + V
Sbjct: 295 -----------GADLNLRPVNVLLQVDEGLTCLAF-----AGNNGVAIIGNHQQQTFKVA 338
Query: 549 YDNVNKRIGWAKSHC 563
+D RIG+A C
Sbjct: 339 HDISTARIGFATGGC 353
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 101/388 (26%), Positives = 169/388 (43%), Gaps = 44/388 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y+ + +G P L MDTGSD++WIQC PC C P + PR + LP S
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQC-VPCKDCVPALRPPFNPRHSSSFFKLPCASS 196
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP----NVV 314
C + + KP + + C + I+Y D S S G+LA + + N +P N+
Sbjct: 197 TCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNIT 256
Query: 315 FGCA-YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGGG 370
GCA D++GL G+LG+ R +S PSQL+S+ K HC +
Sbjct: 257 LGCADIDREGL----PTGASGLLGMDRRPISFPSQLSSRYARK--FSHCFPDKIAHLNSS 310
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPF-----MELYHTEILKINYGSSPLNLGARNSQV--- 422
G +F G + S + + P++ +P ++ Y+ ++ I+ S L L +N +
Sbjct: 311 GLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKV 370
Query: 423 ---GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
G + D+G+++TY K A+ + +S +D + P C+ ++
Sbjct: 371 TGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTP-CYNITSGTAALE 429
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVIS---KKGNICLGI-LDGSEVHNGSTI 535
++TLHF +V K I + +S ++ +CL + G N
Sbjct: 430 ST--ILPSITLHFRGGLDVVLPKNSI----LIPVSSSEEQTTLCLAFQMSGDIPFN---- 479
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+ + V YD R+G A + C
Sbjct: 480 IIGNYQQQNLWVEYDLEKLRLGIAPAQC 507
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 106/390 (27%), Positives = 166/390 (42%), Gaps = 47/390 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---------PLYKPRMGN 251
LY+ ++ VG P + + +DTGSDL W+ CD C CA + +Y+P
Sbjct: 65 LYYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAEST 122
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTI-ENG 306
LP LC + PG Q C Y I+Y +++++S G+L D LHL E+
Sbjct: 123 TSRHLPCSHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDH 177
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+V+ GC Q G L+ + DG+LGL A +S+PS LA G+++N C +
Sbjct: 178 VPVNASVIIGCGQKQSGDYLDG-IAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKED 236
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELY-HTEILKINYGSSPLNLGARNSQVGWA 425
+ G +F G VPS +PF+ LY + +N S + A
Sbjct: 237 S--SGRIFFGDQGVPS-------QQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKA 287
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
L D+G+S+T Y + + + D T C+ A P+ + DV
Sbjct: 288 LVDSGTSFTSLPLDVYKAFTMEFDK-QMNATRVPYEDTTWKYCYSAS-PLE-MPDVPTI- 343
Query: 486 KTLTLHFGSKWQIVSTKFHIS-PEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
TLT Q V+ + +G L CL +L +E I+ L G
Sbjct: 344 -TLTFAADKSLQAVNPILPFNDKQGAL-----AGFCLAVLPSTEPIG----IIAQNFLVG 393
Query: 545 QLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
VV+D + ++GW +S C + ++P
Sbjct: 394 YHVVFDRESMKLGWYRSECHDVEDSTTVPL 423
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 160/377 (42%), Gaps = 35/377 (9%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYKDS 258
++T + +G P R + + +DTGS +T+I C CS C K + P L D
Sbjct: 13 FYTTLKLGTPERTFSVIIDTGSTITYIPC-KDCSHCGKHTAEWFDPDKSTTAKKLACGDP 71
Query: 259 LCMEIQRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC N C TC +C Y YA+ SSS G + D + + +VFG
Sbjct: 72 LC-----NCGTPSC-TCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV---RLVFG 122
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C + G + + DGI+G+ + SQL + +I++V C G + LG
Sbjct: 123 CENGETGEIYRQMA--DGIMGMGNNHNAFQSQLVQRKVIEDVFSLCF--GYPKDGILLLG 178
Query: 377 HDLVPSWG-MAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA-LFDTGSSYT 434
+P + P+L + Y+ ++ I L A G+ + D+G+++T
Sbjct: 179 DVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFT 238
Query: 435 YFTKQAYSELIASLKE-VSSDGL-VLDASDPTL-PVCWRAKFPIRSIVDVKQFFKTLTLH 491
Y A+ + ++ + V GL +DP +CW+ D+ ++F
Sbjct: 239 YLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGA--PDQFKDLDKYFPPAEFV 296
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
FG K + P YL +SK CLGI D + S ++G +S+R +V YD
Sbjct: 297 FGG-----GAKLTLPPLRYLFLSKPAEYCLGIFD----NGNSGALVGGVSVRDVVVTYDR 347
Query: 552 VNKRIGWAKSHCMNPGR 568
N ++G+ C + R
Sbjct: 348 RNSKVGFTTMACADVAR 364
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 107/403 (26%), Positives = 175/403 (43%), Gaps = 57/403 (14%)
Query: 184 VDSSSIFPLRGNIYPDGL----YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239
+ SS+ P+ Y DG+ Y ++ +G PP+P L +DTGSDL W QC PC+ C
Sbjct: 69 LSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSDLVWTQCQ-PCAVCFN 127
Query: 240 GANPLYKPRMGNI--LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD 297
+ P Y + LP DS ++ + +T Q C + Y D S+++G L +
Sbjct: 128 QSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVE 187
Query: 298 ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357
+ + P VVFGC + G+ + GI G R +SLPSQL + N
Sbjct: 188 TVSFV---AGASVPGVVFGCGLNNTGIFRS---NETGIAGFGRGPLSLPSQLK----VGN 237
Query: 358 VVGHCLTTNAGG--GGYMF-LGHDLVPS--WGMAWVPMLDSPFM-ELYHTEILKINYGSS 411
HC T +G +F L DL + + P++ +P Y+ + I GS+
Sbjct: 238 -FSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGST 296
Query: 412 ----PLNLGARNSQVGWALFDTGSSYTYFTKQAY----SELIASLKEVSSDGLVLDASDP 463
P + A + G + D+G+++T + Y E A +K L + S+
Sbjct: 297 RLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVK------LPVVPSNE 350
Query: 464 TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG---NIC 520
T P+ + P+ V + L LHF H+ E Y+ +K G +IC
Sbjct: 351 TGPLLCFSAPPLGKAPHVPK----LVLHFE------GATMHLPRENYVFEAKDGGNCSIC 400
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
L I++ G I+G+ + V+YD N ++ + ++ C
Sbjct: 401 LAIIE------GEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 165/384 (42%), Gaps = 47/384 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G Y T + +G P + + + DTGSDL WIQC PC +C +P++ P + +
Sbjct: 38 GDYVTTISLGTPAKVFSVIADTGSDLIWIQCK-PCQACFNQKDPIFDPEGSSSYTTMSCG 96
Query: 257 DSLCMEIQRNHKPGYCETCQ-QCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVV 314
D+LC + R ++C CDY Y D S + G L+ + + LT G L N+
Sbjct: 97 DTLCDSLPR-------KSCSPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIA 149
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGG 371
FGC + +G + G++GL R +S SQL + + +CL
Sbjct: 150 FGCGHLNRG----SFNDASGLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPSKTS 203
Query: 372 YMFLG-----HDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV--- 422
MF G H A+ PM+ +P ME Y+ ++ I+ L + A + +
Sbjct: 204 PMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPD 263
Query: 423 --GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G +FD+G++ T Y ++ +L+ S + D S L +C+ S
Sbjct: 264 GSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKI-DGSSAGLDLCYDVS---GSKAS 319
Query: 481 VKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
K + HF G+ +Q+ + I+ I +CL ++ N I G+
Sbjct: 320 YKMKIPAMVFHFEGADYQLPVENYFIAANDAGTI-----VCLAMVS----SNMDIGIYGN 370
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ + V+YD + +IGWA S C
Sbjct: 371 MMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 171/377 (45%), Gaps = 45/377 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + +G PP+ + +DTGSDL W+QC APC+ C + +PL+ P +
Sbjct: 6 GEYVLQISLGTPPQQFSAIVDTGSDLCWVQC-APCARCFEQPDPLFIPLASSSYSNASCT 64
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
DSLC + R C C Y Y D S++ G A + + L NGS T + FG
Sbjct: 65 DSLCDALPRPT----CSMRNTCTYSYSYGDGSNTRGDFAFETVTL---NGS-TLARIGFG 116
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY--MF 374
C ++Q+G T DG++GL + +SLPSQL S ++ +CL + G + +
Sbjct: 117 CGHNQEG----TFAGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFSPIT 170
Query: 375 LGHDLVPSWGMAWVPML---DSPFMELYHTEILKINYG-----SSPLNLGARNSQVGWAL 426
G+ S ++ P+L D+P Y+ + I+ G + P + VG +
Sbjct: 171 FGNAAENSRA-SFTPLLQNEDNP--SYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVI 227
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G++ TY+ A+ ++A L+ S +DPT P + I S+
Sbjct: 228 LDSGTTITYWRLAAFIPILAELRRQIS----YPEADPT-PYGLNLCYDISSVSASSLTLP 282
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
++T+H + V + +S LV + +C + + I+G++ + L
Sbjct: 283 SMTVHLTN----VDFEIPVSNLWVLVDNFGETVCTAMSTSDQFS-----IIGNVQQQNNL 333
Query: 547 VVYDNVNKRIGWAKSHC 563
+V D N R+G+ + C
Sbjct: 334 IVTDVANSRVGFLATDC 350
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 108/413 (26%), Positives = 177/413 (42%), Gaps = 53/413 (12%)
Query: 170 SKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQ 229
S++ + AVA P+ +G + + +G P Y +DTGSDL W Q
Sbjct: 71 SRLVARATGVKAVAGGGDLQVPVHAG---NGEFLMDVAIGTPALSYAAIVDTGSDLVWTQ 127
Query: 230 CDAPCSSCAKGANPLYKPRMGN---ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYAD 286
C PC C K + P++ P + +P +LC ++ + C + +C Y Y D
Sbjct: 128 CK-PCVDCFKQSTPVFDPSSSSTYATVPCSSALCSDLPTS----TCTSASKCGYTYTYGD 182
Query: 287 HSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLP 346
SS+ GVLA + L E L P V FGC +G + + G++GL R +SL
Sbjct: 183 ASSTQGVLASETFTLGKEKKKL--PGVAFGCGDTNEG---DGFTQGAGLVGLGRGPLSLV 237
Query: 347 SQLASQGIIKNVVGHCLTTNAGGGG---YMFLGHDLVPSWGMAWVPMLDSPFME------ 397
SQL G+ K +CLT+ G G + G S A P+ +P ++
Sbjct: 238 SQL---GLDK--FSYCLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPS 292
Query: 398 LYHTEILKINYGSSPLNLGA-----RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVS 452
Y+ + + GS+ + L A ++ G + D+G+S TY Q Y L + V+
Sbjct: 293 FYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAF--VA 350
Query: 453 SDGL-VLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL 511
L +D S+ L +C++ P + + +V+ L LHF + + E Y+
Sbjct: 351 QMALPTVDGSEIGLDLCFQG--PAKGVDEVQ--VPKLVLHFDGGADL-----DLPAENYM 401
Query: 512 VI-SKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
V+ S G +CL + + I+G+ + VYD + +A C
Sbjct: 402 VLDSASGALCLTVAPSRGLS-----IIGNFQQQNFQFVYDVAGDTLSFAPVQC 449
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 171/382 (44%), Gaps = 47/382 (12%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS-SCAKGANPLYKPRMGNILPYKD 257
G Y+ M +G+P + Y + +DTGS +W+QC PC+ C +P++ P YK
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQ-PCTIYCHIQEDPVFNPSASKT--YKT 156
Query: 258 SLCMEIQRN-------HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
C Q + ++P + C Y+ Y D S S+G L++D L LT S T
Sbjct: 157 VPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLT---PSQTL 213
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-----TT 365
+ V+GC D QGL +TDGI+GL+ ++S+ SQL+ G N +CL T
Sbjct: 214 SSFVYGCGQDNQGL----FGRTDGIIGLANNELSMLSQLS--GKYGNAFSYCLPTSFSTP 267
Query: 366 NAGGGGYMFLG-HDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVG 423
N+ G++ +G L PS + P+L +P LY ++ I PL + A + +V
Sbjct: 268 NSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP 327
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV--DV 481
+ D+G+ T Y+ L + + S L C++ S V D+
Sbjct: 328 -TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDI 386
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+ FK G+ Q+ K H S LV + G CL + S + I+G+
Sbjct: 387 RIIFKG-----GADLQL---KGHNS----LVELETGITCLAMAGSSSIA-----IIGNYQ 429
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ V YD N R+G+A C
Sbjct: 430 QQTVKVAYDVGNSRVGFAPGGC 451
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 165/384 (42%), Gaps = 47/384 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G Y T + +G P + + + DTGSDL WIQC PC +C +P++ P + +
Sbjct: 38 GDYVTTISLGTPAKVFSVIADTGSDLIWIQCK-PCQACFNQKDPIFDPEGSSSYTTMSCG 96
Query: 257 DSLCMEIQRNHKPGYCETCQ-QCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVV 314
D+LC + R ++C CDY Y D S + G L+ + + LT G L N+
Sbjct: 97 DTLCDSLPR-------KSCSPNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIA 149
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGG 371
FGC + +G + G++GL R +S SQL + + +CL
Sbjct: 150 FGCGHLNRG----SFNDASGLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPSKTS 203
Query: 372 YMFLG-----HDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV--- 422
MF G H A+ PM+ +P ME Y+ ++ I+ L + A + +
Sbjct: 204 PMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPD 263
Query: 423 --GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G +FD+G++ T Y ++ +L+ S + D S L +C+ S
Sbjct: 264 GSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEI-DGSSAGLDLCYDVS---GSKAS 319
Query: 481 VKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
K+ + HF G+ Q+ + I+ I +CL ++ N I G+
Sbjct: 320 YKKKIPAMVFHFEGADHQLPVENYFIAANDAGTI-----VCLAMVS----SNMDIGIYGN 370
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ + V+YD + +IGWA S C
Sbjct: 371 MMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 170/380 (44%), Gaps = 46/380 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + VG PPR YL MDTGSD+ W+QC APC +C ++ ++ P + Y
Sbjct: 56 GEYFIRISVGTPPRRMYLVMDTGSDILWLQC-APCVNCYHQSDAIFDPYKSST--YSTLG 112
Query: 260 CMEIQ-RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL--TIENGSLTKPNVVFG 316
C Q N G C+ +C Y+++Y D S + G D++ L T G + + G
Sbjct: 113 CSTRQCLNLDIGTCQA-NKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLG 171
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYM 373
C +D +G V G+LGL + +S P+Q+ Q +CLT T++ G +
Sbjct: 172 CGHDNEGY----FVGAAGLLGLGKGPLSFPNQVDPQN--GGRFSYCLTDRETDSTEGSSL 225
Query: 374 FLGHDLVPSWGMAWVPMLDSPFM--ELYHTEILKINYGSSPLNLGARNSQV-----GWAL 426
G VP G + P DS Y+ ++ I+ G + L + Q+ G +
Sbjct: 226 VFGEAAVPPAGARFTPQ-DSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVI 284
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G+S T AY+ L + + +SD L A C+ + VDV
Sbjct: 285 IDSGTSVTRLQNAAYASLRDAFRAGTSD-LAPTAGFSLFDTCY--DLSGLASVDV----P 337
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGST--IILGDISLR 543
T+TLHF T + YL+ + CL G+T I+G+I +
Sbjct: 338 TVTLHFQG-----GTDLKLPASNYLIPVDNSNTFCLAFA-------GTTGPSIIGNIQQQ 385
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G V+YDN++ ++G+ S C
Sbjct: 386 GFRVIYDNLHNQVGFVPSQC 405
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 171/382 (44%), Gaps = 47/382 (12%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS-SCAKGANPLYKPRMGNILPYKD 257
G Y+ M +G+P + Y + +DTGS +W+QC PC+ C +P++ P YK
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQ-PCTIYCHIQEDPVFNPSASKT--YKT 156
Query: 258 SLCMEIQRN-------HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
C Q + ++P + C Y+ Y D S S+G L++D L LT S T
Sbjct: 157 VPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLT---PSQTL 213
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-----TT 365
+ V+GC D QGL +TDGI+GL+ ++S+ SQL+ G N +CL T
Sbjct: 214 SSFVYGCGQDNQGL----FGRTDGIIGLANNELSMLSQLS--GKYGNAFSYCLPTSFSTP 267
Query: 366 NAGGGGYMFLG-HDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVG 423
N+ G++ +G L PS + P+L +P LY ++ I PL + A + +V
Sbjct: 268 NSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP 327
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV--DV 481
+ D+G+ T Y+ L + + S L C++ S V D+
Sbjct: 328 -TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDI 386
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+ FK G+ Q+ K H S LV + G CL + S + I+G+
Sbjct: 387 RIIFKG-----GADLQL---KGHNS----LVELETGITCLAMAGSSSI-----AIIGNYQ 429
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ V YD N R+G+A C
Sbjct: 430 QQTVKVAYDVGNSRVGFAPGGC 451
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 111/424 (26%), Positives = 183/424 (43%), Gaps = 60/424 (14%)
Query: 167 PHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLT 226
PH +++ ++ A + S + + G + G YF + VG+PP + +DTGSDL
Sbjct: 59 PHTAQLESLHSATAAADLLRSPV--MSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLI 116
Query: 227 WIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIE 283
W+QC PC C + PLY PR +P C + R PG C Y +
Sbjct: 117 WLQC-LPCRRCYRQVTPLYDPRNSKTHRRIPCASPQCRGVLR--YPGCDARTGGCVYMVV 173
Query: 284 YADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKV 343
Y D S+S G LA D L L + NV GC +D +GLL + G+LG R ++
Sbjct: 174 YGDGSASSGDLATDTLVLPDDT---RVHNVTLGCGHDNEGLLASAA----GLLGAGRGQL 226
Query: 344 SLPSQLASQ--GIIKNVVGHCLTTNAGGGGYMFLGHD-LVPSWGMAWVPMLDSPFM-ELY 399
S P+QLA + +G ++ Y+ G +PS A+ P+ +P LY
Sbjct: 227 SFPTQLAPAYGHVFSYCLGDRMSRARNSSSYLVFGRTPELPS--TAFTPLRTNPRRPSLY 284
Query: 400 HTEILKINYGSSPLNLGARNSQV--------GWALFDTGSSYTYFTKQAYSELIASLKEV 451
+ +++ + G + G N+ + G + D+G++ + FT+ AY+ +
Sbjct: 285 YVDMVGFSVGGERV-AGFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAV------- 336
Query: 452 SSDGLVLDASDPTLPVCWRAKFPI-RSIVDVKQ-------FFKTLTLHFGSKWQIVSTKF 503
D V A+ + R KF + + DV ++ LHF + +
Sbjct: 337 -RDAFVSHAAAAGMRR-LRNKFSVFDTCYDVHGNGPGTGVRVPSIVLHFAAAADMA---- 390
Query: 504 HISPEGYLVISKKGN----ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWA 559
+ YL+ G+ CLG+ + N +LG++ +G VV+D RIG+
Sbjct: 391 -LPQANYLIPVVGGDRRTYFCLGLQAADDGLN----VLGNVQQQGFGVVFDVERGRIGFT 445
Query: 560 KSHC 563
+ C
Sbjct: 446 PNGC 449
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 162/379 (42%), Gaps = 47/379 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---------PLYKPRMGN 251
LY+ ++ VG P + + +DTGSDL W+ CD C CA + +Y+P
Sbjct: 95 LYYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAEST 152
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTI-ENG 306
LP LC + PG Q C Y I+Y +++++S G+L D LHL E+
Sbjct: 153 TSRHLPCSHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDH 207
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+V+ GC Q G L+ + DG+LGL A +S+PS LA G+++N C +
Sbjct: 208 VPVNASVIIGCGQKQSGDYLDG-IAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKED 266
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELY-HTEILKINYGSSPLNLGARNSQVGWA 425
+ G +F G VPS +PF+ LY + +N S + A
Sbjct: 267 S--SGRIFFGDQGVPS-------QQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKA 317
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
L D+G+S+T Y + + + D T C+ A P+ + DV
Sbjct: 318 LVDSGTSFTSLPFDVYKAFTMEFDK-QMNATRVPYEDTTWKYCYSAS-PLE-MPDVPTI- 373
Query: 486 KTLTLHFGSKWQIVSTKFHIS-PEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
TLT Q V+ + +G L CL +L +E I+ L G
Sbjct: 374 -TLTFAADKSLQAVNPILPFNDKQGAL-----AGFCLAVLPSTEPIG----IIAQNFLVG 423
Query: 545 QLVVYDNVNKRIGWAKSHC 563
VV+D + ++GW +S C
Sbjct: 424 YHVVFDRESMKLGWYRSEC 442
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 172/383 (44%), Gaps = 34/383 (8%)
Query: 187 SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK 246
S+ L ++ G Y + + +G PP + L +D S ++ S +P +
Sbjct: 20 SARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDRSS---FVSPKTMFCSFFFLQDPRFS 76
Query: 247 PRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG 306
P + + YK +E G+C+ ++ Y+ +YA+ S+S GVL +D + + +
Sbjct: 77 PALSS--SYKP---LECGNECSTGFCDGSRK--YQRQYAEKSTSSGVLGKDVISFS-NSS 128
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
L +VFGC + G L + DGI+GL R +S+ QL + +++V C
Sbjct: 129 DLGGQRLVFGCETAETGDLYDQ--TADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGM 186
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGAR--NSQVG 423
GGG M LG P M + P Y+ +LK I G SPL L + + G
Sbjct: 187 DEGGGAMILGG-FQPPKDMVFTS--SDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYG 243
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKE-VSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
L D+G++Y YF A+ +++KE V S V + +C+ ++ ++
Sbjct: 244 TVL-DSGTTYAYFPGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGA--GTNVSNLS 300
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDI 540
QFF ++ FG + +SPE YL K G CLG+ + + T +LG I
Sbjct: 301 QFFPSVDFVFGDGQSVT-----LSPENYLFRHTKISGAYCLGVFENGD----PTTLLGGI 351
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+R LV Y+ IG+ K+ C
Sbjct: 352 IVRNMLVTYNRGKASIGFLKTKC 374
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 124/467 (26%), Positives = 194/467 (41%), Gaps = 60/467 (12%)
Query: 119 NKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVN-----DGIIRPHKSKIN 173
N F F ++H+F EV Q GRFV + N D +IR + +
Sbjct: 25 NGRIFTFEMHHRFS-DEVKQWSD--STGRFVKFPPKGSFEYFNALVLRDWLIRGRRLSDS 81
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
+ S +S+S G L++T + +G P + + +DTGSDL W+ CD
Sbjct: 82 ESESSLTFSDGNSTSRISSLG-----FLHYTTVKLGTPGMRFMVALDTGSDLFWVPCD-- 134
Query: 234 CSSCA--KGAN-------PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYE 281
C CA +GA +Y P++ + +SLC QRN G T C Y
Sbjct: 135 CGKCAPTEGATYASEFELSIYNPKISTTNKKVTCNNSLCA--QRNQCLG---TFSTCPYM 189
Query: 282 IEYAD-HSSSMGVLARDELHLTIE--NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGL 338
+ Y +S+ G+L D +HLT E N + V FGC Q G L+ + +G+ GL
Sbjct: 190 VSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD-IAAPNGLFGL 248
Query: 339 SRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMEL 398
K+S+PS LA +G++ + C + G G + G S P +P
Sbjct: 249 GMEKISVPSVLAREGLVADSFSMCFGHD--GVGRISFGDK--GSSDQEETPFNLNPSHPN 304
Query: 399 YHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVL 458
Y+ + ++ G++ ++ ALFDTG+S+TY Y+ + S + D
Sbjct: 305 YNITVTRVRVGTTLID------DEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHS 358
Query: 459 DASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN 518
S C+ + + +LT+ S + I IS EG LV
Sbjct: 359 PDSRIPFEYCYDMSNDANASLIPSL---SLTMKGNSHFTINDPIIVISTEGELV------ 409
Query: 519 ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
CL I+ SE++ I+G + G VV+D + W K C +
Sbjct: 410 YCLAIVKSSELN-----IIGQNYMTGYRVVFDREKLVLAWKKFDCYD 451
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 157/379 (41%), Gaps = 54/379 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G YF + +G+PP YL +D+GSD+ W+QC PC C A+PL+ P + +P
Sbjct: 125 GEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPATSATFSAVPCG 183
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
++C R + C CDYE+ Y D S + G LA + L L G V G
Sbjct: 184 SAVC----RTLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTL----GGTAVEGVAIG 235
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C + +GL V G+LGL +SL QL +CL + G G + LG
Sbjct: 236 CGHRNRGL----FVGAAGLLGLGWGPMSLVGQLGGA--AGGAFSYCLASR--GAGSLVLG 287
Query: 377 HDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTG 430
G WVP++ +P Y+ + I G L L Q+ G + DTG
Sbjct: 288 RSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTG 347
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT-LPVCWRAKFPIRSIVDVKQFF---- 485
++ T ++AY+ L + V++ G + A + L C+ V F+
Sbjct: 348 TAVTRLPQEAYAALRDAF--VAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGA 405
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDISLRG 544
TLTL P L++ G I CL S + ILG+I G
Sbjct: 406 ATLTL----------------PARNLLLEVDGGIYCLAFAPSSSGPS----ILGNIQQEG 445
Query: 545 QLVVYDNVNKRIGWAKSHC 563
+ D+ N IG+ + C
Sbjct: 446 IQITVDSANGYIGFGPTTC 464
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 168/384 (43%), Gaps = 48/384 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VG P P + +DTGSD+ W+QC APC C + P++ PR + D
Sbjct: 138 GEYFTKIGVGTPSTPALMVLDTGSDVVWLQC-APCRRCYDQSGPVFDPRRSSSYGAVD-C 195
Query: 260 CMEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
+ R G C+ ++ C Y++ Y D S + G A + L G V GC
Sbjct: 196 AAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFA---GGARVARVALGCG 252
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGYMFLG 376
+D +GL + L R +S P+Q++ + +CL T++ G
Sbjct: 253 HDNEGLFVAAAGLLG----LGRGSLSFPTQISRR--YGKSFSYCLVDRTSSSSSGAASRS 306
Query: 377 HDLVPSWG------MAWVPMLDSPFME-LYHTEILKINYGS--------SPLNLGARNSQ 421
++G ++ PM+ +P ME Y+ +++ I+ G S L L +
Sbjct: 307 RSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGR 366
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVD 480
G + D+G+S T + +YS L + + ++ GL L +L C+ R +V
Sbjct: 367 -GGVIVDSGTSVTRLARPSYSALRDAFRAAAA-GLRLSPGGFSLFDTCY--DLGGRKVVK 422
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGD 539
V T+++HF + + PE YL+ + +G C +G I+G+
Sbjct: 423 V----PTVSMHFAGGAEAA-----LPPENYLIPVDSRGTFCFAFAG----TDGGVSIIGN 469
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
I +G VV+D +R+G+A C
Sbjct: 470 IQQQGFRVVFDGDGQRVGFAPKGC 493
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 176/384 (45%), Gaps = 49/384 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC 260
L++T++ +G P P+ + +D GSDL W+ CD C CA + Y ++ Y +L
Sbjct: 102 LHYTWIDLGTPSVPFLVALDVGSDLLWVPCD--CIQCAPLSANYYSVLDRDLSEYNPALS 159
Query: 261 MEIQR----NHKPGYCETCQQ----CDYEIE-YADHSSSMGVLARDELHLT--IENG--S 307
+ + + TC+ C Y+ + Y+D++S+ G + D+L LT ++G S
Sbjct: 160 STSKHLFCGHQLCAWSTTCKSANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHS 219
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
L + +VVFGC Q G L+ DG++GL +S+P+ LA +G+++N C N
Sbjct: 220 LLQASVVFGCGRKQSGSYLDG-AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNN- 277
Query: 368 GGGGYMFLGHD-LVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
G G + G D ++P+ Y + GSS L AL
Sbjct: 278 -GSGRILFGDDGPATQQTTQFLPLFGE--FAAYFIGVESFCVGSSCLQRSGFQ-----AL 329
Query: 427 FDTGSSYTYFTKQAYSELIASLK---EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
D+GSS+TY + Y +++ +V++ +VL LP W + I ++V
Sbjct: 330 VDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRIVLRE----LP--WNYCYNISTLVSFN- 382
Query: 484 FFKTLTLHFGSKWQIVSTKFHISP--EGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
++ L F + ++ P +GY V CL + + E + ++G
Sbjct: 383 -IPSMQLVFPLNQIFIHDPVYVLPANQGYKV------FCLTLEETDEDYG----VIGQNL 431
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMN 565
+ G +V+D N ++GW+KS C++
Sbjct: 432 MVGYRMVFDRENLKLGWSKSKCLD 455
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 174/383 (45%), Gaps = 54/383 (14%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWI--QCDA--PCSSCAKGA--NPLYKPRMGNI-- 252
L++ + VG P + + + +DTGSDL W+ QCD P ++ A G+ Y P M +
Sbjct: 108 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSFQATFYIPGMSSTSK 167
Query: 253 -LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADH-SSSMGVLARDELHLTIENG--SL 308
+P + C ++Q+ C T QC Y++ Y +SS G L D L+L+ EN +
Sbjct: 168 AVPCNSNFC-DLQKE-----CSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI 221
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
K ++ GC Q G L+ +G+ GL +VS+PS LA +G+ N C +
Sbjct: 222 LKAQIMLGCGQTQTGSFLDAAAP-NGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRD-- 278
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G + G S P+ + Y I I G+ P ++ +FD
Sbjct: 279 GIGRISFGDQ--ESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDF------ITIFD 330
Query: 429 TGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCW-----RAKFPIRSIVDVK 482
TG+S+TY AY+ + S +V ++ D+ P C+ A+FPI I+
Sbjct: 331 TGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIP-FEYCYDLSSSEARFPIPDII--- 386
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
+T+T GS + ++ IS + + + CL I+ +++ I+G +
Sbjct: 387 --LRTVT---GSMFPVIDPGQVISIQEHEYV-----YCLAIVKSMKLN-----IIGQNFM 431
Query: 543 RGQLVVYDNVNKRIGWAKSHCMN 565
G VV+D K +GW K +C +
Sbjct: 432 TGLRVVFDRERKILGWKKFNCYD 454
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 161/377 (42%), Gaps = 46/377 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS-- 258
L+F + VG PP + + +DTGSDL W+ C+ C+ C +G + NI K S
Sbjct: 101 LHFANVSVGTPPLSFLVALDTGSDLFWLPCN--CTKCVRGVESNGEKIAFNIYDLKGSST 158
Query: 259 ----LC----MEIQRNHKPGYCETCQQ-CDYEIEY-ADHSSSMGVLARDELHLTIENGSL 308
LC E+QR C + C YE+ Y ++ +S+ G L D LHL ++
Sbjct: 159 SQTVLCNSNLCELQRQ-----CPSSDSICPYEVNYLSNGTSTTGFLVEDVLHLITDDDET 213
Query: 309 TKPN--VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+ + FGC Q G L+ +G+ GL S+PS LA +G+ N C ++
Sbjct: 214 KDADTRITFGCGQVQTGAFLDG-AAPNGLFGLGMGNESVPSILAKEGLTSNSFSMCFGSD 272
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
G G + G + S P Y+ + +I G + +L A+
Sbjct: 273 --GLGRITFGDN--SSLVQGKTPFNLRALHPTYNITVTQIIVGGNAADLEFH------AI 322
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
FD+G+S+T+ AY ++ S +S LP + V++
Sbjct: 323 FDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYDLSSNKTVELP---I 379
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
LT+ G + + IS EG + +CLG+L + V+ I+G + G
Sbjct: 380 NLTMKGGDNYLVTDPIVTISGEGVNL------LCLGVLKSNNVN-----IIGQNFMTGYR 428
Query: 547 VVYDNVNKRIGWAKSHC 563
+V+D N +GW +S+C
Sbjct: 429 IVFDRENMILGWRESNC 445
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 95/406 (23%), Positives = 177/406 (43%), Gaps = 30/406 (7%)
Query: 165 IRPHKSKINKKLVSSNAVA-VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGS 223
++ +S+++K L N+V +DS+++ G++ YF + +G P R L DTGS
Sbjct: 98 VKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSANYFVVVGLGTPKRDLSLVFDTGS 157
Query: 224 DLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYC-ETCQQCD 279
DLTW QC+ SC K + ++ P + + SLC ++ C + C
Sbjct: 158 DLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACI 217
Query: 280 YEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
Y I+Y D S+S+G L+++ L +T + + +FGC D +GL + G++GL
Sbjct: 218 YGIQYGDKSTSVGFLSQERLTITATD---IVDDFLFGCGQDNEGLFSGSA----GLIGLG 270
Query: 340 RAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFMEL 398
R +S Q +S I + +CL + + G++ G + + + P+ S
Sbjct: 271 RHPISFVQQTSS--IYNKIFSYCLPSTSSSLGHLTFGASAATNANLKYTPLSTISGDNTF 328
Query: 399 YHTEILKINYGSSPL-NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV 457
Y +I+ I+ G + L + + G ++ D+G+ T AY+ L ++ ++ V
Sbjct: 329 YGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPV 388
Query: 458 LDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG 517
+ D C+ F + V + + F + + G L+
Sbjct: 389 AN-EDGLFDTCY--DFSGYKEISVPK----IDFEFAGGVTV-----ELPLVGILIGRSAQ 436
Query: 518 NICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+CL + ++ I G++ + VVYD RIG+ + C
Sbjct: 437 QVCLAF--AANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 480
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 169/392 (43%), Gaps = 33/392 (8%)
Query: 178 SSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
S N + S PL+ G+ G Y G P + L +DTGSD+TWIQC PCS
Sbjct: 113 SKNNGTYSTMSNLPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCK-PCSD 171
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLAR 296
C +P+++P+ + + L +C C YEI Y D S S G ++
Sbjct: 172 CYSQVDPIFEPQQSSSYKHLSCLSSACTELTTMNHCRL-GGCVYEINYGDGSRSQGDFSQ 230
Query: 297 DELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356
+ L L GS + P+ FGC + GL + G+LGL R +S PSQ S+
Sbjct: 231 ETLTL----GSDSFPSFAFGCGHTNTGLFKGSA----GLLGLGRTALSFPSQTKSK--YG 280
Query: 357 NVVGHCLT--TNAGGGGYMFLGHDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPL 413
+CL ++ G +G +P+ +VP++ +S + Y + I+ G L
Sbjct: 281 GQFSYCLPDFVSSTSTGSFSVGQGSIPATA-TFVPLVSNSNYPSFYFVGLNGISVGGERL 339
Query: 414 NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP--TLPVCWRA 471
++ G + D+G+ T QAY L S + + + L ++ P L C
Sbjct: 340 SIPPAVLGRGGTIVDSGTVITRLVPQAYDALKTSFRSKTRN---LPSAKPFSILDTC--- 393
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
+ + S V+ T+T HF + + + I + + S +CL S+ +
Sbjct: 394 -YDLSSYSQVR--IPTITFHFQNNADVAVSAVGIL---FTIQSDGSQVCLAFASASQ--S 445
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
ST I+G+ + V +D RIG+A C
Sbjct: 446 ISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/282 (29%), Positives = 129/282 (45%), Gaps = 35/282 (12%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G+ PD GLY+ + +G P + YY+ +DTGSD+ W+ C C C + ++
Sbjct: 73 IPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNC-IQCRECPRTSSLGMEL 131
Query: 243 ---PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD-- 297
L + G ++ + C+E+ G C T C Y Y D SS+ G +D
Sbjct: 132 TPYDLEESTTGKLVSCDEQFCLEVNGGPLSG-CTTNMSCPYLQIYGDGSSTAGYFVKDYV 190
Query: 298 -------ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKT-DGILGLSRAKVSLPSQL 349
+L T NGS+ FGC Q G L ++ + DGILG ++ S+ SQL
Sbjct: 191 QYNRVSGDLETTAANGSIK-----FGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQL 245
Query: 350 ASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYG 409
AS +K + HCL GGG +GH + P M P++ P Y+ + + G
Sbjct: 246 ASTRKVKKMFAHCL-DGTNGGGIFAMGHVVQPKVNM--TPLV--PNQPHYNVNMTGVQVG 300
Query: 410 SSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIASL 448
LN+ A + G + D+G++ Y + Y L+A +
Sbjct: 301 HIILNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKI 342
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 164/380 (43%), Gaps = 47/380 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP---------LYKPRMGN 251
L++T + +G P + + +DTGSDL W+ CD C+ CA + +Y P +
Sbjct: 99 LHYTTVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAASDSTAFASDFDLNVYNPNGSS 156
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYAD-HSSSMGVLARDELHLTIENG- 306
+ +SLC H+ T C Y + Y +S+ G+L D LHLT E+
Sbjct: 157 TSKKVTCNNSLC-----THRSQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNH 211
Query: 307 -SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
L + NV+FGC Q G L+ + +G+ GL K+S+PS L+ +G + C
Sbjct: 212 HDLVEANVIFGCGQIQSGSFLD-VAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGR 270
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
+ G G + G S+ P +P Y+ + ++ G++ +++ A
Sbjct: 271 D--GIGRISFGDK--GSFDQDETPFNLNPSHPTYNITVTQVRVGTTVIDVEFT------A 320
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
LFD+G+S+TY Y+ L S D S C+ P + +
Sbjct: 321 LFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMS-PDANTSLIPSV- 378
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
+LT+ GS + + IS + LV CL ++ +E++ I+G + G
Sbjct: 379 -SLTMGGGSHFAVYDPIIIISTQSELV------YCLAVVKSAELN-----IIGQNFMTGY 426
Query: 546 LVVYDNVNKRIGWAKSHCMN 565
VV+D +GW K C +
Sbjct: 427 RVVFDREKLVLGWKKFDCYD 446
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 107/405 (26%), Positives = 172/405 (42%), Gaps = 57/405 (14%)
Query: 177 VSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS 235
+SS+ V + + P++ G G Y + +G P + + L DTGSDLTW QC+
Sbjct: 107 LSSHGVFQEKQATLPVQSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAK 166
Query: 236 SCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYC--------ETCQQ--CDYEIEYA 285
+C K P P YK+ C +C E+C C Y+++Y
Sbjct: 167 TCYKQKEPRLDPTKST--SYKNISC-------SSAFCKLLDTEGGESCSSPTCLYQVQYG 217
Query: 286 DHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSL 345
D S S+G A + L L+ N N +FGC GL G+LGL R K+SL
Sbjct: 218 DGSYSIGFFATETLTLSSSN---VFKNFLFGCGQQNSGLFRGAA----GLLGLGRTKLSL 270
Query: 346 PSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD----SPFMELYHT 401
PSQ A + K + +CL ++ GY+ G + S + + P+ + +PF Y
Sbjct: 271 PSQTAQK--YKKLFSYCLPASSSSKGYLSFGGQV--SKTVKFTPLSEDFKSTPF---YGL 323
Query: 402 EILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDAS 461
+I +++ G + L++ A + D+G+ T AYS L ++ +++ +D D
Sbjct: 324 DITELSVGGNKLSIDASIFSTSGTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGY 383
Query: 462 DPTLPVCWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNI 519
C+ +K I V FK + I G L ++ +
Sbjct: 384 S-IFDTCYDFSKNETIKIPKVGVSFKG------------GVEMDIDVSGILYPVNGLKKV 430
Query: 520 CLGIL-DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CL +G +V I G+ + VVYD+ R+G+A S C
Sbjct: 431 CLAFAGNGDDVK---AAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 161/379 (42%), Gaps = 47/379 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---------PLYKPRMGN 251
LY+ ++ VG P + + +DTGSDL W+ CD C CA + +Y+P
Sbjct: 95 LYYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAEST 152
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTI-ENG 306
LP LC + PG Q C Y I+Y +++++S G+L D LHL E+
Sbjct: 153 TSRHLPCSHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDH 207
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+V+ GC Q G L+ + DG+L L A +S+PS LA G+++N C +
Sbjct: 208 VPVNASVIIGCGQKQSGDYLDG-IAPDGLLALGMADISVPSFLARAGLVQNSFSMCFKED 266
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELY-HTEILKINYGSSPLNLGARNSQVGWA 425
+ G +F G VPS +PF+ LY + +N S + A
Sbjct: 267 S--SGRIFFGDQGVPS-------QQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKA 317
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
L D+G+S+T Y + + + D T C+ A P+ + DV
Sbjct: 318 LVDSGTSFTSLPFDVYKAFTMEFDK-QMNATRVPYEDTTWKYCYSAS-PLE-MPDVPTI- 373
Query: 486 KTLTLHFGSKWQIVSTKFHIS-PEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
TLT Q V+ + +G L CL +L +E I+ L G
Sbjct: 374 -TLTFAADKSLQAVNPILPFNDKQGAL-----AGFCLAVLPSTEPIG----IIAQNFLVG 423
Query: 545 QLVVYDNVNKRIGWAKSHC 563
VV+D + ++GW +S C
Sbjct: 424 YHVVFDRESMKLGWYRSEC 442
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 108/377 (28%), Positives = 166/377 (44%), Gaps = 45/377 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G YFT + VG PPR Y+ +DTGSD+ WIQC APC C ++P++ PR + +
Sbjct: 124 GEYFTRIGVGTPPRYVYMVLDTGSDIVWIQC-APCKRCYAQSDPVFDPRKSRSFASIACR 182
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC R PG Q C Y++ Y D S + G + + LT + + V G
Sbjct: 183 SPLC---HRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTET--LTFRRTRVAR--VALG 235
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMF 374
C +D +GL V G+LGL R ++S PSQ + + +CL + M
Sbjct: 236 CGHDNEGL----FVGAAGLLGLGRGRLSFPSQTGRR--FNHKFSYCLVDRSASSKPSSMV 289
Query: 375 LGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLN------LGARNSQVGWALF 427
G V S + P++ +P ++ Y+ E+L I+ G + + + G +
Sbjct: 290 FGDSAV-SRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVII 348
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D+G+S T T+ AY + + +S+ L C F + +VK T
Sbjct: 349 DSGTSVTRLTRPAYIAFRDAFRAGASN-LKRAPQFSLFDTC----FDLSGKTEVK--VPT 401
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ LHF + VS + YL+ + GN CL G I+G+I +G
Sbjct: 402 VVLHF--RGADVS----LPASNYLIPVDTSGNFCLAFAG----TMGGLSIIGNIQQQGFR 451
Query: 547 VVYDNVNKRIGWAKSHC 563
VVYD R+G+A C
Sbjct: 452 VVYDLAGSRVGFAPHGC 468
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 107/403 (26%), Positives = 174/403 (43%), Gaps = 57/403 (14%)
Query: 184 VDSSSIFPLRGNIYPDGL----YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239
+ SS+ P+ Y DG+ Y ++ +G PP+P L +DTGS L W QC PC+ C
Sbjct: 13 LSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ-PCAVCFN 71
Query: 240 GANPLYKPRMGNI--LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD 297
+ P Y + LP DS ++ + +T Q C Y Y D S+++G L +
Sbjct: 72 QSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVE 131
Query: 298 ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357
+ + P VVFGC + G+ + GI G R +SLPSQL + N
Sbjct: 132 TVSFV---AGASVPGVVFGCGLNNTGIFRS---NETGIAGFGRGPLSLPSQLK----VGN 181
Query: 358 VVGHCLTTNAGG--GGYMF-LGHDLVPS--WGMAWVPMLDSPFM-ELYHTEILKINYGSS 411
HC T +G +F L DL + + P++ +P Y+ + I GS+
Sbjct: 182 -FSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGST 240
Query: 412 ----PLNLGARNSQVGWALFDTGSSYTYFTKQAY----SELIASLKEVSSDGLVLDASDP 463
P + A + G + D+G+++T + Y E A +K L + S+
Sbjct: 241 RLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVK------LPVVPSNE 294
Query: 464 TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG---NIC 520
T P+ + P+ V + L LHF H+ E Y+ +K G +IC
Sbjct: 295 TGPLLCFSAPPLGKAPHVPK----LVLHFE------GATMHLPRENYVFEAKDGGNCSIC 344
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
L I++ G I+G+ + V+YD N ++ + ++ C
Sbjct: 345 LAIIE------GEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 107/403 (26%), Positives = 174/403 (43%), Gaps = 57/403 (14%)
Query: 184 VDSSSIFPLRGNIYPDGL----YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239
+ SS+ P+ Y DG+ Y ++ +G PP+P L +DTGS L W QC PC+ C
Sbjct: 69 LSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ-PCAVCFN 127
Query: 240 GANPLYKPRMGNI--LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD 297
+ P Y + LP DS ++ + +T Q C Y Y D S+++G L +
Sbjct: 128 QSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVE 187
Query: 298 ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357
+ + P VVFGC + G+ + GI G R +SLPSQL + N
Sbjct: 188 TVSFV---AGASVPGVVFGCGLNNTGIFRS---NETGIAGFGRGPLSLPSQLK----VGN 237
Query: 358 VVGHCLTTNAGG--GGYMF-LGHDLVPS--WGMAWVPMLDSPFM-ELYHTEILKINYGSS 411
HC T +G +F L DL + + P++ +P Y+ + I GS+
Sbjct: 238 -FSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGST 296
Query: 412 ----PLNLGARNSQVGWALFDTGSSYTYFTKQAY----SELIASLKEVSSDGLVLDASDP 463
P + A + G + D+G+++T + Y E A +K L + S+
Sbjct: 297 RLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVK------LPVVPSNE 350
Query: 464 TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG---NIC 520
T P+ + P+ V + L LHF H+ E Y+ +K G +IC
Sbjct: 351 TGPLLCFSAPPLGKAPHVPK----LVLHFE------GATMHLPRENYVFEAKDGGNCSIC 400
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
L I++ G I+G+ + V+YD N ++ + ++ C
Sbjct: 401 LAIIE------GEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 168/374 (44%), Gaps = 37/374 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PP Y DTGSDLTW C PC++C K NP++ P+ Y++
Sbjct: 70 GHYLMELSIGTPPFKIYGIADTGSDLTWTSC-VPCNNCYKQRNPMFDPQKSTT--YRNIS 126
Query: 260 CMEIQRNHK--PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG-SLTKPNVVFG 316
C + + HK G C ++C+Y YA + + GVLA++ + L+ G S+ +VFG
Sbjct: 127 C-DSKLCHKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFG 185
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYM 373
C ++ G + + GI+GL VSL SQ+ S K CL T+ M
Sbjct: 186 CGHNNTGGFNDHEM---GIIGLGGGPVSLISQMGSSFGGKR-FSQCLVPFHTDVSVSSKM 241
Query: 374 FLGH-DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL--NLGARNSQVGWALFDTG 430
G V G+ P++ Y +L I+ ++ L N ++N + G D+G
Sbjct: 242 SFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEKGNMFLDSG 301
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
+ T Q Y +++A ++ + V D D +C+R K +R V LT
Sbjct: 302 TPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKNNLRGPV--------LTA 353
Query: 491 HF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
HF G+ ++ T+ ISP K G CLG + S +G + G+ + L+ +
Sbjct: 354 HFEGADVKLSPTQTFISP-------KDGVFCLGFTNTSS--DGG--VYGNFAQSNYLIGF 402
Query: 550 DNVNKRIGWAKSHC 563
D + + + C
Sbjct: 403 DLDRQVVSFKPKDC 416
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 125/467 (26%), Positives = 205/467 (43%), Gaps = 60/467 (12%)
Query: 112 KSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPH--K 169
S N + + + PL H++G SQ + D+ S ++ R + K
Sbjct: 44 SSVNLEPSSATLSVPLVHRYGPCAASQ---------YSDMPTPSFSETLRHSRARTNYIK 94
Query: 170 SKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLDMDTGSDLTWI 228
S+ + + S+ D++ P R + D L Y + G P P L MDTGSD++W+
Sbjct: 95 SRASTGMASTPD---DAAVTVPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSDVSWV 151
Query: 229 QCDAPCSS--CAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIE 283
QC APC+S C +PL+ P + + C ++ +++ G QC Y +E
Sbjct: 152 QC-APCNSTECYPQKDPLFDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVE 210
Query: 284 YADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKV 343
Y D SS+ GV + + +T G +T + FGC +DQ+G K DG+LGL A
Sbjct: 211 YGDGSSTRGVYSNET--ITFAPG-ITVKDFHFGCGHDQRG----PSDKFDGLLGLGGAPE 263
Query: 344 SLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMEL----- 398
SL Q AS + +CL G++ LG + PS + +P L
Sbjct: 264 SLVVQTAS--VYGGAFSYCLPALNSEAGFLALG--VRPSAATNTSAFVFTPMWHLPMDAT 319
Query: 399 -YHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV 457
Y + I+ G PL++ R++ G L D+G+ T + AY+ L A+L++ + +
Sbjct: 320 SYMVNMTGISVGGKPLDI-PRSAFRGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPM 378
Query: 458 LDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG 517
+ + D C+ F S V V + LT G+ + P G LV
Sbjct: 379 VASED--FDTCY--NFTGYSNVTVPRV--ALTFSGGATIDL------DVPNGILV----- 421
Query: 518 NICLGILD-GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CL + G +V G I+G+++ R V+YD + ++G+ C
Sbjct: 422 KDCLAFRESGPDVGLG---IIGNVNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 107/398 (26%), Positives = 167/398 (41%), Gaps = 54/398 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD---APCSSCAKGA---NPLY---KPRMG 250
G Y M G PP+ L DTGSDL W+QC AP + C K A P + K
Sbjct: 52 GQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATL 111
Query: 251 NILPYKDSLCMEI--QRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENG 306
+++P + C+ + R H P C C Y +YAD SS+ G LARD TI NG
Sbjct: 112 SVVPCSAAQCLLVPAPRGHGPS-CSPAAPVPCGYAYDYADGSSTTGFLARDT--ATISNG 168
Query: 307 S---LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
+ V FGC QG + T G++GL + ++S P+Q S + +CL
Sbjct: 169 TSGGAAVRGVAFGCGTRNQG---GSFSGTGGVIGLGQGQLSFPAQ--SGSLFAQTFSYCL 223
Query: 364 TTNAGG-----GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGA 417
GG ++FLG + A+ P++ +P Y+ ++ I G+ L +
Sbjct: 224 LDLEGGRRGRSSSFLFLGRPERRA-AFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPG 282
Query: 418 RNSQV-----GWALFDTGSSYTYFTKQAYSELIAS------LKEVSSDGLVLDASDPTLP 466
+ G + D+GS+ TY AY L+++ L + S L
Sbjct: 283 SEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQG----LE 338
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDG 526
+C+ S+ F LT+ F + + YLV CL I
Sbjct: 339 LCYNVS-SSSSLAPANGGFPRLTIDFAQGLSL-----ELPTGNYLVDVADDVKCLAIRP- 391
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
+ + +LG++ +G V +D + RIG+A++ C+
Sbjct: 392 -TLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTECV 428
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 109 bits (272), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 164/380 (43%), Gaps = 47/380 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY------------KPR 248
L++T + +G P + + +DTGSDL W+ CD CS CA Y K
Sbjct: 3 LHYTTVQLGTPGTKFMVALDTGSDLFWVPCD--CSRCAPTEGSPYASDFELSVYSPKKSS 60
Query: 249 MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYAD-HSSSMGVLARDELHLTIEN-- 305
+P +SLC QR+ E C Y + Y +S+ G+L D LHL EN
Sbjct: 61 TSKTVPCNNSLCA--QRDQ---CTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKH 115
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
+ + FGC Q G L+ + +G+ GL ++S+PS L+ +G++ N C +
Sbjct: 116 SEPIQAYITFGCGQVQSGSFLD-VAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSD 174
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
+ G G + G S P + Y+ + I G++ ++ A
Sbjct: 175 D--GVGRINFGDK--GSLEQEETPFNLNQLHPNYNITVTSIRVGTTLIDADIT------A 224
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
LFD+G+S++YFT YS+L AS + DG +P +P + + +
Sbjct: 225 LFDSGTSFSYFTDPIYSKLSASFHAQTRDG--RHPPNPRIPFEYCYNMSPDANASLTPGI 282
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
+LT+ G + + IS + L+ CL ++ +E++ I+G + G
Sbjct: 283 -SLTMKGGGPFPVYDPIIVISTQNELI------YCLAVVKSAELN-----IIGQNFMTGY 330
Query: 546 LVVYDNVNKRIGWAKSHCMN 565
+V+D +GW K C +
Sbjct: 331 RIVFDREKLVLGWKKFDCYD 350
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 167/374 (44%), Gaps = 42/374 (11%)
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYKDSLCME 262
+ VG + L +DTGSDLTW+QC PC C PL+ P + LP C+
Sbjct: 147 VTVGIGGQNSTLIVDTGSDLTWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVA 205
Query: 263 IQ-RNHKPGYC--ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
+Q G C + CDY+I+Y D S S G L ++L L G N +FGC
Sbjct: 206 LQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL----GKTEIDNFIFGCGR 261
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHD 378
+ +GL G++GL+R+++SL SQ +S + +V +CL TT G G + LG
Sbjct: 262 NNKGL----FGGASGLMGLARSELSLVSQTSS--LFGSVFSYCLPTTGVGSSGSLTLGGA 315
Query: 379 LVPSWG----MAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVG-WALFDTGSS 432
++ +++ M+ +P M Y + I+ G LN+ +S G +L D+G+
Sbjct: 316 DFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTV 375
Query: 433 YTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTLTLH 491
T + Y A ++ S G L C+ + +I VK F+
Sbjct: 376 ITRLSPSIYKAFKAEFEKQFS-GYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFE----- 429
Query: 492 FGSKWQIVSTKFHISPEG--YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
G+ IV EG Y V S ICL S + T+I+G+ + Q V+Y
Sbjct: 430 -GNAEMIVDV------EGVFYFVKSDASQICLAF--ASLGYEDQTMIIGNYQQKNQRVIY 480
Query: 550 DNVNKRIGWAKSHC 563
++ ++G+A C
Sbjct: 481 NSKESKVGFAGEPC 494
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 108 bits (271), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 163/363 (44%), Gaps = 42/363 (11%)
Query: 217 LDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYKDSLCMEIQ-RNHKPGYC 272
L +DTGSDLTW+QC PC C PL+ P + LP C+ +Q G C
Sbjct: 79 LIVDTGSDLTWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLC 137
Query: 273 --ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLV 330
+ CDY+I+Y D S S G L ++L L G N +FGC + +GL
Sbjct: 138 SNKNSTSCDYQIDYGDGSYSRGELGFEKLTL----GKTEIDNFIFGCGRNNKGL----FG 189
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDLVPSWG----M 385
G++GL+R+++SL SQ +S + +V +CL TT G G + LG ++ +
Sbjct: 190 GASGLMGLARSELSLVSQTSS--LFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI 247
Query: 386 AWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVG-WALFDTGSSYTYFTKQAYSE 443
++ M+ +P M Y + I+ G LN+ +S G +L D+G+ T + Y
Sbjct: 248 SYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKA 307
Query: 444 LIASLKEVSSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTK 502
A ++ S G L C+ + +I VK F+ G+ IV
Sbjct: 308 FKAEFEKQFS-GYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFE------GNAEMIVDV- 359
Query: 503 FHISPEG--YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAK 560
EG Y V S ICL S + T+I+G+ + Q V+Y++ ++G+A
Sbjct: 360 -----EGVFYFVKSDASQICLAF--ASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAG 412
Query: 561 SHC 563
C
Sbjct: 413 EPC 415
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 108 bits (271), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 177/383 (46%), Gaps = 58/383 (15%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G YFT + VG PP+ Y+ +DTGSD+ WIQC APC C +P++ P+ + + +
Sbjct: 145 GEYFTRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISCR 203
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC+ R PG C + Q C Y++ Y D S + G + + L G+ P V G
Sbjct: 204 SPLCL---RLDSPG-CNSRQSCLYQVAYGDGSFTFGEFSTETLTF---RGTRV-PKVALG 255
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMF 374
C +D +GL V G+LGL R ++S P+Q + +CL + +
Sbjct: 256 CGHDNEGL----FVGAAGLLGLGRGRLSFPTQTGLR--FGRKFSYCLVDRSASSKPSSVV 309
Query: 375 LGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALF------ 427
G V S + P++ +P ++ Y+ E+ I+ G GAR + + +LF
Sbjct: 310 FGQSAV-SRTAVFTPLITNPKLDTFYYLELTGISVG------GARVAGITASLFKLDTAG 362
Query: 428 ------DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
D+G+S T T++AY L + + ++D + A D +L + F + +V
Sbjct: 363 NGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAAD--LKRAPDYSL---FDTCFDLSGKTEV 417
Query: 482 KQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
K T+ +HF G+ + +T + I + G C + +G +II G+I
Sbjct: 418 K--VPTVVMHFRGADVSLPATNYLIP------VDTNGVFCFAF---AGTMSGLSII-GNI 465
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+G VV+D RIG+A C
Sbjct: 466 QQQGFRVVFDVAASRIGFAARGC 488
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 108 bits (271), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 151/340 (44%), Gaps = 34/340 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PP Y DTGSDLTW C PC+ C K NP++ P+ Y++
Sbjct: 23 GHYLMEVSIGTPPFKIYGIADTGSDLTWTSC-VPCNKCYKQRNPIFDPQKST--SYRNIS 79
Query: 260 CMEIQRNHK--PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG-SLTKPNVVFG 316
C + + HK G C + C+Y YA + + GVLA++ + L+ G S+ +VFG
Sbjct: 80 C-DSKLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFG 138
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYM 373
C ++ G + + GI+GL VS SQ+ S K CL T+ M
Sbjct: 139 CGHNNTGGFND---REMGIIGLGGGPVSFISQIGSSFGGKR-FSQCLVPFHTDVSVSSKM 194
Query: 374 FLGH-DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV---GWALFDT 429
LG V G+ P++ Y +L I+ G++ L+ +SQ G D+
Sbjct: 195 SLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKGNVFLDS 254
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
G+ T Q Y L+A ++ + V + D +C+R K +R V LT
Sbjct: 255 GTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNLRGPV--------LT 306
Query: 490 LHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSE 528
HF G +++ T+ +SP K G CLG + S
Sbjct: 307 AHFEGGDVKLLPTQTFVSP-------KDGVFCLGFTNTSS 339
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 108 bits (271), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 158/373 (42%), Gaps = 42/373 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS-SCAKGANPLYKPRMGNILPYKDS 258
G Y T M +G P +PY + +DTGS LTW+QC +PC SC + + P++ P+ + Y
Sbjct: 115 GNYVTRMGLGTPAKPYIMVVDTGSSLTWLQC-SPCRVSCHRQSGPVFDPKTSS--SYAAV 171
Query: 259 LCMEIQRNH------KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN 312
C Q + P C C Y+ Y D S S+G L++D T+ G+ + PN
Sbjct: 172 SCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKD----TVSFGANSVPN 227
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY 372
+GC D +GL ++ G++GL+R K+SL QLA + +CL + + GY
Sbjct: 228 FYYGCGQDNEGL----FGRSAGLMGLARNKLSLLYQLAPT--LGYSFSYCLPSTS-SSGY 280
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGS 431
+ +G G ++ PM+ + + LY + + PL + + + D+G+
Sbjct: 281 LSIGS--YNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGT 338
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP-IRSIVDVKQFFKTLTL 490
T Y+ L ++ A+ L C+ + +R++ V F
Sbjct: 339 VITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSG--- 395
Query: 491 HFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYD 550
+S LV CL S I+G+ + VVYD
Sbjct: 396 ---------GATLKLSAGNLLVDVDGATTCLAFAPAR-----SAAIIGNTQQQTFSVVYD 441
Query: 551 NVNKRIGWAKSHC 563
+ RIG+A + C
Sbjct: 442 VKSNRIGFAAAGC 454
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 108 bits (271), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 100/419 (23%), Positives = 180/419 (42%), Gaps = 42/419 (10%)
Query: 150 DLDGESVVASVNDGIIRPHK-------SKINKKLVSSNAVA-VDSSSIFPLRGNIYPDGL 201
D DG++ + + I+ K S+++K L ++V +DS+++ G++ G
Sbjct: 86 DHDGKAKSTTPHSDILNQDKERVKYINSRLSKNLGQDSSVEELDSATLPAKSGSLIGSGN 145
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
YF + +G P R L DTGSDLTW QC+ SC K + ++ P + +
Sbjct: 146 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNITCTSA 205
Query: 259 LCMEIQR--NHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC ++ + PG + + C Y I+Y D S S+G +R+ L +T + N +FG
Sbjct: 206 LCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATD---VVDNFLFG 262
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C + QGL + G++GL R +S Q A++ + + +CL + + G++ G
Sbjct: 263 CGQNNQGLFGGSA----GLIGLGRHPISFVQQTAAK--YRKIFSYCLPSTSSSTGHLSFG 316
Query: 377 HDLVPSWGMAWVPMLD-SPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
+ + + P S Y +I I G L + + G A+ D+G+ T
Sbjct: 317 PAATGRY-LKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAIIDSGTVITR 375
Query: 436 FTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF--FKTLTLHFG 493
AY L ++ ++ G+ S L + + + D+ + F T+ F
Sbjct: 376 LPPTAYGALRSAFRQ----GMSKYPSAGELSI-------LDTCYDLSGYKVFSIPTIEFS 424
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
+ + P+G L ++ +CL + + I G++ R VVYD
Sbjct: 425 FAGGVT---VKLPPQGILFVASTKQVCLAFAANGD--DSDVTIYGNVQQRTIEVVYDVG 478
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 169/380 (44%), Gaps = 53/380 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G YF+ + VG P R + +DTGSD+TWIQC+ PCS C + ++P+Y P + + ++ +
Sbjct: 143 GEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCE-PCSDCYQQSDPIYNPALSSSYKLVGCQ 201
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
+LC ++ + C C Y++ Y D S + G A + L L G NV G
Sbjct: 202 ANLCQQLDVSG----CSRNGSCLYQVSYGDGSYTQGNFATETLTL----GGAPLQNVAIG 253
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFL 375
C +D +GL V G+LGL +S PSQL + + +CL ++ +
Sbjct: 254 CGHDNEGL----FVGAAGLLGLGGGSLSFPSQLTDEN--GKIFSYCLVDRDSESSSTLQF 307
Query: 376 GHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNL-----GARNSQVGWALFDT 429
G VP+ G PML + ++ Y+ + I+ G L++ G S G + D+
Sbjct: 308 GRAAVPN-GAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDS 366
Query: 430 GSSYTYFTKQAYSELIASLKEV-----SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
G++ T AY L + + S+DG+ L C+ + VDV
Sbjct: 367 GTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSL------FDTCY--DLSSKESVDV--- 415
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLR 543
T+ HF + + + YLV + G C S S I+G+I +
Sbjct: 416 -PTVVFHFSGGGSM-----SLPAKNYLVPVDSMGTFCFAFAPTSS----SLSIVGNIQQQ 465
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G V +D N ++G+A + C
Sbjct: 466 GIRVSFDRANNQVGFAVNKC 485
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 90/274 (32%), Positives = 130/274 (47%), Gaps = 24/274 (8%)
Query: 190 FPLRG-NI-YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL--- 244
PL G NI Y GLY+T + +G P YY+ +DTGS W+ C C ++ L
Sbjct: 69 LPLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVN-GISCKQCPHESDILRKL 127
Query: 245 --YKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL- 301
Y PR + + K+ C + +P C +C Y YAD +MG+L D LH
Sbjct: 128 TFYDPR--SSVSSKEVKCDDTICTSRPP-CNMTLRCPYITGYADGGLTMGILFTDLLHYH 184
Query: 302 -TIENGSL--TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
NG T +V FGC Q G L N+ V DGI+G + + SQLA+ G K +
Sbjct: 185 QLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKI 244
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGA 417
HCL + GGG + +G + P + P++ + E+YH LK IN + L L A
Sbjct: 245 FSHCLDSTNGGGIFA-IGEVVEPK--VKTTPIVKNN--EVYHLVNLKSINVAGTTLQLPA 299
Query: 418 R---NSQVGWALFDTGSSYTYFTKQAYSELIASL 448
++ D+GS+ Y + YSELI ++
Sbjct: 300 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV 333
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 109/371 (29%), Positives = 159/371 (42%), Gaps = 46/371 (12%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS--SCAKGANPLYKPRMGN---ILPYK 256
Y +G P L++DTGSDL+W+QC PC+ SC + +PL+ P + +P
Sbjct: 137 YVVTASLGTPGMAQTLEVDTGSDLSWVQCK-PCAAPSCYRQKDPLFDPAQSSSYAAVPCG 195
Query: 257 DSLCMEIQRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
S C + Y C QC Y + Y D S++ GV + D L L T +
Sbjct: 196 RSACAGLGI-----YASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAANA---TVQGFL 247
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF 374
FGC + Q G L + DG+LG R + SL Q A G V +CL T + GY+
Sbjct: 248 FGCGHAQSGGLFTGI---DGLLGFGREQPSLVQQTA--GAYGGVFSYCLPTKSSTTGYLT 302
Query: 375 LGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTGSSY 433
LG + G + +L SP Y+ +L I+ G PL++ A G + DTG+
Sbjct: 303 LGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAG-TVVDTGTVI 361
Query: 434 TYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
T AY A+L+ G+ AS P+ P PI I+D F +G
Sbjct: 362 TRLPPAAY----AALRSAFRSGM---ASYPSAP-------PI-GILDTCYSFA----GYG 402
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNICLGILD-GSEVHNGSTIILGDISLRGQLVVYDNV 552
+ + S S + + G + G L S +GS ILG++ R V D
Sbjct: 403 TV-NLTSVALTFSSGATMTLGADGIMSFGCLAFASSGSDGSMAILGNVQQRSFEVRID-- 459
Query: 553 NKRIGWAKSHC 563
+G+ S C
Sbjct: 460 GSSVGFRPSSC 470
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 129/277 (46%), Gaps = 39/277 (14%)
Query: 190 FPLRGN--IYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN---PL 244
FP+ G+ I+ GLY+T + +G PP+ +Y+D+DTGS++ W++C APC+ C + P+
Sbjct: 27 FPISGDNDIFAMGLYYTRISLGTPPQQFYVDVDTGSNVAWVKC-APCTGCEHSGDVPVPM 85
Query: 245 --YKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
+ PR + D+ C + N K C Y + Y D SS+ G D
Sbjct: 86 STFDPRKSTTKISISCTDAECGVL--NKKLQCSPERLSCPYSLLYGDGSSTAGYYLNDVF 143
Query: 300 HL--------TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
T ++G+ +VFGC Q G DG+LG VSLP+QLA
Sbjct: 144 TFNQVPSDNSTAKSGT---ARLVFGCGGTQTGSW-----SVDGLLGFGPTTVSLPNQLAQ 195
Query: 352 QGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS 411
Q I N+ HCL + G G + +G P + + PM+ F E H + +N G S
Sbjct: 196 QNISVNIFAHCLQGDVSGRGSLVIGTIREPD--LVYTPMV---FGE-DHYNVQLLNIGIS 249
Query: 412 PLNLGARNS----QVGWALFDTGSSYTYFTKQAYSEL 444
N+ S G + D+G++ TY + AY E
Sbjct: 250 GRNVTTPASFDLEYTGGVIIDSGTTLTYLVQPAYDEF 286
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/387 (24%), Positives = 170/387 (43%), Gaps = 46/387 (11%)
Query: 192 LRGNIYPD-GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG 250
++ I P G Y + +G PP P +DTGSDLTW QC PC+ C K PL+ P+
Sbjct: 81 IQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPLFDPK-- 137
Query: 251 NILPYKD-----SLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
N Y+D S C+ + ++ C ++C + YAD S + G LA + L +
Sbjct: 138 NSSTYRDSSCGTSFCLALGKDRS---CSKEKKCTFRYSYADGSFTGGNLASETLTVDSTA 194
Query: 306 GS-LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL- 363
G ++ P FGC + G+ + + GI+GL ++SL SQL S I + +CL
Sbjct: 195 GKPVSFPGFAFGCGHSSGGIFDKS---SSGIVGLGGGELSLISQLKST--INGLFSYCLL 249
Query: 364 ---TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS--PLNLGAR 418
T ++ F V +G P++ Y+ + I+ G P ++
Sbjct: 250 PVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSK 309
Query: 419 NSQV--GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
++V G + D+G++YT+ ++ YS+L S+ S G + + +C+ I
Sbjct: 310 KTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVAN-SIKGKRVRDPNGIFSLCYNTTAEIN 368
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
+ + +T HF + P + ++ +C + S++ +
Sbjct: 369 API--------ITAHFK------DANVELQPLNTFMRMQEDLVCFTVAPTSDIG-----V 409
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
LG+++ LV +D KR+ + + C
Sbjct: 410 LGNLAQVNFLVGFDLRKKRVSFKAADC 436
>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
Length = 566
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 119/492 (24%), Positives = 198/492 (40%), Gaps = 93/492 (18%)
Query: 140 DAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP- 198
DA KL R + + E + +R S + +L+ S V + FP+ G P
Sbjct: 76 DAVLKLERLIPPNHELGLTE-----LRAFDSARHGRLLQSPVGGVVN---FPVDGASDPF 127
Query: 199 -DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPRMGNI 252
GLY+T + +G PPR + + +DTGSD+ W+ C + C+ C K + + P + +
Sbjct: 128 LVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTS-CNGCPKTSELQIQLSFFDPGVSSS 186
Query: 253 LPYKDSLCMEIQRNHK-PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
N + C C Y +Y D S + G D + +++G L +P
Sbjct: 187 ASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMCSNLQSGDLQRP 246
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
DGI GL + +S+ SQLA QG+ V HCL + GGG
Sbjct: 247 RRA-----------------VDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGG 289
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG-WALFDTG 430
M LG P + P++ S + + + +N P++ G + DTG
Sbjct: 290 IMVLGQIKRPD--TVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTG 347
Query: 431 SSYTYFTKQAYSELIASLKEVS--SDGLVLDASDPTLP----------VC------WRAK 472
++ Y +AYS I ++ S + P +P +C W +
Sbjct: 348 TTLAYLPDEAYSPFIQAVSVFFFLSSPSAFSVTKPCIPYSVVFAIVESICPQMLHFWN-E 406
Query: 473 FPIRS----IVDV--KQFFKTLTLHFGSK---------------WQIVSTKFHISPEGYL 511
IR ++D+ K+ +KT L + ++I + + P+ L
Sbjct: 407 ITIRCRRYMLLDLTKKKIYKTFNLQVANAVSQYGRPITYESYQCFEITAGDVDVFPQVSL 466
Query: 512 VISKKGNICLGILDGSEVH--NGSTI--------------ILGDISLRGQLVVYDNVNKR 555
+ ++ LG ++ +GS+I ILGD+ L+ ++VVYD V +R
Sbjct: 467 SFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQR 526
Query: 556 IGWAKSHCMNPG 567
IGWA+ C G
Sbjct: 527 IGWAEYDCEFSG 538
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 122/467 (26%), Positives = 193/467 (41%), Gaps = 58/467 (12%)
Query: 119 NKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVN-----DGIIRPHKSKIN 173
N F F ++H+F EV Q GRF + N D +IR + +
Sbjct: 25 NGRIFTFEMHHRFS-DEVKQWSD--STGRFAKFPPKGSFEYFNALVLRDWLIRGRRLSES 81
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
+ S+ D +S + + L++T + +G P + + +DTGSDL W+ CD
Sbjct: 82 ESESESSLTFSDGNSTSRISSLGF---LHYTTVKLGTPGMRFMVALDTGSDLFWVPCD-- 136
Query: 234 CSSCA--KGAN-------PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYE 281
C CA +GA +Y P++ + +SLC QRN G T C Y
Sbjct: 137 CGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCA--QRNQCLG---TFSTCPYM 191
Query: 282 IEYAD-HSSSMGVLARDELHLTIE--NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGL 338
+ Y +S+ G+L D +HLT E N + V FGC Q G L+ + +G+ GL
Sbjct: 192 VSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD-IAAPNGLFGL 250
Query: 339 SRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMEL 398
K+S+PS LA +G++ + C G G + G S P +P
Sbjct: 251 GMEKISVPSVLAREGLVADSFSMCF--GHDGVGRISFGDK--GSSDQEETPFNLNPSHPN 306
Query: 399 YHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVL 458
Y+ + ++ G++ ++ ALFDTG+S+TY Y+ + S + D
Sbjct: 307 YNITVTRVRVGTTLID------DEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHS 360
Query: 459 DASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN 518
S C+ + + +LT+ S + I IS EG LV
Sbjct: 361 PDSRIPFEYCYDMSNDANASLIPSL---SLTMKGNSHFTINDPIIVISTEGELV------ 411
Query: 519 ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
CL I+ SE++ I+G + G VV+D + W K C +
Sbjct: 412 YCLAIVKSSELN-----IIGQNYMTGYRVVFDREKLVLAWKKFDCYD 453
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 167/381 (43%), Gaps = 47/381 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G + + +G PP+ + +DTGSDLTWIQ + PC +C + A+P++ P N +
Sbjct: 23 GEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSE-PCRACFEQADPIFDPSKSSTYNKIACS 81
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
S C ++ C C Y Y D S + G +++ + T G K FG
Sbjct: 82 SSACADLLGTQT---CSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVK----FG 134
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG---GYM 373
+ G +T +GILGL + VS+PSQL S ++ N +CL G M
Sbjct: 135 ASVYNTGTFGDT--GGEGILGLGQGPVSMPSQLGS--VLGNKFSYCLVDWLSAGSETSTM 190
Query: 374 FLGHDLVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPLNLGARNSQV-----GWA 425
+ G VPS + + P++ D P Y+ + I+ G S L++ ++ G
Sbjct: 191 YFGDAAVPSGEVQYTPIVPNADHP--TYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGT 248
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+ D+G++ TY ++ ++ L+A+ S S L +C+ + + F
Sbjct: 249 IIDSGTTITYLQQEVFNALVAAY--TSQVRYPTTTSATGLDLCFNTRGTGSPV------F 300
Query: 486 KTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
+T+H G ++ + IS E + ICL + + I G+I +
Sbjct: 301 PAMTIHLDGVHLELPTANTFISLETNI-------ICLAF---ASALDFPIAIFGNIQQQN 350
Query: 545 QLVVYDNVNKRIGWAKSHCMN 565
+VYD N RIG+A + C +
Sbjct: 351 FDIVYDLDNMRIGFAPADCAS 371
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 169/375 (45%), Gaps = 36/375 (9%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
+G Y + +G+PP+ + + +DTGSDL W+QC PC C + P + P ++ +
Sbjct: 36 NGEYLMTLTLGSPPQSFDVIVDTGSDLNWVQC-LPCRVCYQQPGPKFDPSKSR--SFRKA 92
Query: 259 LCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C + N + C C Y+ Y D S++ G LA + + L G+ + PN FG
Sbjct: 93 ACTDNLCNVSALPLKACAANVCQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFG 152
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT-NAGGGGYMFL 375
C L T G++GL + +SL SQL+ N +CL + N+ +
Sbjct: 153 CGTQN----LGTFAGAAGLVGLGQGPLSLNSQLSH--TFANKFSYCLVSLNSLSASPLTF 206
Query: 376 GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA------RNSQVGWALFDT 429
G + ++++ Y+ ++ I G PLNL +++ G + D+
Sbjct: 207 GSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDS 266
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTL 488
G++ T T AYS ++ + E + LD S L +C+ A S+ D+ F+
Sbjct: 267 GTTITMLTLPAYSAVLRAY-ESFVNYPRLDGSAYGLDLCFNIAGVSNPSVPDMVFKFQ-- 323
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
G+ +Q+ + LV + +CL + GS+ + I+G+I + LVV
Sbjct: 324 ----GADFQMRGENLFV-----LVDTSATTLCLA-MGGSQGFS----IIGNIQQQNHLVV 369
Query: 549 YDNVNKRIGWAKSHC 563
YD K+IG+A + C
Sbjct: 370 YDLEAKKIGFATADC 384
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 170/387 (43%), Gaps = 42/387 (10%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI- 252
G +Y G YF + VG P R ++ +DTGSDL W+QC PC SC K A+P++ PR +
Sbjct: 121 GLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRNSSSF 179
Query: 253 --LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
+P LC ++ + G +C Y++ Y D S S+G + D L + +++
Sbjct: 180 QRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMS- 238
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL---ASQGIIKNVVGHCLTTNA 367
V FGC +D +GL L K+S PSQ+ ++ N +CL +
Sbjct: 239 --VAFGCGFDNEGLFAGAAGLLG----LGAGKLSFPSQIFASSTNSSTANSFSYCLVDRS 292
Query: 368 G----GGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV 422
+ G +PS A P+L +P ++ Y+ ++ ++ G + L + ++ Q+
Sbjct: 293 NPMTRSSSSLIFGAAAIPSTA-ALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQL 351
Query: 423 -----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRS 477
G + D+G+S T F Y+ + + + +++ L C+ F ++
Sbjct: 352 SQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTN-LPSAPRYSLFDTCY--NFSGKA 408
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTII 536
VDV L LHF + + P YL+ I+ G+ CL S I
Sbjct: 409 SVDV----PALVLHFEN-----GADLQLPPTNYLIPINTAGSFCLAFAPTSMELG----I 455
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+I + + +D + +A C
Sbjct: 456 IGNIQQQSFRIGFDLQKSHLAFAPQQC 482
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/392 (24%), Positives = 171/392 (43%), Gaps = 40/392 (10%)
Query: 170 SKINKKLVSSNAVA-VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWI 228
S+I+K L ++V+ +DS ++ G++ G YF + +G P R L DTGSDLTW
Sbjct: 112 SRISKNLGQDSSVSELDSVTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWT 171
Query: 229 QCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQ--RNHKPGYCETCQQCDYEIE 283
QC+ SC K + ++ P + +LC ++ ++PG + + C Y I+
Sbjct: 172 QCEPCARSCYKQQDAIFDPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQ 231
Query: 284 YADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKV 343
Y D S S+G +R+ L +T + N +FGC + QGL + G++GL R +
Sbjct: 232 YGDSSFSVGYFSRERLSVTATD---IVDNFLFGCGQNNQGLFGGSA----GLIGLGRHPI 284
Query: 344 SLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFMELYHTE 402
S Q A+ + + + +CL + G + G + + + P S Y +
Sbjct: 285 SFVQQTAA--VYRKIFSYCLPATSSSTGRLSFG--TTTTSYVKYTPFSTISRGSSFYGLD 340
Query: 403 ILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD 462
I I+ G + L + + G A+ D+G+ T AY+ L ++ ++ G+ S
Sbjct: 341 ITGISVGGAKLPVSSSTFSTGGAIIDSGTVITRLPPTAYTALRSAFRQ----GMSKYPSA 396
Query: 463 PTLPV---CWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN 518
L + C+ + + + SI + F + P+G L ++
Sbjct: 397 GELSILDTCYDLSGYEVFSIPKIDFSFAG------------GVTVQLPPQGILYVASAKQ 444
Query: 519 ICLGILDGSEVHNGSTIILGDISLRGQLVVYD 550
+CL + + I G++ + VVYD
Sbjct: 445 VCLAFAANGD--DSDVTIYGNVQQKTIEVVYD 474
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 168/372 (45%), Gaps = 49/372 (13%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CAKGANPLYKPRMGNI---LPYK 256
Y + G P P + +DTGSD++W+QC PCSS C +PLY P + +P
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 171
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
+C ++ + C + +QC + I YAD +S++G ++D+ LT+ G++ + N FG
Sbjct: 172 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDK--LTLAPGAIVQ-NFYFG 228
Query: 317 CAYDQ---QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
C + + +GL DG+LGL R + SL ++ V +CL + + G++
Sbjct: 229 CGHGKHAVRGLF-------DGVLGLGRLRESLGARYG------GVFSYCLPSVSSKPGFL 275
Query: 374 FLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQV-GWALFDTGS 431
LG PS G + PM P + T L IN G L+L R S G + D+G+
Sbjct: 276 ALGAGKNPS-GFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDL--RPSAFSGGMIVDSGT 332
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
T AY L ++ ++ +L D L C+ +++V K LT
Sbjct: 333 VITGLQSTAYRALRSAFRKAMEAYRLLPNGD--LDTCYNLT-GYKNVVVPK---IALTFT 386
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
G+ + P G LV N CL + +GS +LG+++ R V++D
Sbjct: 387 GGATINL------DVPNGILV-----NGCLAFAESGP--DGSAGVLGNVNQRAFEVLFDT 433
Query: 552 VNKRIGWAKSHC 563
+ G+ C
Sbjct: 434 STSKFGFRAKAC 445
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 168/372 (45%), Gaps = 49/372 (13%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CAKGANPLYKPRMGNI---LPYK 256
Y + G P P + +DTGSD++W+QC PCSS C +PLY P + +P
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 137
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
+C ++ + C + +QC + I YAD +S++G ++D+ LT+ G++ + N FG
Sbjct: 138 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDK--LTLAPGAIVQ-NFYFG 194
Query: 317 CAYDQ---QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
C + + +GL DG+LGL R + SL ++ V +CL + + G++
Sbjct: 195 CGHGKHAVRGLF-------DGVLGLGRLRESLGARYG------GVFSYCLPSVSSKPGFL 241
Query: 374 FLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQV-GWALFDTGS 431
LG PS G + PM P + T L IN G L+L R S G + D+G+
Sbjct: 242 ALGAGKNPS-GFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDL--RPSAFSGGMIVDSGT 298
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
T AY L ++ ++ +L D L C+ +++V K LT
Sbjct: 299 VITGLQSTAYRALRSAFRKAMEAYRLLPNGD--LDTCYNLT-GYKNVVVPKI---ALTFT 352
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
G+ + P G LV N CL + +GS +LG+++ R V++D
Sbjct: 353 GGATINL------DVPNGILV-----NGCLAFAESGP--DGSAGVLGNVNQRAFEVLFDT 399
Query: 552 VNKRIGWAKSHC 563
+ G+ C
Sbjct: 400 STSKFGFRAKAC 411
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 125/497 (25%), Positives = 207/497 (41%), Gaps = 64/497 (12%)
Query: 84 KLFLFLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEF 143
KL FL F ++L S F+ + YK + N+ S + LYH G + ++
Sbjct: 10 KLVCFLT---FMIVLATSSFAKL--EEYKLS---ANQSSILLNLYHVHGDASSLEPNSSS 61
Query: 144 KLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSS--IFPLRGNI----- 196
+ D E V + S++ KK V + + S + P NI
Sbjct: 62 SFCDILSRDEEHV---------KFLSSRLRKKDVQGASFSRHKSGHLLEPNSANIPLNPG 112
Query: 197 --YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI-- 252
G Y+ + +G+PP+ Y + +DTGS L+W+QC C +PL++P N
Sbjct: 113 LSIGSGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYR 172
Query: 253 -LPYKDSLC--MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
L S C ++ + P C C Y Y D S SMG L+RD L LT S T
Sbjct: 173 PLYCSSSECSLLKAATLNDP-LCTASGVCVYTASYGDASYSMGYLSRDLLTLT---PSQT 228
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAG 368
P+ +GC D +GL K GI+GL+R K+S+ +QL+ + +CL T+ +
Sbjct: 229 LPSFTYGCGQDNEGL----FGKAAGIVGLARDKLSMLAQLSPK--YGYAFSYCLPTSTSS 282
Query: 369 GGGYMFLGHDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPLNLGARNSQVGWALF 427
GGG++ +G + PS + PM+ +S LY + I P+ + A QV +
Sbjct: 283 GGGFLSIG-KISPS-SYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQV-PTII 339
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV-DVKQFFK 486
D+G+ T Y+ L + ++ S + L C++ S +++ F+
Sbjct: 340 DSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQ 399
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ L+ + KG CL +++ I+G+ +
Sbjct: 400 G------------GADLSLRAPNILIEADKGIACLAFASSNQI-----AIIGNHQQQTYN 442
Query: 547 VVYDNVNKRIGWAKSHC 563
+ YD +IG+A C
Sbjct: 443 IAYDVSASKIGFAPGGC 459
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 169/378 (44%), Gaps = 45/378 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYK 256
G YFT + VG P R Y+ +DTGSD+ W+QC APC C A+P++ P R +P
Sbjct: 127 GEYFTRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQADPVFDPTKSRTYAGIPCG 185
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC +R PG + C Y++ Y D S + G + + LT +T+ V G
Sbjct: 186 APLC---RRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTET--LTFRRTRVTR--VALG 238
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMF 374
C +D +GL + L R ++S P Q + +CL + +
Sbjct: 239 CGHDNEGLFIGAAGLLG----LGRGRLSFPVQTGRR--FNQKFSYCLVDRSASAKPSSVV 292
Query: 375 LGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLN-LGARNSQV-----GWALF 427
G V S + P++ +P ++ Y+ E+L I+ G SP+ L A ++ G +
Sbjct: 293 FGDSAV-SRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVII 351
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D+G+S T T+ AY L + + V + L A C F + + +VK T
Sbjct: 352 DSGTSVTRLTRPAYIALRDAFR-VGASHLKRAAEFSLFDTC----FDLSGLTEVK--VPT 404
Query: 488 LTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ LHF G+ + +T + I + G+ C + +G +II G+I +G
Sbjct: 405 VVLHFRGADVSLPATNYLIP------VDNSGSFCFAF---AGTMSGLSII-GNIQQQGFR 454
Query: 547 VVYDNVNKRIGWAKSHCM 564
V +D R+G+A C+
Sbjct: 455 VSFDLAGSRVGFAPRGCV 472
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/395 (26%), Positives = 168/395 (42%), Gaps = 65/395 (16%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL---------YKPR--- 248
LY+ + VG P + + +DTGSDL W+ C+ CSSC N Y P
Sbjct: 103 LYYANVSVGTPSLDFLVALDTGSDLFWLPCE--CSSCFTYLNTSNGGKFMLNHYSPNDST 160
Query: 249 MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGS 307
+ +P SLC N C YE+ Y + ++SS+G L D LHL ++ S
Sbjct: 161 TSSTVPCTSSLCNRCTSNQ--------NVCPYEMRYLSANTSSIGYLVEDVLHLATDD-S 211
Query: 308 LTKP---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
L KP + FGC Q G+ T +G++GL K+S+PS LA QG+ N C
Sbjct: 212 LLKPVEAKITFGCGTVQTGIFATT-AAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCF- 269
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
A G G + G D P+ +PF + + + + + +N+G + V +
Sbjct: 270 -GADGYGRIDFG-DTGPA------DQKQTPFNTMLEYQSYNVTF--NVINVGGEPNDVPF 319
Query: 425 -ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV-- 481
A+FD+G+S+TY T+ AYS + + G+ L P FP ++
Sbjct: 320 TAIFDSGTSFTYLTEPAYSTITKQMDA----GMKLKRYSLFGP-----NFPFEYCYEIPP 370
Query: 482 -KQFFKTLTLHFGSKWQ--------IVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG 532
+ F+ LTL+F K V +S + CL I +++
Sbjct: 371 GAKEFQYLTLNFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDID-- 428
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPG 567
++G + G + ++ +GW+ S C + G
Sbjct: 429 ---LIGQNFMTGYRITFNRDQMVLGWSSSDCYDNG 460
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 171/393 (43%), Gaps = 57/393 (14%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP--------LYKPRMGNI 252
L++ + +G P + + +DTGSDL W+ CD C CA ++P +Y PR +
Sbjct: 107 LHYAVVALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPLSSPDYGNLKFDVYSPRKSST 164
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENG-- 306
+P ++C ++Q C Y+IEY +D++SS GVL D ++L E+G
Sbjct: 165 SRKVPCSSNMC-DLQTECS----AASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHS 219
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+T+ + FGC Q G L + +G+LGL S+PS LASQG+ N C +
Sbjct: 220 KITQAPITFGCGQVQTGSFLGS-AAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFGED 278
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
GH + L++P H I+ + + G S A+
Sbjct: 279 ---------GHGRINFGDTGSADQLETPLNIYKHNPYYNISIVGA-MAGGKTFSTKFSAV 328
Query: 427 FDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
D+G+S+T + Y+E+ ++ K+V + +D +LP + + I S V
Sbjct: 329 VDSGTSFTALSDPMYTEITSAFDKQVKEK---RNPADSSLP--FEYCYTISSKGAVSPPN 383
Query: 486 KTLTLHFGSKWQ-----IVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
+LT GS + I T SP GY CL I+ V+ ++G+
Sbjct: 384 ISLTAKGGSVFPVKDPIITITDISSSPVGY---------CLAIMKSEGVN-----LIGEN 429
Query: 541 SLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
+ G VV+D +GW +C + LP
Sbjct: 430 FMSGLKVVFDRERLVLGWKSFNCYSVDHSTKLP 462
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 125/274 (45%), Gaps = 25/274 (9%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN----- 242
PL G PD GLY+ + +G P + YY+ +DTGSD+ W+ C C C + +
Sbjct: 66 LPLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC-IQCKQCPRRSTLGIEL 124
Query: 243 PLY---KPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL 299
LY + G ++ D C +I G C+ C Y Y D SS+ G +D +
Sbjct: 125 TLYNIDESDSGKLVSCDDDFCYQISGGPLSG-CKANMSCPYLEIYGDGSSTAGYFVKDVV 183
Query: 300 HLTIENGSL----TKPNVVFGCAYDQQGLLLNTLVKT-DGILGLSRAKVSLPSQLASQGI 354
G L +V+FGC Q G L ++ + DGILG +A S+ SQLAS G
Sbjct: 184 QYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGR 243
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414
+K + HCL GGG + +G + P M P++ P Y+ + + G L
Sbjct: 244 VKKIFAHCLDGRNGGGIFA-IGRVVQPKVNMT--PLV--PNQPHYNVNMTAVQVGQEFLT 298
Query: 415 LGARNSQVG---WALFDTGSSYTYFTKQAYSELI 445
+ A Q G A+ D+G++ Y + Y L+
Sbjct: 299 IPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLV 332
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 107/398 (26%), Positives = 166/398 (41%), Gaps = 54/398 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD---APCSSCAKGA---NPLY---KPRMG 250
G Y M G PP+ L DTGSDL W+QC AP + C K A P + K
Sbjct: 51 GQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATL 110
Query: 251 NILPYKDSLCMEI--QRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENG 306
+++P + C+ + R H P C C Y +YAD SS+ G LARD TI NG
Sbjct: 111 SVVPCSAAQCLLVPAPRGHGPA-CSPAAPVPCGYAYDYADGSSTTGFLARDT--ATISNG 167
Query: 307 S---LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
+ V FGC QG + T G++GL + ++S P+Q S + +CL
Sbjct: 168 TSGGAAVRGVAFGCGTRNQG---GSFSGTGGVIGLGQGQLSFPAQ--SGSLFAQTFSYCL 222
Query: 364 TTNAGG-----GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGA 417
GG ++FLG + A+ P++ +P Y+ ++ I G+ L +
Sbjct: 223 LDLEGGRRGRSSSFLFLGRPERRA-AFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPG 281
Query: 418 RNSQV-----GWALFDTGSSYTYFTKQAYSELIAS------LKEVSSDGLVLDASDPTLP 466
+ G + D+GS+ TY AY L+++ L + S L
Sbjct: 282 SEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQG----LE 337
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDG 526
+C+ S F LT+ F + + YLV CL I
Sbjct: 338 LCYNVSS-SSSSAPANGGFPRLTIDFAQGLSL-----ELPTGNYLVDVADDVKCLAIRP- 390
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
+ + +LG++ +G V +D + RIG+A++ C+
Sbjct: 391 -TLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTECV 427
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 110/396 (27%), Positives = 174/396 (43%), Gaps = 56/396 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR----MGNILPY 255
G YF M VG PP+ +L +DTGSDL+WIQCD PC C + Y P+ NI Y
Sbjct: 169 GEYFLDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGSHYYPKDSSTYRNISCY 227
Query: 256 KDSLCMEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLTKPN 312
D C + + +C+ Q C Y +YAD S++ G A + ++LT NG
Sbjct: 228 -DPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQ 286
Query: 313 VV---FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TN 366
VV FGC + +G G+LGL R +S PSQ+ Q I + +CLT +N
Sbjct: 287 VVDVMFGCGHWNKGFFYG----ASGLLGLGRGPISFPSQI--QSIYGHSFSYCLTDLFSN 340
Query: 367 AGGGGYMFLGHD--LVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPLNLGARNSQ 421
+ G D L+ + + + +L ++P Y+ +I I G L++ +
Sbjct: 341 TSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWH 400
Query: 422 VGWA------------LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW 469
W+ + D+GS+ T+F AY ++I E + A D + C+
Sbjct: 401 --WSSEGAAADAGGGTIIDSGSTLTFFPDSAY-DIIKEAFEKKIKLQQIAADDFVMSPCY 457
Query: 470 RAKFPIRSIVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS 527
+ V++ F +HF G W + + E V ICL I+ +
Sbjct: 458 NVSGAMMQ-VELPDF----GIHFADGGVWNFPAENYFYQYEPDEV------ICLAIMK-T 505
Query: 528 EVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
H+ TII G++ + ++YD R+G++ C
Sbjct: 506 PNHSHLTII-GNLLQQNFHILYDVKRSRLGYSPRRC 540
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 105/377 (27%), Positives = 171/377 (45%), Gaps = 45/377 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YFT + VG PP+ Y+ +DTGSD+ W+QC PC+ C + ++ P +P
Sbjct: 128 GEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCK-PCTKCYSQTDQIFDPSKSKSFAGIPCY 186
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC +R PG C Y++ Y D S + G + + LT ++ P V G
Sbjct: 187 SPLC---RRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTET--LTFRRAAV--PRVAIG 239
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGYMF 374
C +D +GL V G+LGL R +S P+Q ++ N +CLT T + +
Sbjct: 240 CGHDNEGL----FVGAAGLLGLGRGGLSFPTQTGTR--FNNKFSYCLTDRTASAKPSSIV 293
Query: 375 LGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLN------LGARNSQVGWALF 427
G V S + P++ +P ++ Y+ E+L I+ G +P+ ++ G +
Sbjct: 294 FGDSAV-SRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVII 352
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D+G+S T T+ AY L + + V + L C+ + + +VK T
Sbjct: 353 DSGTSVTRLTRPAYVSLRDAFR-VGASHLKRAPEFSLFDTCYD----LSGLSEVK--VPT 405
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ LHF + VS + YLV + G+ C + +G +II G+I +G
Sbjct: 406 VVLHF--RGADVS----LPAANYLVPVDNSGSFCFAF---AGTMSGLSII-GNIQQQGFR 455
Query: 547 VVYDNVNKRIGWAKSHC 563
VV+D R+G+A C
Sbjct: 456 VVFDLAGSRVGFAPRGC 472
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 118/424 (27%), Positives = 191/424 (45%), Gaps = 55/424 (12%)
Query: 160 VNDGI-IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYL 217
V DG+ +R ++ I K+ SS+ +A S + PL I L Y M +G+ + +
Sbjct: 79 VLDGLHVRSIQNHIRKR-TSSSQIADSSETQVPLTSGIKFQTLNYIVTMGLGS--QNMSV 135
Query: 218 DMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQ-RNHKPGYC---- 272
+DTGSDLTW+QC+ PC SC PL+KP Y+ LC ++ + G C
Sbjct: 136 IVDTGSDLTWVQCE-PCRSCYNQNGPLFKPSTSP--SYQPILCNSTTCQSLELGACGSDP 192
Query: 273 ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKT 332
T CDY + Y D S + G L ++L G ++ N VFGC + +GL
Sbjct: 193 STSATCDYVVNYGDGSYTSGELGIEKLGF----GGISVSNFVFGCGRNNKGLFGG----A 244
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCL--TTNAGGGGYMFLGH------DLVPSWG 384
G++GL R+++S+ SQ + V +CL T AG G + +G+ ++ P
Sbjct: 245 SGLMGLGRSELSMISQ--TNATFGGVFSYCLPSTDQAGASGSLVMGNQSGVFKNVTP--- 299
Query: 385 MAWVPMLDSPFMELYHTEILK---INYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAY 441
+A+ ML P ++L + IL I+ G L++ A + G + D+G+ + Y
Sbjct: 300 IAYTRML--PNLQLSNFYILNLTGIDVGGVSLHVQASSFGNGGVILDSGTVISRLAPSVY 357
Query: 442 SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVST 501
L A E S G L C F + V T++++F ++
Sbjct: 358 KALKAKFLEQFS-GFPSAPGFSILDTC----FNLTGYDQVN--IPTISMYFEGNAEL--- 407
Query: 502 KFHISPEG--YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWA 559
++ G YLV +CL + S+ + I+G+ R Q V+YD ++G+A
Sbjct: 408 --NVDATGIFYLVKEDASRVCLALASLSDEYEMG--IIGNYQQRNQRVLYDAKLSQVGFA 463
Query: 560 KSHC 563
K C
Sbjct: 464 KEPC 467
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 107/389 (27%), Positives = 167/389 (42%), Gaps = 57/389 (14%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR-------MGNIL 253
LY+ + VG PP + + +DTGSDL W+ C+ ++C + + P+ N
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPCNCG-TTCIRDLEDIGVPQSVPLNLYTPNAS 159
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLT--K 310
S+ +R C + + C Y+I Y++ + + G L +D LHL E+ +LT K
Sbjct: 160 TTSSSIRCSDKRCFGSKKCSSPKSICPYQISYSNSTGTTGTLLQDVLHLATEDENLTPVK 219
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
NV GC Q GL +G+LGL S+PS LA I + C G
Sbjct: 220 TNVTLGCGQKQTGLFQRN-NSVNGVLGLGIKGYSVPSLLAKANITADSFSMCFGRVIGNV 278
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMEL-----YHTEILKINYGSSPLNLGARNSQVGWA 425
G + G + ++PF+ + Y + ++ G P +G R +A
Sbjct: 279 GRISFGDK-------GYTDQEETPFISVAPSTAYGLNVTGVSVGGDP--VGTRL----FA 325
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDAS---DPTLP--VCWRAKFPIRSIVD 480
FDTGSS+T+ + AY L S D LV D DP LP C+ SI
Sbjct: 326 KFDTGSSFTHLMEPAYGVLTKSF-----DDLVEDKRRPVDPELPFEFCYDLSPNATSI-- 378
Query: 481 VKQF-FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI--CLGILD--GSEVHNGSTI 535
+F F +T GSK + + F + +GN+ CLG+L G +++
Sbjct: 379 --EFPFVEMTFVGGSKIILNNPFFTARTQAR---HGEGNVMYCLGVLKSVGLKIN----- 428
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
++G + G +V+D +GW S C
Sbjct: 429 VIGQNFVAGYRIVFDRERMILGWKPSLCF 457
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 158/370 (42%), Gaps = 37/370 (10%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS-SCAKGANPLYKPRMGNILPYKD 257
G Y + G P R + DTGSD+ W+QC PC+ C PL+ P + + Y++
Sbjct: 13 SGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCK-PCAVRCYAQQEPLFDPSLSST--YRN 69
Query: 258 SLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
C E + C Y + Y D SS++G LA D LT N +FGC
Sbjct: 70 VSCTEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQ---KFKNFIFGC 126
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKV-SLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
+ GL T G++GL R+ SL SQ+A + NV +CL + + GY+ +G
Sbjct: 127 GQNNTGLFQG----TAGLVGLGRSSTYSLNSQVAPS--LGNVFSYCLPSTSSATGYLNIG 180
Query: 377 HDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
+ P + ML D+ LY +++ I+ G + L+L + Q + D+G+ T
Sbjct: 181 N---PQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVITR 237
Query: 436 FTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSK 495
AYS L +++ + L + L C+ S+V + + LHF
Sbjct: 238 LPPTAYSALKTAVRAAMTQ-YTLAPAVTILDTCYDFS-RTTSVV-----YPVIVLHFAGL 290
Query: 496 WQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI--ILGDISLRGQLVVYDNVN 553
I G + +CL ++ ST+ I+G++ V YDN
Sbjct: 291 ------DVRIPATGVFFVFNSSQVCLAFAGNTD----STMIGIIGNVQQLTMEVTYDNEL 340
Query: 554 KRIGWAKSHC 563
KRIG++ C
Sbjct: 341 KRIGFSAGAC 350
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 126/290 (43%), Gaps = 30/290 (10%)
Query: 284 YADHSSSMGVLARDELHLTIENGSL----TKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
Y D SS+ G L +D +HL + G+ T ++FGC Q G L + DGI+G
Sbjct: 2 YGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFG 61
Query: 340 RAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELY 399
++ S SQLASQG +K HCL N GGG + +G + P + PML Y
Sbjct: 62 QSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFA-IGEVVSPK--VKTTPMLSKS--AHY 116
Query: 400 HTEILKINYGSSPLNLGARNSQVG---WALFDTGSSYTYFTKQAYSELIASLKEVSSDGL 456
+ I G+S L L + G + D+G++ Y Y+ L+ +
Sbjct: 117 SVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEIL------- 169
Query: 457 VLDASDPTLPV-CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISK 515
AS P L + + F D F T+T F S + P YL +
Sbjct: 170 ---ASHPELTLHTVQESFTCFHYTDKLDRFPTVTFQFDK-----SVSLAVYPREYLFQVR 221
Query: 516 KGNICLGILDGSEVHNG--STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ C G +G G S ILGD++L +LVVYD N+ IGW +C
Sbjct: 222 EDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNC 271
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 155/382 (40%), Gaps = 54/382 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YF + VG+PP YL +D+GSD+ W+QC PC C +PL+ P + +
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCG 186
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
++C + +CDY + Y D S + G LA + L L G V G
Sbjct: 187 SAICRTLSGTGCG-GGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIG 241
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT-NAGGGGYMFL 375
C + GL V G+LGL +SL QL G V +CL + AGG G + L
Sbjct: 242 CGHRNSGL----FVGAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAGGAGSLVL 295
Query: 376 GHDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPLNLGARNSQV-----GWALFDT 429
G G WVP++ ++ Y+ + I G L L Q+ G + DT
Sbjct: 296 GRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDT 355
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
G++ T ++AY+ L + D + LP + P S++D
Sbjct: 356 GTAVTRLPREAYAALRGA----------FDGAMGALP-----RSPAVSLLD-----TCYD 395
Query: 490 LHFGSKWQIVSTKFHIS-------PEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDIS 541
L + ++ + F+ P L++ G + CL S ILG+I
Sbjct: 396 LSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSS----GISILGNIQ 451
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
G + D+ N +G+ + C
Sbjct: 452 QEGIQITVDSANGYVGFGPNTC 473
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 111/399 (27%), Positives = 169/399 (42%), Gaps = 65/399 (16%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN------------PLYKPR 248
L+F + VG PP + + +DTGSDL W+ C+ C+SC +G L K
Sbjct: 112 LHFANVSVGTPPLWFLVALDTGSDLFWLPCN--CTSCVRGLKTQNGKVIDLNIYELDKSS 169
Query: 249 MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGS 307
+P ++C + Q H G C YE+EY ++ +SS G L D LHL +N
Sbjct: 170 TRKNVPCNSNMCKQTQ-CHSSG-----SSCRYEVEYLSNDTSSSGFLVEDVLHLITDNDQ 223
Query: 308 LT--KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
+ GC Q G+ LN +G+ GL VS+PS LA +G+I + C +
Sbjct: 224 TKDIDTQITIGCGQVQTGVFLNG-AAPNGLFGLGMENVSVPSILAQKGLISDSFSMCFGS 282
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
+ G G + G G + +S Y+ I +I G G A
Sbjct: 283 D--GSGRITFGDTGSSDQGKTPFNLRES--HPTYNVTITQIIVG------GYAADHEFHA 332
Query: 426 LFDTGSSYTYFTKQAY---SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
+FD+G+S+TY AY SE SL + + + SD C+ P ++I +
Sbjct: 333 IFDSGTSFTYLNDPAYTLISEKFNSLVKANRHSPLSPDSDLPFEYCYDMS-PDQTI---E 388
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN-ICLGI-------LDGSE------ 528
F LT+ G + + +S E +GN +CLGI + G E
Sbjct: 389 VPFLNLTMKGGDDYYVTDPIVPVSSE------VEGNLLCLGIQKSDNLNIIGREYTTEEE 442
Query: 529 -VHNGSTIILGDIS---LRGQLVVYDNVNKRIGWAKSHC 563
+H II I + G +V+D N +GW +S+C
Sbjct: 443 FLHLKHMIIKFFIQKNFMTGYRIVFDRENMNLGWKESNC 481
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 117/467 (25%), Positives = 193/467 (41%), Gaps = 57/467 (12%)
Query: 118 ENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLV 177
+ +E LYH G+ + F + D E V R S++ K
Sbjct: 26 QKQEGMQLNLYHVKGLDSSQTSTSPFSFSDMITKDEERV---------RFLHSRLTNKES 76
Query: 178 SSNAVAVD-----SSSIFPLRGNI-YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD 231
+SN+ D S PL+ + G Y+ + VG P + + + +DTGS L+W+QC
Sbjct: 77 ASNSATTDKLGGPSLVSTPLKSGLSIGSGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQ 136
Query: 232 APCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRN-------HKPGYCETCQQCDYEIEY 284
C +P++ P + YK C Q + + PG C Y+ Y
Sbjct: 137 PCVIYCHVQVDPIFTPSVSKT--YKALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASY 194
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
D S S+G L++D L LT + V+GC D QGL ++ GI+GL+ K+S
Sbjct: 195 GDTSFSIGYLSQDVLTLTPS--AAPSSGFVYGCGQDNQGL----FGRSAGIIGLANDKLS 248
Query: 345 LPSQLASQGIIKNVVGHCLTT------NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM-E 397
+ QL+++ N +CL + N+ G++ +G + S + P++ +P +
Sbjct: 249 MLGQLSNK--YGNAFSYCLPSSFSAQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPS 306
Query: 398 LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV 457
LY + I PL + A + V + D+G+ T Y+ L S + S
Sbjct: 307 LYFLGLTTITVAGKPLGVSASSYNV-PTIIDSGTVITRLPVAIYNALKKSFVMIMSKKYA 365
Query: 458 LDASDPTLPVCWRAKFPIRSIV-DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK 516
L C++ S V +++ F+ G+ ++ K H S LV +K
Sbjct: 366 QAPGFSILDTCFKGSVKEMSTVPEIRIIFRG-----GAGLEL---KVHNS----LVEIEK 413
Query: 517 GNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G CL I S + I+G+ + V YD N +IG+A C
Sbjct: 414 GTTCLAIAASSNPIS----IIGNYQQQTFTVAYDVANSKIGFAPGGC 456
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 108/384 (28%), Positives = 155/384 (40%), Gaps = 33/384 (8%)
Query: 193 RGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CAKGANPLYKPRMG 250
RG G Y + +G P R + DTGSDL+W+QC PCSS C K +PL+ P
Sbjct: 145 RGISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQC-GPCSSGGCYKQQDPLFAPSDS 203
Query: 251 NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTI---ENGS 307
+ E + G +C YE+ Y D S + G L D L L N S
Sbjct: 204 STFSAVRCGARECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANAS 263
Query: 308 LTK----PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
P VFGC + GL + DG+ GL R KVSL SQ A G +CL
Sbjct: 264 AENDNKLPGFVFGCGENNTGL----FGQADGLFGLGRGKVSLSSQAA--GKFGEGFSYCL 317
Query: 364 TTNAGGG-GYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQ 421
+++ GY+ LG + + PML+ Y+ +++ I + + + +
Sbjct: 318 PSSSSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRV--SSPR 375
Query: 422 VGWALF-DTGSSYTYFTKQAYSELIAS-LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
V L D+G+ T +AY L A+ L + G L C+ + V
Sbjct: 376 VALPLIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATV 435
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
+ + L F I S F G L ++K CL + S ILG+
Sbjct: 436 SI----PAVALVFAGGATI-SVDF----SGVLYVAKVAQACLAFAPNGDGR--SAGILGN 484
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
R VVYD ++IG+A C
Sbjct: 485 TQQRTLAVVYDVARQKIGFAAKGC 508
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 99/390 (25%), Positives = 171/390 (43%), Gaps = 48/390 (12%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI- 252
G +Y G YF + +G P R ++ +DTGSDL W+QC PC SC K A+P++ PR +
Sbjct: 46 GLLYGSGEYFVRLGLGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRNSSSF 104
Query: 253 --LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
+P LC ++ + G +C Y++ Y D S S+G + D L + +++
Sbjct: 105 QRIPCLSPLCKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMS- 163
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL---ASQGIIKNVVGHCLTTNA 367
V FGC +D +GL L K+S PSQ+ ++ N +CL +
Sbjct: 164 --VAFGCGFDNEGLFAGAAGLLG----LGAGKLSFPSQIFASSTNSSTANSFSYCLVDRS 217
Query: 368 G----GGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV 422
+ G +PS A P+L +P ++ Y+ ++ ++ G + L + ++ Q+
Sbjct: 218 NPMTRSSSSLIFGVAAIPSTA-ALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQL 276
Query: 423 -----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT---LPVCWRAKFP 474
G + D+G+S T F Y A++++ + + S P C+ F
Sbjct: 277 SQSGSGGVIIDSGTSVTRFPTSVY----ATIRDAFRNATINLPSAPRYSLFDTCY--NFS 330
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGS 533
++ VDV L LHF + + P YL+ I+ G+ CL S
Sbjct: 331 GKASVDV----PALVLHFEN-----GADLQLPPTNYLIPINTAGSFCLAFAPTSMELG-- 379
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+I + + +D + +A C
Sbjct: 380 --IIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 164/380 (43%), Gaps = 47/380 (12%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
D Y + +G PP Y + DTGSDL W QC PC+ C K NP++ PR + Y +
Sbjct: 57 DCEYLMELSIGTPPIKIYAEADTGSDLVWFQC-IPCTKCYKQQNPMFDPRSSS--SYTNI 113
Query: 259 LC-MEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVF 315
C E C T Q+ C+Y YAD+S + GVLA++ L LT G + ++F
Sbjct: 114 TCGTESCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIF 173
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ-GIIKNVVGHCLT---TNAGGGG 371
GC ++ G + G++GL R +SL SQ+ S G N+ CL T+
Sbjct: 174 GCGHNNSGFNDREM----GLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITS 229
Query: 372 YMFLGH-DLVPSWGMAWVPMLDSP----FMELYHTEILKINYGSSPLNLGARNSQV--GW 424
M G V G P++ F L + IN P + G+ + G
Sbjct: 230 QMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINL---PFSNGSSLGTITKGN 286
Query: 425 ALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
L D+G++ TY ++ Y LI ++ +V+ + +D + +C++ +
Sbjct: 287 ILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDGYE----LCYQTPTNLNG------ 336
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
TLT+HF + ++P + + N C + D +E + + G+ +
Sbjct: 337 --PTLTIHFEGGDVL------LTPAQMFIPVQDDNFCFAVFDTNEEY----VTYGNYAQS 384
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
L+ +D + + + + C
Sbjct: 385 NYLIGFDLERQVVSFKATDC 404
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 154/382 (40%), Gaps = 54/382 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YF + VG+PP YL +D+GSD+ W+QC PC C +PL+ P + +
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCG 186
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
++C + +CDY + Y D S + G LA + L L G V G
Sbjct: 187 SAICRTLSGTGCG-GGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIG 241
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT-NAGGGGYMFL 375
C + GL V G+LGL +SL QL G V +CL + AGG G + L
Sbjct: 242 CGHRNSGL----FVGAAGLLGLGWGAMSLIGQLG--GAAGGVFSYCLASRGAGGAGSLVL 295
Query: 376 GHDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSS--PLNLG---ARNSQVGWALFDT 429
G G WVP++ ++ Y+ + I G PL G G + DT
Sbjct: 296 GRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDT 355
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
G++ T ++AY+ L + D + LP + P S++D
Sbjct: 356 GTAVTRLPREAYAALRGA----------FDGAMGALP-----RSPAVSLLD-----TCYD 395
Query: 490 LHFGSKWQIVSTKFHIS-------PEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDIS 541
L + ++ + F+ P L++ G + CL S ILG+I
Sbjct: 396 LSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSS----GISILGNIQ 451
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
G + D+ N +G+ + C
Sbjct: 452 QEGIQITVDSANGYVGFGPNTC 473
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 109/414 (26%), Positives = 178/414 (42%), Gaps = 55/414 (13%)
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
+ V +A+ ++F ++ L++ + VG P + + +DTGSDL W+ CD
Sbjct: 27 NETVRVDALGFFKVNVFMETCELFMRDLHYANVTVGTPSDWFMVALDTGSDLFWLPCD-- 84
Query: 234 CSSCAK-----GANPL----YKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYE 281
C++C + G + L Y P + +P +LC R P C Y+
Sbjct: 85 CTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPE-----SDCPYQ 139
Query: 282 IEY-ADHSSSMGVLARDELHLTIENGSLTK---PNVVFGCAYDQQGLLLNTLVKTDGILG 337
I Y ++ +SS GVL D LHL + N +K V FGC Q G+ + +G+ G
Sbjct: 140 IRYLSNGTSSTGVLVEDVLHL-VSNDKSSKAIPARVTFGCGQVQTGVFHDG-AAPNGLFG 197
Query: 338 LSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFME 397
L +S+PS LA +GI N C + G G + G S P+
Sbjct: 198 LGLEDISVPSVLAKEGIAANSFSMCFGND--GAGRISFGDK--GSVDQRETPLNIRQPHP 253
Query: 398 LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV 457
Y+ + KI+ G + +L A+FD+G+S+TY T AY+ + S ++ D
Sbjct: 254 TYNITVTKISVGGNTGDLEFD------AVFDSGTSFTYLTDAAYTLISESFNSLALDKR- 306
Query: 458 LDASDPTLPV--CWRAKFPIRS-----IVDVKQF-FKTLTLHFGSKWQIVSTKFHISPEG 509
+D LP C+ + P+ S D Q+ LT+ GS + + +H P
Sbjct: 307 YQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNLTMKGGSSYPV----YH--PLV 360
Query: 510 YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + CL I+ ++ I+G + G VV+D +GW +S C
Sbjct: 361 VIPMKDTDVYCLAIMKIEDIS-----IIGQNFMTGYRVVFDREKLILGWKESDC 409
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 152/383 (39%), Gaps = 53/383 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + +G+PP YL +D+GSD+ W+QC PC C A+PL+ P S
Sbjct: 123 GEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPASSATFS-AVSC 180
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
I R + C C+YE+ Y D S + G LA + L L G V GC +
Sbjct: 181 GSAICRTLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTL----GGTAVEGVAIGCGH 236
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG-------GY 372
+GL V G+LGL +SL QL +CL + G G G
Sbjct: 237 RNRGL----FVGAAGLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGGSGSGAADAAGS 290
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----GWAL 426
+ LG G WVP++ +P Y+ + I G L L Q+ G +
Sbjct: 291 LVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVV 350
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT-LPVCWRAKFPIRSIVDVKQFF 485
DTG++ T ++AY+ L + V + G + A + L C+ V F+
Sbjct: 351 MDTGTAVTRLPQEAYAALRDAF--VGAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFY 408
Query: 486 ----KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDI 540
TLTL P L++ G I CL S ILG+I
Sbjct: 409 FDGAATLTL----------------PARNLLLEVDGGIYCLAFAPSSS----GLSILGNI 448
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
G + D+ N IG+ + C
Sbjct: 449 QQEGIQITVDSANGYIGFGPATC 471
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 112/381 (29%), Positives = 158/381 (41%), Gaps = 32/381 (8%)
Query: 190 FPLRGNIYPDGLYFTYMI-VGNPPRPYYLDMDTGSDLTWIQCDAPCSS---CAKGANPLY 245
P R Y D L F + +G P +P L DTGSDL+W+QC PC S C +PL+
Sbjct: 136 IPDRSGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQ-PCGSSGHCHPQQDPLF 194
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
P + Y C E Q G C E C Y + Y D SS+ GVL+RD L LT
Sbjct: 195 DPSKSST--YAAVHCGEPQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSS 252
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
P FGC L + DG+LGL R ++SLPSQ A+ V +CL
Sbjct: 253 RALAGFP---FGCGTRN----LGDFGRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLP 303
Query: 365 TNAGGGGYMFLGHDLVPSWGMA-WVPMLDSP-FMELYHTEILKINYGSSPLNLGARNSQV 422
++ GY+ +G G A + ML P F Y E++ I+ G L +
Sbjct: 304 SSNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTR 363
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G L D+G+ TY QAY EL+ ++ + + L C+ IV
Sbjct: 364 GGTLLDSGTVLTYLPAQAY-ELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAV 422
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
F FG F + G ++ + CL + I+G+
Sbjct: 423 SF------RFGD-----GAVFELDFFGVMIFLDENVGCLA-FAAMDAGGLPLSIIGNTQQ 470
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
R V+YD ++IG+ + C
Sbjct: 471 RSAEVIYDVAAEKIGFVPASC 491
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 175/387 (45%), Gaps = 65/387 (16%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYK 256
G YFT + VG P R Y+ +DTGSD+ WIQC APC C +P++ P R +P
Sbjct: 143 GEYFTRLGVGTPARYVYMVLDTGSDIVWIQC-APCIKCYSQTDPVFDPTKSRSFANIPCG 201
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC +R PG Q C Y++ Y D S ++G + + LT + + VV G
Sbjct: 202 SPLC---RRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTET--LTFRGTRVGR--VVLG 254
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMF 374
C +D +GL V G+LGL R ++S PSQ+ + + +CL + +
Sbjct: 255 CGHDNEGL----FVGAAGLLGLGRGRLSFPSQIGRR--FNSKFSYCLGDRSASSRPSSIV 308
Query: 375 LGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALF------ 427
G D S + P+L +P ++ Y+ E+L I+ G G R S + +LF
Sbjct: 309 FG-DSAISRTTRFTPLLSNPKLDTFYYVELLGISVG------GTRVSGISASLFKLDSTG 361
Query: 428 ------DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD----PTLPVCWRAKFPIRS 477
D+G+S T T+ AY L D ++ AS+ P + + F +
Sbjct: 362 NGGVIIDSGTSVTRLTRAAYVAL--------RDAFLVGASNLKRAPEFSL-FDTCFDLSG 412
Query: 478 IVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
+VK T+ LHF G+ + ++ + I + G+ C + +G +II
Sbjct: 413 KTEVK--VPTVVLHFRGADVPLPASNYLIP------VDNSGSFCFAF---AGTASGLSII 461
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
G+I +G VVYD R+G+A C
Sbjct: 462 -GNIQQQGFRVVYDLATSRVGFAPRGC 487
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 105 bits (262), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 153/367 (41%), Gaps = 42/367 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL-PYKDSLC 260
Y + +G+P + + +D+GSD++W+QC PC C +PL+ P + + P+ S
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCK-PCLQCHSQVDPLFDPSLSSTYSPFSCSSA 189
Query: 261 MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYD 320
Q C + QC Y + YAD SS+ G + D L L GS T N FGC++
Sbjct: 190 ACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLAL----GSNTISNFQFGCSHV 245
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLV 380
+ G N L TDG++GL SL SQ A G +CL G++ LG
Sbjct: 246 ESG--FNDL--TDGLMGLGGGAPSLASQTA--GTFGTAFSYCLPPTPSSSGFLTLGAG-- 297
Query: 381 PSWGMAWVPML-DSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQ 439
+ G PML SP Y + I G + L++ G + D+G+ T +
Sbjct: 298 -TSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVM-DSGTIITRLPRT 355
Query: 440 AYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIV 499
AYS L ++ K +R P RSI+D F S ++
Sbjct: 356 AYSALSSAFKAGMKQ--------------YRPA-PPRSIMDTCFDFSGQ-----SSVRLP 395
Query: 500 STKFHISPEGYLVISKKGNI---CLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRI 556
S S + + G I CL S+ + S I+G++ R V+YD +
Sbjct: 396 SVALVFSGGAVVNLDANGIILGNCLAFAANSD--DSSPGIVGNVQQRTFEVLYDVGGGAV 453
Query: 557 GWAKSHC 563
G+ C
Sbjct: 454 GFKAGAC 460
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 167/377 (44%), Gaps = 44/377 (11%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWI--QCDA--PCSSCAKGANPLYKPRMGNI---L 253
L++ + VG P + + +DTGSDL W+ QCD P +S A G+ Y P M + +
Sbjct: 101 LHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCPPPASGASGSASFYIPSMSSTSQAV 160
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEY--ADHSSSMGVLARDELHLTIENG--SLT 309
P C +H+ C T C Y++ Y AD SSS G L D L+L+ E+ +
Sbjct: 161 PCNSDFC-----DHRKD-CSTTSSCPYKMVYVSADTSSS-GFLVEDVLYLSTEDNHPQIL 213
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
K ++FGC Q G L+ +G+ GL +S+PS LA +G+ + C + G
Sbjct: 214 KAQIMFGCGQVQTGSFLDA-AAPNGLFGLGIDMISVPSILAHKGLTSDSFSMCFGRD--G 270
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDT 429
G + G S P+ + Y I I G+ P++L +FDT
Sbjct: 271 IGRISFGDQ--GSSDQEETPLDINQKHPTYAITITGITVGTEPMDLEFST------IFDT 322
Query: 430 GSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
G+++TY AY+ + S +V ++ D P C+ I F+T+
Sbjct: 323 GTTFTYLADPAYTYITQSFHTQVRANRHAADTRIP-FEYCYDLSSSEARIQTPGVSFRTV 381
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
GS + ++ IS + + + CL I+ ++++ I+G + G VV
Sbjct: 382 G---GSLFPVIDLGQVISIQQHEYV-----YCLAIVKSTKLN-----IIGQNFMTGVRVV 428
Query: 549 YDNVNKRIGWAKSHCMN 565
+D K +GW K +C +
Sbjct: 429 FDRERKILGWKKFNCYD 445
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 119/498 (23%), Positives = 203/498 (40%), Gaps = 61/498 (12%)
Query: 91 ISIFALILYGS--VFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRF 148
+S+F I++ + V + +L + ++N + +E LYH G+ + F
Sbjct: 1 MSLFWFIVFSAHLVLASSLVEFQDNDNPRQKQEGMQLNLYHVKGLDSSQTSTSPFSFSDM 60
Query: 149 VDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDS-------SSIFPLRGNI-YPDG 200
+ D E V R S++ K N+ D S PL+ + G
Sbjct: 61 ITKDEERV---------RFLHSRLTNKESVRNSATTDKLRGGPSLVSTTPLKSGLSIGSG 111
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC 260
Y+ + +G P + + + +DTGS L+W+QC C +P++ P YK C
Sbjct: 112 NYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKT--YKALPC 169
Query: 261 MEIQRN-------HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
Q + + PG C Y+ Y D S S+G L++D L LT
Sbjct: 170 SSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAP--SSGF 227
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG---- 369
V+GC D QGL ++ GI+GL+ K+S+ QL+ + N +CL ++
Sbjct: 228 VYGCGQDNQGL----FGRSSGIIGLANDKISMLGQLSKK--YGNAFSYCLPSSFSAPNSS 281
Query: 370 --GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWAL 426
G++ +G + S + P++ + + LY ++ I PL + A + V +
Sbjct: 282 SLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVP-TI 340
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV-DVKQFF 485
D+G+ T Y+ L S + S L C++ S V +++ F
Sbjct: 341 IDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIF 400
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
+ G+ ++ K H S LV +KG CL I S + I+G+ +
Sbjct: 401 RG-----GAGLEL---KAHNS----LVEIEKGTTCLAIAASSNPIS----IIGNYQQQTF 444
Query: 546 LVVYDNVNKRIGWAKSHC 563
V YD N +IG+A C
Sbjct: 445 KVAYDVANFKIGFAPGGC 462
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 171/379 (45%), Gaps = 48/379 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWI--QCDA--PCSSCAKGANPLYKPRMGNI---L 253
L++ + VG P + + + +DTGSDL W+ QCD P +S A G+ Y P M + +
Sbjct: 115 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASFYIPSMSSTSQAV 174
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEY--ADHSSSMGVLARDELHLTIENG--SLT 309
P C E+++ C T QC Y++ Y AD SSS G L D L+L+ E+ +
Sbjct: 175 PCNSQFC-ELRKE-----CSTTSQCPYKMVYVSADTSSS-GFLVEDVLYLSTEDAIPQIL 227
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
K ++FGC Q G L+ +G+ GL +S+PS LA +G+ N C + + G
Sbjct: 228 KAQILFGCGQVQTGSFLDA-AAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRD--G 284
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDT 429
G + G S P+ +P Y I +I G+S +L +FDT
Sbjct: 285 IGRISFGDQ--GSSDQEETPLDVNPQHPTYTISISEITVGNSLTDLEFST------IFDT 336
Query: 430 GSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
G+S+TY AY+ + S +V ++ D ++ P D+ +
Sbjct: 337 GTSFTYLADPAYTYITQSFHAQVHANRHAAD-----------SRIPFEYCYDLSSSEDRI 385
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGN--ICLGILDGSEVHNGSTIILGDISLRGQL 546
S + + F + EG ++ ++ CL I+ ++++ I+G + G
Sbjct: 386 QTPSISLRTVGGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKLN-----IIGQNFMTGLR 440
Query: 547 VVYDNVNKRIGWAKSHCMN 565
VV+D K +GW K +C +
Sbjct: 441 VVFDRERKILGWKKFNCYD 459
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 109/379 (28%), Positives = 161/379 (42%), Gaps = 48/379 (12%)
Query: 204 TYMIVGNPPRPYYLDMDTGSDLTWIQCD-APCSSCAKGANPLYKPRMG---NILPYKDSL 259
T + G + + +DTGSDLTW+QC+ P SSC +PL+ P +P
Sbjct: 183 TIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCGSPA 242
Query: 260 CMEIQRNH--KPGYCETC-----QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-P 311
C ++ PG C Q+C Y + Y D S S GVLA+D L L G+ TK
Sbjct: 243 CAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGL----GTTTKLD 298
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
VFGC +GL T G++GL R +SL SQ A++ V +CL G
Sbjct: 299 GFVFGCGLSNRGLFGGTA----GLMGLGRTDLSLVSQTAAR--FGGVFSYCLPATTTSTG 352
Query: 372 YMFLGHDLVPSW-GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
+ LG S+ MA+ M+ P ++ + L A G L D+G
Sbjct: 353 SLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGFGAGNVLVDSG 412
Query: 431 SSYTYFTKQAY----SELIASLKEVSSDGL-VLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+ T Y +E + ++ G +LDA C+ R V+V
Sbjct: 413 TVITRLAPSVYKAVRAEFARRFEYPAAPGFSILDA-------CY--DLTGRDEVNVPLL- 462
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN-ICLGILDGSEVHNGSTIILGDISLRG 544
TLTL G++ + + V+ K G+ +CL + S + T I+G+ R
Sbjct: 463 -TLTLEGGAQVTVDAAGM------LFVVRKDGSQVCLAM--ASLPYEDQTPIIGNYQQRN 513
Query: 545 QLVVYDNVNKRIGWAKSHC 563
+ VVYD V R+G+A C
Sbjct: 514 KRVVYDTVGSRLGFADEDC 532
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 105/376 (27%), Positives = 171/376 (45%), Gaps = 44/376 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VG PP+ Y+ +DTGSD+ W+QC APC +C +P++ P + L
Sbjct: 127 GEYFTRIGVGTPPKYVYMVLDTGSDIVWLQC-APCKNCYSQTDPVFNPVKSG--SFAKVL 183
Query: 260 CME--IQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
C +R PG C Q C Y++ Y D S + G + LT + + V GC
Sbjct: 184 CRTPLCRRLESPG-CNQRQTCLYQVSYGDGSYTTGEFVTET--LTFRRTKVEQ--VALGC 238
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMFL 375
+D +GL V G+LGL R +S PSQ +CL + +
Sbjct: 239 GHDNEGL----FVGAAGLLGLGRGGLSFPSQAGR--TFNQKFSYCLVDRSASSKPSSVVF 292
Query: 376 GHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLN-LGARNSQV-----GWALFD 428
G+ V S + P+L +P ++ Y+ E+L I+ G +P++ + A + ++ G + D
Sbjct: 293 GNSAV-SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIID 351
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
G+S T K AY +L++ G S P + + + + VK T+
Sbjct: 352 CGTSVTRLNKPAY----IALRDAFRAGASSLKSAPEFSL-FDTCYDLSGKTTVK--VPTV 404
Query: 489 TLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
LHF G+ + ++ + I +G G C + +G +II G+I +G V
Sbjct: 405 VLHFRGADVSLPASNYLIPVDG------SGRFCFAF---AGTTSGLSII-GNIQQQGFRV 454
Query: 548 VYDNVNKRIGWAKSHC 563
VYD + R+G++ C
Sbjct: 455 VYDLASSRVGFSPRGC 470
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 171/379 (45%), Gaps = 48/379 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWI--QCDA--PCSSCAKGANPLYKPRMGNI---L 253
L++ + VG P + + + +DTGSDL W+ QCD P +S A G+ Y P M + +
Sbjct: 115 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASFYIPSMSSTSQAV 174
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEY--ADHSSSMGVLARDELHLTIENG--SLT 309
P C E+++ C T QC Y++ Y AD SSS G L D L+L+ E+ +
Sbjct: 175 PCNSQFC-ELRKE-----CSTTSQCPYKMVYVSADTSSS-GFLVEDVLYLSTEDAIPQIL 227
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
K ++FGC Q G L+ +G+ GL +S+PS LA +G+ N C + + G
Sbjct: 228 KAQILFGCGQVQTGSFLDA-AAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRD--G 284
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDT 429
G + G S P+ +P Y I +I G+S +L +FDT
Sbjct: 285 IGRISFGDQ--GSSDQEETPLDVNPQHPTYTISISEITVGNSLTDLEFST------IFDT 336
Query: 430 GSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
G+S+TY AY+ + S +V ++ D ++ P D+ +
Sbjct: 337 GTSFTYLADPAYTYITQSFHAQVHANRHAAD-----------SRIPFEYCYDLSSSEDRI 385
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGN--ICLGILDGSEVHNGSTIILGDISLRGQL 546
S + + F + EG ++ ++ CL I+ ++++ I+G + G
Sbjct: 386 QTPSISLRTVGGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKLN-----IIGQNFMTGLR 440
Query: 547 VVYDNVNKRIGWAKSHCMN 565
VV+D K +GW K +C +
Sbjct: 441 VVFDRERKILGWKKFNCYD 459
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 105 bits (261), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 165/378 (43%), Gaps = 47/378 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA--KGAN-------PLYKPRMGN 251
L++T + +G P + + + +DTGSDL W+ CD CS CA +G +Y P+ +
Sbjct: 102 LHYTTVSLGTPGKKFLVALDTGSDLFWVPCD--CSRCAPTEGTTYASDFELSIYNPKGSS 159
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYAD-HSSSMGVLARDELHLTIENG- 306
+ +SLC RN G T C Y + Y +S+ G+L D LHLT E+
Sbjct: 160 TSRKVTCDNSLCA--HRNRCLG---TFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNR 214
Query: 307 -SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
+ V FGC Q G L+ + +G+ GL K+S+PS L+ +G + C
Sbjct: 215 QEFVEAYVTFGCGQVQTGSFLD-IAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGP 273
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
+ G G + G P P + Y+ + ++ G++ ++L A
Sbjct: 274 D--GIGRISFGDKGSPD--QEETPFNLNALHPTYNITVTQVRVGTTLIDLDFT------A 323
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
LFD+G+S+TY Y+ ++ S + D S C+ P + +
Sbjct: 324 LFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPPDSRIPFEFCYDMS-PGENTSLIPSM- 381
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
+LT+ GS++ + IS + L+ C+ ++ +E++ I+G + G
Sbjct: 382 -SLTMKGGSQFPVYDPIIIISSQSELI------YCMAVVRSAELN-----IIGQNFMTGY 429
Query: 546 LVVYDNVNKRIGWAKSHC 563
+++D +GW + C
Sbjct: 430 RIIFDREKLVLGWKEFEC 447
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 105 bits (261), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 155/376 (41%), Gaps = 49/376 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y T + +G P Y + +DTGS LTW+QC SC + PL+ PR + +
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCS 191
Query: 257 DSLCMEIQRNH-KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
S C E+Q P C C Y+ Y D S S+G L+ D T+ GS + P+ +
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTD----TVSFGSTSYPSFYY 247
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC D +GL ++ G++GL+R K+SL QLA + +CL T A GY+ +
Sbjct: 248 GCGQDNEGL----FGRSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPT-AASTGYLSI 300
Query: 376 GHDLVPSWGMAWVPMLDSPF-MELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
G + ++ PM S LY + ++ G SPL + + D+G+ T
Sbjct: 301 GPYNTGHY-YSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVIT 359
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV-------KQFFKT 487
++ L ++ + + + P SI+D + T
Sbjct: 360 RLPTAVHTALSKAVAQA---------------MAGAQRAPAFSILDTCFEGQASQLRVPT 404
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
+ + F + T ++ L+ CL ST I+G+ + V
Sbjct: 405 VVMAFAGGASMKLTTRNV-----LIDVDDSTTCLAF-----APTDSTAIIGNTQQQTFSV 454
Query: 548 VYDNVNKRIGWAKSHC 563
+YD RIG++ C
Sbjct: 455 IYDVAQSRIGFSAGGC 470
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/413 (23%), Positives = 177/413 (42%), Gaps = 51/413 (12%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPD---GLYFTYMIVGNPPRPYYLDMDT 221
I+ + ++ K ++S AV++ + + + PD G Y M +G P MDT
Sbjct: 5 IQRSQERLEKLQITS---AVNTHQMKDIETPVTPDIGSGEYLIQMAIGTPALSLSAIMDT 61
Query: 222 GSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGY--CETCQQCD 279
GSDL W +C+ PC+ C+ + Y LC + P C C+
Sbjct: 62 GSDLVWTKCN-PCTDCSTSSIYDPSSSS----TYSKVLC-QSSLCQPPSIFSCNNDGDCE 115
Query: 280 YEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
Y Y D SS+ G+L+ + ++ S + PN+ FGC +D QG K G++G
Sbjct: 116 YVYPYGDRSSTSGILSDETFSIS----SQSLPNITFGCGHDNQG-----FDKVGGLVGFG 166
Query: 340 RAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGYMFLGHDL-VPSWGMAWVPMLDSPFM 396
R +SL SQL + N +CL T++ +F+G+ + + + P++ S
Sbjct: 167 RGSLSLVSQLGPS--MGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSST 224
Query: 397 ELYHTEILKINYGSSPL-----NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEV 451
Y+ + I+ G L ++ G + D+G++ T+ + AY ++KE
Sbjct: 225 NHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYD----AVKEA 280
Query: 452 SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL 511
+ L +D L +C+ + F ++T HF + + E YL
Sbjct: 281 MVSSINLPQADGQLDLCFNQQG------SSNPGFPSMTFHFKGA------DYDVPKENYL 328
Query: 512 VISKKGNI-CLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+I CL ++ + + G+ I G++ + ++YDN N + +A + C
Sbjct: 329 FPDSTSDIVCLAMMP-TNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 114/406 (28%), Positives = 176/406 (43%), Gaps = 62/406 (15%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY- 255
Y LY+ + VG P + + +DTGSDL W+ CD C CA AN +P + PY
Sbjct: 106 YIGSLYYAVVEVGTPNATFLVALDTGSDLFWVPCD--CKQCASIANVTGQPATA-LRPYS 162
Query: 256 --KDSLCMEIQRNH----KPGYCE--TCQQCDYEIEY-ADHSSSMGVLARDELHLTIEN- 305
+ S ++ ++ +P C T C YE++Y + ++S+ GVL +D LHLT E
Sbjct: 163 PRESSTSKQVTCDNALCDRPNGCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHLTRERP 222
Query: 306 -------GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KN 357
+L P VVFGC Q G L+ DG++GL R VS+PS LAS G++ +
Sbjct: 223 GAAAEAGEALQAP-VVFGCGQVQTGTFLDG-AAFDGLMGLGRENVSVPSVLASSGLVASD 280
Query: 358 VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM---ELYHTEILKINYGSSPLN 414
C + G G + G G ++PF LY+ +N + +
Sbjct: 281 SFSMCFGDD--GVGRINFGDSGSSGQG-------ETPFTGRRTLYNVSFTAVNVETKSV- 330
Query: 415 LGARNSQVGWALFDTGSSYTYFTKQAYSELIA---SLKEVSSDGLVLDASDP-TLPVCWR 470
+ A+ D+G+S+TY Y+EL SL ++DP C+
Sbjct: 331 -----AAEFAAVIDSGTSFTYLADPEYTELATNFNSLVRERRTNFSSGSADPFPFEYCY- 384
Query: 471 AKFPIRS---IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS 527
A P ++ I DV +LT G+++ + ++ G V+ CL I+
Sbjct: 385 ALGPNQTEALIPDV-----SLTTKGGARFPVTQPVIGVA-SGRTVV----GYCLAIMKND 434
Query: 528 EVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
N + I+G + G VV+D +GW K C R P
Sbjct: 435 LGVNFN--IIGQNFMTGLKVVFDREKSVLGWEKFDCYKNARVADAP 478
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 78/220 (35%), Positives = 110/220 (50%), Gaps = 27/220 (12%)
Query: 185 DSSSIFPLRGNIYPD----GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKG 240
DS S+ R +Y D G Y T + +G PP+ + L +D+GS +T++ C + C C K
Sbjct: 72 DSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKH 130
Query: 241 ANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELH 300
+P ++P M + Y+ C N + +QC YE EYA+HSSS GVL D +
Sbjct: 131 QDPKFQPEMSST--YQPVKC-----NMDCNCDDDREQCVYEREYAEHSSSKGVLGEDLIS 183
Query: 301 LTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVG 360
E+ LT VFGC + G L + + DGI+GL + +SL QL +G+I N G
Sbjct: 184 FGNES-QLTPQRAVFGCETVETGDLYSQ--RADGIIGLGQGDLSLVDQLVDKGLISNSFG 240
Query: 361 HCLTTNAGGGGYMFLG-----HDLV-------PSWGMAWV 388
C GGG M LG D+V S+GMA V
Sbjct: 241 LCYGGMDVGGGSMILGGFDYPSDMVFTDSDPDRSFGMATV 280
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 155/379 (40%), Gaps = 44/379 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + +G P + L DTGSDLTW QC SC P++ P +
Sbjct: 152 GNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCT 211
Query: 257 DSLCMEIQR--NHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
+ C ++ + PG C + C Y I+Y D S ++G A+D+L LT + +
Sbjct: 212 SAACSSLKSATGNSPG-CSS-SNCVYGIQYGDSSFTIGFFAKDKLTLTQND---VFDGFM 266
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF 374
FGC + +GL KT G++GL R +S+ Q A + +CL T+ G G++
Sbjct: 267 FGCGQNNKGL----FGKTAGLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRGSNGHLT 320
Query: 375 LGH------DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G+ G+ + P S Y ++L I+ G L++ Q + D
Sbjct: 321 FGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQNAGTIID 380
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF---- 484
+G+ T AY L ++ K+ S PT P A + + D+ +
Sbjct: 381 SGTVITRLPSTAYGSLKSAFKQFMSK-------YPTAP----ALSLLDTCYDLSNYTSIS 429
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
++ +F + + P G L+ + +CL + + S I G+I +
Sbjct: 430 IPKISFNFNGNANV-----ELDPNGILITNGASQVCLAFAGNGD--DDSIGIFGNIQQQT 482
Query: 545 QLVVYDNVNKRIGWAKSHC 563
VVYD ++G+ C
Sbjct: 483 LEVVYDVAGGQLGFGYKGC 501
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 153/369 (41%), Gaps = 35/369 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y T + +G P Y + +DTGS LTW+QC SC + PL+ PR + +
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCS 191
Query: 257 DSLCMEIQRNH-KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
S C E+Q P C C Y+ Y D S S+G L+ D T+ GS P+ +
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTD----TVSFGSTRYPSFYY 247
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC D +GL ++ G++GL+R K+SL QLA + +CL T A GY+ +
Sbjct: 248 GCGQDNEGL----FGRSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPT-AASTGYLSI 300
Query: 376 GHDLVPSWGMAWVPMLDSPF-MELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
G + ++ PM S LY + ++ G SPL + + D+G+ T
Sbjct: 301 GPYNTGHY-YSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVIT 359
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
++ L ++ + + G + L C+ + + V F G+
Sbjct: 360 RLPTAVHTALSKAVAQAMA-GAQRAPAFSILDTCFEGQASQLRVPTVAMAFAG-----GA 413
Query: 495 KWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNK 554
++ + L+ CL ST I+G+ + V+YD
Sbjct: 414 SMKLTT-------RNVLIDVDDSTTCLAF-----APTDSTAIIGNTQQQTFSVIYDVAQS 461
Query: 555 RIGWAKSHC 563
RIG++ C
Sbjct: 462 RIGFSAGGC 470
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 162/387 (41%), Gaps = 41/387 (10%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G+ G YF +G PP+ + L +D+GSDL W+QC +PC C +PLY P +
Sbjct: 54 VSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQC-SPCRQCYAQDSPLYVPSNSS 112
Query: 252 I---LPYKDSLCMEIQRNHK-PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307
+P S C+ I P C YE YAD SSS GV A + T++
Sbjct: 113 TFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYES--ATVDGVR 170
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN- 366
+ K V FGC D QG + G+LGL + +S SQ+ N +CL
Sbjct: 171 IDK--VAFGCGSDNQG----SFAAAGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYL 222
Query: 367 --AGGGGYMFLGHDLVPS-WGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV 422
+ G +L+ + M + P++ +P LY+ +I K+ G L + ++
Sbjct: 223 DPTSVSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEI 282
Query: 423 -----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRS 477
G ++FD+G++ TY+ AYS ++A+ S S L +C
Sbjct: 283 DLLGNGGSIFDSGTTLTYWFPSAYSHILAAFD--SGVHYPRAESVQGLDLCVE------- 333
Query: 478 IVDVKQ-FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
+ V Q F + T+ F F E Y V CL + + G I
Sbjct: 334 LTGVDQPSFPSFTIEFDD-----GAVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTI 388
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
G++ + V YD IG+A + C
Sbjct: 389 -GNLLQQNFFVQYDREENLIGFAPAKC 414
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 108/416 (25%), Positives = 170/416 (40%), Gaps = 46/416 (11%)
Query: 165 IRPHKSKINKKLVSSNAV-----AVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDM 219
+ P + IN L S + + +D ++ P I +G Y +G PP
Sbjct: 48 LTPSQRIINAALRSISRLNRVSNLLDQNNKLPQSVLILHNGEYLMRFYIGTPPVERLATA 107
Query: 220 DTGSDLTWIQCDAPCSSCAKGANPLYKP-RMGNILPY--KDSLCMEIQRNHKPGYCETCQ 276
DTGSDL W+QC +PC+SC + PL++P + +P + C + K C
Sbjct: 108 DTGSDLIWVQC-SPCASCFPQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQKG--CGKSG 164
Query: 277 QCDYEIEYAD-HSSSMGVLARDELHLTIENG--SLTKPNVVFGCAYDQQGLLLNTLV--- 330
+C Y +Y D +S S G+L+ + L + G ++ PN FGC GL N V
Sbjct: 165 ECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFGC-----GLYNNITVFPS 219
Query: 331 -KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGYMFLGHDLVPSWGMAW 387
K GI+GL +SL SQ+ Q I + +CL + F ++ G+
Sbjct: 220 YKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTSKLKFGNESIITGEGVVS 277
Query: 388 VPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIAS 447
PM+ P++ Y+ L + + S G + D+G+ TY + Y AS
Sbjct: 278 TPMIIKPWLPTYY--FLNLEAVTVAQKTVPTGSTDGNVIIDSGTLLTYLGESFYYNFAAS 335
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
L+E + LV D P LP C FP R + +Q + + P
Sbjct: 336 LQESLAVELVQDVLSP-LPFC----FPYRDNFVFPEI----------AFQFTGARVSLKP 380
Query: 508 EGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
V+++ N ++ S V S I G S V YD K++ + + C
Sbjct: 381 ANLFVMTEDRNTVCLMIAPSSVSGIS--IFGSFSQIDFQVEYDLEGKKVSFQPTDC 434
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 163/382 (42%), Gaps = 41/382 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G P R + + +DTGSDLTW+QC +PC +C + L+ P +
Sbjct: 1 GEYLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGTCYSQNDSLFIPNTST--SFTKLA 57
Query: 260 CMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFG 316
C N P C Q C Y Y D S S G D + + NG + PN FG
Sbjct: 58 CGTELCNGLP--YPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFG 115
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYM 373
C +D +G + DGILGL + +S PSQL + + +CL +
Sbjct: 116 CGHDNEG----SFAGADGILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTSPL 169
Query: 374 FLGHDLVPSW-GMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARN---SQVGWA--L 426
G VP++ G+ ++ +L +P + Y+ ++ I+ G LN+ + VG A +
Sbjct: 170 LFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTI 229
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
FD+G++ T + + E++A++ + D L +C F + V
Sbjct: 230 FDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCL-GGFAEGQLPTV----P 284
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
++T HF + P Y + + + C ++ +V I+G I +
Sbjct: 285 SMTFHFEGG------DMELPPSNYFIFLESSQSYCFSMVSSPDV-----TIIGSIQQQNF 333
Query: 546 LVVYDNVNKRIGWAKSHCMNPG 567
V YD V ++IG+ C+ G
Sbjct: 334 QVYYDTVGRKIGFVPKSCVGRG 355
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 109/414 (26%), Positives = 183/414 (44%), Gaps = 48/414 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLDMDTGS 223
+R +++I +++VSS+ V + I PL I L Y M +G+ +D TGS
Sbjct: 29 VRSMQNRI-RRVVSSHNVEASQTQI-PLSSGINLQTLNYIVTMGLGSTNMTVIID--TGS 84
Query: 224 DLTWIQCDAPCSSCAKGANPLYKPRMGNILPY---KDSLCMEIQ-RNHKPGYCETC-QQC 278
DLTW+QC+ PC SC P++KP + S C +Q G C + C
Sbjct: 85 DLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSNPSTC 143
Query: 279 DYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGL 338
+Y + Y D S + G L ++L G ++ + VFGC + +GL G++GL
Sbjct: 144 NYVVNYGDGSYTNGELGVEQLSF----GGVSVSDFVFGCGRNNKGLFGG----VSGLMGL 195
Query: 339 SRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHD------LVPSWGMAWVPML 391
R+ +SL SQ + V +CL TT +G G + +G++ + P + + ML
Sbjct: 196 GRSYLSLVSQ--TNATFGGVFSYCLPTTESGASGSLVMGNESSVFKNVTP---ITYTRML 250
Query: 392 DSP-FMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIA-SLK 449
+P Y + I+ L + + + G L D+G+ T Y L A LK
Sbjct: 251 PNPQLSNFYILNLTGIDVDGVALQVPSFGN--GGVLIDSGTVITRLPSSVYKALKALFLK 308
Query: 450 EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEG 509
+ + G L C F + +V T+++HF ++ K +
Sbjct: 309 QFT--GFPSAPGFSILDTC----FNLTGYDEVS--IPTISMHFEGNAEL---KVDATGTF 357
Query: 510 YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
Y+V +CL + S+ ++ T I+G+ R Q V+YD ++G+A+ C
Sbjct: 358 YVVKEDASQVCLALASLSDAYD--TAIIGNYQQRNQRVIYDTKQSKVGFAEESC 409
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/385 (25%), Positives = 162/385 (42%), Gaps = 47/385 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC 260
L++T++ +G P + + +DTGSD+ W+ CD C CA + Y ++ Y SL
Sbjct: 101 LHYTWIDIGTPNVSFLVALDTGSDMFWVPCD--CIECAPLSAAFYNALDRDLNQYSPSLS 158
Query: 261 MEIQRNHKPGYCETCQQ----------CDYEIEY-ADHSSSMGVLARDELHLTIENGSLT 309
+ H P + C Q C Y EY +D++SS G L D+LHL N +
Sbjct: 159 SSSR--HLPCGHQLCNQNSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLASNNATKN 216
Query: 310 --KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
+ +V+ GC Q G L +G+LGL +S+P+ LA G+I+N + CL N
Sbjct: 217 SIQASVILGCGRKQSGYFLEG-AAPNGMLGLGPGSISVPALLAKAGLIRNSISICL--NE 273
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA---RNSQVGW 424
G G + G G A +PF+ L E+L G +G+ + ++
Sbjct: 274 KGSGRILFGDQ-----GHA-TQRRSTPFL-LDDGELLNYFVGVERFCVGSFCYKETEFK- 325
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
A DTG+S+TY K Y ++A ++ + C+ A
Sbjct: 326 AFIDTGTSFTYLPKGVYETVVAEFEKQVHATRITSQIQSDFNCCYNAS------SRESNN 379
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSE----VHNGSTIILGDI 540
F + F + IS + + ICL ++ + + TI +
Sbjct: 380 FPPMKFTFSKNQSFIIQNPFISMD-----QEDTTICLAVVQSDDELITIGRKYTIACQNF 434
Query: 541 SLRGQLVVYDNVNKRIGWAKSHCMN 565
L G +V+D N R GW +S+C +
Sbjct: 435 -LMGYDMVFDRENLRFGWFRSNCQD 458
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 112/423 (26%), Positives = 187/423 (44%), Gaps = 54/423 (12%)
Query: 160 VNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLD 218
++D +R +++I +++ S++ V + I PL I L Y M +G+ + +
Sbjct: 24 LDDLRVRSMQNRI-RRVASTHNVEASQTQI-PLSSGINLQTLNYIVTMGLGS--KNMTVI 79
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY---KDSLCMEIQ-RNHKPGYCET 274
+DTGSDLTW+QC+ PC SC P++KP + S C +Q G C +
Sbjct: 80 IDTGSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGS 138
Query: 275 CQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKT 332
C+Y + Y D S + G L + L G ++ + VFGC + +GL
Sbjct: 139 SNPSTCNYVVNYGDGSYTNGELGVEALSF----GGVSVSDFVFGCGRNNKGLFGG----V 190
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHD---LVPSWGMAWV 388
G++GL R+ +SL SQ + V +CL TT AG G + +G++ + + +
Sbjct: 191 SGLMGLGRSYLSLVSQ--TNATFGGVFSYCLPTTEAGSSGSLVMGNESSVFKNANPITYT 248
Query: 389 PMLDSP-FMELYHTEILKINYGS----SPLNLGARNSQVGWALFDTGSSYTYFTKQAYSE 443
ML +P Y + I+ G +PL+ G G L D+G+ T Y
Sbjct: 249 RMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGN-----GGILIDSGTVITRLPSSVYKA 303
Query: 444 LIAS-LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTK 502
L A LK+ + G L C F + +V T++L F Q+
Sbjct: 304 LKAEFLKKFT--GFPSAPGFSILDTC----FNLTGYDEVS--IPTISLRFEGNAQL---- 351
Query: 503 FHISPEG--YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAK 560
++ G Y+V +CL + S+ ++ T I+G+ R Q V+YD ++G+A+
Sbjct: 352 -NVDATGTFYVVKEDASQVCLALASLSDAYD--TAIIGNYQQRNQRVIYDTKQSKVGFAE 408
Query: 561 SHC 563
C
Sbjct: 409 EPC 411
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 113/399 (28%), Positives = 166/399 (41%), Gaps = 69/399 (17%)
Query: 195 NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY---KPRMGN 251
N P Y ++ +G PP+P L +DTGSDL W QC PC SC P + +
Sbjct: 28 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCK-PCVSCFDQPLPYFDTSRSSTNA 86
Query: 252 ILPYKDSLCMEIQRNHKPGYC----ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307
+LP + + C + + C +T Q C Y Y D+S ++G+LA D+ + S
Sbjct: 87 LLPCESTQC---KLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTF-VAGTS 142
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
L P V FGC + G+ + GI G R +SLPSQL + N HC TT
Sbjct: 143 L--PGVTFGCGLNNTGVFNS---NETGIAGFGRGPLSLPSQLK----VGN-FSHCFTTIT 192
Query: 368 GG---GGYMFLGHDLVPSWGMAWV---PMLDSPFME----LYHTEILKINYGSS----PL 413
G + L DL S G V P++ E LY+ + I GS+ P
Sbjct: 193 GAIPSTVLLDLPADLF-SNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPE 251
Query: 414 NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP------- 466
+ A + G + D+G+S T Q Y +V D P +P
Sbjct: 252 SAFALTNGTGGTIIDSGTSITSLPPQVY--------QVVRDEFAAQIKLPVVPGNATGHY 303
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN--ICLGIL 524
C+ A P ++ DV + L LHF + + ++ + V GN ICL I
Sbjct: 304 TCFSA--PSQAKPDVPK----LVLHFEGATMDLPRENYV----FEVPDDAGNSIICLAIN 353
Query: 525 DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G E T I+G+ + V+YD N + + + C
Sbjct: 354 KGDE-----TTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 387
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 121/505 (23%), Positives = 209/505 (41%), Gaps = 67/505 (13%)
Query: 85 LFLFLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFG--IREVSQRDAE 142
+F FL +F+L+L VF + R F F ++H+F ++++S
Sbjct: 1 MFSFLKFLVFSLLLSVWVFPQNCKGRI-----------FTFKMHHRFSDMLKDLSDSTTS 49
Query: 143 FKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNA--VAVDSSSIFPLRGNIYPDG 200
+ + +A D ++R +KL + A D +S F + +
Sbjct: 50 RNFPSKGSFEYYAELAH-RDQMLR------GRKLYNVEAPLAFSDGNSTFRISSLGF--- 99
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP--RMGNILPYKDS 258
L++T + +G P + + +DTGSDL W+ CD CS CA Y + P + S
Sbjct: 100 LHYTTVELGTPGMKFMVALDTGSDLFWVPCD--CSKCAPTQGVAYASDFELSIYDPKQSS 157
Query: 259 LCMEIQRN-----HKPGYCETCQQCDYEIEYAD-HSSSMGVLARDELHLTIE--NGSLTK 310
++ N H+ T C Y + Y +S+ G+L D LHLT E N K
Sbjct: 158 TSKKVTCNNNLCAHRNRCLGTFSSCPYMVSYVSAQTSTSGILVEDVLHLTSEDSNQESIK 217
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
V FGC Q G LNT +G+ GL ++S+PS L+ +G+ + C + G
Sbjct: 218 AYVTFGCGQVQSGSFLNT-AAPNGLFGLGMDQISVPSILSREGLTADSFSMCFGHD--GV 274
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
G + G P P +P Y+ + ++ G++ +++ ALFD+G
Sbjct: 275 GRISFGDKGSPD--QEETPFNSNPSHPSYNISVTQVRVGTTLVDVDFT------ALFDSG 326
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV--CWRAKFPIRSIVDVKQFFKTL 488
+S+TY Y+ + + + D DP +P C+ S + +L
Sbjct: 327 TSFTYLINPIYAMVSENFHAQAQDK--RRPPDPRIPFEYCYDMSPGANSSLIPSM---SL 381
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
T+ + + I+ + LV CL I+ +E++ I+G + G VV
Sbjct: 382 TMKGRGHFTVFDPIIVITTQNELV------YCLAIVKSTELN-----IIGQNFMTGYRVV 430
Query: 549 YDNVNKRIGWAKSHCMNPGRFKSLP 573
+D +GW ++ C + + S P
Sbjct: 431 FDREKLVLGWKETDCYDQ-EYNSFP 454
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 162/378 (42%), Gaps = 47/378 (12%)
Query: 203 FTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY------------KPRMG 250
+T + +G P + + +DTGSDL W+ CD CS CA Y K
Sbjct: 113 YTTVQLGTPGTKFMVALDTGSDLFWVPCD--CSRCAPTEGSPYASDFELSVYSPKKSSTS 170
Query: 251 NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYAD-HSSSMGVLARDELHLTIEN--GS 307
+P ++LC QR+ E C Y + Y +S+ G+L D LHL E+
Sbjct: 171 KTVPCNNNLCA--QRDQ---CTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTEHKHSE 225
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
+ + FGC Q G L+ + +G+ GL ++S+PS L+ +G++ N C + +
Sbjct: 226 PIQAYITFGCGQVQSGSFLD-VAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDD- 283
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALF 427
G G + G S P + Y+ + I G++ ++ ALF
Sbjct: 284 -GVGRINFGDK--GSLEQEETPFNLNQLHPNYNITVTSIRVGTTLIDADIT------ALF 334
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D+G+S++YFT YS+L AS + DG +P +P + + + +
Sbjct: 335 DSGTSFSYFTDPIYSKLSASFHAQTRDG--RHPPNPRIPFEYCYNMSPDANASLTPGI-S 391
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
LT+ G + + IS + L+ CL ++ +E++ I+G + G +
Sbjct: 392 LTMKGGGPFPVYDPIIVISTQNELI------YCLAVVKSAELN-----IIGQNFMTGYRI 440
Query: 548 VYDNVNKRIGWAKSHCMN 565
V+D +GW K C +
Sbjct: 441 VFDREKLVLGWKKFDCYD 458
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 153/357 (42%), Gaps = 43/357 (12%)
Query: 216 YLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYC 272
+L +DTGSD+TWIQCD PC C K + L++P LP ++C ++Q +
Sbjct: 2 FLLIDTGSDITWIQCD-PCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQ-----SFS 55
Query: 273 ETC--QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCAYDQQGLLLNTL 329
+C C+Y + Y D S++ G A + L L ++ L PN FGC + +GL
Sbjct: 56 HSCLNSSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGAA 115
Query: 330 VKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG--GGYMFLGHDLVPSWGMAW 387
G++GL ++ + P+Q + V +CL + + G + G + + + +
Sbjct: 116 ----GLMGLGKSSIGFPAQTSVA--FGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRF 169
Query: 388 VPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIA 446
P++DS Y + IN G L + A + D+G+ + F + AY L
Sbjct: 170 TPLVDSSSGPSQYFVSMTGINVGDELLPISAT------VMVDSGTVISRFEQSAYERLRD 223
Query: 447 SLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHIS 506
+ ++ GL S C+R + ++ D+ +TLHF ++ +S
Sbjct: 224 AFTQILP-GLQTAVSVAPFDTCFR----VSTVDDIN--IPLITLHFRDDAEL-----RLS 271
Query: 507 PEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
P L G +C S + +LG+ + VYD R+G + C
Sbjct: 272 PVHILYPVDDGVMCFAFAPSSSGRS----VLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 171/379 (45%), Gaps = 48/379 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWI--QCDA--PCSSCAKGANPLYKPRMGNI---L 253
L++ + VG P + + + +DTGSDL W+ QCD P +S A G+ Y P M + +
Sbjct: 115 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASFYIPSMSSTSQAV 174
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEY--ADHSSSMGVLARDELHLTIENG--SLT 309
P C E+++ C T QC Y++ Y AD SSS G L D L+L+ E+ +
Sbjct: 175 PCNSQFC-ELRKE-----CSTTSQCPYKMVYVSADTSSS-GFLVEDVLYLSTEDAIPQIL 227
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
K ++FGC Q G L+ +G+ GL +S+PS LA +G+ N C + + G
Sbjct: 228 KAQILFGCGQVQTGSFLDA-AAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRD--G 284
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDT 429
G + G S P+ +P Y I ++ G+S +L +FDT
Sbjct: 285 IGRISFGDQ--GSSDQEETPLDVNPQHPTYTISISEMTVGNSLTDLEFST------IFDT 336
Query: 430 GSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
G+S+TY AY+ + S +V ++ D ++ P D+ +
Sbjct: 337 GTSFTYLADPAYTYITQSFHAQVHANRHAAD-----------SRIPFEYCYDLSSSEDRI 385
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGN--ICLGILDGSEVHNGSTIILGDISLRGQL 546
S + + F + EG ++ ++ CL I+ ++++ I+G + G
Sbjct: 386 QTPSISLRTVGGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKLN-----IIGQNFMTGLR 440
Query: 547 VVYDNVNKRIGWAKSHCMN 565
VV+D K +GW K +C +
Sbjct: 441 VVFDRERKILGWKKFNCYD 459
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 171/395 (43%), Gaps = 48/395 (12%)
Query: 195 NIYPDGLY-----FTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP------ 243
++Y +GL+ + + VG P + + +DTGS+L W+ CD CSSC
Sbjct: 50 SLYSNGLFGYILHYANVSVGTPSVSFLVALDTGSNLLWLPCD--CSSCVHSLRSPSGTVD 107
Query: 244 --LYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARD 297
+Y P + +P +LC + QR+ P C Y++ Y ++ +S+ G + +D
Sbjct: 108 LNIYSPNTSSTSEKVPCNSTLCSQTQRDRCP---SDQSNCPYQVVYLSNGTSTTGYIVQD 164
Query: 298 ELHLTIENGSLTK---PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
LHL I + S +K + FGC Q G L T +G+ GL + +S+PS LA G
Sbjct: 165 LLHL-ISDDSQSKAVDAKITFGCGKVQTGSFL-TGGAPNGLFGLGMSNISVPSTLAHNGY 222
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN 414
C + N G G + G G P LY+ I + + G
Sbjct: 223 TSGSFSMCFSPN--GIGRISFGDKGSTGQGETSFNQ-GQPRSSLYNISITQTSIG----- 274
Query: 415 LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
G + V A+FD+G+S+TY AY+ + S + LV + + V + +
Sbjct: 275 -GQASDLVYSAIFDSGTSFTYLNDPAYTLIAESFNK-----LVKETRRSSTQVPFDYCYD 328
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
IRS + + + ++ I + +S Y ++ + + + DGS V+
Sbjct: 329 IRSFISAQILPFSCAYANQTEPTIPAVTLVMSGGDYFNVTDP-IVLVQLADGSAVYCLGM 387
Query: 535 IILGDISLRGQ------LVVYDNVNKRIGWAKSHC 563
I GD+++ GQ +V+D +GW S+C
Sbjct: 388 IKSGDVNIIGQNFMTGHRIVFDRERMILGWKPSNC 422
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/336 (28%), Positives = 155/336 (46%), Gaps = 48/336 (14%)
Query: 193 RGNIYPD----GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
R +Y D G Y T + +G PP+ + L +DTGS +T++ C + C C + +P ++P
Sbjct: 77 RMRLYDDLLLNGYYTTRIWIGTPPQTFALIVDTGSTVTYVPC-STCEQCGRHQDPKFEP- 134
Query: 249 MGNILPYKDSLCMEIQRNHKPGYCE---TC----QQCDYEIEYADHSSSMGVLARDELHL 301
E+ ++P C TC +QC YE +YA+ SSS GVL D +
Sbjct: 135 -------------ELSSTYQPVSCNIDCTCDNERKQCVYERQYAEMSSSSGVLGEDIISF 181
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
++ L +FGC + G L + + DGI+GL R +S+ QL +G+I +
Sbjct: 182 GNQS-ELVPQRAIFGCENQETGDLYSQ--RADGIMGLGRGDLSIVDQLVEKGVISDSFSL 238
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR--N 419
C GGG M LG + P GM + D + Y+ ++ I+ L+L +
Sbjct: 239 CYGGMDIGGGAMILG-GISPPSGMVFAES-DPVRSQYYNIDLKAIHVAGKQLHLDPSIFD 296
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRS 477
+ G L D+G++Y Y + A++ A +KE++S + DP +C+ +
Sbjct: 297 GKHGTVL-DSGTTYAYLPEAAFTAFKDAMMKELTSLKQI-HGPDPNYNDICFSG-----A 349
Query: 478 IVDVKQFFKTLTLHFGSKWQIVST--KFHISPEGYL 511
DV Q T F + + S K +SPE YL
Sbjct: 350 ESDVSQLSNT----FPAVEMVFSNGQKLSLSPENYL 381
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 163/392 (41%), Gaps = 74/392 (18%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP--------LYKPR--- 248
GLY + +GNP R YYL TGSD+ W+ PCSSC P LY P+
Sbjct: 74 GLYCITVKLGNPSRHYYLAFHTGSDVMWV----PCSSCTDCPTPDDIGFSLDLYDPKNSS 129
Query: 249 MGNILPYKDSLCME-IQRNHKPGYCETCQ----QCDYEIEYADHS-SSMGVLARDELHLT 302
+ + D C + ++ H C T QC Y YAD ++ G D++H
Sbjct: 130 TSSEISCSDDRCADALKTGH--AICHTSHSSGDQCGYNQIYADGVLATTGYYVSDDIHFD 187
Query: 303 I----ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
I E+ + + +V+FGC+ + G L + DG++G + SL SQL SQG + +
Sbjct: 188 IFMGNESFASSSASVIFGCSKSRSGHL-----QADGVIGFGKDAPSLISQLNSQG-VSHA 241
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA- 417
CL + GGG + L D V G+ + ++ S + + + +N + P++
Sbjct: 242 FSRCLDDSDDGGGVLIL--DEVGEPGLEFTSLVASRPCYNLNMKSIAVNNQNVPIDSSLF 299
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRS 477
S D+G+S YF Y DP + F RS
Sbjct: 300 TTSSTQGTFLDSGTSLAYFPDGVY--------------------DPVIRAILFIYFSTRS 339
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN------ICLGILDGSEVHN 531
F T+T +F + PE YL+ ++G+ +C+ SE
Sbjct: 340 FSS----FPTVTXYFEG-----GAAMKVGPENYLL--RRGSYDNDSYMCIA-FQRSEGDY 387
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T ILGD+ L ++ VY+ +IGW +C
Sbjct: 388 KQTTILGDLILHDKIFVYNLKKMQIGWVNYNC 419
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/414 (22%), Positives = 175/414 (42%), Gaps = 45/414 (10%)
Query: 165 IRPHKSKINKKLVSSNAVA-VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGS 223
++ +S+++K L N V +DS+++ G++ Y + +G P R L DTGS
Sbjct: 8 VKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLSLVFDTGS 67
Query: 224 DLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYC--ETCQQC 278
DLTW QC+ SC K + ++ P + + SLC ++ + C T C
Sbjct: 68 DLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDASC 127
Query: 279 DYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGL 338
Y+ +Y D+S+S+G L+++ L +T + + +FGC D +GL + G++GL
Sbjct: 128 IYDAKYGDNSTSVGFLSQERLTITATD---IVDDFLFGCGQDNEGLFNGSA----GLMGL 180
Query: 339 SRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFME 397
R +S+ Q +S + +CL + G++ G + + + P+ S
Sbjct: 181 GRHPISIVQQTSSN--YNKIFSYCLPATSSSLGHLTFGASAATNASLIYTPLSTISGDNS 238
Query: 398 LYHTEILKINYGSSPL-NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL 456
Y +I+ I+ G + L + + G ++ D+G+ T Y+ L ++ +
Sbjct: 239 FYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXM---- 294
Query: 457 VLDASDPTLPVCWRAKFPIRSIVDVKQFFKT----LTLHFGSKWQIVSTKFHISPEGYLV 512
PV A + + D+ + + + F + + G L
Sbjct: 295 ------EKYPVANEAGL-LDTCYDLSGYKEISVPRIDFEFSGGVTV-----ELXHRGILX 342
Query: 513 ISKKGNICLGILDGSEVHNGS---TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + +CL NGS + G++ + VVYD RIG+ + C
Sbjct: 343 VESEQQVCLAF-----AANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 161/378 (42%), Gaps = 51/378 (13%)
Query: 209 GNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQR 265
G+P + +DTGSDLTW+QC PCS+C +PL+ P + S C + R
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACADSLR 213
Query: 266 --NHKPGYCETC----QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
PG C + ++C Y + Y D S S GVLA D + L G + VFGC
Sbjct: 214 AATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL----GGASLGGFVFGCGL 269
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--TTNAGGGGYMFL-- 375
+GL T G++GL R ++SL SQ AS+ V +CL T+ G + L
Sbjct: 270 SNRGLFGGTA----GLMGLGRTELSLVSQTASR--YGGVFSYCLPAATSGDASGSLSLGG 323
Query: 376 GHDLVPSWG----MAWVPMLDSPFM-ELYHTEILKINYGSSPL---NLGARNSQVGWALF 427
G D S+ +A+ M+ P Y + G + L LGA N L
Sbjct: 324 GDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASN-----VLI 378
Query: 428 DTGSSYTYFTKQAYSELIAS-LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G+ T Y + A +++ + G L C+ + +VK
Sbjct: 379 DSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCY----DLTGHDEVKVPLL 434
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGN-ICLGILDGSEVHNGSTIILGDISLRGQ 545
TL L G+ + + V+ K G+ +CL + S + T I+G+ + +
Sbjct: 435 TLRLEGGADVTVDAAGM------LFVVRKDGSQVCLAM--ASLSYEDETPIIGNYQQKNK 486
Query: 546 LVVYDNVNKRIGWAKSHC 563
VVYD + R+G+A C
Sbjct: 487 RVVYDTLGSRLGFADEDC 504
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/368 (25%), Positives = 160/368 (43%), Gaps = 51/368 (13%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC-MEIQRNHKPGYCETCQ- 276
+DTGSD+ W+QC APC C + + P++ PR + Y C + R G C+ +
Sbjct: 3 LDTGSDVVWVQC-APCRRCYEQSGPVFDPRRSS--SYGAVGCGAALCRRLDSGGCDLRRG 59
Query: 277 QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGIL 336
C Y++ Y D S + G + L G V GC +D +GL +
Sbjct: 60 ACMYQVAYGDGSVTAGDFVTETLTFA---GGARVARVALGCGHDNEGLFVAAAGLLG--- 113
Query: 337 GLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGY--------MFLGHDLVPSWGMA 386
L R +S P+Q++ + +CL T++G G + G V + +
Sbjct: 114 -LGRGGLSFPTQISRR--YGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSAS 170
Query: 387 WVPMLDSPFME-LYHTEILKINYGS--------SPLNLGARNSQVGWALFDTGSSYTYFT 437
+ PM+ +P ME Y+ +++ I+ G S L L + G + D+G+S T
Sbjct: 171 FTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGR-GGVIVDSGTSVTRLA 229
Query: 438 KQAYSELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQFFKTLTLHFGSKW 496
+ +YS L + + ++ GL L +L C+ R +V V T+++HF
Sbjct: 230 RASYSALRDAFRAAAAGGLRLSPGGFSLFDTCY--DLGGRRVVKVP----TVSMHFAGGA 283
Query: 497 QIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKR 555
+ + PE YL+ + +G C +G I+G+I +G VV+D +R
Sbjct: 284 EAA-----LPPENYLIPVDSRGTFCFAFAG----TDGGVSIIGNIQQQGFRVVFDGDGQR 334
Query: 556 IGWAKSHC 563
+G+A C
Sbjct: 335 VGFAPKGC 342
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 117/463 (25%), Positives = 197/463 (42%), Gaps = 70/463 (15%)
Query: 119 NKESFVFPLYHKFGIR-EVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLV 177
++ + +P K G R + D++ L +F + GI R N +L
Sbjct: 29 SRRALSYPAQLKNGFRITLKHVDSDKNLTKF---------QRIQHGIKRA-----NHRLE 74
Query: 178 SSNAVAVDSSSIFPLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
NA+ + +SS + + +G + + +G PP Y MDTGSDL W QC PC+
Sbjct: 75 RLNAMVLAASSNAEINSPVLSGNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCK-PCTQ 133
Query: 237 CAKGANPLYKP---RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGV 293
C +P++ P + L LC + ++ C C+Y Y D+SS+ G
Sbjct: 134 CFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSS----CS--DSCEYLYTYGDYSSTQGT 187
Query: 294 LARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQG 353
+A + T G ++ PNV FGC D +G + + G++GL R +SL SQL
Sbjct: 188 MATE----TFTFGKVSIPNVGFGCGEDNEG---DGFTQGSGLVGLGRGPLSLVSQLK--- 237
Query: 354 IIKNVVGHCLTTNAGGGGYMFLGHDLVP----SWGMAWVPMLDSPFM-ELYHTEILKINY 408
+ +CLT+ L L S + P++ +P Y+ + I+
Sbjct: 238 --EAKFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISV 295
Query: 409 GSSPLNLGARNSQV-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSD-GLVLDASD 462
G + L + Q+ G + D+G++ TY + A+ +L+ KE +S GL +D S
Sbjct: 296 GGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAF-DLVK--KEFTSQMGLPVDNSG 352
Query: 463 PT-LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNIC 520
T L +C+ + K L LHF + E Y++ S G IC
Sbjct: 353 ATGLELCYNLPSDTSELEVPK-----LVLHF------TGADLELPGENYMIADSSMGVIC 401
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
L + GS +G I G++ + V +D + + + ++C
Sbjct: 402 LAM--GS---SGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNC 439
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 112/399 (28%), Positives = 173/399 (43%), Gaps = 43/399 (10%)
Query: 178 SSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
S N+ + S PL+ G G Y G P + L +DTGSDLTWIQC PC+
Sbjct: 112 SKNSGPYTTMSNLPLQSGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCK-PCAD 170
Query: 237 CAKGANPLYKPRMGN---ILPYKDSLCME-IQRNHKPGYCETCQQCDYEIEYADHSSSMG 292
C + +++P+ + LP + C E I P C C YEI Y D SSS G
Sbjct: 171 CYSQVDAIFEPKQSSSYKTLPCLSATCTELITSESNPTPC-LLGGCVYEINYGDGSSSQG 229
Query: 293 VLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
+++ L L GS + N FGC + GL + G+LGL + +S PSQ S+
Sbjct: 230 DFSQETLTL----GSDSFQNFAFGCGHTNTGLFKG----SSGLLGLGQNSLSFPSQSKSK 281
Query: 353 GIIKNVVGHCL--TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM--ELYHTEILKINY 408
+CL ++ G +G +P+ + + P++ S FM Y + I+
Sbjct: 282 --YGGQFAYCLPDFGSSTSTGSFSVGKGSIPASAV-FTPLV-SNFMYPTFYFVGLNGISV 337
Query: 409 GSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP--TLP 466
G L++ G + D+G+ T QAY+ L S + + D L ++ P L
Sbjct: 338 GGDRLSIPPAVLGRGSTIVDSGTVITRLLPQAYNALKTSFRSKTRD---LPSAKPFSILD 394
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG--NICLGIL 524
C+ S V + T+T HF + + +S G LV + G +CL
Sbjct: 395 TCY--DLSRHSQVRI----PTITFHFQNNADVA-----VSDVGILVPVQNGGSQVCLAFA 443
Query: 525 DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
S++ + I+G+ + V +D RIG+A C
Sbjct: 444 SASQMDGFN--IIGNFQQQRMRVAFDTGAGRIGFASGSC 480
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 168/388 (43%), Gaps = 47/388 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP--------LYKPRMGNI 252
L++ + +G P + + +DTGSDL W+ CD C CA A+P +Y PR +
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPCD--CIKCAPLASPDYGDLKFDMYSPRKSST 155
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENG-- 306
+P SLC + + C Y I+Y ++++SS GVL D L+LT E+G
Sbjct: 156 SRKVPCSSSLC-----DPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESGQS 210
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+T+ + FGC Q G L + +G+LGL S+PS LAS+GI N C +
Sbjct: 211 KITQAPITFGCGQVQSGSFLGS-AAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFGED 269
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
GH + L++P I+ + + + +++ A+
Sbjct: 270 ---------GHGRINFGDTGSSDQLETPLNIYKQNPYYNISITGAMVGGKSFDTKFS-AV 319
Query: 427 FDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
D+G+S+T + Y+E+ ++ +V LDAS P + + I + V
Sbjct: 320 VDSGTSFTALSDPMYTEITSTFNAQVKESRKHLDASMP-----FEYCYSISAQGAVNPPN 374
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
+LT GS + + I+ S+ CL I+ V+ ++G+ + G
Sbjct: 375 ISLTAKGGSIFPVNGPIITITDTS----SRPIAYCLAIMKSEGVN-----LIGENFMSGL 425
Query: 546 LVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
+V+D +GW +C N LP
Sbjct: 426 KIVFDRERLVLGWKTFNCYNFDNSSKLP 453
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 163/392 (41%), Gaps = 44/392 (11%)
Query: 182 VAVDSS-SIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS-SCA 238
AVD S + PL G Y G Y T M +G P +PY + +DTGS LTW+QC +PC SC
Sbjct: 115 AAVDGSLASVPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQC-SPCRVSCH 173
Query: 239 KGANPLYKPRMGNILPYKDSLCMEIQRNH------KPGYCETCQQCDYEIEYADHSSSMG 292
+ + P++ P+ + Y C Q N P C + C Y+ Y D S S+G
Sbjct: 174 RQSGPVFDPKTSS--SYAAVSCSTPQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVG 231
Query: 293 VLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
L++D T+ GS + PN +GC D +GL ++ G++GL+R K+SL QLA
Sbjct: 232 YLSKD----TVSFGSNSVPNFYYGCGQDNEGL----FGRSAGLMGLARNKLSLLYQLAP- 282
Query: 353 GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSS 411
+ +CL +++ G ++ ++ PM+ S + LY ++ +
Sbjct: 283 -TLGYSFSYCLPSSSSSGYLSIGSYN---PGQYSYTPMVSSTLDDSLYFIKLSGMTVAGK 338
Query: 412 PLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
PL + + + D+G+ T Y L ++ DA L C+
Sbjct: 339 PLAVSSSEYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYS-ILDTCFVG 397
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
+ + V F +S + LV CL
Sbjct: 398 QASSLRVPAVSMAFSG------------GAALKLSAQNLLVDVDSSTTCLAFAPAR---- 441
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
S I+G+ + VVYD + RIG+A C
Sbjct: 442 -SAAIIGNTQQQTFSVVYDVKSNRIGFAAGGC 472
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 107/414 (25%), Positives = 180/414 (43%), Gaps = 64/414 (15%)
Query: 168 HKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTW 227
+ K+ + +S+ + +SS P+ +G + + +G P Y MDTGSDL W
Sbjct: 66 KRGKLRLQRLSAKTASFESSVEAPVHAG---NGEFLMKLAIGTPAETYSAIMDTGSDLIW 122
Query: 228 IQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQ-CDYEIE 283
QC PC C P++ P+ + LP LC + + +C C+Y
Sbjct: 123 TQCK-PCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPIS-------SCSDGCEYLYS 174
Query: 284 YADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKV 343
Y D+SS+ GVLA + T G + + FGC D G + + G++GL R +
Sbjct: 175 YGDYSSTQGVLATE----TFAFGDASVSKIGFGCGEDNDG---SGFSQGAGLVGLGRGPL 227
Query: 344 SLPSQLASQGIIKNVVGHCLTT--NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYH 400
SL SQL + +CLT+ ++ G + +G + + P++ +P Y+
Sbjct: 228 SLISQLG-----EPKFSYCLTSMDDSKGISSLLVGSEATMKNAIT-TPLIQNPSQPSFYY 281
Query: 401 TEILKINYGSSPL-----NLGARNSQVGWALFDTGSSYTYFTKQAYS----ELIASLKEV 451
+ I+ G + L +N G + D+G++ TY A++ E I+ LK
Sbjct: 282 LSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLK-- 339
Query: 452 SSDGLVLDASDPT-LPVCWRAKFPIRSIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEG 509
L +D S T L +C+ P S VDV Q L HF G+ ++ + + I+ G
Sbjct: 340 ----LDVDESGSTGLDLCFTLP-PDASTVDVPQ----LVFHFEGADLKLPAENYIIADSG 390
Query: 510 YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G ICL + S + I G+ + +V++D + I +A + C
Sbjct: 391 L------GVICLTMGSSSGMS-----IFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 101/386 (26%), Positives = 160/386 (41%), Gaps = 56/386 (14%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPY 255
+G + M +G P Y +DTGSDL W QC PC C + P++ P + LP
Sbjct: 115 NGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCK-PCVECFNQSTPVFDPSSSSTYSTLPC 173
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
SLC ++ + + C Y Y D SS+ GVLA + L P V F
Sbjct: 174 SSSLCSDLPTST---CTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTK----LPGVAF 226
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG------ 369
GC +G + + G++GL R +SL SQL G+ K +CLT+
Sbjct: 227 GCGDTNEG---DGFTQGAGLVGLGRGPLSLVSQL---GLGK--FSYCLTSLDDTSKSPLL 278
Query: 370 -GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNL-----GARNSQV 422
G + D + + P++ +P Y+ + + GS+ + L ++
Sbjct: 279 LGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGT 338
Query: 423 GWALFDTGSSYTYFTKQAYSEL----IASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
G + D+G+S TY Q Y L A +K +DG S L +C++A P +
Sbjct: 339 GGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADG-----SAVGLDLCFKA--PASGV 391
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIIL 537
DV+ L LHF + + E Y+V+ S G +CL ++ + I+
Sbjct: 392 DDVE--VPKLVLHFDGGADL-----DLPAENYMVLDSASGALCLTVMGSRGLS-----II 439
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G+ + VYD + +A C
Sbjct: 440 GNFQQQNIQFVYDVDKDTLSFAPVQC 465
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 112/412 (27%), Positives = 180/412 (43%), Gaps = 48/412 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSD 224
+R H+ + ++S A SS + L G YFT + VG PPR Y+ +DTGSD
Sbjct: 76 LRLHRDTLRVHALNSRAAGFSSSVVSGLSQG---SGEYFTRLGVGTPPRYLYMVLDTGSD 132
Query: 225 LTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYE 281
+ W+QC +PC C ++P++ P +P LC +R G C Y+
Sbjct: 133 VVWLQC-SPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLC---RRLDSSGCSTRRHTCLYQ 188
Query: 282 IEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRA 341
+ Y D S + G A + LT + K V GC + +GL V G+LGL R
Sbjct: 189 VSYGDGSFTTGDFATET--LTFRGNKIAK--VALGCGHHNEGL----FVGAAGLLGLGRG 240
Query: 342 KVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMFLGHDLVPSWGMAWVPMLDSPFME-L 398
++S PSQ + + +CL + M G D S + P++ +P ++
Sbjct: 241 RLSFPSQTGIR--FNHKFSYCLVDRSASSKPSSMVFG-DAAISRLARFTPLIRNPKLDTF 297
Query: 399 YHTEILKINYGS------SPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVS 452
Y+ ++ I+ G SP ++ G + D+G+S T T+ AY+ L + + V
Sbjct: 298 YYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFR-VG 356
Query: 453 SDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYL 511
+ L C+ +S V V T+ LHF G+ + +T + I
Sbjct: 357 ARHLKRGPEFSLFDTCY--DLSGQSSVKV----PTVVLHFRGADMALPATNYLIP----- 405
Query: 512 VISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + G+ C + +G +II G+I +G VVYD RIG+A C
Sbjct: 406 -VDENGSFCFAF---AGTISGLSII-GNIQQQGFRVVYDLAGSRIGFAPRGC 452
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 172/382 (45%), Gaps = 42/382 (10%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G + G YF + +G+P + YL MDTGSD+ WIQC +PC SC K + ++ PR +
Sbjct: 6 GLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSSF 64
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
+ + T +C Y++ Y D S ++G LA D ++ G T P V
Sbjct: 65 RRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDS--FSVSRGR-TSP-V 120
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG---G 370
VFGC +D +GL V G+LGL K+S PSQL+S+ +CL + G
Sbjct: 121 VFGCGHDNEGL----FVGAAGLLGLGAGKLSFPSQLSSRKF-----SYCLVSRDNGVRAS 171
Query: 371 GYMFLGHDLVP-SWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV------ 422
+ G +P S A+ +L +P ++ Y+ + I+ G + L++ + ++
Sbjct: 172 SALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGR 231
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G + D+G+S T AY+ + + + + + A+D +L + + ++ V
Sbjct: 232 GGVIIDSGTSVTRLPTYAYTVMRDAFRSATQK--LPRAADFSL---FDTCYDFSALTSVT 286
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDIS 541
T++ HF + + P YLV + G C S + I+G+I
Sbjct: 287 --IPTVSFHFEGGASV-----QLPPSNYLVPVDTSGTFCFAFSKTSLDLS----IIGNIQ 335
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ V D + R+G+A C
Sbjct: 336 QQTMRVAIDLDSSRVGFAPRQC 357
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 100/395 (25%), Positives = 170/395 (43%), Gaps = 51/395 (12%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK----- 246
LRG + G + + + Y L +DTGS T++ C C+ C + A+ Y
Sbjct: 29 LRGGVLGTGTLVAEYALADG-QTYDLIVDTGSARTYVPCKG-CARCGEHAHGYYDYDRSM 86
Query: 247 --PRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
R+ +LC E + G C++ +C Y + YA+ SSS G + RD + L
Sbjct: 87 EFERLDCGEASDATLCEETMK----GTCQSDGRCSYVVSYAEGSSSRGYVVRDRVRLG-- 140
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
G+L+ + FGC + + K DG+ G R ++ +QLAS G+I+NV C+
Sbjct: 141 EGTLSA-MLAFGCEEAETNAIYEQ--KADGLFGFGRGTATVHAQLASAGLIENVFSFCVE 197
Query: 365 TNAGGGGYMFLGH-DL-VPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR---- 418
GG + LG D + +A P++ P +H N +S LG
Sbjct: 198 GFGANGGVLTLGRFDFGADAPALARTPLVADPANPAFH------NVRTSSWKLGDSLIEH 251
Query: 419 -NSQVGWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGL-VLDASDPTL-PVCWRAKFP 474
NS D+G+++T+ + + L + + GL ++ DP VC+
Sbjct: 252 LNSYT--TTLDSGTTFTFVPRSVWVSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAA 309
Query: 475 IRSIV----DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI--SKKGNICLGILDGSE 528
++ V ++F LT+ + + PE YL + C+GI
Sbjct: 310 AMNMTLSQSTVSEWFPPLTIAYEG-----GVSLTLGPENYLFAHETNSAAFCVGIF---- 360
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + I+LG I++R L+ +D N R+G A ++C
Sbjct: 361 ANPNNQILLGQITMRDTLMEFDVANSRVGMAPANC 395
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 122/466 (26%), Positives = 195/466 (41%), Gaps = 78/466 (16%)
Query: 117 DENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKL 176
DE + PL H+ G S R ++ E ++ R +S+ K
Sbjct: 53 DEGSNTVSVPLVHRHGPCAPSTRSSD-----------EPSLSE------RLRRSRARSKY 95
Query: 177 VSSNAVAVDSSSIFPLRGNIYPDGLYFTYMI-VGNPPRPYYLDMDTGSDLTWIQCDAPCS 235
+ S A + S L G++ D L + + +G P L +DTGSDL+W+QC APC+
Sbjct: 96 IMSRASKSNVSIPTHLGGSV--DSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQC-APCN 152
Query: 236 S--CAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETC-------QQCDYEIE 283
S C +PL+ P + +P C ++ R+ GY C QC Y I
Sbjct: 153 STTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTRD---GYGSDCTSGSGGGAQCGYAIT 209
Query: 284 YADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKV 343
Y D S + GV + + L + +T + FGC +DQ G K DG+LGL A
Sbjct: 210 YGDGSQTTGVYSNETLTMAP---GVTVKDFHFGCGHDQDG----PNDKYDGLLGLGGAPE 262
Query: 344 SLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEI 403
SL Q +S + +CL G++ LG + + G + PM+ Y +
Sbjct: 263 SLVVQTSS--VYGGAFSYCLPAANDQAGFLALGAPVNDASGFVFTPMVREQ-QTFYVVNM 319
Query: 404 LKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP 463
I G P+++ ++ G + D+G+ T AY+ L A+ ++ A+ P
Sbjct: 320 TGITVGGEPIDV-PPSAFSGGMIIDSGTVVTELQHTAYAALQAAFRKAM-------AAYP 371
Query: 464 TLP-----VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN 518
LP C+ F S V V + LT G+ + P+G L+ +
Sbjct: 372 LLPNGELDTCY--NFTGHSNVTVPRV--ALTFSGGATVDL------DVPDGILL-----D 416
Query: 519 ICLGILD-GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CL + G + G ILG+++ R V+YD + R+G+ C
Sbjct: 417 NCLAFQEAGPDNQPG---ILGNVNQRTLEVLYDVGHGRVGFGADAC 459
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 126/486 (25%), Positives = 195/486 (40%), Gaps = 65/486 (13%)
Query: 119 NKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVS 178
+ SF F L+H+F V +R AE + G + + H + + L
Sbjct: 30 DASSFGFDLHHRF--SPVVRRWAEARGGPLAADQWPARGTPEYYSALSRHD-RARRALAG 86
Query: 179 SNAVAVDSSSIFPLRGNIYPDG-LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC 237
A D F + Y G LY+ + +G P + + +DTGSDL W+ CD C C
Sbjct: 87 G---ADDGLLTFAAGNDTYQSGTLYYAEVELGTPNATFLVALDTGSDLFWVPCD--CRQC 141
Query: 238 AK--GANPLYK--PRMGNILPYKDSLCMEI--------QRNHKPGYCETCQQCDYEIEY- 284
A AN + P + P + S ++ QRN T C YE++Y
Sbjct: 142 ATIPSANGTGQDAPSLRPYSPRRSSTSKQVACDNPLCGQRNGC--SAATNGSCPYEVQYV 199
Query: 285 ADHSSSMGVLARDELHLTIEN------GSLTKPNVVFGCAYDQQGLLLN-TLVKTDGILG 337
+ ++SS GVL +D LHLT E G + VVFGC Q G L+ DG++G
Sbjct: 200 SANTSSSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDGGGGAVDGLMG 259
Query: 338 LSRAKVSLPSQLASQGII-KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM 396
L KVS+PS LA+ G++ + C + G G + G S G A P
Sbjct: 260 LGMGKVSVPSALAASGLVASDSFSMCFGDD--GVGRVNFGD--AGSRGQAETPFTVRSLN 315
Query: 397 ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL 456
Y+ I GS + + A+ D+G+S+TY + Y++L S+
Sbjct: 316 PTYNVSFTSIGVGSESV------AAEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERR 369
Query: 457 VLDASDPTLPV----CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPE---- 508
V +S P C+R P ++ V + +LT G+ + + +
Sbjct: 370 VNFSSGSADPFPFEYCYRLS-PNQTEVAMPDV--SLTAKGGALFPVTQPFIPVGDTTGRA 426
Query: 509 -GYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPG 567
GY + + ++ +GI I+G + G VV+D +GW K C
Sbjct: 427 VGYCLAIMRNDMAIGI-----------DIIGQNFMTGLKVVFDRERSVLGWEKFDCYRNA 475
Query: 568 RFKSLP 573
R P
Sbjct: 476 RVADAP 481
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 164/386 (42%), Gaps = 47/386 (12%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPY 255
+G + + VG P PY +DTGSDL W QC PC C P++ P + LP
Sbjct: 113 NGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCK-PCVECFNQTTPVFDPAASSTYAALPC 171
Query: 256 KDSLCMEI---QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN 312
+LC ++ C Y Y D SS+ GVLA + L + P
Sbjct: 172 SSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQK----VPG 227
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGG 370
V FGC +G + + G++GL R +SL SQL GI + +CLT+ +A G
Sbjct: 228 VAFGCGDTNEG---DGFTQGAGLVGLGRGPLSLVSQL---GIDR--FSYCLTSLDDAAGR 279
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFME------LYHTEILKINYGSSPLNL-----GARN 419
+ LG S A P +P ++ Y+ + + GS+ L L ++
Sbjct: 280 SPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQD 339
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSI- 478
G + D+G+S TY +AY L + S V DAS+ L +C++ P ++
Sbjct: 340 DGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTV-DASEIGLDLCFQG--PAGAVD 396
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIIL 537
DV+ L LHF + E Y+V+ S G +CL ++ G +II
Sbjct: 397 QDVQVQVPKLVLHFDG-----GADLDLPAENYMVLDSASGALCLTVM----ASRGLSII- 446
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G+ + VYD + +A + C
Sbjct: 447 GNFQQQNFQFVYDVAGDTLSFAPAEC 472
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 156/382 (40%), Gaps = 45/382 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + VG+PP YL +D+GSD+ W+QC PC C A+PL+ P S
Sbjct: 169 GEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCK-PCLECYVQADPLFDPATSATFS-GVSC 226
Query: 260 CMEIQRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
I R C + C+YE+ YAD S + G LA + L L G VV GC
Sbjct: 227 GSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTL----GGTAVEGVVIGC 282
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG------- 370
+ +GL V G++GL +SL QL G + +CL + G G
Sbjct: 283 GHRNRGL----FVGAAGLMGLGWGPMSLVGQLG--GEVGGAFSYCLASRGGYGSGAADDD 336
Query: 371 -GYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----G 423
G++ LG G WVP++ +P Y+ + I G L L A Q+ G
Sbjct: 337 AGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAG 396
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGL--VLDASDPTLPVCWRAKFPIRSIVDV 481
+ DTG++ T ++AY+ L + + + S L C + + V
Sbjct: 397 DVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTC----YDLSGYASV 452
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+ T++ F +++ ++ L+ G CL S I+G+
Sbjct: 453 R--VPTVSFCFDGDARLI-----LAARNVLLEVDMGIYCLAFAPSSS----GLSIMGNTQ 501
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
G + D+ N IG+ ++C
Sbjct: 502 QAGIQITVDSANGYIGFGPANC 523
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 120/271 (44%), Gaps = 31/271 (11%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK-- 256
G YF M +GNP R YYL++DTGSD+TWIQC APCSSC +P+Y P N Y+
Sbjct: 9 SGEYFARMGIGNPQRSYYLELDTGSDVTWIQC-APCSSCYSQVDPIYDPS--NSSSYRRV 65
Query: 257 ---DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
+LC + + G C Y + Y D S+S G L + +L N S N+
Sbjct: 66 YCGSALCQALDYSACQG-----MGCSYRVVYGDSSASSGDLGIESFYLG-PNSSTAMRNI 119
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN----AGG 369
FGC + GL + +S SQ+A+ I +CL
Sbjct: 120 AFGCGHSNSGLFRGEAGLLG----MGGGTLSFFSQIAAS--IGPAFSYCLVDRYSQLQSR 173
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPL-----NLGARNSQVG 423
+ G +P + + P+L +P + ++ +L I+ G +PL + G
Sbjct: 174 SSPLIFGRTAIP-FAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTG 232
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSD 454
A+ D+G+S T AY+ L + + S +
Sbjct: 233 GAILDSGTSVTRVVPPAYAVLRDAYRAASRN 263
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 113/418 (27%), Positives = 172/418 (41%), Gaps = 63/418 (15%)
Query: 189 IFPLRGNIYPDG-LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK-------- 239
F + Y G LY+ + +G P + + +DTGSDL W+ CD C CA
Sbjct: 96 TFAAGNDTYQSGTLYYAEVELGTPNATFLVALDTGSDLFWVPCD--CRQCATIPSANATG 153
Query: 240 -GANPL--YKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMG 292
A PL Y PR + + + LC +RN T C YE++Y + ++SS G
Sbjct: 154 PDAPPLRPYSPRRSSTSEQVACDNPLCG--RRNGC--SAATNGSCPYEVQYVSANTSSSG 209
Query: 293 VLARDELHLTIEN------GSLTKPNVVFGCAYDQQGLLLNT-LVKTDGILGLSRAKVSL 345
VL +D LHLT E G + VVFGC Q G L+ DG++GL KVS+
Sbjct: 210 VLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKVSV 269
Query: 346 PSQLASQGII-KNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEIL 404
PS LA+ G++ + C + G G + G S G A P Y+
Sbjct: 270 PSALAASGLVASDSFSMCFGDD--GVGRVNFGD--AGSRGQAETPFTVRSLNPTYNVSFT 325
Query: 405 KINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT 464
I GS + + A+ D+G+S+TY + Y++L S+ V +S
Sbjct: 326 SIGIGSESV------AAEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSA 379
Query: 465 LPV----CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPE-----GYLVISK 515
P C+R P ++ V + +LT G+ + + + GY +
Sbjct: 380 DPFPFEYCYRLS-PNQTEVAMPDV--SLTAKGGALFPVTQPFIPVGDTTGRAIGYCLAIM 436
Query: 516 KGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
+ ++ +GI I+G + G VV+D +GW K C R P
Sbjct: 437 RNDMAIGI-----------DIIGQNFMTGLKVVFDRERSVLGWEKFDCYRNARVADAP 483
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/220 (32%), Positives = 107/220 (48%), Gaps = 24/220 (10%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGN--IYPDGLYFTYMIVGNPPRPYYLDMDTG 222
+R H + + +L++ A+D PL G+ GLYFT + +G P + YY+ +DTG
Sbjct: 59 LREHDGRRHGRLLA----AID----LPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTG 110
Query: 223 SDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNILPYKDSLCMEIQRNHKPGYCET 274
SD+ W+ C C C + +N +Y PR G ++ C+ P C +
Sbjct: 111 SDILWVNC-VSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVANYGGVLP-SCTS 168
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENG-SLTKP---NVVFGCAYDQQGLLLNTLV 330
C+Y I Y D SS+ G D L +G T P +V FGC G L ++ +
Sbjct: 169 TSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNL 228
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
DGILG ++ S+ SQLA+ G ++ + HCL T GGG
Sbjct: 229 ALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGG 268
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 164/377 (43%), Gaps = 45/377 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYK 256
G YFT + VG P R ++ +DTGSD+ WIQC APC C +P++ P R +P
Sbjct: 145 GEYFTRLGVGTPARYVFMVLDTGSDVVWIQC-APCKKCYSQTDPVFNPTKSRSFANIPCG 203
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC +R PG C Y++ Y D S + G + + LT + + V G
Sbjct: 204 SPLC---RRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTET--LTFRGTRVGR--VALG 256
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMF 374
C +D +GL + L R ++S PSQ+ + +CL + YM
Sbjct: 257 CGHDNEGLFIGAAGLLG----LGRGRLSFPSQIGRR--FSRKFSYCLVDRSASSKPSYMV 310
Query: 375 LGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLN------LGARNSQVGWALF 427
G + S + P++ +P ++ Y+ E+L ++ G + + ++ G +
Sbjct: 311 FGDSAI-SRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVII 369
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D+G+S T T+ AY L + + V + L C F + +VK T
Sbjct: 370 DSGTSVTRLTRPAYVALRDAFR-VGASNLKRAPEFSLFDTC----FDLSGKTEVK--VPT 422
Query: 488 LTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ LHF G+ + ++ + I + G+ C + +G +I+ G+I +G
Sbjct: 423 VVLHFRGADVSLPASNYLIP------VDNSGSFCFAF---AGTMSGLSIV-GNIQQQGFR 472
Query: 547 VVYDNVNKRIGWAKSHC 563
VVYD R+G+A C
Sbjct: 473 VVYDLAASRVGFAPRGC 489
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 160/372 (43%), Gaps = 41/372 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR-------MGNI 252
G Y T M +G P + Y + +DTGS LTW+QC SC + + P++ PR +
Sbjct: 119 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCS 178
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN 312
P D+L P C T C Y+ Y D S S+G L++D T+ GS + PN
Sbjct: 179 APQCDAL---TTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSVPN 231
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY 372
+GC D +GL ++ G++GL+R K+SL QLA + +CL T++ GY
Sbjct: 232 FYYGCGQDNEGL----FGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSGY 285
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGS 431
+ +G + ++ PM S + LY ++ I PL++ A + D+G+
Sbjct: 286 LSIGSYNPGQY--SYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDSGT 343
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
T YS L ++ + G ++ L C++ + + V F
Sbjct: 344 VITRLPTDVYSALSKAVAG-AMKGTPRASAFSILDTCFQGQASRLRVPQVSMAFAG---- 398
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
G+ ++ +T LV CL S I+G+ + VVYD
Sbjct: 399 -GAALKLKATNL-------LVDVDSATTCLAFAPAR-----SAAIIGNTQQQTFSVVYDV 445
Query: 552 VNKRIGWAKSHC 563
N +IG+A C
Sbjct: 446 KNSKIGFAAGGC 457
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 178/393 (45%), Gaps = 51/393 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC 260
L++T++ +G P + + +D GSDL WI CD C CA ++ Y ++ Y S
Sbjct: 96 LHYTWIDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLSSSYYSNLDRDLNEYSPSRS 153
Query: 261 MEIQR---NH----KPGYCETC-QQCDYEIEY-ADHSSSMGVLARDELHL----TIENGS 307
+ + +H K C++ QQC Y + Y ++++SS G+L D LHL T+ N S
Sbjct: 154 LSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGTLSNSS 213
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
+ P VV GC Q G L+ V DG+LGL + S+PS LA G+I C N
Sbjct: 214 VQAP-VVLGCGMKQSGGYLDG-VAPDGLLGLGPGESSVPSFLAKSGLIHYSFSLCF--NE 269
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKIN---YGSSPLNLGARNSQVGW 424
G MF G D P+ + + P LY T I+ + G+S L + + +QV
Sbjct: 270 DDSGRMFFG-DQGPTSQQSTSFL---PLDGLYSTYIIGVESCCIGNSCLKMTSFKAQV-- 323
Query: 425 ALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
D+G+S+T+ Y + ++V+ + S W + + S D+ +
Sbjct: 324 ---DSGTSFTFLPGHVYGAITEEFDQQVNGSRSSFEGSP------WEYCY-VPSSQDLPK 373
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI--CLGILDGSEVHNGSTIILGDIS 541
+ TL F + F + ++ +G I CL IL G +G
Sbjct: 374 -VPSFTLMFQR-----NNSFVVYDPVFVFYGNEGVIGFCLAILP----TEGDMGTIGQNF 423
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
+ G +V+D NK++ W++S+C + K +P
Sbjct: 424 MTGYRLVFDRGNKKLAWSRSNCQDLSLGKRMPL 456
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 104/420 (24%), Positives = 170/420 (40%), Gaps = 60/420 (14%)
Query: 162 DGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDT 221
DG+ R N V + + A+ + G G YF+ + +G+P R Y+ +DT
Sbjct: 129 DGVTRLDLRPANGSAVFAASAAIQGPVV---SGVGQGSGEYFSRVGIGSPARQLYMVLDT 185
Query: 222 GSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQ-RNHKPGYCETCQ-QCD 279
GSD+TW+QC PC+ C + ++P++ P + Y C + R+ C C
Sbjct: 186 GSDVTWVQCQ-PCADCYQQSDPVFDPSLSA--SYAAVSCDSQRCRDLDTAACRNATGACL 242
Query: 280 YEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
YE+ Y D S ++G A + L L S NV GC +D +GL V G+L L
Sbjct: 243 YEVAYGDGSYTVGDFATETLTL---GDSTPVGNVAIGCGHDNEGL----FVGAAGLLALG 295
Query: 340 RAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFME-L 398
+S PSQ+++ + +CL D G P++ SP
Sbjct: 296 GGPLSFPSQISA-----STFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTF 350
Query: 399 YHTEILKINYGSSPLNLGAR------NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVS 452
Y+ + I+ G PL++ A S G + D+G++ T AY+ L
Sbjct: 351 YYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAAL-------- 402
Query: 453 SDGLVLDASDPTLP---------VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKF 503
D V A P+LP C+ R+ V+V +L G ++ + +
Sbjct: 403 RDAFVQGA--PSLPRTSGVSLFDTCY--DLSDRTSVEVPAV--SLRFEGGGALRLPAKNY 456
Query: 504 HISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I +G G CL N + I+G++ +G V +D +G+ + C
Sbjct: 457 LIPVDG------AGTYCLAFAP----TNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 506
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 171/382 (44%), Gaps = 42/382 (10%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G + G YF + +G+P + YL MDTGSD+ WIQC +PC SC K + ++ PR +
Sbjct: 6 GLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSSF 64
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
+ + T +C Y++ Y D S ++G LA D ++ T P V
Sbjct: 65 RRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGR---TSP-V 120
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG---G 370
VFGC +D +GL V G+LGL K+S PSQL+S+ +CL + G
Sbjct: 121 VFGCGHDNEGL----FVGAAGLLGLGAGKLSFPSQLSSRKF-----SYCLVSRDNGVRAS 171
Query: 371 GYMFLGHDLVP-SWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV------ 422
+ G +P S A+ +L +P ++ Y+ + I+ G + L++ + ++
Sbjct: 172 SALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGR 231
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G + D+G+S T AY+ + + + + + A+D +L + + ++ V
Sbjct: 232 GGVIIDSGTSVTRLPTYAYTVMRDAFRSATQK--LPRAADFSL---FDTCYDFSALTSVT 286
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDIS 541
T++ HF + + P YLV + G C S + I+G+I
Sbjct: 287 --IPTVSFHFEGGASV-----QLPPSNYLVPVDTSGTFCFAFSKTSLDLS----IIGNIQ 335
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ V D + R+G+A C
Sbjct: 336 QQTMRVAIDLDSSRVGFAPRQC 357
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 160/386 (41%), Gaps = 42/386 (10%)
Query: 190 FPLRGNIYPDGLYFTYMI-VGNPPRPYYLDMDTGSDLTWIQCDAPCSS---CAKGANPLY 245
P R Y D L F + +G P +P L DTGSDL+W+QC PC S C +PL+
Sbjct: 131 IPDRSGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQ-PCGSSGHCHPQQDPLF 189
Query: 246 KPRMGNILPYKDSLCMEIQRNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
P + Y C E Q C E C Y + Y D SS+ GVL+RD L LT
Sbjct: 190 DPSKSST--YAAVHCGEPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSS 247
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
P FGC L + DG+LGL R ++SLPSQ A+ V +CL
Sbjct: 248 RALTGFP---FGCGTRN----LGDFGRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLP 298
Query: 365 TNAGGGGYMFLGHDLVPSWGMA-WVPMLDSP-FMELYHTEILKINYGSSPLNLGARNSQV 422
++ GY+ +G G A + ML P F Y E++ I+ G L +
Sbjct: 299 SSNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTR 358
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-----VCWRAKFPIRS 477
G L D+G+ TY QAY+ L + L ++ P P C+ F S
Sbjct: 359 GGTLLDSGTVLTYLPAQAYALLRDRFR------LTMERYTPAPPNDVLDACY--DFAGES 410
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
V V ++ FG F + G ++ + CL + I+
Sbjct: 411 EVVVP----AVSFRFGD-----GAVFELDFFGVMIFLDENVGCLA-FAAMDTGGLPLSII 460
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G+ R V+YD ++IG+ + C
Sbjct: 461 GNTQQRSAEVIYDVAAEKIGFVPASC 486
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 107/429 (24%), Positives = 170/429 (39%), Gaps = 57/429 (13%)
Query: 159 SVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLD 218
+ G+ R S ++ +S VA S G G Y + VG PPR + +
Sbjct: 112 AARSGVARMPASSSPRRALSERMVATVES------GVAVGSGEYLIDVYVGTPPRRFRMI 165
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETC 275
MDTGSDL W+QC APC C + P++ P + + D C + P C
Sbjct: 166 MDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRP 224
Query: 276 QQ--CDYEIEYADHSSSMGVLARDE--LHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVK 331
+ C Y Y D S++ G LA + ++LT S VVFGC + +GL
Sbjct: 225 AEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGL 284
Query: 332 TDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG--------YMFLGHDLVPSW 383
L R +S SQL + + + +CL + G Y+ L H P
Sbjct: 285 LG----LGRGPLSFASQL--RAVYGHTFSYCLVEHGSDAGSKVVFGEDYLVLAH---PQL 335
Query: 384 GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGSSYTYFTK 438
SP Y+ ++ + G LN+ + V G + D+G++ +YF +
Sbjct: 336 KYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVE 395
Query: 439 QAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF--GSKW 496
AY + + ++ S L P L C+ R V L+L F G+ W
Sbjct: 396 PAYQVIRQAFVDLMSRLYPLIPDFPVLNPCYNVSGVERPEV------PELSLLFADGAVW 449
Query: 497 QIVSTKF--HISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNK 554
+ + + P+G + ++ +G G+ I+G+ + VVYD N
Sbjct: 450 DFPAENYFVRLDPDGIMCLAVRGTPRTGM-----------SIIGNFQQQNFHVVYDLQNN 498
Query: 555 RIGWAKSHC 563
R+G+A C
Sbjct: 499 RLGFAPRRC 507
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 106/377 (28%), Positives = 170/377 (45%), Gaps = 46/377 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VG PP+ Y+ +DTGSD+ W+QC APC +C +P++ P + L
Sbjct: 40 GEYFTRIGVGTPPKYVYMVLDTGSDIVWLQC-APCKNCYSQTDPVFNPVKSG--SFAKVL 96
Query: 260 CME--IQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFG 316
C +R PG C Q C Y++ Y D S + G + L TK V G
Sbjct: 97 CRTPLCRRLESPG-CNQRQTCLYQVSYGDGSYTTGEFVTETLTF-----RRTKVEQVALG 150
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMF 374
C +D +GL V G+LGL R +S PSQ +CL + +
Sbjct: 151 CGHDNEGL----FVGAAGLLGLGRGGLSFPSQAGR--TFNQKFSYCLVDRSASSKPSSVV 204
Query: 375 LGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLN-LGARNSQV-----GWALF 427
G+ V S + P+L +P ++ Y+ E+L I+ G +P++ + A + ++ G +
Sbjct: 205 FGNSAV-SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVII 263
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D G+S T K AY +L++ G S P + + + + VK T
Sbjct: 264 DCGTSVTRLNKPAY----IALRDAFRAGASSLKSAPEFSL-FDTCYDLSGKTTVK--VPT 316
Query: 488 LTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ LHF G+ + ++ + I +G G C + +G +II G+I +G
Sbjct: 317 VVLHFRGADVSLPASNYLIPVDG------SGRFCFAF---AGTTSGLSII-GNIQQQGFR 366
Query: 547 VVYDNVNKRIGWAKSHC 563
VVYD + R+G++ C
Sbjct: 367 VVYDLASSRVGFSPRGC 383
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 102/413 (24%), Positives = 171/413 (41%), Gaps = 48/413 (11%)
Query: 164 IIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGS 223
++ + IN+ +N + DS S P G Y VG PP Y +DTGS
Sbjct: 53 VVNAARRSINR----ANRLFKDSLSNTPESTVYVNGGEYLMTYSVGTPPFNVYGVVDTGS 108
Query: 224 DLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDY 280
D+ W+QC PC C K P++ P + +P +LC ++ C C+Y
Sbjct: 109 DIVWLQC-KPCEQCYKQTTPIFNPSKSSSYKNIPCSSNLCQSVRYTS----CNKQNSCEY 163
Query: 281 EIEYADHSSSMGVLARDELHLTIENG-SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
I ++D S S G L+ + L L G S++ P V GC ++ +G+ +T GI+GL
Sbjct: 164 TINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQG---ETSGIVGLG 220
Query: 340 RAKVSLPSQLASQGIIKNVVGHC---LTTNAGGGGYMFLGHDLVPSW-GMAWVPMLDSPF 395
VSL +QL S I +C L ++ + G V S G+ P +
Sbjct: 221 IGPVSLTTQLKSS--IGGKFSYCLLPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDP 278
Query: 396 MELYHTEILKINYGSSPLNLGA-RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSD 454
Y+ + + G+ + +S+ G + D+G++ T Y+ L +++ ++
Sbjct: 279 QAFYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSGTTLTLLPSHVYTNLESAVAQLVKL 338
Query: 455 GLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS---KWQIVSTKFHISPEGYL 511
V D + L +C+ SI + F +T HF K +ST H++
Sbjct: 339 DRV-DDPNQLLNLCY-------SITSDQYDFPIITAHFKGADIKLNPISTFAHVA----- 385
Query: 512 VISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
G +CL + I G+++ LV YD + + S C+
Sbjct: 386 ----DGVVCLAFTSSQ-----TGPIFGNLAQLNLLVGYDLQQNIVSFKPSDCI 429
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 176/395 (44%), Gaps = 54/395 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYKD 257
L++T++ +G P + + +D GSDL WI CD C CA + Y + P
Sbjct: 80 LHYTWIDIGTPNISFLVALDAGSDLLWIPCD--CIQCAPLSASYYGSLDRDLNQYSPSGS 137
Query: 258 SLCMEIQRNHKPGYCETCQQCD-------YEIE-YADHSSSMGVLARDELHLT-----IE 304
S + +H+ CE+ CD Y I Y++++SS G+L D LHLT
Sbjct: 138 STSKHLSCSHQ--LCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDAS 195
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
N S+ P V+ GC Q G L+ V DG++GL ++S+PS L+ G++KN C
Sbjct: 196 NSSVRAP-VIIGCGMRQTGGYLDG-VAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCF- 252
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
N G +F G + + D + E Y + GSS + +
Sbjct: 253 -NDDDSGRIFFGDQGLATQQTTLFLPSDGKY-ETYIVGVEACCIGSSCIKQTSFR----- 305
Query: 425 ALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
AL D+G+S+T+ ++Y ++ K+V++ + C+++ K+
Sbjct: 306 ALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGY--PWEYCYKSS--------SKE 355
Query: 484 FFK--TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI--CLGILDGSEVHNGSTIILGD 539
K ++ L F ++ F + ++V +G + CL I + +G ILG
Sbjct: 356 LLKNPSVILKFA-----LNNSFVVHNPVFVVHGYQGVVGFCLAI----QPADGDIGILGQ 406
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
+ G +V+D N ++GW++S+C + + +P
Sbjct: 407 NFMTGYRMVFDRENLKLGWSRSNCQDLTDGERMPL 441
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 159/372 (42%), Gaps = 45/372 (12%)
Query: 209 GNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQR 265
G+P + +DTGSDLTW+QC PCS+C +PL+ P + S C +
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACAASLK 255
Query: 266 --NHKPGYCETC-QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQ 322
PG C ++C Y + Y D S S GVLA D T+ G + VFGC +
Sbjct: 256 AATGTPGSCGGGNERCYYALAYGDGSFSRGVLATD----TVALGGASLDGFVFGCGLSNR 311
Query: 323 GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--TTNAGGGGYMFLGHDLV 380
GL T G++GL R ++SL SQ A + V +CL TT+ G + LG D
Sbjct: 312 GLFGGTA----GLMGLGRTELSLVSQTALR--YGGVFSYCLPATTSGDASGSLSLGGDAS 365
Query: 381 P---SWGMAWVPMLDSPFM-ELYHTEILKINYGSSPL---NLGARNSQVGWALFDTGSSY 433
+ +A+ M+ P Y + G + L LGA N L D+G+
Sbjct: 366 SYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASN-----VLIDSGTVI 420
Query: 434 TYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
T Y + A ++ ++ G L C + + +VK TL L
Sbjct: 421 TRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTC----YDLTGHDEVKVPLLTLRLEG 476
Query: 493 GSKWQIVSTKFHISPEGYLVISKKGN-ICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
G++ + + V+ K G+ +CL + S + T I+G+ + + VVYD
Sbjct: 477 GAEVTVDAAGM------LFVVRKDGSQVCLAM--ASLSYEDQTPIIGNYQQKNKRVVYDT 528
Query: 552 VNKRIGWAKSHC 563
V R+G+A C
Sbjct: 529 VGSRLGFADEDC 540
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 167/391 (42%), Gaps = 60/391 (15%)
Query: 193 RGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN- 251
R PDG ++G P Y +DTGSDL W QC PC C K + P++ P +
Sbjct: 163 RERRVPDG-----RVIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSST 216
Query: 252 --ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
+P + C ++ + C + +C Y Y D SS+ GVLA + L
Sbjct: 217 YATVPCSSASCSDLPTSK----CTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK---- 268
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----T 365
P VVFGC +G + + G++GL R +SL SQL G+ K +CLT T
Sbjct: 269 LPGVVFGCGDTNEG---DGFSQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDT 320
Query: 366 NAGG---GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNL-----G 416
N G + + + P++ +P Y+ + I GS+ ++L
Sbjct: 321 NNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFA 380
Query: 417 ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT---LPVCWRAKF 473
++ G + D+G+S TY Q Y +LK+ + + L A+D + L +C+RA
Sbjct: 381 VQDDGTGGVIVDSGTSITYLEVQGYR----ALKKAFAAQMALPAADGSGVGLDLCFRA-- 434
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNG 532
P + + V+ L HF + + E Y+V+ G +CL ++ +
Sbjct: 435 PAKGVDQVE--VPRLVFHFDGGADL-----DLPAENYMVLDGGSGALCLTVMGSRGLS-- 485
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+ + VYD + + +A C
Sbjct: 486 ---IIGNFQQQNFQFVYDVGHDTLSFAPVQC 513
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 153/356 (42%), Gaps = 38/356 (10%)
Query: 219 MDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPG------Y 271
+DTGS L+W+QC PC+ C A+PLY P + YK C ++ +
Sbjct: 3 LDTGSSLSWLQCQ-PCAVYCHAQADPLYDPSVSKT--YKKLSCASVECSRLKAATLNDPL 59
Query: 272 CET-CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLV 330
CET C Y Y D S S+G L++D L LT S T P +GC D QGL
Sbjct: 60 CETDSNACLYTASYGDTSFSIGYLSQDLLTLT---SSQTLPQFTYGCGQDNQGL----FG 112
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDLVPSWGMAWVP 389
+ GI+GL+R K+S+ +QL+++ + +CL T N+G G FL + + P
Sbjct: 113 RAAGIIGLARDKLSMLAQLSTK--YGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTP 170
Query: 390 ML-DSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASL 448
ML DS LY + I PL+L A +V L D+G+ T Y+ L +
Sbjct: 171 MLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVP-TLIDSGTVITRLPMSMYAALRQAF 229
Query: 449 KEVSSDGLVLDASDPTLPVCWRAKF-PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
++ S + L C++ I ++ ++K F+ +
Sbjct: 230 VKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQG------------GADLTLRA 277
Query: 508 EGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
L+ + KG CL S + I+G+ + + YD RIG+A C
Sbjct: 278 PSILIEADKGITCLAFAGSSGTNQ--IAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 159/377 (42%), Gaps = 51/377 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VGNP + YY+ +DTGSD+ WIQC PCS C + ++P++ P + Y
Sbjct: 157 GEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQ-PCSDCYQQSDPIFTPAASS--SYSPLT 213
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
C Q N QC Y++ Y D S + G + + GS T ++ GC +
Sbjct: 214 CDSQQCNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSF---GGSGTVNSIALGCGH 270
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDL 379
D +GL V G+LGL +SL SQL + +CL N L +
Sbjct: 271 DNEGL----FVGAAGLLGLGGGPLSLTSQLKATSF-----SYCL-VNRDSAASSTLDFNS 320
Query: 380 VPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGAR-----NSQVGWALFDTGSSY 433
P P+L S ++ Y+ + ++ G L + +S G + D G++
Sbjct: 321 APVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAI 380
Query: 434 TYFTKQAYSELIASLKEV-----SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
T +AY+ L S + S+ G+ L C+ +S V V T+
Sbjct: 381 TRLQSEAYNSLRDSFVSMSRHLRSTSGVAL------FDTCY--DLSGQSSVKV----PTV 428
Query: 489 TLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ HF G W + + + I + G C + S I+G++ +G
Sbjct: 429 SFHFDGGKSWDLPAANYLIP------VDSAGTYCFAFAPTTS----SLSIIGNVQQQGTR 478
Query: 547 VVYDNVNKRIGWAKSHC 563
V +D N R+G++ + C
Sbjct: 479 VSFDLANNRVGFSTNKC 495
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 102 bits (253), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 147/380 (38%), Gaps = 47/380 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G YF + VG+PP YL +D+GSD+ WIQC PC+ C + A+PL+ P +P
Sbjct: 131 GEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCR-PCAECYQQADPLFDPAASASFTAVPCD 189
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
+C + C C Y++ Y D S + GVLA + L S V G
Sbjct: 190 SGVCRTLPGGSSG--CADSGACRYQVSYGDGSYTQGVLAMETLTF---GDSTPVQGVAIG 244
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA--GGGGYMF 374
C + +GL V G+LGL +SL QL +CL + G G +
Sbjct: 245 CGHRNRGL----FVGAAGLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGADAGAGSLV 298
Query: 375 LGHDLVPSWGMAWVPMLDSPFMELYH----------TEILKINYGSSPLNLGARNSQVGW 424
G D G WVP+L + ++ E L + G L G
Sbjct: 299 FGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLT----EDGGGG 354
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ DTG++ T AY+ L + L L C + + V+
Sbjct: 355 VVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTC----YDLSGYASVR-- 408
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDISLR 543
T+ L+FG ++ P L++ G + CL ILG+I +
Sbjct: 409 VPTVALYFGRDGAALTL-----PARNLLVEMGGGVYCLAF----AASASGLSILGNIQQQ 459
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G + D+ N +G+ S C
Sbjct: 460 GIQITVDSANGYVGFGPSTC 479
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 102 bits (253), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 174/390 (44%), Gaps = 53/390 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA-PCSSCAKGANPL-------YKPRMGNI 252
L++T++ +G P + + +D GSD+ W+ CD C+S + G + Y+P + N
Sbjct: 104 LHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNT 163
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQ-CDYEIEYAD-HSSSMGVLARDELHLT----- 302
LP LC +C+ + C YE++YA ++SS G + D+LHLT
Sbjct: 164 SRHLPCGHKLC------DVHSFCKGSKDPCPYEVQYASANTSSSGYVFEDKLHLTSDGKH 217
Query: 303 IENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
E S+ + +++ GC Q G L+ DG+LGL +S+PS LA G+I+N C
Sbjct: 218 AEQNSV-QASIILGCGRKQTGDYLHG-AGPDGVLGLGPGNISVPSLLAKAGLIQNSFSIC 275
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
L N G + G V +PF+ + + ++ L L Q
Sbjct: 276 LDENE--SGRIIFGDQ-------GHVTQHSTPFLPIIAYMVGVESFCVGSLCLKETRFQ- 325
Query: 423 GWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
AL D+GSS+T+ + Y +++ K+V++ +VL +S C+ A + +V++
Sbjct: 326 --ALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQSS---WEYCYNAS--SQELVNI 378
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
L L F T +P Y S++ + L S + I G
Sbjct: 379 ----PPLKLAFSRN----QTFLIQNPIFYDPASQEQEYTIFCLPVSPSADDYAAI-GQNF 429
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
L G +V+D N R GW++ +C + F S
Sbjct: 430 LMGYRLVFDRENLRFGWSRWNCQDRASFTS 459
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 109/418 (26%), Positives = 186/418 (44%), Gaps = 59/418 (14%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTGS 223
IR + +L A+A+ +SS + + P +G + + +G PP Y +DTGS
Sbjct: 59 IRHGVKRGRNRLQRLQAMALVASSSSEIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGS 118
Query: 224 DLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYKDSLCMEIQRNHKPGYCETCQQ-CD 279
DL W QC PC+ C + P++ P + L LC + ++ +C C+
Sbjct: 119 DLIWTQCK-PCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQS-------SCNNGCE 170
Query: 280 YEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
Y Y D+SS+ G+LA + T+ G + PNV FGC D +G + + G++GL
Sbjct: 171 YLYSYGDYSSTQGILASE----TLTFGKASVPNVAFGCGADNEG---SGFSQGAGLVGLG 223
Query: 340 RAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDL----VPSWGMAWVPMLDSPF 395
R +SL SQL + +CLTT L L S + P++ SP
Sbjct: 224 RGPLSLVSQLK-----EPKFSYCLTTVDDTKTSTLLMGSLASVNASSSAIKTTPLIHSPA 278
Query: 396 M-ELYHTEILKINYGSSPL-----NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLK 449
Y+ + I+ G + L ++ G + D+G++ TY + A++ L+A K
Sbjct: 279 HPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFN-LVA--K 335
Query: 450 EVSSD-GLVLDASDPT-LPVCWRAKFPIRSI-VDVKQFFKTLTLHF-GSKWQIVSTKFHI 505
E ++ L +D+S T L VC+ P S ++V + L HF G+ ++ + + I
Sbjct: 336 EFTAKINLPVDSSGSTGLDVCF--TLPSGSTNIEVPK----LVFHFDGADLELPAENYMI 389
Query: 506 SPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
S G CL + S + I G++ + LV++D + + + + C
Sbjct: 390 GD------SSMGVACLAMGSSSGMS-----IFGNVQQQNMLVLHDLEKETLSFLPTQC 436
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 176/395 (44%), Gaps = 54/395 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYKD 257
L++T++ +G P + + +D GSDL WI CD C CA + Y + P
Sbjct: 99 LHYTWIDIGTPNISFLVALDAGSDLLWIPCD--CIQCAPLSASYYGSLDRDLNQYSPSGS 156
Query: 258 SLCMEIQRNHKPGYCETCQQCD-------YEIE-YADHSSSMGVLARDELHLT-----IE 304
S + +H+ CE+ CD Y I Y++++SS G+L D LHLT
Sbjct: 157 STSKHLSCSHQ--LCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDAS 214
Query: 305 NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
N S+ P V+ GC Q G L+ V DG++GL ++S+PS L+ G++KN C
Sbjct: 215 NSSVRAP-VIIGCGMRQTGGYLDG-VAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCF- 271
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
N G +F G + + D + E Y + GSS + +
Sbjct: 272 -NDDDSGRIFFGDQGLATQQTTLFLPSDGKY-ETYIVGVEACCIGSSCIKQTSFR----- 324
Query: 425 ALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
AL D+G+S+T+ ++Y ++ K+V++ + C+++ K+
Sbjct: 325 ALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGY--PWEYCYKSS--------SKE 374
Query: 484 FFK--TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI--CLGILDGSEVHNGSTIILGD 539
K ++ L F ++ F + ++V +G + CL I + +G ILG
Sbjct: 375 LLKNPSVILKFA-----LNNSFVVHNPVFVVHGYQGVVGFCLAI----QPADGDIGILGQ 425
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
+ G +V+D N ++GW++S+C + + +P
Sbjct: 426 NFMTGYRMVFDRENLKLGWSRSNCQDLTDGERMPL 460
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 160/378 (42%), Gaps = 47/378 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG-NILPYKDS- 258
L+F + VG PP + + +DTGSDL W+ C+ C+ C G ++ NI K S
Sbjct: 100 LHFANVSVGTPPLSFLVALDTGSDLFWLPCN--CTKCVHGIGLSNGEKIAFNIYDLKGSS 157
Query: 259 -----LC----MEIQRNHKPGYCETCQQ-CDYEIEY-ADHSSSMGVLARDELHL--TIEN 305
LC E+QR C + C YE+ Y ++ +S+ G L D LHL +
Sbjct: 158 TSQPVLCNSSLCELQRQ-----CPSSDTICPYEVNYLSNGTSTTGFLVEDVLHLITDDDK 212
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
+ FGC Q G L+ +G+ GL + S+PS LA +G+ N C +
Sbjct: 213 TKDADTRITFGCGQVQTGAFLDG-AAPNGLFGLGMSNESVPSILAKEGLTSNSFSMCFGS 271
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
+ G G + G + S P Y+ + +I G +L A
Sbjct: 272 D--GLGRITFGDN--SSLVQGKTPFNLRALHPTYNITVTQIIVGEKVDDLEFH------A 321
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+FD+G+S+TY AY ++ S +S LP + + V++
Sbjct: 322 IFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCYELSPNQTVELS--- 378
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
LT+ G + + +S EG + +CLG+L + V+ I+G + G
Sbjct: 379 INLTMKGGDNYLVTDPIVTVSGEGINL------LCLGVLKSNNVN-----IIGQNFMTGY 427
Query: 546 LVVYDNVNKRIGWAKSHC 563
+V+D N +GW +S+C
Sbjct: 428 RIVFDRENMILGWRESNC 445
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 108/397 (27%), Positives = 164/397 (41%), Gaps = 61/397 (15%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG- 250
+ G+ G YF +G PP+ + L +D+GSDL W+QC APC C PLY P
Sbjct: 55 VSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQC-APCLQCYAQDTPLYAPSNSS 113
Query: 251 --NILPYKDSLCMEIQRN-------HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
N +P C+ I H PG C YE YAD S S GV A +
Sbjct: 114 TFNPVPCLSPECLLIPATEGFPCDFHYPGACA------YEYRYADTSLSKGVFAYES--A 165
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
T+++ + K V FGC D QG + G+LGL + +S SQ+ N +
Sbjct: 166 TVDDVRIDK--VAFGCGRDNQG----SFAAAGGVLGLGQGPLSFGSQVGYA--YGNKFAY 217
Query: 362 CLTTN---AGGGGYMFLGHDLVPS-WGMAWVPML-DSPFMELYHTEILKINYGSSPLNLG 416
CL ++ G +L+ + + + P++ +S LY+ +I K+ G L +
Sbjct: 218 CLVNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPI- 276
Query: 417 ARNSQVGWAL---------FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV 467
S W+L FD+G++ TY+ AY ++A+ + + AS L +
Sbjct: 277 ---SHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDK--NVRYPRAASVQGLDL 331
Query: 468 CWRAKFPIRSIVDVKQ-FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDG 526
C + V Q F + T+ G F Y V CL + G
Sbjct: 332 C-------VDVTGVDQPSFPSFTIVLGG-----GAVFQPQQGNYFVDVAPNVQCLA-MAG 378
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G +G++ + LV YD RIG+A + C
Sbjct: 379 LPSSVGGFNTIGNLLQQNFLVQYDREENRIGFAPAKC 415
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/386 (23%), Positives = 167/386 (43%), Gaps = 34/386 (8%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGA-NPLYKPRMGNILPYK-- 256
G YF + +G PP+ L DTGSDL W++C A C +C+ + ++ PR +
Sbjct: 82 GQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHSSTFSPAHC 140
Query: 257 -DSLCMEIQRNHKPGYCETCQ---QCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-P 311
D +C + + + C + C YE YAD S + G+ AR+ L +G +
Sbjct: 141 YDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLK 200
Query: 312 NVVFGCAYDQQGLLLN--TLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTN 366
+V FGC + G ++ + +G++GL R +S SQL + N +CL T +
Sbjct: 201 SVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSYCLMDYTLS 258
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGAR-----NS 420
Y+ +G+ + + P+L +P Y+ ++ + + L + +S
Sbjct: 259 PPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDS 318
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G++ + + AY +IA+++ + DA P +C + +
Sbjct: 319 GNGGTVVDSGTTLAFLAEPAYRSVIAAVRR-RVKLPIADALTPGFDLCVN----VSGVTK 373
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
++ L F V P Y + +++ CL I + G ++I G++
Sbjct: 374 PEKILPRLKFEFSGGAVFVP-----PPRNYFIETEEQIQCLAI-QSVDPKVGFSVI-GNL 426
Query: 541 SLRGQLVVYDNVNKRIGWAKSHCMNP 566
+G L +D R+G+++ C P
Sbjct: 427 MQQGFLFEFDRDRSRLGFSRRGCALP 452
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 165/380 (43%), Gaps = 50/380 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC-----AKGANPL----YKPRMGN 251
L++ + VG P + + +DTGSDL W+ CD C++C A G + L Y P +
Sbjct: 103 LHYANVTVGTPSDWFMVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNASS 160
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGS 307
+P +LC R P C Y+I Y ++ +SS GVL D LHL + N
Sbjct: 161 TSTKVPCNSTLCTRGDRCASPE-----SDCPYQIRYLSNGTSSTGVLVEDVLHL-VSNDK 214
Query: 308 LTK---PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
+K V FGC Q G+ + +G+ GL +S+PS LA +GI N C
Sbjct: 215 SSKAIPARVTFGCGQVQTGVFHDG-AAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
+ G G + G S P+ Y+ + KI+ G + +L
Sbjct: 274 ND--GAGRISFGDK--GSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFD------ 323
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
A+FD+G+S+TY T AY+ + S ++ D +D LP + + + D Q+
Sbjct: 324 AVFDSGTSFTYLTDAAYTLISESFNSLALDKR-YQTTDSELP--FEYCYALSPNKDSFQY 380
Query: 485 -FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
LT+ GS + + +H P + + CL I+ ++ I+G +
Sbjct: 381 PAVNLTMKGGSSYPV----YH--PLVVIPMKDTDVYCLAIMKIEDIS-----IIGQNFMT 429
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G VV+D +GW +S C
Sbjct: 430 GYRVVFDREKLILGWKESDC 449
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 117/417 (28%), Positives = 170/417 (40%), Gaps = 74/417 (17%)
Query: 169 KSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFT----YMIVGNPPRPYYLDMDTGSD 224
K++ L + + S+ P+ Y DG FT ++ G PP+ L +DTGSD
Sbjct: 51 KARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVHLAAGTPPQEVQLTLDTGSD 110
Query: 225 LTWIQCD-APCSSCAKGANPLYKPRMGN---ILPYKDSLCMEIQRNHKPGYCETCQQCDY 280
+TW QC P S+C PL+ P + LP C E G T + C+Y
Sbjct: 111 ITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPAC-ETTPPCGGGNDATSRPCNY 169
Query: 281 EIEYADHSSSMGVLARDELHL---TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILG 337
I Y D S S G + R+ T E S P +VFGC + +G+ + GI G
Sbjct: 170 SISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANRGVFTS---NETGIAG 226
Query: 338 LSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWV-PMLDSPFM 396
R +SLPSQL + N HC TT G L G+ V P SP
Sbjct: 227 FGRGSLSLPSQLK----VGN-FSHCFTTITGSKTSAVL-------LGLPGVAPPSASP-- 272
Query: 397 ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAY----SELIASLKEVS 452
L GS R+S ++G+S T + Y E A +K
Sbjct: 273 -------LGRRRGSYRCRSTPRSS-------NSGTSITSLPPRTYRAVREEFAAQVKLPV 318
Query: 453 SDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV 512
G +A+DP C+ A P+R K T+ LHF + + ++ + V
Sbjct: 319 VPG---NATDPF--TCFSA--PLRG---PKPDVPTMALHFEGATMRLPQENYV----FEV 364
Query: 513 I--SKKGN----ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ GN ICL +++G E IILG+I + V+YD N ++ + + C
Sbjct: 365 VDDDDAGNSSRIICLAVIEGGE------IILGNIQQQNMHVLYDLQNSKLSFVPAQC 415
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 172/395 (43%), Gaps = 53/395 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK---- 256
L++T++ +G P + + +D GSDL W+ C+ C CA + Y ++ Y+
Sbjct: 102 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCN--CIQCAPLSASYYGSLDKDLNEYRPSSS 159
Query: 257 ---------DSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLT--IE 304
+LC Q P Q C Y I+Y +++SS G+L +D LHL+ E
Sbjct: 160 STSKHISCSHNLCDSGQSCQSPK-----QSCPYVIDYITENTSSSGLLIQDVLHLSSGCE 214
Query: 305 NGS--LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
N S + V+ GC Q G L+ V DG+ GL ++S+ S LA + +++N C
Sbjct: 215 NSSNCTIQAPVILGCGMKQSGGYLSG-VAPDGLFGLGLGEISVLSSLAKEELVQNSFSLC 273
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
N G G +F G + S LD + E Y + +S L +
Sbjct: 274 F--NEDGSGRIFFGDEGPASQQTTSFVPLDGKY-ETYIVGVEACCIENSCLKQTSFK--- 327
Query: 423 GWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
AL D+G+S+TY ++AY ++ K +++ V P W+ + I + D
Sbjct: 328 --ALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYP-----WKYCYKISA--DA 378
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG--NICLGILDGSEVHNGSTIILGD 539
++TL F ++ F + + + +G C IL +G ILG
Sbjct: 379 MPKVPSVTLLFP-----LNNSFVVHDPVFPIYGDQGLAGFCFAILPA----DGDIGILGQ 429
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
+ G +V+D N ++GW+ ++C + K +P
Sbjct: 430 NYMTGYRMVFDRDNLKLGWSHANCQDLSNEKKMPL 464
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/402 (23%), Positives = 171/402 (42%), Gaps = 33/402 (8%)
Query: 169 KSKINKKLVSSNAVA-VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTW 227
+S+++K L N V +DS+++ G + Y+ + +G P R L DTGS LTW
Sbjct: 106 QSRLSKNLGGENRVKELDSTTLPAKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTW 165
Query: 228 IQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY 284
QC+ SC K +P++ P + + SLC + + T C Y+++Y
Sbjct: 166 TQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSSLCTQFRSAGCSS--STDASCIYDVKY 223
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
D+S S G L+++ L +T + + +FGC D +GL T G++GLSR +S
Sbjct: 224 GDNSISRGFLSQERLTITATD---IVHDFLFGCGQDNEGLFRGTA----GLMGLSRHPIS 276
Query: 345 LPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFMELYHTEI 403
Q +S I + +CL + G++ G + + + P S Y +I
Sbjct: 277 FVQQTSS--IYNKIFSYCLPSTPSSLGHLTFGASAATNANLKYTPFSTISGENSFYGLDI 334
Query: 404 LKINYGSSPL-NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD 462
+ I+ G + L + + G ++ D+G+ T AY+ L ++ ++
Sbjct: 335 VGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLPPTAYAALRSAFRQFMMK-------- 386
Query: 463 PTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLG 522
PV + + + + D + + ++ K + G L +CL
Sbjct: 387 --YPVAYGTRL-LDTCYDFSGYKEISVPRIDFEFA-GGVKVELPLVGILYGESAQQLCLA 442
Query: 523 ILDGSEVHNGSTI-ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
NG+ I I G++ + VVYD RIG+ + C
Sbjct: 443 FAANG---NGNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 481
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 168/383 (43%), Gaps = 41/383 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PPR + + MDTGSDL W+QC APC C + + P++ P + Y++
Sbjct: 147 GEYLVDVYLGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQSGPIFDPAAS--ISYRNVT 203
Query: 260 CMEIQ-------RNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTI-ENGSLT 309
C + + P C + C Y Y D S++ G LA + + + ++G+
Sbjct: 204 CGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRR 263
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHCLTTN-A 367
V FGC + +GL L R +S SQL +G+ + +CL + +
Sbjct: 264 VDGVAFGCGHRNRGLFHGAAGLLG----LGRGPLSFASQL--RGVYGGHAFSYCLVEHGS 317
Query: 368 GGGGYMFLGHDLV----PSWG-MAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
G + GHD P A+ P D+ Y+ ++ I G +N+ +
Sbjct: 318 AAGSKIIFGHDDALLAHPQLNYTAFAPTTDAD--TFYYLQLKSILVGGEAVNISSDTLSA 375
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G + D+G++ +YF + AY + + + S L P L C+ + V+V
Sbjct: 376 GGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEK--VEVP 433
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
+ +L G+ W+ + + ++ + +G +CL +L +G +II G+
Sbjct: 434 EL--SLVFADGAAWEFPAENY------FIRLEPEGIMCLAVL--GTPRSGMSII-GNYQQ 482
Query: 543 RGQLVVYDNVNKRIGWAKSHCMN 565
+ V+YD + R+G+A C +
Sbjct: 483 QNFHVLYDLEHNRLGFAPRRCAD 505
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 120/270 (44%), Gaps = 31/270 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK--- 256
G YF M +G+P R YYL++DTGSD+TWIQC APCSSC +P+Y P N Y+
Sbjct: 43 GEYFARMGIGSPQRSYYLELDTGSDVTWIQC-APCSSCYSQVDPIYDP--SNSSSYRRVY 99
Query: 257 --DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
+LC + + G C Y + Y D S+S G L + +L N S N+
Sbjct: 100 CGSALCQALDYSACQG-----MGCSYRVVYGDSSASSGDLGIESFYLG-PNSSTAMRNIA 153
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN----AGGG 370
FGC + GL + +S SQ+A+ I +CL
Sbjct: 154 FGCGHSNSGLFRGEAGLLG----MGGGTLSFFSQIAAS--IGPAFSYCLVDRYSQLQSRS 207
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSS-----PLNLGARNSQVGW 424
+ G +P + + P+L +P ++ ++ IL I+ G + P + G
Sbjct: 208 SPLIFGRTAIP-FAARFTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGG 266
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSD 454
A+ D+G+S T AY+ L + + S +
Sbjct: 267 AILDSGTSVTRVVPAAYAVLRDAYRAASRN 296
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 105/395 (26%), Positives = 161/395 (40%), Gaps = 70/395 (17%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPY 255
+G Y M +G PPR Y +DTGSDL W QC APC C P + P LP
Sbjct: 86 EGEYLMSMGIGTPPRYYSAILDTGSDLIWTQC-APCMLCVDQPTPFFDPAQSPSYAKLPC 144
Query: 256 KDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
+C + Y C + C Y+ Y D +++ GVL+ + + +T P +
Sbjct: 145 NSPMCNAL-------YYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRI 197
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG---- 369
FGC G L N G++G R +SL SQL S +CLT+
Sbjct: 198 AFGCGNLNAGSLFN----GSGMVGFGRGPLSLVSQLGSPRF-----SYCLTSFMSPVPSR 248
Query: 370 ---GGYMFLGHDLVPSWGMAWVPMLDSPFM------ELYHTEILKINYGSSPL----NLG 416
G Y L + P+ +PF+ +Y+ + I+ G L ++
Sbjct: 249 LYFGAYATLNS----TSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVF 304
Query: 417 ARNSQ--VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT-----LPVCW 469
A N G + D+GS+ TY + AY + + +D + L ++ T L C+
Sbjct: 305 AINDADGTGGVIIDSGSTITYLARAAYDM----VHQAFADQVGLPLTNATSLADVLDTCF 360
Query: 470 RAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSE 528
P R IV + + L HF + E Y++I GN+CL I +
Sbjct: 361 VWPPPPRKIVTMPE----LAFHFEGA------NMELPLENYMLIDGDTGNLCLAI---AA 407
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+GS I+G + V+YDN N + + + C
Sbjct: 408 SDDGS--IIGSFQHQNFHVLYDNENSLLSFTPATC 440
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 173/384 (45%), Gaps = 48/384 (12%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
D L++T++ +G P + + +D GSDL+W+ CD C CA + LYKP ++ Y+ S
Sbjct: 99 DWLHYTWIDIGTPNVSFLVALDAGSDLSWVPCD--CIQCAPLSASLYKPLDRDLSEYRPS 156
Query: 259 LCM---EIQRNHK----PGYCETCQQ-CDYEIEYAD-HSSSMGVLARDELHL-TIENGS- 307
L + NH+ +C+ + C Y +YAD ++SS G L D LHL ++ + S
Sbjct: 157 LSTTSRHLSCNHQLCELGSHCKNLKDPCPYIADYADPNTSSSGFLVEDILHLASVSDDSN 216
Query: 308 ----LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
+ +V+ GC Q G L+ DG++GL +S+PS LA G+I+ C
Sbjct: 217 STQKRVQASVILGCGRKQTGGYLDG-AAPDGVMGLGPGSISVPSLLAKAGLIRKSFSLCF 275
Query: 364 TTNAGGGGYMF--LGHDLVPSWGMAWVPMLDSP-FMELYHTEILKINYGSSPLNLGARNS 420
N G G +F GH S P+L + + Y E+ G+S L
Sbjct: 276 DVN-GSGTILFGDQGHTSQKS-----TPLLPTQGNYDAYLIEVESYCVGNSCLK------ 323
Query: 421 QVGW-ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
Q G+ AL D+G+S+TY Y++++ + + + P W + S
Sbjct: 324 QSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQGGP-----WNYCYNTSSKQ 378
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
L+ I ++ +++ + CL L ++++ G I+G
Sbjct: 379 LDNVPAMRLSFLMNQSLLIHNSTYYVPQNQEFAV-----FCL-TLQPTDLNYG---IIGQ 429
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ G VV+D N ++GW+ S+C
Sbjct: 430 NYMTGYRVVFDMENLKLGWSSSNC 453
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/387 (22%), Positives = 168/387 (43%), Gaps = 50/387 (12%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL---PYKDSLCMEIQ 264
+G PPR L +DT S+LTW+Q C++C+ P + P + + P S+C+
Sbjct: 5 IGTPPREVLLLVDTASELTWVQ-GTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLG-- 61
Query: 265 RNHKPGYCETCQQ----CDYEIEYADHSSSMGVLARDELHLTIENGSL-TKPNVVFGCAY 319
K G+ C + C +++ Y D S + GV+AR+ L +G+ T +V+FGCA
Sbjct: 62 -RSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCAS 120
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI--IKNVVGHCLTTNA---GGGGYMF 374
L + + G LGL+R S P+Q+ S+ + + +C A G +
Sbjct: 121 KD---LQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVII 177
Query: 375 LGHDLVPSWGMAWVPMLDSP----FMELYHTEILKINYGSSPLNLGARNSQV-----GWA 425
G +P+ ++ + P ++ Y+ + I+ G L++ ++ G
Sbjct: 178 FGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGT 237
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW-----RAKFPIRSIVD 480
FD+G++ ++ + A++ L+ + SD T +C+ A+ P +V
Sbjct: 238 YFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVT 297
Query: 481 VKQFFKT---LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
+ FK + L S W ++ + ICL ++ V G ++
Sbjct: 298 LH--FKNNVDMELREASVWVPLARTPQVV-----------TICLAFVNAGAVAQGGVNVI 344
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHCM 564
G+ + L+ +D RIG+A ++C+
Sbjct: 345 GNYQQQDYLIEHDLERSRIGFAPANCV 371
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 109/389 (28%), Positives = 170/389 (43%), Gaps = 42/389 (10%)
Query: 185 DSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CAKGA 241
D++ P R + D L Y + G P P L MDTGSD++W+QC PC+S C
Sbjct: 113 DAAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQC-TPCNSTKCYPQK 171
Query: 242 NPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDE 298
+PL+ P + + C ++ ++ G QC Y +EYAD S S GV + +
Sbjct: 172 DPLFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNET 231
Query: 299 LHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
L L +T + FGC DQ+G K DG+LGL A VSL Q +S +
Sbjct: 232 LTLAP---GITVEDFHFGCGRDQRG----PSDKYDGLLGLGGAPVSLVVQTSS--VYGGA 282
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSW---GMAWVPMLDSP-FMELYHTEILKINYGSSPLN 414
+CL G++ LG PS + PM P + Y + I+ G PL+
Sbjct: 283 FSYCLPALNSEAGFLVLGSP--PSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLH 340
Query: 415 LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
+ +++ G + D+G+ T + AY+ L A+L++ ++ + D C+ F
Sbjct: 341 I-PQSAFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDD--FDTCY--NFT 395
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
S + V + T G+ + P G LV N CL + S +G
Sbjct: 396 GYSNITVPRV--AFTFSGGATIDL------DVPNGILV-----NDCLAFQE-SGPDDGLG 441
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
II G+++ R V+YD +G+ C
Sbjct: 442 II-GNVNQRTLEVLYDAGRGNVGFRAGAC 469
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 167/376 (44%), Gaps = 47/376 (12%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
+G Y + G+PP+ + +DTGSDL W QC PC +C A+ ++ P + Y
Sbjct: 77 NGEYLIDISFGSPPQKASVIVDTGSDLIWTQC-LPCETCNAAASVIFDPVKSST--YDTV 133
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C + P + C Y+ Y D SS+ G L+ + + + + T PNV FGC
Sbjct: 134 SCASNFCSSLP-FQSCTTSCKYDYMYGDGSSTSGALSTETVTVG----TGTIPNVAFGCG 188
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+ L + GI+GL + +SL SQ +S I +CL L D
Sbjct: 189 HTN----LGSFAGAAGIVGLGQGPLSLISQASS--ITSKKFSYCLVPLGSTKTSPMLIGD 242
Query: 379 LVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPL-----NLGARNSQVGWALFDTGSS 432
+ G+A+ +L ++ Y+ ++ I+ + S G + D+G++
Sbjct: 243 SAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTT 302
Query: 433 YTYFTKQAYSELIASLK-EV---SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
TY A++ L+A+LK EV +DG + L C F + + + T+
Sbjct: 303 LTYLETGAFNALVAALKAEVPFPEADGSLYG-----LDYC----FSTAGVANPT--YPTM 351
Query: 489 TLHFGSKWQIVSTKFHISPEG-YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
T HF + + PE ++ + G+ICL + + I+G+I + L+
Sbjct: 352 TFHFKGA------DYELPPENVFVALDTGGSICLAMAASTGFS-----IMGNIQQQNHLI 400
Query: 548 VYDNVNKRIGWAKSHC 563
V+D VN+R+G+ +++C
Sbjct: 401 VHDLVNQRVGFKEANC 416
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 164/383 (42%), Gaps = 47/383 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + VG P L +DT SDLTW+QC PC C + P++ PR + +
Sbjct: 136 GEYIAKIAVGTPGVEALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPRHSTSYREMSFN 194
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
+ C + R+ G C Y + Y D S+++G + L G + P + G
Sbjct: 195 AADCQALGRSG--GGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFA---GGVRLPRISIG 249
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM--- 373
C +D +GL GILGL R +S P+Q+ G +CL G G +
Sbjct: 250 CGHDNKGLF---GAPAAGILGLGRGLMSFPNQIDHNGTFS----YCLVDFLSGPGSLSST 302
Query: 374 --FLGHDLVPSWGMAWVP-MLDSPFMELYHTEILKINYGSSPL-NLGARNSQV------G 423
F + S +++ P +L+ Y+ + I+ G + + R+ Q+ G
Sbjct: 303 LTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGRG 362
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT--LPVCWRAKFPIRSIVDV 481
+ D+G++ T + AY+ + + V+ D + P+ C+ R + V
Sbjct: 363 GVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGG--RGMKKV 420
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDI 540
T+++HF S + + P+ YL+ + G +C + + S I+G+I
Sbjct: 421 ----PTVSMHFAG-----SVEVKLQPKNYLIPVDSMGTVCFAF---AATGDHSVSIIGNI 468
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+G +VYD + R+G+A + C
Sbjct: 469 QQQGFRIVYD-IGGRVGFAPNSC 490
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 153/381 (40%), Gaps = 48/381 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G P + L DTGSDLTW QC SC P++ P Y +
Sbjct: 152 GNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKT--YSNIS 209
Query: 260 C-------MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN 312
C ++ + PG C + C Y I+Y D S ++G A+D L LT +
Sbjct: 210 CTSTACSGLKSATGNSPG-CSS-SNCVYGIQYGDSSFTVGFFAKDTLTLTQND---VFDG 264
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY 372
+FGC + +GL KT G++GL R +S+ Q A + +CL T+ G G+
Sbjct: 265 FMFGCGQNNRGL----FGKTAGLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRGSNGH 318
Query: 373 MFLGH------DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
+ G+ G+ + P S Y ++L I+ G L++ Q +
Sbjct: 319 LTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQNAGTI 378
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF-- 484
D+G+ T Y L ++ K+ S PT P A + + D+ +
Sbjct: 379 IDSGTVITRLPSTVYGSLKSTFKQFMSK-------YPTAP----ALSLLDTCYDLSNYTS 427
Query: 485 --FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
++ +F + + P G L+ + +CL + + + I G+I
Sbjct: 428 ISIPKISFNFNGNANV-----DLEPNGILITNGASQVCLAFAGNGD--DDTIGIFGNIQQ 480
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ VVYD ++G+ C
Sbjct: 481 QTLEVVYDVAGGQLGFGYKGC 501
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/389 (25%), Positives = 152/389 (39%), Gaps = 58/389 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G Y + +G PP Y +DTGSDL W QC APC CA P ++P ++P +
Sbjct: 90 GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCR 148
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVF 315
LC + C C Y+ Y D +S+ GVLA + N S + +V F
Sbjct: 149 SPLCAALPYPA----CFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAF 204
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----------- 364
GC G L N + G++GL R +SL SQL + +CLT
Sbjct: 205 GCGNINSGQLAN----SSGMVGLGRGPLSLVSQLG-----PSRFSYCLTSFLSPEPSRLN 255
Query: 365 ---------TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
TNA G LV + A +P L FM L + + PL
Sbjct: 256 FGVFATLNGTNASSSGSPVQSTPLVVN---AALPSLY--FMSLKGISLGQKRLPIDPLVF 310
Query: 416 GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPI 475
+ G D+G+S T+ + AY + L V + ++ L C+ P
Sbjct: 311 AINDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPP 370
Query: 476 RSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGST 534
V V + LHF + PE Y++I G +CL + + +G
Sbjct: 371 SVAVTVPD----MELHFDG-----GANMTVPPENYMLIDGATGFLCLAM-----IRSGDA 416
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+ + ++YD N + + + C
Sbjct: 417 TIIGNYQQQNMHILYDIANSLLSFVPAPC 445
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/299 (29%), Positives = 132/299 (44%), Gaps = 25/299 (8%)
Query: 154 ESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPP 212
++ V ++N + R ++ K +++ + S PL G G Y+ + G+P
Sbjct: 70 DARVKTLNSRLTR-KDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPA 128
Query: 213 RPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRN------ 266
R Y + +DTGS L+W+QC C A+PL+ P YK C Q +
Sbjct: 129 RYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKT--YKSLSCTSSQCSSLVDAT 186
Query: 267 -HKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGL 324
+ P CET C Y Y D S SMG L++D L L S T P V+GC D GL
Sbjct: 187 LNNP-LCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAP---SQTLPGFVYGCGQDSDGL 242
Query: 325 LLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWG 384
+ GILGL R K+S+ Q++S+ +CL T GGGG++ +G +
Sbjct: 243 ----FGRAAGILGLGRNKLSMLGQVSSK--FGYAFSYCLPTR-GGGGFLSIGKASLAGSA 295
Query: 385 MAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYS 442
+ PM P LY + I G L + A +V + D+G+ T Y+
Sbjct: 296 YKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP-TIIDSGTVITRLPMSVYT 353
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 158/381 (41%), Gaps = 52/381 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP--------LYKPRMGNI 252
L++ + VG P + + +DTGSDL W+ CD + + P +Y P +
Sbjct: 103 LHYANVTVGTPSDWFLVALDTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNIYSPNASST 162
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTI--ENG 306
+P +LC + R P C Y+I Y ++ +SS GVL D LHL +N
Sbjct: 163 SSKVPCNSTLCTRVDRCASP-----LSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNS 217
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+ + GC Q G+ + +G+ GL +S+PS LA +GI N C +
Sbjct: 218 KPIRARITLGCGLVQTGVFHDG-AAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGDD 276
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
G G + G S P+ Y+ + +I+ G + +L A+
Sbjct: 277 --GAGRISFGDK--GSVDQRETPLNIRQPHPTYNVTVTQISVGGNTGDLEFD------AV 326
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
FDTG+S+TY T Y+ + S ++ D S+ C+ ++ K+ F+
Sbjct: 327 FDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCY-------AVSPNKKSFE 379
Query: 487 ----TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
LT+ GS + + + E +V CL I+ ++ I+G +
Sbjct: 380 YPDVNLTMKGGSSYPVYHPLIVVPIEDTVV------YCLAIMKSEDIS-----IIGQNFM 428
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
G VV+D +GW +S C
Sbjct: 429 TGYRVVFDREKLILGWKESDC 449
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 133/297 (44%), Gaps = 27/297 (9%)
Query: 168 HKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFT-----YMI---VGNPPRPYYLDM 219
+++ ++ VS A A + + P L F+ Y++ +G P L++
Sbjct: 89 RRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEV 148
Query: 220 DTGSDLTWIQCDAPCSS--CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQ- 276
DTGSD++W+QC PC S C +PL+ P + Y C + Y C
Sbjct: 149 DTGSDVSWVQCK-PCPSPPCYSQRDPLFDPTRSS--SYSAVPCAAASCSQLALYSNGCSG 205
Query: 277 -QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGI 335
QC Y + Y D S++ GV + D L LT GS +FGC + QQGL DG+
Sbjct: 206 GQCGYVVSYGDGSTTTGVYSSDTLTLT---GSNALKGFLFGCGHAQQGL----FAGVDGL 258
Query: 336 LGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPF 395
LGL R SL SQ +S V +CL GY+ LG + G + P+L +
Sbjct: 259 LGLGRQGQSLVSQASS--TYGGVFSYCLPPTQNSVGYISLGGP-SSTAGFSTTPLLTASN 315
Query: 396 MELYHTEILK-INYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEV 451
Y+ +L I+ G PL++ A G A+ DTG+ T AYS L ++ +
Sbjct: 316 DPTYYIVMLAGISVGGQPLSIDASVFASG-AVVDTGTVVTRLPPTAYSALRSAFRAA 371
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 117/252 (46%), Gaps = 18/252 (7%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL-PY--KDS 258
Y + +G+P + MDTGSD++W+QC PCS C + L+ P + P+ +
Sbjct: 131 YVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSASSTYSPFSCSSA 189
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C+++ ++ + C + QC Y + Y D SS+ G + D L L GS FGC+
Sbjct: 190 ACVQLSQSQQGNGCSS-SQCQYIVSYVDGSSTTGTYSSDTLTL----GSNAIKGFQFGCS 244
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+ G + +TDG++GL SL SQ A G +CL G G++ LG
Sbjct: 245 QSESGGFSD---QTDGLMGLGGDAQSLVSQTA--GTFGKAFSYCLPPTPGSSGFLTLG-- 297
Query: 379 LVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTGSSYTYFT 437
G PML S + Y+ +L+ I G LN+ G ++ D+G+ T
Sbjct: 298 AASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAG-SVMDSGTVITRLP 356
Query: 438 KQAYSELIASLK 449
AYS L ++ K
Sbjct: 357 PTAYSALSSAFK 368
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/389 (25%), Positives = 152/389 (39%), Gaps = 58/389 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G Y + +G PP Y +DTGSDL W QC APC CA P ++P ++P +
Sbjct: 90 GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCR 148
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVF 315
LC + C C Y+ Y D +S+ GVLA + N S + +V F
Sbjct: 149 SPLCAALPYPA----CFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAF 204
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----------- 364
GC G L N + G++GL R +SL SQL + +CLT
Sbjct: 205 GCGNINSGQLAN----SSGMVGLGRGPLSLVSQLG-----PSRFSYCLTSFLSPEPSRLN 255
Query: 365 ---------TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL 415
TNA G LV + A +P L FM L + + PL
Sbjct: 256 FGVFATLNGTNASSSGSPVQSTPLVVN---AALPSLY--FMSLKGISLGQKRLPIDPLVF 310
Query: 416 GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPI 475
+ G D+G+S T+ + AY + L V + ++ L C+ P
Sbjct: 311 AINDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPP 370
Query: 476 RSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGST 534
V V + LHF + PE Y++I G +CL + + +G
Sbjct: 371 SVAVTVPD----MELHFDG-----GANMTVPPENYMLIDGATGFLCLAM-----IRSGDA 416
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+ + ++YD N + + + C
Sbjct: 417 TIIGNYQQQNMHILYDIANSLLSFVPAPC 445
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 133/297 (44%), Gaps = 27/297 (9%)
Query: 168 HKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFT-----YMI---VGNPPRPYYLDM 219
+++ ++ VS A A + + P L F+ Y++ +G P L++
Sbjct: 100 RRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEV 159
Query: 220 DTGSDLTWIQCDAPCSS--CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQ- 276
DTGSD++W+QC PC S C +PL+ P + Y C + Y C
Sbjct: 160 DTGSDVSWVQCK-PCPSPPCYSQRDPLFDPTRSS--SYSAVPCAAASCSQLALYSNGCSG 216
Query: 277 -QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGI 335
QC Y + Y D S++ GV + D L LT GS +FGC + QQGL DG+
Sbjct: 217 GQCGYVVSYGDGSTTTGVYSSDTLTLT---GSNALKGFLFGCGHAQQGL----FAGVDGL 269
Query: 336 LGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPF 395
LGL R SL SQ +S V +CL GY+ LG + G + P+L +
Sbjct: 270 LGLGRQGQSLVSQASS--TYGGVFSYCLPPTQNSVGYISLGGP-SSTAGFSTTPLLTASN 326
Query: 396 MELYHTEILK-INYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEV 451
Y+ +L I+ G PL++ A G A+ DTG+ T AYS L ++ +
Sbjct: 327 DPTYYIVMLAGISVGGQPLSIDASVFASG-AVVDTGTVVTRLPPTAYSALRSAFRAA 382
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 106/396 (26%), Positives = 178/396 (44%), Gaps = 56/396 (14%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYKD 257
L++T++ +G P + + +D+GSDL W+ CD C CA + Y + P +
Sbjct: 97 LHYTWIDIGTPHVSFMVALDSGSDLFWVPCD--CVQCAPLSASHYSSLDRDLSEYSPSQS 154
Query: 258 SLCMEIQRNHK-----PGYCETCQQCDYEIE-YADHSSSMGVLARDELHLT-----IENG 306
S ++ +H+ P Q C Y I Y + +SS G+L D +HL N
Sbjct: 155 STSKQLSCSHRLCDMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDTLNT 214
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
S+ P V+ GC Q G L+ V DG+LGL ++S+PS LA G+I+N C N
Sbjct: 215 SVKAP-VIIGCGMKQSGGYLDG-VAPDGLLGLGLQEISVPSFLAKAGLIQNSFSMCF--N 270
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMEL---YHTEILKIN---YGSSPLNLGARNS 420
G +F G D P+ + +PF++L Y T I+ + G+S L + +
Sbjct: 271 EDDSGRIFFG-DQGPATQQS------APFLKLNGNYTTYIVGVEVCCVGTSCLKQSSFS- 322
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
AL D+G+S+T+ + E+IA + + + C++ S D
Sbjct: 323 ----ALVDSGTSFTFLPDDVF-EMIAEEFDTQVNASRSSFEGYSWKYCYKT-----SSQD 372
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI--CLGILDGSEVHNGSTIILG 538
+ + +L L F + F + +++ +G I CL I + +G +G
Sbjct: 373 LPK-IPSLRLIFPQ-----NNSFMVQNPVFMIYGIQGVIGFCLAI----QPADGDIGTIG 422
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
+ G VV+D N ++GW++S+C G +LP
Sbjct: 423 QNFMMGYRVVFDRENLKLGWSRSNCEFSGISYTLPL 458
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 173/389 (44%), Gaps = 48/389 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG----NILPY 255
G YF M VG PP+ +L +DTGSDL+WIQCD PC C + P Y P NI Y
Sbjct: 168 GEYFIDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGPHYNPNESSSYRNISCY 226
Query: 256 KDSLCMEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLTKPN 312
D C + +C+T Q C Y +YAD S++ G A + ++LT NG +
Sbjct: 227 -DPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKH 285
Query: 313 VV---FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TN 366
VV FGC + +G GL R +S PSQL Q I + +CLT +N
Sbjct: 286 VVDVMFGCGHWNKGFFHGAGGLL----GLGRGPLSFPSQL--QSIYGHSFSYCLTDLFSN 339
Query: 367 AGGGGYMFLGHD--LVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPLNLGAR--- 418
+ G D L+ + + +L ++P Y+ +I I G L++ +
Sbjct: 340 TSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWH 399
Query: 419 --NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
+ VG + D+GS+ T+F AY ++I E + A D + C+ ++
Sbjct: 400 WSSEGVGGTIIDSGSTLTFFPDSAY-DVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQ 458
Query: 477 SIVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
V++ + +HF G+ W + + E V ICL IL H+ T
Sbjct: 459 --VELPDY----GIHFADGAVWNFPAENYFYQYEPDEV------ICLAILKTPN-HSHLT 505
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
II G++ + ++YD R+G++ C
Sbjct: 506 II-GNLLQQNFHILYDVKRSRLGYSPRRC 533
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 165/385 (42%), Gaps = 55/385 (14%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPY 255
+G + + +G P Y +DTGSDL W QC PC C K + P++ P + +P
Sbjct: 92 NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPC 150
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+ C ++ + C + +C Y Y D SS+ GVLA + L P VVF
Sbjct: 151 SSASCSDLPTSK----CTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVVF 202
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGG-- 369
GC +G + + G++GL R +SL SQL G+ K +CLT TN
Sbjct: 203 GCGDTNEG---DGFSQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDTNNSPLL 254
Query: 370 -GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNL-----GARNSQV 422
G + + + P++ +P Y+ + I GS+ ++L ++
Sbjct: 255 LGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGT 314
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT---LPVCWRAKFPIRSIV 479
G + D+G+S TY Q Y +LK+ + + L A+D + L +C+RA P + +
Sbjct: 315 GGVIVDSGTSITYLEVQGYR----ALKKAFAAQMALPAADGSGVGLDLCFRA--PAKGVD 368
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIILG 538
V+ L HF + E Y+V+ G +CL ++ + I+G
Sbjct: 369 QVE--VPRLVFHFDG-----GADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----IIG 416
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
+ + VYD + + +A C
Sbjct: 417 NFQQQNFQFVYDVGHDTLSFAPVQC 441
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 160/391 (40%), Gaps = 54/391 (13%)
Query: 193 RGNIYPD-GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ + PD G Y VG PP Y +DTGSD+ W+QC+ PC C P++ P +
Sbjct: 77 QSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCE-PCQECYNQTTPMFNPSKSS 135
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG-S 307
+P LC ++ C C+Y Y D+S S G L+ D L L NG +
Sbjct: 136 SYKNIPCPSKLCQSMEDTS----CNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLT 191
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--- 364
++ PN+V GC + +L+ + GI+G S +QL S K +CLT
Sbjct: 192 VSFPNIVIGCGTNN---ILSYEGASSGIVGFGSGPASFITQLGSSTGGK--FSYCLTPLF 246
Query: 365 --TNAGGGGYM---FLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA-- 417
TN F V G+ P+L Y+ + + G+ + +G
Sbjct: 247 SVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVP 306
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK----- 472
G + D+G++ T TK YS L +++ ++ V D + TL +C+ K
Sbjct: 307 NGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQ-TLNLCYSVKAEGYD 365
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG 532
FPI +T+HF + P V G CL + S+ H
Sbjct: 366 FPI------------ITMHFK------GADVDLHPISTFVSVADGVFCLA-FESSQDH-- 404
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G+++ + +V YD K + + S C
Sbjct: 405 --AIFGNLAQQNLMVGYDLQQKIVSFKPSDC 433
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/421 (26%), Positives = 178/421 (42%), Gaps = 53/421 (12%)
Query: 160 VNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP-DGLYFTYMIVGNPPRPYYLD 218
V GI R KS++ K A + S L I+ +G Y + +G PP Y
Sbjct: 66 VQHGIKR-GKSRLQKLNAMVLAASSTPDSEDQLEAPIHAGNGEYLIELAIGTPPVSYPAV 124
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK---DSLCMEIQRNHKPGYCETC 275
+DTGSDL W QC PC+ C K P++ P+ + SLC + + TC
Sbjct: 125 LDTGSDLIWTQC-KPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSS-------TC 176
Query: 276 QQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDG 334
C+Y Y D+S + GVLA + ++ N+ FGC D +G + + G
Sbjct: 177 SDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEG---DGFEQASG 233
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFLGH--DLVPSWGMAWVPML 391
++GL R +SL SQL Q +CLT + + LG + + + P+L
Sbjct: 234 LVGLGRGPLSLVSQLKEQRF-----SYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLL 288
Query: 392 DSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGSSYTYFTKQAYSELI 445
+P Y+ + I+ G + L++ +V G + D+G++ TY ++AY L
Sbjct: 289 KNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALK 348
Query: 446 ASLKEVSSDGLVLDASDPT-LPVCWRAKFPIRSI-VDVKQFFKTLTLHFGSKWQIVSTKF 503
+S L LD + T L +C+ P S V++ + L HF
Sbjct: 349 KEF--ISQTKLALDKTSSTGLDLCF--SLPSGSTQVEIPK----LVFHFKGG------DL 394
Query: 504 HISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSH 562
+ E Y++ S G CL + S + I G++ + LV +D + I + +
Sbjct: 395 ELPAENYMIGDSNLGVACLAMGASSGMS-----IFGNVQQQNILVNHDLEKETISFVPTS 449
Query: 563 C 563
C
Sbjct: 450 C 450
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 165/385 (42%), Gaps = 55/385 (14%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPY 255
+G + + +G P Y +DTGSDL W QC PC C K + P++ P + +P
Sbjct: 71 NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPC 129
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+ C ++ + C + +C Y Y D SS+ GVLA + L P VVF
Sbjct: 130 SSASCSDLPTSK----CTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVVF 181
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGG-- 369
GC +G + + G++GL R +SL SQL G+ K +CLT TN
Sbjct: 182 GCGDTNEG---DGFSQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDTNNSPLL 233
Query: 370 -GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNL-----GARNSQV 422
G + + + P++ +P Y+ + I GS+ ++L ++
Sbjct: 234 LGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGT 293
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT---LPVCWRAKFPIRSIV 479
G + D+G+S TY Q Y +LK+ + + L A+D + L +C+RA P + +
Sbjct: 294 GGVIVDSGTSITYLEVQGYR----ALKKAFAAQMALPAADGSGVGLDLCFRA--PAKGVD 347
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIILG 538
V+ L HF + E Y+V+ G +CL ++ + I+G
Sbjct: 348 QVE--VPRLVFHFDG-----GADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----IIG 395
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
+ + VYD + + +A C
Sbjct: 396 NFQQQNFQFVYDVGHDTLSFAPVQC 420
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/377 (25%), Positives = 168/377 (44%), Gaps = 59/377 (15%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLC---M 261
VG PP P + +DTGSDL W+QC PC+ C + + P++ P + L Y +C
Sbjct: 97 VGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSPICPNSP 155
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAYD 320
+ + NH QC Y YAD S+S G LA +++ T + G++T +VVFGC +
Sbjct: 156 QKKYNH-------LNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHS 208
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGH--- 377
+G + GILGLS S+ S+L S+ +C+ G +F H
Sbjct: 209 NRGRFDG---QQSGILGLSAGDQSIVSRLGSR------FSYCI-------GDLFDPHYTH 252
Query: 378 -DLVPSWGMAWVPMLDSPFME---LYHTEILKINYGSSPLNLGAR-----NSQVGWALFD 428
LV G+ + +PF Y+ + I+ G + L++ S G + D
Sbjct: 253 NQLVLGDGVK-MEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 311
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP--VCWRAKFPIRSIVDVKQFFK 486
+G++ T+ K + L ++ + G T+P +C++ + + + + F
Sbjct: 312 SGTTATFLAKDGFDPLSNEIQRLVR-GHFQQVIYRTIPGWLCYKGR-----VNEDLRGFP 365
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
L HF +V + V + CL +L+ + + GS ++G ++ +
Sbjct: 366 ELAFHFAEGADLV-----LDANSLFVQKNQDVFCLAVLESNLKNIGS--VIGIMAQQHYN 418
Query: 547 VVYDNVNKRIGWAKSHC 563
V YD + KR+ + ++ C
Sbjct: 419 VAYDLIGKRVYFQRTDC 435
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 107/393 (27%), Positives = 177/393 (45%), Gaps = 51/393 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS-- 258
L++T++ +G P + + +D GSDL WI CD C CA ++ Y ++ Y S
Sbjct: 95 LHYTWIDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLSSSYYSNLDRDLNEYSPSRS 152
Query: 259 -----LCMEIQRNHKPGYCETC-QQCDYEIEY-ADHSSSMGVLARDELHL----TIENGS 307
L Q K C++ QQC Y + Y ++++SS G+L D LHL ++ N S
Sbjct: 153 LSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHLQSGGSLSNSS 212
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
+ P VV GC Q G L+ V DG+LGL + S+PS LA G+I + C N
Sbjct: 213 VQAP-VVLGCGMKQSGGYLDG-VAPDGLLGLGPGESSVPSFLAKSGLIHDSFSLCF--NE 268
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKIN---YGSSPLNLGARNSQVGW 424
G +F G D P+ + + P LY T I+ + G+S L + + QV
Sbjct: 269 DDSGRIFFG-DQGPTIQQSTSFL---PLDGLYSTYIIGVESCCVGNSCLKMTSFKVQV-- 322
Query: 425 ALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
D+G+S+T+ Y + ++V+ + S W + + S ++ +
Sbjct: 323 ---DSGTSFTFLPGHVYGAIAEEFDQQVNGSRSSFEGSP------WEYCY-VPSSQELPK 372
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI--CLGILDGSEVHNGSTIILGDIS 541
+LTL F + F + ++ +G I CL I + G +G
Sbjct: 373 -VPSLTLTFQQ-----NNSFVVYDPVFVFYGNEGVIGFCLAI----QPTEGDMGTIGQNF 422
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
+ G +V+D NK++ W++S+C + K +P
Sbjct: 423 MTGYRLVFDRGNKKLAWSRSNCQDLSLGKRMPL 455
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 165/385 (42%), Gaps = 55/385 (14%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPY 255
+G + + +G P Y +DTGSDL W QC PC C K + P++ P + +P
Sbjct: 102 NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPC 160
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+ C ++ + C + +C Y Y D SS+ GVLA + L P VVF
Sbjct: 161 SSASCSDLPTSK----CTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVVF 212
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGG-- 369
GC +G + + G++GL R +SL SQL G+ K +CLT TN
Sbjct: 213 GCGDTNEG---DGFSQGAGLVGLGRGPLSLVSQL---GLDK--FSYCLTSLDDTNNSPLL 264
Query: 370 -GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNL-----GARNSQV 422
G + + + P++ +P Y+ + I GS+ ++L ++
Sbjct: 265 LGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGT 324
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT---LPVCWRAKFPIRSIV 479
G + D+G+S TY Q Y +LK+ + + L A+D + L +C+RA P + +
Sbjct: 325 GGVIVDSGTSITYLEVQGYR----ALKKAFAAQMALPAADGSGVGLDLCFRA--PAKGVD 378
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIILG 538
V+ L HF + E Y+V+ G +CL ++ + I+G
Sbjct: 379 QVE--VPRLVFHFDG-----GADLDLPAENYMVLDGGSGALCLTVMGSRGLS-----IIG 426
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
+ + VYD + + +A C
Sbjct: 427 NFQQQNFQFVYDVGHDTLSFAPVQC 451
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 159/386 (41%), Gaps = 56/386 (14%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYK-------------- 246
L++T++ +G P + + +D+GSDL WI C+ C CA ++ Y
Sbjct: 96 LHYTWIDIGTPSVSFLVALDSGSDLLWIPCN--CVQCAPLSSAYYSSLATKDLNEFDPSA 153
Query: 247 PRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYA-DHSSSMGVLARDELHL--TI 303
+ P LC P +QC Y + YA +++SS G+L D LHL +
Sbjct: 154 STTSKVFPCSHKLCESAPACESPK-----EQCPYTVTYASENTSSSGLLVEDVLHLAYSA 208
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
S K VV GC Q G L + DG++GL ++S+PS LA G+++N C
Sbjct: 209 NASSSVKARVVVGCGEKQSGEFLKG-IAPDGVMGLGPGEISVPSFLAKAGLMRNSFSMCF 267
Query: 364 TTNAGGGGYMFLGHDLVPSWGMA--WVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
G Y D+ PS + ++P + E+ + G+S L + +
Sbjct: 268 DEEDSGRIYF---GDVGPSTQQSTRFLPYKNEFVAYFVGVEVCCV--GNSCLKQSSFTT- 321
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
L D+G S+T+ ++ Y E+ + + + V C+ F +
Sbjct: 322 ----LIDSGQSFTFLPEEIYREVALEI-DSHINATVKKIEGGPWEYCYETSFEPK----- 371
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG--NICLGILDGSEVHNGSTIILGD 539
+ L F S + F I +++ +G CL I S G+ ++G
Sbjct: 372 ---VPAIKLKFSS-----NNTFVIHKPLFVLQRSEGLVQFCLPI---SASEEGTGGVIGQ 420
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMN 565
+ G +V+D N ++GW+ S C
Sbjct: 421 NYMAGYRIVFDRENMKLGWSASKCQE 446
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 116/455 (25%), Positives = 190/455 (41%), Gaps = 65/455 (14%)
Query: 129 HKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKK-LVSSNAVAVDSS 187
H F +R + D L RF E + V G R H+ +N L ++NA D
Sbjct: 49 HGFRVR-LKHVDHVKNLTRF-----ERLRRGVARGKNRLHR--LNAMVLAAANATVGDQV 100
Query: 188 SIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP 247
+ GN G + + +G+PPR + MDTGSDL W QC PC C + P++ P
Sbjct: 101 KAPVVAGN----GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCK-PCQQCFDQSTPIFDP 155
Query: 248 RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL--TIEN 305
+ + YK S E+ C + C+Y Y D SS+ GVLA + + E+
Sbjct: 156 KQSSSF-YKISCSSELCGALPTSTCSS-DGCEYLYTYGDSSSTQGVLAFETFTFGDSTED 213
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
++ P + FGC D G + + G++GL R +SL SQL Q +CLT
Sbjct: 214 -QISIPGLGFGCGNDNNG---DGFSQGAGLVGLGRGPLSLVSQLKEQKF-----AYCLTA 264
Query: 366 -NAGGGGYMFLGH--DLVPSWG---MAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGAR 418
+ + LG ++ P M P++ +P Y+ + I+ G + L++
Sbjct: 265 IDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKS 324
Query: 419 NSQV-----GWALFDTGSSYTYFTKQAYS----ELIASLKEVSSDGLVLDASDPTLPVCW 469
++ G + D+G++ TY A++ E IA + V D+ L +C+
Sbjct: 325 TFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLP-----VDDSGTGGLDLCF 379
Query: 470 RAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSE 528
+ K LT HF + E Y++ SK G +CL I
Sbjct: 380 NLPAGTNQVEVPK-----LTFHF------KGADLELPGENYMIGDSKAGLLCLAIGSSRG 428
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ I G++ + +VV+D + + + + C
Sbjct: 429 MS-----IFGNLQQQNFMVVHDLQEETLSFLPTQC 458
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/377 (25%), Positives = 168/377 (44%), Gaps = 59/377 (15%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLC---M 261
VG PP P + +DTGSDL W+QC PC+ C + + P++ P + L Y +C
Sbjct: 65 VGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSPICPNSP 123
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAYD 320
+ + NH QC Y YAD S+S G LA +++ T + G++T +VVFGC +
Sbjct: 124 QKKYNH-------LNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHS 176
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGH--- 377
+G + GILGLS S+ S+L S+ +C+ G +F H
Sbjct: 177 NRGRFDG---QQSGILGLSAGDQSIVSRLGSR------FSYCI-------GDLFDPHYTH 220
Query: 378 -DLVPSWGMAWVPMLDSPFME---LYHTEILKINYGSSPLNLGAR-----NSQVGWALFD 428
LV G+ + +PF Y+ + I+ G + L++ S G + D
Sbjct: 221 NQLVLGDGVK-MEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP--VCWRAKFPIRSIVDVKQFFK 486
+G++ T+ K + L ++ + G T+P +C++ + + + + F
Sbjct: 280 SGTTATFLAKDGFDPLSNEIQRLVR-GHFQQVIYRTIPGWLCYKGR-----VNEDLRGFP 333
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
L HF +V + V + CL +L+ + + GS ++G ++ +
Sbjct: 334 ELAFHFAEGADLV-----LDANSLFVQKNQDVFCLAVLESNLKNIGS--VIGIMAQQHYN 386
Query: 547 VVYDNVNKRIGWAKSHC 563
V YD + KR+ + ++ C
Sbjct: 387 VAYDLIGKRVYFQRTDC 403
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 166/379 (43%), Gaps = 53/379 (13%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPY 255
+G + + +G P Y MDTGSDL W QC PC C P++ P + LP
Sbjct: 94 NGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPC 152
Query: 256 KDSLCMEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
LC+ + +C C+Y Y DHSS+ GVLA + T + S++K +
Sbjct: 153 SSDLCVALP-------ISSCSDGCEYRYSYGDHSSTQGVLATET--FTFGDASVSK--IG 201
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGY 372
FGC D +G + G++GL R +SL SQL G+ K +CLT+ ++ G
Sbjct: 202 FGCGEDNRG---RAYSQGAGLVGLGRGPLSLISQL---GVPK--FSYCLTSIDDSKGIST 253
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPL-----NLGARNSQVGWAL 426
+ +G + + P++ +P Y+ + I+ G + L ++ G +
Sbjct: 254 LLVGSEATVKSAIP-TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLI 312
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT-LPVCWRAKFPIRSIVDVKQFF 485
D+G++ TY A++ L +S L +DAS T L +C+ P S VDV Q
Sbjct: 313 IDSGTTITYLKDSAFAALKKEF--ISQMKLDVDASGSTELELCFTLP-PDGSPVDVPQ-- 367
Query: 486 KTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
L HF G ++ + I V ICL + S + I G+ +
Sbjct: 368 --LVFHFEGVDLKLPKENYIIEDSALRV------ICLTMGSSSGMS-----IFGNFQQQN 414
Query: 545 QLVVYDNVNKRIGWAKSHC 563
+V++D + I +A + C
Sbjct: 415 IVVLHDLEKETISFAPAQC 433
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/377 (25%), Positives = 168/377 (44%), Gaps = 59/377 (15%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLC---M 261
VG PP P + +DTGSDL W+QC PC+ C + + P++ P + L Y +C
Sbjct: 65 VGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSPICPNSP 123
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAYD 320
+ + NH QC Y YAD S+S G LA +++ T + G++T +VVFGC +
Sbjct: 124 QKKYNH-------LNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHS 176
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGH--- 377
+G + GILGLS S+ S+L S+ +C+ G +F H
Sbjct: 177 NRGRFDG---QQSGILGLSAGDQSIVSRLGSR------FSYCI-------GDLFDPHYTH 220
Query: 378 -DLVPSWGMAWVPMLDSPFME---LYHTEILKINYGSSPLNLGAR-----NSQVGWALFD 428
LV G+ + +PF Y+ + I+ G + L++ S G + D
Sbjct: 221 NQLVLGDGVK-MEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP--VCWRAKFPIRSIVDVKQFFK 486
+G++ T+ K + L ++ + G T+P +C++ + + + + F
Sbjct: 280 SGTTATFLAKDGFDPLSNEIQRLVR-GHFQQVIYRTIPGWLCYKGR-----VNEDLRGFP 333
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
L HF +V + V + CL +L+ + + GS ++G ++ +
Sbjct: 334 ELAFHFAEGADLV-----LDANSLFVQKNQDVFCLAVLESNLKNIGS--VIGIMAQQHYN 386
Query: 547 VVYDNVNKRIGWAKSHC 563
V YD + KR+ + ++ C
Sbjct: 387 VAYDLIGKRVYFQRTDC 403
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 114/399 (28%), Positives = 163/399 (40%), Gaps = 68/399 (17%)
Query: 195 NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILP 254
N P Y ++ +G PP+P L +DTGSDL W QC PC +C A P + P + L
Sbjct: 28 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLS 86
Query: 255 Y---KDSLC--MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
+LC + + P + Q C Y Y D S + G L D+ S+
Sbjct: 87 LTSCDSTLCQGLPVASCGSPKFWPN-QTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASV- 144
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTD--GILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
P V FGC GL N + K++ GI G R +SLPSQL + N HC TT
Sbjct: 145 -PGVAFGC-----GLFNNGVFKSNETGIAGFGRGPLSLPSQLK----VGN-FSHCFTTIT 193
Query: 368 GG---GGYMFLGHDLVPSWGMAWV---PMLDSPFME----LYHTEILKINYGSS----PL 413
G + L DL S G V P++ E LY+ + I GS+ P
Sbjct: 194 GAIPSTVLLDLPADLF-SNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPE 252
Query: 414 NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP------- 466
+ A + G + D+G+S T Q Y +V D P +P
Sbjct: 253 SAFALTNGTGGTIIDSGTSITSLPPQVY--------QVVRDEFAAQIKLPVVPGNATGHY 304
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN--ICLGIL 524
C+ A P ++ DV + L LHF + + ++ + V GN ICL I
Sbjct: 305 TCFSA--PSQAKPDVPK----LVLHFEGATMDLPRENYV----FEVPDDAGNSIICLAIN 354
Query: 525 DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G E T I+G+ + V+YD N + + + C
Sbjct: 355 KGDE-----TTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 388
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 169/398 (42%), Gaps = 68/398 (17%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI-------- 252
L++ + +G P + + +DTGSDL W+ CD C CA PL P G++
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPCD--CLKCA----PLQSPNYGSLKFDVYSPA 151
Query: 253 -------LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIE 304
+P +LC ++Q + C Y I+Y +D++SS GVL D L+LT +
Sbjct: 152 QSTTSRKVPCSSNLC-DLQNACR----SKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSD 206
Query: 305 NGS---LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+ +T P ++FGC Q G L + +G+LGL S+PS LAS+G+ N
Sbjct: 207 SAQSKIVTAP-IMFGCGQVQTGSFLGS-AAPNGLLGLGMDSKSVPSLLASKGLAANSFSM 264
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
C G G+ + S P+ Y+ I I GS + S
Sbjct: 265 CF----GDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSI------ST 314
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
A+ D+G+S+T + Y+++ +S ++ S +LD+S P C+ S
Sbjct: 315 EFSAIVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMP-FEFCYSV-----SANG 368
Query: 481 VKQFFKTLTLHFGSKWQ-----IVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI 535
+ +LT GS + I T +P GY CL I+ V+
Sbjct: 369 IVHPNVSLTAKGGSIFPVNDPIITITDNAFNPVGY---------CLAIMKSEGVN----- 414
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
++G+ + G VV+D +GW +C N LP
Sbjct: 415 LIGENFMSGLKVVFDRERMVLGWKNFNCYNFDESSRLP 452
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 168/394 (42%), Gaps = 60/394 (15%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP--------LYKPRMGNI 252
L++ + +G P + + +DTGSDL W+ CD C CA +P +Y P
Sbjct: 75 LHYAVVALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTT 132
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGS- 307
+P +LC ++Q + C Y I+Y +D++SS GVL D L+LT ++
Sbjct: 133 SRKVPCSSNLC-DLQNACR----SKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS 187
Query: 308 --LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
+T P ++FGC Q G L + +G+LGL S+PS LAS+G+ N C
Sbjct: 188 KIVTAP-IMFGCGQVQTGSFLGS-AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF-- 243
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
G G+ + S P+ Y+ I I GS + S A
Sbjct: 244 --GDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSI------STEFSA 295
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ D+G+S+T + Y+++ +S ++ S +LD+S P C+ S +
Sbjct: 296 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMP-FEFCYSV-----SANGIVHP 349
Query: 485 FKTLTLHFGSKWQ-----IVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
+LT GS + I T +P GY CL I+ V+ ++G+
Sbjct: 350 NVSLTAKGGSIFPVNDPIITITDNAFNPVGY---------CLAIMKSEGVN-----LIGE 395
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
+ G VV+D +GW +C N LP
Sbjct: 396 NFMSGLKVVFDRERMVLGWKNFNCYNFDESSRLP 429
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/392 (23%), Positives = 167/392 (42%), Gaps = 46/392 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGA-NPLYKPRMGNILPYK-- 256
G YF + +G PP+ L DTGSDL W++C A C +C+ + ++ PR +
Sbjct: 81 GQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHSSTFSPAHC 139
Query: 257 -DSLCMEIQRNHKPGYCETCQQ------CDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
D +C + KPG C C YE YAD S + G+ AR+ L +G
Sbjct: 140 YDPVCRLVP---KPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEA 196
Query: 310 K-PNVVFGCAYDQQGLLLN--TLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--- 363
K +V FGC + G ++ + +G++GL R +S SQL + N +CL
Sbjct: 197 KLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSYCLMDY 254
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGAR---- 418
T + Y+ +G + + P+L +P Y+ ++ + + L +
Sbjct: 255 TLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEI 314
Query: 419 -NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD---PTLPVCWRAKFP 474
+S G + D+G++ + AY +IA++K+ + L +D P +C
Sbjct: 315 DDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQR----IKLPNADELTPGFDLCVN---- 366
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
+ + ++ L F V P Y + +++ CL I + G +
Sbjct: 367 VSGVTKPEKILPRLKFEFSGGAVFVP-----PPRNYFIETEEQIQCLAI-QSVDPKVGFS 420
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHCMNP 566
+I G++ +G L +D R+G+++ C P
Sbjct: 421 VI-GNLMQQGFLFEFDRDRSRLGFSRRGCALP 451
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 105/409 (25%), Positives = 173/409 (42%), Gaps = 75/409 (18%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-------------PLYKP 247
L++T + +G P + + +DTGSDL W+ CD C+ C+ + +Y P
Sbjct: 100 LHYTTIELGTPGVKFMVALDTGSDLFWVPCD--CTRCSATRSSAFASALASDFDLSVYNP 157
Query: 248 RMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYAD-HSSSMGVLARDELHLTI 303
+ + +SLC RN G T C Y + Y +S+ G+L D LHLT
Sbjct: 158 NGSSTSKKVTCNNSLC--THRNQCLG---TFSNCPYMVSYVSAETSTSGILVEDVLHLTQ 212
Query: 304 --ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+N L + NV+FGC Q G L+ + +G+ GL K+S+PS L+ +G +
Sbjct: 213 PDDNHDLVEANVIFGCGQVQSGSFLD-VAAPNGLFGLGMEKISVPSMLSREGFTADSFSM 271
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
C + G G + G S P +P Y+ I ++ G++ +++
Sbjct: 272 CFGRD--GIGRISFGDK--GSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVEFT--- 324
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKE--------------VSSDGLVL----DASDP 463
ALFD+G+S+TY YS L S+ + V+ + +L D
Sbjct: 325 ---ALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQFHSQVEDR 381
Query: 464 TLPVCWRAKFPIRSIVDVKQFFKT-------LTLHFGSKWQIVSTKFHISPEGYLVISKK 516
P ++ P D+ T LT+ GS++ + IS + LV
Sbjct: 382 RRPP--DSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSELV---- 435
Query: 517 GNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
CL ++ +E++ I+G + G VV+D +GW KS C +
Sbjct: 436 --YCLAVVKSAELN-----IIGQNFMTGYRVVFDREKLILGWKKSDCYD 477
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 152/369 (41%), Gaps = 34/369 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y T M +G P + Y + +DTGS LTW+QC SC + + P++ P+ +
Sbjct: 125 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCS 184
Query: 260 ---CMEIQRNH-KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
C ++ P C T C Y+ Y D S S+G L++D T+ GS + PN +
Sbjct: 185 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSVPNFYY 240
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC D +GL ++ G++GL+R K+SL QLA + +CL T++
Sbjct: 241 GCGQDNEGL----FGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYLS 294
Query: 376 GHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
P ++ PM S + LY ++ I PL++ + + D+G+ T
Sbjct: 295 IGSYNPGQ-YSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 353
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
YS L ++ + G ++ L C++ + + +V F +
Sbjct: 354 RLPTGVYSALSKAVAG-AMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLA 412
Query: 495 KWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNK 554
+ LV CL S I+G+ + VVYD N
Sbjct: 413 ARNL------------LVDVDSATTCLAFAPAR-----SAAIIGNTQQQTFSVVYDVKNS 455
Query: 555 RIGWAKSHC 563
+IG+A + C
Sbjct: 456 KIGFAAAGC 464
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 155/374 (41%), Gaps = 37/374 (9%)
Query: 199 DGLYFTYMI-VGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGNILPYK 256
D L F + G P + Y + DTGSD++WIQC PCS C K +P++ P Y
Sbjct: 131 DTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSAT--YS 187
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C Q G + C Y++EY D SSS GVL+ + L LT + P FG
Sbjct: 188 VVPCGHPQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLT---STRALPGFAFG 244
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C L DG++GL R ++SL SQ A+ +CL ++ GY+ +G
Sbjct: 245 CGQTN----LGDFGDVDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPSDNTTHGYLTIG 298
Query: 377 HDLVPSWG----MAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSS 432
S A V D P Y E++ I+ G L + D+G+
Sbjct: 299 PTTPASNDDVQYTAMVQKQDYP--SFYFVELVSIDIGGYILPVPPTLFTDDGTFLDSGTI 356
Query: 433 YTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
TY +AY+ L K + A DP C+ F +S + F ++ F
Sbjct: 357 LTYLPPEAYTALRDRFKFTMTQYKPAPAYDP-FDTCY--DFTGQSAI----FIPAVSFKF 409
Query: 493 GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI---ILGDISLRGQLVVY 549
+ F +S G L+ +G L V S + I+G++ R V+Y
Sbjct: 410 SD-----GSVFDLSFFGILIFPDDTAPAIGCL--GFVARPSAMPFTIVGNMQQRNTEVIY 462
Query: 550 DNVNKRIGWAKSHC 563
D ++IG+A + C
Sbjct: 463 DVAAEKIGFASASC 476
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 102/393 (25%), Positives = 171/393 (43%), Gaps = 49/393 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC 260
L++T++ +G P + + +D GSDL W+ CD C CA + Y ++ Y S
Sbjct: 99 LHYTWIDIGTPSTSFLVALDAGSDLLWVPCD--CIHCAPLSASFYSNLDRDLNEYSPSRS 156
Query: 261 MEIQR---NHK----PGYCETC--QQCDYEIEY-ADHSSSMGVLARDELHLTIENGSLTK 310
+ + +H+ C+T QQC Y I Y +D++SS G+L D HL +GS +
Sbjct: 157 LSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGLLVEDIFHLQSGDGSTSN 216
Query: 311 PN----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+ VV GC Q G L+ DG++GL + S+PS LA G+I++ C N
Sbjct: 217 SSVQAPVVVGCGMKQSGGYLDG-TAPDGLIGLGPGESSVPSFLAKSGLIRDSFSLCF--N 273
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWAL 426
G +F G ++D F Y + G+S + + N+Q
Sbjct: 274 EDDSGRLFFGDQGSTVQQSTPFLLVDGMF-STYIVGVETCCIGNSCPKVTSFNAQ----- 327
Query: 427 FDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
FD+G+S+T+ AY + K+V++ S W + + +Q
Sbjct: 328 FDSGTSFTFLPGHAYGAIAEEFDKQVNATRSTFQGSP------WEYCY----VPSSQQLP 377
Query: 486 K--TLTLHFGSKWQIVSTKFHISPEGYLVISKKG--NICLGILDGSEVHNGSTIILGDIS 541
K TLTL F + F + ++ +++G CL I + G +G
Sbjct: 378 KIPTLTLMFQQ-----NNSFVVYNPVFVSYNEQGVDGFCLAI----QPTEGGMGTIGQNF 428
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
+ G +V+D NK++ W+ S+C + K +P
Sbjct: 429 MTGYRLVFDRENKKLAWSHSNCQDLSLGKRMPL 461
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 100/411 (24%), Positives = 179/411 (43%), Gaps = 50/411 (12%)
Query: 171 KINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQC 230
++ ++ SS+AV++ SS G G YF + VG P + + L DTGSDLTW++C
Sbjct: 90 RVAAEVASSSAVSLPMSS-----GAYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKC 144
Query: 231 DAPCSSCAKGANP---LYKPRMGNI---LPYKDSLCMEIQRNHKPGYCET-CQQCDYEIE 283
GA+P +++P+ +P C ++ C + C Y+
Sbjct: 145 --------AGASPPGRVFRPKTSRSWAPIPCSSDTC-KLDVPFTLANCSSPASPCTYDYR 195
Query: 284 YADHSS-SMGVLARDELHLTIENGSLTK-PNVVFGCAYDQQGLLLNTLVKTDGILGLSRA 341
Y + S+ + G++ + + + G + + +VV GC+ G + DG+L L A
Sbjct: 196 YKEGSAGARGIVGTESATIALPGGKVAQLKDVVLGCSSSHDG---QSFRSADGVLSLGNA 252
Query: 342 KVSLPSQLASQ---GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMEL 398
K+S +Q A++ +V H NA GY+ G VP + P M
Sbjct: 253 KISFATQAAARFGGSFSYCLVDHLAPRNA--TGYLAFGPGQVPRTPATQTKLFLDPEMPF 310
Query: 399 YHTEILKINYGSSPLNLGAR--NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL 456
Y ++ I+ L++ A +++ G + D+G++ T AY ++A+L + DG
Sbjct: 311 YGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSGNTLTVLAAPAYKAVVAALSK-HLDG- 368
Query: 457 VLDASDPTLPVC--WRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVIS 514
V S P C W A+ P + L + F S + + Y++
Sbjct: 369 VPKVSFPPFEHCYNWTARRP-----GAPEIIPKLAVQFAG-----SARLEPPAKSYVIDV 418
Query: 515 KKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
K G C+G+ +G G ++I G+I + L +D N ++ + +S+C
Sbjct: 419 KPGVKCIGVQEGE--WPGLSVI-GNIMQQEHLWEFDLKNMQVRFKQSNCTR 466
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 100/386 (25%), Positives = 163/386 (42%), Gaps = 55/386 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK----- 256
Y + +G PP+P +DTGSDL W QC APC+SC +PL+ P G Y+
Sbjct: 102 YVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAP--GESASYEPMRCA 158
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS--LTKPNVV 314
LC +I + CE C Y Y D + +MGV A + T G +T P +
Sbjct: 159 GQLCSDILHHG----CEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVP-LG 213
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF 374
FGC G L N GI+G R +SL SQL+ + +CLT+ G
Sbjct: 214 FGCGSMNVGSLNN----GSGIVGFGRNPLSLVSQLSIRRF-----SYCLTSYGSGRKSTL 264
Query: 375 LGHDLVPS-WGMAWVPMLDSPFME------LYHTEILKINYGSSPLNL-----GARNSQV 422
L L +G A P+ +P ++ Y+ + + G+ L + R
Sbjct: 265 LFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGS 324
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKE-----VSSDGLVLDASDPTLPVCWRAKFPIRS 477
G + D+G++ T +E++ + ++ ++ G D +P WR RS
Sbjct: 325 GGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWR-----RS 379
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
+ + HF + + ++ L +KG +CL + D + +GSTI
Sbjct: 380 SSTSQVPVPRMVFHFQDADLDLPRRNYV-----LDDHRKGRLCLLLADSGD--DGSTI-- 430
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G++ + V+YD + + +A + C
Sbjct: 431 GNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 166/378 (43%), Gaps = 45/378 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYK 256
G YFT + VG P R Y+ +DTGSD+ W+QC APC C + ++ P R +P
Sbjct: 116 GEYFTRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQTDHVFDPTKSRTYAGIPCG 174
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC +R PG + C Y++ Y D S + G + + LT +T+ V G
Sbjct: 175 APLC---RRLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTET--LTFRRNRVTR--VALG 227
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMF 374
C +D +GL L R ++S P Q + + +CL + +
Sbjct: 228 CGHDNEGLFTGAAGLLG----LGRGRLSFPVQTGRR--FNHKFSYCLVDRSASAKPSSVI 281
Query: 375 LGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLN-LGARNSQV-----GWALF 427
G V S + P++ +P ++ Y+ E+L I+ G +P+ L A ++ G +
Sbjct: 282 FGDSAV-SRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVII 340
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D+G+S T T+ AY L + + + + L C F + + +VK T
Sbjct: 341 DSGTSVTRLTRPAYIALRDAFR-IGASHLKRAPEFSLFDTC----FDLSGLTEVK--VPT 393
Query: 488 LTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ LHF G+ + +T + I + G+ C + +G +II G+I +G
Sbjct: 394 VVLHFRGADVSLPATNYLIP------VDNSGSFCFAF---AGTMSGLSII-GNIQQQGFR 443
Query: 547 VVYDNVNKRIGWAKSHCM 564
+ YD R+G+A C+
Sbjct: 444 ISYDLTGSRVGFAPRGCV 461
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 149/369 (40%), Gaps = 34/369 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y T M +G P + Y + +DTGS LTW+QC SC + + P++ P+ +
Sbjct: 127 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCS 186
Query: 260 CMEIQ----RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+ P C T C Y+ Y D S S+G L++D T+ GS + PN +
Sbjct: 187 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSVPNFYY 242
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC D +GL ++ G++GL+R K+SL QLA + +CL T++
Sbjct: 243 GCGQDNEGL----FGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYLS 296
Query: 376 GHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
P ++ PM S + LY ++ I PL++ + + D+G+ T
Sbjct: 297 IGSYNPGQ-YSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 355
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
YS L ++ + G ++ L C++ + + +V F +
Sbjct: 356 RLPTGVYSALSKAVAG-AMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLA 414
Query: 495 KWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNK 554
+ LV CL S I+G+ + VVYD N
Sbjct: 415 ARNL------------LVDVDSATTCLAFAPAR-----SAAIIGNTQQQTFSVVYDVKNS 457
Query: 555 RIGWAKSHC 563
+IG+A C
Sbjct: 458 KIGFAAGGC 466
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 108/427 (25%), Positives = 179/427 (41%), Gaps = 64/427 (14%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIF---PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDT 221
I H + KL+ N+ V + I P+ + Y Y + +G PP Y +DT
Sbjct: 22 IEAHNGRFTVKLIPRNSSQVLFNRITAQTPVSVHHYD---YLMELSIGTPPVKTYAQVDT 78
Query: 222 GSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETC--- 275
GSDL W+QC PC++C K NP++ P+ + + Y C ++ Y +C
Sbjct: 79 GSDLIWLQC-IPCTNCYKQLNPMFDPQSSSTYSNIAYGSESCSKL-------YSTSCSPD 130
Query: 276 -QQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFGCAYDQQGLLLNTLVKTD 333
C+Y Y D S + GVLA++ L LT G + V+FGC ++ G+ + K
Sbjct: 131 QNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFND---KEM 187
Query: 334 GILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAG-------GGGYMFLGHDLVPSW 383
GI+GL R +SL SQ+ S + CL TN G G LG+ +V +
Sbjct: 188 GIIGLGRGPLSLVSQIGSS-FGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTP 246
Query: 384 GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV--GWALFDTGSSYTYFTKQAY 441
++ F+ L + IN P N G+ + G + D+G+ T + Y
Sbjct: 247 LVSKNTHQAFYFVTLLGISVEDINL---PFNDGSSLEPITKGNMVIDSGTPTTLLPEDFY 303
Query: 442 SELIASLK-EVSSDGLVLDASDPTL--PVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQI 498
L+ ++ +V+ D + + DPTL +C+R ++ TLT HF +
Sbjct: 304 HRLVEEVRNKVALDPIPI---DPTLGYQLCYRTPTNLKG--------TTLTAHFEGADVL 352
Query: 499 VSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGW 558
++P + + G C G I G+ + L+ +D + + +
Sbjct: 353 ------LTPTQIFIPVQDGIFCFAFTSTFSNEYG---IYGNHAQSNYLIGFDLEKQLVSF 403
Query: 559 AKSHCMN 565
+ C N
Sbjct: 404 KATDCTN 410
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 149/369 (40%), Gaps = 34/369 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y T M +G P + Y + +DTGS LTW+QC SC + + P++ P+ +
Sbjct: 127 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCS 186
Query: 260 CMEIQ----RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+ P C T C Y+ Y D S S+G L++D T+ GS + PN +
Sbjct: 187 AQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSVPNFYY 242
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC D +GL ++ G++GL+R K+SL QLA + +CL T++
Sbjct: 243 GCGQDNEGL----FGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYLS 296
Query: 376 GHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
P ++ PM S + LY ++ I PL++ + + D+G+ T
Sbjct: 297 IGSYNPGQ-YSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 355
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
YS L ++ + G ++ L C++ + + +V F +
Sbjct: 356 RLPTGVYSALSKAVAG-AMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLA 414
Query: 495 KWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNK 554
+ LV CL S I+G+ + VVYD N
Sbjct: 415 ARNL------------LVDVDSATTCLAFAPAR-----SAAIIGNTQQQTFSVVYDVKNS 457
Query: 555 RIGWAKSHC 563
+IG+A C
Sbjct: 458 KIGFAAGGC 466
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 99/391 (25%), Positives = 160/391 (40%), Gaps = 47/391 (12%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G I G YF + +G PP + DTGSDLTW+QC PC C K PL+ + +
Sbjct: 77 GLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQC-KPCQQCYKQNTPLFDKKKSST- 134
Query: 254 PYKDSLCMEIQRN----HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-L 308
YK C I N H+ G E+ C Y Y D S + G +A + + + +GS +
Sbjct: 135 -YKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPV 193
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TT 365
+ P FGC Y+ G T +GL +SL SQL S I +CL +
Sbjct: 194 SFPGTAFGCGYNNGGTFEETGSGI---IGLGGGPLSLVSQLGSS--IGKKFSYCLSHTSA 248
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFME-----LYHTEILKINYGSSPL------- 413
G + LG + + S +L +P ++ Y + I G + L
Sbjct: 249 TTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGG 308
Query: 414 -NLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
+L ++ + G + D+G++ T Y + A ++E + + L C+++
Sbjct: 309 YSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVSDPQGILTHCFKSG 368
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG 532
D + T+T+HF +SP V + +CL ++ +EV
Sbjct: 369 -------DKEIGLPTITMHF------TGADVKLSPINSFVKLSEDIVCLSMIPTTEV--- 412
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G++ LV YD K + + + C
Sbjct: 413 --AIYGNMVQMDFLVGYDLETKTVSFQRMDC 441
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 105/397 (26%), Positives = 165/397 (41%), Gaps = 71/397 (17%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---IL 253
+ +G Y + +G+PPR + +DTGSDL W QC APC C + P ++P L
Sbjct: 80 FSEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASL 138
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
P ++C + Y C Q C Y+ Y D +SS GVLA + + + P
Sbjct: 139 PCSSAMCNAL-------YSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVP 191
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG-- 369
V FGC G L N G++G R +SL SQL S +CLT+
Sbjct: 192 RVSFGCGNMNAGTLFN----GSGMVGFGRGALSLVSQLGSPRF-----SYCLTSFMSPAT 242
Query: 370 -----GGYMFLGHDLVPSWGMAWVPMLDSPFM------ELYHTEILKINYGSSPL----N 414
G Y L S G P+ +PF+ +Y + I+ L +
Sbjct: 243 SRLYFGAYATLNSTNTSSSG----PVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPS 298
Query: 415 LGARNSQ--VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP--TLPVCWR 470
+ A N G + D+G++ T+ + AY+ + + V+ GL + P T C++
Sbjct: 299 VFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAF--VAWVGLPRANATPSDTFDTCFK 356
Query: 471 AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGIL---DG 526
P R +V + + + LHF + E Y+V+ GN+CL +L DG
Sbjct: 357 WPPPPRRMVTLPE----MVLHFDGA------DMELPLENYMVMDGGTGNLCLAMLPSDDG 406
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
S I+G + ++YD N + + + C
Sbjct: 407 S--------IIGSFQHQNFHMLYDLENSLLSFVPAPC 435
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 168/394 (42%), Gaps = 60/394 (15%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP--------LYKPRMGNI 252
L++ + +G P + + +DTGSDL W+ CD C CA +P +Y P
Sbjct: 61 LHYAVVALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTT 118
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGS- 307
+P +LC ++Q + C Y I+Y +D++SS GVL D L+LT ++
Sbjct: 119 SRKVPCSSNLC-DLQNACR----SKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS 173
Query: 308 --LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
+T P ++FGC Q G L + +G+LGL S+PS LAS+G+ N C
Sbjct: 174 KIVTAP-IMFGCGQVQTGSFLGS-AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF-- 229
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
G G+ + S P+ Y+ I I GS + S A
Sbjct: 230 --GDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSI------STEFSA 281
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ D+G+S+T + Y+++ +S ++ S +LD+S P C+ S +
Sbjct: 282 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMP-FEFCYSV-----SANGIVHP 335
Query: 485 FKTLTLHFGSKWQ-----IVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
+LT GS + I T +P GY CL I+ V+ ++G+
Sbjct: 336 NVSLTAKGGSIFPVNDPIITITDNAFNPVGY---------CLAIMKSEGVN-----LIGE 381
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
+ G VV+D +GW +C N LP
Sbjct: 382 NFMSGLKVVFDRERMVLGWKNFNCYNFDESSRLP 415
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 105/397 (26%), Positives = 165/397 (41%), Gaps = 71/397 (17%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---IL 253
+ +G Y + +G+PPR + +DTGSDL W QC APC C + P ++P L
Sbjct: 83 FSEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASL 141
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
P ++C + Y C Q C Y+ Y D +SS GVLA + + + P
Sbjct: 142 PCSSAMCNAL-------YSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVP 194
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG-- 369
V FGC G L N G++G R +SL SQL S +CLT+
Sbjct: 195 RVSFGCGNMNAGTLFN----GSGMVGFGRGALSLVSQLGSPRF-----SYCLTSFMSPAT 245
Query: 370 -----GGYMFLGHDLVPSWGMAWVPMLDSPFM------ELYHTEILKINYGSSPL----N 414
G Y L S G P+ +PF+ +Y + I+ L +
Sbjct: 246 SRLYFGAYATLNSTNTSSSG----PVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPS 301
Query: 415 LGARNSQ--VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP--TLPVCWR 470
+ A N G + D+G++ T+ + AY+ + + V+ GL + P T C++
Sbjct: 302 VFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAF--VAWVGLPRANATPSDTFDTCFK 359
Query: 471 AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGIL---DG 526
P R +V + + + LHF + E Y+V+ GN+CL +L DG
Sbjct: 360 WPPPPRRMVTLPE----MVLHFDGA------DMELPLENYMVMDGGTGNLCLAMLPSDDG 409
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
S I+G + ++YD N + + + C
Sbjct: 410 S--------IIGSFQHQNFHMLYDLENSLLSFVPAPC 438
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 151/369 (40%), Gaps = 34/369 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y T M +G P + Y + +DTGS LTW+QC SC + + P++ P+ +
Sbjct: 125 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCS 184
Query: 260 ---CMEIQRNH-KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
C ++ P C T C Y+ Y D S S+G L++D T+ GS + PN +
Sbjct: 185 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSVPNFYY 240
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC D +GL ++ G++GL+R K+SL QLA + +CL T++
Sbjct: 241 GCGQDNEGL----FGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYLS 294
Query: 376 GHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
P ++ PM S + LY ++ I PL++ + + D+G+ T
Sbjct: 295 IGSYNPGQ-YSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 353
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
YS L ++ + G ++ L C++ + + +V F +
Sbjct: 354 RLPTGVYSALSKAVAG-AMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLA 412
Query: 495 KWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNK 554
+ LV CL S I+G+ + VVYD N
Sbjct: 413 ARNL------------LVDVDSATTCLAFAPAR-----SAAIIGNTQQQTFSVVYDVKNS 455
Query: 555 RIGWAKSHC 563
+IG+A C
Sbjct: 456 KIGFAAGGC 464
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 159/359 (44%), Gaps = 46/359 (12%)
Query: 234 CSSCAKGAN-----PLYKP---RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYA 285
C++C K + LY P + N +P D C + G C+ C Y I Y
Sbjct: 33 CTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISG-CKQDMSCPYSITYG 91
Query: 286 DHSSSMGVLARDELHLTIENGSL-TKPN---VVFGCAYDQQGLLL-NTLVKTDGILGLSR 340
D S++ G D L +G+L TKP+ V+FGC Q G L N+ DGI+G +
Sbjct: 92 DGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQ 151
Query: 341 AKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYH 400
A S+ SQLA+ G +K + HCL ++ GGG + +G + P + P++ P M Y+
Sbjct: 152 ANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFS-IGQVMEPKFNTT--PLV--PRMAHYN 206
Query: 401 TEILKINYGSSP--LNLGARNSQVGWA-LFDTGSSYTYFTKQAYSELIASLKEVSSDGLV 457
+ ++ P L L +S G + D+G++ Y Y++L+ + +
Sbjct: 207 VILKDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKL 266
Query: 458 LDASDPTLPVCWRAK----FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI 513
+ D + K FP+ VK F+ L+L + P YL +
Sbjct: 267 MIVEDQFTCFHYSDKLDEGFPV-----VKFHFEGLSL-------------TVHPHDYLFL 308
Query: 514 SKKGNICLGILDGS-EVHNGSTIIL-GDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
K+ C+G S + G +IL GD+ L +LVVYD N IGW +C + + K
Sbjct: 309 YKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCSSSIKVK 367
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 118/455 (25%), Positives = 192/455 (42%), Gaps = 65/455 (14%)
Query: 129 HKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKK-LVSSNAVAVDSS 187
H F +R + D L RF E + V G R H+ +N L ++NA D
Sbjct: 304 HGFRVR-LKHVDHVKNLTRF-----ERLRRGVARGKNRLHR--LNAMVLAAANATVGDQV 355
Query: 188 SIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP 247
+ GN G + + +G+PPR + MDTGSDL W QC PC C + P++ P
Sbjct: 356 KAPVVAGN----GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQC-KPCQQCFDQSTPIFDP 410
Query: 248 RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL--TIEN 305
+ + YK S E+ C + C+Y Y D SS+ GVLA + + E+
Sbjct: 411 KQSSSF-YKISCSSELCGALPTSTCSS-DGCEYLYTYGDSSSTQGVLAFETFTFGDSTED 468
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
++ P + FGC D G + + G++GL R +SL SQL Q +CLT
Sbjct: 469 -QISIPGLGFGCGNDNNG---DGFSQGAGLVGLGRGPLSLVSQLKEQKF-----AYCLTA 519
Query: 366 -NAGGGGYMFLGH--DLVPSWG---MAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGAR 418
+ + LG ++ P M P++ +P Y+ + I+ G + L++
Sbjct: 520 IDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKS 579
Query: 419 NSQV-----GWALFDTGSSYTYFTKQAYS----ELIASLKEVSSDGLVLDASDPTLPVCW 469
++ G + D+G++ TY A++ E IA + V D+ L +C+
Sbjct: 580 TFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLP-----VDDSGTGGLDLCF 634
Query: 470 RAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSE 528
+ K LT HF + E Y++ SK G +CL I GS
Sbjct: 635 NLPAGTNQVEVPK-----LTFHFK------GADLELPGENYMIGDSKAGLLCLAI--GS- 680
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ I G++ + +VV+D + + + + C
Sbjct: 681 --SRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQC 713
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 98.6 bits (244), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 166/379 (43%), Gaps = 53/379 (13%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPY 255
+G + + +G P Y MDTGSDL W QC PC C P++ P + LP
Sbjct: 94 NGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPC 152
Query: 256 KDSLCMEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
LC+ + +C C+Y Y DHSS+ GVLA + T + S++K +
Sbjct: 153 SSDLCVALP-------ISSCSDGCEYRYSYGDHSSTQGVLATET--FTFGDASVSK--IG 201
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGY 372
FGC D +G + G++GL R +SL SQL G+ K +CLT+ ++ G
Sbjct: 202 FGCGEDNRG---RAYSQGAGLVGLGRGPLSLISQL---GVPK--FSYCLTSIDDSKGIST 253
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPL-----NLGARNSQVGWAL 426
+ +G + + P++ +P Y+ + I+ G + L ++ G +
Sbjct: 254 LLVGSEATVKSAIP-TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLI 312
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT-LPVCWRAKFPIRSIVDVKQFF 485
D+G++ TY A++ L +S L +DAS T L +C+ P S V+V Q
Sbjct: 313 IDSGTTITYLKDNAFAALKKEF--ISQMKLDVDASGSTELELCFTLP-PDGSPVEVPQ-- 367
Query: 486 KTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
L HF G ++ + I V ICL + S + I G+ +
Sbjct: 368 --LVFHFEGVDLKLPKENYIIEDSALRV------ICLTMGSSSGMS-----IFGNFQQQN 414
Query: 545 QLVVYDNVNKRIGWAKSHC 563
+V++D + I +A + C
Sbjct: 415 IVVLHDLEKETISFAPAQC 433
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 98.6 bits (244), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 95/202 (47%), Gaps = 15/202 (7%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + VG+PPR Y+ +D+GSD+ W+QC+ PC+ C ++P++ P + Y
Sbjct: 132 GEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCE-PCTQCYHQSDPVFNP--ADSSSYAGVS 188
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
C +H +C YE+ Y D S + G LA + L G NV GC +
Sbjct: 189 CASTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTF----GRTLIRNVAIGCGH 244
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA-GGGGYMFLGHD 378
QG+ V G+LGL +S QL Q +CL + G + G +
Sbjct: 245 HNQGM----FVGAAGLLGLGSGPMSFVGQLGGQA--GGTFSYCLVSRGIQSSGLLQFGRE 298
Query: 379 LVPSWGMAWVPMLDSPFMELYH 400
VP G AWVP++ +P + ++
Sbjct: 299 AVP-VGAAWVPLIHNPRAQSFY 319
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 98.6 bits (244), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 100/366 (27%), Positives = 156/366 (42%), Gaps = 36/366 (9%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
Y + +G+P + + +DTGSD++W+QC PCS C A+PL+ P +
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCSSA 191
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQ 321
+ + G + QC Y + Y D SS+ G + D L L GS FGC+ +
Sbjct: 192 ACAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLAL----GSNAVRKFQFGCSNVE 247
Query: 322 QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVP 381
G N +TDG++GL SL SQ A G +CL + G++ LG
Sbjct: 248 SG--FND--QTDGLMGLGGGAQSLVSQTA--GTFGAAFSYCLPATSSSSGFLTLG---AG 298
Query: 382 SWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQA 440
+ G PML S + Y I I G L++ G + D+G+ T A
Sbjct: 299 TSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAG-TIMDSGTVLTRLPPTA 357
Query: 441 YSELIASLKEVSSDGLVLDASDP---TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQ 497
YS L ++ K G+ S P L C+ F +S V + T+ L F S
Sbjct: 358 YSALSSAFKA----GMKQYPSAPPSGILDTCF--DFSGQSSVSI----PTVALVF-SGGA 406
Query: 498 IVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIG 557
+V I+ +G ++ + +CL S+ + S I+G++ R V+YD +G
Sbjct: 407 VV----DIASDGIMLQTSNSILCLAFAANSD--DSSLGIIGNVQQRTFEVLYDVGGGAVG 460
Query: 558 WAKSHC 563
+ C
Sbjct: 461 FKAGAC 466
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 98.6 bits (244), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 106/392 (27%), Positives = 165/392 (42%), Gaps = 55/392 (14%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP-RMGNILPYKDSL 259
L++ + +G P + + +DTGSDL W+ CD C +CA +P Y+ + P K S
Sbjct: 103 LHYAVVALGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKSST 160
Query: 260 CMEIQRNHKPGYCETCQQCD-------------YEIEY-ADHSSSMGVLARDELHLTIEN 305
++ C + CD Y IEY +D++SS GVL D L+L E
Sbjct: 161 SRKVP-------CSS-NLCDLQSACRSASSSCPYSIEYLSDNTSSTGVLVEDVLYLITEY 212
Query: 306 GS---LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
G +T P + FGC Q G L + +G+LGL +S+PS LAS+G+ N C
Sbjct: 213 GQPKIVTAP-ITFGCGRIQTGSFLGS-AAPNGLLGLGMDSISVPSLLASEGVAANSFSMC 270
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
+ G G + G S P+ Y+ I GS N
Sbjct: 271 FGDD--GRGRINFGD--TGSSDQQETPLNIYKQNPYYNISITGAMVGSKSFNTNFN---- 322
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
A+ D+G+S+T + YSE+ +S +V LD+S P C+ I V
Sbjct: 323 --AIVDSGTSFTALSDPMYSEITSSFNSQVQDKPTQLDSSLP-FEFCYS----ISPKGSV 375
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+L GS + + I+ + S CL ++ V+ ++G+
Sbjct: 376 NPPNISLMAKGGSIFPVNDPIITITDDA----SNPMAYCLAVMKSEGVN-----LIGENF 426
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
+ G VV+D K +GW K +C + +LP
Sbjct: 427 MSGLKVVFDRERKVLGWKKFNCYSVDNSSNLP 458
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/386 (26%), Positives = 160/386 (41%), Gaps = 56/386 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY---KPRMGNILPYK 256
G Y + +G PP Y MDTGSDL W QC APC CA P + K LP +
Sbjct: 87 GEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCR 145
Query: 257 DSLCMEIQRNHKPGYCETC--QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP-NV 313
S C + +C + C Y+ Y D +S+ GVLA + N + + N+
Sbjct: 146 SSRCASLSS-------PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNI 198
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG-- 371
FGC G L N + G++G R +SL SQL + +CLT+
Sbjct: 199 AFGCGSLNAGDLAN----SSGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSATPSR 249
Query: 372 -YMFLGHDLVPSWGMAWVPMLDSPFM------ELYHTEILKINYGSS-----PLNLGARN 419
Y + +L + + P+ +PF+ +Y + I+ G+ PL +
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAIND 309
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL-VLDASDPTLPVCWRAKFPIRSI 478
G + D+G+S T+ + AY + L VS+ L ++ +D L C++ P
Sbjct: 310 DGTGGVIIDSGTSITWLQQDAYEAVRRGL--VSAIPLPAMNDTDIGLDTCFQWPPPPNVT 367
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIIL 537
V V L HF S + PE Y++I S G +CL + G I+
Sbjct: 368 VTVPD----LVFHFDSA------NMTLLPENYMLIASTTGYLCLVM-----APTGVGTII 412
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G+ + ++YD N + + + C
Sbjct: 413 GNYQQQNLHLLYDIGNSFLSFVPAPC 438
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/416 (24%), Positives = 174/416 (41%), Gaps = 56/416 (13%)
Query: 164 IIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTG 222
+I+ + +++ S NA+ SS I +Y DG Y + +G P + MDTG
Sbjct: 60 LIKRAIKRGERRMRSINAMLQSSSGI---ETPVYAGDGEYLMNVAIGTPDSSFSAIMDTG 116
Query: 223 SDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCMEIQRNHKPGYCETC--QQ 277
SDL W QC+ PC+ C P++ P+ + LP + C ++ ETC +
Sbjct: 117 SDLIWTQCE-PCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPS-------ETCNNNE 168
Query: 278 CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILG 337
C Y Y D S++ G +A + T E S+ PN+ FGC D QG G++G
Sbjct: 169 CQYTYGYGDGSTTQGYMATET--FTFETSSV--PNIAFGCGEDNQGFGQG---NGAGLIG 221
Query: 338 LSRAKVSLPSQLASQGIIKNVVGHCLTT-NAGGGGYMFLGHDL--VPSWGMAWVPMLDSP 394
+ +SLPSQL +C+T+ + + LG VP + + S
Sbjct: 222 MGWGPLSLPSQLG-----VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSL 276
Query: 395 FMELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGSSYTYFTKQAYSELIASLK 449
Y+ + I G L + + Q+ G + D+G++ TY + AY+ + +
Sbjct: 277 NPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFT 336
Query: 450 EVSSDGLVLDASDPTLPVCWR--AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
+ + +D S L C++ + + ++ F L+ G + ISP
Sbjct: 337 D-QINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQ------NILISP 389
Query: 508 EGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G ICL + S++ I G+I + V+YD N + + + C
Sbjct: 390 -------AEGVICLAMGSSSQL---GISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 159/387 (41%), Gaps = 48/387 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + VG PPR + + MDTGSDL W+QC APC C + P++ P L Y++
Sbjct: 150 GEYLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDP--ATSLSYRNVT 206
Query: 260 CMEIQRN--HKPGYCETCQQ-----CDYEIEYADHSSSMGVLARDE--LHLTIENGSLTK 310
C + + P C++ C Y Y D S++ G LA + ++LT S
Sbjct: 207 CGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV 266
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
+VVFGC + +GL L R +S SQL + + + +CL +
Sbjct: 267 DDVVFGCGHSNRGLFHGAAGLLG----LGRGALSFASQL--RAVYGHAFSYCLVDHGSSV 320
Query: 371 GY--------MFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQ 421
G LGH P + + Y+ ++ + G LN+
Sbjct: 321 GSKIVFGDDDALLGH---PRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWD 377
Query: 422 V-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
V G + D+G++ +YF + AY + + E L A P L C+ R
Sbjct: 378 VGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVER 437
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
V+V +F +L G+ W + + ++ + G +CL +L + I
Sbjct: 438 --VEVPEF--SLLFADGAVWDFPAENY------FVRLDPDGIMCLAVLG---TPRSAMSI 484
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+ + V+YD N R+G+A C
Sbjct: 485 IGNFQQQNFHVLYDLQNNRLGFAPRRC 511
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/390 (26%), Positives = 163/390 (41%), Gaps = 46/390 (11%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYKD 257
L++ + VG P + + +DTGSDL W+ C+ C CAK + +Y P + + +P
Sbjct: 120 LHYAEVEVGTPSSKFLVALDTGSDLFWLPCE--CKLCAKNGSTMYSPSLSSTSKTVPCGH 177
Query: 258 SLCMEIQRNHKPGYCETCQQ----CDYEIEY-ADHSSSMGVLARDELHLT----IENGSL 308
LC +P C T + C YE++Y + ++ S GVL D LHL G
Sbjct: 178 PLC------ERPDACATAGKSSSSCPYEVKYVSANTGSSGVLVEDVLHLVDGGGGGGGKA 231
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHCLTTNA 367
+ +VFGC Q G L G++GL KVS+PS LAS G++ + C + +
Sbjct: 232 VQAPIVFGCGQVQTGAFLRG-AAAGGLMGLGLDKVSVPSALASSGLVASDSFSMCFSRD- 289
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFME--LYHTEILKINYGSSPLNLGARNSQVGWA 425
G G + G P A P++ + ++ Y+ + I S + + A
Sbjct: 290 -GVGRINFGDAGSPD--QAETPLIAAGSLQPSYYNISVGAITVDSKAMAVEFT------A 340
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ D+G+S+TY AY+ L + VS + C+R S+ +
Sbjct: 341 VVDSGTSFTYLDDPAYTFLTTNFNSRVSEASETYGSGYEKFEFCYRLSPGQTSMKRLPAM 400
Query: 485 FKTLTLHFGS----KWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
+LT G+ W I+ + Y I CLGI+ S + I G
Sbjct: 401 --SLTTKGGAVFPITWPIIPVLASTNGGPYHPI----GYCLGIIKTSILSTEDATI-GQN 453
Query: 541 SLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
+ G VV+D +GW K C + +
Sbjct: 454 FMTGLKVVFDRRKSVLGWEKFDCYKDAKMQ 483
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 91/381 (23%), Positives = 156/381 (40%), Gaps = 43/381 (11%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK 256
Y Y +G PP Y +DTGSD W QC PC C +P++ P + YK
Sbjct: 85 YAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCK-PCKPCLNQTSPIFNPSKSST--YK 141
Query: 257 DSLCME-IQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPN 312
+ C I + + C + ++ C+YEI Y D S S G +++D L L +GS ++ P
Sbjct: 142 NIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPK 201
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGG 369
+V GC + L T GI+G R S+ SQL S I +CL + A
Sbjct: 202 IVIGCGHKNS---LTTEGLASGIIGFGRGNFSIVSQLGSS--IGGKFSYCLASLFSKANI 256
Query: 370 GGYMFLGH-DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN---SQVGWA 425
++ G +V G+ P++ S ++ Y T + + G + L + G A
Sbjct: 257 SSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNA 316
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+ D+GS+ T YS+L ++ + V D + L +C++ + + F
Sbjct: 317 VIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQ-QLSLCYKTTLKKYEVPIITAHF 375
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST---IILGDISL 542
+ + + + + +C N S ++ G+I+
Sbjct: 376 RGADVKLNAFNTFIQMNHEV-------------MCFAF-------NSSAFPWVVYGNIAQ 415
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ LV YD + I + ++C
Sbjct: 416 QNFLVGYDTLKNIISFKPTNC 436
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 159/387 (41%), Gaps = 48/387 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + VG PPR + + MDTGSDL W+QC APC C + P++ P L Y++
Sbjct: 150 GEYLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAAS--LSYRNVT 206
Query: 260 CMEIQRN--HKPGYCETCQQ-----CDYEIEYADHSSSMGVLARDE--LHLTIENGSLTK 310
C + + P C++ C Y Y D S++ G LA + ++LT S
Sbjct: 207 CGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV 266
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
+VVFGC + +GL L R +S SQL + + + +CL +
Sbjct: 267 DDVVFGCGHSNRGLFHGAAGLLG----LGRGALSFASQL--RAVYGHAFSYCLVDHGSSV 320
Query: 371 GY--------MFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQ 421
G LGH P + + Y+ ++ + G LN+
Sbjct: 321 GSKIVFGDDDALLGH---PRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWD 377
Query: 422 V-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
V G + D+G++ +YF + AY + + E L A P L C+ R
Sbjct: 378 VGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVER 437
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
V+V +F +L G+ W + + ++ + G +CL +L + I
Sbjct: 438 --VEVPEF--SLLFADGAVWDFPAENY------FVRLDPDGIMCLAVLG---TPRSAMSI 484
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+ + V+YD N R+G+A C
Sbjct: 485 IGNFQQQNFHVLYDLQNNRLGFAPRRC 511
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 168/394 (42%), Gaps = 60/394 (15%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP--------LYKPRMGNI 252
L++ + +G P + + +DTGSDL W+ CD C CA +P +Y P
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTT 155
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGS- 307
+P +LC ++Q + C Y I+Y +D++SS GVL D L+LT ++
Sbjct: 156 SRKVPCSSNLC-DLQNACR----SKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS 210
Query: 308 --LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
+T P ++FGC Q G L + +G+LGL S+PS LAS+G+ N C
Sbjct: 211 KIVTAP-IMFGCGQVQTGSFLGSAAP-NGLLGLGMDSKSVPSLLASKGLAANSFSMCF-- 266
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
G G+ + S P+ Y+ I I GS + S A
Sbjct: 267 --GDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSI------STEFSA 318
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ D+G+S+T + Y+++ +S ++ S +LD+S P C+ S +
Sbjct: 319 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMP-FEFCYSV-----SANGIVHP 372
Query: 485 FKTLTLHFGSKWQ-----IVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
+LT GS + I T +P GY CL I+ V+ ++G+
Sbjct: 373 NVSLTAKGGSIFPVNDPIITITDNAFNPVGY---------CLAIMKSEGVN-----LIGE 418
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLP 573
+ G VV+D +GW +C N LP
Sbjct: 419 NFMSGLKVVFDRERMVLGWKNFNCYNFDESSRLP 452
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 163/382 (42%), Gaps = 56/382 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G YFT + +G P R Y+ +DTGSD+ WIQC+ PC C A+P++ P + +
Sbjct: 152 GEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE-PCRECYSQADPIFNPSSSVSFSTVGCD 210
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
++C ++ N G C YE+ Y D S ++G A + L G+ + NV G
Sbjct: 211 SAVCSQLDANDCHG-----GGCLYEVSYGDGSYTVGSYATETLTF----GTTSIQNVAIG 261
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFL 375
C +D GL V G+LGL +S P+QL +Q +CL ++ G +
Sbjct: 262 CGHDNVGL----FVGAAGLLGLGAGSLSFPAQLGTQ--TGRAFSYCLVDRDSESSGTLEF 315
Query: 376 GHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGA-------RNSQVGWALF 427
G + VP G + P++ +PF+ Y+ ++ I+ G L+ + G +
Sbjct: 316 GPESVP-IGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIII 374
Query: 428 DTGSSYTYFTKQAYSEL----IASLKEV-SSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
D+G++ T AY L IA + + +DG+ + + L P
Sbjct: 375 DSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIP-------- 426
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDIS 541
+ HF + F + + L+ + G C + + I+G+I
Sbjct: 427 ----AVGFHFSN-----GAGFILPAKNCLIPMDSMGTFCFAFAPA----DSNLSIMGNIQ 473
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+G V +D+ N +G+A C
Sbjct: 474 QQGIRVSFDSANSLVGFAIDQC 495
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 165/373 (44%), Gaps = 41/373 (10%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
+G Y + GNPP+ +DTGSDL W+QC PC SC + + + P YK
Sbjct: 87 NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQC-LPCKSCYETLSAKFDPSKS--ASYKTL 143
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C P + C Y+ Y D SS+ G L+ D+ +TI G + PNV FGC
Sbjct: 144 GCGSNFCQDLP-FQSCAASCQYDYMYGDGSSTSGALSTDD--VTIGTGKI--PNVAFGCG 198
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFLGH 377
L T G++GL + +SL SQL G +CL + +++G
Sbjct: 199 NSN----LGTFAGAGGLVGLGKGPLSLVSQLG--GTATKKFSYCLVPLGSTKTSPLYIG- 251
Query: 378 DLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGS 431
D + G+A+ PML ++ + Y+ E+ I+ +N A + G + D+G+
Sbjct: 252 DSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGT 311
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
+ TY A++ ++A+LK L +D + + ++ + + T+ H
Sbjct: 312 TLTYLDVDAFNPMVAALKAA----LPYPEADGSF---YGLEYCFSTAGVANPTYPTVVFH 364
Query: 492 FGSKWQIVSTKFHISPEG-YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYD 550
F ++P+ ++ + +G CL + + I G+I ++V+D
Sbjct: 365 FNGA------DVALAPDNTFIALDFEGTTCLAMASSTGFS-----IFGNIQQLNHVIVHD 413
Query: 551 NVNKRIGWAKSHC 563
VNKRIG+ ++C
Sbjct: 414 LVNKRIGFKSANC 426
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 165/380 (43%), Gaps = 44/380 (11%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL----PYK 256
L++T++ +G P + + +DTGSDL WI C+ C CA + Y L P
Sbjct: 99 LHYTWIDIGTPSVSFLVALDTGSDLLWIPCN--CVQCAPLTSTYYSSLATKDLNEYNPSS 156
Query: 257 DSLCMEIQRNHK-PGYCETC----QQCDYEIEY-ADHSSSMGVLARDELHLT------IE 304
S +HK G C +QC Y ++Y + ++SS G+L D LHLT +
Sbjct: 157 SSSSKVFLCSHKLCGSASDCDSPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLM 216
Query: 305 NGSLT-KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
NGS + K VV GC Q G L+ V DG++GL A++S+PS L+ G+++N C
Sbjct: 217 NGSSSVKARVVVGCGKKQSGDYLDG-VAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCF 275
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
+ G ++ G D+ PS + +PF++L + + + +
Sbjct: 276 --DEEDSGRIYFG-DMGPSIQQS------APFLQLENNSGYIVGVEACCIGNSCLKQTSF 326
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
D+G S+TY ++ Y ++ + D + S V W ++ S V+ K
Sbjct: 327 TTFIDSGQSFTYLPEEIYRKVALEI-----DRHINATSKSFEGVSW--EYCYESSVEPKV 379
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
L + + I F LV CL I + GS +G +R
Sbjct: 380 PAIKLKFSHNNTFVIHKPLFVFQQSQGLV-----QFCLPISPSEQEGIGS---IGQNYMR 431
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G +V+D N ++GW+ S C
Sbjct: 432 GYRMVFDRENMKLGWSPSKC 451
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/416 (23%), Positives = 179/416 (43%), Gaps = 36/416 (8%)
Query: 163 GIIRPHKSK---INKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMI-VGNPPRPYYLD 218
GI+R ++ I+++L + D+++ P + L + I +G P R + +
Sbjct: 87 GILRRDHNRVRSIHRRLTGAG----DTAATIPASLGLAFHSLEYVVTIGIGTPARNFTVL 142
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETC--Q 276
DTGSDLTW+QC SC + PL+ P + Y D C Q G TC
Sbjct: 143 FDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSST--YVDVPCGTPQCKIGGGQDLTCGGT 200
Query: 277 QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLL--LNTLVKTDG 334
C+Y ++Y D S + G LA++ T+ + VVFGC+++ + + G
Sbjct: 201 TCEYSVKYGDQSVTRGNLAQEA--FTLSPSAPPAAGVVFGCSHEYSSGVKGAEEEMSVAG 258
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPML--D 392
+LGL R S+ SQ +G +V +CL GY+ +G P +++ P++ +
Sbjct: 259 LLGLGRGDSSILSQ-TRRGNSGDVFSYCLPPRGSSAGYLTIGAAAPPQSNLSFTPLVTDN 317
Query: 393 SPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKE-V 451
S +Y ++ I+ + L + A +G + D+G+ T+ AY L + +
Sbjct: 318 SQLSSVYVVNLVGISVSGAALPIDASAFYIG-TVIDSGTVITHMPAAAYYVLRDEFRRHM 376
Query: 452 SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL 511
++ + +L C+ +V + L FG +I + G L
Sbjct: 377 GGYTMLPEGHVESLDTCYDVTG--HDVVTAPP----VALEFGGGARI-----DVDASGIL 425
Query: 512 VI----SKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++ + ++ L L + +I+G++ R VV+D +RIG+ + C
Sbjct: 426 LVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGRRIGFGANGC 481
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/400 (26%), Positives = 173/400 (43%), Gaps = 64/400 (16%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYKD 257
L++T++ +G P + + +D GSDL W+ CD C CA + Y + P
Sbjct: 112 LHYTWIDIGTPHVSFLVALDAGSDLLWVPCD--CLQCAPLSASYYSSLDRDLNEYSPSHS 169
Query: 258 SLCMEIQRNHKPGYCE---TC----QQCDYEIEY-ADHSSSMGVLARDELHLTIENG--- 306
S + +H+ CE C Q C Y ++Y +++SS G+L D LHL NG
Sbjct: 170 STSKHLSCSHQ--LCELGPNCNSPKQPCPYSMDYYTENTSSSGLLVEDILHLA-SNGDNA 226
Query: 307 ---SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
S+ P VV GC Q G L+ V DG++GL A++S+PS LA G+I+N C
Sbjct: 227 LSYSVRAP-VVIGCGMKQSGGYLDG-VAPDGLMGLGLAEISVPSFLAKAGLIRNSFSMCF 284
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMEL---YHTEILKIN---YGSSPLNLGA 417
+ G +F G D P+ + +PF+ L Y T ++ + GSS L +
Sbjct: 285 DED--DSGRIFFG-DQGPTTQQS------TPFLTLDGNYTTYVVGVEGFCVGSSCLKQTS 335
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP-IR 476
AL DTG+S+T+ Y E I + + + + C+++ +
Sbjct: 336 FR-----ALVDTGTSFTFLPNGVY-ERITEEFDRQVNATISSFNGYPWKYCYKSSSNHLT 389
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG--NICLGILDGSEVHNGST 534
+ VK F ++ F I +++ +G CL I + G
Sbjct: 390 KVPSVKLIFP------------LNNSFVIHNPVFMIYGIQGITGFCLAI----QPTEGDI 433
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKSLPF 574
+G + G VV+D N ++GW+ S C + K +P
Sbjct: 434 GTIGQNFMAGYRVVFDRENMKLGWSHSSCEDRSNDKRMPL 473
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 148/377 (39%), Gaps = 49/377 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + +G P + + DTGSD TW+QC + C + PL+ P +
Sbjct: 94 GNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCS 153
Query: 257 DSLCMEIQRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
S C ++ Y C C Y I+Y D S ++G A+D L L + T N
Sbjct: 154 SSYCSDL-------YVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYD----TIKNFR 202
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF 374
FGC +GL + G+LGL R K SLP Q + V +CL + G G++
Sbjct: 203 FGCGEKNRGL----FGRAAGLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTGFLD 256
Query: 375 LGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
LG P+ PML Y+ + I G L + L D+G+ T
Sbjct: 257 LGPG-APAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVIT 315
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
AY+ L ++ + + GL A+ P SI+D LT H G
Sbjct: 316 RLPPSAYAPLRSAFSK-AMQGLGYSAA------------PAFSILDT---CYDLTGHKGG 359
Query: 495 KWQI--VSTKFH------ISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ VS F + G L ++ CL ++ + I+G+ +
Sbjct: 360 SIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNAD--DTDVAIVGNTQQKTHG 417
Query: 547 VVYDNVNKRIGWAKSHC 563
V+YD K +G+A C
Sbjct: 418 VLYDIGKKIVGFAPGAC 434
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 161/380 (42%), Gaps = 51/380 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VG P R Y+ +DTGSD+ W+QC APC C ++P++ PR
Sbjct: 140 GEYFTRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPCS 198
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
+R G + C Y++ Y D S ++G + + LT + V GC +
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTET--LTFRRNRVK--GVALGCGH 254
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMFLGH 377
D +GL V G+LGL + K+S P Q + +CL + + G+
Sbjct: 255 DNEGL----FVGAAGLLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSASSKPSSVVFGN 308
Query: 378 DLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALF--------- 427
V S + P+L +P ++ Y+ E+L I+ G G R V +LF
Sbjct: 309 AAV-SRIARFTPLLSNPKLDTFYYVELLGISVG------GTRVPGVAASLFKLDQIGNGG 361
Query: 428 ---DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
D+G+S T + AY + + + V + L C F + ++ +VK
Sbjct: 362 VIIDSGTSVTRLIRPAYIAMRDAFR-VGAKALKRAPDFSLFDTC----FDLSNMNEVK-- 414
Query: 485 FKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
T+ LHF G+ + +T + I + G C G I+G+I +
Sbjct: 415 VPTVVLHFRGADVSLPATNYLIP------VDTNGKFCFAFAG----TMGGLSIIGNIQQQ 464
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G VVYD + R+G+A C
Sbjct: 465 GFRVVYDLASSRVGFAPGGC 484
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 109/390 (27%), Positives = 173/390 (44%), Gaps = 52/390 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y+T + +G+P + L +DTGS+LTW++C PC CA + +Y + YK
Sbjct: 98 GEYYTSIKLGSPGQEAILIVDTGSELTWLKC-LPCKVCAPSVDTIYD--AARSVSYKPVT 154
Query: 260 CMEIQ--RNHKPG---YCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGS-LTKPN 312
C Q N G YC QC + Y D S S G L+ D L + T+ G +T +
Sbjct: 155 CNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQD 214
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGG 369
FGCA QG L GILGL+ K++LP QL + K HC +++
Sbjct: 215 FAFGCA---QGDLELVPTGASGILGLNAGKMALPMQLGQRFGWK--FSHCFPDRSSHLNS 269
Query: 370 GGYMFLGHDLVPSWGMAW--VPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWAL 426
G +F G+ +P + + V + +S + YH + ++ S L L R S V +
Sbjct: 270 TGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVV---I 326
Query: 427 FDTGSSYTYFTKQAYSELIA--------SLKEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
D+GSS++ F + +S+L SLK + D S L C+ K I
Sbjct: 327 LDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGD------SFGDLGTCF--KVSNDDI 378
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGN---ICLGILDGSEVHNGST 534
++ + +L+L F I I G L+ +++ N +C DG
Sbjct: 379 DELHRTLPSLSLVFEDGVTI-----GIPSIGVLLPVARYQNHVKMCFAFEDGGP---NPV 430
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
++G+ + V YD R+G+A++ C+
Sbjct: 431 NVIGNYQQQNLWVEYDIQRSRVGFARASCV 460
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 160/379 (42%), Gaps = 52/379 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY----- 255
LY+ + +G P + + +DTGSDL W+ C+ C+ C P Y + N +
Sbjct: 103 LYYANVSIGTPGLYFLVALDTGSDLFWLPCE--CTKC-----PTYLTKRDNGKFWLNHYS 155
Query: 256 KDSLCMEIQRNHKPGYCETCQQCD-------YEIEY-ADHSSSMGVLARDELHLTIENGS 307
++ I+ CE QC Y+ Y +++SSS G L +D LH+ ++
Sbjct: 156 SNASSTSIRVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMATDDSQ 215
Query: 308 LTKP---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
L KP V GC Q G N + +G++GL KVS+PS LASQG+ + C
Sbjct: 216 L-KPVDVKVTLGCGKVQTGKFSN-VTAPNGLIGLGMGKVSVPSFLASQGLTTDSFSMCF- 272
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
G GY + + G P +P Y+ IL+I + P N+
Sbjct: 273 ---GYYGYGRIDFGDIGPVGQRETPF--NPASLSYNVTILQIIVTNRPTNVHLT------ 321
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
A+ D+G+S+TY T YS + ++ + SD C+R +Q
Sbjct: 322 AIIDSGASFTYLTDPFYSIITENMDAAMELERIKSDSDFPFEYCYRLSLA----TIFQQP 377
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
T+ G K+ ++++ + + +CL I+ ++++ ++G G
Sbjct: 378 NLNFTMEGGRKFDVITSYVSVDTD------DGPALCLAIVKSTDIN-----VIGHNFFGG 426
Query: 545 QLVVYDNVNKRIGWAKSHC 563
VV++ +GW + C
Sbjct: 427 YRVVFNREKMTLGWKEVDC 445
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 102/400 (25%), Positives = 166/400 (41%), Gaps = 63/400 (15%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC-AKGANPLYKPRMGNI---LPYKD 257
Y ++ VG PPRP L +DTGSDL W QC APC +C +GA P+ P + +
Sbjct: 94 YLVHLSVGTPPRPVALTLDTGSDLVWTQC-APCLNCFDQGAIPVLDPAASSTHAAVRCDA 152
Query: 258 SLCMEI--QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL----TIENGSLTKP 311
+C + + G + C Y Y D S ++G LA D + G +++
Sbjct: 153 PVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVSER 212
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
+ FGC + +G+ GI G R + SLPSQL +C T+
Sbjct: 213 RLTFGCGHFNKGIF---QANETGIAGFGRGRWSLPSQLGVTSF-----SYCFTS------ 258
Query: 372 YMFLGHDLVPSWGMA-----------WVPMLDSPFM-ELYHTEILKINYGSSPLNLGARN 419
MF + + G+A P+L P LY + I G++ + + R
Sbjct: 259 -MFESTSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERR 317
Query: 420 SQV--GWALFDTGSSYTYFTKQAY----SELIA--SLKEVSSDGLVLD-------ASDPT 464
++ A+ D+G+S T + Y +E +A L + +G LD A+ P
Sbjct: 318 QRLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPK 377
Query: 465 LPVCWRAKFPIRSI-VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGI 523
WR + R++ V V + L G+ W++ + G V +CL +
Sbjct: 378 SAFGWRWRGRGRAMPVRVPRL--VFHLGGGADWELPRENYVFEDYGARV------MCL-V 428
Query: 524 LDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
LD + T+++G+ + VVYD N + +A + C
Sbjct: 429 LDAATGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 163/380 (42%), Gaps = 50/380 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK-----GANPL----YKPRMGN 251
L++ + VG P + + +DTGSDL W+ CD C++C + G + L Y P +
Sbjct: 103 LHYANVTVGTPSDWFLVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNASS 160
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGS 307
+P +LC R P C Y+I Y ++ +SS GVL D LHL + N
Sbjct: 161 TSTKVPCNSTLCTRGDRCASPE-----SNCPYQIRYLSNGTSSTGVLVEDVLHL-VSNDK 214
Query: 308 LTKP---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
+K V GC Q G+ + +G+ GL +S+PS LA +GI N C
Sbjct: 215 SSKAIPARVTLGCGQVQTGVFHDG-AAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
+ G G + G S P+ Y+ + KI+ + +L
Sbjct: 274 ND--GAGRISFGDK--GSVDQRETPLNIRQPHPTYNITVTKISVEGNTGDLEFD------ 323
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
A+FD+G+S+TY T AY+ + S ++ D +D LP + + + D Q+
Sbjct: 324 AVFDSGTSFTYLTDAAYTLISESFNSLALDKR-YQTTDSELP--FEYCYALSPNKDSFQY 380
Query: 485 -FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
LT+ GS + + +H P + + CL IL ++ I+G +
Sbjct: 381 PAVNLTMKGGSSYPV----YH--PLVVIPMKDTDVYCLAILKIEDIS-----IIGQNFMT 429
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G VV+D +GW +S C
Sbjct: 430 GYRVVFDREKLILGWKESDC 449
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 119/494 (24%), Positives = 193/494 (39%), Gaps = 86/494 (17%)
Query: 88 FLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGR 147
FL +S+F+L S FS+ L + F L H+ + + E K
Sbjct: 6 FLTLSLFSLCFIAS-FSHALSN------------GFSVELIHRDSPKSPYYKPTENKYQH 52
Query: 148 FVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMI 207
FVD S+ ++ N S+ + S++ P RG G TY
Sbjct: 53 FVDAARRSI-------------NRANHFFKDSDT-STPESTVIPDRG-----GYLMTYS- 92
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQ 264
VG PP Y DTGSD+ W+QC+ PC C P++ P + +P LC ++
Sbjct: 93 VGTPPTKIYGIADTGSDIVWLQCE-PCEQCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVR 151
Query: 265 RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFGCAYDQQG 323
C C Y+I Y D S S G L+ D L L +GS ++ P +V GC D G
Sbjct: 152 DTS----CSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIGCGTDNAG 207
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGGGGYMFLGHDL 379
+ GI+GL VSL +QL S I +CL + + G
Sbjct: 208 TFGGA---SSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILSFGDAA 262
Query: 380 VPSW-GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS---QVGWALFDTGSSYTY 435
V S G+ P++ + Y + + G+ + G + G + D+G++ T
Sbjct: 263 VVSGDGVVSTPLIKKDPV-FYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTL 321
Query: 436 FTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK-----FPIRSIVDVKQFFKTLTL 490
Y+ L +++ ++ V D + +C+ K FPI +T+
Sbjct: 322 IPSDVYTNLESAVVDLVKLDRV-DDPNQQFSLCYSLKSNEYDFPI------------ITV 368
Query: 491 HF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
HF G+ ++ S V G +C ++ + I G+++ + LV Y
Sbjct: 369 HFKGADVELHSIS-------TFVPITDGIVCFAFQPSPQLGS----IFGNLAQQNLLVGY 417
Query: 550 DNVNKRIGWAKSHC 563
D K + + + C
Sbjct: 418 DLQQKTVSFKPTDC 431
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/384 (23%), Positives = 160/384 (41%), Gaps = 33/384 (8%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---L 253
+ G YF + VG P L +DTGSDL W+QC +PC C ++ PR + +
Sbjct: 81 FESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQC-SPCRRCYAQRGQVFDPRRSSTYRRV 139
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
P C ++ C Y + Y D SSS G LA D+ L N + NV
Sbjct: 140 PCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDK--LAFANDTYVN-NV 196
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGGG 370
GC D +GL + G+LG++R K+S+ +Q+A +V +CL T+ +
Sbjct: 197 TLGCGRDNEGLFDSAA----GLLGVARGKISISTQVAPA--YGSVFEYCLGDRTSRSTRS 250
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV------- 422
Y+ G P A+ +L +P LY+ ++ + G + G N+ +
Sbjct: 251 SYLVFGRTPEPP-STAFTALLSNPRRPSLYYVDMAGFSVGGERVT-GFSNASLALDTATG 308
Query: 423 -GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
G + D+G++ + F + AY+ L + + + + + A + +R
Sbjct: 309 RGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEH--SVFDACYDLRGRPAA 366
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
L G+ + + + +G + CLG E + ++G++
Sbjct: 367 SAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGF----EAADDGLSVIGNVQ 422
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMN 565
+G VV+D +RIG+A C +
Sbjct: 423 QQGFRVVFDVEKERIGFAPKGCTS 446
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 148/377 (39%), Gaps = 49/377 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + +G P + + DTGSD TW+QC + C + PL+ P +
Sbjct: 159 GNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCS 218
Query: 257 DSLCMEIQRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
S C ++ Y C C Y I+Y D S ++G A+D L L + T N
Sbjct: 219 SSYCSDL-------YVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYD----TIKNFR 267
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF 374
FGC +GL + G+LGL R K SLP Q + V +CL + G G++
Sbjct: 268 FGCGEKNRGL----FGRAAGLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTGFLD 321
Query: 375 LGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
LG P+ PML Y+ + I G L + L D+G+ T
Sbjct: 322 LGPG-APAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVIT 380
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
AY+ L ++ + + GL A+ P SI+D LT H G
Sbjct: 381 RLPPSAYAPLRSAFSK-AMQGLGYSAA------------PAFSILDT---CYDLTGHKGG 424
Query: 495 KWQI--VSTKFH------ISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ VS F + G L ++ CL ++ + I+G+ +
Sbjct: 425 SIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNAD--DTDVAIVGNTQQKTHG 482
Query: 547 VVYDNVNKRIGWAKSHC 563
V+YD K +G+A C
Sbjct: 483 VLYDIGKKIVGFAPGAC 499
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 163/384 (42%), Gaps = 50/384 (13%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
DG Y M +G P R Y +DTGSDL W QC APC C P + P N Y+
Sbjct: 89 DGEYLMEMGIGTPARFYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDP--ANSSTYRSL 145
Query: 259 LCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C N Y C Q C Y+ Y D +S+ GVLA + + +T P + FG
Sbjct: 146 GCSAPACNAL--YYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFG 203
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG------- 369
C G L N G++G R +SL SQL S +CLT+
Sbjct: 204 CGNLNAGSLAN----GSGMVGFGRGSLSLVSQLGSPRF-----SYCLTSFLSPVRSRLYF 254
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSS-----PLNLGARNSQ-V 422
G Y L + + P + +P + +Y + I+ G + P L ++
Sbjct: 255 GAYATLNSTNAST--VQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGT 312
Query: 423 GWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPT-LPVCWRAKFPIRSIVD 480
G + D+G++ TY + AY + A + ++S +LD ++ + L C++ P R V
Sbjct: 313 GGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVT 372
Query: 481 VKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
+ Q L LHF G+ W++ + LV G +CL + S+ GS I+G
Sbjct: 373 LPQ----LVLHFDGADWELPLQNYM------LVDPSTGGLCLAMATSSD---GS--IIGS 417
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ V+YD N + + + C
Sbjct: 418 YQHQNFNVLYDLENSLLSFVPAPC 441
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/375 (24%), Positives = 157/375 (41%), Gaps = 35/375 (9%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
+G Y + +G+PP Y +DTGSDL W QC PC C + +P+++P Y
Sbjct: 79 NGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQC-TPCGGCYRQKSPMFEPLRSKT--YSPI 135
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFGC 317
C Q + C + C Y YAD S + GVLAR+ + + +G + +++FGC
Sbjct: 136 PCESEQCSFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFGC 195
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAK-VSLPSQLASQGIIKNVVGHCLT---TNAGGGGYM 373
+ G T + D + +SL SQ+ + K CL T+A G +
Sbjct: 196 GHSNSG----TFNENDMGIIGMGGGPLSLVSQIGTLYGSKR-FSQCLVPFHTDAHTSGTI 250
Query: 374 FLGHDL-VPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS-QVGWALFDTGS 431
G + V G+ P+ Y + I+ G + + + + G + D+G+
Sbjct: 251 NFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIMIDSGT 310
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
TY ++ Y L+ LK SS + D D +C+R++ + + LT H
Sbjct: 311 PATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQLCYRSETNLEGPI--------LTAH 362
Query: 492 F-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYD 550
F G+ Q++ + I P K G C + ++ I G+ + L+ +D
Sbjct: 363 FEGADVQLLPIQTFIPP-------KDGVFCFAMAGSTD----GDYIFGNFAQSNILMGFD 411
Query: 551 NVNKRIGWAKSHCMN 565
K I + + C N
Sbjct: 412 LDRKTISFKPTDCTN 426
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 105/388 (27%), Positives = 163/388 (42%), Gaps = 50/388 (12%)
Query: 195 NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILP 254
N P Y ++ +G PP+P L +DTGSDL W QC PC +C A P + P + L
Sbjct: 75 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLS 133
Query: 255 Y---KDSLC--MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
+LC + + P + Q C Y Y D S + G L D+ S+
Sbjct: 134 LTSCDSTLCQGLPVASCGSPKFWPN-QTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASV- 191
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTD--GILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
P V FGC GL N + K++ GI G R +SLPSQL + N HC T
Sbjct: 192 -PGVAFGC-----GLFNNGVFKSNETGIAGFGRGPLSLPSQLK----VGN-FSHCFTAVN 240
Query: 368 G---GGGYMFLGHDLVPS--WGMAWVPMLDSPFM-ELYHTEILKINYGSS----PLNLGA 417
G + L DL S + P++ +P Y+ + I GS+ P + A
Sbjct: 241 GLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFA 300
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
+ G + D+G++ T + Y + A +V + + +DP C A P+R
Sbjct: 301 LKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSA--PLR 356
Query: 477 SIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI 535
+ K + L LHF G+ + + E + +CL I++G EV
Sbjct: 357 A----KPYVPKLVLHFEGATMDLPRENYVFEVED----AGSSILCLAIIEGGEVTT---- 404
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+ + V+YD N ++ + + C
Sbjct: 405 -IGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 163/382 (42%), Gaps = 56/382 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G YFT + +G P R Y+ +DTGSD+ WIQC+ PC C A+P++ P + +
Sbjct: 6 GEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE-PCRECYSQADPIFNPSSSVSFSTVGCD 64
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
++C ++ N G C YE+ Y D S ++G A + L G+ + NV G
Sbjct: 65 SAVCSQLDANDCHG-----GGCLYEVSYGDGSYTVGSYATETLTF----GTTSIQNVAIG 115
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFL 375
C +D GL V G+LGL +S P+QL +Q +CL ++ G +
Sbjct: 116 CGHDNVGL----FVGAAGLLGLGAGSLSFPAQLGTQ--TGRAFSYCLVDRDSESSGTLEF 169
Query: 376 GHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLG-------ARNSQVGWALF 427
G + VP G + P++ +PF+ Y+ ++ I+ G L+ + G +
Sbjct: 170 GPESVP-IGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIII 228
Query: 428 DTGSSYTYFTKQAYSEL----IASLKEV-SSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
D+G++ T AY L IA + + +DG+ + + L P
Sbjct: 229 DSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIP-------- 280
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDIS 541
+ HF + F + + L+ + G C + + I+G+I
Sbjct: 281 ----AVGFHFSN-----GAGFILPAKNCLIPMDSMGTFCFAFAPA----DSNLSIMGNIQ 327
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+G V +D+ N +G+A C
Sbjct: 328 QQGIRVSFDSANSLVGFAIDQC 349
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 151/384 (39%), Gaps = 65/384 (16%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR---MGNI 252
LYF + +GNP + YY+ +DTGSD+ W+ C C C ++ LY P
Sbjct: 26 LYFAKIGLGNPSKDYYVQVDTGSDILWVNC-IGCDKCPTKSDLGIKLTLYDPASSVSATR 84
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS----L 308
+ D C P C+ C Y + Y D SS+ G D + G+ L
Sbjct: 85 VSCDDDFCTSTYNGLLPD-CKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGL 143
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
+ V FGC Q G L + DGILG HCL N
Sbjct: 144 SNGTVTFGCGAQQSGGLGTSGEALDGILG--------------------AFAHCL-DNVN 182
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL-------GARNSQ 421
GGG +G + P + PM+ P Y+ + +I G + L L G R
Sbjct: 183 GGGIFAIGELVSPK--VNTTPMV--PNQAHYNVYMKEIEVGGTVLELPTDVFDSGDRRGT 238
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+ D+G++ Y + Y ++ ++ GL L + +C++ +V
Sbjct: 239 I----IDSGTTLAYLPEVVYDSMMNEIRS-QQPGLSLHTVEEQF-ICFKYS------GNV 286
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS-EVHNGSTI-ILGD 539
F + HF S + P YL + C G +G + +G + +LGD
Sbjct: 287 DDGFPDIKFHFKD-----SLTLTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGD 341
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ L +LV+YD N+ IGW + +C
Sbjct: 342 LVLSNKLVLYDIENQAIGWTEYNC 365
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 99/401 (24%), Positives = 163/401 (40%), Gaps = 71/401 (17%)
Query: 198 PDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILP 254
PD Y +M +G PP+P L +DTGSDLTW QC APC SC + + P + P ++LP
Sbjct: 81 PDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLP 139
Query: 255 YKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN---GSLTKP 311
+C ++ + C Y YADHS + G L D + G + P
Sbjct: 140 CDLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVP 199
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT------- 364
++ FGC G+ ++ GI G SR +S+P+QL +C T
Sbjct: 200 DLTFGCGLFNNGIFVS---NETGIAGFSRGALSMPAQLKVDNF-----SYCFTAITGSEP 251
Query: 365 ------------TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSP 412
++A GG GH +V S + S ++ Y+ + + G++
Sbjct: 252 SPVFLGVPPNLYSDAAGG-----GHGVVQSTALI---RYHSSQLKAYYISLKGVTVGTTR 303
Query: 413 LNL-----GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP- 466
L + + G + D+G+ T + Y+ + + V+ L + S +L
Sbjct: 304 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAF--VAQTKLTVHNSTSSLSQ 361
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNI---CLG 522
+C+ P + DV L LHF + E Y+ I + G I CL
Sbjct: 362 LCF--SVPPGAKPDV----PALVLHFEGA------TLDLPRENYMFEIEEAGGIRLTCLA 409
Query: 523 ILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G ++ ++G+ + V+YD N + + + C
Sbjct: 410 INAGEDLS-----VIGNFQQQNMHVLYDLANDMLSFVPARC 445
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 164/380 (43%), Gaps = 44/380 (11%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL----PYK 256
L++T++ +G P + + +DTGSDL WI C+ C CA + Y L P
Sbjct: 99 LHYTWIDIGTPSVSFLVALDTGSDLLWIPCN--CVQCAPLTSTYYSSLATKDLNEYNPSS 156
Query: 257 DSLCMEIQRNHK----PGYCET-CQQCDYEIEY-ADHSSSMGVLARDELHLT------IE 304
S +HK CE+ +QC Y + Y + ++SS G+L D LHLT +
Sbjct: 157 SSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLM 216
Query: 305 NGSLT-KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
NGS + K VV GC Q G L+ V DG++GL A++S+PS L+ G+++N C
Sbjct: 217 NGSSSVKARVVIGCGKKQSGDYLDG-VAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCF 275
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
+ G ++ G D+ PS + +PF++L + + + +
Sbjct: 276 --DEEDSGRIYFG-DMGPSIQQS------TPFLQLENNSGYIVGVEACCIGNSCLKQTSF 326
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
D+G S+TY ++ Y ++ + D + S V W ++ S V+ K
Sbjct: 327 TTFIDSGQSFTYLPEEIYRKVALEI-----DRHINATSKSFEGVSW--EYCYESSVEPKV 379
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
L + + I F LV CL I + GS +G +R
Sbjct: 380 PAIKLKFSHNNTFVIHKPLFVFQQSQGLV-----QFCLPISPSGQEGIGS---IGQNYMR 431
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G +V+D N ++ W+ S C
Sbjct: 432 GYRMVFDRENMKLRWSASKC 451
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 167/392 (42%), Gaps = 57/392 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + VG P L +DT SDLTW+QC PC C + P++ PR + Y
Sbjct: 132 GEYMAKIAVGTPAVQALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPRHSTSYGEMNYD 190
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYAD----HSSSMGVLARDELHLTIENGSLTKPN 312
C + R+ G C Y ++Y D S+S+G L + L G + +
Sbjct: 191 APDCQALGRSG--GGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFA---GGVRQAY 245
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG- 371
+ GC +D +GL GILGL R ++S+P Q+A G + +CL G G
Sbjct: 246 LSIGCGHDNKGLF---GAPAAGILGLGRGQISIPHQIAFLGYNAS-FSYCLVDFISGPGS 301
Query: 372 ----YMFLGHDLVPSWGMAWVP-MLDSPFMELYHTEILKINYGSSPL-NLGARNSQV--- 422
F + S ++ P +L+ Y+ ++ ++ G + + R+ Q+
Sbjct: 302 PSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPY 361
Query: 423 ---GWALFDTGSSYTYFTKQAY-------SELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
G + D+G++ T + AY SL +VS+ G D V RA
Sbjct: 362 TGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPS-GLFDTCYTVGGRAG 420
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHN 531
+ ++ ++HF ++ + P+ YL+ + +G +C + +
Sbjct: 421 VKVPAV----------SMHFAGGVEV-----SLQPKNYLIPVDSRGTVCFAF---AGTGD 462
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
S ++G+I +G VVYD +R+G+A ++C
Sbjct: 463 RSVSVIGNILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/392 (23%), Positives = 161/392 (41%), Gaps = 47/392 (11%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G I DG +F + +G PP + DTGSDLTW+QC PC C K P++ + +
Sbjct: 77 GLIGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQC-KPCQQCYKENGPIFDKKKSSTY 135
Query: 254 ---PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LT 309
P C + + + G E+ C Y Y D S S G +A + + + +GS ++
Sbjct: 136 KSEPCDSRNCHALSSSER-GCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVS 194
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTN 366
P VFGC Y+ G T +GL +SL SQL S I +CL +
Sbjct: 195 FPGTVFGCGYNNGGTFDETGSGI---IGLGGGHLSLISQLGSS--ISKKFSYCLSHKSAT 249
Query: 367 AGGGGYMFLGHDLVPS-----WGMAWVPMLDSPFMELYHTEILKINYGS----------S 411
G + LG + +PS G+ P++D Y+ + I+ G +
Sbjct: 250 TNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYN 309
Query: 412 PLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA 471
P + G + G + D+G++ T + + A+++E+ + + L C+++
Sbjct: 310 PNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHCFKS 369
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
+ +T+HF +SP V + +CL ++ +EV
Sbjct: 370 G-------SAEIGLPEITVHF------TGADVRLSPINAFVKVSEDMVCLSMVPTTEV-- 414
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G+ + LV YD + + + + C
Sbjct: 415 ---AIYGNFAQMDFLVGYDLETRTVSFQRMDC 443
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 99/401 (24%), Positives = 163/401 (40%), Gaps = 71/401 (17%)
Query: 198 PDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILP 254
PD Y +M +G PP+P L +DTGSDLTW QC APC SC + + P + P ++LP
Sbjct: 107 PDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLP 165
Query: 255 YKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN---GSLTKP 311
+C ++ + C Y YADHS + G L D + G + P
Sbjct: 166 CDLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVP 225
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT------- 364
++ FGC G+ ++ GI G SR +S+P+QL +C T
Sbjct: 226 DLTFGCGLFNNGIFVS---NETGIAGFSRGALSMPAQLKVDNF-----SYCFTAITGSEP 277
Query: 365 ------------TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSP 412
++A GG GH +V S + S ++ Y+ + + G++
Sbjct: 278 SPVFLGVPPNLYSDAAGG-----GHGVVQSTALI---RYHSSQLKAYYISLKGVTVGTTR 329
Query: 413 LNL-----GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP- 466
L + + G + D+G+ T + Y+ + + V+ L + S +L
Sbjct: 330 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAF--VAQTKLTVHNSTSSLSQ 387
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNI---CLG 522
+C+ P + DV L LHF + E Y+ I + G I CL
Sbjct: 388 LCF--SVPPGAKPDV----PALVLHFEGA------TLDLPRENYMFEIEEAGGIRLTCLA 435
Query: 523 ILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G ++ ++G+ + V+YD N + + + C
Sbjct: 436 INAGEDLS-----VIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 166/382 (43%), Gaps = 45/382 (11%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL----PYK 256
L++T++ +G P + + +D GSDL W+ CD C CA + Y R+G L P
Sbjct: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWVPCD--CMQCAPLSASYYD-RLGRDLNEYSPSL 158
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIE--------YADHSSSMGVLARDELHLT--IENG 306
S + N + CE C + Y++++SS G+L D LHL E+
Sbjct: 159 SSTSKPLSCNDQ--LCELGSDCKSSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHA 216
Query: 307 SLTK--PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
S + +V+ GC Q G + DG++GL +S+PS LA G+++N C
Sbjct: 217 SRSSVWASVIIGCGRKQSGAFSDG-AAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSICFD 275
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
N G +F LV ++VP L+ F+ Y E+ GSS L
Sbjct: 276 DNH-SGTILFGDQGLVTQKSTSFVP-LEGKFVT-YLIEVEGYLVGSSSLKTAGFQ----- 327
Query: 425 ALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
AL D+G+S+T+ + Y +++ K+V++ S W K+ S
Sbjct: 328 ALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSP------W--KYCYNSSSQELL 379
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
T+TL F ++ F + +IS+ + L +H II G +
Sbjct: 380 NIPTVTLVFA-----MNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGII-GQNFMW 433
Query: 544 GQLVVYDNVNKRIGWAKSHCMN 565
G +V+D N ++GW+ S+C +
Sbjct: 434 GYRMVFDRENLKLGWSTSNCQD 455
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 155/371 (41%), Gaps = 40/371 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y T M +G P Y + +DTGS LTW+QC SC + + P++ P+ + Y
Sbjct: 120 GNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSST--YASVG 177
Query: 260 CMEIQRNH------KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
C Q + P C + C Y+ Y D S S+G L++D T+ GS + PN
Sbjct: 178 CSAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSLPNF 233
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
+GC D +GL ++ G++GL+R K+SL QLA + +CL +++ G
Sbjct: 234 YYGCGQDNEGL----FGRSAGLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLS 287
Query: 374 FLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSS 432
++ ++ PM+ S + LY ++ + +PL++ + + D+G+
Sbjct: 288 LGSYN---PGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTV 344
Query: 433 YTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
T YS L ++ + G ++ L C++ + S V F
Sbjct: 345 ITRLPTSVYSALSKAV-AAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAG----- 398
Query: 493 GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
+S + LV CL S I+G+ + VVYD
Sbjct: 399 -------GAALKLSAQNLLVDVDDSTTCLAFAPAR-----SAAIIGNTQQQTFSVVYDVK 446
Query: 553 NKRIGWAKSHC 563
+ RIG+A C
Sbjct: 447 SSRIGFAAGGC 457
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 99/401 (24%), Positives = 163/401 (40%), Gaps = 71/401 (17%)
Query: 198 PDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILP 254
PD Y +M +G PP+P L +DTGSDLTW QC APC SC + + P + P ++LP
Sbjct: 107 PDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLP 165
Query: 255 YKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN---GSLTKP 311
+C ++ + C Y YADHS + G L D + G + P
Sbjct: 166 CDLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVP 225
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT------- 364
++ FGC G+ ++ GI G SR +S+P+QL +C T
Sbjct: 226 DLTFGCGLFNNGIFVS---NETGIAGFSRGALSMPAQLKVDNF-----SYCFTAITGSEP 277
Query: 365 ------------TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSP 412
++A GG GH +V S + S ++ Y+ + + G++
Sbjct: 278 SPVFLGVPPNLYSDAAGG-----GHGVVQSTALI---RYHSSQLKAYYISLKGVTVGTTR 329
Query: 413 LNL-----GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP- 466
L + + G + D+G+ T + Y+ + + V+ L + S +L
Sbjct: 330 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAF--VAQTKLTVHNSTSSLSQ 387
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNI---CLG 522
+C+ P + DV L LHF + E Y+ I + G I CL
Sbjct: 388 LCF--SVPPGAKPDV----PALVLHFEGA------TLDLPRENYMFEIEEAGGIRLTCLA 435
Query: 523 ILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G ++ ++G+ + V+YD N + + + C
Sbjct: 436 INAGEDLS-----VIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/384 (23%), Positives = 159/384 (41%), Gaps = 33/384 (8%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---L 253
+ G YF + VG P L +DTGSDL W+QC +PC C ++ PR + +
Sbjct: 81 FESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQC-SPCRRCYAQRGQVFDPRRSSTYRRV 139
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
P C ++ C Y + Y D SSS G LA D+ L N + NV
Sbjct: 140 PCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDK--LAFANDTYVN-NV 196
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGGG 370
GC D +GL + G+LG+ R K+S+ +Q+A +V +CL T+ +
Sbjct: 197 TLGCGRDNEGLFDSAA----GLLGVGRGKISISTQVAPA--YGSVFEYCLGDRTSRSTRS 250
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV------- 422
Y+ G P A+ +L +P LY+ ++ + G + G N+ +
Sbjct: 251 SYLVFGRTPEPP-STAFTALLSNPRRPSLYYVDMAGFSVGGERVT-GFSNASLALDTATG 308
Query: 423 -GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
G + D+G++ + F + AY+ L + + + + + A + +R
Sbjct: 309 RGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEH--SVFDACYDLRGRPAA 366
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
L G+ + + + +G + CLG E + ++G++
Sbjct: 367 SAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGF----EAADDGLSVIGNVQ 422
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMN 565
+G VV+D +RIG+A C +
Sbjct: 423 QQGFRVVFDVEKERIGFAPKGCTS 446
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 169/384 (44%), Gaps = 53/384 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G YF + VG P Y+ +DTGSD+ W+QC +PC +C ++ ++ P+ +P
Sbjct: 136 GEYFMRLGVGTPATNVYMVLDTGSDVVWLQC-SPCKACYNQSDVIFDPKKSKTFATVPCG 194
Query: 257 DSLCMEIQRNHKPGYCET--CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
LC + + + C T + C Y++ Y D S + G + + L +G+ +V
Sbjct: 195 SRLCRRLDDSSE---CVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF---HGARVD-HVP 247
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY-- 372
GC +D +GL V G+LGL R +S PSQ S+ +CL G
Sbjct: 248 LGCGHDNEGL----FVGAAGLLGLGRGGLSFPSQTKSR--YNGKFSYCLVDRTSSGSSSK 301
Query: 373 ----MFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV----- 422
+ G+D VP + + P+L +P ++ Y+ ++L I+ G S + G SQ
Sbjct: 302 PPSTIVFGNDAVPKTSV-FTPLLTNPKLDTFYYLQLLGISVGGSRVP-GVSESQFKLDAT 359
Query: 423 --GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G+S T T+ AY L + + + + L S C F + +
Sbjct: 360 GNGGVIIDSGTSVTRLTQSAYVALRDAFR-LGATKLKRAPSYSLFDTC----FDLSGMTT 414
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGD 539
VK T+ HFG + + YL+ ++ +G C GS I+G+
Sbjct: 415 VK--VPTVVFHFGGG------EVSLPASNYLIPVNTEGRFCFAFAG----TMGSLSIIGN 462
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
I +G V YD V R+G+ C
Sbjct: 463 IQQQGFRVAYDLVGSRVGFLSRAC 486
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 96.3 bits (238), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 157/387 (40%), Gaps = 61/387 (15%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPY 255
+G + M +G P Y +DTGSDL W QC PC C + P++ P + LP
Sbjct: 99 NGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCK-PCVECFNQSTPVFDPSSSSTYAALPC 157
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVV 314
+LC ++ P T +C Y Y D SS+ GVLA + L + TK P+V
Sbjct: 158 SSTLCSDL-----PSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTL-----AKTKLPDVA 207
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG----- 369
FGC +G + + G++GL R +SL SQL N +CLT+
Sbjct: 208 FGCGDTNEG---DGFTQGAGLVGLGRGPLSLVSQLG-----LNKFSYCLTSLDDTSKSPL 259
Query: 370 --GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNL-----GARNSQ 421
G + + + P++ +P Y+ + + GS+ + L ++
Sbjct: 260 LLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDG 319
Query: 422 VGWALFDTGSSYTYFTKQAYSEL----IASLKEVSSDGLVLDASDPTLPVCWRAKFPIRS 477
G + D+G+S TY Q Y L A +K ++DG S L C+ A
Sbjct: 320 TGGVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADG-----SGIGLDTCFEAPASGVD 374
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTII 536
V+V + + + + E Y+V+ S G +CL ++ + I
Sbjct: 375 QVEVPKLV----------FHLDGADLDLPAENYMVLDSGSGALCLTVMGSRGLS-----I 419
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+ + VYD + +A C
Sbjct: 420 IGNFQQQNIQFVYDVGENTLSFAPVQC 446
>gi|218185382|gb|EEC67809.1| hypothetical protein OsI_35378 [Oryza sativa Indica Group]
Length = 344
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 42/103 (40%), Positives = 70/103 (67%), Gaps = 8/103 (7%)
Query: 461 SDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNIC 520
SDP+LP+CW+ + S+ DVK+ FK+L L+FG+ + I PE +L++++ GN+C
Sbjct: 102 SDPSLPLCWKGQKAFESVSDVKKEFKSLQLNFGN-----NAVMEIPPENFLIVTEYGNVC 156
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
LGIL GS ++ I+GDI+++ Q+V+YDN +++GW + C
Sbjct: 157 LGILHGSRLNFN---IIGDITMQDQMVIYDNEREQLGWIRGSC 196
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 42/81 (51%), Gaps = 13/81 (16%)
Query: 277 QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY---------DQQGLLLN 327
QCDYEI+YAD +S++G L D+ L T+PN+ FG Y D+ L +N
Sbjct: 28 QCDYEIKYADGASTIGALIVDQFSLP---RIATRPNLPFGNYYSPGSATLYFDRHSLGMN 84
Query: 328 TL-VKTDGILGLSRAKVSLPS 347
+ V G+ S +VS PS
Sbjct: 85 PMDVIKGGLSSTSLEQVSDPS 105
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 154/368 (41%), Gaps = 30/368 (8%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS-SCAKGANPLYKPRMGNI---LPY 255
G Y + +G P + + L DTGSDLTW QC+ PCS C + + P L
Sbjct: 130 GGYAVTVGLGTPKKDFSLLFDTGSDLTWTQCE-PCSGGCFPQNDEKFDPTKSTSYKNLSC 188
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
C I + G C + C Y ++Y ++G LA + L +T + N V
Sbjct: 189 SSEPCKSIGKESAQG-CSSSNSCLYGVKYGT-GYTVGFLATETLTITPSD---VFENFVI 243
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC G T G+LGL R+ V+LPSQ +S KN+ +CL ++ G++
Sbjct: 244 GCGERNGGRFSGTA----GLLGLGRSPVALPSQTSS--TYKNLFSYCLPASSSSTGHLSF 297
Query: 376 GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
G + S + P + S ELY ++ I+ G L + + + D+G++ TY
Sbjct: 298 GGGV--SQAAKFTP-ITSKIPELYGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTY 354
Query: 436 FTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSK 495
A+S L ++ +E+ ++ L L C+ + + Q ++ G +
Sbjct: 355 LPSTAHSALSSAFQEMMTN-YTLTKGTSGLQPCYDFSKHANDNITIPQI--SIFFEGGVE 411
Query: 496 WQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKR 555
I + I+ G +CL D ++ I G++ + VVYD
Sbjct: 412 VDIDDSGIFIAANGLE------EVCLAFKDNG--NDTDVAIFGNVQQKTYEVVYDVAKGM 463
Query: 556 IGWAKSHC 563
+G+A C
Sbjct: 464 VGFAPGGC 471
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 98/401 (24%), Positives = 171/401 (42%), Gaps = 61/401 (15%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK---------------GANP 243
G YF + +G PP+ L DTGSDL W++C +PC +C+ A
Sbjct: 83 SGQYFVSLRIGTPPQTLLLVADTGSDLIWVKC-SPCRNCSHRSPGSAFFARHSTTYSAIH 141
Query: 244 LYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTI 303
Y P+ ++P+ R H P C Y+ YAD S++ G +++ L L
Sbjct: 142 CYSPQC-QLVPHPHPNPCNRTRLHSP--------CRYQYTYADSSTTTGFFSKEALTLNT 192
Query: 304 ENGSLTKPN-VVFGCAYDQQGLLLN--TLVKTDGILGLSRAKVSLPSQLASQGIIKNVVG 360
G + K N + FGC + G L + G++GL RA +S SQL + +
Sbjct: 193 STGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRR--FGSKFS 250
Query: 361 HCL---TTNAGGGGYMFLG---HDLVPSWG-MAWVPMLDSPFMELYHTEILK---INYGS 410
+CL T + ++ +G + V G M++ P+L +P ++ +K +N
Sbjct: 251 YCLMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVK 310
Query: 411 SPLNLGA---RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT--L 465
P+N + G + D+G++ T+ T+ AY+E++ + K+ + ++PT
Sbjct: 311 LPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVK---LPSPAEPTPGF 367
Query: 466 PVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD 525
+C R + F L GS F P Y + + CL +
Sbjct: 368 DLCMNVSGVTRPALPRMSF----NLAGGSV-------FSPPPRNYFIETGDQIKCLAVQP 416
Query: 526 GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNP 566
S+ +G +LG++ +G L+ +D R+G+ + C P
Sbjct: 417 VSQ--DGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGCALP 455
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 170/382 (44%), Gaps = 45/382 (11%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL----PYK 256
L++T++ +G P + + +D GSDL W+ CD C CA + Y R+G L P
Sbjct: 92 LHYTWIDIGTPNVSFLVALDAGSDLLWVPCD--CMQCAPLSASYYD-RLGRDLNEYSPSL 148
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIE--------YADHSSSMGVLARDELHLT--IENG 306
S + N + CE C + Y++++SS G+L D LHL E+
Sbjct: 149 SSTSKPLSCNDQ--LCELGSDCKSSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHA 206
Query: 307 SLTK--PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
S + +V+ GC Q G + DG++GL +S+PS LA G+++N C
Sbjct: 207 SRSSVWASVIIGCGRKQSGAFSDG-AAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSICFD 265
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW 424
N G +F LV ++VP L+ F+ Y E+ GSS L
Sbjct: 266 DNH-SGTILFGDQGLVTQKSTSFVP-LEGKFVT-YLIEVEGYLVGSSSLKTAGFQ----- 317
Query: 425 ALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
AL D+G+S+T+ + Y +++ K+V++ S C+ + + ++++
Sbjct: 318 ALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGS--PWKYCYNSS--SQELLNI-- 371
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
T+TL F ++ F + +IS+ + L +H II G +
Sbjct: 372 --PTVTLVFA-----MNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGII-GQNFMW 423
Query: 544 GQLVVYDNVNKRIGWAKSHCMN 565
G +V+D N ++GW+ S+C +
Sbjct: 424 GYRMVFDRENLKLGWSTSNCQD 445
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 93/339 (27%), Positives = 136/339 (40%), Gaps = 47/339 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G+PP+ + +DTGSDL WIQC PCS C ++P+Y P +
Sbjct: 2 GAYTMEIELGSPPKKFNAIVDTGSDLVWIQCK-PCSQCYSQSDPIYDPSASSTFAKTSCS 60
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCA 318
Q G + + C Y +Y D SS+ G A + L L GS PN FGC
Sbjct: 61 TSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG 120
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYMFL 375
G + GI+GL + K+SL +QL S I N +CL ++ +
Sbjct: 121 RLNSG----SFGGAAGIVGLGQGKISLSTQLGSA--INNKFSYCLVDFDDDSSKTSPLIF 174
Query: 376 GHDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPLNLGAR---------------- 418
G G P++ +S Y + I+ G L+L R
Sbjct: 175 GSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVR 234
Query: 419 --NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL-VLDASDPTLPVCWRAKFPI 475
G +FD+G++ T YS++ ++ SS L +DAS +C+
Sbjct: 235 ALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAF--ASSVSLPTVDASSSGFDLCY------ 286
Query: 476 RSIVDVKQF-FKTLTLHFGSKWQIVSTKFHISPEGYLVI 513
+ K F F LTL F TKF + Y VI
Sbjct: 287 -DVSKSKNFKFPALTLAF------KGTKFSPPQKNYFVI 318
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 152/379 (40%), Gaps = 49/379 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PP DTGSDL W QC PC C K +PL+ P+ Y+D
Sbjct: 93 GEYLMSLSLGTPPFKIMGIADTGSDLIWTQCK-PCERCYKQVDPLFDPKSSKT--YRDFS 149
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFGCA 318
C Q + + C Y+ Y D S +MG +A D + L GS ++ P V GC
Sbjct: 150 CDARQCSLLDQSTCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGCG 209
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC---LTTNAGGGGYMFL 375
++ G + K GI+GL +SL SQ+ S + +C L++ AG +
Sbjct: 210 HENDGTFSD---KGSGIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNSSKLNF 264
Query: 376 GHDLVPSW-GMAWVPMLDSPFMELYHTEIL--------KINYGSSPLNLGARNSQVGWAL 426
G + V S G+ P+L S M ++ L +I +G S L G N +
Sbjct: 265 GSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGN-----II 319
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT--LPVCWRAKFPIRSIVDVKQF 484
D+G++ T +S L + V + A DP+ L VC+ A ++
Sbjct: 320 IDSGTTLTIVPDDFFSNLSTA---VGNQVEGRRAEDPSGFLSVCYSATSDLK-------- 368
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
+T HF + P V +CL + I G+++
Sbjct: 369 VPAITAHF------TGADVKLKPINTFVQVSDDVVCLAFASTTS----GISIYGNVAQMN 418
Query: 545 QLVVYDNVNKRIGWAKSHC 563
LV Y+ K + + + C
Sbjct: 419 FLVEYNIQGKSLSFKPTDC 437
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 95.9 bits (237), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 104/382 (27%), Positives = 168/382 (43%), Gaps = 46/382 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL----PYK 256
L++T++ +G P + + +DTGS+L WI C+ C CA + Y L P
Sbjct: 99 LHYTWIDIGTPSVSFLVALDTGSNLLWIPCN--CVQCAPLTSTYYSSLATKDLNEYNPSS 156
Query: 257 DSLCMEIQRNHK----PGYCET-CQQCDYEIEY-ADHSSSMGVLARDELHLT------IE 304
S +HK CE+ +QC Y + Y + ++SS G+L D LHLT +
Sbjct: 157 SSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLM 216
Query: 305 NGSLT-KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
NGS + K VV GC Q G L+ V DG++GL A++S+PS L+ G+++N C
Sbjct: 217 NGSSSVKARVVIGCGKKQSGDYLDG-VAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCF 275
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN-SQV 422
+ G ++ G D+ PS + +PF++L + + G +G Q
Sbjct: 276 --DEEDSGRIYFG-DMGPSIQQS------TPFLQLDNNKYSGYIVGVEACCIGNSCLKQT 326
Query: 423 GWALF-DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+ F D+G S+TY ++ Y ++ + D + S V W ++ S +
Sbjct: 327 SFTTFIDSGQSFTYLPEEIYRKVALEI-----DRHINATSKNFEGVSW--EYCYESSAEP 379
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
K L + + I F LV CL I + GS +G
Sbjct: 380 KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLV-----QFCLPISPSGQEGIGS---IGQNY 431
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+RG +V+D N ++GW+ S C
Sbjct: 432 MRGYRMVFDRENMKLGWSPSKC 453
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 122/496 (24%), Positives = 214/496 (43%), Gaps = 79/496 (15%)
Query: 93 IFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDL- 151
+ + L SVFS +Q R+ SN FP++ G+R E +G +DL
Sbjct: 16 LITMTLLLSVFSL-IQCRHVSN----------FPVHEVVGVR----LQEEPLIGLRIDLV 60
Query: 152 DGESVVASVNDG----------IIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL 201
+S ++ + G I+ + ++ K +S + V + ++ GN G
Sbjct: 61 RTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKLQMSVDEVKAVEAPVY--AGN----GE 114
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
+ M +G P + +DTGSDLTW QC PC+ C P+Y P + +P S
Sbjct: 115 FLMKMAIGTPSLSFSAILDTGSDLTWTQCK-PCTDCYPQPTPIYDPSQSSTYSKVPCSSS 173
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
+C + P Y + C+Y Y D SS+ G+L+ + LT S + P++ FGC
Sbjct: 174 MCQAL-----PMYSCSGANCEYLYSYGDQSSTQGILSYESFTLT----SQSLPHIAFGCG 224
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGGGGYMFL 375
+ +G + +G R +SL SQL + N +CL T + +F+
Sbjct: 225 QENEGGGFSQGGGL---VGFGRGPLSLISQLGQS--LGNKFSYCLVSITDSPSKTSPLFI 279
Query: 376 GHDL-VPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNL--GARNSQV---GWALFD 428
G + + ++ P++ S Y+ + I+ G L++ G + Q+ G + D
Sbjct: 280 GKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIID 339
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGL-VLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
+G++ TY + Y + ++ +SS L +D S+ L +C+ + + F T
Sbjct: 340 SGTTVTYLEQSGYDVVKKAV--ISSINLPQVDGSNIGLDLCFEPQSGSST-----SHFPT 392
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
+T HF F++ E Y+ G CL +L NG + I G+I + +
Sbjct: 393 ITFHFE------GADFNLPKENYIYTDSSGIACLAMLP----SNGMS-IFGNIQQQNYQI 441
Query: 548 VYDNVNKRIGWAKSHC 563
+YDN + +A + C
Sbjct: 442 LYDNERNVLSFAPTVC 457
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 95/413 (23%), Positives = 174/413 (42%), Gaps = 44/413 (10%)
Query: 166 RPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDL 225
R + ++ ++ SS+AV++ SS G G YF ++VG P + + L DTGS+L
Sbjct: 60 RGGRQRVAAEVASSSAVSLPMSS-----GAYAGTGQYFVKVLVGTPAQEFTLVADTGSEL 114
Query: 226 TWIQCDAPCSSCAKGANP---LYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCD 279
TW++ CA GA+P +++P +P C + C
Sbjct: 115 TWVK-------CAGGASPPGLVFRPEASKSWAPVPCSSDTCKLDVPFSLANCSSSASPCS 167
Query: 280 YEIEYADHSS-SMGVLARDELHLTIENGSLTK-PNVVFGCAYDQQGLLLNTLVKTDGILG 337
Y+ Y + S+ ++GV+ D + + G + + +VV GC+ G + DG+L
Sbjct: 168 YDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLGCSSTHDG---QSFKSVDGVLS 224
Query: 338 LSRAKVSLPSQLASQ---GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSP 394
L AK+S S+ A++ +V H NA GY+ G VP + P
Sbjct: 225 LGNAKISFASRAAARFGGSFSYCLVDHLAPRNA--TGYLAFGPGQVPRTPATQTKLFLDP 282
Query: 395 FMELYHTEILKINYGSSPLNLGAR--NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVS 452
M Y ++ ++ L++ A + + G + D+G++ T AY ++A+L ++
Sbjct: 283 AMPFYGVKVDAVHVAGQALDIPAEVWDPKSGGVILDSGTTLTVLATPAYKAVVAALTKLL 342
Query: 453 SDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV 512
+ V P C+ P ++ + L + F + + Y++
Sbjct: 343 AG--VPKVDFPPFEHCYNWTAPRPGAPEIPK----LAVQFTG-----CARLEPPAKSYVI 391
Query: 513 ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
K G C+G+ +G G ++I G+I + L +D N + + S C
Sbjct: 392 DVKPGVKCIGLQEGE--WPGVSVI-GNIMQQEHLWEFDLKNMEVRFMPSTCTR 441
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 171/390 (43%), Gaps = 52/390 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y+T + +G+P + L +DTGS+LTW+QC PC CA + +Y Y+
Sbjct: 98 GEYYTSIKLGSPGQEAILIVDTGSELTWLQC-LPCKVCAPSVDTIYD--AARSASYRPVT 154
Query: 260 CMEIQ--RNHKPG---YCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGS-LTKPN 312
C Q N G YC QC + Y D S S G L+ D L + T+ G +T +
Sbjct: 155 CNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQD 214
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGG 369
FGCA QG L GILGL+ K++LP QL + K HC +++
Sbjct: 215 FAFGCA---QGDLELVPTGASGILGLNAGKMALPMQLGQRFGWK--FSHCFPDRSSHLNS 269
Query: 370 GGYMFLGHDLVPSWGMAW--VPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWAL 426
G +F G+ +P + + V + +S + YH + ++ S L R S V +
Sbjct: 270 TGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSVV---I 326
Query: 427 FDTGSSYTYFTKQAYSELIA--------SLKEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
D+GSS++ F + +S+L SLK + D S L C+ K I
Sbjct: 327 LDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGD------SFGDLGTCF--KVSNDDI 378
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGN---ICLGILDGSEVHNGST 534
++ + +L+L F I I G L+ +++ N +C DG
Sbjct: 379 DELHRTLPSLSLVFEDGVTI-----GIPSIGVLLPVARFQNHVKMCFAFEDGGP---NPV 430
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
++G+ + V YD R+G+A++ C+
Sbjct: 431 NVIGNYQQQNLWVEYDIQRSRVGFARASCV 460
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 107/418 (25%), Positives = 179/418 (42%), Gaps = 46/418 (11%)
Query: 160 VNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLD 218
++D +R +S+I +N A+DS PL + L Y + +G R +
Sbjct: 26 LDDFRVRSLQSRIKSIFSGNNIDALDSQ--IPLSSGVRLQTLNYIVTVEIGG--RNMTVI 81
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCMEIQRNHKPGYCETC 275
+DTGSDLTW+QC PC C +PL+ P + S C +Q + G C
Sbjct: 82 VDTGSDLTWVQCQ-PCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQSLQ--YATGNLGVC 138
Query: 276 QQ----CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVK 331
C+Y + Y D S + G L ++L+L G+ N +FGC + +GL
Sbjct: 139 GSNTPTCNYVVNYGDGSYTRGDLGMEQLNL----GTTHVSNFIFGCGRNNKGL----FGG 190
Query: 332 TDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHD---LVPSWGMAW 387
G++GL ++ +SL SQ ++ I + V +CL TT A G + LG + + +++
Sbjct: 191 ASGLMGLGKSDLSLVSQTSA--IFEGVFSYCLPTTAADASGSLILGGNSSVYKNTTPISY 248
Query: 388 VPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIA 446
M+ +P + Y + I+ G L A N + L D+G+ T Y +L A
Sbjct: 249 TRMIANPQLPTFYFLNLTGISIGGVALQ--APNYRQSGILIDSGTVITRLPPPVYRDLKA 306
Query: 447 S-LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHI 505
LK+ S G L C F + +V T+ + F ++ +
Sbjct: 307 EFLKQFS--GFPSAPPFSILDTC----FNLNGYDEVD--IPTIRMQFEGNAELT---VDV 355
Query: 506 SPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ Y V + +CL + S + I+G+ R Q V+Y+ ++G+A C
Sbjct: 356 TGIFYFVKTDASQVCLAL--ASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEAC 411
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 154/379 (40%), Gaps = 38/379 (10%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC-AKGANPLYKPRMGNI---LPYKD 257
Y ++ VG PPRP L +DTGSDL W QC APC C +GA P+ P + LP
Sbjct: 90 YLMHVSVGTPPRPVALTLDTGSDLVWTQC-APCLDCFEQGAAPVLDPAASSTHAALPCDA 148
Query: 258 SLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN--GSLTKPNVVF 315
LC + G + C Y Y D S ++G LA D ++ G L V F
Sbjct: 149 PLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRVTF 208
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI------IKNVVGHCLTTNAGG 369
GC + +G+ GI G R + SLPSQL + + + T
Sbjct: 209 GCGHINKGIF---QANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTKSSSVVTLGAA 265
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
+ H + + ++ +P LY + I+ G + + + + + D
Sbjct: 266 AAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLRSS-TIID 324
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGL-VLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
+G+S T + Y + A VS GL A L +C+ P+ ++ +
Sbjct: 325 SGASITTLPEDVYEAVKAEF--VSQVGLPAAAAGSAALDLCF--ALPVAALWR-RPAVPA 379
Query: 488 LTLHF--GSKWQIVSTKFHISPEG-YLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
LTLH G+ W++ P G Y+ + +LD + G +++G+ +
Sbjct: 380 LTLHLDGGADWEL--------PRGNYVFEDYAARVLCVVLDAAA---GEQVVIGNYQQQN 428
Query: 545 QLVVYDNVNKRIGWAKSHC 563
VVYD N + +A + C
Sbjct: 429 THVVYDLENDVLSFAPARC 447
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 159/396 (40%), Gaps = 69/396 (17%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA--PCSSCAKGAN-------PLYKPRMGN 251
L++ + +G P + + +DTGSDL W+ C+ C K A LY P N
Sbjct: 90 LHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTP---N 146
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSL-- 308
S+ +R G C + + C Y+I + ++ + G L +D LHL E+ L
Sbjct: 147 ASTTSSSIRCSDKRCFGSGKCSSPESICPYQIALSSNTVTTGTLLQDVLHLVTEDEDLKP 206
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
NV GC +Q G T + +G+LGLS + S+PS LA I N C
Sbjct: 207 VNANVTLGCGQNQTG-AFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIIS 265
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMEL-----YHTEILKINYGSSPLNLGARNSQVG 423
G + G + ++P + L Y + ++ G P+++
Sbjct: 266 VVGRISFGDK-------GYTDQEETPLVSLETSTAYGVNVTGVSVGGVPVDVPL------ 312
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+ALFDTGSS+T + AY + D L+ D P P FP D+++
Sbjct: 313 FALFDTGSSFTLLLESAYGVFTKAF-----DDLMEDKRRPVDP-----DFPFEFCYDLRE 362
Query: 484 FF---KTLTLHFGSK----------WQIVSTKFHISPEGYLVISKKGN--ICLGILDGSE 528
H SK W+I + + + S +G CLGIL
Sbjct: 363 EHLNSDARPRHMQSKCYNPCRDDFRWRIQN-----DSQESVSYSNEGTKMYCLGILKSIN 417
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
++ I+G + G +V+D +GW +S+C
Sbjct: 418 LN-----IIGQNLMSGHRIVFDRERMILGWKQSNCF 448
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 153/352 (43%), Gaps = 44/352 (12%)
Query: 119 NKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVN-----DGIIRPHKSKIN 173
N F F ++H+F EV Q GRF + N D +IR + +
Sbjct: 25 NGRIFTFEMHHRFS-DEVKQWSDS--TGRFAKFPPKGSFEYFNALVLRDWLIRGRRLSES 81
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
+ S+ D +S + + L++T + +G P + + +DTGSDL W+ CD
Sbjct: 82 ESESESSLTFSDGNSTSRISSLGF---LHYTTVKLGTPGMRFMVALDTGSDLFWVPCD-- 136
Query: 234 CSSCA--KGAN-------PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYE 281
C CA +GA +Y P++ + +SLC QRN G T C Y
Sbjct: 137 CGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCA--QRNQCLG---TFSTCPYM 191
Query: 282 IEYAD-HSSSMGVLARDELHLTIE--NGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGL 338
+ Y +S+ G+L D +HLT E N + V FGC Q G L+ + +G+ GL
Sbjct: 192 VSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD-IAAPNGLFGL 250
Query: 339 SRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMEL 398
K+S+PS LA +G++ + C G G + G S P +P
Sbjct: 251 GMEKISVPSVLAREGLVADSFSMCF--GHDGVGRISFGDK--GSSDQEETPFNLNPSHPN 306
Query: 399 YHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKE 450
Y+ + ++ G++ ++ ALFDTG+S+TY Y+ + S ++
Sbjct: 307 YNITVTRVRVGTTLID------DEFTALFDTGTSFTYLVDPMYTTVSESAQD 352
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 95.5 bits (236), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 106/404 (26%), Positives = 170/404 (42%), Gaps = 52/404 (12%)
Query: 179 SNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS-- 236
S+AV +SI G+ Y + +G P + +DTGSDL+W+QC PC +
Sbjct: 95 SDAVGGGGTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCK-PCGAGE 153
Query: 237 CAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCET--CQQCDYEIEYADHSSSM 291
C +PL+ P + +P C ++ C + C+Y IEY + +++
Sbjct: 154 CYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTT 213
Query: 292 GVLARDELHLTIENGSLTKPNVV-----FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLP 346
GV + + L L KP VV FGC Q G K DG+LGL A SL
Sbjct: 214 GVYSTETLTL--------KPGVVVADFGFGCGDHQHG----PYEKFDGLLGLGGAPESLV 261
Query: 347 SQLASQGIIKNVVGHCLTTNAGGGGYMFLGH-----DLVPSWGMAWVPMLDSPFMELYHT 401
SQ +SQ +CL +GG G++ LG + G + PM P + ++
Sbjct: 262 SQTSSQ--FGGPFSYCLPPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYV 319
Query: 402 EILK-INYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA 460
L I+ G +PL + G + D+G+ T AY+ L ++ + S+ +L
Sbjct: 320 VTLTGISVGGAPLAVPPSAFSSGM-VIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPP 378
Query: 461 SD-PTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI 519
S+ L C+ F + V V LT G+ + +P G LV +
Sbjct: 379 SNGAVLDTCY--DFTGHTNVTVPTI--ALTFSGGATIDLA------TPAGVLV-----DG 423
Query: 520 CLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CL + + I+G+++ R V+YD+ +G+ C
Sbjct: 424 CLAFAGAGT--DDTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 95.5 bits (236), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 159/396 (40%), Gaps = 69/396 (17%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA--PCSSCAKGAN-------PLYKPRMGN 251
L++ + +G P + + +DTGSDL W+ C+ C K A LY P N
Sbjct: 102 LHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTP---N 158
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSL-- 308
S+ +R G C + + C Y+I + ++ + G L +D LHL E+ L
Sbjct: 159 ASTTSSSIRCSDKRCFGSGKCSSPESICPYQIALSSNTVTTGTLLQDVLHLVTEDEDLKP 218
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
NV GC +Q G T + +G+LGLS + S+PS LA I N C
Sbjct: 219 VNANVTLGCGQNQTG-AFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIIS 277
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMEL-----YHTEILKINYGSSPLNLGARNSQVG 423
G + G + ++P + L Y + ++ G P+++
Sbjct: 278 VVGRISFGDK-------GYTDQEETPLVSLETSTAYGVNVTGVSVGGVPVDVPL------ 324
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+ALFDTGSS+T + AY + D L+ D P P FP D+++
Sbjct: 325 FALFDTGSSFTLLLESAYGVFTKAF-----DDLMEDKRRPVDP-----DFPFEFCYDLRE 374
Query: 484 FF---KTLTLHFGSK----------WQIVSTKFHISPEGYLVISKKGN--ICLGILDGSE 528
H SK W+I + + + S +G CLGIL
Sbjct: 375 EHLNSDARPRHMQSKCYNPCRDDFRWRIQN-----DSQESVSYSNEGTKMYCLGILKSIN 429
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
++ I+G + G +V+D +GW +S+C
Sbjct: 430 LN-----IIGQNLMSGHRIVFDRERMILGWKQSNCF 460
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 97/408 (23%), Positives = 167/408 (40%), Gaps = 46/408 (11%)
Query: 170 SKINKKLVSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWI 228
SKI +L S + + ++ P + G G Y + +G P + L DTGSDLTW
Sbjct: 98 SKIAGELESVDRLRGSKATKIPAKSGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWT 157
Query: 229 QCDAPCSS-CAKGANPLYKPRMGNILPYKDSLC-------MEIQRNHKPGYCETCQQCDY 280
QC PC+ C +P++ P Y + C +E ++PG C + C Y
Sbjct: 158 QCQ-PCARYCYNQKDPVFVPSQSTT--YSNISCSSPDCSQLESGTGNQPG-CSAARACIY 213
Query: 281 EIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSR 340
I+Y D S S+G A++ L LT + N +FGC + +GL + G++GL +
Sbjct: 214 GIQYGDQSFSVGYFAKETLTLTSTD---VIENFLFGCGQNNRGLFGSAA----GLIGLGQ 266
Query: 341 AKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDS-PFMELY 399
K+S+ Q A + V +CL + GY+ + + P+ + Y
Sbjct: 267 DKISIVKQTAQK--YGQVFSYCLPKTSSSTGYLTF-GGGGGGGALKYTPITKAHGVANFY 323
Query: 400 HTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLD 459
+I+ + G + + + + A+ D+G+ T AYS +LK G+
Sbjct: 324 GVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVITRLPPDAYS----ALKSAFEKGMAKY 379
Query: 460 ASDPTLPV---CWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISK 515
P L + C+ +K+ I V FK + + G + +
Sbjct: 380 PKAPELSILDTCYDLSKYSTIQIPKVGFVFKG------------GEELDLDGIGIMYGAS 427
Query: 516 KGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+CL + + I+G++ + VVYD +IG+ + C
Sbjct: 428 TSQVCLAFAGNQDPS--TVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 99/399 (24%), Positives = 175/399 (43%), Gaps = 57/399 (14%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA---KGANPLYKPRMGNILPY 255
G YF + +G+PP+ L DTGSDLTW++C A ++C+ G+ L + P
Sbjct: 80 SGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLAR-HSTTFSPT 138
Query: 256 K--DSLCMEI-QRNHKP-GYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK- 310
SLC + Q N P + C YE Y+D S + G +++ L +G K
Sbjct: 139 HCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKL 198
Query: 311 PNVVFGCAYDQQG--LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TT 365
++ FGC + G L+ ++ G++GL R +S SQL + +CL T
Sbjct: 199 KSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRR--FGRSFSYCLLDYTL 256
Query: 366 NAGGGGYMFLGHDLVPSWG-----MAWVPMLDSPFM-ELYHTEI-------LKINYGSSP 412
+ Y+ +G D+V + M++ P+L +P Y+ I +K++ S
Sbjct: 257 SPPPTSYLMIG-DVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSV 315
Query: 413 LNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
+L + G + D+G++ T+ T+ AY E++++ K + LP
Sbjct: 316 WSLDELGN--GGTVIDSGTTLTFLTEPAYREILSAFKR-----------EVKLPSPTPGG 362
Query: 473 FPIRSIVDV--------KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGIL 524
RS D+ + F L+L G + + + P Y + +G CL I
Sbjct: 363 ASTRSGFDLCVNVTGVSRPRFPRLSLELGGE-----SLYSPPPRNYFIDISEGIKCLAI- 416
Query: 525 DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
E +G ++G++ +G L+ +D R+G+++ C
Sbjct: 417 QPVEAESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 91/382 (23%), Positives = 151/382 (39%), Gaps = 40/382 (10%)
Query: 193 RGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI 252
R I G Y +G PP DT SDL W+QC +PC +C PL++
Sbjct: 81 RVRIPNHGEYLMRFYIGTPPVERLAIADTASDLIWVQC-SPCETCFPQDTPLFE------ 133
Query: 253 LPYKDSLCMEIQRNHKPG------YCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIEN 305
P+K S + + +P YC C Y Y D SS+ GVL + +H +
Sbjct: 134 -PHKSSTFANLSCDSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQ- 191
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT- 364
++T P +FGC + + K GI+GL +SL SQL Q I + +CL
Sbjct: 192 -TVTFPKTIFGCGSNND-FMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLP 247
Query: 365 -TNAGGGGYMFLGHDLVPSWGMAWVPMLDSP-FMELYHTEILKINYGSSPLNLGARNSQV 422
T+ F + G+ P++ P + Y ++ I G L + +
Sbjct: 248 FTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTN 307
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G + D G+ TY Y + L+E L + + +P + FP ++ +
Sbjct: 308 GNIIIDLGTVLTYLEVNFYHNFVTLLRE----ALGISETKDDIPYPFDFCFPNQANITFP 363
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEG-YLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+ +Q K +SP+ + ICL +L + + + G+++
Sbjct: 364 KIV----------FQFTGAKVFLSPKNLFFRFDDLNMICLAVL--PDFYAKGFSVFGNLA 411
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
V YD K++ +A + C
Sbjct: 412 QVDFQVEYDRKGKKVSFAPADC 433
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 98/346 (28%), Positives = 143/346 (41%), Gaps = 48/346 (13%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
DG Y M +G P R Y +DTGSDL W QC APC C P + P Y+
Sbjct: 87 DGEYLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSAT--YRSL 143
Query: 259 LCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C N Y C Q C Y+ Y D +S+ GVLA + ++ P + FG
Sbjct: 144 GCASPACNAL--YYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFG 201
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG------- 369
C GLL N G++G R +SL SQL S +CLT+
Sbjct: 202 CGNLNAGLLAN----GSGMVGFGRGSLSLVSQLGSPRF-----SYCLTSFLSPVPSRLYF 252
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGS-----SPLNLGARNSQ-V 422
G Y L S + P + +P + +Y + I+ G P ++
Sbjct: 253 GVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGT 312
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL--VLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G++ TY + AY + A+ + L V DAS L C++ P R V
Sbjct: 313 GGTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDAS--VLDTCFQWPPPPRQSVT 370
Query: 481 VKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVI--SKKGNICLGI 523
+ Q L LHF G+ W+ + + Y+++ S G +CL +
Sbjct: 371 LPQ----LVLHFDGADWE-------LPLQNYMLVDPSTGGGLCLAM 405
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 95/387 (24%), Positives = 165/387 (42%), Gaps = 71/387 (18%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF+ + +G+PP+ Y+ +DTGSD+ W+QC APC+ C + A+P+++P +
Sbjct: 153 GEYFSRVGIGSPPKHVYMVVDTGSDVNWVQC-APCADCYQQADPIFEPSFSS-------- 203
Query: 260 CMEIQRNHKPGYCETCQ------------QCDYEIEYADHSSSMGVLARDELHLTIENGS 307
++ P CET Q C YE+ Y D S ++G A + + L +GS
Sbjct: 204 ------SYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITL---DGS 254
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
+ NV GC +D +GL V G+LGL +S PSQ+ + +CL
Sbjct: 255 ASLNNVAIGCGHDNEGL----FVGAAGLLGLGGGSLSFPSQINASSF-----SYCLVNRD 305
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV----- 422
+ +PS + + ++ Y+ + I G L++ + +V
Sbjct: 306 TDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGN 365
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEV-----SSDGLVLDASDPTLPVCWRAKFPIRS 477
G + D+G++ T Y+ L S S+ G+ L C+ RS
Sbjct: 366 GGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVAL------FDTCY--DLSSRS 417
Query: 478 IVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
V+V T++ HF K+ + K ++ P + G C + + I
Sbjct: 418 SVEV----PTVSFHFPDGKYLALPAKNYLIP-----VDSAGTFCFAFAPTTSALS----I 464
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G++ +G V YD N +G++ + C
Sbjct: 465 IGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 106/395 (26%), Positives = 166/395 (42%), Gaps = 66/395 (16%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN--PLYKPRMGNILPYKDS 258
L++ + +G P + + +DTGSDL W+ CD C CA AN L KP P + S
Sbjct: 82 LHYAKVALGTPNATFVVALDTGSDLFWVPCD--CKRCAPIANTSELLKP----YSPRQSS 135
Query: 259 LCMEIQRNH----KPGYCETCQ-QCDYEIEY-ADHSSSMGVLARDELHLTIEN------- 305
+ +H +P C C Y ++Y + ++SS GVL D L++T ++
Sbjct: 136 TSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSGVLVEDVLYMTRQSSSSRSGN 195
Query: 306 ----GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVG 360
G VVFGC +Q G L+ +G+LGL +VS+PS LA+ G++ +
Sbjct: 196 GGNVGEAVGARVVFGCGQEQTGAFLDG-AAMEGLLGLGMDRVSVPSLLAAAGLVGSDSFS 254
Query: 361 HCLTTNAGGGGYMFLGHDLVPSWGMAW--VPMLDSPFMELYHTEILKINYGSSPLNLGAR 418
C + + G G + G PS A P + S Y+ + +N GA
Sbjct: 255 MCFSPD--GNGRINFGE---PSDAGAQNETPFIVSKTRPTYNISVTAVNVKGK----GAM 305
Query: 419 NSQVGWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRS 477
++ A+ D+G+S+TY AYS L S +V L AS P C+ R
Sbjct: 306 AAEFA-AVVDSGTSFTYLNDPAYSLLATSFNSQVREKRANLSASIP-FEYCYALS---RG 360
Query: 478 IVDVKQFFKTLTLHFGSKWQIV---------STKFHISPEGYLVISKKGNICLGILDGSE 528
+V +LT G+ + + +T + GY + K +I +
Sbjct: 361 QTEVLMPEVSLTTRGGAVFPVTRPFVIVAGETTDGQVHAVGYCLAVFKSDIPID------ 414
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G + G VV+D +GW K C
Sbjct: 415 -------IIGQNFMTGLKVVFDRQRSVLGWTKFDC 442
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 100/401 (24%), Positives = 163/401 (40%), Gaps = 38/401 (9%)
Query: 172 INKKLVSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQC 230
I+ +L S ++ P++ G G Y + +G P + + L DTGSD+TW QC
Sbjct: 40 IHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQC 99
Query: 231 DAPCSSCAKGANPLYKPRMGNILPYKD-----SLCMEIQRNHKPGYCETCQQCDYEIEYA 285
+ +C K P P YK+ +LC + K + C Y+++Y
Sbjct: 100 EPCVKTCYKQKEPRLNPSTST--SYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYG 157
Query: 286 DHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSL 345
D S S+G A + L L+ N N +FGC GL L R K++L
Sbjct: 158 DGSYSIGFFATETLTLSSSN---VFKNFLFGCGQQNNGLFGGAAGLLG----LGRTKLAL 210
Query: 346 PSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM-LDSPFMELYHTEIL 404
PSQ A K + +CL ++ GY+ LG + S + + P+ D Y +I
Sbjct: 211 PSQTAK--TYKKLFSYCLPASSSSKGYLSLGGQVSKS--VKFTPLSADFDSTPFYGLDIT 266
Query: 405 KINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT 464
++ G L++ G + D+G+ T + AYSEL ++ + + +D +
Sbjct: 267 GLSVGGRQLSIDESAFSAG-TVIDSGTVITRLSPTAYSELSSAFQNLMTD-YPSTSGYSI 324
Query: 465 LPVCWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLG 522
C+ +K+ I V FK + I G L ++ +CL
Sbjct: 325 FDTCYDFSKYDTVRIPKVGVTFKG------------GVEMDIDVSGILYPVNGLKKVCLA 372
Query: 523 ILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + T I G++ R VVYD R+G+A C
Sbjct: 373 FAGNDD--DSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 411
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 158/372 (42%), Gaps = 39/372 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF+ + VG P + YL +DTGSD+ WIQC+ PCS C + ++P++ P + YK
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE-PCSDCYQQSDPVFNPTSSST--YKSLT 216
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN-VVFGCA 318
C Q + +C Y++ Y D S ++G LA D T+ G+ K N V GC
Sbjct: 217 CSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATD----TVTFGNSGKINDVALGCG 272
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+D +GL GL +S+ +Q+ + +CL G +
Sbjct: 273 HDNEGLFTGAAGLL----GLGGGALSITNQMKATSF-----SYCLVDRDSGKSSSLDFNS 323
Query: 379 LVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSS 432
+ G A P+L + ++ Y+ + + G + + S G + D G++
Sbjct: 324 VQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTA 383
Query: 433 YTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
T QAY+ L + +++++ +S C+ S+ VK T+ HF
Sbjct: 384 VTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCY----DFSSLSSVK--VPTVAFHF 437
Query: 493 -GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
G K + K ++ P + G C S S I+G++ +G + YD
Sbjct: 438 TGGKSLDLPAKNYLIP-----VDDNGTFCFAFAPTSS----SLSIIGNVQQQGTRITYDL 488
Query: 552 VNKRIGWAKSHC 563
NK IG + + C
Sbjct: 489 ANKIIGLSGNKC 500
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 97/381 (25%), Positives = 163/381 (42%), Gaps = 45/381 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G P R + + +DTGSDLTW+QC +PC C + L+ P +
Sbjct: 11 GEYLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGKCYSQNDALFLPNTST--SFTKLA 67
Query: 260 CMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFG 316
C N P C Q C Y Y D S + G D + + NG + PN FG
Sbjct: 68 CGSALCNGLP--FPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFG 125
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYM 373
C +D +G + DGILGL + +S SQL S + +CL +
Sbjct: 126 CGHDNEG----SFAGADGILGLGQGPLSFHSQLKS--VYNGKFSYCLVDWLAPPTQTSPL 179
Query: 374 FLGHDLVPSW-GMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARN---SQVGWA--L 426
G VP + ++P+L +P + Y+ ++ I+ G + LN+ + VG A +
Sbjct: 180 LFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTI 239
Query: 427 FDTGSSYTYFTKQAYSELIASLKE--VSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
FD+G++ T + AY E++A++ ++ + D S L +C FP + V
Sbjct: 240 FDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDIS--RLDLCLSG-FPKDQLPTV--- 293
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLR 543
+T HF + + P Y + + + C + +V+ I+G + +
Sbjct: 294 -PAMTFHFEGGDMV------LPPSNYFIYLESSQSYCFAMTSSPDVN-----IIGSVQQQ 341
Query: 544 GQLVVYDNVNKRIGWAKSHCM 564
V YD +++G+ C+
Sbjct: 342 NFQVYYDTAGRKLGFVPKDCV 362
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 149/362 (41%), Gaps = 33/362 (9%)
Query: 209 GNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
G P + Y L DTGSD++WIQC PCS C K +P++ P Y C Q
Sbjct: 127 GTPAQTYTLMFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSAT--YSAVPCGHPQCAA 183
Query: 268 KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLN 327
G C + C Y+++Y D SS+ GVL+ + L LT + P FGC L
Sbjct: 184 AGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLT---SARALPGFAFGCGETN----LG 236
Query: 328 TLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW--GM 385
DG++GL R ++SL + +CL + GY+ +G S G+
Sbjct: 237 DFGDVDGLIGLGRGQLSL--SSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPASGSDGV 294
Query: 386 AWVPMLDSP-FMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSEL 444
+ M+ + Y +++ I G L + L D+G+ TY +AY+ L
Sbjct: 295 RYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDSGTVLTYLPPEAYTAL 354
Query: 445 IASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFH 504
K + A DP C+ F ++ + F ++ F + F
Sbjct: 355 RDRFKFTMTQYKPAPAYDP-FDTCY--DFAGQNAI----FMPLVSFKFSD-----GSSFD 402
Query: 505 ISPEGYLVISKKGNICLGILDGSEVHNGSTI---ILGDISLRGQLVVYDNVNKRIGWAKS 561
+SP G L+ G L + V ST+ I+G+ R ++YD ++IG+
Sbjct: 403 LSPFGVLIFPDDTAPATGCL--AFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSG 460
Query: 562 HC 563
C
Sbjct: 461 SC 462
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 104/388 (26%), Positives = 162/388 (41%), Gaps = 50/388 (12%)
Query: 195 NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILP 254
N P Y ++ +G PP+P L +DTGSDL W QC PC +C A P + P + L
Sbjct: 75 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLS 133
Query: 255 Y---KDSLC--MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
+LC + + P + Q C Y Y D S + G L D+ S+
Sbjct: 134 LTSCDSTLCQGLPVASCGSPKFWPN-QTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASV- 191
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTD--GILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
P V FGC GL N + K++ GI G R +SLPSQL + N HC T
Sbjct: 192 -PGVAFGC-----GLFNNGVFKSNETGIAGFGRGPLSLPSQLK----VGN-FSHCFTAVN 240
Query: 368 G---GGGYMFLGHDLVPS--WGMAWVPMLDSPFM-ELYHTEILKINYGSS----PLNLGA 417
G + L DL S + P++ +P Y+ + I GS+ P +
Sbjct: 241 GLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFT 300
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
+ G + D+G++ T + Y + A +V + + +DP C A P+R
Sbjct: 301 LKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSA--PLR 356
Query: 477 SIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI 535
+ K + L LHF G+ + + E + +CL I++G EV
Sbjct: 357 A----KPYVPKLVLHFEGATMDLPRENYVFEVED----AGSSILCLAIIEGGEVTT---- 404
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+ + V+YD N ++ + + C
Sbjct: 405 -IGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 88/375 (23%), Positives = 165/375 (44%), Gaps = 52/375 (13%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETC 275
+DTGS+ +QC + + P++ P +P LC+ +Q+ G + C
Sbjct: 16 IDTGSEAVLVQCGSR-------SRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPC 68
Query: 276 ----QQCDYEIEYADHSSSMGVLARDELHLTIENGS---LTKPNVVFGCAYDQQGLLLNT 328
C Y + Y D +S G ++D + L N S + +V FGCA+ QG L++
Sbjct: 69 VNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGCAHSPQGFLVD- 127
Query: 329 LVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT---NAGGGGYMFLGHDLVPSWGM 385
+ + GI+G +R +SLPSQL + + + +C + G +FLG + +
Sbjct: 128 -LGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKV 185
Query: 386 AWVPMLDSPFM----ELYHTEILKINYGSSPLNLGARNSQV------GWALFDTGSSYTY 435
++ P+LD+P +LY+ + I+ L + ++ G + D+G+++T
Sbjct: 186 SYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTR 245
Query: 436 FTKQAYSELIASLKEVSSDGLVLD-ASDPTLPVCWR--AKFPIRSIVDVKQFFKTLTLHF 492
AY+ + + GL + C+ A + + +V+ L+L
Sbjct: 246 VVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVR-----LSLQN 300
Query: 493 GSKWQIVSTKF-HISPEGYLVISKKGN---ICLGILDGSEVHNGSTIILGDISLRGQLVV 548
+ ++ +F H+ ++ +S GN +CL IL + G +LG+ LV
Sbjct: 301 NVRLEL---RFEHL----FVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVE 353
Query: 549 YDNVNKRIGWAKSHC 563
YDN R+G+ ++ C
Sbjct: 354 YDNERSRVGFERADC 368
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 163/384 (42%), Gaps = 55/384 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CAKGANPLYKPRMGNI---LPYK 256
Y + +G P + +DTGSDL+W+QC PC + C +PL+ P + +P
Sbjct: 91 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 149
Query: 257 DSLCMEIQRNHKPGYCE-----TCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
C ++ C C+Y IEY + +++ GV + + L L KP
Sbjct: 150 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTL--------KP 201
Query: 312 NVV-----FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
VV FGC Q G K DG+LGL A SL SQ +SQ +CL
Sbjct: 202 GVVVADFGFGCGDHQHG----PYEKFDGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPT 255
Query: 367 AGGGGYMFLG-----HDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNS 420
+GG G++ LG + G+++ PM P + Y + I+ G +PL +
Sbjct: 256 SGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAF 315
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD-PTLPVCWRAKFPIRSIV 479
G + D+G+ T AY+ L ++ + S+ +L S+ L C+ F + V
Sbjct: 316 SSGM-VIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCY--DFTGHANV 372
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
V +LT G+ + +P G LV + CL G+ N II G+
Sbjct: 373 TVPTI--SLTFSGGATIDLA------APAGVLV-----DGCL-AFAGAGTDNAIGII-GN 417
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
++ R V+YD+ +G+ C
Sbjct: 418 VNQRTFEVLYDSGKGTVGFRAGAC 441
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 151/382 (39%), Gaps = 63/382 (16%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YF + VG+PP YL +D+GSD+ W+QC PC C +PL+ P + +
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCG 186
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
++C + +CDY + Y D S + G LA + L L G V G
Sbjct: 187 SAICRTLSGTGCG-GGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIG 241
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT-NAGGGGYMFL 375
C + GL V G+LGL +SL QL G V +CL + AGG G + L
Sbjct: 242 CGHRNSGL----FVGAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAGGAGSLVL 295
Query: 376 GH-DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-----GWALFDT 429
G + VP A Y+ + I G L L Q+ G + DT
Sbjct: 296 GRTEAVPRGRRA---------SSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDT 346
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
G++ T ++AY+ L + D + LP + P S++D
Sbjct: 347 GTAVTRLPREAYAALRGA----------FDGAMGALP-----RSPAVSLLD-----TCYD 386
Query: 490 LHFGSKWQIVSTKFHIS-------PEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDIS 541
L + ++ + F+ P L++ G + CL S ILG+I
Sbjct: 387 LSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSS----GISILGNIQ 442
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
G + D+ N +G+ + C
Sbjct: 443 QEGIQITVDSANGYVGFGPNTC 464
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/401 (24%), Positives = 163/401 (40%), Gaps = 38/401 (9%)
Query: 172 INKKLVSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQC 230
I+ +L S ++ P++ G G Y + +G P + + L DTGSD+TW QC
Sbjct: 88 IHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQC 147
Query: 231 DAPCSSCAKGANPLYKPRMGNILPYKD-----SLCMEIQRNHKPGYCETCQQCDYEIEYA 285
+ +C K P P YK+ +LC + K + C Y+++Y
Sbjct: 148 EPCVKTCYKQKEPRLNPSTST--SYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYG 205
Query: 286 DHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSL 345
D S S+G A + L L+ N N +FGC GL L R K++L
Sbjct: 206 DGSYSIGFFATETLTLSSSN---VFKNFLFGCGQQNNGLFGGAAGLLG----LGRTKLAL 258
Query: 346 PSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM-LDSPFMELYHTEIL 404
PSQ A K + +CL ++ GY+ LG + S + + P+ D Y +I
Sbjct: 259 PSQTAK--TYKKLFSYCLPASSSSKGYLSLGGQV--SKSVKFTPLSADFDSTPFYGLDIT 314
Query: 405 KINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT 464
++ G L++ G + D+G+ T + AYSEL ++ + + +D +
Sbjct: 315 GLSVGGRKLSIDESAFSAG-TVIDSGTVITRLSPTAYSELSSAFQNLMTD-YPSTSGYSI 372
Query: 465 LPVCWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLG 522
C+ +K+ I V FK + I G L ++ +CL
Sbjct: 373 FDTCYDFSKYDTVRIPKVGVTFKG------------GVEMDIDVSGILYPVNGLKKVCLA 420
Query: 523 ILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + T I G++ R VVYD R+G+A C
Sbjct: 421 FAGNDD--DSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 459
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 88/403 (21%), Positives = 167/403 (41%), Gaps = 70/403 (17%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
L+ + +G+ + +DTGS+ +QC + + P++ P +P
Sbjct: 98 ALFSMQLGIGSLQKNLSAIIDTGSEAVLVQCGSR-------SRPVFDPAASQSYRQVPCI 150
Query: 257 DSLCMEIQRNHKPGYCETC----QQCDYEIEYADHSSSMGVLARDELHLTIENGS---LT 309
LC+ +Q+ G + C C Y + Y D +S G ++D + L N S +
Sbjct: 151 SQLCLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQ 210
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT---N 366
+V FGCA+ QG L++ + + GI+G +R +SLPSQL + + + +C +
Sbjct: 211 FRDVAFGCAHSPQGFLVD--LGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQ 267
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFM----ELYHTEILKINYGSSPLNLGARNSQV 422
G +FLG + + + P+LD+P +LY+ + I+ L + ++
Sbjct: 268 PRATGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKL 327
Query: 423 ------GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLD-------------ASDP 463
G + D+G+++T AY+ + + GL ++
Sbjct: 328 DPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGS 387
Query: 464 TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN---IC 520
+LP + +++ V ++ F+ L ++ +S GN +C
Sbjct: 388 SLPGVPEVRLSLQNNVRLELRFEHL---------------------FVPVSAAGNEVTVC 426
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
L IL + G +LG+ LV YDN R+G+ ++ C
Sbjct: 427 LAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 469
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 127/496 (25%), Positives = 206/496 (41%), Gaps = 83/496 (16%)
Query: 85 LFLFLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFK 144
L L +A++IFA VFS+ + + + F L H D+
Sbjct: 7 LSLVVALAIFAF-----VFSHAFSTSRRVLEHPKVQNGFRAKLKH---------VDSGKN 52
Query: 145 LGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFT 204
L +F E + V G R + K + SSN+ +D+ + P GN G +
Sbjct: 53 LTKF-----ERIQHGVKRGRHRLQRFKAMALVASSNS-EIDAP-VLP--GN----GEFLM 99
Query: 205 YMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYKDSLCM 261
+ +G PP Y MDTGSDL W QC PC+ C P++ P + L LC
Sbjct: 100 KLAIGTPPETYSAIMDTGSDLIWTQCK-PCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCE 158
Query: 262 EIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYD 320
+ ++ TC C+Y Y D+SS+ G+LA + T+ G ++ P V FGC D
Sbjct: 159 ALPQS-------TCSDGCEYLYGYGDYSSTQGMLASE----TLTFGKVSVPEVAFGCGED 207
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLV 380
+G + + G++GL R +SL SQL + +CLT+ L L
Sbjct: 208 NEG---SGFSQGSGLVGLGRGPLSLVSQLK-----EPKFSYCLTSVDDTKASTLLMGSLA 259
Query: 381 PSWG----MAWVPML-DSPFMELYHTEILKINYGSSPL-----NLGARNSQVGWALFDTG 430
+ P++ +S Y+ + I+ G + L + G + D+G
Sbjct: 260 SVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSG 319
Query: 431 SSYTYFTKQAYSELIASLKEVSSD-GLVLDASDPT-LPVCWRAKFPIRSIVDVKQFFKTL 488
++ TY + A+ +L+A KE +S L +D S T L VC+ P S D++ L
Sbjct: 320 TTITYLEQSAF-DLVA--KEFTSQINLPVDNSGSTGLEVCF--TLPSGS-TDIE--VPKL 371
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
HF + E Y++ + G CL + S + I G+I + LV
Sbjct: 372 VFHFD------GADLELPAENYMIADASMGVACLAMGSSSGMS-----IFGNIQQQNMLV 420
Query: 548 VYDNVNKRIGWAKSHC 563
++D + + + + C
Sbjct: 421 LHDLEKETLSFLPTQC 436
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 122/263 (46%), Gaps = 31/263 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR-----MGNILP 254
G YF + +G+PPR Y+ +D+GSD+ W+QC PC+ C +PL+ P MG +
Sbjct: 41 GEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCK-PCTQCYHQTDPLFDPADSASFMG--VS 97
Query: 255 YKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
++C ++ C + +C YE+ Y D S + G LA + L G NV
Sbjct: 98 CSSAVCDRVENAG----CNS-GRCRYEVSYGDGSYTKGTLALETLTF----GRTVVRNVA 148
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA-GGGGYM 373
GC + +G+ V G+LGL +S QL+ Q N +CL + G++
Sbjct: 149 IGCGHSNRGM----FVGAAGLLGLGGGSMSFMGQLSGQ--TGNAFSYCLVSRGTNTNGFL 202
Query: 374 FLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----GWALF 427
G + +P G AW+P++ +P Y+ +L + G + + + Q+ G +
Sbjct: 203 EFGSEAMP-VGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVM 261
Query: 428 DTGSSYTYFTKQAYSELIASLKE 450
DTG++ T F AY + E
Sbjct: 262 DTGTAVTRFPTVAYEAFRNAFIE 284
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/386 (25%), Positives = 164/386 (42%), Gaps = 42/386 (10%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA---------KGANPLYKPRMGN 251
L++ + VG P + + +DTGSDL W+ CD C CA G P + +
Sbjct: 104 LHYAEVAVGTPNTTFLVALDTGSDLFWVPCD--CKQCAPLGNLTAVDGGGGPELRQYSPS 161
Query: 252 ILPYKDSLCMEIQRNHKPGYCETC-QQCDYEIEYA-DHSSSMGVLARDELHLTIEN---- 305
++ +P C T C Y + YA ++SS G L D L+LT E
Sbjct: 162 KSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAA 221
Query: 306 ---GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGH 361
G+ + VVFGC Q G L+ DG++GL KVS+PS LAS G++K N
Sbjct: 222 AAAGAAVRTPVVFGCGQVQTGSFLDG-AAADGLMGLGMEKVSVPSILASTGVVKSNSFSM 280
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
C + + G G + G S + P + Y+ I ++ G L LG
Sbjct: 281 CFSKD--GLGRINFGD--TGSADQSETPFIVKSTHSYYNISITSMSVGDKNLPLGF---- 332
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKF---PIRS 477
+A+ D+G+S+TY AY+ + ++S S + P + + P ++
Sbjct: 333 --YAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSLSPDQT 390
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
V++ +LT + G+ + + S + I+ + + CL ++ + I+
Sbjct: 391 TVELP--IVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPID----II 444
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G + G VV++ +GW K C
Sbjct: 445 GQNFMTGLKVVFNREKSVLGWQKFDC 470
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 119/457 (26%), Positives = 196/457 (42%), Gaps = 47/457 (10%)
Query: 124 VFPLYHKFGIREVSQRDAEFKLGRF-VDL-DGESVVASVNDGIIRPHKSKINKKLVSS-- 179
VF L + +S R+A L F +DL +S ++ D + P + N SS
Sbjct: 8 VFMLLALYSPSSISTREAGEGLRGFSIDLIHRDSPLSPFYDPSLTPSERITNAAFRSSSR 67
Query: 180 -NAVA--VDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
N V+ +D +++ P I +G Y + +G PP DTGSDL W+QC +PC +
Sbjct: 68 LNRVSHFLDENNL-PESLLIPENGEYLMTLYIGTPPVERLAIADTGSDLIWVQC-SPCQN 125
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKP---GYCETCQQCDYEIEYADHSSSMGV 293
C PL++P + +K + C P C QC Y Y D S ++GV
Sbjct: 126 CFPQDTPLFEPLKSST--FKAATCDSQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGV 183
Query: 294 LARDELHL--TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
+ + L T + +++ P+ +FGC +T K G++GL +SL SQL
Sbjct: 184 VGTETLSFGSTGDAQTVSFPSSIFGCGV-YNNFTFHTSDKVTGLVGLGGGPLSLVSQLGP 242
Query: 352 QGIIKNVVGHCLT--TNAGGGGYMFLGHDLVPSWGMAWVPMLDSP-FMELYHTEILKINY 408
Q I +CL ++ F +V + G+ P++ P F Y + +
Sbjct: 243 Q--IGYKFSYCLLPFSSNSTSKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTI 300
Query: 409 GSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVC 468
G + G + G + D+G+ TY + Y+ +ASL+EV S V A D LP
Sbjct: 301 GQKVVPTGRTD---GNIIIDSGTVLTYLEQTFYNNFVASLQEVLS---VESAQD--LPFP 352
Query: 469 WRAKFPIRSI-VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDG 526
++ FP R + + V F Q + P+ L+ + + +CL ++
Sbjct: 353 FKFCFPYRDMTIPVIAF------------QFTGASVALQPKNLLIKLQDRNMLCLAVVPS 400
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
S I G+++ VVYD K++ +A + C
Sbjct: 401 SL---SGISIFGNVAQFDFQVVYDLEGKKVSFAPTDC 434
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/389 (24%), Positives = 166/389 (42%), Gaps = 64/389 (16%)
Query: 192 LRGNIYPD-GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG 250
++ I P G Y + +G PP P +DTGSDLTW QC PC+ C K PL+ P+
Sbjct: 81 IQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPLFDPK-- 137
Query: 251 NILPYKD-----SLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN 305
N Y+D S C+ + ++ C ++C + YAD S + G LA + L +
Sbjct: 138 NSSTYRDSSCGTSFCLALGKDRS---CSKEKKCTFRYSYADGSFTGGNLASETLTVDSTA 194
Query: 306 GS-LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL- 363
G ++ P FGC + G+ + + GI+GL ++SL SQL S I + +CL
Sbjct: 195 GKPVSFPGFAFGCGHSSGGIFDKS---SSGIVGLGGGELSLISQLKST--INGLFSYCLL 249
Query: 364 ---TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS 420
T ++ F V +G P+ P ++ +
Sbjct: 250 PVSTDSSISSRINFGASGRVSGYGTVSTPL-------------------RLPYKGYSKKT 290
Query: 421 QV--GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
+V G + D+G++YT+ ++ YS+L S+ S G + + +C+ I +
Sbjct: 291 EVEEGNIIVDSGTTYTFLPQEFYSKLEKSVAN-SIKGKRVRDPNGIFSLCYNTTAEINAP 349
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
+ +T HF + P + ++ +C + S++ +LG
Sbjct: 350 I--------ITAHFK------DANVELQPLNTFMRMQEDLVCFTVAPTSDIG-----VLG 390
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHCMNPG 567
+++ LV +D + K+ G++K + G
Sbjct: 391 NLAQVNFLVGFD-LRKKRGFSKKAEVEEG 418
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/386 (25%), Positives = 164/386 (42%), Gaps = 42/386 (10%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA---------KGANPLYKPRMGN 251
L++ + VG P + + +DTGSDL W+ CD C CA G P + +
Sbjct: 104 LHYAEVAVGTPNTTFLVALDTGSDLFWVPCD--CKQCAPLGNLTAVDGGGGPELRQYSPS 161
Query: 252 ILPYKDSLCMEIQRNHKPGYCETC-QQCDYEIEYA-DHSSSMGVLARDELHLTIEN---- 305
++ +P C T C Y + YA ++SS G L D L+LT E
Sbjct: 162 KSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAA 221
Query: 306 ---GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGH 361
G+ + VVFGC Q G L+ DG++GL KVS+PS LAS G++K N
Sbjct: 222 AAAGAAVRTPVVFGCGQVQTGSFLDG-AAADGLMGLGMEKVSVPSILASTGVVKSNSFSM 280
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
C + + G G + G S + P + Y+ I ++ G L LG
Sbjct: 281 CFSKD--GLGRINFGD--TGSADQSETPFIVKSTHSYYNISITSMSVGDKNLPLGF---- 332
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKF---PIRS 477
+A+ D+G+S+TY AY+ + ++S S + P + + P ++
Sbjct: 333 --YAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSLSPDQT 390
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
V++ +LT + G+ + + S + I+ + + CL ++ + I+
Sbjct: 391 TVELP--VVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPID----II 444
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G + G VV++ +GW K C
Sbjct: 445 GQNFMTGLKVVFNREKSVLGWQKFDC 470
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 166/403 (41%), Gaps = 35/403 (8%)
Query: 172 INKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD 231
I +K+ +S+ S+ G Y + +G P +++DTGSD +W+QC
Sbjct: 109 IRRKVTASSNKPKGGVSLLANWGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCK 168
Query: 232 APCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQ--RNHKPGYCETCQQCDYEIEYAD 286
PC+ C + +P++ P + +P C E+ + + + + C YE+ Y D
Sbjct: 169 -PCADCYEQRDPVFDPTASSTYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDD 227
Query: 287 HSSSMGVLARDELHLTIENGSL---TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKV 343
S ++G LARD L L+ T P VFGC + G T + DG+LGL K
Sbjct: 228 DSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCGHSNAG----TFGEVDGLLGLGLGKA 283
Query: 344 SLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEI 403
SLPSQ+A++ +CL ++ GY+ G + + M+ Y+ +
Sbjct: 284 SLPSQVAAR--YGAAFSYCLPSSPSAAGYLSFGGAAARA-NAQFTEMVTGQDPTSYYLNL 340
Query: 404 LKINYGSSPLNLGARN-SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA-S 461
I + + A + + D+G++++ AY+ L +S + A S
Sbjct: 341 TGIVVAGRAIKVPASAFATAAGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPS 400
Query: 462 DPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNIC 520
P C+ F V + + L F + H+ P G L + C
Sbjct: 401 SPIFDTCY--DFTGHETVRI----PAVELVFADGATV-----HLHPSGVLYTWNDVAQTC 449
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
L V N ILG+ R V+YD ++RIG+ + C
Sbjct: 450 LAF-----VPNHDLGILGNTQQRTLAVIYDVGSQRIGFGRKGC 487
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/391 (23%), Positives = 161/391 (41%), Gaps = 45/391 (11%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G I DG +F + +G PP + DTGSDLTW+QC PC C K P++ + +
Sbjct: 77 GLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQC-KPCQQCYKENGPIFDKKKSSTY 135
Query: 254 PYK--DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTK 310
+ DS + + + G E+ C Y Y D S S G +A + + + +GS ++
Sbjct: 136 KSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSF 195
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNA 367
P VFGC Y+ G T +GL +SL SQL S I +CL +
Sbjct: 196 PGTVFGCGYNNGGTFDETGSGI---IGLGGGHLSLISQLGSS--ISKKFSYCLSHKSATT 250
Query: 368 GGGGYMFLGHDLVPS-----WGMAWVPMLDSPFMELYHTEILKINYGS----------SP 412
G + LG + +PS G+ P++D + Y+ + I+ G +P
Sbjct: 251 NGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNP 310
Query: 413 LNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
+ G + G + D+G++ T + + ++++E + + L C+++
Sbjct: 311 NDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSG 370
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG 532
+ +T+HF +SP V + +CL ++ +EV
Sbjct: 371 -------SAEIGLPEITVHF------TGADVRLSPINAFVKLSEDMVCLSMVPTTEV--- 414
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G+ + LV YD + + + C
Sbjct: 415 --AIYGNFAQMDFLVGYDLETRTVSFQHMDC 443
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 164/384 (42%), Gaps = 55/384 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CAKGANPLYKPRMGNI---LPYK 256
Y + +G P + +DTGSDL+W+QC PC + C +PL+ P + +P
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 229
Query: 257 DSLCMEIQRNHKPGYCE-----TCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
C ++ C C+Y IEY + +++ GV + + L L KP
Sbjct: 230 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTL--------KP 281
Query: 312 NVV-----FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
VV FGC Q G K DG+LGL A SL SQ +SQ +CL
Sbjct: 282 GVVVADFGFGCGDHQHG----PYEKFDGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPT 335
Query: 367 AGGGGYMFLG-----HDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNS 420
+GG G++ LG + G+++ PM P + ++ L I+ G +PL +
Sbjct: 336 SGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAF 395
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD-PTLPVCWRAKFPIRSIV 479
G + D+G+ T AY+ L ++ + S+ +L S+ L C+ F + V
Sbjct: 396 SSGM-VIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCY--DFTGHANV 452
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
V +LT G+ + +P G LV + CL G+ N II G+
Sbjct: 453 TVPTI--SLTFSGGATIDLA------APAGVLV-----DGCL-AFAGAGTDNAIGII-GN 497
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
++ R V+YD+ +G+ C
Sbjct: 498 VNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/401 (24%), Positives = 163/401 (40%), Gaps = 38/401 (9%)
Query: 172 INKKLVSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQC 230
I+ +L S ++ P++ G G Y + +G P + + L DTGSD+TW QC
Sbjct: 100 IHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQC 159
Query: 231 DAPCSSCAKGANPLYKPRMGNILPYKD-----SLCMEIQRNHKPGYCETCQQCDYEIEYA 285
+ +C K P P YK+ +LC + K + C Y+++Y
Sbjct: 160 EPCVKTCYKQKEPRLNPSTST--SYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYG 217
Query: 286 DHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSL 345
D S S+G A + L L+ N N +FGC GL L R K++L
Sbjct: 218 DGSYSIGFFATETLTLSSSN---VFKNFLFGCGQQNNGLFGGAAGLLG----LGRTKLAL 270
Query: 346 PSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM-LDSPFMELYHTEIL 404
PSQ A K + +CL ++ GY+ LG + S + + P+ D Y +I
Sbjct: 271 PSQTAK--TYKKLFSYCLPASSSSKGYLSLGGQV--SKSVKFTPLSADFDSTPFYGLDIT 326
Query: 405 KINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT 464
++ G L++ G + D+G+ T + AYSEL ++ + + +D +
Sbjct: 327 GLSVGGRKLSIDESAFSAG-TVIDSGTVITRLSPTAYSELSSAFQNLMTD-YPSTSGYSI 384
Query: 465 LPVCWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLG 522
C+ +K+ I V FK + I G L ++ +CL
Sbjct: 385 FDTCYDFSKYDTVRIPKVGVTFKG------------GVEMDIDVSGILYPVNGLKKVCLA 432
Query: 523 ILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + T I G++ R VVYD R+G+A C
Sbjct: 433 FAGNDD--DSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 471
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 163/381 (42%), Gaps = 51/381 (13%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK-- 256
+G Y + +G PP Y +DTGSDL W QC PC+ C K P++ P+ +
Sbjct: 105 NGEYLMELAIGTPPVSYPAVLDTGSDLIWTQC-KPCTQCYKQPTPIFDPKKSSSFSKVSC 163
Query: 257 -DSLCMEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
SLC + + TC C+Y Y D+S + GVLA + ++ N+
Sbjct: 164 GSSLCSAVPSS-------TCSDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIG 216
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYM 373
FGC D +G + + G++GL R +SL SQL + +CLT + +
Sbjct: 217 FGCGEDNEG---DGFEQASGLVGLGRGPLSLVSQLK-----EPRFSYCLTPMDDTKESIL 268
Query: 374 FLGH--DLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----GWA 425
LG + + + P+L +P Y+ + I+ G + L++ +V G
Sbjct: 269 LLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGV 328
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT-LPVCWRAKFPIRSI-VDVKQ 483
+ D+G++ TY ++A+ L +S L LD + T L +C+ P S V++ +
Sbjct: 329 IIDSGTTITYIEQKAFEALKKEF--ISQTKLPLDKTSSTGLDLCF--SLPSGSTQVEIPK 384
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISL 542
+ HF + E Y++ S G CL + S + I G++
Sbjct: 385 ----IVFHFKGG------DLELPAENYMIGDSNLGVACLAMGASSGMS-----IFGNVQQ 429
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ LV +D + I + + C
Sbjct: 430 QNILVNHDLEKETISFVPTSC 450
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 119/492 (24%), Positives = 194/492 (39%), Gaps = 82/492 (16%)
Query: 88 FLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGR 147
FL +S+F+L S FS+ L + F L H+ + + E K
Sbjct: 6 FLTLSLFSLCFIAS-FSHALSN------------GFSVELIHRDSPKSPYYKPTENKYQH 52
Query: 148 FVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMI 207
FVD S+ ++ N S+ + S++ P RG G TY
Sbjct: 53 FVDAARRSI-------------NRANHFFKDSD-TSTPESTVIPDRG-----GYLMTYS- 92
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM-EIQRN 266
VG PP Y DTGSD+ W+QC+ PC C P++ P + YK+ C+ ++ +
Sbjct: 93 VGTPPTKIYGIADTGSDIVWLQCE-PCEQCYNQTTPIFNPSKSS--SYKNIPCLSKLCHS 149
Query: 267 HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFGCAYDQQGLL 325
+ C C Y+I Y D S S G L+ D L L +GS ++ P V GC D G
Sbjct: 150 VRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTF 209
Query: 326 LNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGGGGYMFLGHDLVP 381
+ GI+GL VSL +QL S I +CL + + G V
Sbjct: 210 GGA---SSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILSFGDAAVV 264
Query: 382 SW-GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS---QVGWALFDTGSSYTYFT 437
S G+ P++ + Y + + G+ + G + G + D+G++ T
Sbjct: 265 SGDGVVSTPLIKKDPV-FYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTLIP 323
Query: 438 KQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK-----FPIRSIVDVKQFFKTLTLHF 492
Y+ L +++ ++ V D + +C+ K FPI +T HF
Sbjct: 324 SDVYTNLESAVVDLVKLDRV-DDPNQQFSLCYSLKSNEYDFPI------------ITAHF 370
Query: 493 -GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
G+ ++ S V G +C ++ + I G+++ + LV YD
Sbjct: 371 KGADIELHSIS-------TFVPITDGIVCFAFQPSPQLGS----IFGNLAQQNLLVGYDL 419
Query: 552 VNKRIGWAKSHC 563
K + + + C
Sbjct: 420 QQKTVSFKPTDC 431
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/410 (25%), Positives = 175/410 (42%), Gaps = 45/410 (10%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSD 224
IR +++ +S + D+++ P+R G Y + G P + Y +DTGSD
Sbjct: 81 IRGDANRLRFLKRTSRSSKEDANANVPVRSG---SGEYIIQVDFGTPKQSMYTLIDTGSD 137
Query: 225 LTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY 284
+ WI C C C A P++ P + YK C G C +C +E+ Y
Sbjct: 138 VAWIPCKQ-CQGCHSTA-PIFDPAKSS--SYKPFACDSQPCQEISGNCGGNSKCQFEVLY 193
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
D + G LA D + L GS PN FGCA L + G++GL +S
Sbjct: 194 GDGTQVDGTLASDAITL----GSQYLPNFSFGCAES----LSEDTYSSPGLMGLGGGSLS 245
Query: 345 LPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD-LVPSWGMAWVPMLDSP-FMELYHTE 402
L +Q + + +CL +++ G + LG + V S + + ++ P F Y
Sbjct: 246 LLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPSFPTFYFVT 305
Query: 403 ILKINYGSSPLNLGARN-SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDAS 461
+ I+ G++ +++ A N + G + D+G++ TY AY +L + ++ L +
Sbjct: 306 LKAISVGNTRISVPATNIASGGGTIIDSGTTITYLVPSAYKDLRDAFRQ------QLSSL 359
Query: 462 DPT----LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG 517
PT + C+ S VDV T+TLH +V K E L+ + G
Sbjct: 360 QPTPVEDMDTCYDLS---SSSVDV----PTITLHLDRNVDLVLPK-----ENILITQESG 407
Query: 518 NICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPG 567
CL S I+G++ + +V+D N ++G+A+ C P
Sbjct: 408 LSCLAF-----SSTDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQCAAPA 452
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/385 (25%), Positives = 167/385 (43%), Gaps = 62/385 (16%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G YFT + VG P R Y+ +DTGSD+ WIQC+ PCS C +P++ P + + L
Sbjct: 195 GEYFTRIGVGTPMREQYMVLDTGSDVVWIQCE-PCSKCYSQVDPIFNPSLSASFSTLGCN 253
Query: 257 DSLC--MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
++C ++ H G C Y++ Y D S ++G A + L G+ + NV
Sbjct: 254 SAVCSYLDAYNCHGGG-------CLYKVSYGDGSYTIGSFATEMLTF----GTTSVRNVA 302
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN-AGGGGYM 373
GC +D GL V G+LGL +S PSQL +Q +CL + G +
Sbjct: 303 IGCGHDNAGL----FVGAAGLLGLGAGLLSFPSQLGTQ--TGRAFSYCLVDRFSESSGTL 356
Query: 374 FLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGA-------RNSQVGWA 425
G + VP G P+L +P + Y+ ++ I+ G + L+ S G
Sbjct: 357 EFGPESVP-LGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGF 415
Query: 426 LFDTGSSYTYFTKQAYSEL----IASLKEV-SSDGLVLDASDPTLPVCWR-AKFPIRSIV 479
+ D+G++ T Y + +A +++ ++G+ + C+ + P+ ++
Sbjct: 416 IVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSI------FDTCYDLSGLPLVNV- 468
Query: 480 DVKQFFKTLTLHFGSKWQ-IVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
T+ HF + I+ K ++ P ++ G C + I+G
Sbjct: 469 ------PTVVFHFSNGASLILPAKNYMIPMDFM-----GTFCFAFAPATS----DLSIMG 513
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
+I +G V +D N +G+A C
Sbjct: 514 NIQQQGIRVSFDTANSLVGFALRQC 538
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 157/376 (41%), Gaps = 38/376 (10%)
Query: 199 DGLYFTYMI-VGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGNILPYK 256
D L F + G+P + Y L +DTGSD++WIQC PCS C K +P++ P Y
Sbjct: 157 DTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQC-LPCSGHCYKQHDPVFDPTKSAT--YS 213
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C Q G C C Y++ Y D SS+ GVL+ + L L + + P FG
Sbjct: 214 AVPCGHPQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSL---SSTRDLPGFAFG 270
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C L DG++GL R +SLPSQ A+ +CL + GY+ +G
Sbjct: 271 CGQTN----LGEFGGVDGLVGLGRGALSLPSQAAA--TFGATFSYCLPSYDTTHGYLTMG 324
Query: 377 HDLVPSWG-----MAWVPMLDSP-FMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
P+ + + M+ + LY E++ I+ G L + LFD+G
Sbjct: 325 -STTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDGTLFDSG 383
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
+ TY +AY+ L K + A DP C+ F + + F +
Sbjct: 384 TILTYLPPEAYASLRDRFKFTMTQYKPAPAYDP-FDTCY--DFTGHNAI----FMPAVAF 436
Query: 491 HFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI---ILGDISLRGQLV 547
F F +SP L+ G L + V ST+ I+G+ RG V
Sbjct: 437 KFSD-----GAVFDLSPVAILIYPDDTAPATGCL--AFVPRPSTMPFNIIGNTQQRGTEV 489
Query: 548 VYDNVNKRIGWAKSHC 563
+YD ++IG+ + C
Sbjct: 490 IYDVAAEKIGFGQFTC 505
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 158/383 (41%), Gaps = 42/383 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGN---ILPY 255
G Y + +G PP PY DTGSDL W QC APC+S C + PLY P +LP
Sbjct: 90 GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 148
Query: 256 KDSL--CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
SL C C C Y + Y +S+ + + G P +
Sbjct: 149 NSSLSVCAAALAGTGTAPPPGC-ACTYNVTYGSGWTSVFQGSETFTFGSTPAGHARVPGI 207
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGG 369
FGC+ G G++GL R ++SL SQL G+ K +CLT TN+
Sbjct: 208 AFGCSTASSGF---NASSASGLVGLGRGRLSLVSQL---GVPK--FSYCLTPYQDTNSTS 259
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDS----PFMELYHTEILKINYGSSPLNL-----GARNS 420
+ L + G++ P + S P Y+ + I+ G++ L++
Sbjct: 260 TLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNAD 319
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G++ T AY ++ A++ + + ++D L +C F + S
Sbjct: 320 GTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLC----FMLPSSTS 375
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
++TLHF + + + Y++ G CL + + ++ G ILG+
Sbjct: 376 APPAMPSMTLHFNGADMV------LPADSYMMSDDSGLWCLAMQNQTD---GEVNILGNY 426
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+ ++YD + + +A + C
Sbjct: 427 QQQNMHILYDIGQETLSFAPAKC 449
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/382 (24%), Positives = 155/382 (40%), Gaps = 44/382 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + VG P + +DTGSD+ W+QC APC C + ++ PR D +
Sbjct: 126 GEYFAQVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDCV 184
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
+R G C Y++ Y D S + G A + LT G+ + V GC +
Sbjct: 185 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASET--LTFARGARVQ-RVAIGCGH 241
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--------TNAGGGG 371
D +GL + G+LGL R ++S PSQ+A +CL ++
Sbjct: 242 DNEGL----FIAASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSVRPSSTRSST 295
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV-------- 422
F + + G ++ PM +P M Y+ +L + G + + G S +
Sbjct: 296 VTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVK-GVSQSDLRLNPTTGR 354
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G + D+G+S T + Y + + + + V C+ R +V V
Sbjct: 355 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCY--NLSGRRVVKV- 411
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDIS 541
T+++H + + PE YL+ + G C + +G I+G+I
Sbjct: 412 ---PTVSMHLAGGASVA-----LPPENYLIPVDTSGTFCFAMAG----TDGGVSIIGNIQ 459
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+G VV+D +R+G+ C
Sbjct: 460 QQGFRVVFDGDAQRVGFVPKSC 481
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/382 (24%), Positives = 155/382 (40%), Gaps = 44/382 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + VG P + +DTGSD+ W+QC APC C + ++ PR D +
Sbjct: 120 GEYFAQVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDCV 178
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
+R G C Y++ Y D S + G A + LT G+ + V GC +
Sbjct: 179 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASET--LTFARGARVQ-RVAIGCGH 235
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--------TNAGGGG 371
D +GL + G+LGL R ++S PSQ+A +CL ++
Sbjct: 236 DNEGL----FIAASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSVRPSSTRSST 289
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV-------- 422
F + + G ++ PM +P M Y+ +L + G + + G S +
Sbjct: 290 VTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVK-GVSQSDLRLNPTTGR 348
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G + D+G+S T + Y + + + + V C+ R +V V
Sbjct: 349 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCY--NLSGRRVVKV- 405
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDIS 541
T+++H + + PE YL+ + G C + +G I+G+I
Sbjct: 406 ---PTVSMHLAGGASVA-----LPPENYLIPVDTSGTFCFAMAG----TDGGVSIIGNIQ 453
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+G VV+D +R+G+ C
Sbjct: 454 QQGFRVVFDGDAQRVGFVPKSC 475
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 164/387 (42%), Gaps = 50/387 (12%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL----P 254
D L++T++ +G P + + +D GSDL W+ CD C CA + Y + L P
Sbjct: 104 DWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCD--CIQCAPLSASYYNISLDRDLSEYSP 161
Query: 255 YKDSLCMEIQRNHKPGYCE---TCQQ----CDYEIEYAD--HSSSMGVLARDELHLTIEN 305
S + +H+ CE C+ C Y Y D +++S G L D+LHL
Sbjct: 162 SLSSTSRHLSCDHQ--LCEWGSNCKNPKDPCPYIFNYDDFENTTSAGFLVEDKLHLASVG 219
Query: 306 G----SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+ + +VV GC Q G + DG++GL +S+PS LA G+I+N
Sbjct: 220 DHTARKMLQASVVLGCGRKQGGSFFDG-AAPDGVMGLGPGDISVPSLLAKAGLIQNCFSL 278
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
C N G +F ++P+ + Y + G+S L
Sbjct: 279 CFDEN-DSGRILFGDRGHASQQSTPFLPIQGT--YVAYFVGVESYCVGNSCLKRSGFK-- 333
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
AL D+GSS+TY + Y+EL++ K+V++ + D C+ A + + D
Sbjct: 334 ---ALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISF--QDGLWDYCYNAS--SQELHD 386
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG--NICLGILDGSEVHNGSTIILG 538
+ + L F + F + Y + +G CL + + +GS I+G
Sbjct: 387 I----PAIQLKFPR-----NQNFVVHNPTYSIPHHQGFTMFCLSL----QPTDGSYGIIG 433
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHCMN 565
+ G +V+D N ++GW+ S C +
Sbjct: 434 QNFMIGYRMVFDIENLKLGWSNSSCQD 460
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 97/385 (25%), Positives = 155/385 (40%), Gaps = 54/385 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G Y + +G PP Y MDTGSDL W QC APC CA P + + LP +
Sbjct: 87 GEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCAAQPTPYFDVKRSATYRALPCR 145
Query: 257 DSLCMEIQRNHKPGYCETC--QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNV 313
S C + +C + C Y+ Y D +S+ GVLA + + + + N+
Sbjct: 146 SSRCAALSS-------PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANI 198
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG---GG 370
FGC G L N + G++G R +SL SQL + +CLT+
Sbjct: 199 SFGCGSLNAGELAN----SSGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSPTPSR 249
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFM------ELYHTEILKINYGSS-----PLNLGARN 419
Y + +L + + P+ +PF+ +Y + I+ G+ PL +
Sbjct: 250 LYFGVFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAIND 309
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
G + D+G+S T+ + AY + L + D +D L C++ P V
Sbjct: 310 DGTGGVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMND-TDIGLDTCFQWPPPPNVTV 368
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIILG 538
V F HF + PE Y++I S G +CL + S I+G
Sbjct: 369 TVPDF----VFHFDGA------NMTLPPENYMLIASTTGYLCLAMAPTSV-----GTIIG 413
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
+ + ++YD N + + + C
Sbjct: 414 NYQQQNLHLLYDIANSFLSFVPAPC 438
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 161/380 (42%), Gaps = 47/380 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKD-- 257
G Y + +G PP P +DTGSDLTW QC PC+ C K P + P+ N Y+D
Sbjct: 90 GEYIMNLSIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPFFDPK--NSSTYRDSS 146
Query: 258 ---SLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNV 313
S C+ + + C ++C + YAD S + G LA + L + G ++ P
Sbjct: 147 CGTSFCLALGNDRS---CRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGF 203
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL----TTNAGG 369
FGC + G+ + GI+GL A++S+ SQL S I +CL T ++
Sbjct: 204 AFGCVHRSGGIFDE---HSSGIVGLGVAELSMISQLKST--INGRFSYCLLPVFTDSSMS 258
Query: 370 GGYMFLGHDLVPSWGMAWVPM-LDSPFMELYHTEILKINYGSSPLNLG--ARNSQV--GW 424
F +V G P+ + P Y + + G L+ ++ ++V G
Sbjct: 259 SRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGN 318
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ D+G++YTY + Y +L S+ V D + + +C+ V Q
Sbjct: 319 IIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGIS-SLCYNTT--------VDQI 369
Query: 485 -FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
+T HF + P + ++ +C +L S++ ILG+++
Sbjct: 370 DAPIITAHFK------DANVELQPWNTFLRMQEDLVCFTVLPTSDIG-----ILGNLAQV 418
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
LV +D KR+ + + C
Sbjct: 419 NFLVGFDLRKKRVSFKAADC 438
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 106/386 (27%), Positives = 164/386 (42%), Gaps = 49/386 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY---KD 257
L++ + VG P + + +DTGSDL W+ CD C CA AN ++ PY K
Sbjct: 106 LHYAEVAVGTPNATFLVALDTGSDLFWVPCD--CKQCAPIANASDLRGGPDLRPYSPGKS 163
Query: 258 SLCMEIQRNH----KPGYC----ETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENG-- 306
S + H +P C + C Y + Y + ++SS GVL D LHL+ E
Sbjct: 164 STSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSANTSSSGVLVEDVLHLSREAAGG 223
Query: 307 ---SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHC 362
++T P VV GC Q G L+ DG+LGL KVS+PS L + G++ + C
Sbjct: 224 ASTAVTAP-VVLGCGQVQTGAFLDG-AAVDGLLGLGMDKVSVPSVLHAAGLVASDSFSMC 281
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
+ + G G + G G A P Y+ + ++ G +
Sbjct: 282 FSPD--GFGRINFGDS--GRRGQAETPFTVRNTHPTYNISVTAMSVS------GKEVAAE 331
Query: 423 GWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
A+ D+G+S+TY AY+EL EV L AS P C+ R ++
Sbjct: 332 FAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIP-FEYCYELG---RGQTEL 387
Query: 482 KQFFKTLTLHFGSKWQI---VSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI-IL 537
+LT G+ + + + + + +G +V + CL +L N TI I+
Sbjct: 388 FVPEVSLTTRGGAVFPVTRPIVVIYGETSDGRIVAA---GYCLAVL-----KNDITIDII 439
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G + G VV+D +GW + C
Sbjct: 440 GQNFMTGLKVVFDRERSVLGWHEFDC 465
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 158/383 (41%), Gaps = 42/383 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGN---ILPY 255
G Y + +G PP PY DTGSDL W QC APC+S C + PLY P +LP
Sbjct: 30 GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 88
Query: 256 KDSL--CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
SL C C C Y + Y +S+ + + G P +
Sbjct: 89 NSSLSVCAAALAGTGTAPPPGC-ACTYNVTYGSGWTSVFQGSETFTFGSTPAGHARVPGI 147
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGG 369
FGC+ G G++GL R ++SL SQL G+ K +CLT TN+
Sbjct: 148 AFGCSTASSGF---NASSASGLVGLGRGRLSLVSQL---GVPK--FSYCLTPYQDTNSTS 199
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDS----PFMELYHTEILKINYGSSPLNL-----GARNS 420
+ L + G++ P + S P Y+ + I+ G++ L++
Sbjct: 200 TLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNAD 259
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G++ T AY ++ A++ + + ++D L +C F + S
Sbjct: 260 GTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLC----FMLPSSTS 315
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
++TLHF + + + Y++ G CL + + ++ G ILG+
Sbjct: 316 APPAMPSMTLHFNGADMV------LPADSYMMSDDSGLWCLAMQNQTD---GEVNILGNY 366
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+ ++YD + + +A + C
Sbjct: 367 QQQNMHILYDIGQETLSFAPAKC 389
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 162/383 (42%), Gaps = 45/383 (11%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G G YF+ + VG P + Y+ +DTGSD+ WIQC PCS C + ++P++ P +
Sbjct: 154 VSGTSQGSGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQC-LPCSECYQQSDPIFDPTSSS 212
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
L D C + + C + +C Y++ Y D S ++G A D + E+G +
Sbjct: 213 TFKSLTCSDPKCASLDVSA----CRS-NKCLYQVSYGDGSFTVGNYATDTVTFG-ESGKV 266
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
+V GC +D +GL GL +S+ +Q+ ++ +CL
Sbjct: 267 N--DVALGCGHDNEGLFTGAAGLL----GLGGGALSMTNQIKAKSF-----SYCLVDRDS 315
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGAR-----NSQV 422
+ + G A P+L + M+ Y+ + + G +++ + S
Sbjct: 316 AKSSSLDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGA 375
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G + D G++ T QAY+ L + ++++D + C+ F S V V
Sbjct: 376 GGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCY--DFSSLSTVKV- 432
Query: 483 QFFKTLTLHF-GSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDI 540
T+T HF G K ++ + YL+ I G C S S I+G++
Sbjct: 433 ---PTVTFHFTGGK------SLNLPAKNYLIPIDDAGTFCFAFAPTSS----SLSIIGNV 479
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+G + YD N IG + + C
Sbjct: 480 QQQGTRITYDLANNLIGLSANKC 502
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/265 (29%), Positives = 112/265 (42%), Gaps = 30/265 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y + +G PP P+ DTGSDLTW QC PC C P+Y P + LP +
Sbjct: 71 YLMELAIGKPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPLPCSSA 129
Query: 259 LCMEI-QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
C+ I RN C C Y Y D + S G+L + L L + ++ V FGC
Sbjct: 130 TCLPIWSRN-----CTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGC 184
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGYMFL 375
D G LN + G +GL R +SL +QL G+ K +CLT N+ L
Sbjct: 185 GTDNGGDSLN----STGTVGLGRGTLSLLAQL---GVGK--FSYCLTDFFNSALDSPFLL 235
Query: 376 G--HDLVPSWGMAW-VPMLDSPFM-ELYHTEILKINYGSSPL-----NLGARNSQVGWAL 426
G +L P P+L SP Y + I+ G L R G +
Sbjct: 236 GTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMI 295
Query: 427 FDTGSSYTYFTKQAYSELIASLKEV 451
D+G+++T + + E++ + V
Sbjct: 296 VDSGTTFTILAESGFREVVGRVARV 320
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 159/374 (42%), Gaps = 44/374 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VGNP R +Y+ +DTGSD+ W+QC PC+ C + +P++ P + Y
Sbjct: 159 GEYFTRVGVGNPARQFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPTASST--YAPVT 215
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
C Q + QC Y++ Y D S + G A + + +GS+ NV GC +
Sbjct: 216 CQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFG-NSGSV--KNVALGCGH 272
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDL 379
D +GL V G+LGL +SL +QL + +CL G +
Sbjct: 273 DNEGL----FVGAAGLLGLGGGPLSLTNQLKATSF-----SYCLVNRDSAGSSTLDFNSA 323
Query: 380 VPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGSSY 433
P++ + ++ Y+ + ++ G +++ ++ G + D G++
Sbjct: 324 QLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAI 383
Query: 434 TYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW--RAKFPIRSIVDVKQFFKTLTLH 491
T QAY+ L + ++ + L L ++ C+ + +R T++ H
Sbjct: 384 TRLQTQAYNPLRDAFVRMTQN-LKLTSAVALFDTCYDLSGQASVR--------VPTVSFH 434
Query: 492 F--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
F G W + + + I + G C + S I+G++ +G V +
Sbjct: 435 FADGKSWNLPAANYLIP------VDSAGTYCFAFAPTTS----SLSIIGNVQQQGTRVTF 484
Query: 550 DNVNKRIGWAKSHC 563
D N R+G++ + C
Sbjct: 485 DLANNRMGFSPNKC 498
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 100/381 (26%), Positives = 166/381 (43%), Gaps = 44/381 (11%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G G YFT + +GNP R Y+ +DTGSD+ W+QC PC+ C P+++P
Sbjct: 141 ISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQC-TPCADCYHQTEPIFEPSS-- 197
Query: 252 ILPYKDSLCMEIQRNH-KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
Y+ C Q N + C C YE+ Y D S ++G A + L + GS
Sbjct: 198 SSSYEPLSCDTPQCNALEVSECRN-ATCLYEVSYGDGSYTVGDFATETLTI----GSTLV 252
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGG 369
NV GC + +GL V G+LGL ++LPSQL + +CL ++
Sbjct: 253 QNVAVGCGHSNEGL----FVGAAGLLGLGGGLLALPSQLNTTSF-----SYCLVDRDSDS 303
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV-----G 423
+ G L P +A P+L + ++ Y+ + I+ G L + + ++ G
Sbjct: 304 ASTVEFGTSLPPDAVVA--PLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSG 361
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+ D+G++ T Y+ L S + +SD L A C+ ++ ++V
Sbjct: 362 GIIIDSGTAVTRLQTGIYNSLRDSFLKGTSD-LEKAAGVAMFDTCY--NLSAKTTIEV-- 416
Query: 484 FFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
T+ HF G K + K ++ P + G CL + S I+G++
Sbjct: 417 --PTVAFHFPGGKMLALPAKNYMIP-----VDSVGTFCLAFAPTAS----SLAIIGNVQQ 465
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+G V +D N IG++ + C
Sbjct: 466 QGTRVTFDLANSLIGFSSNKC 486
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 155/383 (40%), Gaps = 57/383 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y + +G PP P +DTGSDL W QCDAPC C PLY P + +
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 259 LCMEIQ----RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNV 313
+C +Q R P C Y Y D +S+ GVLA + L GS T V
Sbjct: 152 MCQALQSPWSRCSPPD-----TGCAYYFSYGDGTSTDGVLATETFTL----GSDTAVRGV 202
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT-NAGGGGY 372
FGC + G N + G++G+ R +SL SQL G+ + +C T NA
Sbjct: 203 AFGCGTENLGSTDN----SSGLVGMGRGPLSLVSQL---GVTR--FSYCFTPFNATAASP 253
Query: 373 MFLGHDLVPSWGMAWVPMLDSP------FMELYHTEILKINYGSSPLNLGARNSQV---- 422
+FLG S P + SP Y+ + I G + L + ++
Sbjct: 254 LFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMG 313
Query: 423 -GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
G + D+G+++T ++A+ L +L L + L +C+ A P V+V
Sbjct: 314 DGGVIIDSGTTFTALEERAFVALARALASRVRLPLA-SGAHLGLSLCFAAASP--EAVEV 370
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDI 540
+ L LHF + E Y+V + + CLG++ + +LG +
Sbjct: 371 PR----LVLHFDGA------DMELRRESYVVEDRSAGVACLGMVSARGMS-----VLGSM 415
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+ ++YD + + + C
Sbjct: 416 QQQNTHILYDLERGILSFEPAKC 438
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 173/392 (44%), Gaps = 55/392 (14%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA-PCSSCAKGA-NPL------YKPRMGNI 252
L++T++ +G P + + +D GSD+ W+ CD C+S + G N L Y+P + N
Sbjct: 104 LHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNT 163
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQ-CDYEIEYAD-HSSSMGVLARDELHLTIENGS 307
LP LC C+ + C Y ++Y+ ++SS G + D+LHLT NG
Sbjct: 164 SRHLPCGHKLC------DVHSVCKGSKDPCPYAVQYSSANTSSSGYVFEDKLHLT-SNGK 216
Query: 308 LTKPN-----VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
+ N ++ GC Q G L DG+LGL +S+PS LA G+I+N C
Sbjct: 217 HAEQNSVQASIILGCGRKQTGEYLRG-AGPDGVLGLGPGNISVPSLLAKAGLIQNSFSIC 275
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
N G +F V ++P +D F Y + GS L L Q
Sbjct: 276 FEENE-SGRIIFGDQGHVTQHSTPFLP-IDGKF-NAYIVGVESFCVGS--LCLKETRFQ- 329
Query: 423 GWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
AL D+GSS+T+ + Y +++ K+V++ +VL S C+ A + ++ +
Sbjct: 330 --ALIDSGSSFTFLPNEVYQKVVIEFDKQVNATSIVLQNS---WEYCYNAS--SQELISI 382
Query: 482 KQFFKTLTLHFG-SKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILGD 539
L L F ++ ++ I P S++ I CL + + + +G
Sbjct: 383 ----PPLNLAFSRNQTYLIQNPIFIDPA-----SQEYTIFCLPVSPSDDDYAA----IGQ 429
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPGRFKS 571
L G +V+D N R W++ +C + F S
Sbjct: 430 NFLMGYRMVFDRENLRFSWSRWNCQDRASFSS 461
>gi|356546446|ref|XP_003541637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 160
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 65/100 (65%), Gaps = 3/100 (3%)
Query: 464 TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGI 523
+LP+CW+ +S+ DV FK + L F ++ + PE YL+++K G +CLGI
Sbjct: 58 SLPICWKDTKTFKSLHDVTSNFKPIALRFTKS---KNSLLQLQPESYLIVTKHGKVCLGI 114
Query: 524 LDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
LDG+E+ G+T I+GDIS + +LV+YDN +IGWA ++C
Sbjct: 115 LDGTEIGLGNTNIIGDISFQDKLVIYDNEKHQIGWASANC 154
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 104/413 (25%), Positives = 170/413 (41%), Gaps = 51/413 (12%)
Query: 164 IIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTG 222
+I+ + +++ S NA+ SS I +Y G Y + +G P MDTG
Sbjct: 60 LIKRAIKRGERRMRSINAMLQSSSGI---ETPVYAGSGEYLMNVAIGTPASSLSAIMDTG 116
Query: 223 SDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCMEIQRNHKPGYCETC-QQC 278
SDL W QC+ PC+ C P++ P+ + LP + C ++ E+C C
Sbjct: 117 SDLIWTQCE-PCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPS-------ESCYNDC 168
Query: 279 DYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGL 338
Y Y D SS+ G +A + T E S+ PN+ FGC D QG G++G+
Sbjct: 169 QYTYGYGDGSSTQGYMATET--FTFETSSV--PNIAFGCGEDNQGFGQG---NGAGLIGM 221
Query: 339 SRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY-MFLGHDL--VPSWGMAWVPMLDSPF 395
+SLPSQL +C+T++ + LG VP + + S
Sbjct: 222 GWGPLSLPSQLG-----VGQFSYCMTSSGSSSPSTLALGSAASGVPEGSPSTTLIHSSLN 276
Query: 396 MELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGSSYTYFTKQAYSELIASLKE 450
Y+ + I G L + + Q+ G + D+G++ TY + AY+ + + +
Sbjct: 277 PTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTD 336
Query: 451 VSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGY 510
+ V D S L C++ S V V + Q ++ E
Sbjct: 337 QINLSPV-DESSSGLSTCFQLPSD-GSTVQVPEI----------SMQFDGGVLNLGEENV 384
Query: 511 LVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
L+ +G ICL + GS G + I G+I + V+YD N + + + C
Sbjct: 385 LISPAEGVICLAM--GSSSQQGIS-IFGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/382 (23%), Positives = 155/382 (40%), Gaps = 44/382 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + VG P + +DTGSD+ W+QC APC C + ++ PR D +
Sbjct: 120 GEYFAQVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDCV 178
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
+R G C Y++ Y D S + G A + LT G+ + V GC +
Sbjct: 179 APICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASET--LTFARGARVQ-RVAIGCGH 235
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--------TNAGGGG 371
D +GL + G+LGL R ++S P+Q+A +CL ++
Sbjct: 236 DNEGL----FIAASGLLGLGRGRLSFPTQIARS--FGRSFSYCLVDRTSSVRPSSTRSST 289
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV-------- 422
F + + G ++ PM +P M Y+ +L + G + + G S +
Sbjct: 290 VTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVK-GVSQSDLRLNPTTGR 348
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G + D+G+S T + Y + + + + V C+ R +V V
Sbjct: 349 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCY--NLSGRRVVKV- 405
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDIS 541
T+++H + + PE YL+ + G C + +G I+G+I
Sbjct: 406 ---PTVSMHLAGGASVA-----LPPENYLIPVDTSGTFCFAMAG----TDGGVSIIGNIQ 453
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+G VV+D +R+G+ C
Sbjct: 454 QQGFRVVFDGDAQRVGFVPKSC 475
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 96/389 (24%), Positives = 159/389 (40%), Gaps = 43/389 (11%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G I G YF + +G PP + DTGSDLTW+QC PC C K +PL+ + +
Sbjct: 77 GLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQC-KPCQQCYKQNSPLFDKKKSSTY 135
Query: 254 PYK--DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK- 310
+ DS + H+ G E+ C Y Y D+S + G +A + + + +GS
Sbjct: 136 KTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSF 195
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG-- 368
P VFGC Y+ G T +GL +SL SQL S I +CL+ A
Sbjct: 196 PGTVFGCGYNNGGTFEETGSGI---IGLGGGPLSLVSQLGSS--IGKKFSYCLSHTAATT 250
Query: 369 -GGGYMFLGHDLVPSWGMAWVPMLDSPFME-----LYHTEILKINYGSSPLNL------- 415
G + LG + +PS L +P ++ Y + + G + L
Sbjct: 251 NGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGL 310
Query: 416 -GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
G + + G + D+G++ T Y + +++E + + L C+++
Sbjct: 311 NGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSG-- 368
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
D + +T+HF + +SP V + +CL ++ +EV
Sbjct: 369 -----DKEIGLPAITMHF------TNADVKLSPINAFVKLNEDTVCLSMIPTTEV----- 412
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G++ LV YD K + + + C
Sbjct: 413 AIYGNMVQMDFLVGYDLETKTVSFQRMDC 441
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 166/382 (43%), Gaps = 49/382 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP---RMGNILPYK 256
G YF + VG P Y+ +DTGSD+ W+QC +PC C ++P++ P + +P
Sbjct: 134 GEYFMRLGVGTPATNMYMVLDTGSDVVWLQC-SPCKVCYNQSDPVFNPAKSKTFATVPCG 192
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC + + + C Y++ Y D S ++G + + L +G+ +V G
Sbjct: 193 SRLCRRLD-DSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTF---HGARVD-HVALG 247
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY---- 372
C +D +GL V G+LGL R +S PSQ ++ +CL G
Sbjct: 248 CGHDNEGL----FVGAAGLLGLGRGGLSFPSQTKNR--YNGKFSYCLVDRTSSGSSSKPP 301
Query: 373 --MFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV------- 422
+ G+ VP + + P+L +P ++ Y+ ++L I+ G S + G SQ
Sbjct: 302 STIVFGNGAVPKTAV-FTPLLTNPKLDTFYYLQLLGISVGGSRVP-GVSESQFKLDATGN 359
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G + D+G+S T T+ AY +L++ G P+ + + F + + VK
Sbjct: 360 GGVIIDSGTSVTRLTQSAY----VALRDAFRLGATRLKRAPSYSL-FDTCFDLSGMTTVK 414
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDIS 541
T+ HF + + YL+ ++ +G C GS I+G+I
Sbjct: 415 --VPTVVFHF------TGGEVSLPASNYLIPVNNQGRFCFAFAG----TMGSLSIIGNIQ 462
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+G V YD V R+G+ C
Sbjct: 463 QQGFRVAYDLVGSRVGFLSRAC 484
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 146/379 (38%), Gaps = 43/379 (11%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI- 252
G G Y + +G P Y + DTGSD TW+QC C K PL+ P +
Sbjct: 155 GRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTY 214
Query: 253 --LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
+ DS C ++ N G C Y ++Y D S ++G A+D LTI + ++
Sbjct: 215 ANVSCTDSACADLDTNGCTG-----GHCLYAVQYGDGSYTVGFFAQDT--LTIAHDAIK- 266
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
FGC GL KT G++GL R K SL Q ++ +CL G
Sbjct: 267 -GFRFGCGEKNNGL----FGKTAGLMGLGRGKTSLTVQAYNK--YGGAFAYCLPALTTGT 319
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
GY+ G + PML Y+ + I G + + L D+G
Sbjct: 320 GYLDFGPGSAGNNAR-LTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSG 378
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
+ T AY+ L ++ +V L ++ K P SI+D F L+
Sbjct: 379 TVITRLPATAYTALSSAFDKV------------MLARGYK-KAPGYSILDTCYDFTGLS- 424
Query: 491 HFGSKWQIVSTKFH------ISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
+ VS F + G + + +CL S + S I+G+ +
Sbjct: 425 --DVELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAF--ASNGDDESVAIVGNTQQKT 480
Query: 545 QLVVYDNVNKRIGWAKSHC 563
V+YD K +G+A C
Sbjct: 481 YGVLYDLGKKTVGFAPGSC 499
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 146/379 (38%), Gaps = 43/379 (11%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI- 252
G G Y + +G P Y + DTGSD TW+QC C K PL+ P +
Sbjct: 155 GRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTY 214
Query: 253 --LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
+ DS C ++ N G C Y ++Y D S ++G A+D LTI + ++
Sbjct: 215 ANVSCTDSACADLDTNGCTG-----GHCLYAVQYGDGSYTVGFFAQDT--LTIAHDAIK- 266
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
FGC GL KT G++GL R K SL Q ++ +CL G
Sbjct: 267 -GFRFGCGEKNNGL----FGKTAGLMGLGRGKTSLTVQAYNK--YGGAFAYCLPALTTGT 319
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
GY+ G + PML Y+ + I G + + L D+G
Sbjct: 320 GYLDFGPGSAGNNAR-LTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSG 378
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
+ T AY+ L ++ +V L ++ K P SI+D F L+
Sbjct: 379 TVITRLPATAYTALSSAFDKV------------MLARGYK-KAPGYSILDTCYDFTGLS- 424
Query: 491 HFGSKWQIVSTKFH------ISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
+ VS F + G + + +CL S + S I+G+ +
Sbjct: 425 --DVELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAF--ASNGDDESVAIVGNTQQKT 480
Query: 545 QLVVYDNVNKRIGWAKSHC 563
V+YD K +G+A C
Sbjct: 481 YGVLYDLGKKTVGFAPGSC 499
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 158/392 (40%), Gaps = 59/392 (15%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y ++ VG PPRP L +DTGSDL W QC APC C PL P + LP
Sbjct: 92 YLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFHQGLPLLDPAASSTYAALPCGAP 150
Query: 259 LCMEIQRNHKPG-----YCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS----LT 309
C + G + + C Y Y D S ++G +A D +NG L
Sbjct: 151 RCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSRLP 210
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT---- 365
+ FGC + +G+ + GI G R + SLPSQL +C T+
Sbjct: 211 TRRLTFGCGHFNKGVFQS---NETGIAGFGRGRWSLPSQLN-----VTTFSYCFTSMFES 262
Query: 366 -----NAGG--GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGA 417
GG + H S + P+L +P LY + I+ G + L
Sbjct: 263 KSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKT--RLAV 320
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL----VLDASDPTLPVCWRAKF 473
+++ + D+G+S T + Y + A + GL V++ S L +C+
Sbjct: 321 PEAKLRSTIIDSGASITTLPEAVYEAVKAEF--AAQVGLPPTGVVEGS--ALDLCF--AL 374
Query: 474 PIRSIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEG-YLVISKKGNICLGILDGSEVHN 531
P+ ++ + +LTLH G+ W++ P G Y+ + +LD +
Sbjct: 375 PVTALWR-RPPVPSLTLHLDGADWEL--------PRGNYVFEDLAARVMCVVLDAAP--- 422
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G ++G+ + VVYD N + +A + C
Sbjct: 423 GDQTVIGNFQQQNTHVVYDLENDWLSFAPARC 454
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 160/365 (43%), Gaps = 40/365 (10%)
Query: 212 PRPYYLDMDTGSDLTWIQCDAPCSSC-AKGANPLYK-PRMGNILPYKDSLCMEIQRNHKP 269
+ + L +DTGS T++ C C+SC A A Y + + S C I
Sbjct: 44 AQTFELIVDTGSSRTYLPCKG-CASCGAHEAGRYYDYDASADFSRVECSACAGI-----G 97
Query: 270 GYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTL 329
G C T C Y++ Y + S S G L RD + L GS+ VVFGC + G +
Sbjct: 98 GKCGTSGVCRYDVHYLEGSGSEGYLVRDVVSL---GGSVGNATVVFGCEERELGSIKQQ- 153
Query: 330 VKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-----TNAGGGGYMFLGH-DL-VPS 382
DG+ G R +L +QLAS +I ++ C+ + GG + LG+ D +
Sbjct: 154 -SADGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGNFDFGADA 212
Query: 383 WGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYS 442
+ + PM+ S Y G+S + G+R + D+G+SYTY ++
Sbjct: 213 PALVYTPMVSSAM--YYQVTTTSWTLGNSVVE-GSRGV---LTIIDSGTSYTYVPGNMHA 266
Query: 443 ELIASLKEVSSD-GLVLDASDPTLP-VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVS 500
+ ++ + + GL A P +C+ + V ++F L + + S
Sbjct: 267 RFLQLAEDAARESGLEKVAPPEDYPDLCFGNSGGL-GWSTVSEYFPALKIEYHG-----S 320
Query: 501 TKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGW 558
+ +SPE YL +K C+GIL+ H+ + I+LG I++R +D ++G
Sbjct: 321 ARLTLSPETYLYWHQKNASAFCVGILE----HDDNRILLGQITMRNTFTEFDVARSQVGM 376
Query: 559 AKSHC 563
A ++C
Sbjct: 377 ASANC 381
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 72/227 (31%), Positives = 114/227 (50%), Gaps = 22/227 (9%)
Query: 159 SVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYL 217
+++D +R ++++ +K+VSS++V V I PL + L Y M +G + +
Sbjct: 103 TLDDLHVRSMQNRL-RKMVSSHSVEVSQIQI-PLASGVNFQTLNYIVTMELGG--QDMTV 158
Query: 218 DMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYKDSLCMEIQ-RNHKPGYCE 273
+DTGSDLTW+QC+ PC SC P++KP + +P S C +Q G CE
Sbjct: 159 IIDTGSDLTWVQCE-PCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACE 217
Query: 274 TC-QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKT 332
+ C Y + Y D S + G L + L G ++ N VFGC + +GL
Sbjct: 218 SNPSNCSYAVNYGDGSYTNGELGAEHLSF----GGISVSNFVFGCGKNNKGL----FGGV 269
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHD 378
G++GL R+ +SL SQ S V +CL T+AG G + +G++
Sbjct: 270 SGLMGLGRSNLSLISQTNS--TFGGVFSYCLPPTDAGASGSLAMGNE 314
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 97/346 (28%), Positives = 142/346 (41%), Gaps = 48/346 (13%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
DG Y M +G P R Y +DTGSDL W QC APC C P + P Y+
Sbjct: 87 DGEYLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSAT--YRSL 143
Query: 259 LCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C N Y C Q C Y+ Y D +S+ GVLA + ++ P + FG
Sbjct: 144 GCASPACNAL--YYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFG 201
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG------- 369
C G L N G++G R +SL SQL S +CLT+
Sbjct: 202 CGNLNAGSLAN----GSGMVGFGRGSLSLVSQLGSPRF-----SYCLTSFLSPVPSRLYF 252
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGS-----SPLNLGARNSQ-V 422
G Y L S + P + +P + +Y + I+ G P ++
Sbjct: 253 GVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGT 312
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL--VLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G++ TY + AY + A+ + L V DAS L C++ P R V
Sbjct: 313 GGTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDAS--VLDTCFQWPPPPRQSVT 370
Query: 481 VKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVI--SKKGNICLGI 523
+ Q L LHF G+ W+ + + Y+++ S G +CL +
Sbjct: 371 LPQ----LVLHFDGADWE-------LPLQNYMLVDPSTGGGLCLAM 405
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 159/386 (41%), Gaps = 54/386 (13%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
Y + VG PP+P +DTGSDL W QC APC+SC +P++ P G Y+ C
Sbjct: 104 YLVDLAVGTPPQPVSALLDTGSDLIWTQC-APCASCLPQPDPIFSP--GASSSYEPMRCA 160
Query: 262 -EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-----TIENGSLTKPNVVF 315
E+ + C+ C Y Y D +++ GV A + E L+ P + F
Sbjct: 161 GELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAP-LGF 219
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC +G L N GI+G RA +SL SQLA + +CLT A G L
Sbjct: 220 GCGTMNKGSLNN----GSGIVGFGRAPLSLVSQLAIRRF-----SYCLTPYASGRKSTLL 270
Query: 376 GHDLVPSWGMAWVPMLDSPFM-------ELYHTEILKINYGSSPLNL-----GARNSQVG 423
L A + + + Y+ + G+ L + R G
Sbjct: 271 FGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSG 330
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKE-----VSSDGLVLDASDPTLPVCWRAKFPIRSI 478
A+ D+G++ T F +E++ + + +++G +S P VC+ A S
Sbjct: 331 GAIVDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANG----SSGPDDGVCFAAA---ASR 383
Query: 479 VDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
V + H G+ + + + + +KGN+CL + D + +G+TI
Sbjct: 384 VPRPAVVPRMVFHLQGADLDLPRRNYVLDDQ------RKGNLCLLLADSGD--SGTTI-- 433
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G+ + V+YD + +A + C
Sbjct: 434 GNFVQQDMRVLYDLEADTLSFAPAQC 459
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 90/390 (23%), Positives = 162/390 (41%), Gaps = 53/390 (13%)
Query: 191 PLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRM 249
PLR + G YF + VG PPR + DTGSD+ W+QC PC SC +PL+ P
Sbjct: 69 PLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQC-LPCQSCYGQTDPLFNPSF 127
Query: 250 GNI---LPYKDSLCMEI-----QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
+ + SLC ++ +RN QC Y++ Y D S ++G + + L
Sbjct: 128 SSTFQSITCGSSLCQQLLIRGCRRN----------QCLYQVSYGDGSFTVGEFSTETLSF 177
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
GS +V GC ++ QGL L + +S PSQ+ + +V +
Sbjct: 178 ----GSNAVNSVAIGCGHNNQGLFTGAAGLLG----LGKGLLSFPSQVGQ--LYGSVFSY 227
Query: 362 CLTTNAGGGGY-MFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGA-- 417
CL T G + G+ V S + +L +P ++ Y+ E++ I G + +N+ A
Sbjct: 228 CLPTRESTGSVPLIFGNQAVAS-NAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGS 286
Query: 418 ----RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
++ G + D+G++ T AY+ + + + G+ DA + + +
Sbjct: 287 LSLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRA----GMPSDAKMTSGFSLFDTCY 342
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
+ + + + G+ + + + + G CL SE +
Sbjct: 343 DLSGRSSIMLPAVSFVFNGGATMALPAQNIMVP------VDNSGTYCLAFAPNSENFS-- 394
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+I + + +D+ R+G + C
Sbjct: 395 --IIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 155/376 (41%), Gaps = 42/376 (11%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC--SSCAKGANPLYKPRMGN---IL 253
+G Y + +G P DTGSDLTW+QC +PC + C PLY P + +L
Sbjct: 93 NGNYLMRIYIGTPSVERLAIADTGSDLTWVQC-SPCDNTKCFAQNTPLYDPLNSSTFTLL 151
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
P C ++ + C C Y Y D+S S G L+ D + L + +
Sbjct: 152 PCDSQPCTQLPYSQY--VCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNS-KI 208
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGG 371
FGC + Q + KT GI+GL +SL SQL + I + +CL ++
Sbjct: 209 CFGCGF-QNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFSSNSNSK 265
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGS 431
F +V G+ P++ P + Y+ + I G+ + G + + + D+GS
Sbjct: 266 LKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNI---IIDSGS 322
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP----VCWRAKFPIRSIVDVKQFFKT 487
+ TY + Y+E ++ +KE V D +P C+ K + + DV
Sbjct: 323 TLTYLEESFYNEFVSLVKET-----VAVEEDQYIPYPFDFCFTYKEGMSTPPDV------ 371
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
HF + +V + P LV+ + IC ++ H I G++ V
Sbjct: 372 -VFHF-TGGDVV-----LKPMNTLVLIEDNLICSTVVPS---HFDGIAIFGNLGQIDFHV 421
Query: 548 VYDNVNKRIGWAKSHC 563
YD ++ +A + C
Sbjct: 422 GYDIQGGKVSFAPTDC 437
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 87/285 (30%), Positives = 123/285 (43%), Gaps = 37/285 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKG--ANPLYKPRMGNIL---PY 255
L+F VG PP P + MDTGS L WIQC PC C+ +P++ P + +
Sbjct: 67 LFFVNFSVGQPPVPQFTIMDTGSSLLWIQCH-PCKHCSSNHMIHPVFNPALSSTFVECSC 125
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS--LTKPNV 313
D C R G+C + +C YE Y + S GVLA++ L T NG+ +T+P +
Sbjct: 126 DDRFC----RYAPNGHCSS-NKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQP-I 179
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
FGC ++ L + GILGL SL QL S+ +G N G +
Sbjct: 180 AFGCGHENGEQLESEFT---GILGLGAKPTSLAVQLGSK--FSYCIGDLANKNYGYNQ-L 233
Query: 374 FLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL-----GARNSQVGWALFD 428
LG D G P+ +Y+ + I+ G LN+ R S+ G + D
Sbjct: 234 VLGED-ADILGDP-TPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTG-VILD 290
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
TG+ YT+ AY EL +K + DP L W F
Sbjct: 291 TGTLYTWLADIAYRELYNEIKSI---------LDPKLERFWFRDF 326
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 101/405 (24%), Positives = 171/405 (42%), Gaps = 55/405 (13%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
+ P+ G Y + VG P L +DT SDLTW+QC PC C + P++ PR
Sbjct: 128 VAPVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPR 186
Query: 249 MGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADH------SSSMGVLARDEL 299
+ Y C + R+ G C Y + Y D S+S+G L + L
Sbjct: 187 HSTSYGEMNYDAPDCQALGRSG--GGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETL 244
Query: 300 HLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVV 359
G + + + GC +D +GL GILGLSR ++S+P Q+A G +
Sbjct: 245 TFA---GGVRQAYLSIGCGHDNKGLF---GAPAAGILGLSRGQISIPHQIAFLGYNAS-F 297
Query: 360 GHCLTTNAGGGG-----YMFLGHDLVPSWGMAWVP-MLDSPFMELYHTEILKINYGSSPL 413
+CL G G F + S ++ P +L+ Y+ ++ ++ G +
Sbjct: 298 SYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV 357
Query: 414 -NLGARNSQV------GWALFDTGSSYTYFTKQAYS-------ELIASLKEVSSDGLVLD 459
+ R+ Q+ G + D+G++ T + AY+ L +VS+ G
Sbjct: 358 PGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPS-G 416
Query: 460 ASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGN 518
D V RA +R V V +++HF ++ + P+ YL+ + +G
Sbjct: 417 LFDTCYTVGGRAG--LRHCVKV----PAVSMHFAGGVEL-----SLQPKNYLITVDSRGT 465
Query: 519 ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+C + + S ++G+I +G VVYD +R+G+A + C
Sbjct: 466 VCFAF---AGTGDRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 154/383 (40%), Gaps = 57/383 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y + +G PP P +DTGSDL W QCDAPC C PLY P + +
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 259 LCMEIQ----RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNV 313
+C +Q R P C Y Y D +S+ GVLA + L GS T V
Sbjct: 152 MCQALQSPWSRCSPPD-----TGCAYYFSYGDGTSTDGVLATETFTL----GSDTAVRGV 202
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT-NAGGGGY 372
FGC + G N + G++G+ R +SL SQL G+ + +C T NA
Sbjct: 203 AFGCGTENLGSTDN----SSGLVGMGRGPLSLVSQL---GVTR--FSYCFTPFNATAASP 253
Query: 373 MFLGHDLVPSWGMAWVPMLDSPF------MELYHTEILKINYGSSPLNLGARNSQV---- 422
+FLG S P + SP Y+ + I G + L + ++
Sbjct: 254 LFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMG 313
Query: 423 -GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
G + D+G+++T + A+ L +L L + L +C+ A P V+V
Sbjct: 314 DGGVIIDSGTTFTALEESAFVALARALASRVRLPLA-SGAHLGLSLCFAAASP--EAVEV 370
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDI 540
+ L LHF + E Y+V + + CLG++ + +LG +
Sbjct: 371 PR----LVLHFDGA------DMELRRESYVVEDRSAGVACLGMVSARGMS-----VLGSM 415
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+ ++YD + + + C
Sbjct: 416 QQQNTHILYDLERGILSFEPAKC 438
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 95/382 (24%), Positives = 158/382 (41%), Gaps = 40/382 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + VG PPR + + MDTGSDL W+QC APC C P++ P +
Sbjct: 148 GEYLVEVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFDQRGPVFDPMASTSYRNVTCG 206
Query: 257 DSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN-V 313
D+ C + P C + + C Y Y D S++ G LA + + + S + + V
Sbjct: 207 DTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGV 266
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG-GGY 372
V GC + +GL L R +S SQL + + + +CL + G
Sbjct: 267 VLGCGHRNRGLFHGAAGLLG----LGRGPLSFASQL--RAVYGHAFSYCLVDHGSAVGSK 320
Query: 373 MFLGHDLV----PSWG-MAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV----- 422
+ G D V P A+ P + Y+ ++ I G L++ + V
Sbjct: 321 IVFGDDNVLLSHPQLNYTAFAP--SAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDG 378
Query: 423 -GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
G + D+G++ +YF + AY + + + L A P L C+ R V+V
Sbjct: 379 SGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVER--VEV 436
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+F +L G+ W + + I + +G +CL +L + I+G+
Sbjct: 437 PEF--SLLFADGAVWDFPAENYFIR------LDTEGIMCLAVLG---TPRSAMSIIGNYQ 485
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ V+YD + R+G+A C
Sbjct: 486 QQNFHVLYDLHHNRLGFAPRRC 507
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 92.4 bits (228), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 92/380 (24%), Positives = 163/380 (42%), Gaps = 56/380 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VGNP R +Y+ +DTGSD+ W+QC PC+ C + +P++ P + Y
Sbjct: 18 GEYFTRVGVGNPARQFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPTASST--YAPVT 74
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
C Q + QC Y++ Y D S + G A + + +GS+ NV GC +
Sbjct: 75 CQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFG-NSGSV--KNVALGCGH 131
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF----- 374
D +GL V G+LGL +SL +QL + +CL G
Sbjct: 132 DNEGL----FVGAAGLLGLGGGPLSLTNQLKATSF-----SYCLVNRDSAGSSTLDFNSA 182
Query: 375 -LGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV-----GWALF 427
LG D V + P++ + ++ Y+ + ++ G +++ ++ G +
Sbjct: 183 QLGVDSVTA------PLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIV 236
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW--RAKFPIRSIVDVKQFF 485
D G++ T QAY+ L + ++ + L L ++ C+ + +R
Sbjct: 237 DCGTAITRLQTQAYNPLRDAFVRMTQN-LKLTSAVALFDTCYDLSGQASVR--------V 287
Query: 486 KTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
T++ HF G W + + + I + G C + S I+G++ +
Sbjct: 288 PTVSFHFADGKSWNLPAANYLIP------VDSAGTYCFAFAPTTS----SLSIIGNVQQQ 337
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G V +D N R+G++ + C
Sbjct: 338 GTRVTFDLANNRMGFSPNKC 357
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 100/380 (26%), Positives = 160/380 (42%), Gaps = 51/380 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VG P R Y+ +DTGSD+ W+QC APC C ++P++ PR
Sbjct: 140 GEYFTRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPCS 198
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
+R G + C Y++ Y D S ++G + + LT + V GC +
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTET--LTFRRNRVK--GVALGCGH 254
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMFLGH 377
D +GL V G+LGL + K+S P Q + +CL + + G+
Sbjct: 255 DNEGL----FVGAAGLLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSASSKPSSVVFGN 308
Query: 378 DLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALF--------- 427
V S + P+L +P ++ Y+ +L I+ G G R V +LF
Sbjct: 309 AAV-SRIARFTPLLSNPKLDTFYYVGLLGISVG------GTRVPGVTASLFKLDQIGNGG 361
Query: 428 ---DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
D+G+S T + AY + + + V + L C F + ++ +VK
Sbjct: 362 VIIDSGTSVTRLIRPAYIAMRDAFR-VGAKTLKRAPDFSLFDTC----FDLSNMNEVK-- 414
Query: 485 FKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
T+ LHF G+ + +T + I + G C G I+G+I +
Sbjct: 415 VPTVVLHFRGADVSLPATNYLIP------VDTNGKFCFAFAG----TMGGLSIIGNIQQQ 464
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G VVYD + R+G+A C
Sbjct: 465 GFRVVYDLASSRVGFAPGGC 484
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 76/275 (27%), Positives = 126/275 (45%), Gaps = 33/275 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA--KGAN-------PLYKPRMGN 251
L++T + +G P + + + +DTGSDL W+ CD CS CA +G +Y P+ +
Sbjct: 102 LHYTTVSLGTPGKKFLVALDTGSDLFWVPCD--CSRCAPTEGTTYASDFELSIYNPKGSS 159
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYAD-HSSSMGVLARDELHLTIENG- 306
+ +SLC RN G T C Y + Y +S+ G+L D LHLT E+
Sbjct: 160 TSRKVTCNNSLCA--HRNRCLG---TFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNR 214
Query: 307 -SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
+ V FGC Q G L+ + +G+ GL K+S+PS L+ +G + C
Sbjct: 215 QEFVEAYVTFGCGQVQTGSFLD-IAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGP 273
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
+ G G + G P P + Y+ + ++ G++ ++L A
Sbjct: 274 D--GIGRISFGDKGGPD--QEETPFNLNALHPTYNITVTQVRVGTTLIDLDFT------A 323
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA 460
LFD+G+S+TY Y+ ++ S + + +V A
Sbjct: 324 LFDSGTSFTYLVDPIYTNVLKSSELIYCMAVVRSA 358
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 92/376 (24%), Positives = 141/376 (37%), Gaps = 34/376 (9%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G G Y + +G P Y + DTGSD TW+QC C K L+ P +
Sbjct: 174 GRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSST- 232
Query: 254 PYKDSLCMEIQRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
Y + C + Y C C Y ++Y D S S+G A D L L+ +
Sbjct: 233 -YANVSCAAPACSDL--YTRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYD---AVK 286
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
FGC +GL + G+LGL R K SLP Q + V HCL + G G
Sbjct: 287 GFRFGCGERNEGL----FGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSSGTG 340
Query: 372 YMFLGHDLVPSWGMAW-VPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
Y+ G + G PML Y+ + I G L++ + D+G
Sbjct: 341 YLDFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFSTAGTIVDSG 400
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
+ T AYS L ++ + K P S++D F ++
Sbjct: 401 TVITRLPPAAYSSLRSAFASA-------------MAARGYKKAPALSLLDTCYDFTGMSE 447
Query: 491 HFGSKWQIV---STKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
K ++ ++ G + + +CLG + + I+G+ L+ V
Sbjct: 448 VAIPKVSLLFQGGAYLDVNASGIMYAASLSQVCLGFAANED--DDDVGIVGNTQLKTFGV 505
Query: 548 VYDNVNKRIGWAKSHC 563
VYD K +G++ C
Sbjct: 506 VYDIGKKTVGFSPGAC 521
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 116/257 (45%), Gaps = 31/257 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR-----MGNILP 254
G YF + VG+PPR Y+ +D+GSD+ W+QC PC+ C +PL+ P MG +
Sbjct: 41 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCK-PCTQCYHQTDPLFDPADSASFMG--VS 97
Query: 255 YKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
++C ++ C + +C YE+ Y D SS+ G LA + L L G NV
Sbjct: 98 CSSAVCDQVDNAG----CNS-GRCRYEVSYGDGSSTKGTLALETLTL----GRTVVQNVA 148
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA-GGGGYM 373
GC + QG+ + L +S QL+ + N +CL + G++
Sbjct: 149 IGCGHMNQGMFVGAAGLLG----LGGGSMSFVGQLSRE--RGNAFSYCLVSRVTNSNGFL 202
Query: 374 FLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----GWALF 427
G + +P G AW+P++ +P Y+ + + G + + ++ G +
Sbjct: 203 EFGSEAMP-VGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVM 261
Query: 428 DTGSSYTYFTKQAYSEL 444
DTG++ T F AY
Sbjct: 262 DTGTAVTRFPTVAYEAF 278
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 129/290 (44%), Gaps = 28/290 (9%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPY 255
DG Y VG PP Y +DTGSD+ W+QC PC C ++ P N ILP+
Sbjct: 83 DGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCK-PCEKCYNQTTRIFDPSKSNTYKILPF 141
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVV 314
+ C ++ + + C+Y I Y D S S G L+ + L L NGS K V
Sbjct: 142 SSTTCQSVEDTSCSS--DNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTV 199
Query: 315 FGCAYDQQGLLLNTLV---KTDGILGLSRAKVSLPSQLASQ-GIIKNVVGHCLTTNAGGG 370
GC + NT+ K+ GI+GL VSL +QL + I +CL + +
Sbjct: 200 IGCGRN------NTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNIS 253
Query: 371 GYMFLGHDLVPSW-GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA---RNSQVGWAL 426
+ G V S G P++ Y+ + + G++ + + R + G +
Sbjct: 254 SKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGNII 313
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLD-ASDP--TLPVCWRAKF 473
D+G++ T YS+L +++ +D + LD DP L +C+R+ F
Sbjct: 314 IDSGTTLTLLPNDIYSKLESAV----ADLVELDRVKDPLKQLSLCYRSTF 359
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 72/219 (32%), Positives = 104/219 (47%), Gaps = 31/219 (14%)
Query: 185 DSSSIFPLRGNIYPD----GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKG 240
DS S+ R +Y D G Y T + +G PP+ + L +D+GS +T++ C + C C K
Sbjct: 71 DSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKH 129
Query: 241 ANPLYKPR-------MGNILPYKDSLCM---------EIQRNHKPGYCE---TC----QQ 277
L P+ + +K S + E+ ++P C C +Q
Sbjct: 130 QVMLSSPKDQILCLVSCKVQIFKISYGLFDEDPKFQPELSSTYQPVKCNMDCNCDDDKEQ 189
Query: 278 CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILG 337
C YE EYA+HSSS GVL D + E+ LT VFGC + G L + + DGI+G
Sbjct: 190 CVYEREYAEHSSSKGVLGEDLISFGNES-HLTPQRAVFGCKTVETGDLYSQ--RADGIIG 246
Query: 338 LSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
L + +SL QL +G+I N G C GGG M +G
Sbjct: 247 LGQGDLSLVGQLVDKGLISNSFGLCYGGLDVGGGSMIVG 285
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 92.0 bits (227), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 117/261 (44%), Gaps = 32/261 (12%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS--SCAKGANPLYKPRMGN---ILPYK 256
Y + +G P L++DTGSDL+W+QC PC+ +C +PL+ P + +P
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCT-PCAAPACYSQKDPLFDPAQSSSYAAVPCG 198
Query: 257 DSLCMEIQRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
+C + Y +C QC Y + Y D S + GV + D L L+ +
Sbjct: 199 GPVCGGLGI-----YASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPND---AVRGFF 250
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF 374
FGC + Q G N DG+LGL R + SL Q A G V +CL T GY+
Sbjct: 251 FGCGHAQSGFTGN-----DGLLGLGREEASLVEQTA--GTYGGVFSYCLPTRPSTTGYLT 303
Query: 375 LGHDLVPSW----GMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDT 429
LG PS G + +L SP Y+ +L I+ G L++ + G + DT
Sbjct: 304 LGG---PSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSV-PSSVFAGGTVVDT 359
Query: 430 GSSYTYFTKQAYSELIASLKE 450
G+ T AY+ L ++ +
Sbjct: 360 GTVITRLPPTAYAALRSAFRS 380
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 92.0 bits (227), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 106/408 (25%), Positives = 163/408 (39%), Gaps = 51/408 (12%)
Query: 168 HKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTW 227
H + I++K S NA +D +S G Y + +G PP P DTGSDL W
Sbjct: 69 HFTDISQKDASDNAPQIDLTS---------NSGEYLMNISLGTPPFPIMAIADTGSDLLW 119
Query: 228 IQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRN--HKPGYCETCQQ-CDYEIEY 284
QC PC C +PL+ P+ + YKD C Q C T C Y Y
Sbjct: 120 TQC-KPCDDCYTQVDPLFDPKASST--YKDVSCSSSQCTALENQASCSTEDNTCSYSTSY 176
Query: 285 ADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKV 343
D S + G +A D L L + + + N++ GC ++ G K GI+GL V
Sbjct: 177 GDRSYTKGNIAVDTLTLGSTDTRPVQLKNIIIGCGHNNAGTFNK---KGSGIVGLGGGAV 233
Query: 344 SLPSQLASQGIIKNVVGHCL----TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELY 399
SL +QL I +CL + N F + +V G+ P++ Y
Sbjct: 234 SLITQLGDS--IDGKFSYCLVPLTSENDRTSKINFGTNAVVSGTGVVSTPLIAKSQETFY 291
Query: 400 HTEILKINYGSSPLNLGARNSQVGWA--LFDTGSSYTYFTKQAYSELIASLKEVSSDGLV 457
+ + I+ GS + +S G + D+G++ T + YSEL V+S
Sbjct: 292 YLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTLTLLPTEFYSEL---EDAVASSIDA 348
Query: 458 LDASDPT--LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISK 515
DP L +C+ A ++ +T+HF ++ P V
Sbjct: 349 EKKQDPQTGLSLCYSATGDLK--------VPAITMHFD------GADVNLKPSNCFVQIS 394
Query: 516 KGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ +C GS S I G+++ LV YD V+K + + + C
Sbjct: 395 EDLVCFA-FRGSP----SFSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 437
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 91.7 bits (226), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 157/387 (40%), Gaps = 63/387 (16%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
+G PP+P L +DTGS L+WIQC K PL KP+ + P S + NH
Sbjct: 72 IGTPPQPTDLVLDTGSQLSWIQCHD--KKIKKRLPPLPKPKTTSFDPSLSSSFSLLPCNH 129
Query: 268 K-----------PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
P C+ + C Y YAD + + G L R++ + SL+ P V+ G
Sbjct: 130 PICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS---KSLSTPPVILG 186
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMF 374
CA + GILG++R ++S SQ +C+ + G G +
Sbjct: 187 CA--------QASTENRGILGMNRGRLSFISQAKISKF-----SYCVPSRTGSNPTGLFY 233
Query: 375 LGHDLVPSWGMAWVPMLDSPFME--------LYHTEILKINYGSSPLNLGARNSQ----- 421
LG D S +V ML P + Y + I LN+ +
Sbjct: 234 LG-DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDAGG 292
Query: 422 VGWALFDTGSSYTYFTKQAY----SELIASLKEVSSDGLV-LDASDPTLPVCWRAKFPIR 476
G + D+GS TY +AY E++ + + G V D +D +C+ A
Sbjct: 293 SGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVAD----MCFDAGV--- 345
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
+V + ++ F + +I F EG L +KG C+GI + GS II
Sbjct: 346 -TAEVGRRIGGISFEFDNGVEI----FVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNII 400
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
G + + V YD NKR+G+ + C
Sbjct: 401 -GTVHQQNMWVEYDLANKRVGFGGAEC 426
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 91.7 bits (226), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 133/317 (41%), Gaps = 44/317 (13%)
Query: 169 KSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWI 228
++ I+K+L SSN +I L+ VG PP P MDTGS L WI
Sbjct: 71 QNSIDKELGSSNFQVDVEQAI--------KTSLFLVNFSVGQPPVPQLTIMDTGSSLLWI 122
Query: 229 QCDAPCSSCAKG--ANPLYKPRMGNIL---PYKDSLCMEIQRNHKPGYCETCQQCDYEIE 283
QC PC C+ +P++ P + + D C R G+C + +C YE
Sbjct: 123 QCQ-PCKHCSSDHMIHPVFNPALSSTFVECSCDDRFC----RYAPNGHCGSSNKCVYEQV 177
Query: 284 YADHSSSMGVLARDELHLTIENGS--LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRA 341
Y + S GVLA++ L T NG+ +T+P + FGC Y+ L + GILGL
Sbjct: 178 YISGTGSKGVLAKERLTFTTPNGNTVVTQP-IAFGCGYENGEQLESHFT---GILGLGAK 233
Query: 342 KVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHT 401
SL QL S+ +G N G + LG D G P+ +Y+
Sbjct: 234 PTSLAVQLGSK--FSYCIGDLANKNYGYNQ-LVLGED-ADILGDP-TPIEFETENSIYYM 288
Query: 402 EILKINYGSSPLNL-----GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL 456
+ I+ G + LN+ R + G + D+G+ YT+ AY EL +K +
Sbjct: 289 NLEGISVGDTQLNIEPVVFKRRGPRTG-VILDSGTLYTWLADIAYRELYNEIKSI----- 342
Query: 457 VLDASDPTLPVCWRAKF 473
DP L W F
Sbjct: 343 ----LDPKLERFWFRDF 355
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 109/412 (26%), Positives = 171/412 (41%), Gaps = 55/412 (13%)
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA 232
K + SS S + PL I + L Y + +G + L +DTGSDLTW+QC
Sbjct: 109 KAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ- 165
Query: 233 PCSSCAKGANPLYKPRMGNILPYKDSLC--------MEIQRNHKP-----GYCETCQQCD 279
PC SC PLY P + + YK C + N P G +T C+
Sbjct: 166 PCRSCYNQQGPLYDPSVSS--SYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKT--TCE 221
Query: 280 YEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
Y + Y D S + G LA + + L G N+VFGC + +GL G++GL
Sbjct: 222 YVVSYGDGSYTRGDLASESIVL----GDTKLENLVFGCGRNNKGLFGG----ASGLMGLG 273
Query: 340 RAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDLV---PSWGMAWVPMLDSPF 395
R+ VSL SQ V +CL + G G + G+D S + + P++ +P
Sbjct: 274 RSSVSLVSQTLK--TFNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQ 331
Query: 396 MELYHTEILKINYGSSPLNLGARNSQVGWA-LFDTGSSYTYFTKQAYSELIAS-LKEVSS 453
+ ++ IL + G+S + + G L D+G+ T Y + LK+ S
Sbjct: 332 LRSFY--ILNLT-GASIGGVELKTLSFGRGILIDSGTVITRLPPSIYKAVKTEFLKQFS- 387
Query: 454 DGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV 512
G L C+ + SI +K F+ + + + + + P+ LV
Sbjct: 388 -GFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEG---NAELEVDVTGVFYFVKPDASLV 443
Query: 513 ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
CL + S + I+G+ + Q V+YD +R+G A +CM
Sbjct: 444 -------CLAL--ASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENCM 486
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 86/371 (23%), Positives = 156/371 (42%), Gaps = 38/371 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKD-- 257
G Y + +G P + + L DTGSDLTW QC+ C P + P YK+
Sbjct: 138 GAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTST--SYKNVS 195
Query: 258 ---SLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
C I + P C Y I+Y ++G LA + L + S N +
Sbjct: 196 CSSEFCKLIAEGNYPAQDCISNTCLYGIQYGS-GYTIGFLATETLAIA---SSDVFKNFL 251
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF 374
FGC+ + +G T T G+LGL R+ ++LPSQ ++ KN+ +CL + G++
Sbjct: 252 FGCSEESRG----TFNGTTGLLGLGRSPIALPSQTTNK--YKNLFSYCLPASPSSTGHLS 305
Query: 375 LGHDLVPSWGMAWVPMLDSP-FMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSY 433
G ++ S P+ SP +LY + I+ L + N + + D+G+++
Sbjct: 306 FGVEV--SQAAKSTPI--SPKLKQLYGLNTVGISVRGRELPI---NGSISRTIIDSGTTF 358
Query: 434 TYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
T+ YS L ++ +E+ ++ + + + P + +I + +++ F
Sbjct: 359 TFLPSPTYSALGSAFREMMANYTLTNGTSSFQPC-----YDFSNIGNGTLTIPGISIFFE 413
Query: 494 SKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
++ I G ++ ++ +CL D + I G+ + V+YD
Sbjct: 414 GGVEV-----EIDVSGIMIPVNGLKEVCLAFADTG--SDSDFAIFGNYQQKTYEVIYDVA 466
Query: 553 NKRIGWAKSHC 563
+G+A C
Sbjct: 467 KGMVGFAPKGC 477
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 101/384 (26%), Positives = 167/384 (43%), Gaps = 53/384 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G YF + VG P Y+ +DTGSD+ W+QC +PC +C + ++ P+ +P
Sbjct: 133 GEYFMRLGVGTPATNVYMVLDTGSDVVWLQC-SPCKACYNQTDAIFDPKKSKTFATVPCG 191
Query: 257 DSLCMEIQRNHKPGYCET--CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
LC + + + C T + C Y++ Y D S + G + + L +G+ +V
Sbjct: 192 SRLCRRLDDSSE---CVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF---HGARVD-HVP 244
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY-- 372
GC +D +GL V G+LGL R +S PSQ ++ +CL G
Sbjct: 245 LGCGHDNEGL----FVGAAGLLGLGRGGLSFPSQTKNR--YNGKFSYCLVDRTSSGSSSK 298
Query: 373 ----MFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV----- 422
+ G+ VP + + P+L +P ++ Y+ ++L I+ G S + G SQ
Sbjct: 299 PPSTIVFGNAAVPKTSV-FTPLLTNPKLDTFYYLQLLGISVGGSRVP-GVSESQFKLDAT 356
Query: 423 --GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G+S T T+ AY L + + + + L S C F + +
Sbjct: 357 GNGGVIIDSGTSVTRLTQPAYVALRDAFR-LGATKLKRAPSYSLFDTC----FDLSGMTT 411
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGD 539
VK T+ HFG + + YL+ ++ +G C GS I+G+
Sbjct: 412 VK--VPTVVFHFGGG------EVSLPASNYLIPVNTEGRFCFAFAG----TMGSLSIIGN 459
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
I +G V YD V R+G+ C
Sbjct: 460 IQQQGFRVAYDLVGSRVGFLSRAC 483
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 118/252 (46%), Gaps = 19/252 (7%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY---KDS 258
Y + +G+P + MDTGSD++W+QC PCS C + L+ P + +
Sbjct: 122 YVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSSSSTYSPFSCSSA 180
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C ++ ++ + C + QC Y + Y D SS+ G + D L L GS + FGC+
Sbjct: 181 PCAQLSQSQEGNGCMS-SQCQYIVNYGDSSSTTGTYSSDTLTL----GSSAMTDFQFGCS 235
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+ G + +TDG++GL SL SQ A G +CL +G G++ LG
Sbjct: 236 QSESGGFND---QTDGLMGLGGGAQSLASQTA--GTFGTAFSYCLPPTSGSSGFLTLGTG 290
Query: 379 LVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTGSSYTYFT 437
S G PML S + Y+ +L+ I GS LNL G +L D+G+ T
Sbjct: 291 ---SSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAG-SLMDSGTIITRLP 346
Query: 438 KQAYSELIASLK 449
AYS L ++ K
Sbjct: 347 PTAYSALSSAFK 358
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/423 (24%), Positives = 176/423 (41%), Gaps = 64/423 (15%)
Query: 162 DGIIRPHKSKINKKLV--SSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDM 219
DG+ R N+ V +S A A+ + G G YF+ + +G+P R Y+ +
Sbjct: 130 DGVTRQDLRPANESAVFGASLAAAIQGPVV---SGVGQGSGEYFSRVGIGSPARELYMVL 186
Query: 220 DTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQ-RNHKPGYCETCQ-Q 277
DTGSD+TW+QC PC+ C + ++P++ P + Y C + R+ C
Sbjct: 187 DTGSDVTWVQCQ-PCADCYQQSDPVFDPSLS--ASYAAVSCDSPRCRDLDTAACRNATGA 243
Query: 278 CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILG 337
C YE+ Y D S ++G A + L L S NV GC +D +GL V G+L
Sbjct: 244 CLYEVAYGDGSYTVGDFATETLTL---GDSTPVTNVAIGCGHDNEGL----FVGAAGLLA 296
Query: 338 LSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM 396
L +S PSQ+++ + +CL ++ + G D + P++ SP
Sbjct: 297 LGGGPLSFPSQISA-----STFSYCLVDRDSPAASTLQFGADGAEA-DTVTAPLVRSPRT 350
Query: 397 -ELYHTEILKINYGSSPLNLGAR------NSQVGWALFDTGSSYTYFTKQAYSELIASLK 449
Y+ + I+ G L++ + S G + D+G++ T AY A+L+
Sbjct: 351 GTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAVTRLQSSAY----AALR 406
Query: 450 EVSSDGLVLDASDPTLP---------VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVS 500
+ G P+LP C+ R+ V+V +L G ++ +
Sbjct: 407 DAFVRGT------PSLPRTSGVSLFDTCY--DLSDRTSVEVPAV--SLRFEGGGALRLPA 456
Query: 501 TKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAK 560
+ I +G G CL N + I+G++ +G V +D +G+
Sbjct: 457 KNYLIPVDG------AGTYCLAFAP----TNAAVSIIGNVQQQGTRVSFDTAKGVVGFTP 506
Query: 561 SHC 563
+ C
Sbjct: 507 NKC 509
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 162/389 (41%), Gaps = 64/389 (16%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG--- 250
G G YF+ + VG P +P+Y+ +DTGSD+ W+QC PCS C + ++P++ P
Sbjct: 149 GTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCK-PCSDCYQQSDPIFDPTASSSY 207
Query: 251 NILPYKDSLCMEIQ----RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG 306
N L C +++ RN K C Y++ Y D S ++G + T+ G
Sbjct: 208 NPLTCDAQQCQDLEMSACRNGK---------CLYQVSYGDGSFTVGEYVTE----TVSFG 254
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
+ + V GC +D +GL V + G+LGL +SL SQ+ + +CL
Sbjct: 255 AGSVNRVAIGCGHDNEGL----FVGSAGLLGLGGGPLSLTSQIKATSF-----SYCLVDR 305
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS-----PLNLGARNSQ 421
G + P + + + Y+ E+ ++ G P S
Sbjct: 306 DSGKSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSG 365
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSS-----DGLVLDASDPTLPVCWRAKFPIR 476
G + D+G++ T QAY+ + + K +S +G+ L + + +
Sbjct: 366 AGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVAL----------FDTCYDLS 415
Query: 477 SIVDVKQFFKTLTLHFGS--KWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
S+ V+ T++ HF W + + + I +G G C + S
Sbjct: 416 SLQSVR--VPTVSFHFSGDRAWALPAKNYLIPVDG------AGTYCFAFAPTTS----SM 463
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G++ +G V +D N +G++ + C
Sbjct: 464 SIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 168/384 (43%), Gaps = 41/384 (10%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC--SSCAKGANPLYKPRMGNILPYKDSL 259
Y + +G PPR + + DTGSDLTW+QC PC SSC PL+ P + Y D
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQC-LPCPDSSCYPQQEPLFDPSKSST--YVDVP 178
Query: 260 CMEIQRNHKPGYCET---CQQCDYEIEYADHSSSMGVLARDELHLTIENG-SLTKPNVVF 315
C H G +T C+Y ++Y D S + G LA + L+ + + VVF
Sbjct: 179 C-SAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVF 237
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQ----LASQGIIKNVVGHCLTTNAGGGG 371
GC+++ + +T + G+LGL R S+ SQ + S G V +CL G
Sbjct: 238 GCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGG---GVFSYCLPPRGSSTG 294
Query: 372 YMFLGHDLVPSW----GMAWVPMLD--SPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
Y+ +G +++ P++ S Y + ++ + +++ A +G A
Sbjct: 295 YLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLG-A 353
Query: 426 LFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ D+G+ T+ AY L + + S ++ + S L C+ + +V +
Sbjct: 354 VIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTG--QDVVTAPR- 410
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVI--SKKG---NICLGILDGSEVHNGSTIILGD 539
+ L FG +I + G L++ ++ G ++ L L ++ +I+G+
Sbjct: 411 ---VALEFGGGARI-----DVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGN 462
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ R VV+D RIG+ + C
Sbjct: 463 MQQRAYNVVFDVDGGRIGFGPNGC 486
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 120/259 (46%), Gaps = 17/259 (6%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y +G PP + DTGSDL W+QC APC C PL+ PR + +P
Sbjct: 92 YLMRFYIGTPPVERFAIADTGSDLIWVQC-APCEKCVPQNAPLFDPRKSSTFKTVPCDSQ 150
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C + + + ++ QC Y+ Y DH+ G+L + ++ +N ++ P + FGC
Sbjct: 151 PCTLLPPSQRACVGKS-GQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCT 209
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC---LTTNAGGGGYMFL 375
+ ++ + G++GL +SL SQL Q I +C L++N+ M
Sbjct: 210 FSNND-TVDESKRNMGLVGLGVGPLSLISQLGYQ--IGRKFSYCFPPLSSNSTSK--MRF 264
Query: 376 GHDLVPSW--GMAWVPM-LDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSS 432
G+D + G+ P+ + S Y+ + ++ G+ + + G L D+G+S
Sbjct: 265 GNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTD-GNILIDSGTS 323
Query: 433 YTYFTKQAYSELIASLKEV 451
+T + Y++ +A +KEV
Sbjct: 324 FTILKQSFYNKFVALVKEV 342
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 156/383 (40%), Gaps = 42/383 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGN---ILPY 255
G Y + +G PP PY DTGSDL W QC APC+S C + PLY P +LP
Sbjct: 88 GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 146
Query: 256 KDSL--CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
SL C C C Y + Y +S+ + + G P +
Sbjct: 147 NSSLSVCAAALAGTGTAPPPGC-ACTYNVTYGSGWTSVFQGSETFTFGSTPAGQSRVPGI 205
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGG 369
FGC+ G G++GL R ++SL SQL G+ K +CLT TN+
Sbjct: 206 AFGCSTASSGF---NASSASGLVGLGRGRLSLVSQL---GVPK--FSYCLTPYQDTNSTS 257
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDS----PFMELYHTEILKINYGSSPLNLGA-----RNS 420
+ L + G++ P + S P Y+ + I+ G++ L++
Sbjct: 258 TLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNAD 317
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G++ T AY ++ A++ + + L +D + F + S
Sbjct: 318 GTGGLIIDSGTTITLLGNTAYQQVRAAVVSL----VTLPTTDGSAATGLDLCFMLPSSTS 373
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
++TLHF + + + Y++ G CL + + ++ G ILG+
Sbjct: 374 APPAMPSMTLHFNGADMV------LPADSYMMSDDSGLWCLAMQNQTD---GEVNILGNY 424
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+ ++YD + + +A + C
Sbjct: 425 QQQNMHILYDIGQETLSFAPAKC 447
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 105/405 (25%), Positives = 172/405 (42%), Gaps = 55/405 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR----MGNILPY 255
G YF + +G+PP+ + L +DTGSDL WIQC PC C + P Y P+ NI
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQC-VPCFDCFEQNGPYYDPKDSISFRNIT-C 251
Query: 256 KDSLCMEIQRNHKPGYCE-TCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK---- 310
D C + P C+ Q C Y Y D S++ G A + + + + + K
Sbjct: 252 NDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311
Query: 311 --PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---T 365
NV+FGC + +GL GL R +S SQL Q + + +CL +
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLL----GLGRGPLSFSSQL--QSLYGHSFSYCLVDRDS 365
Query: 366 NAGGGGYMFLGH--DLVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPLNLGARNS 420
+ + G DL+ + + ++ ++P Y+ +I I G L + N
Sbjct: 366 DTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENW 425
Query: 421 QV-----GWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
+ G + D+G++ +YF+ AY + A L++V LV D P L C+
Sbjct: 426 NLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDF--PILHPCYNV--- 480
Query: 475 IRSIVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG 532
S D F + L + F G+ W + ++ I + +CL +L +
Sbjct: 481 --SGTDELNFPEFL-IQFADGAVWNFPVENY------FIRIQQLDIVCLAMLGTPK---S 528
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHCMN---PGRFKSLPF 574
+ I+G+ + ++YD N R+G+A C P F+S F
Sbjct: 529 ALSIIGNYQQQNFHILYDTKNSRLGYAPMRCAEIEAPISFRSSSF 573
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/366 (24%), Positives = 158/366 (43%), Gaps = 45/366 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y VG PP Y MDTGS++ W+QC PC++C +P++ P + +P
Sbjct: 87 GEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQ-PCNTCFNQTSPIFNPSKSSSYKNIPCT 145
Query: 257 DSLCMEIQRNHKPGYCETCQQ----CDYEIEYADHSSSMGVLARDELHLTIENG-SLTKP 311
S C + H +C C+Y I Y + S G L+ D L L +G S+ P
Sbjct: 146 SSTCKDTNDTHI-----SCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFP 200
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAG 368
N+V GC + +L ++ G++G+ R +SL Q+ S + + +CL +++
Sbjct: 201 NIVIGCGHIN---VLQDNSQSSGVVGMGRGPMSLIKQVGSSS-VGSKFSYCLIPYNSDSN 256
Query: 369 GGGYMFLGHDLVPSWGMAW-VPMLDSPFMELYHTEILK-INYGSSPLNLGAR-NSQVGWA 425
+ G D+V S + PM+ E Y+ L+ + G++ + G R N+
Sbjct: 257 SSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERSNASTQNI 316
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
L D+G+ T S+L++ + +EV ++ D L +C+ ++ D+
Sbjct: 317 LIDSGTPLTMLPNLFLSKLVSYVAQEVKLPR--IEPPDHHLSLCYNTTGKQLNVPDI--- 371
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
T HF ++ G + G +C G + + + I G+I+
Sbjct: 372 ----TAHFN------GADVKLNSNGTFFPFEDGIMCFGFISSNGLE-----IFGNIAQNN 416
Query: 545 QLVVYD 550
L+ YD
Sbjct: 417 LLIDYD 422
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 122/504 (24%), Positives = 209/504 (41%), Gaps = 63/504 (12%)
Query: 74 LPMLFPGLPRKLFLFLAISIF---ALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHK 130
+ M F + L L LAI+ A I G + D+ + + S FPL H
Sbjct: 2 MKMEFTAIGSSLILSLAITFMCGVAEIAPG--LNCRSSDKILNRKVGKRSHSVSFPLIHI 59
Query: 131 FGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIF 190
+ +E R + ES+++ IR +++ +S + D+++
Sbjct: 60 Y---------SECSPFRPPNRTWESLMSEK----IRGDANRLRFLKRTSRSSKQDANANV 106
Query: 191 PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG 250
P+R G Y + G P + Y +DTGSD+ WI C C C A P++ P
Sbjct: 107 PVRSG---SGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQ-CQGCHSTA-PIFDPAKS 161
Query: 251 NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
+ YK C G C +C +E+ Y D + G LA D + L GS
Sbjct: 162 S--SYKPFACDSQPCQEISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITL----GSQYL 215
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
PN FGCA L + G++GL +SL +Q + + +CL +++
Sbjct: 216 PNFSFGCAES----LSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSS 271
Query: 371 GYMFLGHD-LVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNL-GARNSQVGWALF 427
G + LG + V S + + ++ P + ++ LK I+ G++ +++ G + G +
Sbjct: 272 GSLVLGKEAAVSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTII 331
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT----LPVCWRAKFPIRSIVDVKQ 483
D+G++ T+ AY+ L + ++ L + PT + C+ S VDV
Sbjct: 332 DSGTTITHLVPSAYTALRDAFRQ------QLSSLQPTPVEDMDTCYDLS---SSSVDV-- 380
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
T+TLH +V K E L+ + G CL S I+G++ +
Sbjct: 381 --PTITLHLDRNVDLVLPK-----ENILITQESGLACLAF-----SSTDSRSIIGNVQQQ 428
Query: 544 GQLVVYDNVNKRIGWAKSHCMNPG 567
+V+D N ++G+A+ C P
Sbjct: 429 NWRIVFDVPNSQVGFAQEQCAAPA 452
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/405 (25%), Positives = 172/405 (42%), Gaps = 55/405 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR----MGNILPY 255
G YF + +G+PP+ + L +DTGSDL WIQC PC C + P Y P+ NI
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQC-VPCFDCFEQNGPYYDPKDSISFRNIT-C 251
Query: 256 KDSLCMEIQRNHKPGYCE-TCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK---- 310
D C + P C+ Q C Y Y D S++ G A + + + + + K
Sbjct: 252 NDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311
Query: 311 --PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---T 365
NV+FGC + +GL GL R +S SQL Q + + +CL +
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLL----GLGRGPLSFSSQL--QSLYGHSFSYCLVDRDS 365
Query: 366 NAGGGGYMFLGH--DLVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPLNLGARNS 420
+ + G DL+ + + ++ ++P Y+ +I I G L + N
Sbjct: 366 DTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENW 425
Query: 421 QV-----GWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
+ G + D+G++ +YF+ AY + A L++V LV D P L C+
Sbjct: 426 NLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDF--PILHPCYNV--- 480
Query: 475 IRSIVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG 532
S D F + L + F G+ W + ++ I + +CL +L +
Sbjct: 481 --SGTDELNFPEFL-IQFADGAVWNFPVENY------FIRIQQLDIVCLAMLGTPK---S 528
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHCMN---PGRFKSLPF 574
+ I+G+ + ++YD N R+G+A C P F+S F
Sbjct: 529 ALSIIGNYQQQNFHILYDTKNSRLGYAPMRCAEIEAPISFRSSSF 573
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 150/375 (40%), Gaps = 36/375 (9%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
+G Y + +G PP Y DTGSDL W QC PC SC K NP++ P +K+
Sbjct: 88 NGEYLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKST--SFKEV 144
Query: 259 LCMEIQ-RNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKP-NVVF 315
C Q R C Q+ CD+ Y D S + GV+A + L L +G T N+VF
Sbjct: 145 SCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVF 204
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGY 372
GC ++ G + G+ G +SL SQ+ S CL T+
Sbjct: 205 GCGHNNSGTFNENEM---GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSK 261
Query: 373 MFLGHDL-VPSWGMAWVPMLDSPFMELYHTEILKINYGSS--PLNLGARNSQVGWALFDT 429
+ G + V + P++ Y + I+ G P + + + G D
Sbjct: 262 IIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDA 321
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
G+ T + Y+ L+ +KE V D D +C+R+ I + LT
Sbjct: 322 GTPPTLLPRDFYNRLVQGVKEAIPMEPVQDP-DLQPQLCYRSATLIDGPI--------LT 372
Query: 490 LHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
HF G+ Q+ ISP K+G C + + +G T I G+ L+
Sbjct: 373 AHFDGADVQLKPLNTFISP-------KEGVYCFAM----QPIDGDTGIFGNFVQMNFLIG 421
Query: 549 YDNVNKRIGWAKSHC 563
+D K++ + C
Sbjct: 422 FDLDGKKVSFKAVDC 436
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 161/380 (42%), Gaps = 51/380 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VG P R Y+ +DTGSD+ W+QC APC C ++P++ PR
Sbjct: 140 GEYFTRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPCS 198
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
+R G + C Y++ Y D S ++G + + LT + V GC +
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTET--LTFRRNRVK--GVALGCGH 254
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMFLGH 377
D +GL V G+LGL + K+S P Q + +CL + + G+
Sbjct: 255 DNEGL----FVGAAGLLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSASSKPSSVVFGN 308
Query: 378 DLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALF--------- 427
V S + P+L +P ++ Y+ +L I+ G G R V +LF
Sbjct: 309 AAV-SRIARFTPLLSNPKLDTFYYVGLLGISVG------GTRVPGVTASLFKLDQIGNGG 361
Query: 428 ---DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
D+G+S T + AY + + + V + L + C F + ++ +VK
Sbjct: 362 VIIDSGTSVTRLIRPAYIAMRDAFR-VGAKTLKRAPNFSLFDTC----FDLSNMNEVK-- 414
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLR 543
T+ LHF + VS + YL+ + G C G I+G+I +
Sbjct: 415 VPTVVLHF--RRADVS----LPATNYLIPVDTNGKFCFAFAG----TMGGLSIIGNIQQQ 464
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G VVYD + R+G+A C
Sbjct: 465 GFRVVYDLASSRVGFAPGGC 484
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 153/369 (41%), Gaps = 42/369 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS--SCAKGANPLYKPRMGNI---LPYK 256
Y + +G P +++DTGSD++W+QC PCS +C + L+ P + +P
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C E+ R ++ G C QC Y + Y D S++ GV D L L N T +FG
Sbjct: 202 ADACSEL-RIYEAG-CSG-SQCGYVVSYGDGSNTTGVYGSDTLALAPGN---TVGTFLFG 255
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C + Q G+ DG+L L R +SL SQ A G V +CL + GY+ LG
Sbjct: 256 CGHAQAGMFAG----IDGLLALGRQSMSLKSQAA--GAYGGVFSYCLPSKQSAAGYLTLG 309
Query: 377 HDLVPSWGMAWVPMLDS-PFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
S G A +L + Y + I+ G + + A ++ G + DTG+ T
Sbjct: 310 GPTSAS-GFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPA-SAFAGGTVVDTGTVITR 367
Query: 436 FTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
AY+ L ++ + ++ G ++ L C+ F +V + T+ L F
Sbjct: 368 LPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCY--DFSRYGVVTL----PTVALTFSG 421
Query: 495 KWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNK 554
+ I G L + G +G ILG++ R V +D
Sbjct: 422 GATLALEAPGILSSGCLAFAPNGG------------DGDAAILGNVQQRSFAVRFD--GS 467
Query: 555 RIGWAKSHC 563
+G+ C
Sbjct: 468 TVGFMPGAC 476
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/372 (25%), Positives = 152/372 (40%), Gaps = 48/372 (12%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS--SCAKGANPLYKPRMGNI---LPYK 256
Y + +G P +++DTGSD++W+QC PCS +C + L+ P + +P
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C E+ R ++ G C QC Y + Y D S++ GV D L L N T +FG
Sbjct: 202 ADACSEL-RIYEAG-CSG-SQCGYVVSYGDGSNTTGVYGSDTLALAPGN---TVGTFLFG 255
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C + Q G+ DG+L L R +SL SQ A G V +CL + GY+ LG
Sbjct: 256 CGHAQAGMFAG----IDGLLALGRQSMSLKSQAA--GAYGGVFSYCLPSKQSAAGYLTLG 309
Query: 377 HDLVPSWGMAWVPMLDS-PFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
S G A +L + Y + I+ G + + A ++ G + DTG+ T
Sbjct: 310 GPSSAS-GFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPA-SAFAGGTVVDTGTVITR 367
Query: 436 FTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF----FKTLTLH 491
AY+ L ++ + G + P+ P A + + D ++ T+ L
Sbjct: 368 LPPTAYAALRSAFR-----GAIAPCGYPSAP----ANGILDTCYDFSRYGVVTLPTVALT 418
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
F + I G L + G +G ILG++ R V +D
Sbjct: 419 FSGGATLALEAPGILSSGCLAFAPNGG------------DGDAAILGNVQQRSFAVRFD- 465
Query: 552 VNKRIGWAKSHC 563
+G+ C
Sbjct: 466 -GSTVGFMPGAC 476
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/371 (24%), Positives = 145/371 (39%), Gaps = 36/371 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + +G P Y + DTGSD TW+QC+ C + L+ P + +
Sbjct: 184 GNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCA 243
Query: 257 DSLCMEIQRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
C ++ Y + C C Y ++Y D S S+G A D L L+ +
Sbjct: 244 APACSDL-------YTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AIKGFR 293
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF 374
FGC +GL + G+LGL R K SLP Q + V HC + G GY+
Sbjct: 294 FGCGERNEGL----FGEAAGLLGLGRGKTSLPVQAYDK--YGGVFAHCFPARSSGTGYLD 347
Query: 375 LGHDLVPSWGMAW-VPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSY 433
G P+ PML + Y+ + I G L++ + D+G+
Sbjct: 348 FGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVI 407
Query: 434 TYFTKQAYSELIASLKE-VSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
T AYS L ++ +++ G + L C+ F S V + T++L F
Sbjct: 408 TRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCY--DFTGMSQVAI----PTVSLLF 461
Query: 493 GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
+ + G + + CLG E + I+G+ L+ VVYD
Sbjct: 462 QGGASL-----DVDASGIIYAASVSQACLGFAANEE--DDDVGIVGNTQLKTFGVVYDIG 514
Query: 553 NKRIGWAKSHC 563
K +G++ C
Sbjct: 515 KKVVGFSPGAC 525
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 155/372 (41%), Gaps = 39/372 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF+ + VG P + YL +DTGSD+ WIQC+ PC+ C + ++P++ P + YK
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSST--YKSLT 216
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCA 318
C Q + +C Y++ Y D S ++G LA D T+ G+ K NV GC
Sbjct: 217 CSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATD----TVTFGNSGKINNVALGCG 272
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+D +GL G+ +S+ +Q+ + +CL G +
Sbjct: 273 HDNEGLFTGAAGLLGLGGGV----LSITNQMKATSF-----SYCLVDRDSGKSSSLDFNS 323
Query: 379 LVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSS 432
+ G A P+L + ++ Y+ + + G + L S G + D G++
Sbjct: 324 VQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTA 383
Query: 433 YTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
T QAY+ L + +++ + +S C+ F S V V T+ HF
Sbjct: 384 VTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCY--DFSSLSTVKV----PTVAFHF 437
Query: 493 -GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
G K + K ++ P + G C S S I+G++ +G + YD
Sbjct: 438 TGGKSLDLPAKNYLIP-----VDDSGTFCFAFAPTSS----SLSIIGNVQQQGTRITYDL 488
Query: 552 VNKRIGWAKSHC 563
IG + + C
Sbjct: 489 SKNVIGLSGNKC 500
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/398 (25%), Positives = 164/398 (41%), Gaps = 58/398 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + VG PPR + + MDTGSDL W+QC APC C + P++ P + Y++
Sbjct: 149 GEYLMDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASS--SYRNVT 205
Query: 260 CMEIQRNH-------KPGYCETCQQ-----CDYEIEYADHSSSMGVLARDE--LHLTIEN 305
C + + H + TC++ C Y Y D S++ G LA + ++LT
Sbjct: 206 CGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPG 265
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
S VVFGC + +GL L R +S SQL + + + +CL
Sbjct: 266 ASRRVDGVVFGCGHRNRGLFHGAAGLLG----LGRGPLSFASQL--RAVYGHTFSYCLVD 319
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFM----------ELYHTEILKINYGSSPLN 414
+ G + + + +A P L + F Y+ ++ + G LN
Sbjct: 320 HGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLN 379
Query: 415 LGARNSQV-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW 469
+ + V G + D+G++ +YF + AY + + + S L P L C+
Sbjct: 380 ISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLSPCY 439
Query: 470 RAKFPIRSIVDVKQFFKTLTLHF--GSKWQIVSTKFHI--SPEGYLVISKKGNICLGILD 525
R V L+L F G+ W + + I P+G + +CL +L
Sbjct: 440 NVSGVERPEV------PELSLLFADGAVWDFPAENYFIRLDPDGGSI------MCLAVL- 486
Query: 526 GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G +II G+ + VVYD N R+G+A C
Sbjct: 487 -GTPRTGMSII-GNFQQQNFHVVYDLQNNRLGFAPRRC 522
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 116/457 (25%), Positives = 193/457 (42%), Gaps = 54/457 (11%)
Query: 122 SFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIR-----PHKSKINKKL 176
SF L+ + +R D +K L+ ++ A V I R + SK + K
Sbjct: 66 SFSLQLHSRVSVRGTEHSD--YKSLTLARLNRDT--ARVKSLITRLDLAINNISKADLKP 121
Query: 177 VSSNAVAVDSSSIFPL-RGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS 235
+S+ + PL G G YFT + +G P R Y+ +DTGSD+ W+QC PC+
Sbjct: 122 ISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQC-TPCA 180
Query: 236 SCAKGANPLYKPRMGNILPYKDSLCMEIQRNH-KPGYCETCQQCDYEIEYADHSSSMGVL 294
C P+++P Y+ C Q N + C C YE+ Y D S ++G
Sbjct: 181 DCYHQTEPIFEPSS--SSSYEPLSCDTPQCNALEVSECRNA-TCLYEVSYGDGSYTVGDF 237
Query: 295 ARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
A + L + GS NV GC + +GL V G+LGL ++LPSQL +
Sbjct: 238 ATETLTI----GSTLVQNVAVGCGHSNEGL----FVGAAGLLGLGGGLLALPSQLNTTSF 289
Query: 355 IKNVVGHCLT-TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSP 412
+CL ++ + G L P +A P+L + ++ Y+ + I+ G
Sbjct: 290 -----SYCLVDRDSDSASTVDFGTSLSPDAVVA--PLLRNHQLDTFYYLGLTGISVGGEL 342
Query: 413 LNLGARNSQV-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV 467
L + + ++ G + D+G++ T + Y+ L S + + D L A
Sbjct: 343 LQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLD-LEKAAGVAMFDT 401
Query: 468 CWRAKFPIRSIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDG 526
C+ ++ V+V T+ HF G K + K ++ P + G CL
Sbjct: 402 CY--NLSAKTTVEV----PTVAFHFPGGKMLALPAKNYMIP-----VDSVGTFCLAFAPT 450
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ S I+G++ +G V +D N IG++ + C
Sbjct: 451 AS----SLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 143/372 (38%), Gaps = 26/372 (6%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G G Y + +G P Y + DTGSD TW+QC C + L+ P +
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST- 230
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
Y + C + + C Y ++Y D S S+G A D L L+ +
Sbjct: 231 -YANISCAAPACSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AVKGF 286
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
FGC +GL + G+LGL R K SLP Q + V HCL + G GY+
Sbjct: 287 RFGCGERNEGL----FGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSSGTGYL 340
Query: 374 FLGHDLVPSWGMAW-VPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSS 432
G + G PML Y+ + I G L++ + D+G+
Sbjct: 341 DFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGTV 400
Query: 433 YTYFTKQAYSELIASLKE-VSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
T AYS L ++ +++ G + L C+ F S V + T++L
Sbjct: 401 ITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCY--DFTGMSQVAI----PTVSLL 454
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
F + + G + + +CLG + G I+G+ L+ V YD
Sbjct: 455 FQG-----GARLDVDASGIMYAASVSQVCLGFAANED--GGDVGIVGNTQLKTFGVAYDI 507
Query: 552 VNKRIGWAKSHC 563
K +G++ C
Sbjct: 508 GKKVVGFSPGAC 519
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 155/372 (41%), Gaps = 39/372 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF+ + VG P + YL +DTGSD+ WIQC+ PC+ C + ++P++ P + YK
Sbjct: 160 GEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSST--YKSLT 216
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCA 318
C Q + +C Y++ Y D S ++G LA D T+ G+ K NV GC
Sbjct: 217 CSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATD----TVTFGNSGKINNVALGCG 272
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+D +GL G+ +S+ +Q+ + +CL G +
Sbjct: 273 HDNEGLFTGAAGLLGLGGGV----LSITNQMKATSF-----SYCLVDRDSGKSSSLDFNS 323
Query: 379 LVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSS 432
+ G A P+L + ++ Y+ + + G + L S G + D G++
Sbjct: 324 VQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTA 383
Query: 433 YTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
T QAY+ L + +++ + +S C+ F S V V T+ HF
Sbjct: 384 VTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCY--DFSSLSTVKV----PTVAFHF 437
Query: 493 -GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
G K + K ++ P + G C S S I+G++ +G + YD
Sbjct: 438 TGGKSLDLPAKNYLIP-----VDDSGTFCFAFAPTSS----SLSIIGNVQQQGTRITYDL 488
Query: 552 VNKRIGWAKSHC 563
IG + + C
Sbjct: 489 SKNVIGLSGNKC 500
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 153/380 (40%), Gaps = 50/380 (13%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC--SSCAKGANPLYKPRMGN---ILPYK 256
Y + +G P + +DTGSDL+W+QC PC S C +PL+ P + +P
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCNASDCYPQKDPLFDPSKSSTFATIPCA 183
Query: 257 DSLCMEIQRNHKPGYCETCQ--------QCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
C ++ + GY C QC Y IEY + + + GV + + L L S
Sbjct: 184 SDACKQLPVD---GYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLAL---GSSA 237
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
+ FGC DQ G K DG+LGL A SL SQ AS + +CL
Sbjct: 238 VVKSFRFGCGSDQHG----PYDKFDGLLGLGGAPESLVSQTAS--VYGGAFSYCLPPLNS 291
Query: 369 GGGYMFLGHDLV---PSWGMAWVPM--LDSPFMELYHTEILKINYGSSPLNLGARNSQVG 423
G G++ LG + G + PM Y + I+ G L++ G
Sbjct: 292 GAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAKG 351
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+ D+G+ T AY L + + ++ +L +D L C+ F V V +
Sbjct: 352 -NIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCY--NFTGHGTVTVPK 408
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
LT G+ + P G LV CL D + GS I+G+++ R
Sbjct: 409 V--ALTFVGGATVDL------DVPSGVLV-----EDCLAFADAGD---GSFGIIGNVNTR 452
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
V+YD+ +G+ C
Sbjct: 453 TIEVLYDSGKGHLGFRAGAC 472
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 156/387 (40%), Gaps = 63/387 (16%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
+G PP+P L +DTGS L+WIQC K PL KP+ + P S + NH
Sbjct: 72 IGTPPQPTDLVLDTGSQLSWIQCHD--KKVKKRLPPLPKPKTASFDPSLSSSFSLLPCNH 129
Query: 268 K-----------PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
P C+ + C Y YAD + + G L R++ + SL+ P V+ G
Sbjct: 130 PICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS---KSLSTPPVILG 186
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG--GYMF 374
CA + GILG++ ++S SQ +C+ + G G +
Sbjct: 187 CA--------QASTENRGILGMNHGRLSFISQAKISKF-----SYCVPSRTGSNPTGLFY 233
Query: 375 LGHDLVPSWGMAWVPMLDSPFME--------LYHTEILKINYGSSPLNLGARNSQ----- 421
LG D S +V ML P + Y + I LN+ +
Sbjct: 234 LG-DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGG 292
Query: 422 VGWALFDTGSSYTYFTKQAY----SELIASLKEVSSDGLV-LDASDPTLPVCWRAKFPIR 476
G + D+GS TY +AY E++ + + G V D +D +C+ A
Sbjct: 293 SGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVAD----MCFDAGV--- 345
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
+V + ++ F + +I F EG L +KG C+GI + GS II
Sbjct: 346 -TAEVGRRIGGISFEFDNGVEI----FVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNII 400
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
G + + V YD NKR+G+ + C
Sbjct: 401 -GTVHQQNMWVEYDLANKRVGFGGAEC 426
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 119/267 (44%), Gaps = 30/267 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y + +G PP P+ DTGSDLTW QC PC C P+Y P + +P +
Sbjct: 66 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 124
Query: 259 LCMEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARDELHL--TIENGSLTKPNVVF 315
C+ R+ C C Y Y+D + S+G+L + L + ++ +++ +V F
Sbjct: 125 TCLPTWRSRN---CSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAF 181
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGYM 373
GC D G LN + G +GL R +SL +QL G+ K +CLT N+
Sbjct: 182 GCGTDNGGDSLN----STGTVGLGRGTLSLLAQL---GVGK--FSYCLTDFFNSTMDSPF 232
Query: 374 FLG--HDLVPSWGMAW-VPMLDSPFM-ELYHTEILKINYGSSPL-----NLGARNSQVGW 424
FLG +L P G P+L SP Y + I+ G L R G
Sbjct: 233 FLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGG 292
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEV 451
+ D+G+++T K + E++ + ++
Sbjct: 293 MMVDSGTTFTILAKSGFREVVDRVAQL 319
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 152/374 (40%), Gaps = 37/374 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PP DTGSDL W QC PC C K PL+ P+ Y+D
Sbjct: 91 GEYLMSLSLGTPPFEILAIADTGSDLIWTQC-TPCDKCYKQIAPLFDPKSSKT--YRDLS 147
Query: 260 C--MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFG 316
C + Q + C + Q C Y Y D S + G LA D + L NG + P V G
Sbjct: 148 CDTRQCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIG 207
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL----TTNAGGGGY 372
C G K GI+GL +SL SQ+ S + +CL + +AG
Sbjct: 208 CGRRNNGTFDK---KDSGIIGLGGGPMSLISQMGSS--VGGKFSYCLVPFSSESAGNSSK 262
Query: 373 MFLGHDLVPSW-GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA--LFDT 429
+ G + V S G+ P++ Y+ + ++ G + G + + D+
Sbjct: 263 LHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEGNIIIDS 322
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
G+S T F ++E +++ +G + L C+R ++ V +T
Sbjct: 323 GTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPDLKVPV--------IT 374
Query: 490 LHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
HF ++ T +++IS +CL + +G+ I G+++ L+ Y
Sbjct: 375 AHFNGADVVLQTL-----NTFILISDD-VLCLAF---NSTQSGA--IFGNVAQMNFLIGY 423
Query: 550 DNVNKRIGWAKSHC 563
D K + + + C
Sbjct: 424 DIQGKSVSFKPTDC 437
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 153/365 (41%), Gaps = 40/365 (10%)
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQR 265
M +G P Y + +DTGS LTW+QC SC + + P++ P+ + Y C Q
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSST--YASVGCSAQQC 58
Query: 266 NH------KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
+ P C + C Y+ Y D S S+G L++D T+ GS + PN +GC
Sbjct: 59 SDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKD----TVSFGSTSLPNFYYGCGQ 114
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDL 379
D +GL ++ G++GL+R K+SL QLA + +CL +++ G ++
Sbjct: 115 DNEGL----FGRSAGLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSLGSYN- 167
Query: 380 VPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTK 438
P ++ PM+ S + LY ++ + +PL++ + + D+G+ T
Sbjct: 168 -PGQ-YSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPT 225
Query: 439 QAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQI 498
YS L ++ + G ++ L C++ + S V F
Sbjct: 226 SVYSALSKAV-AAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAG----------- 273
Query: 499 VSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGW 558
+S + LV CL S I+G+ + VVYD + RIG+
Sbjct: 274 -GAALKLSAQNLLVDVDDSTTCLAFAPAR-----SAAIIGNTQQQTFSVVYDVKSSRIGF 327
Query: 559 AKSHC 563
A C
Sbjct: 328 AAGGC 332
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 86/182 (47%), Gaps = 12/182 (6%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-----PLYKPR-MGNIL 253
GLY+T + +G+PP+ YY+ +DTGSD+ W+ C C C + Y P G +
Sbjct: 82 GLYYTRIEIGSPPKGYYVQVDTGSDILWVNC-IRCDGCPTRSGLGIELTQYDPAGSGTTV 140
Query: 254 PYKDSLCMEIQRNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLTK 310
+ C+ P C T C + I Y D S++ G D + + NG T
Sbjct: 141 GCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTT 200
Query: 311 PN--VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
N + FGC G L ++ DGILG ++ S+ SQLA+ ++ + HCL T G
Sbjct: 201 SNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRG 260
Query: 369 GG 370
GG
Sbjct: 261 GG 262
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/400 (23%), Positives = 177/400 (44%), Gaps = 51/400 (12%)
Query: 172 INKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD 231
I+++ +S+ V+ S P ++ + +Y + VG PP +DTGS++TW QC
Sbjct: 35 IHRRSNASSRVSNTQSGSSPYANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQC- 93
Query: 232 APCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSM 291
PC C + P++ P + +K+ C C YE++Y DH+ +M
Sbjct: 94 LPCVHCYEQNAPIFDPSKSST--FKEKRC-------------DGHSCPYEVDYFDHTYTM 138
Query: 292 GVLARDELHLTIENGS-LTKPNVVFGCAYDQQGLLLNTLVK--TDGILGLSRAKVSLPSQ 348
G LA + + L +G P + GC ++ N+ K G++GL+ SL +Q
Sbjct: 139 GTLATETITLHSTSGEPFVMPETIIGCGHN------NSWFKPSFSGMVGLNWGPSSLITQ 192
Query: 349 LASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM-LDSPFMELYHTEILKIN 407
+ G ++ +C + G F + +V G+ M + + Y+ + ++
Sbjct: 193 MG--GEYPGLMSYCF-SGQGTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVS 249
Query: 408 YGSSPL-NLGAR-NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL 465
G++ + +G ++ G + D+G++ TYF +Y L+ + V + A+DPT
Sbjct: 250 VGNTRIETMGTTFHALEGNIVIDSGTTLTYF-PVSYCNLVR--QAVEHVVTAVRAADPTG 306
Query: 466 P--VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGI 523
+C+ + D F +T+HF +V K+++ Y+ + G CL I
Sbjct: 307 NDMLCYNS--------DTIDIFPVITMHFSGGVDLVLDKYNM----YMESNNGGVFCLAI 354
Query: 524 LDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ S I G+ + LV YD+ + + ++ ++C
Sbjct: 355 ICNSPTQEA---IFGNRAQNNFLVGYDSSSLLVSFSPTNC 391
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/390 (22%), Positives = 162/390 (41%), Gaps = 53/390 (13%)
Query: 191 PLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRM 249
PLR + G YF + VG PPR + DTGSD+ W+QC PC SC +PL+ P
Sbjct: 69 PLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQC-LPCQSCYGQTDPLFNPSF 127
Query: 250 GNI---LPYKDSLCMEI-----QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
+ + SLC ++ +RN QC Y++ Y D S ++G + + L
Sbjct: 128 SSTFQSITCGSSLCQQLLIRGCRRN----------QCLYQVSYGDGSFTVGEFSTETLSF 177
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
GS +V GC ++ QGL L + +S PSQ+ + +V +
Sbjct: 178 ----GSNAVNSVAIGCGHNNQGLFTGAAGLLG----LGKGLLSFPSQVGQ--LYGSVFSY 227
Query: 362 CLTTNAGGGGY-MFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGA-- 417
CL T G + G+ V S + +L +P ++ Y+ E++ I G + +++ A
Sbjct: 228 CLPTRESTGSVPLIFGNQAVAS-NAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGS 286
Query: 418 ----RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
++ G + D+G++ T AY+ + + + G+ DA + + +
Sbjct: 287 LSLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRA----GMPSDAKMTSGFSLFDTCY 342
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
+ + + + G+ + + + + G CL SE +
Sbjct: 343 DLSGRSSIMLPAVSFVFNGGATMALPAQNIMVP------VDNSGTYCLAFAPNSENFS-- 394
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+I + + +D+ R+G + C
Sbjct: 395 --IIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
Length = 475
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 130/283 (45%), Gaps = 27/283 (9%)
Query: 276 QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGI 335
++C Y YA+ SSS G + D + + +VFGC + G + L DGI
Sbjct: 5 EKCYYSRTYAERSSSEGWMVEDAFGFPDDQPPV---RMVFGCENGETGEIYRQLA--DGI 59
Query: 336 LGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWG-MAWVPMLDSP 394
+G+ + SQL ++G+I++V C G + LG +P + P+L++
Sbjct: 60 MGMGNNHNAFQSQLVARGVIEDVFSLCFGYPK--DGILLLGDVPMPKGANTVYTPLLNNL 117
Query: 395 FMELYHTEILKINYGSSPLNLGARNSQVGWA-LFDTGSSYTYFTKQAYSELIASLKEVS- 452
+ Y+ + I L+L AR G+ + D+G+++TY +A++ + A++ +
Sbjct: 118 HLHYYNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAAAIGSYAL 177
Query: 453 SDGL-VLDASDPTL-PVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIV--STKFHISPE 508
S GL +DP +CW+ F+ L HF S + + + + P
Sbjct: 178 SHGLQSTPGADPQYNDICWKG---------APDNFQGLENHFPSAEFVFGDNARLSLPPL 228
Query: 509 GYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
YL +S+ G CLG+ D + GS ++G +S+R +V N
Sbjct: 229 RYLFVSRPGEYCLGVFD----NGGSGTLIGGVSVRDVVVTMFN 267
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 74/232 (31%), Positives = 109/232 (46%), Gaps = 23/232 (9%)
Query: 154 ESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPP 212
++ V ++N + R ++ K +++ + S PL G G Y+ + G+P
Sbjct: 70 DARVKTLNSRLTR-KDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPA 128
Query: 213 RPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRN------ 266
R Y + +DTGS L+W+QC C A+PL+ P YK C Q +
Sbjct: 129 RYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKT--YKSLSCTSSQCSSLVDAT 186
Query: 267 -HKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGL 324
+ P CET C Y Y D S SMG L++D L L S T P V+GC D GL
Sbjct: 187 LNNP-LCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAP---SQTLPGFVYGCGQDSDGL 242
Query: 325 LLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
+ GILGL R K+S+ Q++S+ +CL T GGGG++ +G
Sbjct: 243 ----FGRAAGILGLGRNKLSMLGQVSSK--FGYAFSYCLPTR-GGGGFLSIG 287
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 90/334 (26%), Positives = 143/334 (42%), Gaps = 53/334 (15%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM------ 261
+GNPP P L +DTGSDLTWI C PC C P + P + Y+++ C+
Sbjct: 84 IGNPPVPQLLLIDTGSDLTWIHC-LPC-KCYPQTIPFFHPSRSST--YRNASCVSAPHAM 139
Query: 262 -EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAY 319
+I R+ K G C Y + Y D S++ G+LA ++L T ++G ++K N+VFGC
Sbjct: 140 PQIFRDEKTG------NCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQ 193
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDL 379
D G K G+LGL S I+ G + G H++
Sbjct: 194 DNSG-----FTKYSGVLGLGPGTFS---------IVTRNFGSKFSYCFGSLTNPTYPHNI 239
Query: 380 VPSWGMAWVPMLDSP---FMELYHTEILKINYGSSPLNLG----ARNSQVGWALFDTGSS 432
+ A + +P F + Y+ ++ I++G L++ R G + DTG S
Sbjct: 240 LILGNGAKIEGDPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGGTVIDTGCS 299
Query: 433 YTYFTKQAYSELIASLKEVSSDGL--VLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
T ++AY L + + + L V D T P C+ + D+ F +T
Sbjct: 300 PTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTP-CYEGNLKL----DLYG-FPVVTF 353
Query: 491 HFGSKWQIVSTKFHISPEGYLVISKKGN-ICLGI 523
HF ++ + E V S+ G+ CL +
Sbjct: 354 HFAGGAELA-----LDVESLFVSSESGDSFCLAM 382
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 163/380 (42%), Gaps = 47/380 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PP +DTGSDL W+QC PC C NP++ P + Y +
Sbjct: 62 GQYLMELYIGTPPIKISGTVDTGSDLIWVQC-VPCLGCYNQINPMFDPLKSST--YTNIS 118
Query: 260 CMEIQRNHKP--GYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFG 316
C + +KP G C ++CDY YAD S + GVLA++ + LT G ++ ++FG
Sbjct: 119 C-DSPLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQGILFG 177
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA--------SQGIIKNVVGHCLTTNAG 368
C ++ G N G++GL SL SQ+ SQ ++ + +++
Sbjct: 178 CGHNNTG---NFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQMS 234
Query: 369 -GGGYMFLGHDLVPSWGMAWVPMLDSPF-MELYHTEILKINYGSSPLNLGARNSQVGWAL 426
G G LG G+ P++ M Y+ +L I+ + L + + + G L
Sbjct: 235 FGKGSEVLGE------GVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEK-GNML 287
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL--PVCWRAKFPIRSIVDVKQF 484
D+G+ +Q Y + +K + D DP+L +C+R + ++
Sbjct: 288 VDSGTPPNILPQQLYDRVYVEVKNKVPLEPITD--DPSLGPQLCYRTQTNLKG------- 338
Query: 485 FKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLR 543
TLT HF G+ + + I P KG CL I + + G I G+ +
Sbjct: 339 -PTLTYHFEGANLLLTPIQTFIPP----TPETKGVFCLAITNCANSDPG---IYGNFAQT 390
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
L+ +D + + + + C
Sbjct: 391 NYLIGFDLDRQIVSFKPTDC 410
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/256 (29%), Positives = 114/256 (44%), Gaps = 35/256 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR-----MGNILP 254
G YF + VG+PPR Y+ +D+GSD+ W+QC PC+ C +P++ P MG +P
Sbjct: 140 GEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQ-PCTQCYHQTDPVFDPADSASFMG--VP 196
Query: 255 YKDSLCMEIQRN--HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN 312
S+C I+ H G C YE+ Y D S + G LA + L G N
Sbjct: 197 CSSSVCERIENAGCHAGG-------CRYEVMYGDGSYTKGTLALETLTF----GRTVVRN 245
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA-GGGG 371
V GC + +G+ V G+LGL +SL QL Q +CL + G
Sbjct: 246 VAIGCGHRNRGM----FVGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTDSAG 299
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----GWA 425
+ G +P G AW+P++ +P Y+ + + G + + Q+ G
Sbjct: 300 SLEFGRGAMPV-GAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGV 358
Query: 426 LFDTGSSYTYFTKQAY 441
+ DTG++ T AY
Sbjct: 359 VMDTGTAVTRIPTVAY 374
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 102/417 (24%), Positives = 168/417 (40%), Gaps = 56/417 (13%)
Query: 162 DGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP-----DGLYFTYMIVGNPPRPYY 216
DGI R L +NA V +S ++G + G YF+ + VG P R Y
Sbjct: 125 DGISR-------ADLRPANATPVFEASAAEIQGPVVSGVGQGSGEYFSRVGVGRPARQLY 177
Query: 217 LDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQ-RNHKPGYCE-T 274
+ +DTGSD+TW+QC PC+ C ++P+Y P + Y C + R+ C +
Sbjct: 178 MVLDTGSDVTWLQCQ-PCADCYAQSDPVYDPSVST--SYATVGCDSPRCRDLDAAACRNS 234
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDG 334
C YE+ Y D S ++G A + L L S NV GC +D +GL V G
Sbjct: 235 TGSCLYEVAYGDGSYTVGDFATETLTL---GDSAPVSNVAIGCGHDNEGL----FVGAAG 287
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFLGHDLVPSWGMAWVPMLDS 393
+L L +S PSQ+++ +CL ++ + G P+ P++ S
Sbjct: 288 LLALGGGPLSFPSQISA-----TTFSYCLVDRDSPSSSTLQFGDSEQPA---VTAPLIRS 339
Query: 394 PFME-LYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSSYTYFTKQAYSELIAS 447
P Y+ + I+ G L++ ++ G + D+G++ T AY L +
Sbjct: 340 PRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVTRLQSGAYGALREA 399
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
+ + L + C+ RS V V + W + +
Sbjct: 400 FVQ-GTQSLPRASGVSLFDTCY--DLAGRSSVQVPAV---------ALWFEGGGELKLPA 447
Query: 508 EGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ YL+ + G CL S G I+G++ +G V +D +G+ C
Sbjct: 448 KNYLIPVDAAGTYCLAFAGTS----GPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 101/393 (25%), Positives = 165/393 (41%), Gaps = 53/393 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK--- 256
G YF + VG PP+ + L +DTGSDL WIQC PC C + P Y P G Y+
Sbjct: 179 GEYFIDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYECFEQNGPHYDP--GQSSSYRNIG 235
Query: 257 --DSLCMEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLTKP 311
DS C + P C+ Q C Y Y D S++ G A + ++LT+ +G KP
Sbjct: 236 CHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSG---KP 292
Query: 312 ------NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT- 364
NV+FGC + +GL L R +S SQL Q + + +CL
Sbjct: 293 ELRRVENVMFGCGHWNRGLFHGAAGLLG----LGRGPLSFSSQL--QSLYGHSFSYCLVD 346
Query: 365 --TNAGGGGYMFLGH--DLVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPLNLGA 417
++A + G DL+ + + ++ ++P Y+ +I I G +N+
Sbjct: 347 RNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPE 406
Query: 418 RNSQV-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
Q+ G + D+G++ +YF + AY ++I G + P L C+
Sbjct: 407 EKWQIATDGSGGTIIDSGTTLSYFAEPAY-QVIKEAFMAKVKGYPVVKDFPVLEPCYNVT 465
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG 532
+ D+ F + G+ W + I I + +CL IL
Sbjct: 466 GVEQP--DLPDF--GIVFSDGAVWNFPVENYFIE------IEPREVVCLAILGTPP---S 512
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
+ I+G+ + ++YD R+G+A + C +
Sbjct: 513 ALSIIGNYQQQNFHILYDTKKSRLGFAPTKCAD 545
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 162/382 (42%), Gaps = 48/382 (12%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
+G Y + +G PP Y +DTGSDL W QC PC C + +P+++P N Y
Sbjct: 47 NGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQC-TPCQGCYRQKSPMFEPLRSNT--YTPI 103
Query: 259 LCMEIQRNHKPGY-CETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFG 316
C + N G+ C + C Y YAD S + GVLAR+ + + +G + ++VFG
Sbjct: 104 PCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDIVFG 163
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVS-----LPSQLASQGIIK-NVVGHCLTTNAGGG 370
C + G + G+ G + VS S+ SQ ++ + H L T + G
Sbjct: 164 CGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTISFGD 223
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS-QVGWALFDT 429
G G+A P++ Y + I+ G + ++ + G + D+
Sbjct: 224 ASDVSGE------GVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGNIMIDS 277
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL--PVCWRAKFPIRSIVDVKQFFKT 487
G+ TY ++ Y L+ LK V S+ L +D DP L +C+R++ + +
Sbjct: 278 GTPATYLPQEFYDRLVKELK-VQSNMLPID-DDPDLGTQLCYRSETNLEGPI-------- 327
Query: 488 LTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICL---GILDGSEVHNGSTIILGDISLR 543
L HF G+ Q++ + I P K G C G DG I G+ +
Sbjct: 328 LIAHFEGADVQLMPIQTFIPP-------KDGVFCFAMAGTTDGE-------YIFGNFAQS 373
Query: 544 GQLVVYDNVNKRIGWAKSHCMN 565
L+ +D K + + + C N
Sbjct: 374 NVLIGFDLDRKTVSFKATDCSN 395
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 104/424 (24%), Positives = 185/424 (43%), Gaps = 61/424 (14%)
Query: 158 ASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-------YFTYMIVGN 210
AS + I+R K +++ + + ++ + +SS+ ++ ++ GL Y + +G
Sbjct: 82 ASSFNEILRRDKLRVDSIIQARRSMNL-TSSVEHMKSSVPFYGLSKITASDYIVNVGIGT 140
Query: 211 PPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNH 267
P + L DTGS L W QC PC +C P++ P LP LC I++
Sbjct: 141 PKKEMPLIFDTGSGLIWTQCK-PCKACYPKV-PVFDPTKSASFKGLPCSSKLCQSIRQG- 197
Query: 268 KPGYCETCQQCDYEIEYADHSSSMGVLARDEL---HLTIENGSLTKPNVVFGCAYDQQGL 324
C + +C Y Y D+SSS G LA + + HL + N++ GC+ G
Sbjct: 198 ----CSS-PKCTYLTAYVDNSSSTGTLATETISFSHLKYDF-----KNILIGCSDQVSGE 247
Query: 325 LLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWG 384
L GI+GL+R+ +SL SQ A+ I + +C+ + G G++ G VP+
Sbjct: 248 SLG----ESGIMGLNRSPISLASQTAN--IYDKLFSYCIPSTPGSTGHLTFGGK-VPN-D 299
Query: 385 MAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSEL 444
+ + P+ + Y ++ I+ G L + A ++ + D+G+ T +AYS L
Sbjct: 300 VRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIASTI-DSGAVLTRLPPKAYSAL 358
Query: 445 IASLKEVSSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFK---TLTLHF-GSKWQIV 499
+ +E+ +LD D L C+ + + +I + FF+ + + G WQ+
Sbjct: 359 RSVFREMMKGYPLLDQDD-FLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVP 417
Query: 500 STKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWA 559
+K + CL + + I G+ + VV+D +RIG+A
Sbjct: 418 GSKVY---------------CLAFAE----LDDEVSIFGNFQQKTYTVVFDGAKERIGFA 458
Query: 560 KSHC 563
C
Sbjct: 459 PGGC 462
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 170/385 (44%), Gaps = 52/385 (13%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G G YF+ + +G PP YL +DTGSD+ W+QC APC+ C + A+P+++P +
Sbjct: 139 ISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQC-APCADCYQQADPIFEP--AS 195
Query: 252 ILPYKDSLCMEIQ-RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
+ C Q R+ C C YE+ Y D S ++G D + TI GS
Sbjct: 196 SASFSTLSCNTRQCRSLDVSECRN-DTCLYEVSYGDGSYTVG----DFVTETITLGSAPV 250
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
NV GC ++ +GL V G+LGL +S PSQ+ + +CL
Sbjct: 251 DNVAIGCGHNNEGL----FVGAAGLLGLGGGSLSFPSQINATSF-----SYCLVDRDSES 301
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV-----GW 424
+ +P ++ P+L + ++ Y+ + ++ G +++ Q+ G
Sbjct: 302 ASTLEFNSTLPPNAVS-APLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGG 360
Query: 425 ALFDTGSSYTYFTKQAYSEL----IASLKEV-SSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+ D+G++ T Y+ L + +++ S++G+ L C+ + V
Sbjct: 361 VIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIAL------FDTCY--DLSSKGNV 412
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILG 538
+V T++ HF ++ + + YLV + +G C + S I+G
Sbjct: 413 EV----PTVSFHFPDGKEL-----PLPAKNYLVPLDSEGTFCFAFAPTAS----SLSIIG 459
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
++ +G VVYD VN +G+ + C
Sbjct: 460 NVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 87/375 (23%), Positives = 156/375 (41%), Gaps = 39/375 (10%)
Query: 198 PD-GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---L 253
PD G Y +G P DTGSDL+W+QC PC +C PL+ P + +
Sbjct: 83 PDHGEYLMRFSLGTPSVERLAIFDTGSDLSWLQC-TPCKTCYPQEAPLFDPTQSSTYVDV 141
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLT---IENGSLTK 310
P + C +N + C + +QC Y +Y S ++G L D + + + G T
Sbjct: 142 PCESQPCTLFPQNQRE--CGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATF 199
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT-NAGG 369
P VFGCA+ K +G +GL +SL SQL Q I + +C+ ++
Sbjct: 200 PKSVFGCAF-YSNFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTS 256
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFD 428
G + G + P+ + P + +P Y+ L+ I G + G + + D
Sbjct: 257 TGKLKFG-SMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNI---IID 312
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
+ T+ + Y++ I+S+KE + + DA P ++ +R+ ++ F
Sbjct: 313 SVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTP-------FEYCVRNPTNLN--FPEF 363
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
HF + + P+ + +C+ ++ + I G+ + V
Sbjct: 364 VFHFTGADVV------LGPKNMFIALDNNLVCMTVVPSKGIS-----IFGNWAQVNFQVE 412
Query: 549 YDNVNKRIGWAKSHC 563
YD K++ +A ++C
Sbjct: 413 YDLGEKKVSFAPTNC 427
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 150/375 (40%), Gaps = 36/375 (9%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
+G Y + +G PP Y DTGSDL W QC PC SC K NP++ P +K+
Sbjct: 88 NGEYLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKST--SFKEV 144
Query: 259 LCMEIQ-RNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVF 315
C Q R C Q+ CD+ Y D S + GV+A + L L +G + N+VF
Sbjct: 145 SCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVF 204
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGY 372
GC ++ G + G+ G +SL SQ+ S CL T+
Sbjct: 205 GCGHNNSGTFNENEM---GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSK 261
Query: 373 MFLGHDL-VPSWGMAWVPMLDSPFMELYHTEILKINYGSS--PLNLGARNSQVGWALFDT 429
+ G + V + P++ Y + I+ G P + + + G D
Sbjct: 262 IIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDA 321
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
G+ T + Y+ L+ +KE V D D +C+R+ I + LT
Sbjct: 322 GTPPTLLPRDFYNRLVQGVKEAIPMEPVQDP-DLQPQLCYRSATLIDGPI--------LT 372
Query: 490 LHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
HF G+ Q+ ISP K+G C + + +G T I G+ L+
Sbjct: 373 AHFDGADVQLKPLNTFISP-------KEGVYCFAM----QPIDGDTGIFGNFVQMNFLIG 421
Query: 549 YDNVNKRIGWAKSHC 563
+D K++ + C
Sbjct: 422 FDLDGKKVSFKAVDC 436
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/390 (24%), Positives = 165/390 (42%), Gaps = 42/390 (10%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CA-KGANPLYKPRM--GN 251
Y G YF VG P + + L DTGSDLTW+ C C S C+ + A + R+ N
Sbjct: 78 YGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 137
Query: 252 I------LPYKDSLC-MEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARDELHLTI 303
+ +P +C +E+ C T C Y+ Y+D S+++G A + + + +
Sbjct: 138 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 197
Query: 304 ENGSLTK-PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
+ G K NV+ GC+ QG + DG++GL +K S + A + +C
Sbjct: 198 KEGRKMKLHNVLIGCSESFQG---QSFQAADGVMGLGYSKYSFAIKAAEK--FGGKFSYC 252
Query: 363 LT---TNAGGGGYMFLGHDLVPSW---GMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
L ++ Y+ G M + ++ Y ++ I+ G + L +
Sbjct: 253 LVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIP 312
Query: 417 ARNSQV---GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
+ V G + D+GSS T+ T+ AY ++A+L+ ++ L C+ +
Sbjct: 313 SEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
S+V L HF +F + Y++ + G CLG + V
Sbjct: 373 FEESLVP------RLVFHFAD-----GAEFEPPVKSYVISAADGVRCLGFVS---VAWPG 418
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T ++G+I + L +D K++G+A S C
Sbjct: 419 TSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 103/394 (26%), Positives = 168/394 (42%), Gaps = 55/394 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YF + +G PP+ Y L +DTGSDL WIQC PC C + P Y P+ + +
Sbjct: 88 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCHDCFEQNGPYYDPKESSSFRNIGCH 146
Query: 257 DSLCMEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP---- 311
D C + P C+ Q C Y Y D S++ G A + + +LT P
Sbjct: 147 DPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTV-----NLTSPTGKS 201
Query: 312 ------NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT- 364
NV+FGC + +GL G+LGL R +S SQL Q + + +CL
Sbjct: 202 EFKRVENVMFGCGHWNRGLFHG----ASGLLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 255
Query: 365 ----TNAGGGGYMFLGHDLVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPLNL-- 415
TN DL+ + + ++ ++P Y+ +I I G LN+
Sbjct: 256 RNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPE 315
Query: 416 ---GARNSQVGWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRA 471
+ VG + D+G++ +YFT+ AY + A +K+V +V D P L C+
Sbjct: 316 STWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDF--PILDPCYNV 373
Query: 472 KFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
+ +D+ F + G+ W + ++ + + +CL IL
Sbjct: 374 SGVEK--IDLPDF--GILFADGAVWNFPVENY------FIRLDPEEVVCLAILG---TPR 420
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
+ I+G+ + V+YD R+G+A +C +
Sbjct: 421 SALSIIGNYQQQNFHVLYDTKKSRLGYAPMNCAD 454
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 99/392 (25%), Positives = 160/392 (40%), Gaps = 49/392 (12%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC-AKGANPL--YKPRMG 250
N PD Y ++ +G PP+P L +DTGSDL W QC PC C ++ PL
Sbjct: 407 ANGVPDTEYLVHLAIGTPPQPVQLILDTGSDLVWTQCR-PCPVCFSRALGPLDPSNSSTF 465
Query: 251 NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS--L 308
++LP +C + + + Q C Y YAD S + G L + +G+
Sbjct: 466 DVLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQA 525
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
T P++ FGC G+ + GI G R +SLPSQL HC T G
Sbjct: 526 TVPDLAFGCGLFNNGIFTS---NETGIAGFGRGALSLPSQLKVDNF-----SHCFTAITG 577
Query: 369 GG-GYMFLGHDLVPS--WGMAWVPMLDSPFME------LYHTEILKINYGSSPL-----N 414
+ LG +P+ + A + +P ++ Y+ + I GS+ L
Sbjct: 578 SEPSSVLLG---LPANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPEST 634
Query: 415 LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-VCWRAKF 473
+ G + D+G+ T + AY +L+ V +A+ +L +C+
Sbjct: 635 FALKQDGTGGTIIDSGTGMTTLPQDAY-KLVHDAFTAQVRLPVDNATSSSLSRLCFSFSV 693
Query: 474 PIRSIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHN 531
P R+ DV + L LHF G+ + + E G++ CL I G ++
Sbjct: 694 PRRAKPDVPK----LVLHFEGATLDLPRENYMFEFE-----DAGGSVTCLAINAGDDL-- 742
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+ + V+YD V + + + C
Sbjct: 743 ---TIIGNYQQQNLHVLYDLVRNMLSFVPAQC 771
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 96/400 (24%), Positives = 160/400 (40%), Gaps = 82/400 (20%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G G YF+ + +G P P Y+ +DTGSD+ WIQC APC+ C A+P+++P
Sbjct: 134 ISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQC-APCADCYHQADPIFEPASST 192
Query: 252 ILPYKDSLCMEIQRNHKPGYCET--CQQ----------CDYEIEYADHSSSMGVLARDEL 299
++ P C+T CQ C YE+ Y D S ++G D +
Sbjct: 193 --------------SYSPLSCDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVG----DFV 234
Query: 300 HLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVV 359
TI GS + NV GC ++ +GL + GL K+S PSQ+ +
Sbjct: 235 TETITLGSASVDNVAIGCGHNNEGLFIGAAGLL----GLGGGKLSFPSQINASSF----- 285
Query: 360 GHCLT-TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNL-- 415
+CL ++ + L+P A P+L + ++ Y+ + ++ G L++
Sbjct: 286 SYCLVDRDSDSASTLEFNSALLPHAITA--PLLRNRELDTFYYVGMTGLSVGGELLSIPE 343
Query: 416 ---GARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
S G + D+G++ T AY+ L D V D LPV
Sbjct: 344 SMFEMDESGNGGIIIDSGTAVTRLQTAAYNAL--------RDAFVKGTKD--LPV----- 388
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHIS--------PEGYLV-ISKKGNICLGI 523
+V F L + ++ + FH++ YL+ + G C
Sbjct: 389 -----TSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAF 443
Query: 524 LDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
S + I+G++ +G V +D N +G+ C
Sbjct: 444 APTSSALS----IIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 109/224 (48%), Gaps = 13/224 (5%)
Query: 191 PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG 250
PL G+I G Y T + +G PP+ + L +DTGS++T++ C C K +P ++
Sbjct: 39 PLYGDILSYGYYATKLYIGTPPQEFTLVVDTGSNMTFVPCCGSEEYCGKHEDPAFQTESS 98
Query: 251 NILPYKDSLCMEIQRNHKPGYCETCQ-QCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
+ Y+ C H C+ + QC Y++ Y D S S GVLA D + E+
Sbjct: 99 ST--YQPVNC------HPSCDCDYLRSQCSYKMHYGDGSYSRGVLAEDIISFGNES-EFA 149
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
+VFGC D G L + ++ DGI+GL R + ++ QL +G+I + C GG
Sbjct: 150 PQRLVFGCELDAIGSLYS--LRADGIIGLGRGRSTIVDQLVDKGVISDSFSLCYGGMEGG 207
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL 413
GG++ LG P M + + + Y+ E+++I PL
Sbjct: 208 GGHIILGSFSPPPSDM-FFTYSNPGRSQYYNVELMEIQVAGKPL 250
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 154/379 (40%), Gaps = 35/379 (9%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G Y + +G+PP +L DTGSD+ W+QC +PCS C +PL+ P + +P
Sbjct: 121 GEYLVRVGIGSPPLEQHLVADTGSDVIWVQC-SPCSDCYAQGDPLFDPANSASFSPVPCN 179
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
+C R +C+Y++ Y D S + GVLA + L L +G V G
Sbjct: 180 SGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTL---DGGTEVQGVAMG 236
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGGGGY 372
C ++ +GL + G+LGL +SL QL +CL G G
Sbjct: 237 CGHENRGL----FAEAAGLLGLGWGPMSLVGQLGGA--AGGAFSYCLAGYYSGEGSGSGS 290
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNL-----GARNSQVGWAL 426
+ LG + G WVP++ +P Y+ + + L L + G +
Sbjct: 291 LVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVV 350
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
DTG++ T +AY+ L + +G C + + V+
Sbjct: 351 MDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTC----YDLSGYASVR--VP 404
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLV--ISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
T+ L+FG Q P L+ + G CL + V +G + ILG+I +G
Sbjct: 405 TVALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAF---AAVASGPS-ILGNIQQQG 460
Query: 545 QLVVYDNVNKRIGWAKSHC 563
+ D+ + +G+ + C
Sbjct: 461 IEITVDSASGYVGFGPATC 479
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 114/484 (23%), Positives = 184/484 (38%), Gaps = 78/484 (16%)
Query: 136 VSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIF--PLR 193
S + A F+L R AS+ D + R + ++ A +++S F PL
Sbjct: 25 ASGKSARFELLRLAP------AASLAD-LARMDRERMAFISSRGRRRAAETASAFAMPLS 77
Query: 194 GNIYP-DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD-----------------APCS 235
Y G YF VG P +P+ L DTGSDLTW++C AP
Sbjct: 78 SGAYTGTGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAP 137
Query: 236 SCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLA 295
+ + K R +P + C E C Y+ Y D S++ G +
Sbjct: 138 ASPRRTFRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVG 197
Query: 296 RDELHLTIENGSLTKPN---VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
D + + + K VV GC G + + +DG+L L + +S S+ AS+
Sbjct: 198 VDSATIALSGRAARKAKLRGVVLGCTTSYNG---QSFLASDGVLSLGYSNISFASRAASR 254
Query: 353 GIIKNVVGHCLTTNAG---GGGYMFLGHDLV-----PSWGMAWVP--------------- 389
+CL + Y+ G + PS G+A
Sbjct: 255 --FGGRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGA 312
Query: 390 -----MLDSPFMELYHTEILKINYGSSPLNLGAR---NSQVGWALFDTGSSYTYFTKQAY 441
+LD Y + ++ L + Q G A+ D+G+S T K AY
Sbjct: 313 RQTPLVLDHRTRPFYAVTVKGVSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAY 372
Query: 442 SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVST 501
++A+L + + GL DP C+ P S DV L +HF S
Sbjct: 373 RAVVAALSKRLA-GLPRVTMDP-FDYCYNWTSP--SGSDVAAPLPMLAVHFAG-----SA 423
Query: 502 KFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKS 561
+ + Y++ + G C+G+ +G G ++I G+I + L YD N+R+ + +S
Sbjct: 424 RLEPPAKSYVIDAAPGVKCIGLQEGP--WPGLSVI-GNILQQEHLWEYDLKNRRLRFKRS 480
Query: 562 HCMN 565
CM+
Sbjct: 481 RCMH 484
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 104/398 (26%), Positives = 155/398 (38%), Gaps = 81/398 (20%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y + VG P RP L +DTGSDL W QC APC C P+ P + LP +
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQC-APCRDCFDQDLPVLDPAASSTYAALPCGAA 142
Query: 259 LC---------MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENG--- 306
C + NH+ C Y Y D S ++G +A D G
Sbjct: 143 RCRALPFTSCGVRTLGNHR--------SCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGE 194
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVV--GHCLT 364
SL + FGC + +G+ + GI G R + SLPSQL NV +C T
Sbjct: 195 SLHTRRLTFGCGHLNKGVFQS---NETGIAGFGRGRWSLPSQL-------NVTSFSYCFT 244
Query: 365 T---------NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLN 414
+ GG H S + P+L +P LY + I+ G +
Sbjct: 245 SMFESKSSLVTLGGSPAALYSH--AHSGEVRTTPILKNPSQPSLYFLSLKGISVGKT--R 300
Query: 415 LGARNSQVGWALFDTGSSYTYFTKQAYSELIAS------LKEVSSDGLVLDASDPTLPVC 468
L ++ + D+G+S T ++ Y + A L +G LD LPV
Sbjct: 301 LPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCF-ALPVT 359
Query: 469 --WRAKFPIRSIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILD 525
WR + +LTLH G+ W++ + + G V +C+ +LD
Sbjct: 360 ALWR-----------RPAVPSLTLHLEGADWELPRSNYVFEDLGARV------MCI-VLD 401
Query: 526 GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ G ++G+ + VVYD N R+ +A + C
Sbjct: 402 AAP---GEQTVIGNFQQQNTHVVYDLENDRLSFAPARC 436
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 97/388 (25%), Positives = 157/388 (40%), Gaps = 57/388 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK----- 256
Y + +G PP+P +DTGSDL W QC APC+SC +PL+ P G Y+
Sbjct: 96 YVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLSQPDPLFAP--GQSASYEPMRCA 152
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV-- 314
+LC +I + CE C Y Y D + ++GV A + G V
Sbjct: 153 GTLCSDILHHS----CERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPL 208
Query: 315 -FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
FGC G L N GI+G R +SL SQL+ + +CLT+ A
Sbjct: 209 GFGCGSVNVGSLNN----GSGIVGFGRNPLSLVSQLSIRRF-----SYCLTSYASRRQST 259
Query: 374 FLGHDLV------PSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNL-----GARNSQ 421
L L + + P+L SP Y+ + G+ L + R
Sbjct: 260 LLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDG 319
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKE-----VSSDGLVLDASDPTLPVCWRAKFPIR 476
G + D+G++ T +E++ + ++ ++ G D +P WR R
Sbjct: 320 SGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWR-----R 374
Query: 477 SIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI 535
S + + LHF G+ + + + ++G +CL + D + +GSTI
Sbjct: 375 SSSTSQMPVPRMVLHFQGADLDLPRRNYVLDDH------RRGRLCLLLADSGD--DGSTI 426
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G++ + V+YD + + A + C
Sbjct: 427 --GNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 153/377 (40%), Gaps = 43/377 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G YF+ + +G+P R Y+ +DTGSD+TW+QC APC+ C ++PL+ P + + +P
Sbjct: 194 GEYFSRIGIGSPARQLYMVLDTGSDVTWLQC-APCADCYAQSDPLFDPALSSSYATVPCD 252
Query: 257 DSLCMEIQRNH-KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
C + + C YE+ Y D S ++G A + L L +GS +V
Sbjct: 253 SPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLG-GDGSAAVHDVAI 311
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGYM 373
GC +D +GL V G+L L +S PSQ+++ +CL +
Sbjct: 312 GCGHDNEGL----FVGAAGLLALGGGPLSFPSQISA-----TEFSYCLVDRDSPSASTLQ 362
Query: 374 FLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSS------PLNLGARNSQVGWAL 426
F D P++ SP Y+ + I+ G P G +
Sbjct: 363 FGASD----SSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVI 418
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G++ T AYS L + + L + C+ RS V V
Sbjct: 419 VDSGTAVTRLQSSAYSALRDAFVR-GTQALPRASGVSLFDTCY--DLAGRSSVQVPAV-- 473
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+L G + ++ + + I +G G CL G+ I+G++ +G
Sbjct: 474 SLRFEGGGELKLPAKNYLIPVDG------AGTYCLAF----AATGGAVSIVGNVQQQGIR 523
Query: 547 VVYDNVNKRIGWAKSHC 563
V +D +G++ + C
Sbjct: 524 VSFDTAKNTVGFSPNKC 540
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 90/377 (23%), Positives = 157/377 (41%), Gaps = 44/377 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCA---KGANPLYKPRMGNI---LPY 255
Y Y+ VG PP DTGSDL W+ C + A G N +++P + L
Sbjct: 103 YLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLSC 162
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIEN--GSLTKPNV 313
+ + C + + C+ +C Y+ Y D S ++GVL+ + G + P V
Sbjct: 163 QSNACQALSQAS----CDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRV 218
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--TTNAGGGG 371
FGC+ G ++DG++GL SL SQL + I + +CL + +A
Sbjct: 219 NFGCSTASAGTF-----RSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSS 273
Query: 372 YMFLGHDLVPSW-GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
+ G V S G A P++ S Y + + G + +S++ + D+G
Sbjct: 274 TLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQ--EVATHDSRI---IVDSG 328
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDP---TLPVCWRAKFPIRSIVDVKQF-FK 486
++ T+ L+ L+ + L P L +C+ ++ + F
Sbjct: 329 TTLTFLDPALLGPLVTELERR----IKLQRVQPPEQLLQLCY----DVQGKSETDNFGIP 380
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+TL FG + + PE + ++G +CL ++ SE S ILG+I+ +
Sbjct: 381 DVTLRFGGGAAVT-----LRPENTFSLLQEGTLCLVLVPVSESQPVS--ILGNIAQQNFH 433
Query: 547 VVYDNVNKRIGWAKSHC 563
V YD + + +A + C
Sbjct: 434 VGYDLDARTVTFAAADC 450
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 113/472 (23%), Positives = 189/472 (40%), Gaps = 57/472 (12%)
Query: 106 TLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGII 165
TL D ++ DE+ + L H+ V+ R+ +L + D + V A I+
Sbjct: 42 TLPDFNNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSA-----IL 96
Query: 166 RPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDL 225
R K+ S V S I + G G YF + VG+PPR Y+ +D+GSD+
Sbjct: 97 RRISGKVIPSSDSRYEVNDFGSDI--VSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDM 154
Query: 226 TWIQCDAPCSSCAKGANPLYKP-RMGNI--LPYKDSLCMEIQRN--HKPGYCETCQQCDY 280
W+QC PC C K ++P++ P + G+ + S+C I+ + H G C Y
Sbjct: 155 VWVQCQ-PCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGG-------CRY 206
Query: 281 EIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSR 340
E+ Y D S + G LA + L NV GC + +G+ + +
Sbjct: 207 EVMYGDGSYTKGTLALETLTFA----KTVVRNVAMGCGHRNRGMFIGAAGLLG----IGG 258
Query: 341 AKVSLPSQLASQGIIKNVVGHCLTTNA-GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELY 399
+S QL+ Q G+CL + G + G + +P G +WVP++ +P +
Sbjct: 259 GSMSFVGQLSGQ--TGGAFGYCLVSRGTDSTGSLVFGREALP-VGASWVPLVRNPRAPSF 315
Query: 400 HTEILKINYGSS---PLNLGA---RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSS 453
+ LK PL G + G + DTG++ T AY K ++
Sbjct: 316 YYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTA 375
Query: 454 DGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYL 511
+ L + C+ + V V+ T++ +F G + + F +
Sbjct: 376 N-LPRASGVSIFDTCY----DLSGFVSVR--VPTVSFYFTEGPVLTLPARNF------LM 422
Query: 512 VISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ G C + G +II G+I G V +D N +G+ + C
Sbjct: 423 PVDDSGTYCFAF---AASPTGLSII-GNIQQEGIQVSFDGANGFVGFGPNVC 470
>gi|424513106|emb|CCO66690.1| predicted protein [Bathycoccus prasinos]
Length = 802
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 98/407 (24%), Positives = 175/407 (42%), Gaps = 59/407 (14%)
Query: 181 AVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKG 240
A+ SS+ L G G ++ +++G P + + +DTGS T++ C PC+SC +
Sbjct: 117 ALKQSSSAGLELNGKARDTGYFYATVLIGTPGHQFEVIVDTGSTYTFVTC-YPCASCGQ- 174
Query: 241 ANPLYKPRMGNILPYKDSLCMEIQR-----NHKPGYCETCQQCDYEIEYADHSSSMGVLA 295
G+ PY + +R G C C+Y+ ++++ S G +
Sbjct: 175 --------HGSNAPYDAAKSSSYERVPCGSGCIFGACRASGLCEYDEKFSEDSQVGGHVV 226
Query: 296 RDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ--- 352
D + + GSL P + FGC + +L K +G++ L RA+ L QL +
Sbjct: 227 SDVIDV---GGSLGTPRIHFGCNSLETNMLKTQ--KANGMIALGRAEAGLHRQLKKKAYP 281
Query: 353 -GIIKNVVGHCLTTNAGGGGYMFLG--------HDLVPSWGMAWVPMLDSPFMELYHTEI 403
G G CL + GGG + LG + + + V ++ + Y+ E+
Sbjct: 282 PGSYDGTFGLCLGSFE-GGGVLSLGKLPEQHYANFVTRKTHTSTVKLVKGSKSQYYNVEV 340
Query: 404 LKINYGSSPLN-------LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKE--VSSD 454
++ ++ L + A + G L D+G++YTY + + I+ +++ V+
Sbjct: 341 HRMFVRNTELKKPSGAELMEAFRAGYGTVL-DSGTTYTYLHEDVFIPFISEIEDKVVNDH 399
Query: 455 G---LVLDASDPTLP--VCWRAKFPIRSIVD--VKQFFKTLTLHF-GSKWQIVSTKFHIS 506
G + DP P VCWR+ + + + V F T L F G + + +F
Sbjct: 400 GANFFRVRGGDPNYPNDVCWRSLNENKQLSESNVNYLFPTFNLTFIGVNEEELPIEF--L 457
Query: 507 PEGYLVI--SKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
PE YL + ++ C+G+ D + GS I+G I R L +D+
Sbjct: 458 PENYLFVHPNEPNAFCVGVFDNGQ--QGS--IIGGIFARNTLFEFDD 500
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/396 (25%), Positives = 159/396 (40%), Gaps = 63/396 (15%)
Query: 85 LFLFLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFK 144
L LF +IF L + FS + R D ++ F P +F QR A
Sbjct: 11 LVLFYLCNIFYLEAFNGGFSVEMIHR------DSSRSPFFSPTETQF------QRVAN-- 56
Query: 145 LGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFT 204
+V S+N + +N+ VS N+ P I G Y
Sbjct: 57 ----------AVHRSIN------RANHLNQSFVSPNS---------PETTVISALGEYLI 91
Query: 205 YMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY---KPRMGNILPYKDSLCM 261
VG P + +DTGSD+ W+QC PC C + P++ K + LP + C
Sbjct: 92 SYSVGTPSLQVFGILDTGSDIIWLQCQ-PCKKCYEQTTPIFDSSKSQTYKTLPCPSNTCQ 150
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCA-Y 319
+Q +C + + C Y I Y D S S+G L+ + L L NGS + P V GC Y
Sbjct: 151 SVQGT----FCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGCGRY 206
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGYMFLGH 377
+ G+ K GI+GL R +SL +QL+ K +CL + F
Sbjct: 207 NAIGI----EEKNSGIVGLGRGPMSLITQLSPSTGGK--FSYCLVPGLSTASSKLNFGNA 260
Query: 378 DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS-QVGWALFDTGSSYTYF 436
+V G P+ + Y + + G + + G+ S G + D+G++ T
Sbjct: 261 AVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGSGGKGNIIIDSGTTLTAL 320
Query: 437 TKQAYSELIASLKEVSSDGLVLDASDP--TLPVCWR 470
YS+L A+ V+ ++ DP L +C++
Sbjct: 321 PNGVYSKLEAA---VAKTVILQRVRDPNQVLGLCYK 353
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 167/377 (44%), Gaps = 54/377 (14%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCMEIQ 264
VG PP+P + +D GSDL W QC AK P++ ++LP LC
Sbjct: 113 VGTPPQPSKVILDLGSDLLWTQCSL-VGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAGT 171
Query: 265 RNHKPGYCETC--QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQ 322
+K TC ++C YE +Y +++ GVLA + +G N+ FGC
Sbjct: 172 FTNK-----TCTDRKCAYENDYGIMTAT-GVLATETFTFGAHHG--VSANLTFGCGKLAN 223
Query: 323 GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG-------GGYMFL 375
G T+ + GILGLS +S+ QLA I K +CLT A G L
Sbjct: 224 G----TIAEASGILGLSPGPLSMLKQLA---ITK--FSYCLTPFADRKTSPVMFGAMADL 274
Query: 376 GHDLVPSWGMAWVPMLDSPFMEL-YHTEILKINYGSSPLN-----LGARNSQVGWALFDT 429
G + + +P+L +P ++ Y+ ++ ++ GS L+ L + G + D+
Sbjct: 275 G-KYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDS 333
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL---PVCWRAKFPIRSIVDVKQFFK 486
++ Y + A++E LK+ +G+ L ++ ++ PVC+ + P ++ Q
Sbjct: 334 ATTLAYLVEPAFTE----LKKAVMEGIKLPVANRSVDDYPVCF--ELPRGMSMEGVQ-VP 386
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
L LHF ++ + + Y G +CL ++ G+ ++G++ +
Sbjct: 387 PLVLHFDGDAEM-----SLPRDNYFQEPSPGMMCLAVMQAP--FEGAPNVIGNVQQQNMH 439
Query: 547 VVYDNVNKRIGWAKSHC 563
V+YD N++ +A + C
Sbjct: 440 VLYDVGNRKFSYAPTKC 456
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 90/385 (23%), Positives = 152/385 (39%), Gaps = 47/385 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP---------LYKPRMGN 251
L++ + +G P + + + +DTGSDL W+ C+ S+C + +Y P
Sbjct: 88 LHYANVTIGTPAQWFLVALDTGSDLFWLPCNCN-STCVRSMETDQGERIKLNIYNPSKSK 146
Query: 252 I---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGS 307
+ +LC R P C Y I Y + S S GVL D +H++ E G
Sbjct: 147 SSSKVTCNSTLCALRNRCISP-----VSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGE 201
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
+ FGC+ Q GL V +GI+GL+ A +++P+ L G+ + C N
Sbjct: 202 ARDARITFGCSESQLGLFKE--VAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPN- 258
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALF 427
G G + G S P+ + Y I K G ++ A F
Sbjct: 259 -GKGTISFGDK--GSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEFT------ATF 309
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF-FK 486
D+G++ T+ + Y+ L + D + + D C+ I S D +
Sbjct: 310 DSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCY----IITSTSDEDKLPSV 365
Query: 487 TLTLHFGSKWQIVS--TKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
+ + G+ + + S F S + V CL +L + N I+G +
Sbjct: 366 SFEMKGGAAYDVFSPILVFDTSDGSFQV------YCLAVL---KQVNADFSIIGQNFMTN 416
Query: 545 QLVVYDNVNKRIGWAKSHCMNPGRF 569
+V+D + +GW KS+C + F
Sbjct: 417 YRIVHDRERRILGWKKSNCNDTNGF 441
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 94/361 (26%), Positives = 146/361 (40%), Gaps = 43/361 (11%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLC--MEIQRNHKPGYCE 273
+DT S+LTW+QC+ PC +C PL+ P +P S C + + C+
Sbjct: 128 VDTASELTWVQCE-PCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACD 186
Query: 274 TC-QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKT 332
C Y + Y D S S GVLA D L L E+ VFGC QG T
Sbjct: 187 DQPAACSYTLSYRDGSYSRGVLAHDRLSLAGED----IQGFVFGCGTSNQG----PFGGT 238
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDLVP---SWGMAWV 388
G++GL R+++SL SQ Q V +CL +G G + LG D S + +
Sbjct: 239 SGLMGLGRSQLSLISQTMDQ--FGGVFSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYT 296
Query: 389 PMLDSPFME-LYHTEILKINYGSSPLNL-GARNSQVGWALFDTGSSYTYFTKQAY----S 442
M+ P Y + I G + G G A+ D+G+ T Y +
Sbjct: 297 AMVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRA 356
Query: 443 ELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTK 502
E ++ L E A L C F + + +V+ L G++ ++ S
Sbjct: 357 EFVSQLAEYPQ-----AAPFSILDTC----FDLTGLREVQVPSLKLVFDGGAEVEVDSKG 407
Query: 503 FHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSH 562
Y+V +CL + ++ T I+G+ + V++D V +IG+A+
Sbjct: 408 VL-----YVVTGDASQVCLALASLKSEYD--TPIIGNYQQKNLRVIFDTVGSQIGFAQET 460
Query: 563 C 563
C
Sbjct: 461 C 461
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 159/375 (42%), Gaps = 49/375 (13%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS---SCAKGANPLYKPRMGN---ILPY 255
Y +G P +++DTGSDL+W+QC PCS SC +PL+ P + +P
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCSAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+C + + QC Y + Y D S++ GV + D L L+ + F
Sbjct: 199 GGPVCAGLGIYAA--SACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS---AVQGFFF 253
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC + Q GL DG+LGL R + SL Q A G V +CL T GY+ L
Sbjct: 254 GCGHAQSGLFNG----VDGLLGLGREQPSLVEQTA--GTYGGVFSYCLPTKPSTAGYLTL 307
Query: 376 G----HDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTG 430
G P G + +L SP Y+ +L I+ G L++ A ++ G + DTG
Sbjct: 308 GLGGPSGAAP--GFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA-SAFAGGTVVDTG 364
Query: 431 SSYTYFTKQAYSELIASLKE-VSSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTL 488
+ T AY+ L ++ + ++S G S+ L C+ A + ++ +V
Sbjct: 365 TVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVA------ 418
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
L FGS ++ I G L + G+ +G ILG++ R V
Sbjct: 419 -LTFGSGATVMLGADGILSFGCLAFAPSGS------------DGGMAILGNVQQRSFEVR 465
Query: 549 YDNVNKRIGWAKSHC 563
D + +G+ S C
Sbjct: 466 IDGTS--VGFKPSSC 478
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 89.0 bits (219), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 73/260 (28%), Positives = 113/260 (43%), Gaps = 55/260 (21%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM------ 261
+G+PP P L +DTGSDLTWIQC PC C P + P + Y+++ C
Sbjct: 94 IGDPPVPQLLLIDTGSDLTWIQC-LPC-KCYPQTIPFFHPSRSST--YRNASCESAPHAM 149
Query: 262 -EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAY 319
+I R+ K G C Y + Y D S++ G+LA+++L T + G ++KPN+VFGC
Sbjct: 150 PQIFRDEKTG------NCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQ 203
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDL 379
D G + G+LGL S+ +T N G G +
Sbjct: 204 DNSG-----FTQYSGVLGLGPGTFSI-----------------VTRNFGSKFSYCFGSLI 241
Query: 380 VPSWGMAWV-----------PMLDSPFMELYHTEILKINYGSSPLNLG----ARNSQVGW 424
P++ ++ P F + Y+ ++ I+ G L++ R G
Sbjct: 242 DPTYPHNFLILGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKGG 301
Query: 425 ALFDTGSSYTYFTKQAYSEL 444
+ DTG S T ++AY L
Sbjct: 302 TVIDTGCSPTILAREAYETL 321
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 102/416 (24%), Positives = 176/416 (42%), Gaps = 52/416 (12%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSS--SIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLDMDT 221
+R +S+I ++S N +D S + PL I L Y + +G R + +DT
Sbjct: 29 LRSLQSRIKNIILSGN---IDDSVDTQIPLTSGIRLQSLNYIVTVELGG--RKMTVIVDT 83
Query: 222 GSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQ------RNHKPGYCETC 275
GSDL+W+QC PC+ C +P++ P Y+ LC + G C +
Sbjct: 84 GSDLSWVQCQ-PCNRCYNQQDPVFNPSKSP--SYRTVLCNSLTCRSLQLATGNSGVCGSN 140
Query: 276 -QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDG 334
C+Y + Y D S + G + + L+L G+ T N +FGC QGL G
Sbjct: 141 PPTCNYVVNYGDGSYTSGEVGMEHLNL----GNTTVNNFIFGCGRKNQGLFGG----ASG 192
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHD---LVPSWGMAWVPM 390
++GL R +SL SQ++ + V +CL TT A G + +G + + +++ M
Sbjct: 193 LVGLGRTDLSLISQISP--MFGGVFSYCLPTTEAEASGSLVMGGNSSVYKNTTPISYTRM 250
Query: 391 LDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKE 450
+ +P + Y + I G + + A + + D+G+ + Y L A +
Sbjct: 251 IHNPLLPFYFLNLTGITVGG--VEVQAPSFGKDRMIIDSGTVISRLPPSIYQALKAEFVK 308
Query: 451 VSSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEG 509
S G S L C+ + + I D+K +F+ S + ++ G
Sbjct: 309 QFS-GYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEG------------SAELNVDVTG 355
Query: 510 --YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
Y V + +CL I S + I+G+ + Q ++YD +G+A+ C
Sbjct: 356 VFYSVKTDASQVCLAI--ASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEAC 409
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 90/374 (24%), Positives = 154/374 (41%), Gaps = 44/374 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF+ + VG+P R Y+ +DTGSD+TW+QC PC+ C + ++P++ P + Y
Sbjct: 161 GEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLST--SYASVA 217
Query: 260 CMEIQRNH--KPGYCE-TCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C + R H C + C YE+ Y D S ++G A + L L S +V G
Sbjct: 218 C-DNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAIG 273
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C +D +GL V G+L L +S PSQ+++ +CL
Sbjct: 274 CGHDNEGL----FVGAAGLLALGGGPLSFPSQISA-----TTFSYCLVDRDSPSSSTLQF 324
Query: 377 HDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSS-----PLNLGARNSQVGWALFDTG 430
D + A P++ SP Y+ + I+ G P + G + D+G
Sbjct: 325 GDAADAEVTA--PLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSG 382
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
++ T AY+ L + + L + C+ R+ V+V ++L
Sbjct: 383 TAVTRLQSSAYAALRDAFVR-GTQSLPRTSGVSLFDTCY--DLSDRTSVEV----PAVSL 435
Query: 491 HFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
F ++ + + YL+ + G CL N + I+G++ +G V +
Sbjct: 436 RFAGGGEL-----RLPAKNYLIPVDGAGTYCLAFAP----TNAAVSIIGNVQQQGTRVSF 486
Query: 550 DNVNKRIGWAKSHC 563
D +G+ + C
Sbjct: 487 DTAKSTVGFTSNKC 500
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 159/384 (41%), Gaps = 62/384 (16%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQC--DAPCSSCAKGANPLYKPRMG---NILPYKDSLCM- 261
+G PP+ + +DTGS L+WIQC AP + + P + + LP +C
Sbjct: 103 IGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTAS---FDPSLSSTFSTLPCTHPVCKP 159
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQ 321
I P C+ + C Y YAD + + G L R++ + SL P ++ GCA +
Sbjct: 160 RIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFS---RSLFTPPLILGCATES 216
Query: 322 QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY-----MFLG 376
GILG++R ++S SQ I K +C+ T GY +LG
Sbjct: 217 --------TDPRGILGMNRGRLSFASQ---SKITK--FSYCVPTRVTRPGYTPTGSFYLG 263
Query: 377 HDLVPSWGMAWVPMLD------SPFME--LYHTEILKINYGSSPLNLG-----ARNSQVG 423
H+ S ++ ML P ++ Y + I G LN+ A G
Sbjct: 264 HN-PNSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSG 322
Query: 424 WALFDTGSSYTYFTKQAY----SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+ D+GS +TY +AY +E++ ++ G V +C+ + +
Sbjct: 323 QTMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGG---VADMCFDG-----NAI 374
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
++ + + F QIV K E L + G C+GI + ++ S II G+
Sbjct: 375 EIGRLIGDMVFEFEKGVQIVVPK-----ERVLATVEGGVHCIGIANSDKLGAASNII-GN 428
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ V +D VN+R+G+ + C
Sbjct: 429 FHQQNLWVEFDLVNRRMGFGTADC 452
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 91/374 (24%), Positives = 159/374 (42%), Gaps = 45/374 (12%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
+G PP+ + +DTGS L+WIQC P + +PL ++LP SLC ++
Sbjct: 84 IGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSSSF-SVLPCNHSLCKPRVPDY 142
Query: 268 K-PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLL 326
P C+ + C Y YAD + + G L R++ + S T P ++ GCA D
Sbjct: 143 TLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFS---SSQTTPPLILGCATDSS---- 195
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMA 386
T GILG++ ++S S V + + G +LG + S G
Sbjct: 196 ----DTQGILGMNLGRLSFSSLAKISKFSYCVPPRRSQSGSSPTGSFYLGPN-PSSAGFK 250
Query: 387 WVPMLD------SPFME--LYHTEILKINYGSSPLNLG-----ARNSQVGWALFDTGSSY 433
+V ++ P ++ Y +L I LN+ A S G L D+G+ +
Sbjct: 251 YVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLIDSGTWF 310
Query: 434 TYFTKQAYSELIASLKEVS----SDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
T+ +AYS++ + +++ G V S L +C+ + + + +
Sbjct: 311 TFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGS---LDMCFDGDAMV-----IGRMIGNMA 362
Query: 490 LHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
F + +IV + E L G CLGI S++ ++ I+G+ + V +
Sbjct: 363 FEFENGVEIV-----VEREKMLADVGGGVQCLGI-GRSDLLGVASNIIGNFHQQDLWVEF 416
Query: 550 DNVNKRIGWAKSHC 563
D V +R+G+ ++ C
Sbjct: 417 DLVGRRVGFGRTDC 430
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 74/266 (27%), Positives = 118/266 (44%), Gaps = 29/266 (10%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y + +G PP P+ DTGSDLTW QC PC C P+Y + + +P +
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPIYDTAVSSSFSPVPCASA 151
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C+ I + + C Y Y D + S GVL + L G ++ + FGC
Sbjct: 152 TCLPIWSSRN--CTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPG-VSVGGIAFGCG 208
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYMF- 374
D GL N + G +GL R +SL +QL G+ K +CLT + G +F
Sbjct: 209 VDNGGLSYN----STGTVGLGRGSLSLVAQL---GVGK--FSYCLTDFFNTSLGSPVLFG 259
Query: 375 -LGHDLVPSWGMAW--VPMLDSPFM-ELYHTEILKINYGSSPL-----NLGARNSQVGWA 425
L PS G A P++ SP++ Y+ + I+ G + L R+ G
Sbjct: 260 ALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGM 319
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEV 451
+ D+G+++T+ + A+ ++ + V
Sbjct: 320 IVDSGTTFTFLVESAFRVVVDHVAGV 345
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 154/381 (40%), Gaps = 44/381 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y + VG PP+P +DTGSDL W QCD C++C + +PL+ PRM + +
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
LC +I + C C Y Y D ++++G A + +G + FGC
Sbjct: 157 LCGDILHHS----CVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCG 212
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG--GGYMFLG 376
G L N GI+G R +SL SQL+ + +CLT A F
Sbjct: 213 TMNVGSLNN----ASGIVGFGRDPLSLVSQLSIRRF-----SYCLTPYASSRKSTLQFGS 263
Query: 377 HDLVPSWGMAWVPMLDSPFME------LYHTEILKINYGSSPLNLGA-----RNSQVGWA 425
V + A P+ +P ++ Y+ + G+ L + A R G
Sbjct: 264 LADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGV 323
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+ D+G++ T F +E++ + + + S P VC+ A + +
Sbjct: 324 IIDSGTALTLFPAAVLAEVVRAFRSQLRLPFA-NGSSPDDGVCFAAPAVAAGGGRMARQV 382
Query: 486 KT--LTLHFGSKWQIVSTKFHISPEGYLVIS-KKGNICLGILDGSEVHNGSTIILGDISL 542
+ HF + E Y++ ++G++C +L G +G+TI G+
Sbjct: 383 AVPRMVFHF------QGADLDLPRENYVLEDHRRGHLC--VLLGDSGDDGATI--GNFVQ 432
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ VVYD + + +A C
Sbjct: 433 QDMRVVYDLERETLSFAPVEC 453
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 73/144 (50%), Gaps = 16/144 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G YFT + VG PP+ Y+ +DTGSD+ WIQC APC C +P++ P+ + + +
Sbjct: 172 GEYFTRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISCR 230
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
LC+ + PG C + Q C Y++ Y D S + G + + L P V G
Sbjct: 231 SPLCLRLD---SPG-CNSRQSCLYQVAYGDGSFTFGEFSTETLTFR----GTRVPKVALG 282
Query: 317 CAYDQQGLLLNTLVKTDGILGLSR 340
C +D +GL V G+LGL R
Sbjct: 283 CGHDNEGL----FVGAAGLLGLGR 302
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 141/371 (38%), Gaps = 26/371 (7%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G G Y + +G P Y + DTGSD TW+QC C + L+ P +
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST- 229
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
Y + C + + C Y ++Y D S S+G A D L L+ +
Sbjct: 230 -YANVSCAAPACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AVKGF 285
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
FGC +GL + G+LGL R K SLP Q + V HCL + G GY+
Sbjct: 286 RFGCGERNEGL----FGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 339
Query: 374 FLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSY 433
G P+ + PML Y+ + I G L + + D+G+
Sbjct: 340 DFGAG-SPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVI 398
Query: 434 TYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
T AYS L ++ +S+ G + L C+ F S V + T++L F
Sbjct: 399 TRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCY--DFAGMSQVAI----PTVSLLF 452
Query: 493 GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
+ + G + + +CL + G I+G+ L+ V YD
Sbjct: 453 QG-----GARLDVDASGIMYAASASQVCLAFAANED--GGDVGIVGNTQLKTFGVAYDIG 505
Query: 553 NKRIGWAKSHC 563
K + ++ C
Sbjct: 506 KKVVSFSPGAC 516
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 143/375 (38%), Gaps = 32/375 (8%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI- 252
G G Y + +G P Y + DTGSD TW+QC C + L+ P +
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTY 230
Query: 253 --LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
+ C ++ G C Y ++Y D S S+G A D L L+ +
Sbjct: 231 ANVSCAAPACFDLDTRGCSG-----GHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AV 282
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
FGC +GL + G+LGL R K SLP Q + V HCL + G
Sbjct: 283 KGFRFGCGERNEGL----FGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSSGT 336
Query: 371 GYMFLGHDLVPSWGMAW-VPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDT 429
GY+ G + G PML Y+ + I G L++ + D+
Sbjct: 337 GYLDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDS 396
Query: 430 GSSYTYFTKQAYSELIAS-LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
G+ T AYS L ++ + +++ G + L C+ F S V + T+
Sbjct: 397 GTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCY--DFTGMSQVAI----PTV 450
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
+L F + G + + +CLG + G I+G+ L+ V
Sbjct: 451 SLLFQG-----GAILDVDASGIMYAASVSQVCLGFAANED--GGDVGIVGNTQLKTFGVA 503
Query: 549 YDNVNKRIGWAKSHC 563
YD K +G++ C
Sbjct: 504 YDIGKKVVGFSPGAC 518
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 97/389 (24%), Positives = 161/389 (41%), Gaps = 43/389 (11%)
Query: 193 RGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMG- 250
R ++ G Y + +G PP PY DTGSDL W QC APC + C + PLY P
Sbjct: 103 RKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASST 161
Query: 251 --NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
++LP SL M C C Y Y ++ GV +
Sbjct: 162 TFSVLPCNSSLSMCAGALAGAAPPPGC-ACMYNQTYGTGWTA-GVQGSETFTFGSSAADQ 219
Query: 309 TK-PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--T 365
+ P V FGC+ + + G++GL R +SL SQL + +CLT
Sbjct: 220 ARVPGVAFGCSNASS----SDWNGSAGLVGLGRGSLSLVSQLGA-----GRFSYCLTPFQ 270
Query: 366 NAGGGGYMFLG-HDLVPSWGMAWVPMLDS----PFMELYHTEILKINYGSS--PLNLGA- 417
+ + LG + G+ P + S P Y+ + I+ G+ P++ GA
Sbjct: 271 DTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAF 330
Query: 418 --RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT-LPVCWRAKFP 474
+ G + D+G++ T AY ++ A++K + + +D SD T L +C+ P
Sbjct: 331 SLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAP 390
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
+ V ++TLHF ++ P +IS G CL + + ++ G+
Sbjct: 391 TSAPPAV---LPSMTLHFDGADMVL-------PADSYMISGSGVWCLAMRNQTD---GAM 437
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G+ + ++YD + + +A + C
Sbjct: 438 STFGNYQQQNMHILYDVREETLSFAPAKC 466
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 89/374 (23%), Positives = 155/374 (41%), Gaps = 44/374 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF+ + VG+P R Y+ +DTGSD+TW+QC PC+ C + ++P++ P + Y
Sbjct: 165 GEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLST--SYASVA 221
Query: 260 CMEIQRNH--KPGYCE-TCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C + R H C + C YE+ Y D S ++G A + L L S +V G
Sbjct: 222 C-DNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAIG 277
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C +D +GL V G+L L +S PSQ+++ +CL
Sbjct: 278 CGHDNEGL----FVGAAGLLALGGGPLSFPSQISA-----TTFSYCLVDRDSPSSSTLQF 328
Query: 377 HDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSS-----PLNLGARNSQVGWALFDTG 430
D + A P++ SP Y+ + ++ G P ++ G + D+G
Sbjct: 329 GDAADAEVTA--PLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSG 386
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
++ T AY+ L + + L + C+ R+ V+V ++L
Sbjct: 387 TAVTRLQSSAYAALRDAFVR-GTQSLPRTSGVSLFDTCY--DLSDRTSVEV----PAVSL 439
Query: 491 HFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
F ++ + + YL+ + G CL N + I+G++ +G V +
Sbjct: 440 RFAGGGEL-----RLPAKNYLIPVDGAGTYCLAFAP----TNAAVSIIGNVQQQGTRVSF 490
Query: 550 DNVNKRIGWAKSHC 563
D +G+ + C
Sbjct: 491 DTAKSTVGFTTNKC 504
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 151/371 (40%), Gaps = 55/371 (14%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYKDSLCMEIQRNHK--PGYCE 273
+DTGSDLTW+QC PCS C +PL+ P +P S C + PG C
Sbjct: 180 VDTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 238
Query: 274 TC---------QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGL 324
T ++C Y + Y D S S GVLA D T+ G + VFGC +GL
Sbjct: 239 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATD----TVALGGASVDGFVFGCGLSNRGL 294
Query: 325 LLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--TTNAGGGGYMFLGHDLVP- 381
T G++GL R ++SL SQ A + V +CL T+ G + LG D
Sbjct: 295 FGGTA----GLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDAAGSLSLGGDTSSY 348
Query: 382 --SWGMAWVPMLDSP----FMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
+ +++ M+ P F + T + LGA N L D+G+ T
Sbjct: 349 RNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN-----VLLDSGTVITR 403
Query: 436 FTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
Y + A ++ ++ L C+ + +VK TL L G+
Sbjct: 404 LAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYN----LTGHDEVKVPLLTLRLEGGA 459
Query: 495 KWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
+ G L +++K +CL + S T I+G+ + + VVYD V
Sbjct: 460 D-------MTVDAAGMLFMARKDGSQVCLAM--ASLSFEDQTPIIGNYQQKNKRVVYDTV 510
Query: 553 NKRIGWAKSHC 563
R+G+A C
Sbjct: 511 GSRLGFADEDC 521
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 151/371 (40%), Gaps = 55/371 (14%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYKDSLCMEIQRNHK--PGYCE 273
+DTGSDLTW+QC PCS C +PL+ P +P S C + PG C
Sbjct: 181 VDTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 239
Query: 274 TC---------QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGL 324
T ++C Y + Y D S S GVLA D T+ G + VFGC +GL
Sbjct: 240 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATD----TVALGGASVDGFVFGCGLSNRGL 295
Query: 325 LLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--TTNAGGGGYMFLGHDLVP- 381
T G++GL R ++SL SQ A + V +CL T+ G + LG D
Sbjct: 296 FGGTA----GLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDAAGSLSLGGDTSSY 349
Query: 382 --SWGMAWVPMLDSP----FMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
+ +++ M+ P F + T + LGA N L D+G+ T
Sbjct: 350 RNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN-----VLLDSGTVITR 404
Query: 436 FTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
Y + A ++ ++ L C+ + +VK TL L G+
Sbjct: 405 LAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYN----LTGHDEVKVPLLTLRLEGGA 460
Query: 495 KWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
+ G L +++K +CL + S T I+G+ + + VVYD V
Sbjct: 461 D-------MTVDAAGMLFMARKDGSQVCLAM--ASLSFEDQTPIIGNYQQKNKRVVYDTV 511
Query: 553 NKRIGWAKSHC 563
R+G+A C
Sbjct: 512 GSRLGFADEDC 522
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 98/401 (24%), Positives = 165/401 (41%), Gaps = 59/401 (14%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGA-NPLYKPRMGN------ 251
G YF + +G PP+ L DTGSDL W++C A C +C+ + + PR +
Sbjct: 85 SGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSA-CRNCSHHPPSSAFLPRHSSSFSPFH 143
Query: 252 -------ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIE 304
+LP+ R H P C + YAD S S G +++ L
Sbjct: 144 CFDPHCRLLPHAPHHLCNHTRLHSP--------CRFLYSYADGSLSSGFFSKETTTLKSL 195
Query: 305 NGS-LTKPNVVFGCAYDQQGLLLN--TLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+GS + + FGC + G ++ G++GL R +S SQL + N +
Sbjct: 196 SGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR--FGNKFSY 253
Query: 362 CL---TTNAGGGGYMFLGHDL-----VPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSP 412
CL T + ++ +G L + +++ P+ +P Y+ I I
Sbjct: 254 CLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVK 313
Query: 413 LNLGARNSQV-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD--PTL 465
L + ++ G + D+G++ TY TK AY E+ LK V + +A++ P
Sbjct: 314 LPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEV---LKSVRRRVKLPNAAELTPGF 370
Query: 466 PVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD 525
+C A R + L G F P Y + +++G +CL I
Sbjct: 371 DLCVNASGESR-----RPSLPRLRFRLGG-----GAVFAPPPRNYFLETEEGVMCLAI-R 419
Query: 526 GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNP 566
E NG ++I G++ +G L+ +D R+G+ + C P
Sbjct: 420 AVESGNGFSVI-GNLMQQGFLLEFDKEESRLGFTRRGCGLP 459
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/252 (30%), Positives = 113/252 (44%), Gaps = 22/252 (8%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
+ + +GNPP Y+ +DTGSDL WIQC+ PC C K +P+Y + Y + LC
Sbjct: 106 FLANLSIGNPPTNVYVVLDTGSDLFWIQCE-PCDVCYKQKDPIYNRTKSD--SYTEMLCN 162
Query: 262 E--IQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCA 318
E + G C C Y+ YAD S + G+L+ +++ T K V FGC
Sbjct: 163 EPPCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGFGCG 222
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--TTNAGGGGYMFLG 376
Q L T + G+LGL VSL SQL++ G + +C +N GG++ G
Sbjct: 223 L--QNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFG 280
Query: 377 HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-------GWALFDT 429
+ M PM+ E Y+ +L I G L +S G + D+
Sbjct: 281 DATYLNGDM--TPMV---IAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDS 335
Query: 430 GSSYTYFTKQAY 441
GS+ + F + Y
Sbjct: 336 GSTLSIFPPEVY 347
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 121/471 (25%), Positives = 194/471 (41%), Gaps = 68/471 (14%)
Query: 115 NDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGI-IRPHKSKIN 173
N + +ES + H+ E G+ +DL + A V D I ++ + KI
Sbjct: 10 NLGKGRESTTLEMKHR-----------ELCSGKTIDLGKKMRRALVLDNIRVQSLQLKI- 57
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA 232
K + SS S + PL I + L Y + +G + L +DTGSDLTW+QC
Sbjct: 58 KAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ- 114
Query: 233 PCSSCAKGANPLYKPRMGNILPYKDSLC--------MEIQRNHKP-----GYCETCQQCD 279
PC SC PLY P + + YK C + N P G +T C+
Sbjct: 115 PCRSCYNQQGPLYDPSVSS--SYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKT--PCE 170
Query: 280 YEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
Y + Y D S + G LA + + L G N VFGC + +GL + L
Sbjct: 171 YVVSYGDGSYTRGDLASESILL----GDTKLENFVFGCGRNNKGLFGGSSGLMG----LG 222
Query: 340 RAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHD---LVPSWGMAWVPMLDSPF 395
R+ VSL SQ + V +CL + G G + G+D S +++ P++ +P
Sbjct: 223 RSSVSLVSQ--TLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQ 280
Query: 396 MELYHTEILKINYGSSPLNLGARNSQVGWA-LFDTGSSYTYFTKQAYSEL-IASLKEVSS 453
+ ++ IL + G+S + ++S G L D+G+ T Y + I LK+ S
Sbjct: 281 LRSFY--ILNLT-GASIGGVELKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFS- 336
Query: 454 DGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV 512
G L C+ + SI +K F+ + + + + + P+ LV
Sbjct: 337 -GFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQG---NAELEVDVTGVFYFVKPDASLV 392
Query: 513 ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CL + S + I+G+ + Q V+YD +R+G +C
Sbjct: 393 -------CLAL--ASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 145/374 (38%), Gaps = 30/374 (8%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G+ G Y + +G P Y + DTGSD TW+QC+ C K L+ P +
Sbjct: 153 GSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSST- 211
Query: 254 PYKDSLCMEIQRNHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP 311
Y + C + Y + C C Y ++Y D S S+G A D L L+ +
Sbjct: 212 -YANISCAAPACSDL--YIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AIK 265
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
FGC +GL + G+LGL R K SLP Q + V HC + G G
Sbjct: 266 GFRFGCGERNEGL----YGEAAGLLGLGRGKTSLPVQAYDK--YGGVFAHCFPARSSGTG 319
Query: 372 YMFLGHDLVPSWGMAW-VPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
Y+ G +P+ PML Y+ + I G L++ + D+G
Sbjct: 320 YLDFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSG 379
Query: 431 SSYTYFTKQAYSELIASLKEVSSD-GLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
+ T AYS L ++ ++ G + L C+ F S V + T++
Sbjct: 380 TVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCY--DFTGMSEVAI----PTVS 433
Query: 490 LHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
L F + + G + + CLG E + I+G+ L+ VVY
Sbjct: 434 LLFQGGASL-----DVHASGIIYAASVSQACLGFAGNKE--DDDVGIVGNTQLKTFGVVY 486
Query: 550 DNVNKRIGWAKSHC 563
D K +G+ C
Sbjct: 487 DIGKKVVGFCPGAC 500
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/380 (25%), Positives = 145/380 (38%), Gaps = 72/380 (18%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YF + VG+PP YL +D+GSD+ W+QC PC C +PL+ P + +
Sbjct: 128 GEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSCG 186
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
++C + +CDY + Y D S + G LA + L L G V G
Sbjct: 187 SAICRTLSGTGCG-GGGDAGKCDYSVTYGDGSYTKGELALETLTL----GGTAVQGVAIG 241
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C + GL V G+LGL +SL QL G V +CL + GG
Sbjct: 242 CGHRNSGL----FVGAAGLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAGG------ 289
Query: 377 HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGS 431
L S F Y+ + I G L L Q+ G + DTG+
Sbjct: 290 -----------AGSLASSF---YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGT 335
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
+ T ++AY+ L + D + LP + P S++D L
Sbjct: 336 AVTRLPREAYAALRGA----------FDGAMGALP-----RSPAVSLLD-----TCYDLS 375
Query: 492 FGSKWQIVSTKFHIS-------PEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDISLR 543
+ ++ + F+ P L++ G + CL S ILG+I
Sbjct: 376 GYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSS----GISILGNIQQE 431
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G + D+ N +G+ + C
Sbjct: 432 GIQITVDSANGYVGFGPNTC 451
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 90/386 (23%), Positives = 154/386 (39%), Gaps = 46/386 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + VG P L MDTGSD+TW+QC PC C + P++ PR + Y
Sbjct: 132 GEYMAKIAVGTPAVEALLAMDTGSDITWLQCQ-PCRRCYPQSGPVFDPRHSTSYREMGYD 190
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADH-SSSMGVLARDELHLTIENGSLTKPNVVF 315
C + R+ G C Y + Y D S+++G + L G + P++
Sbjct: 191 APDCQALGRSG--GGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFA---GGVQVPHMSI 245
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC +D +GL GILGL R ++S PSQ+A+ G +CL +
Sbjct: 246 GCGHDNKGLF---AAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSPGRSV 302
Query: 376 GHDLVPSWGMA-------WVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV------ 422
L G A + P + + M ++ L G +
Sbjct: 303 SSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLDPYT 362
Query: 423 --GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT--LPVCWRAKFPIRSI 478
G + D+G++ T ++AY + + + D + P+ C+ ++
Sbjct: 363 GRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCY-------TM 415
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIIL 537
T+++HF ++ + P+ YL+ + G +C + + S I+
Sbjct: 416 GGRAMKVPTVSMHFAGGVELT-----LPPKNYLIPVDSMGTVCFAF---AGTGDRSVSII 467
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G+I +G VVY+ R+G+A + C
Sbjct: 468 GNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 96/367 (26%), Positives = 152/367 (41%), Gaps = 56/367 (15%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLY---KPRMGNILPYKDSLCMEIQRNHKPGYCETC 275
MDTGSDL W QC APC CA P + K LP + S C + +C
Sbjct: 1 MDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSS-------PSC 52
Query: 276 --QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP-NVVFGCAYDQQGLLLNTLVKT 332
+ C Y+ Y D +S+ GVLA + N + + N+ FGC G L N +
Sbjct: 53 FKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLAN----S 108
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG---YMFLGHDLVPSWGMAWVP 389
G++G R +SL SQL + +CLT+ Y + +L + + P
Sbjct: 109 SGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSP 163
Query: 390 MLDSPFM------ELYHTEILKINYGSS-----PLNLGARNSQVGWALFDTGSSYTYFTK 438
+ +PF+ +Y + I+ G+ PL + G + D+G+S T+ +
Sbjct: 164 VQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQ 223
Query: 439 QAYSELIASLKEVSSDGL-VLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQ 497
AY + L VS+ L ++ +D L C++ P V V L HF S
Sbjct: 224 DAYEAVRRGL--VSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPD----LVFHFDSA-- 275
Query: 498 IVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRI 556
+ PE Y++I S G +CL + G I+G+ + ++YD N +
Sbjct: 276 ----NMTLLPENYMLIASTTGYLCLVM-----APTGVGTIIGNYQQQNLHLLYDIGNSFL 326
Query: 557 GWAKSHC 563
+ + C
Sbjct: 327 SFVPAPC 333
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 131/306 (42%), Gaps = 40/306 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY---KPRMGNILPYK 256
G Y + +G PP Y MDTGSDL W QC APC CA P + K LP +
Sbjct: 87 GEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCR 145
Query: 257 DSLCMEIQRNHKPGYCETC--QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP-NV 313
S C + +C + C Y+ Y D +S+ GVLA + N + + N+
Sbjct: 146 SSRCASLSS-------PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNI 198
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN-AGGGGY 372
FGC G L N + G++G R +SL SQL + +CLT+ +
Sbjct: 199 AFGCGSLNAGDLAN----SSGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSATPSR 249
Query: 373 MFLG--HDLVPSWGMAWVPMLDSPFM------ELYHTEILKINYGSS-----PLNLGARN 419
++ G +L + + P+ +PF+ +Y + I+ G+ PL +
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAIND 309
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL-VLDASDPTLPVCWRAKFPIRSI 478
G + D+G+S T+ + AY + L VS+ L ++ +D L C++ P
Sbjct: 310 DGTGGVIIDSGTSITWLQQDAYEAVRRGL--VSAIPLTAMNDTDIGLDTCFQWPPPPNVT 367
Query: 479 VDVKQF 484
V V F
Sbjct: 368 VTVPDF 373
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 96/202 (47%), Gaps = 15/202 (7%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + VG+PPR Y+ MD+GSD+ W+QC+ PC+ C ++P++ P + +
Sbjct: 134 GEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCE-PCTQCYHQSDPVFNP--ADSSSFSGVS 190
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
C +H +C YE+ Y D S + G LA + TI G NV GC +
Sbjct: 191 CASTVCSHVDNAACHEGRCRYEVSYGDGSYTKGTLALE----TITFGRTLIRNVAIGCGH 246
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA-GGGGYMFLGHD 378
QG+ V G+LGL +S QL Q +CL + G + G +
Sbjct: 247 HNQGM----FVGAAGLLGLGGGPMSFVGQLGGQ--TGGAFSYCLVSRGIESSGLLEFGRE 300
Query: 379 LVPSWGMAWVPMLDSPFMELYH 400
+P G AWVP++ +P + ++
Sbjct: 301 AMP-VGAAWVPLIHNPRAQSFY 321
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 154/381 (40%), Gaps = 44/381 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y + VG PP+P +DTGSDL W QCD C++C + +PL+ PRM + +
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
LC +I + C C Y Y D ++++G A + +G + FGC
Sbjct: 157 LCGDILHHS----CVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCG 212
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG--GGYMFLG 376
G L N GI+G R +SL SQL+ + +CLT A F
Sbjct: 213 TMNVGSLNN----ASGIVGFGRDPLSLVSQLSIRRF-----SYCLTPYASSRKSTLQFGS 263
Query: 377 HDLVPSWGMAWVPMLDSPFME------LYHTEILKINYGSSPLNLGA-----RNSQVGWA 425
V + A P+ +P ++ Y+ + G+ L + A R G
Sbjct: 264 LADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGV 323
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+ D+G++ T F +E++ + + + S P VC+ A + +
Sbjct: 324 IIDSGTALTLFPVAVLAEVVRAFRSQLRLPFA-NGSSPDDGVCFAAPAVAAGGGRMARQV 382
Query: 486 KT--LTLHFGSKWQIVSTKFHISPEGYLVIS-KKGNICLGILDGSEVHNGSTIILGDISL 542
+ HF + E Y++ ++G++C +L G +G+TI G+
Sbjct: 383 AVPRMVFHF------QGADLDLPRENYVLEDHRRGHLC--VLLGDSGDDGATI--GNFVQ 432
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ VVYD + + +A C
Sbjct: 433 QDMRVVYDLERETLSFAPVEC 453
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 112/475 (23%), Positives = 194/475 (40%), Gaps = 62/475 (13%)
Query: 106 TLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGII 165
TL D ++ D++ + L H+ V+ R+ +L + D + V A I+
Sbjct: 42 TLPDFNNTHFSDDSNSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSA-----IL 96
Query: 166 RPHKSKINKKLV---SSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTG 222
R +I+ K+V S + V+ + G G YF + VG+PPR Y+ +D+G
Sbjct: 97 R----RISGKVVVASSDSRYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSG 152
Query: 223 SDLTWIQCDAPCSSCAKGANPLYKP-RMGNI--LPYKDSLCMEIQRN--HKPGYCETCQQ 277
SD+ W+QC PC C K ++P++ P + G+ + S+C I+ + H G
Sbjct: 153 SDMVWVQCQ-PCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGG------- 204
Query: 278 CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILG 337
C YE+ Y D S + G LA + L NV GC + +G+ +
Sbjct: 205 CRYEVMYGDGSYTKGTLALETLTFA----KTVVRNVAMGCGHRNRGMFIGAAGLLG---- 256
Query: 338 LSRAKVSLPSQLASQGIIKNVVGHCLTTNA-GGGGYMFLGHDLVPSWGMAWVPMLDSPFM 396
+ +S QL+ Q G+CL + G + G + +P G +WVP++ +P
Sbjct: 257 IGGGSMSFVGQLSGQ--TGGAFGYCLVSRGTDSTGSLVFGREALP-VGASWVPLVRNPRA 313
Query: 397 ELYHTEILKINYGSS---PLNLGA---RNSQVGWALFDTGSSYTYFTKQAYSELIASLKE 450
++ LK PL G + G + DTG++ T AY+ K
Sbjct: 314 PSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKS 373
Query: 451 VSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF--GSKWQIVSTKFHISPE 508
+++ L + C+ + V V+ T++ +F G + + F
Sbjct: 374 QTAN-LPRASGVSIFDTCY----DLSGFVSVR--VPTVSFYFTEGPVLTLPARNF----- 421
Query: 509 GYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + G C + G +II G+I G V +D N +G+ + C
Sbjct: 422 -LMPVDDSGTYCFAF---AASPTGLSII-GNIQQEGIQVSFDGANGFVGFGPNVC 471
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 160/386 (41%), Gaps = 57/386 (14%)
Query: 202 YFTYMIVGNP-PRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC 260
Y + +G P P+ L++DTGSD+ W QC PC C P + + + LC
Sbjct: 92 YLIHFGIGTPRPQQVALEVDTGSDVVWTQCR-PCFDCFTQPLPRFDTSASDTV--HGVLC 148
Query: 261 ME-IQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFGCA 318
+ I R +P C C Y++ Y D+S ++G LA+D + G +T P++VFGC
Sbjct: 149 TDPICRALRPHAC-FLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCG 207
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT-NAGGGGYMFLGH 377
G N GI G R +SLP QL +C TT +FLG
Sbjct: 208 QYNTG---NFHSNETGIAGFGRGPLSLPRQLGVSSF-----SYCFTTIFESKSTPVFLGG 259
Query: 378 DLVPSWGM---AWVPMLDSPFM----ELYHTEILKINYGSSPLNLG-----ARNSQVGWA 425
P+ G+ A P+L +PF+ E Y+ + I G + L + + G
Sbjct: 260 --APADGLRAHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGT 317
Query: 426 LFDTGSSYTYFTKQAYSELIAS------LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+ D+G++ T F + + L + L S + D +PTL P S V
Sbjct: 318 IIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYN----DTGEPTLQCFSTESVPDASKV 373
Query: 480 DVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIIL 537
V + +TLH G+ W++ E Y+ +C+ +L G + ++
Sbjct: 374 PVPK----MTLHLEGADWELPR-------ENYMAEYPDSDQLCVVVLAGDD----DRTMI 418
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G+ + +V+D ++ + C
Sbjct: 419 GNFQQQNMHIVHDLAGNKLVIEPAQC 444
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 115/460 (25%), Positives = 180/460 (39%), Gaps = 47/460 (10%)
Query: 113 SNNDDENKESFVFPLYHKFG-IREVSQRDAEFK-LGRFVDLDGESVVASVNDGIIRPHKS 170
SNND NK S + HK G ++SQ +A + L +S V S+ H
Sbjct: 68 SNND--NKASL--KVVHKHGPCSKLSQDEASAAPTHTEILLQDQSRVKSI-------HSR 116
Query: 171 KINKKLVSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQ 229
N K V V S+ P + G+ G Y + +G P + L DTGSD+TW Q
Sbjct: 117 LSNSKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQ 176
Query: 230 CDAPCSSCAKGANPLYKPRMGN-----ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY 284
C SC K ++ P + + PG C + C Y I+Y
Sbjct: 177 CQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTSATGNTPG-CAS-SACVYGIQY 234
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
D S S+G ++L LT + N+ FGC + QGL + L R K+S
Sbjct: 235 GDSSFSVGFFGTEKLTLTSTDA---FNNIYFGCGQNNQGLFGGSAGLLG----LGRDKLS 287
Query: 345 LPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFMELYHTEI 403
+ SQ A + + +CL +++ G++ G S + P+ S Y +
Sbjct: 288 VVSQTAQK--YNKIFSYCLPSSSSSTGFLTFGGS--ASKNAKFTPLSTISAGPSFYGLDF 343
Query: 404 LKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP 463
I+ G L + A A+ D+G+ T AYS L AS + + S + +
Sbjct: 344 TGISVGGKKLAISASVFSTAGAIIDSGTVITRLPPAAYSALRASFRNLMSK-YPMTKALS 402
Query: 464 TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGI 523
L C+ F + + V + + F S ++ I G L S +CL
Sbjct: 403 ILDTCY--DFSSYTTISVPK----IGFSFSSGIEV-----DIDATGILYASSLSQVCLAF 451
Query: 524 LDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
S+ + I G++ + V YD ++G+A C
Sbjct: 452 AGNSDATD--VFIFGNVQQKTLEVFYDGSAGKVGFAPGGC 489
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 155/379 (40%), Gaps = 51/379 (13%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCM-EI 263
+G PP+ + +DTGS L+WIQC + P + ++LP LC I
Sbjct: 86 IGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLCKPRI 145
Query: 264 QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQG 323
P C+ + C Y YAD + + G L R+++ + S + P ++ GCA
Sbjct: 146 PDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFS---SSQSTPPLILGCA----- 197
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW 383
GILG++ + S SQ V G +LG++ P+
Sbjct: 198 ---EASTDEKGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSFYLGNN--PNS 252
Query: 384 G-------MAWVPMLDSPFME--LYHTEILKINYGSSPLNLGAR-----NSQVGWALFDT 429
G + + P SP ++ Y + I G++ LN+ A S G + D+
Sbjct: 253 GRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTIIDS 312
Query: 430 GSSYTYFTKQAYS----ELIASLKEVSSDGLVLDA-SDPTLPVCWRAKFPIRSIVDVKQF 484
GS +TY +AY+ E++ + G V SD +C+ + +++ +
Sbjct: 313 GSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSD----MCFDG-----NPMEIGRL 363
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
+ F +IV K+ + L G C+GI SE+ ++ I+G+ +
Sbjct: 364 IGNMVFEFEKGVEIVIDKWRV-----LADVGGGVHCIGI-GRSEMLGAASNIIGNFHQQN 417
Query: 545 QLVVYDNVNKRIGWAKSHC 563
V YD N+RIG K+ C
Sbjct: 418 LWVEYDLANRRIGLGKADC 436
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 106/454 (23%), Positives = 184/454 (40%), Gaps = 48/454 (10%)
Query: 118 ENKESFVFPLYHKFG----IREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKIN 173
ENK + HK G +R+ + +A++ L L +S V S++ SK++
Sbjct: 80 ENKA--FLKVVHKHGPCSDLRQGHKAEAQYIL-----LQDQSRVDSIH--------SKLS 124
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
K S+ A ++++ G+I G YF + +G P + + L DTGSDLTW QC+
Sbjct: 125 KDSGLSDVKATAATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPC 184
Query: 234 CSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSS 290
SC ++ P + +LC + + C Y I+Y D S S
Sbjct: 185 VKSCYNQKEAIFNPSQSTSYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFS 244
Query: 291 MGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA 350
+G +++L LT + + FGC + +GL L R K+SL SQ A
Sbjct: 245 IGFFGKEKLSLTATD---VFNDFYFGCGQNNKGLFGGAAGLLG----LGRDKLSLVSQTA 297
Query: 351 SQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-SPFMELYHTEILKINYG 409
+ + +CL +++ G++ G S ++ P+ S Y ++ I+ G
Sbjct: 298 QR--YNKIFSYCLPSSSSSTGFLTFGGSTSKS--ASFTPLATISGGSSFYGLDLTGISVG 353
Query: 410 SSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW 469
L + + D+G+ T AYS L ++ +++ S A L C+
Sbjct: 354 GRKLAISPSVFSTAGTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPAL-SILDTCF 412
Query: 470 RAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEV 529
F + V + + L F + I G ++ +CL S+
Sbjct: 413 --DFSNHDTISVPK----IGLFFSGGVVV-----DIDKTGIFYVNDLTQVCLAFAGNSDA 461
Query: 530 HNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ I G++ + VVYD R+G+A + C
Sbjct: 462 SD--VAIFGNVQQKTLEVVYDGAAGRVGFAPAGC 493
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/252 (30%), Positives = 119/252 (47%), Gaps = 21/252 (8%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY---KDS 258
Y + +G+P + +DTGSD++W+QC PCS C A+ L+ P + +
Sbjct: 127 YLITVGMGSPAVAQTMLIDTGSDVSWVQCK-PCSQCHSQADSLFDPSSSSTYSAFSCTSA 185
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C ++++ C + QC Y ++Y D S+ G + D L L GS T N FGC+
Sbjct: 186 ACAQLRQRG----CSS-SQCQYTVKYGDGSTGSGTYSSDTLAL----GSSTVENFQFGCS 236
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+ G LL +T G++GL SL +Q A G +CL G G++ LG
Sbjct: 237 QSESGNLLQD--QTAGLMGLGGGAESLATQTA--GTFGKAFSYCLPPTPGSSGFLTLGAS 292
Query: 379 LVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTGSSYTYFT 437
S + PML S + Y+ +L+ I G LN+ A G ++ D+G+ T
Sbjct: 293 T--SGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSAG-SIMDSGTIITRLP 349
Query: 438 KQAYSELIASLK 449
+ AYS L ++ K
Sbjct: 350 RTAYSALSSAFK 361
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 142/371 (38%), Gaps = 27/371 (7%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G G Y + +G P Y + DTGSD TW+QC +C + L+ P +
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSST- 229
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
Y + C + + C Y ++Y D S S+G A D L L+ +
Sbjct: 230 -YANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AVKGF 285
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
FGC GL + G+LGL R K SLP Q + G V HCL + G GY+
Sbjct: 286 RFGCGERNDGL----FGEAAGLLGLGRGKTSLPVQ--TYGKYGGVFAHCLPARSTGTGYL 339
Query: 374 FLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSY 433
G P+ PML Y+ + I G L + + D+G+
Sbjct: 340 DFGAGSPPA--TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVI 397
Query: 434 TYFTKQAYSELIASLKEVSSD-GLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
T AYS L ++ + G A+ L C+ F S V + T++L F
Sbjct: 398 TRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCY--DFTGMSQVAI----PTVSLLF 451
Query: 493 GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
+ + G + +CL G+E G I+G+ L+ V YD
Sbjct: 452 QGGAAL-----DVDASGIMYTVSASQVCL-AFAGNE-DGGDVGIVGNTQLKTFGVAYDIG 504
Query: 553 NKRIGWAKSHC 563
K +G++ C
Sbjct: 505 KKVVGFSPGAC 515
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 88/327 (26%), Positives = 137/327 (41%), Gaps = 36/327 (11%)
Query: 244 LYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDEL 299
+Y+P LP LC + PG Q C Y I+Y +++++S G+L D L
Sbjct: 8 IYRPAESTTSRHLPCSHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 62
Query: 300 HLTI-ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNV 358
HL E+ +V+ GC Q G L+ + DG+LGL A +S+PS LA G+++N
Sbjct: 63 HLNYREDHVPVNASVIIGCGQKQSGDYLDGIAP-DGLLGLGMADISVPSFLARAGLVQNS 121
Query: 359 VGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELY-HTEILKINYGSSPLNLGA 417
C ++ G +F G VPS +PF+ LY + +N S +
Sbjct: 122 FSMCFKEDS--SGRIFFGDQGVPS-------QQSTPFVPLYGKLQTYAVNVDKSCIGHKC 172
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRS 477
AL D+G+S+T Y + + + D T C+ A P+
Sbjct: 173 LEGTSFKALVDSGTSFTSLPFDVYKAFTMEFDK-QMNATRVPYEDTTWKYCYSAS-PLE- 229
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHIS-PEGYLVISKKGNICLGILDGSEVHNGSTII 536
+ DV TLT Q V+ + +G L CL +L +E I
Sbjct: 230 MPDVPTI--TLTFAADKSLQAVNPILPFNDKQGAL-----AGFCLAVLPSTEPIG----I 278
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ L G VV+D + ++GW +S C
Sbjct: 279 IAQNFLVGYHVVFDRESMKLGWYRSEC 305
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/390 (24%), Positives = 169/390 (43%), Gaps = 62/390 (15%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G + + +GNP Y +DTGSDL W QC PC+ C P++ P + +
Sbjct: 105 GEFLMELSIGNPAVKYSAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCS 163
Query: 257 DSLCMEIQRNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
LC + R++ C E C+Y Y D+SS+ G+LA + EN + + F
Sbjct: 164 SGLCNALPRSN----CNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN---SISGIGF 216
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGYM 373
GC + +G + + G++GL R +SL SQL + +CLT+ ++ +
Sbjct: 217 GCGVENEG---DGFSQGSGLVGLGRGPLSLISQLK-----ETKFSYCLTSIEDSEASSSL 268
Query: 374 FLG---HDLVPSWGMAW-------VPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV 422
F+G +V G + + +L +P Y+ E+ I G+ L++ ++
Sbjct: 269 FIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFEL 328
Query: 423 -----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVL---DASDPTLPVCWRAKFP 474
G + D+G++ TY + A+ LKE + + L D+ L +C++
Sbjct: 329 AEDGTGGMIIDSGTTITYLEETAFK----VLKEEFTSRMSLPVDDSGSTGLDLCFKLPDA 384
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGS 533
++I K F HF + E Y+V S G +CL + GS NG
Sbjct: 385 AKNIAVPKMIF-----HF------KGADLELPGENYMVADSSTGVLCLAM--GS--SNGM 429
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ I G++ + V++D + + + + C
Sbjct: 430 S-IFGNVQQQNFNVLHDLEKETVSFVPTEC 458
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 113/247 (45%), Gaps = 33/247 (13%)
Query: 219 MDTGSDLTWIQCDAPCSSCA--KGAN-------PLYKPRMGNI---LPYKDSLCMEIQRN 266
+DTGSDL W+ CD C CA +GA +Y P++ + +SLC QRN
Sbjct: 4 LDTGSDLFWVPCD--CGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCA--QRN 59
Query: 267 HKPGYCETCQQCDYEIEYAD-HSSSMGVLARDELHLTIE--NGSLTKPNVVFGCAYDQQG 323
G T C Y + Y +S+ G+L D +HLT E N + V FGC Q G
Sbjct: 60 QCLG---TFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSG 116
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW 383
L+ + +G+ GL K+S+PS LA +G++ + C G G + G S
Sbjct: 117 SFLD-IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCF--GHDGVGRISFGDK--GSS 171
Query: 384 GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSE 443
P +P Y+ + ++ G++ ++ ALFDTG+S+TY Y+
Sbjct: 172 DQEETPFNLNPSHPNYNITVTRVRVGTTLID------DEFTALFDTGTSFTYLVDPMYTT 225
Query: 444 LIASLKE 450
+ S ++
Sbjct: 226 VSESAQD 232
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 142/374 (37%), Gaps = 33/374 (8%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI- 252
G G Y + +G P Y + DTGSD TW+QC +C + L+ P +
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234
Query: 253 --LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
+ C ++ + G C Y ++Y D S S+G A D L L+ +
Sbjct: 235 ANVSCAAPACSDLDVSGCSG-----GHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AV 286
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
FGC GL + G+LGL R K SLP Q + G V HCL + G
Sbjct: 287 KGFRFGCGERNDGL----FGEAAGLLGLGRGKTSLPVQ--TYGKYGGVFAHCLPARSTGT 340
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
GY+ G P+ PML Y+ + I G L + + D+G
Sbjct: 341 GYLDFGAGSPPA--TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSG 398
Query: 431 SSYTYFTKQAYSELIASLKEVSSD-GLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
+ T AYS L ++ + G A+ L C+ F S V + T++
Sbjct: 399 TVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCY--DFTGMSQVAI----PTVS 452
Query: 490 LHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
L F + G + +CL G+E G I+G+ L+ V Y
Sbjct: 453 LLFQG-----GAALDVDASGIMYTVSASQVCL-AFAGNE-DGGDVGIVGNTQLKTFGVAY 505
Query: 550 DNVNKRIGWAKSHC 563
D K +G++ C
Sbjct: 506 DIGKKVVGFSPGAC 519
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/395 (26%), Positives = 168/395 (42%), Gaps = 64/395 (16%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G YF + VGNPPR + L +DTGSDLTW+QC PC +C + P++ P I+P
Sbjct: 85 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQC-KPCKACFDQSGPVFDPSQSTSFKIIPCN 143
Query: 257 DSLC-MEIQRNHKPGYCETC-QQCDYEIEYADHSSSMGVLARDELHLTIEN--GSLTKPN 312
+ C + + + +T + C Y Y D S + G LA + L +++ + SL +
Sbjct: 144 AAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRD 203
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGG 369
+V GC + +GL GL + +S PSQL S I ++ +CL T N
Sbjct: 204 MVIGCGHSNKGLFQGAGGLL----GLGQGALSFPSQLRSSPIGQS-FSYCLVDRTNNLSV 258
Query: 370 GGYMFLGHDLVPSW---GMAWVPML--DSPFMELYHTEILKINYGSSPLNLGARNSQV-- 422
+ G S M + P + ++ Y+ I I L + A +
Sbjct: 259 SSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAT 318
Query: 423 ---GWALFDTGSSYTYFTKQAYSELIAS-LKEVSSDGLVLDASDP--TLPVCWRAK---- 472
G + D+G++ TY + AY + ++ L +S +DP L +C+ A
Sbjct: 319 NGSGGTIIDSGTTLTYLNRDAYRAVESAFLARIS-----YPRADPFDILGICYNATGRAA 373
Query: 473 --FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGIL--DGSE 528
FP SIV G++ + + I P+ ++ CL IL DG
Sbjct: 374 VPFPALSIV----------FQNGAELDLPQENYFIQPD-----PQEAKHCLAILPTDGMS 418
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+ + +YD + R+G+A + C
Sbjct: 419 -------IIGNFQQQNIHFLYDVQHARLGFANTDC 446
>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 312
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 120/264 (45%), Gaps = 21/264 (7%)
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
E + + ++VFGC+ Q G L DGI G + ++S+ SQL S G+ V HCL
Sbjct: 10 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 69
Query: 364 TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN---LGARNS 420
+ GGG + LG + P G+ + P++ S + E + +N P++ N+
Sbjct: 70 KGSDNGGGILVLGEIVEP--GLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNT 127
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
Q + D+G++ Y AY ++++ S S +L F S VD
Sbjct: 128 Q--GTIVDSGTTLAYLADGAYDPFVSAIAAAVS------PSVRSLVSKGSQCFITSSSVD 179
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI-ILGD 539
F T+TL+F + + PE YL+ + + G + + G I ILGD
Sbjct: 180 SS--FPTVTLYF-----MGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGD 232
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ L+ ++ VYD N R+GWA C
Sbjct: 233 LVLKDKIFVYDLANMRMGWADYDC 256
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 117/447 (26%), Positives = 188/447 (42%), Gaps = 57/447 (12%)
Query: 139 RDAEFKLGRFVDLDGESVVASVNDGI-IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIY 197
+ E G+ +DL + A V D I ++ + KI K + SS S + PL I
Sbjct: 71 KHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKI-KAMTSSTTEQSVSETQIPLTSGIK 129
Query: 198 PDGL-YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK 256
+ L Y + +G + L +DTGSDLTW+QC PC SC PLY P + + YK
Sbjct: 130 LESLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ-PCRSCYNQQGPLYDPSVSS--SYK 184
Query: 257 DSLC--------MEIQRNHKP-----GYCETCQQCDYEIEYADHSSSMGVLARDELHLTI 303
C + N P G +T C+Y + Y D S + G LA + + L
Sbjct: 185 TVFCNSSTCQDLVAATSNSGPCGGNNGVVKT--PCEYVVSYGDGSYTRGDLASESILL-- 240
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
G N VFGC + +GL + L R+ VSL SQ + V +CL
Sbjct: 241 --GDTKLENFVFGCGRNNKGLFGGSSGLMG----LGRSSVSLVSQ--TLKTFNGVFSYCL 292
Query: 364 -TTNAGGGGYMFLGHD---LVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN 419
+ G G + G+D S +++ P++ +P + ++ IL + G+S + ++
Sbjct: 293 PSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFY--ILNLT-GASIGGVELKS 349
Query: 420 SQVGWA-LFDTGSSYTYFTKQAYSEL-IASLKEVSSDGLVLDASDPTLPVCWR-AKFPIR 476
S G L D+G+ T Y + I LK+ S G L C+ +
Sbjct: 350 SSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFS--GFPTAPGYSILDTCFNLTSYEDI 407
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
SI +K F+ + + + + + P+ LV CL + S + I
Sbjct: 408 SIPIIKMIFQG---NAELEVDVTGVFYFVKPDASLV-------CLAL--ASLSYENEVGI 455
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+ + Q V+YD+ +R+G +C
Sbjct: 456 IGNYQQKNQRVIYDSTQERLGIVGENC 482
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 117/447 (26%), Positives = 187/447 (41%), Gaps = 57/447 (12%)
Query: 139 RDAEFKLGRFVDLDGESVVASVNDGI-IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIY 197
+ E G+ +DL + A V D I ++ + KI K + SS S + PL I
Sbjct: 71 KHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKI-KAMTSSTTEQSVSETQIPLTSGIK 129
Query: 198 PDGL-YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK 256
+ L Y + +G + L +DTGSDLTW+QC PC SC PLY P + + YK
Sbjct: 130 LESLNYIVTVELGG--KNMSLIVDTGSDLTWVQCQ-PCRSCYNQQGPLYDPSVSS--SYK 184
Query: 257 DSLC--------MEIQRNHKP-----GYCETCQQCDYEIEYADHSSSMGVLARDELHLTI 303
C + N P G +T C+Y + Y D S + G LA + + L
Sbjct: 185 TVFCNSSTCQDLVAATSNSGPCGGNNGVVKT--PCEYVVSYGDGSYTRGDLASESILL-- 240
Query: 304 ENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
G N VFGC + +GL + L R+ VSL SQ + V +CL
Sbjct: 241 --GDTKLENFVFGCGRNNKGLFGGSSGLMG----LGRSSVSLVSQ--TLKTFNGVFSYCL 292
Query: 364 -TTNAGGGGYMFLGHD---LVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN 419
+ G G + G+D S +++ P++ +P + ++ IL + G+S + ++
Sbjct: 293 PSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFY--ILNLT-GASIGGVELKS 349
Query: 420 SQVGWA-LFDTGSSYTYFTKQAYSEL-IASLKEVSSDGLVLDASDPTLPVCWR-AKFPIR 476
S G L D+G+ T Y + I LK+ S G L C+ +
Sbjct: 350 SSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFS--GFPTAPGYSILDTCFNLTSYEDI 407
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
SI +K F+ + + + + + P+ LV CL + S + I
Sbjct: 408 SIPIIKMIFQG---NAELEVDVTGVFYFVKPDASLV-------CLAL--ASLSYENEVGI 455
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+ + Q V+YD +R+G +C
Sbjct: 456 IGNYQQKNQRVIYDTTQERLGIVGENC 482
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 147/377 (38%), Gaps = 39/377 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + VG PP DTGSD+ W QC PCS+C + P++ P YK+
Sbjct: 81 GEYLVEISVGTPPFSIVAVADTGSDVIWTQCK-PCSNCYQQNAPMFDPSKSTT--YKNVA 137
Query: 260 CME--IQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFG 316
C + C +C Y I Y D S S G LA D + + +G + P V G
Sbjct: 138 CSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIG 197
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL--ASQGIIKNVVGHCLTTNAGGGGYMF 374
C +D G GI+GL R SL +QL A+ G + T + +
Sbjct: 198 CGHDNAGTF---NANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKLN 254
Query: 375 LGHDL-VPSWGMAWVPMLDSP-FMELYHTEILKINYGSSPLNLGARNSQVGWA---LFDT 429
G + V G P+ S + Y ++ ++ G + N S++G + D+
Sbjct: 255 FGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGESNIIIDS 314
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVL-DASDPT--LPVCWRAKFPIRSIVDVKQFFK 486
G++ TY S L+ S S + L A DP+ L C+ + D +
Sbjct: 315 GTTLTYLP----SALLNSFGSAISQSMSLPHAQDPSEFLDYCF------ATTTDDYE-MP 363
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+T+HF + E V ICL + + I G+I+ L
Sbjct: 364 PVTMHFEGA------DVPLQRENLFVRLSDDTICLAF---GSFPDDNIFIYGNIAQSNFL 414
Query: 547 VVYDNVNKRIGWAKSHC 563
V YD N + + +HC
Sbjct: 415 VGYDIKNLAVSFQPAHC 431
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 104/391 (26%), Positives = 172/391 (43%), Gaps = 56/391 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G YF + VGNPPR + L +DTGSDLTW+QC PC +C + P++ P I+P
Sbjct: 169 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQC-KPCKACFDQSGPVFDPSQSTSFKIIPCN 227
Query: 257 DSLC-MEIQRNHKPGYCETC-QQCDYEIEYADHSSSMGVLARDELHLTIEN--GSLTKPN 312
+ C + + + +T + C Y Y D S + G LA + L +++ + SL +
Sbjct: 228 AAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRD 287
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGG 369
+V GC + +GL GL + +S PSQL S I ++ +CL T N
Sbjct: 288 MVIGCGHSNKGLFQGAGGLL----GLGQGALSFPSQLRSSPIGQS-FSYCLVDRTNNLSV 342
Query: 370 GGYMFLGHDLVPSW---GMAWVPML--DSPFMELYHTEILKINYGSSPLNLGARNSQV-- 422
+ G S M + P + ++ Y+ I I L + A +
Sbjct: 343 SSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAP 402
Query: 423 ---GWALFDTGSSYTYFTKQAYSELIAS-LKEVSSDGLVLDASDP--TLPVCWRAKFPIR 476
G + D+G++ TY + AY + ++ L +S +DP L +C+ A R
Sbjct: 403 NGSGGTIIDSGTTLTYLNRDAYRAVESAFLARIS-----YPRADPFDILGICYNATG--R 455
Query: 477 SIVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGIL--DGSEVHNG 532
+ V F TL++ F G++ + + I P+ ++ CL IL DG
Sbjct: 456 TAVP----FPTLSIVFQNGAELDLPQENYFIQPD-----PQEAKHCLAILPTDGMS---- 502
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+ + +YD + R+G+A + C
Sbjct: 503 ---IIGNFQQQNIHFLYDVQHARLGFANTDC 530
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 151/373 (40%), Gaps = 61/373 (16%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCMEIQRNHKPGY---- 271
+DT S+LTW+QC APC SC PL+ P +P C +Q+ G
Sbjct: 158 VDTASELTWVQC-APCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGA 216
Query: 272 --CETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLN 327
C+ + C Y + Y D S S GVLA D L L E VFGC QG
Sbjct: 217 PPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGE----VIDGFVFGCGTSNQGPPFG 272
Query: 328 TLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC--LTTNAGGGGYMFLGHD------L 379
T G++GL R+++SL SQ Q V +C L+ + G + LG D
Sbjct: 273 ---GTSGLMGLGRSQLSLVSQTVDQ--FGGVFSYCLPLSRESDASGSLVLGDDPSAYRNS 327
Query: 380 VPSWGMAWV----PMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
P + V P+L PF Y + I G + +++ A+ D+G+ T
Sbjct: 328 TPVVYTSMVSNSDPLLQGPF---YLVNLTGITVGGQEVESTGFSAR---AIVDSGTVITS 381
Query: 436 FTKQAY----SELIASLKEV-SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
Y +E ++ L E + G + L C F + + +V+ TL
Sbjct: 382 LVPSVYNAVRAEFMSQLAEYPQAPGFSI------LDTC----FNMTGLKEVQVPSLTLVF 431
Query: 491 HFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYD 550
G++ ++ S Y V S +CL + S T I+G+ + VV+D
Sbjct: 432 DGGAEVEVDSGGVL-----YFVSSDSSQVCLAV--ASLKSEDETSIIGNYQQKNLRVVFD 484
Query: 551 NVNKRIGWAKSHC 563
++G+A+ C
Sbjct: 485 TSASQVGFAQETC 497
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/387 (24%), Positives = 165/387 (42%), Gaps = 55/387 (14%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G G YF+ + VG P +P+Y+ +DTGSD+ W+QC PC+ C + +P++ PR +
Sbjct: 145 ISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPRSSS 203
Query: 252 ---ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
LP + C ++ + C +C Y++ Y D S ++G + LT N +
Sbjct: 204 SFASLPCESQQCQALETSG----CRA-SKCLYQVSYGDGSFTVGEFVIET--LTFGNSGM 256
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
NV GC +D +GL V + G+LGL +SL SQ+ + +CL
Sbjct: 257 IN-NVAVGCGHDNEGL----FVGSAGLLGLGGGSLSLTSQMKASSF-----SYCLVDRDS 306
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYG----SSPLNL-GARNSQV 422
+ PS + P+L S ++ Y+ + ++ G S P NL +S
Sbjct: 307 SSSSDLEFNSAAPSDSVN-APLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGY 365
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVS-----SDGLVLDASDPTLPVCWRAKFPIRS 477
G + D+G++ T QAY+ L + + ++G L C+ R
Sbjct: 366 GGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL------FDTCYDLSSQSRV 419
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTII 536
+ T++ F + + P+ YL+ + G C + S I
Sbjct: 420 TI------PTVSFEFAGGKSL-----QLPPKNYLIPVDSVGTFCFAFAPTTS----SLSI 464
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G++ +G V YD N +G++ C
Sbjct: 465 IGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 163/392 (41%), Gaps = 51/392 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR----MGNILPY 255
G YF + VG PP+ + L +DTGSDL WIQC PC +C + P Y P+ NI +
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYACFEQNGPYYDPKDSSSFKNITCH 251
Query: 256 KDSLCMEIQRNHKPGYCE-TCQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLTKP- 311
D C + P C+ Q C Y Y D S++ G A + ++LT G KP
Sbjct: 252 -DPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEG---KPE 307
Query: 312 -----NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-- 364
NV+FGC + +GL L R +S +QL Q + + +CL
Sbjct: 308 LKIVENVMFGCGHWNRGLFHGAAGLLG----LGRGPLSFATQL--QSLYGHSFSYCLVDR 361
Query: 365 -TNAGGGGYMFLGHDLV----PSWGM-AWVPMLDSPFMELYHTEILKINYGSSPLNLGAR 418
+N+ + G D P+ ++V ++P Y+ I I G L +
Sbjct: 362 NSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEE 421
Query: 419 NSQV-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
+ G + D+G++ TYF + AY E+I G L + P L C+
Sbjct: 422 TWHLSAQGGGGTIIDSGTTLTYFAEPAY-EIIKEAFMRKIKGFPLVETFPPLKPCYNVSG 480
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
+ +++ +F + G+ W + ++ I + +CL IL +
Sbjct: 481 VEK--MELPEF--AILFADGAMWDFPVENY------FIQIEPEDVVCLAILG---TPRSA 527
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
I+G+ + ++YD R+G+A C +
Sbjct: 528 LSIIGNYQQQNFHILYDLKKSRLGYAPMKCAD 559
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 150/383 (39%), Gaps = 84/383 (21%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLC----MEIQRNHKPGY 271
+DT S+LTW+QC PC SC +PL+ P +P S C + + P
Sbjct: 135 VDTASELTWVQCQ-PCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCA 193
Query: 272 CETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTL 329
+ QQ C Y + Y D S S GVLARD+L L ++ VFGC QG
Sbjct: 194 DDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQD----IEGFVFGCGTSNQGAPFG-- 247
Query: 330 VKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDLVP---SWGM 385
T G++GL R+ VSL SQ Q V +CL +G G + LG D S +
Sbjct: 248 -GTSGLMGLGRSHVSLVSQTMDQ--FGGVFSYCLPMRESGSSGSLVLGDDSSAYRNSTPI 304
Query: 386 AWVPM------LDSPFMELYHTEILKINYGSSPLNLGARNSQVGW-----ALFDTGSSYT 434
+ M L PF L T I +G + + W + D+G+ T
Sbjct: 305 VYTAMVSDSGPLQGPFYFLNLTGI----------TVGGQEVESPWFSAGRVIIDSGTIIT 354
Query: 435 YFTKQAY----SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
Y +E ++ L E + P SI+D L
Sbjct: 355 TLVPSVYNAVRAEFLSQLAEY-------------------PQAPAFSILDT-----CFNL 390
Query: 491 HFGSKWQIVSTKF--------HISPEG--YLVISKKGNICLGILDGSEVHNGSTIILGDI 540
+ Q+ S KF + +G Y V S +CL + ++ T I+G+
Sbjct: 391 TGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYD--TSIIGNY 448
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+ V++D + +IG+A+ C
Sbjct: 449 QQKNLRVIFDTLGSQIGFAQETC 471
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 128/279 (45%), Gaps = 35/279 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP--------LYKPRMGNI 252
L++ + +G P + + +DTGSDL W+ CD C CA +P +Y P
Sbjct: 34 LHYAVVALGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTT 91
Query: 253 ---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGS- 307
+P +LC ++Q + C Y I+Y +D++SS GVL D L+LT ++
Sbjct: 92 SRKVPCSSNLC-DLQNACR----SKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQS 146
Query: 308 --LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
+T P ++FGC Q G L + +G+LGL S+PS LAS+G+ N C
Sbjct: 147 KIVTAP-IMFGCGQVQTGSFLGSAAP-NGLLGLGMDSKSVPSLLASKGLAANSFSMCF-- 202
Query: 366 NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA 425
G G+ + S P+ Y+ I I GS + S A
Sbjct: 203 --GDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSI------STEFSA 254
Query: 426 LFDTGSSYTYFTKQAYSELIASL-KEVSSDGLVLDASDP 463
+ D+G+S+T + Y+++ +S ++ S +LD+S P
Sbjct: 255 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMP 293
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 155/388 (39%), Gaps = 46/388 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + VG PPR + + MDTGSDL W+QC APC C P++ P + +
Sbjct: 149 GEYLMDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFDQVGPVFDPAASSSYRNVTCG 207
Query: 257 DSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDE--LHLTIENGSLTKPN 312
D C + P C + C Y Y D S++ G LA + ++LT S +
Sbjct: 208 DQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDD 267
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY 372
VVFGC + +GL L R +S SQL + + + +CL +
Sbjct: 268 VVFGCGHWNRGLFHGAAGLLG----LGRGPLSFASQL--RAVYGHTFSYCLVDHGSDVAS 321
Query: 373 MFLGHDLVPSWGMAWVPMLD--------SPFMELYHTEILKINYGSSPLNLGARN----- 419
+ + A P L+ SP Y+ ++ + G LN+ +
Sbjct: 322 KVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGE 381
Query: 420 --SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRS 477
G + D+G++ +YF + AY + + + L P L C+ S
Sbjct: 382 GEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNV-----S 436
Query: 478 IVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI 535
VD + L+L F G+ W + + I + G +CL +L G +I
Sbjct: 437 GVDRPE-VPELSLLFADGAVWDFPAENYFIR------LDPDGIMCLAVL--GTPRTGMSI 487
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G+ + VVYD N R+G+A C
Sbjct: 488 I-GNFQQQNFHVVYDLKNNRLGFAPRRC 514
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 98/394 (24%), Positives = 157/394 (39%), Gaps = 36/394 (9%)
Query: 172 INKKLVSSNAVAVDSSSIFPLR-GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQC 230
I+ KL + V ++ P + G G Y + +G+P + L DTGSDLTW +C
Sbjct: 103 IHAKLSDHSGVKETDAAKLPTKSGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARC 162
Query: 231 DAPCSSCAKGANPLYKPRMGNILPYKDSLCME-IQRNHKPGYCETCQQCDYEIEYADHSS 289
A A+ +P N+ LC I P C C Y I+Y D S
Sbjct: 163 SA-----AETFDPTKSTSYANV-SCSTPLCSSVISATGNPSRCAA-STCVYGIQYGDGSY 215
Query: 290 SMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL 349
S+G L ++ LTI + + N FGC D GL K G+LGL R K+S+ SQ
Sbjct: 216 SIGFLGKER--LTIGSTDIFN-NFYFGCGQDVDGL----FGKAAGLLGLGRDKLSVVSQT 268
Query: 350 ASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYG 409
A + + +CL +++ G FL S + P+ P Y+ ++ I G
Sbjct: 269 APK--YNQLFSYCLPSSSSTG---FLSFGSSQSKSAKFTPLSSGP-SSFYNLDLTGITVG 322
Query: 410 SSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW 469
L + + D+G+ T AYS L ++ ++ + + L C+
Sbjct: 323 GQKLAIPLSVFSTAGTIIDSGTVVTRLPPAAYSALRSAFRKAMAS-YPMGKPLSILDTCY 381
Query: 470 RAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEV 529
F + V + + + F + + G V + +CL +
Sbjct: 382 --DFSKYKTIKVPK----IVISFSGGVDV-----DVDQAGIFVANGLKQVCLAFAGNTGA 430
Query: 530 HNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ T I G+ R VVYD ++G+A + C
Sbjct: 431 RD--TAIFGNTQQRNFEVVYDVSGGKVGFAPASC 462
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 142/371 (38%), Gaps = 27/371 (7%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G G Y + +G P Y + DTGSD TW+QC +C + L+ P +
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSST- 230
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
Y + C + + C Y ++Y D S S+G A D L L+ +
Sbjct: 231 -YANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AVKGF 286
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
FGC GL + G+LGL R K SLP Q + G V HCL + G GY+
Sbjct: 287 RFGCGERNDGL----FGEAAGLLGLGRGKTSLPVQ--TYGKYGGVFAHCLPPRSTGTGYL 340
Query: 374 FLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSY 433
G P+ PML Y+ + I G L + + D+G+
Sbjct: 341 DFGAGSPPA--TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVI 398
Query: 434 TYFTKQAYSELIASLKEVSSD-GLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
T AYS L ++ + G A+ L C+ F S V + T++L F
Sbjct: 399 TRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCY--DFTGMSQVAI----PTVSLLF 452
Query: 493 GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
+ + G + +CL G+E G I+G+ L+ V YD
Sbjct: 453 QGGAAL-----DVDASGIMYTVSASQVCL-AFAGNE-DGGDVGIVGNTQLKTFGVAYDIG 505
Query: 553 NKRIGWAKSHC 563
K +G++ C
Sbjct: 506 KKVVGFSPGAC 516
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 103/439 (23%), Positives = 177/439 (40%), Gaps = 52/439 (11%)
Query: 156 VVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPL-----RGNIYPDGLYFTYMIVGN 210
V+A N + + K NK++V++ + L G G YF ++VG+
Sbjct: 104 VLAKKNQNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVATLESGMTLGSGEYFMDVLVGS 163
Query: 211 PPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRN---- 266
PP+ + L +DTGSDL WIQC PC C + Y P+ YK+ C + + N
Sbjct: 164 PPKHFSLILDTGSDLNWIQC-LPCHDCFQQNGAFYDPKAS--ASYKNITCNDPRCNLVSP 220
Query: 267 -HKPGYCET-CQQCDYEIEYADHSSSMGVLARD--ELHLTIENGS---LTKPNVVFGCAY 319
P C++ Q C Y Y D S++ G A + ++LT GS N++FGC +
Sbjct: 221 PDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVENMMFGCGH 280
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-----TNAGGGGYMF 374
+GL L R +S SQL Q + + +CL TN
Sbjct: 281 WNRGLFHGAAGLLG----LGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDTNVSSKLIFG 334
Query: 375 LGHDLVPSWGM---AWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-----GWAL 426
DL+ + ++V ++ Y+ +I I LN+ + G +
Sbjct: 335 EDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAGGTI 394
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G++ +YF + AY + + E + + P L C F + I ++
Sbjct: 395 IDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPC----FNVSGIDSIQLPEL 450
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
+ G+ W + E + + +CL IL + + I+G+ +
Sbjct: 451 GIAFADGAVWNFPT-------ENSFIWLNEDLVCLAILGTPK---SAFSIIGNYQQQNFH 500
Query: 547 VVYDNVNKRIGWAKSHCMN 565
++YD R+G+A + C +
Sbjct: 501 ILYDTKRSRLGYAPTKCAD 519
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 113/252 (44%), Gaps = 22/252 (8%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
+ + +GNPP Y+ +DTGSDL WIQC+ PC C K +P+Y + Y + LC
Sbjct: 93 FLANLSIGNPPTNVYVVLDTGSDLFWIQCE-PCDVCYKQKDPIYNRTKSD--SYTEMLCN 149
Query: 262 E--IQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCA 318
E + G C C Y+ YAD + + G+L+ +++ T K V FGC
Sbjct: 150 EPPCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGFGCG 209
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--TTNAGGGGYMFLG 376
Q L T + G+LGL VSL SQL++ G + +C +N GG++ G
Sbjct: 210 L--QNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFG 267
Query: 377 HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-------GWALFDT 429
+ M PM+ E Y+ +L I G L +S G + D+
Sbjct: 268 DATYLNGDM--TPMV---IAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDS 322
Query: 430 GSSYTYFTKQAY 441
GS+ + F + Y
Sbjct: 323 GSTLSVFPPEVY 334
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 68/262 (25%), Positives = 111/262 (42%), Gaps = 21/262 (8%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + VG+PPR Y+ +D+GSD+ W+QC PCS C + ++P++ P +
Sbjct: 141 GEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCK-PCSRCYQQSDPVFDPADSSSFAGVSCG 199
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
R G C +C YE+ Y D S + G LA + L + G + +V GC +
Sbjct: 200 SDVCDRLENTG-CNA-GRCRYEVSYGDGSYTKGTLALETLTV----GQVMIRDVAIGCGH 253
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA-GGGGYMFLGHD 378
QG+ + L +S QL Q +CL + G G + G
Sbjct: 254 TNQGMFIGAAGLLG----LGGGSMSFIGQLGGQ--TGGAFSYCLVSRGTGSTGALEFGRG 307
Query: 379 LVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGSS 432
+P G W+ ++ +P Y+ + I G +++ Q+ + DTG++
Sbjct: 308 ALP-VGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTA 366
Query: 433 YTYFTKQAYSELIASLKEVSSD 454
T F AY S +S+
Sbjct: 367 VTRFPTAAYVAFRDSFTAQTSN 388
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 98/390 (25%), Positives = 167/390 (42%), Gaps = 62/390 (15%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G + + +GNP Y +DTGSDL W QC PC+ C P++ P + +
Sbjct: 106 GEFLMELSIGNPAVKYAAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCS 164
Query: 257 DSLCMEIQRNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
LC + R++ C E C+Y Y D+SS+ G+LA + EN + + F
Sbjct: 165 SGLCNALPRSN----CNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDEN---SISGIGF 217
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGYM 373
GC + +G + + G++GL R +SL SQL + +CLT+ ++ +
Sbjct: 218 GCGVENEG---DGFSQGSGLVGLGRGPLSLISQLK-----ETKFSYCLTSIEDSEASSSL 269
Query: 374 FLG---HDLVPSWG----------MAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS 420
F+G +V G M+ + D P Y+ E+ I G+ L++
Sbjct: 270 FIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQP--SFYYLELQGITVGAKRLSVEKSTF 327
Query: 421 QV-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT-LPVCWRAKFP 474
++ G + D+G++ TY + A+ L S L +D S T L +C++
Sbjct: 328 ELSEDGTGGMIIDSGTTITYLEETAFKVLKEEF--TSRMSLPVDDSGSTGLDLCFKLPNA 385
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGS 533
++I K F HF + E Y+V S G +CL + GS NG
Sbjct: 386 AKNIAVPKLIF-----HF------KGADLELPGENYMVADSSTGVLCLAM--GS--SNGM 430
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ I G++ + V++D + + + + C
Sbjct: 431 S-IFGNVQQQNFNVLHDLEKETVTFVPTEC 459
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 163/382 (42%), Gaps = 52/382 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G + + +G PP +DTGSDL WIQC APC C K P++ P N +
Sbjct: 66 GQHLMEIYIGTPPIKITGLVDTGSDLIWIQC-APCLGCYKQIKPMFDPLKSSTYNNISCD 124
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVF 315
LC ++ G C ++C+Y Y D+S + GVLA+D T G ++ +F
Sbjct: 125 SPLCHKLDT----GVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLF 180
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA--------SQGIIKNVVGHCLTTNA 367
GC ++ G + + G++GL SL SQ+ SQ ++ + +++
Sbjct: 181 GCGHNNTGGFNDHEM---GLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRM 237
Query: 368 G-GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWA- 425
G G LG+ G+ P++ Y +L I+ + + NS +G A
Sbjct: 238 SFGKGSQVLGN------GVVTTPLVPREKDTSYFVTLLGISVEDTYFPM---NSTIGKAN 288
Query: 426 -LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL--PVCWRAKFPIRSIVDVK 482
L D+G+ +Q Y ++ A ++ + + D DP+L +C+R + ++
Sbjct: 289 MLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITD--DPSLGTQLCYRTQTNLKG----- 341
Query: 483 QFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
TLT HF G+ + + I P KG CL I + + G + G+ +
Sbjct: 342 ---PTLTFHFVGANVLLTPIQTFIPPTP----QTKGIFCLAIYNRTNSDPG---VYGNFA 391
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
L+ +D + + + + C
Sbjct: 392 QSNYLIGFDLDRQVVSFKPTDC 413
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 86.7 bits (213), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 165/387 (42%), Gaps = 55/387 (14%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G G YF+ + VG P +P+Y+ +DTGSD+ W+QC PC+ C + +P++ PR +
Sbjct: 145 ISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPRSSS 203
Query: 252 ---ILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
LP + C ++ + C +C Y++ Y D S ++G + LT N +
Sbjct: 204 SFASLPCESQQCQALETSG----CRA-SKCLYQVSYGDGSFTVGEFVTET--LTFGNSGM 256
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
+V GC +D +GL V + G+LGL +SL SQ+ + +CL
Sbjct: 257 IN-DVAVGCGHDNEGL----FVGSAGLLGLGGGPLSLTSQMKASSF-----SYCLVDRDS 306
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYG----SSPLNL-GARNSQV 422
+ PS + P+L S ++ Y+ + ++ G S P NL +S
Sbjct: 307 SSSSDLEFNSAAPSDSVN-APLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGY 365
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVS-----SDGLVLDASDPTLPVCWRAKFPIRS 477
G + D+G++ T QAY+ L + + ++G L C+ R
Sbjct: 366 GGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL------FDTCYDLSSQSRV 419
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTII 536
+ T++ F + + P+ YL+ + G C + S I
Sbjct: 420 TI------PTVSFEFAGGKSL-----QLPPKNYLIPVDSVGTFCFAFAPTTS----SLSI 464
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G++ +G V YD N +G++ C
Sbjct: 465 IGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 147/367 (40%), Gaps = 60/367 (16%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLC--MEIQRNHKPGYCE 273
+DT S+LTW+QC APC+SC PL+ P +LP S C +++ G C
Sbjct: 142 VDTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 200
Query: 274 TCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVK 331
+Q C Y + Y D S S GVLA D+L L E VFGC QG
Sbjct: 201 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE----VIDGFVFGCGTSNQG----PFGG 252
Query: 332 TDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDLVP---SWGMAW 387
T G++GL R+++SL SQ Q V +CL + G + LG D S + +
Sbjct: 253 TSGLMGLGRSQLSLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVY 310
Query: 388 VPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIA 446
M+ P Y + I G + S G + D+G+ T Y+ + A
Sbjct: 311 TTMVSDPVQGPFYFVNLTGITIGGQEV-----ESSAGKVIVDSGTIITSLVPSVYNAVKA 365
Query: 447 SLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKF--- 503
L A P + P SI+D L + QI S KF
Sbjct: 366 EF-------LSQFAEYP--------QAPGFSILDT-----CFNLTGFREVQIPSLKFVFE 405
Query: 504 -----HISPEG--YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRI 556
+ G Y V S +CL + + T I+G+ + V++D + +I
Sbjct: 406 GNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYE--TSIIGNYQQKNLRVIFDTLGSQI 463
Query: 557 GWAKSHC 563
G+A+ C
Sbjct: 464 GFAQETC 470
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 87/368 (23%), Positives = 141/368 (38%), Gaps = 31/368 (8%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + +G P + + DTGSD TW+QC + C + PL+ P +
Sbjct: 163 GNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCT 222
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
S C ++ G C Y ++Y D S ++G A+D L L G T + FG
Sbjct: 223 SSYCSDLDTRGCSG-----GHCLYAVQYGDGSYTVGFYAQDTLTL----GYDTVKDFRFG 273
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C +GL K G++GL R K S+P Q + V +C+ + G G++ G
Sbjct: 274 CGEKNRGL----FGKAAGLMGLGRGKTSVPVQAYDK--YSGVFAYCIPATSSGTGFLDFG 327
Query: 377 HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYF 436
+ PML Y+ + I G L++ A AL D+G+ T
Sbjct: 328 PGAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRL 387
Query: 437 TKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSK 495
AY L ++ K + G + L C+ D+ + ++ L S
Sbjct: 388 PPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCY----------DLTGYQGSIALPAVSL 437
Query: 496 WQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKR 555
+ G L ++ CL + + I+G+ + V+YD K
Sbjct: 438 VFQGGACLDVDASGILYVADVSQACLAFAANDD--DTDMTIVGNTQQKTYSVLYDLGKKV 495
Query: 556 IGWAKSHC 563
+G+A C
Sbjct: 496 VGFAPGAC 503
>gi|213998848|gb|ACJ60790.1| nucellin [Psathyrostachys stoloniformis]
Length = 154
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 79/146 (54%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHCLTTNAG 368
K N+ FGC Y Q+ + DGILGL K +QL Q +I +NV+GHCL++
Sbjct: 6 KKNIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G P+ G+ WVPM +S F Y + + P+ R + A+FD
Sbjct: 64 GKGVLYVGDFNPPTRGVTWVPMRESLF--YYSPGLAALFIDKQPI----RGNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YTY Q Y+EL++ ++ S+
Sbjct: 118 SGSTYTYMPAQIYNELVSKIRGTLSE 143
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 120/263 (45%), Gaps = 34/263 (12%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK---- 256
L++T++ +G P + + +D GSDL W+ C+ C CA + Y ++ Y+
Sbjct: 102 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCN--CIQCAPLSASYYGSLDKDLNEYRPSSS 159
Query: 257 ---------DSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLT--IE 304
+LC Q P Q C Y I+Y +++SS G+L +D LHL+ E
Sbjct: 160 STSKHISCSHNLCDSGQSCQSPK-----QSCPYVIDYITENTSSSGLLIQDVLHLSSGCE 214
Query: 305 NGS--LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
N S + V+ GC Q G L+ V DG+ GL ++S+ S LA + +++N C
Sbjct: 215 NSSNCTIQAPVILGCGMKQSGGYLSG-VAPDGLFGLGLGEISVLSSLAKEELVQNSFSLC 273
Query: 363 LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV 422
N G G +F G + S LD + E Y + +S L +
Sbjct: 274 F--NEDGSGRIFFGDEGPASQQTTSFVPLDGKY-ETYIVGVEACCIENSCLKQTSFK--- 327
Query: 423 GWALFDTGSSYTYFTKQAYSELI 445
AL D+G+S+TY ++AY ++
Sbjct: 328 --ALIDSGTSFTYLPEEAYENIV 348
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 98/211 (46%), Gaps = 28/211 (13%)
Query: 165 IRPHKSKINKKLVSSNAVA--VDSSSI-FPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDT 221
I H KL+ N+ + ++I P+ N Y Y + +G PP Y DT
Sbjct: 22 IEAHNGGFTGKLIPRNSSKDFFNRNTIQSPVSANHYD---YLMELSIGTPPVKIYAQADT 78
Query: 222 GSDLTWIQCDAPCSSCAKGANPLYKPR----MGNILPYKDSLCMEIQRNHKPGYCETCQ- 276
GSDL W+QC PC++C K NP++ + NI +S C ++ Y +C
Sbjct: 79 GSDLIWLQC-IPCTNCYKQLNPMFDSQSSSTFSNIACGSES-CSKL-------YSTSCSP 129
Query: 277 ---QCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFGCAYDQQGLLLNTLVKT 332
C Y Y D S + GVLA++ L LT G + V+FGC ++ G + K
Sbjct: 130 DQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHNNNGAFND---KE 186
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCL 363
GI+GL R +SL SQ+ S + N+ CL
Sbjct: 187 MGIIGLGRGPLSLVSQIGSS-LGGNMFSQCL 216
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 86.3 bits (212), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 169/391 (43%), Gaps = 86/391 (21%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS-SCAKGANPLYKPRMGNILPYKDS 258
G+Y++ + +G+PP+ + L MDTGSDLTW++CD PCS C+ + L YK
Sbjct: 1 GVYYSTITLGSPPKDFSLVMDTGSDLTWVRCD-PCSPDCSSTFDRLASNT------YKAL 53
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLT-IENGSLTK-PNVVFG 316
C + DY Y D S + G L+ D L + + L + P VFG
Sbjct: 54 TCAD----------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFG 97
Query: 317 CAYDQQGLLLNTLVKTD-GILGLSRAKVSLPSQLASQGIIKNVVGHCL----TTNAGGGG 371
C G LL L+ + GIL LS +S PSQ+ + N +CL N+
Sbjct: 98 C-----GSLLKGLISGEVGILALSPGSLSFPSQIGEK--YGNKFSYCLLRQTAQNSLKKS 150
Query: 372 YMFLGHDLV----PSWG----MAWVPMLDSPFMELYHTEILK-INYGSSPLNLGAR---N 419
M G V P G + + P+ +S +Y+T L I+ G+ L+L N
Sbjct: 151 PMVFGEAAVELKEPGSGKLQELQYTPIGES---SIYYTVRLDGISVGNQRLDLSPSAFLN 207
Query: 420 SQVGWALFDTGSSYTYF-------TKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
Q +FD+G++ T KQ+ + +++ + V+ G LDA C+R
Sbjct: 208 GQDKPTIFDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIKG--LDA-------CFRV- 257
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNG 532
P S Q +T HF V+ P Y VI CL + +EV
Sbjct: 258 -PPSS----GQGLPDITFHFNGGADFVT-----RPSNY-VIDLGSLQCLIFVPTNEVS-- 304
Query: 533 STIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G++ + V++D N+RIG+ ++ C
Sbjct: 305 ---IFGNLQQQDFFVLHDMDNRRIGFKETDC 332
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 86.3 bits (212), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 161/385 (41%), Gaps = 51/385 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PP P DTGSDL W QC APC C +PL+ P+ + YKD
Sbjct: 88 GEYLMNVSIGTPPFPIMAIADTGSDLLWTQC-APCDDCYTQVDPLFDPKTSST--YKDVS 144
Query: 260 CMEIQRN--HKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKP----N 312
C Q C T C Y + Y D+S + G +A D L L S T+P N
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTL---GSSDTRPMQLKN 201
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC---LTTNAGG 369
++ GC ++ G K GI+GL VSL QL I +C LT+
Sbjct: 202 IIIGCGHNNAGTFNK---KGSGIVGLGGGPVSLIKQLGDS--IDGKFSYCLVPLTSKKDQ 256
Query: 370 GGYMFLGHD-LVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWA-- 425
+ G + +V G+ P++ E ++ LK I+ GS + +S+
Sbjct: 257 TSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNI 316
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT--LPVCWRAKFPIRSIVDVKQ 483
+ D+G++ T + YSEL V+S DP L +C+ A ++ V
Sbjct: 317 IIDSGTTLTLLPTEFYSEL---EDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPV---- 369
Query: 484 FFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
+T+HF G+ ++ S+ + LV C GS S I G+++
Sbjct: 370 ----ITMHFDGADVKLDSSNAFVQVSEDLV-------CFA-FRGSP----SFSIYGNVAQ 413
Query: 543 RGQLVVYDNVNKRIGWAKSHCMNPG 567
LV YD V+K + + + C G
Sbjct: 414 MNFLVGYDTVSKTVSFKPTDCAKMG 438
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 92/386 (23%), Positives = 156/386 (40%), Gaps = 48/386 (12%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDS 258
Y + +G PP P+ DTGSDLTW QC PC C P+Y + +P +
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCK-PCKLCFPQDTPIYDTAASASFSPVPCASA 153
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP-----NV 313
C+ I R+ + T C Y Y D + S GVL + L + P V
Sbjct: 154 TCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGV 213
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGG 371
FGC D GL N + G +GL R +SL +QL G+ K +CLT N G
Sbjct: 214 AFGCGVDNGGLSYN----STGTVGLGRGSLSLVAQL---GVGK--FSYCLTDFFNTSLGS 264
Query: 372 YMFLGH-------DLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPL-----NLGAR 418
+ G + + P++ P+ Y+ + I+ G + L R
Sbjct: 265 PVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLR 324
Query: 419 NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSI 478
+ G + D+G+ +T + A+ ++ + V + V++AS P C+ A + +
Sbjct: 325 DDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQ-PVVNASSLDSP-CFPATAGEQQL 382
Query: 479 VDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK-GNICLGILDGSEVHNGSTIIL 537
D+ LHF + + H + Y+ +++ + CL I + IL
Sbjct: 383 PDMPDML----LHFAGGADM---RLHR--DNYMSFNQESSSFCLNIAGAPSAYGS---IL 430
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G+ + +++D ++ + + C
Sbjct: 431 GNFQQQNIQMLFDITVGQLSFVPTDC 456
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 160/379 (42%), Gaps = 50/379 (13%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK---GANPLYKPRMGN---ILPYKDSLCM 261
+G PP+P L +DTGSDL W QC S+ G+ P+Y P + LP D LC
Sbjct: 97 IGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRLCQ 156
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQ 321
E Q + K C + +C YE Y ++++GVLA + T + FGC
Sbjct: 157 EGQFSFK--NCTSKNRCVYEDVYGS-AAAVGVLASET--FTFGARRAVSLRLGFGCG--- 208
Query: 322 QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLV- 380
L +L+ GILGLS +SL +QL Q +CLT A L +
Sbjct: 209 -ALSAGSLIGATGILGLSPESLSLITQLKIQRF-----SYCLTPFADKKTSPLLFGAMAD 262
Query: 381 -----PSWGMAWVPMLDSPFMEL-YHTEILKINYGSSPL-----NLGARNSQVGWALFDT 429
+ + ++ +P + Y+ ++ I+ G L +L R G + D+
Sbjct: 263 LSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDS 322
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL---PVCWRAKFPIRSIVDVKQFFK 486
GS+ Y + A+ ++KE D + L ++ T+ +C+ P R+ + +
Sbjct: 323 GSTVAYLVEAAFE----AVKEAVMDVVRLPVANRTVEDYELCF--VLPRRTAAAAMEAVQ 376
Query: 487 T--LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
L LHF +V + + Y + G +CL + G I+G++ +
Sbjct: 377 VPPLVLHFDGGAAMV-----LPRDNYFQEPRAGLMCLAV--GKTTDGSGVSIIGNVQQQN 429
Query: 545 QLVVYDNVNKRIGWAKSHC 563
V++D + + +A + C
Sbjct: 430 MHVLFDVQHHKFSFAPTQC 448
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 94/390 (24%), Positives = 164/390 (42%), Gaps = 42/390 (10%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CA-KGANPLYKPRM--GN 251
Y G Y VG P + + L DTGSDLTW+ C C S C+ + A + R+ N
Sbjct: 78 YGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 137
Query: 252 I------LPYKDSLC-MEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARDELHLTI 303
+ +P +C +E+ C T C Y+ Y+D S+++G A + + + +
Sbjct: 138 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 197
Query: 304 ENGSLTK-PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
+ G K NV+ GC+ QG + DG++GL +K S + A + +C
Sbjct: 198 KEGRKMKLHNVLIGCSESFQG---QSFQAADGVMGLGYSKYSFAIKAAEK--FGGKFSYC 252
Query: 363 LT---TNAGGGGYMFLGHDLVPSW---GMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
L ++ Y+ G M + ++ Y ++ I+ G + L +
Sbjct: 253 LVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIP 312
Query: 417 ARNSQV---GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
+ V G + D+GSS T+ T+ AY ++A+L+ ++ L C+ +
Sbjct: 313 SEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
S+V L HF +F + Y++ + G CLG + V
Sbjct: 373 FEESLVP------RLVFHFAD-----GAEFEPPVKSYVISAADGVRCLGFVS---VAWPG 418
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T ++G+I + L +D K++G+A S C
Sbjct: 419 TSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|213998845|gb|ACJ60789.1| nucellin [Psathyrostachys fragilis subsp. fragilis]
Length = 150
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 79/146 (54%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHCLTTNAG 368
K N+ FGC Y Q+ + DGILGL K +QL Q +I +NV+GHCL++
Sbjct: 4 KKNIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSK-- 61
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G P+ G+ WVPM +S F Y + + P+ R + A+FD
Sbjct: 62 GKGVLYVGDFNPPTRGVTWVPMRESLF--YYSPGLAALFIDKQPI----RGNPTFEAVFD 115
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YTY Q Y+EL++ ++ S+
Sbjct: 116 SGSTYTYVPAQIYNELVSKIRGTLSE 141
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 141/372 (37%), Gaps = 26/372 (6%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G G Y + +G P Y + DTGSD TW+QC C + L+ P +
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST- 230
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
Y + C + + + C Y ++Y D S S+G A D L L+ +
Sbjct: 231 -YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AVKGF 286
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
FGC +GL + G+LGL R K SLP Q + V HCL + G GY+
Sbjct: 287 RFGCGERNEGL----FGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 340
Query: 374 -FLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSS 432
F L + PML Y+ + I G L++ + D+G+
Sbjct: 341 DFGAGSLAAARARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTV 400
Query: 433 YTYFTKQAYSEL-IASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
T AYS L A +++ G + L C+ F S V + T++L
Sbjct: 401 ITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCY--DFTGMSQVAI----PTVSLL 454
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
F + + G + + +CL + G I+G+ L+ V YD
Sbjct: 455 FQG-----GARLDVDASGIMYAASASQVCLAFAANED--GGDVGIVGNTQLKTFGVAYDI 507
Query: 552 VNKRIGWAKSHC 563
K +G+ C
Sbjct: 508 GKKVVGFYPGAC 519
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 147/367 (40%), Gaps = 60/367 (16%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLC--MEIQRNHKPGYCE 273
+DT S+LTW+QC APC+SC PL+ P +LP S C +++ G C
Sbjct: 141 VDTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 199
Query: 274 TCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVK 331
+Q C Y + Y D S S GVLA D+L L E VFGC QG
Sbjct: 200 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE----VIDGFVFGCGTSNQG----PFGG 251
Query: 332 TDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDLV---PSWGMAW 387
T G++GL R+++SL SQ Q V +CL + G + LG D S + +
Sbjct: 252 TSGLMGLGRSQLSLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVY 309
Query: 388 VPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIA 446
M+ P Y + I G + S G + D+G+ T Y+ + A
Sbjct: 310 TTMVSDPVQGPFYFVNLTGITIGGQEV-----ESSAGKVIVDSGTIITSLVPSVYNAVKA 364
Query: 447 SLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKF--- 503
L A P + P SI+D L + QI S KF
Sbjct: 365 EF-------LSQFAEYP--------QAPGFSILDT-----CFNLTGFREVQIPSLKFVFE 404
Query: 504 -----HISPEG--YLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRI 556
+ G Y V S +CL + + T I+G+ + V++D + +I
Sbjct: 405 GNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYE--TSIIGNYQQKNLRVIFDTLGSQI 462
Query: 557 GWAKSHC 563
G+A+ C
Sbjct: 463 GFAQETC 469
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 118/272 (43%), Gaps = 23/272 (8%)
Query: 186 SSSIFPLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL 244
+S L+ +I P G Y + +G PP Y DTGSDLTW QC PC C + P+
Sbjct: 75 TSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQC-LPCLKCYQQLRPI 133
Query: 245 YKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL 301
+ P +P C + H C CDY Y D + S G L ++ +
Sbjct: 134 FNPLKSTSFSHVPCNTQTCHAVDDGH----CGVQGVCDYSYTYGDRTYSKGDLGFEK--I 187
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
TI + S+ V GC + G G++GL ++SL SQ++ I +
Sbjct: 188 TIGSSSVKS---VIGCGHASSG----GFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240
Query: 362 CLTT--NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARN 419
CL T + G F + +V G+ P++ + Y+ + I+ G+ A+
Sbjct: 241 CLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQ 300
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEV 451
V + D+G++ T K+ Y +++SL +V
Sbjct: 301 GNV---IIDSGTTLTILPKELYDGVVSSLLKV 329
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 95/380 (25%), Positives = 164/380 (43%), Gaps = 46/380 (12%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y + +G PP P+ DTGSDLTW QC PC C P+Y P + +P +
Sbjct: 77 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 135
Query: 259 LCMEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARDELHL--TIENGSLTKPNVVF 315
C+ + R+ C T C Y Y+D + S G+L + L L ++ +++ +V F
Sbjct: 136 TCLPVLRSRN---CSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAF 192
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGYM 373
GC D G LN + G +GL R +SL +QL G+ K +CLT N+
Sbjct: 193 GCGTDNGGDSLN----STGTVGLGRGTLSLLAQL---GVGK--FSYCLTDFFNSTLDSPF 243
Query: 374 FLG--HDLVPSWGMAW-VPMLDSPFM-ELYHTEILKINYGSSPLNLGARN-----SQVGW 424
LG +L P G P+L SP Y + I G L + + + G
Sbjct: 244 LLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGG 303
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ D+G++++ + + ++ + +V V +AS P C+ A R + F
Sbjct: 304 MVVDSGTTFSILPESGFRVVVDHVAQVLGQPPV-NASSLDSP-CFPAPAGERQL----PF 357
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKK-GNICLGILDGSEVHNGSTIILGDISLR 543
L LHF + + H + Y+ +++ + CL I+ + + +LG+ +
Sbjct: 358 MPDLVLHFAGGADM---RLH--RDNYMSYNQEDSSFCLNIVGTTSTWS----MLGNFQQQ 408
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
+++D ++ + + C
Sbjct: 409 NIQMLFDMTVGQLSFLPTDC 428
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 105/410 (25%), Positives = 172/410 (41%), Gaps = 59/410 (14%)
Query: 186 SSSIFPLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL 244
S+ PL Y G YF VG P +P+ L DTGSDLTW++C +S + A+PL
Sbjct: 93 SAFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRAS-SPDASPL 151
Query: 245 YKPRM---GNILPYKDSLC-MEIQRNHKPGYCETCQQ-------CDYEIEYADHSSSMGV 293
PR+ N + C + +++ P C C Y+ Y D SS+ GV
Sbjct: 152 ASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGV 211
Query: 294 LARDELHLTIE-NGSLTKP---NVVFGC--AYDQQGLLLNTLVKTDGILGLSRAKVSLPS 347
+ D + + +GS K VV GC +YD Q + +DG+L L + +S S
Sbjct: 212 VGTDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQ-----SFQSSDGVLSLGNSNISFAS 266
Query: 348 QLASQGIIKNVVGHCLTTNAG---GGGYMFLGHDLVPSWGMAWVP-----MLDSPFMELY 399
+ A++ +CL + Y+ G G A P +LD+ Y
Sbjct: 267 RAAAR--FGGRFSYCLVDHLAPRNATSYLTFG-----PVGAAHSPSRTPLLLDAQVAPFY 319
Query: 400 HTEILKINYGSSPLNLGARNSQV---GWALFDTGSSYTYFTKQAYSELIASL-KEVSSDG 455
+ ++ LN+ A V G A+ D+G+S T AY ++A+L K+++
Sbjct: 320 AVTVDAVSVAGKALNIPAEVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVP 379
Query: 456 LVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISK 515
V DP C+ R L + F S + + Y++ +
Sbjct: 380 RV--TMDP-FEYCYNWTATRRPPA-----VPRLEVRFAG-----SARLRPPTKSYVIDAA 426
Query: 516 KGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
G C+G+ +G V G ++I G+I + L +D N+ + + +S C +
Sbjct: 427 PGVKCIGLQEG--VWPGVSVI-GNILQQEHLWEFDLANRWLRFQESRCAH 473
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 112/253 (44%), Gaps = 19/253 (7%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK---DS 258
Y + +G+P + +DTGSD++W+QC PCS C A+PL+ P + +
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 186
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C ++ + C + QC Y + Y D SS+ G + D L L GS + FGC+
Sbjct: 187 ACAQL--GQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVKSFQFGCS 240
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL-GH 377
+ G N +TDG++GL SL SQ A G + +CL G++ L
Sbjct: 241 NVESG--FND--QTDGLMGLGGGAQSLVSQTA--GTLGRAFSYCLPPTPSSSGFLTLGAA 294
Query: 378 DLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYF 436
+ G PML S + Y + I G L++ A G + D+G+ T
Sbjct: 295 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAG-TVMDSGTVITRL 353
Query: 437 TKQAYSELIASLK 449
AYS L ++ K
Sbjct: 354 PPTAYSALSSAFK 366
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 153/382 (40%), Gaps = 48/382 (12%)
Query: 191 PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG 250
P ++ +Y + +G PP ++DTGSDL W QC PC +C P++ P
Sbjct: 50 PYADTVFDYSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQC-MPCPNCYTQFAPIFDPSKS 108
Query: 251 NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LT 309
+ K R H C YEI YAD S S G+LA + + + +G
Sbjct: 109 STFKEK--------RCHG-------NSCPYEIIYADESYSTGILATETVTIQSTSGEPFV 153
Query: 310 KPNVVFGCAYDQQGLLL-NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
GC + L+ + GI+GL+ SL SQ+ I ++ +C ++ G
Sbjct: 154 MAETSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLP--IPGLISYCFSSQ-G 210
Query: 369 GGGYMFLGHDLVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPL-NLGA-RNSQVG 423
F + +V G M D PF Y+ + ++ G + LG ++Q G
Sbjct: 211 TSKINFGTNAVVAGDGTVAADMFIKKDQPF---YYLNLDAVSVGDKRIETLGTPFHAQDG 267
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVL--DASDPTLPVCWRAKFPIRSIVDV 481
D+G++YTY +Y L+ S D S L +C+ D
Sbjct: 268 NIFIDSGTTYTYL-PTSYCNLVREAVAASVVAANQVPDPSSENL-LCYNW--------DT 317
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+ F +TLHF +V K+++ Y+ G CL I V I G+ +
Sbjct: 318 MEIFPVITLHFAGGADLVLDKYNM----YVETITGGTFCLAI---GCVDPSMPAIFGNRA 370
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
LV YD+ I ++ ++C
Sbjct: 371 HNNLLVGYDSSTLVISFSPTNC 392
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 97/404 (24%), Positives = 154/404 (38%), Gaps = 43/404 (10%)
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
++ +S N V ++ P+ N G Y + VG PP P DTGSD+ W QC+ P
Sbjct: 60 RRSISHNTGLVTNTVEAPIYNN---RGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCE-P 115
Query: 234 CSSCAKGANPLYKPRMGNILPYKDSLCME--IQRNHKPGYCETCQQCDYEIEYADHSSSM 291
C++C + P++ P Y+ C + C C Y I Y D+S S
Sbjct: 116 CTNCYQQDLPMFNPSKSTT--YRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQ 173
Query: 292 GVLARDELHLTIENGSLTK-PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA 350
G A D L + +G + P GC +D G GI+GL SL Q+
Sbjct: 174 GDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSF---DANVSGIVGLGLGPASLIKQMG 230
Query: 351 SQGIIKNVVGHCLT---TNAGGGGYMFLGHDL-VPSWGMAWVPM-LDSPFMELYHTEILK 405
S + +CLT + GG + G + V G P+ + F Y ++
Sbjct: 231 SA--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKA 288
Query: 406 INYGSSPLNLGARNSQVGWA---LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD 462
++ G + NS +G + D+G++ T Y K +S+ + D
Sbjct: 289 VSVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNF---AKAISNSINLQRTDD 345
Query: 463 PT--LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNIC 520
P L C+ + D K F + +HF + E L+ IC
Sbjct: 346 PNQFLEYCFET-----TTDDYKVPF--IAMHFEGA------NLRLQRENVLIRVSDNVIC 392
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
L + + I G+I+ LV YD N + + +C+
Sbjct: 393 LAF---AGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNCV 433
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 90/391 (23%), Positives = 152/391 (38%), Gaps = 53/391 (13%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP---------------LY 245
L++ + +G P + + + +DTGSDL W+ C+ S+C + +Y
Sbjct: 110 LHYANVTIGTPAQWFLVALDTGSDLFWLPCNCN-STCVRSMETDQGETHMNAQRIRLNIY 168
Query: 246 KPRMG---NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHL 301
P + + + +LC R P C Y I Y + S S GVL D +H+
Sbjct: 169 NPSISTSSSKVTCNSTLCALRNRCISP-----LSDCPYRIRYLSPGSKSTGVLVEDVIHM 223
Query: 302 TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+ E G + FGC+ Q GL V +GI+GL+ A +++P+ L G+ +
Sbjct: 224 STEEGEARDARITFGCSETQLGLFQE--VAVNGIMGLAMADIAVPNMLVKAGVASDSFSM 281
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
C N G G + G S P+ + Y I K G +
Sbjct: 282 CFGPN--GKGTISFGDK--GSSDQHETPLGGTISPLFYDVSITKFKVGKVTVETKFS--- 334
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
A+FD+G++ T+ Y+ L + D + D T C + I S D
Sbjct: 335 ---AIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPANVDSTFEFC----YIITSTSDE 387
Query: 482 KQF-FKTLTLHFGSKWQIVS--TKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
++ + + G+ + + S F S + V CL +L + I+G
Sbjct: 388 EKLPSISFEMKGGAAYDVFSPILVFDTSDGSFQV------YCLAVLKQDKADFN---IIG 438
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHCMNPGRF 569
+ +V+D +GW KS+C + F
Sbjct: 439 QNFMTNYRIVHDRERMILGWKKSNCNDTNGF 469
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 101/409 (24%), Positives = 156/409 (38%), Gaps = 63/409 (15%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQC---DAPCSSCAKGANP---LYKPRMGNILPY 255
Y + +G PP+ + MDTGSDLTW+ C C C N + Y
Sbjct: 12 YLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFSPSHSSSSY 71
Query: 256 KDSL----CMEIQRNHKPGYCETCQQCD---------------YEIEYADHSSSMGVLAR 296
+DS C +I + T C + Y G L R
Sbjct: 72 RDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLTR 131
Query: 297 DELHLTIENGSLTK--PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
D L + +TK P FGC + +T + GI G R +S PSQL G+
Sbjct: 132 DTLRVHEGPARVTKDIPKFCFGC-------VGSTYHEPIGIAGFVRGTLSFPSQL---GL 181
Query: 355 IKNVVGHCL-----TTNAGGGGYMFLGHDLVPSW-GMAWVPMLDSP-FMELYHTEILKIN 407
+K HC N + +G + S M + PML SP + Y+ + I
Sbjct: 182 LKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAIT 241
Query: 408 YG-----SSPLNLGARNSQ-VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDAS 461
G + PLNL +SQ G L D+G++YT+ + YS+L++ K + + +
Sbjct: 242 VGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYPRATEVE 301
Query: 462 -DPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEG--YLVISKKGN 518
+C++ P + D F ++T HF + V P+G + +S N
Sbjct: 302 MRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFV------LPQGNHFYAMSAPSN 355
Query: 519 I----CLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CL ++ G + G + +VYD +RIG+ C
Sbjct: 356 STVVKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDC 404
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 118/445 (26%), Positives = 187/445 (42%), Gaps = 78/445 (17%)
Query: 155 SVVAS--VNDGIIRPHKSKINKKLV--SSNAVAVDS-SSIFPLRGNIYPDGLYFTYMIVG 209
SV AS V D + R ++L SSN V + + I P G Y + +G
Sbjct: 40 SVTASQFVRDALRRDMHRHNARQLAASSSNGTTVSAPTQISPTAGE------YLMTLAIG 93
Query: 210 NPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGN---ILPYKDSLCM---E 262
PP Y DTGSDL W QC APCSS C + PLY P +LP SL M
Sbjct: 94 TPPVSYQAIADTGSDLIWTQC-APCSSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAA 152
Query: 263 IQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK------PNVVFG 316
+ P C C Y + Y +S V E T GS T P + FG
Sbjct: 153 LAGTTPPPGCT----CMYNMTYGSGWTS--VYQGSE---TFTFGSSTPANQTGVPGIAFG 203
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGGGGY 372
C+ G NT G++GL R +SL SQL G+ K +CLT TN+
Sbjct: 204 CSNASGG--FNT-SSASGLVGLGRGSLSLVSQL---GVPK--FSYCLTPYQDTNSTSTLL 255
Query: 373 MFLGHDLVPSWGMAWVPML----DSPFMELYHTEILKINYGSSPLN-----LGARNSQVG 423
+ L + G++ P + D+P Y+ + I+ G++ L+ L + G
Sbjct: 256 LGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTG 315
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEV----SSDGLVLDASDPTLPVCWRAKFPIRSIV 479
+ D+G++ T AY ++ A++ + ++DG ++ L +C F + S
Sbjct: 316 GFIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDG---GSAATGLDLC----FELPSST 368
Query: 480 DVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILG 538
++TLHF + + + Y+++ N+ CL + + ++ G ILG
Sbjct: 369 SAPPTMPSMTLHFDGADMV------LPADSYMMLDS--NLWCLAMQNQTD---GGVSILG 417
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
+ + ++YD + + +A + C
Sbjct: 418 NYQQQNMHILYDVGQETLTFAPAKC 442
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 171/394 (43%), Gaps = 59/394 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + +G PP + + DTGS L W QC APC+ CA P ++P + LP
Sbjct: 88 GAYNMNLSIGTPPVTFSVLADTGSSLIWTQC-APCTECAARPAPPFQPASSSTFSKLPCA 146
Query: 257 DSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
SLC + + TC C Y Y + G LA + LH+ G + P V
Sbjct: 147 SSLCQFLTSPYL-----TCNATGCVYYYPYG-MGFTAGYLATETLHV----GGASFPGVA 196
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF 374
FGC+ + + GI+GL R+ +SL SQ+ G+ + +CL ++A G
Sbjct: 197 FGCSTEN-----GVGNSSSGIVGLGRSPLSLVSQV---GVGR--FSYCLRSDADAGDSPI 246
Query: 375 LGHDLVPSWG--MAWVPMLDSPFM---ELYHTEILKINYGSSPLNL---------GARNS 420
L L G + P+L++P M Y+ + I G++ L + GA
Sbjct: 247 LFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAG 306
Query: 421 QVGWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGL--VLDASDPTLPVCWRAKFP-IR 476
VG + D+G++ TY K+ Y+ + A L ++++ L ++ + +C+ A
Sbjct: 307 LVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGG 366
Query: 477 SIVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNI---CLGILDGSEVHN 531
S V V TL L F G+++ + + G + + +G CL +L SE
Sbjct: 367 SGVPV----PTLVLRFAGGAEYAVRRRSY----VGVVAVDSQGRAAVECLLVLPASE--K 416
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
S I+G++ V+YD +A + C N
Sbjct: 417 LSISIIGNVMQMDLHVLYDLDGGMFSFAPADCAN 450
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 85.5 bits (210), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 111/251 (44%), Gaps = 15/251 (5%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
Y + +G+P + +DTGSD++W+QC PCS C A+PL+ P +
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 256
Query: 262 EIQRNHKPGY-CETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYD 320
+ + + G C + QC Y + Y D SS+ G + D L L GS + FGC+
Sbjct: 257 DCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCSNV 312
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL-GHDL 379
+ G N +TDG++GL SL SQ A G + +CL G++ L
Sbjct: 313 ESG--FND--QTDGLMGLGGGAQSLVSQTA--GTLGRAFSYCLPPTPSSSGFLTLGAAGG 366
Query: 380 VPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTK 438
+ G PML S + Y + I G L++ A G + D+G+ T
Sbjct: 367 SGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAG-TVMDSGTVITRLPP 425
Query: 439 QAYSELIASLK 449
AYS L ++ K
Sbjct: 426 TAYSALSSAFK 436
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 155/378 (41%), Gaps = 43/378 (11%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPY 255
DG Y ++G PP Y MDT +D W QC+ PC C +P++ P + +P
Sbjct: 87 DG-YIISFLIGTPPFQLYGVMDTANDNIWFQCN-PCKPCFNTTSPMFDPSKSSTYKTIPC 144
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVV 314
C ++ H + + C+Y Y + S G L+ D L L N + ++ N+V
Sbjct: 145 SSPKCKNVENTHCSS--DDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIV 202
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGG 371
GC + +G L + G +GL R +S SQL S I +CL +N G G
Sbjct: 203 IGCGHRNKGPLEGYV---SGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGISG 257
Query: 372 YMFLG-HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL---NLGARNSQVGWALF 427
+ G +V G P+ Y T + ++ G + N ++N +G +
Sbjct: 258 KLHFGDKSVVSGVGTVSTPITAGEIG--YSTTLNALSVGDHIIKFENSTSKNDNLGNTII 315
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT--LPVCWRAKFPIRSIVDVKQFF 485
D+G++ T + YS L + V+S + A P +C++A + + F
Sbjct: 316 DSGTTLTILPENVYSRLESI---VTSMVKLERAKSPNQQFKLCYKATLKNLDVPIITAHF 372
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
+H S ++T + I E +C + V N I+G+I+ +
Sbjct: 373 NGADVHLNS----LNTFYPIDHEV---------VCFAFV---SVGNFPGTIIGNIAQQNF 416
Query: 546 LVVYDNVNKRIGWAKSHC 563
LV +D I + + C
Sbjct: 417 LVGFDLQKNIISFKPTDC 434
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 111/251 (44%), Gaps = 15/251 (5%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
Y + +G+P + +DTGSD++W+QC PCS C A+PL+ P +
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 186
Query: 262 EIQRNHKPGY-CETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYD 320
+ + + G C + QC Y + Y D SS+ G + D L L GS + FGC+
Sbjct: 187 DCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCSNV 242
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL-GHDL 379
+ G N +TDG++GL SL SQ A G + +CL G++ L
Sbjct: 243 ESG--FND--QTDGLMGLGGGAQSLVSQTA--GTLGRAFSYCLPPTPSSSGFLTLGAAGG 296
Query: 380 VPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTK 438
+ G PML S + Y + I G L++ A G + D+G+ T
Sbjct: 297 SGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAG-TVMDSGTVITRLPP 355
Query: 439 QAYSELIASLK 449
AYS L ++ K
Sbjct: 356 TAYSALSSAFK 366
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 95/382 (24%), Positives = 166/382 (43%), Gaps = 62/382 (16%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQ 264
+GNP Y +DTGSDL W QC PC+ C P++ P + + LC +
Sbjct: 5 IGNPAVKYSAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALP 63
Query: 265 RNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQG 323
R++ C E C+Y Y D+SS+ G+LA + EN + + FGC + +G
Sbjct: 64 RSN----CNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN---SISGIGFGCGVENEG 116
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGYMFLG---HD 378
+ + G++GL R +SL SQL + +CLT+ ++ +F+G
Sbjct: 117 ---DGFSQGSGLVGLGRGPLSLISQLK-----ETKFSYCLTSIEDSEASSSLFIGSLASG 168
Query: 379 LVPSWGMAW-------VPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----GWA 425
+V G + + +L +P Y+ E+ I G+ L++ ++ G
Sbjct: 169 IVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGM 228
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVL---DASDPTLPVCWRAKFPIRSIVDVK 482
+ D+G++ TY + A+ LKE + + L D+ L +C++ ++I K
Sbjct: 229 IIDSGTTITYLEETAFK----VLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPK 284
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVI-SKKGNICLGILDGSEVHNGSTIILGDIS 541
F HF + E Y+V S G +CL + GS NG + I G++
Sbjct: 285 MIF-----HF------KGADLELPGENYMVADSSTGVLCLAM--GS--SNGMS-IFGNVQ 328
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ V++D + + + + C
Sbjct: 329 QQNFNVLHDLEKETVSFVPTEC 350
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 110/250 (44%), Gaps = 22/250 (8%)
Query: 207 IVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEI 263
I+G PP Y DTGSDLTW QC PC C + P++ P +P C +
Sbjct: 85 IIGTPPVDYLGIADTGSDLTWAQC-LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAV 143
Query: 264 QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQG 323
H C CDY Y D + S G L ++ +TI + S+ V GC + G
Sbjct: 144 DDGH----CGVQGVCDYSYTYGDRTYSKGDLGFEK--ITIGSSSVKS---VIGCGHASSG 194
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGYMFLGHDLVP 381
G++GL ++SL SQ++ I +CL T + G F + +V
Sbjct: 195 ----GFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVS 250
Query: 382 SWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAY 441
G+ P++ + Y+ + I+ G+ A+ V + D+G++ ++ K+ Y
Sbjct: 251 GPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNV---IIDSGTTLSFLPKELY 307
Query: 442 SELIASLKEV 451
+++SL +V
Sbjct: 308 DGVVSSLLKV 317
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 153/374 (40%), Gaps = 37/374 (9%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC 260
L++ + VG P + + +DTGSDL W+ C C C + +P S
Sbjct: 97 LHYALVTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASFYIPSLSSTS 154
Query: 261 MEIQRNHK----PGYCETCQQCDYEIEY--ADHSSSMGVLARDELHLTIENG--SLTKPN 312
+ N C C Y++ Y AD SSS G L D L+L+ E+ K
Sbjct: 155 QAVPCNSDFCGLRKECSKTSSCPYKMVYVSADTSSS-GFLVEDVLYLSTEDTHPQFLKAQ 213
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY 372
++FGC Q G L+ +G+ GL +S+PS LA +G+ N C + G G
Sbjct: 214 IMFGCGEVQTGSFLDA-AAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFGRD--GIGR 270
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSS 432
+ G S P+ + Y I I G++ ++L +FDTG+S
Sbjct: 271 ISFGDQ--GSSDQEETPLDINQKHPTYAITITGIAVGNNLMDLEVST------IFDTGTS 322
Query: 433 YTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
+TY AY+ + +V ++ D+ P C+ I +T+
Sbjct: 323 FTYLADPAYTYITDGFHSQVQANRHAADSRIP-FEYCYDLSSSEARIQTPSISLRTVG-- 379
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
GS + + IS + + + CL I+ ++++ I+G + G VV+D
Sbjct: 380 -GSLFPAIDPGQVISIQQHEYV-----YCLAIVKSTKLN-----IIGQNFMTGVRVVFDR 428
Query: 552 VNKRIGWAKSHCMN 565
K +GW K +C +
Sbjct: 429 ERKILGWKKFNCYD 442
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 157/382 (41%), Gaps = 50/382 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + +G PP + DTGSDL W+QC PC C K +P++ P+ + Y+ L
Sbjct: 92 GEYFMRISIGTPPIEVLVIADTGSDLIWVQCQ-PCQECYKQKSPIFNPKQSS--TYRRVL 148
Query: 260 CMEIQRNHKPGYCETC------QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
C N C + C Y Y DHS +MG LA + + N S+ + +
Sbjct: 149 CETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNSIQE--L 206
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-----TNAG 368
FGC G N GI+GL +SL SQL ++ I N +CL +N
Sbjct: 207 AFGCGNSNGG---NFDEVGSGIVGLGGGSLSLISQLGTK--IDNKFSYCLVPILEKSNFS 261
Query: 369 GGGYMFLGHDLVP-SWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL-GARNS---QVG 423
G +F + + S P++ Y+ + I+ G+ L +RN + G
Sbjct: 262 LGKIVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKG 321
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT--LPVCWRAKFPIRSIVDV 481
+ D+G++ T+ + Y++L L++ V SDP +C+R K I
Sbjct: 322 NIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERV---SDPNGIFSICFRDKIGIE----- 373
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+T+HF + P +++ +C ++ + + I G+++
Sbjct: 374 ---LPIITVHF------TDADVELKPINTFAKAEEDLLCFTMIPSNGIA-----IFGNLA 419
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
LV YD + + + C
Sbjct: 420 QMNFLVGYDLDKNCVSFMPTDC 441
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 153/374 (40%), Gaps = 37/374 (9%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC 260
L++ + VG P + + +DTGSDL W+ C C C + +P S
Sbjct: 97 LHYALVTVGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASFYIPSLSSTS 154
Query: 261 MEIQRNHK----PGYCETCQQCDYEIEY--ADHSSSMGVLARDELHLTIENG--SLTKPN 312
+ N C C Y++ Y AD SSS G L D L+L+ E+ K
Sbjct: 155 QAVPCNSDFCGLRKECSKTSSCPYKMVYVSADTSSS-GFLVEDVLYLSTEDTHPQFLKAQ 213
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY 372
++FGC Q G L+ +G+ GL +S+PS LA +G+ N C + G G
Sbjct: 214 IMFGCGEVQTGSFLDA-AAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFGRD--GIGR 270
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSS 432
+ G S P+ + Y I I G++ ++L +FDTG+S
Sbjct: 271 ISFGDQ--GSSDQEETPLDINQKHPTYAITITGIAVGNNLMDLEVST------IFDTGTS 322
Query: 433 YTYFTKQAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
+TY AY+ + +V ++ D+ P C+ I +T+
Sbjct: 323 FTYLADPAYTYITDGFHSQVQANRHAADSRIP-FEYCYDLSSSEARIQTPSISLRTVG-- 379
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
GS + + IS + + + CL I+ ++++ I+G + G VV+D
Sbjct: 380 -GSLFPAIDPGQVISIQQHEYV-----YCLAIVKSTKLN-----IIGQNFMTGVRVVFDR 428
Query: 552 VNKRIGWAKSHCMN 565
K +GW K +C +
Sbjct: 429 ERKILGWKKFNCYD 442
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 160/381 (41%), Gaps = 51/381 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PP P DTGSDL W QC APC C +PL+ P+ + YKD
Sbjct: 88 GEYLMNVSIGTPPFPIMAIADTGSDLLWTQC-APCDDCYTQVDPLFDPKTSST--YKDVS 144
Query: 260 CMEIQRN--HKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKP----N 312
C Q C T C Y + Y D+S + G +A D L L S T+P N
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTL---GSSDTRPMQLKN 201
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC---LTTNAGG 369
++ GC ++ G K GI+GL VSL QL I +C LT+
Sbjct: 202 IIIGCGHNNAGTFNK---KGSGIVGLGGGPVSLIKQLGDS--IDGKFSYCLVPLTSKKDQ 256
Query: 370 GGYMFLGHD-LVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWA-- 425
+ G + +V G+ P++ E ++ LK I+ GS + +S+
Sbjct: 257 TSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNI 316
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT--LPVCWRAKFPIRSIVDVKQ 483
+ D+G++ T + YSEL V+S DP L +C+ A ++ V
Sbjct: 317 IIDSGTTLTLLPTEFYSEL---EDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPV---- 369
Query: 484 FFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
+T+HF G+ ++ S+ + LV C GS S I G+++
Sbjct: 370 ----ITMHFDGADVKLDSSNAFVQVSEDLV-------CFA-FRGSP----SFSIYGNVAQ 413
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
LV YD V+K + + + C
Sbjct: 414 MNFLVGYDTVSKTVSFKPTDC 434
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 97/404 (24%), Positives = 153/404 (37%), Gaps = 43/404 (10%)
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
++ +S N V ++ P+ N G Y + VG PP P DTGSD+ W QC P
Sbjct: 60 RRSISHNTGLVTNTVEAPIYNN---RGEYLMKLSVGTPPFPIIAVADTGSDIIWTQC-VP 115
Query: 234 CSSCAKGANPLYKPRMGNILPYKDSLCME--IQRNHKPGYCETCQQCDYEIEYADHSSSM 291
C++C + P++ P Y+ C + C C Y I Y D+S S
Sbjct: 116 CTNCYQQDLPMFNPSKSTT--YRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQ 173
Query: 292 GVLARDELHLTIENGSLTK-PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA 350
G A D L + +G + P GC +D G GI+GL SL Q+
Sbjct: 174 GDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSF---DANVSGIVGLGLGPASLIKQMG 230
Query: 351 SQGIIKNVVGHCLT---TNAGGGGYMFLGHDL-VPSWGMAWVPM-LDSPFMELYHTEILK 405
S + +CLT + GG + G + V G P+ + F Y ++
Sbjct: 231 SA--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKA 288
Query: 406 INYGSSPLNLGARNSQVGWA---LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASD 462
++ G + NS +G + D+G++ T Y K +S+ + D
Sbjct: 289 VSVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNF---AKAISNSINLQRTDD 345
Query: 463 PT--LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNIC 520
P L C+ + D K F + +HF + E L+ IC
Sbjct: 346 PNQFLEYCFET-----TTDDYKVPF--IAMHFEGA------NLRLQRENVLIRVSDNVIC 392
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
L + + I G+I+ LV YD N + + +C+
Sbjct: 393 LAF---AGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNCV 433
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 85.1 bits (209), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 93/382 (24%), Positives = 156/382 (40%), Gaps = 59/382 (15%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCM-EI 263
+G PP+ + +DTGS L+WIQC + P + + LP LC I
Sbjct: 78 IGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLCKPRI 135
Query: 264 QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQG 323
P C++ + C Y YAD + + G L +++ +T N +T P ++ GCA +
Sbjct: 136 PDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEK--ITFSNTEITPP-LILGCATESS- 191
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY-----MFLGHD 378
GILG++R ++S SQ +C+ + G+ +LG D
Sbjct: 192 -------DDRGILGMNRGRLSFVSQAKISKF-----SYCIPPKSNRPGFTPTGSFYLG-D 238
Query: 379 LVPSWGMAWVPMLDSPFME--------LYHTEILKINYGSSPLNLGAR-----NSQVGWA 425
S G +V +L P + Y ++ I +G LN+ G
Sbjct: 239 NPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQT 298
Query: 426 LFDTGSSYTYFTKQAY----SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+ D+GS +T+ AY +E++ + G V T +C+ ++ +
Sbjct: 299 MVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGG---TADMCFDG-----NVAMI 350
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+ L F +I+ K E LV G C+GI S + S II G++
Sbjct: 351 PRLIGDLVFVFTRGVEILVPK-----ERVLVNVGGGIHCVGIGRSSMLGAASNII-GNVH 404
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ V +D N+R+G+AK+ C
Sbjct: 405 QQNLWVEFDVTNRRVGFAKADC 426
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 94/390 (24%), Positives = 164/390 (42%), Gaps = 42/390 (10%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CA-KGANPLYKPRM--GN 251
Y G Y VG P + + L DTGSDLTW+ C C S C+ + A + R+ N
Sbjct: 7 YGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 66
Query: 252 I------LPYKDSLC-MEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARDELHLTI 303
+ +P +C +E+ C T C Y+ Y+D S+++G A + + + +
Sbjct: 67 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 126
Query: 304 ENGSLTK-PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
+ G K NV+ GC+ QG + DG++GL +K S + A + +C
Sbjct: 127 KEGRKMKLHNVLIGCSESFQG---QSFQAADGVMGLGYSKYSFAIKAAEK--FGGKFSYC 181
Query: 363 LT---TNAGGGGYMFLGHDLVPSW---GMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
L ++ Y+ G M + ++ Y ++ I+ G + L +
Sbjct: 182 LVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIP 241
Query: 417 ARNSQV---GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF 473
+ V G + D+GSS T+ T+ AY ++A+L+ ++ L C+ +
Sbjct: 242 SEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 301
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
S+V L HF +F + Y++ + G CLG + V
Sbjct: 302 FEESLV------PRLVFHFAD-----GAEFEPPVKSYVISAADGVRCLGFVS---VAWPG 347
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
T ++G+I + L +D K++G+A S C
Sbjct: 348 TSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 377
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/233 (29%), Positives = 105/233 (45%), Gaps = 15/233 (6%)
Query: 191 PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRM- 249
P+ GN+ G Y+TY+ +G P + +DTGS L C C+ C ++KP +
Sbjct: 70 PVYGNVPELGYYYTYLTIGTPGQTVSGILDTGSTLPAFPCSG-CTRCGPSKTGMFKPELS 128
Query: 250 --GNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307
+ D+ C + C +QC Y I Y + SS+ G LA D L + +G
Sbjct: 129 STSSTFGCSDARCFCGANSCS---CNN-EQCGYSIRYLEGSSTSGFLAED--MLAVGDGG 182
Query: 308 LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
N VFGCA + GLL + + DG+ G+ R SL QL QG+I + C A
Sbjct: 183 -PAANFVFGCAQSESGLLYSQIA--DGVFGMGRTPASLYGQLVQQGVIDDAFSMCF--GA 237
Query: 368 GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS 420
G + LG+ +P+ A V ++ +I +N+ L G R++
Sbjct: 238 PREGVLLLGNVALPADAPAPVVTPVVGNTNKFNIQIEGLNFNDQQLVSGQRHN 290
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 157/379 (41%), Gaps = 71/379 (18%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G YF + VG PP P L +DTGSD+ W+QC APC C + ++ PR +
Sbjct: 140 GEYFASVGVGTPPTPALLVLDTGSDVVWLQC-APCRQCYAQSGRVFDPRRSRSYAAVRCG 198
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C + G C Y++ Y D S + G LA + L G+ P V G
Sbjct: 199 APPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFA--RGARV-PRVAVG 255
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C +D +GL V G+LGL R ++SLP+Q A + G + Y F G
Sbjct: 256 CGHDNEGL----FVAAAGLLGLGRGRLSLPTQTARR------YGRRFS-------YCFQG 298
Query: 377 HDLVPSWGMAWVPMLDSPFMELYHTEILKINY----GSSPLNLGARNSQV------GWAL 426
D L H I++ + G+ +G R+ ++ G +
Sbjct: 299 SD-------------------LDHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRGGVI 339
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL-PVCWRAKFPIRSIVDVKQFF 485
D+G+S T + Y + + + ++ GL L +L C+ + R +V V
Sbjct: 340 LDSGTSVTRLARPVYVAVREAFR-AAAGGLRLAPGGFSLFDTCYDLRG--RRVVKV---- 392
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
T+++H ++ + PE YL+ + +G CL + +G I+G+I +G
Sbjct: 393 PTVSVHLAGGAEVA-----LPPENYLIPVDTRGTFCLALAG----TDGGVSIVGNIQQQG 443
Query: 545 QLVVYDNVNKRIGWAKSHC 563
VV+D +R+ C
Sbjct: 444 FRVVFDGDRQRVALVPKSC 462
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/395 (23%), Positives = 166/395 (42%), Gaps = 51/395 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGA-NPLYKPRMGNILPYK-- 256
G YF + +G PP+ L DTGSDL W++C A C +C + + R
Sbjct: 87 GQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSA-CRNCTRHTPGSAFLARHSTTFSPNHC 145
Query: 257 -DSLC--MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PN 312
DS C + + ++H+ + C YE Y D S + G +++ L +G K
Sbjct: 146 YDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKG 205
Query: 313 VVFGCAYDQQGLLLN--TLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNA 367
+ FGCA+ G ++ + G++GL R +SL SQL + N +CL +
Sbjct: 206 IAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHR--FGNKFSYCLMDHDISP 263
Query: 368 GGGGYMFLG---HDLVP-SWGMAWVPMLDSPFMELYH---TEILKINYGSSPLN---LGA 417
Y+ +G +D+ P M + P+ +P ++ E + ++ P+N
Sbjct: 264 SPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWAL 323
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT--LPVCWRAKFPI 475
G + D+G++ T+ + AY +++ +K + ++PT +C
Sbjct: 324 DELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVR---LPSPAEPTPGFDLC------- 373
Query: 476 RSIVDVKQF----FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHN 531
V+V + L+ G + F P Y V + + CL L +
Sbjct: 374 ---VNVSEIEHPRLPKLSFKLGGD-----SVFSPPPRNYFVDTDEDVKCLA-LQAVMTPS 424
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNP 566
G ++I G++ +G L+ +D R+G+++ C P
Sbjct: 425 GFSVI-GNLMQQGFLLEFDKDRTRLGFSRHGCALP 458
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 110/251 (43%), Gaps = 15/251 (5%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
Y + +G+P + +DTGSD++W+QC PCS C A+PL+ P +
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 110
Query: 262 EI-QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYD 320
+ Q + C + QC Y + Y D SS+ G + D L L GS + FGC+
Sbjct: 111 DCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGCSNV 166
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL-GHDL 379
+ G N +TDG++GL SL SQ A G + +CL G++ L
Sbjct: 167 ESG--FND--QTDGLMGLGGGAQSLVSQTA--GTLGRAFSYCLPPTPSSSGFLTLGAAGG 220
Query: 380 VPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTK 438
+ G PML S + Y + I G L++ A G + D+G+ T
Sbjct: 221 SGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAG-TVMDSGTVITRLPP 279
Query: 439 QAYSELIASLK 449
AYS L ++ K
Sbjct: 280 TAYSALSSAFK 290
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 100/393 (25%), Positives = 155/393 (39%), Gaps = 60/393 (15%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
Y + VG PPR + + MDTGSDL W+QC APC C + P++ P + Y++ C
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASS--SYRNLTCG 202
Query: 262 EIQRNH-------------KPGYCETCQQCDYEIEYADHSSSMGVLARDE--LHLTIENG 306
+ + H +PG C Y Y D S+S G LA + ++LT
Sbjct: 203 DPRCGHVAPPEAPAPRACRRPGE----DPCPYYYWYGDQSNSTGDLALESFTVNLTAPGA 258
Query: 307 SLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN 366
S VVFGC + +GL L R +S SQL + + +CL +
Sbjct: 259 SSRVDGVVFGCGHRNRGLFHGAAGLLG----LGRGPLSFASQLRAV-YGGHTFSYCLVDH 313
Query: 367 AGG-GGYMFLGHDLVPSWGMAWVPML--------DSPFMELYHTEILKINYGSSPLNLG- 416
+ G D + +A P L SP Y+ + + G LN+
Sbjct: 314 GSDVASKVVFGED--DALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISS 371
Query: 417 ----ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
A G + D+G++ +YF + AY + + + S P L C+
Sbjct: 372 DTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVS 431
Query: 473 FPIRSIVDVKQFFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVH 530
R V L+L F G+ W + + ++ + G +CL +L
Sbjct: 432 GVERPEV------PELSLLFADGAVWDFPAENY------FIRLDPDGIMCLAVL--GTPR 477
Query: 531 NGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G +II G+ + V YD N R+G+A C
Sbjct: 478 TGMSII-GNFQQQNFHVAYDLHNNRLGFAPRRC 509
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 111/459 (24%), Positives = 182/459 (39%), Gaps = 59/459 (12%)
Query: 118 ENKESFVFPLYHKFG----IREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKIN 173
+K PL H+ G + + E LGR + A+++ + P S +
Sbjct: 54 SSKNGATLPLVHRHGPCSPVMSKEKPSHEETLGR-----DQLRAANIHAKLSSPRNS--S 106
Query: 174 KKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
K + + V + +SS + L P+ Y + +G P + +DTGSD++W+QC AP
Sbjct: 107 AKELQQSGVTIPTSSGYSLG---TPE--YVITVSLGTPAVTQVMSIDTGSDVSWVQC-AP 160
Query: 234 CS--SCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETC--QQCDYEIEYADHSS 289
C+ SC+ + L+ P Y C Q G C C Y ++Y DHS+
Sbjct: 161 CAAQSCSSQKDKLFDP--AKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSN 218
Query: 290 SMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL 349
+ G D L LT + N FGC++ G + + DG++GL SL SQ
Sbjct: 219 TTGTYGSDTLGLTTSDA---VKNFQFGCSHRANGF----VGQLDGLMGLGGDTESLVSQT 271
Query: 350 ASQGIIKNVVGHCL-TTNAGGGGYMFLGHDL--VPSWGMAWVPMLDSPFMELYHTEILKI 406
A+ +CL +++ GG++ LG S + P++ Y + I
Sbjct: 272 AA--TYGKAFSYCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAI 329
Query: 407 NYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP 466
+ LN+ A G ++ D+G+ T AY L + K+ + A P
Sbjct: 330 TVAGTKLNVPASVFS-GASVVDSGTVITQLPPTAYQALRTAFKK------EMKAYPSAAP 382
Query: 467 V-CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQI-VSTKFHISPEGYLVISKKGNICLGIL 524
V F I V+ TLT G+ + VS F+ CL
Sbjct: 383 VGILDTCFDFSGIKTVRVPVVTLTFSRGAVMDLDVSGIFYAG-------------CLAFT 429
Query: 525 DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ +G T ILG++ R +++D +G+ C
Sbjct: 430 --ATAQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 16/212 (7%)
Query: 190 FPL-RGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
FP+ RG +Y+T + +G PPR + + +DTGSD+ W+ C + C C + P
Sbjct: 69 FPVERGTNPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCIS-CVGCPLQNVTFFDPG 127
Query: 249 MGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIE 304
+ L D C + K G C +Y++EY+D S + G D + T+
Sbjct: 128 ASSSAVKLACSDKRCFS-DLHKKSG----CSPLEYKVEYSDGSFTSGYYISDLISFETVM 182
Query: 305 NGSLTKPN---VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+ +LT + VFGC+ GL+ GI+GL + ++ + SQL+SQ + V
Sbjct: 183 SSNLTVKSSAPFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSL 242
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDS 393
CL+ GGG + LG + +P+ + P++ S
Sbjct: 243 CLSGGQEGGGVIILGENRLPN--TVYTPLVRS 272
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 108/462 (23%), Positives = 181/462 (39%), Gaps = 59/462 (12%)
Query: 121 ESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSN 180
+ F + H++ R++ F G D E + V IR H N + +S+
Sbjct: 26 DGFSLEIVHRYS------RESPFYPGNITDY--ERITRLVELSKIRAH----NLAITTSS 73
Query: 181 AVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKG 240
+ ++ F LR + D Y +I+G+P P YL DTGS L W QC+ PC+ +
Sbjct: 74 GFSPEA---FRLRIS-QDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCE-PCTRRFRQ 128
Query: 241 ANPLYK---PRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARD 297
P++ R LP + C N C +C Y I YA S++ GV A+D
Sbjct: 129 LPPIFNSTASRTYRDLPCQHQFCTN---NQNVFQCRD-DKCVYRIAYAGGSATAGVAAQD 184
Query: 298 ELHLTIENGSLTKPNVVFGCAYDQQGL-LLNTLVKTDGILGLSRAKVSLPSQLASQGIIK 356
L + EN + FGC+ D Q + K GI+GL+ + VSL Q+ I K
Sbjct: 185 ILQ-SAENDRIP---FYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQM--NHITK 238
Query: 357 NVVGHC-----LTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSP------FMELYHTEILK 405
N +C L++ + + G+D+ S SP F+ L +
Sbjct: 239 NRFSYCLNLFDLSSPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAG 298
Query: 406 INYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKE-VSSDGLVLDASDPT 464
P + G + D+G++ TY ++ AY +I + K G
Sbjct: 299 NRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGF-------- 350
Query: 465 LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPE-GYLVISKKGNICLGI 523
R + + KQ T + + F + PE YL + +G C+ +
Sbjct: 351 ----QRVNIQLSGYICYKQQGHTFHNYPSMAFHFQGADFFVEPEYVYLTVQDRGAFCVAL 406
Query: 524 LDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
+ I+G ++ +YD N+++ + +C +
Sbjct: 407 ---QPISPQQRTIIGALNQANTQFIYDAANRQLLFTPENCQD 445
>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 430
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 140/331 (42%), Gaps = 51/331 (15%)
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGSLTKP 311
+P SLC N C YE+ Y + ++SS+G L D LHL ++ SL KP
Sbjct: 17 VPCTSSLCNRCTSNQN--------VCPYEMRYLSANTSSIGYLVEDVLHLATDD-SLLKP 67
Query: 312 ---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
+ FGC Q G+ T +G++GL K+S+PS LA QG+ N C A
Sbjct: 68 VEAKITFGCGTVQTGIFATTAAP-NGLIGLGMEKISVPSFLADQGLTSNSFSMCF--GAD 124
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGW-ALF 427
G G + G D P+ +PF + + + + + +N+G + V + A+F
Sbjct: 125 GYGRIDFG-DTGPA------DQKQTPFNTMLEYQSYNVTF--NVINVGGEPNDVPFTAIF 175
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV---KQF 484
D+G+S+TY T+ AYS + + G+ L + FP ++ +
Sbjct: 176 DSGTSFTYLTEPAYSTITKQMDA----GMKLKRYS-----LFGPNFPFEYCYEIPPGAKE 226
Query: 485 FKTLTLHFGSKWQ--------IVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
F+ LTL+F K V +S + CL I +++ +
Sbjct: 227 FQYLTLNFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDID-----L 281
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHCMNPG 567
+G + G + ++ +GW+ S C + G
Sbjct: 282 IGQNFMTGYRITFNRDQMVLGWSSSDCYDNG 312
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 113/446 (25%), Positives = 170/446 (38%), Gaps = 68/446 (15%)
Query: 147 RFVDLDGESVVASVNDGIIRPHKSKINKK--LVSSNAVAVDSSSIFPLRGNIYP----DG 200
R V D +V AS D + R + + + +++ A D P G + G
Sbjct: 69 RLVHRDSFAVNASAADLLARRLQRDMRRAAWIITKAATPAD-----PENGTVVTGAPTSG 123
Query: 201 LYFTYMIVGNPPR-----PYYLDMDTGSDLTWIQCDAPCSSCAKGANPLY---KPRMGNI 252
Y + VG P L D GSD+TW+QC PC C P+Y K +
Sbjct: 124 EYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQC-MPCFRCYHQPGPVYNRLKSSSASD 182
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN 312
+ C + G + +C Y++EY D SSS G + LT G + P
Sbjct: 183 VGCYAPACRAL--GSSGGCVQFLNECQYKVEYGDGSSSAGDFGVET--LTFPPG-VRVPG 237
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG-- 370
V GC D QGL GILGL R +S PSQ+A G +CL GG
Sbjct: 238 VAIGCGSDNQGLF---PAPAAGILGLGRGSLSFPSQIA--GRYGRSFSYCLAGQGTGGRS 292
Query: 371 -----GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGS--------SPLNLGA 417
G + + +S Y+ ++ I+ G S L L
Sbjct: 293 STLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDP 352
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSEL-----IASLKEVSSDGLVLDASDPTLPVCWRAK 472
G + D+G++ T + AY+ +A++KE L P P +
Sbjct: 353 STGH-GGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKE-------LGWPSPGGPFAFFDT 404
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL--VISKKGNICLGILDGSEVH 530
V + +++HF ++ + P+ YL V S KG +C +
Sbjct: 405 CYSSVRGRVMKKVPAVSMHFAGGVEV-----KLPPQNYLIPVDSNKGTMCFAFAGSGD-- 457
Query: 531 NGSTIILGDISLRGQLVVYDNVNKRI 556
G +II G+I L+G VVYD +R+
Sbjct: 458 RGVSII-GNIQLQGFRVVYDVDGQRV 482
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/376 (22%), Positives = 154/376 (40%), Gaps = 46/376 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YF + VG P R Y+ DTGSD++W+QC +PC C + +P++ P + + L
Sbjct: 79 GDYFARIGVGTPARSVYMVADTGSDVSWLQC-SPCRKCYRQQDPIFNPSLSSSFKPLACA 137
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
S+C +++ C +C Y++ Y D S ++G + + L G +V G
Sbjct: 138 SSICGKLKIKG----CSRKNECMYQVSYGDGSFTVGDFSTETLSF----GEHAVRSVAMG 189
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG-GGYMFL 375
C + QGL L R +S PSQ + +V +CL +
Sbjct: 190 CGRNNQGLFHGAAGLLG----LGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAASLVF 243
Query: 376 GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN-------LGARNSQVGWALFD 428
G VP + + Y+ + +I SP+N +G+R + G + D
Sbjct: 244 GPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGT--GGVIVD 301
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
+G++ + T AY+ L + + LV S P + + + + + S+ L
Sbjct: 302 SGTAISRLTTPAYTALRDAFRS-----LVTFPSAPGISL-FDTCYDLSSMKTATLPAVVL 355
Query: 489 TLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
G+ + + +G LV + +G CL E + I+G++ + +
Sbjct: 356 DFDGGASMPLPA-------DGILVNVDDEGTYCLAFAPEEEAFS----IIGNVQQQTFRI 404
Query: 548 VYDNVNKRIGWAKSHC 563
DN +++G A C
Sbjct: 405 SIDNQKEQMGIAPDQC 420
>gi|62954896|gb|AAY23265.1| Similar to probable aspartic proteinase (EC 3.4.23.-) - barley
[Oryza sativa Japonica Group]
gi|77548965|gb|ABA91762.1| Aspartic proteinase Asp1 precursor, putative [Oryza sativa Japonica
Group]
gi|125576451|gb|EAZ17673.1| hypothetical protein OsJ_33214 [Oryza sativa Japonica Group]
Length = 96
Score = 84.7 bits (208), Expect = 1e-13, Method: Composition-based stats.
Identities = 31/49 (63%), Positives = 40/49 (81%)
Query: 189 IFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC 237
+FPL GN+YP G +F M +G P +PY+LD+DTGSDLTW++CDAPC SC
Sbjct: 31 VFPLHGNVYPSGRFFVTMNIGVPEKPYFLDIDTGSDLTWVECDAPCQSC 79
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 159/372 (42%), Gaps = 42/372 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF + +G PP Y+ +DTGSD++WIQC APCS C + ++P++ P N Y
Sbjct: 147 GEYFLRVGIGKPPSQAYVVLDTGSDVSWIQC-APCSECYQQSDPIFDPISSN--SYSPIR 203
Query: 260 CMEIQ-RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C E Q ++ C C YE+ Y D S ++G A + T+ GS NV GC
Sbjct: 204 CDEPQCKSLDLSECRN-GTCLYEVSYGDGSYTVGEFATE----TVTLGSAAVENVAIGCG 258
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
++ +GL V G+LGL K+S P+Q+ + +CL +
Sbjct: 259 HNNEGL----FVGAAGLLGLGGGKLSFPAQVNATSF-----SYCLVNRDSDAVSTLEFNS 309
Query: 379 LVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGW-----ALFDTGSS 432
+P A P++ +P ++ ++ LK I+ G L + + +V + D+G++
Sbjct: 310 PLP-RNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTA 368
Query: 433 YTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF 492
T + Y L + + + G+ C+ R V++ T++ F
Sbjct: 369 VTRLRSEVYDALRDAFVK-GAKGIPKANGVSLFDTCY--DLSSRESVEI----PTVSFRF 421
Query: 493 GSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
++ + YL+ + G C + S I+G++ +G V +D
Sbjct: 422 PEGREL-----PLPARNYLIPVDSVGTFCFAFAPTT----SSLSIIGNVQQQGTRVGFDI 472
Query: 552 VNKRIGWAKSHC 563
N +G++ C
Sbjct: 473 ANSLVGFSVDSC 484
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 93/382 (24%), Positives = 155/382 (40%), Gaps = 59/382 (15%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYKDSLCM-EI 263
+G PP+ + +DTGS L+WIQC + P + + LP LC I
Sbjct: 78 IGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLCKPRI 135
Query: 264 QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQG 323
P C++ + C Y YAD + + G L ++++ T N +T P ++ GCA +
Sbjct: 136 PDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKI--TFSNTEITPP-LILGCATESS- 191
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY-----MFLGHD 378
GILG++R ++S SQ +C+ + G+ +LG D
Sbjct: 192 -------DDRGILGMNRGRLSFVSQAKISKF-----SYCIPPKSNRPGFTPTGSFYLG-D 238
Query: 379 LVPSWGMAWVPMLDSPFME--------LYHTEILKINYGSSPLNLGAR-----NSQVGWA 425
S G +V +L P + Y ++ I +G LN+ G
Sbjct: 239 NPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQT 298
Query: 426 LFDTGSSYTYFTKQAY----SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
+ D+GS +T+ AY +E++ + G V T +C+ ++ +
Sbjct: 299 MVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGG---TADMCFDG-----NVAMI 350
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+ L F +I K E LV G C+GI S + S II G++
Sbjct: 351 PRLIGDLVFVFTRGVEIFVPK-----ERVLVNVGGGIHCVGIGRSSMLGAASNII-GNVH 404
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ V +D N+R+G+AK+ C
Sbjct: 405 QQNLWVEFDVTNRRVGFAKADC 426
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 157/387 (40%), Gaps = 46/387 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR----MGNILPY 255
G YF + +G PP+ Y L +DTGSDL WIQC PC +C + + P Y P+ NI +
Sbjct: 190 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKESSSFENITCH 248
Query: 256 KDSLCMEIQRNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLTK-- 310
D C + P C + Q C Y Y D S++ G A + ++LT NG +
Sbjct: 249 -DPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKH 307
Query: 311 -PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TN 366
NV+FGC + +GL L R +S SQL Q I + +CL ++
Sbjct: 308 VENVMFGCGHWNRGLFHGAAGLLG----LGRGPLSFASQL--QSIYGHSFSYCLVDRNSD 361
Query: 367 AGGGGYMFLGHD--LVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPLNLGARNSQ 421
+ G D L+ + + + ++ Y+ I I L +
Sbjct: 362 TSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWH 421
Query: 422 V-----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
+ G + D+G++ TYF + AY E+I G L P L C+ +
Sbjct: 422 LSKEGGGGTIIDSGTTLTYFAEPAY-EIIKEAFMKKIKGYELVEGFPPLKPCYN----VS 476
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
I ++ + G+ W + I E LV CL IL + + I
Sbjct: 477 GIEKMELPDFGILFSDGAMWDFPVENYFIQIEPDLV-------CLAILGTPK---SALSI 526
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+ + ++YD R+G+A C
Sbjct: 527 IGNYQQQNFHILYDMKKSRLGYAPMKC 553
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 146/363 (40%), Gaps = 57/363 (15%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC-MEIQRNHKPGYCETCQ- 276
+DTGSD+TW+QC PC+ C + ++P++ P + Y C + R+ C
Sbjct: 3 LDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLS--ASYAAVSCDSQRCRDLDTAACRNATG 59
Query: 277 QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGIL 336
C YE+ Y D S ++G A + L L S NV GC +D +GL V G+L
Sbjct: 60 ACLYEVAYGDGSYTVGDFATETLTL---GDSTPVGNVAIGCGHDNEGL----FVGAAGLL 112
Query: 337 GLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM 396
L +S PSQ+++ + +CL D G P++ SP
Sbjct: 113 ALGGGPLSFPSQISA-----STFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRT 167
Query: 397 E-LYHTEILKINYGSSPLNLGAR------NSQVGWALFDTGSSYTYFTKQAYSELIASLK 449
Y+ + I+ G PL++ A S G + D+G++ T AY+ L
Sbjct: 168 STFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAAL----- 222
Query: 450 EVSSDGLVLDASDPTLP---------VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVS 500
D V A P+LP C+ R+ V+V +L G ++ +
Sbjct: 223 ---RDAFVQGA--PSLPRTSGVSLFDTCY--DLSDRTSVEVPAV--SLRFEGGGALRLPA 273
Query: 501 TKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAK 560
+ I +G G CL N + I+G++ +G V +D +G+
Sbjct: 274 KNYLIPVDG------AGTYCLAFAP----TNAAVSIIGNVQQQGTRVSFDTARGAVGFTP 323
Query: 561 SHC 563
+ C
Sbjct: 324 NKC 326
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/369 (24%), Positives = 146/369 (39%), Gaps = 42/369 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CAKGANPLYKPRMGNI---LPYK 256
Y + G P P + +DTGSDLTW+QC PCSS C+ +PL+ P + +P
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCK-PCSSGQCSPQKDPLFDPSHSSTYSAVPCA 170
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C ++ + C Q C + I Y D +S++GV +D+ LT+ G++ K + FG
Sbjct: 171 SGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDK--LTLAPGAIVK-DFYFG 227
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C + + L L L +Q +CL G++ G
Sbjct: 228 CGHSKSSLPGLFDGLL--------GLGRLSESLGAQYGGGGGFSYCLPAVNSKPGFLAFG 279
Query: 377 HDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQV-GWALFDTGSSYT 434
PS G + PM P + T L I G L+L R S G + D+G+ T
Sbjct: 280 AGRNPS-GFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDL--RPSAFSGGMIVDSGTVVT 336
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGS 494
Y L A+ +E ++ L C+ D+ + +
Sbjct: 337 VLQSTVYRALRAAFREAMKAYRLVHGD---LDTCY----------DLTGYKNVVVPKIAL 383
Query: 495 KWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNK 554
+ +T P G LV N CL + + +G+ +LG+++ R V++D
Sbjct: 384 TFSGGATINLDVPNGILV-----NGCLAFAETGK--DGTAGVLGNVNQRTFEVLFDTSAS 436
Query: 555 RIGWAKSHC 563
+ G+ C
Sbjct: 437 KFGFRAKAC 445
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 111/450 (24%), Positives = 184/450 (40%), Gaps = 83/450 (18%)
Query: 169 KSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWI 228
+ +I K L S + V + PLR DG Y + +G PP+ + +DTGSDLTW+
Sbjct: 59 QERIKKPLSSVDVV------MEPLRE--VRDG-YLITLNIGTPPQAVQVYLDTGSDLTWV 109
Query: 229 QCDAPCSSCAK----GANPLYKPRMGNIL----PYKD----SLCMEIQRNHKP------G 270
C C + N L P + + L ++D S C+EI + P
Sbjct: 110 PCGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVA 169
Query: 271 YC-------ETC-QQC-DYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQ 321
C TC + C + Y + G+L RD L + P FGC
Sbjct: 170 GCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRD----VPRFSFGC---- 221
Query: 322 QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC-----LTTNAGGGGYMFLG 376
+ +T + GI G R +SLPSQL G ++ HC N + LG
Sbjct: 222 ---VTSTYREPIGIAGFGRGLLSLPSQL---GFLEKGFSHCFLPFKFVNNPNISSPLILG 275
Query: 377 HDLVP---SWGMAWVPMLDSP-FMELYHTEILKINYGSS------PLNLGARNSQ-VGWA 425
+ + + + PML++P + Y+ + I G++ PL L +SQ G
Sbjct: 276 ASALSINLTDSLQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGM 335
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT-LPVCWRAKFPIRSIV----D 480
L D+G++YT+ + YS+L+ +L+ + + T +C++ P ++ D
Sbjct: 336 LVDSGTTYTHLPEPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLEND 395
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEG----YLVISKKGNI--CLGILDGSEVHNGST 534
V F ++T HF + + + P+G + G++ CL + + G
Sbjct: 396 VMMIFPSITFHFLNNATL------LLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPA 449
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
+ G + VVYD +RIG+ C+
Sbjct: 450 GVFGSFQQQNVKVVYDLEKERIGFQAMDCV 479
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 159/378 (42%), Gaps = 44/378 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQC--DAPCSSCAKGANPLYKPRMG--NILPYKD 257
Y Y+ VG PP DTGSDL W+ C + + GA + R ++L +
Sbjct: 100 YLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLSCQS 159
Query: 258 SLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS----LTKPNV 313
+ C + + C+ +C Y+ Y D S ++GVL+ + G + P V
Sbjct: 160 AACQALSQAS----CDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPRV 215
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGG 370
FGC+ G ++DG++GL +SL SQL + I +CL A
Sbjct: 216 SFGCSTGSAGSF-----RSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSS 270
Query: 371 GYMFLG-HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDT 429
+ G +V G A P++ S ++ Y+T L+ + A +S++ + D+
Sbjct: 271 STLSFGARAVVSDPGAASTPLVPSE-VDSYYTVALESVAVAGQDVASANSSRI---IVDS 326
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP---TLPVCWRAKFPIRSIVDVKQF-F 485
G++ T+ L+A L+ + L + P L +C+ ++ + F
Sbjct: 327 GTTLTFLDPALLRPLVAELERR----IRLPRAQPPEQLLQLCYD----VQGKSQAEDFGI 378
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
+TL FG + + PE + ++G +CL ++ SE S ILG+I+ +
Sbjct: 379 PDVTLRFGGGASVT-----LRPENTFSLLEEGTLCLVLVPVSESQPVS--ILGNIAQQNF 431
Query: 546 LVVYDNVNKRIGWAKSHC 563
V YD + + +A C
Sbjct: 432 HVGYDLDARTVTFAAVDC 449
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/376 (22%), Positives = 154/376 (40%), Gaps = 46/376 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YF + VG P R Y+ DTGSD++W+QC +PC C + +P++ P + + L
Sbjct: 12 GDYFARIGVGTPARSVYMVADTGSDVSWLQC-SPCRKCYRQQDPIFNPSLSSSFKPLACA 70
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
S+C +++ C +C Y++ Y D S ++G + + L G +V G
Sbjct: 71 SSICGKLKIKG----CSRKNKCMYQVSYGDGSFTVGDFSTETLSF----GEHAVRSVAMG 122
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG-GGYMFL 375
C + QGL L R +S PSQ + +V +CL +
Sbjct: 123 CGRNNQGLFHGAAGLLG----LGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAASLVF 176
Query: 376 GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN-------LGARNSQVGWALFD 428
G VP + + Y+ + +I SP+N +G+R + G + D
Sbjct: 177 GPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGT--GGVIVD 234
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
+G++ + T AY+ L + + LV S P + + + + + S+ L
Sbjct: 235 SGTAISRLTTPAYTALRDAFRS-----LVTFPSAPGISL-FDTCYDLSSMKTATLPAVVL 288
Query: 489 TLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
G+ + + +G LV + +G CL E + I+G++ + +
Sbjct: 289 DFDGGASMPLPA-------DGILVNVDDEGTYCLAFAPEEEAFS----IIGNVQQQTFRI 337
Query: 548 VYDNVNKRIGWAKSHC 563
DN +++G A C
Sbjct: 338 SIDNQKEQMGIAPDQC 353
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/316 (28%), Positives = 132/316 (41%), Gaps = 40/316 (12%)
Query: 195 NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILP 254
N P Y ++ +G PP+P L +DTGSDL W QC PC +C A P + P + L
Sbjct: 75 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSSTLS 133
Query: 255 YK---DSLC--MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
+LC + + P + Q C Y Y D S + G L D+ S+
Sbjct: 134 LTSCDSTLCQGLPVASCGSPKFWPN-QTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASV- 191
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTD--GILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367
P V FGC GL N + K++ GI G R +SLPSQL HC T
Sbjct: 192 -PGVAFGC-----GLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNF-----SHCFTAVN 240
Query: 368 G---GGGYMFLGHDLVPS--WGMAWVPMLDSPFM-ELYHTEILKINYGSS----PLNLGA 417
G + L DL S + P++ +P Y+ + I GS+ P + A
Sbjct: 241 GLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFA 300
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFPIR 476
+ G + D+G++ T + Y + A +V + + +DP C A P+R
Sbjct: 301 LKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSA--PLR 356
Query: 477 SIVDVKQFFKTLTLHF 492
+ K + L LHF
Sbjct: 357 A----KPYVPKLVLHF 368
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 138/332 (41%), Gaps = 57/332 (17%)
Query: 162 DGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMI---VGNPPRPYYLD 218
D I + KI K+ S++ ++ N+ P Y +++ +G PP P
Sbjct: 61 DTIWDHYSHKILKQTFSNDYIS-----------NLVPSPRYVVFLMNFSIGEPPIPQLAV 109
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQ-Q 277
MDTGS LTW+ C PCSSC++ + P++ P + Y + C E + C+ +
Sbjct: 110 MDTGSSLTWVMCH-PCSSCSQQSVPIFDPSKSS--TYSNLSCSECNK------CDVVNGE 160
Query: 278 CDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGC---------AYDQQGLLLN 327
C Y +EY SS G+ AR++L L TI+ + P+++FGC Y QG+
Sbjct: 161 CPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQGI--- 217
Query: 328 TLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAW 387
+G+ GL + SL + +G+ TN Y F L M
Sbjct: 218 -----NGVFGLGSGRFSLLPSFGKK--FSYCIGNLRNTN-----YKFNRLVLGDKANMQG 265
Query: 388 VPMLDSPFMELYHTEILKINYGSSPLNLG------ARNSQVGWALFDTGSSYTYFTKQAY 441
+ LY+ + I+ G L++ + + D+G+ +T+ TK +
Sbjct: 266 DSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTKYGF 325
Query: 442 SELIASLKEVSSDGLVLDASDPTLP--VCWRA 471
L ++ + LVL D P +C+
Sbjct: 326 EVLSFEVENLLEGVLVLAQQDKHNPYTLCYSG 357
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/388 (24%), Positives = 154/388 (39%), Gaps = 44/388 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YF + VG PP+ + L +DTGSDL WIQC PC +C + + P Y P+ + +
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISCH 251
Query: 257 DSLCMEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLT---K 310
D C + P C+ Q C Y Y D S++ G A + ++LT NG
Sbjct: 252 DPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHV 311
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNA 367
NV+FGC + +GL L + +S SQ+ Q + +CL +NA
Sbjct: 312 ENVMFGCGHWNRGLFHGAAGLLG----LGKGPLSFASQM--QSLYGQSFSYCLVDRNSNA 365
Query: 368 GGGGYMFLGHD--LVPSWGMAWVPM---LDSPFMELYHTEILKINYGSSPLNLGAR---- 418
+ G D L+ + + D Y+ +I + L +
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHL 425
Query: 419 -NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRS 477
+ G + D+G++ TYF + AY E+I G L P L C+ +
Sbjct: 426 SSEGAGGTIIDSGTTLTYFAEPAY-EIIKEAFVRKIKGYELVEGLPPLKPCYN----VSG 480
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
I ++ + G+ W + E Y + +CL IL + I+
Sbjct: 481 IEKMELPDFGILFADGAVW-------NFPVENYFIQIDPDVVCLAILGNPR---SALSII 530
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHCMN 565
G+ + ++YD R+G+A C +
Sbjct: 531 GNYQQQNFHILYDMKKSRLGYAPMKCAD 558
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 104/443 (23%), Positives = 181/443 (40%), Gaps = 59/443 (13%)
Query: 156 VVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYP---------DGLYFTYM 206
V+ N + + K +K++V++ VA SS+ G + G YF +
Sbjct: 118 VLEKNNQNTVSQKQKKNDKEVVTTTPVA---SSVEEQAGQLVATLESGMTLGSGEYFMDV 174
Query: 207 IVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRN 266
+VG+PP+ + L +DTGSDL WIQC PC C + Y P+ YK+ C + + N
Sbjct: 175 LVGSPPKHFSLILDTGSDLNWIQC-LPCYDCFQQNGAFYDPKAS--ASYKNITCNDQRCN 231
Query: 267 -----HKPGYCET-CQQCDYEIEYADHSSSMGVLARD--ELHLTIENGS---LTKPNVVF 315
P C++ Q C Y Y D S++ G A + ++LT GS N++F
Sbjct: 232 LVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMF 291
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-----TNAGGG 370
GC + +GL L R +S SQL Q + + +CL TN
Sbjct: 292 GCGHWNRGLFHGAAGLLG----LGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDTNVSSK 345
Query: 371 GYMFLGHDLVPSWGM---AWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV----- 422
DL+ + ++V ++ Y+ +I I LN+ +
Sbjct: 346 LIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGA 405
Query: 423 GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G + D+G++ +YF + AY + + E + + P L C F + I +V+
Sbjct: 406 GGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPC----FNVSGIHNVQ 461
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
+ G+ W + E + + +CL +L + + I+G+
Sbjct: 462 LPELGIAFADGAVWNFPT-------ENSFIWLNEDLVCLAMLGTPK---SAFSIIGNYQQ 511
Query: 543 RGQLVVYDNVNKRIGWAKSHCMN 565
+ ++YD R+G+A + C +
Sbjct: 512 QNFHILYDTKRSRLGYAPTKCAD 534
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 97/418 (23%), Positives = 171/418 (40%), Gaps = 49/418 (11%)
Query: 164 IIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLDMDTG 222
++R +++ N L ++ + P + D L Y + G P P L +DTG
Sbjct: 83 MLRRDRARRNHILRKASGRRITLGVSIPTSLGAFVDSLQYVVTLGFGTPAVPQVLLIDTG 142
Query: 223 SDLTWIQCDAPC--SSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQ 277
SDL+W+QC PC S+C +P++ P + +P C ++ + C
Sbjct: 143 SDLSWVQCQ-PCNSSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLDPDSYANGCTNSSS 201
Query: 278 ----CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTD 333
C Y I+Y + +++GV + + L L+ E ++ N FGC Q+G+
Sbjct: 202 GASLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVN-NFSFGCGLVQKGVFDLFDGLL- 259
Query: 334 GILGLSRAKVSLPSQLASQ--GIIKNVVGHCLTTNAGGGGYMFLGHDLVP---SWGMAWV 388
P L SQ G +CL G++ LG + G +
Sbjct: 260 -------GLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGFLALGAPATGGNNTAGFQFT 312
Query: 389 PM--LDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIA 446
P+ +++ F Y ++ I+ G L++ G + D+G+ T + AYS L
Sbjct: 313 PLQVVETTF---YLVKLTGISVGGKQLDI-EPTVFAGGMIIDSGTIVTGLPETAYSALRT 368
Query: 447 SLKE-VSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHI 505
+ + +S+ L+ D L C+ F + V V LT G V+ +
Sbjct: 369 AFRSAMSAYPLLPPNDDEDLDTCY--DFTGNTNVTVPTV--ALTFEGG-----VTIDLDV 419
Query: 506 SPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
P G L+ + CL + G+ +G T I+G+++ R V+YD+ +G+ C
Sbjct: 420 -PSGVLL-----DGCLAFVAGAS--DGDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 89/380 (23%), Positives = 167/380 (43%), Gaps = 58/380 (15%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF+ + +G PP P Y+ +DTGSD++W+QC APC+ C + +P+++P SL
Sbjct: 149 GEYFSRVGIGRPPSPVYMVLDTGSDVSWVQC-APCAECYEQTDPIFEPTSSASF---TSL 204
Query: 260 CMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
E ++ E C+ C YE+ Y D S ++G D + T+ GS + N+ GC
Sbjct: 205 SCETEQCKSLDVSE-CRNGTCLYEVSYGDGSYTVG----DFVTETVTLGSTSLGNIAIGC 259
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFLG 376
++ +GL + L +S PSQL + +CL ++ +
Sbjct: 260 GHNNEGLFIGAAGLLG----LGGGSLSFPSQLNASSF-----SYCLVDRDSDSTSTLDFN 310
Query: 377 HDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV-----GWALFDTG 430
+ P A P+ +P ++ ++ + ++ G + L + + Q+ G + D+G
Sbjct: 311 SPITPDAVTA--PLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSG 368
Query: 431 SSYTYFTKQAYSELIASLKEVSSD-----GLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
++ T Y+ L + + + D G+ L C+ +S V+V
Sbjct: 369 TAVTRLQTTVYNVLRDAFVKSTHDLQTARGVAL------FDTCY--DLSSKSRVEV---- 416
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTI-ILGDISLR 543
T++ HF + ++ + + YL+ + +G C ST+ ILG+ +
Sbjct: 417 PTVSFHFANGNEL-----PLPAKNYLIPVDSEGTFCFAF-----APTDSTLSILGNAQQQ 466
Query: 544 GQLVVYDNVNKRIGWAKSHC 563
G V +D N +G++ + C
Sbjct: 467 GTRVGFDLANSLVGFSPNKC 486
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 164/385 (42%), Gaps = 57/385 (14%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDA------PCSSCAKGANPLYKPRMGN---ILPYKDS 258
+G PP+P L +DTGSDL W QC +S ++ PLY+PR + LP D
Sbjct: 90 IGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCSDR 149
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
LC E Q ++K C +C Y+ Y + GVLA + + N ++ P + FGC
Sbjct: 150 LCQEGQFSYK--NCARNNRCMYDELYGSAEAG-GVLASETFTFGV-NAKVSLP-LGFGCG 204
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG------GY 372
G LV G++GLS +SL SQL+ +CLT A G
Sbjct: 205 ALSAG----DLVGASGLMGLSPGIMSLVSQLSVPRF-----SYCLTPFAERKTSPLLFGA 255
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFMEL--YHTEILKINYGSSPLNLGARNSQV------GW 424
M + + +L +P ME Y+ ++ ++ G+ L++ A + + G
Sbjct: 256 MADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGSGG 315
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEV----SSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
+ D+GS+ +Y + A+ + ++ E ++G D D L + ++
Sbjct: 316 TIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDEDYDDYELCFALPTGVAMEAV-- 373
Query: 481 VKQFFKT--LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
KT L LHF + + + Y + G +CL + G+ I+G
Sbjct: 374 -----KTPPLVLHFDGGAAMTLPR-----DNYFQEPRAGLMCLAV--GTSPDGFGVSIIG 421
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
++ + V++D N++ +A + C
Sbjct: 422 NVQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/429 (23%), Positives = 172/429 (40%), Gaps = 48/429 (11%)
Query: 166 RPHKSKINKKLVSSNAVAVDSSSIFPLR-------GNIYPDGLYFTYMIVGNPPRPYYLD 218
+ K++ +K ++S+ V + + P + G G YF ++VG PP+ + L
Sbjct: 117 KKQKNEKVRKKITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLI 176
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCMEIQRNHKPGYCET- 274
+DTGSDL W+QC PC C Y P+ + D C I P CE+
Sbjct: 177 LDTGSDLNWLQC-LPCYDCFHQNGMFYDPKTSASFKNITCNDPRCSLISSPDPPVQCESD 235
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHL---TIENGSLTKP--NVVFGCAYDQQGLLLNTL 329
Q C Y Y D S++ G A + + T E GS N++FGC + +GL
Sbjct: 236 NQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGAS 295
Query: 330 VKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYMFLGH--DLVPSWG 384
GL R +S SQL Q + + +CL +N + G DL+
Sbjct: 296 GLL----GLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTN 349
Query: 385 MAWVPML---DSPFMELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGSSYTYF 436
+ + + ++ Y+ +I I G L++ + G + D+G++ +YF
Sbjct: 350 LNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYF 409
Query: 437 TKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKW 496
+ AY + E + + P L C F + I + L + F
Sbjct: 410 AEPAYEIIKNKFAEKMKENYPIFRDFPVLDPC----FNVSGIEENNIHLPELGIAF---- 461
Query: 497 QIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRI 556
+ T ++ E + + +CL IL + + I+G+ + ++YD R+
Sbjct: 462 -VDGTVWNFPAENSFIWLSEDLVCLAILGTPK---STFSIIGNYQQQNFHILYDTKRSRL 517
Query: 557 GWAKSHCMN 565
G+ + C +
Sbjct: 518 GFTPTKCAD 526
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 88/384 (22%), Positives = 151/384 (39%), Gaps = 47/384 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YFT + VG P P + +DTGSD+ W+QC APC C + ++ PR + D
Sbjct: 145 GEYFTKIGVGTPVTPALMVLDTGSDVVWLQC-APCRRCYDQSGQMFDPRASHSYGAVD-C 202
Query: 260 CMEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
+ R G C+ ++ C Y++ Y D S + G A + LT +G+ P V GC
Sbjct: 203 AAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATET--LTFASGARV-PRVALGCG 259
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA----------- 367
+D +GL + L R +S PSQ++ + +CL
Sbjct: 260 HDNEGLFVAAAGLLG----LGRGSLSFPSQISRR--FGRSFSYCLVDRTSSSASATSRSS 313
Query: 368 ----GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA---RNS 420
G G LG ++ G P + H + + ++
Sbjct: 314 TVTFGSGARGALGRRVLHPDGEE--PQDGDVLLRAAHGHQRRRRARPGRGRVRPPPDPST 371
Query: 421 QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G + + + A+ ++ GL L +L + + + +
Sbjct: 372 GRGGVIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSPGGFSL---FDTCYDLSGLKV 428
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGD 539
VK T+++HF + + PE YL+ + +G C +G I+G+
Sbjct: 429 VK--VPTVSMHFAGGAEAA-----LPPENYLIPVDSRGTFCFAFAG----TDGGVSIIGN 477
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
I +G VV+D +R+G+ C
Sbjct: 478 IQQQGFRVVFDGDGQRLGFVPKGC 501
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 168/384 (43%), Gaps = 62/384 (16%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCM-EI 263
+G PP+ + +DTGS L+WIQC + ++ P + ++LP LC I
Sbjct: 83 IGTPPQSQQMILDTGSQLSWIQCHKKVPR-KPPPSTVFDPSLSSSFSVLPCNHPLCKPRI 141
Query: 264 QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQG 323
P C+ + C Y YAD + + G L R+++ + S + P ++ GCA D
Sbjct: 142 PDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFST---SQSTPPLILGCAEDAS- 197
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY-----MFLGHD 378
GILG++ ++S ASQ I +C+ T G+ +LG +
Sbjct: 198 -------DDKGILGMNLGRLS----FASQAKITK-FSYCVPTRQVRPGFTPTGSFYLGEN 245
Query: 379 LVPSWGMAWVPMLD------SPFME-LYHTEILK-INYGSSPLNL-----GARNSQVGWA 425
S G ++ +L P ++ L HT L+ I G+ LN+ A S G +
Sbjct: 246 -PNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQS 304
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVS----SDGLVLDA-SDPTLPVCWRAKFPIRSIVD 480
+ D+GS +TY AY+++ + ++ G V SD +C+ + ++
Sbjct: 305 MIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVSD----MCFDG-----NAME 355
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILGD 539
+ + + F +IV K G ++ G + C+GI SE+ ++ I+G+
Sbjct: 356 IGRLIGNMVFEFDKGVEIVIEK------GRVLADVGGGVHCVGI-GRSEMLGAASNIIGN 408
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ V +D N+R+G+ K+ C
Sbjct: 409 FHQQNLWVEFDIANRRVGFGKADC 432
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 106/417 (25%), Positives = 168/417 (40%), Gaps = 51/417 (12%)
Query: 161 NDGIIRPHKSKINK---KLV--SSNAVAVDSSSIFPLRGNI-YPDGLYFTYMIVGNPPRP 214
+D IIR ++++ KL S+N V+ S+ P + I G Y + +G P
Sbjct: 85 HDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHD 144
Query: 215 YYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCET 274
L DTGSDLTW QC+ SC P + P + Y++ C E+
Sbjct: 145 LSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSST--YQNVSCSSPMCEDA----ES 198
Query: 275 CQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKT 332
C C Y I Y D S + G LA+++ LT S +V FGC + QGL
Sbjct: 199 CSASNCVYSIVYGDKSFTQGFLAKEKFTLT---NSDVLEDVYFGCGENNQGLFDGVAGLL 255
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDLVPSWGMAWVPML 391
L K+SLP+Q + N+ +CL + + G++ G + S + + P+
Sbjct: 256 G----LGPGKLSLPAQTTTT--YNNIFSYCLPSFTSNSTGHLTFGSAGI-SESVKFTPIS 308
Query: 392 DSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEV 451
P Y +I+ I+ G L + + A+ D+G+ +T + Y+EL + KE
Sbjct: 309 SFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEK 368
Query: 452 -----SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHIS 506
S+ G L C+ F V + T+ F ST +
Sbjct: 369 MSSYKSTSGYGL------FDTCY--DFTGLDTVT----YPTIAFSFAG-----STVVELD 411
Query: 507 PEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G + K +CL ++ I G++ VVYD R+G+A + C
Sbjct: 412 GSGISLPIKISQVCLAFAGNDDL----PAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 98/390 (25%), Positives = 160/390 (41%), Gaps = 44/390 (11%)
Query: 193 RGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMG- 250
R ++ G Y + +G PP PY DTGSDL W QC APC + C + PLY P
Sbjct: 105 RKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASST 163
Query: 251 --NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
++LP SL M C C Y Y ++ GV +
Sbjct: 164 TFSVLPCNSSLSMCAGALAGAAPPPGC-ACMYYQTYGTGWTA-GVQGSETFTFGSSAADQ 221
Query: 309 TK-PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--T 365
+ P V FGC+ + + G++GL R +SL SQL + +CLT
Sbjct: 222 ARVPGVAFGCSNASS----SDWNGSAGLVGLGRGSLSLVSQLGA-----GRFSYCLTPFQ 272
Query: 366 NAGGGGYMFLG-HDLVPSWGMAWVPMLDS----PFMELYHTEILKINYGSS--PLNLGA- 417
+ + LG + G+ P + S P Y+ + I+ G+ P++ GA
Sbjct: 273 DTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAF 332
Query: 418 --RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGL-VLDASDPT-LPVCWRAKF 473
+ G + D+G++ T AY ++ A++K L +D SD T L +C+
Sbjct: 333 SLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPA 392
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
P + V ++TLHF ++ P +IS G CL + + ++ G+
Sbjct: 393 PTSAPPAV---LPSMTLHFDGADMVL-------PADSYMISGSGVWCLAMRNQTD---GA 439
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G+ + ++YD + + +A + C
Sbjct: 440 MSTFGNYQQQNMHILYDVREETLSFAPAKC 469
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 157/375 (41%), Gaps = 39/375 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y +G P DTGSDL W QC PC C + PL+ P+ + Y+D
Sbjct: 90 GEYLMKFSLGTPAFDILAIADTGSDLIWTQC-KPCDQCYEQDAPLFDPKSSST--YRDIS 146
Query: 260 CMEIQRN--HKPGYC--ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVV 314
C Q + + C E + C Y Y D S + G +A D + L +G + P +
Sbjct: 147 CSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAI 206
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC---LTTNAGGGG 371
GC ++ G K GI+GL +SL SQL S I +C L++NA
Sbjct: 207 IGCGHNNGGSFTE---KGSGIVGLGGGPISLISQLGST--IDGKFSYCLVPLSSNATNSS 261
Query: 372 YMFLGHD-LVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL--GARNSQVGWALFD 428
+ G + +V G+ P++ Y + ++ GS + + + G + D
Sbjct: 262 KLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGNIIID 321
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
+G++ T F + +SEL +++++ + V D S L +C+ +D F ++
Sbjct: 322 SGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSG-ILSLCYS--------IDADLKFPSI 372
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
T HF ++P V +C + +++G+ I G+++ LV
Sbjct: 373 TAHFDGA------DVKLNPLNTFVQVSDTVLCFAF---NPINSGA--IFGNLAQMNFLVG 421
Query: 549 YDNVNKRIGWAKSHC 563
YD K + + + C
Sbjct: 422 YDLEGKTVSFKPTDC 436
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/388 (24%), Positives = 148/388 (38%), Gaps = 68/388 (17%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G YFT + VG P R Y+ +DTGSD+ WIQC+ PC C A+P++ P + +
Sbjct: 155 GEYFTRIGVGTPTREQYMVLDTGSDVAWIQCE-PCRECYSQADPIFNPSYSASFSTVGCD 213
Query: 257 DSLCMEIQR--NHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
++C ++ H G C YE Y D S S G A + L G+ + NV
Sbjct: 214 SAVCSQLDAYDCHSGG-------CLYEASYGDGSYSTGSFATETLTF----GTTSVANVA 262
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYM 373
GC + GL + L +S P+Q+ +Q + +CL + G +
Sbjct: 263 IGCGHKNVGLFIGAAGLLG----LGAGALSFPNQIGTQ--TGHTFSYCLVDRESDSSGPL 316
Query: 374 FLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLN-------LGARNSQVGWA 425
G VP G + P+ +P + Y+ + I+ G + L+ S G
Sbjct: 317 QFGPKSVP-VGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGF 375
Query: 426 LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
+ D+G+ T AY + D V A LP D F
Sbjct: 376 IIDSGTVVTRLVTSAYDAV--------RDAFV--AGTGQLPR-----------TDAVSIF 414
Query: 486 KTLTLHFGSKWQIVST-KFHISPEGYLVISKK---------GNICLGILDGSEVHNGSTI 535
T G ++ V T FH S L++ K G C + S
Sbjct: 415 DTCYDLSGLQFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAAS----SVS 470
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+ + V +D+ N +G+A C
Sbjct: 471 IMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 119/498 (23%), Positives = 190/498 (38%), Gaps = 71/498 (14%)
Query: 117 DENKESF----------VFPLYHKFGIR--EVSQRDAEFKLG---RFVDLDGESVVASVN 161
DE E+F F L H+ G + E Q +F L R +L +
Sbjct: 85 DEESEAFPAQKPHQNLVKFHLKHRSGSKDAEPKQSVVDFTLSDLTRIQNLHRRVIEKKNQ 144
Query: 162 DGIIRPHKSKINKKLVSSN---AVAVDSSSIFPLRGNIYP---------DGLYFTYMIVG 209
+ I R KS+ + S A S + P+ G + G YF + VG
Sbjct: 145 NTISRLQKSQKEQPKQSYKPVVAAPAASRTTSPVSGQLVATLESGVSLGSGEYFMDVFVG 204
Query: 210 NPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRN 266
PP+ + L +DTGSDL WIQC PC +C + + P Y P+ + + D C +
Sbjct: 205 TPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLVSAP 263
Query: 267 HKPGYCET-CQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLT---KPNVVFGCAYD 320
P C+ Q C Y Y D S++ G A + ++LT NG+ NV+FGC +
Sbjct: 264 DPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVENVMFGCGHW 323
Query: 321 QQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYMFLGH 377
+GL L + +S SQ+ Q + +CL +NA + G
Sbjct: 324 NRGLFHGAAGLLG----LGKGPLSFASQM--QSLYGQSFSYCLVDRNSNASVSSKLIFGE 377
Query: 378 D--LVPSWGMAWVPM---LDSPFMELYHTEILKINYGSSPLNLGAR-----NSQVGWALF 427
D L+ + + D Y+ +I + L + + G +
Sbjct: 378 DKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSEGAGGTII 437
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D+G++ TYF + AY E+I G L P L C+ +V K
Sbjct: 438 DSGTTLTYFAEPAY-EIIKEAFVRKIKGYQLVEGLPPLKPCY----------NVSGIEKM 486
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
FG + + ++ E Y + +CL IL + I+G+ + +
Sbjct: 487 ELPDFGILFADEAV-WNFPVENYFIWIDPEVVCLAILGNPR---SALSIIGNYQQQNFHI 542
Query: 548 VYDNVNKRIGWAKSHCMN 565
+YD R+G+A C +
Sbjct: 543 LYDMKKSRLGYAPMKCAD 560
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/336 (23%), Positives = 134/336 (39%), Gaps = 32/336 (9%)
Query: 113 SNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKI 172
S + +E E ++ + H+ + + D +L + D + V + +IR S
Sbjct: 62 SEDHEEGGEKWMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVAS-----LIRRLSSG- 115
Query: 173 NKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA 232
+ VD + G G YF + VG+PPR Y+ +D+GSD+ W+QC
Sbjct: 116 -----GGGSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ- 169
Query: 233 PCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMG 292
PC+ C ++P++ P R G C +C YE+ Y D S + G
Sbjct: 170 PCTQCYHQSDPVFDPADSASFTGVSCSSSVCDRLENAG-CHA-GRCRYEVSYGDGSYTKG 227
Query: 293 VLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
LA + L G +V GC + +G+ + L +S QL Q
Sbjct: 228 TLALETLTF----GRTMVRSVAIGCGHRNRGMFVGAAGLLG----LGGGSMSFVGQLGGQ 279
Query: 353 GIIKNVVGHCLTTNA-GGGGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGS 410
+CL + G + G + +P+ G AWVP++ +P Y+ + + G
Sbjct: 280 --TGGAFSYCLVSRGTDSSGSLVFGREALPA-GAAWVPLVRNPRAPSFYYIGLAGLGVGG 336
Query: 411 SPLNLGARNSQV-----GWALFDTGSSYTYFTKQAY 441
+ + ++ G + DTG++ T AY
Sbjct: 337 IRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAY 372
>gi|213998812|gb|ACJ60773.1| nucellin [Hordeum euclaston]
Length = 154
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 82/157 (52%), Gaps = 18/157 (11%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G PS G+ WVPM +S F Y + ++ + P+ R + A+FD
Sbjct: 64 GKGVLYVGDFNPPSRGVTWVPMKESLF--YYSAGLAELLIDNQPI----RGNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELI---------ASLKEVSSDGL 456
+GS+YT+ Q Y+E++ +SL+EV D L
Sbjct: 118 SGSTYTHVPAQIYNEIVSKVRGTLSESSLEEVKGDAL 154
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 74/232 (31%), Positives = 104/232 (44%), Gaps = 32/232 (13%)
Query: 209 GNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQR 265
G+P + +DTGSDLTW+QC PCS+C +PL+ P + S C + R
Sbjct: 103 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACADSLR 161
Query: 266 --NHKPGYCETC----QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
PG C + ++C Y + Y D S S GVLA D + L G + VFGC
Sbjct: 162 AATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL----GGASLGGFVFGCGL 217
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--TTNAGGGGYMFL-- 375
+GL T G++GL R ++SL SQ AS+ V +CL T+ G + L
Sbjct: 218 SNRGLFGGTA----GLMGLGRTELSLVSQTASR--YGGVFSYCLPAATSGDASGSLSLGG 271
Query: 376 GHDLVPSWG----MAWVPMLDSPFM-ELYHTEILKINYGSSPL---NLGARN 419
G D S+ +A+ M+ P Y + G + L LGA N
Sbjct: 272 GDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASN 323
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 89/378 (23%), Positives = 147/378 (38%), Gaps = 53/378 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP-------RMGNI 252
G YF + +G P + +Y+ +DTGSD+ W+QC PC C + +P++ P R+G
Sbjct: 158 GEYFLRVGIGRPSKTFYMVIDTGSDVNWLQC-KPCDDCYQQVDPIFDPASSSSFSRLGCQ 216
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN 312
P +L + RN C Y++ Y D S ++G A + + +GS+ K
Sbjct: 217 TPQCRNLDVFACRN---------DSCLYQVSYGDGSYTVGDFATETVSFG-NSGSVDK-- 264
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY 372
V GC +D +GL + P L SQ I + +CL
Sbjct: 265 VAIGCGHDNEGLFVGAAGLI--------GLGGGPLSLTSQ-IKASSFSYCLVNRDSVDSS 315
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL-----GARNSQVGWALF 427
+ PS + +S Y+ I ++ G L + S G +
Sbjct: 316 TLEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIV 375
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D G++ T QAY+ L + +++ D L + C+ R+ V V T
Sbjct: 376 DCGTAVTRLQTQAYNALRDTFVKLTKD-LPSTSGFALFDTCY--NLSSRTSVRV----PT 428
Query: 488 LTLHF-GSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQ 545
+ F G K + P YL+ + G CL S I+G++ +G
Sbjct: 429 VAFLFDGGK------SLPLPPSNYLIPVDSAGTFCLAF----APTTASLSIIGNVQQQGT 478
Query: 546 LVVYDNVNKRIGWAKSHC 563
V YD N ++ ++ C
Sbjct: 479 RVTYDLANSQVSFSSRKC 496
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 104/430 (24%), Positives = 177/430 (41%), Gaps = 64/430 (14%)
Query: 160 VNDGIIRP-HK-SKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYL 217
V D + R H+ ++ ++L SS D + P R ++ G Y + +G PP Y
Sbjct: 48 VRDALRRDMHRHARFTRELASSG----DRTVAAPTRKDLPNGGEYIMTLAIGTPPLSYPA 103
Query: 218 DMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGN---ILPYKD--SLCMEIQRNHKPGY 271
DTGSDL W QC APC S C K A Y P +LP S+C + P
Sbjct: 104 IADTGSDLIWTQC-APCGSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPPPG 162
Query: 272 CETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCAYDQQGLLLNTLV 330
C C Y Y ++ G+ + + T+ P + FGC+ +
Sbjct: 163 CS----CMYNQTYGTGWTA-GIQSVETFTFGSTPADQTRVPGIAFGCSNASS----DDWN 213
Query: 331 KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGYMFLGHDLVPSWGMAWV 388
+ G++GL R +SL SQL + + +CLT +A + LG PS +
Sbjct: 214 GSAGLVGLGRGSMSLVSQLGA-----GMFSYCLTPFQDANSTSTLLLG----PSAALNGT 264
Query: 389 PMLDSPFME---------LYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSSYT 434
+L +PF+ Y+ + I+ G++ L++ R G + D+G++ T
Sbjct: 265 GVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTIT 324
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPT-LPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
AY ++ A+++ + + V D SD T L +C F + S ++T HF
Sbjct: 325 SLVDAAYQQVRAAIESLVTLP-VADGSDSTGLDLC----FALTSETSTPPSMPSMTFHFD 379
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVN 553
++ P +I G CL + + + G+ G+ + ++YD
Sbjct: 380 GADMVL-------PVDNYMILGSGVWCLAMRNQTV---GAMSTFGNYQQQNVHLLYDIHE 429
Query: 554 KRIGWAKSHC 563
+ + +A + C
Sbjct: 430 ETLSFAPAKC 439
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 104/407 (25%), Positives = 170/407 (41%), Gaps = 47/407 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLDMDTGS 223
+R ++ N SS A D S PL +PDG Y + VG P + + DTGS
Sbjct: 23 VRWMAARANSSSWSSMAGTTDVES--PL----HPDGGGYVMDISVGTPGKRFRAIADTGS 76
Query: 224 DLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKD---SLCMEIQRNHKPGYCE-TCQQCD 279
DL W+Q + PC+ C+ G ++ PR + D LC E+ PG CE C
Sbjct: 77 DLVWVQSE-PCTGCSGGT--IFDPRQSSTFREMDCSSQLCAEL-----PGSCEPGSSTCS 128
Query: 280 YEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGL 338
Y EY + G ARD + L T +GS P+ GC G++ + DG++GL
Sbjct: 129 YSYEYGSGETE-GEFARDTISLGTTSDGSQKFPSFAVGC-----GMVNSGFDGVDGLVGL 182
Query: 339 SRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFME 397
+ VSL SQL++ I + +CL N+ L G +P +
Sbjct: 183 GQGPVSLTSQLSAA--IDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSD 240
Query: 398 LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV 457
Y T L G + G G + D+G++ TY Y +++ ++ + + V
Sbjct: 241 TYPTYYLLTVNGIAV--AGQTMGSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRV 298
Query: 458 LDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG 517
D S L +C+ + F LT+ +T S +LV+ G
Sbjct: 299 -DGSSMGLDLCYDRS------SNRNYKFPALTIRLAG-----ATMTPPSSNYFLVVDDSG 346
Query: 518 N-ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ +CL + S + I+G++ +G ++YD + + + ++ C
Sbjct: 347 DTVCLAMGSASGLP---VSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 99/408 (24%), Positives = 163/408 (39%), Gaps = 49/408 (12%)
Query: 176 LVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS 235
+SS A + SS G P Y +G+P +P L +DT +D TW C +PC
Sbjct: 53 FLSSKAASTGVSSAPVASGQSPPS--YVVRAGLGSPAQPILLALDTSADATWAHC-SPCG 109
Query: 236 SCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGY-----CETCQQCDYEIEYADH 287
+C + L+ P LP ++C +Q P C + +AD
Sbjct: 110 TCPSSGS-LFAPANSTSYAPLPCSSTMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFAD- 167
Query: 288 SSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPS 347
+S LA D LHL G PN FGC G N + G+LGL R ++L S
Sbjct: 168 ASFQASLASDWLHL----GKDAIPNYAFGCVSAVSGPTAN--LPKQGLLGLGRGPMALLS 221
Query: 348 QLASQGIIKNVVGHCLTTNAG--GGGYMFLGHDLVPSWGMAWVPMLDSP-FMELYHTEIL 404
Q+ + + V +CL + G + LG P G+ + PML +P LY+ +
Sbjct: 222 QVGN--MYNGVFSYCLPSYKSYYFSGSLRLGAAGQPR-GVRYTPMLKNPNRSSLYYVNVT 278
Query: 405 KINYGSSPLNLGARN-----SQVGWALFDTGSSYTYFTKQAYSELIASLKE---VSSDGL 456
++ G +P+ + A + + + D+G+ T +T Y+ L + S
Sbjct: 279 GLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYT 338
Query: 457 VLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK 516
L A D C+ + +T+H + + E L+ S
Sbjct: 339 SLGAFD----TCFNTDEVAAGVA------PAVTVHMDGGLDLA-----LPMENTLIHSSA 383
Query: 517 GNI-CLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ CL + + + N +L ++ + VV+D N R+G+A+ C
Sbjct: 384 TPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFARESC 431
>gi|213998834|gb|ACJ60784.1| nucellin [Hordeum bulbosum]
Length = 154
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL +IK NV+GHCL++
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLRGHKMIKENVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G P+ G+ WVPM +S F Y + ++ P+ R + A+FD
Sbjct: 64 GKGVLYVGDFNPPTRGVTWVPMRESLF--YYSPGLAEVFIDKQPI----RGNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YT+ Q YSE+++ ++ S+
Sbjct: 118 SGSTYTHVPAQIYSEIVSKVRGTLSE 143
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 101/407 (24%), Positives = 168/407 (41%), Gaps = 76/407 (18%)
Query: 182 VAVDSSSIF-----PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS 236
V DSSS+ P +Y +Y + VG PP ++DTGSD+ W QC PC +
Sbjct: 396 VGYDSSSLLLQGASPYADTLYDYSIYLMKLQVGTPPFEIVAEIDTGSDIIWTQC-MPCPN 454
Query: 237 CAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLAR 296
C P++ P K S E + N C YEI YAD + S G+LA
Sbjct: 455 CYSQFAPIFDPS-------KSSTFREQRCNGN--------SCHYEIIYADKTYSKGILAT 499
Query: 297 DELHLTIENGS-LTKPNVVFGCAYDQQGLLLNTLV-KTDGILGLSRAKVSLPSQLASQGI 354
+ + + +G GC D L + + GI+GL+ +SL SQ+
Sbjct: 500 ETVTIPSTSGEPFVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLP-- 557
Query: 355 IKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPML---DSPFMELYHTEILKINYGSS 411
++ +C + G F + +V G M D+PF Y+ L ++ S
Sbjct: 558 YPGLISYCF-SGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPF---YY---LNLDAVSV 610
Query: 412 PLNLGAR-----NSQVGWALFDTGSSYTYF-------TKQAYSELIASLK--EVSSDGLV 457
NL A +++ G D+G++ TYF ++A +++ ++K ++ SD L
Sbjct: 611 EDNLIATLGTPFHAEDGNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNL- 669
Query: 458 LDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG 517
+C+ + D F +T+HF +V K+++ YL G
Sbjct: 670 ---------LCYYS--------DTIDIFPVITMHFSGGADLVLDKYNM----YLETITGG 708
Query: 518 NICLGILDGSEVHNGST-IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CL I ++ S + G+ + LV YD + I ++ ++C
Sbjct: 709 IFCLAI----GCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNC 751
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 91/370 (24%), Positives = 156/370 (42%), Gaps = 49/370 (13%)
Query: 191 PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG 250
P ++ +Y + VG PP ++DTGSDL W QC PC C +P++ P
Sbjct: 71 PYADTLFDYNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQC-MPCPDCYSQFDPIFDPS-- 127
Query: 251 NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LT 309
K S E QR H + C YEI Y D++ S G+LA + + + +G
Sbjct: 128 -----KSSTFNE-QRCHG-------KSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFV 174
Query: 310 KPNVVFGCAYDQQGLLLNTLV-KTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
GC L + + GI+GL+ SL SQ+ ++ +C + G
Sbjct: 175 MAETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLP--YPGLISYCF-SGQG 231
Query: 369 GGGYMFLGHDLVPSWGMAWVPML---DSPF----MELYHTEILKINYGSSPLNLGARNSQ 421
F + +V G M D+PF ++ E +I +P +++
Sbjct: 232 TSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPF-----HAE 286
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
G + D+GS+ TYF + + ++++V + V D S + +C+ ++ +D+
Sbjct: 287 DGNIVIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDM-LCYFSE-----TIDI 340
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
F +T+HF +V K+++ Y+ + G CL I+ S I G+ +
Sbjct: 341 ---FPVITMHFSGGADLVLDKYNM----YMESNSGGLFCLAIICNSPTQEA---IFGNRA 390
Query: 542 LRGQLVVYDN 551
LV YD+
Sbjct: 391 QNNFLVGYDS 400
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 109/245 (44%), Gaps = 28/245 (11%)
Query: 154 ESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPR 213
ES VAS+ +S++ K L + + +++ + G Y + +G+P R
Sbjct: 49 ESRVASI--------QSRLAKNLAGGSNLKASKATLPSKSASTLGSGNYVVTVGLGSPKR 100
Query: 214 PYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLC-------MEIQRN 266
DTGSDLTW QC+ C + ++ P L Y + C +E
Sbjct: 101 DLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTS--LSYSNVSCDSPSCEKLESATG 158
Query: 267 HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLL 326
+ PG C + C Y I Y D S S+G AR++L LT + N FGC + +GL
Sbjct: 159 NSPG-CSS-STCLYGIRYGDGSYSIGFFAREKLSLTSTD---VFNNFQFGCGQNNRGLFG 213
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMA 386
T G+LGL+R +SL SQ A + V +CL +++ GY+ G S +
Sbjct: 214 GTA----GLLGLARNPLSLVSQTAQK--YGKVFSYCLPSSSSSTGYLSFGSGDGDSKAVK 267
Query: 387 WVPML 391
+ P L
Sbjct: 268 FTPRL 272
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 87/379 (22%), Positives = 165/379 (43%), Gaps = 56/379 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF+ + +G PP P Y+ +DTGSD++W+QC APC+ C + +P ++P SL
Sbjct: 149 GEYFSRVGIGRPPSPVYMVLDTGSDVSWVQC-APCAECYEQTDPXFEPTSSASFT---SL 204
Query: 260 CMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
E ++ E C+ C YE+ Y D S ++G D + T+ GS + N+ GC
Sbjct: 205 SCETEQCKSLDVSE-CRNGTCLYEVSYGDGSYTVG----DFVTETVTLGSTSLGNIAIGC 259
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFLG 376
++ +GL + L +S PSQL + +CL ++ +
Sbjct: 260 GHNNEGLFIGAAGLLG----LGGGSLSFPSQLNASSF-----SYCLVDRDSDSTSTLDFN 310
Query: 377 HDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQV-----GWALFDTG 430
+ P A P+ +P ++ ++ + ++ G + L + + Q+ G + D+G
Sbjct: 311 SPITPDAVTA--PLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSG 368
Query: 431 SSYTYFTKQAYSELIASLKEVSSD-----GLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
++ T Y+ L + + + D G+ L C+ +S V+V
Sbjct: 369 TAVTRLQTTVYNVLRDAFVKSTHDLQTARGVAL------FDTCY--DLSSKSRVEV---- 416
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
T++ HF + ++ + + YL+ + +G C + + ILG+ +G
Sbjct: 417 PTVSFHFANGNEL-----PLPAKNYLIPVDSEGTFCFAFAP----TDSTLSILGNAQQQG 467
Query: 545 QLVVYDNVNKRIGWAKSHC 563
V +D N +G++ + C
Sbjct: 468 TRVGFDLANSLVGFSPNKC 486
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 93/390 (23%), Positives = 160/390 (41%), Gaps = 64/390 (16%)
Query: 167 PHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTY-----MIVGNPPRPYYLDMDT 221
PH I+ SNA + S+ G+ Y D ++ TY + +G PP +DT
Sbjct: 27 PHGFTIDLIHRRSNASSSRVSNT--QAGSPYADTVFDTYEYLMKLQIGTPPFEVEAVLDT 84
Query: 222 GSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYE 281
GS+L W QC PC C P++ P + +K++ C + P + C Y+
Sbjct: 85 GSELIWTQC-LPCLHCYDQKAPIFDPSKSST--FKETRC------NTPDH-----SCPYK 130
Query: 282 IEYADHSSSMGVLARDELHLTIENGS-LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSR 340
+ Y D S + G LA + + + +G P + GC+ + G + GI+GLSR
Sbjct: 131 LVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNNSGSGFRP--SSSGIVGLSR 188
Query: 341 AKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYH 400
+SL SQ+ VV + G +L D V S G + + +PF L
Sbjct: 189 GSLSLISQMGGAYPGDGVVSTTMFAKTAKRGQYYLNLDAV-SVGDTRIETVGTPFHALN- 246
Query: 401 TEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDA 460
G + D+G+ TYF + + +++ V + V+D
Sbjct: 247 ----------------------GNIVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDP 284
Query: 461 SDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNIC 520
S + +C+ + + + F +T+HF +V K+++ Y+ +++ G C
Sbjct: 285 SRNDM-LCYYS--------NTIEIFPVITVHFSGGADLVLDKYNM----YMELNRGGVFC 331
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYD 550
L I+ + I G+ + LV YD
Sbjct: 332 LAIICNNPTQ---VAIFGNRAQNNFLVGYD 358
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 88/397 (22%), Positives = 167/397 (42%), Gaps = 47/397 (11%)
Query: 173 NKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA 232
N LV ++ ++ P ++ + +Y + VG PP +DTGS++TW QC
Sbjct: 351 NNFLVGYDSSSLLQLGSSPYADTVFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQC-L 409
Query: 233 PCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMG 292
PC C K P++ P + +K+ C + C YE++Y D + + G
Sbjct: 410 PCVHCYKQNAPIFDPSKSST--FKEKRCHD-------------HSCPYEVDYFDKTYTKG 454
Query: 293 VLARDELHLTIENGS-LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
LA D + + +G + GC + +G +GL+ +SL +Q+
Sbjct: 455 TLATDTVTIHSTSGEPFVMAETIIGCGRNNSWF----RPSFEGFVGLNWGPLSLITQMG- 509
Query: 352 QGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGS 410
G ++ +C N G F + +V G+ M + Y+ + ++ G
Sbjct: 510 -GEYPGLMSYCFAGN-GTSKINFGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGD 567
Query: 411 SPL-NLGA-RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-- 466
+ + LG ++ G + D+G++ TYF ++Y L+ + V + A+DPT
Sbjct: 568 TRIETLGTPFHALEGNIVIDSGTTLTYF-PESYCNLVR--QAVEHVVPAVPAADPTGNDL 624
Query: 467 VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDG 526
+C+ + + + F +T+HF +V K+++ E Y G CL I+
Sbjct: 625 LCYYS--------NTTEIFPVITMHFSGGADLVLDKYNMFMESY----SGGLFCLAIICN 672
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ I G+ + LV YD+ + + + ++C
Sbjct: 673 NPTQEA---IFGNRAQNNFLVGYDSSSLLVSFKPTNC 706
>gi|213998798|gb|ACJ60766.1| nucellin [Hordeum brevisubulatum subsp. violaceum]
Length = 141
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 79/143 (55%), Gaps = 9/143 (6%)
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAGGGG 371
+ FGC Y Q+ + DGILGL K +QL Q +IK NV+GHCL++ G G
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMIKENVIGHCLSSK--GKG 58
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGS 431
+++G PS G+ WVPM +S F Y + ++ + P+ R + A+FD+GS
Sbjct: 59 VLYVGDFNPPSRGVTWVPMRESLF--YYSPGLAELLIDNQPI----RGNPTFEAVFDSGS 112
Query: 432 SYTYFTKQAYSELIASLKEVSSD 454
+YT+ Q Y+E+++ ++ S+
Sbjct: 113 TYTHVPAQIYNEIVSKVRGTLSE 135
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 96/416 (23%), Positives = 167/416 (40%), Gaps = 42/416 (10%)
Query: 163 GIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTG 222
G + S I++K S+ V +D S G Y YFT + VG P + + + +DTG
Sbjct: 72 GADQKRHSLISRKRNSTVGVKMDLGS-----GIDYGTAQYFTEIRVGTPAKKFRVVVDTG 126
Query: 223 SDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM------EIQRNHKPGYCET-C 275
S+LTW+ C A+G + R +K C+ ++ C T
Sbjct: 127 SELTWVNCRYR----ARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPS 182
Query: 276 QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCAYDQQGLLLNTLVKTDG 334
C Y+ YAD S++ GV A++ + + + NG + + P + GC+ G + DG
Sbjct: 183 TPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTG---QSFQGADG 239
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYMFLGHDLVPSWGMAWVPML 391
+LGL+ + S S S + +CL +N Y+ G L
Sbjct: 240 VLGLAFSDFSFTSTATS--LYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPL 297
Query: 392 D-SPFMELYHTEILKINYGSSPLNLGAR---NSQVGWALFDTGSSYTYFTKQAYSELIAS 447
D + Y ++ I+ G L++ ++ + G + D+G+S T AY +++
Sbjct: 298 DLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTG 357
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
L + + + C F S +V + LT H +F
Sbjct: 358 LARYLVELKRVKPEGVPIEYC----FSFTSGFNVSK-LPQLTFHLKG-----GARFEPHR 407
Query: 508 EGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ YLV + G CLG + +T ++G+I + L +D + + +A S C
Sbjct: 408 KSYLVDAAPGVKCLGFVSAG---TPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 460
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 92/388 (23%), Positives = 158/388 (40%), Gaps = 55/388 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRM-GNILPYKDS-- 258
Y + +G PP+P +DTGSDL W QC APC+SC +PL+ P + +P + S
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAPAASSSYVPMRCSGQ 161
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
LC +I + C+ C Y Y D ++++GV A + +G + FGC
Sbjct: 162 LCNDILHHS----CQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCG 217
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGYM--- 373
G L N GI+G R +SL SQL+ I+ +CLT T+ M
Sbjct: 218 TMNVGSLNN----GSGIVGFGRDPLSLVSQLS----IRR-FSYCLTPYTSTRKSTLMFGS 268
Query: 374 -----FLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL-----GARNSQVG 423
F G D + Y+ + G+ L + R G
Sbjct: 269 LSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSG 328
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKF-------PIR 476
+ D+G++ T F +E++ + + +S P VC+
Sbjct: 329 GVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFT-SSSSPDDGVCFATPMAAGGRRASAA 387
Query: 477 SIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI 535
++V V + + HF G+ ++ + L ++G++C+ + D + +G+TI
Sbjct: 388 TVVSVPR----MAFHFQGADLELPRRNY------VLDDPRRGSLCILLADSGD--SGATI 435
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G+ + V+YD + + +A + C
Sbjct: 436 --GNFVQQDMRVLYDLEAETLSFAPAQC 461
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 113/488 (23%), Positives = 194/488 (39%), Gaps = 79/488 (16%)
Query: 90 AISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFV 149
+++ L LY FS L K F + H+ R R E + R
Sbjct: 6 CLTLVLLCLYNICFSEAL------------KSGFSVEIIHRDSSRSPFYRATETQFQRVT 53
Query: 150 DLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVG 209
+ +V S+N + N+ V SNAV P+ + DG Y +G
Sbjct: 54 N----AVRRSMN------RANHFNQISVYSNAVES------PV--TLLDDGDYLMSYSLG 95
Query: 210 NPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRN 266
PP P Y +DT SD+ W+QC C +C +P++ P LP + C +Q
Sbjct: 96 TPPFPVYGIVDTASDIIWVQCQL-CETCYNDTSPMFDPSYSKTYKNLPCSSTTCKSVQGT 154
Query: 267 HKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCAYDQQG 323
C + ++ C++ + Y D S S G L + + L N P V GC
Sbjct: 155 S----CSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGC------ 204
Query: 324 LLLNTLVKTD--GILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGH-DLV 380
+ NT V D GI+GL VSL QL+S I +CL + + G +V
Sbjct: 205 -IRNTNVSFDSIGIVGLGGGPVSLVPQLSSS--ISKKFSYCLAPISDRSSKLKFGDAAMV 261
Query: 381 PSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV---GWALFDTGSSYTYFT 437
G ++ + + Y+ + + G++ + + +S+ G + D+G+++T
Sbjct: 262 SGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSGTTFTVLP 321
Query: 438 KQAYSELIASLKEVSSDGLVLDASDP--TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSK 495
YS+L +++ +V + A DP +C+++ + VDV +T HF
Sbjct: 322 DDVYSKLESAVADVVK---LERAEDPLKQFSLCYKSTY---DKVDV----PVITAHFS-- 369
Query: 496 WQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKR 555
++ +++ +CL L S I G+++ + LV YD K
Sbjct: 370 ----GADVKLNALNTFIVASHRVVCLAFLSSQ-----SGAIFGNLAQQNFLVGYDLQRKI 420
Query: 556 IGWAKSHC 563
+ + + C
Sbjct: 421 VSFKPTDC 428
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 96/416 (23%), Positives = 167/416 (40%), Gaps = 42/416 (10%)
Query: 163 GIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTG 222
G + S I++K S+ V +D S G Y YFT + VG P + + + +DTG
Sbjct: 50 GADQKRHSLISRKRNSTVGVKMDLGS-----GIDYGTAQYFTEIRVGTPAKKFRVVVDTG 104
Query: 223 SDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM------EIQRNHKPGYCET-C 275
S+LTW+ C A+G + R +K C+ ++ C T
Sbjct: 105 SELTWVNCRYR----ARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPS 160
Query: 276 QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCAYDQQGLLLNTLVKTDG 334
C Y+ YAD S++ GV A++ + + + NG + + P + GC+ G + DG
Sbjct: 161 TPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTG---QSFQGADG 217
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYMFLGHDLVPSWGMAWVPML 391
+LGL+ + S S S + +CL +N Y+ G L
Sbjct: 218 VLGLAFSDFSFTSTATS--LYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPL 275
Query: 392 D-SPFMELYHTEILKINYGSSPLNLGAR---NSQVGWALFDTGSSYTYFTKQAYSELIAS 447
D + Y ++ I+ G L++ ++ + G + D+G+S T AY +++
Sbjct: 276 DLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTG 335
Query: 448 LKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISP 507
L + + + C F S +V + LT H +F
Sbjct: 336 LARYLVELKRVKPEGVPIEYC----FSFTSGFNVSK-LPQLTFHLKG-----GARFEPHR 385
Query: 508 EGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ YLV + G CLG + +T ++G+I + L +D + + +A S C
Sbjct: 386 KSYLVDAAPGVKCLGFVSAG---TPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 438
>gi|168025647|ref|XP_001765345.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683398|gb|EDQ69808.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 879
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 96/400 (24%), Positives = 159/400 (39%), Gaps = 66/400 (16%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQC------DAPCSSCAKGANPLYKPRMGN---- 251
+ M +G PP+ ++ MDTGS TW+ C D P G N ++PR +
Sbjct: 227 FHVEMKLGVPPKKFHFHMDTGSRDTWVYCQVSRNLDEP--PIELGPNGKFEPRDESSYIQ 284
Query: 252 ILPYKDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLT 309
+ + SLC E Q ++P C + + C ++ YAD S+ GVL + L ++ + S
Sbjct: 285 CIGHTASLCSEYQ--YEPHLCNSVDKYHCVNDLNYADDSTYSGVLVNESLMVSTIDNSDM 342
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHCLTTNAG 368
+F C + + TDGI+GL K +L Q + +I +NV+G CL G
Sbjct: 343 DAMGLFWCINEAS----HPFTGTDGIIGLGNCKKTLGDQWTTNKVISQNVLGVCLAKGPG 398
Query: 369 GGGYMFLGHDLVPSWGMA---WVPM--LDSPFMELYHTEILKINYG------SSPLNLGA 417
GY+ LG + + + W + + S Y + + I++ +S NLG
Sbjct: 399 PVGYISLGVNFKKKFEESTSVWSKLTPMSSAGECAYSSPLASISFHDKTFVFTSETNLG- 457
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV---------- 467
FDTGS Y Y L+ L ++ + D
Sbjct: 458 ---------FDTGSDMMYLEAVIYEPLLDMLDSYATSRGYVRVEDSVAQSYYVHQSEQRQ 508
Query: 468 CWRAKFPIRSIVDVK----QFFKTLTLHFG----SKWQIVSTKFHISPEGYLVISK-KGN 518
CW ++ + K F LT F + + P YL + +
Sbjct: 509 CWAPPAKMQRALLTKASPISHFHALTFTFKGIPRATGHSSDQNLIVEPASYLSWNAPERK 568
Query: 519 ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGW 558
+C I+ + + LG I ++G L V+D N+++ W
Sbjct: 569 LCANIILSPKDSD-----LGAIGMKGHLFVFDVENQKVQW 603
>gi|213998826|gb|ACJ60780.1| nucellin [Hordeum intercedens]
Length = 148
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 79/146 (54%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K V FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++
Sbjct: 6 KKKVAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G PS G+ WVPM +S F Y + ++ + P+ R + A+FD
Sbjct: 64 GKGVLYVGDFNPPSRGVTWVPMKESLF--YYSAGLAELLIDNQPI----RGNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YT+ Q Y+E+++ ++ S+
Sbjct: 118 SGSTYTHVPAQIYNEIVSKVRGTLSE 143
>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
Length = 519
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 150/383 (39%), Gaps = 58/383 (15%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP----------LYKPR-- 248
L++ + VG P + + +DTGSDL W+ C+ S+C + LY P
Sbjct: 101 LHYANVSVGTPATWFLVALDTGSDLFWLPCNCG-STCIRDLKEVGLSQSRPLNLYSPNTS 159
Query: 249 -MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSS-SMGVLARDELHLTIENG 306
+ + D C R C Y+I+Y + + G L D LHL E+
Sbjct: 160 STSSSIRCSDDRCFGSSRCSS-----PASSCPYQIQYLSKDTFTTGTLFEDVLHLVTEDE 214
Query: 307 SL--TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
L K N+ GC +Q G L ++ +G+LGL S+PS LA I N C
Sbjct: 215 GLEPVKANITLGCGKNQTGFLQSS-AAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFG 273
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVG- 423
G + G D + + T +L + +++G V
Sbjct: 274 NIIDVVGRISFG---------------DKGYTDQMETPLLPTEPSVTEVSVGGDAVGVQL 318
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP--VCWRAKFPIRSIVDV 481
ALFDTG+S+T+ + Y + + + +D DP LP C+ P ++ +
Sbjct: 319 LALFDTGTSFTHLLEPEYGLITKAFDDHVTDK--RRPIDPELPFEFCYDLS-PNKTTI-- 373
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
F + + F Q+ F +P L I CLGIL + I+G
Sbjct: 374 --LFPRVAMTFEGGSQM----FLRNP---LFIDNSAMYCLGILKSVDFKIN---IIGQNF 421
Query: 542 LRGQLVVYDNVNKRIGWAKSHCM 564
+ G +V+D +GW +S C
Sbjct: 422 MSGYRIVFDRERMILGWKRSDCF 444
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 91/398 (22%), Positives = 163/398 (40%), Gaps = 46/398 (11%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G+ G YF + VG P + + L +DTGSDLTWIQC+ P ++ + P +
Sbjct: 49 VSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSS 108
Query: 252 ILPYKDSLCMEIQRNHKPG-YCETC-----QQCDYEIEYADHSSSMGVLARDELHLTIEN 305
Y++ C + + P +C CDY Y+D S + G+LA + + +
Sbjct: 109 SSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 168
Query: 306 GSLTKP-----------NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
S + NV GC+ + G + + G+LGL + +SL +Q
Sbjct: 169 RSGKRAGNHKTRRIRIKNVALGCSRESVGA---SFLGASGVLGLGQGPISLATQ-TRHTA 224
Query: 355 IKNVVGHCLTTN-AGGGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSP 412
+ + +CL G FL +A P++ +P + Y+ + + P
Sbjct: 225 LGGIFSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKP 284
Query: 413 LNLGARNSQVG-------WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL 465
++ G +S G +FD+G++ +Y + AYS+++ +L + L
Sbjct: 285 VD-GIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALN-----------ASIYL 332
Query: 466 PVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD 525
P +V + K + G ++Q + + Y+V+ + C+ L
Sbjct: 333 PRAQEIPEGFELCYNVTRMEKGMP-KLGVEFQGGAV-MELPWNNYMVLVAENVQCVA-LQ 389
Query: 526 GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
NGS ILG++ + + YD RIG+ S C
Sbjct: 390 KVTTTNGSN-ILGNLLQQDHHIEYDLAKARIGFKWSPC 426
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 118/504 (23%), Positives = 198/504 (39%), Gaps = 60/504 (11%)
Query: 99 YGSVFSYTLQD-RYKSNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVV 157
+GSV S T D + S D KE + RE+ Q + + VDL + +
Sbjct: 48 FGSVSSSTSNDCGFSSKEHDPAKEHTRESVKLHLRRREIKQ-ETKRTTHSVVDLQIQDLT 106
Query: 158 ASVNDGIIRPHKSKIN-----KKLVSSNAVAVDSSSIFPLR-------GNIYPDGLYFTY 205
+ R KSK KK ++S+ V + + P + G G YF
Sbjct: 107 R-IQTLHARFKKSKKQRNEKVKKKITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMD 165
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCME 262
++VG PP+ + L +DTGSDL W+QC PC C Y P+ + D C
Sbjct: 166 VLVGTPPKHFSLILDTGSDLNWLQC-LPCYDCFHQNEAFYDPKTSASFKNITCNDPRCSL 224
Query: 263 IQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARD--ELHLTIENGSLTK---PNVVFG 316
I P C++ Q C Y Y D S++ G A + ++LT G ++ N++FG
Sbjct: 225 ISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENMMFG 284
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-----TNAGGGG 371
C + +GL GL R +S SQL Q + + +CL TN
Sbjct: 285 CGHWNRGLFSGASGLL----GLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDTNVSSKL 338
Query: 372 YMFLGHDLVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPLNLGARNSQV-----G 423
DL+ + + + ++ Y+ +I I G L++ + G
Sbjct: 339 IFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAG 398
Query: 424 WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ 483
+ D+G++ +YF + AY + E + ++ P L C F + I +
Sbjct: 399 GTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPC----FNVSGIEENNI 454
Query: 484 FFKTLTLHF--GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
L + F G+ W + E + + +CL IL + + I+G+
Sbjct: 455 HLPELGIAFADGAVW-------NFPAENSFIWLSEDLVCLAILGTPK---STFSIIGNYQ 504
Query: 542 LRGQLVVYDNVNKRIGWAKSHCMN 565
+ ++YD R+G+ + C +
Sbjct: 505 QQNFHILYDTKMSRLGFTPTKCAD 528
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 94/421 (22%), Positives = 171/421 (40%), Gaps = 66/421 (15%)
Query: 189 IFPLRGNIYPDG--------LYFTY-------MIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
I PL+ + P G L F + + VG+PP+ + +DTGS+L+W+ C
Sbjct: 28 ILPLKTQVLPSGSVPRPSSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKK- 86
Query: 234 CSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHK-PGYCETCQQCDYEIEYADHSS 289
A + ++ P + +P C R+ P C+ + C I YAD SS
Sbjct: 87 ----APNLHSVFDPLRSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASS 142
Query: 290 SMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL 349
G LA D H+ G+ P +FGC + KT G++G++R +S +Q+
Sbjct: 143 IEGNLASDTFHI----GNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM 198
Query: 350 ASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW--GMAWVPMLDS----PFME--LYHT 401
Q +C++ G +F SW + + P++ P+ + Y
Sbjct: 199 GLQKF-----SYCISGQDSSGILLFGESSF--SWLKALKYTPLVQISTPLPYFDRVAYTV 251
Query: 402 EILKINYGSSPLNL-----GARNSQVGWALFDTGSSYTYFTKQAYSEL--------IASL 448
++ I +S L L ++ G + D+G+ +T+ Y+ L ASL
Sbjct: 252 QLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASL 311
Query: 449 KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF-GSKWQIVSTKFHISP 507
K + V + + +C+R R++ + T+TL F G++ + + +
Sbjct: 312 KVLEDPNFVFQGA---MDLCYRVPLTRRTLPPL----PTVTLMFRGAEMSVSAERLMYRV 364
Query: 508 EGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPG 567
G VI ++ SE+ + I+G + + +D R+G+A+ C G
Sbjct: 365 PG--VIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRCXLAG 422
Query: 568 R 568
+
Sbjct: 423 Q 423
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 96/387 (24%), Positives = 153/387 (39%), Gaps = 72/387 (18%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI------- 252
G YF + +G PP Y+ +DTGSD++WIQC APCS C + ++P++ P N
Sbjct: 147 GEYFLRVGIGKPPSQAYVVLDTGSDVSWIQC-APCSECYQQSDPIFDPVSSNSYSPIRCD 205
Query: 253 LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN 312
P SL + RN C YE+ Y D S ++G A + T+ G+ N
Sbjct: 206 APQCKSLDLSECRNGT---------CLYEVSYGDGSYTVGEFATE----TVTLGTAAVEN 252
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY 372
V GC ++ +GL V G+LGL K+S P+Q+ + +CL
Sbjct: 253 VAIGCGHNNEGL----FVGAAGLLGLGGGKLSFPAQVNATSF-----SYCLVNRDSDAVS 303
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGW-----AL 426
+ +P + P+ +P ++ ++ LK I+ G L + +V +
Sbjct: 304 TLEFNSPLP-RNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGII 362
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G++ T + Y L D V A I V F
Sbjct: 363 IDSGTAVTRLRSEVYDAL--------RDAFVKGAKG------------IPKANGVSLFDT 402
Query: 487 TLTLHFGSKWQIVSTKFHISPEG---------YLV-ISKKGNICLGILDGSEVHNGSTII 536
L Q+ + FH PEG YL+ + G C + S I
Sbjct: 403 CYDLSSRESVQVPTVSFHF-PEGRELPLPARNYLIPVDSVGTFCFAFAPTTS----SLSI 457
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G++ +G V +D N +G++ C
Sbjct: 458 MGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 67/257 (26%), Positives = 113/257 (43%), Gaps = 22/257 (8%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + +G PP Y DTGSDL W QC PC C K + P++ P +P
Sbjct: 90 GEYLMSVSIGTPPVDYIGMADTGSDLMWAQC-LPCLKCYKQSRPIFDPLKSTSFSHVPCN 148
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C I +H C CDY Y D + + G L ++ +TI + S+ V G
Sbjct: 149 SQNCKAIDDSH----CGAQGVCDYSYTYGDQTYTKGDLGFEK--ITIGSSSVKS---VIG 199
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGYMF 374
C ++ G++GL ++SL SQ++ I +CL T + G F
Sbjct: 200 CGHESG----GGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINF 255
Query: 375 LGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
+ +V G+ P++ + Y+ + I+ G+ A+ V + D+G++ +
Sbjct: 256 GQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQGNV---IIDSGTTLS 312
Query: 435 YFTKQAYSELIASLKEV 451
+ K+ Y +++SL +V
Sbjct: 313 FLPKELYDGVVSSLLKV 329
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 105/417 (25%), Positives = 167/417 (40%), Gaps = 51/417 (12%)
Query: 161 NDGIIRPHKSKINK---KLV--SSNAVAVDSSSIFPLRGNI-YPDGLYFTYMIVGNPPRP 214
+D IIR ++++ KL S+N V+ S+ P + I G Y + +G P
Sbjct: 85 HDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHD 144
Query: 215 YYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCET 274
L DTGSDLTW QC+ SC P + P + Y++ C E+
Sbjct: 145 LSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSST--YQNVSCSSPMCEDA----ES 198
Query: 275 CQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKT 332
C C Y I Y D S + G LA+++ LT S +V FGC + QGL
Sbjct: 199 CSASNCVYSIGYGDKSFTQGFLAKEKFTLT---NSDVLEDVYFGCGENNQGLFDGVAGLL 255
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDLVPSWGMAWVPML 391
L K+SLP+Q + N+ +CL + + G++ G + S + + P+
Sbjct: 256 G----LGPGKLSLPAQTTTT--YNNIFSYCLPSFTSNSTGHLTFGSAGI-SESVKFTPIS 308
Query: 392 DSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEV 451
P Y +I+ I+ G L + + A+ D+G+ +T + Y+EL + KE
Sbjct: 309 SFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEK 368
Query: 452 -----SSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHIS 506
S+ G L C+ F V + T+ F T +
Sbjct: 369 MSSYKSTSGYGL------FDTCY--DFTGLDTVT----YPTIAFSFAG-----GTVVELD 411
Query: 507 PEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G + K +CL ++ I G++ VVYD R+G+A + C
Sbjct: 412 GSGISLPIKISQVCLAFAGNDDL----PAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 82.0 bits (201), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 152/371 (40%), Gaps = 44/371 (11%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS--SCAKGANPLYKPRMGNILPYKDSL 259
Y + +G P + +DTGSD++W+QC APC+ SC+ + L+ P M Y
Sbjct: 129 YVITVTIGTPAVTQVMSIDTGSDVSWVQC-APCAAQSCSSQKDKLFDPAMSAT--YSAFS 185
Query: 260 CMEIQRNHKPGYCETC--QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
C Q C QC Y ++Y D S++ G D L LT + + FGC
Sbjct: 186 CGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSD---AVKSFQFGC 242
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLG 376
++ G + + DG++GL SL SQ A+ +CL ++ GGG++ LG
Sbjct: 243 SHRAAGF----VGELDGLMGLGGDTESLVSQTAA--TYGKAFSYCLPPPSSSGGGFLTLG 296
Query: 377 -HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
S + PM+ Y + I + LN+ A G ++ D+G+ T
Sbjct: 297 AAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFS-GASVVDSGTVITQ 355
Query: 436 FTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF--FKTLTLHFG 493
AY L + K+ + + P+ S+ F F T+T+
Sbjct: 356 LPPTAYQALRTAFKKEMK--------------AYPSAAPVGSLDTCFDFSGFNTITV--- 398
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNICLGILD-GSEVHNGSTIILGDISLRGQLVVYDNV 552
V+ F L IS G + G L + H+G T ILG++ R +++D
Sbjct: 399 ---PTVTLTFSRGAAMDLDIS--GILYAGCLAFTATAHDGDTGILGNVQQRTFEMLFDVG 453
Query: 553 NKRIGWAKSHC 563
+ IG+ C
Sbjct: 454 GRTIGFRSGAC 464
>gi|213998816|gb|ACJ60775.1| nucellin [Hordeum patagonicum subsp. patagonicum]
Length = 152
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 79/146 (54%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++
Sbjct: 4 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKVITGNVIGHCLSSK-- 61
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G PS G+ WVPM +S F Y + ++ + P+ R + A+FD
Sbjct: 62 GKGVLYVGDFNPPSRGVTWVPMKESLFY--YSPGLAELLIDNQPI----RGNPTFEAVFD 115
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YT+ Q Y+E+++ ++ S+
Sbjct: 116 SGSTYTHVPAQIYNEIVSKVRGTLSE 141
>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
Length = 449
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 100/415 (24%), Positives = 166/415 (40%), Gaps = 69/415 (16%)
Query: 185 DSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL 244
D + P R IY D +Y M +G + YL +DTGS L W QCD C C G P
Sbjct: 67 DEKFVTPFR--IYEDVVYLAEMEIGERQQKQYLLIDTGSSLVWTQCDE-CPHCHIGDVPP 123
Query: 245 YKPRMGNILPYKDSLCMEIQRNHK------------PGYCETC--QQCDYEIEY---ADH 287
Y +++ C + N K PGY C +C ++ Y
Sbjct: 124 YGRSQSRT--FQEVSCGDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQG 181
Query: 288 SSSMGVLARDELHLTIENGSL---TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
+ G ++ D H I++ K +VFGCA+ Q+ ++L + + GILGL S
Sbjct: 182 ETVQGYMSMDTFHF-IDDRRFDYQAKFRMVFGCAH-QENIVLTAVKECTGILGLGMGDAS 239
Query: 345 LPSQLASQGIIKNVVGHCLTTNAGGGGY------MFLGHD--------LVPSWGMAWVPM 390
L GI K +C+ G Y F H LV WG ++P+
Sbjct: 240 F---LRQTGITK--FSYCVPPRMPGYSYRRHSWLRFGSHAQISGKKVPLVMRWGKYYLPL 294
Query: 391 LDSPFMELYHTEILKINYGSSPLNLGARNSQVGW--ALFDTGSSYTYFTKQAYSELIASL 448
+ + E++ SP+ + A SQ + + DTG+S + +LI +
Sbjct: 295 TAITYT---YNELM------SPVPIIAYKSQEDYLHMMVDTGTSLLSLPTSLHDDLIKEM 345
Query: 449 KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPE 508
+ + +++ + C++ R++ +VK TL+ G ++ ++ I E
Sbjct: 346 EAIIKSENIMEGATRWPKHCYK-----RTMDEVKDITVTLSFDGGLDIELFTSALFIKTE 400
Query: 509 GYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+K +CL + + V + S ILG + V YD +++ I C
Sbjct: 401 ----TTKGPAVCLAV---NRVDDSSKAILGMFAQTNINVGYDLLSREIAMDPIRC 448
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 97/395 (24%), Positives = 160/395 (40%), Gaps = 57/395 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILP---YK 256
G YF + +G PPR + L +DTGSDL WIQC PC C P Y P+ +
Sbjct: 190 GEYFMDVFIGTPPRHFSLILDTGSDLNWIQC-VPCYDCFVQNGPYYDPKESSSFKNIGCH 248
Query: 257 DSLCMEIQRNHKPGYCET-CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP---- 311
D C + P C+ Q C Y Y D S++ G A + + +LT P
Sbjct: 249 DPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTV-----NLTSPAGKS 303
Query: 312 ------NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT- 364
NV+FGC + +GL L R +S SQL Q + + +CL
Sbjct: 304 EFKRVENVMFGCGHWNRGLFHGAAGLLG----LGRGPLSFSSQL--QSLYGHSFSYCLVD 357
Query: 365 ----TNAGGGGYMFLGHDLVPSWGMAWVPML---DSPFMELYHTEILKINYGSSPLNLGA 417
TN DL+ + + ++ ++P Y+ +I I G L +
Sbjct: 358 RNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPE 417
Query: 418 RNSQV-----GWALFDTGSSYTYFTKQAYSELI--ASLKEVSSDGLVLDASDPTLPVCWR 470
+ G + D+G++ +YF + +Y E+I A +K+V ++ D P L C+
Sbjct: 418 ETWHLSPEGAGGTIVDSGTTLSYFAEPSY-EIIKDAFVKKVKGYPVIKDF--PILDPCYN 474
Query: 471 AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVH 530
+ +++ +F + G+ W + I E + +CL IL
Sbjct: 475 VSGVEK--MELPEF--RILFEDGAVWNFPVENYFIKLEPEEI------VCLAILGTPR-- 522
Query: 531 NGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
+ I+G+ + ++YD R+G+A C +
Sbjct: 523 -SALSIIGNYQQQNFHILYDTKKSRLGYAPMKCAD 556
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 96/430 (22%), Positives = 166/430 (38%), Gaps = 52/430 (12%)
Query: 164 IIRPHKSKINKKLVSSNAVAVDSSSIFPLR--GNIYPDGLYFTYMIVGNP-PRPYYLDMD 220
++R ++ +L S + A D++ P+ G+ Y ++ +G P P+ L +D
Sbjct: 54 LLRRMVARSKARLASLRSSACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLHLD 113
Query: 221 TGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQ 277
TGSDL W QC C+ C P+++ + + +P D LC G +
Sbjct: 114 TGSDLVWTQC--ACTVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRS 171
Query: 278 CDYEIEYADHSSSMGVLARDELHLTIENGSLTK---PNVVFGCAYDQQGLLLNTLVKTDG 334
C Y Y DHS + G +A D + + T PN+ FGC GL G
Sbjct: 172 CFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTP---NQSG 228
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA---------GGGGYMFLGHDLVP---- 381
I G +SLPSQL + +C T GG H P
Sbjct: 229 IAGFGTGPLSLPSQLKVRRF-----SYCFTAMEESRVSPVILGGEPENIEAHATGPIQST 283
Query: 382 --SWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA-----RNSQVGWALFDTGSSYT 434
+ G A P+ PF Y + + G + L A + G D+G++ T
Sbjct: 284 PFAPGPAGAPVGSQPF---YFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAIT 340
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF-G 493
+F + + L + + +DP +C+ ++ K L LH G
Sbjct: 341 FFPQAVFRSLREAFVAQVPLPVAKGYTDPDNLLCFSVPAKKKAPAVPK-----LILHLEG 395
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVN 553
+ W++ + + + + + +C+ IL N + I+G+ + +VYD +
Sbjct: 396 ADWELPRENYVLDNDDDGSGAGR-KLCVVILSAG---NSNGTIIGNFQQQNMHIVYDLES 451
Query: 554 KRIGWAKSHC 563
++ +A + C
Sbjct: 452 NKMVFAPARC 461
>gi|213998804|gb|ACJ60769.1| nucellin [Hordeum muticum]
gi|213998808|gb|ACJ60771.1| nucellin [Hordeum erectifolium]
gi|213998820|gb|ACJ60777.1| nucellin [Hordeum patagonicum subsp. mustersii]
gi|213998822|gb|ACJ60778.1| nucellin [Hordeum patagonicum subsp. santacrucense]
gi|333069937|gb|AEF13570.1| nucellin, partial [Hordeum pubiflorum]
Length = 154
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 79/146 (54%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G PS G+ WVPM +S F Y + ++ + P+ R + A+FD
Sbjct: 64 GKGVLYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPI----RGNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YT+ Q Y+E+++ ++ S+
Sbjct: 118 SGSTYTHVPAQIYNEIVSKVRGTLSE 143
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 101/416 (24%), Positives = 167/416 (40%), Gaps = 68/416 (16%)
Query: 192 LRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP--CSSC-----AKGAN 242
L+ ++P G Y + G PP+ MDTGS L W C + CS C
Sbjct: 80 LKTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGI 139
Query: 243 PLYKPRMG---NILPYKDSLCMEIQRNHKPGYCETCQQCD------------YEIEYADH 287
P + P+ N++ K+ C + P CQ+CD Y I+Y
Sbjct: 140 PTFIPKQSSSSNLIGCKNHKCSWL---FGPKVQSKCQECDPTTQNCTQSCPPYVIQYG-L 195
Query: 288 SSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPS 347
S+ G+L + L + T P + GC+ L ++ + +GI G R+ SLPS
Sbjct: 196 GSTAGLLLSETLDFPHKK---TIPGFLVGCS-------LFSIRQPEGIAGFGRSPESLPS 245
Query: 348 QLASQGIIKNVVGHCLTTNAGGGGYMF---LGHDLVPSWGMAWVPMLDSP---FMELYHT 401
QL + +V H + G D + G+++ P +P F + Y+
Sbjct: 246 QLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYV 305
Query: 402 EILKINYGSSPLNLGAR-----NSQVGWALFDTGSSYTYFTKQAYSELIASL--KEVSSD 454
+ I G + + + + + G + D+G+++T+ K Y EL+A K+V+
Sbjct: 306 LLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVY-ELVAKEFEKQVAHY 364
Query: 455 GLVLDASDPT-LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI 513
+ + + T L C+ V V +F HF K + Y
Sbjct: 365 TVATEVQNQTGLRPCFNISG--EKSVSVPEFI----FHFKG-----GAKMALPLANYFSF 413
Query: 514 SKKGNICLGI----LDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
G ICL I + GS + G IILG+ R V +D N+R G+ + +C++
Sbjct: 414 VDSGVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQQNCVS 469
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/387 (23%), Positives = 152/387 (39%), Gaps = 56/387 (14%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP----------LYKPR-- 248
L++ + VG P + + +DTGSDL W+ C+ S+C + LY P
Sbjct: 101 LHYANVSVGTPATWFLVALDTGSDLFWLPCNCG-STCIRDLKEVGLSQSRPLNLYSPNTS 159
Query: 249 -MGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSS-SMGVLARDELHLTIENG 306
+ + D C R C Y+I+Y + + G L D LHL E+
Sbjct: 160 STSSSIRCSDDRCFGSSRCSS-----PASSCPYQIQYLSKDTFTTGTLFEDVLHLVTEDE 214
Query: 307 SL--TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT 364
L K N+ GC +Q G L ++ +G+LGL S+PS LA I N C
Sbjct: 215 GLEPVKANITLGCGKNQTGFLQSS-AAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFG 273
Query: 365 TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM-----ELYHTEILKINYGSSPLNLGARN 419
G + G + +++P + Y + +++ G + +
Sbjct: 274 NIIDVVGRISFGDK-------GYTDQMETPLLPTEPSPTYAVSVTEVSVGGDAVGVQL-- 324
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP--VCWRAKFPIRS 477
ALFDTG+S+T+ + Y + + + +D DP LP C+ P ++
Sbjct: 325 ----LALFDTGTSFTHLLEPEYGLITKAFDDHVTDK--RRPIDPELPFEFCYDLS-PNKT 377
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
+ + +T GS+ + + F + E CLGIL + I+
Sbjct: 378 TILFPRV--AMTFEGGSQMFLRNPLFIVWNE-----DNSAMYCLGILKSVDFKIN---II 427
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHCM 564
G + G +V+D +GW +S C
Sbjct: 428 GQNFMSGYRIVFDRERMILGWKRSDCF 454
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 93/416 (22%), Positives = 169/416 (40%), Gaps = 66/416 (15%)
Query: 189 IFPLRGNIYPDG--------LYFTY-------MIVGNPPRPYYLDMDTGSDLTWIQCDAP 233
I PL+ + P G L F + + VG+PP+ + +DTGS+L+W+ C
Sbjct: 35 ILPLKTQVLPSGSVPRPSSKLSFHHNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKK- 93
Query: 234 CSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHK-PGYCETCQQCDYEIEYADHSS 289
A + ++ P + +P C R+ P C+ + C I YAD SS
Sbjct: 94 ----APNLHSVFDPLRSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASS 149
Query: 290 SMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL 349
G LA D H+ G+ P +FGC + KT G++G++R +S +Q+
Sbjct: 150 IEGNLASDTFHI----GNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM 205
Query: 350 ASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW--GMAWVPMLDS----PFME--LYHT 401
Q +C++ G +F SW + + P++ P+ + Y
Sbjct: 206 GLQKF-----SYCISGQDSSGILLFGESSF--SWLKALKYTPLVQISTPLPYFDRVAYTV 258
Query: 402 EILKINYGSSPLNL-----GARNSQVGWALFDTGSSYTYFTKQAYSEL--------IASL 448
++ I +S L L ++ G + D+G+ +T+ Y+ L ASL
Sbjct: 259 QLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASL 318
Query: 449 KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF-GSKWQIVSTKFHISP 507
K + V + + +C+R R++ + T+TL F G++ + + +
Sbjct: 319 KVLEDPNFVFQGA---MDLCYRVPLTRRTLPPL----PTVTLMFRGAEMSVSAERLMYRV 371
Query: 508 EGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G VI ++ SE+ + I+G + + +D R+G+A+ C
Sbjct: 372 PG--VIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 425
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/396 (23%), Positives = 160/396 (40%), Gaps = 59/396 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + +G PP P DTGSDLTW+Q PC C P++ P LP
Sbjct: 78 GEYMMNLSIGTPPFPILAIADTGSDLTWLQ-SKPCDQCYPQKGPIFDPSNSTTFHKLPCT 136
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
+ C + + + C C Y Y DHS + G LA D +T+ N S+ NV FG
Sbjct: 137 TAPCNALDESARS--CTDPTTCGYTYSYGDHSYTTGYLASDT--VTVGNASVQIRNVAFG 192
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----------TN 366
C G N + GI+GL +S SQL I +CL ++
Sbjct: 193 CGTRNGG---NFDEQGSGIVGLGGGNLSFVSQLGD--TIGKKFSYCLLPLENEISSQPSD 247
Query: 367 AGGGGYMFLGHDLVPSWG------MAWVPMLDSPFMELYHTEILKINYGSSPL------- 413
+ + G + V S A P+++ Y+ I I G L
Sbjct: 248 SPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSS 307
Query: 414 -----NLGARNS-QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV 467
+ G+++S + G + D+G++ T+ ++ Y L A+L E V D + +
Sbjct: 308 KTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSL 367
Query: 468 CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS 527
C+++ + V++ + +HF + + P V +++G +C +L +
Sbjct: 368 CFKSG---KEEVEL----PLMKVHFRGGADV-----ELKPVNTFVRAEEGLVCFTMLPTN 415
Query: 528 EVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+V I G+++ +V YD + + + + C
Sbjct: 416 DVG-----IYGNLAQMNFVVGYDLGKRTVSFLPADC 446
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/398 (22%), Positives = 163/398 (40%), Gaps = 46/398 (11%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G+ G YF + VG P + + L +DTGSDLTWIQC+ P ++ + P +
Sbjct: 17 VSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSS 76
Query: 252 ILPYKDSLCMEIQRNHKPG-YCETC-----QQCDYEIEYADHSSSMGVLARDELHLTIEN 305
Y++ C + + P +C CDY Y+D S + G+LA + + +
Sbjct: 77 SSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 136
Query: 306 GSLTKP-----------NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
S + NV GC+ + G + + G+LGL + +SL +Q
Sbjct: 137 RSGKRAGNHKTRTIRIKNVALGCSRESVGA---SFLGASGVLGLGQGPISLATQ-TRHTA 192
Query: 355 IKNVVGHCLTTN-AGGGGYMFLGHDLVPSWGMAWVPMLDSPFME-LYHTEILKINYGSSP 412
+ + +CL G FL +A P++ +P + Y+ + + P
Sbjct: 193 LGGIFSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKP 252
Query: 413 LNLGARNSQVG-------WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTL 465
++ G +S G +FD+G++ +Y + AYS+++ +L + L
Sbjct: 253 VD-GIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALN-----------ASIYL 300
Query: 466 PVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILD 525
P +V + K + G ++Q + + Y+V+ + C+ L
Sbjct: 301 PRAQEIPEGFELCYNVTRMEKGMP-KLGVEFQGGAV-MELPWNNYMVLVAENVQCVA-LQ 357
Query: 526 GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
NGS ILG++ + + YD RIG+ S C
Sbjct: 358 KVTTTNGSN-ILGNLLQQDHHIEYDLAKARIGFKWSPC 394
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 96/411 (23%), Positives = 167/411 (40%), Gaps = 46/411 (11%)
Query: 169 KSKINKKLVSSNAVAVDSSSIFPLRGNIYPDG-LYFTYMIVGNPPRPYYLDMDTGSDLTW 227
++ ++ + N + I + ++ P+G YF M +G P + DTGSDLTW
Sbjct: 60 RNAFSRSISRVNVFKTKAVDINSFQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTW 119
Query: 228 IQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQ----CDYEIE 283
+QC PC C + +PL+ P + Y+ LC N + C C+Y
Sbjct: 120 VQC-LPCDPCYRQKSPLFDPSRSS--SYRHMLCGSRFCNALDVSEQACTMDTNICEYHYS 176
Query: 284 YADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAK 342
Y D S + G LA ++ + + + + +VFGC G +GL
Sbjct: 177 YGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGI---VGLGGGA 233
Query: 343 VSLPSQLASQGIIKNVVGHC---LTTNAGGGGYMFLGHDLVPSW-GMAWVPMLDSPFMEL 398
+SL SQL+S IIK +C L+ + + G D V S + P++
Sbjct: 234 LSLVSQLSS--IIKGKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTY 291
Query: 399 YHTEILKINYGSS--PLNLGARNSQV--GWALFDTGSSYTYFTKQAYSELIASLKEVSSD 454
Y+ + I+ G+ P G N V G + D+G++ T+ + ++EL L+E
Sbjct: 292 YYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKA 351
Query: 455 GLVLDASDPT--LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLV 512
V SDP VC+R+ I + +HF + P V
Sbjct: 352 ERV---SDPRGLFSVCFRSAGDID--------LPVIAVHFN------DADVKLQPLNTFV 394
Query: 513 ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ + +C ++ +++ I G+++ LV YD + + + + C
Sbjct: 395 KADEDLLCFTMISSNQIG-----IFGNLAQMDFLVGYDLEKRTVSFKPTDC 440
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 155/384 (40%), Gaps = 68/384 (17%)
Query: 195 NIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILP 254
N P Y ++ +G PP+P L +DTGSDL W QC PC +C A P + P +
Sbjct: 82 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSST-- 138
Query: 255 YKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
+ CD + +S L R + T + P V
Sbjct: 139 ------------------LSLTSCDSTLCQGLPVAS---LPRSD-KFTFVGAGASVPGVA 176
Query: 315 FGCAYDQQGLLLNTLVKTD--GILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG--- 369
FGC GL N + K++ GI G R +SLPSQL + N HC TT G
Sbjct: 177 FGC-----GLFNNGVFKSNETGIAGFGRGPLSLPSQLK----VGN-FSHCFTTITGAIPS 226
Query: 370 GGYMFLGHDLVPSWGMAWV---PMLDSPFM-ELYHTEILKINYGSS----PLNLGARNSQ 421
+ L DL S G V P++ +P Y+ + I GS+ P + A +
Sbjct: 227 TVLLDLPADLF-SNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNG 285
Query: 422 VGWALFDTGSSYTYFTKQAYSELI-ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
G + D+G++ T + Y + A +V + + +DP C A P+R+
Sbjct: 286 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSA--PLRA--- 338
Query: 481 VKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGD 539
K + L LHF G+ + + E + +CL I++G EV +G+
Sbjct: 339 -KPYVPKLVLHFEGATMDLPRENYVFEVED----AGSSILCLAIIEGGEVTT-----IGN 388
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ V+YD N ++ + + C
Sbjct: 389 FQQQNMHVLYDLQNSKLSFVPAQC 412
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 117/261 (44%), Gaps = 35/261 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC--AKGANPLYKPRMGNI---LP 254
G Y + +G PP + + +DTGS+L W QC APC+ C P+ +P + LP
Sbjct: 89 GAYNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRCFPRPTPAPVLQPARSSTFSRLP 147
Query: 255 YKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
S C + + +P C C Y Y ++ G LA + LT+ +G+ P V
Sbjct: 148 CNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATET--LTVGDGTF--PKVA 202
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY-- 372
FGC+ + N + + GI+GL R +SL SQLA +CL ++ GG
Sbjct: 203 FGCSTE------NGVDNSSGIVGLGRGPLSLVSQLA-----VGRFSYCLRSDMADGGASP 251
Query: 373 MFLGH--DLVPSWGMAWVPMLDSPFMEL---YHTEILKINYGSSPL-----NLGARNSQV 422
+ G L + P+L +P+++ Y+ + I S+ L G + +
Sbjct: 252 ILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGL 311
Query: 423 GWA-LFDTGSSYTYFTKQAYS 442
G + D+G++ TY K Y+
Sbjct: 312 GGGTIVDSGTTLTYLAKDGYA 332
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 92/376 (24%), Positives = 148/376 (39%), Gaps = 43/376 (11%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PP P DTGSDL W QC+ PC C + +PL+ P+ + Y+
Sbjct: 84 GEYLMNISIGTPPVPILAIADTGSDLIWTQCN-PCEDCYQQTSPLFDPKESST--YRKVS 140
Query: 260 CMEIQ-RNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKP----NV 313
C Q R + C T + C Y I Y D+S + G +A D + + S +P N+
Sbjct: 141 CSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTM---GSSGRRPVSLRNM 197
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL---TTNAGGG 370
+ GC ++ G +GL SL SQL I +CL T+ G
Sbjct: 198 IIGCGHENTGTFDPAGSGI---IGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETGLT 252
Query: 371 GYMFLGHD-LVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR--NSQVGWALF 427
+ G + +V G+ M+ Y + I+ GS + + + G +
Sbjct: 253 SKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVI 312
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
D+G++ T Y EL + + V D D L +C+R + + D+ FK
Sbjct: 313 DSGTTLTLLPSNFYYELESVVASTIKAERVQDP-DGILSLCYRDSSSFK-VPDITVHFKG 370
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLV 547
+ G+ V+ +S C N I G+++ LV
Sbjct: 371 GDVKLGNLNTFVAVSEDVS-------------CFAF-----AANEQLTIFGNLAQMNFLV 412
Query: 548 VYDNVNKRIGWAKSHC 563
YD V+ + + K+ C
Sbjct: 413 GYDTVSGTVSFKKTDC 428
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 141/372 (37%), Gaps = 26/372 (6%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G G Y + +G P Y + DTGSD TW+QC C + L+ P +
Sbjct: 172 GRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST- 230
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
Y + C + + + C Y ++Y D S S+G A D L L+ +
Sbjct: 231 -YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AVKGF 286
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
FGC +GL + G+LGL R K SLP Q + V HCL + G GY+
Sbjct: 287 RFGCGERNEGL----FGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 340
Query: 374 -FLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSS 432
F L + PML Y+ + I G L++ + D+G+
Sbjct: 341 DFGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTV 400
Query: 433 YTYFTKQAYSEL-IASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
T AYS L A +++ G + L C+ F S V + T++L
Sbjct: 401 ITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCY--DFTGMSQVAI----PTVSLL 454
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
F + + G + + +CL + G I+G+ L+ V YD
Sbjct: 455 FQG-----GARLDVDASGIMYAASASQVCLAFAANED--GGDVGIVGNTQLKTFGVAYDI 507
Query: 552 VNKRIGWAKSHC 563
K +G+ C
Sbjct: 508 GKKVVGFYPGAC 519
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/381 (23%), Positives = 158/381 (41%), Gaps = 56/381 (14%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCM-EI 263
+G PP+ + +DTGS L+WIQC ++ ++ P + ++LP LC I
Sbjct: 88 IGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSS-VFDPSLSSSFSVLPCNHPLCKPRI 146
Query: 264 QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQG 323
P C+ + C Y YAD + + G L R+++ + S + P ++ GCA +
Sbjct: 147 PDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFS---RSQSTPPLILGCAEESS- 202
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW 383
GILG++ ++S SQ V + G +LG + S
Sbjct: 203 -------DAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTGSFYLGEN-PNSG 254
Query: 384 GMAWVPMLD------SPFME--LYHTEILKINYGSSPLNLGARN-----SQVGWALFDTG 430
G ++ +L P ++ Y + I G+ LN+ S G + D+G
Sbjct: 255 GFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSG 314
Query: 431 SSYTYFTKQAYSELI--------ASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
S +TY +AY+++ A LK+ G V D +C+ + +++
Sbjct: 315 SEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSD-------MCFNG-----NAIEIG 362
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
+ + F +IV K E L G C+GI SE+ ++ I+G+
Sbjct: 363 RLIGNMVFEFDKGVEIVVEK-----ERVLADVGGGVHCVGI-GRSEMLGAASNIIGNFHQ 416
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ V +D N+R+G+ K+ C
Sbjct: 417 QNIWVEFDLANRRVGFGKADC 437
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 30/139 (21%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G YF+ + +G+PP+ Y+ +DTGSD+ W+QC APC+ C + A+P+++P +
Sbjct: 51 GEYFSRVGIGSPPKHVYMVVDTGSDVNWVQC-APCADCYQQADPIFEPSFSS-------- 101
Query: 260 CMEIQRNHKPGYCETCQ------------QCDYEIEYADHSSSMGVLARDELHLTIENGS 307
++ P CET Q C YE+ Y D S ++G A + + L +GS
Sbjct: 102 ------SYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITL---DGS 152
Query: 308 LTKPNVVFGCAYDQQGLLL 326
+ NV GC +D +GL +
Sbjct: 153 ASLNNVAIGCGHDNEGLFV 171
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 103/407 (25%), Positives = 168/407 (41%), Gaps = 47/407 (11%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLDMDTGS 223
+R ++ N SS A D S PL +PDG Y + VG P + + DTGS
Sbjct: 23 VRWMAARANSSSWSSMAGTTDVES--PL----HPDGGGYVMDISVGTPGKRFRAIADTGS 76
Query: 224 DLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKD---SLCMEIQRNHKPGYCE-TCQQCD 279
DL W+Q + PC+ C+ G ++ PR + D LC E+ PG CE C
Sbjct: 77 DLVWVQSE-PCTGCSGGT--IFDPRQSSTFREMDCSSQLCTEL-----PGSCEPGSSACS 128
Query: 280 YEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGL 338
Y EY + G ARD + L T GS P+ GC G++ + DG++GL
Sbjct: 129 YSYEYGSGETE-GEFARDTISLGTTSGGSQKFPSFAVGC-----GMVNSGFDGVDGLVGL 182
Query: 339 SRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFME 397
+ VSL SQL++ I + +CL N+ L G +P +
Sbjct: 183 GQGPVSLTSQLSAA--IDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSD 240
Query: 398 LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV 457
Y T L G + G G + D+G++ TY Y +++ ++ + + V
Sbjct: 241 TYPTYYLLTVNGIAV--AGQTMGSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRV 298
Query: 458 LDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG 517
D S L +C+ + F LT+ +T S +LV+ G
Sbjct: 299 -DGSSMGLDLCYDRS------SNRNYKFPALTIRLAG-----ATMTPPSSNYFLVVDDSG 346
Query: 518 N-ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ +CL + + I+G++ +G ++YD + + + ++ C
Sbjct: 347 DTVCLAMGSAGGLP---VSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|213998840|gb|ACJ60787.1| nucellin [Hordeum patagonicum subsp. magellanicum]
Length = 154
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 79/146 (54%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G PS G+ WVPM +S F Y + ++ + P+ R + A+FD
Sbjct: 64 GKGVLYVGDFNPPSRGVTWVPMKESLFY--YSPGLAELLIDNQPI----RGNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YT+ Q Y+E+++ ++ S+
Sbjct: 118 SGSTYTHVPAQIYNEILSKVRGTLSE 143
>gi|213998818|gb|ACJ60776.1| nucellin [Hordeum patagonicum subsp. setifolium]
Length = 149
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 79/146 (54%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G PS G+ WVPM +S F Y + ++ + P+ R + A+FD
Sbjct: 64 GKGVLYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPI----RGNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YT+ Q Y+E+++ ++ S+
Sbjct: 118 SGSTYTHVPAQIYNEILSKVRGTLSE 143
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 108/411 (26%), Positives = 176/411 (42%), Gaps = 55/411 (13%)
Query: 170 SKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMI-VGNPPRPYYLDMDTGSDLTWI 228
S+++K ++ +A D S L G++ D L + + +G P L +DTGSDL+W+
Sbjct: 96 SRVSKGMMGDDA---DVSIPTHLGGSV--DSLEYVVTVGLGTPSVSQVLLIDTGSDLSWV 150
Query: 229 QCDAPCSS--CAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCET---CQQCDY 280
QC PC+S C +PL+ P + +P C ++ + G C + QC +
Sbjct: 151 QCQ-PCNSTTCYPQKDPLFDPSKSSTYAPIPCNTDACRDLTDDGYGGGCASGDGAAQCGF 209
Query: 281 EIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSR 340
I Y D S + GV + + L L + + FGC +DQ G K DG+LGL
Sbjct: 210 AITYGDGSQTRGVYSNETLALAP---GVAVKDFRFGCGHDQDG----ANDKYDGLLGLGG 262
Query: 341 AKVSLPSQLASQGIIKNVVGHCL------TTNAGGGGYMFLGHDLVPSWGMAWVPMLDSP 394
A SL Q AS + +CL GG +V + G + PM+
Sbjct: 263 APESLVVQTAS--VYGGAFSYCLPALNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIREE 320
Query: 395 FMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASL-KEVSS 453
Y + I G P+++ ++ G + D+G+ T AY+ L A+ K +++
Sbjct: 321 -ETFYVVNMTGITVGGEPIDV-PPSAFSGGMIIDSGTVVTELQHTAYNALQAAFRKAMAA 378
Query: 454 DGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVI 513
LV + L C+ F S V + + LT G+ + P G L+
Sbjct: 379 YPLVRNGE---LDTCY--DFSGYSNVTLPKV--ALTFSGGATIDL------DVPNGILL- 424
Query: 514 SKKGNICLGILD-GSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ CL + G + G ILG+++ R V+YD R+G+ + C
Sbjct: 425 ----DDCLAFQESGPDDQPG---ILGNVNQRTLEVLYDAGRGRVGFRAAVC 468
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 117/261 (44%), Gaps = 35/261 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC--AKGANPLYKPRMGNI---LP 254
G Y + +G PP + + +DTGS+L W QC APC+ C P+ +P + LP
Sbjct: 89 GAYNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRCFPRPTPAPVLQPARSSTFSRLP 147
Query: 255 YKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
S C + + +P C C Y Y ++ G LA + LT+ +G+ P V
Sbjct: 148 CNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATET--LTVGDGTF--PKVA 202
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY-- 372
FGC+ + N + + GI+GL R +SL SQLA +CL ++ GG
Sbjct: 203 FGCSTE------NGVDNSSGIVGLGRGPLSLVSQLA-----VGRFSYCLRSDMADGGASP 251
Query: 373 MFLGH--DLVPSWGMAWVPMLDSPFMEL---YHTEILKINYGSSPLNLGARN---SQVGW 424
+ G L + P+L +P+++ Y+ + I S+ L + +Q G
Sbjct: 252 ILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGL 311
Query: 425 A---LFDTGSSYTYFTKQAYS 442
+ D+G++ TY K Y+
Sbjct: 312 GGGTIVDSGTTLTYLAKDGYA 332
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 99/419 (23%), Positives = 168/419 (40%), Gaps = 47/419 (11%)
Query: 160 VNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLD 218
++D +R +S++ K ++S + + PL I L Y + +G R +
Sbjct: 93 MDDFQLRSLQSRM-KSIISGRNIDDSVDAPIPLTSGIRLQTLNYIVTVELGG--RKMTVI 149
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCME---IQRNHKPGYCETC 275
+DTGSDL+W+QC PC C +P++ P Y+ LC G C
Sbjct: 150 VDTGSDLSWVQCQ-PCKRCYNQQDPVFNPSTSP--SYRTVLCSSPTCQSLQSATGNLGVC 206
Query: 276 Q----QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVK 331
C+Y + Y D S + G L + L L S N +FGC + QGL
Sbjct: 207 GSNPPSCNYVVNYGDGSYTRGELGTEHLDL---GNSTAVNNFIFGCGRNNQGLFGG---- 259
Query: 332 TDGILGLSRAKVSLPSQLASQGIIKNVVGHCL--TTNAGGGGYMFLGHDLV--PSWGMAW 387
G++GL R+ +SL SQ ++ + V +CL T G + G+ V + +++
Sbjct: 260 ASGLVGLGRSSLSLISQTSA--MFGGVFSYCLPITETEASGSLVMGGNSSVYKNTTPISY 317
Query: 388 VPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIAS 447
M+ +P + Y + I GS + A + + D+G+ T Y L
Sbjct: 318 TRMIPNPQLPFYFLNLTGITVGSVAVQ--APSFGKDGMMIDSGTVITRLPPSIYQAL--- 372
Query: 448 LKEVSSDGLVLDASD-PTLP--VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFH 504
D V S P+ P + F + +V+ + +HF ++
Sbjct: 373 -----KDEFVKQFSGFPSAPAFMILDTCFNLSGYQEVE--IPNIKMHFEGNAEL---NVD 422
Query: 505 ISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++ Y V + +CL I S + I+G+ + Q V+YD +G+A C
Sbjct: 423 VTGVFYFVKTDASQVCLAI--ASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEAC 479
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 151/382 (39%), Gaps = 72/382 (18%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP-RMGNILPYKDSLCMEIQRN 266
+G PP P L ++ G++L W + P C + A P ++P LP+ + N
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSN-PSPECFEQAFPYFEPLTFSRGLPFASCGSPKFWPN 59
Query: 267 HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLL 326
Q C Y Y D S + G L D+ S+ P V FGC GL
Sbjct: 60 ---------QTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASV--PGVAFGC-----GLFN 103
Query: 327 NTLVKTD--GILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG---GGYMFLGHDLVP 381
N + K++ GI G R +SLPSQL + N HC TT G + L DL
Sbjct: 104 NGVFKSNETGIAGFGRGPLSLPSQLK----VGN-FSHCFTTITGAIPSTVLLDLPADLF- 157
Query: 382 SWGMAWV---PMLDSPFME----LYHTEILKINYGSS----PLNLGARNSQVGWALFDTG 430
S G V P++ E LY+ + I GS+ P + A + G + D+G
Sbjct: 158 SNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSG 217
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP-------VCWRAKFPIRSIVDVKQ 483
+S T Q Y +V D P +P C+ A P ++ DV +
Sbjct: 218 TSITSLPPQVY--------QVVRDEFAAQIKLPVVPGNATGHYTCFSA--PSQAKPDVPK 267
Query: 484 FFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN--ICLGILDGSEVHNGSTIILGDIS 541
L LHF + + ++ + V GN ICL I G E T I+G+
Sbjct: 268 ----LVLHFEGATMDLPRENYV----FEVPDDAGNSIICLAINKGDE-----TTIIGNFQ 314
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ V+YD N + + + C
Sbjct: 315 QQNMHVLYDLQNNMLSFVAAQC 336
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 137/332 (41%), Gaps = 39/332 (11%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
+G+PP L MDT SDL WIQC PC +C + P++ P +++ C Q +
Sbjct: 91 IGSPPITQLLHMDTASDLLWIQC-LPCINCYAQSLPIFDPSRS--YTHRNETCRTSQYSM 147
Query: 268 KP-GYCETCQQCDYEIEYADHSSSMGVLARDELHLTI---ENGSLTKPNVVFGCAYDQQG 323
+ + C+Y + Y D + S G+LAR+ L E+ S +VVFGC +D G
Sbjct: 148 PSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDNYG 207
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC---LTTNAGGGGYMFLGHDLV 380
LV T GILGL + SL + + +C L + + LG D
Sbjct: 208 ---EPLVGT-GILGLGYGEFSLVHRFGKK------FSYCFGSLDDPSYPHNVLVLGDDGA 257
Query: 381 PSWGMAWVPMLDSPFMELYHTEILKINYGSSPL----NLGARNSQVGWA--LFDTGSSYT 434
G + + F Y+ I I+ L + RN Q G + DTG+S T
Sbjct: 258 NILGDTTPLEIHNGF---YYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLT 314
Query: 435 YFTKQAYSELIASLKEVSSDGLV---LDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
++AY L ++++ + D C+ F R +V+ F +T H
Sbjct: 315 SLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFE-RDLVESG--FPIVTFH 371
Query: 492 FGSKWQ----IVSTKFHISPEGYLVISKKGNI 519
F + + S +SP + + GN+
Sbjct: 372 FSEGAELSLDVKSLFMKLSPNVFCLAVTPGNL 403
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/373 (23%), Positives = 159/373 (42%), Gaps = 37/373 (9%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
Y + G PP+ +Y +DTGS++ WI C+ PCS C+ P ++P + Y
Sbjct: 124 YIIKLGFGTPPQSFYTVLDTGSNIAWIPCN-PCSGCSSKQQP-FEPSKSSTYNYLTCASQ 181
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQ 321
+ Q + C Y D S +L+ + L + GS N VFGC+
Sbjct: 182 QCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSV----GSQQVENFVFGCSNAA 237
Query: 322 QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGYMFLGHDL 379
+GL + +T ++G R +S SQ A+ + + +CL + ++ G + LG +
Sbjct: 238 RGL----IQRTPSLVGFGRNPLSFVSQTAT--LYDSTFSYCLPSLFSSAFTGSLLLGKEA 291
Query: 380 VPSWGMAWVPML-DSPFMELYHTEILKINYGSSPLNLGA-----RNSQVGWALFDTGSSY 433
+ + G+ + P+L +S + Y+ + I+ G +++ A S + D+G+
Sbjct: 292 LSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVI 351
Query: 434 TYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
T + AY+ + S + S+ L + + C+ R DV+ F +TLHF
Sbjct: 352 TRLVEPAYNAMRDSFRSQLSN-LTMASPTDLFDTCYN-----RPSGDVE--FPLITLHFD 403
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTII--LGDISLRGQLVVYD 550
+ +I G + G++ CL G G ++ G+ + +V+D
Sbjct: 404 DNLDLTLPLDNILYPG----NDDGSVLCLAF--GLPPGGGDDVLSTFGNYQQQKLRIVHD 457
Query: 551 NVNKRIGWAKSHC 563
R+G A +C
Sbjct: 458 VAESRLGIASENC 470
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 88/379 (23%), Positives = 160/379 (42%), Gaps = 47/379 (12%)
Query: 191 PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG 250
P ++ +Y + VG PP ++DTGSDL W QC PC++C P++ P
Sbjct: 50 PYADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNCYSQYAPIFDPS-- 106
Query: 251 NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LT 309
N +K+ C C Y+I YAD + S G LA + + + +G
Sbjct: 107 NSSTFKEKRCNG-------------NSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFV 153
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
P GC ++ G++GLS SL +Q+ G ++ +C + G
Sbjct: 154 MPETTIGCGHNSSWF----KPTFSGMVGLSWGPSSLITQMG--GEYPGLMSYCFASQ-GT 206
Query: 370 GGYMFLGHDLVPSWGMAWVPM-LDSPFMELYHTEILKINYGSSPL-NLGAR-NSQVGWAL 426
F + +V G+ M L + LY+ + ++ G + + +G ++ G +
Sbjct: 207 SKINFGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNII 266
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP--VCWRAKFPIRSIVDVKQF 484
D+G++ TYF +Y L+ + V + +DPT +C+ D
Sbjct: 267 IDSGTTLTYF-PVSYCNLVR--EAVDHYVTAVRTADPTGNDMLCYYT--------DTIDI 315
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
F +T+HF +V K+++ Y+ +G CL I+ + + I G+ +
Sbjct: 316 FPVITMHFSGGADLVLDKYNM----YIETITRGTFCLAIICNNPPQDA---IFGNRAQNN 368
Query: 545 QLVVYDNVNKRIGWAKSHC 563
LV YD+ + + ++ ++C
Sbjct: 369 FLVGYDSSSLLVSFSPTNC 387
>gi|168002493|ref|XP_001753948.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694924|gb|EDQ81270.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 602
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 96/445 (21%), Positives = 166/445 (37%), Gaps = 104/445 (23%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYK--DSLCMEIQR 265
+G + YY+ +DTGS ++W+ C +G + L+KP+ + + K + C Q
Sbjct: 162 LGKERQEYYMHIDTGSGISWVNCKGRGPITTEGPHGLFKPKADSYVNCKKQEEFCKGFQ- 220
Query: 266 NHKPGYCETCQ--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA----- 318
+ + C+ +C ++ +Y D G + +L + +GS ++ +V FGCA
Sbjct: 221 DGEEHRCDKKHHFRCIFDTQYGDGLIIEGYIVMIDLIFDLSDGSESQADVAFGCASTCPK 280
Query: 319 ----------------------------YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA 350
+ + L NT + TDG++GL S QL
Sbjct: 281 FQVVKNTPHLSVKIASSFSIMCADKVNDEETKKLGQNTAL-TDGLIGLGPHPGSWLHQLN 339
Query: 351 SQGIIKN-VVGHCLTTNAGGGGYMFLGHDLVPSWGM--------------AWVPMLDSP- 394
G I V+ C + G + +G +L G W + SP
Sbjct: 340 MLGYISEYVIAICFEPDLGKSRHAAIGPELPEPAGFLSFGNPYSAQAESTIWTANIPSPE 399
Query: 395 -----------------FMELYHTEILKINYGSSPLNLGA------RNSQVGWAL-FDTG 430
+ +Y ++ I Y + L R+ G + FDTG
Sbjct: 400 EYANPHPHEANSTNLQYYDAMYTGRLVSIRYRDIVIQLRGNEKKRKRDHPEGVQMGFDTG 459
Query: 431 SSYTYFTKQAYSELIASLKEVS----------SDGLVLDASDPTLPVCWRAKF-----PI 475
S TY T++ + + L E + +D V D CWR K +
Sbjct: 460 SDLTYLTRKTFDAFVTILDEEAKHLGYEITRDADEFVKDEQRK----CWRKKSGGEEPSV 515
Query: 476 RSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI 535
D+ F T +++ K++I+ EG ++ C +L +E G+
Sbjct: 516 EDFGDMILEFATFAEDDTKSELVINPKYYITSEGS---GRQHRTCFNMLKETEFDFGN-- 570
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAK 560
LG +RG L+++DN RIGW +
Sbjct: 571 -LGAEVMRGHLLLFDNELNRIGWRR 594
>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 530
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 150/379 (39%), Gaps = 40/379 (10%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP--LYKPRMGNIL-PYKD 257
L++ + VG P + + +DTGS+L W+ C+ S+C + L + R N+ P
Sbjct: 102 LHYANVSVGTPATWFLVALDTGSNLFWLPCNCG-STCIRDLKDIGLSQSRPLNLYSPNTS 160
Query: 258 SLCMEIQRNHKPGY-----CETCQQCDYEIEYADHSS-SMGVLARDELHLTIENGSL--T 309
S I+ N + C Y+I+Y + + G L D LHL E+ L
Sbjct: 161 STSSSIRCNDDRCFGSSQCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHLVTEDVDLKPV 220
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
K N+ GC +Q G L ++ +G+LGL S+PS LA I N C
Sbjct: 221 KANITLGCGRNQTGFLQSS-AAINGLLGLGMKDYSVPSILAKAKITANSFSMCFGNIIDV 279
Query: 370 GGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDT 429
G + G + +++P + + +N + Q+ ALFDT
Sbjct: 280 IGRISFGDK-------GYTDQMETPLLPTEPSPTYAVNVTEVSVGGDVVGVQL-LALFDT 331
Query: 430 GSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT 489
G+S+T+ + Y + + D V D P P + P D+ T+
Sbjct: 332 GTSFTHLLEPEYGLITKAF-----DDHVTDKRRPIDP-----EIPFEFCYDLSPNSTTIL 381
Query: 490 L-HFGSKWQIVSTKFHISPEGYLVISKKGNI---CLGILDGSEVHNGSTIILGDISLRGQ 545
++ S F +P ++ + N CLGIL + I+G + G
Sbjct: 382 FPRVAMTFEGGSLMFLRNP--LFIVWNEDNTAMYCLGILKSVDFKIN---IIGQNFMSGY 436
Query: 546 LVVYDNVNKRIGWAKSHCM 564
VV+D +GW +S C
Sbjct: 437 RVVFDRERMILGWKRSDCF 455
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 99/417 (23%), Positives = 166/417 (39%), Gaps = 56/417 (13%)
Query: 169 KSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWI 228
+S ++L S+ V + + R ++ G Y + +G PP Y DTGSDL W
Sbjct: 63 RSLFGRELAESDGTTVSART----RKDLPNGGEYLMTLSIGTPPLSYPAIADTGSDLIWT 118
Query: 229 QCDAPCSS--CAKGANPLYKPRMGN---ILPYKDSLCM---EIQRNHKPGYCETCQQCDY 280
QC APCS C PLY P +LP SL M + P C C Y
Sbjct: 119 QC-APCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAGKAPPPGCA----CMY 173
Query: 281 EIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCAYDQQGLLLNTLVKTDGILGLS 339
Y ++ GV + + P + FGC+ + + G++GL
Sbjct: 174 NQTYGTGWTA-GVQGSETFTFGSAAADQARVPGIAFGCSNASS----SDWNGSAGLVGLG 228
Query: 340 RAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGYMFLG-HDLVPSWGMAWVPMLDS--- 393
R +SL SQL + +CLT + + LG + G+ P + S
Sbjct: 229 RGSLSLVSQLGA-----GRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPAK 283
Query: 394 -PFMELYHTEILKINYGS-----SPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIAS 447
P Y+ + I+ G+ SP + G + D+G++ T AY ++ A+
Sbjct: 284 APMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTITSLVNAAYQQVRAA 343
Query: 448 LKEVSSDGLVLDASDPT-LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHIS 506
++ + + +D SD T L +C+ P ++TLHF ++
Sbjct: 344 VQSLVTL-PAIDGSDSTGLDLCYALPTP----TSAPPAMPSMTLHFDGADMVL------- 391
Query: 507 PEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
P +IS G CL + + ++ G+ G+ + ++YD N+ + +A + C
Sbjct: 392 PADSYMISGSGVWCLAMRNQTD---GAMSTFGNYQQQNMHILYDVRNEMLSFAPAKC 445
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 108/421 (25%), Positives = 177/421 (42%), Gaps = 72/421 (17%)
Query: 186 SSSIF--PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP--CSSCA--- 238
S+S+F PL + Y G Y T + G P + +L DTGS L W C + CS C+
Sbjct: 65 SNSVFKSPLSPHSY--GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPK 122
Query: 239 --KGANPLYKPRMGN---ILPYKDSLCMEIQRNHKPGYCETC--------QQC-DYEIEY 284
P + P++ + ++ ++ C I C +C Q C Y ++Y
Sbjct: 123 IDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQY 182
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
S++ G+L L T++ PN V GC++ ++ + GI G R S
Sbjct: 183 GSGSTA-GLL----LSETLDFPDKXIPNFVVGCSFL-------SIHQPSGIAGFGRGSES 230
Query: 345 LPSQLASQGIIKNVVGHCLTT----NAGGGGYMFLGHDLVPSWGMAWVPMLDSP------ 394
LPSQ+ G+ K +CL + ++ G + L V S G+ + P +P
Sbjct: 231 LPSQM---GLKK--FAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNA 285
Query: 395 FMELYHTEILKINYGSSPLNLGAR-----NSQVGWALFDTGSSYTYFTKQAYSELIASLK 449
+ E Y+ I KI G+ + + + G ++ D+GS++T+ K E++A
Sbjct: 286 YKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVL-EVVAREF 344
Query: 450 EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF--GSKWQIVSTKFHISP 507
E A+D R F I VK F L F G+KW + +
Sbjct: 345 EKQLANWT-RATDVETLTGLRPCFDISKEKSVK--FPELIFQFKGGAKWALPLNNY---- 397
Query: 508 EGYLVISKKGNICLGIL-----DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSH 562
+ ++S G CL ++ DG G ++ILG + V YD VN+R+G+ +
Sbjct: 398 --FALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQT 455
Query: 563 C 563
C
Sbjct: 456 C 456
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 99/424 (23%), Positives = 161/424 (37%), Gaps = 97/424 (22%)
Query: 187 SSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQC---------------- 230
SS + R NI + +G P + L +DTGS L+WIQC
Sbjct: 65 SSPYTFRSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSF 124
Query: 231 ---------DAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYE 281
D PCS +PL KPR+ + P C++ + C Y
Sbjct: 125 DPSLSSSFSDLPCS------HPLCKPRIPDFT--------------LPTSCDSNRLCHYS 164
Query: 282 IEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRA 341
YAD + + G L +++ + S T P ++ GCA + GILG++
Sbjct: 165 YFYADGTFAEGNLVKEKFTFS---NSQTTPPLILGCAKES--------TDEKGILGMNLG 213
Query: 342 KVSLPSQLASQGIIKNVVGHCLTTNA-----GGGGYMFLGHDLVPSWGMAWVPMLDSPFM 396
++S SQ +C+ T + G +LG D S G +V +L P
Sbjct: 214 RLSFISQAKISKF-----SYCIPTRSNRPGLASTGSFYLG-DNPNSRGFKYVSLLTFPQS 267
Query: 397 E--------LYHTEILKINYGSSPLNLGAR-----NSQVGWALFDTGSSYTYFTKQAY-- 441
+ Y + I G LN+ G + D+GS +T+ AY
Sbjct: 268 QRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDK 327
Query: 442 --SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIV 499
E++ + G V + T +C+ + ++ + L FG +I+
Sbjct: 328 VKEEIVRLVGSRLKKGYVYGS---TADMCFDGNHSM----EIGRLIGDLVFEFGRGVEIL 380
Query: 500 STKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWA 559
K + LV G C+GI S + S II G++ + V +D N+R+G++
Sbjct: 381 VEK-----QSLLVNVGGGIHCVGIGRSSMLGAASNII-GNVHQQNLWVEFDVTNRRVGFS 434
Query: 560 KSHC 563
K+ C
Sbjct: 435 KAEC 438
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 107/421 (25%), Positives = 176/421 (41%), Gaps = 72/421 (17%)
Query: 186 SSSIF--PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP--CSSCA--- 238
S+S+F PL + Y G Y T + G P + +L DTGS L W C + CS C+
Sbjct: 65 SNSVFKSPLSPHSY--GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPK 122
Query: 239 --KGANPLYKPRMGN---ILPYKDSLCMEIQRNHKPGYCETC--------QQC-DYEIEY 284
P + P++ + ++ ++ C I C +C Q C Y ++Y
Sbjct: 123 IDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQY 182
Query: 285 ADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS 344
S++ G+L + L + PN V GC++ ++ + GI G R S
Sbjct: 183 GSGSTA-GLLLSETLDFPDKK----IPNFVVGCSFL-------SIHQPSGIAGFGRGSES 230
Query: 345 LPSQLASQGIIKNVVGHCLTT----NAGGGGYMFLGHDLVPSWGMAWVPMLDSP------ 394
LPSQ+ G+ K +CL + ++ G + L V S G+ + P +P
Sbjct: 231 LPSQM---GLKK--FAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNA 285
Query: 395 FMELYHTEILKINYGSSPLNLGAR-----NSQVGWALFDTGSSYTYFTKQAYSELIASLK 449
+ E Y+ I KI G+ + + + G ++ D+GS++T+ K E++A
Sbjct: 286 YKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVL-EVVAREF 344
Query: 450 EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHF--GSKWQIVSTKFHISP 507
E A+D R F I VK F L F G+KW + +
Sbjct: 345 EKQLANWT-RATDVETLTGLRPCFDISKEKSVK--FPELIFQFKGGAKWALPLNNY---- 397
Query: 508 EGYLVISKKGNICLGIL-----DGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSH 562
+ ++S G CL ++ DG G ++ILG + V YD VN+R+G+ +
Sbjct: 398 --FALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQT 455
Query: 563 C 563
C
Sbjct: 456 C 456
>gi|413952261|gb|AFW84910.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 298
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 112/251 (44%), Gaps = 21/251 (8%)
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C+ Q G L DGI G + ++S+ SQL S G+ V HCL + GGG + LG
Sbjct: 9 CSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLG 68
Query: 377 HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN---LGARNSQVGWALFDTGSSY 433
+ P G+ + P++ S + E + +N P++ N+Q + D+G++
Sbjct: 69 EIVEP--GLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQ--GTIVDSGTTL 124
Query: 434 TYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
Y AY ++++ S S +L F S VD F T+TL+F
Sbjct: 125 AYLADGAYDPFVSAIAAAVS------PSVRSLVSKGSQCFITSSSVDSS--FPTVTLYF- 175
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI-ILGDISLRGQLVVYDNV 552
+ + PE YL+ + + G + + G I ILGD+ L+ ++ VYD
Sbjct: 176 ----MGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLA 231
Query: 553 NKRIGWAKSHC 563
N R+GWA C
Sbjct: 232 NMRMGWADYDC 242
>gi|213998838|gb|ACJ60786.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 154
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 77/146 (52%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL +IK NV+GHCL++
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G P+ G+ W PM +S F Y + ++ P+ R + A+FD
Sbjct: 64 GKGVLYVGDFNPPTRGVTWAPMRESLF--YYSPGLAEVFIDKQPI----RGNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YT+ Q Y+E+++ ++ S+
Sbjct: 118 SGSTYTHVPAQIYNEIVSKVRVTLSE 143
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 82/191 (42%), Gaps = 22/191 (11%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANP----------LYKPRMG 250
LY+ + VG PP + + +DTGSDL W+ C+ ++C + LY P
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPCNCG-TTCIRDLEDIGVPQSVPLNLYTPNAS 159
Query: 251 NI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307
+ D C ++ P C Y+I Y++ + + G L +D LHL E+ +
Sbjct: 160 TTSSSIRCSDKRCFGSKKCSSPS-----SICPYQISYSNSTGTKGTLLQDVLHLATEDEN 214
Query: 308 LT--KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
LT K NV GC Q GL +G+LGL S+PS LA I N C
Sbjct: 215 LTPVKANVTLGCGQKQTGLFQRN-NSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGR 273
Query: 366 NAGGGGYMFLG 376
G G + G
Sbjct: 274 VIGNVGRISFG 284
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 21/199 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YF + VG+PPR Y+ +D+GSD+ W+QC PCS C + ++P++ P +
Sbjct: 135 GEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQ-PCSECYQQSDPVFDPAGSATYAGISCD 193
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
S+C + C +C YE+ Y D S + G LA + L G + N+ G
Sbjct: 194 SSVCDRLDNAG----CND-GRCRYEVSYGDGSYTRGTLALETLTF----GRVLIRNIAIG 244
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG-GGGYMFL 375
C + +G+ + L +S QL Q +CL + G +
Sbjct: 245 CGHMNRGMFIGAAGLLG----LGGGAMSFVGQLGGQ--TGGAFSYCLVSRGTESTGTLEF 298
Query: 376 GHDLVPSWGMAWVPMLDSP 394
G +P G AWVP++ +P
Sbjct: 299 GRGAMPV-GAAWVPLIRNP 316
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 103/430 (23%), Positives = 168/430 (39%), Gaps = 54/430 (12%)
Query: 155 SVVAS--VNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPP 212
SV AS V + R +KL +S++ D + P+ P G + + +G PP
Sbjct: 40 SVTASQFVRAALHRDMHRHNARKLAASSS---DGTVSAPVSPTTVP-GEFLMTLAIGTPP 95
Query: 213 RPYYLDMDTGSDLTWIQCDAPCS-SCAKGANPLYKPRMG---NILPYKDSLCMEIQRNHK 268
P+ DTGSDL W QC APCS C + PLY P + LP SL
Sbjct: 96 LPFLAIADTGSDLIWTQC-APCSRQCFQQPTPLYNPSSSTTFSALPCNSSL--------- 145
Query: 269 PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK------PNVVFGCAYDQQ 322
G C C Y + Y S V E T GS T P + FGC+
Sbjct: 146 -GLCAPACACMYNMTYG--SGWTYVFQGTE---TFTFGSSTPADQVRVPGIAFGCSNASS 199
Query: 323 GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----TNAGGGGYMFLGHD 378
G G++GL R +SL SQL + +CLT TN+ +
Sbjct: 200 GF---NASSASGLVGLGRGSLSLVSQLGAPKF-----SYCLTPYQDTNSTSTLLLGPSAS 251
Query: 379 LVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSSY 433
L + ++ P + SP Y+ + I+ G++ L + + G + D+G++
Sbjct: 252 LNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTI 311
Query: 434 TYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
T AY ++ A++ + + L +D + F + S ++TLHF
Sbjct: 312 TMLGNTAYQQVRAAVLSL----VTLPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHFD 367
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVN 553
++ ++ S CL + + ++ ILG+ + ++YD
Sbjct: 368 GADMVLPADNYMM-SLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGK 426
Query: 554 KRIGWAKSHC 563
+ + +A + C
Sbjct: 427 ETLSFAPAKC 436
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 92/383 (24%), Positives = 152/383 (39%), Gaps = 64/383 (16%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLCM-EI 263
+G PP+ + +DTGS L+WIQC A + P + +ILP LC I
Sbjct: 81 IGTPPQTQPMVLDTGSQLSWIQCHKKQPPTAS-----FDPSLSSTFSILPCTHPLCKPRI 135
Query: 264 QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQG 323
P C+ + C Y YAD + + G L R++ + S++ P ++ GCA +
Sbjct: 136 PDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFS---RSVSTPPLILGCATES-- 190
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGY-----MFLGHD 378
GILG++ ++S Q I K +C+ G+ +LG++
Sbjct: 191 ------TDPRGILGMNLGRLSFAKQ---SKITK--FSYCVPPRQTRPGFTPTGSFYLGNN 239
Query: 379 LVPSWGMAWVPMLDSPFMEL-------YHTEILKINYGSSPLNLG-----ARNSQVGWAL 426
S G +V M+ S + Y ++ I LN+ A G +
Sbjct: 240 -PSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTM 298
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCW------RAKFPIRSIVD 480
D+GS +TY +AY ++ A V+ A P L + F V+
Sbjct: 299 IDSGSEFTYLVSEAYDKVRAQ---------VVRAVGPRLKKGYVYGGVADMCFDSVKAVE 349
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
+ + + F ++V I E L G C+GI ++ S II G+
Sbjct: 350 IGRLIGEMVFEFERGVEVV-----IPKERVLADVGGGVHCVGIGSSDKLGAASNII-GNF 403
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+ V +D V +R+G+ K+ C
Sbjct: 404 HQQNLWVEFDLVRRRVGFGKADC 426
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 75/261 (28%), Positives = 117/261 (44%), Gaps = 26/261 (9%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP-RMGNILPYKDSL 259
L++ + +G P + + +DTGSDL W+ CD C +CA +P Y+ + P K S
Sbjct: 87 LHYAVVALGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKSST 144
Query: 260 -----CMEIQRNHKPGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTIENGS----LT 309
C + + C Y I+Y +D++SS GVL D L+L E G +T
Sbjct: 145 SRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVLVEDVLYLVTEYGRQPKIVT 204
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI-IKNVVGHCLTTNAG 368
P + FGC Q G L T +G+LGL +S+PS LASQG+ N C +
Sbjct: 205 AP-ITFGCGRTQTGSFLGTAAP-NGLLGLGMDTISVPSLLASQGVAAANSFSMCFAQD-- 260
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G + G S P+ Y+ I GS ++ A+ D
Sbjct: 261 GHGRINFGD--TGSSDQQETPLNMYKQNPYYNISITGATVGSKSIHTKFN------AIVD 312
Query: 429 TGSSYTYFTKQAYSELIASLK 449
+G+S+T + Y+++ +S+
Sbjct: 313 SGTSFTALSDPMYTQITSSVS 333
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 114/265 (43%), Gaps = 34/265 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC-SSCAKGANPLYKPRMGNI---LPY 255
G Y + VG PP + +DTGSDLTW QC APC ++C PLY P + LP
Sbjct: 94 GAYHMILSVGTPPLAFPAIIDTGSDLTWTQC-APCTTACFAQPTPLYDPARSSTFSKLPC 152
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL----TIENGSLTKP 311
LC + + C C Y+ YA ++ G LA D L + + S +
Sbjct: 153 ASPLCQALPSAFRA--CNA-TGCVYDYRYAVGFTA-GYLAADTLAIGDGDGDGDASSSFA 208
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
V FGC+ G + GI+GL R+ +SL SQ+ G+ + +CL ++A G
Sbjct: 209 GVAFGCSTANGG----DMDGASGIVGLGRSALSLLSQI---GVGR--FSYCLRSDADAGA 259
Query: 372 YMFL-------GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPL-----NLGARN 419
L D V S + P+ Y+ + I GS+ L G
Sbjct: 260 SPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTA 319
Query: 420 SQVGWALFDTGSSYTYFTKQAYSEL 444
+ G + D+G+++TY + Y+ L
Sbjct: 320 AGAGGVIVDSGTTFTYLAEAGYTML 344
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 66/250 (26%), Positives = 109/250 (43%), Gaps = 28/250 (11%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
DG + + G PP+ + L +DTGS +TW QC PC C K + + P L Y
Sbjct: 159 DGNFLVDVAFGTPPQKFTLILDTGSSITWTQCK-PCVRCLKASRRHFDPSAS--LTYSLG 215
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C+ + Y + Y D S+S+G D +T+E+ + P FGC
Sbjct: 216 SCIPSTVGNT-----------YNMTYGDKSTSVGNYGCDT--MTLEHSDVF-PKFQFGCG 261
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+ +G + DG+LGL + ++S SQ AS+ K V +CL G +F
Sbjct: 262 RNNEGDFGS---GADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFGEKA 316
Query: 379 LVPSWGMAWVPMLDSPFME------LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSS 432
S + + +++ P Y ++L I+ G+ LN+ + + D+G+
Sbjct: 317 TSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDSGTV 376
Query: 433 YTYFTKQAYS 442
T ++AYS
Sbjct: 377 ITRLPQRAYS 386
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 140/372 (37%), Gaps = 26/372 (6%)
Query: 194 GNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNIL 253
G G Y + +G P Y + DTGSD TW+QC C + L+ P +
Sbjct: 170 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSST- 228
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
Y + C + + + C Y ++Y D S S+G A D L L+ +
Sbjct: 229 -YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD---AVKGF 284
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
FGC +GL + G+LGL R K SLP Q + V HCL + G GY+
Sbjct: 285 RFGCGERNEGL----FGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 338
Query: 374 -FLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSS 432
F + PML Y+ + I G L++ + D+G+
Sbjct: 339 DFGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAGTIVDSGTV 398
Query: 433 YTYFTKQAYSEL-IASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
T AYS L A +++ G + L C+ F S V + T++L
Sbjct: 399 ITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCY--DFTGMSQVAI----PTVSLL 452
Query: 492 FGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDN 551
F + + G + + +CL + G I+G+ L+ V YD
Sbjct: 453 FQG-----GARLDVDASGIMYAASASQVCLAFAANED--GGDVGIVGNTQLKTFGVAYDI 505
Query: 552 VNKRIGWAKSHC 563
K +G+ C
Sbjct: 506 GKKVVGFYPGVC 517
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 143/366 (39%), Gaps = 45/366 (12%)
Query: 219 MDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYKDSLC--MEIQRNHKPGYCE 273
+DT S+LTW+QC APC SC +PL+ P +P S C +++ G
Sbjct: 168 VDTASELTWVQC-APCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAA 226
Query: 274 TCQ-------QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLL 326
CQ C Y + Y D S S GVLA D L L E VFGC QG
Sbjct: 227 ACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGE----VIDGFVFGCGTSNQGPPF 282
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHD---LVPS 382
T G++GL R+++SL SQ Q V +CL + G + +G D S
Sbjct: 283 G---GTSGLMGLGRSQLSLVSQTMDQ--FGGVFSYCLPLKESDSSGSLVIGDDSSVYRNS 337
Query: 383 WGMAWVPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGW--ALFDTGSSYTYFTKQ 439
+ + M+ P Y + I G + +S G A+ D+G+ T
Sbjct: 338 TPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPS 397
Query: 440 AYSELIASLKEVSSDGLVLDASDPTLP--VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQ 497
Y+ + A L A P P F + + +V+ L G + +
Sbjct: 398 IYNAVKAEF-------LSQFAEYPQAPGFSILDTCFNMTGLREVQVPSLKLVFDGGVEVE 450
Query: 498 IVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIG 557
+ S Y V S +CL + + T I+G+ + V++D ++G
Sbjct: 451 VDSGGVL-----YFVSSDSSQVCLAMAPLKSEYE--TNIIGNYQQKNLRVIFDTSGSQVG 503
Query: 558 WAKSHC 563
+A+ C
Sbjct: 504 FAQETC 509
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 88/379 (23%), Positives = 160/379 (42%), Gaps = 47/379 (12%)
Query: 191 PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG 250
P ++ +Y + VG PP ++DTGSDL W QC PC++C P++ P
Sbjct: 50 PYADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNCYSQYAPIFDPS-- 106
Query: 251 NILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS-LT 309
N +K+ C C Y+I YAD + S G LA + + + +G
Sbjct: 107 NSSTFKEKRCNG-------------NSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFV 153
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGG 369
P GC ++ G++GLS SL +Q+ G ++ +C + G
Sbjct: 154 MPETTIGCGHNSSWF----KPTFSGMVGLSWGPSSLITQMG--GEYPGLMSYCFASQ-GT 206
Query: 370 GGYMFLGHDLVPSWGMAWVPM-LDSPFMELYHTEILKINYGSSPL-NLGAR-NSQVGWAL 426
F + +V G+ M L + LY+ + ++ G + + +G ++ G +
Sbjct: 207 SKINFGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNII 266
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLP--VCWRAKFPIRSIVDVKQF 484
D+G++ TYF +Y L+ + V + +DPT +C+ D
Sbjct: 267 IDSGTTLTYF-PVSYCNLVR--EAVDHYVTAVRTADPTGNDMLCYYT--------DTIDI 315
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
F +T+HF +V K+++ Y+ +G CL I+ + + I G+ +
Sbjct: 316 FPVITMHFSGGADLVLDKYNM----YIETITRGTFCLAIICNNPPQDA---IFGNRAQNN 368
Query: 545 QLVVYDNVNKRIGWAKSHC 563
LV YD+ + + ++ ++C
Sbjct: 369 FLVGYDSSSLLVFFSPTNC 387
>gi|213998842|gb|ACJ60788.1| nucellin [Hordeum cordobense]
Length = 154
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G PS G+ WVPM +S F Y + ++ + P+ R + +FD
Sbjct: 64 GKGVLYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPI----RGNPTFEVVFD 117
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YT+ Q Y+E+++ ++ S+
Sbjct: 118 SGSTYTHVPAQIYNEIVSKVRGTLSE 143
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 106/264 (40%), Gaps = 37/264 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y ++ VG PPRP L +DTGSDL W QC APC C PL P + LP
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFDQGIPLLDPAASSTYAALPCGAP 144
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL---TIENGSLTKP---N 312
C + G + C Y Y D S ++G +A D NG + P
Sbjct: 145 RCRALPFTSCGG-----RSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRR 199
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT------- 365
+ FGC + +G+ + GI G R + SLPSQL + +C T+
Sbjct: 200 LTFGCGHFNKGVFQS---NETGIAGFGRGRWSLPSQLNATSF-----SYCFTSMFDSKSS 251
Query: 366 --NAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV 422
GG H S + P+ +P LY + I+ G + L + ++
Sbjct: 252 IVTLGGAPAALYSH--AHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPV--PETKF 307
Query: 423 GWALFDTGSSYTYFTKQAYSELIA 446
+ D+G+S T ++ Y + A
Sbjct: 308 RSTIIDSGASITTLPEEVYEAVKA 331
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 92/383 (24%), Positives = 151/383 (39%), Gaps = 63/383 (16%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYKDSLCM-EI 263
+G PP+P + +DTGS L+WIQC A + P + + +LP LC +
Sbjct: 94 IGTPPQPQQMVLDTGSQLSWIQCHNKTPPTAS-----FDPSLSSSFYVLPCTHPLCKPRV 148
Query: 264 QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQG 323
P C+ + C Y YAD + + G L R++L + S T P ++ GC+ + +
Sbjct: 149 PDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFS---PSQTTPPLILGCSSESR- 204
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT------NAGGGGYMFLGH 377
GILG++ ++S P Q +C+ T N G +LG+
Sbjct: 205 -------DARGILGMNLGRLSFPFQAKVTKF-----SYCVPTRQPANNNNFPTGSFYLGN 252
Query: 378 DLVPSWGMAWVPMLDSPFME--------LYHTEILKINYGSSPLNL-----GARNSQVGW 424
+ S +V ML P + Y + I G LN+ G
Sbjct: 253 N-PNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQ 311
Query: 425 ALFDTGSSYTYFTKQAY----SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
+ D+GS +T+ AY E+I L G V +C+ + ++
Sbjct: 312 TMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGG---VADMCFDG-----NAME 363
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
+ + + F +IV K E L G C+GI SE ++ I+G+
Sbjct: 364 IGRLLGDVAFEFEKGVEIVVPK-----ERVLADVGGGVHCVGI-GRSERLGAASNIIGNF 417
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+ V +D N+RIG+ + C
Sbjct: 418 HQQNLWVEFDLANRRIGFGVADC 440
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 69/263 (26%), Positives = 116/263 (44%), Gaps = 33/263 (12%)
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCME 262
+ VG PP+ + +DTGS+L+W+ C AP + K + ++PR + +P + C
Sbjct: 89 LAVGTPPQNVTMVLDTGSELSWLLC-APAGARNKFSAMSFRPRASSTFAAVPCASAQCRS 147
Query: 263 IQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC---AY 319
P +C + YAD SSS G LA D + GS FGC A+
Sbjct: 148 RDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAV----GSGPPLRAAFGCMSSAF 203
Query: 320 DQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDL 379
D V + G+LG++R +S SQ +++ +C+ ++ G + LGH
Sbjct: 204 DSS----PDGVASAGLLGMNRGALSFVSQASTRRF-----SYCI-SDRDDAGVLLLGHSD 253
Query: 380 VPS-----WGMAWVPMLDSPFME--LYHTEILKINYGSSPLNLGAR-----NSQVGWALF 427
+P+ + + P L P+ + Y ++L I G L + A ++ G +
Sbjct: 254 LPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMV 313
Query: 428 DTGSSYTYFTKQAYSELIASLKE 450
D+G+ +T+ AYS L A
Sbjct: 314 DSGTQFTFLLGDAYSALKAEFTR 336
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 99/392 (25%), Positives = 152/392 (38%), Gaps = 64/392 (16%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCM 261
Y +G PP +DTGSDL W QCDAPC C PLY P + Y + C
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAP--ARSVTYANVSCG 157
Query: 262 EIQRNHKPGYCETCQQ-------------CDYEIEYADHSSSMGVLARDELHLTIENGSL 308
+ P + + C Y Y D SS+ GVLA +
Sbjct: 158 SRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTF---GAGT 214
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TN 366
T ++ FGC D G N + G++G+ R +SL SQL G+ K +C T +
Sbjct: 215 TVHDLAFGCGTDNLGGTDN----SSGLVGMGRGPLSLVSQL---GVTK--FSYCFTPFND 265
Query: 367 AGGGGYMFLGHDLVPSWGMAWVPMLDSPF----MELYHTEILKINYGSS-----PLNLGA 417
+FLG S P + SP Y+ + I G + P
Sbjct: 266 TTTSSPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRL 325
Query: 418 RNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRA---KFP 474
S G + D+G+++T ++A+ L ++ + L + L VC+ A + P
Sbjct: 326 TASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLA-SGAHLGLSVCFAAPQGRGP 384
Query: 475 IRSIVDVKQFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVHN 531
VDV + L LHF G+ ++ P V+ + G CLGI+ +
Sbjct: 385 --EAVDVPR----LVLHFDGADMEL--------PRSSAVVEDRVAGVACLGIVSARGMS- 429
Query: 532 GSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+LG + + V YD + + ++C
Sbjct: 430 ----VLGSMQQQNMHVRYDVGRDVLSFEPANC 457
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 149/382 (39%), Gaps = 37/382 (9%)
Query: 192 LRGNIYPD-GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG 250
+R I PD G + + +G PP DTGSDLTW QC PC C + P++ PR
Sbjct: 79 IRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQC-LPCRECFNQSQPIFNPRRS 137
Query: 251 NILPYKDSLCM-EIQRNHKPGYC-ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
+ Y+ C + R+ + +C Q C Y Y D S + G LA D++ + GS
Sbjct: 138 S--SYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITI----GSF 191
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--- 365
P V GC + G G+ G S + V SQ+ + +K +CL T
Sbjct: 192 KLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLV---SQMRTIAGVKPRFSYCLPTFFS 248
Query: 366 NAGGGGYMFLGHDLVPS-WGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-- 422
NA G + G V S + P++ Y + I+ G S +
Sbjct: 249 NANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTN 308
Query: 423 -GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDV 481
G + D+G++ T + Y + ++L V V D S L +C+ A + D+
Sbjct: 309 HGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSG-ILELCYSAG----QVDDL 363
Query: 482 KQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+T HF + + P CL ++V I G+++
Sbjct: 364 N--IPIITAHFAGGADV-----KLLPVNTFAPVADNVTCLTFAPATQV-----AIFGNLA 411
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
V YD NKR+ + C
Sbjct: 412 QINFEVGYDLGNKRLSFEPKLC 433
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 157/374 (41%), Gaps = 50/374 (13%)
Query: 213 RPYYLDMDTGSDLTWIQCDAPCSSCAK---GANPLYKPRMGN---ILPYKDSLCMEIQRN 266
+P L +DTGSDL W QC S+ A G+ P+Y P + LP D LC E Q +
Sbjct: 24 QPRKLIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFS 83
Query: 267 HKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLL 326
K C + +C YE Y ++++GVLA + T + FGC L
Sbjct: 84 FK--NCTSKNRCVYEDVYGS-AAAVGVLASET--FTFGARRAVSLRLGFGCG----ALSA 134
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLV------ 380
+L+ GILGLS +SL +QL Q +CLT A L +
Sbjct: 135 GSLIGATGILGLSPESLSLITQLKIQRF-----SYCLTPFADKKTSPLLFGAMADLSRHK 189
Query: 381 PSWGMAWVPMLDSPFMEL-YHTEILKINYGSSPL-----NLGARNSQVGWALFDTGSSYT 434
+ + ++ +P + Y+ ++ I+ G L +L R G + D+GS+
Sbjct: 190 TTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVA 249
Query: 435 YFTKQAYSELIASLKEVSSDGLVLDASDPTL---PVCWRAKFPIRSIVDVKQFFKT--LT 489
Y + A+ ++KE D + L ++ T+ +C+ P R+ + + L
Sbjct: 250 YLVEAAFE----AVKEAVMDVVRLPVANRTVEDYELCF--VLPRRTAAAAMEAVQVPPLV 303
Query: 490 LHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVY 549
LHF +V + + Y + G +CL + G I+G++ + V++
Sbjct: 304 LHFDGGAAMV-----LPRDNYFQEPRAGLMCLAV--GKTTDGSGVSIIGNVQQQNMHVLF 356
Query: 550 DNVNKRIGWAKSHC 563
D + + +A + C
Sbjct: 357 DVQHHKFSFAPTQC 370
>gi|213998836|gb|ACJ60785.1| nucellin [Hordeum bogdanii]
Length = 154
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G PS G+ WVPM +S F Y + ++ + P+ + A+FD
Sbjct: 64 GKGVLYVGDFNPPSRGVTWVPMRESLF--YYSPGLAELLIDNQPIG----GNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+GS+YT+ Q Y+E+++ ++ S+
Sbjct: 118 SGSTYTHVPAQIYNEIVSKVRGTLSE 143
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 117/259 (45%), Gaps = 20/259 (7%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC---SSCAKGANPLYKPRMGNILPY--- 255
Y + +G+P + +DTGSD++W+QC+ PC S C A L+ P +
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCE-PCPAPSPCHAHAGALFDPAASSTYAAFNC 193
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+ C ++ + + C+ +C Y ++Y D S++ G + D L L+ GS F
Sbjct: 194 SAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLS---GSDVVRGFQF 250
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC++ + G ++ KTDG++GL SL SQ A++ +CL G++ L
Sbjct: 251 GCSHAELGAGMDD--KTDGLIGLGGDAQSLVSQTAAR--YGKSFSYCLPATPASSGFLTL 306
Query: 376 GHDLVPSWG----MAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTG 430
G G A PML S + Y+ L+ I G L L G +L D+G
Sbjct: 307 GAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAG-SLVDSG 365
Query: 431 SSYTYFTKQAYSELIASLK 449
+ T AY+ L ++ +
Sbjct: 366 TVITRLPPAAYAALSSAFR 384
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 115/264 (43%), Gaps = 34/264 (12%)
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL-YKPRMG---NILPYKDSLCM 261
+ VG PP+ + +DTGS+L+W+ C G + L ++PR +P + C
Sbjct: 70 LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQCR 129
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC---A 318
P +QC + YAD SSS G LA + T+ G + FGC A
Sbjct: 130 SRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEV--FTVGQGPPLR--AAFGCMATA 185
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+D V T G+LG++R +S SQ +++ +C+ ++ G + LGH
Sbjct: 186 FDTS----PDGVATAGLLGMNRGALSFVSQASTRRF-----SYCI-SDRDDAGVLLLGHS 235
Query: 379 LVPSWGMAWVPMLDSPFMEL-------YHTEILKINYGSSPLNLGAR-----NSQVGWAL 426
+P + + P+ P M L Y ++L I G PL + A ++ G +
Sbjct: 236 DLPFLPLNYTPLYQ-PAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTM 294
Query: 427 FDTGSSYTYFTKQAYSELIASLKE 450
D+G+ +T+ AYS L A
Sbjct: 295 VDSGTQFTFLLGDAYSALKAEFSR 318
>gi|213998806|gb|ACJ60770.1| nucellin [Hordeum flexuosum]
Length = 136
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 75/138 (54%), Gaps = 9/138 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G PS G+ WVPM +S F Y + ++ + P+ R + A+FD
Sbjct: 64 GKGVLYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPI----RGNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELIA 446
+GS+YT+ Q Y+E+++
Sbjct: 118 SGSTYTHVPAQIYNEIVS 135
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 78.6 bits (192), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 115/264 (43%), Gaps = 34/264 (12%)
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL-YKPRMG---NILPYKDSLCM 261
+ VG PP+ + +DTGS+L+W+ C G + L ++PR +P + C
Sbjct: 69 LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQCR 128
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC---A 318
P +QC + YAD SSS G LA + T+ G + FGC A
Sbjct: 129 SRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEV--FTVGQGPPLR--AAFGCMATA 184
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+D V T G+LG++R +S SQ +++ +C+ ++ G + LGH
Sbjct: 185 FDTS----PDGVATAGLLGMNRGALSFVSQASTRRF-----SYCI-SDRDDAGVLLLGHS 234
Query: 379 LVPSWGMAWVPMLDSPFMEL-------YHTEILKINYGSSPLNLGAR-----NSQVGWAL 426
+P + + P+ P M L Y ++L I G PL + A ++ G +
Sbjct: 235 DLPFLPLNYTPLYQ-PAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTM 293
Query: 427 FDTGSSYTYFTKQAYSELIASLKE 450
D+G+ +T+ AYS L A
Sbjct: 294 VDSGTQFTFLLGDAYSALKAEFSR 317
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 78.6 bits (192), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 98/397 (24%), Positives = 158/397 (39%), Gaps = 66/397 (16%)
Query: 200 GLYFTYMIVGNP-PRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
G Y + +G P P+ L MDTGSDL W QC PC C PL+ P + + ++
Sbjct: 85 GEYLIHFNIGTPRPQRVALTMDTGSDLVWTQC-TPCPVCFDQPFPLFDPSVSST--FRAV 141
Query: 259 LCME-IQRNHKPGYCETCQ----QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNV 313
C + I R C +C Y Y D S + G + +D NG P
Sbjct: 142 ACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVA 201
Query: 314 V----FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT----T 365
V FGC G+ + GI G R +SLPSQL +CLT T
Sbjct: 202 VSGLAFGCGDYNTGVFAS---NESGIAGFGRGPLSLPSQLR-----VGRFSYCLTSHDET 253
Query: 366 NAGGGGYMFLGHDLVPSWGMAW--------VPMLDSP-FMELYHTEILKINYGSSPLNLG 416
+ +FLG P G+ P++ SP F Y+ + I G + L +
Sbjct: 254 ESNKTSAVFLG---TPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVD 310
Query: 417 A-----RNSQVGWALFDTGSSYTYFT----KQAYSELIASLKEVSSDGLVLDASDPTLPV 467
+ + G + D+G+ T F +Q +E +A L D + S+ +
Sbjct: 311 SSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYD----NTSEVGNLL 366
Query: 468 CWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYL-VISKKGNICLGILDG 526
C++ + + K F + S + E Y+ + G +CL +++G
Sbjct: 367 CFQRPKGGKQVPVPKLIF-----------HLASADMDLPRENYIPEDTDSGVMCL-MING 414
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+EV +++G+ + +VYD N ++ +A + C
Sbjct: 415 AEV---DMVLIGNFQQQNMHIVYDVENSKLLFASAQC 448
>gi|308810200|ref|XP_003082409.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116060877|emb|CAL57355.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 455
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 162/392 (41%), Gaps = 82/392 (20%)
Query: 215 YYLDMDTGSDLTWIQCDAPCSS-----CAKGANPLYKPRMGNILPY--------KDSLCM 261
+ L +DTGS LT++ C P S C +P Y R+ + + D+ C
Sbjct: 33 FDLFVDTGSPLTYLAC-WPASREFVDYCGVHEHPYYDARVSDDFRFLNATTNAEDDAFCR 91
Query: 262 EIQR----NHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
+ + G CE + I Y D+S+++GV+ D + + E L ++FGC
Sbjct: 92 RASSLFILDDESGACE------FGIPYMDNSTAIGVMVEDVMTVGDE---LAGAKMIFGC 142
Query: 318 AYDQQGLLLNT---LVKTDGILGLSRAKVSLPSQLASQGII-KNVVGHCLTTNAGGGGYM 373
G L+ + DG+ G R + + +QLA G+I +V G C + AG M
Sbjct: 143 -----GCLVEANGEADRYDGMAGFGRGETTFHTQLARTGVIDADVFGFC-SEGAGTNTAM 196
Query: 374 F------LGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR---NSQVGW 424
G DL P ++W ML + + + LGA+ S +
Sbjct: 197 LSLGRYDFGRDLSP---LSWTRMLGDDDLAVR----------TMSWKLGAKIIAGSTNVY 243
Query: 425 ALFDTGSS--------YTYFTKQAYSELIASLKEVSSDGLVL-DASDPTLPVCWRAKFPI 475
+ D+G++ Y F K+ ++ L SD V D S T C+ +K
Sbjct: 244 TVLDSGTTLVVLPPVMYGDFMKELLDRIV-DLNATYSDVHVFEDYSFSTF--CFYSKSGA 300
Query: 476 RSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVIS--KKGNICLGILDGSEVHNGS 533
+ ++ LT+ + +V + PE YL S C+GI+ G+E
Sbjct: 301 LTNDIIRDALPKLTITYDPDIALV-----LPPENYLFSSWIVPREHCIGIMKGAE----G 351
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
IILG +LR V YD N+RIG A +HC N
Sbjct: 352 QIILGQQTLRNTFVEYDLENERIGLAVTHCEN 383
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 95/417 (22%), Positives = 171/417 (41%), Gaps = 63/417 (15%)
Query: 189 IFPLRGNIYP------DGLYFTY-------MIVGNPPRPYYLDMDTGSDLTWIQCDA--- 232
+ PL+ I P D L+F + + VG PP+ + +DTGS+L+W++C+
Sbjct: 47 VLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSN 106
Query: 233 --PCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHK-PGYCETCQQCDYEIEYADHSS 289
P ++ + Y P +P C R+ P C++ + C + YAD SS
Sbjct: 107 PNPVNNFDPTRSSSYSP-----IPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASS 161
Query: 290 SMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL 349
S G LA + H S N++FGC G KT G+LG++R +S SQ+
Sbjct: 162 SEGNLAAEIFHF---GNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQM 218
Query: 350 ASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWV-PMLDSPFMEL---------- 398
G K +C++ G++ LG W+ P+ +P + +
Sbjct: 219 ---GFPK--FSYCISGTDDFPGFLLLGDS-----NFTWLTPLNYTPLIRISTPLPYFDRV 268
Query: 399 -YHTEI--LKINYGSSPLN---LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVS 452
Y ++ +K+N P+ L ++ G + D+G+ +T+ Y+ L +
Sbjct: 269 AYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLN-R 327
Query: 453 SDGLVLDASDP------TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHIS 506
++G++ DP T+ +C+R P+R + T++L F VS + +
Sbjct: 328 TNGILTVYEDPDFVFQGTMDLCYRIS-PVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLY 386
Query: 507 PEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+L + C S++ ++G + + +D RIG A C
Sbjct: 387 RVPHLTVGNDSVYCF-TFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVEC 442
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/388 (24%), Positives = 160/388 (41%), Gaps = 51/388 (13%)
Query: 202 YFTYMIVGNP-PRPYYLDMDTGSDLTWIQCDAPCSSCAK-GANPLYKPRMGNILPYKDSL 259
YF + +G P P+ + L DTGSDLTW+ C+ C SC K +P R + ++
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTIP 178
Query: 260 C------MEIQRNHKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKP- 311
C +E+Q C C ++ Y + ++GV A + + + + + +
Sbjct: 179 CSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIRLF 238
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGG 371
+V+ GC T DG++GL K SL +LA I N +CL +
Sbjct: 239 DVLIGCTES----FNETNGFPDGVMGLGYRKHSLALRLAE--IFGNKFSYCLVDHLSSSN 292
Query: 372 YM-FLGHDLVPSW---GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA---RNSQVGW 424
+ FL +P M +L Y + I+ G S L++ + + VG
Sbjct: 293 HKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVTGVGG 352
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPV--------CWRAKFPIR 476
+ D+G+S T +AY +++ +LK + D +P+ C+ K R
Sbjct: 353 MIVDSGTSLTMLAGEAYDKVVDALKP------IFDKHKKVVPIELPELNNFCFEDKGFDR 406
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
+ V L +HF F + Y++ +G CLGI+ GS+ I
Sbjct: 407 AAV------PRLLIHFAD-----GAIFKPPVKSYIIDVAEGIKCLGIIKAD--FPGSS-I 452
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHCM 564
LG++ + L YD ++G+ S C+
Sbjct: 453 LGNVMQQNHLWEYDLGRGKLGFGPSSCI 480
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 153/375 (40%), Gaps = 45/375 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PP P DTGS+L W QC PC C +PL+ P+ + YKD
Sbjct: 92 GEYLMNLSLGTPPSPIMAVADTGSNLIWTQC-KPCDDCYTQVDPLFDPKASST--YKDVS 148
Query: 260 CMEIQRN--HKPGYCETCQQ-CDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVF 315
C Q C T + C Y + YAD S +MG A D L L + +N + N++
Sbjct: 149 CSSSQCTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQLKNIII 208
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC + N K+ G++GL VSL QL I +CL +
Sbjct: 209 GCGQNNAVTFRN---KSSGVVGLGGGAVSLIKQLGDS--IDGKFSYCLVPENDQTSKINF 263
Query: 376 GHDLVPSW-GMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-GWALFDTGSSY 433
G + V S G P++ Y+ + I+ GS N+ +S + G + D+G++
Sbjct: 264 GTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSK--NMQTPDSNIKGNMVIDSGTTL 321
Query: 434 TYFTKQAYSEL---IASL--KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
T + Y E+ +ASL + S D + + +C+ A + +
Sbjct: 322 TLLPVKYYIEIENAVASLINADKSKDERIGSS------LCYNATADLN--------IPVI 367
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
T+HF + P + +CL S NG I G+++ + LV
Sbjct: 368 TMHFE------GADVKLYPYNSFFKVTEDLVCLA-FGMSFYRNG---IYGNVAQKNFLVG 417
Query: 549 YDNVNKRIGWAKSHC 563
YD +K + + + C
Sbjct: 418 YDTASKTMSFKPTDC 432
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 160/403 (39%), Gaps = 97/403 (24%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQC-------------------------DAPCSSCAKGAN 242
+G P + L +DTGS L+WIQC D PCS +
Sbjct: 87 IGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCS------H 140
Query: 243 PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLT 302
PL KPR+ + P C++ + C Y YAD + + G L +++ T
Sbjct: 141 PLCKPRIPDFT--------------LPTSCDSNRLCHYSYFYADGTFAEGNLVKEK--FT 184
Query: 303 IENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC 362
N S T P ++ GCA + +T VK GILG++ ++S SQ +C
Sbjct: 185 FSN-SQTTPPLILGCAKE------STDVK--GILGMNLGRLSFISQAKISKF-----SYC 230
Query: 363 LTTNA-----GGGGYMFLGHDLVPSWGMAWVPMLDSPFME--------LYHTEILKINYG 409
+ T + G +LG + S G +V +L P + Y +L I G
Sbjct: 231 IPTRSNRPGLASTGSFYLGEN-PNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIG 289
Query: 410 SSPLNLGAR-----NSQVGWALFDTGSSYTYFTKQAY----SELIASLKEVSSDGLVLDA 460
LN+ + G + D+GS +T+ AY E++ + G V +
Sbjct: 290 QKRLNIPSSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGS 349
Query: 461 SDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNIC 520
T +C+ + + + L FG +I+ K + LV G C
Sbjct: 350 ---TADMCFDGNHQMV----IGRLIGDLVFEFGRGVEILVEK-----QRLLVNVGGGIHC 397
Query: 521 LGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+GI S + S II G++ + V +D N+R+G++K+ C
Sbjct: 398 VGIGRSSMLGAASNII-GNVHQQNLWVEFDVANRRVGFSKAEC 439
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 107/389 (27%), Positives = 165/389 (42%), Gaps = 78/389 (20%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS-SCAKGANPLYKPRMGNILPYKDS 258
G+Y++ + +G+PP+ + L MDTGSDLTW++CD PCS C+ + L YK
Sbjct: 122 GVYYSSITLGSPPKDFSLVMDTGSDLTWVRCD-PCSPDCSSTFDRLASN------TYKAL 174
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
C + R P ++ + + M A DEL P VFGC
Sbjct: 175 TCADDLR--LPVLLRLWRRLFHSGRSLRDTLKMAGAASDELE--------EFPGFVFGC- 223
Query: 319 YDQQGLLLNTLVKTD-GILGLSRAKVSLPSQLASQGIIKNVVGHCL----TTNAGGGGYM 373
G LL L+ + GIL LS +S PSQ+ + N +CL N+ M
Sbjct: 224 ----GSLLKGLISGEVGILALSPGSLSFPSQIGEK--YGNKFSYCLLRQTAQNSLKKSPM 277
Query: 374 FLGHDLV----PSWG----MAWVPMLDSPFMELYHTEILK-INYGSSPLNLGAR---NSQ 421
G V P G + + P+ +S +Y+T L I+ G+ L+L N Q
Sbjct: 278 VFGEAAVELKEPGSGKPQELQYTPIGES---SIYYTVRLDGISVGNQRLDLSPSTFLNGQ 334
Query: 422 VGWALFDTGSSYTYF-------TKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
+FD+G++ T KQ+ + +++ + V+ GL DA C+R P
Sbjct: 335 DKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKGL--DA-------CFRV--P 383
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
S Q +T HF V+ P Y VI CL + +EV
Sbjct: 384 PSS----GQGLPDITFHFNGGADFVT-----RPSNY-VIDLGSLQCLIFVPTNEVS---- 429
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G++ + V++D N+RIG+ ++ C
Sbjct: 430 -IFGNLQQQDFFVLHDMDNRRIGFKETDC 457
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 125/314 (39%), Gaps = 58/314 (18%)
Query: 201 LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKD 257
L+ +G P P MDTGS++ W++C APC C + PL P + LP +
Sbjct: 98 LFLVNFSMGQPATPQLAIMDTGSNILWVRC-APCKRCTQQNGPLLDPSKSSTYASLPCTN 156
Query: 258 SLCMEIQRNHKP-GYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVF 315
++C ++ P YC QC Y + YA SS GVLA ++L + + G P+VVF
Sbjct: 157 TMC-----HYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVF 211
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC+++ + G+ GL + S +++ S+ +C L
Sbjct: 212 GCSHENGDYKDR---RFTGVFGLGKGITSFVTRMGSK------FSYC------------L 250
Query: 376 GHDLVPSWGMAWVPMLDSPFMELYHTEILKIN-----------YGSSPLNLGARNSQVGW 424
G+ P +G + + E Y T + +N G L++ + +
Sbjct: 251 GNIADPHYGYNQLVFGEKANFEGYSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKG 310
Query: 425 ----ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
AL D+G++ T+ + A+ L D V D L WR F
Sbjct: 311 NEKSALIDSGTALTWLAESAFRAL---------DNEVRQLLDGVLMPFWRGSFACYKGTV 361
Query: 481 VKQF--FKTLTLHF 492
+ F +T HF
Sbjct: 362 SQDLIGFPVVTFHF 375
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 100/425 (23%), Positives = 164/425 (38%), Gaps = 68/425 (16%)
Query: 175 KLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC 234
+ S+ V + I P G Y + +G PP + +DT SDL W QC PC
Sbjct: 68 EAASARKAVVAETPIMPAGGE------YLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PC 120
Query: 235 SSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSM 291
+ C +P++ PR+ + LP C E+ H+ G+ + + C Y Y+ ++++
Sbjct: 121 TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELD-VHRCGH-DDDESCQYTYTYSGNATTE 178
Query: 292 GVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
G LA D+L + G V FGC+ G + G++GL R +SL SQL+
Sbjct: 179 GTLAVDKLVI----GEDAFRGVAFGCSTSSTGGAPPP--QASGVVGLGRGPLSLVSQLSV 232
Query: 352 QGIIKNVVGHCLTTNAGG-GGYMFLGHDLVPSWGMA---WVPMLDSP-FMELYHTEILKI 406
+ +CL A G + LG D + VPM P + Y+ + +
Sbjct: 233 RRF-----AYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGL 287
Query: 407 NYGSSPLNL-----------------------GARNSQVGWA-----LFDTGSSYTYFTK 438
G ++L A VG A + D S+ T+
Sbjct: 288 LIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEA 347
Query: 439 QAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQI 498
Y EL+ L EV S L +C+ P D + + + L F +W
Sbjct: 348 SLYDELVNDL-EVEIRLPRGTGSSLGLDLCF--ILPDGVAFD-RVYVPAVALAFDGRWLR 403
Query: 499 VSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGW 558
+ + + + G +CL + GS ILG+ + V+Y+ R+ +
Sbjct: 404 LDKARLFAED-----RESGMMCLMV---GRAEAGSVSILGNFQQQNMQVLYNLRRGRVTF 455
Query: 559 AKSHC 563
+S C
Sbjct: 456 VQSPC 460
>gi|213998830|gb|ACJ60782.1| nucellin [Hordeum pusillum]
Length = 147
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 75/138 (54%), Gaps = 9/138 (6%)
Query: 312 NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAGGG 370
+ FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++ G
Sbjct: 1 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK--GK 58
Query: 371 GYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTG 430
G +++G PS G+ WVPM +S F Y + ++ + P+ R + A+FD+G
Sbjct: 59 GVLYVGDFNPPSRGVTWVPMKESLFY--YSPGLAELLIDNQPI----RGNPTFEAVFDSG 112
Query: 431 SSYTYFTKQAYSELIASL 448
S+YT+ Q Y+E+++ +
Sbjct: 113 STYTHVPAQIYNEIVSKV 130
>gi|213998824|gb|ACJ60779.1| nucellin [Hordeum chilense]
Length = 140
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 76/143 (53%), Gaps = 9/143 (6%)
Query: 313 VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAGGGG 371
+ FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++ G G
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK--GKG 58
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGS 431
++ G PS G+ WVPM +S Y + ++ + P+ R + A+FD+GS
Sbjct: 59 VLYFGDFNPPSRGVTWVPMKES--XXYYSPGLAELLIDNQPI----RGNPTFEAVFDSGS 112
Query: 432 SYTYFTKQAYSELIASLKEVSSD 454
+YT+ Q Y+E+++ ++ S+
Sbjct: 113 TYTHVPAQIYNEIVSKVRGTLSE 135
>gi|66817422|ref|XP_642564.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
gi|60470632|gb|EAL68608.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
Length = 492
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 162/377 (42%), Gaps = 73/377 (19%)
Query: 213 RPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYKDSLCM-------- 261
+ + L +DTGS LT I C+SC K P+Y P + + ++P C+
Sbjct: 107 QKFILQVDTGSTLTAIPLKG-CNSC-KDNRPVYDPALSSSSQLIPCSSDKCLGSGSASPS 164
Query: 262 -EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYD 320
++ +N K CD+ I Y D S G + DE+ + S + FG +
Sbjct: 165 CKLHQNAK-------STCDFIILYGDGSKIKGKVFSDEITV-----SGVSSTIYFGANVE 212
Query: 321 QQGLLLNTLVKTDGILGLSRAKVS-------LPSQLASQGIIKNVVGHCLTTNAGGGGYM 373
+ G + DGI+GL R + S + S IKN+ G L + G GY+
Sbjct: 213 EVGAF--EYPRADGIMGLGRTSNNKNLVPTIFDSMVRSNSSIKNIFGIYLDYH--GQGYL 268
Query: 374 FLG----HDLVPSWGMAWVPMLDS-PFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
LG H + S + + P+ + PF + T +++ S P N +G + D
Sbjct: 269 SLGKINHHYYIGS--IQYTPIQPAGPFYAIKPTS-FRVDNTSFPAN------SMGQVIVD 319
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTL 488
+G+S T + Y LI ++ + + S P++ F R + ++ F T
Sbjct: 320 SGTSDLILTSRVYDHLIQYFRKHYCH-IDMVCSYPSI-------FSSRVCFEKEEDFATF 371
Query: 489 T-LHFGSKWQIVSTKFHISPEGYLVISKKGN-----ICLGILDGSEVHNGSTIILGDISL 542
LHFG + + + I P+ Y++ ++ C GI G ++ ILGD+ +
Sbjct: 372 PWLHFGFEGGV---RIAIPPKNYMIKTESNQQGVYGYCWGIDRGDDM-----TILGDVFM 423
Query: 543 RGQLVVYDNVNKRIGWA 559
RG ++DN+ R+G+A
Sbjct: 424 RGYYTIFDNIENRVGFA 440
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 100/425 (23%), Positives = 164/425 (38%), Gaps = 68/425 (16%)
Query: 175 KLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC 234
+ S+ V + I P G Y + +G PP + +DT SDL W QC PC
Sbjct: 68 EAASARKAVVAETPIMPAGGE------YLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PC 120
Query: 235 SSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSM 291
+ C +P++ PR+ + LP C E+ H+ G+ + + C Y Y+ ++++
Sbjct: 121 TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELD-VHRCGH-DDDESCQYTYTYSGNATTE 178
Query: 292 GVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
G LA D+L + G V FGC+ G + G++GL R +SL SQL+
Sbjct: 179 GTLAVDKLVI----GEDAFRGVAFGCSTSSTGGAPPP--QASGVVGLGRGPLSLVSQLSV 232
Query: 352 QGIIKNVVGHCLTTNAGG-GGYMFLGHDLVPSWGMA---WVPMLDSP-FMELYHTEILKI 406
+ +CL A G + LG D + VPM P + Y+ + +
Sbjct: 233 RRF-----AYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGL 287
Query: 407 NYGSSPLNL-----------------------GARNSQVGWA-----LFDTGSSYTYFTK 438
G ++L A VG A + D S+ T+
Sbjct: 288 LIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEA 347
Query: 439 QAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQI 498
Y EL+ L EV S L +C+ P D + + + L F +W
Sbjct: 348 SLYDELVNDL-EVEIRLPRGTGSSLGLDLCF--ILPDGVAFD-RVYVPAVALAFDGRWLR 403
Query: 499 VSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGW 558
+ + + + G +CL + GS ILG+ + V+Y+ R+ +
Sbjct: 404 LDKARLFAED-----RESGMMCLMV---GRAEAGSVSILGNFQQQNMQVLYNLRRGRVTF 455
Query: 559 AKSHC 563
+S C
Sbjct: 456 VQSPC 460
>gi|213998810|gb|ACJ60772.1| nucellin [Hordeum comosum]
Length = 154
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 9/146 (6%)
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAG 368
K + FGC Y Q+ + DGILGL K +QL Q +I NV+GHCL++
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK-- 63
Query: 369 GGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFD 428
G G +++G PS G+ WVPM +S F Y + ++ + P+ R + A+FD
Sbjct: 64 GKGVLYVGDFNPPSRGVTWVPMKESLF--YYSPGLAELLIDNQPI----RGNPTFEAVFD 117
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSD 454
+ S+YT+ Q Y+E+++ ++ S+
Sbjct: 118 SDSTYTHVPAQIYNEIVSKVRGTLSE 143
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 117/256 (45%), Gaps = 35/256 (13%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
+G PP P + MDTGSD+ W+ C PC++C L+ P M + LC
Sbjct: 107 IGQPPIPQLVVMDTGSDILWVMC-TPCTNCDNHLGLLFDPSMSSTF---SPLC------K 156
Query: 268 KPGYCETCQQCD---YEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAYDQQG 323
P + C +CD + + YAD+S++ G+ RD + T + G+ P+V+FGC ++
Sbjct: 157 TPCDFKGCSRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHN--- 213
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW 383
+ +T +GILGL+ SL +++ + +C+ A + H L+
Sbjct: 214 IGQDTDPGHNGILGLNNGPDSLATKIGQK------FSYCIGDLADP---YYNYHQLILGE 264
Query: 384 GMAWVPMLDSPFM---ELYHTEILKINYGSSPLNLG-----ARNSQVGWALFDTGSSYTY 435
G A + +PF Y+ + I+ G L++ + ++ G + DTGS+ T+
Sbjct: 265 G-ADLEGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITF 323
Query: 436 FTKQAYSELIASLKEV 451
+ L ++ +
Sbjct: 324 LVDSVHRLLSKEVRNL 339
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 150/379 (39%), Gaps = 42/379 (11%)
Query: 202 YFTYMIVGNPP-RPYYLDMDTGSDLTWIQCDAPC-SSCAKGANPLYKPRMGNIL-PY--K 256
Y + +G+PP + + +DTGSD++W++C PC C +PL+ P + + P+
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWVRCK-PCWQQCRPQVDPLFDPSLSSTYSPFSCS 198
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHS-SSMGVLARDELHLTIENGSLTKPNVVF 315
+ C ++ + C + QC Y Y D S + G + D L L + ++ F
Sbjct: 199 SAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVVVSKFRF 258
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC++ + G+ T L SL SQ A +CL G++ L
Sbjct: 259 GCSHAETGITGLTAGLMG----LGGGAQSLVSQTAGT-FGTTAFSYCLPPTPSSSGFLTL 313
Query: 376 GHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYT 434
G S G PML S + Y + I G L++ G + D+G+ T
Sbjct: 314 GAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVFSAGM-IMDSGTVVT 372
Query: 435 YFTKQAYSELIASLKE---------VSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFF 485
AYS L ++ K S+ G LD F + V
Sbjct: 373 RLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTC-----------FDMSGQSSVS--M 419
Query: 486 KTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDISLRG 544
T+ L F V ++ G L+ + +I CL + S+ +GST I+G++ R
Sbjct: 420 PTVALVFSGAGGAV---VNLDASGILLQMETSSIFCLAFVATSD--DGSTGIIGNVQQRT 474
Query: 545 QLVVYDNVNKRIGWAKSHC 563
V+YD +G+ C
Sbjct: 475 FQVLYDVAGGAVGFKAGAC 493
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 108/253 (42%), Gaps = 18/253 (7%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP-RMGNILPYKDSLC 260
Y + +G P + +DTGSD++W+ C A G++ + P + P+ S
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHCHA---RAGAGSSLFFDPGKSSTYTPFSCSSA 181
Query: 261 MEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYD 320
+ + C C Y + Y D S++ G D L L N + N FGC+
Sbjct: 182 ACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLAL---NSTEKVENFQFGCSET 238
Query: 321 Q---QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGH 377
+GL + +TDG++GL SL SQ A+ + +CL G++ LG
Sbjct: 239 SDPGEGLDED---QTDGLMGLGGGAPSLVSQTAA--TYGSAFSYCLPATTRSSGFLTLGA 293
Query: 378 DLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTGSSYTYF 436
S G PM S ++ IL+ IN G P+ + G ++ D+G+ T
Sbjct: 294 STGTS-GFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAAG-SIMDSGTIITRL 351
Query: 437 TKQAYSELIASLK 449
+AYS L A+ +
Sbjct: 352 PPRAYSALSAAFR 364
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 107/441 (24%), Positives = 166/441 (37%), Gaps = 76/441 (17%)
Query: 185 DSSSIFPLRGNIYP-DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD-----APCSSCA 238
D + PL Y G YF VG P RP+ L DTGSDLTW++C AP + A
Sbjct: 37 DEAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPA 96
Query: 239 KGANPLYKPRMGNILPYKDSLCMEIQRNHKPGY----------CETCQQ----------- 277
G N Y N + R +P +TC
Sbjct: 97 PGYNYGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPT 156
Query: 278 ----CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPN-------VVFGCAYDQQGLLL 326
C YE Y D S++ G + D + + K VV GC G
Sbjct: 157 PGSPCAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTG--- 213
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG---GGGYMFLGHDLVPS- 382
+ + +DG+L L + VS S+ A++ +CL + Y+ G + S
Sbjct: 214 ESFLASDGVLSLGYSNVSFASRAAAR--FGGRFSYCLVDHLAPRNATSYLTFGPNPAVSS 271
Query: 383 --------WGMAWVP-------MLDSPFMELYHTEILKINYGSSPLNLGARNSQV---GW 424
G A P +LD Y + ++ L + V G
Sbjct: 272 ASASRTACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGG 331
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
A+ D+G+S T AY ++A+L + GL A DP C+ P+ + D+
Sbjct: 332 AILDSGTSLTVLVSPAYRAVVAALGK-KLVGLPRVAMDP-FDYCYNWTSPL-TGEDLAVA 388
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
L +HF S + P+ Y++ + G C+G+ +G G ++I G+I +
Sbjct: 389 VPALAVHFAG-----SARLQPPPKSYVIDAAPGVKCIGLQEGD--WPGVSVI-GNILQQE 440
Query: 545 QLVVYDNVNKRIGWAKSHCMN 565
L +D N+R+ + +S CM
Sbjct: 441 HLWEFDLKNRRLRFKRSRCMQ 461
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 90/370 (24%), Positives = 153/370 (41%), Gaps = 45/370 (12%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
+G PP P Y MDTGS LTWIQC+ PC +C + PLY P + + R
Sbjct: 116 IGQPPVPQYAVMDTGSSLTWIQCE-PCINCHQQKGPLYNPSSSSTYVSC----SDFDRTD 170
Query: 268 KPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCAYDQQGLLL 326
C+Y YAD +++ G AR++L T ++G +V+FGC ++ L
Sbjct: 171 TTFTATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGITIMHDVIFGCGHNNTQLPG 230
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF----LGHDL-VP 381
T + G+ GL + S+ S+L +C+ N G Y F LG+ L +
Sbjct: 231 PTGYAS-GVFGLGDSGSSIISKLGFG------FSYCI-GNIGDPLYGFHRLTLGNKLKIE 282
Query: 382 SWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG-------ARNSQVGWALFDTGSSYT 434
+ VP LY+ ++ I+ G L++ N + D+G++ +
Sbjct: 283 GYSTPLVP------RGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIVIDSGATLS 336
Query: 435 YFTKQAYSELIASLKEVSSDGLV-LDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
Y +QAY+ + + + S L L +C+ K + Q F T H
Sbjct: 337 YIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYIGK-----LNQDLQGFPDATFHLA 391
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVN 553
+V F + EG +CL ++ + T ++G ++ + V YD
Sbjct: 392 DGADLV---FQV--EGLFFQYTDNVLCLALVPTES--DEETCLIGLLAQQYYNVAYDLKQ 444
Query: 554 KRIGWAKSHC 563
+++ + + C
Sbjct: 445 QKLYFQRIEC 454
>gi|145351657|ref|XP_001420185.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580418|gb|ABO98478.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 498
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 100/394 (25%), Positives = 155/394 (39%), Gaps = 52/394 (13%)
Query: 196 IYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA-PCSSCAKGANPLYKPRMGNILP 254
+Y G F + + + L++DTGS LT+ C P C +P Y M
Sbjct: 60 LYSSGHEFFLTVELAGKQKFDLEVDTGSPLTYFPCKGCPLEVCGIHEHPYYDYDMSKT-- 117
Query: 255 YKDSLCMEIQR-----NHKPGY--CET----CQQCDYEIEYADHSSSMGVLARDELHLTI 303
++ C N +P C+T C + I Y D S G +A D L
Sbjct: 118 FRKLNCTTSTEDAAYCNAQPNVLLCDTNISYTNTCLFGIGYVDGSVGRGYMAEDTFTLGD 177
Query: 304 ENGSLTKPNVVFGCA--YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGII-KNVVG 360
E L + FGC Y G + ++ DG+ G SR + +QLA G+I +V G
Sbjct: 178 E---LAPAKITFGCGGMYYPDG----SNLRQDGMAGFSRGNTAFHTQLAKAGVIDAHVFG 230
Query: 361 HCLTTNAGGGGYMFLGH----DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLG 416
C + LG VP +AW ML + + + G +
Sbjct: 231 FCSEGMETSTAMLTLGRYNFGRRVPE--LAWTRMLGEDDLAV---RTMSWKLGDKTI--- 282
Query: 417 ARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVS-SDGLVLDASDPTLPVCWRAKFPI 475
A +S V + + D+G++ T + + + L E + S GL + C+
Sbjct: 283 ASSSNV-YTVLDSGTTLTVLPSAMHHDFMTHLNETARSAGLSVVVRGTH---CFYENQRQ 338
Query: 476 RSIVD--VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKG--NICLGILDGSE--V 529
S+ + ++F +LT+ + +V + PE YL C GI+ S+ +
Sbjct: 339 SSLTQYTLTRWFPSLTITYDPDVTLV-----LRPENYLFADTVNLHAFCAGIMSASDAAL 393
Query: 530 HNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
NG IILG +LR V YD N R+G A C
Sbjct: 394 ANGEQIILGQQTLRNTFVEYDLENSRVGMATVQC 427
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/227 (28%), Positives = 108/227 (47%), Gaps = 21/227 (9%)
Query: 160 VNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDM 219
++D +R +++I +++ S++ V + I PL I L + + +G + + +
Sbjct: 24 LDDLRVRSMQNRI-RRVASTHNVEASQTQI-PLSSGINLQTLNYI-VTMGLGSKNMTVII 80
Query: 220 DTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY---KDSLCMEIQ-RNHKPGYCETC 275
DT SDLTW+QC+ PC SC P++KP + S C +Q G C +
Sbjct: 81 DTRSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSS 139
Query: 276 Q--QCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTD 333
C+Y + Y D S + G L + L G ++ + VFGC + +GL
Sbjct: 140 NPSTCNYVVNYGDGSYTNGDLGVEALSF----GGVSVSDFVFGCGRNNKGLFGG----VS 191
Query: 334 GILGLSRAKVSLPSQLASQGIIKNVVGHCL-TTNAGGGGYMFLGHDL 379
G++GL R+ +SL SQ + V +CL TT AG G + +G++
Sbjct: 192 GLMGLGRSYLSLVSQ--TNATFGGVFSYCLPTTEAGSSGSLVMGNEF 236
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/230 (29%), Positives = 103/230 (44%), Gaps = 31/230 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G Y + +G PP + + DTGS L W QC APC+ CA P ++P + LP
Sbjct: 88 GAYNMNLSIGTPPVTFSVLADTGSSLIWTQC-APCTECAARPAPPFQPASSSTFSKLPCA 146
Query: 257 DSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
SLC + ++ TC C Y Y + G LA + LH+ G + P V
Sbjct: 147 SSLCQFLTSPYR-----TCNATGCVYYYPYG-MGFTAGYLATETLHV----GGASFPGVT 196
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMF 374
FGC+ + + GI+GL R+ +SL SQ+ G+ + +CL +NA G
Sbjct: 197 FGCSTEN-----GVGNSSSGIVGLGRSPLSLVSQV---GVAR--FSYCLRSNADAGDSPI 246
Query: 375 LGHDLVPSWG--MAWVPMLDSPFM---ELYHTEILKINYGSSPLNLGARN 419
L L G + P+L++P M Y+ + I G++ L + N
Sbjct: 247 LFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPMAMAN 296
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 91/383 (23%), Positives = 149/383 (38%), Gaps = 60/383 (15%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK-----GANPLYKPRMGNILPYKDSLCM- 261
+G PP+ + +DTGS L+WIQC + K + +LP LC
Sbjct: 88 IGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHPLCKP 147
Query: 262 EIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQ 321
+ P C+ C Y YAD + + G L R+++ + S T P ++ GCA
Sbjct: 148 RVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFS---PSQTTPPIILGCATQS 204
Query: 322 QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG--GGGYMFLGHDL 379
GILG++ ++ PSQ I K +C+ T G +LG++
Sbjct: 205 D--------DARGILGMNLGRLGFPSQAK---ITK--FSYCVPTKQAQPASGSFYLGNNP 251
Query: 380 VPS---------WGMAW-VPMLDSPFMELYHTEILKINYGSSPLNL-----GARNSQVGW 424
S +G + +P LD P Y + I+ G LN+ G
Sbjct: 252 ASSSFRYVNLLTFGQSQRMPNLD-PLA--YTLPLQGISIGGKKLNIPPSVFKPNAGGSGQ 308
Query: 425 ALFDTGSSYTYFTKQAYS----ELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVD 480
+ D+GS +TY +AY+ EL+ + G + +C+ ++
Sbjct: 309 TMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGG---VADICFDG-----DAIE 360
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDI 540
+ + + F QIV I E L G CLG+ + G II G+
Sbjct: 361 IGRLVGDMVFEFEKGVQIV-----IPKERVLATVDGGVHCLGMGRSERLGAGGNII-GNF 414
Query: 541 SLRGQLVVYDNVNKRIGWAKSHC 563
+ V +D N+R+G+ ++ C
Sbjct: 415 HQQNLWVEFDLANRRVGFGEADC 437
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 70/156 (44%), Gaps = 21/156 (13%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDS 258
Y + +G PP P +DTGSDL W QCDAPC C PLY P + +
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 259 LCMEIQ----RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNV 313
+C +Q R P C Y Y D +S+ GVLA + L GS T V
Sbjct: 152 MCQALQSPWSRCSPPD-----TGCAYYFSYGDGTSTDGVLATETFTL----GSDTAVRGV 202
Query: 314 VFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL 349
FGC + G N + G++G+ R +SL SQL
Sbjct: 203 AFGCGTENLGSTDN----SSGLVGMGRGPLSLVSQL 234
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 91/388 (23%), Positives = 156/388 (40%), Gaps = 82/388 (21%)
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQCDAPCS---SCAKGANPLYKPRMGNILPYKDSLCME 262
M VG P +P + +DTGSD+TW+QC PC+ C + P++ P E
Sbjct: 1 MRVGQPQQPSFFVLDTGSDVTWLQC-LPCAGKNGCYEQITPIFDP--------------E 45
Query: 263 IQRNHKPGYC--ETCQ----------QCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
+ ++ P C E CQ C Y++EY D S ++G LA + L N +
Sbjct: 46 LSSSYNPVSCDSEQCQLLDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSN---SI 102
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
PN+ GC +D +GL V DG++GL +S+ SQL + +CL +
Sbjct: 103 PNISIGCGHDNEGL----FVGADGLIGLGGGAISISSQLKASSF-----SYCL-VDIDSP 152
Query: 371 GYMFLGHDLVPSWGMAWVPML-DSPFMELYHTEILKINYGSSPLNLGAR-----NSQVGW 424
+ L + P P++ + F + +++ ++ G PL + + S +G
Sbjct: 153 SFSTLDFNTDPPSDSLISPLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGG 212
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ D+G++ T Y L + L L + P P ++ F
Sbjct: 213 IIVDSGTTITQLPSDVYEVLREAF-------LGLTTNLPPAP-------------EISPF 252
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKK---------GNICLGILDGSEVHNGSTI 535
L S ++ + F + E L + K G CL + + +
Sbjct: 253 DTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFVSATFPLS---- 308
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I+G+ +G V YD N +G++ + C
Sbjct: 309 IIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 89/370 (24%), Positives = 156/370 (42%), Gaps = 48/370 (12%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDS 258
+G Y + +G PP Y +DT SDL W QC PC C K NP++ P L +S
Sbjct: 28 NGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQC-TPCQGCYKQKNPMFDP-----LKECNS 81
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
+H C + CDY YAD S++ G+LA++ + +G +++FGC
Sbjct: 82 F-----FDHS---CSPEKACDYVYAYADDSATKGMLAKEIATFSSTDGKPIVESIIFGCG 133
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYMFL 375
++ G+ + G+ G + VS L CL + G + L
Sbjct: 134 HNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGS----KRFSQCLVPFHADPHTSGTISL 189
Query: 376 GH-DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSS--PLNLGARNSQVGWALFDTGSS 432
G V G+ P++ Y + I+ G + P N S+ G + D+G+
Sbjct: 190 GEASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSK-GNIMIDSGTP 248
Query: 433 YTYFTKQAYSELIASLK-EVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLH 491
TY ++ Y L+ LK +++ + +D D +C++++ + + LT H
Sbjct: 249 ETYLPQEFYDRLVEELKVQINLPPIHVDP-DLGTQLCYKSETNLEGPI--------LTAH 299
Query: 492 F-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYD 550
F G+ +++ + I P K G C + ++ I G+ + L+ +D
Sbjct: 300 FEGADVKLLPLQTFIPP-------KDGVFCFAMTGTTD----GLYIFGNFAQSNVLIGFD 348
Query: 551 NVNKRIGWAK 560
++KRI + K
Sbjct: 349 -LDKRIVFFK 357
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 89/387 (22%), Positives = 150/387 (38%), Gaps = 57/387 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGNI---LPYKD 257
Y +VG+PP+ +DTGS L W QC A C + P + +P +D
Sbjct: 86 YIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQD 145
Query: 258 SLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGC 317
C + +C C + + Y +G L D T ++G T + FGC
Sbjct: 146 KACA----GNYLHFCALDGTCTFRVTYG-AGGIIGFLGTDA--FTFQSGGAT---LAFGC 195
Query: 318 AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYMF 374
+ + L G++GL R ++SL SQ ++ +CLT N G ++F
Sbjct: 196 VSFTRFAAPDVLHGASGLIGLGRGRLSLASQTGAKRF-----SYCLTPYFHNNGASSHLF 250
Query: 375 LGHDLVPSWG------MAWVPM-LDSPFMELYHTEILKINYGSSPLNLGARNSQV----- 422
+G S G MA+V D P+ Y+ ++ I G + L + + +
Sbjct: 251 VGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEE 310
Query: 423 ----GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV--LDASDPTLPVCWRAKFPIR 476
G + D+GS +T + AY L+ L + LV D + +C
Sbjct: 311 GFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARG---- 366
Query: 477 SIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTII 536
D+ + TL LHF + + PE Y +K C+ I+ G I
Sbjct: 367 ---DLDRVVPTLVLHFSGGADMA-----LPPENYWAPLEKSTACMAIVRGYLQS-----I 413
Query: 537 LGDISLRGQLVVYDNVNKRIGWAKSHC 563
+G+ + +++D R+ + + C
Sbjct: 414 IGNFQQQNMHILFDVGGGRLSFQNADC 440
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 95/417 (22%), Positives = 169/417 (40%), Gaps = 63/417 (15%)
Query: 189 IFPLRGNIYP------DGLYFTY-------MIVGNPPRPYYLDMDTGSDLTWIQCDA--- 232
+ PL+ I P D L+F + + VG PP+ + +DTGS+L+W++C+
Sbjct: 47 VLPLKTRITPTDHQPTDKLHFHHNVTLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSN 106
Query: 233 --PCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHK-PGYCETCQQCDYEIEYADHSS 289
P ++ + Y P +P C R+ P C++ + C + YAD SS
Sbjct: 107 PNPVNNFDPTRSSSYSP-----IPCSSPTCRTRTRDFLIPASCDSDKLCHATLSYADASS 161
Query: 290 SMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQL 349
S G LA + H S N++FGC G KT G+LG++R +S SQ+
Sbjct: 162 SEGNLAAEIFHF---GNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQM 218
Query: 350 ASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWV-PMLDSPFMEL---------- 398
G K +C++ G++ LG W+ P+ +P + +
Sbjct: 219 ---GFPK--FSYCISGTDDFPGFLLLGDS-----NFTWLTPLNYTPLIRISTPLPYFDRV 268
Query: 399 -YHTEI--LKINYGSSPLN---LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVS 452
Y ++ +K+N P+ L ++ G + D+G+ +T+ Y+ L +
Sbjct: 269 AYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFTFLLGPVYTALRSDFLN-Q 327
Query: 453 SDGLVLDASDP------TLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHIS 506
++G++ DP T+ +C+R P R + T++L F VS + +
Sbjct: 328 TNGILTVYEDPEFVFQGTMDLCYRIS-PFRIRTGILHRLPTVSLVFEGAEIAVSGQPLLY 386
Query: 507 PEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+L C S++ ++G + + +D RIG A C
Sbjct: 387 RVPHLTAGNDSVYCF-TFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVQC 442
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 103/405 (25%), Positives = 165/405 (40%), Gaps = 41/405 (10%)
Query: 172 INKKLVSSNAVAVDSSSIFPLRGNIYPDGL-YFTYMIVGNPPRPYYLDMDTGSDLTWIQC 230
I +K +S S P D L Y + +G P + +DTGSDL+W+QC
Sbjct: 96 ITRKAKASGRTTTLSDVSIPTSLGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQC 155
Query: 231 DAPC--SSCAKGANPLYKPRMGNI---LPYKDSLCMEI---QRNHKPGYCETCQQCDYEI 282
PC SSC +PLY P + +P C ++ +H C Y I
Sbjct: 156 K-PCNSSSCYPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGI 214
Query: 283 EYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAK 342
EY + +++GV + + L L+ + ++ + FGC QQG T DG+LGL A
Sbjct: 215 EYGNRDTTVGVYSTETLTLSPQ---VSVKDFGFGCGLVQQG----TFDLFDGLLGLGGAP 267
Query: 343 VSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGH--DLVPSWGMAWVPMLDSPFME-LY 399
SL SQ A +CL G++ LG + + G + P+ P Y
Sbjct: 268 ESLVSQTAET--YGGAFSYCLPPGNSTTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFY 325
Query: 400 HTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLK-EVSSDGLVL 458
+ ++ G PL++ G + D+G+ T AYS L + + +S+ L+
Sbjct: 326 LVNLTGVSVGGKPLDI-PPTVLSGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLP 384
Query: 459 DASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN 518
+D L C+ I +V LT G+ + P G L+
Sbjct: 385 PNNDDVLDTCYN----FTGIANVTVPTVALTFDGGATIDL------DVPSGVLI-----Q 429
Query: 519 ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
CL G+ +G I+G+++ R V+YD+ +G+ C
Sbjct: 430 DCLAFAGGAS--DGDVGIIGNVNQRTFEVLYDSGRGHVGFRPGAC 472
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 81/385 (21%), Positives = 162/385 (42%), Gaps = 51/385 (13%)
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCME 262
+ VG PP+ + +DTGS+L+W+ C + N ++ P + + +P +C
Sbjct: 74 LTVGTPPQSVTMVLDTGSELSWLHCKK-----QQNINSVFNPHLSSSYTPIPCMSPICKT 128
Query: 263 IQRNHK-PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQ 321
R+ P C++ C + YAD +S G LA D ++ GS +P ++FG
Sbjct: 129 RTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAIS---GS-GQPGIIFGSMDSG 184
Query: 322 QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVP 381
N KT G++G++R +S +Q+ G K +C++ G +F D
Sbjct: 185 FSSNANEDSKTTGLMGMNRGSLSFVTQM---GFPK--FSYCISGKDASGVLLF--GDATF 237
Query: 382 SW--GMAWVPMLDS----PFME--LYHTEILKINYGSSPLNL-----GARNSQVGWALFD 428
W + + P++ P+ + Y ++ I GS PL + ++ G + D
Sbjct: 238 KWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMVD 297
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP------TLPVCWRAKFP--IRSIVD 480
+G+ +T+ Y+ L + G++ DP + +C+R + + ++
Sbjct: 298 SGTRFTFLLGSVYTALRNEFV-AQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPA 356
Query: 481 VKQFFKTLTLHFGSKWQIVSTK--FHISPEGYLVISKKGNICLGILDGSEVHNGSTIILG 538
V F+ G++ + + + + +G + CL S++ ++G
Sbjct: 357 VTMVFE------GAEMSVSGERLLYRVGGDGDVAKGNGDVYCL-TFGNSDLLGIEAYVIG 409
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
+ + +D VN R+G+A + C
Sbjct: 410 HHHQQNVWMEFDLVNSRVGFADTKC 434
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 92/390 (23%), Positives = 161/390 (41%), Gaps = 62/390 (15%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC---AKGANPLYKPRMGNI--L 253
+G Y + +G PP+ +DTGSDL W++CD C C G + + L
Sbjct: 2 EGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDN-CDHCDLDHHGETIFFSDASSSYKKL 60
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL----HLTIENGSLT 309
P + C + CE + C Y+ EY D S + G + D + H E+
Sbjct: 61 PCNSTHCSGMSSAGIGPRCE--ETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSF 118
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA-- 367
+FGCA +G T G++GL + SL QL + + +CL +
Sbjct: 119 FDGFLFGCARKLKG----DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCLVSYDSP 172
Query: 368 -GGGGYMFL-------GHDLVPSWGMAWVPMLDSPFME--LYHTEILKINYGSSPLNL-- 415
++FL GHD+V + P+L ++ LY+ ++ I G P+ +
Sbjct: 173 PSAKSFLFLGSSAALRGHDVVST------PILHGDHLDQTLYYVDLQSITIGGVPVVVYD 226
Query: 416 --GARNSQVG-----WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVC 468
N+ VG + D+G++YT T Y + S++E ++L PTL
Sbjct: 227 KESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQ----VIL----PTLGNS 278
Query: 469 WRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSE 528
S D F ++T +F ++ Q+V + E ++ + +CL + +
Sbjct: 279 AGLDLCFNSSGDTSYGFPSVTFYFANQVQLV-----LPFENIFQVTSRDVVCLSM----D 329
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGW 558
G I+G++ + ++YD V +I +
Sbjct: 330 SSGGDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 90/389 (23%), Positives = 163/389 (41%), Gaps = 52/389 (13%)
Query: 186 SSSIFPLR-GNIYPDGLYFTY-----MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239
SS +F + G+ Y D ++ TY + +G PP +DTGS+ W QC PC C
Sbjct: 43 SSRVFNTQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQC-LPCVHCYN 101
Query: 240 GANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETC-QQCDYEIEYADHSSSMGVLARDE 298
P++ P K S EI+ C+T C YE+ Y S + G L +
Sbjct: 102 QTAPIFDPS-------KSSTFKEIR-------CDTHDHSCPYELVYGGKSYTKGTLVTET 147
Query: 299 LHLTIENGS-LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357
+ + +G P + GC + G G++GL R SL +Q+ G
Sbjct: 148 VTIHSTSGQPFVMPETIIGCGRNNSGFKPGFA----GVVGLDRGPKSLITQMG--GEYPG 201
Query: 358 VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM-LDSPFMELYHTEILKINYGSSPLNLG 416
++ +C G F + +V G+ + + + Y+ + ++ G++ +
Sbjct: 202 LMSYCF-AGKGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETV 260
Query: 417 AR--NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
++ G + D+GS+ TYF ++Y L+ E + SD +C+ +K
Sbjct: 261 GTPFHALKGNIVIDSGSTLTYF-PESYCNLVRKAVEQVVTAVRFPRSD---ILCYYSK-- 314
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
+D+ F +T+HF +V K+++ Y+ + G CL I+ S +
Sbjct: 315 ---TIDI---FPVITMHFSGGADLVLDKYNM----YVASNTGGVFCLAIICNSPIEEA-- 362
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G+ + LV YD+ + + + ++C
Sbjct: 363 -IFGNRAQNNFLVGYDSSSLLVSFKPTNC 390
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 90/389 (23%), Positives = 163/389 (41%), Gaps = 52/389 (13%)
Query: 186 SSSIFPLR-GNIYPDGLYFTY-----MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAK 239
SS +F + G+ Y D ++ TY + +G PP +DTGS+ W QC PC C
Sbjct: 37 SSRVFNTQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQC-LPCVHCYN 95
Query: 240 GANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETC-QQCDYEIEYADHSSSMGVLARDE 298
P++ P K S EI+ C+T C YE+ Y S + G L +
Sbjct: 96 QTAPIFDPS-------KSSTFKEIR-------CDTHDHSCPYELVYGGKSYTKGTLVTET 141
Query: 299 LHLTIENGS-LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKN 357
+ + +G P + GC + G G++GL R SL +Q+ G
Sbjct: 142 VTIHSTSGQPFVMPETIIGCGRNNSGFKPGFA----GVVGLDRGPKSLITQMG--GEYPG 195
Query: 358 VVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM-LDSPFMELYHTEILKINYGSSPLNLG 416
++ +C G F + +V G+ + + + Y+ + ++ G++ +
Sbjct: 196 LMSYCF-AGKGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETV 254
Query: 417 AR--NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFP 474
++ G + D+GS+ TYF ++Y L+ E + SD +C+ +K
Sbjct: 255 GTPFHALKGNIVIDSGSTLTYF-PESYCNLVRKAVEQVVTAVRFPRSD---ILCYYSK-- 308
Query: 475 IRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGST 534
+D+ F +T+HF +V K+++ Y+ + G CL I+ S +
Sbjct: 309 ---TIDI---FPVITMHFSGGADLVLDKYNM----YVASNTGGVFCLAIICNSPIEEA-- 356
Query: 535 IILGDISLRGQLVVYDNVNKRIGWAKSHC 563
I G+ + LV YD+ + + + ++C
Sbjct: 357 -IFGNRAQNNFLVGYDSSSLLVSFKPTNC 384
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 95/379 (25%), Positives = 156/379 (41%), Gaps = 41/379 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN-ILPYKDS 258
G Y +GNP +DT + L W+QC S C L + + Y+
Sbjct: 73 GEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEME 132
Query: 259 LCMEIQRNHKPGYCETCQQ----CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
C N G+ +TC C Y + Y D+ ++ G+L+ D +G L +
Sbjct: 133 PCGSNFCNSLTGF-QTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGMLVDVGFL 191
Query: 315 -FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGG 371
FGC+ + L G +GL++ +SL SQL GI K +CL N G
Sbjct: 192 NFGCS---EAPLTGDEQSYTGNVGLNQTPLSLISQL---GIKK--FSYCLVPFNNLGSTS 243
Query: 372 YMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGA----RNSQVGWALF 427
M+ G V S G P+L P + Y+ ++L I+ G+ + + GW +
Sbjct: 244 KMYFGSLPVTSGGQ--TPLL-YPNSDAYYVKVLGISIGNDEPHFDGVFDVYEVRDGW-II 299
Query: 428 DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
DTG +Y+ A+ L+A + D +C F +++ D++ F
Sbjct: 300 DTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELC----FELQNANDLES-FPD 354
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI-ILGDISLRGQL 546
+T+HF I++ + ++ I G CL +L +GS + ILG+ L+
Sbjct: 355 VTVHFDGADLILNVE-----STFVKIEDDGIFCLALL-----RSGSPVSILGNFQLQNYH 404
Query: 547 VVYDNVNKRIGWAKSHCMN 565
V YD + I +A C +
Sbjct: 405 VGYDLEAQVISFAPVDCAD 423
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 97/417 (23%), Positives = 160/417 (38%), Gaps = 64/417 (15%)
Query: 190 FPLRGNIYPD-GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPR 248
PL Y G YF VG P +P+ L DTGSDLTW++C P S+ + + P
Sbjct: 84 MPLTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPG 143
Query: 249 MGNILPYKDSLCMEIQRNHKPGYC--ETCQQ---------------CDYEIEYADHSSSM 291
G +DS R P C +TC + C Y+ Y D S++
Sbjct: 144 PGRAFRPEDS------RTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAAR 197
Query: 292 GVLARDELHLTIENGSLTKPN---VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQ 348
G + + + + K +V GC+ G + +DG+L L + +S S
Sbjct: 198 GTVGTESATIALSGREERKAKLKGLVLGCSSSYTG---PSFEASDGVLSLGYSGISFASH 254
Query: 349 LASQGIIKNVVGHCLTTN---AGGGGYMFLGHDLVPSWGMAWVP--------------ML 391
AS+ +CL + Y+ G + S A +L
Sbjct: 255 AASR--FGGRFSYCLVDHLSPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLL 312
Query: 392 DSPFMELYHTEILKINYGSSPLNLGARNSQV---GWALFDTGSSYTYFTKQAYSELIASL 448
D Y + I+ L + V G + D+G+S T K AY ++A+L
Sbjct: 313 DRRMRPFYDVSLKAISVAGEFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAAL 372
Query: 449 KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPE 508
+ + GL DP C+ P DV + +HF + + +
Sbjct: 373 SKGLA-GLPRVTMDP-FEYCYNWTSPSGKDADVA--VPKMAVHFAG-----AARLEPPGK 423
Query: 509 GYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
Y++ + G C+G+ +G G ++I G+I + L +D N+R+ + +S C +
Sbjct: 424 SYVIDAAPGVKCIGLQEGP--WPGISVI-GNILQQEHLWEFDIKNRRLKFQRSRCTH 477
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 149/375 (39%), Gaps = 40/375 (10%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCS-SCAKGANPLYKPRMGNI---LPYKDSLCMEI 263
+G PP+P + S +W+ C + C+ +C + L++P + LP C
Sbjct: 5 LGTPPQPLNFTLAVDSGFSWVACSSSCAINCTTAS--LFQPGLSTSHTKLPCGSPSCSAF 62
Query: 264 QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQG 323
C C Y Y + SS G L D + N+ GC D G
Sbjct: 63 SAVSTS--CGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCGRDSGG 120
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDL---V 380
LL L+ T G +G + VS QL++ G + +CL ++ G + + L
Sbjct: 121 LL--ELLDTSGFVGFDKGNVSFMGQLSALGYRSKFI-YCLPSDTFRGKLVIGNYKLRNAS 177
Query: 381 PSWGMAWVPMLDSP-FMELYHTEILKINYGSSPLNL---GARNSQVGWALFDTGSSYTYF 436
S MA+ PM+ +P ELY + I+ + + G ++ G + DT + +Y
Sbjct: 178 ISSSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGTGGTVIDTTTFLSYL 237
Query: 437 TKQAYSELIASLKEVSSDGLVLDASDPT---LPVCW----RAKFPIRSIVDVKQFFKTLT 489
T Y++L+ ++K +++ + + +S + +C+ + FP + TLT
Sbjct: 238 TSDFYTQLVQAIKNYTTNLVEVSSSVADALGVELCYNISANSDFPPPA---------TLT 288
Query: 490 LHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
HF G VST F + S IC+ I SE + ++G V
Sbjct: 289 YHFLGGAGVEVSTWFLLDDSD----SVNNTICMAI-GRSESVGPNLNVIGTYQQLDLTVE 343
Query: 549 YDNVNKRIGWAKSHC 563
YD R G+ C
Sbjct: 344 YDLEQMRYGFGAQGC 358
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 86/371 (23%), Positives = 144/371 (38%), Gaps = 40/371 (10%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS---CAKGANPLYKPRMGNILPYKDS 258
Y + VG P + +YL DTGSD+TW+QC PC+S C K +P++ P+ +
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ-PCASENTCYKQFDPIFDPKSSSSYSPLSC 206
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
+ + K C + C Y++ Y D S + G LA + L N + PN+ GC
Sbjct: 207 NSQQCKLLDKAN-CNS-DTCIYQVHYGDGSFTTGELATETLSFGNSN---SIPNLPIGCG 261
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+D +GL L +SL SQL + +CL +
Sbjct: 262 HDNEGLFAGGAGLIG----LGGGAISLSSQLKASSF-----SYCLVNLDSDSSSTLEFNS 312
Query: 379 LVPSWGMAWVPMLDSPFMELYHTEILKINYGS-----SPLNLGARNSQVGWALFDTGSSY 433
+PS + + + F + +++ I+ G SP S +G + D+G+
Sbjct: 313 YMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTII 372
Query: 434 TYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
+ Y L + +++S L C+ F +S V+V L+
Sbjct: 373 SRLPSDVYESLREAFVKLTS-SLSPAPGISVFDTCY--NFSGQSNVEVPTIAFVLS---- 425
Query: 494 SKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
T + YL+ + G CL + S I+G +G V YD
Sbjct: 426 -----EGTSLRLPARNYLIMLDTAGTYCLAFIK----TKSSLSIIGSFQQQGIRVSYDLT 476
Query: 553 NKRIGWAKSHC 563
N +G++ + C
Sbjct: 477 NSIVGFSTNKC 487
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 105/427 (24%), Positives = 169/427 (39%), Gaps = 54/427 (12%)
Query: 156 VVASVNDGIIRPHKSKINKKLVSSNAVAVDS--SSIFPLRGNIYPDGLYFTYMIVGNPPR 213
V ASV D ++ S ++ S+ VA S +S+ GN G Y +G PP+
Sbjct: 57 VSASVIDTVLHMASSDSHRFTYLSSLVAGKSKPTSVPVASGNQLHIGNYVVRARLGTPPQ 116
Query: 214 PYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCE 273
++ +DT +D W+ PCS C+ +N + Y C Q G
Sbjct: 117 LMFMVLDTSNDAVWL----PCSGCSGCSNASTSFNTNSSSTYSTVSCSTTQCTQARGL-- 170
Query: 274 TCQQ-------CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLL 326
TC C + Y SS L +D L L+ + PN FGC G
Sbjct: 171 TCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTLSPD----VIPNFSFGCINSASG--- 223
Query: 327 NTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG--GGGYMFLGHDLVPSWG 384
N+L G++GL R +SL SQ S + V +CL + G + LG P
Sbjct: 224 NSL-PPQGLMGLGRGPMSLVSQTTS--LYSGVFSYCLPSFRSFYFSGSLKLGLLGQPK-S 279
Query: 385 MAWVPMLDSPFM-ELYHTEILKINYGS-----SPLNLGARNSQVGWALFDTGSSYTYFTK 438
+ + P+L +P LY+ + ++ GS P+ L ++ + D+G+ T F +
Sbjct: 280 IRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSGTVITRFAQ 339
Query: 439 QAYSELIASL-KEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQ 497
Y + K+V+ L A D C+ A D + +TLH
Sbjct: 340 PVYEAIRDEFRKQVNGSFSTLGAFD----TCFSA--------DNENVTPKITLH------ 381
Query: 498 IVSTKFHISPEGYLVISKKGNI-CLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRI 556
+ S + E L+ S G + CL + + N ++ ++ + +++D N RI
Sbjct: 382 MTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRI 441
Query: 557 GWAKSHC 563
G A C
Sbjct: 442 GIAPEPC 448
>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 242
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 111/243 (45%), Gaps = 20/243 (8%)
Query: 288 SSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPS 347
SSS GVL D + E+ L VFGC + G L + DGI+GL R ++S+
Sbjct: 2 SSSSGVLGEDIVSFGRES-ELKAQRAVFGCENSETGDLFSQ--HADGIMGLGRGQLSIMD 58
Query: 348 QLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPS-WGMAWVPMLDSPFMELYHTEILKI 406
QL +G+I + C GGG M LG PS + L SP+ Y+ E+ +I
Sbjct: 59 QLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRSPY---YNIELKEI 115
Query: 407 NYGSSPLNLGAR--NSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT 464
+ L + +R +S+ G L D+G++Y Y +QA+ ++ + DP+
Sbjct: 116 HVAGKALRVDSRIFDSKHGTVL-DSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPS 174
Query: 465 LP-VCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICL 521
+C+ R++ + + F + + FG+ K ++PE YL K G CL
Sbjct: 175 YKDICFAG--ARRNVSKLHEVFPDVDMVFGN-----GQKLSLTPENYLFRHSKVDGAYCL 227
Query: 522 GIL 524
G+
Sbjct: 228 GVF 230
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 86/371 (23%), Positives = 144/371 (38%), Gaps = 40/371 (10%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS---CAKGANPLYKPRMGNILPYKDS 258
Y + VG P + +YL DTGSD+TW+QC PC+S C K +P++ P+ +
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ-PCASENTCYKQFDPIFDPKSSSSYSPLSC 206
Query: 259 LCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
+ + K C + C Y++ Y D S + G LA + L N + PN+ GC
Sbjct: 207 NSQQCKLLDKAN-CNS-DTCIYQVHYGDGSFTTGELATETLSFGNSN---SIPNLPIGCG 261
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+D +GL L +SL SQL + +CL +
Sbjct: 262 HDNEGLFAGGAGLIG----LGGGAISLSSQLKASSF-----SYCLVNLDSDSSSTLEFNS 312
Query: 379 LVPSWGMAWVPMLDSPFMELYHTEILKINYGS-----SPLNLGARNSQVGWALFDTGSSY 433
+PS + + + F + +++ I+ G SP S +G + D+G+
Sbjct: 313 NMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTII 372
Query: 434 TYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
+ Y L + +++S L C+ F +S V+V L+
Sbjct: 373 SRLPSDVYESLREAFVKLTS-SLSPAPGISVFDTCY--NFSGQSNVEVPTIAFVLS---- 425
Query: 494 SKWQIVSTKFHISPEGYLV-ISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNV 552
T + YL+ + G CL + S I+G +G V YD
Sbjct: 426 -----EGTSLRLPARNYLIMLDTAGTYCLAFIK----TKSSLSIIGSFQQQGIRVSYDLT 476
Query: 553 NKRIGWAKSHC 563
N +G++ + C
Sbjct: 477 NSLVGFSTNKC 487
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 66/238 (27%), Positives = 99/238 (41%), Gaps = 16/238 (6%)
Query: 219 MDTGSDLTWIQC-DAPCSSCAKGANPLYKP---RMGNILPYKDSLCMEIQ--RNHKPGYC 272
+DT SD+ W+QC P S C + LY P R C ++ N
Sbjct: 186 LDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPYANGCSSSS 245
Query: 273 ETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKT 332
+ QC Y + Y D S++ G L D+L L+ + P FGC++ +G + KT
Sbjct: 246 NSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTS---QVPKFEFGCSHAARGSFSRS--KT 300
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD 392
GI+ L R SL SQ +++ V +C A G+ LG S A PML
Sbjct: 301 AGIMALGRGVQSLVSQTSTK--YGQVFSYCFPPTASHKGFFVLGVPRRSSSRYAVTPMLK 358
Query: 393 SPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKE 450
+P LY + I L++ G AL D+ + T AY L ++ ++
Sbjct: 359 TPM--LYQVRLEAIAVAGQRLDVPPTVFAAGAAL-DSRTVITRLPPTAYQALRSAFRD 413
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 90/193 (46%), Gaps = 13/193 (6%)
Query: 166 RPHKSKINKKLVSSNAVAVDSSSI-FPLRGNIYPDGLYFTYMI-VGNPPRPYYLDMDTGS 223
+P + ++ LV + A DS + PL GL F I G+P + +L MDTGS
Sbjct: 20 KPKRVTLHIPLVHNGANFYDSKVVSLPLSSPHSQRGLAFMAEIHFGSPQKKQFLHMDTGS 79
Query: 224 DLTWIQCDAPCSSC-AKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYC--ETCQQCDY 280
LTW QC PCS C A+ P Y+P + Y+D++C + P + + C Y
Sbjct: 80 SLTWTQC-FPCSDCYAQKIYPKYRPAAS--ITYRDAMCEDSHPKSNPHFAFDPLTRICTY 136
Query: 281 EIEYADHSSSMGVLARDELHLTIENGSLTKPN-VVFGCAYDQQGLLLNTLVKTDGILGLS 339
+ Y D ++ G LA++ + + +G + + V FGC G + GILGL
Sbjct: 137 QQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGCNTLSDG----SYFTGTGILGLG 192
Query: 340 RAKVSLPSQLASQ 352
K S+ + S+
Sbjct: 193 VGKYSIIGEFGSK 205
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 65/128 (50%), Gaps = 13/128 (10%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPY 255
+G + + +G P Y +DTGSDLTW QC PCS C K P+Y P + + +
Sbjct: 18 NGEFLMQLAIGKPSLAYSAILDTGSDLTWTQC-MPCSDCYKQPTPIYDPSLSSTYGTVSC 76
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
K SLC+ + P C+Y Y D+SS+ G+L+ + L+ S + P++ F
Sbjct: 77 KSSLCLAL-----PASACISATCEYLYTYGDYSSTQGILSYETFTLS----SQSIPHIAF 127
Query: 316 GCAYDQQG 323
GC D +G
Sbjct: 128 GCGQDNEG 135
>gi|213998832|gb|ACJ60783.1| nucellin [Hordeum vulgare subsp. spontaneum]
Length = 127
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 73/134 (54%), Gaps = 9/134 (6%)
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAGGGGYMFL 375
C Y Q+ + DGILGL K L +QL +IK NV+GHCL++ G G +++
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSK--GKGVLYV 58
Query: 376 GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
G P+ G+ WVPM +S F Y + ++ P+ R + A+FD+GS+YT+
Sbjct: 59 GDFNPPTRGVTWVPMRESLF--YYSPGLAEVFIDKQPI----RGNPTFEAVFDSGSTYTH 112
Query: 436 FTKQAYSELIASLK 449
Q Y+E+++ ++
Sbjct: 113 VPAQIYNEIVSKVR 126
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 86/336 (25%), Positives = 141/336 (41%), Gaps = 47/336 (13%)
Query: 208 VGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNH 267
+G+PP L MDT SDL W+QC PC +C + P++ P +++ C Q +
Sbjct: 91 IGSPPVTQLLHMDTASDLLWLQC-RPCINCYAQSLPIFDPSRS--YTHRNESCRTSQYSM 147
Query: 268 KP-GYCETCQQCDYEIEYADHSSSMGVLARDELHLTI---ENGSLTKPNVVFGCAYDQQG 323
+ + C+Y + Y D + S G+LA++ L E+ S +VVFGC +D G
Sbjct: 148 PSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHDNYG 207
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHC---LTTNAGGGGYMFLGHDLV 380
LV T GILGL + SL + ++ +C L + + LG D
Sbjct: 208 ---EPLVGT-GILGLGYGEFSLVHRFGTK------FSYCFGSLDDPSYPHNVLVLGDDGA 257
Query: 381 PSWGMAWVPMLDSPFMELYH------TEILKINYGSSPLN--LGARNSQVGWA--LFDTG 430
G D+ +E+Y+ E + ++ P++ + RN Q G + DTG
Sbjct: 258 NILG-------DTTPLEIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTG 310
Query: 431 SSYTYFTKQAYSEL---IASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKT 487
+S T ++AY L I E ++ D C+ R +V+ F
Sbjct: 311 NSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLE-RDLVESG--FPI 367
Query: 488 LTLHFGSKWQ----IVSTKFHISPEGYLVISKKGNI 519
+T HF + + S +SP + + GN+
Sbjct: 368 VTFHFSDGAELSLDVKSVFMKLSPNVFCLAVTPGNM 403
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 89/384 (23%), Positives = 159/384 (41%), Gaps = 57/384 (14%)
Query: 219 MDTGSDLTWIQC--DAPCSSCAK--GANPLYKPRMG---NILPYKDSLCMEIQRNHKPGY 271
MDTGSDL W+ C + C +C + +N ++ PRM +++ DS C + N+
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60
Query: 272 CETC----QQCD-----YEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQ 322
C++C + C Y I+Y S+ G+L + L+L +ENG + F
Sbjct: 61 CQSCAGSLKNCSETCPPYGIQYG-RGSTAGLLLTETLNLPLENGEGARAITHFAV----- 114
Query: 323 GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN----AGGGGYMFLGHD 378
G + + + GI G R +S+PSQL I K+ +CL ++ M LG
Sbjct: 115 GCSIVSSQQPSGIAGFGRGALSMPSQLGEH-IGKDRFAYCLQSHRFDEENKKSLMVLGDK 173
Query: 379 LVP-SWGMAWVPML-------DSPFMELYHTEILKINYGSSPLN------LGARNSQVGW 424
+P + + + P L S + Y+ + ++ G L L G
Sbjct: 174 ALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGG 233
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPT-LPVCWRAKFPIRSIVDVKQ 483
+ D+G+++T F+ + + + A + D T + +C+ + +IV
Sbjct: 234 TIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGLCYDVT-GLENIV---- 288
Query: 484 FFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGS---EVHNGSTIILGD 539
HF G ++ + S S +ICL ++ EV +G +ILG+
Sbjct: 289 -LPEFAFHFKGGSDMVLPVANYFS-----YFSSFDSICLTMISSRGLLEVDSGPAVILGN 342
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHC 563
+ ++YD R+G+ + C
Sbjct: 343 DQQQDFYLLYDREKNRLGFTQQTC 366
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 65/128 (50%), Gaps = 13/128 (10%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPY 255
+G + + +G P Y +DTGSDLTW QC PCS C K P+Y P + + +
Sbjct: 18 NGEFLMQLAIGKPSLAYSAILDTGSDLTWTQC-IPCSDCYKQPTPIYDPSLSSTYGTVSC 76
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
K SLC+ + P C+Y Y D+SS+ G+L+ + L+ S + P++ F
Sbjct: 77 KSSLCLAL-----PASACISATCEYLYTYGDYSSTQGILSYETFTLS----SQSIPHIAF 127
Query: 316 GCAYDQQG 323
GC D +G
Sbjct: 128 GCGQDNEG 135
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 98/417 (23%), Positives = 165/417 (39%), Gaps = 74/417 (17%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQC---DAPCSSCAK------GANPLYKPRMGNI 252
Y + +G PP+ + MDTGSDLTW+ C C C ++ ++ P +
Sbjct: 11 YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSSS 70
Query: 253 ---LPYKDSLCMEIQRNHKP------GYC-------ETC-QQC-DYEIEYADHSSSMGVL 294
S C EI + P C TC + C + Y + G+L
Sbjct: 71 SFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGIL 130
Query: 295 ARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
RD L + P FGC + +T + GI G R +SLPSQL G
Sbjct: 131 TRDILKARTRD----VPRFSFGC-------VTSTYHEPIGIAGFGRGLLSLPSQL---GF 176
Query: 355 IKNVVGHC-----LTTNAGGGGYMFLGHDLVP---SWGMAWVPMLDSP-FMELYHTEILK 405
++ HC N + LG + + + + PML++P + Y+ +
Sbjct: 177 LEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLES 236
Query: 406 INYGSS------PLNLGARNSQ-VGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVL 458
I G++ PL L +SQ G L D+G++YT+ YS+L+ L+ +
Sbjct: 237 ITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRAT 296
Query: 459 DASDPT-LPVCWRAKFPIRSIV----DVKQFFKTLTLHFGSKWQIVSTKFHISPEG---- 509
+ T +C++ P ++ DV F ++T +F + + + P+G
Sbjct: 297 ETESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATL------LLPQGNSFY 350
Query: 510 YLVISKKGNI--CLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCM 564
+ G++ CL + + + G + G + VVYD +RIG+ C+
Sbjct: 351 AMSAPSDGSVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCV 407
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 75.5 bits (184), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 91/207 (43%), Gaps = 25/207 (12%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG---NILPYK 256
G YFT + VG P R Y+ +DTGSD+ WIQC+ PC C A+P++ P + +
Sbjct: 155 GEYFTRIGVGTPTREQYMVLDTGSDVAWIQCE-PCRECYSQADPIFNPSYSASFSTVGCD 213
Query: 257 DSLCMEIQR--NHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
++C ++ H G C YE Y D S S G A + L G+ + NV
Sbjct: 214 SAVCSQLDAYDCHSGG-------CLYEASYGDGSYSTGSFATETLTF----GTTSVANVA 262
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT-TNAGGGGYM 373
GC + GL + L +S P+Q+ +Q + +CL + G +
Sbjct: 263 IGCGHKNVGLFIGAAGLLG----LGAGALSFPNQIGTQ--TGHTFSYCLVDRESDSSGPL 316
Query: 374 FLGHDLVPSWGMAWVPMLDSPFMELYH 400
G VP G + P+ +P + ++
Sbjct: 317 QFGPKSVPV-GSIFTPLEKNPHLPTFY 342
>gi|330794218|ref|XP_003285177.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
gi|325084898|gb|EGC38316.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
Length = 817
Score = 75.5 bits (184), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 99/411 (24%), Positives = 175/411 (42%), Gaps = 56/411 (13%)
Query: 186 SSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWI---QCDAPCSSCAKGAN 242
SSSI L G I YF ++VG PP+ + + +DTGS + C S K +
Sbjct: 191 SSSI--LYGGITSSFEYFIPILVGTPPQMFTVQVDTGSTSLAVPGSNCYLYKSQSIKTSC 248
Query: 243 PLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQ------CDYEIEYADHSSSMGVLAR 296
+ + ++S+ C TC+ C + ++Y D S G L
Sbjct: 249 SCSDGNLDGLYSLEESISSNQLNCSDTSNCNTCKNNKSNKPCPFVLKYGDGSFIAGSLVI 308
Query: 297 DELHLTIEN-------GSLTKPNVVFG---CAYDQQGLLLNTLVKTDGILGLSRAKVS-- 344
D H+TI + G++ K ++ F C Q+ + DGILGLS ++
Sbjct: 309 D--HVTIGDFTVPAKFGNIQKESLSFSQLTCPSTQRSQAVR-----DGILGLSFQQLDPD 361
Query: 345 ----LPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG--HDLVPSWGMAWVPMLDSPFMEL 398
+ S++ + I NV CL GG + +G +D + + P+ DS +
Sbjct: 362 NGDDIFSKIVAHYNIPNVFSMCL---GKDGGLLTIGGTNDHITQETPKYTPIFDSHY--- 415
Query: 399 YHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVL 458
Y + I G+ LNL + + ++ D+G++ YF+ + + ++ +L+E + L
Sbjct: 416 YSITVTNIYVGNDSLNLAPPD--LSTSIVDSGTTLLYFSDEIFYSIVRNLEEKHCE-LPG 472
Query: 459 DASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN 518
+DP W + ++ T+ L S K + P+ Y ++ G
Sbjct: 473 ICNDPF----WEGNCHHLEEKLISEY-PTIYLEMKGMNGEPSFKLEVPPDLYF-LNINGL 526
Query: 519 ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSH-CMNPGR 568
C GI E+ ++++GD+ L+G V+Y+ N IG+A++H C G
Sbjct: 527 YCFGISHMKEI----SVLIGDVVLQGYNVIYNRENSSIGFARTHGCSTKGN 573
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 75.5 bits (184), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 80/324 (24%), Positives = 128/324 (39%), Gaps = 57/324 (17%)
Query: 190 FPLRGNIYPDG--------LYFTY-------MIVGNPPRPYYLDMDTGSDLTWIQCDAPC 234
FPLR P G L F + + VG PP+ + +DTGS+L+W+ C
Sbjct: 34 FPLRSRQVPVGALPRPPSKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAT-- 91
Query: 235 SSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSM 291
A A ++PR +P + C P ++C + YAD S+S
Sbjct: 92 GRAAAAAADSFRPRASATFAAVPCGSARCSSRDLPAPPSCDAASRRCRVSLSYADGSASD 151
Query: 292 GVLARDELHLTIENGSLTKPNVVFGC---AYDQQGLLLNTLVKTDGILGLSRAKVSLPSQ 348
G LA D + G FGC AYD V T G+LG++R +S +Q
Sbjct: 152 GALATDVFAV----GDAPPLRSAFGCMSAAYDSS----PDAVATAGLLGMNRGALSFVTQ 203
Query: 349 LASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDS----PFME--LYHTE 402
+++ +C+ ++ G + LGH +P + + P+ P+ + Y +
Sbjct: 204 ASTRRF-----SYCI-SDRDDAGVLLLGHSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQ 257
Query: 403 ILKINYGSSPLN-----LGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLV 457
+L I G PL L ++ G + D+G+ +T+ AYS + A
Sbjct: 258 LLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEF--------- 308
Query: 458 LDASDPTLPVCWRAKFPIRSIVDV 481
L + P LP F + D
Sbjct: 309 LKQTKPLLPALEDPSFAFQEAFDT 332
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 87/382 (22%), Positives = 158/382 (41%), Gaps = 44/382 (11%)
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYKDSLCME 262
+ VG PP+ + +DTGS+L+W+ C+ ++ A P + P + + + C
Sbjct: 70 ITVGTPPQNMSMVIDTGSELSWLHCNT--NTTATIPYPFFNPNISSSYTPISCSSPTCTT 127
Query: 263 IQRNHK-PGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQ 321
R+ P C++ C + YAD SSS G LA D T GS P +VFGC
Sbjct: 128 RTRDFPIPASCDSNNLCHATLSYADASSSEGNLASD----TFGFGSSFNPGIVFGCMNSS 183
Query: 322 QGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVP 381
+ T G++G++ +SL SQL I K +C+ + + G + LG
Sbjct: 184 YSTNSESDSNTTGLMGMNLGSLSLVSQLK---IPK--FSYCI-SGSDFSGILLLGESNF- 236
Query: 382 SWG--MAWVPMLDS----PFME--LYHTEILKINYGSSPLNLGAR-----NSQVGWALFD 428
SWG + + P++ P+ + Y + I LN+ ++ G +FD
Sbjct: 237 SWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFD 296
Query: 429 TGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP------TLPVCWRAKFPIRSIVDVK 482
G+ ++Y Y+ L ++G + DP + +C+R + ++
Sbjct: 297 LGTQFSYLLGPVYNALRDEFLN-QTNGTLRALDDPNFVFQIAMDLCYRVPVNQSELPEL- 354
Query: 483 QFFKTLTLHF-GSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDIS 541
+++L F G++ ++ + G+ V C S++ I+G
Sbjct: 355 ---PSVSLVFEGAEMRVFGDQLLYRVPGF-VWGNDSVYCF-TFGNSDLLGVEAFIIGHHH 409
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ + +D V R+G A + C
Sbjct: 410 QQSMWMEFDLVEHRVGLAHARC 431
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/377 (22%), Positives = 148/377 (39%), Gaps = 39/377 (10%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +G PP P DTGSDL W QC PC +C + PL+ P+ D
Sbjct: 92 GAYLMNISLGTPPVPMLGIADTGSDLIWRQC-LPCPNCYEQVEPLFDPKESETYKTLDCD 150
Query: 260 CMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSLTKPNVVFGCA 318
Q + G C+ C Y Y D S + G L+ D L + + E + P + FGC
Sbjct: 151 NEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFGCG 210
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCL----TTNAGGGGYMF 374
+D G K G++GL +SL QL+S+ + +CL + + F
Sbjct: 211 HDNGGTFNE---KDGGLIGLGGGPLSLVMQLSSE--VGGQFSYCLVPLSSDSTVSSKINF 265
Query: 375 LGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNS--------QVGWAL 426
+V G P++ Y+ + ++ GS + + + G +
Sbjct: 266 GKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNII 325
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFK 486
D+G++ T + Y+++ ++L D + +C+ + +
Sbjct: 326 IDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDP-NGIFSLCYSSVNNLE--------IP 376
Query: 487 TLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQL 546
T+T HF + P V ++ +C ++ S + I G+++ L
Sbjct: 377 TITAHF------TGADVQLPPLNTFVQVQEDLVCFSMIPSSNL-----AIFGNLAQINFL 425
Query: 547 VVYDNVNKRIGWAKSHC 563
V YD N ++ + ++ C
Sbjct: 426 VGYDLKNNKVSFKQTDC 442
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 101/430 (23%), Positives = 173/430 (40%), Gaps = 64/430 (14%)
Query: 168 HKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTW 227
H ++L S+A A + P + ++ G Y + +G PP Y DTGSDL W
Sbjct: 53 HARFAREQLAPSSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPLSYRAIADTGSDLIW 112
Query: 228 IQCDAPC--------SSCAKGANPLYKPRMGN---ILPYKD--SLCMEIQRNHKPGYCET 274
QC APC + C K + LY P +LP S+C + P C
Sbjct: 113 TQC-APCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGCA- 170
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENG--SLTKPNVVFGCAYDQQGLLLNTLVKT 332
C Y Y ++ GV + + + ++ PN+ FGC+ N +
Sbjct: 171 ---CMYNQTYGTGWTA-GVQSVETFTFGSSSTPPAVRVPNIAFGCSNASS----NDWNGS 222
Query: 333 DGILGLSRAKVSLPSQLASQGIIKNVVGHCLT--TNAGGGGYMFLGHDLVPSWGMAWV-- 388
G++GL R +SL SQL + +CLT +A + LG PS A
Sbjct: 223 AGLVGLGRGSMSLVSQLGA-----GAFSYCLTPFQDANSTSTLLLG----PSAAAALKGT 273
Query: 389 -PMLDSPFME---------LYHTEILKINYGSSPLNL-----GARNSQVGWALFDTGSSY 433
P+ +PF+ Y+ + I+ G + L + R G + D+G++
Sbjct: 274 GPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTI 333
Query: 434 TYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFG 493
T AY ++ A+++ + L L A P F +++ ++TLHF
Sbjct: 334 TTLVDSAYQQVRAAVRSLLVTRLPL-AHGPDHSTGLDLCFALKASTP-PPAMPSMTLHFE 391
Query: 494 SKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVN 553
+V + E Y+++ G CL + + + G+ ++G+ + V+YD
Sbjct: 392 GGADMV-----LPVENYMILG-SGVWCLAMRNQTV---GAMSMVGNYQQQNIHVLYDVRK 442
Query: 554 KRIGWAKSHC 563
+ + +A + C
Sbjct: 443 ETLSFAPAVC 452
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 96/414 (23%), Positives = 158/414 (38%), Gaps = 60/414 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPY------ 255
Y + +G PP+ + MDTGSDLTW C C + N M + P
Sbjct: 80 YLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHSSSSH 139
Query: 256 ----KDSLCMEIQRNHKPGYCETCQQCD---------------YEIEYADHSSSMGVLAR 296
C+++ + P T C + Y G L R
Sbjct: 140 RDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTLTR 199
Query: 297 DELHLTIENGSLTK--PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGI 354
D L + N +T+ P FGC + ++ + GI G R +SLPSQL G
Sbjct: 200 DTLRVHGRNLGVTQEIPRFCFGC-------VASSYREPIGIAGFGRGALSLPSQL---GF 249
Query: 355 IKNVVGHCL-----TTNAGGGGYMFLGH-DLVPSWGMAWVPMLDSP-FMELYHTEILKIN 407
++ HC N + +G L M + PML SP + Y+ + I
Sbjct: 250 LRKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAIT 309
Query: 408 YGS-----SPLNLGARNS-QVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDAS 461
G+ P +L +S G L D+G++YT+ + YS++++ L+ + + D
Sbjct: 310 VGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDME 369
Query: 462 DPT-LPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTK----FHISPEGYLVISKK 516
T +C++ SI+ ++T HF + +V ++ + +S + K
Sbjct: 370 MRTGFDLCYKVPCQNNSIL-TGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVK- 427
Query: 517 GNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGRFK 570
CL + G +LG + VVYD +RIG+ C + F+
Sbjct: 428 ---CLLFQSMDDGDYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDCASAASFQ 478
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 99/242 (40%), Gaps = 24/242 (9%)
Query: 219 MDTGSDLTWIQCDAPCSS--CAKGANPLYKPRMGN---ILPYKDSLCMEIQRNHKPGYCE 273
+DT SD+ W+QC APC + C + LY P + P C RN P Y
Sbjct: 160 IDTASDVPWVQC-APCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPAC----RNLGP-YAN 213
Query: 274 TC----QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYD--QQGLLLN 327
C QC Y ++Y D S+S G D L L + FGC++ Q G N
Sbjct: 214 GCTPAGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFSN 273
Query: 328 TLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAW 387
KT GI+ L R SLP+Q ++ +V +CL G+ LG V + A
Sbjct: 274 ---KTSGIMALGRGAQSLPTQ--TKATYGDVFSYCLPPTPVHSGFFILGVPRVAASRYAV 328
Query: 388 VPMLDSPFME-LYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIA 446
PML S LY ++ I L + G A+ D+ + T AY L A
Sbjct: 329 TPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAG-AVMDSRTIVTRLPPTAYMALRA 387
Query: 447 SL 448
+
Sbjct: 388 AF 389
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 158/375 (42%), Gaps = 49/375 (13%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS---SCAKGANPLYKPRMGN---ILPY 255
Y +G P +++DTGSDL+W+QC PCS SC +PL+ P + +P
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCSAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+C + + QC Y + Y D S++ GV + D L L+ + F
Sbjct: 199 GGPVCAGLGIYAA--SACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS---AVQGFFF 253
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC + Q GL DG+LGL R + SL Q A G V +CL T GY+ L
Sbjct: 254 GCGHAQSGLFNG----VDGLLGLGREQPSLVEQTA--GTYGGVFSYCLPTKPSTAGYLTL 307
Query: 376 G----HDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTG 430
G P G + +L SP Y+ +L I+ G L++ A ++ G + DTG
Sbjct: 308 GVGGPSGAAP--GFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA-SAFAGGTVVDTG 364
Query: 431 SSYTYFTKQAYSELIASLKE-VSSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTL 488
+ T AY+ L ++ + ++S G S+ L C+ A + ++ +V
Sbjct: 365 TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVA------ 418
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
L FGS + I G L + G+ +G ILG++ R V
Sbjct: 419 -LTFGSGATVTLGADGILSFGCLAFAPSGS------------DGGMAILGNVQQRSFEVR 465
Query: 549 YDNVNKRIGWAKSHC 563
D + +G+ S C
Sbjct: 466 IDGTS--VGFKPSSC 478
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/310 (25%), Positives = 139/310 (44%), Gaps = 47/310 (15%)
Query: 156 VVASVNDGIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPY 215
+ +S+N ++ P K+++ + +V S P R NI + VG PP+
Sbjct: 36 LCSSLNPALVLPLKTQV----IPPESVR-RSPDKLPFRHNIS----LTVSLTVGTPPQNV 86
Query: 216 YLDMDTGSDLTWIQCDAPCSSCAKGA--NPLYKPRMGNILPYKDSLCMEIQRNH--KPGY 271
+ +DTGS+L+W+ C+ +S + + NP++ I P S C + R+ +P
Sbjct: 87 TMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPI-PCSSSTCTDQTRDFPIRPS- 144
Query: 272 CETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVK 331
C++ Q C + YAD SSS G LA D ++ GS PNVVFGC K
Sbjct: 145 CDSNQFCHATLSYADASSSEGNLATDTFYI----GSSGIPNVVFGCMDSIFSSNSEEDSK 200
Query: 332 TDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWV-PM 390
G++G++R +S SQ+ G K +C++ Y F G L+ +W+ P+
Sbjct: 201 NTGLMGMNRGSLSFVSQM---GFPK--FSYCISE------YDFSGLLLLGDANFSWLAPL 249
Query: 391 LDSPFMEL-------------YHTEILKINYGSSPLN---LGARNSQVGWALFDTGSSYT 434
+P +E+ E +K+ + P+ ++ G + D+G+ +T
Sbjct: 250 NYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDSGTQFT 309
Query: 435 YFTKQAYSEL 444
+ AY+ L
Sbjct: 310 FLLGPAYTAL 319
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 91/390 (23%), Positives = 160/390 (41%), Gaps = 62/390 (15%)
Query: 199 DGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSC---AKGANPLYKPRMGNI--L 253
+G Y + +G PP+ +DTGSDL W++CD C C G + + L
Sbjct: 2 EGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDN-CDHCDLDHHGETIFFSDASSSYKKL 60
Query: 254 PYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDEL----HLTIENGSLT 309
P + C + CE + C Y+ EY D S + G + D + H E+
Sbjct: 61 PCNSTHCSGMSSAGIGPRCE--ETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSF 118
Query: 310 KPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA-- 367
+FGC +G T G++GL + SL QL + + +CL +
Sbjct: 119 FDGFLFGCGRKLKG----DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCLVSYDSP 172
Query: 368 -GGGGYMFL-------GHDLVPSWGMAWVPMLDSPFME--LYHTEILKINYGSSPLNL-- 415
++FL GHD+V + P+L ++ LY+ ++ I G P+ +
Sbjct: 173 PSAKSFLFLGSSAALRGHDVVST------PILHGDHLDQTLYYVDLQSITVGGVPVVVYD 226
Query: 416 --GARNSQVG-----WALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVC 468
N+ VG + D+G++YT T Y + S++E ++L PTL
Sbjct: 227 KESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQ----VIL----PTLGNS 278
Query: 469 WRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSE 528
S D F ++T +F ++ Q+V + E ++ + +CL + +
Sbjct: 279 AGLDLCFNSSGDTSYGFPSVTFYFANQVQLV-----LPFENIFQVTSRDVVCLSM----D 329
Query: 529 VHNGSTIILGDISLRGQLVVYDNVNKRIGW 558
G I+G++ + ++YD V +I +
Sbjct: 330 SSGGDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
Length = 431
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 114/276 (41%), Gaps = 37/276 (13%)
Query: 190 FPLRGNIYPD--GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP 247
PL G P+ GLY+ + +G P R YY+ M+ LT + +
Sbjct: 84 LPLGGTGRPEAVGLYYAKIGIGTPARDYYVQME----LTLYD--------------IKES 125
Query: 248 RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307
G ++ C I P YC C Y YAD SSS G + + N S
Sbjct: 126 LTGKLVSCDQDFCYAINGG-PPSYCIANMSCSYTEIYADGSSSFGYFVKGYCTASKYN-S 183
Query: 308 LTKPN------VVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGH 361
+ N V C+ Q G L ++ DGILG ++ S+ SQLAS G ++ + H
Sbjct: 184 IPHLNNNPLLEVPLRCSATQSGDL-SSEEALDGILGFGKSNTSMISQLASSGKVRKMFAH 242
Query: 362 CLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQ 421
CL GGG +GH + P + P++ P Y+ + + G LNL
Sbjct: 243 CLDG-LNGGGIFAIGHIVQPK--VNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFD 297
Query: 422 VG---WALFDTGSSYTYFTKQAYSELIASLKEVSSD 454
VG + D+G++ Y + Y +L++ + SD
Sbjct: 298 VGDKKGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSD 333
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 99/414 (23%), Positives = 159/414 (38%), Gaps = 76/414 (18%)
Query: 197 YPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP------------------CSSCA 238
Y D Y + VG PP + DTGSDL W++C+
Sbjct: 77 YGDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPP 136
Query: 239 KGANPLYKP-------RMGNILPYKDSLCMEIQRNHKPGYCE-TCQQCDYEIEYADHSSS 290
A + P R+G P C+ + N C CD+ Y D +S+
Sbjct: 137 PEAVVYFNPFDSSSYSRVGCDGPS----CLALATNAS---CNGDSHACDFRYSYRDGASA 189
Query: 291 MGVLARDELHL--TIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQ 348
G+LA D I N + + ++ FGCA G + DG++GL +SL SQ
Sbjct: 190 TGLLAADTFTFGGNINNDTTSTASIDFGCATGTAGREF----QADGMVGLGAGPLSLASQ 245
Query: 349 LASQGIIKNVVGHCLTT---NAGGGGYMFLGHDLVPSWGMAWVPMLDSP--FMELYHTEI 403
L + CLT + F +V G A P++ S Y I
Sbjct: 246 LGRK------FSFCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISI 299
Query: 404 LKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDAS-- 461
+ P+ + V + DTG+ T+ + A L+A L E S V+D +
Sbjct: 300 DSLKVAGQPV---PGTTSVSKVIVDTGTVLTFLDRAA---LLAPLTE--SLARVMDGAGL 351
Query: 462 ------DPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISK 515
D TL +C+ + + DV +TL + ++ EG V+ K
Sbjct: 352 PRAPPPDETLELCYD----VSRVKDVDGVIPDVTLVL---GGGGGGEVRLTGEGTFVLVK 404
Query: 516 KGNICLGILDGS-EVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMNPGR 568
+G +CL ++ S E+ S +LG+++L+ V D + +A ++C + R
Sbjct: 405 EGVLCLAVVTTSPELQPLS--VLGNVALQDLHVGIDLDARTATFATANCDSSSR 456
>gi|213998800|gb|ACJ60767.1| nucellin [Hordeum marinum subsp. marinum]
Length = 142
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 75/139 (53%), Gaps = 9/139 (6%)
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAGGGGYMFL 375
C Y Q+ + DGILGL K +QL Q +I NV+GHCL++ G G +++
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK--GKGVLYV 58
Query: 376 GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
G+ PS G+ WVPM +S F Y + ++ + P+ R + A+FD+GS+YT
Sbjct: 59 GNFNPPSRGVTWVPMRESSF--YYSPGLAELLIDNQPI----RGNPTFEAVFDSGSTYTL 112
Query: 436 FTKQAYSELIASLKEVSSD 454
Q Y+E+++ ++ S+
Sbjct: 113 VPSQIYNEIVSKVRGTLSE 131
>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
Length = 507
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 105/392 (26%), Positives = 170/392 (43%), Gaps = 56/392 (14%)
Query: 191 PLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMG 250
P+ G+++ T +IVGN + + +DTGS L I + C++C + + P+Y P
Sbjct: 114 PMTGDLFQIN---TQIIVGNTT--FLVQVDTGSLLMAIPLEG-CNTCVE-SRPVYHPSST 166
Query: 251 NILPYKDSLCMEIQRNHKPGYCETC--QQCDYEIEYADHSSSMGVLARDELHLTIENGSL 308
+ S + + P T + CD++I Y D S G + D ++L G
Sbjct: 167 STKVACSSDQCKGSGSTPPSCSRTSSGESCDFQIRYGDGSHVSGYIYEDVVNLAGLQG-- 224
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVS-LPS---QLASQGIIKNVVGHCLT 364
K N FG ++ G + DGI+G R S +P+ L S +KN G L
Sbjct: 225 -KAN--FGANDEETGDF--EYPRADGIIGFGRTCSSCVPTVWDSLVSDLGLKNQFGMLL- 278
Query: 365 TNAGGGGYMFLGHDLVPSW--GMAWVPML--DSPFMELYHTEILKINYGSSPLNLGARNS 420
N GGG + LG + + + P++ ++PF + T I +IN + P G++
Sbjct: 279 -NYEGGGSLSLGEINTSYYTGDIRYTPLVQKNTPFYSVKSTGI-RINDYTIP---GSKLG 333
Query: 421 QVGWALFDTGSSYTYFTKQAYSEL-------IASLKEVSSDGLVLDASDPTLPVCWRAKF 473
Q + D+GS+ AY +L S++ V + + S +C+ +
Sbjct: 334 Q--EVIVDSGSTALSLASGAYDQLRNYFQTHYCSIQGVCENPNIFQGS-----ICYSSD- 385
Query: 474 PIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGS 533
DV F TL F Q+ I P+ YLV + N G E + +
Sbjct: 386 ------DVLSKFPTLYFTFDGGVQVA-----IPPKNYLVKAPLTNGKYGYCFMIERADST 434
Query: 534 TIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
ILGD+ +RG V+DNVN R+G+A M+
Sbjct: 435 MTILGDVFMRGYYTVFDNVNDRVGFAVGANMS 466
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 83/164 (50%), Gaps = 12/164 (7%)
Query: 195 NIYPDG---LYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKG-ANPLYKPRMG 250
N++P L+ +G PP P MDTGS L WIQC APC SC++ P++ P +
Sbjct: 92 NLHPSASEPLFLVNFSMGQPPVPQLAIMDTGSSLLWIQC-APCKSCSQQIIGPMFDPSIS 150
Query: 251 NILPYKDSLCMEIQRNHKP-GYCETCQQCDYEIEYADHSSSMGVLARDELHL-TIENGSL 308
+ Y C I + P G C++ QC Y Y + S+GV+A ++L + + G
Sbjct: 151 S--TYDSLSCKNIICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRN 208
Query: 309 TKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
NV+FGC++ + G + + G+ GL S+ +Q+ S+
Sbjct: 209 AVNNVLFGCSH-RNGNYKDR--RFTGVFGLGSGITSVVNQMGSK 249
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/259 (27%), Positives = 116/259 (44%), Gaps = 20/259 (7%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC---SSCAKGANPLYKPRMGNILPY--- 255
Y + +G+P + +DTGSD++W+QC+ PC S C A L+ P +
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCE-PCPAPSPCHAHAGALFDPAASSTYAAFNC 166
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+ C ++ + + C+ +C Y ++Y D S++ G + D L L +GS F
Sbjct: 167 SAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL---SGSDVVRGFQF 223
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC++ + G ++ KTDG++GL S SQ A++ +CL G++ L
Sbjct: 224 GCSHAELGAGMDD--KTDGLIGLGGDAQSPVSQTAAR--YGKSFFYCLPATPASSGFLTL 279
Query: 376 GHDLVPSWG----MAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTG 430
G G A PML S + Y+ L+ I G L L G +L D+G
Sbjct: 280 GAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAG-SLVDSG 338
Query: 431 SSYTYFTKQAYSELIASLK 449
+ T AY+ L ++ +
Sbjct: 339 TVITRLPPAAYAALSSAFR 357
>gi|213998802|gb|ACJ60768.1| nucellin [Hordeum murinum subsp. glaucum]
Length = 142
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 74/139 (53%), Gaps = 9/139 (6%)
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAGGGGYMFL 375
C Y Q+ + DGILGL K QL Q +IK N++GHCL++ G G +++
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAVQLKGQKMIKENIIGHCLSSK--GKGVLYV 58
Query: 376 GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
G PS G+ WVPM +S F Y + ++ + P+ R + A+FD+GS+YT+
Sbjct: 59 GDFNPPSRGVTWVPMRESLF--YYSPGLAELLIDNQPI----RGNPTFEAVFDSGSTYTH 112
Query: 436 FTKQAYSELIASLKEVSSD 454
YSE+++ ++ S+
Sbjct: 113 VPAHIYSEIVSKVRGTLSE 131
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 96/435 (22%), Positives = 163/435 (37%), Gaps = 60/435 (13%)
Query: 157 VASVNDGIIRPHKSKINKKLVSSNAVAVD-----SSSIFPL-----RGNIYPDGLYFTYM 206
++ V+DG + + + +V S A A + ++ P R N + Y ++
Sbjct: 37 LSHVDDGRGFTKRELLRRMVVRSRARAANLCPYSGATARPATAPVGRANTDVNSEYLIHL 96
Query: 207 IVGNP-PRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQR 265
+G P +P L +DTGSD+ W QC+ PC+ C P + N + + C +
Sbjct: 97 SIGAPRSQPVVLTLDTGSDVVWTQCE-PCAECFTQPLPRFDTAASNTV--RSVACSDPLC 153
Query: 266 NHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLT--IENGSLTKPNVVFGCAYDQQG 323
N + C Y Y D S S G RD G +T P++ FGC G
Sbjct: 154 NAHSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAG 213
Query: 324 LLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTN-AGGGGYMFLG--HDLV 380
L T GI G R +SLPSQL + +C TT +FLG DL
Sbjct: 214 RFLQT---ETGIAGFGRGPLSLPSQLKVRQF-----SYCFTTRFEAKSSPVFLGGAGDLK 265
Query: 381 PSWGMAWVPMLDSPFMEL---------YHTEILKINYGSSPLNL-GARNSQVGWALFDTG 430
A P+L +PF+ Y + G + L + + G D+G
Sbjct: 266 ---AHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGSGATFIDSG 322
Query: 431 SSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
+ T F + +L ++ ++ + A + + W K L
Sbjct: 323 TDITTFPDAVFRQLKSAFIAQAALPVNKTADEDDICFSWDGK--------KTAAMPKLVF 374
Query: 491 HF-GSKWQIVSTKFHISPEGYLVISKK-GNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
H G+ W + E Y+ ++ G +C+ + ++ ++G+ + +V
Sbjct: 375 HLEGADWDLPR-------ENYVTEDRESGQVCVAVSTSGQMDR---TLIGNFQQQNTHIV 424
Query: 549 YDNVNKRIGWAKSHC 563
YD ++ + C
Sbjct: 425 YDLAAGKLLLVPAQC 439
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/388 (23%), Positives = 151/388 (38%), Gaps = 59/388 (15%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS--SCAKGANPLY---KPRMGNILPYK 256
Y ++G+PP+ +DTGS+L W QC C +CAK P Y + +P
Sbjct: 84 YIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCA 143
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHS--SSMGVLARDELHLTIENGSLTKPNVV 314
DS ++ + C C + Y S S+G A T ++G+ +
Sbjct: 144 DS--AKLCAANGVHLCGLDGSCTFAASYGAGSVFGSLGTEA-----FTFQSGA---AKLG 193
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGG 371
FGC + + L G++GL R ++SL SQ + +CLT N G
Sbjct: 194 FGCVSLTR-ITKGALNGASGLIGLGRGRLSLVSQTGA-----TKFSYCLTPYLRNHGASS 247
Query: 372 YMFLGHDLVPSWGMAWVPML-------DSPFMELYHTEILKINYGSSPLNLGARNSQV-- 422
++F+G S G V + D P+ Y+ ++ I+ G + L + + ++
Sbjct: 248 HLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFELRR 307
Query: 423 -------GWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPI 475
G + DTGS T + AYS L + + LV +D L +C +
Sbjct: 308 VAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPADTGLDLCVARQ--- 364
Query: 476 RSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI 535
DV + L HFG + +S Y K C+ I +G G
Sbjct: 365 ----DVDKVVPVLVFHFGGGADMA-----VSAGSYWGPVDKSTACMLIEEG-----GYET 410
Query: 536 ILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++G+ + ++YD + + + C
Sbjct: 411 VIGNFQQQDVHLLYDIGKGELSFQTADC 438
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 93/395 (23%), Positives = 159/395 (40%), Gaps = 56/395 (14%)
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQC-----DAPCSSCAKGANPLYKPRMGNILPYKDSLC 260
+ VG PP+ + +DTGS+L+W+ C DAP + A + Y P +P C
Sbjct: 67 VAVGTPPQNVTMVLDTGSELSWLLCNGSRHDAPFDASASSS---YAP-----VPCSSPAC 118
Query: 261 MEIQRNH--KPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCA 318
+ R+ +P +C++ C + YAD SS+ G+LA D L GS P +FGC
Sbjct: 119 TWLGRDLPVRP-FCDS-SACRVSLSYADASSADGLLAADTFLL----GSSPMP-ALFGCI 171
Query: 319 YDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHD 378
+ G+LG++R +S +Q A++ +C+ G G + G+D
Sbjct: 172 TSYSSSTDPSETPPTGLLGMNRGGLSFVTQTATRRF-----AYCIAAGQGPGILLLGGND 226
Query: 379 LV------PSWGMAWVPMLDS----PFME--LYHTEILKINYGSSPLN-----LGARNSQ 421
P + + P+++ P+ + Y ++ I GS+ L L ++
Sbjct: 227 TETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTG 286
Query: 422 VGWALFDTGSSYTYFTKQAYSELIASLKE---VSSDGLVLDASDP------TLPVCWRAK 472
G + D+G+ +T+ AY+ L A S DG + +P C+R
Sbjct: 287 AGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRGT 346
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIV--STKFHISPEGYLVISKKGNICLGILDGSEVH 530
S + L +V + K G +G CL S++
Sbjct: 347 EARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCL-TFGSSDMA 405
Query: 531 NGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
S ++G + V YD N R+G+A + C +
Sbjct: 406 GVSAYVIGHHHQQDVWVEYDLRNARLGFAAARCAD 440
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 96/393 (24%), Positives = 150/393 (38%), Gaps = 49/393 (12%)
Query: 185 DSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CAKGAN 242
D+ S+ G+ Y Y + +G P P L +DTGS LTW+QC PC+S C
Sbjct: 112 DAVSVPTQLGSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCK-PCNSSQCYPQRL 170
Query: 243 PLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQ--CDYEIEYADHSSSMGVLARD 297
PL+ P + +P C + C + C YEI Y ++ G + D
Sbjct: 171 PLFDPNTSSSYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTD 230
Query: 298 ELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK- 356
LT+ G++ K FGC + QQ DG+LGL R LP LA Q +
Sbjct: 231 A--LTLGPGAIVK-RFHFGCGHHQQ---RGKFDMADGVLGLGR----LPQSLAWQASARR 280
Query: 357 --NVVGHCLTTNAGGGGYMFLG--HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSP 412
V HCL G++ LG HD + M D P+ Y I+
Sbjct: 281 GGGVFSHCLPPTGVSTGFLALGAPHDTSAFVFTPLLTMDDQPW--FYQLMPTAISVAGQL 338
Query: 413 LNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAK 472
L++ + G + D+G+ + + AY+ L + + A+
Sbjct: 339 LDIPPAVFREG-VITDSGTVLSALQETAYTALRTAFRSA------------------MAE 379
Query: 473 FPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDGSEVH 530
+P+ V T + VS F +L S + CL + +
Sbjct: 380 YPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRGGATVHLDASSGVLMDGCLAFWSSGDEY 439
Query: 531 NGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G ++G +S R V+YD +++G+ C
Sbjct: 440 TG---LIGSVSQRTIEVLYDMPGRKVGFRTGAC 469
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/386 (22%), Positives = 151/386 (39%), Gaps = 52/386 (13%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSL 259
G Y + +GNP DTGSDL W+QC PC C K +P++ PR + Y++ L
Sbjct: 91 GEYLMRISIGNPQVEILAIADTGSDLIWVQCQ-PCEMCYKQNSPIFDPRRSS--SYRNVL 147
Query: 260 CMEIQRNHKPGYCETC------QQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKP-- 311
C N G +C + C Y Y D S S G LA + + N + +
Sbjct: 148 CGNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIA 207
Query: 312 ---NVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG 368
V FGC G +GL +SL SQL + + +CL +
Sbjct: 208 YFQEVAFGCGTKNGGTFDELGSGI---IGLGGGSMSLVSQLGPK--LSGKFSYCLVPTSE 262
Query: 369 GGGY---MFLGHDLVPS---WGMAWVPMLDSPFMELYHTEILKINYGSSPL---NLGARN 419
Y + G+D+ S + + P+L Y+ + I+ + L NL
Sbjct: 263 QSNYTSKINFGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRLPYTNLWNGE 322
Query: 420 SQVGWALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP--TLPVCWRAKFPIRS 477
+ G + D+G++ T+ + ++ L ++++E V SDP +C++ + I
Sbjct: 323 VEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERV---SDPHGLFNICFKDEKAIE- 378
Query: 478 IVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIIL 537
+T HF + P ++ +C ++ +++ I
Sbjct: 379 -------LPIITAHF------TGADVELQPVNTFAKVEEDLLCFTMIPSNDIA-----IF 420
Query: 538 GDISLRGQLVVYDNVNKRIGWAKSHC 563
G+++ LV YD K + + + C
Sbjct: 421 GNLAQMNFLVGYDLEKKAVSFLPTDC 446
>gi|330842955|ref|XP_003293432.1| hypothetical protein DICPUDRAFT_158270 [Dictyostelium purpureum]
gi|325076242|gb|EGC30045.1| hypothetical protein DICPUDRAFT_158270 [Dictyostelium purpureum]
Length = 484
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 100/419 (23%), Positives = 173/419 (41%), Gaps = 73/419 (17%)
Query: 165 IRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSD 224
I+P + +S N + + + N+Y G + + L +DTGS
Sbjct: 57 IKPSVGHYQRNPLSKNVDLEMQGNFYQINANVYIGG------------QKFILQVDTGST 104
Query: 225 LTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYKDSLCMEIQRNHKPGYCETCQQ---- 277
LT I C++C +G P+Y P + N ++P C+ C Q
Sbjct: 105 LTAIPL-KNCNNC-RGERPVYNPEISNSSILIPCSSDHCL--GSGSAAPSCRLHQSSKSS 160
Query: 278 CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILG 337
CD+ I Y D S G + DE+ + NG K FG ++ G + DGI+G
Sbjct: 161 CDFVILYGDGSKVRGKIYSDEITM---NG--VKSIGFFGANVEEVGTF--EYPRADGIMG 213
Query: 338 LSRAKVS-------LPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSW---GMAW 387
L R + S + + +KNV G + + G G++ LG + P++ + +
Sbjct: 214 LGRTGNNKNLVPTIFESMVRANSSMKNVFG--IYLDYQGQGHLSLGR-INPNFYVGEIEY 270
Query: 388 VPML-DSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIA 446
P++ + PF + T +I+ N S +G + D+G+S + + Y LIA
Sbjct: 271 TPVVQNGPFYSIKPTS-FRIS------NTSFLASSLGQVIVDSGTSDIILSGKIYDHLIA 323
Query: 447 SLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLT-LHFGSKWQIVSTKFHI 505
+ + DP RA F + ++ F++ LHFG + + I
Sbjct: 324 FFRRHYCH--IDMVCDPISIFTGRACF------EREEDFESFPWLHFGFSGGV---RIAI 372
Query: 506 SPEGYLVISKKGN-----ICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWA 559
P+ Y++ ++ C GI G ++ ILGD+ +RG ++DN R+G+A
Sbjct: 373 PPKNYMIKTQSTQPGVYGYCWGIDRGEDM-----TILGDVFMRGYYTIFDNEENRVGFA 426
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 68/136 (50%), Gaps = 9/136 (6%)
Query: 192 LRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN 251
+ G G YF+ + +G PP Y+ +DTGSD++W+QC APC+ C + A+P+++P
Sbjct: 122 ISGTSQGSGEYFSRIGIGEPPSQAYMVLDTGSDISWVQC-APCADCYRQADPIFEPTAS- 179
Query: 252 ILPYKDSLCMEIQ-RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
Y C Q R C C Y++ Y D S ++G D + T+ G
Sbjct: 180 -ASYAPLSCEAAQCRYLDQSQCRN-GNCLYQVSYGDGSYTVG----DFVTETVTIGVNKV 233
Query: 311 PNVVFGCAYDQQGLLL 326
NV GC ++ +GL +
Sbjct: 234 KNVALGCGHNNEGLFV 249
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 158/375 (42%), Gaps = 49/375 (13%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS---SCAKGANPLYKPRMGN---ILPY 255
Y +G P +++DTGSDL+W+QC PC+ SC +PL+ P + +P
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+C + + QC Y + Y D S++ GV + D L L+ + F
Sbjct: 199 GGPVCAGLGIYAA--SACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS---AVQGFFF 253
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC + Q GL DG+LGL R + SL Q A G V +CL T GY+ L
Sbjct: 254 GCGHAQSGLFNG----VDGLLGLGREQPSLVEQTA--GTYGGVFSYCLPTKPSTAGYLTL 307
Query: 376 G----HDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTG 430
G P G + +L SP Y+ +L I+ G L++ A ++ G + DTG
Sbjct: 308 GVGGPSGAAP--GFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA-SAFAGGTVVDTG 364
Query: 431 SSYTYFTKQAYSELIASLKE-VSSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTL 488
+ T AY+ L ++ + ++S G S+ L C+ A + ++ +V
Sbjct: 365 TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVA------ 418
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
L FGS + I G L + G+ +G ILG++ R V
Sbjct: 419 -LTFGSGATVTLGADGILSFGCLAFAPSGS------------DGGMAILGNVQQRSFEVR 465
Query: 549 YDNVNKRIGWAKSHC 563
D + +G+ S C
Sbjct: 466 IDGTS--VGFKPSSC 478
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 99/412 (24%), Positives = 160/412 (38%), Gaps = 82/412 (19%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGN---ILPYK 256
G Y + +G P + +DT SDL W QC PC C K +P++ P ++P
Sbjct: 86 GEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQ-PCVKCYKQLDPVFNPVASTSYAVVPCN 144
Query: 257 DSLCMEI--QRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVV 314
C E+ R + G + C Y Y ++++ G+LA D L + G VV
Sbjct: 145 SDTCDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAI----GDDVFRGVV 200
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAG-GGGYM 373
FGC+ G + G++GL R +SL SQL+ + + +CL G +
Sbjct: 201 FGCSSSSVG---GPPPQVSGVVGLGRGALSLVSQLSVRRFM-----YCLPPPVSRSAGRL 252
Query: 374 FLGHDLVPSWGMA----WVPM-LDSPFMELYHTEILKINYGSSPLNLGARN----SQVGW 424
LG D + A VPM S + Y+ + I+ G ++ +RN + G
Sbjct: 253 VLGADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGT 312
Query: 425 A--------------------------LFDTGSSYTYFTKQAYSELIASLKEVSSDGLVL 458
A + D S+ T+ + Y E++ L+E + L
Sbjct: 313 AAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLEE----EIRL 368
Query: 459 ---DASDPTLPVCWRAKFPIRSIVDVKQFFK-TLTLHFGSKWQIVSTKFHISPEGYLVIS 514
SD L +C F + V + + + ++L F W + E V
Sbjct: 369 PRGSGSDLGLDLC----FILPEGVPMSRVYAPPVSLAFEGVW------LRLDKEQMFVED 418
Query: 515 K-KGNICL--GILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
+ G +CL G DG ILG+ + V+Y+ RI + K+ C
Sbjct: 419 RASGMMCLMVGKTDGVS-------ILGNYQQQNMQVMYNLRRGRITFIKTAC 463
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 153/382 (40%), Gaps = 46/382 (12%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGAN-PLYKPRMGNILPYKDSLC 260
Y +G PP+ + +D +D W+ C A C CA GA+ P + P + Y+ C
Sbjct: 100 YVARARLGTPPQTLLVAIDPSNDAAWVPCSA-CLGCAPGASSPSFDPTQSST--YRPVRC 156
Query: 261 MEIQRNHKPGYCETC-----QQCDYEIEYADHSSSMGVLARDELHLTIENGS-LTKPNVV 314
Q P +C C + + YA S+ VL +D L L+ NG+ + +
Sbjct: 157 GAPQCAQVPPATPSCPAGPGASCAFNLSYAS-STLHAVLGQDALSLSDSNGAAVPDDHYT 215
Query: 315 FGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT--NAGGGGY 372
FGC G V G++G R +S SQ ++ ++ +CL + ++ G
Sbjct: 216 FGCLRVVTGS--GGSVPPQGLVGFGRGPLSFLSQ--TKATYGSIFSYCLPSYKSSNFSGT 271
Query: 373 MFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYG-------SSPLNLGARNSQVGW 424
+ LG P + P+L +P LY+ ++ + +S L L A + G
Sbjct: 272 LRLGPAGQPRR-IKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGR-GG 329
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+ D G+ +T + AY+ L + + S P P F V+ +
Sbjct: 330 TIVDAGTMFTRLSPPAYAALRNAFRR--------GVSAPAAPAL--GGFDTCYYVNGTKS 379
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKK--GNICLGILDG-SEVHNGSTIILGDIS 541
+ F ++ PE +VIS G CL + G S+ N +L +
Sbjct: 380 VPAVAFVFAGGARVTL------PEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQ 433
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+ VV+D N R+G+++ C
Sbjct: 434 QQNHRVVFDVGNGRVGFSRELC 455
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 150/379 (39%), Gaps = 55/379 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC-SSCAKGANPLYKPRMGNI---LPY 255
G Y +G PP+ DTGSDL W +C C +SC +P Y P + LP
Sbjct: 89 GAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPC 148
Query: 256 KDSLCMEIQRNHKPGYCETC-QQCDYEIEYA----DHSSSMGVLARDELHLTIENGSLTK 310
D LC + R+ +C +CDY Y DH + G LAR+ L G+
Sbjct: 149 SDRLC-SLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTL----GADAV 203
Query: 311 PNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGG 370
P+V FGC +G + L R +SL SQL + + +CLT++A
Sbjct: 204 PSVRFGCTTASEGGYGSGSGLVG----LGRGPLSLVSQLNASTFM-----YCLTSDASKA 254
Query: 371 GYMFLGHDLVPSWGMAWVP---MLDSPFMELYHTEILKINYGSSPLNLGARNSQVG---W 424
+ G + S A V +L S Y + I+ GS A VG
Sbjct: 255 SPLLFGS--LASLTGAQVQSTGLLAS--TTFYAVNLRSISIGS------ATTPGVGEPEG 304
Query: 425 ALFDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQF 484
+FD+G++ TY + AYSE A+ +S V D C++ R
Sbjct: 305 VVFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDG--FEACFQKPANGRL---SNAA 359
Query: 485 FKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRG 544
T+ LHF + Y+V + G +C + + S I+G+I
Sbjct: 360 VPTMVLHFDGA------DMALPVANYVVEVEDGVVCWIV-----QRSPSLSIIGNIMQVN 408
Query: 545 QLVVYDNVNKRIGWAKSHC 563
LV++D + + ++C
Sbjct: 409 YLVLHDVHRSVLSFQPANC 427
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 89/190 (46%), Gaps = 20/190 (10%)
Query: 175 KLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC 234
+ S+ V + I P G Y + +G PP + +DT SDL W QC PC
Sbjct: 68 EAASARKAVVAETPIMPAGGE------YLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PC 120
Query: 235 SSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSM 291
+ C +P++ PR+ + LP C E+ H+ G+ + + C Y Y+ ++++
Sbjct: 121 TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELD-VHRCGH-DDDESCQYTYTYSGNATTE 178
Query: 292 GVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLAS 351
G LA D+L + G V FGC+ G + G++GL R +SL SQL+
Sbjct: 179 GTLAVDKLVI----GEDAFRGVAFGCSTSSTGGAPPP--QASGVVGLGRGPLSLVSQLSV 232
Query: 352 Q--GIIKNVV 359
+ G+I ++
Sbjct: 233 RRYGMIIDIA 242
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 153/382 (40%), Gaps = 58/382 (15%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC---SSCAKGANPLYKPRMGNILPYK 256
G YF + VG P + Y+ DTGSD++W+QC PC + C K P++ P+ +
Sbjct: 182 GEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ-PCDGENGCYKQIGPIFDPKSSSSY--S 238
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C Q + C YE+EY D S ++G LA + N + PN+ G
Sbjct: 239 PLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSN---SIPNLPIG 295
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C +D +GL V G++GL +SL SQL + +CL
Sbjct: 296 CGHDNEGL----FVGAAGLIGLGGGAISLSSQLEATSF-----SYCLVDLDSESSSTLDF 346
Query: 377 HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGS 431
+ PS + + + F + +++ ++ G PL + + + ++ G + D+G+
Sbjct: 347 NADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGT 406
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLV-LDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
+ T E+ + + +V D V L + P P V F L
Sbjct: 407 TIT--------EIPSDVYDVLRDAFVGLTKNLPPAP-------------GVSPFDTCYDL 445
Query: 491 HFGSKWQIVSTKFHISPEGYLVISKK---------GNICLGILDGSEVHNGSTIILGDIS 541
S ++ + F + E L + K G CL L + + I+G++
Sbjct: 446 SSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLS----IIGNVQ 501
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+G V YD N +G++ C
Sbjct: 502 QQGIRVSYDLANSLVGFSTDKC 523
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/385 (21%), Positives = 155/385 (40%), Gaps = 58/385 (15%)
Query: 206 MIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPL----YKPRMGNILPYKDSLCM 261
+ +G+PP+ + +DTGS+L+W+ C + NPL Y P P S+CM
Sbjct: 63 LTIGSPPQNVTMVLDTGSELSWLHCKK-LPNLNSTFNPLLSSSYTPT-----PCNSSVCM 116
Query: 262 EIQRNHK-PGYCETCQQ-CDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAY 319
R+ P C+ + C + YAD SS+ G LA + L +P +FGC
Sbjct: 117 TRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSL----AGAAQPGTLFGC-M 171
Query: 320 DQQGLL--LNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGH 377
D G +N KT G++G++R +SL +Q+ + +C+ + G + LG
Sbjct: 172 DSAGYTSDINEDAKTTGLMGMNRGSLSLVTQM-----VLPKFSYCI-SGEDAFGVLLLGD 225
Query: 378 DLVPSWGMAWVPML----DSPFME--LYHTEILKINYGSSPLNLGAR-----NSQVGWAL 426
+ + P++ SP+ + Y ++ I L L ++ G +
Sbjct: 226 GPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTM 285
Query: 427 FDTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDP------TLPVCWRAKFPIRSIVD 480
D+G+ +T+ Y+ L E + G++ DP + +C+ A + ++
Sbjct: 286 VDSGTQFTFLLGPVYNSLKDEFLE-QTKGVLTRIEDPNFVFEGAMDLCYHAPASLAAVPA 344
Query: 481 VKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGN--ICLGILDGSEVHNGSTIILG 538
V F + +S E L KG + S++ ++G
Sbjct: 345 VTLVFS-------------GAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIG 391
Query: 539 DISLRGQLVVYDNVNKRIGWAKSHC 563
+ + +D V R+G+ ++ C
Sbjct: 392 HHHQQNVWMEFDLVKSRVGFTETTC 416
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 83/179 (46%), Gaps = 18/179 (10%)
Query: 175 KLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC 234
+ S+ V + I P G Y + +G PP + +DT SDL W QC PC
Sbjct: 68 EAASARKAVVAETPIMPAGGE------YLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PC 120
Query: 235 SSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSM 291
+ C +P++ PR+ + LP C E+ H+ G+ + + C Y Y+ ++++
Sbjct: 121 TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELD-VHRCGH-DDDESCQYTYTYSGNATTE 178
Query: 292 GVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLA 350
G LA D+L + G V FGC+ G + G++GL R +SL SQL+
Sbjct: 179 GTLAVDKLVI----GEDAFRGVAFGCSTSSTGGAPPP--QASGVVGLGRGPLSLVSQLS 231
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 91/382 (23%), Positives = 154/382 (40%), Gaps = 58/382 (15%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC---SSCAKGANPLYKPRMGNILPYK 256
G YF + VG P + Y+ DTGSD++W+QC PC + C K P++ P+ +
Sbjct: 182 GEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ-PCDGENGCYKQIGPIFDPKSSSSY--S 238
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C Q + C YE+EY D S ++G LA + N + PN+ G
Sbjct: 239 PLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSN---SIPNLPIG 295
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C +D +GL V DG++GL +SL SQL + +CL
Sbjct: 296 CGHDNEGL----FVGADGLIGLGGGAISLSSQLEATSF-----SYCLVDLDSESSSTLDF 346
Query: 377 HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTGS 431
+ PS + + + F + +++ ++ G PL + + + ++ G + D+G+
Sbjct: 347 NADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGT 406
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLV-LDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTL 490
+ T E+ + + +V D V L + P P V F L
Sbjct: 407 TIT--------EIPSDVYDVLRDAFVGLTKNLPPAP-------------GVSPFDTCYDL 445
Query: 491 HFGSKWQIVSTKFHISPEGYLVISKK---------GNICLGILDGSEVHNGSTIILGDIS 541
S ++ + F + E L + K G CL L + + I+G++
Sbjct: 446 SSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTFPLS----IIGNVQ 501
Query: 542 LRGQLVVYDNVNKRIGWAKSHC 563
+G V YD N +G++ C
Sbjct: 502 QQGIRVSYDLANSLVGFSTDKC 523
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 65/254 (25%), Positives = 110/254 (43%), Gaps = 37/254 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNI---LPYK 256
G YF + +G+P Y+ +D+GSD+ WIQC+ PC C +P++ P +
Sbjct: 127 GEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCE-PCDQCYNQTDPIFNPATSASFIGVACS 185
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
++C ++ + C +C Y++ Y D S + G LA + TI G + G
Sbjct: 186 SNVCNQLDDDVA---CRK-GRCGYQVAYGDGSYTKGTLALE----TITIGRTVIQDTAIG 237
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C + +G+ V G+LGL +S QL +Q G+CL + A G M
Sbjct: 238 CGHWNEGM----FVGAAGLLGLGGGPMSFVGQLGAQ--TGGAFGYCLVSRAMPVGAM--- 288
Query: 377 HDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSSPLNLGARNSQV-----GWALFDTG 430
WVP++ +PF Y+ + + G + + + Q+ G + DTG
Sbjct: 289 ----------WVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTG 338
Query: 431 SSYTYFTKQAYSEL 444
++ T AY+
Sbjct: 339 TAITRLPTVAYNAF 352
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 122/296 (41%), Gaps = 51/296 (17%)
Query: 190 FPLRGNIYPDG--------LYFTY-------MIVGNPPRPYYLDMDTGSDLTWIQC---- 230
FPLR P G L F + + VG PP+ + +DTGS+L+W+ C
Sbjct: 36 FPLRARQVPAGALPRPPSKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCATGR 95
Query: 231 -DAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYAD 286
+ + A ++PR +P + C P +QC + YAD
Sbjct: 96 QGSAAAGAAAAMGESFRPRASATFAAVPCGSTQCSSRDLPAPPSCDGASRQCHVSLSYAD 155
Query: 287 HSSSMGVLARDELHLTIENGSLTKPNVVFGC---AYDQQGLLLNTLVKTDGILGLSRAKV 343
S+S G LA D + G FGC AYD V T G+LG++R +
Sbjct: 156 GSASDGALATDVFAV----GEAPPLRSAFGCMSTAYDSS----PDGVATAGLLGMNRGTL 207
Query: 344 SLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPM----LDSPFME-- 397
S +Q +++ +C+ ++ G + LGH +P + + P+ L P+ +
Sbjct: 208 SFVTQASTRRF-----SYCI-SDRDDAGVLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRV 261
Query: 398 LYHTEILKINYGSSPLNLGAR-----NSQVGWALFDTGSSYTYFTKQAYSELIASL 448
Y ++L I G L + A ++ G + D+G+ +T+ AYS L A
Sbjct: 262 AYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEF 317
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 80/353 (22%), Positives = 141/353 (39%), Gaps = 39/353 (11%)
Query: 219 MDTGSDLTWIQC-DAPCSSCAKGANPLYKPRMGNI---LPYKDSLCMEIQRNHKPGYCET 274
+DT SD+ W+QC P C +PLY P + +P C E+ ++ G T
Sbjct: 173 VDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPT 232
Query: 275 CQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDG 334
+C Y + Y D ++ G D L ++ ++ + FGC++ +G N + G
Sbjct: 233 TDECKYIVNYGDGKATTGTYVTDTLTMSP---TIVVKDFRFGCSHAVRGSFSN---QNAG 286
Query: 335 ILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLD-- 392
IL L + SL Q A N +C+ G++ LG + S ++ P++
Sbjct: 287 ILALGGGRGSLLEQTADA--YGNAFSYCI-PKPSSAGFLSLGGPVEASLKFSYTPLIKNK 343
Query: 393 -SPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYFTKQAYSELIASLKEV 451
+P + H E + + L + G A+ D+G+ T Q Y+ L A+ +
Sbjct: 344 HAPTFYIVHLEAIIV--AGKQLAVPPTAFATG-AVMDSGAVVTQLPPQVYAALRAAFRSA 400
Query: 452 SSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGY 510
+ L A L C+ +FP DVK +L G+ + P
Sbjct: 401 MAAYGPLAAPVRNLDTCYDFTRFP-----DVKVPKVSLVFAGGA-------TLDLEPASI 448
Query: 511 LVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
++ G + G E S +G++ + V+YD ++G+ + C
Sbjct: 449 IL---DGCLAFAATPGEE----SVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 72.8 bits (177), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 158/375 (42%), Gaps = 49/375 (13%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCS---SCAKGANPLYKPRMGN---ILPY 255
Y +G P +++DTGSDL+W+QC PC+ SC +PL+ P + +P
Sbjct: 48 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCAAAPSCYSQKDPLFDPAQSSSYAAVPC 106
Query: 256 KDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVF 315
+C + + QC Y + Y D S++ GV + D L L+ + F
Sbjct: 107 GGPVCAGLGIYAA--SACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS---AVQGFFF 161
Query: 316 GCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFL 375
GC + Q GL DG+LGL R + SL Q A G V +CL T GY+ L
Sbjct: 162 GCGHAQSGLFNG----VDGLLGLGREQPSLVEQTA--GTYGGVFSYCLPTKPSTAGYLTL 215
Query: 376 G----HDLVPSWGMAWVPMLDSPFMELYHTEILK-INYGSSPLNLGARNSQVGWALFDTG 430
G P G + +L SP Y+ +L I+ G L++ A ++ G + DTG
Sbjct: 216 GVGGPSGAAP--GFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA-SAFAGGTVVDTG 272
Query: 431 SSYTYFTKQAYSELIASLKE-VSSDGLVLDASDPTLPVCWR-AKFPIRSIVDVKQFFKTL 488
+ T AY+ L ++ + ++S G S+ L C+ A + ++ +V
Sbjct: 273 TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVA------ 326
Query: 489 TLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISLRGQLVV 548
L FGS + I G L + G+ +G ILG++ R V
Sbjct: 327 -LTFGSGATVTLGADGILSFGCLAFAPSGS------------DGGMAILGNVQQRSFEVR 373
Query: 549 YDNVNKRIGWAKSHC 563
D + +G+ S C
Sbjct: 374 IDGTS--VGFKPSSC 386
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 97/441 (21%), Positives = 168/441 (38%), Gaps = 86/441 (19%)
Query: 163 GIIRPHKSKINKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTG 222
G + S I++K V +D S G Y YFT + VG P + + + +DTG
Sbjct: 54 GADQKRHSLISRKRKFKGGVKMDLGS-----GIDYGTAQYFTEVRVGTPAKKFRVVVDTG 108
Query: 223 SDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYC--ETCQQ--- 277
S+LTW+ C Y+ R + + E ++ K C +TC+
Sbjct: 109 SELTWVNCR-------------YRGRGKGKVKNRRVFRAEESKSFKTVGCFTQTCKVDLM 155
Query: 278 --------------CDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCAYDQQ 322
C Y+ YAD S++ GV A++ + + + NG + ++ GC+
Sbjct: 156 NLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSSFS 215
Query: 323 GLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLT---TNAGGGGYMFLGH-- 377
+ DG+LGL+ + S S S + + +CL +N Y+ G+
Sbjct: 216 ---GQSFQGADGVLGLAFSDFSFTSTATS--LFGAKLSYCLVDHLSNKNISNYLIFGYSS 270
Query: 378 -----DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGAR---NSQVGWALFDT 429
P L PF Y I+ I+ G L++ + + G + D+
Sbjct: 271 SSTSTKTAPGRTTPLDLTLIPPF---YAINIIGISIGDDMLDIPTQVWDATTGGGTILDS 327
Query: 430 GSSYTYFTKQAY-------SELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVK 482
G+S T + AY + + LK V +G+ ++ + +K P
Sbjct: 328 GTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNESKLP-------- 379
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTIILGDISL 542
LT H +F + YLV + G CLG + +T ++G+I
Sbjct: 380 ----QLTFHLKG-----GARFEPHRKSYLVDAAPGVKCLGFMSAG---TPATNVVGNIMQ 427
Query: 543 RGQLVVYDNVNKRIGWAKSHC 563
+ L +D + + +A S C
Sbjct: 428 QNYLWEFDLMASTLSFAPSTC 448
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 153/376 (40%), Gaps = 53/376 (14%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSS--CAKGANPLYKPRMGNI---LPYK 256
Y + +G P + +DTGSD++W+QC+ PC + C L+ P + +
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCN-PCPNPPCHAQTGALFDPAKSSTYRAVSCA 185
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
+ C ++++ G T +C Y ++Y D S++ G +RD L T+ S FG
Sbjct: 186 AAECAQLEQQGN-GCGATNYECQYGVQYGDGSTTNGTYSRDTL--TLSGASDAVKGFQFG 242
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C++ + G +TDG++GL SL SQ A+ N +CL +G G++ LG
Sbjct: 243 CSHLESGFS----DQTDGLMGLGGGAQSLVSQTAA--AYGNSFSYCLPPTSGSSGFLTLG 296
Query: 377 HDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTYF 436
S + + Y + I G L L G ++ D+G+ T
Sbjct: 297 GGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVFAAG-SVVDSGTIITRL 355
Query: 437 TKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKW 496
AYS L ++ K +R+ P RSI+D F +
Sbjct: 356 PPTAYSALSSAFKAGMKQ--------------YRSA-PARSILDT-------CFDFAGQT 393
Query: 497 QIVSTKFHISPEGYLVISKKGNICL---GILDG------SEVHNGSTIILGDISLRGQLV 547
QI P LV S I L GI+ G + +G+T I+G++ R V
Sbjct: 394 QISI------PTVALVFSGGAAIDLDPNGIMYGNCLAFAATGDDGTTGIIGNVQQRTFEV 447
Query: 548 VYDNVNKRIGWAKSHC 563
+YD + +G+ C
Sbjct: 448 LYDVGSSTLGFRSGAC 463
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 97/397 (24%), Positives = 152/397 (38%), Gaps = 59/397 (14%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAP-----CSSCA-KGANPLYKPRMG--- 250
G Y +G PP+ L +DTGS L W C P C +C G +P P
Sbjct: 72 GGYSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNK 131
Query: 251 ----NILPYKDSLCMEIQRNHKPGYCETCQQCD-YEIEYADHSSSMGVLARDELHLTIEN 305
LP + C + + C T ++C Y +EY S+ G L D L L+ N
Sbjct: 132 SSTVQSLPCRSPKCNWVFGSDL--NCSTTKRCPYYGLEYG-LGSTTGQLVSDVLGLSKLN 188
Query: 306 GSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTT 365
P+ +FGC+ L + + +GI G R S+P+QL +V H
Sbjct: 189 ---RIPDFLFGCS-------LVSNRQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDD 238
Query: 366 NAGGGGYMF---LGHDLVPSWGMAWVPMLD----SPFMELYHTEILKINYGSS-----PL 413
G + H + G+A+ P SP+ E Y+ + KI G P
Sbjct: 239 TPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPR 298
Query: 414 NLGARNSQVGWALFDTGSSYTYFTKQAYS----ELIASLKEVSSDGLVLDASDPTLPVCW 469
L G + D+GS++T+ + + EL + + + D+S L C+
Sbjct: 299 YLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSG--LGPCY 356
Query: 470 RAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGIL---DG 526
+S VDV + T + G+ + T Y + G +C+ +L D
Sbjct: 357 NITG--QSEVDVPKL--TFSFKGGANMDLPLTD-------YFSLVTDGVVCMTVLTDPDE 405
Query: 527 SEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHC 563
G IILG+ + + YD +R G+ C
Sbjct: 406 PGSTTGPAIILGNYQQQNFYIEYDLKKQRFGFKPQQC 442
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 154/379 (40%), Gaps = 48/379 (12%)
Query: 199 DGLYFTYMI-VGNPPRPYYLDMDTGSDLTWIQCDAPCSS-CAKGANPLYKPRMGNILPYK 256
D L F ++ G P + + +DTGSDL+WIQC PCS C + +P + P + Y
Sbjct: 133 DTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCK-PCSGHCYRQHDPDFDPAKSSS--YA 189
Query: 257 DSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTKPNVVFG 316
C G C C Y ++Y D SS+ GVL+RD L N S FG
Sbjct: 190 AVPCGTPVCAAAGGMCNG-TTCLYGVQYGDGSSTTGVLSRDTLTF---NSSSKFTGFTFG 245
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLG 376
C G + DG+LGL R K+SLPSQ A V +CL + GY+ +G
Sbjct: 246 CGEKNIG----DFGEVDGLLGLGRGKLSLPSQAAPS--FGGVFSYCLPSYNTTPGYLNIG 299
Query: 377 H----DLVPSWGMAWVPMLDSP-FMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGS 431
VP + + M+ P + Y E++ IN G L + L D+G+
Sbjct: 300 ATKPTSTVP---VQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKTGTLLDSGT 356
Query: 432 SYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPIRSIVDVKQ----FFKT 487
TY AY+ L K + + P P P+ + D
Sbjct: 357 ILTYLPPPAYTSLRDRFK------FTMQGNKPAPPY-----EPLDTCYDFTGQGAIVIPA 405
Query: 488 LTLHFGSKWQIVSTKFHISPEGYLVISKKGNICLGILDGSEVHNGSTI---ILGDISLRG 544
++ +F S + F+ G ++ +G L + V + + I+G+ R
Sbjct: 406 VSFNF-SDGAVFDLDFY----GIMIFPDDAKPLIGCL--AFVSRPAAMPFSIVGNTQQRA 458
Query: 545 QLVVYDNVNKRIGWAKSHC 563
V+YD +++IG+ C
Sbjct: 459 AEVIYDVPSQKIGFIPISC 477
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 78/338 (23%), Positives = 130/338 (38%), Gaps = 49/338 (14%)
Query: 113 SNNDDENKESFVFPLYHKFGIREVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKI 172
S + +E E ++ + H+ + + D +L + D + V + +IR S
Sbjct: 123 SEDHEEGGEKWMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVAS-----LIRRLSSG- 176
Query: 173 NKKLVSSNAVAVDSSSIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDA 232
+ VD + G G YF + VG+PPR Y+ +D+GSD+ W+QC
Sbjct: 177 -----GGGSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ- 230
Query: 233 PCSSCAKGANPLYKPRMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMG 292
PC+ C ++P++ P R G C +C YE+ Y D S + G
Sbjct: 231 PCTQCYHQSDPVFDPADSASFTGVSCSSSVCDRLENAG-CHA-GRCRYEVSYGDGSYTKG 288
Query: 293 VLARDELHLTIENGSLTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQ 352
LA + L G +V GC + +G+ V G+LGL +S QL Q
Sbjct: 289 TLALETLTF----GRTMVRSVAIGCGHRNRGM----FVGAAGLLGLGGGSMSFVGQLGGQ 340
Query: 353 GIIKNVVGHCLTTNAGGGGYMFLGHDLVPSWGMAWVPMLDSPFM-ELYHTEILKINYGSS 411
+CL + AWVP++ +P Y+ + + G
Sbjct: 341 --TGGAFSYCLVS-------------------AAWVPLVRNPRAPSFYYIGLAGLGVGGI 379
Query: 412 PLNLGARNSQV-----GWALFDTGSSYTYFTKQAYSEL 444
+ + ++ G + DTG++ T AY
Sbjct: 380 RVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAF 417
>gi|168021169|ref|XP_001763114.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685597|gb|EDQ71991.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 641
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 63/226 (27%), Positives = 92/226 (40%), Gaps = 37/226 (16%)
Query: 202 YFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPC--SSCAKGANPLYKPRMGNILPYKDSL 259
Y+ M VG + ++ +DTGS +W+ C P G N +Y P + +
Sbjct: 126 YYVKMRVGKSKKLFHFLIDTGSQPSWLHCKWPAIEKHPVAGPNGMYVPEKEVQVDCRSPE 185
Query: 260 CMEIQ---------RNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGSLTK 310
C+ +Q RN P +C Y+I Y D S G +D + L G
Sbjct: 186 CLSLQRIPSNFNNIRNLFPCNEPNDWRCTYDITYLDRSHLRGFYVQDVVSLATLEGEQLD 245
Query: 311 PNVVFG-------------CAY----DQQG--LLLNTLVKTDGILGLSRAKVSLPSQLAS 351
+ G C++ D+ G L + + TDG+LGL++ S SQL
Sbjct: 246 AKITLGYATPNHRAAPFGFCSWHASSDRYGEEELERSPLTTDGLLGLNKGTESFVSQLKR 305
Query: 352 QGII-KNVVGHCL-----TTNAGGGGYMFLGHD-LVPSWGMAWVPM 390
QG I +VVGHC T G+MF G L+ S + W PM
Sbjct: 306 QGAISSHVVGHCFRSLDTTDFETNSGFMFFGKSKLLDSLPITWSPM 351
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 102/409 (24%), Positives = 163/409 (39%), Gaps = 66/409 (16%)
Query: 200 GLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCD----APCSSCAKGANPLYKPRMGNILPY 255
G YF VG P +P+ L DTGSDLTW++C A S+ + + PR P
Sbjct: 93 GQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRA-FRPE 151
Query: 256 KDSLCMEIQRNHKPGYCETCQQ---------------CDYEIEYADHSSSMGVLARDELH 300
K I P +TC + C Y+ Y D S++ G + +
Sbjct: 152 KSKTWAPI-----PCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESAT 206
Query: 301 LTIENGSLTKPNVVFGCAYDQQGLLLN--------TLVKTDGILGLSRAKVSLPSQLASQ 352
+ + + S + N V QGL+L + +DG+L L + VS S AS+
Sbjct: 207 IALSSSSSSSKNKVKKAKL--QGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASR 264
Query: 353 GIIKNVVGHCLTTN---AGGGGYMFLGHDLVPSW--------GMAWVPM-LDSPFMELYH 400
+CL + Y+ G + S G P+ LDS Y
Sbjct: 265 --FGGRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYD 322
Query: 401 TEILKINYGSSPLNLGARNSQV---GWALFDTGSSYTYFTKQAYSELIASL-KEVSSDGL 456
I I+ L + +V G + D+G+S T K AY ++A+L K+++
Sbjct: 323 VSIKAISVDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPR 382
Query: 457 VLDASDPTLPVCWRAKFPIRSIVDVKQFFKTLTLHFGSKWQIVSTKFHISPEGYLVISKK 516
V A DP C+ P R D L +HF S + + Y++ +
Sbjct: 383 V--AMDP-FEYCYNWTSPSRK--DEGDDLPKLAVHFAG-----SARLEPPSKSYVIDAAP 432
Query: 517 GNICLGILDGSEVHNGSTIILGDISLRGQLVVYDNVNKRIGWAKSHCMN 565
G C+G+ +G G ++I G+I + L +D N+R+ + +S C +
Sbjct: 433 GVKCIGVQEGP--WPGISVI-GNILQQEHLWEFDLKNRRLRFKRSRCTH 478
>gi|213998814|gb|ACJ60774.1| nucellin [Hordeum cf. pusillum GP-2003]
Length = 142
Score = 72.4 bits (176), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 72/133 (54%), Gaps = 9/133 (6%)
Query: 317 CAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIK-NVVGHCLTTNAGGGGYMFL 375
C Y Q+ + DGILGL K +QL Q +I NV+GHCL++ G G +++
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSK--GKGVLYV 58
Query: 376 GHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALFDTGSSYTY 435
G PS G+ WVPM +S F Y + ++ + P+ R + A+FD+GS+YT+
Sbjct: 59 GDFNPPSRGVTWVPMKESLFY--YSPGLAELLIDNQPI----RGNPTFEAVFDSGSTYTH 112
Query: 436 FTKQAYSELIASL 448
Q Y+E+++ +
Sbjct: 113 VPAQIYNEIVSKV 125
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 72.4 bits (176), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 88/388 (22%), Positives = 145/388 (37%), Gaps = 70/388 (18%)
Query: 213 RPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKPRMGNILPYKDSLCMEIQ-RNHKP-- 269
+ YY +DTG++L+WIQC+ C N + P+KD Q +++KP
Sbjct: 99 KTYYFQIDTGNELSWIQCEG----CQNKGNMCF--------PHKDPPYTSSQSKSYKPVS 146
Query: 270 ----GYCE--TCQQ--CDYEIEYADHSSSMGVLARDELHLTIENGSLTK-PNVVFGCAYD 320
+CE C++ C Y + Y S + G LA + +G T ++ FGC+ D
Sbjct: 147 CNQHSFCEPNQCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTALKSISFGCSTD 206
Query: 321 QQGLLLNTLVKTD---GILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNAGGGGYMFLGH 377
+ ++ L+ + G+LG+ S +QL S I +C+T N Y+ G
Sbjct: 207 SRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGS--ISHGKFSYCITANNTHNTYLRFGK 264
Query: 378 DLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLN-----LGARNSQVGWALFDTGSS 432
+V S + ++ YH +L I+ LN L R + D G+
Sbjct: 265 HVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSRGCIIDAGTL 324
Query: 433 YTYFTKQAYSELIASL-KEVSSD---------GLVLDASDPTLPVCWRAKFPIRSIVDVK 482
T K + L +L +SS+ L D L R P+
Sbjct: 325 ATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNLPV------- 377
Query: 483 QFFKTLTLHFGSKWQIVSTKFHISPEGYLVISK---KGNICLGILDGSEVHNGSTIILGD 539
+T H + + + PE + + K CL +L + S I+G
Sbjct: 378 -----VTFH------LENADLEVKPEAIFLFREFEGKNVFCLSMLS-----DDSKTIIGA 421
Query: 540 ISLRGQLVVYDNVNKRIGWAKSHCMNPG 567
Q VYD + + + C G
Sbjct: 422 YQQMKQKFVYDTKARVLSFGPEDCEKNG 449
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.137 0.423
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,968,725,353
Number of Sequences: 23463169
Number of extensions: 447156825
Number of successful extensions: 1143925
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 604
Number of HSP's successfully gapped in prelim test: 2222
Number of HSP's that attempted gapping in prelim test: 1137406
Number of HSP's gapped (non-prelim): 4096
length of query: 577
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 429
effective length of database: 8,886,646,355
effective search space: 3812371286295
effective search space used: 3812371286295
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)