BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011566
(483 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 584 bits (1505), Expect = e-164, Method: Compositional matrix adjust.
Identities = 292/462 (63%), Positives = 356/462 (77%), Gaps = 23/462 (4%)
Query: 28 SAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNY 87
S +T+T+PL+P TK SDP + L+ LA++S+SRA HLK+ PKT N+
Sbjct: 24 SPSTITIPLSPTITKR----PSSDPWEYLNHLATTSISRAHHLKS---PKT-------NF 69
Query: 88 SNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNV 147
S LIKTPL SYGGYS+SLS GTP Q + I DTGSSLVWFPCTSRY C CNFPN
Sbjct: 70 S--LIKTPLFSRSYGGYSMSLSLGTPSQ-TVKLIMDTGSSLVWFPCTSRYVCASCNFPNT 126
Query: 148 DPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG 207
D ++IP F+P+ SSSS+LIGC+NPKC+W+FG +V+S+C C+P+ + C ACP Y++QYG
Sbjct: 127 DITKIPKFMPRLSSSSKLIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYG 186
Query: 208 LGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYC 267
LG TAGLLLSET+ FP+KT+ +FLAGCS+LS RQP GIAGFGRS ESLP QLGLKKFSYC
Sbjct: 187 LGSTAGLLLSETINFPNKTISDFLAGCSLLSTRQPEGIAGFGRSQESLPLQLGLKKFSYC 246
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS-AFGEFYYVGLRQII 326
L+SR+FDD+PVSS+L+LD GP + DSKT GLSYTPF KN S+ AF E+YYV LR+II
Sbjct: 247 LVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKII 306
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
VG HVK+PYS+LVPGSDGNGG IVDSGSTFTF+EG +FE +AKEF +QM NY+ A +V+
Sbjct: 307 VGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQ 366
Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA-- 444
K +GLRPCFDISG+KSV +P+L +FKGGAKM LP NYFA V V+CL + +DNAA
Sbjct: 367 KLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTIVSDNAAAL 426
Query: 445 ---GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G GPAIILG+FQ QNFY+E+DL NDRFGF +Q CA
Sbjct: 427 GGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSCA 468
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 563 bits (1450), Expect = e-158, Method: Compositional matrix adjust.
Identities = 281/447 (62%), Positives = 344/447 (76%), Gaps = 19/447 (4%)
Query: 38 PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS 97
P STK L S +P L+ LAS SLSRA H+K+ PKTK SL+KTPL
Sbjct: 40 PSSTK--LIVSSKNPWGALNHLASLSLSRAHHIKS---PKTK---------FSLLKTPLF 85
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
SYGGYSISL+FGTPPQ +T F+ DTGSSLVWFPCTSRY C C+FPN++ + IP FIP
Sbjct: 86 PRSYGGYSISLNFGTPPQ-TTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIP 144
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
K+SSSS LIGC+N KCSW+FGP V+S+C+ C P + C +CP Y++QYGLG TAGLLLS
Sbjct: 145 KQSSSSNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGSTAGLLLS 204
Query: 218 ETLRFP-SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
ETL FP KT+P FL GCS+ S RQP GIAGFGRS ESLPSQLGLKKFSYCL+S FDD
Sbjct: 205 ETLDFPHKKTIPGFLVGCSLFSIRQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDT 264
Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
P SS+LVLDTG GS D+KTPGLSYTPF KNP ++AF ++YYV LR I++G HVK+PY
Sbjct: 265 PASSDLVLDTGSGSDDTKTPGLSYTPFQKNP---TAAFRDYYYVLLRNIVIGDTHVKVPY 321
Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
+LVPGSDGNGG IVDSG+TFTFME P++E VAKEF +Q+ +Y+ A +V+ ++GLRPCF+
Sbjct: 322 KFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFN 381
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
ISG+KSV +PE I FKGGAKMALP NYF+ V + V+CL + +DN +G +G GPAIIL
Sbjct: 382 ISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDSGVICLTIVSDNMSGSGIGGGPAIIL 441
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
G++Q +NF++EFDL N+RFGF +Q C
Sbjct: 442 GNYQQRNFHVEFDLKNERFGFKQQNCV 468
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 280/478 (58%), Positives = 347/478 (72%), Gaps = 17/478 (3%)
Query: 7 SLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSR 66
S I LL+ L + A S+ T+T+PL+PL K H SDSDP L AS+SL+R
Sbjct: 8 SYIITVFLLLSLLSHIAFTSSNPNTITLPLSPLLIKP--HSSDSDPFHSLKFAASASLTR 65
Query: 67 ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGS 126
A HLK + +N S S+ TP SYGGYSI L+ GTPPQ S PF+ DTGS
Sbjct: 66 AHHLKHR-----------NNNSPSVATTPAYPKSYGGYSIDLNLGTPPQTS-PFVLDTGS 113
Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
SLVWFPCTSRY C CNFPN+D ++IP FIPK SS+++L+GC+NPKC +IFG +V+ RC
Sbjct: 114 SLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYIFGSDVQFRCP 173
Query: 187 GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIA 246
C P ++ C L CP+Y++QYGLG TAG LL + L FP KTVP FL GCSILS RQP+GIA
Sbjct: 174 QCKPESQNCSLTCPAYIIQYGLGSTAGFLLLDNLNFPGKTVPQFLVGCSILSIRQPSGIA 233
Query: 247 GFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKN 306
GFGR ESLPSQ+ LK+FSYCL+S +FDD P SS+LVL +GD+KT GLSYTPF N
Sbjct: 234 GFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVLQIS-STGDTKTNGLSYTPFRSN 292
Query: 307 PVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFE 366
P ++ AF E+YY+ LR++IVG K VKIPY++L PGSDGNGG IVDSGSTFTFME P++
Sbjct: 293 PSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYN 352
Query: 367 AVAKEFIRQM-GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY 425
VA+EF++Q+ NYSRA D E +SGL PCF+ISG K+V PEL KFKGGAKM P +NY
Sbjct: 353 LVAQEFVKQLEKNYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNY 412
Query: 426 FALVGN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F+LVG+ EV+CL + +D AGP GPAIILG++Q QNFY+E+DL N+RFGF + C
Sbjct: 413 FSLVGDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSC 470
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 276/459 (60%), Positives = 343/459 (74%), Gaps = 10/459 (2%)
Query: 29 AATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK--TKTKPKTKDSNIGSN 86
+ V +PL+P S + S DP L LA SS++RA LK T KP + + +
Sbjct: 16 VSAVKLPLSPFS---HSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEEALSSTAT 72
Query: 87 YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
S +++K+ LS SYGGYS+SLSFGTP Q + PF+FDTGSSLVWFPCTSRY C DCNF
Sbjct: 73 ASATVVKSHLSPKSYGGYSVSLSFGTPSQ-TIPFVFDTGSSLVWFPCTSRYLCSDCNFSG 131
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
+DP++IP FIPK SSSS++IGCQNPKC ++FG NV+ C+GC P + C + CP Y+LQY
Sbjct: 132 LDPTQIPRFIPKNSSSSRVIGCQNPKCQFLFGANVQ--CRGCDPNTRNCTVPCPPYILQY 189
Query: 207 GLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSY 266
GLG TAG+L+SE L FP TVP+F+ GCS++S R PAGIAGFGR ESLPSQ+ LK FS+
Sbjct: 190 GLGSTAGILISEKLDFPDLTVPDFVVGCSVISTRTPAGIAGFGRGPESLPSQMKLKSFSH 249
Query: 267 CLLSRKFDDAPVSSNLVLDTGPG-SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
CL+SR+FDD V+++L LDTG G SKTPGLSYTPF KNP S++AF E+YY+ LR+I
Sbjct: 250 CLVSRRFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRI 309
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VGSKHVKIPY +L PG++GNGG IVDSGSTFTFME P+FE VA+EF QM NY+R D+
Sbjct: 310 YVGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDL 369
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAA 444
EK SG+ PCF+ISGK V +PELI +FKGGAKM LP NYF+ VGN + +CL + +DN
Sbjct: 370 EKVSGIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTV 429
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P G GPAIILG FQ QN+ +E+DL NDRFGFAK+KC+
Sbjct: 430 NPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 281/474 (59%), Positives = 345/474 (72%), Gaps = 19/474 (4%)
Query: 10 CLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH 69
C F+L LL ++ + AT+T+PLTPL TK+ SDP ++L L S+SL+RA H
Sbjct: 13 CGFTLFSLLLLANSSPDKNPATITLPLTPLFTKN----PSSDPWQLLSHLTSASLTRAHH 68
Query: 70 LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
LK + + S + TPL HSYGGYS+SLSFGTP Q + F+ DTGSSLV
Sbjct: 69 LKHRK-------------NTSSVNTPLFAHSYGGYSVSLSFGTPSQ-TLSFVMDTGSSLV 114
Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
WFPCTSRY C C+FPN+DP++IP FIPK SSS++++GC NPKC ++ V +RC GC
Sbjct: 115 WFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVMDSEVRTRCPGCD 174
Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFG 249
+ C ACP+Y +QYGLG T GLLL E+L F +T P+F+ GCSILS RQP+GIAGFG
Sbjct: 175 QNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGCSILSSRQPSGIAGFG 234
Query: 250 RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
R SLP Q+GLKKFSYCLLS +FDD+P SS + L GP S D KT GLSYTPF KNPV
Sbjct: 235 RGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVS 294
Query: 310 SSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
S+SAF E+YYV LR IIVG K VK+PYS++V GSDGNGG IVDSGSTFTFME P+FEAVA
Sbjct: 295 SNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVA 354
Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
EF RQM NY+RAADVE SGL+PCF++SG SV LP L+ +FKGGAKM LP NYF+LV
Sbjct: 355 TEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLV 414
Query: 430 GN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G+ VLCL + ++ A G L GP+IILG++Q QNFY E+DL N+RFGF +Q+C
Sbjct: 415 GDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
Length = 609
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 281/474 (59%), Positives = 344/474 (72%), Gaps = 19/474 (4%)
Query: 10 CLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH 69
C F+L LL ++ + AT+T+PLTPL TK+ SDP ++L L S+SL+RA H
Sbjct: 13 CGFTLFSLLLLANSSPDKNPATITLPLTPLFTKN----PSSDPWQLLSHLTSASLTRAHH 68
Query: 70 LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
LK + + S + TPL HSYGGYS+SLSFGTP Q + F+ DTGSSLV
Sbjct: 69 LKHRK-------------NTSSVNTPLFAHSYGGYSVSLSFGTPSQ-TLSFVMDTGSSLV 114
Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
WFPCTSRY C C+FPN+DP++IP FIPK SSS++++GC NPKC ++ V +RC GC
Sbjct: 115 WFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVMDSEVRTRCPGCD 174
Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFG 249
+ C ACP+Y +QYGLG T GLLL E+L F +T P+F+ GCSILS RQP+GIAGFG
Sbjct: 175 QNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGCSILSSRQPSGIAGFG 234
Query: 250 RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
R SLP Q+GLKKFSYCLLS +FDD+P SS + L GP S D KT GLSYTPF KNPV
Sbjct: 235 RGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVS 294
Query: 310 SSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
S+SAF E+YYV LR IIVG K VK PYS++V GSDGNGG IVDSGSTFTFME P+FEAVA
Sbjct: 295 SNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVA 354
Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
EF RQM NY+RAADVE SGL+PCF++SG SV LP L+ +FKGGAKM LP NYF+LV
Sbjct: 355 TEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLV 414
Query: 430 GN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G+ VLCL + ++ A G L GP+IILG++Q QNFY E+DL N+RFGF +Q+C
Sbjct: 415 GDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 543 bits (1398), Expect = e-151, Method: Compositional matrix adjust.
Identities = 269/459 (58%), Positives = 340/459 (74%), Gaps = 10/459 (2%)
Query: 29 AATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK--TKTKPKTKDSNIGSN 86
+ V +PL+P S + S DP L LA SS++RA LK T KP + +
Sbjct: 16 VSAVKLPLSPFS---HSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTT 72
Query: 87 YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
S +++K+PLS SYGGYS+SLSFGTP Q + PF+FDTGSSLVW PCTSRY C C+F
Sbjct: 73 ASATVVKSPLSAKSYGGYSVSLSFGTPSQ-TIPFVFDTGSSLVWLPCTSRYLCSGCDFSG 131
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
+DP+ IP FIPK SSSS++IGCQ+PKC +++GPNV+ C+GC P + C + CP Y+LQY
Sbjct: 132 LDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQ--CRGCDPNTRNCTVGCPPYILQY 189
Query: 207 GLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSY 266
GLG TAG+L++E L FP TVP+F+ GCSI+S RQPAGIAGFGR SLPSQ+ LK+FS+
Sbjct: 190 GLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSH 249
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGD-SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
CL+SR+FDD V+++L LDTG G SKTPGL+YTPF KNP S+ AF E+YY+ LR+I
Sbjct: 250 CLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRI 309
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG KHVKIPY YL PG++G+GG IVDSGSTFTFME P+FE VA+EF QM NY+R D+
Sbjct: 310 YVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDL 369
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAA 444
EK++GL PCF+ISGK V +PELI +FKGGAK+ LP NYF VGN + +CL + +D
Sbjct: 370 EKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTV 429
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P+ G GPAIILG FQ QN+ +E+DL NDRFGFAK+KC+
Sbjct: 430 NPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 541 bits (1394), Expect = e-151, Method: Compositional matrix adjust.
Identities = 280/458 (61%), Positives = 352/458 (76%), Gaps = 18/458 (3%)
Query: 27 SSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSN 86
S + T+T+PL+ S L S P L+ LAS SLSRA H+K+ PKT N
Sbjct: 19 SKSTTITIPLSAPSFNK-LIVSSKKPWGSLNHLASLSLSRAHHIKS---PKT-------N 67
Query: 87 YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
+S LIKTPL SYGGYSISL+FGTPPQ +T F+ DTGSSLVWFPCTSRY C +CNFPN
Sbjct: 68 FS--LIKTPLFPRSYGGYSISLNFGTPPQ-TTKFVMDTGSSLVWFPCTSRYLCSECNFPN 124
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
+ + IP F+PK SSSS+LIGC+NP+CS IFGP ++S+C+ C + C CP Y++QY
Sbjct: 125 IKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQY 184
Query: 207 GLGFTAGLLLSETLRFPSK-TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFS 265
G G TAGLLLSETL FP+K T+P+FL GCSI S +QP GIAGFGRS ESLPSQLGLKKFS
Sbjct: 185 GSGSTAGLLLSETLDFPNKKTIPDFLVGCSIFSIKQPEGIAGFGRSPESLPSQLGLKKFS 244
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL+S FDD P SS+LVLDTG GSG +KT GLS+TPF KNP ++AF ++YYV LR I
Sbjct: 245 YCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNP---TTAFRDYYYVLLRNI 301
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
++G HVK+PY +LVPG+DGNGG IVDSG+TFTFME P++E VAKEF +QM +Y+ A ++
Sbjct: 302 VIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEI 361
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
+ +GLRPC++ISG+KS+ +P+LI +FKGGAKMALP NYF++V + V+CL + +DN AG
Sbjct: 362 QNLTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDSGVICLTIVSDNVAG 421
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P LG GPAIILG++Q +NFY+EFDL N++FGF +Q CA
Sbjct: 422 PGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSCA 459
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 537 bits (1384), Expect = e-150, Method: Compositional matrix adjust.
Identities = 268/459 (58%), Positives = 339/459 (73%), Gaps = 10/459 (2%)
Query: 29 AATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK--TKTKPKTKDSNIGSN 86
+ V +PL+P S + S DP L LA SS++RA LK T KP + +
Sbjct: 16 VSAVKLPLSPFS---HSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTT 72
Query: 87 YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
S +++K+PLS SYGGYS+SLSFGTP Q + PF+FDTGSSLV PCTSRY C C+F
Sbjct: 73 ASATVVKSPLSAKSYGGYSVSLSFGTPSQ-TIPFVFDTGSSLVCLPCTSRYLCSGCDFSG 131
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
+DP+ IP FIPK SSSS++IGCQ+PKC +++GPNV+ C+GC P + C + CP Y+LQY
Sbjct: 132 LDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQ--CRGCDPNTRNCTVGCPPYILQY 189
Query: 207 GLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSY 266
GLG TAG+L++E L FP TVP+F+ GCSI+S RQPAGIAGFGR SLPSQ+ LK+FS+
Sbjct: 190 GLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNLKRFSH 249
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGD-SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
CL+SR+FDD V+++L LDTG G SKTPGL+YTPF KNP S+ AF E+YY+ LR+I
Sbjct: 250 CLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRI 309
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG KHVKIPY YL PG++G+GG IVDSGSTFTFME P+FE VA+EF QM NY+R D+
Sbjct: 310 YVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDL 369
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAA 444
EK++GL PCF+ISGK V +PELI +FKGGAK+ LP NYF VGN + +CL + +D
Sbjct: 370 EKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTV 429
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P+ G GPAIILG FQ QN+ +E+DL NDRFGFAK+KC+
Sbjct: 430 NPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 533 bits (1374), Expect = e-149, Method: Compositional matrix adjust.
Identities = 271/455 (59%), Positives = 334/455 (73%), Gaps = 19/455 (4%)
Query: 31 TVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNS 90
++T+PL+PL TK H SDSDP + ASSSL+RA HLK + +N S S
Sbjct: 28 SITLPLSPLLTKP--HSSDSDPFHSVKLAASSSLTRAHHLKHR-----------NNNSPS 74
Query: 91 LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
+ TP SYGGYSI L+ GTPPQ S PF+ DTGSSLVWFPCTS Y C CNFPN+DP+
Sbjct: 75 VATTPAYPKSYGGYSIDLNLGTPPQTS-PFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPT 133
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS-PRNKTCPLACPSYLLQYGLG 209
+IP FIPK SS+++L+GC+NPKC ++FGP+VESRC C P ++ C L CPSY++QYGLG
Sbjct: 134 KIPTFIPKNSSTAKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLG 193
Query: 210 FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
TAG LL + L FP KTVP FL GCSILS RQP+GIAGFGR ESLPSQ+ LK+FSYCL+
Sbjct: 194 ATAGFLLLDNLNFPGKTVPQFLVGCSILSIRQPSGIAGFGRGQESLPSQMNLKRFSYCLV 253
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
S +FDD P SS+LVL +GD+KT GLSYTPF NP ++S F E+YYV LR++IVG
Sbjct: 254 SHRFDDTPQSSDLVLQIS-STGDTKTNGLSYTPFRSNP-SNNSVFREYYYVTLRKLIVGG 311
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADVEKK 388
VKIPY +L PGSDGNGG IVDSGSTFTFME P++ VA+EF+RQ+G YSR +VE +
Sbjct: 312 VDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQ 371
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPA 447
SGL PCF+ISG K++ PE +FKGGAKM+ P NYF+ VG+ EVLC + +D AG
Sbjct: 372 SGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQP 431
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
GPAIILG++Q QNFY+E+DL N+RFGF + C
Sbjct: 432 KTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 271/474 (57%), Positives = 340/474 (71%), Gaps = 25/474 (5%)
Query: 12 FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
S LL + A + + +T+PL + H S DPL+ L LASSS +RA +K
Sbjct: 7 LSFFYLLLFSSLSAIAHSNPITLPL-----NSFPHLSSPDPLQALTFLASSSQTRAHQIK 61
Query: 72 TKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF 131
T PK SNS+ K+PLS HSYG YS LSFGTP Q + IFDTGSSLVWF
Sbjct: 62 T---PK----------SNSVFKSPLSPHSYGAYSTPLSFGTP-QQTLHLIFDTGSSLVWF 107
Query: 132 PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPR 191
PCTSRY C +C+FP +DP+ IP F+PK SSSS+L+GCQNPKCSWIFGP+V+S+C+ C+P+
Sbjct: 108 PCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPK 167
Query: 192 NKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRS 251
+ C CP+Y++QYG G TAGLLLSETL FP K +PNF+ GCS LS QP+GIAGFGR
Sbjct: 168 TENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRG 227
Query: 252 SESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
SESLPSQ+GLKKF+YCL SRKFDD+P S L+LD S K+ GL+YTPF +NP S+
Sbjct: 228 SESLPSQMGLKKFAYCLASRKFDDSPHSGQLILD----STGVKSSGLTYTPFRQNPSVSN 283
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
+A+ E+YY+ +R+IIVG++ VK+PY +LVPG DGNGG I+DSGSTFTFM+ P+ E VA+E
Sbjct: 284 NAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVARE 343
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
F +Q+ N++RA DVE +GLRPCFDIS +KSV PELI +FKGGAK ALP NYFALV +
Sbjct: 344 FEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSS 403
Query: 432 E-VLCLILFTDNAAG-PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
V CL + T G GP++ILG FQ QNFY+E+DL N R GF +Q C+
Sbjct: 404 SGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 271/474 (57%), Positives = 340/474 (71%), Gaps = 25/474 (5%)
Query: 12 FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
S LL + A + + +T+PL + H S DPL+ L LASSS +RA +K
Sbjct: 7 LSFFYLLLFSSLSAIAHSNPITLPL-----NSFPHLSSPDPLQALTFLASSSQTRAHQIK 61
Query: 72 TKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF 131
T PK SNS+ K+PLS HSYG YS LSFGTP Q + IFDTGSSLVWF
Sbjct: 62 T---PK----------SNSVFKSPLSPHSYGAYSTPLSFGTP-QQTLHLIFDTGSSLVWF 107
Query: 132 PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPR 191
PCTSRY C +C+FP +DP+ IP F+PK SSSS+L+GCQNPKCSWIFGP+V+S+C+ C+P+
Sbjct: 108 PCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPK 167
Query: 192 NKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRS 251
+ C CP+Y++QYG G TAGLLLSETL FP K +PNF+ GCS LS QP+GIAGFGR
Sbjct: 168 TENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKXIPNFVVGCSFLSIHQPSGIAGFGRG 227
Query: 252 SESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
SESLPSQ+GLKKF+YCL SRKFDD+P S L+LD S K+ GL+YTPF +NP S+
Sbjct: 228 SESLPSQMGLKKFAYCLASRKFDDSPHSGQLILD----STGVKSSGLTYTPFRQNPSVSN 283
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
+A+ E+YY+ +R+IIVG++ VK+PY +LVPG DGNGG I+DSGSTFTFM+ P+ E VA+E
Sbjct: 284 NAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVARE 343
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
F +Q+ N++RA DVE +GLRPCFDIS +KSV PELI +FKGGAK ALP NYFALV +
Sbjct: 344 FEKQLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSS 403
Query: 432 E-VLCLILFTDNAAG-PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
V CL + T G GP++ILG FQ QNFY+E+DL N R GF +Q C+
Sbjct: 404 SGVACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 260/474 (54%), Positives = 338/474 (71%), Gaps = 24/474 (5%)
Query: 11 LFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHL 70
LFS+ +LL SS++T +PLT + + +DP K ++ L S+SL+RA+HL
Sbjct: 59 LFSIFLLL-----PTSSSSSTTVLPLTTFPSVSF-----TDPFKTINLLLSASLNRAQHL 108
Query: 71 KTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW 130
KT P++K + N S L SYG YS+SL+FGTPPQ + FIFDTGSSLVW
Sbjct: 109 KT---PQSKSNTSIQNVS-------LFPRSYGAYSVSLAFGTPPQ-NLSFIFDTGSSLVW 157
Query: 131 FPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSP 190
FPCT+ YRC C+FP VDP+ I F+PK SSS +++GC+NPKC+WIFGPN++SRC+ C+
Sbjct: 158 FPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNS 217
Query: 191 RNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGR 250
+++ C +CP Y LQYG G TAG+LLSETL +K VP+FL GCS++S QPAGIAGFGR
Sbjct: 218 KSRKCSDSCPGYGLQYGSGATAGILLSETLDLENKRVPDFLVGCSVMSVHQPAGIAGFGR 277
Query: 251 SSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
ESLPSQ+ LK+FS+CL+SR FDD+PVSS LVLD+G S +SKT Y PF +NP S
Sbjct: 278 GPESLPSQMRLKRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVS 337
Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
++AF E+YY+ LR+I++G K VK PY YLVP S GNGG I+DSGSTFTF++ P+FEA+A
Sbjct: 338 NAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIAD 397
Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALV 429
E +Q+ Y RA DVE +SGLRPCF+I ++S P+++LKFKGG K++L ENY A+V
Sbjct: 398 ELEKQLVKYPRAKDVEAQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMV 457
Query: 430 GNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+E V+CL + TD A G GPAIILG FQ QN +E+DLA R GF KQKC
Sbjct: 458 TDEGVVCLTMMTDEAVV-GGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510
>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
Length = 445
Score = 480 bits (1236), Expect = e-133, Method: Compositional matrix adjust.
Identities = 254/436 (58%), Positives = 310/436 (71%), Gaps = 29/436 (6%)
Query: 10 CLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH 69
C F+L LL ++ + AT+T+PLTPL TK+ SDP ++L L S+SL+RA H
Sbjct: 29 CGFTLFSLLLLANSSPDKNPATITLPLTPLFTKN----PSSDPWQLLSHLTSASLTRAHH 84
Query: 70 LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
LK + + S + TPL HSYGGYS+SLSFGTP Q + F+ DTGSSLV
Sbjct: 85 LKHRK-------------NTSSVNTPLFAHSYGGYSVSLSFGTPSQ-TLSFVMDTGSSLV 130
Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
WFPCTSRY C C+FPN+DP++IP FIPK SSS++++GC NPKC ++ S
Sbjct: 131 WFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVMD----------S 180
Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFG 249
+ C ACP+Y +QYGLG T GLLL E+L F +T P+F+ GCSILS RQP+GIAGFG
Sbjct: 181 ENSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPDFVVGCSILSSRQPSGIAGFG 240
Query: 250 RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
R SLP Q+GLKKFSYCLLS +FDD+P SS + L GP S D KT GLSYTPF KNPV
Sbjct: 241 RGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVS 300
Query: 310 SSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
S+SAF E+YYV LR IIVG K VK+PYS++V GSDGNGG IVDSGSTFTFME P+FEAVA
Sbjct: 301 SNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVA 360
Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
EF RQM NY+RAADVE SGL+PCF++SG SV LP L+ +FKGGAKM LP NYF+LV
Sbjct: 361 TEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLV 420
Query: 430 GN-EVLCLILFTDNAA 444
G+ VLCL + ++ A
Sbjct: 421 GDLSVLCLTIVSNEAV 436
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 476 bits (1224), Expect = e-131, Method: Compositional matrix adjust.
Identities = 248/434 (57%), Positives = 301/434 (69%), Gaps = 19/434 (4%)
Query: 51 DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
DP + L L S+SL RARHLK T + L HSYG YSI LSF
Sbjct: 50 DPYRNLRHLVSASLIRARHLKNPKTTPTSTTP-------------LFTHSYGAYSIPLSF 96
Query: 111 GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
GTPPQ + P I DTGS LVWFPCT RY C +C+F +PS FIPK SSSS+++GC N
Sbjct: 97 GTPPQ-TLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSS-NIFIPKSSSSSKVLGCVN 154
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNF 230
PKC WI G V+SRC+ C P + C CP YL+ YG G T G++LSETL P K VPNF
Sbjct: 155 PKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGKGVPNF 214
Query: 231 LAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
+ GCS+LS QPAGI+GFGR SLPSQLGLKKFSYCLLSR++DD SS+LVLD S
Sbjct: 215 IVGCSVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVLDGESDS 274
Query: 291 GDSKTPGLSYTPFYKNP-VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349
G+ KT GLSYTPF +NP V AF +YY+GLR I VG KHVKIPY YL+PG+DG+GG
Sbjct: 275 GE-KTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGT 333
Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELI 409
I+DSG+TFT+M+G +FE VA EF +Q+ + RA +VE +GLRPCF+ISG + PEL
Sbjct: 334 IIDSGTTFTYMKGEIFELVAAEFEKQVQS-KRATEVEGITGLRPCFNISGLNTPSFPELT 392
Query: 410 LKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
LKF+GGA+M LP NY A + G++V+CL + TD AAG GPAIILG+FQ QNFY+E+
Sbjct: 393 LKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVEY 452
Query: 469 DLANDRFGFAKQKC 482
DL N+R GF +Q C
Sbjct: 453 DLRNERLGFRQQSC 466
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 246/482 (51%), Positives = 321/482 (66%), Gaps = 32/482 (6%)
Query: 6 FSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLS 65
FSL+ S++I F++ S+ T+T+ L+PL T H S S P L S+S++
Sbjct: 8 FSLLSFLSIIITTFSS-----STPNTITLHLSPLFTNH--PSSSSHPFHTLKLAVSTSIT 60
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
RA HLK KP N ++TP+ +YGGYSI L FGTP Q + PF+ DTG
Sbjct: 61 RAHHLKNH-KP------------NKSLETPVHPKTYGGYSIDLEFGTPSQ-TFPFVLDTG 106
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESR- 184
S+LVW PC+S Y C CN S P FIPK SSSS+ +GC NPKC+W+FGP+V+S
Sbjct: 107 STLVWLPCSSHYLCSKCN----SFSNTPKFIPKNSSSSKFVGCTNPKCAWVFGPDVKSHC 162
Query: 185 CKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAG 244
C+ C CP+Y +QYGLG TAG LLSE L FP+K +FL GCS++S QPAG
Sbjct: 163 CRQDKAAFNNCSQTCPAYTVQYGLGSTAGFLLSENLNFPTKKYSDFLLGCSVVSVYQPAG 222
Query: 245 IAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPF 303
IAGFGR ESLPSQ+ L +FSYCLLS +FDD A ++SNLVL+T S D KT G+SYTPF
Sbjct: 223 IAGFGRGEESLPSQMNLTRFSYCLLSHQFDDSATITSNLVLETA-SSRDGKTNGVSYTPF 281
Query: 304 YKNPVGSSS-AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEG 362
KNP + AFG +YY+ L++I+VG K V++P L P DG+GG IVDSGSTFTFME
Sbjct: 282 LKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMER 341
Query: 363 PLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS-GKKSVYLPELILKFKGGAKMALP 421
P+F+ VA+EF +Q+ +Y+RA + EK+ GL PCF ++ G ++ PEL +F+GGAKM LP
Sbjct: 342 PIFDLVAQEFAKQV-SYTRAREAEKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLP 400
Query: 422 PENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
NYF+LVG +V CL + +D+ AG GPA+ILG++Q QNFY+E+DL N+RFGF Q
Sbjct: 401 VANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQ 460
Query: 481 KC 482
C
Sbjct: 461 SC 462
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 243/447 (54%), Positives = 295/447 (65%), Gaps = 31/447 (6%)
Query: 38 PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS 97
PLS + + D L+ L+ L S+SL+RA HLK P+T TP+
Sbjct: 29 PLSHSYTNQNPSQDHLQKLNYLVSTSLARAHHLK---NPQT---------------TPVF 70
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
HSYGGYSISLSFGTPPQ + F+ DTGSS VWFPCT RY C +C+F SRI F+P
Sbjct: 71 SHSYGGYSISLSFGTPPQ-TLSFVMDTGSSFVWFPCTLRYLCNNCSFT----SRISPFLP 125
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
K SSSS++IGC+NPKCSWI ++ RC C ++ C CP YL+ YG G T G+ LS
Sbjct: 126 KHSSSSKIIGCKNPKCSWIHQTDL--RCTDCDNNSRNCSQICPPYLILYGSGTTGGVALS 183
Query: 218 ETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
ETL VPNFL GCS+ S RQPAGIAGFGR SLPSQLGL KFSYCLLS KFDD
Sbjct: 184 ETLHLHGLIVPNFLVGCSVFSSRQPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHKFDDTQ 243
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNP-VGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
SS+LVLD+ S D KT L YTP KNP V AF +YYV LR+I +G + VKIPY
Sbjct: 244 ESSSLVLDSQSDS-DKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPY 302
Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
YL P DGNGG I+DSG+TFT+M FE ++ EFI Q+ NY RA VE SGL+PCF+
Sbjct: 303 KYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFN 362
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAII 455
+SG K + LP+L L FKGGA + LP ENYFA +G+ EV C + TD A + GP +I
Sbjct: 363 VSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKAS---GPGMI 419
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
LG+FQ+QNFY+E+DL N+R GF K+ C
Sbjct: 420 LGNFQMQNFYVEYDLQNERLGFKKESC 446
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 249/459 (54%), Positives = 306/459 (66%), Gaps = 24/459 (5%)
Query: 29 AATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYS 88
++++T+PL T D + L+ L ++SL+RARHLK P+T
Sbjct: 6 SSSITIPLQHPQTNQIPFQ---DQYQKLNHLVTTSLARARHLKN---PQTT--------P 51
Query: 89 NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD 148
+ PL HSYGGYS+SLSFGTPPQ + FI DTGS +VWFPCTS Y C C+F +
Sbjct: 52 ATTTTAPLFSHSYGGYSVSLSFGTPPQ-TLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSS 110
Query: 149 PS-RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV--ESRCKGCSPRNKTCPLACPSYLLQ 205
PS RI FIPK SSSS+L+GC+NPKCSWI N+ + C S N+TCP Y++
Sbjct: 111 PSSRIQPFIPKESSSSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCP----PYMIF 166
Query: 206 YGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFS 265
YG G T G+ LSETL S + PNFL GCS+ S QPAGIAGFGR SLPSQLGL KFS
Sbjct: 167 YGSGTTGGVALSETLHLHSLSKPNFLVGCSVFSSHQPAGIAGFGRGLSSLPSQLGLGKFS 226
Query: 266 YCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNP-VGSSSAFGEFYYVGLR 323
YCLLS +FDD SS+LVLD D KT L YTPF KNP V + S+F +YY+GLR
Sbjct: 227 YCLLSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLR 286
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+I VG HVK+PY YL PG DGNGGVI+DSG+TFTFM FE ++ EFIRQ+ +Y R
Sbjct: 287 RITVGGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVK 346
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
++E GLRPCF++S K+V PEL L FKGGA +ALP ENYFA VG EV CL + TD
Sbjct: 347 EIEDAIGLRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGV 406
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
AGP GP +ILG+FQ+QNFY+E+DL N+R GF ++KC
Sbjct: 407 AGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 226/449 (50%), Positives = 295/449 (65%), Gaps = 28/449 (6%)
Query: 44 YLHH---SDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHS 100
+ HH S+S P L S+S++RA HLK P S +KT + +
Sbjct: 166 FTHHPSSSNSHPFHTLQLAVSTSITRAHHLKNHNNP-------------SSLKTLVHPKT 212
Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCN-FPNVDPSRIPAFIPKR 159
YGGYSI L FGTPPQ + PF+ DTGSSLVW PC S Y C CN F N + P FIPK
Sbjct: 213 YGGYSIDLKFGTPPQ-TFPFVLDTGSSLVWLPCYSHYLCSKCNSFSN---NNTPKFIPKD 268
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRC----KGCSPRNKTCPLACPSYLLQYGLGFTAGLL 215
S SS+ +GC+NPKC+W+FG +V S C K N C CP+Y +QYGLG TAG L
Sbjct: 269 SFSSKFVGCRNPKCAWVFGSDVTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGLGSTAGFL 328
Query: 216 LSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD 275
LSE L FP+K V +FL GCS++S QP GIAGFGR ESLP+Q+ L +FSYCLLS +FD+
Sbjct: 329 LSENLNFPAKNVSDFLVGCSVVSVYQPGGIAGFGRGEESLPAQMNLTRFSYCLLSHQFDE 388
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
+P +S+LV++ KT G+SYT F KNP AFG +YY+ LR+I+VG K V++P
Sbjct: 389 SPENSDLVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVP 448
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
L P +G+GG IVDSGST TFME P+F+ VA+EF++Q+ NY+RA ++EK+ GL PCF
Sbjct: 449 RRMLEPDVNGDGGFIVDSGSTLTFMERPIFDLVAEEFVKQV-NYTRARELEKQFGLSPCF 507
Query: 396 DIS-GKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPA 453
++ G ++ PE+ +F+GGAKM LP NYF+ VG +V CL + +D+ AG GPA
Sbjct: 508 VLAGGAETASFPEMRFEFRGGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPA 567
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ILG++Q QNFY+E DL N+RFGF Q C
Sbjct: 568 VILGNYQQQNFYVECDLENERFGFRSQSC 596
>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
Length = 454
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 229/435 (52%), Positives = 277/435 (63%), Gaps = 34/435 (7%)
Query: 51 DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
DP + L L S+SL RARHLK + TPL HSYG YSI LSF
Sbjct: 50 DPYRNLRHLVSASLIRARHLKNPK-------------TTPTSTTPLFTHSYGAYSIPLSF 96
Query: 111 GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
GTPPQ + P I DTGS LVWFPCT RY C +C+F +PS FIPK SSSS+++GC N
Sbjct: 97 GTPPQ-TLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSS-NIFIPKSSSSSKVLGCVN 154
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNF 230
PKC WI G V+SRC+ C P + C CP YL LRF F
Sbjct: 155 PKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYL--------------NFLRFWDHRRSQF 200
Query: 231 LAGCSI-LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPG 289
L I+GFGR SLPSQLGLKKFSYCLLSR++DD SS+LVLD
Sbjct: 201 HRRMLCPLHQSTRREISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVLDGESD 260
Query: 290 SGDSKTPGLSYTPFYKNP-VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
SG+ KT GLSYTPF +NP V AF +YY+GLR I VG KHVKIPY YL+PG+DG+GG
Sbjct: 261 SGE-KTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGG 319
Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
I+DSG+TFT+M+G +FE VA EF +Q+ + RA +VE +GLRPCF+ISG + PEL
Sbjct: 320 TIIDSGTTFTYMKGEIFELVAAEFEKQVQS-KRATEVEGITGLRPCFNISGLNTPSFPEL 378
Query: 409 ILKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLE 467
LKF+GGA+M LP NY A + G++V+CL + TD AAG GPAIILG+FQ QNFY+E
Sbjct: 379 TLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVE 438
Query: 468 FDLANDRFGFAKQKC 482
+DL N+R GF +Q C
Sbjct: 439 YDLRNERLGFRQQSC 453
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 207/461 (44%), Positives = 289/461 (62%), Gaps = 32/461 (6%)
Query: 27 SSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKT-KTKPKTKDSNIGS 85
++ AT+T+PLT + + + PL+ L LA++SLSRA HLK KT P T+ S
Sbjct: 27 NTPATITIPLT----STFTNSPSTKPLRFLQHLATASLSRAHHLKHGKTSPLTQIS---- 78
Query: 86 NYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
LS HSYGG+SI LSFGTPPQ + F+ DTGS +VW PCT+ Y C +C+F
Sbjct: 79 ----------LSPHSYGGHSIPLSFGTPPQKLS-FLVDTGSHVVWAPCTTHYTCTNCSFS 127
Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
+ +P ++P F PK SSSS+++GC+NPKC P+V C C+ +K C ACP Y LQ
Sbjct: 128 DAEPKKVPIFNPKLSSSSKILGCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQ 187
Query: 206 YGLGFTAGLLLSETLRFPSKTVPNFLAGC--SILSDRQPAGIAGFGRSSESLPSQLGLKK 263
YG G ++G L E L FP KT+ FL GC S + + A +AGFGRS SLP Q+G+KK
Sbjct: 188 YGTGASSGDFLLENLNFPGKTIHEFLVGCTTSAVGEVTSAALAGFGRSMFSLPMQMGVKK 247
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
F+YCL S +DD SS L+LD D +T GLSY PF KNP F +YY+G++
Sbjct: 248 FAYCLNSHDYDDTRNSSKLILDY----SDGETKGLSYAPFLKNP----PDFPIYYYLGVK 299
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I +G+K ++IP YL PGSDG GG+++DSG + +M GP+F+ V E ++M Y R+
Sbjct: 300 DIKIGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSL 359
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
+ E + G+ PC++ +G+KS+ +P+LI +F+GGA M +P +NYF L+ L T +A
Sbjct: 360 EAEAEIGVTPCYNFTGQKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDA 419
Query: 444 AGPAL--GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L GP+IILG+ Q ++Y+EFDL N+R GF +Q C
Sbjct: 420 GTNTLEFTPGPSIILGNSQHVDYYVEFDLKNERLGFRQQTC 460
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 206/436 (47%), Positives = 278/436 (63%), Gaps = 30/436 (6%)
Query: 51 DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
D + ++ A SSLSRARHLK +P T + P SYGGYS+ S
Sbjct: 33 DKWESINLAALSSLSRARHLK---RPPTLTGKV---------TLPAYPRSYGGYSVIFSL 80
Query: 111 GTPPQASTPFIFDTGSSLVWFPCT---SRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
GTPPQ + + DTGSSLVW PCT + Y C +C F VDP++IP + +SS+ Q +
Sbjct: 81 GTPPQKVS-LVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLP 139
Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS-KT 226
C++PKC+W+FG ++ CS + CP Y L+YGLG T G L+S+ L
Sbjct: 140 CRSPKCNWVFGSDLN-----CSTTKR-----CPYYGLEYGLGSTTGQLVSDVLGLSKLNR 189
Query: 227 VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDT 286
+P+FL GCS++S+RQP GIAGFGR S+P+QLGL KFSYCL+S +FDD P S +LVL
Sbjct: 190 IPDFLFGCSLVSNRQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDTPQSGDLVLHR 249
Query: 287 GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
G D+ G++Y PF K+P + S + E+YY+ L +I+VG K V IP YLVP +G+
Sbjct: 250 GRRHADAAANGVAYAPFTKSP--ALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGD 307
Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
GG+IVDSGSTFTFME +F+ VA+E + M Y RA ++E SGL PC++I+G+ V +P
Sbjct: 308 GGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDVP 367
Query: 407 ELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYL 466
+L FKGGA M LP +YF+LV + V+C+ + TD P GPAIILG++Q QNFY+
Sbjct: 368 KLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDE-PGSTTGPAIILGNYQQQNFYI 426
Query: 467 EFDLANDRFGFAKQKC 482
E+DL RFGF Q+C
Sbjct: 427 EYDLKKQRFGFKPQQC 442
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 207/481 (43%), Positives = 295/481 (61%), Gaps = 35/481 (7%)
Query: 6 FSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLS 65
FS+ LFS L+L + + AT+T+PLTP TK+ ++PL L LA++S+S
Sbjct: 9 FSVFTLFSRLVL---ASSSKNNIPATITIPLTPTFTKN----PSTEPLLFLQHLATASMS 61
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R+ HLK ++ LI+T L HS+GG++I LSFGTPPQ + F+ DTG
Sbjct: 62 RSHHLK-------------HGKASPLIQTSLFPHSHGGHTIPLSFGTPPQKLS-FLVDTG 107
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
S +VW PCT+ Y C +C+F N P ++P F P+ SSS +++GC++PKC+ P+V C
Sbjct: 108 SHVVWAPCTTHYTCTNCSFSN--PKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGC 165
Query: 186 KGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPA-- 243
C+ +K C ACP Y LQYG G +G L E L FP KT+ FL GC+ +DR+P+
Sbjct: 166 PRCNGNSKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSADREPSSD 225
Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
+AGFGR+ SLP Q+G+KKF+YCL S +DD S L+LD D +T GLSY PF
Sbjct: 226 ALAGFGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILDYS----DGETQGLSYAPF 281
Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
KNP + +YY+G++ + +G+K ++IP YL PGSD GGV++DSG + +M P
Sbjct: 282 LKNP----PDYPFYYYLGVKDMKIGNKLLRIPGKYLTPGSDSRGGVMIDSGFAYGYMTLP 337
Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
+F+ V E +QM Y R+ + E +SGL PC++ +G KS+ +P+LI +F GGA M +P
Sbjct: 338 VFKIVTNELKKQMSKYRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGM 397
Query: 424 NYFALVGNEVL-CLILFTDNAAGP-ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
NYF L L C + TD+ GP+IILG++Q + Y+EFDL N+R GF +Q
Sbjct: 398 NYFLLFSEASLGCFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQT 457
Query: 482 C 482
C
Sbjct: 458 C 458
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 204/481 (42%), Positives = 297/481 (61%), Gaps = 35/481 (7%)
Query: 6 FSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLS 65
FS+ LFS L+L + + AT+T+PLTP+ TK+ ++PL L LA++S+S
Sbjct: 9 FSVFTLFSHLVL---ASSSKNNIPATITIPLTPIFTKN----PSTEPLLFLQHLATASMS 61
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R+ HLK ++ LI+T L HSYG ++I LSFGTPPQ + F+ DTG
Sbjct: 62 RSHHLK-------------HGKASPLIQTSLFPHSYGAHTIPLSFGTPPQKLS-FLMDTG 107
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
S +VW PCT+ Y C +C+F N P ++P F P+ SSS +++GC++PKC+ PBV
Sbjct: 108 SHVVWAPCTTHYTCTNCSFSN--PKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGX 165
Query: 186 KGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPA-- 243
C+ +K C ACP Y LQYG G +G L E L FP KT+ FL GC+ +DR+P+
Sbjct: 166 PRCNGNSKKCSHACPQYTLQYGTGAASGFFLLENLDFPGKTIHKFLVGCTTSADREPSSD 225
Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
+AGFGR+ SLP Q+G+KKF+YCL S +DD S L+LD D +T GLSY PF
Sbjct: 226 ALAGFGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILD----YSDGETQGLSYAPF 281
Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
KNP + +YY+G++ + +G+K ++IP YL PGSD GGV++DSG +++M P
Sbjct: 282 XKNP----PDYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSYMTLP 337
Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
+F+ V E +QM Y R+ ++E ++G+ PC++ +G KS+ +P+LI +F GGA M +P
Sbjct: 338 VFKIVTNELKKQMSKYRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGANMVVPGM 397
Query: 424 NYFALVGNEVL-CLILFTDN-AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
NYF L L C + TD+ + GP+IILG++Q + Y+EFDL N+R GF +Q
Sbjct: 398 NYFLLFSEASLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQT 457
Query: 482 C 482
C
Sbjct: 458 C 458
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 214/477 (44%), Positives = 292/477 (61%), Gaps = 43/477 (9%)
Query: 11 LFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHL 70
+F L +LF + AT+T+PLT T S PL AS+SLSRA HL
Sbjct: 12 VFILFSILFLASCSKDNIPATITIPLTSTFT--------SKPL------ASASLSRAHHL 57
Query: 71 KT-KTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
K KT P +KT L HSYGG+SISLSFGTPPQ + F+ DTGS +V
Sbjct: 58 KHGKTNPP--------------VKTSLFPHSYGGHSISLSFGTPPQKLS-FLVDTGSDVV 102
Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
W PCT+ Y C +C+F DP ++P F PK SSSS+++ C+NPKC + P V C C+
Sbjct: 103 WAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRNPKCVSTYFPYVHLGCPRCN 162
Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPA--GIAG 247
+K C ACP Y QYG G ++G L E L+FP KT+ NFL GC+ + R+ + +AG
Sbjct: 163 GNSKHCSYACP-YSTQYGTGASSGYFLLENLKFPRKTIRNFLLGCTTSAARELSSDALAG 221
Query: 248 FGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNP 307
FGRS SLP Q+G+KKF+YCL S +DD S L+LD D KT GLSYTPF K+P
Sbjct: 222 FGRSMFSLPIQMGVKKFAYCLNSHDYDDTRNSGKLILDY----RDGKTKGLSYTPFLKSP 277
Query: 308 VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG-STFTFMEGPLFE 366
+SAF +Y++G++ I +G+K ++IP YL PGSDG GVI+DSG +M GP+F+
Sbjct: 278 --PASAF--YYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAGYMTGPVFK 333
Query: 367 AVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF 426
V E +QM Y R+ + E ++GL PC++ +G KS+ +P LI +F+GGA M +P +NYF
Sbjct: 334 IVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYF 393
Query: 427 ALVGNEVL-CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ E L C ++ T+ + P+IILG+ Q ++Y+E+DL NDRFGF +Q C
Sbjct: 394 GISPQESLACFLMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 450
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 350 bits (897), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 201/440 (45%), Positives = 269/440 (61%), Gaps = 28/440 (6%)
Query: 61 SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTP-LSVHSYGGYSISLSFGTPPQASTP 119
++SL+RA HLK + S GS S+ T L HSYGGY+ + S GTPPQ P
Sbjct: 25 AASLARALHLKRRDP--NHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQ-PLP 81
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF-G 178
+ DTGS L W PCTS Y C +C+ P+ S +P F PK SSSS+L+GC+NP C W+
Sbjct: 82 VLLDTGSHLTWVPCTSSYECRNCSSPSA--SAVPVFHPKNSSSSRLVGCRNPSCQWVHSA 139
Query: 179 PNVESRCKG--CSPRNKTCPLA----CPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLA 232
N+ ++C+ CSP CP A CP Y + YG G TAGLL+++TLR P + VP F+
Sbjct: 140 ANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVL 199
Query: 233 GCSILSDRQP-AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGS 290
GCS++S QP +G+AGFGR + S+P+QLGL KFSYCLLSR+FDD A VS +LVL
Sbjct: 200 GCSLVSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGG---- 255
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
G+ Y P K+ G +G +YY+ LR + VG K V++P + G+GG I
Sbjct: 256 -TGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTI 314
Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDI-SGKKSVYLPEL 408
VDSG+TFT+++ +F+ VA + + G Y R+ D E + GL PCF + G +S+ LPEL
Sbjct: 315 VDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPEL 374
Query: 409 ILKFKGGAKMALPPENYFALVGN---EVLCLILFTDNAAGPALGR---GPAIILGDFQLQ 462
F+GGA M LP ENYF + G E +CL + TD + G G GPAIILG FQ Q
Sbjct: 375 SFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQ 434
Query: 463 NFYLEFDLANDRFGFAKQKC 482
N+ +E+DL +R GF +Q C
Sbjct: 435 NYLVEYDLEKERLGFRRQSC 454
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 335 bits (859), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 194/428 (45%), Positives = 258/428 (60%), Gaps = 26/428 (6%)
Query: 73 KTKPKTKDSNIGSNYSNSLIKTP-LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF 131
K + S GS S+ T L HSYGGY+ + S GTPPQ P + DTGS L W
Sbjct: 67 KRRDPNHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQ-PLPVLLDTGSHLTWV 125
Query: 132 PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF-GPNVESRCKG--C 188
PCTS Y C +C+ P+ S +P F PK SSSS+L+GC+NP C W+ N+ ++C+ C
Sbjct: 126 PCTSSYECRNCSSPSA--SAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPC 183
Query: 189 SPRNKTCPLA----CPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQP-A 243
SP CP A CP Y + YG G TAGLL+++TLR P + VP F+ GCS++S QP +
Sbjct: 184 SPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSLVSVHQPPS 243
Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTP 302
G+AGFGR + S+P+QLGL KFSYCLLSR+FDD A VS +LVL G+ Y P
Sbjct: 244 GLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGG-----TGGGEGMQYVP 298
Query: 303 FYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEG 362
K+ G +G +YY+ LR + VG K V++P + G+GG IVDSG+TFT+++
Sbjct: 299 LVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDP 358
Query: 363 PLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMAL 420
+F+ VA + + G Y R+ D E GL PCF + G +S+ LPEL F+GGA M L
Sbjct: 359 TVFQPVADAVVAAVGGRYKRSKDAEDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQL 418
Query: 421 PPENYFALVGN---EVLCLILFTD---NAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
P ENYF + G E +CL + TD + G GPAIILG FQ QN+ +E+DL +R
Sbjct: 419 PVENYFVVAGRGAVEAICLAVVTDFGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKER 478
Query: 475 FGFAKQKC 482
GF +Q C
Sbjct: 479 LGFRRQSC 486
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 190/439 (43%), Positives = 257/439 (58%), Gaps = 34/439 (7%)
Query: 69 HLKTKTKPKTKDSNIGSNYSNSLIKTPLSV--HSYGGYSISLSFGTPPQASTPFIFDTGS 126
HLK + + S+ + I ++ HSYGGY+ + S GTPPQ P + DTGS
Sbjct: 66 HLKRRGRASHHSQKGSSSGGHKSIPATAALYPHSYGGYAFTASLGTPPQ-PLPVLLDTGS 124
Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
L W PCTS Y C +C+ P + +P F PK SSSS+L+GC+NP C W+ ++C+
Sbjct: 125 QLTWVPCTSNYDCRNCSSPFA--AAVPVFHPKNSSSSRLVGCRNPSCLWVHSAEHVAKCR 182
Query: 187 GCSPRNKTCPLA---CPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQP- 242
R C A CP Y + YG G TAGLL+++TLR P + V F+ GCS++S QP
Sbjct: 183 APCSRGANCTPASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVSGFVLGCSLVSVHQPP 242
Query: 243 AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYT 301
+G+AGFGR + S+P+QLGL KFSYCLLSR+FDD A VS +LVL GD+ G+ Y
Sbjct: 243 SGLAGFGRGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSGSLVL-----GGDND--GMQYV 295
Query: 302 PFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
P K+ G + +YY+ L + VG K V++P + G+GG IVDSG+TFT+++
Sbjct: 296 PLVKSAAGDKQPYAVYYYLALSGVTVGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLD 355
Query: 362 GPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMA 419
+F+ VA + + G Y R+ DVE+ GL PCF + G KS+ LPEL L FKGGA M
Sbjct: 356 PTVFQPVADAVVAAVGGRYKRSKDVEEGLGLHPCFALPQGAKSMALPELSLHFKGGAVMQ 415
Query: 420 LPPENYFALVGNE-------------VLCLILFTD--NAAGPALGRGPAIILGDFQLQNF 464
LP ENYF + G +CL + TD + G GPAIILG FQ QN+
Sbjct: 416 LPLENYFVVAGRAPVPGAGAGAGAAEAICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNY 475
Query: 465 YLEFDLANDRFGFAKQKCA 483
+E+DL +R GF +Q CA
Sbjct: 476 LVEYDLEKERLGFRRQPCA 494
>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
Length = 452
Score = 315 bits (806), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 173/376 (46%), Positives = 234/376 (62%), Gaps = 24/376 (6%)
Query: 124 TGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF-GPNVE 182
+GS L W PCTS Y C +C+ P+ S +P F PK SSSS+L+GC+NP C W+ N+
Sbjct: 79 SGSHLTWVPCTSSYECRNCSSPSA--SAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLA 136
Query: 183 SRCKG--CSPRNKTCPLA----CPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSI 236
++C+ CSP CP A CP Y + YG G TAGLL+++TLR P + VP F+ GCS+
Sbjct: 137 TKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGSTAGLLIADTLRAPGRAVPGFVLGCSL 196
Query: 237 LSDRQP-AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSK 294
+S QP +G+AGFGR + S+P+QLGL KFSYCLLSR+FDD A VS +LVL
Sbjct: 197 VSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGG-----TGG 251
Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
G+ Y P K+ G +G +YY+ LR + VG K V++P + G+GG IVDSG
Sbjct: 252 GEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSG 311
Query: 355 STFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKF 412
+TFT+++ +F+ VA + + G Y R+ D E + GL PCF + G +S+ LPEL F
Sbjct: 312 TTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHF 371
Query: 413 KGGAKMALPPENYFALVGN---EVLCLILFTDNAAGPALGR---GPAIILGDFQLQNFYL 466
+GGA M LP ENYF + G E +CL + TD + G G GPAIILG FQ QN+ +
Sbjct: 372 EGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLV 431
Query: 467 EFDLANDRFGFAKQKC 482
E+DL +R GF +Q C
Sbjct: 432 EYDLEKERLGFRRQSC 447
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 188/418 (44%), Positives = 250/418 (59%), Gaps = 32/418 (7%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
++ L HSYGGY+ ++S GTPPQ P + DTGS L W PCTS Y+C +C+ + S
Sbjct: 77 VRASLYPHSYGGYAFTVSLGTPPQP-LPVLLDTGSHLSWVPCTSSYQCRNCSSLSAA-SP 134
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG--------CSPRNKTCPLACPSYL 203
+ F PK SSSS+LIGC+NP C WI P+ S C+ C+PRN CP YL
Sbjct: 135 LHVFHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYL 194
Query: 204 LQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ-PAGIAGFGRSSESLPSQLGLK 262
+ YG G TAGLL+S+TLR P + V NF+ GCS+ S Q P+G+AGFGR + S+PSQLGL
Sbjct: 195 VVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRGAPSVPSQLGLT 254
Query: 263 KFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
KFSYCLLSR+FDD A VS L+L G G+ Y P ++ + + +YY+
Sbjct: 255 KFSYCLLSRRFDDNAAVSGELILGG--AGGKDGGVGMQYAPLARS-ASARPPYSVYYYLA 311
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYS 380
L I VG K V++P V GG IVDSG+TF++ + +FE VA + + G YS
Sbjct: 312 LTAITVGGKSVQLPERAFV-AGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYS 370
Query: 381 RAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVG--------- 430
R+ VE+ GL PCF + G K++ LPE+ L FKGG+ M LP ENYF + G
Sbjct: 371 RSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPA 430
Query: 431 -NEVLCLILFTD----NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
E +CL + +D + GPAIILG FQ QN+Y+E+DL +R GF +Q+CA
Sbjct: 431 MAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCA 488
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 188/418 (44%), Positives = 250/418 (59%), Gaps = 32/418 (7%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
++ L HSYGGY+ ++S GTPPQ P + DTGS L W PCTS Y+C +C+ + S
Sbjct: 77 VRASLYPHSYGGYAFTVSLGTPPQP-LPVLLDTGSHLSWVPCTSSYQCRNCSSLSAA-SP 134
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG--------CSPRNKTCPLACPSYL 203
+ F PK SSSS+LIGC+NP C WI P+ S C+ C+PRN CP YL
Sbjct: 135 LHVFHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYL 194
Query: 204 LQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ-PAGIAGFGRSSESLPSQLGLK 262
+ YG G TAGLL+S+TLR P + V NF+ GCS+ S Q P+G+AGFGR + S+PSQLGL
Sbjct: 195 VVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRGAPSVPSQLGLT 254
Query: 263 KFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
KFSYCLLSR+FDD A VS L+L G G+ Y P ++ + + +YY+
Sbjct: 255 KFSYCLLSRRFDDNAAVSGELILGG--AGGKDGGVGMQYAPLARS-ASARPPYSVYYYLA 311
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYS 380
L I VG K V++P V GG IVDSG+TF++ + +FE VA + + G YS
Sbjct: 312 LTAITVGGKSVQLPERAFV-AGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYS 370
Query: 381 RAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVG--------- 430
R+ VE+ GL PCF + G K++ LPE+ L FKGG+ M LP ENYF + G
Sbjct: 371 RSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPA 430
Query: 431 -NEVLCLILFTD----NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
E +CL + +D + GPAIILG FQ QN+Y+E+DL +R GF +Q+CA
Sbjct: 431 MAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCA 488
>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
Length = 490
Score = 298 bits (763), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 184/418 (44%), Positives = 244/418 (58%), Gaps = 33/418 (7%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
++ L HSYGGY+ ++S GTPPQ P + +TGS L W P TS Y + P
Sbjct: 77 VRASLYPHSYGGYAFTVSLGTPPQP-LPVLLETGSHLSWVPSTSSYSANCSSLSAASPLH 135
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG--------CSPRNKTCPLACPSYL 203
+ F PK SSSS+LIGC+NP C WI P+ S C+ C+PRN CP YL
Sbjct: 136 V--FHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYL 193
Query: 204 LQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ-PAGIAGFGRSSESLPSQLGLK 262
+ YG G TAGLL+S+TLR P + V NF+ GCS+ S Q P+G+AGFGR + S+PSQLGL
Sbjct: 194 VVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVHQPPSGLAGFGRGAPSVPSQLGLT 253
Query: 263 KFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
KFSYCLLSR+FDD A VS L+L G G+ Y P ++ + + +YY+
Sbjct: 254 KFSYCLLSRRFDDNAAVSGELILGG--AGGKDGGVGMQYAPLARS-ASARPPYSVYYYLA 310
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYS 380
L I VG K V++P V GG IVDSG+TF++ + +FE VA + + G YS
Sbjct: 311 LTAITVGGKSVQLPERAFV-AGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYS 369
Query: 381 RAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVG--------- 430
R+ VE+ GL PCF + G K++ LPE+ L FKGG+ M LP ENYF + G
Sbjct: 370 RSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPA 429
Query: 431 -NEVLCLILFTD----NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
E +CL + +D + GPAIILG FQ QN+Y+E+DL +R GF +Q+CA
Sbjct: 430 MAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCA 487
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 287 bits (735), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 158/377 (41%), Positives = 223/377 (59%), Gaps = 27/377 (7%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS LVW PCT Y C++C P D + F+P+ SSS L+ C + C ++G N
Sbjct: 1 MDTGSDLVWVPCTRNYSCINC--PE-DSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNT 57
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP------SKTVPNFLAGCS 235
E C+ C+ K C CP Y +QYG G TAGLLL+ETL P ++ + +F GCS
Sbjct: 58 ELLCQSCAGSLKNCSETCPPYGIQYGRGSTAGLLLTETLNLPLENGEGARAITHFAVGCS 117
Query: 236 ILSDRQPAGIAGFGRSSESLPSQLGLK----KFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
I+S +QP+GIAGFGR + S+PSQLG +F+YCL S +FD+ S +VL G
Sbjct: 118 IVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVL------G 171
Query: 292 DSKTPG---LSYTPFYKNPVG-SSSAFGEFYYVGLRQIIVGSKHVK-IPYSYLVPGSDGN 346
D P L+YTPF N SS +G +YY+GLR + +G K +K +P L + GN
Sbjct: 172 DKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGN 231
Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
GG I+DSG+TFT +F+ +A F Q+G Y RA +VE K+G+ C+D++G +++ LP
Sbjct: 232 GGTIIDSGTTFTVFSDEIFKHIAAGFASQIG-YRRAGEVEDKTGMGLCYDVTGLENIVLP 290
Query: 407 ELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
E FKGG+ M LP NYF+ + + +CL + + + GPA+ILG+ Q Q+FY
Sbjct: 291 EFAFHFKGGSDMVLPVANYFSYFSSFDSICLTMISSRGL-LEVDSGPAVILGNDQQQDFY 349
Query: 466 LEFDLANDRFGFAKQKC 482
L +D +R GF +Q C
Sbjct: 350 LLYDREKNRLGFTQQTC 366
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 284 bits (727), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 184/421 (43%), Positives = 241/421 (57%), Gaps = 29/421 (6%)
Query: 85 SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
S+ + + ++T L HSYGGY+ S+S GTPPQ P + DTGS L W PCTS Y+C +C+
Sbjct: 72 SSQAPAAVRTALYPHSYGGYAFSVSLGTPPQP-LPVLLDTGSHLSWVPCTSSYQCRNCSS 130
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
S + F PK SSSS+L+GC+NP C WI S C S N CP YL+
Sbjct: 131 SPSAMSAMAVFHPKNSSSSRLVGCRNPACRWIHS-KSPSTCG--STGNNGNGDVCPPYLV 187
Query: 205 QYGLGFTAGLLLSETLRFPSKTVP-------NFLAGCSILSDRQ-PAGIAGFGRSSESLP 256
YG G T+GLL+S+TLR + NF GCSI+S Q P+G+AGFGR + S+P
Sbjct: 188 VYGSGSTSGLLISDTLRLSPSSSSSAPAPFRNFAIGCSIVSVHQPPSGLAGFGRGAPSVP 247
Query: 257 SQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFG 315
SQL + KFSYCLLSR+FDD + VS LVL K + Y P N S +
Sbjct: 248 SQLKVPKFSYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNN-AASKPPYS 306
Query: 316 EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
+YY+ L I VG K V +P VP S GG I+DSG+TFT+++ +F+ VA
Sbjct: 307 VYYYLALTGISVGGKPVNLPSRAFVPSS--GGGAIIDSGTTFTYLDPTVFKPVAAAMESA 364
Query: 376 M-GNYSRAADVEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYF------ 426
+ G Y+R+ VE GLRPCF + ++ LP+L LKFKGGA M LP ENYF
Sbjct: 365 VGGRYNRSRPVEDALGLRPCFALPPGPGGAMELPDLELKFKGGAVMRLPVENYFVAAGPA 424
Query: 427 --ALVGNEVLCLILFTD--NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G +CL + +D + G GPAIILG FQ QN+++E+DL +R GF +Q C
Sbjct: 425 GGPAAGPVAICLAVVSDLPASGGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPC 484
Query: 483 A 483
A
Sbjct: 485 A 485
>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 284 bits (727), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 201/475 (42%), Positives = 267/475 (56%), Gaps = 52/475 (10%)
Query: 35 PLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKT 94
PL P + +H+ L LA +SL+RA L+ + + ++S ++
Sbjct: 36 PLPPAAAQHH----------PLSRLARASLARASRLRGHHQGQA---------ASSPVRA 76
Query: 95 PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
L HSYGGY+ SLS GTPPQ P + DTGS L W PCTS Y+C +C+ P
Sbjct: 77 ALYPHSYGGYAFSLSLGTPPQ-PLPVLLDTGSHLTWVPCTSNYQCQNCS---AAAGSFPV 132
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG----CSPRNKTCPL----ACPSYLLQY 206
F PK SSSS L+ C +P C WI + S C C P C CP YL+ Y
Sbjct: 133 FHPKSSSSSLLVSCSSPSCLWIHSKSHLSDCARDSAPCRPSTANCSATATNVCPPYLVVY 192
Query: 207 GLGFTAGLLLSETLRFPSKTVP--NFLAGCSILSDRQP-AGIAGFGRSSESLPSQLGLKK 263
G G TAGLL+S+TLR + NF GCS+ S QP +G+AGFGR + S+P+QLG+ K
Sbjct: 193 GSGSTAGLLVSDTLRLSPRGAASRNFAVGCSLASVHQPPSGLAGFGRGAPSVPAQLGVNK 252
Query: 264 FSYCLLSRKF-DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
FSYCLLSR+F DDA +S LVL G S + Y P KN G+ + +YY+ L
Sbjct: 253 FSYCLLSRRFDDDAAISGELVL--GASSAGKAKAMMQYAPLLKN-AGARPPYSVYYYLSL 309
Query: 323 RQIIVGSKHVKIPYSYLVPGS-DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYS 380
I VG K V +P L P S G GG I+DSG+TFT+++ +F+ VA + + G Y+
Sbjct: 310 TGIAVGGKSVALPARALAPVSGGGGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYN 369
Query: 381 RAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVG------NEV 433
R+ DVE GLRPCF + +G +++ LPEL L F GGA+M LP ENYF G E
Sbjct: 370 RSKDVEGALGLRPCFALPAGARTMDLPELSLHFSGGAEMRLPIENYFLAAGPASGVAPEA 429
Query: 434 LCLILFTD-----NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+CL + +D AG + G GPAIILG FQ QN+ +E+DL +R GF +Q C+
Sbjct: 430 ICLAVVSDVSSASGGAGVSGGGGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPCS 484
>gi|296084856|emb|CBI28265.3| unnamed protein product [Vitis vinifera]
Length = 446
Score = 247 bits (631), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 130/235 (55%), Positives = 155/235 (65%), Gaps = 15/235 (6%)
Query: 51 DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
DP + L L S+SL RARHLK T + + HSYG YSI LSF
Sbjct: 50 DPYRNLRHLVSASLIRARHLKNPKTTPTSTTPL-------------FTHSYGAYSIPLSF 96
Query: 111 GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
GTPPQ + P I DTGS LVWFPCT RY C +C+F +PS FIPK SSSS+++GC N
Sbjct: 97 GTPPQ-TLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSS-NIFIPKSSSSSKVLGCVN 154
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNF 230
PKC WI G V+SRC+ C P + C CP YL+ YG G T G++LSETL P K VPNF
Sbjct: 155 PKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGKGVPNF 214
Query: 231 LAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
+ GCS+LS QPAGI+GFGR SLPSQLGLKKFSYCLLSR++DD SS+L+ +
Sbjct: 215 IVGCSVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLIFE 269
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 66/120 (55%), Positives = 86/120 (71%), Gaps = 2/120 (1%)
Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
+FE VA EF +Q+ + RA +VE +GLRPCF+ISG + PEL LKF+GGA+M LP
Sbjct: 267 IFELVAAEFEKQVQS-KRATEVEGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLA 325
Query: 424 NYFALVG-NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
NY A +G ++V+CL + TD AAG GPAIILG+FQ QNFY+E+DL N+R GF +Q C
Sbjct: 326 NYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 385
>gi|115461432|ref|NP_001054316.1| Os04g0685200 [Oryza sativa Japonica Group]
gi|113565887|dbj|BAF16230.1| Os04g0685200, partial [Oryza sativa Japonica Group]
Length = 330
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 191/321 (59%), Gaps = 24/321 (7%)
Query: 183 SRCKG--CSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDR 240
S C G C+PRN CP YL+ YG G TAGLL+S+TLR P + V NF+ GCS+ S
Sbjct: 11 SSCPGANCTPRNANANNVCPPYLVVYGSGSTAGLLISDTLRTPGRAVRNFVIGCSLASVH 70
Query: 241 Q-PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD-APVSSNLVLDTGPGSGDSKTPGL 298
Q P+G+AGFGR + S+PSQLGL KFSYCLLSR+FDD A VS L+L G G+
Sbjct: 71 QPPSGLAGFGRGAPSVPSQLGLTKFSYCLLSRRFDDNAAVSGELILGG--AGGKDGGVGM 128
Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
Y P ++ + + +YY+ L I VG K V++P V GG IVDSG+TF+
Sbjct: 129 QYAPLARS-ASARPPYSVYYYLALTAITVGGKSVQLPERAFV-AGGAGGGAIVDSGTTFS 186
Query: 359 FMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGA 416
+ + +FE VA + + G YSR+ VE+ GL PCF + G K++ LPE+ L FKGG+
Sbjct: 187 YFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGS 246
Query: 417 KMALPPENYFALVG----------NEVLCLILFTD----NAAGPALGRGPAIILGDFQLQ 462
M LP ENYF + G E +CL + +D + GPAIILG FQ Q
Sbjct: 247 VMNLPVENYFVVAGPAPSGGAPAMAEAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQ 306
Query: 463 NFYLEFDLANDRFGFAKQKCA 483
N+Y+E+DL +R GF +Q+CA
Sbjct: 307 NYYIEYDLEKERLGFRRQQCA 327
>gi|297740191|emb|CBI30373.3| unnamed protein product [Vitis vinifera]
Length = 218
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 101/226 (44%), Positives = 147/226 (65%), Gaps = 10/226 (4%)
Query: 259 LGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
+G+KKF+YCL S +DD S L+LD D KT GLSYTPF K+P +SAF +Y
Sbjct: 1 MGVKKFAYCLNSHDYDDTRNSGKLILDYR----DGKTKGLSYTPFLKSP--PASAF--YY 52
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG-STFTFMEGPLFEAVAKEFIRQMG 377
++G++ I +G+K ++IP YL PGSDG GVI+DSG +M GP+F+ V E +QM
Sbjct: 53 HLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMS 112
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CL 436
Y R+ + E ++GL PC++ +G KS+ +P LI +F+GGA M +P +NYF + E L C
Sbjct: 113 KYRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESLACF 172
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ T+ + P+IILG+ Q ++Y+E+DL NDRFGF +Q C
Sbjct: 173 LMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 218
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 148/460 (32%), Positives = 213/460 (46%), Gaps = 39/460 (8%)
Query: 53 LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
L + HSL+ S + HL T + S ++ + I PLS S Y++S + G+
Sbjct: 27 LPLTHSLSKSQFNSTPHLLKFTSAR---SATRFHHRHRQISLPLSPGS--DYTLSFNLGS 81
Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
P DTGS LVWFPC + C+ C D + P +SS + C++P
Sbjct: 82 HPPQPISLYMDTGSDLVWFPCAP-FECILCE-GKYDTAATGGLSPPNITSSASVSCKSPA 139
Query: 173 CSWIFGPNVES------RCKGCSPRNKTCP-LACPSYLLQYGLGFTAGLLLSETLRFPSK 225
CS S RC C +CP + YG G L ++L P+
Sbjct: 140 CSAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVARLYRDSLSMPAS 199
Query: 226 T---VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDA 276
+ + NF GC+ + +P G+AGFGR SLP+QL +FSYCL+S FD
Sbjct: 200 SPLVLHNFTFGCAHTALGEPVGVAGFGRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDAD 259
Query: 277 PVSSNLVLDTGPGSGDS---KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
V L G S D K G F + + FY VGL I VG++ +
Sbjct: 260 RVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLEGITVGNRKIP 319
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAADVEKKSGLR 392
+P GNGG++VDSG+TFT + L+E++ EF +MG Y RA +E+++GL
Sbjct: 320 VPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGLG 379
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYF---------ALVGNEVLCLILFTDNA 443
PC+ S + +P + L F G + + LP NY+ +V CL+L N
Sbjct: 380 PCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLM--NG 436
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A GPA LG++Q Q F + +DL R GFA++KCA
Sbjct: 437 GDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKCA 476
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 147/458 (32%), Positives = 217/458 (47%), Gaps = 45/458 (9%)
Query: 53 LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
L + H+L+ + + HL T S + + + PLS S Y++S + G
Sbjct: 5 LPLTHTLSQTQFNNTHHLLKST------STLSAKRFRRQLSLPLSPGS--DYTLSFNLGP 56
Query: 113 PPQASTPFIF-DTGSSLVWFPCTSRYRCVDCN-FPNVDPSRIPAFIPKRSSSSQLIGCQN 170
QA ++ DTGS LVWFPC ++C+ C PN P P ++ S + C++
Sbjct: 57 RAQAQPITLYMDTGSDLVWFPCAP-FKCILCEGKPNASP-------PVNTTRSVAVSCKS 108
Query: 171 PKCSW---IFGPN---VESRCKGCSPRNKTCP-LACPSYLLQYGLGFTAGLLLSETLRFP 223
P CS + P+ +RC S C CP + YG G L +TL
Sbjct: 109 PACSAAHNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLYRDTLSLS 168
Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDAP 277
S + NF GC+ + +P G+AGFGR SLP+QL +FSYCL+S FD
Sbjct: 169 SLFLRNFTFGCAYTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSER 228
Query: 278 VS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
V S L+L + + G F P+ + FY VGL I VG + V P
Sbjct: 229 VRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVGKRIVPAP 288
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS-RAADVEKKSGLRPC 394
+ G+GGV+VDSG+TFT + + +V EF R +G + RA +E+K+GL PC
Sbjct: 289 EMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTGLAPC 348
Query: 395 FDISGKKSVYLPELILKFKGG-AKMALPPENYF--------ALVGNEVLCLILFTDNAAG 445
+ ++ V P L L+F GG + + LP +NYF A G + ++ +
Sbjct: 349 YYLNSVAEV--PVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLMLMNGGDE 406
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
L GP LG++Q Q F +E+DL R GFA+++CA
Sbjct: 407 AELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 444
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 143/407 (35%), Positives = 198/407 (48%), Gaps = 46/407 (11%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y++S + G+ P S DTGS LVWFPC + C+ C + A P + S
Sbjct: 19 YTLSFNLGSHPSQSITLYMDTGSDLVWFPCAP-FECILCE------GKFNATKPLNITRS 71
Query: 164 QLIGCQNPKCSWIFGPNVE------SRCKGCSPRNKTCPLA-CPSYLLQYGLGFTAGLLL 216
+ CQ+P CS +RC + C A CP + YG G L
Sbjct: 72 HRVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSFIAHLH 131
Query: 217 SETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLS 270
+TL + NF GC+ + +P G+AGFGR SLP+QL +FSYCL+S
Sbjct: 132 RDTLSMSQLFLKNFTFGCAHTALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVS 191
Query: 271 RKFDDAPVS--SNLVLDTGPGSGD---SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
FD V S L+L G D S+ YT +NP S FY VGL I
Sbjct: 192 HSFDKERVRKPSPLIL----GHYDDYSSERVEFVYTSMLRNPKHSY-----FYCVGLTGI 242
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
VG + + P G+GGV+VDSG+TFT + L+ +V EF R++G + RA++
Sbjct: 243 SVGKRTILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASE 302
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKG-GAKMALPPENYFA--LVGNE-----VLCL 436
VE+K+GL PC+ + G V +P + F G + + LP NYF L G + V CL
Sbjct: 303 VEEKTGLGPCYFLEGL--VEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCL 360
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+L + L GP ILG++Q Q F + +DL N R GFAK++CA
Sbjct: 361 MLM-NGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCA 406
>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 480
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 153/467 (32%), Positives = 213/467 (45%), Gaps = 64/467 (13%)
Query: 55 ILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPP 114
+ H+L+ + + HL T S S + LS+ G +LSF P
Sbjct: 29 LTHTLSKAQFNSTHHLLKST----------STRSAKRFRRQLSLPLSPGSDYTLSFNLGP 78
Query: 115 QASTPFI---FDTGSSLVWFPCTSRYRCVDC----NFPNVDPSRIPAFIPKRSSSSQLIG 167
QA I DTGS LVWFPC ++C+ C N PN P P + S +
Sbjct: 79 QAQAQPITLYMDTGSDLVWFPCAP-FKCILCEGKPNEPNASP-------PTNITQSVAVS 130
Query: 168 CQNPKCSWIFG---PN---VESRCKGCSPRNKTCP-LACPSYLLQYGLGFTAGLLLSETL 220
C++P CS P+ +RC S C CP + YG G L +TL
Sbjct: 131 CKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIARLYRDTL 190
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFD 274
S + NF GC+ + +P G+AGFGR SLP+QL +FSYCL+S FD
Sbjct: 191 SLSSLFLRNFTFGCAHTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFD 250
Query: 275 DAPVS--SNLVLDTGPGSGDSKTPG----LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
V S L+L K G YT +NP FY V L I VG
Sbjct: 251 SERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENP-----KHPYFYTVSLIGIAVG 305
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADVEK 387
+ + P + G+GGV+VDSG+TFT + + +V EF R++G + RA +E+
Sbjct: 306 KRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIEE 365
Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGG--AKMALPPENYF---------ALVGNEVLCL 436
K+GL PC+ ++ V P L L+F GG + + LP +NYF A +V CL
Sbjct: 366 KTGLAPCYYLNSVADV--PALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRKVGCL 423
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+L G GP LG++Q Q F +E+DL R GFA+++CA
Sbjct: 424 MLMNGGDEADLSG-GPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 469
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 152/463 (32%), Positives = 227/463 (49%), Gaps = 49/463 (10%)
Query: 53 LKILHSLASSSLSRARHLKTKTKPKTKDS-NIGSNYSNSLIKTPLSVHSYGGYSISLSFG 111
L + HS++ + + HL T ++K + + + + PL+ S Y++S + G
Sbjct: 25 LPLTHSISKTKFNSTHHLLKSTSTRSKARFHHQHHKHQTQVSLPLAPGS--DYTLSFNLG 82
Query: 112 T-PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
+ PPQ T ++ DTGS LVWFPC S + C+ C + PA I K++ S + CQ+
Sbjct: 83 SNPPQLITLYM-DTGSDLVWFPC-SPFECILCE--GKPQTTKPANITKQTHS---VSCQS 135
Query: 171 PKCSWIFGPNVE------SRCKGCSPRNKTCP-LACPSYLLQYGLGFTAGLLLSETLRFP 223
P CS SRC C +CP + YG G L +TL
Sbjct: 136 PACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFVANLYQQTLSLS 195
Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDAP 277
S + NF GC+ + +P G+AGFGR SLP+QL +FSYCL+S FD
Sbjct: 196 SLHLQNFTFGCAHTALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHSFDGDR 255
Query: 278 VS--SNLVL----DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
+ S L+L DT G+GD ++ YT NP +Y VGL I VG +
Sbjct: 256 LRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNP-----KHPYYYCVGLAGISVGKRT 310
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY-SRAADVEKKSG 390
V P GNGG++VDSG+TFT + + AV EF +++ + RA+++E K+G
Sbjct: 311 VPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKTG 370
Query: 391 LRPCFDISGKKSVYLPELILKFKG-GAKMALPPENYFALVGN---------EVLCLILFT 440
L PC+ ++G + P L L F G + + LP +NYF + +V C++L
Sbjct: 371 LGPCYYLNGLSQI--PVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMMLM- 427
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ L GP LG++Q Q F + +DL +R GFAK++CA
Sbjct: 428 NGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKECA 470
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 141/412 (34%), Positives = 198/412 (48%), Gaps = 45/412 (10%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCN---FPNVDPSRIPAFIPK 158
GY I+L+ GTPPQA ++ DTGS L W PC + + C+DCN N+ S I F P
Sbjct: 10 GYLITLNIGTPPQAVQVYM-DTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSI--FSPL 66
Query: 159 RSSSSQLIGCQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FT 211
SSSS C + C+ I F P + C TC CPS+ YG G
Sbjct: 67 HSSSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLV 126
Query: 212 AGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFSYCLL 269
+G+L + L+ ++ VP F GC + +P GIAGFGR SLPSQLG K FS+C L
Sbjct: 127 SGILTRDILKARTRDVPRFSFGCVTSTYHEPIGIAGFGRGLLSLPSQLGFLEKGFSHCFL 186
Query: 270 SRKFDDAP-VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
KF + P +SS L+L S + T L +TP PV +S YY+GL I +G
Sbjct: 187 PFKFVNNPNISSPLILGASALSIN-LTDSLQFTPMLNTPVYPNS-----YYIGLESITIG 240
Query: 329 SKH--VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
+ ++P + S GNGG++VDSG+T+T + P + + ++ Y RA + E
Sbjct: 241 TNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLT-ILQSTITYPRATETE 299
Query: 387 KKSGLRPCFDI----------SGKKSVYLPELILKFKGGAKMALPPENYFALV-----GN 431
++G C+ + + P + F A + LP N F + G+
Sbjct: 300 SRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSDGS 359
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
V CL LF + G GPA + G FQ QN + +DL +R GF C
Sbjct: 360 VVQCL-LFQNMEDG---NYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCV 407
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 143/412 (34%), Positives = 205/412 (49%), Gaps = 48/412 (11%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSS 161
GY ISL+ GTPP+ ++ DTGS L W PC + + C+DCN + +++ + S
Sbjct: 28 GYLISLNLGTPPKVIQVYM-DTGSDLTWVPCGNLSFDCMDCN--DYRNNKLMSTYSPSYS 84
Query: 162 SSQLIG-CQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
SS L C +P CS + + P + C + TCP CPS+ YG G G
Sbjct: 85 SSSLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIG 144
Query: 214 LLLSETLRFP------SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFS 265
L +TL ++ VPNF GC + R+P GIAGFGR SLPSQLG K FS
Sbjct: 145 TLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREPIGIAGFGRGVLSLPSQLGFLQKGFS 204
Query: 266 YCLLSRKFDDAP-VSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
+C L KF + P +SS LV+ D S D L +T KNP+ + +YY+GL
Sbjct: 205 HCFLGFKFANNPNISSPLVIGDLAISSNDH----LQFTSLLKNPM-----YPNYYYIGLE 255
Query: 324 QIIVG-SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I VG + +++P S S GNGG+I+DSG+T+T + GP + + ++ + Y RA
Sbjct: 256 AITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLL-SMLQSIITYPRA 314
Query: 383 ADVEKKSGLRPCFDISGKKSVY------LPELILKFKGGAKMALPPENYFALVG-----N 431
+ E ++G C+ I +V LP + F + LP N+F +G
Sbjct: 315 QEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNST 374
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
V CL+L + + GPA + G FQ QN + +DL +R GF CA
Sbjct: 375 VVKCLLLQNMDDS----DSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDCA 422
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 143/412 (34%), Positives = 205/412 (49%), Gaps = 48/412 (11%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSS 161
GY ISL+ GTPP+ ++ DTGS L W PC + + C+DCN + +++ + S
Sbjct: 11 GYLISLNLGTPPKVIQVYM-DTGSDLTWVPCGNLSFDCMDCN--DYRNNKLMSTYSPSYS 67
Query: 162 SSQLIG-CQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
SS L C +P CS + + P + C + TCP CPS+ YG G G
Sbjct: 68 SSSLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIG 127
Query: 214 LLLSETLRFP------SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFS 265
L +TL ++ VPNF GC + R+P GIAGFGR SLPSQLG K FS
Sbjct: 128 TLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREPIGIAGFGRGVLSLPSQLGFLQKGFS 187
Query: 266 YCLLSRKFDDAP-VSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
+C L KF + P +SS LV+ D S D L +T KNP+ + +YY+GL
Sbjct: 188 HCFLGFKFANNPNISSPLVIGDLAISSNDH----LQFTSLLKNPM-----YPNYYYIGLE 238
Query: 324 QIIVG-SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I VG + +++P S S GNGG+I+DSG+T+T + GP + + ++ + Y RA
Sbjct: 239 AITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLS-MLQSIITYPRA 297
Query: 383 ADVEKKSGLRPCFDISGKKSVY------LPELILKFKGGAKMALPPENYFALVG-----N 431
+ E ++G C+ I +V LP + F + LP N+F +G
Sbjct: 298 QEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNST 357
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
V CL+L + + GPA + G FQ QN + +DL +R GF CA
Sbjct: 358 VVKCLLLQNMDDS----DSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDCA 405
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 151/475 (31%), Positives = 221/475 (46%), Gaps = 62/475 (13%)
Query: 53 LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSL-----IKTPLSVHSYGGYSIS 107
L + HSL+++ + HL T ++ + L + PLS S Y++S
Sbjct: 28 LPLTHSLSNTQFTSTHHLLKSTSSRSASRFQHQHQKRHLRNRHQVSLPLSPGS--DYTLS 85
Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCN--FPNVDPSRIPAFIPKRSSSSQL 165
+ + P DTGS LVWFPC + C+ C N S P P+ SS+++
Sbjct: 86 FTLNSNPPQHVSLYLDTGSDLVWFPCKP-FECILCEGKAENTTASTPP---PRLSSTARS 141
Query: 166 IGCQNPKCSWIFG--PNVE----SRCKGCSPRNKTC-PLACPSYLLQYGLGFTAGLLLSE 218
+ C++ CS P + + C S C +CPS+ YG G L +
Sbjct: 142 VHCKSSACSAAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHD 201
Query: 219 TLRFP----SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCL 268
+++ P S ++ NF GC+ + +P G+AGFGR SLP+QL +FSYCL
Sbjct: 202 SIKLPLATPSLSLHNFTFGCAHTALAEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCL 261
Query: 269 LSRKF--DDAPVSSNLVLDTGPGSGDSKTPGLS-------YTPFYKNPVGSSSAFGEFYY 319
+S F D + S L+L G D K ++ YT NP FY
Sbjct: 262 VSHSFNSDRLRLPSPLIL----GHSDDKEKRVNKDDVQFVYTSMLDNP-----KHPYFYC 312
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN- 378
VGL I +G K + P +G+GGV+VDSG+TFT + L+ +V EF ++G
Sbjct: 313 VGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRV 372
Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG-AKMALPPENYFA--LVGNE--- 432
Y RA +VE K+GL PC+ V +P L+L F G + + LP +NYF L G +
Sbjct: 373 YERAKEVEDKTGLGPCYYY--DTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVR 430
Query: 433 ----VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
V CL+L G GP LG++Q F + +DL R GFA++KCA
Sbjct: 431 RKRRVGCLMLMNGGEEAELTG-GPGATLGNYQQHGFEVVYDLEQRRVGFARRKCA 484
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 137/410 (33%), Positives = 196/410 (47%), Gaps = 41/410 (10%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDC-NFPNVDPSRIPAFIPKRS 160
GY I+L+ GTPPQA ++ DTGS L W PC + + C++C + N D F P S
Sbjct: 82 GYLITLNIGTPPQAVQVYL-DTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHS 140
Query: 161 SSSQLIGCQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
S+S C + C I F P + C TC CPS+ YG G +G
Sbjct: 141 STSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISG 200
Query: 214 LLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFSYCLLSR 271
+L + L+ ++ VP F GC + R+P GIAGFGR SLPSQLG K FS+C L
Sbjct: 201 ILTRDILKARTRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPF 260
Query: 272 KFDDAP-VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
KF + P +SS L+L S + T L +TP P+ +S YY+GL I +G+
Sbjct: 261 KFVNNPNISSPLILGASALSIN-LTDSLQFTPMLNTPMYPNS-----YYIGLESITIGTN 314
Query: 331 --HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
++P + S GNGG++VDSG+T+T + P + + ++ Y RA + E +
Sbjct: 315 ITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTT-LQSTITYPRATETESR 373
Query: 389 SGLRPCFDI----------SGKKSVYLPELILKFKGGAKMALPPENYFALV-----GNEV 433
+G C+ + + P + F A + LP N F + G+ V
Sbjct: 374 TGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVV 433
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
CL LF + G GPA + G FQ QN + +DL +R GF C
Sbjct: 434 QCL-LFQNMEDG---DYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCV 479
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 150/465 (32%), Positives = 218/465 (46%), Gaps = 52/465 (11%)
Query: 53 LKILHSLASSSLSRARHL--KTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
L + HSL+ + HL T T ++ ++ N L PLS S Y++S +
Sbjct: 25 LPLTHSLSMIEFNTTHHLLKSTSTHSLSRFHRHKHHHHNQL-SLPLSPGS--DYTLSFNL 81
Query: 111 GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP---NVDPSRIPAFIPKRSSSSQLIG 167
G Q T ++ DTGS LVWFPCT + C+ C DPS P S S I
Sbjct: 82 GPHSQPITLYM-DTGSDLVWFPCTP-FNCILCELKPKLTSDPSP-----PTNISHSTPIS 134
Query: 168 CQNPKCSWIFGPNVES------RCKGCSPRNKTC-PLACPSYLLQYGLGFTAGLLLSETL 220
C + CS S C S K C CP + YG G L +TL
Sbjct: 135 CNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLIASLYRDTL 194
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFD 274
+ + NF GC+ + +P G+AGFGR SLP+QL +FSYCL+S F
Sbjct: 195 SLSTLQLTNFTFGCAHTTFSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSHSFR 254
Query: 275 DAPVS--SNLVL----DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+ S L+L D +GD + YT +NP S FY VGL+ I VG
Sbjct: 255 SERIRKPSPLILGRYNDEKQSNGD-EVVEFVYTSMLENPKHS-----YFYTVGLKGISVG 308
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA-DVEK 387
K V P G+GGV+VDSG+TFT + + +V + F R+ +R A ++E+
Sbjct: 309 KKTVPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQ 368
Query: 388 KSGLRPCFDISGKKSVYLPELILKFKG-GAKMALPPENYF--------ALVGNEVLCLIL 438
K+GL PC+ ++ + +P + L+F G + + LP +NYF + E + ++
Sbjct: 369 KTGLSPCYYLN--TAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLM 426
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
F + + GP +LG++Q Q F +E+DL R GFA++KCA
Sbjct: 427 FMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKCA 471
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 153/485 (31%), Positives = 218/485 (44%), Gaps = 79/485 (16%)
Query: 53 LKILHSLASSSLSRARHL--KTKTKPKTKDSNIGSNYSNSL------------IKTPLSV 98
L + HSL+ + + HL T K + + + +SN L I PLS
Sbjct: 31 LPLSHSLSKTKFTSTHHLLKSTTIKSTARHHHHRTRHSNKLKNHHRHHQHQQQISLPLS- 89
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
G +L+F Q + ++ DTGS +VWFPC S + C+ C +P + P
Sbjct: 90 ---PGTDYTLTFSINSQTLSVYM-DTGSDIVWFPC-SPFECILCE-GKFEPGTL---TPL 140
Query: 159 RSSSSQLIGCQNPKCSWIFG-PNVESRCKGCSPRNKTCPL-----------ACPSYLLQY 206
S S LI C++ CS P+ C CPL CPS+ Y
Sbjct: 141 NVSKSSLISCKSRACSTAHNSPSTSDLCAIAK-----CPLDEIETSDCSNYHCPSFYYAY 195
Query: 207 GLGFTAGLLLSETLRFPSKT-----VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL 261
G G L L PS + + +F GC+ + +P G+AGFG S SLP+QL
Sbjct: 196 GDGSLIAKLHKHNLIMPSTSNKPFSLKDFTFGCAHSALGEPIGVAGFGFGSLSLPAQLAN 255
Query: 262 ------KKFSYCLLSRKFDDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
+FSYCL+S FD + S L+L + YTP NP
Sbjct: 256 LSPDLGNQFSYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNP-----K 310
Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
FY V + I VGS V+ P + + DGNGGV+VDSG+T+T + + +VA E
Sbjct: 311 HPYFYSVSMEAISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELD 370
Query: 374 RQMGN-YSRAADVEKKSGLRPCFDISG----KKSVYLPELILKFKGGAKMALPPENYF-- 426
R++G + RA++ E K+GL PC+ + G + + +P L F G + LP NYF
Sbjct: 371 RRVGRVFKRASETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYE 430
Query: 427 ------ALVGNEVLCLILFT--DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
G +V CL+L D + G GP LG++Q Q F + +DL R GFA
Sbjct: 431 FLDGEDEKKGRKVGCLMLMDGGDESEG-----GPGATLGNYQQQGFQVVYDLEERRVGFA 485
Query: 479 KQKCA 483
+KCA
Sbjct: 486 PRKCA 490
>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
Length = 439
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 144/433 (33%), Positives = 213/433 (49%), Gaps = 51/433 (11%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC--TSRYRCVDCNFPNVDP 149
I P++ ++ GY +SL+ GTPPQ ++ DTGS L W PC +S Y+C+DC +V P
Sbjct: 14 IIEPVTAYT-DGYLLSLNLGTPPQVFQVYL-DTGSDLTWVPCGSSSSYQCLDCG-SSVKP 70
Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYL 203
+ P F+P S+S+ C + C + F P + C + CP CP +
Sbjct: 71 T--PTFLPSESTSNTRDLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFS 128
Query: 204 LQYGLG-FTAGLLLSETLRFPSKT-------------VPNFLAGCSILSDRQPAGIAGFG 249
YG G G L +++ T P F GC S R+P GIAGFG
Sbjct: 129 YTYGGGALVLGSLSRDSVTLHGSTHGSGAGAGPLPVAFPGFGFGCVGSSIREPLGIAGFG 188
Query: 250 RSSESLPSQLGL--KKFSYCLLSRKFDDAP-VSSNLVLDTGPGSGDSKTPGLSYTPFYKN 306
R + SLPSQLG K FS+C L +F P +S LV+ S S G +TP
Sbjct: 189 RGALSLPSQLGFLGKGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPML-- 246
Query: 307 PVGSSSAFGEFYYVGLRQIIVGSKH----VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEG 362
+S+ + FYYVGL +++G + P S + GNGGV+VD+G+T+T +
Sbjct: 247 ---TSATYPNFYYVGLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPD 303
Query: 363 PLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKM 418
P + +V I Y R+ D+E ++G CF + ++ LP + L GGA++
Sbjct: 304 PFYASVLASLISAAPPYERSRDLEARTGFDLCFKVPCARAPCADDELPPITLHLAGGARL 363
Query: 419 ALPP-ENYF---ALVGNEVLCLILF----TDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
ALP +Y+ A+ + V+ +LF ++ G GPA +LG FQ+QN + +DL
Sbjct: 364 ALPKLSSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYDL 423
Query: 471 ANDRFGFAKQKCA 483
A R GF + CA
Sbjct: 424 AAGRVGFRPRDCA 436
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 138/410 (33%), Positives = 203/410 (49%), Gaps = 45/410 (10%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSS 161
GY ISLS GTPPQ ++ DTGS L W PC + + C++C+ N +R+ A S
Sbjct: 79 GYLISLSIGTPPQVIQVYM-DTGSDLTWAPCGNISFDCIECD--NYRNNRMMASFSPSHS 135
Query: 162 SSQLIG-CQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
SS C +P C + P + C + TC CP + YG G G
Sbjct: 136 SSSHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTG 195
Query: 214 LLLSETLRFP------SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFS 265
L +TLR ++ +P F GC S R+P GIAGFGR + SLPSQLG K FS
Sbjct: 196 TLTRDTLRVHGRNLGVTQEIPRFCFGCVASSYREPIGIAGFGRGALSLPSQLGFLRKGFS 255
Query: 266 YCLLSRKFDDAP-VSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
+C L+ K+ + P +SS L++ D S D + +TP K+P+ + +YYVGL
Sbjct: 256 HCFLAFKYANNPNISSPLIIGDIALTSKDD----MQFTPMLKSPM-----YPNYYYVGLE 306
Query: 324 QIIVGS-KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I VG+ ++P S S GNGG++VDSG+T+T + P + V ++ + NY RA
Sbjct: 307 AITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVL-SVLQSIINYPRA 365
Query: 383 ADVEKKSGLRPCFDISGKKSV-----YLPELILKFKGGAKMALPPENYF----ALVGNEV 433
D+E ++G C+ + + + LP + F A + L ++F A + V
Sbjct: 366 TDMEMRTGFDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTV 425
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ +LF G GPA +LG FQ Q+ + +D+ +R GF CA
Sbjct: 426 VKCLLFQSMDDG---DYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDCA 472
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 141/422 (33%), Positives = 198/422 (46%), Gaps = 50/422 (11%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNF--PNVDPSRIPAFIPKR 159
GY +SLS GTPPQ ++ DTGS L W PC + + C DC N+ R+ AF+P
Sbjct: 20 GYLMSLSIGTPPQVVQVYM-DTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTH 78
Query: 160 SSSSQLIGCQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGL-GFTA 212
SS+S C + C I F P + C S TCP CPS+ YG G
Sbjct: 79 SSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVT 138
Query: 213 GLLLSETL---------RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL-- 261
G L + L +K +P F GC + R+P GIAGFGR SLP QLG
Sbjct: 139 GSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGFSH 198
Query: 262 KKFSYCLLSRKFDDAP-VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K FS+C L KF + P SS L+L G + SK L +TP K+P+ + +YY+
Sbjct: 199 KGFSHCFLPFKFSNNPNFSSPLIL--GNLAISSKDENLQFTPLLKSPM-----YPNYYYI 251
Query: 321 GLRQIIVGS--KHVKIPYSYLVPGSD--GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
GL I +G+ + + S+ + D GNGG+++DSG+T+T + PL+ + +
Sbjct: 252 GLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVI 311
Query: 377 GNYSRAADVEKKSGLRPCFDISGKKS-------VYLPELILKFKGGAKMALPP-ENYFAL 428
G Y RA VE +G C+ + K + LP + F + LP N++A+
Sbjct: 312 G-YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAM 370
Query: 429 VG----NEVLCLI---LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
V CL+ + + GPA I G FQ QN + +DL +R GF
Sbjct: 371 AAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMD 430
Query: 482 CA 483
C
Sbjct: 431 CV 432
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 139/410 (33%), Positives = 200/410 (48%), Gaps = 44/410 (10%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSS 161
GY ISL+ GTPPQ ++ DTGS L W PC + + C+DC+ + + AF P SS
Sbjct: 11 GYLISLNIGTPPQVIQVYM-DTGSDLTWVPCGNLSFDCMDCD-DYRNSKLMSAFSPSHSS 68
Query: 162 SSQLIGCQNPKCSWI------FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGL 214
SS C +P C+ I F P + C + TC CPS+ YG G G
Sbjct: 69 SSYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGT 128
Query: 215 LLSETLRFP------SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFSY 266
L +TLR +K +P F GC + +P GIAGF R + S PSQLGL K FS+
Sbjct: 129 LTRDTLRVHEGPARVTKDIPKFCFGCVGSTYHEPIGIAGFVRGTLSFPSQLGLLKKGFSH 188
Query: 267 CLLSRKFDDAP-VSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
C L+ K+ + P +SS LV+ DT S D+ + +TP K+P+ + +YY+GL
Sbjct: 189 CFLAFKYANNPNISSPLVIGDTALSSKDN----MQFTPMLKSPM-----YPNYYYIGLEA 239
Query: 325 IIVGS-KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I VG+ +P + S GNGG+++DSG+T+T + P + + F + + Y RA
Sbjct: 240 ITVGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIF-KAIITYPRAT 298
Query: 384 DVEKKSGLRPCFDIS------GKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEV 433
+VE ++G C+ + P + F LP N+F A + V
Sbjct: 299 EVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTV 358
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ +LF A GPA + G FQ QN + +DL +R GF CA
Sbjct: 359 VKCLLFQSMADS---DYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDCA 405
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 165/494 (33%), Positives = 216/494 (43%), Gaps = 68/494 (13%)
Query: 6 FSLICLFSLLILLFTTDAGAGSSAAT------VTVPLTPLSTKHYLHHSDSDPLKILHSL 59
FS I L SLL++ A SS + VPLT H H + L++L
Sbjct: 25 FSWIVLVSLLLVSMAIVLAAASSHPAAGLLDGLRVPLT-----HVDAHGNYTKLQLLRRA 79
Query: 60 ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQAST 118
A S R L +T GS + + + VH+ G + + +S GTP A
Sbjct: 80 ARRSHHRMSRLVARTA-------TGSVKAAAAPDLQVPVHAGNGEFLMDMSIGTPALAYA 132
Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFG 178
I DTGS LVW C CV+C + P F P SS+ + C + CS +
Sbjct: 133 A-IVDTGSDLVWTQCKP---CVECFNQST-----PVFDPSSSSTYSTLPCSSSLCSDLPT 183
Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPNFLAGCSIL 237
S K C Y YG T G+L +ET +P GC
Sbjct: 184 STCTSAAKDCG------------YTYTYGDASSTQGVLAAETFTLAKTKLPGVAFGCGDT 231
Query: 238 SD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
++ Q AG+ G GR SL SQLGL KFSYCL S DD S L+ S D+
Sbjct: 232 NEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTS--LDDTSKSPLLLGSLAAISTDT 289
Query: 294 KTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
+ + TP KNP S FYYV L+ + VGS + +P S DG GGVIVD
Sbjct: 290 ASAAAIQTTPLIKNPSQPS-----FYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVD 344
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD--ISGKKSVYLPELIL 410
SG++ T++E + + K F QM AD GL CF SG V +P+L+L
Sbjct: 345 SGTSITYLELQGYRPLKKAFAAQM--KLPVAD-GSAVGLDLCFKAPASGVDDVEVPKLVL 401
Query: 411 KFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFD 469
F GGA + LP ENY L + LCL + RG +II G+FQ QN +D
Sbjct: 402 HFDGGADLDLPAENYMVLDSASGALCLTVMGS--------RGLSII-GNFQQQNIQFVYD 452
Query: 470 LANDRFGFAKQKCA 483
+ D FA +CA
Sbjct: 453 VDKDTLSFAPVQCA 466
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 142/396 (35%), Positives = 184/396 (46%), Gaps = 53/396 (13%)
Query: 98 VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
VH+ G + + +S GTP A + I DTGS LVW C CVDC P F
Sbjct: 88 VHAGNGEFLMDVSIGTPALAYSA-IVDTGSDLVWTQCKP---CVDCF-----KQSTPVFD 138
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
P SS+ + C + CS + P +K + Y YG T G+L
Sbjct: 139 PSSSSTYATVPCSSASCSDL-------------PTSKCTSASKCGYTYTYGDSSSTQGVL 185
Query: 216 LSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
+ET +P + GC ++ Q AG+ G GR SL SQLGL KFSYCL S
Sbjct: 186 ATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS- 244
Query: 272 KFDDAPVSSNLVLDT--GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
DD +S L+L + G + + TP KNP S FYYV L+ I VGS
Sbjct: 245 -LDDTN-NSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPS-----FYYVSLKAITVGS 297
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ +P S DG GGVIVDSG++ T++E + A+ K F QM AAD
Sbjct: 298 TRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA--LPAAD-GSGV 354
Query: 390 GLRPCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGP 446
GL CF G V +P L+ F GGA + LP ENY L G LCL +
Sbjct: 355 GLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGS----- 409
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
RG +II G+FQ QNF +D+ +D FA +C
Sbjct: 410 ---RGLSII-GNFQQQNFQFVYDVGHDTLSFAPVQC 441
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 142/396 (35%), Positives = 184/396 (46%), Gaps = 53/396 (13%)
Query: 98 VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
VH+ G + + +S GTP A + I DTGS LVW C CVDC P F
Sbjct: 98 VHAGNGEFLMDVSIGTPALAYSA-IVDTGSDLVWTQCKP---CVDCF-----KQSTPVFD 148
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
P SS+ + C + CS + P +K + Y YG T G+L
Sbjct: 149 PSSSSTYATVPCSSASCSDL-------------PTSKCTSASKCGYTYTYGDSSSTQGVL 195
Query: 216 LSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
+ET +P + GC ++ Q AG+ G GR SL SQLGL KFSYCL S
Sbjct: 196 ATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS- 254
Query: 272 KFDDAPVSSNLVLDT--GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
DD +S L+L + G + + TP KNP S FYYV L+ I VGS
Sbjct: 255 -LDDTN-NSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPS-----FYYVSLKAITVGS 307
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ +P S DG GGVIVDSG++ T++E + A+ K F QM AAD
Sbjct: 308 TRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA--LPAAD-GSGV 364
Query: 390 GLRPCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGP 446
GL CF G V +P L+ F GGA + LP ENY L G LCL +
Sbjct: 365 GLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGS----- 419
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
RG +II G+FQ QNF +D+ +D FA +C
Sbjct: 420 ---RGLSII-GNFQQQNFQFVYDVGHDTLSFAPVQC 451
>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 143/476 (30%), Positives = 216/476 (45%), Gaps = 61/476 (12%)
Query: 53 LKILHSLASSSLSRARHL--KTKTKPKTK---DSNIGSNYSNSLIKTPLSVHSYGGYSIS 107
L ++HSL+ + + HL T T+ T+ + +++++ + PLS S Y++S
Sbjct: 28 LPLIHSLSKTQFTSTHHLLKSTSTRSTTRFHHHHHNKNSHNHRQVSLPLSPGS--DYTLS 85
Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
+ + P + DTGS LVWFPC + C+ C + S PK S ++ +
Sbjct: 86 FTINSQPIS---LYLDTGSDLVWFPCQP-FECILCEGKAENASLASTPPPKLSKTATPVS 141
Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPL-----------ACPSYLLQYGLGFTAGLLL 216
C++ CS + N+ S C+ N CPL +CP + YG G L
Sbjct: 142 CKSSACSAVHS-NLPSS-DLCAISN--CPLESIEISDCRKHSCPQFYYAYGDGSLIARLY 197
Query: 217 SETLRFP-----SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFS 265
+++R P + NF GC+ + +P G+AGFGR SLP+QL +FS
Sbjct: 198 RDSIRLPLSNQTNLIFNNFTFGCAHTTLAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFS 257
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKT--------PGLSYTPFYKNPVGSSSAFGEF 317
YCL+S FD V L G D K P YT NP F
Sbjct: 258 YCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNP-----RHPYF 312
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
Y VGL I +G K + P G+GGV+VDSG+TFT + L++ V EF ++G
Sbjct: 313 YCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVG 372
Query: 378 NYS-RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---------A 427
+ RA+ +E+ +GL PC+ ++ G+ + LP NYF
Sbjct: 373 RVNERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVVLPRRNYFYEFLDGGHGK 432
Query: 428 LVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+V CL+L + L GP LG++Q Q F + +DL N R GFA+++CA
Sbjct: 433 GKKRKVGCLMLM-NGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQCA 487
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 142/396 (35%), Positives = 184/396 (46%), Gaps = 53/396 (13%)
Query: 98 VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
VH+ G + + +S GTP A + I DTGS LVW C CVDC P F
Sbjct: 67 VHAGNGEFLMDVSIGTPALAYSA-IVDTGSDLVWTQCKP---CVDCF-----KQSTPVFD 117
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
P SS+ + C + CS + P +K + Y YG T G+L
Sbjct: 118 PSSSSTYATVPCSSASCSDL-------------PTSKCTSASKCGYTYTYGDSSSTQGVL 164
Query: 216 LSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
+ET +P + GC ++ Q AG+ G GR SL SQLGL KFSYCL S
Sbjct: 165 ATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS- 223
Query: 272 KFDDAPVSSNLVLDT--GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
DD +S L+L + G + + TP KNP S FYYV L+ I VGS
Sbjct: 224 -LDDTN-NSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPS-----FYYVSLKAITVGS 276
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ +P S DG GGVIVDSG++ T++E + A+ K F QM AAD
Sbjct: 277 TRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA--LPAAD-GSGV 333
Query: 390 GLRPCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGP 446
GL CF G V +P L+ F GGA + LP ENY L G LCL +
Sbjct: 334 GLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGS----- 388
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
RG +II G+FQ QNF +D+ +D FA +C
Sbjct: 389 ---RGLSII-GNFQQQNFQFVYDVGHDTLSFAPVQC 420
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 154/460 (33%), Positives = 215/460 (46%), Gaps = 50/460 (10%)
Query: 55 ILHSLASSSL-SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
I H + SSSL S ARH + +T ++ S+ + + PL+ S Y++SLS G P
Sbjct: 41 IHHLIRSSSLRSAARHGRHRTH------HLPSSRRHRQLSLPLAPGS--DYTLSLSVG-P 91
Query: 114 PQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP-KRSSSSQLIGCQN 170
+ P DTGS LVWFPC + C+ C P + P + S+ I C +
Sbjct: 92 LSTANPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPPGNNNSSNPLPPPTDSRRIPCAS 150
Query: 171 PKCSWIF--GPNVE----SRCKGCSPRNKTCPL--ACPSYLLQYGLG-FTAGLLLSETLR 221
P CS P + +RC +C ACP YG G A L
Sbjct: 151 PFCSAAHSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLVARLRRGRVGI 210
Query: 222 FPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK----KFSYCLLSRKFD-DA 276
S V NF C+ + +P G+AGFGR SLP+QL +FSYCL++ F D
Sbjct: 211 AASVAVENFTFACAHTALGEPVGVAGFGRGPLSLPAQLAPAALSGRFSYCLVAHSFRADR 270
Query: 277 PVS-SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
P+ S L+L PG + G+ YTP NP FY V L + VG +
Sbjct: 271 PIRPSPLILGRSPGEDPASETGIVYTPLLHNP-----KHPYFYSVALEAVSVGGTRIPAR 325
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM--GNYSRAADVEKKSGLRP 393
G G+GG++VDSG+TFT + + VA+EF R M + RA E ++GL P
Sbjct: 326 PELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAP 385
Query: 394 CF----DISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNE----VLCLILFTDN 442
C+ D S + + +P L + F+G A + LP NYF +E V CL+L
Sbjct: 386 CYYYDHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMN-- 443
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G G GPA LG+FQ Q F + +D+ R GFA+++C
Sbjct: 444 -GGEDDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|297740193|emb|CBI30375.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 103/250 (41%), Positives = 139/250 (55%), Gaps = 30/250 (12%)
Query: 1 MAACPFSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLA 60
MA+ L +F+L +LF + + AT+T+PLT T ++P L LA
Sbjct: 29 MASSTSLLFPVFTLFSILFLASSSNDNIPATITIPLTSTFTSKL----STEPRVFLQHLA 84
Query: 61 SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF 120
S+SLSRA HLK T ++ L+K L HSYGG++I LSFGTPPQ + F
Sbjct: 85 SASLSRAHHLKHGT-------------TSPLVKASLFPHSYGGHTIPLSFGTPPQKLS-F 130
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
+ DTGS +VW PCT+ Y C +C+F N P ++P F PK SSS +++ C+NPKCS
Sbjct: 131 LVDTGSHVVWAPCTTHYTCTNCSFSN--PKKVPIFNPKLSSSYKILECRNPKCSL----- 183
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDR 240
C C+ +K C ACP Y LQYG G +G L E L FP KT+ FL GC+ +
Sbjct: 184 ---GCPRCNGNSKNCSHACPQYSLQYGTGSASGFFLLENLNFPGKTIHKFLVGCTTSAAH 240
Query: 241 QPA--GIAGF 248
+P +AGF
Sbjct: 241 EPTSDALAGF 250
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 141/402 (35%), Positives = 191/402 (47%), Gaps = 30/402 (7%)
Query: 104 YSISLSFGTPPQASTPFIF-DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
Y++SLS G P AS+ +F DTGS LVWFPC + C+ C + +P
Sbjct: 88 YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAP-FTCMLCEGKATPGGNHSSPLPP-PID 145
Query: 163 SQLIGCQNPKCSWIF--GPNVE----SRCKGCSPRNKTCP-LACPSYLLQYGLG-FTAGL 214
S+ I C +P CS P + +RC + +C ACP YG G A L
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205
Query: 215 LLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
S V NF C+ + +P G+AGFGR SLP+QL +FSYCL++
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAH 265
Query: 272 KF--DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
F D SS L+L S D+ G S T F P+ + FY V L + VG
Sbjct: 266 SFRADRLIRSSPLILGR---STDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGG 322
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA--DVEK 387
K ++ DGNGG++VDSG+TFT + F VA EF R M E
Sbjct: 323 KRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEA 382
Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLCLILFT--- 440
++GL PC+ S +P + L F+G A +ALP NYF + G V CL+L
Sbjct: 383 QTGLAPCYHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGG 441
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+N G G GPA LG+FQ Q F + +D+ R GFA+++C
Sbjct: 442 NNDDGED-GGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 141/402 (35%), Positives = 191/402 (47%), Gaps = 30/402 (7%)
Query: 104 YSISLSFGTPPQASTPFIF-DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
Y++SLS G P AS+ +F DTGS LVWFPC + C+ C + +P
Sbjct: 88 YTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAP-FTCMLCEGKATPGGNHSSPLPP-PID 145
Query: 163 SQLIGCQNPKCSWIF--GPNVE----SRCKGCSPRNKTCP-LACPSYLLQYGLG-FTAGL 214
S+ I C +P CS P + +RC + +C ACP YG G A L
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205
Query: 215 LLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
S V NF C+ + +P G+AGFGR SLP+QL +FSYCL++
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAH 265
Query: 272 KF--DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
F D SS L+L S D+ G S T F P+ + FY V L + VG
Sbjct: 266 SFRADRLIRSSPLILGR---STDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGG 322
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD--VEK 387
K ++ DGNGG++VDSG+TFT + F VA EF R M E
Sbjct: 323 KRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEA 382
Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLCLILFT--- 440
++GL PC+ S +P + L F+G A +ALP NYF + G V CL+L
Sbjct: 383 QTGLAPCYHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGG 441
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+N G G GPA LG+FQ Q F + +D+ R GFA+++C
Sbjct: 442 NNDDGED-GGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 140/410 (34%), Positives = 197/410 (48%), Gaps = 44/410 (10%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSS 161
GY ISL+ GTPPQ + DTGS L W PC + + C++C+ + + F P SS
Sbjct: 81 GYLISLNIGTPPQV-IQVLMDTGSDLTWVPCGNLSFDCMECD-DYRNNKLMATFSPSYSS 138
Query: 162 SSQLIGCQNPKCSWIF---GPNVESRCKGCSPRN---KTCPLACPSYLLQYGLG-FTAGL 214
SS C +P C I P GCS TC CPS+ YG G G+
Sbjct: 139 SSYRASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGI 198
Query: 215 LLSETLRFP------SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL--KKFSY 266
L +TLR +K +P F GC + R+P GIAGFGR + S+ SQLG K FS+
Sbjct: 199 LTRDTLRVNGSSPGVAKEIPKFCFGCVGSAYREPIGIAGFGRGTLSMVSQLGFLQKGFSH 258
Query: 267 CLLSRKFDDAP-VSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
C L+ K+ + P +SS LV+ D S D + +TP +P+ + FYYVGL
Sbjct: 259 CFLAFKYANNPNISSPLVVGDIALTSKDD----MQFTPMLNSPM-----YPNFYYVGLEA 309
Query: 325 IIVGS-KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I VG+ ++P S S GNGG+ +DSG+T+T + P + V ++ NY R
Sbjct: 310 ITVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVL-SILQSTINYPRDT 368
Query: 384 DVEKKSGLRPCFDI------SGKKSVYLPELILKFKGGAKMALPPENYFALV---GN-EV 433
+E ++G C+ + + LP + F + LP N+F V GN V
Sbjct: 369 GMEMQTGFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAV 428
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ ++F G GPA + G FQ QN + +DL +R GF CA
Sbjct: 429 VKCLMFQSTDDG---DDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 475
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 143/489 (29%), Positives = 220/489 (44%), Gaps = 73/489 (14%)
Query: 24 GAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTK---- 79
GA + + + L+ LH +++ + +++SR + L+ K +PK
Sbjct: 113 GAEPKNSVIDSTVRDLTRIQNLHR------RVIENRNQNTISRLQRLQ-KEQPKQSFKPV 165
Query: 80 ---DSNIGSNYSNSLIKTPLSVHSYGG--YSISLSFGTPPQASTPFIFDTGSSLVWFPCT 134
++ S S L+ T S S G Y + + GTPP+ + I DTGS L W C
Sbjct: 166 FAPAASSTSPVSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFS-LILDTGSDLNWIQCV 224
Query: 135 SRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKT 194
C + + P DP K SSS + I C +P+C + P+ + CK N++
Sbjct: 225 PCIACFEQSGPYYDP--------KDSSSFRNISCHDPRCQLVSSPDPPNPCKA---ENQS 273
Query: 195 CPLACPSYLLQYGLGF-TAGLLLSET----LRFPS-----KTVPNFLAGCSILS------ 238
CP Y YG G T G ET L P+ K V N + GC +
Sbjct: 274 CP-----YFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLFHG 328
Query: 239 ----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
G F +SL Q FSYCL+ R +A VSS L+ G
Sbjct: 329 AAGLLGLGKGPLSFASQMQSLYGQ----SFSYCLVDRN-SNASVSSKLIF--GEDKELLS 381
Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
P L++T F G + FYYV + ++V + +KIP S+G GG I+DSG
Sbjct: 382 HPNLNFTSFGG---GKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGAGGTIIDSG 438
Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
+T T+ P +E + + F+R++ Y VE L+PC+++SG + + LP+ + F
Sbjct: 439 TTLTYFAEPAYEIIKEAFVRKIKGYEL---VEGLPPLKPCYNVSGIEKMELPDFGILFAD 495
Query: 415 GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
GA P ENYF + +V+CL + + R I+G++Q QNF++ +D+ R
Sbjct: 496 GAVWNFPVENYFIQIDPDVVCLAILGN-------PRSALSIIGNYQQQNFHILYDMKKSR 548
Query: 475 FGFAKQKCA 483
G+A KCA
Sbjct: 549 LGYAPMKCA 557
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 138/383 (36%), Positives = 176/383 (45%), Gaps = 52/383 (13%)
Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
GTP A + I DTGS LVW C CVDC P F P SS+ + C
Sbjct: 173 IGTPALAYSA-IVDTGSDLVWTQCKP---CVDCF-----KQSTPVFDPSSSSTYATVPCS 223
Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVP 228
+ CS + P +K + Y YG T G+L +ET +P
Sbjct: 224 SASCSDL-------------PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLP 270
Query: 229 NFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVL 284
+ GC ++ Q AG+ G GR SL SQLGL KFSYCL S DD +S L+L
Sbjct: 271 GVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS--LDDTN-NSPLLL 327
Query: 285 DT--GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
+ G + + TP KNP S FYYV L+ I VGS + +P S
Sbjct: 328 GSLAGISEASAAASSVQTTPLIKNPSQPS-----FYYVSLKAITVGSTRISLPSSAFAVQ 382
Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD--ISGK 400
DG GGVIVDSG++ T++E + A+ K F QM AAD GL CF G
Sbjct: 383 DDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA--LPAAD-GSGVGLDLCFRAPAKGV 439
Query: 401 KSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDF 459
V +P L+ F GGA + LP ENY L G LCL + RG +II G+F
Sbjct: 440 DQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGS--------RGLSII-GNF 490
Query: 460 QLQNFYLEFDLANDRFGFAKQKC 482
Q QNF +D+ +D FA +C
Sbjct: 491 QQQNFQFVYDVGHDTLSFAPVQC 513
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 126/402 (31%), Positives = 185/402 (46%), Gaps = 57/402 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTPP+ + I DTGS L W C C + + P DP K SS
Sbjct: 195 GEYFMDVFVGTPPKHFS-LILDTGSDLNWIQCVPCIACFEQSGPYYDP--------KDSS 245
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET- 219
S + I C +P+C + P+ CK N++CP Y YG G T G ET
Sbjct: 246 SFRNISCHDPRCQLVSAPDPPKPCKA---ENQSCP-----YFYWYGDGSNTTGDFALETF 297
Query: 220 ---LRFPS-----KTVPNFLAGCSILS----------DRQPAGIAGFGRSSESLPSQLGL 261
L P+ K V N + GC + G F +SL Q
Sbjct: 298 TVNLTTPNGTSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQ--- 354
Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
FSYCL+ R +A VSS L+ G P L++T F G + FYYV
Sbjct: 355 -SFSYCLVDRN-SNASVSSKLIF--GEDKELLSHPNLNFTSFGG---GKDGSVDTFYYVQ 407
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
++ ++V + +KIP S+G GG I+DSG+T T+ P +E + + F+R++ Y
Sbjct: 408 IKSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQL 467
Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
VE L+PC+++SG + + LP+ + F A P ENYF + EV+CL + +
Sbjct: 468 ---VEGLPPLKPCYNVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGN 524
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
R I+G++Q QNF++ +D+ R G+A KCA
Sbjct: 525 -------PRSALSIIGNYQQQNFHILYDMKKSRLGYAPMKCA 559
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 134/402 (33%), Positives = 191/402 (47%), Gaps = 56/402 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTPP+ + I DTGS L W C Y C + N P DP K SS
Sbjct: 193 GEYFMDVFVGTPPKHFS-LILDTGSDLNWIQCVPCYACFEQNGPYYDP--------KDSS 243
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSET- 219
S + I C +P+C + P+ CKG ++CP Y YG T G ET
Sbjct: 244 SFKNITCHDPRCQLVSSPDPPQPCKG---ETQSCP-----YFYWYGDSSNTTGDFALETF 295
Query: 220 ---LRFPS-----KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
L P K V N + GC + AG+ G GR S +QL FS
Sbjct: 296 TVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFS 355
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY---KNPVGSSSAFGEFYYVGL 322
YCL+ R ++ VSS L+ G P L++T F +NPV + FYYV +
Sbjct: 356 YCLVDRN-SNSSVSSKLIF--GEDKELLSHPNLNFTSFVGGKENPVDT------FYYVLI 406
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ I+VG + +KIP + G GG I+DSG+T T+ P +E + + F+R++ +
Sbjct: 407 KSIMVGGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPL- 465
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTD 441
VE L+PC+++SG + + LPE + F GA P ENYF + E V+CL +
Sbjct: 466 --VETFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAIL-- 521
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
R I+G++Q QNF++ +DL R G+A KCA
Sbjct: 522 -----GTPRSALSIIGNYQQQNFHILYDLKKSRLGYAPMKCA 558
>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 137/472 (29%), Positives = 214/472 (45%), Gaps = 53/472 (11%)
Query: 53 LKILHSLASSSLSRARHLKTKTKPKT-----KDSNIGSNYSNSLIKTPLSVHSYGGYSIS 107
L + HSL+ + + HL T + + + + +++ + PLS S Y++S
Sbjct: 28 LPLTHSLSKTQFTSTHHLIKSTSTSSITRFRRHHHQKNTHNHRQVSLPLSPGS--DYTLS 85
Query: 108 LSFGTPPQASTPFIF-DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
+ + P F++ DTGS LVWFPC + C+ C + S PK S ++ +
Sbjct: 86 FTLDSQPI----FLYLDTGSDLVWFPCQP-FECILCEGKAENTSLASTPPPKLSKTATPV 140
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPL-----------ACPSYLLQYGLGFTAGLL 215
C++ CS N+ S C+ N CPL +CP + YG G L
Sbjct: 141 SCKSSACSAAHS-NLPSS-DLCAISN--CPLESIETSDCQKHSCPQFYYAYGDGSLIARL 196
Query: 216 LSETLRFP-----SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKF 264
+++ P + V NF GC+ + +P G+AGFGR SLP+QL +F
Sbjct: 197 YRDSISLPLSNPTNLIVNNFTFGCAHTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQF 256
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSK---TPGLSYTPFYKNPVGSSSAFGEFYYVG 321
SYCL+S FD + L G D K G++ F + + FY VG
Sbjct: 257 SYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVG 316
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS- 380
L I +G K + P +G+GG++VDSG+TFT + L+ +V EF ++G +
Sbjct: 317 LEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNE 376
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---------ALVGN 431
RA +E+ +GL PC+ ++ G+ + LP NYF
Sbjct: 377 RARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKR 436
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+V CL+L + L GP LG++Q Q F + +DL N R GFA+++CA
Sbjct: 437 KVGCLMLM-NGGEEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCA 487
>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 137/472 (29%), Positives = 214/472 (45%), Gaps = 53/472 (11%)
Query: 53 LKILHSLASSSLSRARHLKTKTKPKT-----KDSNIGSNYSNSLIKTPLSVHSYGGYSIS 107
L + HSL+ + + HL T + + + + +++ + PLS S Y++S
Sbjct: 28 LPLTHSLSKTQFTSTHHLIKSTSTSSITRFRRHHHQKNTHNHRQVSLPLSPGS--DYTLS 85
Query: 108 LSFGTPPQASTPFIF-DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
+ + P F++ DTGS LVWFPC + C+ C + S PK S ++ +
Sbjct: 86 FTLDSQPI----FLYLDTGSDLVWFPCQP-FECILCEGKAENTSLASTPPPKLSKTATPV 140
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPL-----------ACPSYLLQYGLGFTAGLL 215
C++ CS N+ S C+ N CPL +CP + YG G L
Sbjct: 141 SCKSSACSAAHS-NLPSS-DLCAISN--CPLESIETSDCQKHSCPQFYYAYGDGSLIARL 196
Query: 216 LSETLRFP-----SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKF 264
+++ P + V NF GC+ + +P G+AGFGR SLP+QL +F
Sbjct: 197 YRDSISLPLSNPTNLIVNNFTFGCAHTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQF 256
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSK---TPGLSYTPFYKNPVGSSSAFGEFYYVG 321
SYCL+S FD + L G D K G++ F + + FY VG
Sbjct: 257 SYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVG 316
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS- 380
L I +G K + P +G+GG++VDSG+TFT + L+ +V EF ++G +
Sbjct: 317 LEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNE 376
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---------ALVGN 431
RA +E+ +GL PC+ ++ G+ + LP NYF
Sbjct: 377 RARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKR 436
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+V CL+L + L GP LG++Q Q F + +DL N R GFA+++CA
Sbjct: 437 KVGCLMLM-NGGDEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCA 487
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 135/416 (32%), Positives = 196/416 (47%), Gaps = 53/416 (12%)
Query: 87 YSNSLIKTPLSVHSYGG--YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
YS+ L+ T S S G Y + + GTPP+ + I DTGS L W C C + +
Sbjct: 173 YSSQLVATLESGVSLGSGEYFMDVFIGTPPKHYS-LILDTGSDLNWIQCVPCIACFEQSG 231
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
P DP K SSS + I C +P+C + P+ CK N+TCP Y
Sbjct: 232 PYYDP--------KESSSFENITCHDPRCKLVSSPDPPKPCKD---ENQTCP-----YFY 275
Query: 205 QYG-LGFTAGLLLSETL---------RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRS 251
YG T G ET + K V N + GC + AG+ G GR
Sbjct: 276 WYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRG 335
Query: 252 SESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPV 308
S SQL FSYCL+ R D VSS L+ G P L++T F
Sbjct: 336 PLSFASQLQSIYGHSFSYCLVDRN-SDTSVSSKLIF--GEDKELLSHPNLNFTSFVG--- 389
Query: 309 GSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
G ++ FYYVG++ I+V + +KIP +G GG I+DSG+T T+ P +E +
Sbjct: 390 GEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEII 449
Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL 428
+ F++++ Y VE L+PC+++SG + + LP+ + F GA P ENYF
Sbjct: 450 KEAFMKKIKGYEL---VEGFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQ 506
Query: 429 VGNEVLCL-ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ +++CL IL T +A I+G++Q QNF++ +D+ R G+A KC
Sbjct: 507 IEPDLVCLAILGTPKSA--------LSIIGNYQQQNFHILYDMKKSRLGYAPMKCT 554
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 148/453 (32%), Positives = 199/453 (43%), Gaps = 62/453 (13%)
Query: 43 HYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG 102
H H + L++L A S R L + + + + ++ P VH+
Sbjct: 46 HVDAHGNYSRLQLLQRAARRSHHRMSRLVARA------TGVKAVAGGGDLQVP--VHAGN 97
Query: 103 G-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G + + ++ GTP S I DTGS LVW C CVDC P F P SS
Sbjct: 98 GEFLMDVAIGTPA-LSYAAIVDTGSDLVWTQCKP---CVDCF-----KQSTPVFDPSSSS 148
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C + CS + P + + Y YG T G+L SET
Sbjct: 149 TYATVPCSSALCSDL-------------PTSTCTSASKCGYTYTYGDASSTQGVLASETF 195
Query: 221 RF--PSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
K +P GC ++ Q AG+ G GR SL SQLGL KFSYCL S D
Sbjct: 196 TLGKEKKKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS--LD 253
Query: 275 DAPVSSNLVLDTGPGSGDSKTPG--LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
D S L+L + + TP KNP S FYYV L + VGS +
Sbjct: 254 DGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPS-----FYYVSLTGLTVGSTRI 308
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
+P S DG GGVIVDSG++ T++E + A+ K F+ QM + + GL
Sbjct: 309 TLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDG---SEIGLD 365
Query: 393 PCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALG 449
CF G V +P+L+L F GGA + LP ENY L + LCL + A
Sbjct: 366 LCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTV--------APS 417
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
RG +II G+FQ QNF +D+A D FA +C
Sbjct: 418 RGLSII-GNFQQQNFQFVYDVAGDTLSFAPVQC 449
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 150/481 (31%), Positives = 217/481 (45%), Gaps = 67/481 (13%)
Query: 27 SSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTK-------TKPKTK 79
S A+ T LT + T H +IL ++LSR + K P++
Sbjct: 118 SFVASTTRDLTRIQTLHK---------RILEKKNQNALSRLNKEEPKQPVVAPAASPESY 168
Query: 80 DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC 139
+N S + +++ +S+ S G Y + + GTPP+ + I DTGS L W C Y C
Sbjct: 169 PANGLSGQLMATLESGVSLGS-GEYFMDVFIGTPPRHFS-LILDTGSDLNWIQCVPCYDC 226
Query: 140 VDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLAC 199
N P DP K SSS + IGC +P+C + P+ CK N+TCP
Sbjct: 227 FVQNGPYYDP--------KESSSFKNIGCHDPRCHLVSSPDPPQPCKA---ENQTCP--- 272
Query: 200 PSYLLQYG-LGFTAGLLLSET----LRFPS-----KTVPNFLAGCSILSD---RQPAGIA 246
Y YG T G ET L P+ K V N + GC + AG+
Sbjct: 273 --YFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLL 330
Query: 247 GFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
G GR S SQL FSYCL+ R D VSS L+ G P +++T
Sbjct: 331 GLGRGPLSFSSQLQSLYGHSFSYCLVDRN-SDTNVSSKLIF--GEDKDLLNHPEVNFTSL 387
Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
G + FYYV ++ I+VG + +KIP +G GG IVDSG+T ++ P
Sbjct: 388 V---AGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEP 444
Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
+E + F++++ Y D L PC+++SG + + LPE + F+ GA P E
Sbjct: 445 SYEIIKDAFVKKVKGYPVIKDFPI---LDPCYNVSGVEKMELPEFRILFEDGAVWNFPVE 501
Query: 424 NYF-ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
NYF L E++CL + R I+G++Q QNF++ +D R G+A KC
Sbjct: 502 NYFIKLEPEEIVCLAIL-------GTPRSALSIIGNYQQQNFHILYDTKKSRLGYAPMKC 554
Query: 483 A 483
A
Sbjct: 555 A 555
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 149/499 (29%), Positives = 216/499 (43%), Gaps = 68/499 (13%)
Query: 1 MAACPFSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLA 60
M C S + ++L+ L T A ++ T+ LT H + L +A
Sbjct: 1 MKDCSMSELLAYALIFTLLFTAAATPTAGLTMRADLT-----HVDKGRGFTRWERLSRMA 55
Query: 61 SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF 120
S +RA L + G +Y + T +V S G Y I + GTP
Sbjct: 56 VRSRARAASLYQR----------GGHYGQPVTAT--AVPSSGEYLIHFNIGTPRPQRVAL 103
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
DTGS LVW CT C D FP DPS SS+ + + C +P C G +
Sbjct: 104 TMDTGSDLVWTQCTPCPVCFDQPFPLFDPSV--------SSTFRAVACPDPICRPSSGLS 155
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--------PSKTVPNFL 231
V + C+ + C YL YG TAG + +T F P V
Sbjct: 156 VSA----CALKTFRC-----FYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLA 206
Query: 232 AGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTG 287
GC + +GIAGFGR SLPSQL + +FSYCL S ++ +S + L T
Sbjct: 207 FGCGDYNTGVFASNESGIAGFGRGPLSLPSQLRVGRFSYCLTSHDETESNKTSAVFLGTP 266
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
P + + G PF P+ S +F FYY+ L I VG + + S DG+G
Sbjct: 267 PNGLRAHSSG----PFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDGSG 322
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQ--MGNYSRAADVEKKSGLRPCFDI-SGKKSVY 404
G ++DSG+ T +FE + EF+ Q + Y ++V G CF G K V
Sbjct: 323 GTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEV----GNLLCFQRPKGGKQVP 378
Query: 405 LPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
+P+LI A M LP ENY + V+CL++ N A + +++G+FQ QN
Sbjct: 379 VPKLIFHL-ASADMDLPRENYIPEDTDSGVMCLMI---NGAEVDM-----VLIGNFQQQN 429
Query: 464 FYLEFDLANDRFGFAKQKC 482
++ +D+ N + FA +C
Sbjct: 430 MHIVYDVENSKLLFASAQC 448
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 130/402 (32%), Positives = 188/402 (46%), Gaps = 56/402 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y I + GTPP+ + I DTGS L W C Y C + N P+ DP + SS
Sbjct: 179 GEYFIDVFVGTPPKHFS-LILDTGSDLNWIQCVPCYECFEQNGPHYDPGQ--------SS 229
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-----------F 210
S + IGC + +C + P+ CK N+TCP Y YG F
Sbjct: 230 SYRNIGCHDSRCHLVSSPDPPQPCKA---ENQTCP-----YYYWYGDSSNTTGDFALETF 281
Query: 211 TAGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKF 264
T L +S + + + V N + GC + AG+ G GR S SQL F
Sbjct: 282 TVNLTMS-SGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSF 340
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
SYCL+ R DA VSS L+ G P L++T G + FYYV ++
Sbjct: 341 SYCLVDRN-SDANVSSKLIF--GEDKDLLSHPELNFTTLV---AGKENPVDTFYYVQIKS 394
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
I+VG + V IP +DG+GG I+DSG+T ++ P ++ + + F+ ++ Y D
Sbjct: 395 IVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKD 454
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCLILFTDNA 443
L PC++++G + LP+ + F GA P ENYF + EV+CL +
Sbjct: 455 FPV---LEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAI----- 506
Query: 444 AGPALGRGPAI--ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LG P+ I+G++Q QNF++ +D R GFA KCA
Sbjct: 507 ----LGTPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKCA 544
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 141/460 (30%), Positives = 207/460 (45%), Gaps = 66/460 (14%)
Query: 54 KILHSLASSSLSRARHLKTKTKPKTKD--------SNIGSNYSNSLIKTPLSVHSYGG-- 103
+I+ + +SR + K + + + K + G+ S L+ T S + G
Sbjct: 30 RIIEKKNQNDISRLKKDKERPEKQIKTVVATAASPESYGTGLSGQLMATLESGVTLGSGE 89
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTPP+ + I DTGS L W C + C + N P DP K SSS
Sbjct: 90 YFMDVFIGTPPKHYS-LILDTGSDLNWIQCVPCHDCFEQNGPYYDP--------KESSSF 140
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSET--- 219
+ IGC +P+C + P+ CK N+TCP Y YG T G +ET
Sbjct: 141 RNIGCHDPRCHLVSSPDPPLPCKA---ENQTCP-----YFYWYGDSSNTTGDFATETFTV 192
Query: 220 -LRFPS-----KTVPNFLAGCSILSDRQPAGIAGFGRSSE---SLPSQLGL---KKFSYC 267
L P+ K V N + GC + G +G S SQL FSYC
Sbjct: 193 NLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYC 252
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY---KNPVGSSSAFGEFYYVGLRQ 324
L+ R D VSS L+ G P L++T +NPV + FYYV ++
Sbjct: 253 LVDRN-SDTNVSSKLIF--GEDKDLLNHPELNFTTLVGGKENPVDT------FYYVQIKS 303
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
I+VG + + IP S SDG GG IVDSG+T ++ P ++ + F++++ Y D
Sbjct: 304 IMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQD 363
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNA 443
L PC+++SG + + LP+ + F GA P ENYF L EV+CL +
Sbjct: 364 FPI---LDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAIL---- 416
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
R I+G++Q QNF++ +D R G+A CA
Sbjct: 417 ---GTPRSALSIIGNYQQQNFHVLYDTKKSRLGYAPMNCA 453
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 133/411 (32%), Positives = 195/411 (47%), Gaps = 43/411 (10%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y++S + G+ + ++ DTGS LVWFPC S + C+ C S +P +S
Sbjct: 74 GDYTLSFNLGSESHKISLYM-DTGSDLVWFPC-SPFECILCEGKPKIQSPLPKIANNKSV 131
Query: 162 SSQLIGCQNPKCSWIFGPNV--ESRCKGCSPRNKTCP-LACPSYLLQYGLGFTAGLLLSE 218
S C + ++ SRC S C +CP + YG G L +
Sbjct: 132 SCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRD 191
Query: 219 TLRFPSK------TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSY 266
+L P+ V NF GC+ + +P G+AGFGR S+PSQL +FSY
Sbjct: 192 SLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSY 251
Query: 267 CLLSRKF--DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
CL+S F D S L+L G + YT +NP FY VGL
Sbjct: 252 CLVSHSFAADRVRRPSPLIL----GRYYTGETEFIYTSLLENP-----KHPYFYSVGLAG 302
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS-RAA 383
I VG+ + P G+GGV+VDSG+TFT + L+E+V EF + G + RA
Sbjct: 303 ISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRAR 362
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKG-GAKMALPPENYF--------ALVG--NE 432
+E+ +GL PC+ + SV +P ++L F G + + LP +NYF +VG +
Sbjct: 363 RIEENTGLSPCYYY--ENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRK 420
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
V CL+L G GP LG++Q Q F + +DL +R GFA+++C+
Sbjct: 421 VGCLMLMNGGDEAELAG-GPGATLGNYQQQGFEVVYDLEKNRVGFARRQCS 470
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 129/445 (28%), Positives = 199/445 (44%), Gaps = 53/445 (11%)
Query: 59 LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG--YSISLSFGTPPQA 116
L S++ R + ++ + P + +S L+ T S S G Y I + G+PP+
Sbjct: 149 LKKSNVERKKPMEEVSSPAESPESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKH 208
Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
+ I DTGS L W C + C + N P DP K S S + I C +P+C +
Sbjct: 209 FS-LILDTGSDLNWIQCVPCFDCFEQNGPYYDP--------KDSISFRNITCNDPRCQLV 259
Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-----------LGFTAGLLLSETLRFPSK 225
P+ CK ++CP Y YG FT L S T + +
Sbjct: 260 SSPDPPRPCKF---ETQSCP-----YFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311
Query: 226 TVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVS 279
V N + GC + AG+ G GR S SQL FSYCL+ R D VS
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRD-SDTSVS 370
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
S L+ G P L++T G + FYY+ ++ I VG + ++IP
Sbjct: 371 SKLIF--GEDKDLLTHPELNFTSLI---AGKENPVDTFYYLQIKSIFVGGEKLQIPEENW 425
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
+DG GG I+DSG+T ++ P + + + F+R++ Y D L PC+++SG
Sbjct: 426 NLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDF---PILHPCYNVSG 482
Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGD 458
+ PE +++F GA P ENYF + +++CL + + I+G+
Sbjct: 483 TDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAML-------GTPKSALSIIGN 535
Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
+Q QNF++ +D N R G+A +CA
Sbjct: 536 YQQQNFHILYDTKNSRLGYAPMRCA 560
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 129/445 (28%), Positives = 199/445 (44%), Gaps = 53/445 (11%)
Query: 59 LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG--YSISLSFGTPPQA 116
L S++ R + ++ + P + +S L+ T S S G Y I + G+PP+
Sbjct: 149 LKKSNVERKKPMEEVSSPAESPESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKH 208
Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
+ I DTGS L W C + C + N P DP K S S + I C +P+C +
Sbjct: 209 FS-LILDTGSDLNWIQCVPCFDCFEQNGPYYDP--------KDSISFRNITCNDPRCQLV 259
Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-----------LGFTAGLLLSETLRFPSK 225
P+ CK ++CP Y YG FT L S T + +
Sbjct: 260 SSPDPPRPCKF---ETQSCP-----YFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311
Query: 226 TVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVS 279
V N + GC + AG+ G GR S SQL FSYCL+ R D VS
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRD-SDTSVS 370
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
S L+ G P L++T G + FYY+ ++ I VG + ++IP
Sbjct: 371 SKLIF--GEDKDLLTHPELNFTSLI---AGKENPVDTFYYLQIKSIFVGGEKLQIPEENW 425
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
+DG GG I+DSG+T ++ P + + + F+R++ Y D L PC+++SG
Sbjct: 426 NLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDF---PILHPCYNVSG 482
Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGD 458
+ PE +++F GA P ENYF + +++CL + + I+G+
Sbjct: 483 TDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAML-------GTPKSALSIIGN 535
Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
+Q QNF++ +D N R G+A +CA
Sbjct: 536 YQQQNFHILYDTKNSRLGYAPMRCA 560
>gi|118484651|gb|ABK94196.1| unknown [Populus trichocarpa]
Length = 125
Score = 160 bits (404), Expect = 2e-36, Method: Composition-based stats.
Identities = 70/123 (56%), Positives = 97/123 (78%)
Query: 360 MEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMA 419
ME P++E VAKEF +Q+ +Y+ A +V+ ++GLRPCF+ISG+KSV +PE I FKGGAKMA
Sbjct: 1 MEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGAKMA 60
Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
LP NYF+ V + V+CL + +DN +G +G GPAIILG++Q +NF++EFDL N+RFGF +
Sbjct: 61 LPLANYFSFVDSGVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQ 120
Query: 480 QKC 482
Q C
Sbjct: 121 QNC 123
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 147/439 (33%), Positives = 203/439 (46%), Gaps = 67/439 (15%)
Query: 62 SSLSRARHLKTKTKPKTKDSNI----GSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQA 116
+ L R +H + K + + N S+ +S + +H+ G Y I L+ GTPP
Sbjct: 61 TKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQLEAPIHAGNGEYLIELAIGTPP-V 119
Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
S P + DTGS L+W C RC P+ P F PK+SSS + C + CS +
Sbjct: 120 SYPAVLDTGSDLIWTQCKPCTRCYK------QPT--PIFDPKKSSSFSKVSCGSSLCSAL 171
Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK----TVPNFL 231
+ TC C Y+ YG T G+L +ET F +V N
Sbjct: 172 --------------PSSTCSDGC-EYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIG 216
Query: 232 AGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTG 287
GC ++ Q +G+ G GR SL SQL ++FSYCL DD S L+L +
Sbjct: 217 FGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEQRFSYCL--TPIDDTK-ESVLLLGSL 273
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
D+K + TP KNP+ S FYY+ L I VG + I S G DGNG
Sbjct: 274 GKVKDAKE--VVTTPLLKNPLQPS-----FYYLSLEAISVGDTRLSIEKSTFEVGDDGNG 326
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLP 406
GVI+DSG+T T+++ +EA+ KEFI Q A D +GL CF + SG V +P
Sbjct: 327 GVIIDSGTTITYVQQKAYEALKKEFISQT---KLALDKTSSTGLDLCFSLPSGSTQVEIP 383
Query: 407 ELILKFKGGAKMALPPENYFALVGNE---VLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
+L+ FKGG + LP ENY ++G+ V CL A G + G I G+ Q QN
Sbjct: 384 KLVFHFKGG-DLELPAENY--MIGDSNLGVACL------AMGASSGMS---IFGNVQQQN 431
Query: 464 FYLEFDLANDRFGFAKQKC 482
+ DL + F C
Sbjct: 432 ILVNHDLEKETISFVPTSC 450
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 135/405 (33%), Positives = 188/405 (46%), Gaps = 60/405 (14%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
+K P+ V G + + L+ G+PP++ + I DTGS L+W C +C D
Sbjct: 355 VKAPV-VAGNGEFLMKLAIGSPPRSFSA-IMDTGSDLIWTQCKPCQQCFD--------QS 404
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGF 210
P F PK+SSS I C + C + P GC YL YG
Sbjct: 405 TPIFDPKQSSSFYKISCSSELCGAL--PTSTCSSDGCE------------YLYTYGDSSS 450
Query: 211 TAGLLLSETLRFPSKT-----VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGL 261
T G+L ET F T +P GC ++ Q AG+ G GR SL SQL
Sbjct: 451 TQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKE 510
Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
+KF+YCL + DD+ SS L+ + + + TP KNP S FYY+
Sbjct: 511 QKFAYCLTA--IDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS-----FYYLS 563
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
L+ I VG + IP S DG+GGVI+DSG+T T++E F ++ EFI QM +
Sbjct: 564 LQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM---NL 620
Query: 382 AADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE---VLCLI 437
D GL CF++ +G V +P+L FK GA + LP ENY ++G+ +LCL
Sbjct: 621 PVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENY--MIGDSKAGLLCL- 676
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A G + G I G+ Q QNF + DL + F +C
Sbjct: 677 -----AIGSSRGMS---IFGNLQQQNFMVVHDLQEETLSFLPTQC 713
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 137/396 (34%), Positives = 182/396 (45%), Gaps = 53/396 (13%)
Query: 98 VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
VH+ G + + +S GTP A I DTGS LVW C CV+C + P F
Sbjct: 95 VHAGNGEFLMDMSIGTPAVAYAAII-DTGSDLVWTQCKP---CVECFNQST-----PVFD 145
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
P SS+ + C + CS + P +K C Y YG T G+L
Sbjct: 146 PSSSSTYAALPCSSTLCSDL-------------PSSKCTSAKC-GYTYTYGDSSSTQGVL 191
Query: 216 LSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
+ET +P+ GC ++ Q AG+ G GR SL SQLGL KFSYCL S
Sbjct: 192 AAETFTLAKTKLPDVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTS- 250
Query: 272 KFDDAPVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
DD S L+ + + TP +NP S FYYV L+ + VGS
Sbjct: 251 -LDDTSKSPLLLGSLATISESAAAASSVQTTPLIRNPSQPS-----FYYVNLKGLTVGST 304
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
H+ +P S DG GGVIVDSG++ T++E + A+ K F QM AAD G
Sbjct: 305 HITLPSSAFAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQM--KLPAAD-GSGIG 361
Query: 391 LRPCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPA 447
L CF+ SG V +P+L+ GA + LP ENY L G+ LCL +
Sbjct: 362 LDTCFEAPASGVDQVEVPKLVFHLD-GADLDLPAENYMVLDSGSGALCLTVMGS------ 414
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
RG +II G+FQ QN +D+ + FA +CA
Sbjct: 415 --RGLSII-GNFQQQNIQFVYDVGENTLSFAPVQCA 447
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 135/405 (33%), Positives = 188/405 (46%), Gaps = 60/405 (14%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
+K P+ V G + + L+ G+PP++ + I DTGS L+W C +C D
Sbjct: 100 VKAPV-VAGNGEFLMKLAIGSPPRSFSA-IMDTGSDLIWTQCKPCQQCFD--------QS 149
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGF 210
P F PK+SSS I C + C + P GC YL YG
Sbjct: 150 TPIFDPKQSSSFYKISCSSELCGAL--PTSTCSSDGCE------------YLYTYGDSSS 195
Query: 211 TAGLLLSETLRFPSKT-----VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGL 261
T G+L ET F T +P GC ++ Q AG+ G GR SL SQL
Sbjct: 196 TQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKE 255
Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
+KF+YCL + DD+ SS L+ + + + TP KNP S FYY+
Sbjct: 256 QKFAYCLTA--IDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPS-----FYYLS 308
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
L+ I VG + IP S DG+GGVI+DSG+T T++E F ++ EFI QM +
Sbjct: 309 LQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQM---NL 365
Query: 382 AADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE---VLCLI 437
D GL CF++ +G V +P+L FK GA + LP ENY ++G+ +LCL
Sbjct: 366 PVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELPGENY--MIGDSKAGLLCL- 421
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A G + G I G+ Q QNF + DL + F +C
Sbjct: 422 -----AIGSSRGMS---IFGNLQQQNFMVVHDLQEETLSFLPTQC 458
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 136/396 (34%), Positives = 182/396 (45%), Gaps = 45/396 (11%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y++++S GTPP P I DTGS+L+W C RC FP P+ P P RSS
Sbjct: 89 GAYNMNISLGTPP-LDFPVIVDTGSNLIWAQCAPCTRC----FPR--PTPAPVLQPARSS 141
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ + C C ++ SR PR AC +Y YG G+TAG L +ETL
Sbjct: 142 TFSRLPCNGSFCQYL---PTSSR-----PRTCNATAAC-AYNYTYGSGYTAGYLATETLT 192
Query: 222 FPSKTVPNFLAGCSILSD-RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS 280
T P GCS + +GI G GR SL SQL + +FSYCL S D +S
Sbjct: 193 VGDGTFPKVAFGCSTENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGG--AS 250
Query: 281 NLVLDTGPGSGDSKTPG--LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
++ GS T G + TP KNP S YYV L I V S + + S
Sbjct: 251 PILF----GSLAKLTEGSVVQSTPLLKNPYLQRSTH---YYVNLTGIAVDSTELPVTGST 303
Query: 339 LVPGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS-GLRPCFD 396
G GG IVDSG+T T++ + V + F QM N ++ L C+
Sbjct: 304 FGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYK 363
Query: 397 IS---GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGPA 447
S G K+V +P L L+F GGAK +P +NYFA V + V CL++ PA
Sbjct: 364 PSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVL------PA 417
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P I+G+ + +L +D+ F FA CA
Sbjct: 418 TDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADCA 453
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 140/400 (35%), Positives = 185/400 (46%), Gaps = 51/400 (12%)
Query: 98 VHSYGG-YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
VH+ G + + LS GTP P A+ I DTGS LVW C CV+C F P
Sbjct: 109 VHAGNGEFLMDLSVGTPALPYAA---IVDTGSDLVWTQCKP---CVEC-FNQT----TPV 157
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAG 213
F P SS+ + C + C+ + S S + Y YG T G
Sbjct: 158 FDPAASSTYAALPCSSALCADLPTSTCASSSSSSSASSPC------GYTYTYGDASSTQG 211
Query: 214 LLLSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
+L +ET + VP GC ++ Q AG+ G GR SL SQLG+ +FSYCL
Sbjct: 212 VLATETFTLARQKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLT 271
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
S DDA S L+L + G S TP KNP S FYYV L + VG
Sbjct: 272 S--LDDAAGRSPLLLGSAAGISASAATAPAQTTPLVKNPSQPS-----FYYVSLTGLTVG 324
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
S + +P S DG GGVIVDSG++ T++E + A+ K F+ M + A +
Sbjct: 325 STRLALPSSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDA---SE 381
Query: 389 SGLRPCFD-----ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDN 442
GL CF + V +P+L+L F GGA + LP ENY L + LCL +
Sbjct: 382 IGLDLCFQGPAGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMAS- 440
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
RG +II G+FQ QNF +D+A D FA +C
Sbjct: 441 -------RGLSII-GNFQQQNFQFVYDVAGDTLSFAPAEC 472
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 133/394 (33%), Positives = 182/394 (46%), Gaps = 41/394 (10%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y++++S GTPP P I DTGS+L+W C RC FP P+ P P RSS
Sbjct: 89 GAYNMNISLGTPP-LDFPVIVDTGSNLIWAQCAPCTRC----FPR--PTPAPVLQPARSS 141
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ + C C ++ SR PR AC +Y YG G+TAG L +ETL
Sbjct: 142 TFSRLPCNGSFCQYL---PTSSR-----PRTCNATAAC-AYNYTYGSGYTAGYLATETLT 192
Query: 222 FPSKTVPNFLAGCSILSD-RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS 280
T P GCS + +GI G GR SL SQL + +FSYCL S D +S
Sbjct: 193 VGDGTFPKVAFGCSTENGVDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGG--AS 250
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
++ G + ++ + TP KNP S YYV L I V S + + S
Sbjct: 251 PILF--GSLAKLTERSVVQSTPLLKNPYLQRSTH---YYVNLTGIAVDSTELPVTGSTFG 305
Query: 341 PGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS-GLRPCFDIS 398
G GG IVDSG+T T++ + V + F QM N ++ L C+ S
Sbjct: 306 FTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPS 365
Query: 399 ---GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGPALG 449
G K+V +P L L+F GGAK +P +NYFA V + V CL++ PA
Sbjct: 366 AGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVL------PATD 419
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P I+G+ + +L +D+ F FA CA
Sbjct: 420 DLPISIIGNLMQMDMHLLYDIDGGMFSFAPADCA 453
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 146/441 (33%), Positives = 204/441 (46%), Gaps = 72/441 (16%)
Query: 62 SSLSRARHLKTKTKPKTKDSN---IGSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQAS 117
+ L R +H + K + + N + ++ +S + +H+ G Y + L+ GTPP S
Sbjct: 62 TKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAGNGEYLMELAIGTPP-VS 120
Query: 118 TPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
P + DTGS L+W PCT Y+ P+ P F PK+SSS + C + CS
Sbjct: 121 YPAVLDTGSDLIWTQCKPCTQCYK---------QPT--PIFDPKKSSSFSKVSCGSSLCS 169
Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK----TVPN 229
+ + TC C Y+ YG T G+L +ET F +V N
Sbjct: 170 AV--------------PSSTCSDGC-EYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHN 214
Query: 230 FLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
GC ++ Q +G+ G GR SL SQL +FSYCL DD S L+L
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEPRFSYCL--TPMDDTK-ESILLLG 271
Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
+ D+K + TP KNP+ S FYY+ L I VG + I S G DG
Sbjct: 272 SLGKVKDAKE--VVTTPLLKNPLQPS-----FYYLSLEGISVGDTRLSIEKSTFEVGDDG 324
Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI-SGKKSVY 404
NGGVI+DSG+T T++E FEA+ KEFI Q D +GL CF + SG V
Sbjct: 325 NGGVIIDSGTTITYIEQKAFEALKKEFISQT---KLPLDKTSSTGLDLCFSLPSGSTQVE 381
Query: 405 LPELILKFKGGAKMALPPENYFALVGNE---VLCLILFTDNAAGPALGRGPAIILGDFQL 461
+P+++ FKGG + LP ENY ++G+ V CL A G + G I G+ Q
Sbjct: 382 IPKIVFHFKGG-DLELPAENY--MIGDSNLGVACL------AMGASSGMS---IFGNVQQ 429
Query: 462 QNFYLEFDLANDRFGFAKQKC 482
QN + DL + F C
Sbjct: 430 QNILVNHDLEKETISFVPTSC 450
>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 441
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 140/437 (32%), Positives = 201/437 (45%), Gaps = 59/437 (13%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC--TSRYRCVDCNFPNVDP 149
I P++ ++ GY +SL+ GTPPQ ++ DTGS L W PC + Y+C++C +
Sbjct: 14 IIEPIATYT-DGYLLSLNLGTPPQVFQVYL-DTGSDLTWVPCGTNTSYQCLECGNEHSIS 71
Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFG-PNVESRCK--GCSP---RNKTCPLACPSYL 203
PAF +S SS C + C + N C GCS + C CP +
Sbjct: 72 KPTPAFSLSQSYSSTRDLCGSRFCVDVHSSDNSHDACAAAGCSIPVFMSGLCTRLCPPFA 131
Query: 204 LQYG-LGFTAGLLLSETLRFPSKT--------VPNFLAGCSILSDRQPAGIAGFGRSSES 254
YG G L +T+ P F GC S R+P GIAGFG+ S
Sbjct: 132 YTYGGRALVLGSLARDTIALHGSIYGISVPIEFPGFCFGCVGSSIREPIGIAGFGKGKLS 191
Query: 255 LPSQLGL--KKFSYCLLSRKFDDAP-VSSNLVLDTGPGSGD---SKTPGLSYTPFYKNPV 308
LPSQLG K FS+C L F P ++S +V+ GD S G +TP K
Sbjct: 192 LPSQLGFLDKGFSHCFLGFWFARNPNITSPMVI------GDLALSVKDGFLFTPMLK--- 242
Query: 309 GSSSAFGEFYYVGLRQIIVGSK-HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
S + FYY+GL + +G + P S S+GNGGVIVD+G+T+T + P F A
Sbjct: 243 --SLTYPNFYYIGLEGVTIGDNAAIPAPPSLSGIDSEGNGGVIVDTGTTYTHLSDP-FYA 299
Query: 368 VAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMALPPE 423
+ Y+R+ ++E ++G C + + LP + + G +ALP E
Sbjct: 300 SVLSSLSSTVPYNRSYELEIRTGFDLCLKVPCMHAPCNDDELPPITVHLGGDVTLALPKE 359
Query: 424 N-YFALVG--NEVL--CL---------ILFTDNAAGPAL---GRGPAIILGDFQLQNFYL 466
+ Y+A+ N V+ CL + DN G GPA +LG FQ+QN +
Sbjct: 360 SCYYAVTAPRNSVVIKCLLFQRKDDDGVFSADNDDGEDASFSAGGPAAVLGSFQMQNVEV 419
Query: 467 EFDLANDRFGFAKQKCA 483
+DL + R GF + CA
Sbjct: 420 VYDLESGRVGFQPRDCA 436
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 133/393 (33%), Positives = 186/393 (47%), Gaps = 57/393 (14%)
Query: 98 VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
VH+ G + + L+ GTP + + I DTGS L+W C C D P+ P F
Sbjct: 90 VHAGNGEFLMKLAIGTPAETYSA-IMDTGSDLIWTQCKPCKDCFD------QPT--PIFD 140
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
PK+SSS + C + C+ + + S GC YL YG T G+L
Sbjct: 141 PKKSSSFSKLPCSSDLCAAL---PISSCSDGCE------------YLYSYGDYSSTQGVL 185
Query: 216 LSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
+ET F +V GC +D Q AG+ G GR SL SQLG KFSYCL S
Sbjct: 186 ATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLISQLGEPKFSYCLTS- 244
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
DD+ S+L++ + ++ T TP +NP S FYY+ L I VG
Sbjct: 245 -MDDSKGISSLLVGSEATMKNAIT-----TPLIQNPSQPS-----FYYLSLEGISVGDTL 293
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ I S +DG+GG+I+DSG+T T++E F A+ KEFI Q+ D +GL
Sbjct: 294 LPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQL---KLDVDESGSTGL 350
Query: 392 RPCFDISGKKS-VYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALG 449
CF + S V +P+L+ F+ GA + LP ENY A G V+CL + + +
Sbjct: 351 DLCFTLPPDASTVDVPQLVFHFE-GADLKLPAENYIIADSGLGVICLTMGSSSGMS---- 405
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I G+FQ QN + DL + FA +C
Sbjct: 406 -----IFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 130/412 (31%), Positives = 193/412 (46%), Gaps = 73/412 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L GTP I DTGS + W C C DC P+ P F P+ SSS
Sbjct: 139 YYVPLQVGTP-AVEVVLIMDTGSDVSWIQCVP---CKDCV-----PALRPPFNPRHSSSF 189
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
+ C + C+ ++ + CSP +TC + +QYG G ++GLL ET+
Sbjct: 190 FKLPCASSTCTNVY----QGVKPFCSPSGRTCLFS-----IQYGDGSLSSGLLAMETI-- 238
Query: 223 PSKTVPNF-----------LAGCSILSDRQ-----PAGIAGFGRSSESLPSQLG---LKK 263
+ PNF GC+ + DR+ +G+ G R S PSQL +K
Sbjct: 239 -AGNTPNFGDGEPVKLSNITLGCADI-DREGLPTGASGLLGMDRRPISFPSQLSSRYARK 296
Query: 264 FSYCLLSRKFDDAPV---SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
FS+C F D SS LV G D +P L YTP +NP S++ ++YYV
Sbjct: 297 FSHC-----FPDKIAHLNSSGLVFF---GESDIISPYLRYTPLVQNPAVPSASL-DYYYV 347
Query: 321 GLRQIIVGSKHVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
GL I V + + + ++ + G+GG I+DSG+ FT+++ P F+A+ +EF +
Sbjct: 348 GLVGISVDESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF---LART 404
Query: 380 SRAADVEKKSGLRPCFDIS----GKKSVYLPELILKFKGGAKMALPPENYFALVGNE--- 432
S A V+ SG PC++I+ +S LP + L F+GG + LP + V +
Sbjct: 405 SHLAKVDDNSGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQ 464
Query: 433 -VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LCL G P I+G++Q QN ++E+DL R G A +CA
Sbjct: 465 TTLCLAFLMS-------GDIPFNIIGNYQQQNLWVEYDLEKLRLGIAPAQCA 509
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 119/410 (29%), Positives = 194/410 (47%), Gaps = 48/410 (11%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPS 150
+ P+ Y G PPQ + I DTGS+L+W C+ R C N P DPS
Sbjct: 59 VTAPIHWGGQSQYIAEYLIGDPPQRAEAII-DTGSNLIWTQCSRCRPTCFRQNLPYYDPS 117
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
R S +++ +GC + C+ G + C NKTC + + YG G
Sbjct: 118 R--------SRAARAVGCNDAACA--LGSETQ-----CLSDNKTC-----AVVTGYGAGN 157
Query: 211 TAGLLLSETLRFPSKTVPNFLAGCSILSDRQP------AGIAGFGRSSESLPSQLGLKKF 264
AG L +E L F S+TV + + GC +++ P +GI G GR SLPSQLG +F
Sbjct: 158 IAGTLATENLTFQSETV-SLVFGCIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDTRF 216
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPG--SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
SYC L+ F+D S++V+ G +G + + ++ PF ++P S F FYY+ L
Sbjct: 217 SYC-LTPYFEDTIEPSHMVVGASAGLINGSASSTPVTTVPFVRSP--SDDPFSTFYYLPL 273
Query: 323 RQIIVGSKHVKIPYS-----YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
I G + +P + + PG G +DSG+ T + ++A+ E RQ+G
Sbjct: 274 TGITAGKVKLAVPSAAFDLRQVAPGM--WTGTFIDSGAPLTSLVDVAYQALRAELARQLG 331
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA----KMALPPENYFALVGNEV 433
+ + +G C + + + +P L+L F GG+ + +PP NY+A V +
Sbjct: 332 A-ALVQPLAGTTGFDLCVALKDAERL-VPPLVLHFGGGSGTGTDLVVPPANYWAPVDSAT 389
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
C+++F+ + +L ++G++ QN ++ +DLA F C+
Sbjct: 390 ACMVVFS-SVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVLSFQPADCS 438
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 130/396 (32%), Positives = 186/396 (46%), Gaps = 54/396 (13%)
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
+ GGY++++S GTP + P + DTGS L+W C +C P+ P F P
Sbjct: 81 NGVGGYNMNISVGTP-LLTFPVVADTGSDLIWTQCAPCTKCFQ------QPA--PPFQPA 131
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
SS+ + C + C ++ PN +TC Y +YG G+TAG L +E
Sbjct: 132 SSSTFSKLPCTSSFCQFL--PN----------SIRTCNATGCVYNYKYGSGYTAGYLATE 179
Query: 219 TLRFPSKTVPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
TL+ + P+ GCS + +GIAG GR + SL QLG+ +FSYCL R A
Sbjct: 180 TLKVGDASFPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGVGRFSYCL--RSGSAA 237
Query: 277 PVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
S L GS + T G + TPF NP S +YYV L I VG + +
Sbjct: 238 GASPILF-----GSLANLTDGNVQSTPFVNNPAVHPS----YYYVNLTGITVGETDLPVT 288
Query: 336 YSYLVPGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
S +G GG IVDSG+T T++ +E V + F+ Q N + V GL C
Sbjct: 289 TSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTT---VNGTRGLDLC 345
Query: 395 F-DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGPA 447
F G + +P L+L+F GGA+ A+P YFA V + V CL++ PA
Sbjct: 346 FKSTGGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMML------PA 397
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G P ++G+ + +L +DL F F+ CA
Sbjct: 398 KGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADCA 433
>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
Length = 503
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 135/415 (32%), Positives = 186/415 (44%), Gaps = 47/415 (11%)
Query: 104 YSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y++SLS G P A+ P DTGS LVWFPC + C+ C P+
Sbjct: 90 YTLSLSVG-PASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEG---KPTPGRLGPLPPPP 144
Query: 162 SSQLIGCQNPKCSWIFGPN------VESRCKGCSPRNKTC--PLACPSYLLQYGLGFTAG 213
S+ I C +P CS +RC +C ACP YG G
Sbjct: 145 DSRRIPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVA 204
Query: 214 LLLSETLRFPSKT-------VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---K 263
L + + V NF C+ + +P G+AGFGR SLP QL + +
Sbjct: 205 HLRRGRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGR 264
Query: 264 FSYCLLSRKF--DDAPVSSNLVLDTGPGSGDS--KTPGLSYTPFYKNPVGSSSAFGEFYY 319
FSYCL+S F D S L+L P + +T G YTP NP FY
Sbjct: 265 FSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNP-----KHPYFYS 319
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
V L + VG+ ++ GNGG++VDSG+TFT + ++ VA+ F R M
Sbjct: 320 VALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAA 379
Query: 380 SRAA--DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-------ALVG 430
A E+++GL PC+ + +P L L F+G A +ALP NYF A G
Sbjct: 380 GFARAERAEEQTGLTPCYRYAASDR-GVPPLALHFRGNATVALPRRNYFMGFKSEDAGAG 438
Query: 431 ---NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++V CL+L A G GPA LG+FQ Q F + +D+ R GFA+++C
Sbjct: 439 TRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 493
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 128/402 (31%), Positives = 182/402 (45%), Gaps = 55/402 (13%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L S G Y + L+ GTPP T + DTGS L+W C C D P F
Sbjct: 84 LVAASQGEYLMDLAIGTPPLRYTAMV-DTGSDLIWTQCAPCVLCAD--------QPTPYF 134
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGL 214
P RS++ +L+ C++P C+ + P R C Y YG TAG+
Sbjct: 135 RPARSATYRLVPCRSPLCAALPYPACFQR--------SVC-----VYQYYYGDEASTAGV 181
Query: 215 LLSETLRFPSKT-----VPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSY 266
L SET F + V + GC ++ Q A G+ G GR SL SQLG +FSY
Sbjct: 182 LASETFTFGAANSSKVMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSY 241
Query: 267 CLLSRKFDDAPVSSNL---VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
CL S +P S L V T G+ S S +P P+ ++A Y++ L+
Sbjct: 242 CLTSFL---SPEPSRLNFGVFATLNGTNASS----SGSPVQSTPLVVNAALPSLYFMSLK 294
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I +G K + I DG GGV +DSG++ T+++ ++AV +E + +
Sbjct: 295 GISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTN 354
Query: 384 DVEKKSGLRPCFDISGKKS--VYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFT 440
D E GL CF S V +P++ L F GGA M +PPENY + G LCL +
Sbjct: 355 DTEI--GLETCFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIR 412
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G A I+G++Q QN ++ +D+AN F C
Sbjct: 413 S---------GDATIIGNYQQQNMHILYDIANSLLSFVPAPC 445
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 143/447 (31%), Positives = 207/447 (46%), Gaps = 61/447 (13%)
Query: 45 LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
L H DSD + RA H + + ++ + + SN+ I +P+ + G +
Sbjct: 47 LKHVDSDKNLTKFQRIQHGIKRANH-----RLERLNAMVLAASSNAEINSPV-LSGNGEF 100
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
++L+ GTPP+ + I DTGS L+W C +C D PS P F PK+SSS
Sbjct: 101 LMNLAIGTPPETYSA-IMDTGSDLIWTQCKPCTQCFD------QPS--PIFDPKKSSSFS 151
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFP 223
+ C + C + P++ +C +C YL YG T G + +ET F
Sbjct: 152 KLSCSSQLCKAL-------------PQS-SCSDSC-EYLYTYGDYSSTQGTMATETFTFG 196
Query: 224 SKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
++PN GC ++ Q +G+ G GR SL SQL KFSYCL S DD S
Sbjct: 197 KVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKFSYCLTS--IDDTKTS 254
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
+ L+ +G S + TP +NP+ S FYY+ L I VG + I S
Sbjct: 255 TLLMGSLASVNGTSA--AIRTTPLIQNPLQPS-----FYYLSLEGISVGGTRLPIKESTF 307
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI-S 398
DG GG+I+DSG+T T++E F+ V KEF QMG D +GL C+++ S
Sbjct: 308 QLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMG---LPVDNSGATGLELCYNLPS 364
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAAGPALG-RGPAIIL 456
+ +P+L+L F GA + LP ENY A V+CL A+G G I
Sbjct: 365 DTSELEVPKLVLHFT-GADLELPGENYMIADSSMGVICL----------AMGSSGGMSIF 413
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
G+ Q QN ++ DL + F C
Sbjct: 414 GNVQQQNMFVSHDLEKETLSFLPTNCG 440
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 131/412 (31%), Positives = 194/412 (47%), Gaps = 73/412 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L GTP I DTGS + W C C DC P+ P F P+ SSS
Sbjct: 138 YYVPLQLGTP-AVEVVLIMDTGSDVSWIQCVP---CKDCV-----PALRPPFNPRHSSSF 188
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
+ C + C+ ++ + CSP +TC + +QYG G ++GLL ET+
Sbjct: 189 FKLPCASSTCTNVY----QGVKPFCSPSGRTCLFS-----IQYGDGSLSSGLLAMETI-- 237
Query: 223 PSKTVPNF-----------LAGCSILSDRQ-----PAGIAGFGRSSESLPSQLG---LKK 263
+ PNF GC+ + DR+ +G+ G R S PSQL +K
Sbjct: 238 -AGNTPNFGDGEPVKLSNITLGCADI-DREGLPTGASGLLGMDRRPISFPSQLSSRYARK 295
Query: 264 FSYCLLSRKFDDAPV---SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
FS+C F D SS LV G D +P L YTP +NP S++ ++YYV
Sbjct: 296 FSHC-----FPDKIAHLNSSGLVFF---GESDIISPYLRYTPLVQNPAVPSASL-DYYYV 346
Query: 321 GLRQIIVGSKHVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
GL I V + + + ++ + G+GG I+DSG+ FT+++ P F+A+ +EF +
Sbjct: 347 GLVGISVDESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF---LART 403
Query: 380 SRAADVEKKSGLRPCFDIS----GKKSVYLPELILKFKGGAKMALPPENYFALVGNE--- 432
S A V+ SG PC++I+ +S LP + L F+GG + LP + V +
Sbjct: 404 SHLAKVDDNSGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQ 463
Query: 433 -VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LCL A G P I+G++Q QN ++E+DL R G A +CA
Sbjct: 464 TTLCL-------AFQMSGDIPFNIIGNYQQQNLWVEYDLEKLRLGIAPAQCA 508
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 144/487 (29%), Positives = 212/487 (43%), Gaps = 52/487 (10%)
Query: 12 FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
FS+L++L T A +AA V V LT + +DP +L R H
Sbjct: 4 FSVLLILACTIL-ASDAAAAVRVGLTRI---------HADPEVTASEFVRGALRRDMHRH 53
Query: 72 TK-TKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW 130
+ + + S+ + T + + G Y ++LS GTPP S I DTGS L+W
Sbjct: 54 ARFAREQLAPSSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPL-SYRAIADTGSDLIW 112
Query: 131 FPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNP--KCSWIFGPNVESRCKGC 188
C V + P S++ ++ C +P C+ + GP+ C
Sbjct: 113 TQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGC--- 169
Query: 189 SPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF------PSKTVPNFLAGCSILSDRQ- 241
AC Y YG G+TAG+ ET F P+ VPN GCS S
Sbjct: 170 ---------AC-MYNQTYGTGWTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDW 219
Query: 242 --PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS 299
AG+ G GR S SL SQLG FSYCL F DA +S L+L + T +
Sbjct: 220 NGSAGLVGLGRGSMSLVSQLGAGAFSYCL--TPFQDANSTSTLLLGPSAAAALKGTGPVR 277
Query: 300 YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTF 359
TPF P S + +YY+ L I VG + IP +DG GG+I+DSG+T T
Sbjct: 278 STPFVAGP--SKAPMSTYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTITT 335
Query: 360 MEGPLFEAVAKEFIRQM--GNYSRAADVEKKSGLRPCFDISGKK-SVYLPELILKFKGGA 416
+ ++ V + +R + A + +GL CF + +P + L F+GGA
Sbjct: 336 LVDSAYQQV-RAAVRSLLVTRLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFEGGA 394
Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
M LP ENY ++G+ V CL + G ++G++Q QN ++ +D+ +
Sbjct: 395 DMVLPVENYM-ILGSGVWCLAMRNQTV-------GAMSMVGNYQQQNIHVLYDVRKETLS 446
Query: 477 FAKQKCA 483
FA C+
Sbjct: 447 FAPAVCS 453
>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
Length = 466
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 134/397 (33%), Positives = 181/397 (45%), Gaps = 46/397 (11%)
Query: 104 YSISLSFGTPPQASTPFIF-DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
Y++SLS G P AS+ +F DTGS LVWFPC + + C+ C + +P
Sbjct: 88 YTLSLSVGPPSTASSVSLFLDTGSDLVWFPC-APFTCMLCEGKATPGGNHSSPLPP-PID 145
Query: 163 SQLIGCQNPKCSWIF--GPNVE----SRCKGCSPRNKTCP-LACPSYLLQYGLG-FTAGL 214
S+ I C +P CS P + +RC + +C ACP YG G A L
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205
Query: 215 LLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
S V NF C+ + +P G+AGFGR SLP+QL
Sbjct: 206 RRGRVGLAASMAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQL--------------- 250
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
AP S GS D+ G S T F P+ + FY V L + VG K ++
Sbjct: 251 -APSLS--------GSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQA 301
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA--DVEKKSGLR 392
DGNGG++VDSG+TFT + F VA EF R M E ++GL
Sbjct: 302 QPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLA 361
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLCLILFT---DNAAG 445
PC+ S +P + L F+G A +ALP NYF + G V CL+L +N G
Sbjct: 362 PCYHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDG 420
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G GPA LG+FQ Q F + +D+ R GFA+++C
Sbjct: 421 ED-GGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 456
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 128/402 (31%), Positives = 181/402 (45%), Gaps = 55/402 (13%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L S G Y + L+ GTPP T + DTGS L+W C C D P F
Sbjct: 84 LVAASQGEYLMDLAIGTPPLRYTAMV-DTGSDLIWTQCAPCVLCAD--------QPTPYF 134
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGL 214
P RS++ +L+ C++P C+ + P R C Y YG TAG+
Sbjct: 135 RPARSATYRLVPCRSPLCAALPYPACFQR--------SVC-----VYQYYYGDEASTAGV 181
Query: 215 LLSETLRFPSKT-----VPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSY 266
L SET F + V + GC ++ Q A G+ G GR SL SQLG +FSY
Sbjct: 182 LASETFTFGAANSSKVMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSY 241
Query: 267 CLLSRKFDDAPVSSNL---VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
CL S +P S L V T G+ S S +P P+ ++A Y++ L+
Sbjct: 242 CLTSFL---SPEPSRLNFGVFATLNGTNASS----SGSPVQSTPLVVNAALPSLYFMSLK 294
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I +G K + I DG GGV +DSG++ T+++ ++AV E + +
Sbjct: 295 GISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTN 354
Query: 384 DVEKKSGLRPCFDISGKKS--VYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFT 440
D E GL CF S V +P++ L F GGA M +PPENY + G LCL +
Sbjct: 355 DTEI--GLETCFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIR 412
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G A I+G++Q QN ++ +D+AN F C
Sbjct: 413 S---------GDATIIGNYQQQNMHILYDIANSLLSFVPAPC 445
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 128/402 (31%), Positives = 188/402 (46%), Gaps = 54/402 (13%)
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
+S G Y+++LS GTPP + + DTGSSL+W C C +C P+ P F P
Sbjct: 85 NSAGAYNMNLSIGTPP-VTFSVLADTGSSLIWTQCAP---CTECA---ARPA--PPFQPA 135
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
SS+ + C + C ++ P + GC Y YG+GFTAG L +E
Sbjct: 136 SSSTFSKLPCASSLCQFLTSPYLTCNATGCV------------YYYPYGMGFTAGYLATE 183
Query: 219 TLRFPSKTVPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
TL + P GCS + +GI G GRS SL SQ+G+ +FSYCL S DA
Sbjct: 184 TLHVGGASFPGVAFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRS----DA 239
Query: 277 PVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
+ +L GS T G + TP +NP SS+ +YYV L I VG+ + +
Sbjct: 240 DAGDSPILF---GSLAKVTGGNVQSTPLLENPEMPSSS---YYYVNLTGITVGATDLPVT 293
Query: 336 YSYL----VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE-KKSG 390
+ G+ GG IVDSG+T T++ + V + F+ QM + V + G
Sbjct: 294 STTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFG 353
Query: 391 LRPCFDIS---GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTD 441
CFD + G V +P L+L+F GGA+ A+ +Y +V + V CL++
Sbjct: 354 FDLCFDATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVL-- 411
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
PA + I+G+ + ++ +DL F FA CA
Sbjct: 412 ----PASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 449
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 136/416 (32%), Positives = 186/416 (44%), Gaps = 48/416 (11%)
Query: 104 YSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y++SLS G P A+ P DTGS LVWFPC + C+ C P R P
Sbjct: 90 YTLSLSVG-PASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEG-KPTPGRSGPLPPP--P 144
Query: 162 SSQLIGCQNPKCSWIFGPN------VESRCKGCSPRNKTC--PLACPSYLLQYGLGFTAG 213
S+ I C +P CS +RC +C ACP YG G
Sbjct: 145 DSRRIPCASPLCSAAHASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVA 204
Query: 214 LLLSETLRFPSKT-------VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---K 263
L + + V NF C+ + +P G+AGFGR SLP QL + +
Sbjct: 205 HLRRGRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGR 264
Query: 264 FSYCLLSRKF--DDAPVSSNLVLDTGPGSGDS---KTPGLSYTPFYKNPVGSSSAFGEFY 318
FSYCL+S F D S L+L P D+ +T G YTP NP FY
Sbjct: 265 FSYCLVSHSFRADRLIRPSPLILGRSPDDADAAAAETDGFVYTPLLHNP-----KHPYFY 319
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
V L + VG+ ++ GNGG++VDSG+TFT + ++ VA+ F R M
Sbjct: 320 SVALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAA 379
Query: 379 YSRAA--DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---------- 426
A E+++GL PC+ + +P L L F+G A +ALP NYF
Sbjct: 380 AGFARAERAEEQTGLTPCYRYAASDR-GVPPLALHFRGNATVALPRRNYFMGFKSEDAGA 438
Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++V CL+L A G GPA LG+FQ Q F + +D+ R GFA+++C
Sbjct: 439 GTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 494
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 135/414 (32%), Positives = 187/414 (45%), Gaps = 44/414 (10%)
Query: 104 YSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCN---FPNVDPSRIPAFIPK 158
Y++SLS G P A+ P DTGS LVWFPC + C+ C P+ S
Sbjct: 94 YTLSLSVG-PASAAAPVSLFLDTGSDLVWFPCAP-FTCMLCEGKPTPSGGHSSSAPLPLP 151
Query: 159 RSSSSQLIGCQNPKCSWIFG---PNVESRCKGCSPRN------KTCPLACPSYLLQYGLG 209
S+ + C +P CS P+ GC + + ACP YG G
Sbjct: 152 PPPDSRRVPCASPLCSAAHASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDG 211
Query: 210 -FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---KFS 265
A L S V NF C+ + +P G+AGFGR SLP QL + +FS
Sbjct: 212 SLVAHLRRGRVGLGASVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLAPQLSGRFS 271
Query: 266 YCLLSRKF--DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
YCL+S F D S L+L P + ++T G YTP NP FY V L
Sbjct: 272 YCLVSHSFRADRLIRPSPLILGRSPDAA-AETGGFVYTPLLHNP-----KHPYFYSVALE 325
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+ VG+ ++ GNGG++VDSG+TFT + + VA+ F R M A
Sbjct: 326 AVSVGATRIQARPELARVDRAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFAR 385
Query: 384 --DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-----------ALVG 430
E+++GL PC+ + +P L L F+G A +ALP NYF A
Sbjct: 386 AERAEEQTGLTPCYHYAASDR-GVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRK 444
Query: 431 NEVLCLILFT--DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++V CL+L D + GPA LG+FQ Q F + +D+ R GFA+++C
Sbjct: 445 DDVGCLMLMNGGDVSGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 498
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 132/402 (32%), Positives = 182/402 (45%), Gaps = 59/402 (14%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L + S G Y + + GTP + + I DTGS L+W C CVD P F
Sbjct: 84 LVLASDGEYLMEMGIGTPARFYSA-ILDTGSDLIWTQCAPCLLCVD--------QPTPYF 134
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG---FTA 212
P SS+ + +GC P C+ ++ P C KTC + QY G TA
Sbjct: 135 DPANSSTYRSLGCSAPACNALYYP----LC-----YQKTC-------VYQYFYGDSASTA 178
Query: 213 GLLLSETLRFPSK----TVPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFS 265
G+L +ET F + T+P GC L+ A G+ GFGR S SL SQLG +FS
Sbjct: 179 GVLANETFTFGTNDTRVTLPRISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFS 238
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL S +PV S L + + TPF NP A Y++ + I
Sbjct: 239 YCLTSFL---SPVRSRLYFGAYATLNSTNASTVQSTPFIINP-----ALPTMYFLNMTGI 290
Query: 326 IVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
VG + I P + +DG GG I+DSG+T T++ P + AV + F+ + + D
Sbjct: 291 SVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLD 350
Query: 385 VEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV--LCLILFT 440
V + S L CF ++SV LP+L+L F GA LP +NY LV LCL + T
Sbjct: 351 VTETSVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNYM-LVDPSTGGLCLAMAT 408
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ I+G +Q QNF + +DL N F C
Sbjct: 409 SSDGS---------IIGSYQHQNFNVLYDLENSLLSFVPAPC 441
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 127/400 (31%), Positives = 184/400 (46%), Gaps = 51/400 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y I + GTPP+ I DTGS L W C Y C + N P+ +P+ SS
Sbjct: 168 GEYFIDMFVGTPPK-HVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNE--------SS 218
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET- 219
S + I C +P+C + P+ CK N+TCP Y Y G T G ET
Sbjct: 219 SYRNISCYDPRCQLVSSPDPLQHCK---TENQTCP-----YFYDYADGSNTTGDFALETF 270
Query: 220 ---LRFPS-----KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
L +P+ K V + + GC + G+ G GR S PSQL FS
Sbjct: 271 TVNLTWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFS 330
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL + F + VSS L+ D + F K G + FYY+ ++ I
Sbjct: 331 YCL-TDLFSNTSVSSKLIF-----GEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSI 384
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
+VG + + IP S+G GG I+DSGST TF ++ + + F +++ AAD
Sbjct: 385 VVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAAD- 443
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCL-ILFTDNA 443
+ PC+++SG V LP+ + F GA P ENYF +EV+CL IL T N
Sbjct: 444 --DFIMSPCYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNH 501
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G+ QNF++ +D+ R G++ ++CA
Sbjct: 502 SH-------LTIIGNLLQQNFHILYDVKRSRLGYSPRRCA 534
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 143/453 (31%), Positives = 195/453 (43%), Gaps = 61/453 (13%)
Query: 45 LHHSD-----SDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH 99
LHH D S P + + +R + + +G+ +S+S+I
Sbjct: 64 LHHVDALSFNSTPETLFTTRLQRDAARVEAISYLAETAGTGKRVGTGFSSSVISGL--AQ 121
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
G Y + GTPP+ + DTGS +VW C RC + DP F P++
Sbjct: 122 GSGEYFTRIGVGTPPRY-VYMVLDTGSDIVWIQCAPCKRC----YAQSDP----VFDPRK 172
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
S S I C++P C + P GC+ + +TC Y + YG G FT G +E
Sbjct: 173 SRSFASIACRSPLCHRLDSP-------GCNTQKQTC-----MYQVSYGDGSFTFGDFSTE 220
Query: 219 TLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRK 272
TL F V GC ++ AG+ G GR S PSQ G + KFSYCL+ R
Sbjct: 221 TLTFRRTRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRS 280
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
P S++V GDS + +TP NP FYYV L I VG
Sbjct: 281 ASSKP--SSMVF------GDSAVSRTARFTPLVSNP-----KLDTFYYVELLGISVGGTR 327
Query: 332 V-KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
V I S GNGGVI+DSG++ T + P + A F N RA S
Sbjct: 328 VPGITASLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQF---SL 384
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
CFD+SGK V +P ++L F+ GA ++LP NY V + F G +
Sbjct: 385 FDTCFDLSGKTEVKVPTVVLHFR-GADVSLPASNYLIPVDTSGNFCLAFAGTMGGLS--- 440
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +DLA R GFA CA
Sbjct: 441 ----IIGNIQQQGFRVVYDLAGSRVGFAPHGCA 469
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 123/403 (30%), Positives = 176/403 (43%), Gaps = 56/403 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTPP+ + I DTGS L W C Y C N DP K S+
Sbjct: 160 GEYFMDVLVGTPPKHFS-LILDTGSDLNWLQCLPCYDCFHQNEAFYDP--------KTSA 210
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-----------LGF 210
S + I C +P+CS I P +CK N++CP Y YG F
Sbjct: 211 SFKNITCNDPRCSLISSPEPPVQCKS---DNQSCP-----YFYWYGDRSNTTGDFAVETF 262
Query: 211 TAGLLLSETLRFPSKTVPNFLAGCSILS-------DRQPAGIAGFGRSSESLPSQLGLKK 263
T L +E R V N + GC + G S L S G
Sbjct: 263 TVNLTTTEG-RSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYG-HS 320
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
FSYCL+ R D VSS L+ G L++T F G ++ FYY+ ++
Sbjct: 321 FSYCLVDRN-SDTNVSSKLIF--GEDKDLLNHTNLNFTSFVN---GKENSVETFYYIQIK 374
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRA 382
I+VG + + IP DG GG I+DSG+T ++ P +E + +F +M NY
Sbjct: 375 SILVGGEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVF 434
Query: 383 ADVEKKSGLRPCFDISG--KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
D L PCF++SG + +++LPEL + F GA P EN F + +++CL +
Sbjct: 435 RDFPV---LDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLSEDLVCLAIL- 490
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G++Q QNF++ +D R GF KCA
Sbjct: 491 ------GTPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKCA 527
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 130/407 (31%), Positives = 186/407 (45%), Gaps = 58/407 (14%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
IK P S G + + LS G P + I DTGS L+W C C D P+
Sbjct: 96 IKAPTHGGS-GEFLMELSIGNPAVKYSA-IVDTGSDLIWTQCKPCTECFD------QPT- 146
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGF 210
P F P++SSS +GC + C+ + N C+ C YL YG
Sbjct: 147 -PIFDPEKSSSYSKVGCSSGLCNALPRSN-------CNEDKDAC-----EYLYTYGDYSS 193
Query: 211 TAGLLLSETLRFPSK-TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFS 265
T GLL +ET F + ++ GC + ++ Q +G+ G GR SL SQL KFS
Sbjct: 194 TRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFS 253
Query: 266 YCLLSRKFDDAP-------VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
YCL S + +A ++S +V TG T +S +NP S FY
Sbjct: 254 YCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMS---LLRNPDQPS-----FY 305
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
Y+ L+ I VG+K + + S DG GG+I+DSG+T T++E F+ + +EF +M
Sbjct: 306 YLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-- 363
Query: 379 YSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCL 436
S D +GL CF + K++ +P++I FK GA + LP ENY A VLCL
Sbjct: 364 -SLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCL 421
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ + N I G+ Q QNF + DL + F +C
Sbjct: 422 AMGSSNGMS---------IFGNVQQQNFNVLHDLEKETVSFVPTECG 459
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 131/407 (32%), Positives = 187/407 (45%), Gaps = 58/407 (14%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
IK P S G + + LS G P I DTGS L+W C C D P+
Sbjct: 97 IKAPTHGGS-GEFLMELSIGNPA-VKYAAIVDTGSDLIWTQCKPCTECFD------QPT- 147
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGF 210
P F P++SSS +GC + C+ + N C+ +C YL YG
Sbjct: 148 -PIFDPEKSSSYSKVGCSSGLCNALPRSN-------CNEDKDSC-----EYLYTYGDYSS 194
Query: 211 TAGLLLSETLRFPSK-TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFS 265
T GLL +ET F + ++ GC + ++ Q +G+ G GR SL SQL KFS
Sbjct: 195 TRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFS 254
Query: 266 YCLLSRKFDDAP-------VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
YCL S + +A ++S +V TG T +S +NP S FY
Sbjct: 255 YCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMS---LLRNPDQPS-----FY 306
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
Y+ L+ I VG+K + + S DG GG+I+DSG+T T++E F+ + +EF +M
Sbjct: 307 YLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-- 364
Query: 379 YSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCL 436
S D +GL CF + + K++ +P+LI FK GA + LP ENY A VLCL
Sbjct: 365 -SLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFK-GADLELPGENYMVADSSTGVLCL 422
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ + N I G+ Q QNF + DL + F +C
Sbjct: 423 AMGSSNGMS---------IFGNVQQQNFNVLHDLEKETVTFVPTECG 460
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 138/456 (30%), Positives = 202/456 (44%), Gaps = 71/456 (15%)
Query: 50 SDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH-SYGGYSISL 108
+DP +L R H + SN + + P + + G Y ++L
Sbjct: 37 ADPSVTASQFVRDALRRDMHRHNARQLAASSSN------GTTVSAPTQISPTAGEYLMTL 90
Query: 109 SFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAF--IPKRSS-- 161
+ GTPP S I DTGS L+W PC+S+ C P +PS F +P SS
Sbjct: 91 AIGTPP-VSYQAIADTGSDLIWTQCAPCSSQ--CFQQPTPLYNPSSSTTFAVLPCNSSLS 147
Query: 162 --SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
++ L G P GC TC Y + YG G+T+ SET
Sbjct: 148 MCAAALAGTTPPP--------------GC-----TC-----MYNMTYGSGWTSVYQGSET 183
Query: 220 LRFPSKT------VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
F S T VP GCS S +G+ G GR S SL SQLG+ KFSYCL
Sbjct: 184 FTFGSSTPANQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVPKFSYCL- 242
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
+ D +S L+L GP + + T G+S TPF +P S + +YY+ L I +G+
Sbjct: 243 -TPYQDTNSTSTLLL--GPSASLNDTGGVSSTPFVASP--SDAPMSTYYYLNLTGISLGT 297
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ IP + L +DG GG I+DSG+T T + ++ V + + + +
Sbjct: 298 TALSIPTTALSLKADGTGGFIIDSGTTITLLGNTAYQQV-RAAVVSLVTLPTTDGGSAAT 356
Query: 390 GLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
GL CF++ S +P + L F GA M LP ++Y L N + CL + G +
Sbjct: 357 GLDLCFELPSSTSAPPTMPSMTLHFD-GADMVLPADSYMMLDSN-LWCLAMQNQTDGGVS 414
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
ILG++Q QN ++ +D+ + FA KC+
Sbjct: 415 -------ILGNYQQQNMHILYDVGQETLTFAPAKCS 443
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 132/400 (33%), Positives = 188/400 (47%), Gaps = 55/400 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + L GTPP+ I DTGS L W C C+DC R P F P S
Sbjct: 150 GEYLVDLYVGTPPR-RFQMIMDTGSDLNWLQCAP---CLDCF-----EQRGPVFDPATSL 200
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + + C +P+C + P C+ P + CP Y YG T G L E
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACR--RPHSDPCP-----YYYWYGDQSNTTGDLALEAF 253
Query: 221 RF------PSKTVPNFLAGCSILSDR----QPAGIAGFGRSSESLPSQLGL---KKFSYC 267
S+ V + + GC S+R AG+ G GR + S SQL FSYC
Sbjct: 254 TVNLTAPGASRRVDDVVFGCG-HSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYC 312
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDS--KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
L+ + V S +V G D+ P L+YT +++A FYYV L+ +
Sbjct: 313 LVDHG---SSVGSKIVF----GDDDALLGHPRLNYT---AFAPSAAAAADTFYYVQLKGV 362
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
+VG + + I S G DG+GG I+DSG+T ++ P +E + + F+ +M Y AD
Sbjct: 363 LVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVAD 422
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNA 443
L PC+++SG + V +PE L F GA P ENYF L + ++CL +
Sbjct: 423 FPV---LSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVL---- 475
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
R I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 476 ---GTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCA 512
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 130/401 (32%), Positives = 187/401 (46%), Gaps = 59/401 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTPP+ I DTGS L W C C+DC R P F P S+
Sbjct: 148 GEYLVEVYVGTPPR-RFQMIMDTGSDLNWLQCAP---CLDCF-----DQRGPVFDPMAST 198
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + + C + +C + P C+ S R+ CP Y YG T G L E
Sbjct: 199 SYRNVTCGDTRCGLVSPPAAPRTCR--SSRSDPCP-----YYYWYGDQSNTTGDLALEAF 251
Query: 221 RF-----PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
S+ V + GC + AG+ G GR S SQL FSYCL+
Sbjct: 252 TVNLTASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLV 311
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKT----PGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
+ V S +V GD P L+YT F S+A FYYV L+ I
Sbjct: 312 DHG---SAVGSKIVF------GDDNVLLSHPQLNYTAF-----APSAAENTFYYVQLKGI 357
Query: 326 IVGSKHVKIP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAA 383
+VG + + IP ++ V DG+GG I+DSG+T ++ P ++A+ + F+ +M Y A
Sbjct: 358 LVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIA 417
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
D L PC+++SG + V +PE L F GA P ENYF + E ++CL +
Sbjct: 418 DFPV---LSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVL--- 471
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
R I+G++Q QNF++ +DL ++R GFA ++CA
Sbjct: 472 ----GTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFAPRRCA 508
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 132/400 (33%), Positives = 188/400 (47%), Gaps = 55/400 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + L GTPP+ I DTGS L W C C+DC R P F P S
Sbjct: 150 GEYLVDLYVGTPPR-RFQMIMDTGSDLNWLQCAP---CLDCF-----EQRGPVFDPAASL 200
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + + C +P+C + P C+ P + CP Y YG T G L E
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACR--RPHSDPCP-----YYYWYGDQSNTTGDLALEAF 253
Query: 221 RF------PSKTVPNFLAGCSILSDR----QPAGIAGFGRSSESLPSQLGL---KKFSYC 267
S+ V + + GC S+R AG+ G GR + S SQL FSYC
Sbjct: 254 TVNLTAPGASRRVDDVVFGCG-HSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYC 312
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDS--KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
L+ + V S +V G D+ P L+YT +++A FYYV L+ +
Sbjct: 313 LVDHG---SSVGSKIVF----GDDDALLGHPRLNYT---AFAPSAAAAADTFYYVQLKGV 362
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
+VG + + I S G DG+GG I+DSG+T ++ P +E + + F+ +M Y AD
Sbjct: 363 LVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVAD 422
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNA 443
L PC+++SG + V +PE L F GA P ENYF L + ++CL +
Sbjct: 423 FPV---LSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVL---- 475
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
R I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 476 ---GTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCA 512
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 125/393 (31%), Positives = 180/393 (45%), Gaps = 57/393 (14%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+ LS G P + I DTGS L+W C C D P+ P F P++SSS
Sbjct: 1 MELSIGNPAVKYSA-IVDTGSDLIWTQCKPCTECFD------QPT--PIFDPEKSSSYSK 51
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPS 224
+GC + C+ + N C+ C YL YG T GLL +ET F
Sbjct: 52 VGCSSGLCNALPRSN-------CNEDKDAC-----EYLYTYGDYSSTRGLLATETFTFED 99
Query: 225 K-TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP-- 277
+ ++ GC + ++ Q +G+ G GR SL SQL KFSYCL S + +A
Sbjct: 100 ENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSS 159
Query: 278 -----VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
++S +V TG T +S +NP S FYY+ L+ I VG+K +
Sbjct: 160 LFIGSLASGIVNKTGASLDGEVTKTMS---LLRNPDQPS-----FYYLELQGITVGAKRL 211
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
+ S DG GG+I+DSG+T T++E F+ + +EF +M S D +GL
Sbjct: 212 SVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM---SLPVDDSGSTGLD 268
Query: 393 PCFDI-SGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAAGPALGR 450
CF + K++ +P++I FK GA + LP ENY A VLCL + + N
Sbjct: 269 LCFKLPDAAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAMGSSNGMS----- 322
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ Q QNF + DL + F +C
Sbjct: 323 ----IFGNVQQQNFNVLHDLEKETVSFVPTECG 351
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 152/483 (31%), Positives = 212/483 (43%), Gaps = 59/483 (12%)
Query: 7 SLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSR 66
SL + +L I F +S + P + L H DS + R
Sbjct: 6 SLSLVVALAIFAFVFSHAFSTSRRVLEHPKVQNGFRAKLKHVDSGKNLTKFERIQHGVKR 65
Query: 67 ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGS 126
RH + K ++ SNS I P+ + G + + L+ GTPP+ + I DTGS
Sbjct: 66 GRHRLQRFKAMALVAS-----SNSEIDAPV-LPGNGEFLMKLAIGTPPETYSA-IMDTGS 118
Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
L+W C +C D P+ P F PK+SSS + C + C +
Sbjct: 119 DLIWTQCKPCTQCFD------QPT--PIFDPKKSSSFSKLSCSSKLCEAL---------- 160
Query: 187 GCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPNFLAGCSILSD----RQ 241
P++ TC C YL YG T G+L SETL F +VP GC ++ Q
Sbjct: 161 ---PQS-TCSDGC-EYLYGYGDYSSTQGMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQ 215
Query: 242 PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYT 301
+G+ G GR SL SQL KFSYCL S DD S+ L+ G + S +
Sbjct: 216 GSGLVGLGRGPLSLVSQLKEPKFSYCLTS--VDDTKASTLLM-------GSLASVKASDS 266
Query: 302 PFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
P+ +SA FYY+ L I VG + I S DG+GG+I+DSG+T T++E
Sbjct: 267 EIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLE 326
Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMAL 420
F+ VAKEF Q+ + D +GL CF + SG + +P+L+ F GA + L
Sbjct: 327 QSAFDLVAKEFTSQI---NLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFD-GADLEL 382
Query: 421 PPENYF-ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
P ENY A V CL A G + G I G+ Q QN + DL + F
Sbjct: 383 PAENYMIADASMGVACL------AMGSSSGMS---IFGNIQQQNMLVLHDLEKETLSFLP 433
Query: 480 QKC 482
+C
Sbjct: 434 TQC 436
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 122/402 (30%), Positives = 177/402 (44%), Gaps = 54/402 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTPP+ + I DTGS L W C Y C N DP K S+
Sbjct: 158 GEYFMDVLVGTPPKHFS-LILDTGSDLNWLQCLPCYDCFHQNGMFYDP--------KTSA 208
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + I C +P+CS I P+ +C+ N++CP Y YG T G ET
Sbjct: 209 SFKNITCNDPRCSLISSPDPPVQCES---DNQSCP-----YFYWYGDRSNTTGDFAVETF 260
Query: 221 RFPSKT---------VPNFLAGCSILS-------DRQPAGIAGFGRSSESLPSQLGLKKF 264
T V N + GC + G S L S G F
Sbjct: 261 TVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYG-HSF 319
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
SYCL+ R + VSS L+ G L++T F G ++ FYY+ ++
Sbjct: 320 SYCLVDRN-SNTNVSSKLIF--GEDKDLLNHTNLNFTSFVN---GKENSVETFYYIQIKS 373
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAA 383
I+VG K + IP SDG+GG I+DSG+T ++ P +E + +F +M NY
Sbjct: 374 ILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFR 433
Query: 384 DVEKKSGLRPCFDISG--KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
D L PCF++SG + +++LPEL + F G P EN F + +++CL +
Sbjct: 434 DFPV---LDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAIL-- 488
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G++Q QNF++ +D R GF KCA
Sbjct: 489 -----GTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCA 525
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 129/394 (32%), Positives = 185/394 (46%), Gaps = 55/394 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
GGY++++S GTP + + DTGS L+W C +C P+ P F P SS
Sbjct: 84 GGYNMNISVGTP-LLTFSVVADTGSDLIWTQCAPCTKCFQ------QPA--PPFQPASSS 134
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ + C + C ++ PN +TC Y +YG G+TAG L +ETL+
Sbjct: 135 TFSKLPCTSSFCQFL--PN----------SIRTCNATGCVYNYKYGSGYTAGYLATETLK 182
Query: 222 FPSKTVPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
+ P+ GCS + +GIAG GR + SL QLG+ +FSYCL R A S
Sbjct: 183 VGDASFPSVAFGCSTENGVGNSTSGIAGLGRGALSLIPQLGVGRFSYCL--RSGSAAGAS 240
Query: 280 SNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
L GS + T G + TPF NP S +YYV L I VG + + S
Sbjct: 241 PILF-----GSLANLTDGNVQSTPFVNNPAVHPS----YYYVNLTGITVGETDLPVTTST 291
Query: 339 LVPGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
+G GG IVDSG+T T++ +E V + F+ Q + + V GL CF
Sbjct: 292 FGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTT---VNGTRGLDLCFKS 348
Query: 398 S--GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGPALG 449
+ G + +P L+L+F GGA+ A+P YFA V + V CL++ PA G
Sbjct: 349 TGGGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMML------PAKG 400
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P ++G+ + +L +DL F FA CA
Sbjct: 401 DQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 434
>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
Length = 429
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 140/430 (32%), Positives = 201/430 (46%), Gaps = 55/430 (12%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC--TSRYRCVDCNFPNVDP 149
I P++ ++ GY +SL+ G PPQ ++ DTGS L W PC S Y+C++C +
Sbjct: 14 IIEPVTTYT-DGYLLSLNLGMPPQVFQVYL-DTGSDLTWVPCGTNSSYQCLECGNEHSTS 71
Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFG-PNVESRCK--GC---SPRNKTCPLACPSYL 203
IP+F P +SSS+ C + C I N C GC S + C CP +
Sbjct: 72 KPIPSFSPSQSSSNMKELCGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSGLCTRPCPPFS 131
Query: 204 LQYGLG-FTAGLLLSETLRFPSKT--------VPNFLAGCSILSDRQPAGIAGFGRSSES 254
YG G G L + + VP F GC S R+P GIAGFG+ S
Sbjct: 132 YTYGGGALVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGCVGSSIREPIGIAGFGKGILS 191
Query: 255 LPSQLGL--KKFSYCLLSRKFDDAP-VSSNLVLDTGPGSGD---SKTPGLSYTPFYK--- 305
LPSQLG K FS+C L +F P +S+L++ GD S +TP K
Sbjct: 192 LPSQLGFLDKGFSHCFLGFRFARNPNFTSSLIM------GDLALSAKDDFLFTPMLKSIT 245
Query: 306 NPVGSSSAFGEFYYVGLRQIIVGS-KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
NP FYY+GL + +G + P S S+GNGG+IVD+G+T+T + P
Sbjct: 246 NP--------NFYYIGLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPF 297
Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMAL 420
+ A+ + Y R+ D+E ++G CF I + LP + F G K+ L
Sbjct: 298 YTAILSSLASVI-LYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTL 356
Query: 421 PPEN-YFALVG--NEVL--CLIL--FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
P ++ Y+A+ N V+ CL+ D GP +LG FQ+QN + +D+
Sbjct: 357 PKDSCYYAVTAPKNSVVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEAG 416
Query: 474 RFGFAKQKCA 483
R GF + CA
Sbjct: 417 RIGFQPKDCA 426
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 122/402 (30%), Positives = 173/402 (43%), Gaps = 53/402 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + L G PPQ S I DTGS LVW C++ C +C+ + P+ + F P+ SS
Sbjct: 81 GQYFVDLRIGQPPQ-SLLLIADTGSDLVWVKCSA---CRNCS--HHSPATV--FFPRHSS 132
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ C +P C + P RC + TCP Y Y G T+GL ET
Sbjct: 133 TFSPAHCYDPVCRLVPKPGRAPRCNHTR-IHSTCP-----YEYGYADGSLTSGLFARETT 186
Query: 221 RFPSKT-----VPNFLAGCSILSDRQPA---------GIAGFGRSSESLPSQLGLK---K 263
+ + + + GC Q G+ G GR S SQLG + K
Sbjct: 187 SLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNK 246
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
FSYCL+ P S ++ G G L +TP NP+ + FYYV L+
Sbjct: 247 FSYCLMDYTLSPPPTSYLII-----GDGGDAVSKLFFTPLLTNPLSPT-----FYYVKLK 296
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+ V ++I S GNGG ++DSG+T F+ P + V ++Q A
Sbjct: 297 SVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAA-VKQRIKLPNAD 355
Query: 384 DVEKKSGLRPCFDISG--KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
E G C ++SG K LP L +F GGA PP NYF ++ CL +
Sbjct: 356 --ELTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAI--- 410
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ P +G ++G+ Q F EFD R GF+++ CA
Sbjct: 411 QSVDPKVGFS---VIGNLMQQGFLFEFDRDRSRLGFSRRGCA 449
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 157/490 (32%), Positives = 220/490 (44%), Gaps = 70/490 (14%)
Query: 16 ILLFTTDAGAGSSAATVTV-------PLTPLSTKHYLHHSDSDPLKI-LHSLASSSLSRA 67
+LLF + A S T+T+ PL L S PL + LH L S SL++
Sbjct: 10 LLLFFFISTAASEFQTLTLRSLPTPSPLPLFPDSQSLQSSPDAPLTLDLHHLDSLSLNKT 69
Query: 68 ------RHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFI 121
L T ++ + +S+S++ + LS S G Y L GTPP+ +
Sbjct: 70 PTDLFNLRLHRDTLRVHALNSRAAGFSSSVV-SGLSQGS-GEYFTRLGVGTPPRY-LYMV 126
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS +VW C+ +C + DP F P +S S I C +P C +
Sbjct: 127 LDTGSDVVWLQCSPCRKC----YSQSDP----IFNPYKSKSFAGIPCSSPLCRRL----- 173
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD- 239
GCS R TC Y + YG G FT G +ETL F + GC ++
Sbjct: 174 --DSSGCSTRRHTC-----LYQVSYGDGSFTTGDFATETLTFRGNKIAKVALGCGHHNEG 226
Query: 240 --RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
AG+ G GR S PSQ G++ KFSYCL+ R P S++V GD+
Sbjct: 227 LFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKP--SSMVF------GDAA 278
Query: 295 TPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK-IPYSYLVPGSDGNGGVIVD 352
L+ +TP +NP FYYVGL I VG V+ + S S GNGGVI+D
Sbjct: 279 ISRLARFTPLIRNP-----KLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIID 333
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG++ T + P + A+ F + R + S C+D+SG+ SV +P ++L F
Sbjct: 334 SGTSVTRLTRPAYTALRDAFRVGARHLKRGPEF---SLFDTCYDLSGQSSVKVPTVVLHF 390
Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
+ GA MALP NY V F +G + I+G+ Q Q F + +DLA
Sbjct: 391 R-GADMALPATNYLIPVDENGSFCFAFAGTISGLS-------IIGNIQQQGFRVVYDLAG 442
Query: 473 DRFGFAKQKC 482
R GFA + C
Sbjct: 443 SRIGFAPRGC 452
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 130/406 (32%), Positives = 187/406 (46%), Gaps = 61/406 (15%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L+ + G Y + LS GTPP A P I DTGS L W C C F P+ P +
Sbjct: 88 LAENGAGAYHMILSVGTPPLA-FPAIIDTGSDLTWTQCAP---CTTACF--AQPT--PLY 139
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCK--GCSPRNKTCPLACPSYLLQYGLGFTAG 213
P RSS+ + C +P C + P+ C GC Y +Y +GFTAG
Sbjct: 140 DPARSSTFSKLPCASPLCQAL--PSAFRACNATGCV------------YDYRYAVGFTAG 185
Query: 214 LLLSETLRF--------PSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLK 262
L ++TL S + GCS + +GI G GRS+ SL SQ+G+
Sbjct: 186 YLAADTLAIGDGDGDGDASSSFAGVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGVG 245
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
+FSYCL R DA S L +GD + T +NPV + +YYV L
Sbjct: 246 RFSYCL--RSDADAGASPILFGALANVTGDK----VQSTALLRNPVAARRR-APYYYVNL 298
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I VGS + + S + G GGVIVDSG+TFT++ + + + F+ Q A
Sbjct: 299 TGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQT-----A 353
Query: 383 ADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALV--GNEVLCLI 437
+ + SG + FD+ +G +P L+ +F GGA+ A+P ++YF V G V CL+
Sbjct: 354 GLLTRVSGAQFDFDLCFEAGAADTPVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLL 413
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ RG ++I G+ + ++ +DL F FA CA
Sbjct: 414 VLPT--------RGVSVI-GNVMQMDLHVLYDLDGATFSFAPADCA 450
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 129/394 (32%), Positives = 186/394 (47%), Gaps = 51/394 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++L+ GTPP S P I DTGS L+W +C C + P + P S+
Sbjct: 86 GEYIMTLAIGTPP-LSYPAIADTGSDLIW------TQCAPCGSQCFKQAGQP-YNPSSST 137
Query: 162 SSQLIGCQNP--KCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
+ ++ C + C+ + GP S GCS C Y YG G+TAG+ ET
Sbjct: 138 TFGVLPCNSSVSMCAALAGP---SPPPGCS---------C-MYNQTYGTGWTAGIQSVET 184
Query: 220 LRFPSK-----TVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
F S VP GCS S AG+ G GR S SL SQLG FSYCL
Sbjct: 185 FTFGSTPADQTRVPGIAFGCSNASSDDWNGSAGLVGLGRGSMSLVSQLGAGMFSYCL--T 242
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
F DA +S L+L GP + + T G+ TPF +P S + +YY+ L I +G+
Sbjct: 243 PFQDANSTSTLLL--GPSAALNGT-GVLTTPFVASP--SKAPMSTYYYLNLTGISIGTTA 297
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ IP + +DG GG+I+DSG+T T + ++ V + I + AD +GL
Sbjct: 298 LSIPPNAFALRTDGTGGLIIDSGTTITSLVDAAYQQV-RAAIESLVTLP-VADGSDSTGL 355
Query: 392 RPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
CF ++ + S +P + F GA M LP +NY ++G+ V CL +
Sbjct: 356 DLCFALTSETSTPPSMPSMTFHFD-GADMVLPVDNYM-ILGSGVWCLAMRNQTV------ 407
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G G++Q QN +L +D+ + FA KC+
Sbjct: 408 -GAMSTFGNYQQQNVHLLYDIHEETLSFAPAKCS 440
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 135/470 (28%), Positives = 207/470 (44%), Gaps = 64/470 (13%)
Query: 29 AATVTVPLTPLSTKHYLHHSDSDP---LKILHSLASSSLSRARHLKTKTKPKTKDSNIGS 85
+ + P + S LHH P L+++ S ++ ++ K K + + S
Sbjct: 15 VSAIVAPTSSTSRGTLLHHGQKRPQPGLRVVLEQVDSGMNLTKYELIKRAIKRGERRMRS 74
Query: 86 N----YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
S+S I+TP+ S G Y ++++ GTP +S I DTGS L+W C +C
Sbjct: 75 INAMLQSSSGIETPVYAGS-GEYLMNVAIGTPA-SSLSAIMDTGSDLIWTQCEPCTQCFS 132
Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
P+ P F P+ SSS + C++ C + + + C+
Sbjct: 133 ------QPT--PIFNPQDSSSFSTLPCESQYCQDLPSESCYNDCQ--------------- 169
Query: 202 YLLQYGLGF-TAGLLLSETLRFPSKTVPNFLAGCSILSDRQP------AGIAGFGRSSES 254
Y YG G T G + +ET F + +VPN GC D Q AG+ G G S
Sbjct: 170 YTYGYGDGSSTQGYMATETFTFETSSVPNIAFGCG--EDNQGFGQGNGAGLIGMGWGPLS 227
Query: 255 LPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
LPSQLG+ +FSYC+ S + S L L + +P + NP
Sbjct: 228 LPSQLGVGQFSYCMTSSG---SSSPSTLALGSAASGVPEGSPSTTLIHSSLNPT------ 278
Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
+YY+ L+ I VG ++ IP S DG GG+I+DSG+T T++ + AVA+ F
Sbjct: 279 --YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTD 336
Query: 375 QMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
Q+ N S E SGL CF + S +V +PE+ ++F GG + L EN V
Sbjct: 337 QI-NLSPVD--ESSSGLSTCFQLPSDGSTVQVPEISMQFDGGV-LNLGEENVLISPAEGV 392
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+CL + + + G + I G+ Q Q + +DL N F +C
Sbjct: 393 ICLAMGSSSQQGIS-------IFGNIQQQETQVLYDLQNLAVSFVPTQCG 435
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 126/401 (31%), Positives = 183/401 (45%), Gaps = 58/401 (14%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDP 149
++TP+ G Y ++LS GTP Q PF I DTGS L+W C +C +
Sbjct: 84 VETPVYAGD-GEYLMNLSIGTPAQ---PFSAIMDTGSDLIWTQCQPCTQCFN-------- 131
Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG 209
P F P+ SSS + C + C + P CS N +C Y YG G
Sbjct: 132 QSTPIFNPQGSSSFSTLPCSSQLCQALQSPT-------CS--NNSC-----QYTYGYGDG 177
Query: 210 F-TAGLLLSETLRFPSKTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKF 264
T G + +ETL F S ++PN GC AG+ G GR SLPSQL + KF
Sbjct: 178 SETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKF 237
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
SYC+ + SS L+L + S + +P + SS FYY+ L
Sbjct: 238 SYCMTPIGSSN---SSTLLLGSLANSVTAGSPNTTLI--------QSSQIPTFYYITLNG 286
Query: 325 IIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+ VGS + I P + + ++G GG+I+DSG+T T+ ++AV + FI QM N S
Sbjct: 287 LSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQM-NLSVVN 345
Query: 384 DVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
SG CF + S + ++ +P ++ F GG + LP ENYF N ++CL + + +
Sbjct: 346 G--SSSGFDLCFQMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSNGLICLAMGSSS 402
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ Q QN + +D N F +C
Sbjct: 403 QG--------MSIFGNIQQQNLLVVYDTGNSVVSFLSAQCG 435
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 126/401 (31%), Positives = 183/401 (45%), Gaps = 58/401 (14%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDP 149
++TP+ G Y ++LS GTP Q PF I DTGS L+W C +C +
Sbjct: 84 VETPVYAGD-GEYLMNLSIGTPAQ---PFSAIMDTGSDLIWTQCQPCTQCFN-------- 131
Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG 209
P F P+ SSS + C + C + P CS N +C Y YG G
Sbjct: 132 QSTPIFNPQGSSSFSTLPCSSQLCQALQSPT-------CS--NNSC-----QYTYGYGDG 177
Query: 210 F-TAGLLLSETLRFPSKTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKF 264
T G + +ETL F S ++PN GC AG+ G GR SLPSQL + KF
Sbjct: 178 SETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKF 237
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
SYC+ + SS L+L + S + +P + SS FYY+ L
Sbjct: 238 SYCMTPIG---SSTSSTLLLGSLANSVTAGSPNTTLI--------ESSQIPTFYYITLNG 286
Query: 325 IIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+ VGS + I P + + ++G GG+I+DSG+T T+ ++AV + FI QM N S
Sbjct: 287 LSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQM-NLSVVN 345
Query: 384 DVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
SG CF + S + ++ +P ++ F GG + LP ENYF N ++CL + + +
Sbjct: 346 G--SSSGFDLCFQMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSNGLICLAMGSSS 402
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ Q QN + +D N F +C
Sbjct: 403 QG--------MSIFGNIQQQNLLVVYDTGNSVVSFLFAQCG 435
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 144/484 (29%), Positives = 216/484 (44%), Gaps = 56/484 (11%)
Query: 11 LFSLLILLF-TTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH 69
+ SL +L+F A S AA+V V LT +H SDP +L R H
Sbjct: 8 MASLAVLVFLVVCATLASGAASVRVGLT------RIH---SDPDITAPEFVRDALRRDMH 58
Query: 70 LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
+ +++ S+ + +T + + G Y ++LS GTPP S P I DTGS L+
Sbjct: 59 -RQQSRSLFGRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPP-LSYPAIADTGSDLI 116
Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
W +C C+ P + P S++ ++ C + S C G
Sbjct: 117 W------TQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNSSL----------SMCAGVL 160
Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT-----VPNFLAGCSILSDRQ--- 241
P Y YG G+TAG+ SET F S VP GCS S
Sbjct: 161 AGKAPPPGCACMYNQTYGTGWTAGVQGSETFTFGSAAADQARVPGIAFGCSNASSSDWNG 220
Query: 242 PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYT 301
AG+ G GR S SL SQLG +FSYCL F D +S L+L GP + + T G+ T
Sbjct: 221 SAGLVGLGRGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLL--GPSAALNGT-GVRST 275
Query: 302 PFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
PF +P + + +YY+ L I +G+K + I +DG GG+I+DSG+T T +
Sbjct: 276 PFVASP--AKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTITSLV 333
Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV--YLPELILKFKGGAKMA 419
++ V + ++ + A D +GL C+ + S +P + L F GA M
Sbjct: 334 NAAYQQV-RAAVQSLVTLP-AIDGSDSTGLDLCYALPTPTSAPPAMPSMTLHFD-GADMV 390
Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
LP ++Y + G+ V CL + G G++Q QN ++ +D+ N+ FA
Sbjct: 391 LPADSYM-ISGSGVWCLAMRNQT-------DGAMSTFGNYQQQNMHILYDVRNEMLSFAP 442
Query: 480 QKCA 483
KC+
Sbjct: 443 AKCS 446
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 130/404 (32%), Positives = 185/404 (45%), Gaps = 58/404 (14%)
Query: 88 SNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNV 147
S+S I+ P+ + G + + L+ GTPP+ + I DTGS L+W C +C + P
Sbjct: 82 SSSEIEAPV-LPGNGEFLMKLAIGTPPETYSA-ILDTGSDLIWTQCKPCTQCFHQSTPIF 139
Query: 148 DPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG 207
DP + +F SS C+ GC YL YG
Sbjct: 140 DPKKSSSFSKLSCSSQLCEALPQSSCN-----------NGCE------------YLYSYG 176
Query: 208 -LGFTAGLLLSETLRFPSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLK 262
T G+L SETL F +VPN GC ++ Q AG+ G GR SL SQL
Sbjct: 177 DYSSTQGILASETLTFGKASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEP 236
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
KFSYCL + DD S+ +L S ++ + + TP +P A FYY+ L
Sbjct: 237 KFSYCLTT--VDDTKTST--LLMGSLASVNASSSAIKTTPLIHSP-----AHPSFYYLSL 287
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I VG + I S DG+GG+I+DSG+T T++E F VAKEF ++ +
Sbjct: 288 EGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKI---NLP 344
Query: 383 ADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE---VLCLIL 438
D +GL CF + SG ++ +P+L+ F GA + LP ENY ++G+ V CL
Sbjct: 345 VDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFD-GADLELPAENY--MIGDSSMGVACL-- 399
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A G + G I G+ Q QN + DL + F +C
Sbjct: 400 ----AMGSSSGMS---IFGNVQQQNMLVLHDLEKETLSFLPTQC 436
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 138/457 (30%), Positives = 201/457 (43%), Gaps = 62/457 (13%)
Query: 45 LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
L +DP +L R H K S+ G+ S +P + G Y
Sbjct: 36 LTRVHADPSVTASQFVRGALRRDMHRHNARKLALAASS-GATVSAPTQNSPTA----GEY 90
Query: 105 SISLSFGTPPQASTPF--IFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAF--IP 157
++L+ GTPP P+ I DTGS L+W PCTS+ C P +PS F +P
Sbjct: 91 LMALAIGTPP---LPYQAIADTGSDLIWTQCAPCTSQ--CFRQPTPLYNPSSSTTFAVLP 145
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
SS S C+ + GC AC +Y + YG G+T+ S
Sbjct: 146 CNSSLS--------VCAAALAGTGTAPPPGC---------AC-TYNVTYGSGWTSVFQGS 187
Query: 218 ETLRFPS-----KTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCL 268
ET F S VP GCS S +G+ G GR SL SQLG+ KFSYCL
Sbjct: 188 ETFTFGSTPAGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCL 247
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+ D +S L+L GP + + T G+S TPF +P S++ FYY+ L I +G
Sbjct: 248 --TPYQDTNSTSTLLL--GPSASLNGTAGVSSTPFVASP--STAPMNTFYYLNLTGISLG 301
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+ + IP + +DG GG+I+DSG+T T + ++ V + + D
Sbjct: 302 TTALSIPPDAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV--TLPTTDGSAA 359
Query: 389 SGLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
+GL CF + S +P + L F GA M LP ++Y + + CL +
Sbjct: 360 TGLDLCFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYMMSDDSGLWCLAMQNQT---- 414
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G ILG++Q QN ++ +D+ + FA KC+
Sbjct: 415 ---DGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 448
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 122/391 (31%), Positives = 177/391 (45%), Gaps = 57/391 (14%)
Query: 102 GGYSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
G Y ++LS GTP Q PF I DTGS L+W C +C + P F P+
Sbjct: 93 GEYLMNLSIGTPAQ---PFSAIMDTGSDLIWTQCQPCTQCFN--------QSTPIFNPQG 141
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
SSS + C + C + P CS N C Y YG G T G + +E
Sbjct: 142 SSSFSTLPCSSQLCQALSSPT-------CS--NNFC-----QYTYGYGDGSETQGSMGTE 187
Query: 219 TLRFPSKTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
TL F S ++PN GC AG+ G GR SLPSQL + KFSYC+
Sbjct: 188 TLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIG-- 245
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
+ SNL+L + S + +P + SS FYY+ L + VGS + I
Sbjct: 246 -SSTPSNLLLGSLANSVTAGSPNTTLI--------QSSQIPTFYYITLNGLSVGSTRLPI 296
Query: 335 -PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
P ++ + ++G GG+I+DSG+T T+ +++V +EFI Q+ + SG
Sbjct: 297 DPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI---NLPVVNGSSSGFDL 353
Query: 394 CFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
CF S ++ +P ++ F GG + LP ENYF N ++CL + + +
Sbjct: 354 CFQTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNGLICLAMGSSSQG-------- 404
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ Q QN + +D N FA +C
Sbjct: 405 MSIFGNIQQQNMLVVYDTGNSVVSFASAQCG 435
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 121/397 (30%), Positives = 182/397 (45%), Gaps = 62/397 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTPPQ DTGS L+W C C D +P F RSS++
Sbjct: 35 YLVHLAIGTPPQP-VQLTLDTGSDLIWTQCKPCVSCFD--------QPLPYFDTSRSSTN 85
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
L+ C++ +C P V + C + +TC +Y YG T GLL ++ F
Sbjct: 86 ALLPCESTQCK--LDPTV-TVCVKLNQTVQTC-----AYYTSYGDNSVTIGLLAADKFTF 137
Query: 223 PSKT-VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+ T +P GC + + + GIAGFGR SLPSQL + FS+C +
Sbjct: 138 VAGTSLPGVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTIT---GA 194
Query: 278 VSSNLVLD-------TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
+ S ++LD G G+ + TP + Y NP YY+ L+ I VGS
Sbjct: 195 IPSTVLLDLPADLFSNGQGAVQT-TPLIQYAKNEANPT--------LYYLSLKGITVGST 245
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ +P S ++G GG I+DSG++ T + +++ V EF Q+ +G
Sbjct: 246 RLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI---KLPVVPGNATG 301
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAGP 446
CF + +P+L+L F+ GA M LP ENY V GN ++CL
Sbjct: 302 HYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICL---------- 350
Query: 447 ALGRG-PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A+ +G I+G+FQ QN ++ +DL N+ F +C
Sbjct: 351 AINKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 387
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 115/386 (29%), Positives = 174/386 (45%), Gaps = 36/386 (9%)
Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
G PPQ + I DTGS+L+W C++ C + + P RS +++ + C
Sbjct: 77 IGDPPQQAEAII-DTGSNLIWTQCST------CQPAGCFSQNLSFYDPSRSRTARPVACN 129
Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPN 229
+ C+ E+RC + NK C + L YG G G+L +E F ++
Sbjct: 130 DTACAL----GSETRC---ARDNKAC-----AVLTAYGAGVIGGVLGTEAFTFQPQSENV 177
Query: 230 FLA-GCSILSDRQP------AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNL 282
LA GC + P +GI G GR + SL SQLG KFSYCL + F + +S L
Sbjct: 178 SLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCL-TPYFSQSTNTSRL 236
Query: 283 VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
+ G P S PF KNP F FYY+ L I VG + +P +
Sbjct: 237 FVGASAGLSSGGAPATS-VPFLKNP--DVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLR 293
Query: 343 SDGNG---GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS- 398
G G ++DSGS FT + ++A+ E ++Q+G S GL C ++
Sbjct: 294 QVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLG-ASIVPPPAGAEGLDLCAAVAH 352
Query: 399 GKKSVYLPELILKF-KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
G +P L+L F GG +A+PPENY+ V + C+++F+ L I+G
Sbjct: 353 GDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIG 412
Query: 458 DFQLQNFYLEFDLANDRFGFAKQKCA 483
++ Q+ +L +DL F C+
Sbjct: 413 NYMQQDMHLLYDLEKGMLSFQPADCS 438
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 138/457 (30%), Positives = 200/457 (43%), Gaps = 62/457 (13%)
Query: 45 LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
L +DP +L R H K S+ G+ S +P + G Y
Sbjct: 38 LTRVHADPSVTASQFVRGALRRDMHRHNARKLALAASS-GATVSAPTQDSPTA----GEY 92
Query: 105 SISLSFGTPPQASTPF--IFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAF--IP 157
++L+ GTPP P+ I DTGS L+W PCTS+ C P +PS F +P
Sbjct: 93 LMALAIGTPP---LPYQAIADTGSDLIWTQCAPCTSQ--CFRQPTPLYNPSSSTTFAVLP 147
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
SS S C+ + GC AC +Y + YG G+T+ S
Sbjct: 148 CNSSLS--------VCAAALAGTGTAPPPGC---------AC-TYNVTYGSGWTSVFQGS 189
Query: 218 ETLRFPSK-----TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCL 268
ET F S VP GCS S +G+ G GR SL SQLG+ KFSYCL
Sbjct: 190 ETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCL 249
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+ D +S L+L GP + + T G+S TPF +P S++ FYY+ L I +G
Sbjct: 250 --TPYQDTNSTSTLLL--GPSASLNGTAGVSSTPFVASP--STAPMNTFYYLNLTGISLG 303
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+ + IP +DG GG+I+DSG+T T + ++ V + + D
Sbjct: 304 TTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV--TLPTTDGSAD 361
Query: 389 SGLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
+GL CF + S +P + L F GA M LP ++Y + + CL +
Sbjct: 362 TGLDLCFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYMMSDDSGLWCLAMQNQT---- 416
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G ILG++Q QN ++ +D+ + FA KC+
Sbjct: 417 ---DGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 450
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 124/404 (30%), Positives = 178/404 (44%), Gaps = 56/404 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP--AFIPKR 159
G Y +SL GTPPQ + + DTGS L+W C+ C +C+ R P AF +
Sbjct: 84 GQYFVSLRIGTPPQ-TLLLVADTGSDLIWVKCSP---CRNCS------HRSPGSAFFARH 133
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
S++ I C +P+C + P+ P N+T + Y Y T G E
Sbjct: 134 STTYSAIHCYSPQCQLVPHPHPN-------PCNRTRLHSPCRYQYTYADSSTTTGFFSKE 186
Query: 219 TLRFPSKT-----VPNFLAGCSI---------LSDRQPAGIAGFGRSSESLPSQLGLK-- 262
L + T + GC S G+ G GR+ S SQLG +
Sbjct: 187 ALTLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFG 246
Query: 263 -KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
KFSYCL+ P +S L + SK +S+TP NP+ + FYY+
Sbjct: 247 SKFSYCLMDYTLSPPP-TSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPT-----FYYIA 300
Query: 322 LRQIIVGSKHVKIPYSYLVPGSD--GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
++ + V VK+P + V D GNGG I+DSG+T TF+ P + + K F +++
Sbjct: 301 IKGVYVNG--VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLP 358
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
S A E G C ++SG LP + GG+ + PP NYF G+++ CL
Sbjct: 359 SPA---EPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCL--- 412
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A P G +LG+ Q F LEFD R GF ++ CA
Sbjct: 413 ---AVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGCA 453
>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
Length = 432
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 142/434 (32%), Positives = 204/434 (47%), Gaps = 60/434 (13%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC--TSRYRCVDCNFPNVDP 149
I P++ ++ GY +SL+ G PPQ ++ DTGS L W PC S Y+C++C +
Sbjct: 14 IIEPVTTYT-DGYLLSLNLGMPPQVFQVYL-DTGSDLTWVPCGTNSSYQCLECGNEHSTS 71
Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFG-PNVESRCK--GC---SPRNKTCPLACPSYL 203
IP+F P +SSS+ C + C I N C GC S + C CP +
Sbjct: 72 KPIPSFSPSQSSSNMKELCGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSDLCTRPCPPFS 131
Query: 204 LQYGLG-FTAGLLLSETLRFPSKT--------VPNFLAGCSILSDRQPAGIAGFGRSSES 254
YG G G L + + VP F GC S R+P GIAGFG+ S
Sbjct: 132 YTYGGGALVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGCVGSSIREPIGIAGFGKGILS 191
Query: 255 LPSQLGL--KKFSYCLLSRKFDDAP-VSSNLVLDTGPGSGD---SKTPGLSYTPFYK--- 305
LPSQLG K FS+C L +F P +S+L++ GD S +TP K
Sbjct: 192 LPSQLGFLDKGFSHCFLGFRFARNPNFTSSLIM------GDLALSAKDDFLFTPMLKSIT 245
Query: 306 NPVGSSSAFGEFYYVGLRQIIVGS-KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
NP FYY+GL + +G + P S S+GNGG+IVD+G+T+T + P
Sbjct: 246 NP--------NFYYIGLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPF 297
Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMAL 420
+ A+ + Y R+ D+E ++G CF I + LP + F G K+ L
Sbjct: 298 YTAILSSLASVI-LYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTL 356
Query: 421 PPEN-YFALVG--NEVL--CLIL------FTDNAAGPALGRGPAIILGDFQLQNFYLEFD 469
P ++ Y+A+ N V+ CL+ D+ G A GP +LG FQ+QN + +D
Sbjct: 357 PKDSCYYAVTAPKNSVVVKCLLFQRMDNDDDDDDVGGA-NNGPGAVLGSFQMQNVEVVYD 415
Query: 470 LANDRFGFAKQKCA 483
+ R GF + CA
Sbjct: 416 MEAGRIGFQPKDCA 429
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 138/409 (33%), Positives = 190/409 (46%), Gaps = 71/409 (17%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L + S G Y +S+ GTPP+ + I DTGS L+W C CVD P DP++ P++
Sbjct: 81 LVLASEGEYLMSMGIGTPPRYYSA-ILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSY 139
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG---FTA 212
+ C +P C+ ++ P C RN C + QY G TA
Sbjct: 140 AK--------LPCNSPMCNALYYP----LCY----RN-VC-------VYQYFYGDSANTA 175
Query: 213 GLLLSETLRFPSK----TVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGLKKFS 265
G+L +ET F + TVP GC L S +G+ GFGR SL SQLG +FS
Sbjct: 176 GVLSNETFTFGTNDTRVTVPRIAFGCGNLNAGSLFNGSGMVGFGRGPLSLVSQLGSPRFS 235
Query: 266 YCLLSRKFDDAPVSSNLVLD---TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
YCL S +PV S L T + S + TPF NP YY+ +
Sbjct: 236 YCLTSFM---SPVPSRLYFGAYATLNSTSASTGEPVQSTPFIVNP-----GLPTMYYLNM 287
Query: 323 RQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG---- 377
I VG + + I P + + +DG GGVI+DSGST T++ ++ V + F Q+G
Sbjct: 288 TGISVGGELLPIDPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLT 347
Query: 378 NYSRAADVEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VL 434
N + ADV L CF +K V +PEL F+ GA M LP ENY + G+ L
Sbjct: 348 NATSLADV-----LDTCFVWPPPPRKIVTMPELAFHFE-GANMELPLENYMLIDGDTGNL 401
Query: 435 CL-ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL I +D+ + I+G FQ QNF++ +D N F C
Sbjct: 402 CLAIAASDDGS----------IIGSFQHQNFHVLYDNENSLLSFTPATC 440
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 128/401 (31%), Positives = 183/401 (45%), Gaps = 59/401 (14%)
Query: 102 GGYSISLSFGTPP---QASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAF 155
G Y ++L+ GTPP QA I DTGS L+W PCTS+ C P +PS F
Sbjct: 30 GEYLMALAIGTPPLPYQA----IADTGSDLIWTQCAPCTSQ--CFRQPTPLYNPSSSTTF 83
Query: 156 --IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAG 213
+P SS S C+ + GC AC +Y + YG G+T+
Sbjct: 84 AVLPCNSSLS--------VCAAALAGTGTAPPPGC---------AC-TYNVTYGSGWTSV 125
Query: 214 LLLSETLRFPSK-----TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKF 264
SET F S VP GCS S +G+ G GR SL SQLG+ KF
Sbjct: 126 FQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKF 185
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
SYCL + D +S L+L GP + + T G+S TPF +P S++ FYY+ L
Sbjct: 186 SYCL--TPYQDTNSTSTLLL--GPSASLNGTAGVSSTPFVASP--STAPMNTFYYLNLTG 239
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
I +G+ + IP +DG GG+I+DSG+T T + ++ V + + D
Sbjct: 240 ISLGTTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV--TLPTTD 297
Query: 385 VEKKSGLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
+GL CF + S +P + L F GA M LP ++Y + + CL +
Sbjct: 298 GSADTGLDLCFMLPSSTSAPPAMPSMTLHFN-GADMVLPADSYMMSDDSGLWCLAMQNQT 356
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G ILG++Q QN ++ +D+ + FA KC+
Sbjct: 357 -------DGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 390
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 148/448 (33%), Positives = 200/448 (44%), Gaps = 75/448 (16%)
Query: 49 DSDPLKILHSLASS----SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
D+ +K L SLA++ +L+RAR G +S+S+I G Y
Sbjct: 103 DAARVKSLISLAATVGGTNLTRAR---------------GPGFSSSVISGL--AQGSGEY 145
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
L GTP + + DTGS +VW C C+ C + DP F P +S S
Sbjct: 146 FTRLGVGTPARY-VYMVLDTGSDIVWIQCAP---CIKC-YSQTDP----VFDPTKSRSFA 196
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFP 223
I C +P C + P GCS + + C Y + YG G FT G +ETL F
Sbjct: 197 NIPCGSPLCRRLDYP-------GCSTKKQICL-----YQVSYGDGSFTVGEFSTETLTFR 244
Query: 224 SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAP 277
V + GC ++ AG+ G GR S PSQ+G + KFSYCL R P
Sbjct: 245 GTRVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASSRP 304
Query: 278 VSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK-IP 335
S++V GDS + +TP NP FYYV L I VG V I
Sbjct: 305 --SSIVF------GDSAISRTTRFTPLLSNP-----KLDTFYYVELLGISVGGTRVSGIS 351
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S S GNGGVI+DSG++ T + + A+ F+ N RA + S CF
Sbjct: 352 ASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEF---SLFDTCF 408
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
D+SGK V +P ++L F+ GA + LP NY V N F A+G + I
Sbjct: 409 DLSGKTEVKVPTVVLHFR-GADVPLPASNYLIPVDNSGSFCFAFAGTASGLS-------I 460
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+G+ Q Q F + +DLA R GFA + CA
Sbjct: 461 IGNIQQQGFRVVYDLATSRVGFAPRGCA 488
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 125/402 (31%), Positives = 178/402 (44%), Gaps = 53/402 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + L G PPQ S I DTGS LVW C++ C +C+ + P+ + F P+ SS
Sbjct: 82 GQYFVDLRIGQPPQ-SLLLIADTGSDLVWVKCSA---CRNCS--HHSPATV--FFPRHSS 133
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ C +P C + P+ C N T + Y Y G T+GL ET
Sbjct: 134 TFSPAHCYDPVCRLVPKPDRAPIC------NHTRIHSTCHYEYGYADGSLTSGLFARETT 187
Query: 221 RFPSKT-----VPNFLAGCSILSDRQPA---------GIAGFGRSSESLPSQLGLK---K 263
+ + + + GC Q G+ G GR S SQLG + K
Sbjct: 188 SLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNK 247
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
FSYCL+ P +S L++ G G G SK L +TP NP+ + FYYV L+
Sbjct: 248 FSYCLMDYTLSPPP-TSYLIIGNG-GDGISK---LFFTPLLTNPLSPT-----FYYVKLK 297
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+ V ++I S GNGG +VDSG+T F+ P + +V R++ A
Sbjct: 298 SVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK--LPIA 355
Query: 384 DVEKKSGLRPCFDISG--KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
D G C ++SG K LP L +F GGA PP NYF ++ CL +
Sbjct: 356 DA-LTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAI--- 411
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ P +G ++G+ Q F EFD R GF+++ CA
Sbjct: 412 QSVDPKVGFS---VIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 144/458 (31%), Positives = 193/458 (42%), Gaps = 68/458 (14%)
Query: 45 LHH-----SDSDPLKILHSLASSSLSRARHLKT---KTKPKTKDSNIGSNYSNSLIKTPL 96
LHH SD P + +S + SR + L + + G +S+S+ T
Sbjct: 82 LHHLDALSSDETPQDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPGFSSSV--TSG 139
Query: 97 SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
G Y L GTP + + DTGS +VW C +C + DP F
Sbjct: 140 LAQGSGEYFTRLGVGTPARY-VFMVLDTGSDVVWIQCAPCKKC----YSQTDP----VFN 190
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLL 215
P +S S I C +P C + P GCS + C Y + YG G FT G
Sbjct: 191 PTKSRSFANIPCGSPLCRRLDSP-------GCSTKKHIC-----LYQVSYGDGSFTYGEF 238
Query: 216 LSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLK---KFSYC 267
+ETL F V GC D + I G S PSQ+G + KFSYC
Sbjct: 239 STETLTFRGTRVGRVALGCG--HDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYC 296
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQII 326
L+ R P S +V GDS + +TP NP FYYV L +
Sbjct: 297 LVDRSASSKP--SYMVF------GDSAISRTARFTPLVSNP-----KLDTFYYVELLGVS 343
Query: 327 VGSKHV-KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG V I S S GNGGVI+DSG++ T + P + A+ F N RA +
Sbjct: 344 VGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEF 403
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
S CFD+SGK V +P ++L F+ GA ++LP NY V N F +G
Sbjct: 404 ---SLFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPASNYLIPVDNSGSFCFAFAGTMSG 459
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G+ Q Q F + +DLA R GFA + CA
Sbjct: 460 LS-------IVGNIQQQGFRVVYDLAASRVGFAPRGCA 490
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 168/370 (45%), Gaps = 54/370 (14%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGS + W C C DC + DP + P S+S +GC +P+C +
Sbjct: 178 MVLDTGSDVTWLQCQP---CADC-YAQSDP----VYDPSVSTSYATVGCDSPRCRDL--- 226
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
C +C Y + YG G +T G +ETL S V N GC
Sbjct: 227 ----DAAACRNSTGSCL-----YEVAYGDGSYTVGDFATETLTLGDSAPVSNVAIGCG-- 275
Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
D + + G + S PSQ+ FSYCL+ R D+P SS L GD
Sbjct: 276 HDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDR---DSPSSSTLQF------GD 326
Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
S+ P ++ P ++P ++ FYYV L I VG + + IP S G+GGVIVD
Sbjct: 327 SEQPAVT-APLIRSPRTNT-----FYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVD 380
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG+ T ++ + A+ + F++ + RA+ V S C+D++G+ SV +P + L F
Sbjct: 381 SGTAVTRLQSGAYGALREAFVQGTQSLPRASGV---SLFDTCYDLAGRSSVQVPAVALWF 437
Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
+GG ++ LP +NY V + F + GP I+G+ Q Q + FD A
Sbjct: 438 EGGGELKLPAKNYLIPVDAAGTYCLAFAGTS-------GPVSIIGNVQQQGVRVSFDTAK 490
Query: 473 DRFGFAKQKC 482
+ GF KC
Sbjct: 491 NTVGFTADKC 500
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 124/399 (31%), Positives = 176/399 (44%), Gaps = 50/399 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+PP+ + I DTGS L W C Y C N DP K S+
Sbjct: 168 GEYFMDVLVGSPPKHFS-LILDTGSDLNWIQCLPCYDCFQQNGAFYDP--------KASA 218
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + I C + +C+ + P+ CK N++CP Y YG T G ET
Sbjct: 219 SYKNITCNDQRCNLVSSPDPPMPCKS---DNQSCP-----YYYWYGDSSNTTGDFAVETF 270
Query: 221 RFPSKT---------VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
T V N + GC + AG+ G GR S SQL FS
Sbjct: 271 TVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFS 330
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL+ R D VSS L+ G P L++T F G + FYYV ++ I
Sbjct: 331 YCLVDRN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFV---AGKENLVDTFYYVQIKSI 384
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAAD 384
+V + + IP SDG GG I+DSG+T ++ P +E + + + G Y D
Sbjct: 385 LVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRD 444
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
L PCF++SG +V LPEL + F GA P EN F + +++CL +
Sbjct: 445 FPI---LDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAML----- 496
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G++Q QNF++ +D R G+A KCA
Sbjct: 497 --GTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCA 533
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 158/469 (33%), Positives = 212/469 (45%), Gaps = 75/469 (15%)
Query: 35 PLTPLSTKHYLHHSDS-----DPLKILHSLASSSLSRAR---HLKTKTKPKTKDSNIGSN 86
P T LS LHH D+ P ++ H +R + HL T KT+ +N GS
Sbjct: 60 PTTSLS----LHHIDALSFNKTPSQLFHLRLERDAARVKTLTHLAAATN-KTRPANPGSG 114
Query: 87 YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCN 143
+S+S++ LS S G Y L GTPP+ + DTGS +VW PCT Y D
Sbjct: 115 FSSSVVSG-LSQGS-GEYFTRLGVGTPPKYLY-MVLDTGSDVVWLQCKPCTKCYSQTD-- 169
Query: 144 FPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL 203
F P +S S I C +P C + P GCS +N C Y
Sbjct: 170 ---------QIFDPSKSKSFAGIPCYSPLCRRLDSP-------GCSLKNNLC-----QYQ 208
Query: 204 LQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQL 259
+ YG G FT G +ETL F VP GC ++ AG+ G GR S P+Q
Sbjct: 209 VSYGDGSFTFGDFSTETLTFRRAAVPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFPTQT 268
Query: 260 GLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFG 315
G + KFSYCL R P S++V GDS + +TP KNP
Sbjct: 269 GTRFNNKFSYCLTDRTASAKP--SSIVF------GDSAVSRTARFTPLVKNP-----KLD 315
Query: 316 EFYYVGLRQIIVGSKHVK-IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
FYYV L I VG V+ I S+ S GNGGVI+DSG++ T + P + ++ F
Sbjct: 316 TFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFRV 375
Query: 375 QMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
+ RA + S C+D+SG V +P ++L F+G A ++LP NY V N
Sbjct: 376 GASHLKRAPEF---SLFDTCYDLSGLSEVKVPTVVLHFRG-ADVSLPAANYLVPVDNSGS 431
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
F +G + I+G+ Q Q F + FDLA R GFA + CA
Sbjct: 432 FCFAFAGTMSGLS-------IIGNIQQQGFRVVFDLAGSRVGFAPRGCA 473
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 129/406 (31%), Positives = 185/406 (45%), Gaps = 42/406 (10%)
Query: 90 SLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDP 149
S P++ S G+S+++ GTPPQ T I DTGS L+W C+ R +
Sbjct: 70 SAADVPVAPLSDQGHSLTVGIGTPPQPRT-LIVDTGSDLIWTQCSMLSRRTR-TAASASR 127
Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG 209
R P + P+RSSS + C + C + K C+ RN C Y YG
Sbjct: 128 QREPLYEPRRSSSFAYLPCSDRLCQ-----EGQFSYKNCA-RNNRC-----MYDELYGSA 176
Query: 210 FTAGLLLSETLRF--PSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKKF 264
G+L SET F +K GC LS +G+ G SL SQL + +F
Sbjct: 177 EAGGVLASETFTFGVNAKVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPRF 236
Query: 265 SYCLL---SRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYV 320
SYCL RK +S L+ +T G + T +NP ++ +YYV
Sbjct: 237 SYCLTPFAERK------TSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETA----YYYV 286
Query: 321 GLRQIIVGSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
L + +G+K + +P + L + DG+GG IVDSGST +++E F AV K + +
Sbjct: 287 PLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLP 346
Query: 380 SRAADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
E CF + ++V P L+L F GGA M LP +NYF ++CL
Sbjct: 347 VANGTDEDYDDYELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCL 406
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ T G G +II G+ Q QN ++ FD+ N +F FA KC
Sbjct: 407 AVGTSPD-----GFGVSII-GNVQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 128/400 (32%), Positives = 179/400 (44%), Gaps = 52/400 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+PP+ + I DTGS L W C + C N DP K S+
Sbjct: 153 GEYFMDVLVGSPPKHFS-LILDTGSDLNWIQCLPCHDCFQQNGAFYDP--------KASA 203
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + I C +P+C+ + P+ CK N++CP Y YG T G ET
Sbjct: 204 SYKNITCNDPRCNLVSPPDPPKPCKS---DNQSCP-----YYYWYGDSSNTTGDFAVETF 255
Query: 221 RFPSKT---------VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
T V N + GC + AG+ G GR S SQL FS
Sbjct: 256 TVNLTTSGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFS 315
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL+ R D VSS L+ G P L++T F + FYYV ++ I
Sbjct: 316 YCLVDRN-SDTNVSSKLIF--GEDKDLLSHPNLNFTSFVAR---KENLVDTFYYVQIKSI 369
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAAD 384
IV + + IP SDG GG I+DSG+T ++ P +E + + + G Y D
Sbjct: 370 IVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRD 429
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL-ILFTDNA 443
L PCF++SG S+ LPEL + F GA P EN F + +++CL IL T +
Sbjct: 430 FPI---LDPCFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKS 486
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A I+G++Q QNF++ +D R G+A KCA
Sbjct: 487 AFS--------IIGNYQQQNFHILYDTKRSRLGYAPTKCA 518
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 136/447 (30%), Positives = 191/447 (42%), Gaps = 53/447 (11%)
Query: 57 HSLASSSLSR---ARH--LKTKTKPKTKDSNIGSNYSN-----SLIKTPLSVHSYGGYSI 106
H A SSLSR RH +KT+ + + SN S LS S G+S+
Sbjct: 34 HPYAGSSLSRHDVVRHGARASKTRAAWLTAKLAGVLSNRRGGVSPADVRLSPLSDQGHSL 93
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
++ GTPPQ I DTGS L+W C + P + P SS+ +
Sbjct: 94 TVGIGTPPQPRK-LIVDTGSDLIWTQC----KLSSSTAVAARHGSPPVYDPGESSTFAFL 148
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT 226
C + C + K C+ +N+ Y YG G+L SET F ++
Sbjct: 149 PCSDRLCQ-----EGQFSFKNCTSKNRCV------YEDVYGSAAAVGVLASETFTFGARR 197
Query: 227 VPNFLAG--CSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
+ G C LS GI G S SL +QL +++FSYCL F D S
Sbjct: 198 AVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCLT--PFADKKTSPL 255
Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
L S T + T NPV + +YYV L I +G K + +P + L
Sbjct: 256 LFGAMADLSRHKTTRPIQTTAIVSNPVKTV-----YYYVPLVGISLGHKRLAVPAASLAM 310
Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI---- 397
DG GG IVDSGST ++ FEAV KE + + A + L CF +
Sbjct: 311 RPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVMDVVRLPVANRTVEDYEL--CFVLPRRT 367
Query: 398 --SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
+ ++V +P L+L F GGA M LP +NYF ++CL A G I
Sbjct: 368 AAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCL------AVGKTTDGSGVSI 421
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G+ Q QN ++ FD+ + +F FA +C
Sbjct: 422 IGNVQQQNMHVLFDVQHHKFSFAPTQC 448
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 136/457 (29%), Positives = 203/457 (44%), Gaps = 55/457 (12%)
Query: 42 KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
+ +L ++ D ++I ++ S + + P+ S + +++ ++V S
Sbjct: 92 ESFLDKAEKDAVRIETMHRRAARSGVARMPASSSPR----RALSERMVATVESGVAVGS- 146
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y I + GTPP+ I DTGS L W C C+DC R P F P SS
Sbjct: 147 GEYLIDVYVGTPPRRFR-MIMDTGSDLNWLQCAP---CLDCF-----EQRGPVFDPAASS 197
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + + C + +C + P C+ P +CP Y YG T G L E+
Sbjct: 198 SYRNVTCGDQRCGLVAPPEAPRACR--RPAEDSCP-----YYYWYGDQSNTTGDLALESF 250
Query: 221 RF------PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
S+ V + GC + AG+ G GR S SQL FSYCL
Sbjct: 251 TVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCL 310
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+ D S +V G P L YT F +SS FYYV L+ ++VG
Sbjct: 311 VEHGSD---AGSKVVF--GEDYLVLAHPQLKYTAF----APTSSPADTFYYVKLKGVLVG 361
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAADVEK 387
+ I G DG+GG I+DSG+T ++ P ++ + + F+ M Y D
Sbjct: 362 GDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPV 421
Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNAAGP 446
L PC+++SG + +PEL L F GA P ENYF L + ++CL + G
Sbjct: 422 ---LNPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGM 478
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 479 S-------IIGNFQQQNFHVVYDLQNNRLGFAPRRCA 508
>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 129/413 (31%), Positives = 184/413 (44%), Gaps = 71/413 (17%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS----------------SSQL 165
DTGS LVWFPC + C+ C + PS P ++ SS L
Sbjct: 98 LDTGSDLVWFPCRP-FTCILCESKPLPPSPPPTLSSSATTVSCSSPSCSAAHSSLPSSDL 156
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK 225
N +I C+ T CP + YG G L S++L PS
Sbjct: 157 CAISNCPLDYI-------ETGDCN----TSSYPCPPFYYAYGDGSLVAKLFSDSLSLPSV 205
Query: 226 TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDAPVS 279
+V NF GC+ + +P G+AGFGR SLP+QL + FSYCL+S FD V
Sbjct: 206 SVANFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVSHSFDSDRVR 265
Query: 280 --SNLVLDTGPGSGDSKTPG----------------LSYTPFYKNPVGSSSAFGEFYYVG 321
S L+L + + +T NP FY V
Sbjct: 266 RPSPLILGRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNP-----KHPYFYSVS 320
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YS 380
L+ I +G +++ P +G GGV+VDSG+TFT + + +V +EF ++G +
Sbjct: 321 LQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHE 380
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKG-GAKMALPPENYFALVGN-------- 431
RA VE SG+ PC+ ++ ++V +P L+L F G G+ + LP NYF +
Sbjct: 381 RADRVEPSSGMSPCYYLN--QTVKVPALVLHFAGNGSTVTLPRRNYFYEFMDGGDGKEEK 438
Query: 432 -EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+V CL+L G G ILG++Q Q F + +DL N R GFAK+KCA
Sbjct: 439 RKVGCLMLMNGGDESELRG-GTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCA 490
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 134/391 (34%), Positives = 181/391 (46%), Gaps = 55/391 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L GTPP+ + DTGS +VW C +C + D P F PK+S
Sbjct: 145 GEYFTRLGVGTPPKY-VYMVLDTGSDVVWIQCAPCRKC----YSQTD----PVFDPKKSG 195
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S I C++P C + P GC+ R ++C Y + YG G FT G +ETL
Sbjct: 196 SFSSISCRSPLCLRLDSP-------GCNSR-QSCL-----YQVAYGDGSFTFGEFSTETL 242
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
F VP GC ++ AG+ G GR S P+Q GL +KFSYCL+ R
Sbjct: 243 TFRGTRVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSAS 302
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK- 333
P S++V S S+T +TP NP FYY+ L I VG V
Sbjct: 303 SKP--SSVVFGQ---SAVSRTA--VFTPLITNP-----KLDTFYYLELTGISVGGARVAG 350
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
I S + GNGGVI+DSG++ T + + ++ F + RA D S
Sbjct: 351 ITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDY---SLFDT 407
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCLILFTDNAAGPALGRGP 452
CFD+SGK V +P +++ F+ GA ++LP NY V N V C AG G
Sbjct: 408 CFDLSGKTEVKVPTVVMHFR-GADVSLPATNYLIPVDTNGVFCFAF-----AGTMSGLS- 460
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + FD+A R GFA + CA
Sbjct: 461 --IIGNIQQQGFRVVFDVAASRIGFAARGCA 489
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 138/424 (32%), Positives = 185/424 (43%), Gaps = 73/424 (17%)
Query: 84 GSNYSNSLIKTPLS---VHSYGGYSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYR 138
G+ + S + P+ G Y + GTP +TP + DTGS +VW C R
Sbjct: 119 GTRRTGSGVVAPVVSGLAQGSGEYFTKIGVGTP---ATPALMVLDTGSDVVWLQCAPCRR 175
Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
C D + F P+RS S +GC P C + GC R K C
Sbjct: 176 CYDQSGQ--------VFDPRRSRSYGAVGCSAPLCRRL-------DSGGCDLRRKAC--- 217
Query: 199 CPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSILSDR---QPAGIAGFGRSSE 253
Y + YG G TAG +ETL F V GC ++ AG+ G GR S
Sbjct: 218 --LYQVAYGDGSVTAGDFATETLTFAGGARVARIALGCGHDNEGLFVAAAGLLGLGRGSL 275
Query: 254 SLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
S P+Q+ + FSYCL+ R P S + + G G+ S T S+TP KNP
Sbjct: 276 SFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFGSGAVGS-TVAASFTPMVKNP--- 331
Query: 311 SSAFGEFYYVGLRQIIVGSKHVK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
FYYV L I VG V + S L + S G GGVIVDSG++ T + P + A+
Sbjct: 332 --RMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSAL 389
Query: 369 AKEFIRQMGNYSRAADVEKKSGLR----------PCFDISGKKSVYLPELILKFKGGAKM 418
F RAA +GLR C+D+SG+K V +P + + F GGA+
Sbjct: 390 RDAF--------RAA----AAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEA 437
Query: 419 ALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
ALPPENY V ++ F G + I+G+ Q Q F + FD R GF
Sbjct: 438 ALPPENYLIPVDSKGTFCFAFAGTDGGVS-------IIGNIQQQGFRVVFDGDGQRVGFV 490
Query: 479 KQKC 482
+ C
Sbjct: 491 PKGC 494
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 130/393 (33%), Positives = 183/393 (46%), Gaps = 57/393 (14%)
Query: 98 VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
VH+ G + ++L+ GTP + + I DTGS L+W C C D P+ P F
Sbjct: 90 VHAGNGEFLMNLAIGTPAETYSA-IMDTGSDLIWTQCKPCKVCFD------QPT--PIFD 140
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
P++SSS + C + C + + S GC Y YG T G+L
Sbjct: 141 PEKSSSFSKLPCSSDLCVAL---PISSCSDGCE------------YRYSYGDHSSTQGVL 185
Query: 216 LSETLRFPSKTVPNFLAGCSILSDR-----QPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
+ET F +V GC +R Q AG+ G GR SL SQLG+ KFSYCL S
Sbjct: 186 ATETFTFGDASVSKIGFGCG-EDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS 244
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
DD+ S L++ + + S P TP +NP S FYY+ L I VG
Sbjct: 245 --IDDSKGISTLLVGS-EATVKSAIP----TPLIQNPSRPS-----FYYLSLEGISVGDT 292
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ I S DG+GG+I+DSG+T T+++ F A+ KEFI QM D +
Sbjct: 293 LLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQM---KLDVDASGSTE 349
Query: 391 LRPCFDISGKKS-VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
L CF + S V +P+L+ F+ G + LP ENY ++ + L +I T G + G
Sbjct: 350 LELCFTLPPDGSPVEVPQLVFHFE-GVDLKLPKENY--IIEDSALRVICLT---MGSSSG 403
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I G+FQ QN + DL + FA +C
Sbjct: 404 MS---IFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 130/393 (33%), Positives = 183/393 (46%), Gaps = 57/393 (14%)
Query: 98 VHS-YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
VH+ G + ++L+ GTP + + I DTGS L+W C C D P+ P F
Sbjct: 90 VHAGNGEFLMNLAIGTPAETYSA-IMDTGSDLIWTQCKPCKVCFD------QPT--PIFD 140
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
P++SSS + C + C + + S GC Y YG T G+L
Sbjct: 141 PEKSSSFSKLPCSSDLCVAL---PISSCSDGCE------------YRYSYGDHSSTQGVL 185
Query: 216 LSETLRFPSKTVPNFLAGCSILSDR-----QPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
+ET F +V GC +R Q AG+ G GR SL SQLG+ KFSYCL S
Sbjct: 186 ATETFTFGDASVSKIGFGCG-EDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS 244
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
DD+ S L++ + + S P TP +NP S FYY+ L I VG
Sbjct: 245 --IDDSKGISTLLVGS-EATVKSAIP----TPLIQNPSRPS-----FYYLSLEGISVGDT 292
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ I S DG+GG+I+DSG+T T+++ F A+ KEFI QM D +
Sbjct: 293 LLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQM---KLDVDASGSTE 349
Query: 391 LRPCFDISGKKS-VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
L CF + S V +P+L+ F+ G + LP ENY ++ + L +I T G + G
Sbjct: 350 LELCFTLPPDGSPVDVPQLVFHFE-GVDLKLPKENY--IIEDSALRVICLT---MGSSSG 403
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I G+FQ QN + DL + FA +C
Sbjct: 404 MS---IFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 119/400 (29%), Positives = 176/400 (44%), Gaps = 43/400 (10%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPC-TSRYRCVDCNFPNVDPSRIPAFIPKRS 160
G Y +S++FGTPPQ I DTGS L+W C T+ C P SR PAF+ +S
Sbjct: 52 GQYLVSMAFGTPPQ-EVLLIADTGSDLIWLQCSTTAAPPAFC--PKKACSRRPAFVASKS 108
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
++ ++ C +C + P CSP P+ C Y Y G T G L +T
Sbjct: 109 ATLSVVPCSAAQCLLV--PAPRGHGPSCSP---AAPVPC-GYAYDYADGSSTTGFLARDT 162
Query: 220 LRFPSKT-----VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLG---LKKFSYC 267
+ T V GC + G+ G G+ S P+Q G + FSYC
Sbjct: 163 ATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYC 222
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
LL + SS+ + P + +YTP NP+ + FYYVG+ I V
Sbjct: 223 LLDLEGGRRGRSSSFLFLGRP----ERRAAFAYTPLVSNPLAPT-----FYYVGVVAIRV 273
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
G++ + +P S GNGG ++DSGST T++ + + F + +
Sbjct: 274 GNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATF 333
Query: 388 KSGLRPCFDISGKKSVY-----LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
GL C+++S S+ P L + F G + LP NY V ++V CL
Sbjct: 334 FQGLELCYNVSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCL------ 387
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A P L +LG+ Q +++EFD A+ R GFA+ +C
Sbjct: 388 AIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTEC 427
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 120/412 (29%), Positives = 172/412 (41%), Gaps = 82/412 (19%)
Query: 92 IKTPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD 148
I+ PL + G Y + G P + + DTGS + W CT C DC
Sbjct: 133 IEAPLISGTTQGSGEYFTRVGIGKPAR-EVYMVLDTGSDVNWLQCTP---CADCYHQTE- 187
Query: 149 PSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL 208
P F P SSS + + C P+C+ + S C RN TC Y + YG
Sbjct: 188 ----PIFEPSSSSSYEPLSCDTPQCNAL----EVSEC-----RNATCL-----YEVSYGD 229
Query: 209 G-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESL------------ 255
G +T G +ETL S V N GC G S+E L
Sbjct: 230 GSYTVGDFATETLTIGSTLVQNVAVGC--------------GHSNEGLFVGAAGLLGLGG 275
Query: 256 -----PSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
PSQL FSYCL+ R D A G S +P P +N
Sbjct: 276 GLLALPSQLNTTSFSYCLVDRDSDSASTVD---------FGTSLSPDAVVAPLLRN---- 322
Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
FYY+GL I VG + ++IP S G+GG+I+DSG+ T ++ ++ ++
Sbjct: 323 -HQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRD 381
Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
F++ + +AA V + C+++S K +V +P + F GG +ALP +NY V
Sbjct: 382 SFVKGTLDLEKAAGV---AMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVD 438
Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ + F A+ A I+G+ Q Q + FDLAN GF+ KC
Sbjct: 439 SVGTFCLAFAPTASSLA-------IIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 126/396 (31%), Positives = 179/396 (45%), Gaps = 47/396 (11%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTPP+ I DTGS L W C C+DC R P F P SSS
Sbjct: 146 YLMDVYVGTPPR-RFQMIMDTGSDLNWLQCAP---CLDCF-----EQRGPVFDPAASSSY 196
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
+ + C +P+C + P + P CP Y YG + G L E+
Sbjct: 197 RNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCP-----YYYWYGDQSNSTGDLALESFTV 251
Query: 223 ------PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQL----GLKKFSYCLL 269
S V + GC + AG+ G GR S SQL G FSYCL+
Sbjct: 252 NLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLV 311
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
D V+S +V + P L YT F +SS FYYV L ++VG
Sbjct: 312 DHGSD---VASKVVFGEDDALALAAHPRLKYTAFAP----ASSPADTFYYVRLTGVLVGG 364
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKK 388
+ + I G+GG I+DSG+T ++ P ++ + + FI +M G+Y D
Sbjct: 365 ELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPV- 423
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAAGPA 447
L PC+++SG + +PEL L F GA P ENYF L + ++CL + G +
Sbjct: 424 --LSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMS 481
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 482 -------IIGNFQQQNFHVAYDLHNNRLGFAPRRCA 510
>gi|15450651|gb|AAK96597.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 110
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 64/109 (58%), Positives = 82/109 (75%), Gaps = 1/109 (0%)
Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVL 434
M NY+R D+EK++GL PCF+ISGK V +PELI +FKGGAK+ LP NYF VGN + +
Sbjct: 1 MSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTV 60
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
CL + +D P+ G GPAIILG FQ QN+ +E+DL NDRFGFAK+KC+
Sbjct: 61 CLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 109
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 142/490 (28%), Positives = 210/490 (42%), Gaps = 66/490 (13%)
Query: 14 LLILLF-TTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKT 72
L +L+F A S AA+V V LT + SDP +L R H +
Sbjct: 27 LAVLVFLVVCATLASGAASVRVGLTRI---------HSDPDTTAPQFVRDALRRDMH-RQ 76
Query: 73 KTKPKTKDSNIGSNYSNSLIKTPLSVHSY------GGYSISLSFGTPPQASTPF--IFDT 124
+++ +D + S+ T +S + G Y ++L+ GTPP P+ + DT
Sbjct: 77 RSRSFGRDRDRELAESDGRTSTTVSARTRKDLPNGGEYLMTLAIGTPP---LPYAAVADT 133
Query: 125 GSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESR 184
GS L+W +C C + P + P S++ ++ C + S
Sbjct: 134 GSDLIW------TQCAPCGTQCFE-QPAPLYNPASSTTFSVLPCNSSL----------SM 176
Query: 185 CKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT-----VPNFLAGCSILSD 239
C G P Y YG G+TAG+ SET F S VP GCS S
Sbjct: 177 CAGALAGAAPPPGCACMYYQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSNASS 236
Query: 240 RQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
AG+ G GR S SL SQLG +FSYCL F D +S L+L GP + + T
Sbjct: 237 SDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLL--GPSAALNGT- 291
Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
G+ TPF +P + + +YY+ L I +G+K + I DG GG+I+DSG+T
Sbjct: 292 GVRSTPFVASP--ARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTT 349
Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS---VYLPELILKFK 413
T + ++ V Q+ D +GL CF + S LP + L F
Sbjct: 350 ITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFD 409
Query: 414 GGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
GA M LP ++Y + G+ V CL + G G++Q QN ++ +D+ +
Sbjct: 410 -GADMVLPADSYM-ISGSGVWCLAMRNQT-------DGAMSTFGNYQQQNMHILYDVREE 460
Query: 474 RFGFAKQKCA 483
FA KC+
Sbjct: 461 TLSFAPAKCS 470
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 128/468 (27%), Positives = 203/468 (43%), Gaps = 76/468 (16%)
Query: 45 LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG- 103
L H DS H L ++R+ K + +++ S+ ++ + P+ +GG
Sbjct: 40 LTHVDSGRGFTKHELLRRMVARS---------KARLASLRSSACDTALTAPVD---HGGS 87
Query: 104 ------YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
Y I L GTP DTGS LVW C C D +P F
Sbjct: 88 DVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCACTV-CFD--------QPVPVFRA 138
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL---GFTAGL 214
S + + C +P C G V GC+ R+++C YG T G
Sbjct: 139 SVSHTFSRVPCSDPLC----GHAVYLPLSGCAARDRSC-------FYAYGYMDHSITTGK 187
Query: 215 LLSETLRFPS-------KTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKK 263
+ +T F + VPN GC +++ +GIAGFG SLPSQL +++
Sbjct: 188 MAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQLKVRR 247
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGL 322
FSYC + +++ VS ++L P + ++ G + TPF P G+ FY++ L
Sbjct: 248 FSYCFTA--MEESRVSP-VILGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSL 304
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
R + VG + S DG+GG +DSG+ TF +F ++ + F+ Q+
Sbjct: 305 RGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPLPVAK 364
Query: 383 ADVEKKSGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYF-------ALVGNEVL 434
+ + L CF + KK +P+LIL + GA LP ENY + G ++
Sbjct: 365 GYTDPDNLL--CFSVPAKKKAPAVPKLILHLE-GADWELPRENYVLDNDDDGSGAGRKLC 421
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+IL N+ G I+G+FQ QN ++ +DL +++ FA +C
Sbjct: 422 VVILSAGNSNG--------TIIGNFQQQNMHIVYDLESNKMVFAPARC 461
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 118/401 (29%), Positives = 184/401 (45%), Gaps = 57/401 (14%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
+S +++ G+S+++ GTPPQ S I D GS L+W C+ V ++P F
Sbjct: 99 ISPYAHQGHSLTVGVGTPPQPSK-VILDLGSDLLWTQCS----LVGPTAKQLEP----VF 149
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLL 215
RSSS ++ C + C+ + NKTC +Y YG+ G+L
Sbjct: 150 DAARSSSFSVLPCDS------------KLCEAGTFTNKTCTDRKCAYENDYGIMTATGVL 197
Query: 216 LSETLRFPSK--TVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLL- 269
+ET F + N GC L++ + +GI G S+ QL + KFSYCL
Sbjct: 198 ATETFTFGAHHGVSANLTFGCGKLANGTIAEASGILGLSPGPLSMLKQLAITKFSYCLTP 257
Query: 270 --SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYT-PFYKNPVGSSSAFGEFYYVGLRQII 326
RK +PV + D G KT G T P KNPV +YYV + +
Sbjct: 258 FADRK--TSPVMFGAMADLG----KYKTTGKVQTIPLLKNPVEDI-----YYYVPMVGMS 306
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR--QMGNYSRAAD 384
VGSK + +P L DG GG ++DS +T ++ P F + K + ++ +R+ D
Sbjct: 307 VGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRSVD 366
Query: 385 VEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
CF++ + V +P L+L F G A+M+LP +NYF ++CL +
Sbjct: 367 -----DYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQEPSPGMMCLAVMQ- 420
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A G ++G+ Q QN ++ +D+ N +F +A KC
Sbjct: 421 -----APFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKC 456
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 131/429 (30%), Positives = 192/429 (44%), Gaps = 59/429 (13%)
Query: 64 LSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH-SYGGYSISLSFGTPPQASTPFIF 122
+ RA + K + ++ + + I+TP++ G Y I ++ GTP S I
Sbjct: 1 MKRAIQRSQERLEKLQITSAVNTHQMKDIETPVTPDIGSGEYLIQMAIGTPA-LSLSAIM 59
Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
DTGS LVW C C DC+ ++ + K S L CQ P IF N +
Sbjct: 60 DTGSDLVWTKCNP---CTDCSTSSIYDPSSSSTYSKVLCQSSL--CQPPS---IFSCNND 111
Query: 183 SRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ 241
C+ Y+ YG T+G+L ET S+++PN GC D Q
Sbjct: 112 GDCE---------------YVYPYGDRSSTSGILSDETFSISSQSLPNITFGCG--HDNQ 154
Query: 242 ----PAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
G+ GFGR S SL SQLG KFSYCL+SR D+ +S L +
Sbjct: 155 GFDKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRT--DSSKTSPLFI--------GN 204
Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
T L T P+ SS+ YY+ L I VG + + IP SDG+GG+I+DSG
Sbjct: 205 TASLEATTVGSTPLVQSSSTNH-YYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSG 263
Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
+T TF++ ++AV + + + N +A L CF+ G + P + FK
Sbjct: 264 TTLTFLQQTAYDAVKEAMVSSI-NLPQA-----DGQLDLCFNQQGSSNPGFPSMTFHFK- 316
Query: 415 GAKMALPPENY-FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
GA +P ENY F ++++CL + N+ G I G+ Q QN+ + +D N+
Sbjct: 317 GADYDVPKENYLFPDSTSDIVCLAMMPTNS-----NLGNMAIFGNVQQQNYQILYDNENN 371
Query: 474 RFGFAKQKC 482
FA C
Sbjct: 372 VLSFAPTAC 380
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 131/396 (33%), Positives = 172/396 (43%), Gaps = 48/396 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVDCNFPNVDPSRIPAF--IPK 158
G Y ++L+ GTPPQ S P I DTGS LVW C RC P +PS P F +P
Sbjct: 90 GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLP- 147
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTAGLLLS 217
SS L C E+R G +P P C Y YG G+T+GL S
Sbjct: 148 --CSSALNLCA-----------AEARLAGATP-----PPGCACRYNQTYGTGWTSGLQGS 189
Query: 218 ETLRFPSK-----TVPNFLAGCSILSDRQPAGIAGFGRSSESLP---SQLGLKKFSYCLL 269
ET F S VP GCS S G AG SQL FSYCL
Sbjct: 190 ETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL- 248
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
F D S L+L + G+ TPF +P S +YY+ L I VG+
Sbjct: 249 -TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSP--SKPPMSTYYYLNLTGISVGA 305
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ IP +DG GG+I+DSG+T T + ++ V + +R + D +
Sbjct: 306 AALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRV-RAAVRSLVKLP-VTDGSNAT 363
Query: 390 GLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
GL CF + S LP + L F GGA M LP ENY L G + CL + +
Sbjct: 364 GLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDGG-MWCLAMRSQT----- 417
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G LG++Q QN ++ +D+ + FA KC+
Sbjct: 418 --DGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 131/470 (27%), Positives = 200/470 (42%), Gaps = 63/470 (13%)
Query: 29 AATVTVPLTPLSTKHYLHHSDSDP---LKILHSLASSSLSRARHLKTKTKPKTKDSNIGS 85
+ + P + S LHH P L++ S + ++ K K + + S
Sbjct: 15 VSAIVAPTSSTSRGTLLHHGQKRPQPGLRVDLEQVDSGKNLTKYELIKRAIKRGERRMRS 74
Query: 86 N----YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
S+S I+TP+ G Y ++++ GTP +S I DTGS L+W C +C
Sbjct: 75 INAMLQSSSGIETPVYAGD-GEYLMNVAIGTP-DSSFSAIMDTGSDLIWTQCEPCTQCFS 132
Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
P F P+ SSS + C++ C + ++TC
Sbjct: 133 --------QPTPIFNPQDSSSFSTLPCESQYCQDL--------------PSETCNNNECQ 170
Query: 202 YLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQP------AGIAGFGRSSES 254
Y YG G T G + +ET F + +VPN GC D Q AG+ G G S
Sbjct: 171 YTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCG--EDNQGFGQGNGAGLIGMGWGPLS 228
Query: 255 LPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
LPSQLG+ +FSYC+ S + S L L + +P + NP
Sbjct: 229 LPSQLGVGQFSYCMTSYG---SSSPSTLALGSAASGVPEGSPSTTLIHSSLNPT------ 279
Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
+YY+ L+ I VG ++ IP S DG GG+I+DSG+T T++ + AVA+ F
Sbjct: 280 --YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTD 337
Query: 375 QMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
Q+ + E SGL CF S +V +PE+ ++F GG + L +N V
Sbjct: 338 QI---NLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV-LNLGEQNILISPAEGV 393
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+CL + + + G + I G+ Q Q + +DL N F +C
Sbjct: 394 ICLAMGSSSQLGIS-------IFGNIQQQETQVLYDLQNLAVSFVPTQCG 436
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 119/412 (28%), Positives = 171/412 (41%), Gaps = 82/412 (19%)
Query: 92 IKTPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD 148
I+ PL + G Y + G P + + DTGS + W CT C DC
Sbjct: 136 IEAPLISGTTQGSGEYFTRVGIGNPAR-EVYMVLDTGSDVNWLQCTP---CADCYHQTE- 190
Query: 149 PSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL 208
P F P SSS + + C P+C+ + S C RN TC Y + YG
Sbjct: 191 ----PIFEPSSSSSYEPLSCDTPQCNAL----EVSEC-----RNATCL-----YEVSYGD 232
Query: 209 G-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESL------------ 255
G +T G +ETL S V N GC G S+E L
Sbjct: 233 GSYTVGDFATETLTIGSTLVQNVAVGC--------------GHSNEGLFVGAAGLLGLGG 278
Query: 256 -----PSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
PSQL FSYCL+ R D A G S P P +N
Sbjct: 279 GLLALPSQLNTTSFSYCLVDRDSDSASTVE---------FGTSLPPDAVVAPLLRN---- 325
Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
FYY+GL I VG + ++IP S G+GG+I+DSG+ T ++ ++ ++
Sbjct: 326 -HQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTGIYNSLRD 384
Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
F++ + +AA V + C+++S K ++ +P + F GG +ALP +NY V
Sbjct: 385 SFLKGTSDLEKAAGV---AMFDTCYNLSAKTTIEVPTVAFHFPGGKMLALPAKNYMIPVD 441
Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ + F A+ A I+G+ Q Q + FDLAN GF+ KC
Sbjct: 442 SVGTFCLAFAPTASSLA-------IIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 132/397 (33%), Positives = 188/397 (47%), Gaps = 53/397 (13%)
Query: 97 SVH-SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
SVH S Y + ++ GTPP T + DTGS L+W C + C C FP P+ P +
Sbjct: 84 SVHASTATYLVDIAIGTPPLPLTA-VLDTGSDLIWTQCDAP--CRRC-FPQ--PA--PLY 135
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGL 214
P RS++ + C++P C + P SRC SP + C +Y YG G T G+
Sbjct: 136 APARSATYANVSCRSPMCQALQSP--WSRC---SPPDTGC-----AYYFSYGDGTSTDGV 185
Query: 215 LLSETLRFPSKTVPNFLA-GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
L +ET S T +A GC ++ S +G+ G GR SL SQLG+ +FSYC
Sbjct: 186 LATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTP 245
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
+A +S L L GS + TPF +P G + +YY+ L I VG
Sbjct: 246 F---NATAASPLFL----GSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDT 298
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ I + G+GGVI+DSG+TFT +E F A+A+ ++ A+ G
Sbjct: 299 LLPIDPAVFRLTPMGDGGVIIDSGTTFTALEESAFVALARALASRV-RLPLASGAHL--G 355
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA-LG 449
L CF + ++V +P L+L F GA M L E+Y + D +AG A LG
Sbjct: 356 LSLCFAAASPEAVEVPRLVLHFD-GADMELRRESY------------VVEDRSAGVACLG 402
Query: 450 ----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
RG + +LG Q QN ++ +DL F KC
Sbjct: 403 MVSARGMS-VLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 132/397 (33%), Positives = 188/397 (47%), Gaps = 53/397 (13%)
Query: 97 SVH-SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
SVH S Y + ++ GTPP T + DTGS L+W C + C C FP P+ P +
Sbjct: 84 SVHASTATYLVDIAIGTPPLPLTA-VLDTGSDLIWTQCDAP--CRRC-FPQ--PA--PLY 135
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGL 214
P RS++ + C++P C + P SRC SP + C +Y YG G T G+
Sbjct: 136 APARSATYANVSCRSPMCQALQSP--WSRC---SPPDTGC-----AYYFSYGDGTSTDGV 185
Query: 215 LLSETLRFPSKTVPNFLA-GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
L +ET S T +A GC ++ S +G+ G GR SL SQLG+ +FSYC
Sbjct: 186 LATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTP 245
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
+A +S L L GS + TPF +P G + +YY+ L I VG
Sbjct: 246 F---NATAASPLFL----GSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDT 298
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ I + G+GGVI+DSG+TFT +E F A+A+ ++ A+ G
Sbjct: 299 LLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARALASRV-RLPLASGAHL--G 355
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA-LG 449
L CF + ++V +P L+L F GA M L E+Y + D +AG A LG
Sbjct: 356 LSLCFAAASPEAVEVPRLVLHFD-GADMELRRESY------------VVEDRSAGVACLG 402
Query: 450 ----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
RG + +LG Q QN ++ +DL F KC
Sbjct: 403 MVSARGMS-VLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 117/395 (29%), Positives = 164/395 (41%), Gaps = 58/395 (14%)
Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIP 157
YG + + + GTPPQ + I DTGS L W PC + + D P F P
Sbjct: 22 YGEFLVPIYLGTPPQKAV-VIIDTGSDLTWIQSEPCRACFEQAD-----------PIFDP 69
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLL 216
+SS+ I C + C+ + G S C Y YG G T G
Sbjct: 70 SKSSTYNKIACSSSACADLLGTQTCSAAANCI------------YAYGYGDGSVTRGYFS 117
Query: 217 SETLRFPSKTVPNFLAGCSI-----LSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
ET+ G S+ D GI G G+ S+PSQLG KFSYCL
Sbjct: 118 KETITATDTAGEEVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCL 177
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+ + S+ D SG+ + YTP N + +YY+ ++ I VG
Sbjct: 178 VDWLSAGSETSTMYFGDAAVPSGE-----VQYTPIVPN-----ADHPTYYYIAVQGISVG 227
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+ I S S G+GG I+DSG+T T+++ +F A+ + Q+ R
Sbjct: 228 GSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQV----RYPTTTSA 283
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
+GL CF+ G S P + + G + LP N F + ++CL A AL
Sbjct: 284 TGLDLCFNTRGTGSPVFPAMTIHLD-GVHLELPTANTFISLETNIICL------AFASAL 336
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P I G+ Q QNF + +DL N R GFA CA
Sbjct: 337 DF-PIAIFGNIQQQNFDIVYDLDNMRIGFAPADCA 370
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 139/487 (28%), Positives = 214/487 (43%), Gaps = 63/487 (12%)
Query: 14 LLILLF-TTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH-LK 71
L +L+F A S AA+V V LT + SDP +L R H +
Sbjct: 27 LAVLVFLVVCATLASGAASVRVGLTRI---------HSDPDTTAPQFVRDALRRDMHRQR 77
Query: 72 TKTKPKTKDSNIGSNYSNSLI--KTPLSVHSYGGYSISLSFGTPPQASTPF--IFDTGSS 127
+++ + +D + + + + +T + + G Y ++L+ GTPP P+ + DTGS
Sbjct: 78 SRSFGRDRDRELAESDGRTTVSARTRKDLPNGGEYLMTLAIGTPP---LPYAAVADTGSD 134
Query: 128 LVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG 187
L+W +C C + P + P S++ ++ C + S C G
Sbjct: 135 LIW------TQCAPCGTQCFE-QPAPLYNPASSTTFSVLPCNSSL----------SMCAG 177
Query: 188 CSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT-----VPNFLAGCSILSDRQ- 241
P Y YG G+TAG+ SET F S VP GCS S
Sbjct: 178 ALAGAAPPPGCACMYNQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDW 237
Query: 242 --PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS 299
AG+ G GR S SL SQLG +FSYCL F D +S L+L GP + + T G+
Sbjct: 238 NGSAGLVGLGRGSLSLVSQLGAGRFSYCL--TPFQDTNSTSTLLL--GPSAALNGT-GVR 292
Query: 300 YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTF 359
TPF +P + + +YY+ L I +G+K + I DG GG+I+DSG+T T
Sbjct: 293 STPFVASP--ARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITS 350
Query: 360 MEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS---VYLPELILKFKGGA 416
+ ++ V + ++ + D +GL CF + S LP + L F GA
Sbjct: 351 LANAAYQQV-RAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFD-GA 408
Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
M LP ++Y + G+ V CL + G G++Q QN ++ +D+ +
Sbjct: 409 DMVLPADSYM-ISGSGVWCLAMRNQT-------DGAMSTFGNYQQQNMHILYDVREETLS 460
Query: 477 FAKQKCA 483
FA KC+
Sbjct: 461 FAPAKCS 467
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 131/396 (33%), Positives = 171/396 (43%), Gaps = 48/396 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVDCNFPNVDPSRIPAF--IPK 158
G Y ++L+ GTPPQ S P I DTGS LVW C RC P +PS P F +P
Sbjct: 90 GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLP- 147
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTAGLLLS 217
SS L C E+R G +P P C Y YG G+T+GL S
Sbjct: 148 --CSSALNLCA-----------AEARLAGATP-----PPGCACRYNQTYGTGWTSGLQGS 189
Query: 218 ETLRFPSK-----TVPNFLAGCSILSDRQPAGIAGFGRSSESLP---SQLGLKKFSYCLL 269
ET F S VP GCS S G AG SQL FSYCL
Sbjct: 190 ETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL- 248
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
F D S L+L + G+ TPF +P S +YY+ L I VG
Sbjct: 249 -TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSP--SKPPMSTYYYLNLTGISVGP 305
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ IP +DG GG+I+DSG+T T + ++ V + +R + D +
Sbjct: 306 AALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRV-RAAVRSLVKLP-VTDGSNAT 363
Query: 390 GLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
GL CF + S LP + L F GGA M LP ENY L G + CL + +
Sbjct: 364 GLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDGG-MWCLAMRSQT----- 417
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G LG++Q QN ++ +D+ + FA KC+
Sbjct: 418 --DGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 122/390 (31%), Positives = 172/390 (44%), Gaps = 56/390 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+P + + DTGS + W C C DC + DP F P S+
Sbjct: 167 GEYFSRVGIGSPAR-ELYMVLDTGSDVTWVQCQP---CADC-YQQSDP----VFDPSLSA 217
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C +P+C + C RN T AC Y + YG G +T G +ETL
Sbjct: 218 SYAAVSCDSPRCRDL-------DTAAC--RNATG--AC-LYEVAYGDGSYTVGDFATETL 265
Query: 221 RFPSKT-VPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFD 274
T V N GC D + + G + S PSQ+ FSYCL+ R
Sbjct: 266 TLGDSTPVTNVAIGCG--HDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR--- 320
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
D+P +S L D+ T P ++P G FYYV L I VG + + I
Sbjct: 321 DSPAASTLQFGADGAEADTVT-----APLVRSP-----RTGTFYYVALSGISVGGQALSI 370
Query: 335 PYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
P S + + + G+GGVIVDSG+ T ++ + A+ F+R + R + V S
Sbjct: 371 PSSAFAMDATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGV---SLFDT 427
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGP 452
C+D+S + SV +P + L+F+GG + LP +NY V G CL NAA
Sbjct: 428 CYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAA-------- 479
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q + FD A GF KC
Sbjct: 480 VSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 119/371 (32%), Positives = 166/371 (44%), Gaps = 56/371 (15%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGS + W C C DC + DP F P S+S + C NP+C +
Sbjct: 182 MVLDTGSDVTWVQCQP---CADC-YQQSDP----VFDPSLSTSYASVACDNPRCHDL--- 230
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
+ RN T AC Y + YG G +T G +ETL S V + GC
Sbjct: 231 ------DAAACRNSTG--AC-LYEVAYGDGSYTVGDFATETLTLGDSAPVSSVAIGCG-- 279
Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
D + + G + S PSQ+ FSYCL+ R D+P SS L GD
Sbjct: 280 HDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDR---DSPSSSTLQF------GD 330
Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
+ ++ P ++P S+ FYYVGL + VG + + IP S S G GGVIVD
Sbjct: 331 AADAEVT-APLIRSPRTST-----FYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVD 384
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG+ T ++ + A+ F+R + R + V S C+D+S + SV +P + L+F
Sbjct: 385 SGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGV---SLFDTCYDLSDRTSVEVPAVSLRF 441
Query: 413 KGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
GG ++ LP +NY V G CL NAA I+G+ Q Q + FD A
Sbjct: 442 AGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAA--------VSIIGNVQQQGTRVSFDTA 493
Query: 472 NDRFGFAKQKC 482
GF KC
Sbjct: 494 KSTVGFTTNKC 504
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 131/396 (33%), Positives = 171/396 (43%), Gaps = 48/396 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVDCNFPNVDPSRIPAF--IPK 158
G Y ++L+ GTPPQ S P I DTGS LVW C RC P +PS P F +P
Sbjct: 95 GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLP- 152
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTAGLLLS 217
SS L C E+R G +P P C Y YG G+T+GL S
Sbjct: 153 --CSSALNLCA-----------AEARLAGATP-----PPGCACRYNQTYGTGWTSGLQGS 194
Query: 218 ETLRFPSK-----TVPNFLAGCSILSDRQPAGIAGFGRSSESLP---SQLGLKKFSYCLL 269
ET F S VP GCS S G AG SQL FSYCL
Sbjct: 195 ETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL- 253
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
F D S L+L + G+ TPF +P S +YY+ L I VG
Sbjct: 254 -TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSP--SKPPMSTYYYLNLTGISVGP 310
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ IP +DG GG+I+DSG+T T + ++ V + +R + D +
Sbjct: 311 AALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRV-RAAVRSLVKLP-VTDGSNAT 368
Query: 390 GLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
GL CF + S LP + L F GGA M LP ENY L G + CL + +
Sbjct: 369 GLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDGG-MWCLAMRSQT----- 422
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G LG++Q QN ++ +D+ + FA KC+
Sbjct: 423 --DGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 456
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 126/410 (30%), Positives = 173/410 (42%), Gaps = 66/410 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + LS GTPP+ DTGS LVW C C++C D IP P SS+
Sbjct: 94 YLVHLSVGTPPR-PVALTLDTGSDLVWTQCAP---CLNC----FDQGAIPVLDPAASSTH 145
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
+ C P C + P G S ++C Y+ YG T G L S+ F
Sbjct: 146 AAVRCDAPVCRAL--PFTSCGRGGSSWGERSC-----VYVYHYGDKSITVGKLASDRFTF 198
Query: 223 -PSKTVP-------NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
P GC + GIAGFGR SLPSQLG+ FSYC S
Sbjct: 199 GPGDNADGGGVSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTS 258
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
F+ SS + L P T + TP ++P S Y++ L+ I VG+
Sbjct: 259 -MFES--TSSLVTLGVAPAE-LHLTGQVQSTPLLRDPSQPS-----LYFLSLKAITVGAT 309
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ IP I+DSG++ T + ++EAV EF+ Q+G A + S
Sbjct: 310 RIPIPERRQ---RLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVE---GSA 363
Query: 391 LRPCFDISGKKS-----------------VYLPELILKFKGGAKMALPPENY-FALVGNE 432
L CF + + V +P L+ GGA LP ENY F G
Sbjct: 364 LDLCFALPSAAAPKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGAR 423
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V+CL+L D A G G +++G++Q QN ++ +DL ND FA +C
Sbjct: 424 VMCLVL--DAATG---GGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 131/422 (31%), Positives = 192/422 (45%), Gaps = 61/422 (14%)
Query: 89 NSLIKTPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
N +K+PL + G Y + + GTPPQ S + DTGS LVW C++ C +C+
Sbjct: 70 NPTLKSPLISGASTGSGQYFVDIRLGTPPQ-SLLLVADTGSDLVWVKCSA---CRNCS-- 123
Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
+ PS AF+P+ SSS C +P C + P+ + + C +L
Sbjct: 124 HHPPSS--AFLPRHSSSFSPFHCFDPHCRLL--PHAPHHLCNHTRLHSPC-----RFLYS 174
Query: 206 YGLG-FTAGLLLSETLRFPSKT-----VPNFLAGCSI------LSDRQ---PAGIAGFGR 250
Y G ++G ET S + + GC +S Q G+ G GR
Sbjct: 175 YADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGR 234
Query: 251 SSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLV---LDTGPGSGDSKTPGLSYTPFY 304
S S SQLG + KFSYCL+ P S ++ L + P + +K +SYTP
Sbjct: 235 GSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATK---ISYTPLQ 291
Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD--GNGGVIVDSGSTFTFMEG 362
NP+ + FYY+ + I + VK+P + V D GNGG +VDSG+T T++
Sbjct: 292 INPLSPT-----FYYITIHSITIDG--VKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTK 344
Query: 363 PLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK-KSVYLPELILKFKGGAKMALP 421
+E V K +R+ AA++ G C + SG+ + LP L + GGA A P
Sbjct: 345 TAYEEVLKS-VRRRVKLPNAAELTP--GFDLCVNASGESRRPSLPRLRFRLGGGAVFAPP 401
Query: 422 PENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
P NYF V+CL + + G G ++I G+ Q F LEFD R GF ++
Sbjct: 402 PRNYFLETEEGVMCLAIRAVES-----GNGFSVI-GNLMQQGFLLEFDKEESRLGFTRRG 455
Query: 482 CA 483
C
Sbjct: 456 CG 457
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 126/402 (31%), Positives = 177/402 (44%), Gaps = 67/402 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + +S GTPP+ DTGS LVW C C+DC + P P SS+
Sbjct: 90 YLMHVSVGTPPR-PVALTLDTGSDLVWTQCAP---CLDC----FEQGAAPVLDPAASSTH 141
Query: 164 QLIGCQNPKCSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLR 221
+ C P C + F + C G S +++C Y+ YG T G L +++
Sbjct: 142 AALPCDAPLCRALPF-----TSCGGRSWGDRSC-----VYVYHYGDRSLTVGQLATDSFT 191
Query: 222 FPSKTVPNFLA------GCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
F LA GC ++ GIAGFGR SLPSQL + FSYC S
Sbjct: 192 FGGDDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTS- 250
Query: 272 KFDDAPVSSNLVLDTGPGSGD-------SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
FD S+ V+ G + + + T + T KNP S Y+V LR
Sbjct: 251 MFD---TKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPS-----LYFVPLRG 302
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
I VG V +P S L I+DSG++ T + ++EAV EF+ Q+G A
Sbjct: 303 ISVGGARVAVPESRL------RSSTIIDSGASITTLPEDVYEAVKAEFVSQVG---LPAA 353
Query: 385 VEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFT 440
+ L CF + + + +P L L GGA LP NY F VLC++L
Sbjct: 354 AAGSAALDLCFALPVAALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVL-- 411
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
D AA G +++G++Q QN ++ +DL ND FA +C
Sbjct: 412 DAAA------GEQVVIGNYQQQNTHVVYDLENDVLSFAPARC 447
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/371 (32%), Positives = 165/371 (44%), Gaps = 56/371 (15%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGS + W C C DC + DP F P S+S + C NP+C +
Sbjct: 178 MVLDTGSDVTWVQCQP---CADC-YQQSDP----VFDPSLSTSYASVACDNPRCHDL--- 226
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
+ RN T AC Y + YG G +T G +ETL S V + GC
Sbjct: 227 ------DAAACRNSTG--AC-LYEVAYGDGSYTVGDFATETLTLGDSAPVSSVAIGCG-- 275
Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
D + + G + S PSQ+ FSYCL+ R D+P SS L GD
Sbjct: 276 HDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDR---DSPSSSTLQF------GD 326
Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
+ ++ P ++P S+ FYYVGL I VG + + IP S G GGVIVD
Sbjct: 327 AADAEVT-APLIRSPRTST-----FYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVD 380
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG+ T ++ + A+ F+R + R + V S C+D+S + SV +P + L+F
Sbjct: 381 SGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGV---SLFDTCYDLSDRTSVEVPAVSLRF 437
Query: 413 KGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
GG ++ LP +NY V G CL NAA I+G+ Q Q + FD A
Sbjct: 438 AGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAA--------VSIIGNVQQQGTRVSFDTA 489
Query: 472 NDRFGFAKQKC 482
GF KC
Sbjct: 490 KSTVGFTSNKC 500
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 119/400 (29%), Positives = 175/400 (43%), Gaps = 43/400 (10%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPC-TSRYRCVDCNFPNVDPSRIPAFIPKRS 160
G Y +S++FGTPPQ I DTGS L+W C T+ C P SR PAF+ +S
Sbjct: 51 GQYLVSMAFGTPPQ-EVLLIADTGSDLIWLQCSTTAAPPAFC--PKKACSRRPAFVASKS 107
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
++ ++ C +C + P CSP P+ C Y Y G T G L +T
Sbjct: 108 ATLSVVPCSAAQCLLV--PAPRGHGPACSP---AAPVPC-GYAYDYADGSSTTGFLARDT 161
Query: 220 LRFPSKT-----VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLG---LKKFSYC 267
+ T V GC + G+ G G+ S P+Q G + FSYC
Sbjct: 162 ATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYC 221
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
LL + SS+ + P + +YTP NP+ + FYYVG+ I V
Sbjct: 222 LLDLEGGRRGRSSSFLFLGRP----ERRAAFAYTPLVSNPLAPT-----FYYVGVVAIRV 272
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
G++ + +P S GNGG ++DSGST T++ + + F + +
Sbjct: 273 GNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATF 332
Query: 388 KSGLRPCFDISGKKSVY-----LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
GL C+++S S P L + F G + LP NY V ++V CL
Sbjct: 333 FQGLELCYNVSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCL------ 386
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A P L +LG+ Q +++EFD A+ R GFA+ +C
Sbjct: 387 AIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTEC 426
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 126/392 (32%), Positives = 171/392 (43%), Gaps = 58/392 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS +VW C +C + DP F P +S
Sbjct: 127 GEYFTRIGVGTPARY-VYMVLDTGSDVVWLQCAPCRKC----YTQADP----VFDPTKSR 177
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ I C P C + P GC+ +NK C Y + YG G FT G +ETL
Sbjct: 178 TYAGIPCGAPLCRRLDSP-------GCNNKNKVC-----QYQVSYGDGSFTFGDFSTETL 225
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLK---KFSYCLLSRK 272
F V GC D + I G S P Q G + KFSYCL+ R
Sbjct: 226 TFRRTRVTRVALGCG--HDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRS 283
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
P S++V GDS + +TP KNP FYY+ L I VG
Sbjct: 284 ASAKP--SSVVF------GDSAVSRTARFTPLIKNP-----KLDTFYYLELLGISVGGSP 330
Query: 332 VK-IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
V+ + S + GNGGVI+DSG++ T + P + A+ F + RAA+ S
Sbjct: 331 VRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEF---SL 387
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
CFD+SG V +P ++L F+ GA ++LP NY V N F +G +
Sbjct: 388 FDTCFDLSGLTEVKVPTVVLHFR-GADVSLPATNYLIPVDNSGSFCFAFAGTMSGLS--- 443
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q F + FDLA R GFA + C
Sbjct: 444 ----IIGNIQQQGFRVSFDLAGSRVGFAPRGC 471
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 145/462 (31%), Positives = 202/462 (43%), Gaps = 74/462 (16%)
Query: 42 KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
K L H D+D L S +L R+ + T G + + I L + S
Sbjct: 32 KATLRHVDADAGYTEEQLLSRALRRSSA-RVATLQSLAALAPGDAITAARI---LVLASD 87
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTP + + I DTGS L+W C CVD P F P RS+
Sbjct: 88 GEYLMEMGIGTPTRYYSA-ILDTGSDLIWTQCAPCLLCVD--------QPTPYFDPARSA 138
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG---FTAGLLLSE 218
+ + +GC +P C+ ++ P C K C + QY G TAG+L +E
Sbjct: 139 TYRSLGCASPACNALYYP----LCY-----QKVC-------VYQYFYGDSASTAGVLANE 182
Query: 219 TLRFPSK----TVPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSYCLLSR 271
T F + ++P GC L+ A G+ GFGR S SL SQLG +FSYCL S
Sbjct: 183 TFTFGTNETRVSLPGISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSF 242
Query: 272 KFDDAPVSSNL---VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+PV S L V T + S P S TPF NP A Y++ + I VG
Sbjct: 243 L---SPVPSRLYFGVYATLNSTNASSEPVQS-TPFVVNP-----ALPTMYFLNMTGISVG 293
Query: 329 SKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
+ I P + + +DG GG I+DSG+T T++ P ++AV F Q+ +V
Sbjct: 294 GYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQIT--LPLLNVTD 351
Query: 388 KSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
S L CF ++SV LP+L+L F GA LP +NY + D + G
Sbjct: 352 ASVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNY------------MLVDPSTG 398
Query: 446 PALGRGPA-----IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L A I+G +Q QNF + +DL N F C
Sbjct: 399 GGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 118/395 (29%), Positives = 172/395 (43%), Gaps = 47/395 (11%)
Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
G PPQ + I DTGS+L+W C++ C + + P RS +++ + C
Sbjct: 90 IGDPPQQAAAII-DTGSNLIWTQCST------CRANGCFGQDLTFYDPSRSRTAKPVACN 142
Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF----PSK 225
+ C + G E+RC + K C + L YG G G L +E F S+
Sbjct: 143 DTAC--LLGS--ETRC---ARDGKAC-----AVLTAYGAGAIGGFLGTEVFTFGHGQSSE 190
Query: 226 TVPNFLAGCSILSDRQP------AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
+ GC S P +GI G GR SLPSQLG KFSYCL + F DA +
Sbjct: 191 NNVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNKFSYCL-TPYFSDAANT 249
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-- 337
S L + G P S PF KNP F FYY+ L I VG+ + +P +
Sbjct: 250 STLFVGASAGLSGGGAPATS-VPFLKNP--DDDPFDSFYYLPLTGITVGTAKLDVPAAAF 306
Query: 338 ---YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
+ P GG ++DSGS FT + ++A+ E +RQ+G S GL C
Sbjct: 307 DLREVAPAK--WGGTLIDSGSPFTSLIDVAYQALRDELVRQLG-ASVVPPPAGAEGLDLC 363
Query: 395 FD--ISGKKSVYLPELILKFKGGA----KMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
G +P L+L F G + +PPENY+ V + C+++F+ L
Sbjct: 364 VGGVAPGDAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTL 423
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G++ Q+ +L +DL F C+
Sbjct: 424 PLNETTIIGNYMQQDMHLLYDLGQGVLSFQPADCS 458
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 125/402 (31%), Positives = 181/402 (45%), Gaps = 59/402 (14%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L S G Y + L+ GTPP T I DTGS L+W +C C P+ P F
Sbjct: 81 LVTASSGEYLVDLAIGTPPLYYTA-IMDTGSDLIW------TQCAPCLLCAAQPT--PYF 131
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGL 214
KRS++ + + C++ +C+ + P S K C Y YG TAG+
Sbjct: 132 DVKRSATYRALPCRSSRCAALSSP---------SCFKKMC-----VYQYYYGDTASTAGV 177
Query: 215 LLSETLRFPSKT-----VPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSY 266
L +ET F + + N GC L+ + A G+ GFGR SL SQLG +FSY
Sbjct: 178 LANETFTFGAASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQLGPSRFSY 237
Query: 267 CLLSRKFDDAPVSSNL---VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
CL S +P S L V + S + TPF NP A Y++ ++
Sbjct: 238 CLTSYL---SPTPSRLYFGVFANLNSTNTSSGSPVQSTPFVINP-----ALPNMYFLSVK 289
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I +G+K + I DG GGVI+DSG++ T+++ +EAV + + A
Sbjct: 290 GISLGTKRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLASTI---PLPA 346
Query: 384 DVEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFT 440
+ GL CF +V +P+ + F GA M LPPENY + LCL +
Sbjct: 347 MNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFD-GANMTLPPENYMLIASTTGYLCLAM-- 403
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A ++G I+G++Q QN +L +D+AN F C
Sbjct: 404 ---APTSVG----TIIGNYQQQNLHLLYDIANSFLSFVPAPC 438
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 144/462 (31%), Positives = 202/462 (43%), Gaps = 74/462 (16%)
Query: 42 KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
K L H D+D L S +L R+ + T G + + I L + S
Sbjct: 32 KATLRHVDADAGYTEEQLLSRALRRSSA-RVATLQSLAALAPGDAITAARI---LVLASD 87
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTP + + I DTGS L+W C CVD P F P RS+
Sbjct: 88 GEYLMEMGIGTPTRYYSA-ILDTGSDLIWTQCAPCLLCVD--------QPTPYFDPARSA 138
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG---FTAGLLLSE 218
+ + +GC +P C+ ++ P C K C + QY G TAG+L +E
Sbjct: 139 TYRSLGCASPACNALYYP----LCY-----QKVC-------VYQYFYGDSASTAGVLANE 182
Query: 219 TLRFPSK----TVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
T F + ++P GC L+ +G+ GFGR S SL SQLG +FSYCL S
Sbjct: 183 TFTFGTNETRVSLPGISFGCGNLNAGLLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSF 242
Query: 272 KFDDAPVSSNL---VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+PV S L V T + S P S TPF NP A Y++ + I VG
Sbjct: 243 L---SPVPSRLYFGVYATLNSTNASSEPVQS-TPFVVNP-----ALPTMYFLNMTGISVG 293
Query: 329 SKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
+ I P + + +DG GG I+DSG+T T++ P ++AV F Q+ +V
Sbjct: 294 GYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQIT--LPLLNVTD 351
Query: 388 KSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
S L CF ++SV LP+L+L F GA LP +NY + D + G
Sbjct: 352 ASVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNY------------MLVDPSTG 398
Query: 446 PALGRGPA-----IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L A I+G +Q QNF + +DL N F C
Sbjct: 399 GGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 125/392 (31%), Positives = 178/392 (45%), Gaps = 58/392 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTPPQ DTGS LVW C C + + P D SR SS+
Sbjct: 35 YLLHLAIGTPPQP-VQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASR--------SSTF 85
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
L C + +C P+V + C + +TC +Y YG T G L ET+ F
Sbjct: 86 ALPSCDSTQCK--LDPSV-TMC--VNQTVQTC-----AYSYSYGDKSATIGFLDVETVSF 135
Query: 223 PS-KTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS---RKFD 274
+ +VP + GC + + GIAGFGR SLPSQL + FS+C + RK
Sbjct: 136 VAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRK-- 193
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
S ++ D + + TP KNP A FYY+ L+ I VGS + +
Sbjct: 194 ----PSTVLFDLPADLYKNGRGTVQTTPLIKNP-----AHPTFYYLSLKGITVGSTRLPV 244
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
P S +G GG I+DSG+ FT + ++ V EF + ++G C
Sbjct: 245 PESAFAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHV---KLPVVPSNETGPLLC 300
Query: 395 FDISG-KKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCLILFTDNAAGPALGR 450
F K+ ++P+L+L F+ GA M LP ENY GN +CL A+
Sbjct: 301 FSAPPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICL----------AIIE 349
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+FQ QN ++ +DL N + F + KC
Sbjct: 350 GEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 126/390 (32%), Positives = 169/390 (43%), Gaps = 52/390 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L GTP + + DTGS +VW C RC + DP F P++S
Sbjct: 140 GEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRC----YSQSDP----IFDPRKSK 190
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ I C +P C + GC+ R KTC Y + YG G FT G +ETL
Sbjct: 191 TYATIPCSSPHCRRL-------DSAGCNTRRKTC-----LYQVSYGDGSFTVGDFSTETL 238
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
F V GC ++ AG+ G G+ S P Q G + KFSYCL+ R
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSAS 298
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-K 333
P S++V S ++ +TP NP FYYVGL I VG V
Sbjct: 299 SKP--SSVVFGNAAVSRIAR-----FTPLLSNP-----KLDTFYYVGLLGISVGGTRVPG 346
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+ S GNGGVI+DSG++ T + P + A+ F RA D S
Sbjct: 347 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDF---SLFDT 403
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
CFD+S V +P ++L F+G A ++LP NY V F G +
Sbjct: 404 CFDLSNMNEVKVPTVVLHFRG-ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLS------ 456
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +DLA+ R GFA CA
Sbjct: 457 -IIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 128/398 (32%), Positives = 181/398 (45%), Gaps = 70/398 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTPPQ DTGS LVW C C + + P D SR SS+
Sbjct: 91 YLLHLAIGTPPQP-VQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASR--------SSTF 141
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
L C + +C P+V + C + +TC +Y YG T G L ET+ F
Sbjct: 142 ALPSCDSTQCK--LDPSV-TMC--VNQTVQTC-----AYSYSYGDKSATIGFLDVETVSF 191
Query: 223 PS-KTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS---RKFD 274
+ +VP + GC + + GIAGFGR SLPSQL + FS+C + RK
Sbjct: 192 VAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRK-- 249
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
S ++ D + + TP KNP A FYY+ L+ I VGS + +
Sbjct: 250 ----PSTVLFDLPADLYKNGRGTVQTTPLIKNP-----AHPTFYYLSLKGITVGSTRLPV 300
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE------KK 388
P S +G GG I+DSG+ FT + ++ V EF AA V+ +
Sbjct: 301 PESAFAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEF---------AAHVKLPVVPSNE 350
Query: 389 SGLRPCFDISG-KKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCLILFTDNAA 444
+G CF K+ ++P+L+L F+ GA M LP ENY GN +CL
Sbjct: 351 TGPLLCFSAPPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICL-------- 401
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A+ G I+G+FQ QN ++ +DL N + F + KC
Sbjct: 402 --AIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 123/401 (30%), Positives = 182/401 (45%), Gaps = 64/401 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++S GTP + + I DTGS L+W C C + + P F P+ SS
Sbjct: 38 GDYVTTISLGTPAKVFS-VIADTGSDLIWIQCKPCQACFN--------QKDPIFDPEGSS 88
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S + C + C + PR K+C C Y YG G T G L SET+
Sbjct: 89 SYTTMSCGDTLCDSL-------------PR-KSCSPNC-DYSYGYGDGSGTRGTLSSETV 133
Query: 221 RFPSK-----TVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
S N GC L S +G+ G GR + S SQLG KFSYCL+
Sbjct: 134 TLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLV 193
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQIIV 327
+ DAP ++ + S S L Y TP NP A FYYV L+ I +
Sbjct: 194 PWR--DAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNP-----AMESFYYVKLKDISI 246
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
+ ++IP DG+GG+I DSG+T T + ++ V +R + + +++
Sbjct: 247 AGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIV----LRALRSKVSFPEIDG 302
Query: 388 KS-GLRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTD 441
S GL C+D+SG K+ Y +P ++ F+ GA LP ENYF + ++CL + +
Sbjct: 303 SSAGLDLCYDVSGSKASYKKKIPAMVFHFE-GADHQLPVENYFIAANDAGTIVCLAMVSS 361
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
N +G I G+ QNF + +D+ + + G+A +C
Sbjct: 362 NM---DIG-----IYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 130/431 (30%), Positives = 187/431 (43%), Gaps = 76/431 (17%)
Query: 63 SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQASTPFI 121
+L+RA H K+ + + + S S +TPL + S GG Y ++ S GTPPQ + +
Sbjct: 42 NLTRAAH-KSHQRLSMLAARLDDAASGS-AQTPLQLDSGGGAYDMTFSIGTPPQELSA-L 98
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS L+W C + RCV P P++ P +SSS + C CS + P+
Sbjct: 99 ADTGSDLIWAKCGACTRCV--------PQGSPSYYPNKSSSFSKLPCSGSLCSDL--PSS 148
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLG-----FTAGLLLSETLRFPSKTVPNFLAGCSI 236
+ CS C Y YGL +T G L SET S VP GC+
Sbjct: 149 Q-----CSAGGAEC-----DYKYSYGLASDPHHYTQGYLGSETFTLGSDAVPGIGFGCTT 198
Query: 237 L---SDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP--GSG 291
+ +G+ G GR SL SQL + FSYCL S DA +S L+ +G G+G
Sbjct: 199 MSEGGYGSGSGLVGLGRGPLSLVSQLNVGAFSYCLTS----DAAKTSPLLFGSGALTGAG 254
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
TP L + +Y Y V L I +G+ G+ G+I
Sbjct: 255 VQSTPLLRTSTYY-------------YTVNLESISIGAA---------TTAGTGSSGIIF 292
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
DSG+T F+ P + + + Q N + A+ + G CF SG P ++L
Sbjct: 293 DSGTTVAFLAEPAYTLAKEAVLSQTTNLTMAS---GRDGYEVCFQTSG---AVFPSMVLH 346
Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
F GG M LP ENYF V + V C I+ P+L I+G+ N+++ +D+
Sbjct: 347 FDGG-DMDLPTENYFGAVDDSVSCWIV----QKSPSLS-----IVGNIMQMNYHIRYDVE 396
Query: 472 NDRFGFAKQKC 482
F C
Sbjct: 397 KSMLSFQPANC 407
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 129/398 (32%), Positives = 180/398 (45%), Gaps = 70/398 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTPPQ DTGS LVW C C + + P D SR SS+
Sbjct: 91 YLLHLAIGTPPQP-VQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASR--------SSTF 141
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
L C + +C P+V + C N+T SY YG T G L ET+ F
Sbjct: 142 ALPSCDSTQCK--LDPSV-TMCV-----NQTVQTCAFSY--SYGDKSATIGFLDVETVSF 191
Query: 223 PS-KTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS---RKFD 274
+ +VP + GC + + GIAGFGR SLPSQL + FS+C + RK
Sbjct: 192 VAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRK-- 249
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
S ++ D + + TP KNP A FYY+ L+ I VGS + +
Sbjct: 250 ----PSTVLFDLPADLYKNGRGTVQTTPLIKNP-----AHPTFYYLSLKGITVGSTRLPV 300
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE------KK 388
P S +G GG I+DSG+ FT + ++ V EF AA V+ +
Sbjct: 301 PESAFAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEF---------AAHVKLPVVPSNE 350
Query: 389 SGLRPCFDISG-KKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCLILFTDNAA 444
+G CF K+ ++P+L+L F+ GA M LP ENY GN +CL
Sbjct: 351 TGPLLCFSAPPLGKAPHVPKLVLHFE-GATMHLPRENYVFEAKDGGNCSICL-------- 401
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A+ G I+G+FQ QN ++ +DL N + F + KC
Sbjct: 402 --AIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 122/399 (30%), Positives = 180/399 (45%), Gaps = 60/399 (15%)
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
S G Y + + G+PP+ + I DTGS L+W C CV+ P+ P F P +
Sbjct: 84 SEGEYLMDVGIGSPPRYFSAMI-DTGSDLIWTQCAPCLLCVE------QPT--PYFEPAK 134
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
S+S + C + C+ ++ P C Y YG +AG+L +E
Sbjct: 135 STSYASLPCSSAMCNALYSP--------------LCFQNACVYQAFYGDSASSAGVLANE 180
Query: 219 TLRFPSKT----VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
T F + + VP GC ++ +G+ GFGR + SL SQLG +FSYCL S
Sbjct: 181 TFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSF 240
Query: 272 KFDDAPVSSNLVLD---TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+P +S L T + S + + TPF NP A Y++ + I V
Sbjct: 241 M---SPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNP-----ALPTMYFLNMTGISVA 292
Query: 329 SKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
+ I P + + +DG GGVI+DSG+T TF+ P + V F+ +G A+
Sbjct: 293 GDLLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVG--LPRANATP 350
Query: 388 KSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILF-TDNA 443
CF ++ V LPE++L F GA M LP ENY + G LCL + +D+
Sbjct: 351 SDTFDTCFKWPPPPRRMVTLPEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSDDG 409
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ I+G FQ QNF++ +DL N F C
Sbjct: 410 S----------IIGSFQHQNFHMLYDLENSLLSFVPAPC 438
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 130/400 (32%), Positives = 179/400 (44%), Gaps = 64/400 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTPPQ I DTGS LVW C C +DPS SS+
Sbjct: 415 YLVHLAIGTPPQ-PVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSN--------SSTF 465
Query: 164 QLIGCQNPKC---SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
++ C +P C +W S C + N+TC Y+ Y G T G L +ET
Sbjct: 466 DVLPCSSPVCDNLTW-------SSCGKHNWGNQTC-----VYVYAYADGSITTGHLDAET 513
Query: 220 LRFPSK------TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
F + TVP+ GC + ++ GIAGFGR + SLPSQL + FS+C
Sbjct: 514 FTFAAADGTGQATVPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDNFSHCFT 573
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+ + P S L L P + S G + TP +N S A YY+ L+ I VG
Sbjct: 574 AITGSE-PSSVLLGL---PANLYSDADGAVQSTPLVQN-FSSLRA----YYLSLKGITVG 624
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
S + IP S DG GG I+DSG+ T + ++ V F Q+ D
Sbjct: 625 STRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQV---RLPVDNATS 681
Query: 389 SGL-RPCFDISGKKSVY--LPELILKFKGGAKMALPPENY---FALVGNEVLCLILFTDN 442
S L R CF S + +P+L+L F+ GA + LP ENY F G V CL +
Sbjct: 682 SSLSRLCFSFSVPRRAKPDVPKLVLHFE-GATLDLPRENYMFEFEDAGGSVTCLAI---- 736
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
AG L I+G++Q QN ++ +DL + F +C
Sbjct: 737 NAGDDL-----TIIGNYQQQNLHVLYDLVRNMLSFVPAQC 771
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 129/407 (31%), Positives = 184/407 (45%), Gaps = 57/407 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTPP+ I DTGS L W C C+DC R P F P SS
Sbjct: 149 GEYLMDVYVGTPPRRFR-MIMDTGSDLNWLQCAP---CLDCF-----EQRGPVFDPAASS 199
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA----CPSYLLQYGLGFTAGLLLS 217
S + + C + +C +V + + +TC CP Y T G L
Sbjct: 200 SYRNVTCGDHRCG-----HVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLAL 254
Query: 218 ETLRF------PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
E+ S+ V + GC + AG+ G GR S SQL FS
Sbjct: 255 ESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFS 314
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGD-----SKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
YCL+ D V S +V G D + P L YT F SSS FYYV
Sbjct: 315 YCLVDHGSD---VGSKVVF----GEDDDALALAAHPQLKYTAFAPASS-SSSPADTFYYV 366
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ ++VG + + I G DG+GG I+DSG+T ++ P ++ + F+ +M S
Sbjct: 367 KLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRM---S 423
Query: 381 RAAD-VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCL 436
R+ V + L PC+++SG + +PEL L F GA P ENYF + G ++CL
Sbjct: 424 RSYPLVPEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCL 483
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G + I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 484 AVLGTPRTGMS-------IIGNFQQQNFHVVYDLQNNRLGFAPRRCA 523
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 121/399 (30%), Positives = 178/399 (44%), Gaps = 60/399 (15%)
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
S G Y + + G+PP+ + I DTGS L+W C CV+ P F P +
Sbjct: 81 SEGEYLMDVGIGSPPRYFSAMI-DTGSDLIWTQCAPCLLCVE--------QPTPYFEPAK 131
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
S+S + C + C+ ++ P C Y YG +AG+L +E
Sbjct: 132 STSYASLPCSSAMCNALYSP--------------LCFQNACVYQAFYGDSASSAGVLANE 177
Query: 219 TLRFPSKT----VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
T F + + VP GC ++ +G+ GFGR + SL SQLG +FSYCL S
Sbjct: 178 TFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSF 237
Query: 272 KFDDAPVSSNLVLD---TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+P +S L T + S + + TPF NP A Y++ + I V
Sbjct: 238 M---SPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNP-----ALPTMYFLNMTGISVA 289
Query: 329 SKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
+ I P + + +DG GGVI+DSG+T TF+ P + V F+ +G A+
Sbjct: 290 GDLLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVG--LPRANATP 347
Query: 388 KSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILF-TDNA 443
CF ++ V LPE++L F GA M LP ENY + G LCL + +D+
Sbjct: 348 SDTFDTCFKWPPPPRRMVTLPEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSDDG 406
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ I+G FQ QNF++ +DL N F C
Sbjct: 407 S----------IIGSFQHQNFHMLYDLENSLLSFVPAPC 435
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 122/400 (30%), Positives = 181/400 (45%), Gaps = 62/400 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++S GTP + + I DTGS L+W C C + + P F P+ SS
Sbjct: 38 GDYVTTISLGTPAKVFS-VIADTGSDLIWIQCKPCQACFN--------QKDPIFDPEGSS 88
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S + C + C + PR K+C C Y YG G T G L SET+
Sbjct: 89 SYTTMSCGDTLCDSL-------------PR-KSCSPDC-DYSYGYGDGSGTRGTLSSETV 133
Query: 221 RFPSK-----TVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
S N GC L S +G+ G GR + S SQLG KFSYCL+
Sbjct: 134 TLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLV 193
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQIIV 327
+ DAP ++ + S S L Y TP NP A FYYV L+ I +
Sbjct: 194 PWR--DAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNP-----AMESFYYVKLKDISI 246
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
+ ++IP DG+GG+I DSG+T T + ++ V + +R ++ +
Sbjct: 247 AGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRA-LRSKISFPKIDG--S 303
Query: 388 KSGLRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDN 442
+GL C+D+SG K+ Y +P ++ F+ GA LP ENYF + ++CL + + N
Sbjct: 304 SAGLDLCYDVSGSKASYKMKIPAMVFHFE-GADYQLPVENYFIAANDAGTIVCLAMVSSN 362
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G I G+ QNF + +D+ + + G+A +C
Sbjct: 363 M---DIG-----IYGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 142/495 (28%), Positives = 204/495 (41%), Gaps = 77/495 (15%)
Query: 14 LLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTK 73
LL++L T A A T+ +P+ +H P + SL R RH
Sbjct: 6 LLVVLVTFTADATHRPKTLHIPV--------VHRGAVFPSR--RGAPPGSLRRCRHAAPF 55
Query: 74 TKPKTKDSNIGSNYSNSLIKTPLSVHSY--GGYSISLSFGTPPQASTPFIFDTGSSLVWF 131
T +I ++ + L +S + G Y ++ G PP + + DTGS L+W
Sbjct: 56 TAQVASFHSIAADDDDRLRSPVMSGVPFDSGEYFAVINVGDPPTRAL-VVIDTGSDLIWL 114
Query: 132 ---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGC 188
PC YR V P + P+ SS+ + I C +P+C + R GC
Sbjct: 115 QCVPCRHCYRQV-----------TPLYDPRSSSTHRRIPCASPRCRDVL------RYPGC 157
Query: 189 SPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGC---SILSDRQPA 243
R C Y++ YG G ++G L ++ L FP T V N GC ++ A
Sbjct: 158 DARTGGC-----VYMVVYGDGSASSGDLATDRLVFPDDTHVHNVTLGCGHDNVGLLESAA 212
Query: 244 GIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY 300
G+ G GR S P+QL FSYCL R SS LV P + P ++
Sbjct: 213 GLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRTP-----EPPSTAF 267
Query: 301 TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK--IPYSYLVPGSDGNGGVIVDSGSTFT 358
TP NP S YYV + VG + V S + + G GG++VDSG+ +
Sbjct: 268 TPLRTNPRRPS-----LYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVVDSGTAIS 322
Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKK-----SGLRPCFDISGK----KSVYLPELI 409
+ AV F +++ AA +K S C+D+ G +V +P ++
Sbjct: 323 RFARDAYAAVRDAF----DSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPSIV 378
Query: 410 LKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
L F GGA MALP NY V G + AA L +LG+ Q Q F L F
Sbjct: 379 LHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLN-----VLGNVQQQGFGLVF 433
Query: 469 DLANDRFGFAKQKCA 483
D+ R GF C+
Sbjct: 434 DVERGRIGFTPNGCS 448
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 175/392 (44%), Gaps = 56/392 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+P + + DTGS + W C C DC + DP F P SS
Sbjct: 194 GEYFSRIGIGSPAR-QLYMVLDTGSDVTWLQCAP---CADC-YAQSDP----LFDPALSS 244
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C +P C + + + + N +C Y + YG G +T G +ETL
Sbjct: 245 SYATVPCDSPHCRAL---DASACHNNAANGNSSC-----VYEVAYGDGSYTVGDFATETL 296
Query: 221 RFP---SKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRK 272
S V + GC D + + G + S PSQ+ +FSYCL+ R
Sbjct: 297 TLGGDGSAAVHDVAIGCG--HDNEGLFVGAAGLLALGGGPLSFPSQISATEFSYCLVDR- 353
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
D+P +S L G+ DS T P ++P ++ FYYV L I VG + +
Sbjct: 354 --DSPSASTLQF----GASDSST---VTAPLMRSPRSNT-----FYYVALNGISVGGETL 399
Query: 333 -KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
IP + G+GGVIVDSG+ T ++ + A+ F+R RA+ V S
Sbjct: 400 SDIPPAAFAMDEQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGV---SLF 456
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGR 450
C+D++G+ SV +P + L+F+GG ++ LP +NY V G CL A
Sbjct: 457 DTCYDLAGRSSVQVPAVSLRFEGGGELKLPAKNYLIPVDGAGTYCLAF--------AATG 508
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+ Q Q + FD A + GF+ KC
Sbjct: 509 GAVSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 139/453 (30%), Positives = 188/453 (41%), Gaps = 73/453 (16%)
Query: 55 ILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS---VHSYGGYSISLSFG 111
+ H L AR K +N G+ + P+ G Y + G
Sbjct: 89 LRHRLQRDKRRAARISKAAAGGGAGAAN-GTRSRGGAVAAPVVSGLAQGSGEYFTKIGVG 147
Query: 112 TPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
TP STP + DTGS +VW C RC D + P F P+RSSS + C
Sbjct: 148 TP---STPALMVLDTGSDVVWLQCAPCRRCYDQSGP--------VFDPRRSSSYGAVDCA 196
Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-V 227
P C + GC R + C Y + YG G TAG +ETL F V
Sbjct: 197 APLCRRL-------DSGGCDLRRRAC-----LYQVAYGDGSVTAGDFATETLTFAGGARV 244
Query: 228 PNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSN 281
GC ++ AG+ G GR S S P+Q+ K FSYCL+ R + +++
Sbjct: 245 ARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAAS 304
Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIPYSYL- 339
+ G S+TP +NP FYYV L I VG V + S L
Sbjct: 305 RSRSSTVTFGPPSASAASFTPMVRNP-----RMETFYYVQLVGISVGGARVPGVAESDLR 359
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR------- 392
+ S G GGVIVDSG++ T + P + A+ F RAA +GLR
Sbjct: 360 LDPSTGRGGVIVDSGTSVTRLARPSYSALRDAF--------RAA----AAGLRLSPGGFS 407
Query: 393 ---PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
C+D+ G+K V +P + + F GGA+ ALPPENY V + F G +
Sbjct: 408 LFDTCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVS-- 465
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q F + FD R GFA + C
Sbjct: 466 -----IIGNIQQQGFRVVFDGDGQRVGFAPKGC 493
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 132/456 (28%), Positives = 195/456 (42%), Gaps = 82/456 (17%)
Query: 54 KILHSLASSSLSR-ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
++LH +A+ S +R AR L + D +Y++ + T VH ++ GT
Sbjct: 71 ELLHRMAARSKARSARLLSGRAASARVDPG---SYTDGVPDTEYLVH--------MAIGT 119
Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
PPQ I DTGS L W C CV C + +P F P RS + ++ C
Sbjct: 120 PPQ-PVQLILDTGSDLTWTQCAP---CVSCFRQS-----LPRFNPSRSMTFSVLPCDLRI 170
Query: 173 C---SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK--- 225
C +W S C S N C Y Y T G L S+T F S
Sbjct: 171 CRDLTW-------SSCGEQSWGNGIC-----VYAYAYADHSITTGHLDSDTFSFASADHA 218
Query: 226 ----TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+VP+ GC + ++ GIAGF R + S+P+QL + FSYC + +
Sbjct: 219 IGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPS 278
Query: 278 -----VSSNLVLDT-GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
V NL D G G G ++ L + S+ + YY+ L+ + VG+
Sbjct: 279 PVFLGVPPNLYSDAAGGGHGVVQSTAL---------IRYHSSQLKAYYISLKGVTVGTTR 329
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ IP S DG GG IVDSG+ T + ++ V F+ Q ++ S L
Sbjct: 330 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQ----TKLTVHNSTSSL 385
Query: 392 -RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAGP 446
+ CF + +P L+L F+ GA + LP ENY + G + CL + AG
Sbjct: 386 SQLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAI----NAGE 440
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L ++G+FQ QN ++ +DLAND F +C
Sbjct: 441 DLS-----VIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 122/411 (29%), Positives = 176/411 (42%), Gaps = 50/411 (12%)
Query: 84 GSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCN 143
SNY +++ P+ +++++S GTPPQ T I DTGS L+W C
Sbjct: 70 ASNY-GTIVPMPIRPFGRLHHTLTVSIGTPPQPRT-LILDTGSDLIWTQCKL-------- 119
Query: 144 FPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL 203
F P + P +SSS C C K CS RNK Y
Sbjct: 120 FDTRQHREKPLYDPAKSSSFAAAPCDGRLCE-----TGSFNTKNCS-RNKCI------YT 167
Query: 204 LQYGLGFTAGLLLSETLRFPS--KTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQ 258
YG T G L SET F + + GC L+ +GI G SL SQ
Sbjct: 168 YNYGSATTKGELASETFTFGEHRRVSVSLDFGCGKLTSGSLPGASGILGISPDRLSLVSQ 227
Query: 259 LGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEF 317
L + +FSYCL F D +S++ +T G + T NP GS+ +
Sbjct: 228 LQIPRFSYCLT--PFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSN----YY 281
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
YYV L I VG+K + +P S G DG+GG VDSG T + + EA+ KE + +
Sbjct: 282 YYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGDTTGMLPSVVMEAL-KEAMVEAV 340
Query: 378 NYSRAADVEKKSGLRPCFDI------SGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
+ CF + + + +V +P L+ F GGA M L ++Y V
Sbjct: 341 KLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSA 400
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+CL++ + RG I+G++Q QN ++ FD+ N F FA +C
Sbjct: 401 GRMCLVISSG-------ARGA--IIGNYQQQNMHVLFDVENHEFSFAPTQC 442
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 139/459 (30%), Positives = 210/459 (45%), Gaps = 58/459 (12%)
Query: 45 LHHSDSDPLKI--LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG 102
L +D D ++I +H A+ R+ +T P + S + +++ ++V S G
Sbjct: 95 LDLADKDAVRIETMHRRAA----RSGGDRTPASPSSSPRRALSERMVATVESGVAVGS-G 149
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
Y + + GTPP+ I DTGS L W C C+DC F V P F P SSS
Sbjct: 150 EYLMDVYVGTPPRRFR-MIMDTGSDLNWLQCAP---CLDC-FDQVGP----VFDPAASSS 200
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLR 221
+ + C + +C + P C+ P +CP Y YG T G L E+
Sbjct: 201 YRNVTCGDQRCGLVAPPEPPRACR--RPGEDSCP-----YYYWYGDQSNTTGDLALESFT 253
Query: 222 F------PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
S+ V + + GC + AG+ G GR S SQL FSYCL+
Sbjct: 254 VNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLV 313
Query: 270 SRKFDDAPVSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
D V+S +V + + + P L+YT F +SS FYYV L+ ++VG
Sbjct: 314 DHGSD---VASKVVFGEDDALALAAAHPQLNYTAF----APASSPADTFYYVKLKGVLVG 366
Query: 329 SKHVKIPYSYLVPGSDGNGG--VIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADV 385
+ + I G G I+DSG+T ++ P ++ + + FI +MG +Y D
Sbjct: 367 GELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDF 426
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAA 444
L PC+++SG +PEL L F GA P ENYF L + ++CL +
Sbjct: 427 PV---LSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRT 483
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G + I+G+FQ QNF++ +DL N+R GFA ++CA
Sbjct: 484 GMS-------IIGNFQQQNFHVVYDLKNNRLGFAPRRCA 515
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 140/455 (30%), Positives = 203/455 (44%), Gaps = 67/455 (14%)
Query: 37 TPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL 96
TP H D+ +K L SL ++S R+L +KP + +S+S+I
Sbjct: 76 TPEELFHLRLQRDAIRVKKLSSLGATS----RNL---SKPGGT-----TGFSSSVISGL- 122
Query: 97 SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
G Y + GTPP+ + DTGS +VW C C +C + DP F
Sbjct: 123 -AQGSGEYFTRIGVGTPPKY-VYMVLDTGSDIVWLQCAP---CKNC-YSQTDP----VFN 172
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLL 215
P +S S + C+ P C + P GC+ R +TC Y + YG G +T G
Sbjct: 173 PVKSGSFAKVLCRTPLCRRLESP-------GCNQR-QTCL-----YQVSYGDGSYTTGEF 219
Query: 216 LSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
++ETL F V GC ++ AG+ G GR S PSQ G +KFSYCL+
Sbjct: 220 VTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLV 279
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
R P S++V S ++ +TP NP FYYV L I VG
Sbjct: 280 DRSASSKP--SSVVFGNSAVSRTAR-----FTPLLTNP-----RLDTFYYVELLGISVGG 327
Query: 330 KHVK-IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
V I S+ GNGGVI+D G++ T + P + A+ F + A +
Sbjct: 328 TPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEF--- 384
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
S C+D+SGK +V +P ++L F+G A ++LP NY V F +G +
Sbjct: 385 SLFDTCYDLSGKTTVKVPTVVLHFRG-ADVSLPASNYLIPVDGSGRFCFAFAGTTSGLS- 442
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +DLA+ R GF+ + CA
Sbjct: 443 ------IIGNIQQQGFRVVYDLASSRVGFSPRGCA 471
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 124/434 (28%), Positives = 194/434 (44%), Gaps = 60/434 (13%)
Query: 60 ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL-SVHSYGGYSISLSFGTPPQAST 118
+++ + R L+ K+ N + + +KT + + H GGY++++ GTP + +
Sbjct: 87 SAAEILRRDQLRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFS 146
Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFG 178
+FDTGS L W C C FP D F P +S+S + + C + C I
Sbjct: 147 -LLFDTGSDLTWTQCEP---CSGGCFPQNDEK----FDPTKSTSYKNLSCSSEPCKSIG- 197
Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
+ +GCS N Y ++YG G+T G L +ETL PS NF+ GC
Sbjct: 198 ---KESAQGCSSSNSCL------YGVKYGTGYTVGFLATETLTITPSDVFENFVIGCGER 248
Query: 238 SDRQ---PAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
+ + AG+ G GRS +LPSQ FSYCL P SS+ G G
Sbjct: 249 NGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYCL--------PASSSSTGHLSFGGG 300
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
S+ +TP +S E Y + + I VG + + I S G I+
Sbjct: 301 VSQAA--KFTPI-------TSKIPELYGLDVSGISVGGRKLPIDPSVFR-----TAGTII 346
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS--GKKSVYLPELI 409
DSG+T T++ A++ F M NY+ + SGL+PC+D S ++ +P++
Sbjct: 347 DSGTTLTYLPSTAHSALSSAFQEMMTNYTL---TKGTSGLQPCYDFSKHANDNITIPQIS 403
Query: 410 LKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
+ F+GG ++ + F A G E +CL F DN + I G+ Q + + + +
Sbjct: 404 IFFEGGVEVDIDDSGIFIAANGLEEVCLA-FKDNGNDTDVA-----IFGNVQQKTYEVVY 457
Query: 469 DLANDRFGFAKQKC 482
D+A GFA C
Sbjct: 458 DVAKGMVGFAPGGC 471
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 139/494 (28%), Positives = 200/494 (40%), Gaps = 91/494 (18%)
Query: 12 FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
F ++ LL ++AATV + LT H+D+ LA+ L + L+
Sbjct: 6 FVIVTLLAALAISRCNAAATVRMQLT---------HADAG-----RGLAARELMQRMALR 51
Query: 72 TKTKPKTK------DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
+K + + Y N + T VH L+ GTPPQ DTG
Sbjct: 52 SKARAARRLSSSASAPVSPGTYDNGVPTTEYLVH--------LAIGTPPQ-PVQLTLDTG 102
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG-----CQNPKCSWIFGPN 180
S L+W C C D P DPS S+ G C +PK F PN
Sbjct: 103 SDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPK----FWPN 158
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKTVPNFLAGCSIL 237
+TC Y YG T G L + F +VP GC +
Sbjct: 159 ------------QTC-----VYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201
Query: 238 SD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
++ GIAGFGR SLPSQL + FS+C + + S ++LD S
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPSTVLLDLPADLYKS 258
Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
+ TP +NP + FYY+ L+ I VGS + +P S +G GG I+DS
Sbjct: 259 GRGAVQSTPLIQNPANPT-----FYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDS 312
Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG--KKSVYLPELILK 411
G+ T + ++ V F Q+ V + P F +S + Y+P+L+L
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-----KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLH 367
Query: 412 FKGGAKMALPPENYFALV---GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
F+ GA M LP ENY V G+ +LCL + G +G+FQ QN ++ +
Sbjct: 368 FE-GATMDLPRENYVFEVEDAGSSILCLAIIEG---------GEVTTIGNFQQQNMHVLY 417
Query: 469 DLANDRFGFAKQKC 482
DL N + F +C
Sbjct: 418 DLQNSKLSFVPAQC 431
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 125/390 (32%), Positives = 168/390 (43%), Gaps = 52/390 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L GTP + + DTGS +VW C RC + DP F P++S
Sbjct: 140 GEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRC----YSQSDP----IFDPRKSK 190
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ I C +P C + GC+ R KTC Y + YG G FT G +ETL
Sbjct: 191 TYATIPCSSPHCRRL-------DSAGCNTRRKTC-----LYQVSYGDGSFTVGDFSTETL 238
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
F V GC ++ AG+ G G+ S P Q G + KFSYCL+ R
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSAS 298
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-K 333
P S++V S ++ +TP NP FYYV L I VG V
Sbjct: 299 SKP--SSVVFGNAAVSRIAR-----FTPLLSNP-----KLDTFYYVELLGISVGGTRVPG 346
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+ S GNGGVI+DSG++ T + P + A+ F RA D S
Sbjct: 347 VAASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDF---SLFDT 403
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
CFD+S V +P ++L F+G A ++LP NY V F G +
Sbjct: 404 CFDLSNMNEVKVPTVVLHFRG-ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLS------ 456
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +DLA+ R GFA CA
Sbjct: 457 -IIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 139/494 (28%), Positives = 200/494 (40%), Gaps = 91/494 (18%)
Query: 12 FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
F ++ LL ++AATV + LT H+D+ LA+ L + L+
Sbjct: 6 FVIVTLLAALAISRCNAAATVRMQLT---------HADAG-----RGLAARELMQRMALR 51
Query: 72 TKTKPKTK------DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
+K + + Y N + T VH L+ GTPPQ DTG
Sbjct: 52 SKARAARRLSSSASAPVSPGTYDNGVPTTEYLVH--------LAIGTPPQ-PVQLTLDTG 102
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG-----CQNPKCSWIFGPN 180
S L+W C C D P DPS S+ G C +PK F PN
Sbjct: 103 SDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPK----FWPN 158
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKTVPNFLAGCSIL 237
+TC Y YG T G L + F +VP GC +
Sbjct: 159 ------------QTC-----VYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201
Query: 238 SD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
++ GIAGFGR SLPSQL + FS+C + + S ++LD S
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPSTVLLDLPADLYKS 258
Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
+ TP +NP + FYY+ L+ I VGS + +P S +G GG I+DS
Sbjct: 259 GRGAVQSTPLIQNPANPT-----FYYLSLKGITVGSTRLPVPESEFTL-KNGTGGTIIDS 312
Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG--KKSVYLPELILK 411
G+ T + ++ V F Q+ V + P F +S + Y+P+L+L
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-----KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLH 367
Query: 412 FKGGAKMALPPENYFALV---GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
F+ GA M LP ENY V G+ +LCL + G +G+FQ QN ++ +
Sbjct: 368 FE-GATMDLPRENYVFEVEDAGSSILCLAIIEG---------GEVTTIGNFQQQNMHVLY 417
Query: 469 DLANDRFGFAKQKC 482
DL N + F +C
Sbjct: 418 DLQNSKLSFVPAQC 431
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 130/458 (28%), Positives = 195/458 (42%), Gaps = 71/458 (15%)
Query: 45 LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY-GG 103
L +DP ++L R H K + ++ S+ + P+S + G
Sbjct: 32 LTRVHADPSVTASQFVRAALHRDMHRHNARK-------LAASSSDGTVSAPVSPTTVPGE 84
Query: 104 YSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYR-CVDCNFPNVDPSRIPAF--IPK 158
+ ++L+ GTPP PF I DTGS L+W C R C P +PS F +P
Sbjct: 85 FLMTLAIGTPP---LPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPC 141
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
SS +G P C+ + Y + YG G+T +E
Sbjct: 142 NSS----LGLCAPACACM-------------------------YNMTYGSGWTYVFQGTE 172
Query: 219 TLRFPSKT------VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCL 268
T F S T VP GCS S +G+ G GR S SL SQLG KFSYCL
Sbjct: 173 TFTFGSSTPADQVRVPGIAFGCSNASSGFNASSASGLVGLGRGSLSLVSQLGAPKFSYCL 232
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+ D +S L+L GP + + T +S TPF +P +YY+ L I +G
Sbjct: 233 --TPYQDTNSTSTLLL--GPSASLNDTGVVSSTPFVASPSS------IYYYLNLTGISLG 282
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+ + IP + +DG GG+I+DSG+T T + ++ V + + D
Sbjct: 283 TTALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAAVLSLV--TLPTTDGSAA 340
Query: 389 SGLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
+GL CF++ S +P + L F GA M LP +NY + + L+
Sbjct: 341 TGLDLCFELPSSTSAPPSMPSMTLHFD-GADMVLPADNYMMSLSDPDSDSSLWCLAMQNQ 399
Query: 447 ALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G + ILG++Q QN ++ +D+ + FA KC+
Sbjct: 400 TDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKCS 437
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 131/466 (28%), Positives = 208/466 (44%), Gaps = 69/466 (14%)
Query: 54 KILHSLASSSLSRARHLKTKTKPKTKDSNIGS----------NYSNSLIKTPLSVHSYG- 102
KI+ + S+SR + +K + +++ + +S +++ T S S G
Sbjct: 109 KIIEKKDTKSMSRKQEVKESITIQQQNNLANAFVASLESSKGEFSGNIMATLESGASLGT 168
Query: 103 -GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + + GTPP+ I DTGS L W C Y C + N + + PK SS
Sbjct: 169 GEYFLDMFVGTPPK-HVWLILDTGSDLSWIQCDPCYDCFEQNGSH--------YYPKDSS 219
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET- 219
+ + I C +P+C + + CK N+TCP Y Y G T G SET
Sbjct: 220 TYRNISCYDPRCQLVSSSDPLQHCKA---ENQTCP-----YFYDYADGSNTTGDFASETF 271
Query: 220 ---LRFPS-----KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFS 265
L +P+ K V + + GC + +G+ G GR S PSQ+ FS
Sbjct: 272 TVNLTWPNGKEKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFS 331
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL + F + VSS L+ G L++T G + FYY+ ++ I
Sbjct: 332 YCL-TDLFSNTSVSSKLIF--GEDKELLNNHNLNFTTLL---AGEETPDETFYYLQIKSI 385
Query: 326 IVGSKHVKIP-----YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
+VG + + I +S +D GG I+DSGST TF ++ + + F +++
Sbjct: 386 MVGGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQ 445
Query: 381 RAADVEKKSGLRPCFDISGK-KSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCL-I 437
AAD + PC+++SG V LP+ + F G P ENYF +EV+CL I
Sbjct: 446 IAAD---DFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAI 502
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ T N + I+G+ QNF++ +D+ R G++ ++CA
Sbjct: 503 MKTPNHSH-------LTIIGNLLQQNFHILYDVKRSRLGYSPRRCA 541
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 123/404 (30%), Positives = 172/404 (42%), Gaps = 72/404 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTS--RYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y G PPQ + I DTGSSL+W CT+ R CV + P + S +F P
Sbjct: 86 YIAEYMVGDPPQRAEALI-DTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAP---- 140
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ CQ+ C+ N C + TC ++ + YG G G L ++
Sbjct: 141 ----VPCQDKACA----GNYLHFCA----LDGTC-----TFRVTYGAGGIIGFLGTDAFT 183
Query: 222 FPSK--------------TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYC 267
F S P+ L G S G+ G GR SL SQ G K+FSYC
Sbjct: 184 FQSGGATLAFGCVSFTRFAAPDVLHGAS--------GLIGLGRGRLSLASQTGAKRFSYC 235
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
L + F + SS+L + G+ S + G + F ++P + FYY+ L
Sbjct: 236 L-TPYFHNNGASSHLFV----GAAASLSGGGGAVMSMAFVESP--KDYPYSTFYYLPLVG 288
Query: 325 IIVGSKHVKIPYSYL----VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
I VG + IP + V GGVI+DSGS FT + +E + E RQ+
Sbjct: 289 ITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSL 348
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
E G+ C G +P L+L F GGA MALPPENY+A + C+
Sbjct: 349 VPPPGEDDGGMALCV-ARGDLDRVVPTLVLHFSGGADMALPPENYWAPLEKSTACM---- 403
Query: 441 DNAAGPALGRG-PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A+ RG I+G+FQ QN ++ FD+ R F C+
Sbjct: 404 ------AIVRGYLQSIIGNFQQQNMHILFDVGGGRLSFQNADCS 441
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 137/465 (29%), Positives = 210/465 (45%), Gaps = 79/465 (16%)
Query: 44 YLHHSDSDPLKI--LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
+L ++ D ++I +H A+ S S A + + + + + +++ + V S
Sbjct: 94 FLDSAEKDAVRIDTMHRRAALSGSAAARRDSAPRRALSERVVAT------VESGVPVGS- 146
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTPP+ I DTGS L W C C+DC F P F P S
Sbjct: 147 GEYLVDVYLGTPPRRFR-MIMDTGSDLNWLQCAP---CLDC-FEQSGP----IFDPAASI 197
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCS-PRNKTCPLACPSYLLQYG-----------LG 209
S + + C + +C + P ES + C PR+ CP Y YG
Sbjct: 198 SYRNVTCGDDRCRLV-SPPAESAPRECRRPRSDPCP-----YYYWYGDQSNTTGDLALEA 251
Query: 210 FTAGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQL----GLK 262
FT L S T R V GC + AG+ G GR S SQL G
Sbjct: 252 FTVNLTQSGTRR-----VDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGH 306
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS--KTPGLSYTPFYKNPVGSSSAFGEFYYV 320
FSYCL+ + S ++ G D+ P L+YT F ++ FYY+
Sbjct: 307 AFSYCLVEHG---SAAGSKIIF----GHDDALLAHPQLNYTAF-----APTTDADTFYYL 354
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NY 379
L+ I+VG + V I L GG I+DSG+T ++ P ++A+ + FI +M +Y
Sbjct: 355 QLKSILVGGEAVNISSDTL-----SAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSY 409
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLIL 438
+ L PC+++SG + V +PEL L F GA P ENYF + E ++CL +
Sbjct: 410 PL---ILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAV 466
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+G + I+G++Q QNF++ +DL ++R GFA ++CA
Sbjct: 467 LGTPRSGMS-------IIGNYQQQNFHVLYDLEHNRLGFAPRRCA 504
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 125/405 (30%), Positives = 176/405 (43%), Gaps = 65/405 (16%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L S G Y + L+ GTPP T I DTGS L+W C C D P F
Sbjct: 81 LVTASSGEYLVDLAIGTPPLYYTA-IMDTGSDLIWTQCAPCLLCAD--------QPTPYF 131
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGL 214
K+S++ + + C++ +C+ + P S K C Y YG TAG+
Sbjct: 132 DVKKSATYRALPCRSSRCASLSSP---------SCFKKMC-----VYQYYYGDTASTAGV 177
Query: 215 LLSETLRFPSKT-----VPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSY 266
L +ET F + N GC L+ A G+ GFGR SL SQLG +FSY
Sbjct: 178 LANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSY 237
Query: 267 CLLSRKFDDAP------VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
CL S P V +NL T SG + TPF NP A Y++
Sbjct: 238 CLTSY-LSATPSRLYFGVYANLS-STNTSSGSP----VQSTPFVINP-----ALPNMYFL 286
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I +G+K + I DG GGVI+DSG++ T+++ +EAV + + +
Sbjct: 287 SLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI---P 343
Query: 381 RAADVEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLI 437
A + GL CF +V +P+L+ F A M L PENY + LCL+
Sbjct: 344 LPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFD-SANMTLLPENYMLIASTTGYLCLV 402
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ P G I+G++Q QN +L +D+ N F C
Sbjct: 403 M------APT---GVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 438
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 130/442 (29%), Positives = 190/442 (42%), Gaps = 83/442 (18%)
Query: 55 ILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPP 114
IL+ ++ S L + L+T+ +P+ + + S S G Y + G P
Sbjct: 123 ILNGVSKSDL---KPLQTEIQPQDLSTPVSSGTS----------QGSGEYFTRVGVGNPA 169
Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
+ S + DTGS + W C C DC + DP F P SSS + C + +C+
Sbjct: 170 K-SYYMVLDTGSDINWIQCQP---CSDC-YQQSDP----IFTPAASSSYSPLTCDSQQCN 220
Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFP-SKTVPNFLA 232
+ + S RN C Y + YG G FT G ++ET+ F S TV +
Sbjct: 221 SL---------QMSSCRNGQC-----RYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIAL 266
Query: 233 GCSILSDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTG 287
GC D + + G SL SQL FSYCL++R D+ SS L ++
Sbjct: 267 GCG--HDNEGLFVGAAGLLGLGGGPLSLTSQLKATSFSYCLVNR---DSAASSTLDFNSA 321
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
P GDS P+ SS FYYVGL + VG + ++IP G+G
Sbjct: 322 P-VGDSVIA----------PLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDG 370
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL---RPCFDISGKKSVY 404
GVIVD G+ T ++ + ++ F+ S + + SG+ C+D+SG+ SV
Sbjct: 371 GVIVDCGTAITRLQSEAYNSLRDSFV------SMSRHLRSTSGVALFDTCYDLSGQSSVK 424
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI----ILGDFQ 460
+P + F GG LP NY V D+A P I+G+ Q
Sbjct: 425 VPTVSFHFDGGKSWDLPAANYLIPV-----------DSAGTYCFAFAPTTSSLSIIGNVQ 473
Query: 461 LQNFYLEFDLANDRFGFAKQKC 482
Q + FDLAN+R GF+ KC
Sbjct: 474 QQGTRVSFDLANNRVGFSTNKC 495
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 121/403 (30%), Positives = 172/403 (42%), Gaps = 73/403 (18%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTPPQ DTGS L+W C C D P DPS S+
Sbjct: 35 YLVHLAIGTPPQP-VQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 93
Query: 164 QLIG-----CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLS 217
G C +PK F PN +TC Y YG T G L
Sbjct: 94 LCQGLPVASCGSPK----FWPN------------QTC-----VYTYSYGDKSVTTGFLEV 132
Query: 218 ETLRF--PSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
+ F +VP GC + ++ GIAGFGR SLPSQL + FS+C +
Sbjct: 133 DKFTFVGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTI 192
Query: 272 KFDDAPVSSNLVLD-------TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
+ S ++LD G G+ + TP + Y NP YY+ L+
Sbjct: 193 T---GAIPSTVLLDLPADLFSNGQGAVQT-TPLIQYAKNEANPT--------LYYLSLKG 240
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
I VGS + +P S ++G GG I+DSG++ T + +++ V EF Q+
Sbjct: 241 ITVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI---KLPVV 296
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFT 440
+G CF + +P+L+L F+ GA M LP ENY V GN ++CL
Sbjct: 297 PGNATGHYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICL---- 351
Query: 441 DNAAGPALGRG-PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A+ +G I+G+FQ QN ++ +DL N+ F +C
Sbjct: 352 ------AINKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 388
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 124/431 (28%), Positives = 182/431 (42%), Gaps = 57/431 (13%)
Query: 59 LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAST 118
L + +RA +L ++ P G + S S + + L S G Y + + G+PP
Sbjct: 84 LVARDNARAEYLASRLSPAAYQPT-GFSGSESKVVSGLDEGS-GEYFVRVGIGSPPTEQY 141
Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFG 178
+ D+GS ++W C C++C + DP F P S++ + C + C +
Sbjct: 142 -LVVDSGSDVIWVQCKP---CLEC-YAQADP----LFDPATSATFSAVPCGSAVCRTL-- 190
Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSIL 237
R GC Y + YG G +T G L ETL V GC
Sbjct: 191 -----RTSGCGDSGGC------DYEVSYGDGSYTKGALALETLTLGGTAVEGVAIGCGHR 239
Query: 238 SDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
+ AG+ G G SL QLG FSYCL SR + +LVL G
Sbjct: 240 NRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRG------AGSLVL----GRS 289
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
++ G + P +NP S FYYVGL I VG + + + DG GGV++
Sbjct: 290 EAVPEGAVWVPLVRNPQAPS-----FYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVM 344
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
D+G+ T + + A+ F+ +G RA V S L C+D+SG SV +P +
Sbjct: 345 DTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGV---SLLDTCYDLSGYTSVRVPTVSFY 401
Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
F G A + LP N V + CL F +++GP+ ILG+ Q + + D A
Sbjct: 402 FDGAATLTLPARNLLLEVDGGIYCLA-FAPSSSGPS-------ILGNIQQEGIQITVDSA 453
Query: 472 NDRFGFAKQKC 482
N GF C
Sbjct: 454 NGYIGFGPTTC 464
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 124/390 (31%), Positives = 177/390 (45%), Gaps = 53/390 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ + DTGS +VW C C +C + DP F P +S
Sbjct: 40 GEYFTRIGVGTPPKY-VYMVLDTGSDIVWLQCAP---CKNC-YSQTDP----VFNPVKSG 90
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C+ P C + P GC+ R +TC Y + YG G +T G ++ETL
Sbjct: 91 SFAKVLCRTPLCRRLESP-------GCNQR-QTCL-----YQVSYGDGSYTTGEFVTETL 137
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
F V GC ++ AG+ G GR S PSQ G +KFSYCL+ R
Sbjct: 138 TFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSAS 197
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK- 333
P S++V S ++ +TP NP FYYV L I VG V
Sbjct: 198 SKP--SSVVFGNSAVSRTAR-----FTPLLTNP-----RLDTFYYVELLGISVGGTPVSG 245
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
I S+ GNGGVI+D G++ T + P + A+ F + A + S
Sbjct: 246 ITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEF---SLFDT 302
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
C+D+SGK +V +P ++L F+G A ++LP NY V F +G +
Sbjct: 303 CYDLSGKTTVKVPTVVLHFRG-ADVSLPASNYLIPVDGSGRFCFAFAGTTSGLS------ 355
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +DLA+ R GF+ + CA
Sbjct: 356 -IIGNIQQQGFRVVYDLASSRVGFSPRGCA 384
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 116/372 (31%), Positives = 165/372 (44%), Gaps = 55/372 (14%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGS + W C C DC + DP F P S+S + C + +C +
Sbjct: 181 MVLDTGSDVTWVQCQP---CADC-YQQSDP----VFDPSLSASYAAVSCDSQRCRDL--- 229
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSIL 237
C RN T AC Y + YG G +T G +ETL T V N GC
Sbjct: 230 ----DTAAC--RNATG--AC-LYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCG-- 278
Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
D + + G + S PSQ+ FSYCL+ R D+P +S L G
Sbjct: 279 HDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR---DSPAASTLQFGDGAAEAG 335
Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIV 351
+ T P ++P S+ FYYV L I VG + + IP S + + + G+GGVIV
Sbjct: 336 TVT-----APLVRSPRTST-----FYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIV 385
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
DSG+ T ++ + A+ F++ + R + V S C+D+S + SV +P + L+
Sbjct: 386 DSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGV---SLFDTCYDLSDRTSVEVPAVSLR 442
Query: 412 FKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
F+GG + LP +NY V G CL NAA I+G+ Q Q + FD
Sbjct: 443 FEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAA--------VSIIGNVQQQGTRVSFDT 494
Query: 471 ANDRFGFAKQKC 482
A GF KC
Sbjct: 495 ARGAVGFTPNKC 506
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 124/377 (32%), Positives = 168/377 (44%), Gaps = 53/377 (14%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P+ + DTGS + W C C DC + DP PA SSS +L+GCQ C
Sbjct: 154 PRRDQLMVLDTGSDVTWIQCEP---CSDC-YQQSDPIYNPAL----SSSYKLVGCQANLC 205
Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLA 232
+ GCS RN +C Y + YG G +T G +ETL + N
Sbjct: 206 QQL-------DVSGCS-RNGSCL-----YQVSYGDGSYTQGNFATETLTLGGAPLQNVAI 252
Query: 233 GCSILSD---RQPAGIAGFGRSSESLPSQL---GLKKFSYCLLSRKFDDAPVSSNLVLDT 286
GC ++ AG+ G G S S PSQL K FSYCL+ R D+ SS L
Sbjct: 253 GCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVDR---DSESSSTLQF-- 307
Query: 287 GPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
G + P G P KN S FYYV L I VG K + I S + G
Sbjct: 308 ----GRAAVPNGAVLAPMLKN-----SRLDTFYYVSLSGISVGGKMLSISDSVFGIDASG 358
Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL 405
NGGVIVDSG+ T ++ ++++ F N V S C+D+S K+SV +
Sbjct: 359 NGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGV---SLFDTCYDLSSKESVDV 415
Query: 406 PELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
P ++ F GG M+LP +NY V + F ++ + I+G+ Q Q
Sbjct: 416 PTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLS-------IVGNIQQQGIR 468
Query: 466 LEFDLANDRFGFAKQKC 482
+ FD AN++ GFA KC
Sbjct: 469 VSFDRANNQVGFAVNKC 485
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 122/393 (31%), Positives = 169/393 (43%), Gaps = 60/393 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L GTPP+ T + DTGS ++W C C C + DP F P SS
Sbjct: 151 GEYFTRLGVGTPPRY-TYMVLDTGSDIMWIQCLP---CAKC-YGQTDP----LFNPAASS 201
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKT-CPLACPSYLLQYGLG-FTAGLLLSET 219
+ + + C P C + GC RNK C Y + YG G FT G +ET
Sbjct: 202 TYRKVPCATPLCKKL-------DISGC--RNKRYCE-----YQVSYGDGSFTVGDFSTET 247
Query: 220 LRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSES-----LPSQLGL---KKFSYCLLSR 271
L F + + GC D + I G PSQ G K+FSYCL+ R
Sbjct: 248 LTFRGQVIRRVALGCG--HDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDR 305
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
+ +S+L+ G + P + +TP NP FYYV L I VG +
Sbjct: 306 S--ASGTASSLIF------GKAAIPKSAIFTPLLSNP-----KLDTFYYVELVGISVGGR 352
Query: 331 HV-KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ IP S + GNGGVI+DSG++ T + + + F GN A S
Sbjct: 353 RLTSIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGF---S 409
Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
C+D+SG K+V +P L+ F+GGA ++LP NY V + F N G +
Sbjct: 410 LFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGGLS-- 467
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q + + FD +R GF C
Sbjct: 468 -----IIGNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 123/391 (31%), Positives = 171/391 (43%), Gaps = 51/391 (13%)
Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + L+ GTPP PF+ DTGS L W C C P P + SS
Sbjct: 93 YLMELAIGTPP---VPFVALADTGSDLTWTQCQPCKLCF--------PQDTPIYDTAVSS 141
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C + C I+ S RN T + Y YG G ++AG+L +ETL
Sbjct: 142 SFSPVPCASATCLPIW-----------SSRNCTASSSPCRYRYAYGDGAYSAGVLGTETL 190
Query: 221 RFPSK---TVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
FP +V GC + + G G GR S SL +QLG+ KFSYCL + F+
Sbjct: 191 TFPGAPGVSVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCL-TDFFN 249
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
+ S L + S + TP ++P +YYV L I +G + I
Sbjct: 250 TSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPY-----VPTWYYVSLEGISLGDARLPI 304
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
P DG+GG+IVDSG+TFTF L E+ + + + R V S PC
Sbjct: 305 PNGTFDLRDDGSGGMIVDSGTTFTF----LVESAFRVVVDHVAGVLRQPVVNASSLDSPC 360
Query: 395 F-DISGKKSV-YLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRG 451
F +G++ + +P+++L F GGA M L +NY + E CL N AG
Sbjct: 361 FPAATGEQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESSFCL-----NIAGSP--SA 413
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+FQ QN + FD+ + F C
Sbjct: 414 DVSILGNFQQQNIQMLFDITVGQLSFMPTDC 444
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 128/397 (32%), Positives = 172/397 (43%), Gaps = 58/397 (14%)
Query: 102 GGYSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
G Y + GTP +TP + DTGS +VW C RC + + F P+R
Sbjct: 138 GEYFTKIGVGTP---ATPALMVLDTGSDVVWLQCAPCRRCYEQSGQ--------VFDPRR 186
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
S S +GC P C + GC R C Y + YG G TAG +E
Sbjct: 187 SRSYNAVGCAAPLCRRL-------DSGGCDLRRSAC-----LYQVAYGDGSVTAGDFATE 234
Query: 219 TLRFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
TL F V GC ++ AG+ G GR S S P+Q+ + FSYCL+ R
Sbjct: 235 TLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDR 294
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
S + + G G+ S T S+TP KNP FYYV L I VG
Sbjct: 295 TSSANTASRSSTVTFGSGAVGS-TVASSFTPMVKNP-----RMETFYYVQLIGISVGGAR 348
Query: 332 V-KIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
V + S L + S G GGVIVDSG++ T + P + A+ F AA +
Sbjct: 349 VPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAF------RGAAAGLRLSP 402
Query: 390 G----LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
G C+D+SG+K V +P + + F GGA+ ALPPENY V ++ F G
Sbjct: 403 GGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGG 462
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ I+G+ Q Q F + FD R F + C
Sbjct: 463 VS-------IIGNIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 124/390 (31%), Positives = 168/390 (43%), Gaps = 52/390 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L GTP + + DTGS +VW C RC + DP F P++S
Sbjct: 140 GEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRC----YSQSDP----IFDPRKSK 190
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ I C +P C + GC+ R KTC Y + YG G FT G +ETL
Sbjct: 191 TYATIPCSSPHCRRL-------DSAGCNTRRKTC-----LYQVSYGDGSFTVGDFSTETL 238
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
F V GC ++ AG+ G G+ S P Q G + KFSYCL+ R
Sbjct: 239 TFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSAS 298
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-K 333
P S++V S ++ +TP NP FYYVGL I VG V
Sbjct: 299 SKP--SSVVFGNAAVSRIAR-----FTPLLSNP-----KLDTFYYVGLLGISVGGTRVPG 346
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+ S GNGGVI+DSG++ T + P + A+ F RA + S
Sbjct: 347 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNF---SLFDT 403
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
CFD+S V +P ++L F+ A ++LP NY V F G +
Sbjct: 404 CFDLSNMNEVKVPTVVLHFRR-ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLS------ 456
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +DLA+ R GFA CA
Sbjct: 457 -IIGNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 125/426 (29%), Positives = 184/426 (43%), Gaps = 74/426 (17%)
Query: 87 YSNSLIKT--PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
+SNS KT L H + SL+ GTPPQ T + DTGS L W C
Sbjct: 48 FSNSSSKTTGKLLFHHNVTLTASLTIGTPPQNIT-MVLDTGSELSWLRCKK--------- 97
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLAC-PSYL 203
+P+ F P S + I C + C ++R + T P+ C P+ L
Sbjct: 98 ---EPNFTSIFNPLASKTYTKIPCSSQTC--------KTRTS-----DLTLPVTCDPAKL 141
Query: 204 LQYGLGF-----TAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRS 251
+ + + G L ET RF S T P + GC + D + G+ G R
Sbjct: 142 CHFIISYADASSVEGHLAFETFRFGSLTRPATVFGCMDSGSSSNTEEDAKTTGLMGMNRG 201
Query: 252 SESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
S S +Q+G +KFSYC+ + + S L G P L+YTP V S
Sbjct: 202 SLSFVNQMGFRKFSYCI-------SGLDSTGFLLLGEARYSWLKP-LNYTPL----VQIS 249
Query: 312 SAFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
+ F Y V L I V +K + +P S VP G G +VDSG+ FTF+ GP++ A
Sbjct: 250 TPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSA 309
Query: 368 VAKEFIRQMGNYSRAADVEK---KSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPP 422
+ KEF+ Q R + + + + C+ I S LP + L F+ GA+M++
Sbjct: 310 LRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPVVKLMFR-GAEMSVSG 368
Query: 423 ENYFALVGNE------VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
+ V E V C + G + + ++G Q QN ++E+DL N R G
Sbjct: 369 QRLLYRVPGEVRGKDSVWCFTFGNSDELGIS-----SFLIGHHQQQNVWMEYDLENSRIG 423
Query: 477 FAKQKC 482
FA+ +C
Sbjct: 424 FAELRC 429
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 131/456 (28%), Positives = 194/456 (42%), Gaps = 82/456 (17%)
Query: 54 KILHSLASSSLSR-ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
++L +A+ S +R AR L + D +Y++ + T VH ++ GT
Sbjct: 45 ELLRRMAARSKARSARLLSGRAASARMDPG---SYTDGVPDTEYLVH--------MAIGT 93
Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
PPQ I DTGS L W C CV C + +P F P RS + ++ C
Sbjct: 94 PPQ-PVQLILDTGSDLTWTQCAP---CVSCFRQS-----LPRFNPSRSMTFSVLPCDLRI 144
Query: 173 C---SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK--- 225
C +W S C S N C Y Y T G L S+T F S
Sbjct: 145 CRDLTW-------SSCGEQSWGNGIC-----VYAYAYADHSITTGHLDSDTFSFASADHA 192
Query: 226 ----TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+VP+ GC + ++ GIAGF R + S+P+QL + FSYC + +
Sbjct: 193 IGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPS 252
Query: 278 -----VSSNLVLDT-GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
V NL D G G G ++ L + S+ + YY+ L+ + VG+
Sbjct: 253 PVFLGVPPNLYSDAAGGGHGVVQSTAL---------IRYHSSQLKAYYISLKGVTVGTTR 303
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ IP S DG GG IVDSG+ T + ++ V F+ Q ++ S L
Sbjct: 304 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQ----TKLTVHNSTSSL 359
Query: 392 -RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAGP 446
+ CF + +P L+L F+ GA + LP ENY + G + CL + AG
Sbjct: 360 SQLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAI----NAGE 414
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L ++G+FQ QN ++ +DLAND F +C
Sbjct: 415 DLS-----VIGNFQQQNMHVLYDLANDMLSFVPARC 445
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 165/372 (44%), Gaps = 55/372 (14%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGS + W C C DC + DP F P S+S + C + +C +
Sbjct: 1 MVLDTGSDVTWVQCQP---CADC-YQQSDP----VFDPSLSASYAAVSCDSQRCRDL--- 49
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSIL 237
+ RN T AC Y + YG G +T G +ETL T V N GC
Sbjct: 50 ------DTAACRNATG--AC-LYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCG-- 98
Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
D + + G + S PSQ+ FSYCL+ R D+P +S L G
Sbjct: 99 HDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDR---DSPAASTLQFGDGAAEAG 155
Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIV 351
+ T P ++P S+ FYYV L I VG + + IP S + + + G+GGVIV
Sbjct: 156 TVT-----APLVRSPRTST-----FYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIV 205
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
DSG+ T ++ + A+ F++ + R + V S C+D+S + SV +P + L+
Sbjct: 206 DSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGV---SLFDTCYDLSDRTSVEVPAVSLR 262
Query: 412 FKGGAKMALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
F+GG + LP +NY V G CL NAA I+G+ Q Q + FD
Sbjct: 263 FEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAA--------VSIIGNVQQQGTRVSFDT 314
Query: 471 ANDRFGFAKQKC 482
A GF KC
Sbjct: 315 ARGAVGFTPNKC 326
>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
Length = 378
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 103/311 (33%), Positives = 144/311 (46%), Gaps = 32/311 (10%)
Query: 198 ACPSYLLQYGLGFTAGLLLSETLRFPSKT-------VPNFLAGCSILSDRQPAGIAGFGR 250
ACP YG G L + + V NF C+ + +P G+AGFGR
Sbjct: 64 ACPPLYYAYGDGSLVAHLRRGRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGR 123
Query: 251 SSESLPSQLGLK---KFSYCLLSRKF--DDAPVSSNLVLDTGPGSGDS--KTPGLSYTPF 303
SLP QL + +FSYCL+S F D S L+L P + +T G YTP
Sbjct: 124 GPLSLPGQLSPQLSGRFSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPL 183
Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
NP FY V L + VG+ ++ GNGG++VDSG+TFT +
Sbjct: 184 LHNP-----KHPYFYSVALEAVSVGAARIQARPELARVDRAGNGGMVVDSGTTFTMLPNE 238
Query: 364 LFEAVAKEFIRQMGNYSRAA--DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALP 421
++ VA+ F R M A E+++GL PC+ + +P L L F+G A +ALP
Sbjct: 239 MYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRYAASDR-GVPPLALHFRGNATVALP 297
Query: 422 PENYF----------ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
NYF ++V CL+L A G GPA LG+FQ Q F + +D+
Sbjct: 298 RRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVD 357
Query: 472 NDRFGFAKQKC 482
R GFA+++C
Sbjct: 358 AGRVGFARRRC 368
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 135/475 (28%), Positives = 193/475 (40%), Gaps = 90/475 (18%)
Query: 36 LTPLSTKHYLHHSDSDPLKI--LH---------------SLASSSLSRARHLKTKTKPKT 78
L P T + +HH D L + LH +L S S + L+T+ KP+
Sbjct: 86 LHPRETIYKIHHKDYKSLVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPED 145
Query: 79 KDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR 138
+ + S S G Y + G P + + DTGS + W C
Sbjct: 146 LSTPVTSGTS----------QGSGEYFTRVGVGNPARQFY-MVLDTGSDINWLQCQP--- 191
Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
C DC + DP F P SS+ + CQ+ +CS + + S R+ C
Sbjct: 192 CTDC-YQQTDP----IFDPTASSTYAPVTCQSQQCSSL---------EMSSCRSGQCL-- 235
Query: 199 CPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSDRQPAGIAGFGRSSE--- 253
Y + YG G +T G +E++ F S +V N GC D + + G
Sbjct: 236 ---YQVNYGDGSYTFGDFATESVSFGNSGSVKNVALGCG--HDNEGLFVGAAGLLGLGGG 290
Query: 254 --SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
SL +QL FSYCL++R D+ SS L ++ DS T P KN
Sbjct: 291 PLSLTNQLKATSFSYCLVNR---DSAGSSTLDFNSAQLGVDSVT-----APLMKN----- 337
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
FYYVGL + VG + V IP S GNGG+IVD G+ T ++ + +
Sbjct: 338 RKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYNPLRDA 397
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
F+R N + V + C+D+SG+ SV +P + F G LP NY V
Sbjct: 398 FVRMTQNLKLTSAV---ALFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAANYLIPV-- 452
Query: 432 EVLCLILFTDNAAGPALGRGPAI----ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
D+A P I+G+ Q Q + FDLAN+R GF+ KC
Sbjct: 453 ---------DSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 122/392 (31%), Positives = 177/392 (45%), Gaps = 67/392 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
GGY++++S GTP + + DTGS L+W C +C P+ P F P SS
Sbjct: 84 GGYNMNISVGTP-LLTFSVVADTGSDLIWTQCAPCTKCFQ------QPA--PPFQPASSS 134
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ + C + C ++ PN +TC Y +YG G+TAG L +ETL+
Sbjct: 135 TFSKLPCTSSFCQFL--PN----------SIRTCNATGCVYNYKYGSGYTAGYLATETLK 182
Query: 222 FPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
+ P+ GCS + G G+ LG+ +FSYCL R A S
Sbjct: 183 VGDASFPSVAFGCSTEN--------GLGQ------LDLGVGRFSYCL--RSGSAAGASPI 226
Query: 282 LVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
L GS + T G + TPF NP S +YYV L I VG + + S
Sbjct: 227 LF-----GSLANLTDGNVQSTPFVNNPAVHPS----YYYVNLTGITVGETDLPVTTSTFG 277
Query: 341 PGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS- 398
+G GG IVDSG+T T++ +E V + F+ Q + + V GL CF +
Sbjct: 278 FTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTT---VNGTRGLDLCFKSTG 334
Query: 399 -GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGPALGRG 451
G + +P L+L+F GGA+ A+P YFA V + V CL++ PA G
Sbjct: 335 GGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMML------PAKGDQ 386
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P ++G+ + +L +DL F FA CA
Sbjct: 387 PMSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 418
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 122/399 (30%), Positives = 170/399 (42%), Gaps = 69/399 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + ++ GTPP I DTGS LVW C+S + D F P RSS+
Sbjct: 103 YLMYVNVGTPP-TQLLAIADTGSDLVWVNCSSS----GGGLADADAGGNVVFQPTRSSTY 157
Query: 164 QLIGCQNPKCSWIFGP--NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ CQ+ C + + +S C+ Y YG G T G+L +ET
Sbjct: 158 SQLSCQSNACQALSQASCDADSECQ---------------YQYSYGDGSRTIGVLSTETF 202
Query: 221 RFPSK------TVPNFLAGCSILSDR--QPAGIAGFGRSSESLPSQLGL-----KKFSYC 267
F VP GCS S + G+ G G + SL SQLG +K SYC
Sbjct: 203 SFVDGGGKGQVRVPRVNFGCSTASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYC 262
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
L+ DA SS L + PG + TP + V S +Y V L + V
Sbjct: 263 LIPSY--DANSSSTLNFGS---RAVVSEPGAASTPLVPSDVDS------YYTVALESVAV 311
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
G + V S +IVDSG+T TF++ L + E R++ R E+
Sbjct: 312 GGQEVATHDSR----------IIVDSGTTLTFLDPALLGPLVTELERRI-KLQRVQPPEQ 360
Query: 388 KSGLRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
L+ C+D+ GK +P++ L+F GGA + L PEN F+L+ LCL+L
Sbjct: 361 L--LQLCYDVQGKSETDNFGIPDVTLRFGGGAAVTLRPENTFSLLQEGTLCLVLV----- 413
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P P ILG+ QNF++ +DL FA CA
Sbjct: 414 -PVSESQPVSILGNIAQQNFHVGYDLDARTVTFAAADCA 451
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 122/400 (30%), Positives = 174/400 (43%), Gaps = 71/400 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + ++ GTPP A I DTGS LVW C+S + V F P RS++
Sbjct: 100 YLMYVNVGTPP-AQMLAIADTGSDLVWVNCSSNGGGGGASDGAV------VFHPSRSTTY 152
Query: 164 QLIGCQNPKCSWIFGP--NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
L+ CQ+ C + + +S C+ Y YG G T G+L +ET
Sbjct: 153 SLLSCQSAACQALSQASCDADSECQ---------------YQYAYGDGSRTIGVLSTETF 197
Query: 221 RFPSKT--------VPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGL-----KKFS 265
F + VP GCS S + G+ G G + SL SQLG ++FS
Sbjct: 198 SFAAAGGGGEGQVRVPRVSFGCSTGSAGSFRSDGLVGLGAGALSLVSQLGAAARIARRFS 257
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL+ + A SS L D PG + TP + V S +Y V L +
Sbjct: 258 YCLVP-PYAAANSSSTLSFGARAVVSD---PGAASTPLVPSEVDS------YYTVALESV 307
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
V + V S + +IVDSG+T TF++ L + E R++ RA
Sbjct: 308 AVAGQDV---------ASANSSRIIVDSGTTLTFLDPALLRPLVAELERRI-RLPRAQPP 357
Query: 386 EKKSGLRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
E+ L+ C+D+ GK +P++ L+F GGA + L PEN F+L+ LCL+L
Sbjct: 358 EQL--LQLCYDVQGKSQAEDFGIPDVTLRFGGGASVTLRPENTFSLLEEGTLCLVLV--- 412
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
P P ILG+ QNF++ +DL FA C
Sbjct: 413 ---PVSESQPVSILGNIAQQNFHVGYDLDARTVTFAAVDC 449
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 128/455 (28%), Positives = 192/455 (42%), Gaps = 80/455 (17%)
Query: 54 KILHSLASSSLSR-ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
++L +A+ S +R AR L + D +Y++ + T VH ++ GT
Sbjct: 71 ELLRRMAARSKARSARLLSGRAASARMDPG---SYTDGVPDTEYLVH--------MAIGT 119
Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
PPQ I DTGS L W C CV C + +P F P RS + ++ C
Sbjct: 120 PPQ-PVQLILDTGSDLTWTQCAP---CVSCFRQS-----LPRFNPSRSMTFSVLPCDLRI 170
Query: 173 C---SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK--- 225
C +W S C S N C Y Y T G L S+T F S
Sbjct: 171 CRDLTW-------SSCGEQSWGNGIC-----VYAYAYADHSITTGHLDSDTFSFASADHA 218
Query: 226 ----TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+VP+ GC + ++ GIAGF R + S+P+QL + FSYC + +
Sbjct: 219 IGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPS 278
Query: 278 -----VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
V NL D G G + ++ + + + YY+ L+ + VG+ +
Sbjct: 279 PVFLGVPPNLYSDAA-GGGHGVVQSTALIRYHSSQLKA-------YYISLKGVTVGTTRL 330
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL- 391
IP S DG GG IVDSG+ T + ++ V F+ Q ++ S L
Sbjct: 331 PIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQ----TKLTVHNSTSSLS 386
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAGPA 447
+ CF + +P L+L F+ GA + LP ENY + G + CL + AG
Sbjct: 387 QLCFSVPPGAKPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAI----NAGED 441
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L ++G+FQ QN ++ +DLAND F +C
Sbjct: 442 LS-----VIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 126/390 (32%), Positives = 172/390 (44%), Gaps = 52/390 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS +VW C +C + D F P +S
Sbjct: 116 GEYFTRIGVGTPARY-VYMVLDTGSDVVWLQCAPCRKC----YTQTDH----VFDPTKSR 166
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ I C P C + P GCS +NK C Y + YG G FT G +ETL
Sbjct: 167 TYAGIPCGAPLCRRLDSP-------GCSNKNKVC-----QYQVSYGDGSFTFGDFSTETL 214
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
F V GC ++ AG+ G GR S P Q G + KFSYCL+ R
Sbjct: 215 TFRRNRVTRVALGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSAS 274
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK- 333
P SS + D S S+T +TP KNP FYY+ L I VG V+
Sbjct: 275 AKP-SSVIFGD----SAVSRTA--HFTPLIKNP-----KLDTFYYLELLGISVGGAPVRG 322
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+ S + GNGGVI+DSG++ T + P + A+ F + RA + S
Sbjct: 323 LSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEF---SLFDT 379
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
CFD+SG V +P ++L F+ GA ++LP NY V N F +G +
Sbjct: 380 CFDLSGLTEVKVPTVVLHFR-GADVSLPATNYLIPVDNSGSFCFAFAGTMSGLS------ 432
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +DL R GFA + C
Sbjct: 433 -IIGNIQQQGFRISYDLTGSRVGFAPRGCV 461
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 140/465 (30%), Positives = 199/465 (42%), Gaps = 88/465 (18%)
Query: 42 KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSN--IGSNYSNSLIKTPLSVH 99
++ LH D L++L + SL A K+ K++N + ++ L ++ LS
Sbjct: 22 RNRLHR---DELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPL-RSGLSDG 77
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFI 156
S G Y +SL GTPP+ + + DTGS ++W PC S Y D P F
Sbjct: 78 S-GEYFVSLGVGTPPR-TVNMVADTGSDVLWLQCLPCQSCYGQTD-----------PLFN 124
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLL 215
P SS+ Q I C + C + +GC R C Y + YG G FT G
Sbjct: 125 PSFSSTFQSITCGSSLCQQLL-------IRGC--RRNQCL-----YQVSYGDGSFTVGEF 170
Query: 216 LSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
+ETL F S V + GC + AG+ G G+ S PSQ+G FSYCL
Sbjct: 171 STETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLP 230
Query: 270 SRK--------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
+R+ F + V+SN +T NP FYYV
Sbjct: 231 TRESTGSVPLIFGNQAVASNA----------------QFTTLLTNP-----KLDTFYYVE 269
Query: 322 LRQIIVGSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
+ I VG V IP L + S GNGGVI+DSG+ T + + + F M
Sbjct: 270 MVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGM---- 325
Query: 381 RAADVEKKSGLR---PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+D + SG C+D+SG+ S+ LP + F GGA MALP +N V N +
Sbjct: 326 -PSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCL 384
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F N+ + I+G+ Q Q+F + FD +R G +C
Sbjct: 385 AFAPNSENFS-------IIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 117/396 (29%), Positives = 163/396 (41%), Gaps = 49/396 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G PP + + DTGS L+W C RC + V P + P+ S
Sbjct: 90 GEYFAVIGVGDPPTHAL-VVIDTGSDLIWLQCLPCRRC----YRQV----TPLYDPRNSK 140
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + I C +P+C + R GC R C Y++ YG G ++G L ++TL
Sbjct: 141 THRRIPCASPQCRGVL------RYPGCDARTGGC-----VYMVVYGDGSASSGDLATDTL 189
Query: 221 RFPSKT-VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
P T V N GC ++ AG+ G GR S P+QL FSYCL R
Sbjct: 190 VLPDDTRVHNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMS 249
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
SS LV P + P ++TP NP S YYV + VG + V
Sbjct: 250 RARNSSSYLVFGRTP-----ELPSTAFTPLRTNPRRPS-----LYYVDMVGFSVGGERVA 299
Query: 334 --IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
S + + G GGV+VDSG+ + + AV F+ K S
Sbjct: 300 GFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVF 359
Query: 392 RPCFDISGK---KSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNAAGPA 447
C+D+ G V +P ++L F A MALP NY +VG + AA
Sbjct: 360 DTCYDVHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDG 419
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
L +LG+ Q Q F + FD+ R GF C+
Sbjct: 420 LN-----VLGNVQQQGFGVVFDVERGRIGFTPNGCS 450
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 134/457 (29%), Positives = 199/457 (43%), Gaps = 79/457 (17%)
Query: 50 SDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGS------NYSNSLIKTPLSVHSYGG 103
+D L + L S LS A+ L T K +T S+ YS +L+
Sbjct: 41 TDSLSLSFPLTSLPLSTAKPLNTNPKLRTLSSSSSYNIKSSFKYSMALV----------- 89
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
++L GTPPQ + DTGS L W C ++ P+ +F P SSS
Sbjct: 90 --VTLPIGTPPQPQQ-MVLDTGSQLSWIQCHNK----------TPPTA--SFDPSLSSSF 134
Query: 164 QLIGCQNPKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF 222
++ C +P C P V + +N+ C SY G + G L+ E L F
Sbjct: 135 YVLPCTHPLCK----PRVPDFTLPTTCDQNRLCHY---SYFYADGT-YAEGNLVREKLAF 186
Query: 223 -PSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
PS+T P + GCS S R GI G S P Q + KFSYC+ +R+ P ++N
Sbjct: 187 SPSQTTPPLILGCSSES-RDARGILGMNLGRLSFPFQAKVTKFSYCVPTRQ----PANNN 241
Query: 282 ------LVLDTGPGSGDSK-TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
L P S + L++ + P A Y V ++ I +G + + I
Sbjct: 242 NFPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLDPLA----YTVPMQGIRIGGRKLNI 297
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAADVEK 387
P S P + G+G +VDSGS FTF+ ++ V +E IR +G Y AD+
Sbjct: 298 PPSVFRPNAGGSGQTMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADM-- 355
Query: 388 KSGLRPCFDISGKK-SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
CFD + + L ++ +F+ G ++ +P E A VG V C+ + G
Sbjct: 356 ------CFDGNAMEIGRLLGDVAFEFEKGVEIVVPKERVLADVGGGVHCVGIGRSERLGA 409
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A + I+G+F QN ++EFDLAN R GF C+
Sbjct: 410 A-----SNIIGNFHQQNLWVEFDLANRRIGFGVADCS 441
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 124/441 (28%), Positives = 185/441 (41%), Gaps = 55/441 (12%)
Query: 52 PLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFG 111
P + L S +RA +L ++ P + ++ S S + + L S G Y + + G
Sbjct: 76 PRHAVLDLVSRDNARAEYLASRLSPAYQPTDFFG--SESKVVSGLDEGS-GEYFVRVGIG 132
Query: 112 TPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNP 171
+PP + D+GS ++W C C++C + DP F P S++ + C +
Sbjct: 133 SPPTEQY-LVVDSGSDVIWVQCKP---CLEC-YAQADP----LFDPASSATFSAVSCGSA 183
Query: 172 KCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNF 230
C + R GC Y + YG G +T G L ETL V
Sbjct: 184 ICRTL-------RTSGCGDSGGC------EYEVSYGDGSYTKGTLALETLTLGGTAVEGV 230
Query: 231 LAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDD---APVSSN 281
GC + AG+ G G SL QLG FSYCL SR A + +
Sbjct: 231 AIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGS 290
Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
LVL G ++ G + P +NP S FYYVG+ I VG + + +
Sbjct: 291 LVL----GRSEAVPEGAVWVPLVRNPQAPS-----FYYVGVSGIGVGDERLPLQDGLFQL 341
Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKK 401
DG GGV++D+G+ T + + A+ F+ +G RA V S L C+D+SG
Sbjct: 342 TEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGV---SLLDTCYDLSGYT 398
Query: 402 SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQL 461
SV +P + F G A + LP N V + CL F +++G + ILG+ Q
Sbjct: 399 SVRVPTVSFYFDGAATLTLPARNLLLEVDGGIYCLA-FAPSSSGLS-------ILGNIQQ 450
Query: 462 QNFYLEFDLANDRFGFAKQKC 482
+ + D AN GF C
Sbjct: 451 EGIQITVDSANGYIGFGPATC 471
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 139/466 (29%), Positives = 195/466 (41%), Gaps = 90/466 (19%)
Query: 42 KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSV--- 98
++ LH D L++L + SL A K+ K++N + +TPL
Sbjct: 22 RNRLHR---DELRLLSISSRISLGVAGIPKSSLTNPLKNTNP---FLQQDFETPLRSGLS 75
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAF 155
G Y +SL GTPP+ + + DTGS ++W PC S Y D P F
Sbjct: 76 DGSGEYFVSLGVGTPPR-TVNMVADTGSDVLWLQCLPCQSCYGQTD-----------PLF 123
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGL 214
P SS+ Q I C + C + +GC R C Y + YG G FT G
Sbjct: 124 NPSFSSTFQSITCGSSLCQQLL-------IRGC--RRNQCL-----YQVSYGDGSFTVGE 169
Query: 215 LLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
+ETL F S V + GC + AG+ G G+ S PSQ+G FSYCL
Sbjct: 170 FSTETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCL 229
Query: 269 LSRK--------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+R+ F + V+SN +T NP FYYV
Sbjct: 230 PTRESTGSVPLIFGNQAVASNA----------------QFTTLLTNP-----KLDTFYYV 268
Query: 321 GLRQIIVGSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+ I VG V IP L + S GNGGVI+DSG+ T + + + F M
Sbjct: 269 EMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGM--- 325
Query: 380 SRAADVEKKSGLR---PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
+D + SG C+D+SG+ S+ LP + F GGA MALP +N V N
Sbjct: 326 --PSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYC 383
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ F N+ + I+G+ Q Q+F + FD +R G +C
Sbjct: 384 LAFAPNSENFS-------IIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 123/429 (28%), Positives = 184/429 (42%), Gaps = 63/429 (14%)
Query: 63 SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQASTPFI 121
+ +RA H +++ + + +G+ + S ++PL + S GG Y ++ S GTPPQ + +
Sbjct: 41 NFTRAAH-RSRERLSILATRLGAASAGS-AQSPLQMDSGGGAYDMTFSMGTPPQTLSA-L 97
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS L+W C + RC P ++ P +SSS + C + C + ++
Sbjct: 98 ADTGSDLIWAKCGACKRCA--------PRGSASYYPTKSSSFSKLPCSSALCRTLESQSL 149
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLG-----FTAGLLLSETLRFPSKTVPNFLAGCSI 236
+ C G R C SY YGL +T G + SET S V GC+
Sbjct: 150 AT-CGGTRARGAVC-----SYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQGIGFGCTT 203
Query: 237 LSDRQPAGIAGFG---RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
+S+ +G R SL QL + FSYCL S D SS L+ G +G
Sbjct: 204 MSEGGYGSGSGLVGLGRGKLSLVRQLKVGAFSYCLTS----DPSTSSPLLFGAGALTG-- 257
Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
PG+ TP + FY V L I +G+ PG+ G G+I DS
Sbjct: 258 --PGVQSTPLVNLKTST------FYTVNLDSISIGAAK--------TPGT-GRHGIIFDS 300
Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFK 413
G+T TF+ P + + Q N +R V G CF SG P ++L F
Sbjct: 301 GTTLTFLAEPAYTLAEAGLLSQTTNLTR---VPGTDGYEVCFQTSG--GAVFPSMVLHFD 355
Query: 414 GGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
GG MAL ENYF V + V C ++ P+ I+G+ ++++ +DL
Sbjct: 356 GG-DMALKTENYFGAVNDSVSCWLV----QKSPS----EMSIVGNIMQMDYHIRYDLDKS 406
Query: 474 RFGFAKQKC 482
F C
Sbjct: 407 VLSFQPTNC 415
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 118/415 (28%), Positives = 176/415 (42%), Gaps = 72/415 (17%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
LS H ++SL+ G+PPQ T + DTGS L W C PN+ F
Sbjct: 48 LSFHHNVSLTVSLTVGSPPQTVT-MVLDTGSELSWLHCKKA--------PNLHS----VF 94
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA--- 212
P RSSS I C +P C R+ + P++C L + + A
Sbjct: 95 DPLRSSSYSPIPCTSPTCR-------------TRTRDFSIPVSCDKKKLCHAIISYADAS 141
Query: 213 ---GLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
G L S+T + +P + GC + D + G+ G R S S +Q+GL+
Sbjct: 142 SIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ 201
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----Y 318
KFSYC+ + S+ +L G S S L YTP V S+ F Y
Sbjct: 202 KFSYCISGQD-------SSGILLFGESSF-SWLKALKYTPL----VQISTPLPYFDRVAY 249
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
V L I V + +++P S P G G +VDSG+ FTF+ GP++ A+ EF+RQ
Sbjct: 250 TVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKA 309
Query: 379 YSRAADVEK---KSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALV---- 429
+ + + + C+ + + LP + L F+ GA+M++ E V
Sbjct: 310 SLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVI 368
Query: 430 --GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ V C G + I+G QN ++EFDLA R GFA+ +C
Sbjct: 369 RGSDSVYCFTFGNSELLGVE-----SYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 418
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 115/402 (28%), Positives = 171/402 (42%), Gaps = 60/402 (14%)
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
V Y Y I GTP DTGS +VW C + C P D S
Sbjct: 86 VVGYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSA------ 139
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLL 216
S + + C +P C + R C L +Y + YG T G L
Sbjct: 140 --SDTVHGVLCTDPICRAL--------------RPHACFLGGCTYQVNYGDNSVTIGQLA 183
Query: 217 SETLRFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYC 267
++ F K TVP+ + GC + GIAGFGR SLP QLG+ FSYC
Sbjct: 184 KDSFTFDGKGGGKVTVPDLVFGCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYC 243
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSG---DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
+ F+ S+ + L P G + P LS TPF N E+YY+ L+
Sbjct: 244 FTT-IFESK--STPVFLGGAPADGLRAHATGPILS-TPFLPN-------HPEYYYLSLKG 292
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
I VG + +P S V +DG+GG I+DSG+ T +F ++ + F+ Q+ + +
Sbjct: 293 ITVGKTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYN 352
Query: 385 VEKKSGLRPCF---DISGKKSVYLPELILKFKGGAKMALPPENYFALV-GNEVLCLILFT 440
+ L+ CF + V +P++ L + GA LP ENY A ++ LC+++
Sbjct: 353 DTGEPTLQ-CFSTESVPDASKVPVPKMTLHLE-GADWELPRENYMAEYPDSDQLCVVVLA 410
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G ++G+FQ QN ++ DLA ++ +C
Sbjct: 411 --------GDDDRTMIGNFQQQNMHIVHDLAGNKLVIEPAQC 444
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 121/400 (30%), Positives = 191/400 (47%), Gaps = 55/400 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G PP+ I DTGS L W C C D + P DPS+ S+
Sbjct: 85 GEYFMDVFVGNPPRHFL-LIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQ--------ST 135
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S ++I C C + V C+ S +KT P C Y YG T+G L E+L
Sbjct: 136 SFKIIPCNAAACDLV----VHDECRDNS--SKTSPKTC-KYFYWYGDSSRTSGDLALESL 188
Query: 221 RFP------SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGL----KKFSYC 267
S + + + GC + + G+ G G+ + S PSQL + FSYC
Sbjct: 189 SVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYC 248
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKT-PGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
L+ R ++ VSS + G G S+ + +TPF V ++++ FYY+G++ I
Sbjct: 249 LVDRT-NNLSVSSAISF--GAGFALSRHFDQMKFTPF----VRTNNSVETFYYLGIQGIK 301
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
+ + + IP ++G+GG I+DSG+T T++ + AV F+ ++ +Y RA +
Sbjct: 302 IDQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFD 360
Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL--CL-ILFTDNA 443
L C++ +G+ +V P L + F+ GA++ LP ENYF + CL IL TD
Sbjct: 361 I---LGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGM 417
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G+FQ QN + +D+ + R GFA C+
Sbjct: 418 S----------IIGNFQQQNIHFLYDVQHARLGFANTDCS 447
>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 126/410 (30%), Positives = 187/410 (45%), Gaps = 65/410 (15%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS LVWFPC + C+ C + PS + ++ S + S + ++
Sbjct: 100 LDTGSDLVWFPCRP-FTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSAAHSSLPSSDL 158
Query: 182 ESRCKGCSPRNKTCPLA-------------CPSYLLQYGLGFTAGLLLSETLRFPSKTVP 228
C+ N CPL CP + YG G L S++L PS +V
Sbjct: 159 ------CAISN--CPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVS 210
Query: 229 NFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDAPVS--S 280
NF GC+ + +P G+AGFGR SLP+QL + FSYCL+S FD V S
Sbjct: 211 NFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVRRPS 270
Query: 281 NLVLDTGPGSGDSKT----------------PGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
L+L + + +T +NP FY V L+
Sbjct: 271 PLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENP-----KHPYFYSVSLQG 325
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAA 383
I +G +++ P +G GGV+VDSG+TFT + + +V +EF ++G + RA
Sbjct: 326 ISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERAD 385
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGG-AKMALPPENYFALVGN---------EV 433
VE SG+ PC+ ++ ++V +P L+L F G + + LP NYF + ++
Sbjct: 386 RVEPSSGMSPCYYLN--QTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKI 443
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
CL+L G G ILG++Q Q F + +DL N R GFAK+KCA
Sbjct: 444 GCLMLMNGGDESELRG-GTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCA 492
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 174/387 (44%), Gaps = 55/387 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G PP + + DTGS + W C C +C + DP F P S+
Sbjct: 149 GEYFSRVGIGRPP-SPVYMVLDTGSDVSWVQCAP---CAEC-YEQTDP----XFEPTSSA 199
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C+ +C + +V S C RN TC Y + YG G +T G ++ET+
Sbjct: 200 SFTSLSCETEQCKSL---DV-SEC-----RNGTCL-----YEVSYGDGSYTVGDFVTETV 245
Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
S ++ N GC ++ AG+ G G S S PSQL FSYCL+ R D+
Sbjct: 246 TLGSTSLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDR---DSD 302
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
+S L ++ TP P ++NP F+Y+GL + VG + IP +
Sbjct: 303 STSTLDFNS------PITPDAVTAPLHRNP-----NLDTFFYLGLTGMSVGGAVLPIPET 351
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
DGNGG+IVDSG+ T ++ ++ + F++ + A V + C+D+
Sbjct: 352 SFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGV---ALFDTCYDL 408
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF--TDNAAGPALGRGPAII 455
S K V +P + F G ++ LP +NY V +E F TD+ I
Sbjct: 409 SSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLS---------I 459
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
LG+ Q Q + FDLAN GF+ KC
Sbjct: 460 LGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 118/415 (28%), Positives = 176/415 (42%), Gaps = 72/415 (17%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
LS H ++SL+ G+PPQ T + DTGS L W C PN+ F
Sbjct: 55 LSFHHNVSLTVSLTVGSPPQTVT-MVLDTGSELSWLHCKKA--------PNLHS----VF 101
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA--- 212
P RSSS I C +P C R+ + P++C L + + A
Sbjct: 102 DPLRSSSYSPIPCTSPTCR-------------TRTRDFSIPVSCDKKKLCHAIISYADAS 148
Query: 213 ---GLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
G L S+T + +P + GC + D + G+ G R S S +Q+GL+
Sbjct: 149 SIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ 208
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----Y 318
KFSYC+ + S+ +L G S S L YTP V S+ F Y
Sbjct: 209 KFSYCISGQD-------SSGILLFGESSF-SWLKALKYTPL----VQISTPLPYFDRVAY 256
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
V L I V + +++P S P G G +VDSG+ FTF+ GP++ A+ EF+RQ
Sbjct: 257 TVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKA 316
Query: 379 YSRAADVEK---KSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALV---- 429
+ + + + C+ + + LP + L F+ GA+M++ E V
Sbjct: 317 SLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFR-GAEMSVSAERLMYRVPGVI 375
Query: 430 --GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ V C G + I+G QN ++EFDLA R GFA+ +C
Sbjct: 376 RGSDSVYCFTFGNSELLGVE-----SYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 425
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 167/391 (42%), Gaps = 46/391 (11%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
G+S+++ P + I DTGS L+W C + P + P SS+
Sbjct: 15 GHSLTVGIVQPRK----LIVDTGSDLIWTQC----KLSSSTAAAARHGSPPVYDPGESST 66
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF 222
+ C + C + K C+ +N+ Y YG G+L SET F
Sbjct: 67 FAFLPCSDRLCQ-----EGQFSFKNCTSKNRCV------YEDVYGSAAAVGVLASETFTF 115
Query: 223 PSKTVPNFLAG--CSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
++ + G C LS GI G S SL +QL +++FSYCL F D
Sbjct: 116 GARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCL--TPFADKK 173
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S L S T + T NPV + +YYV L I +G K + +P +
Sbjct: 174 TSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETV-----YYYVPLVGISLGHKRLAVPAA 228
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L DG GG IVDSGST ++ FEAV KE + + A + L CF +
Sbjct: 229 SLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVMDVVRLPVANRTVEDYEL--CFVL 285
Query: 398 ------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
+ ++V +P L+L F GGA M LP +NYF ++CL A G
Sbjct: 286 PRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCL------AVGKTTDGS 339
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q QN ++ FD+ + +F FA +C
Sbjct: 340 GVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 121/406 (29%), Positives = 170/406 (41%), Gaps = 60/406 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP--AFIPKR 159
G Y + L GTPPQ + DTGS LVW C++ C +C P AF+ +
Sbjct: 87 GQYFVDLRLGTPPQKLL-LVADTGSDLVWVKCSA---CRNCT------RHTPGSAFLARH 136
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGF-TAGLLLS 217
S++ C + C + P RC L P Y YG G T+G
Sbjct: 137 STTFSPNHCYDSACQLVPLPK-HHRCNHAR-------LHSPCRYEYSYGDGSKTSGFFSK 188
Query: 218 ETLRFPSKT-----VPNFLAGCSI---------LSDRQPAGIAGFGRSSESLPSQLGLK- 262
ET + + + GC+ S G+ G GR SL SQLG +
Sbjct: 189 ETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRF 248
Query: 263 --KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEF 317
KFSYCL+ +P S L+ T + PG + +TP + NP+ + F
Sbjct: 249 GNKFSYCLMDHDISPSPTSYLLIGSTQ----NDVAPGKRRMRFTPLHINPLSPT-----F 299
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
YY+G+ + V + I S GNGG IVDSG+T TF+ P + + R++
Sbjct: 300 YYIGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVR 359
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
S A E G C ++S + LP+L K G + + PP NYF +V CL
Sbjct: 360 LPSPA---EPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLA 416
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
L A G ++G+ Q F LEFD R GF++ CA
Sbjct: 417 L---QAVMTPSGFS---VIGNLMQQGFLLEFDKDRTRLGFSRHGCA 456
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 174/387 (44%), Gaps = 55/387 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G PP + + DTGS + W C C +C + DP F P S+
Sbjct: 149 GEYFSRVGIGRPP-SPVYMVLDTGSDVSWVQCAP---CAEC-YEQTDP----IFEPTSSA 199
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C+ +C + +V S C RN TC Y + YG G +T G ++ET+
Sbjct: 200 SFTSLSCETEQCKSL---DV-SEC-----RNGTCL-----YEVSYGDGSYTVGDFVTETV 245
Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
S ++ N GC ++ AG+ G G S S PSQL FSYCL+ R D+
Sbjct: 246 TLGSTSLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDR---DSD 302
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
+S L ++ TP P ++NP F+Y+GL + VG + IP +
Sbjct: 303 STSTLDFNS------PITPDAVTAPLHRNP-----NLDTFFYLGLTGMSVGGAVLPIPET 351
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
DGNGG+IVDSG+ T ++ ++ + F++ + A V + C+D+
Sbjct: 352 SFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGV---ALFDTCYDL 408
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF--TDNAAGPALGRGPAII 455
S K V +P + F G ++ LP +NY V +E F TD+ I
Sbjct: 409 SSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLS---------I 459
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
LG+ Q Q + FDLAN GF+ KC
Sbjct: 460 LGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 165/391 (42%), Gaps = 55/391 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+P + + DTGS + W C+ C C N F P+ SS
Sbjct: 12 GEYFVRVGIGSPTKLQY-LVMDTGSDVPWIQCSP---CKSCYKQN-----DAVFDPRASS 62
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + + C P+C + K C+ + C Y + YG G FT G L S++
Sbjct: 63 SFRRLSCSTPQCKLL-------DVKACASTDNRCL-----YQVSYGDGSFTVGDLASDSF 110
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDD 275
+ GC D + + G S PSQL +KFSYCL+SR +
Sbjct: 111 SVSRGRTSPVVFGCG--HDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRD-NG 167
Query: 276 APVSSNLVLDTGPGSGDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
SS L+ GDS P +YT KNP FYY GL I +G +
Sbjct: 168 VRASSALLF------GDSALPTSASFAYTQLLKNP-----KLDTFYYAGLSGISIGGTLL 216
Query: 333 KIP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
IP ++ + S G GGVI+DSG++ T + + + F RAAD S
Sbjct: 217 SIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADF---SLF 273
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D S SV +P + F+GGA + LPP NY V F+ + +
Sbjct: 274 DTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLS---- 329
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q + DL + R GFA ++C
Sbjct: 330 ---IIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 121/390 (31%), Positives = 176/390 (45%), Gaps = 54/390 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ G+P + T FIFDTGS L W C CV + R F P S
Sbjct: 145 GNYVVTVGLGSPKRDLT-FIFDTGSDLTWTQCEP---CVGYCYQQ----REHIFDPSTSL 196
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C +P C + + GCS + TC Y ++YG G ++ G E L
Sbjct: 197 SYSNVSCDSPSCEKL--ESATGNSPGCS--SSTCL-----YGIRYGDGSYSIGFFAREKL 247
Query: 221 RFPSKTV-PNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRK 272
S V NF GC ++R AG+ G R+ SL SQ K FSYCL
Sbjct: 248 SLTSTDVFNNFQFGCG-QNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCL---- 302
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
P SS+ GSGD + + +TP N S + FY++ + I VG + +
Sbjct: 303 ----PSSSSSTGYLSFGSGDGDSKAVKFTPSEVN-----SDYPSFYFLDMVGISVGERKL 353
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
IP S G I+DSG+ + + ++ +V K F M +Y R V S L
Sbjct: 354 PIPKSVF-----STAGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGV---SILD 405
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
C+D+S K+V +P++IL F GGA+M L PE ++ +CL F N+ +
Sbjct: 406 TCYDLSKYKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQVCLA-FAGNSDDDEVA--- 461
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + ++ +D A R GFA C
Sbjct: 462 --IIGNVQQKTIHVVYDDAEGRVGFAPSGC 489
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 122/392 (31%), Positives = 175/392 (44%), Gaps = 58/392 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT--SRYRCVDCNFPNVDPSRIPAFIPKR 159
G Y +++ GTP + T FIFDTGS L W C +RY C + P F P +
Sbjct: 136 GNYVVTVGLGTPKRDLT-FIFDTGSDLTWTQCEPCARY-CYH--------QQEPIFNPSK 185
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
S+S I C +P C + K + + +C + Y +QYG ++ G +
Sbjct: 186 STSYTNISCSSPTCDEL---------KSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQD 236
Query: 219 TLRFPSKTV-PNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKK---FSYCLLS 270
L S V NFL GC ++R AG+ G GR++ SL SQ K FSYCL S
Sbjct: 237 KLALTSTDVFNNFLFGCG-QNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPS 295
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
SS L G G G SK + +TP N G S FY++ L I VG +
Sbjct: 296 TS------SSTGYLTFGSGGGTSK--AVKFTPSLVNSQGPS-----FYFLNLIAISVGGR 342
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ S G I+DSG+ + + + + F +QM Y +AA S
Sbjct: 343 KLSTSASVF-----STAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPA---SI 394
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
L C+D S +V +P++ L F GA+M L P F ++ +CL F N+ +
Sbjct: 395 LDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLA-FAGNSDATDIA- 452
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ Q + F + +D+A R GFA C
Sbjct: 453 ----ILGNVQQKTFDVVYDVAGGRIGFAPGGC 480
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 165/391 (42%), Gaps = 55/391 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+P + + DTGS + W C+ C C N F P+ SS
Sbjct: 12 GEYFVRVGIGSPTKLQY-LVMDTGSDVPWIQCSP---CKSCYKQN-----DAVFDPRASS 62
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + + C P+C + K C+ + C Y + YG G FT G L S++
Sbjct: 63 SFRRLSCSTPQCKLL-------DVKACASTDNRCL-----YQVSYGDGSFTVGDLASDSF 110
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDD 275
+ GC D + + G S PSQL +KFSYCL+SR +
Sbjct: 111 LVSRGRTSPVVFGCG--HDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRD-NG 167
Query: 276 APVSSNLVLDTGPGSGDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
SS L+ GDS P +YT KNP FYY GL I +G +
Sbjct: 168 VRASSALLF------GDSALPTSASFAYTQLLKNP-----KLDTFYYAGLSGISIGGTLL 216
Query: 333 KIP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
IP ++ + S G GGVI+DSG++ T + + + F RAAD S
Sbjct: 217 SIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADF---SLF 273
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D S SV +P + F+GGA + LPP NY V F+ + +
Sbjct: 274 DTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLS---- 329
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q + DL + R GFA ++C
Sbjct: 330 ---IIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 124/400 (31%), Positives = 168/400 (42%), Gaps = 63/400 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS +VW +C C R+ F P+RS
Sbjct: 126 GEYFAQVGVGTPATTAL-MVLDTGSDVVWL------QCAPCRHCYAQSGRV--FDPRRSR 176
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C P C + GC R +C Y + YG G TAG SETL
Sbjct: 177 SYAAVDCVAPICRRL-------DSAGCDRRRNSC-----LYQVAYGDGSVTAGDFASETL 224
Query: 221 RFPS-KTVPNFLAGCSILSDRQPAGIAG-----FGRSSESLPSQLGL---KKFSYCLLSR 271
F V GC D + IA GR S PSQ+ + FSYCL+ R
Sbjct: 225 TFARGARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDR 282
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
P S+ T + G S+TP +NP FYYV L VG
Sbjct: 283 TSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNP-----RMATFYYVHLLGFSVGGAR 337
Query: 332 VK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK- 388
VK + S L + + G GGVI+DSG++ T + P++EAV F RAA V +
Sbjct: 338 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRV 389
Query: 389 -----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
S C+++SG++ V +P + + GGA +ALPPENY V C + +
Sbjct: 390 SPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD 449
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+ Q Q F + FD R GF + C
Sbjct: 450 --------GGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 481
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 124/400 (31%), Positives = 168/400 (42%), Gaps = 63/400 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS +VW +C C R+ F P+RS
Sbjct: 120 GEYFAQVGVGTPATTAL-MVLDTGSDVVWL------QCAPCRHCYAQSGRV--FDPRRSR 170
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C P C + GC R +C Y + YG G TAG SETL
Sbjct: 171 SYAAVDCVAPICRRL-------DSAGCDRRRNSC-----LYQVAYGDGSVTAGDFASETL 218
Query: 221 RFPS-KTVPNFLAGCSILSDRQPAGIAG-----FGRSSESLPSQLGL---KKFSYCLLSR 271
F V GC D + IA GR S PSQ+ + FSYCL+ R
Sbjct: 219 TFARGARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDR 276
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
P S+ T + G S+TP +NP FYYV L VG
Sbjct: 277 TSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNP-----RMATFYYVHLLGFSVGGAR 331
Query: 332 VK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK- 388
VK + S L + + G GGVI+DSG++ T + P++EAV F RAA V +
Sbjct: 332 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRV 383
Query: 389 -----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
S C+++SG++ V +P + + GGA +ALPPENY V C + +
Sbjct: 384 SPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD 443
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+ Q Q F + FD R GF + C
Sbjct: 444 --------GGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 121/400 (30%), Positives = 190/400 (47%), Gaps = 55/400 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G PP+ I DTGS L W C C D + P DPS+ S+
Sbjct: 169 GEYFMDVFVGNPPRHFL-LIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQ--------ST 219
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S ++I C C + V C+ S +KT P C Y YG T+G L E+L
Sbjct: 220 SFKIIPCNAAACDLV----VHDECRDNS--SKTSPKTC-KYFYWYGDSSRTSGDLALESL 272
Query: 221 RFP------SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGL----KKFSYC 267
S + + + GC + + G+ G G+ + S PSQL + FSYC
Sbjct: 273 SVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYC 332
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKT-PGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
L+ R ++ VSS + G G S+ + +TPF V ++++ FYY+G++ I
Sbjct: 333 LVDRT-NNLSVSSAISF--GAGFALSRHFDQMRFTPF----VRTNNSVETFYYLGIQGIK 385
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
+ + + IP +G+GG I+DSG+T T++ + AV F+ ++ +Y RA +
Sbjct: 386 IDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFD 444
Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL--CL-ILFTDNA 443
L C++ +G+ +V P L + F+ GA++ LP ENYF + CL IL TD
Sbjct: 445 I---LGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGM 501
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G+FQ QN + +D+ + R GFA C+
Sbjct: 502 S----------IIGNFQQQNIHFLYDVQHARLGFANTDCS 531
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 133/403 (33%), Positives = 171/403 (42%), Gaps = 69/403 (17%)
Query: 102 GGYSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
G Y + GTP TP + DTGS +VW C RC D + DP +
Sbjct: 145 GEYFTKIGVGTP---VTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDP--------RA 193
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
S S + C P C + GC R K C Y + YG G TAG +E
Sbjct: 194 SHSYGAVDCAAPLCRRL-------DSGGCDLRRKAC-----LYQVAYGDGSVTAGDFATE 241
Query: 219 TLRFPSKT-VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
TL F S VP GC ++ AG+ G GR S S PSQ+ + FSYCL+ R
Sbjct: 242 TLTFASGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDR 301
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
A +S T + S+TP KNP FYYV L I VG
Sbjct: 302 TSSSASATSRSSTVTFGSGAVGPSAAASFTPMVKNP-----RMETFYYVQLMGISVGGAR 356
Query: 332 V-KIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
V + S L + S G GGVIVDSG++ T + P + A+ F RAA +
Sbjct: 357 VPGVAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAF--------RAA----AA 404
Query: 390 GLR----------PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
GLR C+D+SG K V +P + + F GGA+ ALPPENY V + F
Sbjct: 405 GLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF 464
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G + I+G+ Q Q F + FD R GF + C
Sbjct: 465 AGTDGGVS-------IIGNIQQQGFRVVFDGDGQRLGFVPKGC 500
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 117/388 (30%), Positives = 172/388 (44%), Gaps = 57/388 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G PP + I DTGS + W C C DC + DP F P S+
Sbjct: 147 GEYFSRVGIGKPPSQAY-LILDTGSDVNWVQCAP---CADC-YQQADP----IFEPASSA 197
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C +C + +V S C RN TC Y + YG G +T G ++ET+
Sbjct: 198 SFSTLSCNTRQCRSL---DV-SEC-----RNDTC-----LYEVSYGDGSYTVGDFVTETI 243
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
S V N GC ++ AG+ G G S S PSQ+ FSYCL+ R D+
Sbjct: 244 TLGSAPVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDR---DSE 300
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
+S L ++ + P P +N FYYVGL + VG + V IP S
Sbjct: 301 SASTLEFNS------TLPPNAVSAPLLRN-----HHLDTFYYVGLTGLSVGGELVSIPES 349
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL---RPC 394
GNGGVIVDSG+ T ++ ++ ++ F++ R D+ +G+ C
Sbjct: 350 AFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVK------RTRDLPSTNGIALFDTC 403
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
+D+S K +V +P + F G ++ LP +NY + +E F A+ +
Sbjct: 404 YDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLS------- 456
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q + +DL N GF KC
Sbjct: 457 IIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 128/400 (32%), Positives = 177/400 (44%), Gaps = 59/400 (14%)
Query: 91 LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
L TP++ + G Y I +SFG+PPQ ++ I DTGS L+W +C+ C N S
Sbjct: 68 LFSTPVASGN-GEYLIDISFGSPPQKAS-VIVDTGSDLIW------TQCLPCETCNAAAS 119
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
I F P +SS+ + C + CS + + + CK Y YG G
Sbjct: 120 VI--FDPVKSSTYDTVSCASNFCSSLPFQSCTTSCK---------------YDYMYGDGS 162
Query: 211 -TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKK 263
T+G L +ET+ + T+PN GC ++ S AGI G G+ SL SQ + KK
Sbjct: 163 STSGALSTETVTVGTGTIPNVAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKK 222
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
FSYCL+ S L+ D+ G + T L+ T NP FYY L
Sbjct: 223 FSYCLV--PLGSTKTSPMLIGDSAAAGGVAYTALLTNT---ANPT--------FYYADLT 269
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I V K V P + G GG I+DSG+T T++E F A+ ++
Sbjct: 270 GISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGAFNALVAALKAEVPFPEADG 329
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDN 442
+ GL CF +G + P + FK GA LPPEN F AL +CL +
Sbjct: 330 SLY---GLDYCFSTAGVANPTYPTMTFHFK-GADYELPPENVFVALDTGGSICLAM---- 381
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
AA I+G+ Q QN + DL N R GF + C
Sbjct: 382 AASTGFS-----IMGNIQQQNHLIVHDLVNQRVGFKEANC 416
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 115/375 (30%), Positives = 165/375 (44%), Gaps = 45/375 (12%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
I DT S L W C C D P DPS P++ + C +P C +
Sbjct: 156 VIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAA--------VPCDSPSCDALQQQ 207
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
G P + P AC SY L Y G ++ G+L + L + + F+ GC +
Sbjct: 208 LATGAGAGAPPCDAGRPAAC-SYALSYRDGSYSRGVLAHDRLSLAGEVIDGFVFGCGTSN 266
Query: 239 DRQP----AGIAGFGRSSESLPSQLGLK---KFSYCL-LSRKFDDAPVSSNLVLDTGPGS 290
P +G+ G GRS SL SQ + FSYCL LSR+ D S +LVL P +
Sbjct: 267 QGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESD---ASGSLVLGDDPSA 323
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG-NGGV 349
+ TP + YT N G FY V L I VG + V+ S G +
Sbjct: 324 YRNSTP-VVYTSMVSNS--DPLLQGPFYLVNLTGITVGGQEVE---------STGFSARA 371
Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELI 409
IVDSG+ T + ++ AV EF+ Q+ Y +A S L CF+++G K V +P L
Sbjct: 372 IVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGF---SILDTCFNMTGLKEVQVPSLT 428
Query: 410 LKFKGGAKMALPPEN--YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLE 467
L F GGA++ + YF + +CL A I+G++Q +N +
Sbjct: 429 LVFDGGAEVEVDSGGVLYFVSSDSSQVCL------AVASLKSEDETSIIGNYQQKNLRVV 482
Query: 468 FDLANDRFGFAKQKC 482
FD + + GFA++ C
Sbjct: 483 FDTSASQVGFAQETC 497
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 134/425 (31%), Positives = 182/425 (42%), Gaps = 80/425 (18%)
Query: 91 LIKTP---LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNV 147
LI P LS H ++SL+ G+PPQ T + DTGS L W C
Sbjct: 24 LISQPSNKLSFHHNVTLTVSLTVGSPPQQVT-MVLDTGSELSWLHCKK------------ 70
Query: 148 DPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG------CSPRNKTCPLACPS 201
P+ F P SSS I C +P C +R + C P+ K C A S
Sbjct: 71 SPNLTSVFNPLSSSSYSPIPCSSPVC--------RTRTRDLPNPVTCDPK-KLCH-AIVS 120
Query: 202 YLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSES 254
Y L G L S+ R S +P L GC + D + G+ G R S S
Sbjct: 121 YADASSL---EGNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLS 177
Query: 255 LPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP---GLSYTPFYKNPVGSS 311
+QLGL KFSYC+ R SS ++L GDS L+YTP V S
Sbjct: 178 FVTQLGLPKFSYCISGRD------SSGVLL-----FGDSHLSWLGNLTYTPL----VQIS 222
Query: 312 SAFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
+ F Y V L I VG+K + +P S P G G +VDSG+ FTF+ GP++ A
Sbjct: 223 TPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTA 282
Query: 368 VAKEFIRQM-GNYSRAAD--VEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPE 423
+ EF+ Q G + D + + C+ + +G K LP + L F+ GA+M + E
Sbjct: 283 LRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMFR-GAEMVVGGE 341
Query: 424 NYFALV-----GNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
V G E V CL + G A ++G QN ++EFDL R GF
Sbjct: 342 VLLYKVPGMMKGKEWVYCLTFGNSDLLGIE-----AFVIGHHHQQNVWMEFDLVKSRVGF 396
Query: 478 AKQKC 482
+ +C
Sbjct: 397 VETRC 401
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 120/415 (28%), Positives = 180/415 (43%), Gaps = 72/415 (17%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L H ++SL+ GTP Q T + DTGS L W C +P+ F
Sbjct: 59 LLFHHNVTLTVSLTAGTPLQNIT-MVLDTGSELSWLHCKK------------EPNFNSIF 105
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLAC-PSYLLQYGLGF---- 210
P S + I C +P C E+R R+ P++C P+ L + + +
Sbjct: 106 NPLASKTYTKIPCSSPTC--------ETRT-----RDLPLPVSCDPAKLCHFIISYADAS 152
Query: 211 -TAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
G L ET R S T P + GC + D + G+ G R S S +Q+G +
Sbjct: 153 SVEGNLAFETFRVGSVTGPATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFR 212
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----Y 318
KFSYC+ R S+ VL G S P L+YTP V S+ F Y
Sbjct: 213 KFSYCISDR-------DSSGVLLLGEASFSWLKP-LNYTPL----VEMSTPLPYFDRVAY 260
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
V L I V K + +P S VP G G +VDSG+ FTF+ GP++ A+ +EF+ Q
Sbjct: 261 SVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEFLLQTKG 320
Query: 379 YSRAADVEK---KSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNE- 432
R + + + + C+ I ++ LP + L F+ GA+M++ + V E
Sbjct: 321 VLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLMFR-GAEMSVSGQRLLYRVPGEV 379
Query: 433 -----VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V C ++ G + ++G Q QN ++E+DL R GFA+ +C
Sbjct: 380 RGKDSVWCFTFGNSDSLGIE-----SFVIGHHQQQNVWMEYDLEKSRIGFAEVRC 429
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 123/400 (30%), Positives = 168/400 (42%), Gaps = 63/400 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS +VW +C C R+ F P+RS
Sbjct: 120 GEYFAQVGVGTPATTAL-MVLDTGSDVVWL------QCAPCRHCYAQSGRV--FDPRRSR 170
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C P C + GC R +C Y + YG G TAG SETL
Sbjct: 171 SYAAVDCVAPICRRL-------DSAGCDRRRNSC-----LYQVAYGDGSVTAGDFASETL 218
Query: 221 RFPS-KTVPNFLAGCSILSDRQPAGIAG-----FGRSSESLPSQLGL---KKFSYCLLSR 271
F V GC D + IA GR S P+Q+ + FSYCL+ R
Sbjct: 219 TFARGARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDR 276
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
P S+ T + G S+TP +NP FYYV L VG
Sbjct: 277 TSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNP-----RMATFYYVHLLGFSVGGAR 331
Query: 332 VK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK- 388
VK + S L + + G GGVI+DSG++ T + P++EAV F RAA V +
Sbjct: 332 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRV 383
Query: 389 -----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
S C+++SG++ V +P + + GGA +ALPPENY V C + +
Sbjct: 384 SPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD 443
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+ Q Q F + FD R GF + C
Sbjct: 444 --------GGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 107/347 (30%), Positives = 157/347 (45%), Gaps = 44/347 (12%)
Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA 212
P F P SS+ + C + C ++ P + GC Y YG+GFTA
Sbjct: 94 PPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCV------------YYYPYGMGFTA 141
Query: 213 GLLLSETLRFPSKTVPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
G L +ETL + P GCS + +GI G GRS SL SQ+G+ +FSYCL S
Sbjct: 142 GYLATETLHVGGASFPGVAFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRS 201
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
DA + +L GS T G S +NP SS+ +YYV L I VG+
Sbjct: 202 ----DADAGDSPILF---GSLAKVTGGKSSPAILENPEMPSSS---YYYVNLTGITVGAT 251
Query: 331 HVKIPYSYL----VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
+ + + G+ GG IVDSG+T T++ + V + F+ QM + V
Sbjct: 252 DLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVN 311
Query: 387 -KKSGLRPCFDIS---GKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCL 436
+ G CFD + G V +P L+L+F GGA+ A+ +Y +V + V CL
Sbjct: 312 GTRFGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECL 371
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
++ PA + I+G+ + ++ +DL F FA CA
Sbjct: 372 LVL------PASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 412
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 138/483 (28%), Positives = 204/483 (42%), Gaps = 74/483 (15%)
Query: 20 TTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTK 79
TT S A++ ++ L P H H D L +L LA S +R + + TK +
Sbjct: 68 TTSFSPTSLASSFSLELHPRELLHGGSHKDYRAL-MLSRLARDS-ARVKAINTKLQLAVS 125
Query: 80 ----------DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
D+ I S T + G Y + + G P + + + DTGS +
Sbjct: 126 GTDKSDLVPMDTEILHPQDFSTPVTSGTSQGSGEYFLRVGIGRPSK-TFYMVIDTGSDVN 184
Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
W C C DC + VDP F P SSS +GCQ P+C + C
Sbjct: 185 WLQCKP---CDDC-YQQVDP----IFDPASSSSFSRLGCQTPQCRNL-------DVFAC- 228
Query: 190 PRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSDRQPAGIAG 247
RN +C Y + YG G +T G +ET+ F S +V GC D + +
Sbjct: 229 -RNDSCL-----YQVSYGDGSYTVGDFATETVSFGNSGSVDKVAIGCG--HDNEGLFVGA 280
Query: 248 FGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTP 302
G SL SQ+ FSYCL++R D+ SS L ++ S DS T P
Sbjct: 281 AGLIGLGGGPLSLTSQIKASSFSYCLVNR---DSVDSSTLEFNSAKPS-DSVT-----AP 331
Query: 303 FYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEG 362
+KN S FYYVG+ + VG + + IP S G GG+IVD G+ T ++
Sbjct: 332 IFKN-----SKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQT 386
Query: 363 PLFEAVAKEFIRQMGNYSRAADVEKKSG---LRPCFDISGKKSVYLPELILKFKGGAKMA 419
+ A+ F++ D+ SG C+++S + SV +P + F GG +
Sbjct: 387 QAYNALRDTFVK------LTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSLP 440
Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
LPP NY V + + F A + I+G+ Q Q + +DLAN + F+
Sbjct: 441 LPPSNYLIPVDSAGTFCLAFAPTTASLS-------IIGNVQQQGTRVTYDLANSQVSFSS 493
Query: 480 QKC 482
+KC
Sbjct: 494 RKC 496
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 117/400 (29%), Positives = 177/400 (44%), Gaps = 67/400 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + LS GTPPQ P + DTGS LVW +C +C+ ++D F SS
Sbjct: 3 GEYMMELSIGTPPQL-IPAMIDTGSDLVWL------KCDNCDHCDLDHHGETIFFSDASS 55
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + + C + CS + + RC+ +TC Y +YG G T+G + S+ +
Sbjct: 56 SYKKLPCNSTHCSGMSSAGIGPRCE------ETCK-----YKYEYGDGSRTSGDVGSDRI 104
Query: 221 RFPSKTV--------PNFLAGCS--ILSDRQ-PAGIAGFGRSSESLPSQLGLK---KFSY 266
F S FL GC+ + D G+ G G+ S SL QLG K KFSY
Sbjct: 105 SFRSHGAGEDHRSFFDGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSY 164
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-FYYVGLRQI 325
CL+S D+P S+ L G + L P+ + YYV L+ I
Sbjct: 165 CLVSY---DSPPSAKSFLFLG------SSAALRGHDVVSTPILHGDHLDQTLYYVDLQSI 215
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGV--------IVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
+G +P S N V ++DSG+T+T + P++EA+ K Q+
Sbjct: 216 TIGG----VPVVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV- 270
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ +GL CF+ SG S P + F ++ LP EN F + +V+CL
Sbjct: 271 ---ILPTLGNSAGLDLCFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLS 327
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
+ +++G G I+G+ Q QNF++ +DL + F
Sbjct: 328 M---DSSG-----GDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 125/393 (31%), Positives = 171/393 (43%), Gaps = 58/393 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + +S GTPPQ + I DTGS L W C RC F DP FIP SS
Sbjct: 6 GEYVLQISLGTPPQQFSA-IVDTGSDLCWVQCAPCARC----FEQPDP----LFIPLASS 56
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S C + C + P CS RN TC +Y YG G T G ET+
Sbjct: 57 SYSNASCTDSLCDALPRPT-------CSMRN-TC-----TYSYSYGDGSNTRGDFAFETV 103
Query: 221 RFPSKTVPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLG---LKKFSYCLLSRKFD 274
T+ GC + AG + G G+ SLPSQL FSYCL+ D
Sbjct: 104 TLNGSTLARIGFGCGHNQEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLV----D 159
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
+ + + G + +S+ S+TP +N S +YYVG+ I VG++ V
Sbjct: 160 QSTTGTFSPITFGNAAENSRA---SFTPLLQNEDNPS-----YYYVGVESISVGNRRVPT 211
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
P S ++G GGVI+DSG+T T+ F + E RQ+ +Y A GL C
Sbjct: 212 PPSAFRIDANGVGGVILDSGTTITYWRLAAFIPILAELRRQI-SYPEAD--PTPYGLNLC 268
Query: 395 FDIS--GKKSVYLPELILKFKGGAKMALPPENYFALVGN--EVLCLILFTDNAAGPALGR 450
+DIS S+ LP + + +P N + LV N E +C + T +
Sbjct: 269 YDISSVSASSLTLPSMTVHLT-NVDFEIPVSNLWVLVDNFGETVCTAMSTSDQFS----- 322
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q QN + D+AN R GF C+
Sbjct: 323 ----IIGNVQQQNNLIVTDVANSRVGFLATDCS 351
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 118/400 (29%), Positives = 176/400 (44%), Gaps = 67/400 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + LS GTPPQ P + DTGS LVW +C +C+ ++D F SS
Sbjct: 3 GEYMMELSIGTPPQL-IPAMIDTGSDLVWL------KCDNCDHCDLDHHGETIFFSDASS 55
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + + C + CS + + RC+ +TC Y +YG G T+G + S+ +
Sbjct: 56 SYKKLPCNSTHCSGMSSAGIGPRCE------ETCK-----YKYEYGDGSRTSGDVGSDRI 104
Query: 221 RFPSKTV--------PNFLAGC--SILSDRQ-PAGIAGFGRSSESLPSQLGLK---KFSY 266
F S FL GC + D G+ G G+ S SL QLG K KFSY
Sbjct: 105 SFRSHGAGEDHRSFFDGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSY 164
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-FYYVGLRQI 325
CL+S D+P S+ L G + L P+ + YYV L+ I
Sbjct: 165 CLVSY---DSPPSAKSFLFLG------SSAALRGHDVVSTPILHGDHLDQTLYYVDLQSI 215
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGV--------IVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
VG +P S N V ++DSG+T+T + P++EA+ K Q+
Sbjct: 216 TVGG----VPVVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV- 270
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ +GL CF+ SG S P + F ++ LP EN F + +V+CL
Sbjct: 271 ---ILPTLGNSAGLDLCFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLS 327
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
+ +++G G I+G+ Q QNF++ +DL + F
Sbjct: 328 M---DSSG-----GDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 124/415 (29%), Positives = 184/415 (44%), Gaps = 57/415 (13%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
+ TPL Y +S+ L G+ Q + I DTGS V C SR R
Sbjct: 90 VVTPL--EDYALFSMQLGIGSL-QKNLSAIIDTGSEAVLVQCGSRSR------------- 133
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
P F P S S + + C + C + + C + TC +Y L YG
Sbjct: 134 -PVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSATC-----TYSLSYGDSRN 187
Query: 212 AGLLLSETLRFPSKTVPNFLA--------GCS-----ILSDRQPAGIAGFGRSSESLPSQ 258
+ S+ + F + T + A GC+ L D GI GF R + SLPSQ
Sbjct: 188 STGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQ 247
Query: 259 L----GLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
L G KFSYC S+ + P ++ ++ G SK + YTP NPV + A
Sbjct: 248 LKDRLGGSKFSYCFPSQPWQ--PRATGVIFLGDSGLSKSK---VGYTPLLDNPV--TPAR 300
Query: 315 GEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
+ YYVGL I V K + IP S + + S G+GG ++DSG+TFT + + A F
Sbjct: 301 SQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFA 360
Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYFALV--- 429
+ R V +G C++IS S+ +PE+ L + ++ L E+ F V
Sbjct: 361 ASNRSGLRK-KVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAA 419
Query: 430 GNEV-LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
GNEV +CL + + +G G +LG++Q N+ +E+D R GF + C+
Sbjct: 420 GNEVTVCLAILSSQKSGF----GKINVLGNYQQSNYLVEYDNERSRVGFERADCS 470
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 126/403 (31%), Positives = 171/403 (42%), Gaps = 66/403 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS +VW C RC + + P F P+RSS
Sbjct: 127 GEYFTKIGVGTPATQAL-MVLDTGSDVVWVQCAPCRRCYEQSGP--------VFDPRRSS 177
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S +GC C + GC R C Y + YG G TAG ++ETL
Sbjct: 178 SYGAVGCGAALCRRL-------DSGGCDLRRGAC-----MYQVAYGDGSVTAGDFVTETL 225
Query: 221 RFPSKT-VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKF 273
F V GC ++ AG+ G GR S P+Q+ + FSYCL+ R
Sbjct: 226 TFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTS 285
Query: 274 DDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
A + S+ G+G S+TP +NP FYYV L I VG
Sbjct: 286 SGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNP-----RMETFYYVQLVGISVGGAR 340
Query: 332 V-KIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
V + S L + S G GGVIVDSG++ T + + A+ F RAA
Sbjct: 341 VPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAF--------RAA---AAG 389
Query: 390 GLR----------PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
GLR C+D+ G++ V +P + + F GGA+ ALPPENY V + F
Sbjct: 390 GLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF 449
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G + I+G+ Q Q F + FD R GFA + C
Sbjct: 450 AGTDGGVS-------IIGNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 133/454 (29%), Positives = 193/454 (42%), Gaps = 71/454 (15%)
Query: 47 HSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYS--NSLIKTPLSVHSYGGY 104
H PL+ ++S + + + + T S YS ++L P S G Y
Sbjct: 79 HGACSPLRPINSSSWIDMVSQSFDRDNDRLNTIWSKNNGTYSTMSNLPLQPGSKVGTGNY 138
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
++ FGTP + S I DTGS + W C C DC + VDP F P++SSS +
Sbjct: 139 IVTAGFGTPAKNSL-LIIDTGSDVTWIQCKP---CSDC-YSQVDP----IFEPQQSSSYK 189
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFP 223
+ C + C+ + N R GC Y + YG G + G ETL
Sbjct: 190 HLSCLSSACTELTTMN-HCRLGGCV------------YEINYGDGSRSQGDFSQETLTLG 236
Query: 224 SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAP 277
S + P+F GC + + AG+ G GR++ S PSQ K +FSYCL
Sbjct: 237 SDSFPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCL--------- 287
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
+ V T GS + T + P+ S+S + FY+VGL I VG + + IP
Sbjct: 288 --PDFVSSTSTGSFSVGQGSIPATATFV-PLVSNSNYPSFYFVGLNGISVGGERLSIP-- 342
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
P G GG IVDSG+ T + ++A+ F + N A + S L C+D+
Sbjct: 343 ---PAVLGRGGTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSA---KPFSILDTCYDL 396
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI--- 454
S V +P + F+ A +A+ + VG ILFT + G + A
Sbjct: 397 SSYSQVRIPTITFHFQNNADVAV------SAVG------ILFTIQSDGSQVCLAFASASQ 444
Query: 455 -----ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+FQ Q + FD R GFA CA
Sbjct: 445 SISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSCA 478
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 118/393 (30%), Positives = 164/393 (41%), Gaps = 58/393 (14%)
Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + L+ G PP PF+ DTGS L W C C + P DPS F P
Sbjct: 71 YLMELAIGKPP---VPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSP---- 123
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ C + C I+ N C+P + C Y YG G ++AG+L +ETL
Sbjct: 124 ----LPCSSATCLPIWSRN-------CTP-SSLC-----RYRYAYGDGAYSAGILGTETL 166
Query: 221 RFPSKTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS--R 271
+ P + G + G G GR + SL +QLG+ KFSYCL
Sbjct: 167 TLGPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFN 226
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
D+P + + PG ++ TP ++P S Y+V L+ I +G
Sbjct: 227 SALDSPFLLGTLAELAPGPSTVQS-----TPLLQSPQNPSR-----YFVSLQGISLGDVR 276
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ IP DG GG+IVDSG+TFT + F V R +G V S
Sbjct: 277 LPIPNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGRVARVLGQ----PPVNASSLD 332
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGR 450
PCF + Y+P+L+L F GGA M L +NY + + CL N AG
Sbjct: 333 APCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCL-----NIAGTT--P 385
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+LG+FQ QN + FD + F C+
Sbjct: 386 ESTSVLGNFQQQNIQMLFDTTVGQLSFLPTDCS 418
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 120/399 (30%), Positives = 173/399 (43%), Gaps = 68/399 (17%)
Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + L+ GTPP PF+ DTGS L W C C + P DPS F P
Sbjct: 66 YLMELAIGTPP---VPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSP---- 118
Query: 162 SSQLIGCQNPKC--SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
+ C + C +W R + CS + C Y+ Y G ++ G+L +E
Sbjct: 119 ----VPCSSATCLPTW--------RSRNCSNPSSPC-----RYIYSYSDGAYSVGILGTE 161
Query: 219 TLRF----PSKTVP--NFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
TL P +TV + GC + G G GR + SL +QLG+ KFSYCL
Sbjct: 162 TLTIGSSVPGQTVSVGSVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLT 221
Query: 270 S--RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
D+P + + PG G ++ TP ++P+ S Y+V L+ I +
Sbjct: 222 DFFNSTMDSPFFLGTLAELAPGPGTVQS-----TPLLQSPLNPSR-----YFVNLQGISL 271
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
G + IP +DGNGG++VDSG+TFT + F V + +G V
Sbjct: 272 GDVRLPIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLLGQ----PPVNA 327
Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGP 446
S PCF S ++P+L+L F GGA M L +NY + ++ CL N G
Sbjct: 328 SSLDSPCFP-SPDGEPFMPDLVLHFAGGADMRLHRDNYMSYNEDDSSFCL-----NIVG- 380
Query: 447 ALGRGPAII--LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P+ LG+FQ QN + FD+ + F C+
Sbjct: 381 ----SPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDCS 415
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/374 (30%), Positives = 158/374 (42%), Gaps = 62/374 (16%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGS + W C C DC + DP F P SS+ + CQ+ +CS +
Sbjct: 35 MVLDTGSDINWLQCQP---CTDC-YQQTDP----IFDPTASSTYAPVTCQSQQCSSL--- 83
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
+ S R+ C Y + YG G +T G +E++ F S +V N GC
Sbjct: 84 ------EMSSCRSGQCL-----YQVNYGDGSYTFGDFATESVSFGNSGSVKNVALGCG-- 130
Query: 238 SDRQPAGIAGFGRSSE-----SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
D + + G SL +QL FSYCL++R D+ SS L ++ D
Sbjct: 131 HDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNR---DSAGSSTLDFNSAQLGVD 187
Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
S T P KN FYYVGL + VG + V IP S GNGG+IVD
Sbjct: 188 SVTA-----PLMKN-----RKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVD 237
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
G+ T ++ + + F+R N + V C+D+SG+ SV +P + F
Sbjct: 238 CGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVAL---FDTCYDLSGQASVRVPTVSFHF 294
Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI----ILGDFQLQNFYLEF 468
G LP NY V D+A P I+G+ Q Q + F
Sbjct: 295 ADGKSWNLPAANYLIPV-----------DSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTF 343
Query: 469 DLANDRFGFAKQKC 482
DLAN+R GF+ KC
Sbjct: 344 DLANNRMGFSPNKC 357
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 120/408 (29%), Positives = 173/408 (42%), Gaps = 63/408 (15%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
LS H ++SL+ G+PPQ T + DTGS L W C PN++ + F
Sbjct: 52 LSFHHNVTLTVSLTVGSPPQNVT-MVLDTGSELSWLHCK--------KLPNLNST----F 98
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCK------GCSPRNKTCPLACPSYLLQYGLG 209
P SSS C + C+ +R + C P NK C + ++ Y
Sbjct: 99 NPLLSSSYTPTPCNSSICT--------TRTRDLTIPASCDPNNKLCHV-----IVSYADA 145
Query: 210 FTA-GLLLSETLRFPSKTVPNFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLG 260
+A G L +ET P L GC I D + G+ G R S SL +Q+
Sbjct: 146 SSAEGTLAAETFSLAGAAQPGTLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMS 205
Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
L KFSYC+ +DA VL G G+ D+ +P L YTP S Y V
Sbjct: 206 LPKFSYCI---SGEDALG----VLLLGDGT-DAPSP-LQYTPLVTATTSSPYFNRVAYTV 256
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNY 379
L I V K +++P S VP G G +VDSG+ FTF+ G ++ ++ EF+ Q G
Sbjct: 257 QLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVL 316
Query: 380 SRAAD--VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV---GNEVL 434
+R D + + C+ + +P + L F GA+M + E V + V
Sbjct: 317 TRIEDPNFVFEGAMDLCYHAPASFAA-VPAVTLVFS-GAEMRVSGERLLYRVSKGSDWVY 374
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
C + G A ++G QN ++EFDL R GF + C
Sbjct: 375 CFTFGNSDLLGIE-----AYVIGHHHQQNVWMEFDLLKSRVGFTQTTC 417
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 125/445 (28%), Positives = 194/445 (43%), Gaps = 77/445 (17%)
Query: 52 PLKILHSLASSSL----SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSIS 107
P + SL S + +R R LK ++ +D+N P+ S G Y I
Sbjct: 69 PNRTWESLMSEKIRGDANRLRFLKRTSRSSKQDANA---------NVPVRSGS-GEYIIQ 118
Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
+ FGTP Q+ I DTGS + W PC +C C+ S P F P +SSS +
Sbjct: 119 VDFGTPKQSMYTLI-DTGSDVAWIPCK---QCQGCH------STAPIFDPAKSSSYKPFA 168
Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFPSKT 226
C + C I G C G N C + + YG G G L S+ + S+
Sbjct: 169 CDSQPCQEISG-----NCGG----NSKC-----QFEVSYGDGTQVDGTLASDAITLGSQY 214
Query: 227 VPNFLAGC--SILSDRQPA------GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
+PNF GC S+ D P+ G +++ ++L FSYCL S
Sbjct: 215 LPNFSFGCAESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSST----S 270
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
S +LVL S L +T K+P + FY+V L+ I VG+ + +P +
Sbjct: 271 SGSLVLGKEAAVSSSS---LKFTTLIKDP-----SIPTFYFVTLKAISVGNTRISVPGTN 322
Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
+ G GG I+DSG+T T + + A+ F +Q+ + + VE + C+D+S
Sbjct: 323 IASG----GGTIIDSGTTITHLVPSAYTALRDAFRQQLSSL-QPTPVED---MDTCYDLS 374
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
SV +P + L + LP EN + + CL + ++ I+G+
Sbjct: 375 -SSSVDVPTITLHLDRNVDLVLPKENILITQESGLACLAFSSTDSRS---------IIGN 424
Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
Q QN+ + FD+ N + GFA+++CA
Sbjct: 425 VQQQNWRIVFDVPNSQVGFAQEQCA 449
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 107/401 (26%), Positives = 168/401 (41%), Gaps = 44/401 (10%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S+ G+PPQ + + DTGS L W C++ + P F+ + S+
Sbjct: 81 GQYFVSIRLGSPPQ-TLLLVADTGSDLTWVRCSACKTNCSIHPPGS------TFLARHST 133
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+ C + C + PN +P N T + Y Y G T+G ET
Sbjct: 134 TFSPTHCFSSLCQLVPQPNP-------NPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETT 186
Query: 221 RFPSKT-----VPNFLAGCSILSD---------RQPAGIAGFGRSSESLPSQLGLK---K 263
+ + + + GC + +G+ G GR S SQLG +
Sbjct: 187 TLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRS 246
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
FSYCLL P S ++ D D+K+ +S+TP NP + FYY+ ++
Sbjct: 247 FSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKS-MMSFTPLLINPEAPT-----FYYISIK 300
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS-RA 382
+ V + I S GNGG ++DSG+T TF+ P + + F R++ S
Sbjct: 301 GVFVDGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTP 360
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
+SG C +++G P L L+ G + + PP NYF + + CL +
Sbjct: 361 GGASTRSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVE 420
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A G ++G+ Q F LEFD R GF+++ CA
Sbjct: 421 AES-----GRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGCA 456
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 164/373 (43%), Gaps = 50/373 (13%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P + + DTGS + W C C DC + DP F P S+S + C +C
Sbjct: 153 PSSPVYMVLDTGSDVNWIQCAP---CADC-YHQADP----IFEPASSTSYSPLSCDTKQC 204
Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLA 232
+ +V S C RN TC Y + YG G +T G ++ET+ S +V N
Sbjct: 205 QSL---DV-SEC-----RNNTC-----LYEVSYGDGSYTVGDFVTETITLGSASVDNVAI 250
Query: 233 GCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPG 289
GC ++ AG+ G G S PSQ+ FSYCL+ R D A S L ++
Sbjct: 251 GCGHNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSA---STLEFNS--- 304
Query: 290 SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349
+ P P +N FYYVG+ + VG + + IP S GNGG+
Sbjct: 305 ---ALLPHAITAPLLRN-----RELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGI 356
Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELI 409
I+DSG+ T ++ + A+ F++ + ++V + C+D+S K SV +P +
Sbjct: 357 IIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEV---ALFDTCYDLSRKTSVEVPTVT 413
Query: 410 LKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFD 469
GG + LP NY V ++ F ++ + I+G+ Q Q + FD
Sbjct: 414 FHLAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALS-------IIGNVQQQGTRVGFD 466
Query: 470 LANDRFGFAKQKC 482
LAN GF ++C
Sbjct: 467 LANSLVGFEPRQC 479
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 120/394 (30%), Positives = 172/394 (43%), Gaps = 63/394 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS +VW C +C + VDP F P S+
Sbjct: 195 GEYFTRIGVGTPMREQY-MVLDTGSDVVWIQCEPCSKC----YSQVDP----IFNPSLSA 245
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S +GC + CS++ N GC Y + YG G +T G +E L
Sbjct: 246 SFSTLGCNSAVCSYLDAYNCHG--GGCL------------YKVSYGDGSYTIGSFATEML 291
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGI-------AGFGRSSESLPSQLGL---KKFSYCLLS 270
F + +V N GC AG+ G G S PSQLG + FSYCL+
Sbjct: 292 TFGTTSVRNVAIGCG----HDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCLVD 347
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
R F + S+ L+ GP +S G TP NP + FYYV L I VG
Sbjct: 348 R-FSE----SSGTLEFGP---ESVPLGSILTPLLTNP-----SLPTFYYVPLISISVGGA 394
Query: 331 HVKI--PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+ P + + + G GG IVDSG+ T ++ P+++AV F+ +A E
Sbjct: 395 LLDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKA---EGV 451
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
S C+D+SG V +P ++ F GA + LP +NY ++ + + F A L
Sbjct: 452 SIFDTCYDLSGLPLVNVPTVVFHFSNGASLILPAKNY--MIPMDFMGTFCFAFAPATSDL 509
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q + FD AN GFA ++C
Sbjct: 510 S-----IMGNIQQQGIRVSFDTANSLVGFALRQC 538
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 119/393 (30%), Positives = 168/393 (42%), Gaps = 63/393 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + S + DTGS + W C+ +C + P F P SS
Sbjct: 12 GDYFARIGVGTPAR-SVYMVADTGSDVSWLQCSPCRKCYR--------QQDPIFNPSLSS 62
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + + C + C + + KGCS +NK Y + YG G FT G +ETL
Sbjct: 63 SFKPLACASSICGKL-------KIKGCSRKNKCM------YQVSYGDGSFTVGDFSTETL 109
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGI-------AGFGRSSESLPSQLGLKK---FSYCLLS 270
F V + GC R G+ G GR S PSQ G FSYCL
Sbjct: 110 SFGEHAVRSVAMGCG----RNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPR 165
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
R ++ ++++LV GP + K +T N +YYVGL +I V
Sbjct: 166 R---ESAIAASLVF--GPSAVPEKA---RFTKLLPN-----RRLDTYYYVGLARIRVAGS 212
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
V IP GS G GGVIVDSG+ + + P + A+ F R + + A + S
Sbjct: 213 PVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGI---SL 268
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALG 449
C+D+S K+ LP ++L F GGA M LP + V +E CL + A
Sbjct: 269 FDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFS--- 325
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q F + D ++ G A +C
Sbjct: 326 -----IIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 181/408 (44%), Gaps = 50/408 (12%)
Query: 81 SNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV 140
SN G+ S +TPL S G Y++S GTP + DTGS L+W C + RC
Sbjct: 71 SNAGAAPGES-AQTPLKKGS-GDYAMSFGIGTPATGLSGEA-DTGSDLIWTKCGACARC- 126
Query: 141 DCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP 200
P P++ P SSS+ + C + C + P + S G + C
Sbjct: 127 -------SPRGSPSYYPTSSSSAAFVACGDRTCGELPRP-LCSNVAGGGSGSGNC----- 173
Query: 201 SYLLQYGLG-----FTAGLLLSETLRF--PSKTVPNFLAGCSILSDR---QPAGIAGFGR 250
SY YG +T G+L++ET F + P GC++ S+ +G+ G GR
Sbjct: 174 SYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGR 233
Query: 251 SSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
SL +QL ++ F Y L S +P+S + D G+GDS TP NPV
Sbjct: 234 GKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGGNGDS----FMSTPLLTNPVVQ 289
Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
FYYVGL I VG K V+IP ++ S G GGVI DSG+T T + P + V
Sbjct: 290 DL---PFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR 346
Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
E + QMG + + L CF G + P ++L F GGA M L ENY +
Sbjct: 347 DELLSQMG-FQKPPPAANDDDLI-CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQM 403
Query: 430 ----GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
G C + + A I+G+ +F++ FDL+ +
Sbjct: 404 QGQNGETARCWSVVKSSQA--------LTIIGNIMQMDFHVVFDLSGN 443
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 117/393 (29%), Positives = 166/393 (42%), Gaps = 51/393 (12%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
+ISL+ G+PPQ T + DTGS L W C PN++ + F P SSS
Sbjct: 60 TISLTIGSPPQNVT-MVLDTGSELSWLHCK--------KLPNLNST----FNPLLSSSYT 106
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP 223
C + C + + C P NK C + ++ Y +A G L +ET
Sbjct: 107 PTPCNSSVC--MTRTRDLTIPASCDPNNKLCHV-----IVSYADASSAEGTLAAETFSLA 159
Query: 224 SKTVPNFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD 275
P L GC I D + G+ G R S SL +Q+ L KFSYC+ +D
Sbjct: 160 GAAQPGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQMVLPKFSYCI---SGED 216
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
A L+L GP S L YTP S Y V L I V K +++P
Sbjct: 217 A--FGVLLLGDGP----SAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLP 270
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAAD--VEKKSGLR 392
S VP G G +VDSG+ FTF+ GP++ ++ EF+ Q G +R D + +
Sbjct: 271 KSVFVPDHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMD 330
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG---NEVLCLILFTDNAAGPALG 449
C+ + +P + L F GA+M + E V + V C + G
Sbjct: 331 LCYHAPASLAA-VPAVTLVFS-GAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIE-- 386
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A ++G QN ++EFDL R GF + C
Sbjct: 387 ---AYVIGHHHQQNVWMEFDLVKSRVGFTETTC 416
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 114/386 (29%), Positives = 164/386 (42%), Gaps = 58/386 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTP Q + DT + W PC+ C F NV +S++
Sbjct: 96 YIVRAKIGTPAQ-TMLLAMDTSNDAAWIPCSGCVGCSSTVFNNV-----------KSTTF 143
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ +GC+ P+C + PN S+C G AC ++ + YG A L + +
Sbjct: 144 KTVGCEAPQCKQV--PN--SKCGGS---------AC-AFNMTYGSSSIAANLSQDVVTLA 189
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
+ ++P++ GC + S P G+ G GR SL SQ L FSYCL S F
Sbjct: 190 TDSIPSYTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPS--FRSLN 247
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + TP KNP SS YYV L I VG + V IP S
Sbjct: 248 FSGSLRL--GPVGQPKR---IKTTPLLKNPRRSS-----LYYVNLMAIRVGRRVVDIPPS 297
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L G I DSG+ FT + P + AV F +++GN A V G C+
Sbjct: 298 ALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGN----ATVTSLGGFDTCY-- 351
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ P + F G + LPP+N + + CL + AA P ++
Sbjct: 352 --TSPIVAPTITFMFS-GMNVTLPPDNLLIHSTASSITCLAM----AAAPDNVNSVLNVI 404
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q QN + FD+ N R G A++ C
Sbjct: 405 ANMQQQNHRILFDVPNSRLGVAREPC 430
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 181/408 (44%), Gaps = 50/408 (12%)
Query: 81 SNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV 140
SN G+ S +TPL S G Y++S GTP + DTGS L+W C + RC
Sbjct: 71 SNAGAAPGES-AQTPLKKGS-GDYAMSFGIGTPATGLSGEA-DTGSDLIWTKCGACARC- 126
Query: 141 DCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP 200
P P++ P SSS+ + C + C + P + S G + C
Sbjct: 127 -------SPRGSPSYYPTSSSSAAFVACGDRTCGELPRP-LCSNVAGGGSGSGNC----- 173
Query: 201 SYLLQYGLG-----FTAGLLLSETLRF--PSKTVPNFLAGCSILSDR---QPAGIAGFGR 250
SY YG +T G+L++ET F + P GC++ S+ +G+ G GR
Sbjct: 174 SYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGR 233
Query: 251 SSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
SL +QL ++ F Y L S +P+S + D G+GDS TP NPV
Sbjct: 234 GKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGGNGDS----FMSTPLLTNPVVQ 289
Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
FYYVGL I VG K V+IP ++ S G GGVI DSG+T T + P + V
Sbjct: 290 DL---PFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR 346
Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
E + QMG + + L CF G + P ++L F GGA M L ENY +
Sbjct: 347 DELLSQMG-FQKPPPAANDDDLI-CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQM 403
Query: 430 ----GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
G C + + A I+G+ +F++ FDL+ +
Sbjct: 404 QGQNGETARCWSVVKSSQA--------LTIIGNIMQMDFHVVFDLSGN 443
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 175/389 (44%), Gaps = 45/389 (11%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
++L GTPPQ + DTGS L W C ++ P +F P SSS +
Sbjct: 84 VTLPIGTPPQLQQ-MVLDTGSQLSWIQCHNKKTP-----QKKQPPTTSSFDPSLSSSFFV 137
Query: 166 IGCQNPKCS-WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF-P 223
+ C +P C + ++ + C S L SY G + G L+ E + F P
Sbjct: 138 LPCNHPLCKPRVPDFSLPTDCDANS-------LCHYSYFYADGT-YAEGNLVREKIAFSP 189
Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLV 283
S+T P + GC+ SD GI G PSQ + KFSYC+ +++ P S +
Sbjct: 190 SQTTPPIILGCATQSD-DARGILGMNLGRLGFPSQAKITKFSYCVPTKQAQ--PASGSFY 246
Query: 284 LDTGPGSGDSKTPGL-SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
L P S + L ++ + P A Y + L+ I +G K + IP S P
Sbjct: 247 LGNNPASSSFRYVNLLTFGQSQRMPNLDPLA----YTLPLQGISIGGKKLNIPPSVFKPN 302
Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-------NYSRAADVEKKSGLRPCF 395
+ G+G ++DSGS FT++ + + +E ++++G Y AD+ CF
Sbjct: 303 AGGSGQTMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADI--------CF 354
Query: 396 DISG-KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
D + + +++ +F+ G ++ +P E A V V CL + LG G I
Sbjct: 355 DGDAIEIGRLVGDMVFEFEKGVQIVIPKERVLATVDGGVHCLGM----GRSERLGAGGNI 410
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+F QN ++EFDLAN R GF + C+
Sbjct: 411 I-GNFHQQNLWVEFDLANRRVGFGEADCS 438
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 131/449 (29%), Positives = 188/449 (41%), Gaps = 74/449 (16%)
Query: 47 HSDSDPLKILHSLASSSLS-RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
+S P K S A + L +AR L + + S++ +++++P Y
Sbjct: 37 NSQCSPFKTSVSWADTLLQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPT-------YI 89
Query: 106 ISLSFGTPPQASTPFI--FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
+ + GTP Q P + DT + W PC+ CV C S F P +SSSS
Sbjct: 90 VRANIGTPAQ---PMLVALDTSNDAAWIPCSG---CVGC-------SSSVLFDPSKSSSS 136
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + C+ P+C P+ +K+C + + YG L +TL
Sbjct: 137 RTLQCEAPQCKQAPNPSCTV--------SKSC-----GFNMTYGGSTIEAYLTQDTLTLA 183
Query: 224 SKTVPNFLAGC--SILSDRQPA-GIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
S +PN+ GC PA G+ G GR SL SQ L FSYCL + K
Sbjct: 184 SDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSK----- 238
Query: 278 VSSNLV--LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
SSN L GP + + + TP KNP SS YYV L I VG+K V IP
Sbjct: 239 -SSNFSGSLRLGPKNQPIR---IKTTPLLKNPRRSS-----LYYVNLVGIRVGNKIVDIP 289
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S L G I DSG+ +T + P + AV EF R++ N A+ G C+
Sbjct: 290 TSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKN----ANATSLGGFDTCY 345
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPA 453
SV P + F G + LPP+N + GN + CL + AA P
Sbjct: 346 S----GSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGN-LSCLAM----AAAPVNVNSVL 395
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ Q QN + D+ N R G +++ C
Sbjct: 396 NVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 118/393 (30%), Positives = 168/393 (42%), Gaps = 63/393 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + S + DTGS + W C+ +C + P F P SS
Sbjct: 79 GDYFARIGVGTPAR-SVYMVADTGSDVSWLQCSPCRKCYR--------QQDPIFNPSLSS 129
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + + C + C + + KGCS +N+ Y + YG G FT G +ETL
Sbjct: 130 SFKPLACASSICGKL-------KIKGCSRKNECM------YQVSYGDGSFTVGDFSTETL 176
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGI-------AGFGRSSESLPSQLGLK---KFSYCLLS 270
F V + GC R G+ G GR S PSQ G FSYCL
Sbjct: 177 SFGEHAVRSVAMGCG----RNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPR 232
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
R ++ ++++LV GP + K +T N +YYVGL +I V
Sbjct: 233 R---ESAIAASLVF--GPSAVPEKA---RFTKLLPN-----RRLDTYYYVGLARIRVAGS 279
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
V IP GS G GGVIVDSG+ + + P + A+ F R + + A + S
Sbjct: 280 PVNIPPDAFAMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGI---SL 335
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALG 449
C+D+S K+ LP ++L F GGA M LP + V +E CL + A
Sbjct: 336 FDTCYDLSSMKTATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFS--- 392
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q F + D ++ G A +C
Sbjct: 393 -----IIGNVQQQTFRISIDNQKEQMGIAPDQC 420
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 127/398 (31%), Positives = 170/398 (42%), Gaps = 62/398 (15%)
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
S Y I L FGTPPQ S + DTGS++ W PC C C+ S+ F P +
Sbjct: 120 SSSNYIIKLGFGTPPQ-SFYTVLDTGSNIAWIPCNP---CSGCS------SKQQPFEPSK 169
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
SS+ + C + +C + R S + C L +YG +L SE
Sbjct: 170 SSTYNYLTCASQQCQLL-------RVCTKSDNSVNCSLT-----QRYGDQSEVDEILSSE 217
Query: 219 TLRFPSKTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSR 271
TL S+ V NF+ GCS L R P+ + GFGR+ S SQ FSYCL S
Sbjct: 218 TLSVGSQQVENFVFGCSNAARGLIQRTPS-LVGFGRNPLSFVSQTATLYDSTFSYCLPSL 276
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
F A S L+ G GL +TP N S + FYYVGL I VG +
Sbjct: 277 -FSSAFTGSLLL-----GKEALSAQGLKFTPLLSN-----SRYPSFYYVGLNGISVGEEL 325
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA--DVEKKS 389
V IP L G I+DSG+ T + P + A+ F Q+ N + A+ D+
Sbjct: 326 VSIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTC 385
Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE---VLCLILFTDNAAGP 446
RP D V P + L F + LP +N GN+ VLCL A G
Sbjct: 386 YNRPSGD------VEFPLITLHFDDNLDLTLPLDNIL-YPGNDDGSVLCL------AFGL 432
Query: 447 ALGRGPAII--LGDFQLQNFYLEFDLANDRFGFAKQKC 482
G G ++ G++Q Q + D+A R G A + C
Sbjct: 433 PPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 168/391 (42%), Gaps = 46/391 (11%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPPQ + I D+GS L+W C C+ C + P + P SS
Sbjct: 63 GQYFVDFFLGTPPQKFS-LIVDSGSDLLWVQCAP---CLQCYAQDT-----PLYAPSNSS 113
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C +P+C I P E P + P AC +Y +Y + G+ E+
Sbjct: 114 TFNPVPCLSPECLLI--PATEGF-----PCDFHYPGAC-AYEYRYADTSLSKGVFAYESA 165
Query: 221 RFPSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRK 272
+ GC D Q G+ G G+ S SQ+G KF+YCL++
Sbjct: 166 TVDDVRIDKVAFGCG--RDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNY- 222
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
D VSS L+ GD + F P+ S+S YYV + +++VG + +
Sbjct: 223 LDPTSVSSWLIF------GDELISTIHDLQF--TPIVSNSRNPTLYYVQIEKVMVGGESL 274
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
I +S GNGG I DSG+T T+ P + + F + + Y RAA V+ GL
Sbjct: 275 PISHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNV-RYPRAASVQ---GLD 330
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
C D++G P + GGA NYF V V CL + AG G
Sbjct: 331 LCVDVTGVDQPSFPSFTIVLGGGAVFQPQQGNYFVDVAPNVQCLAM-----AGLPSSVGG 385
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+G+ QNF +++D +R GFA KC+
Sbjct: 386 FNTIGNLLQQNFLVQYDREENRIGFAPAKCS 416
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 130/444 (29%), Positives = 186/444 (41%), Gaps = 74/444 (16%)
Query: 52 PLKILHSLASSSLS-RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSF 110
P K S A + L +AR L + + S++ +++++P Y + +
Sbjct: 42 PFKTSVSWADTLLQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPT-------YIVRANI 94
Query: 111 GTPPQASTPFI--FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
GTP Q P + DT + W PC+ CV C S F P +SSSS+ + C
Sbjct: 95 GTPAQ---PMLVALDTSNDAAWIPCSG---CVGC-------SSSVLFDPSKSSSSRTLQC 141
Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVP 228
+ P+C P+ +K+C + + YG L +TL S +P
Sbjct: 142 EAPQCKQAPNPSCTV--------SKSC-----GFNMTYGGSTIEAYLTQDTLTLASDVIP 188
Query: 229 NFLAGC--SILSDRQPA-GIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAPVSSNL 282
N+ GC PA G+ G GR SL SQ L FSYCL + K SSN
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSK------SSNF 242
Query: 283 V--LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
L GP + + + TP KNP SS YYV L I VG+K V IP S L
Sbjct: 243 SGSLRLGPKNQPIR---IKTTPLLKNPRRSS-----LYYVNLVGIRVGNKIVDIPTSALA 294
Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
G I DSG+ +T + P + AV EF R++ N A+ G C+
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKN----ANATSLGGFDTCYS---- 346
Query: 401 KSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
SV P + F G + LPP+N + GN + CL + AA P ++
Sbjct: 347 GSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGN-LSCLAM----AAAPVNVNSVLNVIAS 400
Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
Q QN + D+ N R G +++ C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 106/337 (31%), Positives = 157/337 (46%), Gaps = 40/337 (11%)
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
SS+ + + C +P C G +V + C+ N C YL YG TAG + +
Sbjct: 2 SSTFKAVACPDPICRPSSGVSVSA----CAMENFQC-----FYLCSYGDRSITAGHIFKD 52
Query: 219 TLRFPSKT-----VPNFLAGC----SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
T F S V GC + L +GIAGFGR +SLPSQL + +FSYCL
Sbjct: 53 TFTFMSPNGVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLKVGRFSYCL- 111
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
SS ++L T P + + PF P+ + FYY+ L I VG
Sbjct: 112 --TLVTESKSSVVILGTPPDPDGLR--AHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGK 167
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ--MGNYSRAADVEK 387
+ S DG+GG ++DSG++ T + +FE + +E + Q + Y +V
Sbjct: 168 TRLPFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTPEV-- 225
Query: 388 KSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAG 445
G R CF G K V +P+LIL GA M LP +NYF + V+CL + N A
Sbjct: 226 --GDRLCFRRPKGGKQVPVPKLILHL-AGADMDLPRDNYFVEEPDSGVMCLQI---NGAE 279
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+++G+FQ QN ++ +D+ N++ FA +C
Sbjct: 280 DTT----MVLIGNFQQQNMHVVYDVENNKLLFAPAQC 312
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 118/399 (29%), Positives = 176/399 (44%), Gaps = 69/399 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++L+ G+PPQ S I DTGS L W +C+ C P P F P +S
Sbjct: 37 GEYLMTLTLGSPPQ-SFDVIVDTGSDLNWV------QCLPCRVCYQQPG--PKFDPSKSR 87
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPL-ACPSYLLQYGLGF-----TAGLL 215
S + C + C NV + PL AC + + QY + T G L
Sbjct: 88 SFRKAACTDNLC------NVSA-----------LPLKACAANVCQYQYTYGDQSNTNGDL 130
Query: 216 LSETLRFP----SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLG---LKKFS 265
ET+ +++VPNF GC ++ + AG+ G G+ SL SQL KFS
Sbjct: 131 AFETISLNNGAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFS 190
Query: 266 YCLLS-RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
YCL+S +P++ + + + YT N + +YYV L
Sbjct: 191 YCLVSLNSLSASPLTFGSI---------AAAANIQYTSIVVN-----ARHPTYYYVQLNS 236
Query: 325 IIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I VG + + + P + + S G GG I+DSG+T T + P + AV + + NY R
Sbjct: 237 IEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAY-ESFVNYPRLD 295
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
GL CF+I+G + +P+++ KF+ GA + EN F LV L L +
Sbjct: 296 G--SAYGLDLCFNIAGVSNPSVPDMVFKFQ-GADFQMRGENLFVLVDTSATTLCLAMGGS 352
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G + I+G+ Q QN + +DL + GFA C
Sbjct: 353 QGFS-------IIGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 118/418 (28%), Positives = 174/418 (41%), Gaps = 60/418 (14%)
Query: 88 SNSLIKTPLSVHSYGGYS--ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
S S ++P +H + S +SL+ GTPPQ + + DTGS L W C
Sbjct: 67 SGSFPRSPNKLHFHHNVSLTVSLTVGTPPQ-NVSMVLDTGSELSWLRC------------ 113
Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
N + F P RSSS + C + C+ R+ P +C S L
Sbjct: 114 NKTQTFQTTFDPNRSSSYSPVPCSSLTCT-------------DRTRDFPIPASCDSNQLC 160
Query: 206 YGL------GFTAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSS 252
+ + + G L S+T + +P + GC + D + G+ G R S
Sbjct: 161 HAILSYADASSSEGNLASDTFYIGNSDMPGTIFGCMDSSFSTNTEEDSKNTGLMGMNRGS 220
Query: 253 ESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
S SQ+ KFSYC+ F VL G + P L+YTP +
Sbjct: 221 LSFVSQMDFPKFSYCISDSDFSG-------VLLLGDANFSWLMP-LNYTPLIQISTPLPY 272
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
Y V L I V SK + +P S VP G G +VDSG+ FTF+ GP++ A+ EF
Sbjct: 273 FDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEF 332
Query: 373 IRQMGNYSRAADVEK---KSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFA 427
+ Q R + + G+ C+ + S +LP + L F+ GA+M + +
Sbjct: 333 LNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFR-GAEMKVSGDRLLY 391
Query: 428 LVGNEVL---CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V EV + FT L A ++G QN ++EFDL R GFA+ +C
Sbjct: 392 RVPGEVRGSDSVYCFT--FGNSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 447
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 100/316 (31%), Positives = 146/316 (46%), Gaps = 44/316 (13%)
Query: 188 CSPRNKTCPLACPSYLLQYG-----------LGFTAGLLLSETLRFPSKTVPNFLAGCSI 236
C N+TCP Y YG FT L +S + + V N + GC
Sbjct: 67 CKAENQTCP-----YYYWYGDSSNTTGDFALETFTVNLTMSSG-KPELRRVENVMFGCGH 120
Query: 237 LSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
+ AG+ G GR S SQL FSYCL+ R DA VSS L+ G
Sbjct: 121 WNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRN-SDANVSSKLIF--GEDK 177
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
P L++T G + FYYV ++ I+VG + V IP +DG+GG I
Sbjct: 178 DLLSHPELNFTTLV---AGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTI 234
Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
+DSG+T ++ P ++ + + F+ ++ Y D L PC++++G + LP+ +
Sbjct: 235 IDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPV---LEPCYNVTGVEQPDLPDFGI 291
Query: 411 KFKGGAKMALPPENYFALVG-NEVLCLILFTDNAAGPALGRGPAI--ILGDFQLQNFYLE 467
F GA P ENYF + EV+CL + LG P+ I+G++Q QNF++
Sbjct: 292 VFSDGAVWNFPVENYFIEIEPREVVCLAI---------LGTPPSALSIIGNYQQQNFHIL 342
Query: 468 FDLANDRFGFAKQKCA 483
+D R GFA KCA
Sbjct: 343 YDTKKSRLGFAPTKCA 358
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 124/440 (28%), Positives = 187/440 (42%), Gaps = 77/440 (17%)
Query: 54 KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
+ + + AS L +AR K T P + ++G+ G Y +S+ GTP
Sbjct: 112 RKIAAAASPVLDQARGKKGVTLPAQRGISLGT----------------GNYVVSMGLGTP 155
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
+ T +FDTGS L W CT C DC + P F P RSS+ + C +P+C
Sbjct: 156 ARDMT-VVFDTGSDLSWVQCTP---CSDCY-----EQKDPLFDPARSSTYSAVPCASPEC 206
Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF-PSKTVPNFL 231
++SR CS R+K C Y + YG T G L +TL S +P F+
Sbjct: 207 Q-----GLDSR--SCS-RDKKC-----RYEVVYGDQSQTDGALARDTLTLTQSDVLPGFV 253
Query: 232 AGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLD 285
GC + G+ G GR SL SQ K FSYCL S +P ++ +
Sbjct: 254 FGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPS-----SPSAAGYLSL 308
Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
GP +++ + + +P FYYV L + V + V++ P
Sbjct: 309 GGPAPANARFTAMETR--HDSP--------SFYYVRLVGVKVAGRTVRVS-----PIVFS 353
Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGLRPCFDISGKKSV 403
G ++DSG+ T + ++ A+ F R MG Y RA + S L C+D +G +V
Sbjct: 354 AAGTVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPAL---SILDTCYDFTGHTTV 410
Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
+P + L F GGA + L + CL F N G G I+G+ Q +
Sbjct: 411 RIPSVALVFAGGAAVGLDFSGVLYVAKVSQACLA-FAPNGDGADAG-----IIGNTQQKT 464
Query: 464 FYLEFDLANDRFGFAKQKCA 483
+ +D+A + GF C+
Sbjct: 465 LAVVYDVARQKIGFGANGCS 484
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 118/426 (27%), Positives = 175/426 (41%), Gaps = 52/426 (12%)
Query: 95 PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD------ 148
PL Y S G PPQ + + DTGS LVW C++ C P
Sbjct: 69 PLRWSGKTQYIASYGIGDPPQPAEAVV-DTGSDLVWTQCST------CRLPAAAAAGGGG 121
Query: 149 --PSRIPAFIPKRSSSSQLIGCQNPKCSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
P +P + S +++ + C + + P +G + C +A
Sbjct: 122 CFPQNLPYYNFSLSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAA-----S 176
Query: 206 YGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQP------AGIAGFGRSSESLPSQL 259
YG G G+L ++ FPS + GC + P +GI G GR + SL SQL
Sbjct: 177 YGAGVALGVLGTDAFTFPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQL 236
Query: 260 GLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG--------LSYTPFYKNPVGSS 311
+FSYCL + F D S+L + G +G S G ++ PF KNP
Sbjct: 237 NATEFSYCL-TPYFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNP--KD 293
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYL----VPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
S F FYY+ L + G+ V +P GG ++DSGS FT + P A
Sbjct: 294 SPFSTFYYLPLVGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRA 353
Query: 368 VAKEFIRQM-GNYSRAADVEKKSG-LRPCF----DISGKKSVYLPELILKFK----GGAK 417
+ KE RQ+ G+ S K G L C D + +P L+L+F GG +
Sbjct: 354 LTKELARQLRGSGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRE 413
Query: 418 MALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
+ +P E Y+A V C+ + + + L I+G+F Q+ + +DLAN F
Sbjct: 414 LVIPAEKYWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSF 473
Query: 478 AKQKCA 483
C+
Sbjct: 474 QPANCS 479
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 124/445 (27%), Positives = 196/445 (44%), Gaps = 77/445 (17%)
Query: 52 PLKILHSLASSSL----SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSIS 107
P + SL S + +R R LK ++ +D+N P+ S G Y I
Sbjct: 69 PNRTWESLMSEKIRGDANRLRFLKRTSRSSKEDANA---------NVPVRSGS-GEYIIQ 118
Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
+ FGTP Q+ I DTGS + W PC +C C+ S P F P +SSS +
Sbjct: 119 VDFGTPKQSMYTLI-DTGSDVAWIPCK---QCQGCH------STAPIFDPAKSSSYKPFA 168
Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFPSKT 226
C + C I G C G N C + + YG G G L S+ + S+
Sbjct: 169 CDSQPCQEISG-----NCGG----NSKC-----QFEVLYGDGTQVDGTLASDAITLGSQY 214
Query: 227 VPNFLAGC--SILSDRQPA------GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
+PNF GC S+ D + G +++ ++L FSYCL S +
Sbjct: 215 LPNFSFGCAESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSS----STS 270
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
S +LVL S L +T K+P +F FY+V L+ I VG+ + +P +
Sbjct: 271 SGSLVLGKEAAVSSS---SLKFTTLIKDP-----SFPTFYFVTLKAISVGNTRISVPATN 322
Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
+ G GG I+DSG+T T++ ++ + F +Q+ + + VE + C+D+S
Sbjct: 323 IASG----GGTIIDSGTTITYLVPSAYKDLRDAFRQQLSSL-QPTPVED---MDTCYDLS 374
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
SV +P + L + LP EN + + CL + ++ I+G+
Sbjct: 375 SS-SVDVPTITLHLDRNVDLVLPKENILITQESGLSCLAFSSTDSRS---------IIGN 424
Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
Q QN+ + FD+ N + GFA+++CA
Sbjct: 425 VQQQNWRIVFDVPNSQVGFAQEQCA 449
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 166/391 (42%), Gaps = 40/391 (10%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS LVW C+ RC R F P+RSS
Sbjct: 84 GEYFALVGVGTPSTKAM-LVIDTGSDLVWLQCSPCRRCY--------AQRGQVFDPRRSS 134
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
+ + + C +P+C + P C C Y++ YG G ++ G L ++ L
Sbjct: 135 TYRRVPCSSPQCRALRFPG----CDSGGAAGGGC-----RYMVAYGDGSSSTGDLATDKL 185
Query: 221 RFPSKT-VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKF 273
F + T V N GC ++ AG+ G GR S+ +Q+ F YCL R
Sbjct: 186 AFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRT- 244
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
+ SS LV P + P ++T NP S YYV + VG + V
Sbjct: 245 SRSTRSSYLVFGRTP-----EPPSTAFTALLSNPRRPS-----LYYVDMAGFSVGGERVT 294
Query: 334 --IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
S + + G GGV+VDSG+ + + A+ F + + S
Sbjct: 295 GFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVF 354
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D+ G+ + P ++L F GGA MALPPENYF V + A G
Sbjct: 355 DACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG 414
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++I G+ Q Q F + FD+ +R GFA + C
Sbjct: 415 LSVI-GNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 173/404 (42%), Gaps = 67/404 (16%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
+ PL+ S GY++++ GTPPQ T I DT S L W CN N +
Sbjct: 79 MSVPLARISDEGYTVTIGIGTPPQLHT-LIADTASDLTW---------TQCNLFNDTAKQ 128
Query: 152 I-PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
+ P F P +SSS + C + C+ K CS NKTC Y+ Y
Sbjct: 129 VEPLFDPAKSSSFAFVTCSSKLCT-----EDNPGTKRCS--NKTC-----RYVYPYVSVE 176
Query: 211 TAGLLLSETLRFPSKT---VPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKKF 264
AG+L E+ +F GC L+D +GI G + S+ SQL + KF
Sbjct: 177 AAGVLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGASGILGMSPAILSMVSQLAIPKF 236
Query: 265 SYCLL---SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
SYCL RK SS L G KT G P+ S F +YYV
Sbjct: 237 SYCLTPYTDRK------SSPLFFGAWADLGRYKTTG---------PIQKSLTF--YYYVP 279
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
L + +G++ + +P + + GG +VD G T + P F A+ KE + N
Sbjct: 280 LVGLSLGTRRLDVPAATF---ALKQGGTVVDLGCTVGQLAEPAFTAL-KEAVLHTLNLPL 335
Query: 382 AADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
K + CF + +V P L+L F GGA M LP +NYF ++CL L
Sbjct: 336 TNRTVKD--YKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYFQEPTAGLMCLAL 393
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G G +II G+ Q QNF+L FD+ + +F FA C
Sbjct: 394 VP--------GGGMSII-GNVQQQNFHLLFDVHDSKFLFAPTIC 428
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 117/391 (29%), Positives = 172/391 (43%), Gaps = 54/391 (13%)
Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
Q + I DTGS V C SR R P F P S S + + C + C
Sbjct: 9 QKNLSAIIDTGSEAVLVQCGSRSR--------------PVFDPAASQSYRQVPCISQLCL 54
Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLA-- 232
+ + C + C +Y L YG + S+ + F + T + A
Sbjct: 55 AVQQQTSNGSSQPCVNSSAAC-----TYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQ 109
Query: 233 ------GCS-----ILSDRQPAGIAGFGRSSESLPSQL----GLKKFSYCLLSRKFDDAP 277
GC+ L D GI GF R + SLPSQL G KFSYC S+ + P
Sbjct: 110 FRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQ--P 167
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
++ ++ G SK +SYTP NPV + A + YYVGL I V K + IP S
Sbjct: 168 RATGVIFLGDSGLSKSK---VSYTPLLDNPV--TPARSQLYYVGLTSISVDGKTLAIPES 222
Query: 338 -YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
+ + S G+GG ++DSG+TFT + + A F + R V +G C++
Sbjct: 223 AFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKK-VGAAAGFDDCYN 281
Query: 397 ISGKKSV-YLPELILKFKGGAKMALPPENYFALV---GNEV-LCLILFTDNAAGPALGRG 451
IS S+ +PE+ L + ++ L E+ F V GNEV +CL + + +G G
Sbjct: 282 ISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSG----FG 337
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+LG++Q N+ +E+D R GF + C
Sbjct: 338 KINVLGNYQQSNYLVEYDNERSRVGFERADC 368
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 116/397 (29%), Positives = 171/397 (43%), Gaps = 62/397 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTP + DTGS LVW C C D + P +DP+ SS+
Sbjct: 84 YLVRLAVGTP-RRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAA--------SSTY 134
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
+ C +C + + R G +++C Y YG T G + ++ F
Sbjct: 135 AALPCGAARCRALPFTSCGVRTLG---NHRSC-----IYAYHYGDKSLTVGEIATDRFTF 186
Query: 223 -------PSKTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
S GC L+ GIAGFGR SLPSQL + FSYC S
Sbjct: 187 GDSGGSGESLHTRRLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTS- 245
Query: 272 KFDDAPVSSNLVLDTGPGS--GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
F+ SS + L P + + + + TP KNP S Y++ L+ I VG
Sbjct: 246 MFESK--SSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPS-----LYFLSLKGISVGK 298
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ +P + I+DSG++ T + ++EAV EF Q+G + S
Sbjct: 299 TRLPVPETKF-------RSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVE---GS 348
Query: 390 GLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAG 445
L CF + + + +P L L + GA LP NY F +G V+C++L A
Sbjct: 349 ALDLCFALPVTALWRRPAVPSLTLHLE-GADWELPRSNYVFEDLGARVMCIVL----DAA 403
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
P G ++G+FQ QN ++ +DL NDR FA +C
Sbjct: 404 P----GEQTVIGNFQQQNTHVVYDLENDRLSFAPARC 436
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 164/389 (42%), Gaps = 62/389 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTP QA DT + W PC+ CV C S F P +SSSS
Sbjct: 88 YIVRANIGTPAQAML-VALDTSNDAAWIPCSG---CVGC-------SSSVLFDPSKSSSS 136
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + C+ P+C P+ +K+C + + YG L +TL
Sbjct: 137 RTLQCEAPQCKQAPNPSCTV--------SKSC-----GFNMTYGGSAIEAYLTQDTLTLA 183
Query: 224 SKTVPNFLAGC--SILSDRQPA-GIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
+ +PN+ GC PA G+ G GR SL SQ L FSYCL + K
Sbjct: 184 TDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSK----- 238
Query: 278 VSSNLV--LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
SSN L GP + + + TP KNP SS YYV L I VG+K V IP
Sbjct: 239 -SSNFSGSLRLGPKNQPIR---IKTTPLLKNPRRSS-----LYYVNLVGIRVGNKIVDIP 289
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S L G I DSG+ +T + P + A+ EF R++ N A+ G C+
Sbjct: 290 TSALAFDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKN----ANATSLGGFDTCY 345
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPA 453
SV P + F G + LPP+N + GN + CL + AA P
Sbjct: 346 S----GSVVFPSVTFMF-AGMNVTLPPDNLLIHSSAGN-LSCLAM----AAAPTNVNSVL 395
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ Q QN + D+ N R G +++ C
Sbjct: 396 NVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 117/388 (30%), Positives = 162/388 (41%), Gaps = 49/388 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+PP + D+GS ++W C +C + DP F P SS
Sbjct: 128 GEYFVRVGVGSPP-TDQYLVVDSGSDVIWVQCRPCEQC----YAQTDP----LFDPAASS 178
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C + C + G C Y + YG G +T G L ETL
Sbjct: 179 SFSGVSCGSAICR-----TLSGTGCGGGGDAGKC-----DYSVTYGDGSYTKGELALETL 228
Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
V GC + AG+ G G + SL QLG FSYCL SR
Sbjct: 229 TLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAG 288
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
A +LVL G ++ G + P +N SS FYYVGL I VG + + +
Sbjct: 289 GA---GSLVL----GRTEAVPVGAVWVPLVRNNQASS-----FYYVGLTGIGVGGERLPL 336
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
S DG GGV++D+G+ T + + A+ F MG R+ V S L C
Sbjct: 337 QDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAV---SLLDTC 393
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
+D+SG SV +P + F GA + LP N VG V CL F +++G +
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLA-FAPSSSGIS------- 445
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ Q + + D AN GF C
Sbjct: 446 ILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 121/385 (31%), Positives = 163/385 (42%), Gaps = 65/385 (16%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGS +VW C RC + P F P+RSSS +GC C +
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYE--------QSGPVFDPRRSSSYGAVGCGAALCRRL--- 49
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSIL 237
GC R C Y + YG G TAG ++ETL F V GC
Sbjct: 50 ----DSGGCDLRRGAC-----MYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHD 100
Query: 238 SDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVS--SNLVLDTGPG 289
++ AG+ G GR S P+Q+ + FSYCL+ R A + S+ G
Sbjct: 101 NEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFG 160
Query: 290 SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIPYSYL-VPGSDGNG 347
+G S+TP +NP FYYV L I VG V + S L + S G G
Sbjct: 161 AGSVGASSASFTPMVRNP-----RMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRG 215
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR----------PCFDI 397
GVIVDSG++ T + + A+ F RAA GLR C+D+
Sbjct: 216 GVIVDSGTSVTRLARASYSALRDAF--------RAA---AAGGLRLSPGGFSLFDTCYDL 264
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
G++ V +P + + F GGA+ ALPPENY V + F G + I+G
Sbjct: 265 GGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVS-------IIG 317
Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q Q F + FD R GFA + C
Sbjct: 318 NIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 114/393 (29%), Positives = 165/393 (41%), Gaps = 50/393 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPPQ + I D+GS L+W C+ +C P ++P SS
Sbjct: 62 GQYFVDFFLGTPPQKFS-LIVDSGSDLLWVQCSPCRQCY--------AQDSPLYVPSNSS 112
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ + C + C I P E P + P AC L + G+ E+
Sbjct: 113 TFSPVPCLSSDCLLI--PATEGF-----PCDFRYPGACAYEYLYADTSSSKGVFAYESAT 165
Query: 222 FPSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
+ GC SD Q G+ G G+ S SQ+G KF+YCL++
Sbjct: 166 VDGVRIDKVAFGCG--SDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNY-L 222
Query: 274 DDAPVSSNLVLDTGPGSGD---SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
D VSS+L+ GD S + YTP NP + YYV + ++ VG K
Sbjct: 223 DPTSVSSSLIF------GDELISTIHDMQYTPIVSNPKSPT-----LYYVQIEKVTVGGK 271
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ I S GNGG I DSG+T T+ + + F + +Y RA V+ G
Sbjct: 272 SLPISDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGV-HYPRAESVQ---G 327
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
L C +++G P ++F GA ENYF V V CL + AG A
Sbjct: 328 LDLCVELTGVDQPSFPSFTIEFDDGAVFQPEAENYFVDVAPNVRCLAM-----AGLASPL 382
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G +G+ QNF++++D + GFA KC+
Sbjct: 383 GGFNTIGNLLQQNFFVQYDREENLIGFAPAKCS 415
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 113/401 (28%), Positives = 171/401 (42%), Gaps = 69/401 (17%)
Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + L+ GTPP PF+ DTGS L W C C + P DPS F P
Sbjct: 77 YLMELAIGTPP---VPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSP---- 129
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL-----GFTAGLLL 216
+ C + C + R + CS PS L +YG ++AG+L
Sbjct: 130 ----VPCSSATCLPVL------RSRNCS---------TPSSLCRYGYSYSDGAYSAGILG 170
Query: 217 SETLRFPSK------TVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLKKFSYC 267
+ETL S +V + GC + G G GR + SL +QLG+ KFSYC
Sbjct: 171 TETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYC 230
Query: 268 LLS--RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
L D+P + + PG G ++ TP ++P+ S Y V L+ I
Sbjct: 231 LTDFFNSTLDSPFLLGTLAELAPGPGAVQS-----TPLLQSPLNPSR-----YVVSLQGI 280
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
+G + IP ++ GG++VDSG+TF+ + F V + +G V
Sbjct: 281 TLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQ----PPV 336
Query: 386 EKKSGLRPCFDI-SGKKSV-YLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
S PCF +G++ + ++P+L+L F GGA M L +NY + + CL +
Sbjct: 337 NASSLDSPCFPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTT 396
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ +LG+FQ QN + FD+ + F C+
Sbjct: 397 STWS--------MLGNFQQQNIQMLFDMTVGQLSFLPTDCS 429
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 111/381 (29%), Positives = 172/381 (45%), Gaps = 52/381 (13%)
Query: 116 ASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF--IPKRSSSSQLIGCQNPKC 173
A + DT S L W C C D P DPS P++ +P SSS C +
Sbjct: 129 AEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSS-----CDALRV 183
Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLA 232
+ G + C+ N+ P AC SY L Y G ++ G+L + LR + + F+
Sbjct: 184 AMAAGTSP------CADDNEQQP-AC-SYALSYRDGSYSRGVLARDKLRLAGQDIEGFVF 235
Query: 233 GCSILSDRQP----AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLD 285
GC + P +G+ G GRS SL SQ + FSYCL R ++ S +LVL
Sbjct: 236 GCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMR---ESGSSGSLVL- 291
Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSA--FGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
GD + + TP + S S G FY++ L I VG + V+ P+
Sbjct: 292 -----GDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWF------ 340
Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
G VI+DSG+ T + ++ AV EF+ Q+ Y +A S L CF+++G K V
Sbjct: 341 -SAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAF---SILDTCFNLTGLKEV 396
Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQL 461
+P L F+G ++ + + V ++ +CL L + + I+G++Q
Sbjct: 397 QVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKS------EYDTSIIGNYQQ 450
Query: 462 QNFYLEFDLANDRFGFAKQKC 482
+N + FD + GFA++ C
Sbjct: 451 KNLRVIFDTLGSQIGFAQETC 471
>gi|383130038|gb|AFG45739.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 154
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/151 (38%), Positives = 88/151 (58%), Gaps = 4/151 (2%)
Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
GD P L+YTPF N SSS + FYY+ LR + +G K + +P + GNG
Sbjct: 5 GDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNG 64
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
G I+DSG+TFT ++ + F Q+G + RA++VE ++G+R C+++SG V LP+
Sbjct: 65 GTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHVLLPD 123
Query: 408 LILKFKGGAKMALPPENYFALVGNEVLCLIL 438
FKGG+ M LP NYF+ ++ +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSDSICLTM 154
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 121/404 (29%), Positives = 169/404 (41%), Gaps = 73/404 (18%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
+++L+ G PPQ + + DTGS L W C P+ F P SS+
Sbjct: 66 TVTLAVGDPPQ-NISMVLDTGSELSWLHCKK------------SPNLGSVFNPVSSSTYS 112
Query: 165 LIGCQNPKCSWIFGPNVESRCK------GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
+ C +P C +R + C P+ C +A SY + G L E
Sbjct: 113 PVPCSSPIC--------RTRTRDLPIPASCDPKTHLCHVAI-SYADATSI---EGNLAHE 160
Query: 219 TLRFPSKTVPNFLAGC--SILS-----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
T S T P L GC S LS D + G+ G R S S +QLG KFSYC+
Sbjct: 161 TFVIGSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI--- 217
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----YYVGLRQIIV 327
+ S++ L G S P + YTP V S+ F Y V L I V
Sbjct: 218 ----SGSDSSVFLLLGDASYSWLGP-IQYTPL----VLQSTPLPYFDRVAYTVQLEGIRV 268
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
GSK + +P S VP G G +VDSG+ FTF+ GP++ A+ EFI Q + R D
Sbjct: 269 GSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPD 328
Query: 388 ---KSGLRPCFDISGKKS---VYLPELILKFKGGAKMALPPENYFALVG-------NEVL 434
+ + C+ + LP + L F+ GA+M++ + V EV
Sbjct: 329 FVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVY 387
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
C + G A ++G QN ++EFDLA R GFA
Sbjct: 388 CFTFGNSDLLGIE-----AFVIGHHHQQNVWMEFDLAKSRVGFA 426
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 111/374 (29%), Positives = 172/374 (45%), Gaps = 49/374 (13%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
I DT S L W C C D P DPS P++ + C + C +
Sbjct: 126 VIVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAA--------VPCNSSSCDAL--- 174
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
V + G + ++ P AC SY L Y G ++ G+L + L + + F+ GC S
Sbjct: 175 RVATGMSGQACDDQ--PAAC-SYTLSYRDGSYSRGVLAHDRLSLAGEDIQGFVFGCGT-S 230
Query: 239 DRQP----AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
++ P +G+ G GRS SL SQ + FSYCL + ++ S +LVL
Sbjct: 231 NQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPK---ESGSSGSLVLGDDASVY 287
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP-YSYLVPGSDGNGGVI 350
+ TP + YT +P+ G FY L I VG + V+ P +S + G G I
Sbjct: 288 RNSTP-IVYTAMVSDPLQ-----GPFYLANLTGITVGGEDVQSPGFS-----AGGGGKAI 336
Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
VDSG+ T + ++ AV EF+ Q+ Y +AA S L CFD++G + V +P L L
Sbjct: 337 VDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPF---SILDTCFDLTGLREVQVPSLKL 393
Query: 411 KFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
F GGA++ + + +V + +CL L + + I+G++Q +N + F
Sbjct: 394 VFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKS------EYDTPIIGNYQQKNLRVIF 447
Query: 469 DLANDRFGFAKQKC 482
D + GFA++ C
Sbjct: 448 DTVGSQIGFAQETC 461
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 122/435 (28%), Positives = 178/435 (40%), Gaps = 78/435 (17%)
Query: 70 LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
LKT+ P++ + ++ +L +++L+ G+PPQ + + DTGS L
Sbjct: 40 LKTQKLPRSSSDKLSFRHNVTL-------------TVTLAVGSPPQ-NISMVLDTGSELS 85
Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK--- 186
W C P+ F P SS+ + C +P C +R +
Sbjct: 86 WLHCKK------------SPNLGSVFNPVSSSTYSPVPCSSPIC--------RTRTRDLP 125
Query: 187 ---GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC--SILS--- 238
C P+ C +A SY + G L +T S T P L GC S LS
Sbjct: 126 IPASCDPKTHFCHVAI-SYADATSI---EGNLAHDTFVIGSVTRPGTLFGCMDSGLSSDS 181
Query: 239 --DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
D + G+ G R S S +QLG KFSYC+ + S+ +L G S P
Sbjct: 182 EEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI-------SGSDSSGILLLGDASYSWLGP 234
Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
+ YTP Y V L I VGSK + +P S VP G G +VDSG+
Sbjct: 235 -IQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQ 293
Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEK---KSGLRPCFDISGKKS---VYLPELIL 410
FTF+ GP++ A+ EFI Q + R D + + C+ + LP + L
Sbjct: 294 FTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPNFTGLPVISL 353
Query: 411 KFKGGAKMALPPENYFALVG-------NEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
F+ GA+M++ + V EV C + G A ++G QN
Sbjct: 354 MFR-GAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIE-----AFVIGHHHQQN 407
Query: 464 FYLEFDLANDRFGFA 478
++EFDLA R GFA
Sbjct: 408 VWMEFDLAKSRVGFA 422
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 126/396 (31%), Positives = 181/396 (45%), Gaps = 40/396 (10%)
Query: 97 SVH-SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
SVH S Y + + GTPP A + + DTGS L+W C + C C P P +
Sbjct: 92 SVHASTATYLVDFAIGTPPLALSA-VLDTGSDLIWTQCDAP--CRRCF-----PQPAPLY 143
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGL 214
P RS + + C + C + SRC + C +Y YG G T G+
Sbjct: 144 APARSVTYANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGC-TYYYSYGDGSSTDGV 202
Query: 215 LLSETLRFPSKTVPNFLA-GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
L +ET F + T + LA GC ++ +G+ G GR SL SQLG+ KFSYC
Sbjct: 203 LATETFTFGAGTTVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTKFSYCFT- 261
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
F+D SS L L GS S +P TPF +P G + +YY+ L I VG
Sbjct: 262 -PFNDTTTSSPLFL----GSSASLSPAAKSTPFVPSPSGPRRS--SYYYLSLEGITVGDT 314
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAADVEKKS 389
+ I + + G GG+I+DSG+TFT +E F +A+ ++ + A +
Sbjct: 315 LLPIDPAVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHL---- 370
Query: 390 GLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
GL CF G ++V +P L+L F GA M LP + A+V + V + G
Sbjct: 371 GLSVCFAAPQGRGPEAVDVPRLVLHFD-GADMELPRSS--AVVEDRVAGVACL-----GI 422
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
RG + +LG Q QN ++ +D+ D F C
Sbjct: 423 VSARGMS-VLGSMQQQNMHVRYDVGRDVLSFEPANC 457
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 165/391 (42%), Gaps = 40/391 (10%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS LVW C+ RC R F P+RSS
Sbjct: 84 GEYFALVGVGTPSTKAM-LVIDTGSDLVWLQCSPCRRCY--------AQRGQVFDPRRSS 134
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
+ + + C +P+C + P C C Y++ YG G ++ G L ++ L
Sbjct: 135 TYRRVPCSSPQCRALRFPG----CDSGGAAGGGC-----RYMVAYGDGSSSTGELATDKL 185
Query: 221 RFPSKT-VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKF 273
F + T V N GC ++ AG+ G R S+ +Q+ F YCL R
Sbjct: 186 AFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRT- 244
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
+ SS LV P + P ++T NP S YYV + VG + V
Sbjct: 245 SRSTRSSYLVFGRTP-----EPPSTAFTALLSNPRRPS-----LYYVDMAGFSVGGERVT 294
Query: 334 --IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
S + + G GGV+VDSG+ + + A+ F + + S
Sbjct: 295 GFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVF 354
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D+ G+ + P ++L F GGA MALPPENYF V + A G
Sbjct: 355 DACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG 414
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++I G+ Q Q F + FD+ +R GFA + C
Sbjct: 415 LSVI-GNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 129/434 (29%), Positives = 189/434 (43%), Gaps = 58/434 (13%)
Query: 59 LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAST 118
LA+ +R +L+ + P T + +GS + + + G Y + + G+PP
Sbjct: 94 LAARDGARVEYLQRRLSPTTMTTEVGSEVVSGISE------GSGEYFVRVGVGSPPTEQY 147
Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFG 178
+ D+GS ++W C C +C + DP F P S+S + C + C + G
Sbjct: 148 -LVVDSGSDVIWIQCRP---CAEC-YQQADP----LFDPAASASFTAVPCDSGVCRTLPG 198
Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSI 236
+ GC+ AC Y + YG G +T G+L ETL F T V GC
Sbjct: 199 GS-----SGCADSG-----AC-RYQVSYGDGSYTQGVLAMETLTFGDSTPVQGVAIGCGH 247
Query: 237 LSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGS 290
+ AG+ G G SL QLG FSYCL SR D + +LV G
Sbjct: 248 RNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAG--AGSLVF----GR 301
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
D+ G + P +N S FYYVGL + VG + + + DG GGV+
Sbjct: 302 DDAMPVGAVWVPLLRNAQQPS-----FYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVV 356
Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFDISGKKSVYLPELI 409
+D+G+ T + + A+ F + G+ RA V S L C+D+SG SV +P +
Sbjct: 357 MDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGV---SLLDTCYDLSGYASVRVPTVA 413
Query: 410 LKF-KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
L F + GA + LP N +G V CL F +A+G + ILG+ Q Q +
Sbjct: 414 LYFGRDGAALTLPARNLLVEMGGGVYCLA-FAASASGLS-------ILGNIQQQGIQITV 465
Query: 469 DLANDRFGFAKQKC 482
D AN GF C
Sbjct: 466 DSANGYVGFGPSTC 479
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 116/388 (29%), Positives = 161/388 (41%), Gaps = 49/388 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+PP + D+GS ++W C +C + DP F P SS
Sbjct: 128 GEYFVRVGVGSPP-TDQYLVVDSGSDVIWVQCRPCEQC----YAQTDP----LFDPAASS 178
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C + C + G C Y + YG G +T G L ETL
Sbjct: 179 SFSGVSCGSAICR-----TLSGTGCGGGGDAGKC-----DYSVTYGDGSYTKGELALETL 228
Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
V GC + AG+ G G + SL QLG FSYCL SR
Sbjct: 229 TLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAG 288
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
A +LVL G ++ G + P +N SS FYYVGL I VG + + +
Sbjct: 289 GA---GSLVL----GRTEAVPVGAVWVPLVRNNQASS-----FYYVGLTGIGVGGERLPL 336
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
DG GGV++D+G+ T + + A+ F MG R+ V S L C
Sbjct: 337 QDGLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAV---SLLDTC 393
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
+D+SG SV +P + F GA + LP N VG V CL F +++G +
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLA-FAPSSSGIS------- 445
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ Q + + D AN GF C
Sbjct: 446 ILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 112/387 (28%), Positives = 163/387 (42%), Gaps = 56/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
+ + GTP Q + DT + W PC+ C+ C V F +SSS
Sbjct: 103 FVVRAKIGTPAQ-TLLLALDTSNDAAWIPCSG---CIGCPSTTV-------FSSDKSSSF 151
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + CQ+P+C+ + P+ C G AC + L YG A L+ + L
Sbjct: 152 RPLPCQSPQCNQVPNPS----CSGS---------AC-GFNLTYGSSTVAADLVQDNLTLA 197
Query: 224 SKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+ +VP++ GC S + + G+ S L FSYCL S F
Sbjct: 198 TDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPS--FKSVN 255
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + + YTP +NP SS YYV L I VG K V IP S
Sbjct: 256 FSGSLRL--GPVAQPIR---IKYTPLLRNPRRSS-----LYYVNLISIRVGRKIVDIPPS 305
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L S G ++DSG+TFT + P + AV EF R++G R V G C+ +
Sbjct: 306 ALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVG---RNVTVSSLGGFDTCYTV 362
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ P + F G + LPP+N+ CL + AA P ++
Sbjct: 363 ----PIISPTITFMF-AGMNVTLPPDNFLIHSTAGSTTCLAM----AAAPDNVNSVLNVI 413
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
Q QN + FD+ N R G A++ C+
Sbjct: 414 ASMQQQNHRILFDIPNSRVGVARESCS 440
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 117/394 (29%), Positives = 170/394 (43%), Gaps = 55/394 (13%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSR---YRCVDCNFPNVDPSRIPAFIPKRS 160
Y S G+PPQ + I DTGS L+W C + C P + S+ F+P
Sbjct: 86 YIASYLIGSPPQRTEALI-DTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVP--- 141
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL 220
+ C + + N C + +C +++ YG G G L +E+
Sbjct: 142 -----VPCADK--AGFCAANGVHLCG----LDGSC-----TFIASYGAGRVIGSLGTESF 185
Query: 221 RFPSKTVPNFLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
F S T + GC L+ +G+ G GR SL SQ+G +FSYC L+ F
Sbjct: 186 AFESGTT-SLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATRFSYC-LTPYFH 243
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-- 332
+ SS+L + S G + PF K+P + FYY+ L I VG +
Sbjct: 244 SSGASSHLFVGASA----SLGGGGASMPFVKSP--KDYPYSTFYYLPLEGITVGKTRLPA 297
Query: 333 ----KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
L G GGVI+D+GS T + +EA+ +E Q+GN S E
Sbjct: 298 VNSTTFQLRQLFKGY-WAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPE-D 355
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
SGL C G + V +P L+ F GGA MA+P +Y+A V C+++ L
Sbjct: 356 SGLELCVAREGFQKV-VPALVFHFGGGADMAVPAASYWAPVDKAAACMMI---------L 405
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+FQ Q+ +L +DL RF F C
Sbjct: 406 EGGYDSIIGNFQQQDMHLLYDLRRGRFSFQTADC 439
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 132/427 (30%), Positives = 178/427 (41%), Gaps = 78/427 (18%)
Query: 86 NYSNSLIKTP---LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC 142
N LI P LS H ++SL+ G+PPQ T + DTGS L W C
Sbjct: 979 NTQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVT-MVLDTGSELSWLHCKK------- 1030
Query: 143 NFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG------CSPRNKTCP 196
P+ F P SSS I C +P C +R + C P+ K C
Sbjct: 1031 -----SPNLTSVFNPLSSSSYSPIPCSSPIC--------RTRTRDLPNPVTCDPK-KLCH 1076
Query: 197 LACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFG 249
A SY L G L S+ R S +P L GC + D + G+ G
Sbjct: 1077 -AIVSYADASSL---EGNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMN 1132
Query: 250 RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
R S S +QLGL KFSYC+ R + +L L S L+YTP V
Sbjct: 1133 RGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDLHL--------SWLGNLTYTPL----VQ 1180
Query: 310 SSSAFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
S+ F Y V L I VG+K + +P S P G G +VDSG+ FTF+ GP++
Sbjct: 1181 ISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVY 1240
Query: 366 EAVAKEFIRQM-GNYSRAAD--VEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALP 421
A+ EF+ Q G + D + + C+ + +G K LP + L F+ GA+M +
Sbjct: 1241 TALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSLMFR-GAEMVVG 1299
Query: 422 PENYFALV-----GNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
E V GNE V CL + G A ++G QN ++EFDL
Sbjct: 1300 GEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIE-----AFVIGHHHQQNVWMEFDLV---- 1350
Query: 476 GFAKQKC 482
FA C
Sbjct: 1351 AFAADLC 1357
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 119/412 (28%), Positives = 172/412 (41%), Gaps = 82/412 (19%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTPP+ DTGS LVW C C P +DP+ SS+
Sbjct: 92 YLVHLAVGTPPR-PVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAA--------SSTY 142
Query: 164 QLIGCQNPKCSWI-----FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLS 217
+ C P+C + G S G N++C +Y+ YG T G + +
Sbjct: 143 AALPCGAPRCRALPFTSCGGGGRSSWGNG----NRSC-----AYIYHYGDKSVTVGEIAT 193
Query: 218 ETL-----------RFPSKTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK 262
+ R P++ GC + GIAGFGR SLPSQL +
Sbjct: 194 DRFTFGGDNGDGDSRLPTR---RLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVT 250
Query: 263 KFSYCLLSR--------KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
FSYC S AP ++ L SG+ +T TP KNP S
Sbjct: 251 TFSYCFTSMFESKSSLVTLGGAPAAALLYSHAAHISGEVRT-----TPLLKNPSQPS--- 302
Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
Y++ L+ I VG + +P + L I+DSG++ T + ++EAV EF
Sbjct: 303 --LYFLSLKGISVGKTRLAVPEAKL-------RSTIIDSGASITTLPEAVYEAVKAEFAA 353
Query: 375 QMGNYSRAADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENY-FALVG 430
Q+G V + S L CF + + + +P L L GA LP NY F +
Sbjct: 354 QVG--LPPTGVVEGSALDLCFALPVTALWRRPPVPSLTLHLD-GADWELPRGNYVFEDLA 410
Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V+C++L A P G ++G+FQ QN ++ +DL ND FA +C
Sbjct: 411 ARVMCVVL----DAAP----GDQTVIGNFQQQNTHVVYDLENDWLSFAPARC 454
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 117/401 (29%), Positives = 168/401 (41%), Gaps = 64/401 (15%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+SL+ GTPPQ + + DTGS L W C ++P F P RS+S Q
Sbjct: 33 VSLTVGTPPQ-NVSMVIDTGSELSWLHCNKTL-----SYPT-------TFDPTRSTSYQT 79
Query: 166 IGCQNPKCSWIFGPNVESRCK------GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
I C +P C+ +R + C N C + L + G L S+
Sbjct: 80 IPCSSPTCT--------NRTQDFPIPASCDSNN-----LCHATLSYADASSSDGNLASDV 126
Query: 220 LRFPSKTVPNFLAGC--SILS-----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRK 272
S + + GC S+ S D + G+ G R S S SQLG KFSYC+
Sbjct: 127 FHIGSSDISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFPKFSYCISGTD 186
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
F S L+L G + P L+YTP + Y V L I V K +
Sbjct: 187 F------SGLLL-LGESNLTWSVP-LNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLL 238
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA---DVEKKS 389
IP S P G G +VDSG+ FTF+ GP++ A+ F+ Q + R D +
Sbjct: 239 PIPKSTFEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQG 298
Query: 390 GLRPCFDISGKKSV--YLPELILKFKGGAKMALPPENYF-----ALVGNE-VLCLILFTD 441
+ C+ + + V LP + L F+ GA+M + + L GN+ V CL
Sbjct: 299 AMDLCYLVPLSQRVLPLLPTVTLVFR-GAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNS 357
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ G A ++G QN ++EFDL R G A+ +C
Sbjct: 358 DLLGVE-----AYVIGHHHQQNVWMEFDLEKSRIGLAQVRC 393
>gi|383130042|gb|AFG45741.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 60/152 (39%), Positives = 88/152 (57%), Gaps = 5/152 (3%)
Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
GD P L+YTPF N SSS + FYY+ LR + +G K + +P S GNG
Sbjct: 5 GDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDSKGNG 64
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
G I+DSG+TFT ++ + F Q+G + RA++VE ++G+R C+++SG V LP+
Sbjct: 65 GTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHVLLPD 123
Query: 408 LILKFKGGAKMALPPENYFA-LVGNEVLCLIL 438
FKGG+ M LP NYF+ V + +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSFDSICLTM 155
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 114/379 (30%), Positives = 163/379 (43%), Gaps = 64/379 (16%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS L+W C C D P F K+S++ + + C++ +C+ + P
Sbjct: 1 MDTGSDLIWTQCAPCLLCAD--------QPTPYFDVKKSATYRALPCRSSRCASLSSP-- 50
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKT-----VPNFLAGCS 235
S K C Y YG TAG+L +ET F + N GC
Sbjct: 51 -------SCFKKMC-----VYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCG 98
Query: 236 ILSDRQPA---GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP------VSSNLVLDT 286
L+ A G+ GFGR SL SQLG +FSYCL S P V +NL T
Sbjct: 99 SLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSY-LSATPSRLYFGVYANLS-ST 156
Query: 287 GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
SG + TPF NP A Y++ L+ I +G+K + I DG
Sbjct: 157 NTSSGSP----VQSTPFVINP-----ALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGT 207
Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI--SGKKSVY 404
GGVI+DSG++ T+++ +EAV + + + A + GL CF +V
Sbjct: 208 GGVIIDSGTSITWLQQDAYEAVRRGLVSAI---PLPAMNDTDIGLDTCFQWPPPPNVTVT 264
Query: 405 LPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
+P+L+ F A M L PENY + LCL++ P G I+G++Q QN
Sbjct: 265 VPDLVFHFD-SANMTLLPENYMLIASTTGYLCLVM------APT---GVGTIIGNYQQQN 314
Query: 464 FYLEFDLANDRFGFAKQKC 482
+L +D+ N F C
Sbjct: 315 LHLLYDIGNSFLSFVPAPC 333
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 113/394 (28%), Positives = 171/394 (43%), Gaps = 55/394 (13%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP---AFIPKRSSS 162
+SL GTPPQ I DTGS L W C + P + P F P SSS
Sbjct: 84 VSLPIGTPPQTQQ-MILDTGSQLSWIQCHKKV-----------PRKPPPSSVFDPSLSSS 131
Query: 163 SQLIGCQNPKCS-WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
++ C +P C I + + C +N+ C SY G G L+ E +
Sbjct: 132 FSVLPCNHPLCKPRIPDFTLPTSCD----QNRLCHY---SYFYADGT-LAEGNLVREKIT 183
Query: 222 FP-SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA--PV 278
F S++ P + GC+ S GI G S SQ L KFSYC+ +R+ P
Sbjct: 184 FSRSQSTPPLILGCAEESS-DAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPT 242
Query: 279 SSNLVLDTGPGSGDSKTPGL-SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S L P SG + L +++ + P A Y V ++ I +G++ + IP S
Sbjct: 243 GS-FYLGENPNSGGFRYINLLTFSQSQRMPNLDPLA----YTVAMQGIRIGNQKLNIPIS 297
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAADVEKKSG 390
P G G ++DSGS FT++ + V +E +R +G Y +D+
Sbjct: 298 AFRPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDM----- 352
Query: 391 LRPCFDISG-KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
CF+ + + + ++ +F G ++ + E A VG V C+ + G A
Sbjct: 353 ---CFNGNAIEIGRLIGNMVFEFDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAA-- 407
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G+F QN ++EFDLAN R GF K C+
Sbjct: 408 ---SNIIGNFHQQNIWVEFDLANRRVGFGKADCS 438
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 115/399 (28%), Positives = 170/399 (42%), Gaps = 62/399 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC-NFPNVDPSRIPAFIPKRSSS 162
Y + L+ GTPPQ + + DTGS L+W C C C + P+ P F P +S+S
Sbjct: 96 YVVDLAIGTPPQPVSALL-DTGSDLIWTQCAP---CASCLSQPD------PLFAPGQSAS 145
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLR 221
+ + C CS I + E R TC +Y YG G T G+ +E
Sbjct: 146 YEPMRCAGTLCSDILHHSCE--------RPDTC-----TYRYNYGDGTMTVGVYATERFT 192
Query: 222 FPSKTVPNFLA-------GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
F S GC ++ S +GI GFGR+ SL SQL +++FSYCL S
Sbjct: 193 FASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTS- 251
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
+ S+ L G T + TP ++P + FYYV + VG++
Sbjct: 252 -YASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPT-----FYYVHFTGLTVGARR 305
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
++IP S DG+GGVIVDSG+ T + + V + F RQ A + G+
Sbjct: 306 LRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAF-RQQLRLPFANGGNPEDGV 364
Query: 392 RPCFDI-------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNA 443
CF + S + +P ++L F+ GA + LP NY LCL+L
Sbjct: 365 --CFLVPAAWRRSSSTSQMPVPRMVLHFQ-GADLDLPRRNYVLDDHRRGRLCLLLADSGD 421
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G +G+ Q+ + +DL + A +C
Sbjct: 422 DGST--------IGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 129/500 (25%), Positives = 205/500 (41%), Gaps = 85/500 (17%)
Query: 1 MAACPFSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDS----DPLKIL 56
M AC L LF L IL P+T + + +L H D ++L
Sbjct: 7 MKACSCMLPYLFFLAILF--------------AWPVTSATLRAHLSHVDDGRGFTKRELL 52
Query: 57 HSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQA 116
+ S +RA +L + + + +N+ + + Y I LS G P
Sbjct: 53 RRMVVRSRARAANLCPYSGATARPATAPVGRANTDVNSE--------YLIHLSIGAPRSQ 104
Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
DTGS +VW C C +C +P F S++ + + C +P C
Sbjct: 105 PVVLTLDTGSDVVWTQCEP---CAECF-----TQPLPRFDTAASNTVRSVACSDPLC--- 153
Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSK------TVPN 229
N S C L +Y+ YG G + G L ++ F TVP+
Sbjct: 154 ---NAHS--------EHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPD 202
Query: 230 FLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
GC + + + GIAGFGR SLPSQL +++FSYC +R +A S +
Sbjct: 203 IGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVRQFSYCFTTRF--EAKSSPVFLGG 260
Query: 286 TGPGSGDSKTPGLSYTPFYKN-PVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
G + P LS TPF ++ P G+ ++ Y + + + VG + +P +D
Sbjct: 261 AGDLKAHATGPILS-TPFVRSLPPGTDNS---HYVLSFKGVTVGKTRLPVPEIK----AD 312
Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADVEKKSGLRPCFDISGKKSV 403
G+G +DSG+ T +F + FI Q ++ AD + CF GKK+
Sbjct: 313 GSGATFIDSGTDITTFPDAVFRQLKSAFIAQAALPVNKTADEDDI-----CFSWDGKKTA 367
Query: 404 YLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQ 462
+P+L+ + GA LP ENY +C+ + T G+ ++G+FQ Q
Sbjct: 368 AMPKLVFHLE-GADWDLPRENYVTEDRESGQVCVAVSTS-------GQMDRTLIGNFQQQ 419
Query: 463 NFYLEFDLANDRFGFAKQKC 482
N ++ +DLA + +C
Sbjct: 420 NTHIVYDLAAGKLLLVPAQC 439
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 127/434 (29%), Positives = 182/434 (41%), Gaps = 59/434 (13%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R R +++K K K + S+ + T ++ G Y + L GTP + S + DTG
Sbjct: 16 RVRWIESKAKLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGLGTPAR-SLFMVVDTG 74
Query: 126 SSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
S L W PC S Y+ D P F P+ SSS Q I C +P C + V
Sbjct: 75 SDLPWLQCQPCKSCYKQAD-----------PIFDPRNSSSFQRIPCLSPLCKAL---EVH 120
Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLA-GCSILSDR 240
S C G C SY + YG G F+ G S+ + + +A GC ++
Sbjct: 121 S-CSGSRGATSRC-----SYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEG 174
Query: 241 QPAGIAGFGRSSE---SLPSQL--------GLKKFSYCLLSRKFDDAPVSSNLVLDTGPG 289
AG AG S PSQ+ FSYCL+ R SS+L+
Sbjct: 175 LFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIF----- 229
Query: 290 SGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
G + P + +P KNP FYY + + VG + I L G+GG
Sbjct: 230 -GVAAIPSTAALSPLLKNP-----KLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGG 283
Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
VI+DSG++ T ++ + F N A + S C++ SGK SV +P L
Sbjct: 284 VIIDSGTSVTRFPTSVYATIRDAFRNATINLPSA---PRYSLFDTCYNFSGKASVDVPAL 340
Query: 409 ILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
+L F+ GA + LPP NY + + F + LG I+G+ Q Q+F + F
Sbjct: 341 VLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSM--ELG-----IIGNIQQQSFRIGF 393
Query: 469 DLANDRFGFAKQKC 482
DL FA Q+C
Sbjct: 394 DLQKSHLAFAPQQC 407
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 120/417 (28%), Positives = 177/417 (42%), Gaps = 83/417 (19%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR----IPAFIP 157
G Y + L GTP + P I DTGS L W + CN PN + P +
Sbjct: 25 GQYFVELRVGTPAK-KFPLIIDTGSDLTW---------IQCNPPNTTANSSSPPAPWYDK 74
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-----TA 212
SSS + I C + +C +F P +P +C + PS Y G+ T
Sbjct: 75 SSSSSYREIPCTDDEC--LFLP---------APIGSSCSIKSPS-PCDYTYGYSDQSRTT 122
Query: 213 GLLLSETLRFPSKT---------------VPNFLAGCSILSDRQ----PAGIAGFGRSSE 253
G+L ET+ S+ + N GCS S +G+ G G+
Sbjct: 123 GILAYETISMKSRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPI 182
Query: 254 SLPSQLGLKK----FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
SL +Q FSYCL+ + SS LV+ G ++ L++TP +NP
Sbjct: 183 SLATQTRHTALGGIFSYCLVDY-LRGSNASSFLVM------GRTRWRKLAHTPIVRNPAA 235
Query: 310 SSSAFGEFYYVGLRQIIVGSKHVK-IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
S FYYV + + V K V I S DGN G I DSG+T +++ P + V
Sbjct: 236 QS-----FYYVNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKV 290
Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL 428
+ RA ++ + G C++++ + +P+L ++F+GGA M LP NY L
Sbjct: 291 LGALNASI-YLPRAQEIPE--GFELCYNVT-RMEKGMPKLGVEFQGGAVMELPWNNYMVL 346
Query: 429 VGNEVLCLIL---FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V V C+ L T N + ILG+ Q+ ++E+DLA R GF C
Sbjct: 347 VAENVQCVALQKVTTTNGSN---------ILGNLLQQDHHIEYDLAKARIGFKWSPC 394
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 138/445 (31%), Positives = 195/445 (43%), Gaps = 58/445 (13%)
Query: 47 HSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSI 106
DS +K + SLA+ S R TK P+T +S ++I + LS S G Y +
Sbjct: 88 QRDSLRVKSITSLAAVSTGRN---ATKRTPRT-----AGGFSGAVI-SGLSQGS-GEYFM 137
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
L GTP + + DTGS +VW C+ C C + D F PK+S + +
Sbjct: 138 RLGVGTPA-TNVYMVLDTGSDVVWLQCSP---CKAC-YNQTDA----IFDPKKSKTFATV 188
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSK 225
C + C + + S C + R+KTC Y + YG G FT G +ETL F
Sbjct: 189 PCGSRLCRRL---DDSSEC--VTRRSKTCL-----YQVSYGDGSFTEGDFSTETLTFHGA 238
Query: 226 TVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVS 279
V + GC ++ AG+ G GR S PSQ + KFSYCL+ R +
Sbjct: 239 RVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSK 298
Query: 280 SNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIPYS 337
+ G ++ P S +TP NP FYY+ L I VG V + S
Sbjct: 299 PPSTIVFG----NAAVPKTSVFTPLLTNP-----KLDTFYYLQLLGISVGGSRVPGVSES 349
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
+ GNGGVI+DSG++ T + P + A+ F RA S CFD+
Sbjct: 350 QFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRA---PSYSLFDTCFDL 406
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
SG +V +P ++ F GG +++LP NY V E F AG G I+G
Sbjct: 407 SGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAF----AGTM---GSLSIIG 458
Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q Q F + +DL R GF + C
Sbjct: 459 NIQQQGFRVAYDLVGSRVGFLSRAC 483
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 122/404 (30%), Positives = 168/404 (41%), Gaps = 73/404 (18%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
+++L+ G PPQ + + DTGS L W C P+ F P SS+
Sbjct: 66 TVTLAVGDPPQ-NISMVLDTGSELSWLHCKK------------SPNLGSVFNPVSSSTYS 112
Query: 165 LIGCQNPKCSWIFGPNVESRCK------GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
+ C +P C +R + C P+ C +A SY + G L E
Sbjct: 113 PVPCSSPIC--------RTRTRDLPIPASCDPKTHLCHVAI-SYADATSI---EGNLAHE 160
Query: 219 TLRFPSKTVPNFLAGC--SILS-----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
T S T P L GC S LS D + G+ G R S S +QLG KFSYC+
Sbjct: 161 TFVIGSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI--- 217
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----YYVGLRQIIV 327
SS +L G S P + YTP V S+ F Y V L I V
Sbjct: 218 ---SGSDSSGFLL-LGDASYSWLGP-IQYTPL----VLQSTPLPYFDRVAYTVQLEGIRV 268
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
GSK + +P S VP G G +VDSG+ FTF+ GP++ A+ EFI Q + R D
Sbjct: 269 GSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPD 328
Query: 388 ---KSGLRPCFDISGKKS---VYLPELILKFKGGAKMALPPENYFALVG-------NEVL 434
+ + C+ + LP + L F+ GA+M++ + V EV
Sbjct: 329 FVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFR-GAEMSVSGQKLLYRVNGAGSEGKEEVY 387
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
C + G A ++G QN ++EFDLA R GFA
Sbjct: 388 CFTFGNSDLLGIE-----AFVIGHHHQQNVWMEFDLAKSRVGFA 426
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 126/448 (28%), Positives = 181/448 (40%), Gaps = 60/448 (13%)
Query: 45 LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
+ S D H+ R L + P+ S+ + + + ++ S G Y
Sbjct: 84 FNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSSYSVEEFGAEVVSGMNQGS-GEY 142
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
I + G+PP+ + D+GS +VW PCT Y D P F P S+
Sbjct: 143 FIRIGVGSPPREQY-VVIDSGSDIVWVQCQPCTQCYHQTD-----------PVFDPADSA 190
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C + C I N C Y + YG G +T G L ETL
Sbjct: 191 SFMGVPCSSSVCERI--------------ENAGCHAGGCRYEVMYGDGSYTKGTLALETL 236
Query: 221 RFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
F V N GC + AG+ G G S SL QLG + FSYCL+SR D
Sbjct: 237 TFGRTVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTD 296
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
A L+ G G+ G ++ P +NP S FYY+ L + VG V I
Sbjct: 297 SAGS-----LEFGRGA---MPVGAAWIPLIRNPRAPS-----FYYIRLSGVGVGGMKVPI 343
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
GNGGV++D+G+ T + + A FI Q GN RA+ V S C
Sbjct: 344 SEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGV---SIFDTC 400
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
++++G SV +P + F GG + LP N+ V + F + +G +
Sbjct: 401 YNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIPVDDVGTFCFAFAASPSGLS------- 453
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + + FD AN GF C
Sbjct: 454 IIGNIQQEGIQISFDGANGFVGFGPNVC 481
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 118/387 (30%), Positives = 168/387 (43%), Gaps = 56/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
+ + GTP Q + DT + W PC+ C+ C V F +SSS
Sbjct: 26 FVVRAKIGTPAQ-TLLLALDTSNDAAWIPCSG---CIGCPSTTV-------FSSDKSSSF 74
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + CQ+P+C+ + P+ C G AC + L YG A L+ + L
Sbjct: 75 RPLPCQSPQCNQVPNPS----CSGS---------AC-GFNLTYGSSTVAADLVQDNLTLA 120
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
+ +VP++ GC + S P G+ G GR SL Q L FSYCL S F
Sbjct: 121 TDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPS--FKSVN 178
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + + YTP +NP SS YYV L I VG K V IP S
Sbjct: 179 FSGSLRL--GPVAQPIR---IKYTPLLRNPRRSS-----LYYVNLISIRVGRKIVDIPPS 228
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L S G ++DSG+TFT + P + AV EF R++G R V G C+ +
Sbjct: 229 ALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVG---RNVTVSSLGGFDTCYTV 285
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ P + F G + LPP+N+ CL + AA P ++
Sbjct: 286 ----PIISPTITFMF-AGMNVTLPPDNFLIHSTSGSTTCLAM----AAAPDNVNSVLNVI 336
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
Q QN + FD+ N R G A++ C+
Sbjct: 337 ASMQQQNHRILFDIPNSRVGVARESCS 363
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 122/415 (29%), Positives = 171/415 (41%), Gaps = 72/415 (17%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L H ++SL+ G+PPQ T + DTGS L W C F N F
Sbjct: 61 LLFHHNVSLTVSLTVGSPPQNVT-MVLDTGSELSWL------HCKKTQFLN------SVF 107
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA--- 212
P S + + C +P C R+ T P++C + L + + A
Sbjct: 108 NPLSSKTYSKVPCLSPTCK-------------TRTRDLTIPVSCDATKLCHVIVSYADAT 154
Query: 213 ---GLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
G L ET R S T P + GC + D + G+ G R S S +Q+G
Sbjct: 155 SIEGNLAFETFRLGSLTKPATIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYP 214
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----Y 318
KFSYC+ FD A V L G S P LSYTP V S+ F Y
Sbjct: 215 KFSYCI--SGFDSAGV-----LLLGNASFPWLKP-LSYTPL----VQISTPLPYFDRVAY 262
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
V L I V +K + +P S VP G G +VDSG+ FTF+ GP++ A+ EF+ Q
Sbjct: 263 TVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLSQTRG 322
Query: 379 YSRAADVEK---KSGLRPCF--DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE- 432
+ + + + + C+ D S LP + L F+ GA+M++ E V E
Sbjct: 323 ILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQ-GAEMSVSGERLLYRVPGEV 381
Query: 433 -----VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V C + G A ++G QN ++EFDL R G A +C
Sbjct: 382 RGRDSVWCFTFGNSDLLGVE-----AFVIGHHHQQNVWMEFDLEKSRIGLADVRC 431
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 130/458 (28%), Positives = 189/458 (41%), Gaps = 63/458 (13%)
Query: 42 KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
K LH + L+ L R R +++K + K + S+ + T ++
Sbjct: 71 KEKLHTHEQLLLETLQR----DEQRVRWIESKAQLAGKKKDEASSTDLNGPVTSGLLYGS 126
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
G Y + L GTP + S + DTGS L W PC S Y+ D P F P+
Sbjct: 127 GEYFVRLGVGTPAR-SLFMVVDTGSDLPWLQCQPCKSCYKQAD-----------PIFDPR 174
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLS 217
SSS Q I C +P C + + S C G C SY + YG G F+ G S
Sbjct: 175 NSSSFQRIPCLSPLCKAL---EIHS-CSGSRGATSRC-----SYQVAYGDGSFSVGDFSS 225
Query: 218 ETLRFPSKTVPNFLA-GCSILSDRQPAGIAGFGRSSE---SLPSQL--------GLKKFS 265
+ + + +A GC ++ AG AG S PSQ+ FS
Sbjct: 226 DLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFS 285
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQ 324
YCL+ R SS+L+ G + P + +P KNP FYY +
Sbjct: 286 YCLVDRSNPMTRSSSSLIF------GAAAIPSTAALSPLLKNP-----KLDTFYYAAMIG 334
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
+ VG + I L G+GGVI+DSG++ T ++ + F N A
Sbjct: 335 VSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAP- 393
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
+ S C++ SGK SV +P L+L F+ GA + LPP NY + + F +
Sbjct: 394 --RYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSM 451
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
LG I+G+ Q Q+F + FDL FA Q+C
Sbjct: 452 --ELG-----IIGNIQQQSFRIGFDLQKSHLAFAPQQC 482
>gi|361067845|gb|AEW08234.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130032|gb|AFG45736.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130034|gb|AFG45737.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130036|gb|AFG45738.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130046|gb|AFG45743.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130048|gb|AFG45744.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130050|gb|AFG45745.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130054|gb|AFG45747.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130056|gb|AFG45748.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 88/152 (57%), Gaps = 5/152 (3%)
Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
GD P L+YTPF N SSS + FYY+ LR + +G K + +P + GNG
Sbjct: 5 GDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNG 64
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
G I+DSG+TFT ++ + F Q+G + RA++VE ++G+R C+++SG V LP+
Sbjct: 65 GTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHVLLPD 123
Query: 408 LILKFKGGAKMALPPENYFA-LVGNEVLCLIL 438
FKGG+ M LP NYF+ V + +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSFDSICLTM 155
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 160/389 (41%), Gaps = 66/389 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+PP+ + I DTGS L W C C DC F D P + S
Sbjct: 168 GEYFMDVLVGSPPKHFS-LILDTGSDLNWIQCLP---CYDC-FQQNDNQSCPYYYWYGDS 222
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
S N T A ++ + + L +
Sbjct: 223 S-----------------------------NTTGDFAVETFTVNLTTNGGSSELYN---- 249
Query: 222 FPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDD 275
V N + GC + AG+ G GR S SQL FSYCL+ R D
Sbjct: 250 -----VENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRN-SD 303
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
VSS L+ G P L++T F G + FYYV ++ I+V + + IP
Sbjct: 304 TNVSSKLIF--GEDKDLLSHPNLNFTSFV---AGKENLVDTFYYVQIKSILVAGEVLNIP 358
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPC 394
SDG GG I+DSG+T ++ P +E + + + G Y D L PC
Sbjct: 359 EETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPI---LDPC 415
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
F++SG +V LPEL + F GA P EN F + +++CL + +
Sbjct: 416 FNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAML-------GTPKSAFS 468
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G++Q QNF++ +D R G+A KCA
Sbjct: 469 IIGNYQQQNFHILYDTKRSRLGYAPTKCA 497
>gi|383130040|gb|AFG45740.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 88/152 (57%), Gaps = 5/152 (3%)
Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
GD P L+YTPF N SSS + FYY+ LR + +G K + +P + GNG
Sbjct: 5 GDKALPTAMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNG 64
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
G I+DSG+TFT ++ + F Q+G + RA++VE ++G+R C+++SG V LP+
Sbjct: 65 GTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHVLLPD 123
Query: 408 LILKFKGGAKMALPPENYFA-LVGNEVLCLIL 438
FKGG+ M LP NYF+ V + +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSFDSICLTM 155
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 132/471 (28%), Positives = 197/471 (41%), Gaps = 98/471 (20%)
Query: 35 PLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK----TKTKPKTKDSNIGSNYSNS 90
PL+P ++S+ L+ +++ S+SR H PK +S++ SN
Sbjct: 42 PLSPF------YNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKAAESDVTSNR--- 92
Query: 91 LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
G Y +SLS GTPP I DTGS L+W C RC + VD
Sbjct: 93 -----------GEYLMSLSLGTPP-FKIMGIADTGSDLIWTQCKPCERC----YKQVD-- 134
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LG 209
P F PK S + + C +CS + +S C G C Y YG
Sbjct: 135 --PLFDPKSSKTYRDFSCDARQCSLL----DQSTCSG-----NIC-----QYQYSYGDRS 178
Query: 210 FTAGLLLSETLRFPSKT-----VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLG 260
+T G + S+T+ S T P + GC +D + +GI G G SL SQ+G
Sbjct: 179 YTMGNVASDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMG 238
Query: 261 LK---KFSYCLL---SRKFDDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
KFSYCL+ SR + + ++ SN V+ +GPG P+ SS
Sbjct: 239 SSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVV-SGPG-------------VQSTPLLSSE 284
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
FY++ L + VG++ +K S L G G +I+DSG+T T + F ++
Sbjct: 285 TMSSFYFLTLEAMSVGNERIKFGDSSL---GTGEGNIIIDSGTTLTIVPDDFFSNLSTA- 340
Query: 373 IRQMGNYSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
+GN E SG L C+ S + +P + F GA + L P N F V +
Sbjct: 341 ---VGNQVEGRRAEDPSGFLSVCY--SATSDLKVPAITAHFT-GADVKLKPINTFVQVSD 394
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+V+CL F +G + I G+ NF +E+++ F C
Sbjct: 395 DVVCLA-FASTTSGIS-------IYGNVAQMNFLVEYNIQGKSLSFKPTDC 437
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 120/417 (28%), Positives = 176/417 (42%), Gaps = 83/417 (19%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR----IPAFIP 157
G Y + L GTP + P I DTGS L W + CN PN + P +
Sbjct: 57 GQYFVELRVGTPAK-KFPLIVDTGSDLTW---------IQCNPPNTTANSSSPPAPWYDK 106
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-----TA 212
SSS + I C + +C ++ P + S C + T P C Y G+ T
Sbjct: 107 SSSSSYREIPCTDDECQFLPAP-IGSSC------SITSPSPC-----DYTYGYSDQSRTT 154
Query: 213 GLLLSETLRFPSKT---------------VPNFLAGCSILSDRQ----PAGIAGFGRSSE 253
G+L ET+ S+ + N GCS S +G+ G G+
Sbjct: 155 GILAYETISMKSRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPI 214
Query: 254 SLPSQLGLKK----FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
SL +Q FSYCL+ + SS LV+ G + L++TP +NP
Sbjct: 215 SLATQTRHTALGGIFSYCLVD-YLRGSNASSFLVM------GRTHWRKLAHTPIVRNPAA 267
Query: 310 SSSAFGEFYYVGLRQIIVGSKHVK-IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
S FYYV + + V K V I S DGN G I DSG+T +++ P + V
Sbjct: 268 QS-----FYYVNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKV 322
Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL 428
+ RA ++ + G C++++ + +P+L ++F+GGA M LP NY L
Sbjct: 323 LGALNASI-YLPRAQEIPE--GFELCYNVT-RMEKGMPKLGVEFQGGAVMELPWNNYMVL 378
Query: 429 VGNEVLCLIL---FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V V C+ L T N + ILG+ Q+ ++E+DLA R GF C
Sbjct: 379 VAENVQCVALQKVTTTNGSN---------ILGNLLQQDHHIEYDLAKARIGFKWSPC 426
>gi|383130052|gb|AFG45746.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 88/152 (57%), Gaps = 5/152 (3%)
Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
GD P L+YTPF N SSS + FYY+ LR + +G K + +P + GNG
Sbjct: 5 GDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNG 64
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
G I+DSG+TFT ++ + F Q+G + RA++VE ++G+R C+++SG V LP+
Sbjct: 65 GTIIDSGTTFTIFNEEFYKNITAAFSSQIG-FRRASEVEARTGMRLCYNVSGVDHVLLPD 123
Query: 408 LILKFKGGAKMALPPENYFA-LVGNEVLCLIL 438
FKGG+ M LP NYF+ V + +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSFDSICLTM 155
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 120/400 (30%), Positives = 169/400 (42%), Gaps = 66/400 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTPPQ + + DTGS L+W C C C DP F P S+S
Sbjct: 102 YVVDLAIGTPPQPVSALL-DTGSDLIWTQCAP---CASC-LAQPDP----LFAPGESASY 152
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
+ + C CS I GC P C +Y YG G T G+ +E F
Sbjct: 153 EPMRCAGQLCSDILH-------HGCE-----MPDTC-TYRYNYGDGTMTMGVYATERFTF 199
Query: 223 PSKTVPNFLA-----GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS---- 270
S + GC ++ S +GI GFGR+ SL SQL +++FSYCL S
Sbjct: 200 TSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYGSG 259
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
RK S L+ GS G + P P+ S FYYV L + VG++
Sbjct: 260 RK-------STLLF----GSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGAR 308
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
++IP S DG+GGVIVDSG+ T + G + V + F RQ A + G
Sbjct: 309 RLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAF-RQQLRLPFANGGNPEDG 367
Query: 391 LRPCFDI-------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
+ CF + S V +P ++ F+ A + LP NY + LCL+L
Sbjct: 368 V--CFLVPAAWRRSSSTSQVPVPRMVFHFQ-DADLDLPRRNYVLDDHRKGRLCLLLADSG 424
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G +G+ Q+ + +DL + FA +C
Sbjct: 425 DDGST--------IGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 113/428 (26%), Positives = 195/428 (45%), Gaps = 53/428 (12%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R R ++ + + N+ ++ + + + +++ + Y +++ G+ + I DTG
Sbjct: 28 RVRSMQNRIRRVASTHNVEASQTQIPLSSGINLQTLN-YIVTMGLGSK---NMTVIIDTG 83
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC-SWIFGPNVESR 184
S L W C C + + P F P SSS Q + C + C S F
Sbjct: 84 SDLTWVQCEPCMSCYN--------QQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGA 135
Query: 185 CKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR--- 240
C +P TC +Y++ YG G +T G L E L F +V +F+ GC +
Sbjct: 136 CGSSNP--STC-----NYVVNYGDGSYTNGELGVEALSFGGVSVSDFVFGCGRNNKGLFG 188
Query: 241 QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG 297
+G+ G GRS SL SQ FSYCL + +A S +LV+ + P
Sbjct: 189 GVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTT---EAGSSGSLVMGNESSVFKNANP- 244
Query: 298 LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTF 357
++YT NP FY + L I VG +K P S+ GNGG+++DSG+
Sbjct: 245 ITYTRMLSNP-----QLSNFYILNLTGIDVGGVALKAPLSF------GNGGILIDSGTVI 293
Query: 358 TFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAK 417
T + +++A+ EF+++ + A S L CF+++G V +P + L+F+G A+
Sbjct: 294 TRLPSSVYKALKAEFLKKFTGFPSAPGF---SILDTCFNLTGYDEVSIPTISLRFEGNAQ 350
Query: 418 MALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
+ + F +V + +CL L + + A I+G++Q +N + +D +
Sbjct: 351 LNVDATGTFYVVKEDASQVCLALASLSDA------YDTAIIGNYQQRNQRVIYDTKQSKV 404
Query: 476 GFAKQKCA 483
GFA++ C+
Sbjct: 405 GFAEEPCS 412
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 110/390 (28%), Positives = 161/390 (41%), Gaps = 77/390 (19%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTPPQ DTGS L+W C C D +P F P SS+
Sbjct: 89 YLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFD--------QALPYFDPSTSSTL 139
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF- 222
L C + + C+G A L S+ F
Sbjct: 140 SLTSCDS------------TLCQGLP---------------------VASLPRSDKFTFV 166
Query: 223 -PSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+VP GC + ++ GIAGFGR SLPSQL + FS+C +
Sbjct: 167 GAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTIT---GA 223
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
+ S ++LD + + TP +NP + FYY+ L+ I VGS + +P S
Sbjct: 224 IPSTVLLDLPADLFSNGQGAVQTTPLIQNPANPT-----FYYLSLKGITVGSTRLPVPES 278
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
+G GG I+DSG+ T + ++ V F Q+ V + P F +
Sbjct: 279 EFAL-KNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQV-----KLPVVSGNTTDPYFCL 332
Query: 398 SG--KKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCLILFTDNAAGPALGRGP 452
S + Y+P+L+L F+ GA M LP ENY V G+ +LCL + G
Sbjct: 333 SAPLRAKPYVPKLVLHFE-GATMDLPRENYVFEVEDAGSSILCLAIIEG---------GE 382
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G+FQ QN ++ +DL N + F +C
Sbjct: 383 VTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 112/393 (28%), Positives = 167/393 (42%), Gaps = 53/393 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP + IFDTGS L W C CV + + P F P S
Sbjct: 152 GNYIVNVGLGTPKK-DLSLIFDTGSDLTWTQCQP---CVKSCYAQ----QQPIFDPSASK 203
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ I C + CS + + GCS N Y +QYG FT G +TL
Sbjct: 204 TYSNISCTSTACSGL--KSATGNSPGCSSSNCV-------YGIQYGDSSFTVGFFAKDTL 254
Query: 221 RFPSKTV-PNFLAGCSILSDR----QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRK 272
V F+ GC ++R + AG+ G GR S+ Q K FSYCL + +
Sbjct: 255 TLTQNDVFDGFMFGCG-QNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSR 313
Query: 273 FDDAPVSSNLVLDTGPGSGDSKT--PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
+ +L G G SK G+++TPF +SS FY++ + I VG K
Sbjct: 314 GSNG----HLTFGNGNGVKTSKAVKNGITFTPF------ASSQGATFYFIDVLGISVGGK 363
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ I P N G I+DSG+ T + ++ ++ F + M Y A + S
Sbjct: 364 ALSIS-----PMLFQNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPAL---SL 415
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
L C+D+S S+ +P++ F G A + L P G +CL F N +G
Sbjct: 416 LDTCYDLSNYTSISIPKISFNFNGNANVDLEPNGILITNGASQVCLA-FAGNGDDDTIG- 473
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ Q Q + +D+A + GF + C+
Sbjct: 474 ----IFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 123/404 (30%), Positives = 171/404 (42%), Gaps = 69/404 (17%)
Query: 94 TPLSVHSYGGYSISLSFGTPPQASTPFIF--DTGSSLVWFPCT-SRYRCVDCNFPNVDPS 150
TP + G Y + GTP + P+I DTGSSL W C+ R C
Sbjct: 107 TPGTSVGVGNYVTRMGLGTPAK---PYIMVVDTGSSLTWLQCSPCRVSC--------HRQ 155
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LG 209
P F PK SSS + C +P+C + + CSP N Y YG
Sbjct: 156 SGPVFDPKTSSSYAAVSCSSPQCDGLSTATLNPAV--CSPSNVCI------YQASYGDSS 207
Query: 210 FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---K 263
F+ G L +T+ F + +VPNF GC ++ + AG+ G R+ SL QL
Sbjct: 208 FSVGYLSKDTVSFGANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYS 267
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
FSYCL S SS+ L G PG G SYTP N + S Y++
Sbjct: 268 FSYCLPS-------TSSSGYLSIGSYNPG-------GYSYTPMVSNTLDDS-----LYFI 308
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNY 379
L + V K + + S + I+DSG+ T + ++ A++K M G+
Sbjct: 309 SLSGMTVAGKPLAVSSSEYT-----SLPTIIDSGTVITRLPTSVYTALSKAVAAAMKGST 363
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
RAA S L CF+ K +P + + F GGA + L N V CL
Sbjct: 364 KRAA---AYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSAGNLLVDVDGATTCL--- 417
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A PA R AII G+ Q Q F + +D+ ++R GFA C+
Sbjct: 418 ---AFAPA--RSAAII-GNTQQQTFSVVYDVKSNRIGFAAAGCS 455
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 124/412 (30%), Positives = 172/412 (41%), Gaps = 72/412 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
G Y ++LS GTPP I DTGS L W PC Y P + P F P
Sbjct: 78 GEYMMNLSIGTPPFPILA-IADTGSDLTWLQSKPCDQCY-----------PQKGPIFDPS 125
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLS 217
S++ + C C+ + S R+ T P C Y YG +T G L S
Sbjct: 126 NSTTFHKLPCTTAPCNAL----------DESARSCTDPTTC-GYTYSYGDHSYTTGYLAS 174
Query: 218 ETLRFPSKTVP--NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
+T+ + +V N GC + D Q +GI G G + S SQLG KKFSYCL
Sbjct: 175 DTVTVGNASVQIRNVAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCL 234
Query: 269 L------SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY--TPFY-KNPVGSSSAFGEFYY 319
L S + D+P +S +V P S T G+ + TP K P +YY
Sbjct: 235 LPLENEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEP-------STYYY 287
Query: 320 VGLRQIIVGSKHV--------KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
+ + I VG K + Y S G +I+DSG+T TF+E + A+
Sbjct: 288 LTIEAITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAA 347
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
+ ++ R DV K S CF SGK+ V LP + + F+GGA + L P N F
Sbjct: 348 LVEEI-KMERVNDV-KNSMFSLCFK-SGKEEVELPLMKVHFRGGADVELKPVNTFVRAEE 404
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
++C + N G I G+ NF + +DL F C+
Sbjct: 405 GLVCFTMLPTNDVG---------IYGNLAQMNFVVGYDLGKRTVSFLPADCS 447
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 124/435 (28%), Positives = 182/435 (41%), Gaps = 74/435 (17%)
Query: 71 KTKTKPK--TKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSL 128
K K KPK ++ + + +++TP Y GTPPQ + D +
Sbjct: 72 KPKPKPKGHSRHTFVPIAAGRQILRTP-------SYVARARLGTPPQ-TLLVAIDPSNDA 123
Query: 129 VWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGC 188
W PC++ C+ C + P+F P +SS+ + + C P+C+ +
Sbjct: 124 AWVPCSA---CLGC----APGASSPSFDPTQSSTYRPVRCGAPQCAQV------------ 164
Query: 189 SPRNKTCPL---ACPSYLLQYGLGFTAGLLLSETLRFPSK---TVPN--FLAGCSIL--- 237
P +CP A ++ L Y +L + L VP+ + GC +
Sbjct: 165 PPATPSCPAGPGASCAFNLSYASSTLHAVLGQDALSLSDSNGAAVPDDHYTFGCLRVVTG 224
Query: 238 --SDRQPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNL--VLDTGPGS 290
P G+ GFGR S SQ FSYCL S K SSN L GP
Sbjct: 225 SGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYK------SSNFSGTLRLGPAG 278
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL-VPGSDGNGGV 349
+ + TP NP S YYV + + V K V IP S L + + G GG
Sbjct: 279 QPRR---IKTTPLLSNPHRPS-----LYYVAMVGVRVNGKAVPIPASALALDAATGRGGT 330
Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELI 409
IVD+G+ FT + P + A+ F R + A G C+ ++G KSV P +
Sbjct: 331 IVDAGTMFTRLSPPAYAALRNAFRRGV----SAPAAPALGGFDTCYYVNGTKSV--PAVA 384
Query: 410 LKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLE 467
F GGA++ LP EN + V CL + AAGP+ G + +L Q QN +
Sbjct: 385 FVFAGGARVTLPEENVVISSTSGGVACLAM----AAGPSDGVNAGLNVLASMQQQNHRVV 440
Query: 468 FDLANDRFGFAKQKC 482
FD+ N R GF+++ C
Sbjct: 441 FDVGNGRVGFSRELC 455
>gi|383130044|gb|AFG45742.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 87/152 (57%), Gaps = 5/152 (3%)
Query: 291 GDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
GD P L+YTPF N SSS + FYY+ LR + +G K + +P + GNG
Sbjct: 5 GDKALPTAMSLNYTPFLINTKASSSGYNTFYYIDLRGVSIGRKRLNLPSKLFSFDNKGNG 64
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
G I+DSG+TFT ++ + F Q+G + RA++VE ++G+R C++ SG V LP+
Sbjct: 65 GTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNASGVDHVLLPD 123
Query: 408 LILKFKGGAKMALPPENYFA-LVGNEVLCLIL 438
FKGG+ M LP NYF+ V + +CL +
Sbjct: 124 FAFHFKGGSDMVLPVANYFSYFVSFDSICLTM 155
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 113/399 (28%), Positives = 160/399 (40%), Gaps = 54/399 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++ GTP + DT S L W C RC P P F P+ S+
Sbjct: 132 GEYMAKIAVGTPAVQAL-LALDTASDLTWLQCQPCRRCY--------PQSGPVFDPRHST 182
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-----TAGLLL 216
S + P C + R G + TC Y +QYG G + G L+
Sbjct: 183 SYGEMNYDAPDCQAL------GRSGGGDAKRGTC-----IYTVQYGDGHGSTSTSVGDLV 231
Query: 217 SETLRFPSKTVPNFLA-GCSI----LSDRQPAGIAGFGRSSESLPSQLGL----KKFSYC 267
ETL F +L+ GC L AGI G GR S+P Q+ FSYC
Sbjct: 232 EETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYC 291
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
L+ F P S + L G G+ D+ P S+TP N FYYV L + V
Sbjct: 292 LV--DFISGPGSPSSTLTFGAGAVDTSPPA-SFTPTVLN-----QNMPTFYYVRLIGVSV 343
Query: 328 GSKHVKIP----YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
G V++P + G GGVI+DSG+T T + P + A F + + +
Sbjct: 344 GG--VRVPGVTERDLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVS 401
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
C+ + G+ V +P + + F GG +++L P+NY V + F
Sbjct: 402 TGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGT- 460
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G ++G+ Q F + +DLA R GFA C
Sbjct: 461 -----GDRSVSVIGNILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 135/483 (27%), Positives = 197/483 (40%), Gaps = 82/483 (16%)
Query: 37 TPLSTKHYLHHSDSD-PLKILHSLASSSLSRARHLKTKTKPKTK------------DSNI 83
TP T+ + D + + H A LSR R L + ++K SN
Sbjct: 22 TPAPTEGAFFFAGGDVRVDLTHVDAGKQLSR-RELVRRAVQRSKARAAALSVARLGGSNK 80
Query: 84 GSNYSNSLIKTP-LSVHSYGG--YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV 140
G+ + + P L V G Y + L+ GTPPQ + + DTGS L+W C C
Sbjct: 81 GARQQDQNQQQPGLPVRPSGDLEYLVDLAVGTPPQPVSALL-DTGSDLIWTQCAP---CA 136
Query: 141 DCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP 200
C P DP F P SSS + + C C+ I + C+ R TC
Sbjct: 137 SC-LPQPDP----IFSPGASSSYEPMRCAGELCNDI----LHHSCQ----RPDTC----- 178
Query: 201 SYLLQYGLGFTA-GLLLSETLRF--------PSKTVPNFLAGCSILSD---RQPAGIAGF 248
+Y YG G T G+ +E F +K GC ++ +GI GF
Sbjct: 179 TYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTMNKGSLNNGSGIVGF 238
Query: 249 GRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYK--- 305
GR+ SL SQL +++FSYCL + S+ L G D+ T + T +
Sbjct: 239 GRAPLSLVSQLAIRRFSYCLT--PYASGRKSTLLFGSLRGGVYDAATATVQTTRLLRSRQ 296
Query: 306 NPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
NP FYYV + VG++ ++IP S DG+GG IVDSG+ T P+
Sbjct: 297 NPT--------FYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTALTLFPAPVL 348
Query: 366 EAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS---VYLPELILKFKGGAKMALPP 422
V + F Q+ AA+ CF + + +P ++ + GA + LP
Sbjct: 349 AEVVRAFRSQL-RLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFHLQ-GADLDLPR 406
Query: 423 ENYF---ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
NY GN LCL+L +G +G+F Q+ + +DL D FA
Sbjct: 407 RNYVLDDQRKGN--LCLLLADSGDSG--------TTIGNFVQQDMRVLYDLEADTLSFAP 456
Query: 480 QKC 482
+C
Sbjct: 457 AQC 459
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 165/392 (42%), Gaps = 75/392 (19%)
Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
GTPP + G+ L+W C + FP +P +P S C
Sbjct: 1 MGTPPNP-VKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFAS-------CG 52
Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKT 226
+PK F PN +TC Y YG T G L + F +
Sbjct: 53 SPK----FWPN------------QTC-----VYTYSYGDKSVTTGFLEVDKFTFVGAGAS 91
Query: 227 VPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNL 282
VP GC + ++ GIAGFGR SLPSQL + FS+C + + S +
Sbjct: 92 VPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTIT---GAIPSTV 148
Query: 283 VLD-------TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
+LD G G+ + TP + Y NP YY+ L+ I VGS + +P
Sbjct: 149 LLDLPADLFSNGQGAVQT-TPLIQYAKNEANPT--------LYYLSLKGITVGSTRLPVP 199
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S ++G GG I+DSG++ T + +++ V EF Q+ +G CF
Sbjct: 200 ESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI---KLPVVPGNATGHYTCF 255
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAGPALGRG 451
+ +P+L+L F+ GA M LP ENY V GN ++CL A+ +G
Sbjct: 256 SAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICL----------AINKG 304
Query: 452 -PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+FQ QN ++ +DL N+ F +C
Sbjct: 305 DETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 133/490 (27%), Positives = 200/490 (40%), Gaps = 80/490 (16%)
Query: 24 GAGSSAATVTV-----PLTPLSTKHYLHHSDSDPL--------KILHSLASSSLSRARHL 70
GA SS +T+ P +PL+ H S D L I H +++++ R
Sbjct: 79 GATSSGTRMTIVHRHGPCSPLADAHGKPPSHEDILAADQNRAESIQHRVSTTATGRGNPK 138
Query: 71 KTKTKPKTKDSNIGSNYSNSLIKTPLSVH--------SYGGYSISLSFGTPPQASTPFIF 122
+++ P + + + + + + G Y +++ GTP T +F
Sbjct: 139 RSRRAPSRRQQPSSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYT-VVF 197
Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
DTGS W C CV + R F P RSS+ I C P CS +++
Sbjct: 198 DTGSDTTWVQCQP---CVVVCYEQ----REKLFDPARSSTYANISCAAPACS-----DLD 245
Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR 240
+R GCS N Y +QYG G ++ G +TL S V F GC ++
Sbjct: 246 TR--GCSGGNCL-------YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEG 296
Query: 241 ---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
+ AG+ G GR SLP Q K F++CL +R S LD GPGS +
Sbjct: 297 LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS------SGTGYLDFGPGSPAAA 350
Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
L+ N FYYVG+ I VG + + IP S G IVDSG
Sbjct: 351 GARLTTPMLTDNGP-------TFYYVGMTGIRVGGQLLSIPQSVFT-----TAGTIVDSG 398
Query: 355 STFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
+ T + + ++ F M Y +A V S L C+D +G V +P + L F
Sbjct: 399 TVITRLPPAAYSSLRSAFASAMAARGYKKAPAV---SLLDTCYDFTGMSQVAIPTVSLLF 455
Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
+GGA++ + +CL F N G +G I+G+ QL+ F + +D+
Sbjct: 456 QGGARLDVDASGIMYAASVSQVCL-GFAANEDGGDVG-----IVGNTQLKTFGVAYDIGK 509
Query: 473 DRFGFAKQKC 482
GF+ C
Sbjct: 510 KVVGFSPGAC 519
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/387 (27%), Positives = 172/387 (44%), Gaps = 52/387 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS + W C C +C + DP F P SS
Sbjct: 162 GEYFSRIGVGTPAK-EMYVVLDTGSDVNWIQC---LPCSEC-YQQSDP----IFDPTSSS 212
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + + C +PKC+ + C R+ C Y + YG G FT G ++T+
Sbjct: 213 TFKSLTCSDPKCASL-------DVSAC--RSNKCL-----YQVSYGDGSFTVGNYATDTV 258
Query: 221 RF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
F S V + GC ++ AG+ G G + S+ +Q+ K FSYCL+ R D+
Sbjct: 259 TFGESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSMTNQIKAKSFSYCLVDR---DS 315
Query: 277 PVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
SS+L ++ G+GD+ P L +S FYYVGL VG + V IP
Sbjct: 316 AKSSSLDFNSVQIGAGDATAPLLR-----------NSKMDTFYYVGLSGFSVGGQQVSIP 364
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S + G GGVI+D G+ T ++ + ++ F++ ++ + S C+
Sbjct: 365 SSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKG--TSPISLFDTCY 422
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
D S +V +P + F GG + LP +NY + + F ++ + I
Sbjct: 423 DFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFAPTSSSLS-------I 475
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G+ Q Q + +DLAN+ G + KC
Sbjct: 476 IGNVQQQGTRITYDLANNLIGLSANKC 502
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 132/434 (30%), Positives = 179/434 (41%), Gaps = 70/434 (16%)
Query: 59 LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAST 118
LA SSL+R+ T+T+ + + TP+S + G S Q S
Sbjct: 119 LALSSLNRSDLYPTETELLRPED----------LSTPVSSGTAQGSGEYFSRVGVGQPSK 168
Query: 119 PF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
PF + DTGS + W C C DC + DP F P SSS + C +C
Sbjct: 169 PFYMVLDTGSDVNWLQCKP---CSDC-YQQSDP----IFDPTASSSYNPLTCDAQQCQ-- 218
Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCS 235
+ C RN C Y + YG G FT G ++ET+ F + +V GC
Sbjct: 219 -----DLEMSAC--RNGKCL-----YQVSYGDGSFTVGEYVTETVSFGAGSVNRVAIGCG 266
Query: 236 ILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
++ AG+ G G SL SQ+ FSYCL+ R D+ SS L ++ P GD
Sbjct: 267 HDNEGLFVGSAGLLGLGGGPLSLTSQIKATSFSYCLVDR---DSGKSSTLEFNS-PRPGD 322
Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
S P KN FYYV L + VG + V +P G GGVIVD
Sbjct: 323 SVV-----APLLKN-----QKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVD 372
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG+ T + + +V F R+ N A E + C+D+S +SV +P + F
Sbjct: 373 SGTAITRLRTQAYNSVRDAFKRKTSNLRPA---EGVALFDTCYDLSSLQSVRVPTVSFHF 429
Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI----ILGDFQLQNFYLEF 468
G ALP +NY V D A P I+G+ Q Q + F
Sbjct: 430 SGDRAWALPAKNYLIPV-----------DGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSF 478
Query: 469 DLANDRFGFAKQKC 482
DLAN GF+ KC
Sbjct: 479 DLANSLVGFSPNKC 492
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 114/418 (27%), Positives = 169/418 (40%), Gaps = 66/418 (15%)
Query: 75 KPKTKDSNIGSNYSNSLIKTPLSVHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPC 133
KP++ ++ SN N PL + GG Y + S GTPPQ T + DTGS L+W C
Sbjct: 72 KPQSSSASQLSN--NDTDTVPLRMDGGGGAYDMEFSIGTPPQKLTA-LADTGSDLIWTKC 128
Query: 134 TSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNK 193
+ ++ P SS+ + C + C+ + ++ C+
Sbjct: 129 DAG--------GGAAWGGSSSYHPNASSTFTRLPCSDRLCAALRSYSLAR----CAAGGA 176
Query: 194 TCPLACPSYLLQYGLG----FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIA 246
C Y YGLG FT G L SET VP GC+ + + AG+
Sbjct: 177 EC-----DYKYAYGLGDDPDFTQGFLGSETFTLGGDAVPGVGFGCTTALEGDYGEGAGLV 231
Query: 247 GFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS-SNLVLDTGPGSGDSKTPGLSYTPFYK 305
G GR SL SQL F YCL + +P+ L TG G+G T L+ T FY
Sbjct: 232 GLGRGPLSLVSQLDAGTFMYCLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTFYA 291
Query: 306 NPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
V LR I +GS V+ DSG+T T++ P +
Sbjct: 292 --------------VNLRSITIGSATTAGVGGPGG--------VVFDSGTTLTYLAEPAY 329
Query: 366 EAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY 425
F+ Q + + VE + G C++ + +P ++L F GGA MALP NY
Sbjct: 330 TEAKAAFLSQTTSLT---PVEGRYGFEACYEKPDSARL-IPAMVLHFDGGADMALPVANY 385
Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V + V+C + + R P++ I+G+ N+ + D+ F C
Sbjct: 386 VVEVDDGVVCWV----------VQRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANC 433
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 156/374 (41%), Gaps = 54/374 (14%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+ + G PPQ IFD + W C +C D P I F P +SSS L
Sbjct: 189 VQIGVGGPPQKFY-MIFDLQTDFTWLQCQPCIKCYD------QPDSI--FDPSQSSSYTL 239
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS 224
+ C+ C+ + PN G N T Y G T G+L++ET+ F S
Sbjct: 240 LSCETKHCNLL--PNSSCSDDGYCRYNIT-----------YKDGTNTEGVLINETVSFES 286
Query: 225 K-TVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
V GCS ++ P G G GR S S PS++ SYCL+ K D S
Sbjct: 287 SGWVDRVSLGCSN-KNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCLVESK--DGYSS 343
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
S L ++ P SG K L +NP + YYVGL+ I VG + + +P S
Sbjct: 344 STLEFNSPPCSGSVKAKLL------QNPKAEN-----LYYVGLKGIKVGGEKIDVPNSTF 392
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
GNGG+IV S S T +E + V F+ + + R + C+++S
Sbjct: 393 TIDPYGNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQ---FDTCYNLSS 449
Query: 400 KKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
+V LP L + G LP E+Y +A+ N C A P+ +G ILG
Sbjct: 450 NNTVELPILEFEVNDGKSWLLPKESYLYAVDKNGTFCF------AFAPS--KGSFSILGT 501
Query: 459 FQLQNFYLEFDLAN 472
Q + FDL N
Sbjct: 502 LQQYGTRVTFDLVN 515
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 157/387 (40%), Gaps = 58/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTPPQ + D W PC CV C+ F +S++
Sbjct: 35 YIVKAKVGTPPQ-TLLMALDNSYDAAWIPCKG---CVGCSST--------VFNTVKSTTF 82
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ +GC P+C + P C G TC ++ YG L +T+
Sbjct: 83 KTLGCGAPQCKQVPNP----ICGG-----STC-----TWNTTYGSSTILSNLTRDTIALS 128
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
VP + GC + S P G+ GFGR S SQ L FSYCL S F
Sbjct: 129 MDPVPYYAFGCIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPS--FRTLN 186
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + P + TP KNP SS YYV L I VG K V IP S
Sbjct: 187 FSGSLRL--GP---VGQPPRIKTTPLLKNPRRSS-----LYYVKLNGIRVGRKIVDIPRS 236
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L G I DSG+ FT + P + AV EF +++GN A V G C+ +
Sbjct: 237 ALAFNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVGN----ATVSSLGGFDTCYSV 292
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAIIL 456
+P I G + +PPEN V CL + AA P ++
Sbjct: 293 P-----IVPPTITFMFSGMNVTMPPENLLIHSTAGVTSCLAM----AAAPDNVNSVLNVI 343
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
Q QN + FD+ N R G A+++C+
Sbjct: 344 ASMQQQNHRILFDVPNSRLGVAREQCS 370
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 172/386 (44%), Gaps = 52/386 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP+ + DTGS + W C C DC + DP F P SS
Sbjct: 153 GEYFSRVGIGSPPK-HVYMVVDTGSDVNWVQCAP---CADC-YQQADP----IFEPSFSS 203
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C+ +C + +V S C RN +C Y + YG G +T G +ET+
Sbjct: 204 SYAPLTCETHQCKSL---DV-SEC-----RNDSCL-----YEVSYGDGSYTVGDFATETI 249
Query: 221 RFP-SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
S ++ N GC ++ AG+ G G S S PSQ+ FSYCL++R D A
Sbjct: 250 TLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRDTDSA 309
Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
S L + S P S T P+ ++ FYY+G+ I VG + + IP
Sbjct: 310 ---STLEFN-------SPIPSHSVTA----PLLRNNQLDTFYYLGMTGIGVGGQMLSIPR 355
Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
S GNGG+IVDSG+ T ++ ++ ++ F+R + + V C+D
Sbjct: 356 SSFEVDESGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVAL---FDTCYD 412
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
+S + SV +P + F G +ALP +NY V + F + + I+
Sbjct: 413 LSSRSSVEVPTVSFHFPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALS-------II 465
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
G+ Q Q + +DL+N GF+ C
Sbjct: 466 GNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 115/406 (28%), Positives = 177/406 (43%), Gaps = 81/406 (19%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y I +S GTPP+ + DTGS ++W C CV C + D F P +SS
Sbjct: 35 GEYFIRVSVGTPPRG-MYLVMDTGSDILWLQCAP---CVSC-YHQCDE----VFDPYKSS 85
Query: 162 SSQLIGCQNPKC-SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
+ +GC + +C + G V ++C Y + YG G F+ G ++
Sbjct: 86 TYSTLGCNSRQCLNLDVGGCVGNKCL---------------YQVDYGDGSFSTGEFATDA 130
Query: 220 LRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE-----------------SLPSQLGLK 262
+ S + G ++ ++ P G G +E S P+Q+ +
Sbjct: 131 VSLNSTS-----GGGQVVLNKIPLGC---GHDNEGYFVGAAGLLGLGKGPLSFPNQINSE 182
Query: 263 ---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP--GLSYTPFYKNPVGSSSAFGEF 317
+FSYCL R D SS + GD+ P G+ +TP N S+ F
Sbjct: 183 NGGRFSYCLTGRDTDSTERSSLIF-------GDAAVPPAGVRFTPQASNLRVST-----F 230
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
YY+ + I VG + IP S S GNGGVI+DSG++ T ++ + ++ + F
Sbjct: 231 YYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAF---RA 287
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCL 436
S + S C+++S SV +P + L F+GGA + LP NY V N CL
Sbjct: 288 GTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCL 347
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
AG GP+II G+ Q Q F + +D +++ GF +C
Sbjct: 348 AF-----AGTT---GPSII-GNIQQQGFRVIYDNLHNQVGFVPSQC 384
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 116/398 (29%), Positives = 167/398 (41%), Gaps = 61/398 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC-NFPNVDPSRIPAFIPKRSSS 162
Y + L+ GTPPQ T + DTGS L+W C + C C P+ P F P+ SSS
Sbjct: 98 YVLDLAVGTPPQPITALL-DTGSDLIWTQCDT---CTACLRQPD------PLFSPRMSSS 147
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLR 221
+ + C C G + C R TC +Y YG G T G +E
Sbjct: 148 YEPMRCAGQLC----GDILHHSCV----RPDTC-----TYRYSYGDGTTTLGYYATERFT 194
Query: 222 FPS-----KTVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
F S ++VP GC + S +GI GFGR SL SQL +++FSYCL
Sbjct: 195 FASSSGETQSVPLGF-GCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCL----- 248
Query: 274 DDAPVSSNLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
P +S+ GS D + P P+ S+ FYYV + VG++ +
Sbjct: 249 --TPYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRL 306
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
+IP S DG+GGVI+DSG+ T + V + F Q+ A G+
Sbjct: 307 RIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQL-RLPFANGSSPDDGV- 364
Query: 393 PCFDISG--------KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
CF + V +P ++ F+ GA + LP ENY +L D+
Sbjct: 365 -CFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLLGDSGD 422
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A +G+F Q+ + +DL + FA +C
Sbjct: 423 DGA-------TIGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 119/402 (29%), Positives = 167/402 (41%), Gaps = 61/402 (15%)
Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + L+ GTPP PF+ DTGS L W C C + P D + +F P
Sbjct: 95 YLMELAIGTPP---VPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSP---- 147
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLG-FTAGLLLSET 219
+ C + C I+ S RN T P Y Y G ++AG+L +ET
Sbjct: 148 ----VPCASATCLPIWR----------SSRNCTATTTSPCRYRYAYDDGAYSAGVLGTET 193
Query: 220 LRF---------PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLKKFSYC 267
L F P +V GC + + G G GR S SL +QLG+ KFSYC
Sbjct: 194 LTFAGSSPGAPGPGVSVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYC 253
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
L F + + S ++ + T G + TP + P S YYV L
Sbjct: 254 L--TDFFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSR-----YYVSLEG 306
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
I +G + IP DG+GG+IVDSG+ FT L E+ + + +
Sbjct: 307 ISLGDARLPIPNGTFDLRDDGSGGMIVDSGTIFTV----LVESAFRVVVNHVAGVLNQPV 362
Query: 385 VEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTD 441
V S PCF + + +P+++L F GGA M L +NY + + CL
Sbjct: 363 VNASSLDSPCFPATAGEQQLPDMPDMLLHFAGGADMRLHRDNYMSFNQESSSFCL----- 417
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N AG G ILG+FQ QN + FD+ + F C+
Sbjct: 418 NIAGAPSAYGS--ILGNFQQQNIQMLFDITVGQLSFVPTDCS 457
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 166/392 (42%), Gaps = 63/392 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S+ GTP + IFDTGS L W C C DC + D P F P SS
Sbjct: 147 GNYVVSVGLGTPAK-QYAVIFDTGSDLSWVQCKP---CADC-YEQQD----PLFDPSLSS 197
Query: 162 SSQLIGCQNPKCSWI--FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
+ + C P+C + G + +SRC+ Y +QYG T G L+ +
Sbjct: 198 TYAAVACGAPECQELDASGCSSDSRCR---------------YEVQYGDQSQTDGNLVRD 242
Query: 219 TLRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
TL S T+P F+ GC + Q G+ G GR SLPSQ F+YCL S
Sbjct: 243 TLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSS 302
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
+S G + +T + A FYY+ L I VG +
Sbjct: 303 SSGRGYLS----------LGGAPPANAQFTAL------ADGATPSFYYIDLVGIKVGGRA 346
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
++IP + ++DSG+ T + + + F R M Y +A + S L
Sbjct: 347 IRIPATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPAL---SIL 399
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D +G ++ +P + L F GGA ++L + CL F NA ++
Sbjct: 400 DTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSKVSQACLA-FAPNADDSSIA-- 456
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
ILG+ Q + F + +D+AN R GF + C+
Sbjct: 457 ---ILGNTQQKTFAVAYDVANQRIGFGAKGCS 485
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 118/435 (27%), Positives = 184/435 (42%), Gaps = 58/435 (13%)
Query: 71 KTKTKPKTKDSNIG-SNYSNSLIKTP--------LSVHSYGGYSISLSFGTPPQASTPFI 121
T K ++S I +N +N+ +K+P LS + L GTPPQ P +
Sbjct: 55 NTALKMMLRNSLIANTNNNNTQLKSPPSSPYNYKLSFKYSMALIVDLPIGTPPQVQ-PMV 113
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS-WIFGPN 180
DTGS L W C + P +F P SS+ + C +P C I
Sbjct: 114 LDTGSQLSWIQCHKK--------APAKPPPTASFDPSLSSTFSTLPCTHPVCKPRIPDFT 165
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP-SKTVPNFLAGCSILSD 239
+ + C +N+ C SY G + G L+ E F S P + GC+ S
Sbjct: 166 LPTSCD----QNRLCHY---SYFYADGT-YAEGNLVREKFTFSRSLFTPPLILGCATEST 217
Query: 240 RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR--KFDDAPVSSNLVLDTGPGSGDSK-TP 296
P GI G R S SQ + KFSYC+ +R + P S L P S +
Sbjct: 218 -DPRGILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYTPTGS-FYLGHNPNSNTFRYIE 275
Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
L++ + P A Y V L+ I +G + + I + + G+G ++DSGS
Sbjct: 276 MLTFARSQRMPNLDPLA----YTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSE 331
Query: 357 FTFMEGPLFEAVAKEFIRQMG-------NYSRAADVEKKSGLRPCFDISG-KKSVYLPEL 408
FT++ ++ V E +R +G Y AD+ CFD + + + ++
Sbjct: 332 FTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADM--------CFDGNAIEIGRLIGDM 383
Query: 409 ILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
+ +F+ G ++ +P E A V V C+ + + G A + I+G+F QN ++EF
Sbjct: 384 VFEFEKGVQIVVPKERVLATVEGGVHCIGIANSDKLGAA-----SNIIGNFHQQNLWVEF 438
Query: 469 DLANDRFGFAKQKCA 483
DL N R GF C+
Sbjct: 439 DLVNRRMGFGTADCS 453
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 125/425 (29%), Positives = 180/425 (42%), Gaps = 71/425 (16%)
Query: 88 SNSLIKTP--LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
SNS ++P L ++SL+ GTPPQ + + DTGS L W + CN
Sbjct: 13 SNSFPRSPNKLPFRHNISLTVSLTVGTPPQ-NVSMVIDTGSELSW---------LYCNKT 62
Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
S F RS S + I C + C+ R+ + P +C S L
Sbjct: 63 TTTTSYPTTFNQTRSISYRPIPCSSSTCT-------------NQTRDFSIPASCDSNSLC 109
Query: 206 YG-LGF-----TAGLLLSETLRFPSKTVPNFLAGC--SILS-----DRQPAGIAGFGRSS 252
+ L + + G L S+T + +P + GC S+ S D + G+ G R S
Sbjct: 110 HATLSYADASSSEGNLASDTFHMGASDIPGMVFGCMDSVFSSNSDEDSKNTGLMGMNRGS 169
Query: 253 ESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
S SQ+G KFSYC+ F S L+L G + P L+YTP V S+
Sbjct: 170 LSFVSQMGFPKFSYCISGTDF-----SGMLLL--GESNFTWAVP-LNYTPL----VQIST 217
Query: 313 AFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
F Y V L I V + + IP S P G G +VDSG+ FTF+ GP + A+
Sbjct: 218 PLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMVDSGTQFTFLLGPAYTAL 277
Query: 369 AKEFIRQMGNYSRAA---DVEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPE 423
EF+ Q + R D + + C+ + + V LP + L F GA+M + E
Sbjct: 278 RSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPTVSLVFN-GAEMTVADE 336
Query: 424 NYFALV-----GNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
V GN+ V CL + G A ++G QN ++EFDL R G
Sbjct: 337 RVLYRVPGEIRGNDSVHCLSFGNSDLLGVE-----AYVIGHHHQQNVWMEFDLERSRIGL 391
Query: 478 AKQKC 482
A+ +C
Sbjct: 392 AQVRC 396
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 166/392 (42%), Gaps = 63/392 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S+ GTP + IFDTGS L W C C DC + D P F P SS
Sbjct: 147 GNYVVSVGLGTPAK-QYAVIFDTGSDLSWVQCKP---CADC-YEQQD----PLFDPSLSS 197
Query: 162 SSQLIGCQNPKCSWI--FGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
+ + C P+C + G + +SRC+ Y +QYG T G L+ +
Sbjct: 198 TYAAVACGAPECQELDASGCSSDSRCR---------------YEVQYGDQSQTDGNLVRD 242
Query: 219 TLRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
TL S T+P F+ GC + Q G+ G GR SLPSQ F+YCL S
Sbjct: 243 TLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSS 302
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
+S G + +T + A FYY+ L I VG +
Sbjct: 303 SSGRGYLS----------LGGAPPANAQFTAL------ADGATPSFYYIDLVGIKVGGRA 346
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
++IP + ++DSG+ T + + + F R M Y +A + S L
Sbjct: 347 IRIPATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPAL---SIL 399
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D +G ++ +P + L F GGA ++L + CL F NA ++
Sbjct: 400 DTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSKVSQACLA-FAPNADDSSIA-- 456
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
ILG+ Q + F + +D+AN R GF + C+
Sbjct: 457 ---ILGNTQQKTFAVTYDVANQRIGFGAKGCS 485
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 122/482 (25%), Positives = 205/482 (42%), Gaps = 71/482 (14%)
Query: 14 LLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTK 73
L++L F+ + +++ +++ PLT LS + D+ + S S+ + + + K
Sbjct: 5 LVVLFFSINPSQQTNSLSLSFPLTSLSLSN-----DTTSKMLYTSQLFSTTKKPNNPQNK 59
Query: 74 TKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC 133
T + + YS +LI I+L GTPPQ + P + DTGS L W C
Sbjct: 60 TP--SYNYKFSFKYSMALI-------------INLPIGTPPQ-TQPMVLDTGSQLSWIQC 103
Query: 134 TSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS-WIFGPNVESRCKGCSPRN 192
+ P+ +F P SS+ ++ C +P C I + + C +N
Sbjct: 104 HKKQ----------PPT--ASFDPSLSSTFSILPCTHPLCKPRIPDFTLPTSCD----QN 147
Query: 193 KTCPLACPSYLLQYGLGFTAGLLLSETLRFP-SKTVPNFLAGCSILSDRQPAGIAGFGRS 251
+ C SY G + G L+ E F S + P + GC+ S P GI G
Sbjct: 148 RLCHY---SYFYADGT-YAEGNLVREKFTFSRSVSTPPLILGCATEST-DPRGILGMNLG 202
Query: 252 SESLPSQLGLKKFSYCLLSRKFDDAPV-SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
S Q + KFSYC+ R+ + + L P S K G+ + + P
Sbjct: 203 RLSFAKQSKITKFSYCVPPRQTRPGFTPTGSFYLGNNPSSKGFKYVGMMTSSRQRMPNFD 262
Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
A Y + + I + K + I + + G+G ++DSGS FT++ ++ V
Sbjct: 263 PLA----YTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDSGSEFTYLVSEAYDKVRA 318
Query: 371 EFIRQMG-------NYSRAADVEKKSGLRPCFDISGKKSV--YLPELILKFKGGAKMALP 421
+ +R +G Y AD+ CFD + + E++ +F+ G ++ +P
Sbjct: 319 QVVRAVGPRLKKGYVYGGVADM--------CFDSVKAVEIGRLIGEMVFEFERGVEVVIP 370
Query: 422 PENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
E A VG V C+ + + + G A + I+G+F QN ++EFDL R GF K
Sbjct: 371 KERVLADVGGGVHCVGIGSSDKLGAA-----SNIIGNFHQQNLWVEFDLVRRRVGFGKAD 425
Query: 482 CA 483
C+
Sbjct: 426 CS 427
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 118/403 (29%), Positives = 167/403 (41%), Gaps = 71/403 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
G Y + GTP + + DTGS + W PCT+ Y+ D F P
Sbjct: 14 GEYFAVVGVGTP-RRDMYLVVDTGSDITWLQCAPCTNCYKQKDA-----------LFNPS 61
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLS 217
SSS +++ C + C + GC NK Y YG G FT G L++
Sbjct: 62 SSSSFKVLDCSSSLC-------LNLDVMGCL-SNKCL------YQADYGDGSFTMGELVT 107
Query: 218 ETLRF-----PSKTV-PNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKK---FS 265
+ + P + V N GC ++ AGI G GR S P+ L FS
Sbjct: 108 DNVVLDDAFGPGQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFS 167
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-----GLSYTPFYKNPVGSSSAFGEFYYV 320
YCL R+ D S LV GD+ P + + P +NP +YYV
Sbjct: 168 YCLPDRE-SDPNHKSTLVF------GDAAIPHTATGSVKFIPQLRNP-----RVATYYYV 215
Query: 321 GLRQIIVGSKHV-KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+ I VG + IP S S GNGG I DSG+T T +E + AV F +
Sbjct: 216 QITGISVGGNLLTNIPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHL 275
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
+ AAD + C+D +G S+ +P + F+G M LPP NY V N + F
Sbjct: 276 TSAADFKI---FDTCYDFTGMNSISVPTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAF 332
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A GP++I G+ Q Q+F + +D + + G +C
Sbjct: 333 -------AASMGPSVI-GNVQQQSFRVIYDNVHKQIGLLPDQC 367
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 166/400 (41%), Gaps = 51/400 (12%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVD--------PSRIPAFIPKRSSSSQLIGCQNPK 172
+ DTGS LVW C++ C P V P +P + S +++ + C +
Sbjct: 77 VVDTGSDLVWTQCST------CRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDD 130
Query: 173 CSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFL 231
+ P +G + C +A YG G G+L ++ FPS +
Sbjct: 131 GALCGVAPETAGCARGGGSGDDACVVAA-----SYGAGVALGVLGTDAFTFPSSSSVTLA 185
Query: 232 AGCSILSDRQP------AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
GC + P +GI G GR + SL SQL +FSYCL + F D S+L +
Sbjct: 186 FGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCL-TPYFRDTVSPSHLFVG 244
Query: 286 TGPGSGDSKTPG--------LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
G +G G ++ PF KNP S F FYY+ L + G+ V +P
Sbjct: 245 DGELAGLRAAAGGGGGGGAPVTTVPFAKNP--KDSPFSTFYYLPLVGLAAGNATVALPAG 302
Query: 338 YL----VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSG-L 391
GG ++DSGS FT + P A+ KE RQ+ G+ S K G L
Sbjct: 303 AFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGAL 362
Query: 392 RPCF----DISGKKSVYLPELILKFK----GGAKMALPPENYFALVGNEVLCLILFTDNA 443
C D + +P L+L+F GG ++ +P E Y+A V C+ + + +
Sbjct: 363 ELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSAS 422
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
L I+G+F Q+ + +DLAN F C+
Sbjct: 423 GNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 462
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 118/428 (27%), Positives = 186/428 (43%), Gaps = 55/428 (12%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R ++ + +T S I + + T Y +++ G+ + I DTG
Sbjct: 84 HVRSIQNHIRKRTSSSQIADSSETQVPLTSGIKFQTLNYIVTMGLGSQ---NMSVIVDTG 140
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
S L W C C + N P PS P++ P I C + C ++E
Sbjct: 141 SDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQP--------ILCNSTTCQ-----SLELGA 187
Query: 186 KGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---Q 241
G P A Y++ YG G +T+G L E L F +V NF+ GC +
Sbjct: 188 CGSDPSTS----ATCDYVVNYGDGSYTSGELGIEKLGFGGISVSNFVFGCGRNNKGLFGG 243
Query: 242 PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGL 298
+G+ G GRS S+ SQ FSYCL S D A S +LV+ G + TP +
Sbjct: 244 ASGLMGLGRSELSMISQTNATFGGVFSYCLPST--DQAGASGSLVMGNQSGVFKNVTP-I 300
Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
+YT N FY + L I VG + + S S GNGGVI+DSG+ +
Sbjct: 301 AYTRMLPNL-----QLSNFYILNLTGIDVGGVSLHVQAS-----SFGNGGVILDSGTVIS 350
Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKM 418
+ +++A+ +F+ Q + A S L CF+++G V +P + + F+G A++
Sbjct: 351 RLAPSVYKALKAKFLEQFSGFPSAPGF---SILDTCFNLTGYDQVNIPTISMYFEGNAEL 407
Query: 419 ALPPENYFALVGNEV--LCLIL--FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
+ F LV + +CL L +D +G I+G++Q +N + +D +
Sbjct: 408 NVDATGIFYLVKEDASRVCLALASLSDEY---EMG-----IIGNYQQRNQRVLYDAKLSQ 459
Query: 475 FGFAKQKC 482
GFAK+ C
Sbjct: 460 VGFAKEPC 467
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 169/404 (41%), Gaps = 66/404 (16%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
++SL+ GTPPQ+ T + DTGS L W C + N++ F P SSS
Sbjct: 71 TVSLTVGTPPQSVT-MVLDTGSELSWLHCKKQ--------QNINS----VFNPHLSSSYT 117
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS----YLLQYGLGFTA--GLLLSE 218
I C +P C R+ P++C S ++ FT+ G L S+
Sbjct: 118 PIPCMSPICK-------------TRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASD 164
Query: 219 TLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
T P + G + D + G+ G R S S +Q+G KFSYC+ +
Sbjct: 165 TFAISGSGQPGIIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFPKFSYCISGK 224
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
++ VL G + P L YTP K Y V L I VGSK
Sbjct: 225 D-------ASGVLLFGDATFKWLGP-LKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKP 276
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAAD--VEKK 388
+++P P G G +VDSG+ FTF+ G ++ A+ EF+ Q G + D +
Sbjct: 277 LQVPKEIFAPDHTGAGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFE 336
Query: 389 SGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYFALVG---------NEVLCLIL 438
+ CF + V +P + + F+ GA+M++ E VG +V CL
Sbjct: 337 GAMDLCFRVRRGGVVPAVPAVTMVFE-GAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTF 395
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ G A ++G QN ++EFDL N R GFA KC
Sbjct: 396 GNSDLLGIE-----AYVIGHHHQQNVWMEFDLVNSRVGFADTKC 434
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 138/469 (29%), Positives = 192/469 (40%), Gaps = 58/469 (12%)
Query: 35 PLTPLSTKHYLHHSDS-DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIK 93
P T L L HSDS L ILH L + R K K S+ S+ I+
Sbjct: 17 PKTQLQRLKELVHSDSVRQLMILHKLRGGQIPR-------RKAKEVLSSSSGRGSDDAIE 69
Query: 94 TPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
P+ + + G YS++ GTP Q + DTGS L W C +Y C N N
Sbjct: 70 VPMHPAADYGIGQYSVAFKVGTPSQKFM-LVADTGSDLTWMSC--KYHCRSRNCSNRKAR 126
Query: 151 RIP---AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG 207
RI F SSS + I C C +E S N PL Y +Y
Sbjct: 127 RIRHKRVFHANLSSSFKTIPCLTDMC------KIE-LMDLFSLTNCPTPLTPCGYDYRYS 179
Query: 208 LGFTA-GLLLSETLRFPSKT-----VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPS 257
G TA G +ET+ K + N L GCS S + G+ G G S S
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239
Query: 258 QLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
+ K KFSYCL+ VS+ L GS SK L+ + + +G ++F
Sbjct: 240 KAAEKFGGKFSYCLVDH-LSHKNVSNYLTF----GSSRSKEALLNNMTYTELVLGMVNSF 294
Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
Y V + I +G +KIP V G GG I+DSGS+ TF+ P ++ V
Sbjct: 295 ---YAVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRV 349
Query: 375 QMGNYSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
+ + + VE G L CF+ +G + +P L+ F GA+ P ++Y + V
Sbjct: 350 SLLKFRK---VEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGV 406
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL + G + ++G+ QN EFDL + GFA C
Sbjct: 407 RCLGFVSVAWPGTS-------VVGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 115/392 (29%), Positives = 165/392 (42%), Gaps = 70/392 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S+ G+P + IFDTGS L W C S F P +S+
Sbjct: 132 GNYIVSIGLGSPKK-DLMLIFDTGSDLTWARC----------------SAAETFDPTKST 174
Query: 162 SSQLIGCQNPKCSWIFGPNVE-SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
S + C P CS + SRC TC Y +QYG G ++ G L E
Sbjct: 175 SYANVSCSTPLCSSVISATGNPSRCAA-----STCV-----YGIQYGDGSYSIGFLGKER 224
Query: 220 LRFPSKTV-PNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRK 272
L S + NF GC D + AG+ G GR S+ SQ K FSYCL
Sbjct: 225 LTIGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCL---- 280
Query: 273 FDDAPVSSNLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
P SS+ TG S G S++ +TP P FY + L I VG +
Sbjct: 281 ----PSSSS----TGFLSFGSSQSKSAKFTPLSSGP-------SSFYNLDLTGITVGGQK 325
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ IP S G I+DSG+ T + + A+ F + M +Y + S L
Sbjct: 326 LAIPLSVF-----STAGTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPL---SIL 377
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D S K++ +P++++ F GG + + F G + +CL AG R
Sbjct: 378 DTCYDFSKYKTIKVPKIVISFSGGVDVDVDQAGIFVANGLKQVCLAF-----AGNTGARD 432
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A I G+ Q +NF + +D++ + GFA C+
Sbjct: 433 TA-IFGNTQQRNFEVVYDVSGGKVGFAPASCS 463
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 149/337 (44%), Gaps = 39/337 (11%)
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-- 209
+P P SSS+ + C + C + P + S G + C SY YG
Sbjct: 12 LPLLYPTSSSSAAFVACGDRTCGELPRP-LCSNVAGGGSGSGNC-----SYHYAYGNARD 65
Query: 210 ---FTAGLLLSETLRF--PSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL 261
+T G+L++ET F + P GC++ S+ +G+ G GR SL +QL +
Sbjct: 66 THHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNV 125
Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
+ F Y L S +P+S + D G+GDS TP NPV FYYVG
Sbjct: 126 EAFGYRLSSDLSAPSPISFGSLADVTGGNGDS----FMSTPLLTNPVVQDL---PFYYVG 178
Query: 322 LRQIIVGSKHVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L I VG K V+IP ++ S G GGVI DSG+T T + P + V E + QMG +
Sbjct: 179 LTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMG-FQ 237
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCL 436
+ L CF G + P ++L F GGA M L ENY + G C
Sbjct: 238 KPPPAANDDDLI-CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCW 295
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
+ + A I+G+ +F++ FDL+ +
Sbjct: 296 SVVKSSQA--------LTIIGNIMQMDFHVVFDLSGN 324
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 121/439 (27%), Positives = 181/439 (41%), Gaps = 59/439 (13%)
Query: 59 LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAST 118
L + +RA +L T+ P + G + S S + + L S G Y + +S G+PP
Sbjct: 129 LVARDNARAEYLATRLSPAYQPP--GFSGSESKVVSGLDEGS-GEYLVRVSVGSPPTEQY 185
Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI-F 177
+ D+GS ++W C C++C + DP F P S++ + C + C +
Sbjct: 186 -LVVDSGSDVMWVQCKP---CLEC-YVQADP----LFDPATSATFSGVSCGSAICRILPT 236
Query: 178 GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSI 236
+ GC Y + Y G +T G L ETL V + GC
Sbjct: 237 SACGDGELGGCE------------YEVSYADGSYTKGALALETLTLGGTAVEGVVIGCGH 284
Query: 237 LSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR-------KFDDAPVSSNLV 283
+ AG+ G G SL QLG + FSYCL SR DDA LV
Sbjct: 285 RNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDA---GWLV 341
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
L G ++ G + P +NP S FYYVGL I VG + + +
Sbjct: 342 L----GRSEAVPEGAVWVPLVRNPRAPS-----FYYVGLSGIEVGDERLPLQAGLFQLTE 392
Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
DG G V++D+G+T T + + A+ F+ + A S L C+D+SG SV
Sbjct: 393 DGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGYASV 452
Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
+P + F G A++ L N V + CL F +++G + I+G+ Q
Sbjct: 453 RVPTVSFCFDGDARLILAARNVLLEVDMGIYCLA-FAPSSSGLS-------IMGNTQQAG 504
Query: 464 FYLEFDLANDRFGFAKQKC 482
+ D AN GF C
Sbjct: 505 IQITVDSANGYIGFGPANC 523
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 170/387 (43%), Gaps = 52/387 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS + W C C DC + DP F P SS
Sbjct: 160 GEYFSRIGVGTPAK-EMYLVLDTGSDVNWIQCEP---CSDC-YQQSDP----VFNPTSSS 210
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + + C P+CS + C R+ C Y + YG G FT G L ++T+
Sbjct: 211 TYKSLTCSAPQCSLL-------ETSAC--RSNKCL-----YQVSYGDGSFTVGELATDTV 256
Query: 221 RFPSKTVPNFLA-GCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
F + N +A GC ++ AG+ G G + S+ +Q+ FSYCL+ R D+
Sbjct: 257 TFGNSGKINDVALGCGHDNEGLFTGAAGLLGLGGGALSITNQMKATSFSYCLVDR---DS 313
Query: 277 PVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
SS+L ++ GSGD+ P L + FYYVGL VG + V +P
Sbjct: 314 GKSSSLDFNSVQLGSGDATAPLLR-----------NQKIDTFYYVGLSGFSVGGQKVMMP 362
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
+ + G+GGVI+D G+ T ++ + ++ F++ N + S C+
Sbjct: 363 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKG--TSSISLFDTCY 420
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
D S SV +P + F GG + LP +NY V + F ++ + I
Sbjct: 421 DFSSLSSVKVPTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSSSLS-------I 473
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G+ Q Q + +DLAN G + KC
Sbjct: 474 IGNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 118/390 (30%), Positives = 162/390 (41%), Gaps = 64/390 (16%)
Query: 104 YSISLSFGTPPQASTPFI--FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + + GTP Q P + DT + W PC+ CV C + F P +SS
Sbjct: 91 YIVRANIGTPAQ---PMLVALDTSNDAAWVPCSG---CVGC-------ASSVLFDPSKSS 137
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
SS+ + C P+C P + K+C + + YG L +TL
Sbjct: 138 SSRNLQCDAPQCKQAPNPTCTA--------GKSC-----GFNMTYGGSTIEASLTQDTLT 184
Query: 222 FPSKTVPNFLAGC--SILSDRQPA-GIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDD 275
+ + ++ GC PA G+ G GR SL SQ L + FSYCL + K
Sbjct: 185 LANDVIKSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSK--- 241
Query: 276 APVSSNLV--LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
SSN L GP + + TP KNP SS YYV L I VG+K V
Sbjct: 242 ---SSNFSGSLRLGPKYQPVR---IKTTPLLKNPRRSS-----LYYVNLVGIRVGNKIVD 290
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
IP S L + G I DSG+ FT + P + AV EF R++ N A+ G
Sbjct: 291 IPTSALAFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKN----ANATSLGGFDT 346
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGP 452
C+ SG SV P + F G + LPP+N + CL + AA P
Sbjct: 347 CY--SG--SVVYPSVTFMF-AGMNVTLPPDNLLIHSSSGSTSCLAM----AAAPNNVNSV 397
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ Q QN + DL N R G +++ C
Sbjct: 398 LNVIASMQQQNHRVLIDLPNSRLGISRETC 427
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 166/387 (42%), Gaps = 61/387 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I++ G+P ++ T I DTGS + W C C C+ DP F P SS+
Sbjct: 133 YLITVRLGSPGKSQTMLI-DTGSDVSWVQCKP---CSQCH-SQADP----LFDPSSSSTY 183
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
C + C+ + GCS + C Y + YG G T G S+TL
Sbjct: 184 SPFSCSSAACAQL-----GQEGNGCS--SSQC-----QYTVTYGDGSSTTGTYSSDTLAL 231
Query: 223 PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDA 276
S V F GCS + + Q G+ G G ++SL SQ FSYCL A
Sbjct: 232 GSNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCL------PA 285
Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
SS+ L G G+ + F K P+ SS FY V ++ I VG + + IP
Sbjct: 286 TSSSSGFLTLGAGT----------SGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPT 335
Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPCF 395
S + G I+DSG+ T + + A++ F M Y A SG L CF
Sbjct: 336 SVF------SAGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSA----PPSGILDTCF 385
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
D SG+ SV +P + L F GGA + + + N +LCL F N+ +LG I
Sbjct: 386 DFSGQSSVSIPTVALVFSGGAVVDIASDGIMLQTSNSILCLA-FAANSDDSSLG-----I 439
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G+ Q + F + +D+ GF C
Sbjct: 440 IGNVQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 163/378 (43%), Gaps = 56/378 (14%)
Query: 114 PQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
PQ + F+ DTGS + W PC + C + P F P+ SSS + C +
Sbjct: 6 PQQPSFFVLDTGSDVTWLQCLPCAGKNGCYE--------QITPIFDPELSSSYNPVSCDS 57
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVP 228
+C + C + Y ++YG G FT G L +ETL F S ++P
Sbjct: 58 EQCQLL--------------DEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIP 103
Query: 229 NFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
N GC ++ G+ G G + S+ SQL FSYCL+ D+P S L +
Sbjct: 104 NISIGCGHDNEGLFVGADGLIGLGGGAISISSQLKASSFSYCLVDI---DSPSFSTLDFN 160
Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
T P S DS +P KN F F YV + + VG K + I S G
Sbjct: 161 TDPPS-DSLI-----SPLVKN-----DRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESG 209
Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL 405
GG+IVDSG+T T + ++E + + F+ N A ++ S C+D+S + +V +
Sbjct: 210 LGGIIVDSGTTITQLPSDVYEVLREAFLGLTTNLPPAPEI---SPFDTCYDLSSQSNVEV 266
Query: 406 PELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
P + G + LP +N V + CL + P I+G+FQ Q
Sbjct: 267 PTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFVS--------ATFPLSIIGNFQQQGI 318
Query: 465 YLEFDLANDRFGFAKQKC 482
+ +DL N GF+ KC
Sbjct: 319 RVSYDLTNSLVGFSTNKC 336
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 107/390 (27%), Positives = 172/390 (44%), Gaps = 46/390 (11%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+SL GTPPQ + DTGS L W C + P +F P SSS +
Sbjct: 82 VSLPIGTPPQTQQ-MVLDTGSQLSWIQCHKKSV-------PKKPPPTTSFDPSLSSSFSV 133
Query: 166 IGCQNPKCS-WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
+ C +P C I + + C +N+ C SY G + G L+ E + F S
Sbjct: 134 LPCNHPLCKPRIPDFTLPTTCD----QNRLCHY---SYFYADGT-YAEGSLVREKITFSS 185
Query: 225 -KTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS-NL 282
++ P + GC+ S + GI G S SQ + KFSYC+ +R+ S+ +
Sbjct: 186 SQSTPPLILGCAEASTDE-KGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSF 244
Query: 283 VLDTGPGSGDSKTPGL-SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
L P SG + L ++TP ++P A Y + ++ I +G+ + I + P
Sbjct: 245 YLGNNPNSGRFQYINLLTFTPSQRSPNLDPLA----YTIPMQGIRMGNARLNISATLFRP 300
Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-------NYSRAADVEKKSGLRPC 394
G G I+DSGS FT++ + V +E +R +G Y +D+ C
Sbjct: 301 DPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDM--------C 352
Query: 395 FDISGKK-SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
FD + + + ++ +F+ G ++ + A VG V C+ + G A +
Sbjct: 353 FDGNPMEIGRLIGNMVFEFEKGVEIVIDKWRVLADVGGGVHCIGIGRSEMLGAA-----S 407
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+F QN ++E+DLAN R G K C+
Sbjct: 408 NIIGNFHQQNLWVEYDLANRRIGLGKADCS 437
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 114/394 (28%), Positives = 170/394 (43%), Gaps = 63/394 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS + W C C +C + DP F P S+
Sbjct: 155 GEYFTRIGVGTPTREQY-MVLDTGSDVAWIQCEP---CREC-YSQADP----IFNPSYSA 205
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S +GC + CS + + S GC Y YG G ++ G +ETL
Sbjct: 206 SFSTVGCDSAVCSQLDAYDCHS--GGCL------------YEASYGDGSYSTGSFATETL 251
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGI-------AGFGRSSESLPSQLGLKK---FSYCLLS 270
F + +V N GC + G+ G G + S P+Q+G + FSYCL+
Sbjct: 252 TFGTTSVANVAIGCG----HKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVD 307
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
R+ D S+ L GP S G +TP KNP FYY+ + I VG
Sbjct: 308 RESD-----SSGPLQFGP---KSVPVGSIFTPLEKNP-----HLPTFYYLSVTAISVGGA 354
Query: 331 HVKI--PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+ P + + + G+GG I+DSG+ T + ++AV F+ G R V
Sbjct: 355 LLDSIPPEVFRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAV--- 411
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
S C+D+SG + V +P + F GA + LP +NY L+ + + F A ++
Sbjct: 412 SIFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAKNY--LIPMDTVGTFCFAFAPAASSV 469
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q+ + FD AN GFA +C
Sbjct: 470 S-----IMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 165/388 (42%), Gaps = 61/388 (15%)
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
+ + GTPPQA++ FI TG LVW C+ +C+ C +P F+P SS+ +
Sbjct: 57 NFTIGTPPQAASAFIDLTGE-LVWTQCS---QCIHCF-----KQDLPVFVPNASSTFKPE 107
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSK 225
C C I P S C +Y GLG T G++ ++T +
Sbjct: 108 PCGTDVCKSIPTPKCAS---------DVC-----AYDGVTGLGGHTVGIVATDTFAIGTA 153
Query: 226 TVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
+ GC + SD P+G G GR+ SL +Q+ L +FSYCL D +S
Sbjct: 154 APASLGFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSR 210
Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
L L G+ G ++TPF K + ++Y + L +I G + +P
Sbjct: 211 LFL----GASAKLAGGGAWTPFVKT--SPNDGMSQYYPIELEEIKAGDATITMPR----- 259
Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG--LRPCFDISG 399
G V+V + + L ++V +EF + + AA G CF +G
Sbjct: 260 ---GRNTVLVQTAVVRVSL---LVDSVYQEFKKAVMASVGAAPTATPVGAPFEVCFPKAG 313
Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL----ILFTDNAAGPALGRGPAII 455
P+L+ F+ GA + +PP NY VGN+ +CL I + A L I
Sbjct: 314 VSGA--PDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLN-----I 366
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LG FQ +N +L FDL D F C+
Sbjct: 367 LGSFQQENVHLLFDLDKDMLSFEPADCS 394
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 121/426 (28%), Positives = 177/426 (41%), Gaps = 68/426 (15%)
Query: 68 RHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSS 127
RHL KP + GS+ + + + G Y + + G+PP+ + + D+GS
Sbjct: 105 RHLAAG-KPTYAEEAFGSDVVSGMEQ------GSGEYFVRIGVGSPPR-NQYVVIDSGSD 156
Query: 128 LVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV-ES 183
++W PCT Y D P F P SSS + C + CS + E
Sbjct: 157 IIWVQCEPCTQCYHQSD-----------PVFNPADSSSYAGVSCASTVCSHVDNAGCHEG 205
Query: 184 RCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ- 241
RC+ Y + YG G +T G L ETL F + N GC +
Sbjct: 206 RCR---------------YEVSYGDGSYTKGTLALETLTFGRTLIRNVAIGCGHHNQGMF 250
Query: 242 --PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
AG+ G G S QLG + FSYCL+SR + S+ +L G ++
Sbjct: 251 VGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRG-----IQSSGLLQFGR---EAVPV 302
Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
G ++ P NP S FYYVGL + VG V I G+GGV++D+G+
Sbjct: 303 GAAWVPLIHNPRAQS-----FYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTA 357
Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA 416
T + +EA FI Q N RA+ V S C+D+ G SV +P + F GG
Sbjct: 358 VTRLPTAAYEAFRDAFIAQTTNLPRASGV---SIFDTCYDLFGFVSVRVPTVSFYFSGGP 414
Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
+ LP N+ V + F +++G + I+G+ Q + + D AN G
Sbjct: 415 ILTLPARNFLIPVDDVGSFCFAFAPSSSGLS-------IIGNIQQEGIEISVDGANGFVG 467
Query: 477 FAKQKC 482
F C
Sbjct: 468 FGPNVC 473
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 132/443 (29%), Positives = 184/443 (41%), Gaps = 79/443 (17%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
RAR + + P+ L H ++SL+ GTPPQ T + DTG
Sbjct: 61 RARQMPARALPRQPSK--------------LRFHHNVSLTVSLAVGTPPQNVT-MVLDTG 105
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
S L W C P ++ A F P+ SS+ + C + +C P+ +
Sbjct: 106 SELSWLLCA----------PAGARNKFSAMSFRPRASSTFAAVPCASAQCRSRDLPSPPA 155
Query: 184 RCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILS---DR 240
C G S R C S L Y G ++ L+ + F + P A +S D
Sbjct: 156 -CDGASSR-------C-SVSLSYADGSSSDGALATDV-FAVGSGPPLRAAFGCMSSAFDS 205
Query: 241 QPAGIA-----GFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKT 295
P G+A G R + S SQ ++FSYC+ R DDA V L+L G S
Sbjct: 206 SPDGVASAGLLGMNRGALSFVSQASTRRFSYCISDR--DDAGV---LLL------GHSDL 254
Query: 296 PG---LSYTPFYKNPVGSSSAFGEFYY-VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
P L+YTP Y+ P F Y V L I VG KH+ IP S L P G G +V
Sbjct: 255 PTFLPLNYTPMYQ-PALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMV 313
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK---KSGLRPCFDISGKKS---VYL 405
DSG+ FTF+ G + A+ EF RQ A D + CF + +S L
Sbjct: 314 DSGTQFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARL 373
Query: 406 PELILKFKGGAKMALPPENYFALV------GNEVLCLILFTDNAAGPALGRGPAIILGDF 459
P + L F GA+MA+ + V G+ V CL F + P + A ++G
Sbjct: 374 PGVTLLFN-GAEMAVAGDRLLYKVPGERRGGDGVWCLT-FGNADMVPIM----AYVIGHH 427
Query: 460 QLQNFYLEFDLANDRFGFAKQKC 482
N ++E+DL R G A +C
Sbjct: 428 HQMNVWVEYDLERGRVGLAPVRC 450
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 116/398 (29%), Positives = 167/398 (41%), Gaps = 61/398 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC-NFPNVDPSRIPAFIPKRSSS 162
Y + L+ GTPPQ T + DTGS L+W C + C C P+ P F P+ SSS
Sbjct: 98 YVLDLAVGTPPQPITALL-DTGSDLIWTQCDT---CTACLRQPD------PLFSPRMSSS 147
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLR 221
+ + C C G + C R TC +Y YG G T G +E
Sbjct: 148 YEPMRCAGQLC----GDILHHSCV----RPDTC-----TYRYSYGDGTTTLGYYATERFT 194
Query: 222 FPS-----KTVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
F S ++VP GC + S +GI GFGR SL SQL +++FSYCL
Sbjct: 195 FASSSGETQSVPLGF-GCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCL----- 248
Query: 274 DDAPVSSNLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
P +S+ GS D + P P+ S+ FYYV + VG++ +
Sbjct: 249 --TPYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRL 306
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
+IP S DG+GGVI+DSG+ T + V + F Q+ A G+
Sbjct: 307 RIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQL-RLPFANGSSPDDGV- 364
Query: 393 PCFDISG--------KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
CF + V +P ++ F+ GA + LP ENY +L D+
Sbjct: 365 -CFAAPAVAAGGGRMARQVAVPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLLGDSGD 422
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A +G+F Q+ + +DL + FA +C
Sbjct: 423 DGA-------TIGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 165/388 (42%), Gaps = 61/388 (15%)
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
+ + GTPPQA++ FI TG LVW C+ +C+ C +P F+P SS+ +
Sbjct: 27 NFTIGTPPQAASAFIDLTGE-LVWTQCS---QCIHCF-----KQDLPVFVPNASSTFKPE 77
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSK 225
C C I P S C+ T GLG T G++ ++T +
Sbjct: 78 PCGTDVCKSIPTPKCASDV--CAFDGVT------------GLGGHTVGIVATDTFAIGTA 123
Query: 226 TVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
+ GC + SD P+G G GR+ SL +Q+ L +FSYCL D +S
Sbjct: 124 APASLGFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSR 180
Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
L L G+ G ++TPF K + ++Y + L +I G + +P
Sbjct: 181 LFL----GASAKLAGGGAWTPFVKT--SPNDGMSQYYPIELEEIKAGDATITMPR----- 229
Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG--LRPCFDISG 399
G V+V + + L ++V +EF + + AA G CF +G
Sbjct: 230 ---GRNTVLVQTAVVRVSL---LVDSVYQEFKKAVMASVGAAPTATPVGEPFEVCFPKAG 283
Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL----ILFTDNAAGPALGRGPAII 455
P+L+ F+ GA + +PP NY VGN+ +CL I + A L I
Sbjct: 284 VSGA--PDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLN-----I 336
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LG FQ +N +L FDL D F C+
Sbjct: 337 LGSFQQENVHLLFDLDKDMLSFEPADCS 364
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 161/392 (41%), Gaps = 73/392 (18%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
G+S+++ P + I DTGS L+W C K SSS
Sbjct: 42 GHSLTVGIVQPRK----LIVDTGSDLIWTQC------------------------KLSSS 73
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLR 221
+ + G P ++T P ++ A G+L SET
Sbjct: 74 T-----------------AAAARHGSPPLSRTAPARTGAFTRTCTASAAAVGVLASETFT 116
Query: 222 FPSKTVPNFLAG--CSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
F ++ + G C LS GI G S SL +QL +++FSYCL F D
Sbjct: 117 FGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCLT--PFADK 174
Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
S L S T + T NPV + +YYV L I +G K + +P
Sbjct: 175 KTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETV-----YYYVPLVGISLGHKRLAVPA 229
Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
+ L DG GG IVDSGST ++ FEAV KE + + A + L CF
Sbjct: 230 ASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAV-KEAVMDVVRLPVANRTVEDYEL--CFV 286
Query: 397 I------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
+ + ++V +P L+L F GGA M LP +NYF ++CL A G
Sbjct: 287 LPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCL------AVGKTTDG 340
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q QN ++ FD+ + +F FA +C
Sbjct: 341 SGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 112/407 (27%), Positives = 171/407 (42%), Gaps = 68/407 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y +++ GTPP+ T +FDTGS L W +C+ C + P + P F P +SS+
Sbjct: 122 YVVTIGIGTPPRNFT-VLFDTGSDLTWV------QCLPCPDSSCYPQQEPLFDPSKSSTY 174
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
+ C P+C G ++RC S C Y ++YG T G L ET
Sbjct: 175 VDVPCSAPECH--IGGVQQTRCGATS---------C-EYSVKYGDESETHGSLAEETFTL 222
Query: 223 --PSKTVP---NFLAGCS-----ILSD--RQPAGIAGFGRSSESLPSQL------GLKKF 264
PS P + GCS + +D AG+ G GR S+ SQ G F
Sbjct: 223 SPPSPLAPAATGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVF 282
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
SYCL R + L + G + + LS+TP + + S Y V L
Sbjct: 283 SYCLPPR----GSSTGYLTIGGGAAAPQQQYSNLSFTPL----ITTISQLRSAYVVNLAG 334
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
+ V V IP S + G ++DSG+ T M + + EF MG+Y +
Sbjct: 335 VSVNGAAVDIPASAF------SLGAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPE 388
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE--------VLCL 436
K L C+D++G+ V P + L+F GGA++ + ++ E + CL
Sbjct: 389 GSMKL-LDTCYDVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACL 447
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N+AG +I+G+ Q + + + FD+ R GF C+
Sbjct: 448 AFLPTNSAG-------LVIVGNMQQRAYNVVFDVDGGRIGFGPNGCS 487
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 127/449 (28%), Positives = 191/449 (42%), Gaps = 79/449 (17%)
Query: 51 DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS----VHSYGGYSI 106
D +I+ L S A L T+ KPK K N +N + P++ + S Y
Sbjct: 38 DTARIVSMLTSG----AGPLTTRAKPKPK------NRANPPV--PIAPGRQILSIPNYIA 85
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
GTP Q + D + W PC++ C C + P+F P +SS+ + +
Sbjct: 86 RAGLGTPAQ-TLLVAIDPSNDAAWVPCSA---CAGCA------ASSPSFSPTQSSTYRTV 135
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS---YLLQYGLGFTAGLLLSETLRFP 223
C +P+C+ + P +CP S + L Y +L ++L
Sbjct: 136 PCGSPQCAQVPSP--------------SCPAGVGSSCGFNLTYAASTFQAVLGQDSLALE 181
Query: 224 SKTVPNFLAGC-SILSDRQ--PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAP 277
+ V ++ GC ++S P G+ GFGR S SQ FSYCL + +
Sbjct: 182 NNVVVSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYR----- 236
Query: 278 VSSNL--VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
SSN L GP G K + TP NP S YYV + I VGSK V++P
Sbjct: 237 -SSNFSGTLKLGP-IGQPKR--IKTTPLLYNPHRPS-----LYYVNMIGIRVGSKVVQVP 287
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S L G I+D+G+ FT + P++ AV F ++ R G C+
Sbjct: 288 QSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRV----RTPVAPPLGGFDTCY 343
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAI 454
++ +V +P + F G + LP EN + V CL + AAGP+ G A+
Sbjct: 344 NV----TVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAM----AAGPSDGVNAAL 395
Query: 455 -ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+L Q QN + FD+AN R GF+++ C
Sbjct: 396 NVLASMQQQNQRVLFDVANGRVGFSRELC 424
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 130/490 (26%), Positives = 201/490 (41%), Gaps = 80/490 (16%)
Query: 24 GAGSSAATVTV-----PLTPLSTKHYLHHSDSDPL--------KILHSLASSSLSRARHL 70
GA SS +T+ P +PL+ H S D L I H +++++ +R
Sbjct: 78 GATSSGTRMTIVHRHGPCSPLAAAHGKPPSHEDILAADQNRAESIQHRVSTTATARGNPK 137
Query: 71 KTKTKPKTKDSNIGSNYSNSLIKTPLSVH--------SYGGYSISLSFGTPPQASTPFIF 122
+++ P + + + + + + G Y +++ GTP T +F
Sbjct: 138 RSRRAPSRRQQPSSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYT-VVF 196
Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
DTGS W C CV + + F P RSS+ + C P C +
Sbjct: 197 DTGSDTTWVQCQP---CVVVCYEQQEK----LFDPARSSTYANVSCAAPAC-------FD 242
Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR 240
+GCS + Y +QYG G ++ G +TL S V F GC ++
Sbjct: 243 LDTRGCSGGHCL-------YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEG 295
Query: 241 ---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
+ AG+ G GR SLP Q K F++CL +R S LD GPGS +
Sbjct: 296 LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS------SGTGYLDFGPGSPAAA 349
Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
L+ TP + + FYYVG+ I VG + + IP S G IVDSG
Sbjct: 350 GARLT-TPMLTDNGPT------FYYVGMTGIRVGGQLLSIPQSVFA-----TAGTIVDSG 397
Query: 355 STFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
+ T + P + ++ F+ M Y +A V S L C+D +G V +P + L F
Sbjct: 398 TVITRLPPPAYSSLRSAFVSAMAARGYKKAPAV---SLLDTCYDFTGMSQVAIPTVSLLF 454
Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
+GGA + + +CL F N G +G I+G+ QL+ F + +D+
Sbjct: 455 QGGAILDVDASGIMYAASVSQVCL-GFAANEDGGDVG-----IVGNTQLKTFGVAYDIGK 508
Query: 473 DRFGFAKQKC 482
GF+ C
Sbjct: 509 KVVGFSPGAC 518
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 127/449 (28%), Positives = 191/449 (42%), Gaps = 79/449 (17%)
Query: 51 DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS----VHSYGGYSI 106
D +I+ L S A L T+ KPK K N +N + P++ + S Y
Sbjct: 57 DTARIVSMLTSG----AGPLTTRAKPKPK------NRANPPV--PIAPGRQILSIPNYIA 104
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
GTP Q + D + W PC++ C C + P+F P +SS+ + +
Sbjct: 105 RAGLGTPAQ-TLLVAIDPSNDAAWVPCSA---CAGCA------ASSPSFSPTQSSTYRTV 154
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS---YLLQYGLGFTAGLLLSETLRFP 223
C +P+C+ + P +CP S + L Y +L ++L
Sbjct: 155 PCGSPQCAQVPSP--------------SCPAGVGSSCGFNLTYAASTFQAVLGQDSLALE 200
Query: 224 SKTVPNFLAGC-SILSDRQ--PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAP 277
+ V ++ GC ++S P G+ GFGR S SQ FSYCL + +
Sbjct: 201 NNVVVSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYR----- 255
Query: 278 VSSNL--VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
SSN L GP G K + TP NP S YYV + I VGSK V++P
Sbjct: 256 -SSNFSGTLKLGP-IGQPKR--IKTTPLLYNPHRPS-----LYYVNMIGIRVGSKVVQVP 306
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S L G I+D+G+ FT + P++ AV F ++ R G C+
Sbjct: 307 QSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRV----RTPVAPPLGGFDTCY 362
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAI 454
++ +V +P + F G + LP EN + V CL + AAGP+ G A+
Sbjct: 363 NV----TVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAM----AAGPSDGVNAAL 414
Query: 455 -ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+L Q QN + FD+AN R GF+++ C
Sbjct: 415 NVLASMQQQNQRVLFDVANGRVGFSRELC 443
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 136/445 (30%), Positives = 193/445 (43%), Gaps = 58/445 (13%)
Query: 47 HSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSI 106
DS +K + SLA+ S R TK P++ +S ++I + LS S G Y +
Sbjct: 91 QRDSLRVKSITSLAAVSTGRN---ATKRTPRS-----AGGFSGAVI-SGLSQGS-GEYFM 140
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC-NFPNVDPSRIPAFIPKRSSSSQL 165
L GTP + + DTGS +VW C+ C C N +V F PK+S +
Sbjct: 141 RLGVGTPA-TNVYMVLDTGSDVVWLQCSP---CKACYNQSDV------IFDPKKSKTFAT 190
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS 224
+ C + C + + S C + R+KTC Y + YG G FT G +ETL F
Sbjct: 191 VPCGSRLCRRL---DDSSEC--VTRRSKTCL-----YQVSYGDGSFTEGDFSTETLTFHG 240
Query: 225 KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPV 278
V + GC ++ AG+ G GR S PSQ + KFSYCL+ R +
Sbjct: 241 ARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSS 300
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIPYS 337
+ G D+ +TP NP FYY+ L I VG V + S
Sbjct: 301 KPPSTIVFG---NDAVPKTSVFTPLLTNP-----KLDTFYYLQLLGISVGGSRVPGVSES 352
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
+ GNGGVI+DSG++ T + + A+ F RA S CFD+
Sbjct: 353 QFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPSY---SLFDTCFDL 409
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
SG +V +P ++ F GG +++LP NY V E F AG G I+G
Sbjct: 410 SGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAF----AGTM---GSLSIIG 461
Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q Q F + +DL R GF + C
Sbjct: 462 NIQQQGFRVAYDLVGSRVGFLSRAC 486
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 133/453 (29%), Positives = 192/453 (42%), Gaps = 65/453 (14%)
Query: 47 HSDSDPLKILHS-----LASSSLSRAR-HLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHS 100
H PL+ ++S L S S R L T ++K+S + SN +++ +V +
Sbjct: 78 HGACSPLRPINSSSWIDLVSQSFERDNARLNTI---RSKNSGPYTTMSNLPLQSGTTVGT 134
Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
G Y ++ FGTP + S I DTGS L W C C DC + VD F PK+S
Sbjct: 135 -GNYIVTAGFGTPAKNSL-LIIDTGSDLTWIQCKP---CADC-YSQVDA----IFEPKQS 184
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKT-CPLACPSYLLQYGLGFTA-GLLLSE 218
SS + + C + C+ + S N T C L Y + YG G ++ G E
Sbjct: 185 SSYKTLPCLSATCTELI----------TSESNPTPCLLGGCVYEINYGDGSSSQGDFSQE 234
Query: 219 TLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRK 272
TL S + NF GC + + +G+ G G++S S PSQ K +F+YCL
Sbjct: 235 TLTLGSDSFQNFAFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLP--- 291
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
D +S G GS + +TP N + + FY+VGL I VG +
Sbjct: 292 -DFGSSTSTGSFSVGKGSIPASA---VFTPLVSNFM-----YPTFYFVGLNGISVGGDRL 342
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
IP P G G IVDSG+ T + + A+ F + + A + S L
Sbjct: 343 SIP-----PAVLGRGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSA---KPFSILD 394
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV--GNEVLCLILFTDNAAGPALGR 450
C+D+S V +P + F+ A +A+ V G +CL A A
Sbjct: 395 TCYDLSRHSQVRIPTITFHFQNNADVAVSDVGILVPVQNGGSQVCL------AFASASQM 448
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+FQ Q + FD R GFA CA
Sbjct: 449 DGFNIIGNFQQQRMRVAFDTGAGRIGFASGSCA 481
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 115/392 (29%), Positives = 165/392 (42%), Gaps = 61/392 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
G Y + + G+PP+ S + D+GS +VW PCT Y D P F P
Sbjct: 41 GEYFVRIGVGSPPR-SQYMVIDSGSDIVWVQCKPCTQCYHQTD-----------PLFDPA 88
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVES-RCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLL 216
S+S + C + C + S RC+ Y + YG G T G L
Sbjct: 89 DSASFMGVSCSSAVCDQVDNAGCNSGRCR---------------YEVSYGDGSSTKGTLA 133
Query: 217 SETLRFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKK---FSYCLLS 270
ETL V N GC ++ AG+ G G S S QL ++ FSYCL+S
Sbjct: 134 LETLTLGRTVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVS 193
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
R +SN L+ G ++ G ++ P +NP S +YY+GL + VG
Sbjct: 194 RV-----TNSNGFLEFGS---EAMPVGAAWIPLIRNPHSPS-----YYYIGLSGLGVGDM 240
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
V I GNGGV++D+G+ T +EA FI Q GN RA+ V S
Sbjct: 241 KVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGV---SI 297
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
C+++ G SV +P + F GG + LP N+ V + F + +G +
Sbjct: 298 FDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSPSGLS--- 354
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ Q + + D AN+ GF C
Sbjct: 355 ----ILGNIQQEGIQISVDGANEFVGFGPNVC 382
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 123/454 (27%), Positives = 180/454 (39%), Gaps = 57/454 (12%)
Query: 40 STKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL-SV 98
S H +H S PL+ + +LA +R L +K S + P+ S
Sbjct: 24 SVYHNVHPPSSSPLESIIALAREDDARLLFLSSKAA------------STGVSSAPVASG 71
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
S Y + G+P Q DT + W C+ C C PS F P
Sbjct: 72 QSPPSYVVRAGLGSPAQPIL-LALDTSADATWAHCSP---CGTC------PSSGSLFAPA 121
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
S+S + C + C+ + G C P + + PL ++ + L S+
Sbjct: 122 NSTSYAPLPCSSTMCTVLQG----QPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASD 177
Query: 219 TLRFPSKTVPNFLAGC-SILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLS 270
L +PN+ GC S +S + G+ G GR +L SQ+G FSYCL S
Sbjct: 178 WLHLGKDAIPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPS 237
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
K S +L L G + P G+ YTP KNP SS YYV + + VG
Sbjct: 238 YK--SYYFSGSLRL------GAAGQPRGVRYTPMLKNPNRSS-----LYYVNVTGLSVGR 284
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
VK+P G +VDSG+ T P++ A+ +EF R + S +
Sbjct: 285 APVKVPAGSFAFDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYTSL---G 341
Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPAL 448
CF+ + P + + GG +ALP EN + CL + A P
Sbjct: 342 AFDTCFNTDEVAAGVAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAM----AEAPQN 397
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+L + Q QN + FD+AN R GFA++ C
Sbjct: 398 VNAVVNVLANLQQQNLRVVFDVANSRVGFARESC 431
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 133/485 (27%), Positives = 203/485 (41%), Gaps = 77/485 (15%)
Query: 24 GAGSSAATVTV-----PLTPLSTKH--------YLHHSDSDPLKILHSLASSSLSRA--- 67
GA SS +T+ P +PL+ H L + I H +++++ SR
Sbjct: 83 GATSSTTRMTIVHRHGPCSPLAAAHSKPPSHDEILAADQNRAESIQHRVSTTATSRGQPK 142
Query: 68 RHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSS 127
R + + + S+ + SL +P G Y +++ GTP T +FDTGS
Sbjct: 143 RSRRQQPSSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYT-VVFDTGSD 201
Query: 128 LVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKG 187
W C CV + R F P RSS+ + C P CS ++++R G
Sbjct: 202 TTWVQCQP---CVVVCYEQ----REKLFDPARSSTYANVSCAAPACS-----DLDTR--G 247
Query: 188 CSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR---QP 242
CS + Y +QYG G ++ G +TL S V F GC ++ +
Sbjct: 248 CSGGHCL-------YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEA 300
Query: 243 AGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLS 299
AG+ G GR SLP Q K F++CL +R + LD G GS ++ L+
Sbjct: 301 AGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS------TGTGYLDFGAGSPAAR---LT 351
Query: 300 YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTF 359
TP + + FYYVGL I VG + + IP S G IVDSG+ T
Sbjct: 352 TTPMLVDNGPT------FYYVGLTGIRVGGRLLYIPQSVFA-----TAGTIVDSGTVITR 400
Query: 360 MEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAK 417
+ + ++ F M Y +A V S L C+D +G V +P + L F+GGA+
Sbjct: 401 LPPAAYSSLRSAFAAAMSARGYKKAPAV---SLLDTCYDFAGMSQVAIPTVSLLFQGGAR 457
Query: 418 MALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
+ + +CL F N G +G I+G+ QL+ F + +D+ F
Sbjct: 458 LDVDASGIMYAASASQVCLA-FAANEDGGDVG-----IVGNTQLKTFGVAYDIGKKVVSF 511
Query: 478 AKQKC 482
+ C
Sbjct: 512 SPGAC 516
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 130/447 (29%), Positives = 189/447 (42%), Gaps = 62/447 (13%)
Query: 47 HSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSI 106
DS ++ L SLA+ S R TK P++ G ++ + LS S G Y +
Sbjct: 89 QRDSLRVESLTSLAAVSAGRN---VTKRPPRSAGGFSG------VVISGLSQGS-GEYFM 138
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
L GTP + + DTGS +VW C+ C + P F P +S + +
Sbjct: 139 RLGVGTPA-TNMYMVLDTGSDVVWLQCSPCKVCYN--------QSDPVFNPAKSKTFATV 189
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSK 225
C + C + + S C S R+K C Y + YG G FT G +ETL F
Sbjct: 190 PCGSRLCRRL---DDSSEC--VSRRSKACL-----YQVSYGDGSFTVGDFSTETLTFHGA 239
Query: 226 TVPNFLAGCSILSDRQPAGI-----AGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAP 277
V + GC D + + G GR S PSQ + KFSYCL+ R +
Sbjct: 240 RVDHVALGCG--HDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSS 297
Query: 278 VSSNLVLDTGPGSGDSKTPGLS-YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIP 335
+ G G+ P + +TP NP FYY+ L I VG V +
Sbjct: 298 SKPPSTIVFGNGA----VPKTAVFTPLLTNP-----KLDTFYYLQLLGISVGGSRVPGVS 348
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S + GNGGVI+DSG++ T + + A+ F ++G +R S CF
Sbjct: 349 ESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAF--RLGA-TRLKRAPSYSLFDTCF 405
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
D+SG +V +P ++ F GG +++LP NY V N+ F G I
Sbjct: 406 DLSGMTTVKVPTVVFHFTGG-EVSLPASNYLIPVNNQGRFCFAFAGTM-------GSLSI 457
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G+ Q Q F + +DL R GF + C
Sbjct: 458 IGNIQQQGFRVAYDLVGSRVGFLSRAC 484
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 121/430 (28%), Positives = 182/430 (42%), Gaps = 66/430 (15%)
Query: 68 RHLKTKTKPKTKDSNIGSNYSN--SLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R LK K P N+ + S + + + S G Y + GTP + + DTG
Sbjct: 117 RKLKLKKDPAGSYENVAGVTAEFGSEVVSGMEQGS-GEYFTRIGIGTPTREQY-MVLDTG 174
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
S +VW C C +C + DP F P S S +GC + CS + +
Sbjct: 175 SDVVWIQCEP---CREC-YSQADP----IFNPSSSVSFSTVGCDSAVCSQLDANDCHG-- 224
Query: 186 KGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAG 244
GC Y + YG G +T G +ETL F + ++ N GC G
Sbjct: 225 GGCL------------YEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCG----HDNVG 268
Query: 245 I-------AGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
+ G G S S P+QLG + FSYCL+ R + S+ L+ GP +S
Sbjct: 269 LFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSE-----SSGTLEFGP---ESV 320
Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK-IP-YSYLVPGSDGNGGVIVD 352
G +TP NP FYY+ + I VG + +P ++ + + G GG+I+D
Sbjct: 321 PIGSIFTPLVANPF-----LPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIID 375
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG+ T ++ ++A+ FI + RA + S C+D+S +SV +P + F
Sbjct: 376 SGTAVTRLQTSAYDALRDAFIAGTQHLPRADGI---SIFDTCYDLSALQSVSIPAVGFHF 432
Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
GA LP +N L+ + + F A L I+G+ Q Q + FD AN
Sbjct: 433 SNGAGFILPAKN--CLIPMDSMGTFCFAFAPADSNLS-----IMGNIQQQGIRVSFDSAN 485
Query: 473 DRFGFAKQKC 482
GFA +C
Sbjct: 486 SLVGFAIDQC 495
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 171/372 (45%), Gaps = 50/372 (13%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
I DTGS L W C RC + + P F P +S S + + C + C +
Sbjct: 80 IVDTGSDLSWVQCQPCNRCYN--------QQDPVFNPSKSPSYRTVLCNSLTCRSLQLAT 131
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD 239
S G +P TC +Y++ YG G +T+G + E L + TV NF+ GC +
Sbjct: 132 GNSGVCGSNP--PTC-----NYVVNYGDGSYTSGEVGMEHLNLGNTTVNNFIFGCGRKNQ 184
Query: 240 ---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
+G+ G GR+ SL SQ+ FSYCL + +A S +LV+ +
Sbjct: 185 GLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTT---EAEASGSLVMGGNSSVYKN 241
Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
TP +SYT NP+ FY++ L I VG V+ P S G +I+DS
Sbjct: 242 TTP-ISYTRMIHNPL------LPFYFLNLTGITVGGVEVQAP-------SFGKDRMIIDS 287
Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFK 413
G+ + + +++A+ EF++Q Y A L CF++SG + V +P++ + F+
Sbjct: 288 GTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMI---LDSCFNLSGYQEVKIPDIKMYFE 344
Query: 414 GGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
G A++ + F V + +CL + A+ P I+G++Q +N + +D
Sbjct: 345 GSAELNVDVTGVFYSVKTDASQVCLAI----ASLPY--EDEVGIIGNYQQKNQRIIYDTK 398
Query: 472 NDRFGFAKQKCA 483
GFA++ C+
Sbjct: 399 GSMLGFAEEACS 410
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 121/436 (27%), Positives = 190/436 (43%), Gaps = 82/436 (18%)
Query: 72 TKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF 131
T+++ + + + + S + + + LS+ S G Y I +S GTPP+ + DTGS ++W
Sbjct: 27 TRSRSRDRQTKVPSQDFQAPVVSGLSLGS-GEYFIRISVGTPPR-RMYLVMDTGSDILWL 84
Query: 132 PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC-SWIFGPNVESRCKGCSP 190
C CV+C + D F P +SS+ +GC +C + G ++C
Sbjct: 85 QCAP---CVNC-YHQSDA----IFDPYKSSTYSTLGCSTRQCLNLDIGTCQANKCL---- 132
Query: 191 RNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFG 249
Y + YG G FT G ++ + S + ++ ++ P G G
Sbjct: 133 -----------YQVDYGDGSFTTGEFGTDDVSLNSTS-----GVGQVVLNKIPLGC---G 173
Query: 250 RSSE-----------------SLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPG 289
+E S P+Q+ + +FSYCL R+ D SS LV
Sbjct: 174 HDNEGYFVGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSS-LVF----- 227
Query: 290 SGDSKTP--GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
G++ P G +TP S+ FYY+ + I VG + IP S S GNG
Sbjct: 228 -GEAAVPPAGARFTP-----QDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNG 281
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
GVI+DSG++ T ++ + ++ F S A S C+D+SG SV +P
Sbjct: 282 GVIIDSGTSVTRLQNAAYASLRDAF---RAGTSDLAPTAGFSLFDTCYDLSGLASVDVPT 338
Query: 408 LILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYL 466
+ L F+GG + LP NY V N CL AG GP+II G+ Q Q F +
Sbjct: 339 VTLHFQGGTDLKLPASNYLIPVDNSNTFCLAF-----AGTT---GPSII-GNIQQQGFRV 389
Query: 467 EFDLANDRFGFAKQKC 482
+D +++ GF +C
Sbjct: 390 IYDNLHNQVGFVPSQC 405
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 161/375 (42%), Gaps = 43/375 (11%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF--IPKRSSSSQLIGCQNPKCSWIFG 178
I DT S L W C C D P DPS P++ +P SSS C + +
Sbjct: 167 IVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSS-----CDALQLATGGT 221
Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSIL 237
+ C+G C SY L Y G ++ G+L + L + + F+ GC
Sbjct: 222 SGGAAACQGQDQSAAAC-----SYTLSYRDGSYSRGVLAHDRLSLAGEVIDGFVFGCGTS 276
Query: 238 SDRQP----AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
+ P +G+ G GRS SL SQ + FSYCL ++ D S +LV+
Sbjct: 277 NQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLKESDS---SGSLVI------ 327
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
GD + + TP + S G FY+V L I VG + V+ G I
Sbjct: 328 GDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGK---AI 384
Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
+DSG+ T + ++ AV EF+ Q Y +A S L CF+++G + V +P L L
Sbjct: 385 IDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGF---SILDTCFNMTGLREVQVPSLKL 441
Query: 411 KFKGGAKMALPPEN--YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
F GG ++ + YF + +CL A P I+G++Q +N + F
Sbjct: 442 VFDGGVEVEVDSGGVLYFVSSDSSQVCL------AMAPLKSEYETNIIGNYQQKNLRVIF 495
Query: 469 DLANDRFGFAKQKCA 483
D + + GFA++ C
Sbjct: 496 DTSGSQVGFAQETCG 510
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 137/469 (29%), Positives = 191/469 (40%), Gaps = 58/469 (12%)
Query: 35 PLTPLSTKHYLHHSDS-DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIK 93
P T L L HSDS L ILH L + R K K S+ S+ I+
Sbjct: 17 PKTQLQRLKELVHSDSVRQLMILHKLRGGQIPR-------RKAKEVLSSSSGRGSDDAIE 69
Query: 94 TPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
P+ + + G Y ++ GTP Q + DTGS L W C +Y C N N
Sbjct: 70 VPMHPAADYGIGQYFVAFKVGTPSQKFM-LVADTGSDLTWMSC--KYHCRSRNCSNRKAR 126
Query: 151 RIP---AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG 207
RI F SSS + I C C +E S N PL Y +Y
Sbjct: 127 RIRHKRVFHANLSSSFKTIPCLTDMC------KIE-LMDLFSLTNCPTPLTPCGYDYRYS 179
Query: 208 LGFTA-GLLLSETLRFPSKT-----VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPS 257
G TA G +ET+ K + N L GCS S + G+ G G S S
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239
Query: 258 QLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
+ K KFSYCL+ VS+ L GS SK L+ + + +G ++F
Sbjct: 240 KAAEKFGGKFSYCLVDH-LSHKNVSNYLTF----GSSRSKEALLNNMTYTELVLGMVNSF 294
Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
Y V + I +G +KIP V G GG I+DSGS+ TF+ P ++ V
Sbjct: 295 ---YAVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRV 349
Query: 375 QMGNYSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
+ + + VE G L CF+ +G + +P L+ F GA+ P ++Y + V
Sbjct: 350 SLLKFRK---VEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGV 406
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL + G + ++G+ QN EFDL + GFA C
Sbjct: 407 RCLGFVSVAWPGTS-------VVGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 162/389 (41%), Gaps = 55/389 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+PP+ S + D+GS +VW C +C + P DP+ +F S
Sbjct: 138 GEYFVRIGVGSPPR-SQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCS 196
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
SS +N C RC+ Y + YG G +T G L ETL
Sbjct: 197 SSVCDRLENAGCH-------AGRCR---------------YEVSYGDGSYTKGTLALETL 234
Query: 221 RFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
F V + GC + AG+ G G S S QLG + FSYCL+SR D
Sbjct: 235 TFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTD 294
Query: 275 DAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
S +LV G P G ++ P +NP S FYY+GL + VG V
Sbjct: 295 S---SGSLVF------GREALPAGAAWVPLVRNPRAPS-----FYYIGLAGLGVGGIRVP 340
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
I G+GGV++D+G+ T + ++A F+ Q N RA V
Sbjct: 341 ISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAI---FDT 397
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
C+D+ G SV +P + F GG + LP N+ + + F + +G +
Sbjct: 398 CYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLS------ 451
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ Q + + FD AN GF C
Sbjct: 452 -ILGNIQQEGIQISFDGANGYVGFGPNIC 479
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 115/397 (28%), Positives = 166/397 (41%), Gaps = 91/397 (22%)
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
+S G Y+++LS GTPP + + DTGSSL+W C C +C P+ P F P
Sbjct: 85 NSAGAYNMNLSIGTPP-VTFSVLADTGSSLIWTQCAP---CTECA---ARPA--PPFQPA 135
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
SS+ + C + C ++ P GC Y YG+GFTAG L +E
Sbjct: 136 SSSTFSKLPCASSLCQFLTSPYRTCNATGCV------------YYYPYGMGFTAGYLATE 183
Query: 219 TLRFPSKTVPNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR-KFDD 275
TL + P GCS + +GI G GRS SL SQ+G+ +FSYCL S D
Sbjct: 184 TLHVGGASFPGVTFGCSTENGVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLRSNADAGD 243
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
+P+ + G+ S TP +NP SS+ +YYV L I VG+ + +
Sbjct: 244 SPILFGSLAKVTGGNVQS-------TPLLENPEMPSSS---YYYVNLTGITVGATDLPMA 293
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
+ L T + G F G CF
Sbjct: 294 MANL------------------TTVNGTRF------------------------GFDLCF 311
Query: 396 D---ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE------VLCLILFTDNAAGP 446
D G V +P L+L+F GGA+ A+ +YF +V + V CL++ P
Sbjct: 312 DATAAGGGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECLLVL------P 365
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A + I+G+ + ++ +DL F FA CA
Sbjct: 366 ASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 402
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 119/410 (29%), Positives = 170/410 (41%), Gaps = 73/410 (17%)
Query: 89 NSLIKTPLSVHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNV 147
N+ + PL + GG Y + S GTPPQ T + DTGS L+W C C +
Sbjct: 75 NNTQRIPLRMDDSGGAYDMEFSMGTPPQKLTA-LADTGSDLIWAKCGGA-----CT-TSC 127
Query: 148 DPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG 207
+P P+++P SS+ + C + CS + +V C+ C Y YG
Sbjct: 128 EPQGSPSYLPNASSTFAKLPCSDRLCSLLRSDSV----AWCAAAGAEC-----DYRYSYG 178
Query: 208 LG-----FTAGLLLSETLRFPSKTVPNFLAGCSI---LSDRQPAGIAGFGRSSESLPSQL 259
LG +T G L ET + VP+ GC+ +G+ G GR SL SQL
Sbjct: 179 LGDDDHHYTQGFLARETFTLGADAVPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQL 238
Query: 260 GLKKFSYCLLSRKFDDAPVSSNLV---LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
F YCL S DA +S L+ L + G+ T L+ T FY
Sbjct: 239 NASTFMYCLTS----DASKASPLLFGSLASLTGAQVQSTGLLASTTFYA----------- 283
Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
V LR I +GS PG GV+ DSG+T T++ P + F+ Q
Sbjct: 284 ---VNLRSISIGSA--------TTPGVGEPEGVVFDSGTTLTYLAEPAYSEAKAAFLSQ- 331
Query: 377 GNYSRAADVEKKSGLRPCFD--ISGKKS-VYLPELILKFKGGAKMALPPENYFALVGNEV 433
+ VE G CF +G+ S +P ++L F GA MALP NY V + V
Sbjct: 332 ---TSLDQVEDTDGFEACFQKPANGRLSNAAVPTMVLHFD-GADMALPVANYVVEVEDGV 387
Query: 434 LCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+C I + R P++ I+G+ N+ + D+ F C
Sbjct: 388 VCWI----------VQRSPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANC 427
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 113/396 (28%), Positives = 166/396 (41%), Gaps = 49/396 (12%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
++SL+ GTPPQ T + DTGS L W + CN S F P SSS
Sbjct: 74 TVSLTVGTPPQNVT-MVIDTGSELSW---------LHCNTSQNSSSSSSTFNPVWSSSYS 123
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
I C + C+ ++R P + C + L + G L ++T S
Sbjct: 124 PIPCSSSTCT------DQTRDFPIRPSCDSNQF-CHATLSYADASSSEGNLATDTFYIGS 176
Query: 225 KTVPNFLAGC--SILS-----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+PN + GC SI S D + G+ G R S S SQ+G KFSYC+ F
Sbjct: 177 SGIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDF---- 232
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S L+L G + P L+YTP + Y V L I V K + IP S
Sbjct: 233 --SGLLL-LGDANFSWLAP-LNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPES 288
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK---KSGLRPC 394
P G G +VDSG+ FTF+ GP + A+ F+ + R + + + C
Sbjct: 289 VFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLC 348
Query: 395 FDISGKKSVY--LPELILKFKGGAKMALPPENYFALV-----GNE-VLCLILFTDNAAGP 446
+ + ++ LP + L F+ GA+M + + V GN+ + C + G
Sbjct: 349 YRVPTNQTRLPPLPSVTLVFR-GAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGV 407
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A ++G QN ++EFDL R G A+ +C
Sbjct: 408 E-----AFVIGHLHQQNVWMEFDLKKSRIGLAEIRC 438
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 126/438 (28%), Positives = 189/438 (43%), Gaps = 62/438 (14%)
Query: 58 SLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAS 117
S +S LSR R+ + P T SNI YS +LI +SL GTP Q S
Sbjct: 50 SFKTSLLSR-RNPSPPSSPYTFRSNI--KYSMALI-------------LSLPIGTPSQ-S 92
Query: 118 TPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF 177
+ DTGS L W C + P +F P SSS + C +P C
Sbjct: 93 QELVLDTGSQLSWIQCHPKKIKKPLPPPTT------SFDPSLSSSFSDLPCSHPLCK--- 143
Query: 178 GPNVESRCKGCSPR--NKTCPLACPS-----YLLQYGLG-FTAGLLLSETLRFP-SKTVP 228
PR + T P +C S Y Y G F G L+ E F S+T P
Sbjct: 144 ------------PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTP 191
Query: 229 NFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS-NLVLDTG 287
+ GC+ S + GI G S SQ + KFSYC+ +R S+ + L
Sbjct: 192 PLILGCAKESTDE-KGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDN 250
Query: 288 PGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
P S G L++ + P A Y V L+ I +G K + IP S P + G+
Sbjct: 251 PNSRGFKYVSLLTFPQSQRMPNLDPLA----YTVPLQGIRIGQKRLNIPGSVFRPDAGGS 306
Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV--Y 404
G +VDSGS FT + ++ V +E +R +G+ + V + CFD + +
Sbjct: 307 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTA-DMCFDGNHSMEIGRL 365
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
+ +L+ +F G ++ + ++ VG + C+ + + G A + I+G+ QN
Sbjct: 366 IGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAA-----SNIIGNVHQQNL 420
Query: 465 YLEFDLANDRFGFAKQKC 482
++EFD+ N R GF+K +C
Sbjct: 421 WVEFDVTNRRVGFSKAEC 438
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 114/394 (28%), Positives = 168/394 (42%), Gaps = 56/394 (14%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
ISL GTPPQA + DTGS L W + C+ + P +F P SSS
Sbjct: 74 ISLPIGTPPQAQQ-MVLDTGSQLSW---------IQCHRKKLPPKPKTSFDPSLSSSFST 123
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPR--NKTCPLACPS-----YLLQYGLG-FTAGLLLS 217
+ C +P C PR + T P +C S Y Y G F G L+
Sbjct: 124 LPCSHPLCK---------------PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVK 168
Query: 218 ETLRFPSKTV-PNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCL--LSRK 272
E + F + + P + GC+ S DR GI G R S SQ + KFSYC+ S +
Sbjct: 169 EKITFSNTEITPPLILGCATESSDDR---GILGMNRGRLSFVSQAKISKFSYCIPPKSNR 225
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
P S + D G L++ + P A Y V + I G K +
Sbjct: 226 PGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLA----YTVPMIGIRFGLKKL 281
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
I S P + G+G +VDSGS FT + ++ V E + ++G + V +
Sbjct: 282 NISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADM 341
Query: 393 PCFDISGKKSV---YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
CFD G ++ + +L+ F G ++ +P E VG + C+ + + G A
Sbjct: 342 -CFD--GNVAMIPRLIGDLVFVFTRGVEILVPKERVLVNVGGGIHCVGIGRSSMLGAA-- 396
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G+ QN ++EFD+ N R GFAK C+
Sbjct: 397 ---SNIIGNVHQQNLWVEFDVTNRRVGFAKADCS 427
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 107/378 (28%), Positives = 167/378 (44%), Gaps = 61/378 (16%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
I DTGS L W C RC + + P F P S S + + C +P C + +
Sbjct: 149 IVDTGSDLSWVQCQPCKRCYN--------QQDPVFNPSTSPSYRTVLCSSPTCQSL--QS 198
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSILS 238
C +C +Y++ YG G +T G L +E L + T V NF+ GC
Sbjct: 199 ATGNLGVCGSNPPSC-----NYVVNYGDGSYTRGELGTEHLDLGNSTAVNNFIFGCG--R 251
Query: 239 DRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
+ Q +G+ G GRSS SL SQ FSYCL + S +LV+
Sbjct: 252 NNQGLFGGASGLVGLGRSSLSLISQTSAMFGGVFSYCL---PITETEASGSLVMGGNSSV 308
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
+ TP +SYT NP FY++ L I VGS V+ P S G G++
Sbjct: 309 YKNTTP-ISYTRMIPNPQL------PFYFLNLTGITVGSVAVQAP-------SFGKDGMM 354
Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
+DSG+ T + +++A+ EF++Q + A L CF++SG + V +P + +
Sbjct: 355 IDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMI---LDTCFNLSGYQEVEIPNIKM 411
Query: 411 KFKGGAKMALPPENYFALVGNEV--LCLILFT---DNAAGPALGRGPAIILGDFQLQNFY 465
F+G A++ + F V + +CL + + +N G I+G++Q +N
Sbjct: 412 HFEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVG---------IIGNYQQKNQR 462
Query: 466 LEFDLANDRFGFAKQKCA 483
+ +D GFA + C
Sbjct: 463 VIYDTKGSMLGFAAEACT 480
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 124/487 (25%), Positives = 191/487 (39%), Gaps = 75/487 (15%)
Query: 14 LLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARH---- 69
+L LLFT G + P + H D L++ H + S R
Sbjct: 7 VLFLLFTIAKGLHN----------PKCDATHQHDHDGSTLQVFHVFSPCSPFRPSKPMSW 56
Query: 70 ----LKTKTKPKTKD---SNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIF 122
LK + K + + S++ + S I + + Y + GTP Q +
Sbjct: 57 EESVLKLQAKDQARMQYLSSLVARRSIVPIASGRQITQSPTYIVKAKIGTPAQ-TLLLAM 115
Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
DT + W PCT+ CV C S F P +S++ + +GC +C +
Sbjct: 116 DTSNDASWVPCTA---CVGC-------STTTPFAPAKSTTFKKVGCGASQCKQV------ 159
Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC------SI 236
RN TC + ++ YG A L+ +T+ + VP + GC S
Sbjct: 160 --------RNPTCDGSACAFNFTYGTSSVAASLVQDTVTLATDPVPAYAFGCIQKVTGSS 211
Query: 237 LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
+ + G+ S + +L FSYCL S F S +L L GP + +
Sbjct: 212 VPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPS--FKTLNFSGSLRL--GPVAQPKR-- 265
Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
+ +TP KNP SS YYV L I VG + V IP L ++ G + DSG+
Sbjct: 266 -IKFTPLLKNPRRSS-----LYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFDSGTV 319
Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA 416
FT + P + AV EF R++ + + V G C+ + P + F G
Sbjct: 320 FTRLVEPAYNAVRNEFRRRIAVHKKLT-VTSLGGFDTCY----TAPIVAPTITFMF-SGM 373
Query: 417 KMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
+ LPP+N V CL + A P ++ + Q QN + FD+ N R
Sbjct: 374 NVTLPPDNILIHSTAGSVTCLAM----APAPDNVNSVLNVIANMQQQNHRVLFDVPNSRL 429
Query: 476 GFAKQKC 482
G A++ C
Sbjct: 430 GVARELC 436
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 127/457 (27%), Positives = 185/457 (40%), Gaps = 63/457 (13%)
Query: 38 PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTK-TKPKTKDSNIGSNYSNSLIKTPL 96
P ST L H D+ + LA++S + +R T KPK G +SL PL
Sbjct: 66 PFST--VLTHDDARAAHLASRLATTSNAPSRRPTTSLRKPKAAAGASGGPLDDSLASVPL 123
Query: 97 SVHS---YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP 153
+ + G Y L GTP S + DTGSSL W C+ CV V P
Sbjct: 124 TPGTSVGVGNYVTELGLGTP-ATSYAMVVDTGSSLTWLQCSP---CVVSCHRQVGP---- 175
Query: 154 AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTA 212
+ P+ SS+ + C +C + + CS RN Y YG F+
Sbjct: 176 LYDPRASSTYATVPCSASQCDELQAATLNP--SACSVRNVCI------YQASYGDSSFSV 227
Query: 213 GLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSY 266
G L +T+ F S + PNF GC ++ + AG+ G R+ SL QL FSY
Sbjct: 228 GYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSY 287
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
CL P S+ L GP + SYTP + SSS Y+V L +
Sbjct: 288 CL------PTPASTGY-LSIGPYTSGH----YSYTP-----MASSSLDASLYFVTLSGMS 331
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
VG + + P + I+DSG+ T + ++ A++K M A
Sbjct: 332 VGGSPLAVS-----PAEYSSLPTIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSA---P 383
Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
S L CF + + +P + + F GGA + L +N V + CL ++
Sbjct: 384 AFSILDTCFQGQASQ-LRVPAVAMAFAGGATLKLATQNVLIDVDDSTTCLAFAPTDST-- 440
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +D+A R GFA C+
Sbjct: 441 -------TIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 112/386 (29%), Positives = 158/386 (40%), Gaps = 56/386 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + FGTPPQ + DT S W PC+ CV C S F P +S+S
Sbjct: 97 YIVKAKFGTPPQ-TLLLALDTSSDAAWIPCSG---CVGC-------STSKPFAPIKSTSF 145
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + C +P C + P C G AC ++ YG A ++ +TL
Sbjct: 146 RNVSCGSPHCKQVPNPT----CGGS---------AC-AFNFTYGSSSIAASVVQDTLTLA 191
Query: 224 SKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+ +P + GC S + G+ S S L FSYCL S F
Sbjct: 192 ADPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPS--FKSIN 249
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + YTP +NP SS YYV L I VG K V IP +
Sbjct: 250 FSGSLRL--GPVYQPKR---IKYTPLLRNPRRSS-----LYYVNLVAIKVGRKIVDIPPA 299
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L G I DSG+ FT + P++ AV EF R++G V G C+++
Sbjct: 300 ALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVG---PKLPVTTLGGFDTCYNV 356
Query: 398 SGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ +P + F G +ALPP+N CL + A P ++
Sbjct: 357 ----PIVVPTITFLF-SGMNVALPPDNIVIHSTAGSTTCLAM----AGAPDNVNSVLNVI 407
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q QN + FD+ N R G A++ C
Sbjct: 408 ANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 133/474 (28%), Positives = 206/474 (43%), Gaps = 75/474 (15%)
Query: 29 AATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYS 88
++T TVP P + + YL H L ++ +RA L+ + KPK S + S
Sbjct: 114 SSTATVPDHPAARERYLKH-----------LLAADSARAASLQLR-KPKPASSTTTTQAS 161
Query: 89 NSLIKTPLSV---HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
+ + PL + Y +++ G + I DTGS L W +C C
Sbjct: 162 AAAAEVPLGSGIRYQTLNYVTTIALGGGGAKNLTVIVDTGSDLTWV------QCEPCPGS 215
Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF-----GPNVESRCKGCSPRNKTCPLACP 200
+ R P F P S + + C +P C+ P +R G S + C
Sbjct: 216 SCYAQRDPLFDPAASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNS--EQRC----- 268
Query: 201 SYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCSILSDRQ----PAGIAGFGRSSES 254
Y L YG G F+ G+L +TL + T + F+ GC LS+R AG+ G GR+ S
Sbjct: 269 YYALSYGDGSFSRGVLAQDTLGLGTTTKLDGFVFGCG-LSNRGLFGGTAGLMGLGRTDLS 327
Query: 255 LPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
L SQ + FSYCL A +S L GPG S P ++YT +P
Sbjct: 328 LVSQTAARFGGVFSYCL------PATTTSTGSLSLGPGP-SSSFPNMAYTRMIADPTQP- 379
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
FY++ + + + L G G V+VDSG+ T + +++AV E
Sbjct: 380 ----PFYFINI------TGAAVGGGAALTAPGFGAGNVLVDSGTVITRLAPSVYKAVRAE 429
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV-- 429
F R+ Y A S L C+D++G+ V +P L L +GGA++ + +V
Sbjct: 430 FARRF-EYPAAPGF---SILDACYDLTGRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRK 485
Query: 430 -GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G++V CL + A+ P + P I+G++Q +N + +D R GFA + C
Sbjct: 486 DGSQV-CLAM----ASLPYEDQTP--IIGNYQQRNKRVVYDTVGSRLGFADEDC 532
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 121/413 (29%), Positives = 174/413 (42%), Gaps = 95/413 (23%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA----FIPKR 159
Y ++++ G+PP+ S I DTGS LVW V C N D S A F P R
Sbjct: 101 YLMTVNLGSPPR-SMLAIADTGSDLVW---------VKCKKGNNDTSSAAAPTTQFDPSR 150
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
SS+ + CQ C E+ + C +YL YG G T G+L +E
Sbjct: 151 SSTYGRVSCQTDAC--------EALGRATCDDGSNC-----AYLYAYGDGSNTTGVLSTE 197
Query: 219 TLRFPSKTVPNFLAGCSILSDRQ-PAGIAGFGRSSE------------------SLPSQL 259
T F G S S RQ G FG S+ SL +QL
Sbjct: 198 TFTFDD--------GGSGRSPRQVRVGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQL 249
Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
G ++FSYCL+ V+++ L+ G D PG + TP V +
Sbjct: 250 GGATSLGRRFSYCLVPHS-----VNASSALNFG-ALADVTEPGAASTPLVAGDVDT---- 299
Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
+Y V L + VG+K V S + +IVDSG+T TF++ L + E R
Sbjct: 300 --YYTVVLDSVKVGNKTVA---------SAASSRIIVDSGTTLTFLDPSLLGPIVDELSR 348
Query: 375 QMGNYSRAADVEKKSGL-RPCFDISGKK---SVYLPELILKFKGGAKMALPPENYFALVG 430
++ V+ GL + C++++G++ +P+L L+F GGA +AL PEN F V
Sbjct: 349 RI----TLPPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQ 404
Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LCL + + P ILG+ QN ++ +DL FA CA
Sbjct: 405 EGTLCLAIVATTE------QQPVSILGNLAQQNIHVGYDLDAGTVTFAGADCA 451
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 114/394 (28%), Positives = 168/394 (42%), Gaps = 56/394 (14%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
ISL GTPPQA + DTGS L W + C+ + P +F P SSS
Sbjct: 74 ISLPIGTPPQAQQ-MVLDTGSQLSW---------IQCHRKKLPPKPKTSFDPSLSSSFST 123
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPR--NKTCPLACPS-----YLLQYGLG-FTAGLLLS 217
+ C +P C PR + T P +C S Y Y G F G L+
Sbjct: 124 LPCSHPLCK---------------PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVK 168
Query: 218 ETLRFPSKTV-PNFLAGCSILS--DRQPAGIAGFGRSSESLPSQLGLKKFSYCL--LSRK 272
E + F + + P + GC+ S DR GI G R S SQ + KFSYC+ S +
Sbjct: 169 EKITFSNTEITPPLILGCATESSDDR---GILGMNRGRLSFVSQAKISKFSYCIPPKSNR 225
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
P S + D G L++ + P A Y V + I G K +
Sbjct: 226 PGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLA----YTVPMIGIRFGLKKL 281
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
I S P + G+G +VDSGS FT + ++ V E + ++G + V +
Sbjct: 282 NISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADM 341
Query: 393 PCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
CFD G ++ + +L+ F G ++ +P E VG + C+ + + G A
Sbjct: 342 -CFD--GNVAMIPRLIGDLVFVFTRGVEIFVPKERVLVNVGGGIHCVGIGRSSMLGAA-- 396
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G+ QN ++EFD+ N R GFAK C+
Sbjct: 397 ---SNIIGNVHQQNLWVEFDVTNRRVGFAKADCS 427
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 123/407 (30%), Positives = 173/407 (42%), Gaps = 48/407 (11%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L H ++SL+ GTPPQ T + DTGS L W C + + +F
Sbjct: 55 LRFHHNVSLTVSLAVGTPPQNVT-MVLDTGSELSWLLCATGRQGSAAAGAAAAMGE--SF 111
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GL 214
P+ S++ + C + +CS P S C G S + C ++ L Y G + G
Sbjct: 112 RPRASATFAAVPCGSTQCSSRDLPAPPS-CDGAS---RQCHVS-----LSYADGSASDGA 162
Query: 215 LLSETLRFPSKTVPNFLAGC-SILSDRQPAGIA-----GFGRSSESLPSQLGLKKFSYCL 268
L ++ GC S D P G+A G R + S +Q ++FSYC+
Sbjct: 163 LATDVFAVGEAPPLRSAFGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCI 222
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY-VGLRQIIV 327
R DDA V L+L G D L+YTP Y+ P F Y V L I V
Sbjct: 223 SDR--DDAGV---LLL----GHSDLPFLPLNYTPLYQ-PTLPLPYFDRVAYSVQLLGIRV 272
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
G K + IP S L P G G +VDSG+ FTF+ G + A+ EF++Q RA D
Sbjct: 273 GGKALPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPS 332
Query: 388 ---KSGLRPCFDISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNE------VLC 435
+ L CF + + S LP + L F GA+M++ + V E V C
Sbjct: 333 FAFQEALDTCFRVPAGRPPPSARLPPVTLLFN-GAEMSVAGDRLLYKVPGEHRGADGVWC 391
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L F + P A ++G N ++E+DL R G A KC
Sbjct: 392 LT-FGNADMVPLT----AYVIGHHHQMNLWVEYDLERGRVGLAPVKC 433
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 121/414 (29%), Positives = 178/414 (42%), Gaps = 81/414 (19%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y +++ GTPP I DTGS LVW C + D + + P + F+P SS+
Sbjct: 110 YLMAIEVGTPP-VRVLAIADTGSDLVWVKCKGK----DNDNNSTAPPSV-YFVPSASSTY 163
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF 222
+GC C + S CSP + +C YL YG G A G L +ET F
Sbjct: 164 GRVGCDTKACRAL------SSAASCSP-DGSC-----EYLYSYGDGSRASGQLSTETFTF 211
Query: 223 P-----SKT-----------------VPNFLAGCSILSDR--QPAGIAGFGRSSESLPSQ 258
SKT + GCS + + G+ G G SL SQ
Sbjct: 212 STIADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGGGPVSLASQ 271
Query: 259 LGL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
LG +KFSYCL A +++ L+ G + S+ PG + TP V +
Sbjct: 272 LGATTSLGRKFSYCLAPY----ANTNASSALNFGSRAVVSE-PGAASTPLITGEVET--- 323
Query: 314 FGEFYYVGLRQI-IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
+Y + L I + G+K P + +IVDSG+T T+++ L + K+
Sbjct: 324 ---YYTIALDSINVAGTKR---------PTTAAQAHIIVDSGTTLTYLDSALLTPLVKDL 371
Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISG---KKSVYLPELILKFKGGAKMALPPENYFALV 429
R++ RA EK L C+DISG + ++ +P++ L GG ++ L P+N F +V
Sbjct: 372 TRRI-KLPRAESPEKI--LDLCYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVV 428
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
VLCL L + R ILG+ QN ++ +DL FA CA
Sbjct: 429 QEGVLCLALVATSE------RQSVSILGNIAQQNLHVGYDLEKGTVTFAAADCA 476
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 113/388 (29%), Positives = 157/388 (40%), Gaps = 58/388 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+PP + D+GS ++W C +C + DP F P SS
Sbjct: 128 GEYFVRVGVGSPP-TDQYLVVDSGSDVIWVQCRPCEQC----YAQTDP----LFDPAASS 178
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C + C + G C Y + YG G +T G L ETL
Sbjct: 179 SFSGVSCGSAICR-----TLSGTGCGGGGDAGKC-----DYSVTYGDGSYTKGELALETL 228
Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
V GC + AG+ G G + SL QLG FSYCL SR
Sbjct: 229 TLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASR--- 285
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
G+G + + L T +SS FYYVGL I VG + + +
Sbjct: 286 --------------GAGGAGSLVLGRTEAVPRGRRASS----FYYVGLTGIGVGGERLPL 327
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
S DG GGV++D+G+ T + + A+ F MG R+ V S L C
Sbjct: 328 QDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAV---SLLDTC 384
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
+D+SG SV +P + F GA + LP N VG V CL F +++G +
Sbjct: 385 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLA-FAPSSSGIS------- 436
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ Q + + D AN GF C
Sbjct: 437 ILGNIQQEGIQITVDSANGYVGFGPNTC 464
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 158/383 (41%), Gaps = 46/383 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S S GTPPQ T + D S VW C++ C C + P F SS
Sbjct: 95 GMYVLSFSVGTPPQVVTG-VLDITSDFVWMQCSA---CATCGADAPAATSAPPFYAFLSS 150
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLLLSE 218
+ + + C N C + P CS + C Y YG G TAGLL +
Sbjct: 151 TIREVRCANRGCQRLV-PQT------CSADDSPC-----GYSYVYGGGAANTTAGLLAVD 198
Query: 219 TLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
F + + GC++ ++ G+ G GR SL SQL + +FSY L DDA
Sbjct: 199 AFAFATVRADGVIFGCAVATEGDIGGVIGLGRGELSLVSQLQIGRFSYYLAP---DDAVD 255
Query: 279 SSNLVL---DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
+ +L D P +T TP N S YYV L I V + + IP
Sbjct: 256 VGSFILFLDDAKP-----RTSRAVSTPLVANRASRS-----LYYVELAGIRVDGEDLAIP 305
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
+DG+GGV++ TF++ ++ V + ++G RAAD + GL C+
Sbjct: 306 RGTFDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKIG--LRAAD-GSELGLDLCY 362
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAI 454
+ +P + L F GGA M L NYF + L CL + A G
Sbjct: 363 TSESLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPA-------GDGS 415
Query: 455 ILGDFQLQNFYLEFDLANDRFGF 477
+LG ++ +D++ R F
Sbjct: 416 LLGSLIQVGTHMIYDISGSRLVF 438
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/392 (27%), Positives = 164/392 (41%), Gaps = 51/392 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP + IFDTGS L W C CV + + P F P S
Sbjct: 152 GNYIVNVGLGTPKK-DLSLIFDTGSDLTWTQCQP---CVKSCYAQ----QQPIFDPSTSK 203
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ I C + CS + + GCS N Y +QYG FT G + L
Sbjct: 204 TYSNISCTSAACSSL--KSATGNSPGCSSSNCV-------YGIQYGDSSFTIGFFAKDKL 254
Query: 221 RFPSKTV-PNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKF 273
V F+ GC + + AG+ G GR S+ Q K FSYCL + +
Sbjct: 255 TLTQNDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRG 314
Query: 274 DDAPVSSNLVLDTGPGSGDSKT--PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
+ +L G G SK G+++TPF +SS +Y++ + I VG K
Sbjct: 315 SNG----HLTFGNGNGVKASKAVKNGITFTPF------ASSQGTAYYFIDVLGISVGGKA 364
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ I P N G I+DSG+ T + + ++ F + M Y A + S L
Sbjct: 365 LSIS-----PMLFQNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPAL---SLL 416
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D+S S+ +P++ F G A + L P G +CL F N ++G
Sbjct: 417 DTCYDLSNYTSISIPKISFNFNGNANVELDPNGILITNGASQVCLA-FAGNGDDDSIG-- 473
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ Q Q + +D+A + GF + C+
Sbjct: 474 ---IFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 121/458 (26%), Positives = 184/458 (40%), Gaps = 94/458 (20%)
Query: 54 KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNY---------SNSLIKTPLSVHSYGGY 104
++LH + + +RAR L ++ + S++ + L +TP+S + G Y
Sbjct: 65 ELLHEVVTHDFARARALASRLVSSNSPNRSSSDHRHLAEEEEVEHDLAQTPVSFTNGGVY 124
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
S++ G+PP+ + + DTGS L W C DP
Sbjct: 125 YSSITLGSPPKDFS-LVMDTGSDLTWVRC--------------DPC-------------- 155
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCP--LACPSYLLQYGLGFTAGLLLSETLRF 222
+P CS F R + + TC L P L + F +G L +TL+
Sbjct: 156 -----SPDCSSTF-----DRLASNTYKALTCADDLRLPVLLRLWRRLFHSGRSLRDTLKM 205
Query: 223 PS------KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLS 270
+ P F+ GC L GI S S PSQ+G K KFSYCLL
Sbjct: 206 AGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLR 265
Query: 271 RKFDDAPVSSNLVLDTG------PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
+ ++ S +V PGSG K L YTP +G SS + Y V L
Sbjct: 266 QTAQNSLKKSPMVFGEAAVELKEPGSG--KPQELQYTP-----IGESSIY---YTVRLDG 315
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
I VG++ + + S + G D I DSG+T T + + +++ + + + A+
Sbjct: 316 ISVGNQRLDLSPSTFLNGQDKP--TIFDSGTTLTMLPSGVCDSIKQ----SLASMVSGAE 369
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
GL CF + LP++ F GGA P NY +G+ + CLI N
Sbjct: 370 FVAIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVIDLGS-LQCLIFVPTNEV 428
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I G+ Q Q+F++ D+ N R GF + C
Sbjct: 429 S---------IFGNLQQQDFFVLHDMDNRRIGFKETDC 457
>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/404 (25%), Positives = 166/404 (41%), Gaps = 65/404 (16%)
Query: 102 GGYSISLSFGTP-PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
Y I+ G P+ + + DTGS + W T+ C RS
Sbjct: 108 ASYIITFYLGNQRPEDNISAVVDTGSDIFW---TTEKEC------------------SRS 146
Query: 161 SSSQLIGCQNPKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLLL 216
+ ++ C +PKC S K + + C +Y + YG TAG++
Sbjct: 147 KTRSMLPCCSPKCEQRASCGCGRSELKAEAEKETKC-----TYAIIYGGNANDSTAGVMY 201
Query: 217 SETLRF---PSKTVPN------FLAGCSI-----LSDRQPAGIAGFGRSSESLPSQLGLK 262
+ L SK VP+ GCS D G+ G GRS+ SLP QL
Sbjct: 202 EDKLTIVAVASKAVPSSQSFKEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFS 261
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
KFSYCL S + D P S L+L P + + +S + Y+V L
Sbjct: 262 KFSYCLSSYQEPDLP--SYLLLTAAPDM--ATGAVGGGAAVATTALQPNSDYKTLYFVHL 317
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ I +G + + G + VD+G++FT +EG +F + E R M
Sbjct: 318 QNISIGGTR------FPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYV 371
Query: 383 ADVEKKSGLRPCF---DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
+ ++ + C+ + +S LP+++L F A M LP ++Y ++ LCL ++
Sbjct: 372 KEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTTSK-LCLAIY 430
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N +G +LG+FQ+QN ++ D N++ F + C+
Sbjct: 431 KSNI------KGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCS 468
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 115/398 (28%), Positives = 169/398 (42%), Gaps = 71/398 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS +VW C C +C + DP F P S
Sbjct: 6 GEYFTRIGIGTPTREQY-MVLDTGSDVVWIQCEP---CREC-YSQADP----IFNPSSSV 56
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S +GC + CS + + GC Y + YG G +T G +ETL
Sbjct: 57 SFSTVGCDSAVCSQLDANDCHG--GGCL------------YEVSYGDGSYTVGSYATETL 102
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGI-------AGFGRSSESLPSQLGL---KKFSYCLLS 270
F + ++ N GC G+ G G S S P+QLG + FSYCL+
Sbjct: 103 TFGTTSIQNVAIGCG----HDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVD 158
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
R + S+ L+ GP +S G +TP NP FYY+ + I VG
Sbjct: 159 RDSE-----SSGTLEFGP---ESVPIGSIFTPLVANPF-----LPTFYYLSMVAISVGGV 205
Query: 331 HVK-IP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+ +P ++ + + G GG+I+DSG+ T ++ ++A+ FI + RA +
Sbjct: 206 ILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGI--- 262
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
S C+D+S +SV +P + F GA LP +N CLI D+
Sbjct: 263 SIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKN----------CLIPM-DSMGTFCF 311
Query: 449 GRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
PA I+G+ Q Q + FD AN GFA +C
Sbjct: 312 AFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/378 (29%), Positives = 167/378 (44%), Gaps = 59/378 (15%)
Query: 115 QASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
Q + PF + DTGS + W C C DC + DP F P+ SSS + C++ +
Sbjct: 163 QPAKPFYMVLDTGSDINWLQCQP---CTDC-YQQTDP----IFDPRSSSSFASLPCESQQ 214
Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFL 231
C + GC R C Y + YG G FT G ++ETL F + + N +
Sbjct: 215 CQAL-------ETSGC--RASKCL-----YQVSYGDGSFTVGEFVTETLTFGNSGMINDV 260
Query: 232 A-GCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTG 287
A GC ++ AG+ G G SL SQ+ FSYCL+ R + SS+L ++
Sbjct: 261 AVGCGHDNEGLFVGSAGLLGLGGGPLSLTSQMKASSFSYCLVDRD---SSSSSDLEFNSA 317
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
S P L S FYYVGL + VG + + IP + G G
Sbjct: 318 APSDSVNAPLLK-----------SGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYG 366
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG---LRPCFDISGKKSVY 404
G+IVDSG+ T ++ + + F+ SR ++K +G C+D+S + V
Sbjct: 367 GIIVDSGTAITRLQTQAYNTLRDAFV------SRTPYLKKTNGFALFDTCYDLSSQSRVT 420
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
+P + +F GG + LPP+NY V + F + + I+G+ Q Q
Sbjct: 421 IPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLS-------IIGNVQQQGT 473
Query: 465 YLEFDLANDRFGFAKQKC 482
+ +DLAN GF+ KC
Sbjct: 474 RVHYDLANSVVGFSPHKC 491
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 170/391 (43%), Gaps = 46/391 (11%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+SL GTP Q S + DTGS L W C + P +F P SSS
Sbjct: 83 LSLPIGTPSQ-SQELVLDTGSQLSWIQCHPKKIKKPLPPPTT------SFDPSLSSSFSD 135
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPR--NKTCPLACPS-----YLLQYGLG-FTAGLLLS 217
+ C +P C PR + T P +C S Y Y G F G L+
Sbjct: 136 LPCSHPLCK---------------PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVK 180
Query: 218 ETLRFP-SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
E F S+T P + GC+ S GI G S SQ + KFSYC+ +R
Sbjct: 181 EKFTFSNSQTTPPLILGCAKEST-DVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPG 239
Query: 277 PVSS-NLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
S+ + L P S G L++ + P A Y V L I +G K + I
Sbjct: 240 LASTGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPLA----YTVPLLGIRIGQKRLNI 295
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
P S P + G+G +VDSGS FT + ++ V +E +R +G+ + V + C
Sbjct: 296 PSSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTA-DMC 354
Query: 395 FDISGKKSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
FD + + + + +L+ +F G ++ + + VG + C+ + + G A
Sbjct: 355 FDGNHQMVIGRLIGDLVFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAA----- 409
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I+G+ QN ++EFD+AN R GF+K +C+
Sbjct: 410 SNIIGNVHQQNLWVEFDVANRRVGFSKAECS 440
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 170/398 (42%), Gaps = 63/398 (15%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+SL GTPPQ S I DTGS L W C + P P F P SSS +
Sbjct: 79 VSLPIGTPPQ-SQQMILDTGSQLSWIQCHKK-------VPRKPPPST-VFDPSLSSSFSV 129
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPR--NKTCPLACP-----SYLLQYGLGFTA-GLLLS 217
+ C +P C PR + T P +C Y Y G A G L+
Sbjct: 130 LPCNHPLCK---------------PRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVR 174
Query: 218 ETLRF-PSKTVPNFLAGCSI-LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD 275
E + F S++ P + GC+ SD + GI G S SQ + KFSYC+ +R+
Sbjct: 175 EKITFSTSQSTPPLILGCAEDASDDK--GILGMNLGRLSFASQAKITKFSYCVPTRQVRP 232
Query: 276 A--PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
P S + + +G L+++ + P A + V L+ I +G+K +
Sbjct: 233 GFTPTGSFYLGENPNSAGFQYISLLTFSQSQRMPNLDPLA----HTVALQGIRIGNKKLN 288
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG-------NYSRAADVE 386
IP S G G ++DSGS FT++ + V +E +R G YS +D+
Sbjct: 289 IPVSAFRADPSGAGQSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVSDM- 347
Query: 387 KKSGLRPCFDISGKK-SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
CFD + + + ++ +F G ++ + A VG V C+ + G
Sbjct: 348 -------CFDGNAMEIGRLIGNMVFEFDKGVEIVIEKGRVLADVGGGVHCVGIGRSEMLG 400
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A + I+G+F QN ++EFD+AN R GF K C+
Sbjct: 401 AA-----SNIIGNFHQQNLWVEFDIANRRVGFGKADCS 433
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 117/399 (29%), Positives = 173/399 (43%), Gaps = 69/399 (17%)
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
V + G Y +++ GTP + T FDTGS L W C C+ FP P F P
Sbjct: 134 VPTGGAYVVTVGLGTPKKDFT-LSFDTGSDLTWTQCEP---CLGGCFPQ----NQPKFDP 185
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
S+S + + C + C I +G P C Y +QYG G+T G L +
Sbjct: 186 TTSTSYKNVSCSSEFCKLI--------AEGNYPAQDCISNTC-LYGIQYGSGYTIGFLAT 236
Query: 218 ETLRFPSKTV-PNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKK---FSYCLLS 270
ETL S V NFL GCS S G+ G GRS +LPSQ K FSYCL
Sbjct: 237 ETLAIASSDVFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCL-- 294
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKN-PVGSSSAFGEFYYVGLRQIIVGS 329
P S P S + G+ + K+ P+ S + Y GL + +
Sbjct: 295 ------PAS--------PSSTGHLSFGVEVSQAAKSTPI--SPKLKQLY--GLNTVGISV 336
Query: 330 KHVKIPYSYLVPGSDGNGGV---IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
+ ++P NG + I+DSG+TFTF+ P + A+ F M NY+
Sbjct: 337 RGRELPI---------NGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLT---N 384
Query: 387 KKSGLRPCFDIS--GKKSVYLPELILKFKGGAKMALPPENYFALV-GNEVLCLILFTDNA 443
S +PC+D S G ++ +P + + F+GG ++ + V G + +CL F D
Sbjct: 385 GTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLA-FADTG 443
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ I G++Q + + + +D+A GFA + C
Sbjct: 444 SDSDFA-----IFGNYQQKTYEVIYDVAKGMVGFAPKGC 477
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 121/408 (29%), Positives = 163/408 (39%), Gaps = 69/408 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y++ + G+PP+ I DTGS LVW C +C + P DPS F
Sbjct: 2 GAYTMEIELGSPPKKFNA-IVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTF------ 54
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
K S GCS KTC Y QYG T G ETL
Sbjct: 55 ---------AKTSCSTSSCQSLPASGCSSSAKTCI-----YGYQYGDSSSTQGDFALETL 100
Query: 221 RF-----PSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
SK PNF GC L+ AGI G G+ SL +QLG KFSYCL+
Sbjct: 101 TLRSSGGSSKAFPNFQFGCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLV 160
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
DD+ +S L+ GS S G TP N S +Y+VGL I VG
Sbjct: 161 DFD-DDSSKTSPLIF----GSSASTGSGAISTPIIPN-----SGRSTYYFVGLEGISVGG 210
Query: 330 KHVKIPYSYL------------VPGSDGN-GGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
K + + + V + N GG I DSG+T T ++ ++ V F +
Sbjct: 211 KQLSLATRAIDFLSVRSKKKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSV 270
Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV--GNEVL 434
S SG C+D+S K+ P L L FK G K + P +NYF +V V
Sbjct: 271 ---SLPTVDASSSGFDLCYDVSKSKNFKFPALTLAFK-GTKFSPPQKNYFVIVDTAETVA 326
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL + + I+G+ QN+++ +D + +C
Sbjct: 327 CLAMGGSGSL-------GLGIIGNLMQQNYHVVYDRGTSTISMSPAQC 367
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 172/385 (44%), Gaps = 51/385 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G PP + + DTGS + W C C +C + DP F P S+
Sbjct: 147 GEYFLRVGIGKPPSQAY-VVLDTGSDVSWIQCAP---CSEC-YQQSDP----IFDPISSN 197
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S I C P+C + S C RN TC Y + YG G +T G +ET+
Sbjct: 198 SYSPIRCDEPQCKSL----DLSEC-----RNGTC-----LYEVSYGDGSYTVGEFATETV 243
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
S V N GC ++ AG+ G G S P+Q+ FSYCL++R D+
Sbjct: 244 TLGSAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNR---DSD 300
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S L ++ P ++ T P +NP FYY+GL+ I VG + + IP S
Sbjct: 301 AVSTLEFNS-PLPRNAAT-----APLMRNP-----ELDTFYYLGLKGISVGGEALPIPES 349
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
+ G GG+I+DSG+ T + +++A+ F++ +A V S C+D+
Sbjct: 350 SFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGV---SLFDTCYDL 406
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
S ++SV +P + +F G ++ LP NY V + F + + I+G
Sbjct: 407 SSRESVEIPTVSFRFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLS-------IIG 459
Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q Q + FD+AN GF+ C
Sbjct: 460 NVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 178/397 (44%), Gaps = 53/397 (13%)
Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
GTPP+ + DT S L W TS C +C+ P+++P F P SSS C
Sbjct: 5 IGTPPR-EVLLLVDTASELTWVQGTS---CTNCS-----PTKVPPFNPGLSSSFISEPCT 55
Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFPS---- 224
+ C G + C+ +C S+ + Y G A G++ E S
Sbjct: 56 SSVC---LGRSKLGFQSACNRSTGSC-----SFQVAYLDGSEAYGVIAREIFSLQSWDGA 107
Query: 225 -KTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLK-------KFSYCLLSRK 272
T+ + + GC+ ++P +G G R S S P+Q+G + +FSYC +R
Sbjct: 108 ASTLGDVIFGCASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRA 167
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPG--LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
+ SS +++ GDS P Y + P +S +FYYVGL+ I VG +
Sbjct: 168 --EHLNSSGVII-----FGDSGIPAHHFQYLSLEQEPPIASIV--DFYYVGLQGISVGGE 218
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ IP S GNGG DSG+T +F+ P A+ + F R++ + +R + +
Sbjct: 219 LLHIPRSAFKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKE 278
Query: 391 LRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGN--EVLCLILFTDNAAGP 446
L C+D++ + P + L FK M L + + + +V+ + L NA
Sbjct: 279 L--CYDVAAGDARLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAG-- 334
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A+ +G ++G++Q Q++ +E DL R GFA C
Sbjct: 335 AVAQGGVNVIGNYQQQDYLIEHDLERSRIGFAPANCV 371
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 111/386 (28%), Positives = 157/386 (40%), Gaps = 56/386 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + FGTPPQ + DT S W PC+ CV C S F P +S+S
Sbjct: 97 YIVKAKFGTPPQ-TLLLALDTSSDAAWIPCSG---CVGC-------STSKPFAPIKSTSF 145
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + C +P C + P C G AC ++ YG A ++ +TL
Sbjct: 146 RNVSCGSPHCKQVPNPT----CGGS---------AC-AFNFTYGSSSIAASVVQDTLTLA 191
Query: 224 SKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+ +P + GC S + G+ S S L FSYCL S F
Sbjct: 192 TDPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPS--FKSIN 249
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + YTP +NP SS YYV L I VG K V IP +
Sbjct: 250 FSGSLRL--GPVYQPKR---IKYTPLLRNPRRSS-----LYYVNLVAIKVGRKIVDIPPA 299
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L G I DSG+ FT + P++ AV EF R++G V G C+++
Sbjct: 300 ALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVG---PKLPVTTLGGFDTCYNV 356
Query: 398 SGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ +P + F G + LPP+N CL + A P ++
Sbjct: 357 ----PIVVPTITFLF-SGMNVTLPPDNIVIHSTAGSTTCLAM----AGAPDNVNSVLNVI 407
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q QN + FD+ N R G A++ C
Sbjct: 408 ANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 110/410 (26%), Positives = 176/410 (42%), Gaps = 55/410 (13%)
Query: 82 NIGSNYSNSLIKTPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR 138
N + Y + TP+ + G Y + GTP + + DTGS + W C
Sbjct: 137 NEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAK-DMYLVLDTGSDVNWIQCEP--- 192
Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
C DC + DP F P SS+ + + C P+CS + C R+ C
Sbjct: 193 CADC-YQQSDP----VFNPTSSSTYKSLTCSAPQCSLL-------ETSAC--RSNKCL-- 236
Query: 199 CPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSE 253
Y + YG G FT G L ++T+ F S + N GC ++ AG+ G G
Sbjct: 237 ---YQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVL 293
Query: 254 SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSS 312
S+ +Q+ FSYCL+ R D+ SS+L ++ G GD+ P L +
Sbjct: 294 SITNQMKATSFSYCLVDR---DSGKSSSLDFNSVQLGGGDATAPLLR-----------NK 339
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
FYYVGL VG + V +P + + G+GGVI+D G+ T ++ + ++ F
Sbjct: 340 KIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAF 399
Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE 432
++ N + + S C+D S +V +P + F GG + LP +NY V +
Sbjct: 400 LKLTVNLKKGS--SSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDS 457
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F ++ + I+G+ Q Q + +DL+ + G + KC
Sbjct: 458 GTFCFAFAPTSSSLS-------IIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 110/410 (26%), Positives = 176/410 (42%), Gaps = 55/410 (13%)
Query: 82 NIGSNYSNSLIKTPL---SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR 138
N + Y + TP+ + G Y + GTP + + DTGS + W C
Sbjct: 137 NEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAK-EMYLVLDTGSDVNWIQCEP--- 192
Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
C DC + DP F P SS+ + + C P+CS + C R+ C
Sbjct: 193 CADC-YQQSDP----VFNPTSSSTYKSLTCSAPQCSLL-------ETSAC--RSNKCL-- 236
Query: 199 CPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSE 253
Y + YG G FT G L ++T+ F S + N GC ++ AG+ G G
Sbjct: 237 ---YQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVL 293
Query: 254 SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSS 312
S+ +Q+ FSYCL+ R D+ SS+L ++ G GD+ P L +
Sbjct: 294 SITNQMKATSFSYCLVDR---DSGKSSSLDFNSVQLGGGDATAPLLR-----------NK 339
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
FYYVGL VG + V +P + + G+GGVI+D G+ T ++ + ++ F
Sbjct: 340 KIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAF 399
Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE 432
++ N + + S C+D S +V +P + F GG + LP +NY V +
Sbjct: 400 LKLTVNLKKGS--SSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDS 457
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F ++ + I+G+ Q Q + +DL+ + G + KC
Sbjct: 458 GTFCFAFAPTSSSLS-------IIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 113/378 (29%), Positives = 165/378 (43%), Gaps = 59/378 (15%)
Query: 115 QASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
Q + PF + DTGS + W C C DC + DP F P+ SSS + C++ +
Sbjct: 163 QPAKPFYMVLDTGSDINWLQCQP---CTDC-YQQTDP----IFDPRSSSSFASLPCESQQ 214
Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNF 230
C + GC R C Y + YG G FT G + ETL F S + N
Sbjct: 215 CQAL-------ETSGC--RASKCL-----YQVSYGDGSFTVGEFVIETLTFGNSGMINNV 260
Query: 231 LAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTG 287
GC ++ AG+ G G S SL SQ+ FSYCL+ R + SS+L ++
Sbjct: 261 AVGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCLVDRD---SSSSSDLEFNSA 317
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
S P L S FYYVGL + VG + + IP + G G
Sbjct: 318 APSDSVNAPLLK-----------SGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYG 366
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG---LRPCFDISGKKSVY 404
G+IVDSG+ T ++ + + F+ SR ++K +G C+D+S + V
Sbjct: 367 GIIVDSGTAITRLQTQAYNTLRDAFV------SRTPYLKKTNGFALFDTCYDLSSQSRVT 420
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
+P + +F GG + LPP+NY V + F + + I+G+ Q Q
Sbjct: 421 IPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLS-------IIGNVQQQGT 473
Query: 465 YLEFDLANDRFGFAKQKC 482
+ +DLAN GF+ KC
Sbjct: 474 RVHYDLANSVVGFSPHKC 491
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 119/405 (29%), Positives = 168/405 (41%), Gaps = 47/405 (11%)
Query: 95 PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP- 153
P + + G YS++ GTP Q + DTGS L W C +Y C N N RI
Sbjct: 3 PAADYGIGQYSVAFKVGTPSQKFM-LVADTGSDLTWMSC--KYHCRSRNCSNRKARRIRH 59
Query: 154 --AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
F SSS + I C C +E S N PL Y +Y G T
Sbjct: 60 KRVFHANLSSSFKTIPCLTDMC------KIE-LMDLFSLTNCPTPLTPCGYDYRYSDGST 112
Query: 212 A-GLLLSETLRFPSKT-----VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGL 261
A G +ET+ K + N L GCS S + G+ G G S S +
Sbjct: 113 ALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAE 172
Query: 262 K---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
K KFSYCL+ VS+ L GS SK L+ + + +G ++F Y
Sbjct: 173 KFGGKFSYCLVDH-LSHKNVSNYLTF----GSSRSKEALLNNMTYTELVLGMVNSF---Y 224
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
V + I +G +KIP V G GG I+DSGS+ TF+ P ++ V +
Sbjct: 225 AVNMMGISIGGAMLKIPSE--VWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLK 282
Query: 379 YSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ + VE G L CF+ +G + +P L+ F GA+ P ++Y + V CL
Sbjct: 283 FRK---VEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLG 339
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ G + ++G+ QN EFDL + GFA C
Sbjct: 340 FVSVAWPGTS-------VVGNIMQQNHLWEFDLGLKKLGFAPSSC 377
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 119/410 (29%), Positives = 174/410 (42%), Gaps = 63/410 (15%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA-FIPKRSSSS 163
++ ++ G PPQ T + DTGS L W RC P+ P + PA F SS+
Sbjct: 63 TVPVAVGAPPQNVT-MVLDTGSELSWL------RCNGSRVPSTPPPQAPAAFNGSASSTY 115
Query: 164 QLIGCQNPKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
C +P+C W G + V C G P + +C ++ L Y +A G+L ++T
Sbjct: 116 AAAHCSSPECQW-RGRDLPVPPFCAG--PPSNSCRVS-----LSYADASSADGILAADTF 167
Query: 221 RFPSKTVPNFLAGC---------SILSDRQPA-GIAGFGRSSESLPSQLGLKKFSYCLLS 270
L GC + SD + A G+ G R S S +Q +F+YC+
Sbjct: 168 LLGGAPPVRALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCI-- 225
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYK--NPVGSSSAFGEFYYVGLRQIIVG 328
AP +L G G G + P L+YTP + P+ Y V L I VG
Sbjct: 226 -----APGDGPGLLVLG-GDGAALAPQLNYTPLIQISRPLPYFDRVA--YSVQLEGIRVG 277
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR---AADV 385
+ + IP S L P G G +VDSG+ FTF+ + + EF+ Q +D
Sbjct: 278 AALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDF 337
Query: 386 EKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENYFALVGNE--------- 432
+ CF S + S LPE+ L + GA++A+ E V E
Sbjct: 338 VFQGAFDACFRASEARVAAASQMLPEVGLVLR-GAEVAVGGEKLLYRVPGERRGEGGAEA 396
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V CL + AG + A ++G QN ++E+DL N R GFA +C
Sbjct: 397 VWCLTFGNSDMAGMS-----AYVIGHHHQQNVWVEYDLQNGRVGFAPARC 441
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 118/464 (25%), Positives = 177/464 (38%), Gaps = 80/464 (17%)
Query: 44 YLHHSDSDPLKILHSLASSSLSR----------ARHLKTKTKPKTKD-SNIGSNYSNSLI 92
Y H D L++ H + S R L+ K + + + SN+ + S I
Sbjct: 35 YQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLQLQAKDQARMQYLSNLVARRSIVPI 94
Query: 93 KTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI 152
+ + Y + FGTP Q + DT + W PCT+ CV C S
Sbjct: 95 ASGRQITQSPTYIVRAKFGTPAQ-TLLLAMDTSNDAAWVPCTA---CVGC-------STT 143
Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA 212
F P +S++ + +GC +C + RN TC + ++ YG A
Sbjct: 144 TPFAPPKSTTFKKVGCGASQCKQV--------------RNPTCDGSACAFNFTYGTSSVA 189
Query: 213 GLLLSETLRFPSKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSY 266
L+ +T+ + VP + GC S L + G+ S + +L FSY
Sbjct: 190 ASLVQDTVTLATDPVPAYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSY 249
Query: 267 CLLSRK-------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
CL S K D PV+ P P +KNP SS YY
Sbjct: 250 CLPSFKTLNFSGHXDLXPVAQ---------------PRDQVYPSFKNPRRSS-----LYY 289
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
V L I VG + V IP L G + DSG+ FT + P + AV EF R++ +
Sbjct: 290 VNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFDSGTVFTRLVEPAYTAVRNEFRRRVSVH 349
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLIL 438
+ V G C+ + + P + F G + LPP+N V CL +
Sbjct: 350 KKLT-VTSLGGFDTCYTV----PIVAPTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAM 403
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A P ++ + Q QN + FD+ N R G A++ C
Sbjct: 404 ----APAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELC 443
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 119/407 (29%), Positives = 169/407 (41%), Gaps = 79/407 (19%)
Query: 94 TPLSVHSYGGYSISLSFGTPPQASTPFIF--DTGSSLVWFPCT-SRYRCVDCNFPNVDPS 150
TP + + G Y + GTP + P+I DTGSSL W C+ R C
Sbjct: 127 TPGTSYGVGNYVTRMGLGTPAK---PYIMVVDTGSSLTWLQCSPCRVSC--------HRQ 175
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS-----YLLQ 205
P F PK SSS + C P+C+ + + P AC S Y
Sbjct: 176 SGPVFDPKTSSSYAAVSCSTPQCNDLSTATLN-------------PAACSSSDVCIYQAS 222
Query: 206 YG-LGFTAGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL 261
YG F+ G L +T+ F S +VPNF GC ++ + AG+ G R+ SL QL
Sbjct: 223 YGDSSFSVGYLSKDTVSFGSNSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAP 282
Query: 262 K---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
FSYCL S + PG SYTP SS+ Y
Sbjct: 283 TLGYSFSYCLPSSSSSGYLSIGSY----NPGQ-------YSYTPMV-----SSTLDDSLY 326
Query: 319 YVGLRQIIVGSKHVKI---PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
++ L + V K + + YS L I+DSG+ T + +++A++K
Sbjct: 327 FIKLSGMTVAGKPLAVSSSEYSSL--------PTIIDSGTVITRLPTTVYDALSKAVAGA 378
Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
M RA + S L CF + S+ +P + + F GGA + L +N V + C
Sbjct: 379 MKGTKRA---DAYSILDTCF-VGQASSLRVPAVSMAFSGGAALKLSAQNLLVDVDSSTTC 434
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L A PA R AII G+ Q Q F + +D+ ++R GFA C
Sbjct: 435 L------AFAPA--RSAAII-GNTQQQTFSVVYDVKSNRIGFAAGGC 472
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 120/454 (26%), Positives = 174/454 (38%), Gaps = 51/454 (11%)
Query: 39 LSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSV 98
LS H +H S PL+ + +LA +R L +K + + S S P
Sbjct: 27 LSVYHNVHPSSPSPLESIIALARDDDARLLFLSSKAA----TAGVSSAPVASGQAPP--- 79
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
Y + G+P Q DT + W C+ C + F P
Sbjct: 80 ----SYVVRAGLGSPSQ-QLLLALDTSADATWAHCSPCGTCPSSSL----------FAPA 124
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
SSS + C + C G + G L ++ + L S+
Sbjct: 125 NSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASD 184
Query: 219 TLRFPSKTVPNFLAGCSILSDRQPA------GIAGFGRSSESLPSQLGL---KKFSYCLL 269
TLR +PN+ GC + S P G+ G GR +L SQ G FSYCL
Sbjct: 185 TLRLGKDAIPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLP 243
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
S + S +L L G+G + + YTP +NP SS YYV + + VG
Sbjct: 244 S--YRSYYFSGSLRL----GAGGGQPRSVRYTPMLRNPHRSS-----LYYVNVTGLSVGR 292
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
VK+P + G +VDSG+ T P++ A+ +EF RQ+ S
Sbjct: 293 AWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPS---GYTSLG 349
Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPAL 448
CF+ + P + + GG +ALP EN L CL + A P
Sbjct: 350 AFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAM----AEAPQN 405
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ + Q QN + FD+AN R GFAK+ C
Sbjct: 406 VNSVVNVIANLQQQNIRVVFDVANSRIGFAKESC 439
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 118/397 (29%), Positives = 162/397 (40%), Gaps = 59/397 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTS--RYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y G PPQ + I DTGS LVW C++ R C P + S F P
Sbjct: 90 YVAEYLIGDPPQRAEALI-DTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPV--- 145
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTAGLLLSETL 220
P C C LA S + YG G AG L +E
Sbjct: 146 -----------------PCAARICAANDDIIHFCDLAAGCSVIAGYGAGVVAGTLGTEAF 188
Query: 221 RFPSKTVPNFLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
F S T GC + +G+ G GR SL SQ G KFSYCL + F
Sbjct: 189 AFQSGTA-ELAFGCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTGATKFSYCL-TPYFH 246
Query: 275 DAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
+ + +L + G GD T T F K P GS FYY+ L + VG
Sbjct: 247 NNGATGHLFVGASASLGGHGDVMT-----TQFVKGPKGS-----PFYYLPLIGLTVGETR 296
Query: 332 VKIPYSY-----LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
+ IP + + PG +GGVI+DSGS FT + ++A+A E ++ A +
Sbjct: 297 LPIPATVFDLREVAPGLF-SGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPD 355
Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
G C V +P ++ F+GGA MA+P E+Y+A V +AGP
Sbjct: 356 ADDGAL-CVARRDVGRV-VPAVVFHFRGGADMAVPAESYWAPVDKAA---ACMAIASAGP 410
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ ++G++Q QN + +DLAN F F C+
Sbjct: 411 YRRQS---VIGNYQQQNMRVLYDLANGDFSFQPADCS 444
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 174/400 (43%), Gaps = 71/400 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP + DTGSSL+W C+ + C P P F P +SS
Sbjct: 87 GEYLMRFYIGSPPVERLAMV-DTGSSLIWLQCSPCHNCF--------PQETPLFEPLKSS 137
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTC-PLACPSYLLQYG-LGFTAGLLLSET 219
+ + C + C+ + P + C L Y + YG F+ G+L +ET
Sbjct: 138 TYKYATCDSQPCTLL------------QPSQRDCGKLGQCIYGIMYGDKSFSVGILGTET 185
Query: 220 LRFPS----KTV--PNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLK---KF 264
L F S +TV PN + GC +I + + GIAG G SL SQLG + KF
Sbjct: 186 LSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKF 245
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
SYCLL D+ +S L + T G+ TP P + +Y++ L
Sbjct: 246 SYCLLPY---DSTSTSKLKFGS---EAIITTNGVVSTPLIIKP-----SLPTYYFLNLEA 294
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
+ +G K +V +G +++DSG+ T++E + +G D
Sbjct: 295 VTIGQK--------VVSTGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLG-VKLLQD 345
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNA 443
+ S L+ CF + ++ +P++ +F GA +AL P+N L + +LCL A
Sbjct: 346 L--PSPLKTCF--PNRANLAIPDIAFQFT-GASVALRPKNVLIPLTDSNILCL------A 394
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P+ G G + + G +F +E+DL + FA CA
Sbjct: 395 VVPSSGIGIS-LFGSIAQYDFQVEYDLEGKKVSFAPTDCA 433
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 164/391 (41%), Gaps = 61/391 (15%)
Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + GTP Q P + DT S + W PC+ CV C PS AF P +S+
Sbjct: 115 YIVKALIGTPAQ---PLLLAMDTSSDVAWIPCSG---CVGC------PSNT-AFSPAKST 161
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
S + + C P+C + P +R AC S+ L YG A L +T+R
Sbjct: 162 SFKNVSCSAPQCKQVPNPTCGAR-------------AC-SFNLTYGSSSIAANLSQDTIR 207
Query: 222 FPSKTVPNFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
+ + F GC +I + G+ S S + FSYCL S F
Sbjct: 208 LAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPS--F 265
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
S +L L GP S + + YT +NP SS YYV L I VG K V
Sbjct: 266 RSLTFSGSLRL--GPTSQPQR---VKYTQLLRNPRRSS-----LYYVNLVAIRVGRKVVD 315
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+P + + G I DSG+ +T + P++EAV EF +++ + A V G
Sbjct: 316 LPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTT--AVVTSLGGFDT 373
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGP 452
C+ SG+ V +P + FK G M +P +N CL + AA P
Sbjct: 374 CY--SGQ--VKVPTITFMFK-GVNMTMPADNLMLHSTAGSTSCLAM----AAAPENVNSV 424
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
++ Q QN + D+ N R G A+++C+
Sbjct: 425 VNVIASMQQQNHRVLIDVPNGRLGLARERCS 455
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 119/410 (29%), Positives = 174/410 (42%), Gaps = 63/410 (15%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA-FIPKRSSSS 163
++ ++ G PPQ T + DTGS L W RC P+ P + PA F SS+
Sbjct: 61 TVPVAVGAPPQNVT-MVLDTGSELSWL------RCNGSRVPSTPPPQAPAAFNGSASSTY 113
Query: 164 QLIGCQNPKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
C +P+C W G + V C G P + +C ++ L Y +A G+L ++T
Sbjct: 114 AAAHCSSPECQW-RGRDLPVPPFCAG--PPSXSCRVS-----LSYADASSADGILAADTF 165
Query: 221 RFPSKTVPNFLAGC---------SILSDRQPA-GIAGFGRSSESLPSQLGLKKFSYCLLS 270
L GC + SD + A G+ G R S S +Q +F+YC+
Sbjct: 166 LLGGAPPVXALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCI-- 223
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYK--NPVGSSSAFGEFYYVGLRQIIVG 328
AP +L G G G + P L+YTP + P+ Y V L I VG
Sbjct: 224 -----APGDGPGLLVLG-GDGAALAPQLNYTPLIQISRPLPYFDRVA--YSVQLEGIRVG 275
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR---AADV 385
+ + IP S L P G G +VDSG+ FTF+ + + EF+ Q +D
Sbjct: 276 AALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDF 335
Query: 386 EKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENYFALVGNE--------- 432
+ CF S + S LPE+ L + GA++A+ E V E
Sbjct: 336 VFQGAFDACFRASEARVAAASXMLPEVGLVLR-GAEVAVGGEKLLYRVPGERRGEGGAEA 394
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V CL + AG + A ++G QN ++E+DL N R GFA +C
Sbjct: 395 VWCLTFGNSDMAGMS-----AYVIGHHHQQNVWVEYDLQNGRVGFAPARC 439
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 115/440 (26%), Positives = 199/440 (45%), Gaps = 56/440 (12%)
Query: 54 KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
++ L S L R R ++ + + N+ ++ + + + +++ + Y +++ G+
Sbjct: 17 RLQKQLISDDL-RVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLN-YIVTMGLGS- 73
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
+ I DTGS L W C C + + P F P SSS Q + C + C
Sbjct: 74 --TNMTVIIDTGSDLTWVQCEPCMSCYN--------QQGPIFKPSTSSSYQSVSCNSSTC 123
Query: 174 -SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFL 231
S F C G +P TC +Y++ YG G +T G L E L F +V +F+
Sbjct: 124 QSLQFATGNTGAC-GSNP--STC-----NYVVNYGDGSYTNGELGVEQLSFGGVSVSDFV 175
Query: 232 AGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLD 285
GC + +G+ G GRS SL SQ FSYCL + ++ S +LV+
Sbjct: 176 FGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTT---ESGASGSLVMG 232
Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
+ TP ++YT NP FY + L I V +++P S G
Sbjct: 233 NESSVFKNVTP-ITYTRMLPNP-----QLSNFYILNLTGIDVDGVALQVP-------SFG 279
Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL 405
NGGV++DSG+ T + +++A+ F++Q + A S L CF+++G V +
Sbjct: 280 NGGVLIDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGF---SILDTCFNLTGYDEVSI 336
Query: 406 PELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQN 463
P + + F+G A++ + F +V + +CL L + + A I+G++Q +N
Sbjct: 337 PTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDA------YDTAIIGNYQQRN 390
Query: 464 FYLEFDLANDRFGFAKQKCA 483
+ +D + GFA++ C+
Sbjct: 391 QRVIYDTKQSKVGFAEESCS 410
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 80/243 (32%), Positives = 115/243 (47%), Gaps = 16/243 (6%)
Query: 243 AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTP 302
+G+ G + SL SQL + +FSYCL F + S L + T + T
Sbjct: 110 SGLMGLSPGTMSLISQLSVPRFSYCLTP--FAERKTSPMLFGAMADLRKYNTTGPIQTTA 167
Query: 303 FYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEG 362
+NP + +YYV L + +G+K +++P + L DG GG IVDSGST + G
Sbjct: 168 ILRNPAMDTF----YYYVPLVGLSLGTKRLRVPAASLAINPDGTGGTIVDSGSTMAHLAG 223
Query: 363 PLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS---GKKSVYLPELILKFKGGAKMA 419
F+AV K + + VE CF + +V P L+L F GGA MA
Sbjct: 224 KAFDAVKKAVLEAVKLPVFNGTVED---YELCFAVPSGVAMAAVKTPPLVLHFDGGAAMA 280
Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
LP +NYF ++CL + A P P I+G+ Q QN ++ FD+ N +F FA
Sbjct: 281 LPRDNYFQEPRAGLMCLAV----ARSPEDLGAPISIIGNVQQQNMHVLFDVHNQKFSFAP 336
Query: 480 QKC 482
KC
Sbjct: 337 TKC 339
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 162/391 (41%), Gaps = 55/391 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y ++L GTP T I DTGS L W +C CN + P + P F P +SS+
Sbjct: 125 YVVTLGIGTPAVQQTVLI-DTGSDLSWV------QCKPCNASDCYPQKDPLFDPSKSSTF 177
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
I C + C + V+ GC+ P C Y ++YG G T G+ +ETL
Sbjct: 178 ATIPCASDACKQL---PVDGYDNGCTNNTSGMPPQC-GYAIEYGNGAITEGVYSTETLAL 233
Query: 223 -PSKTVPNFLAGCSILSDRQP-----AGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
S V +F GC SD+ G+ G G + ESL SQ FSYCL
Sbjct: 234 GSSAVVKSFRFGCG--SDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCL----- 286
Query: 274 DDAPVSSNLVLDT--GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
P++S T P S ++ G +TP + S FY V L I VG K
Sbjct: 287 --PPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHA----FSPKIATFYVVTLTGISVGGKA 340
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ IP + G+ IVDSG+ T + ++A+ F M Y + S L
Sbjct: 341 LDIPPAVFAKGN------IVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPAD--SAL 392
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C++ +G +V +P++ L F GGA + L +V +L D A G G
Sbjct: 393 DTCYNFTGHGTVTVPKVALTFVGGATVDL-----------DVPSGVLVEDCLAFADAGDG 441
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ + + +D GF C
Sbjct: 442 SFGIIGNVNTRTIEVLYDSGKGHLGFRAGAC 472
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 130/488 (26%), Positives = 202/488 (41%), Gaps = 80/488 (16%)
Query: 23 AGAGSSAATVTVPLTPLSTKHYLHHSDSDPL---KILHSLASSSLSRARHLKTKTKPKTK 79
AGA +A TV L + D DP + L L ++ SRA + + +
Sbjct: 105 AGAARTATTVL----ELKRHSLVAIPDDDPAAHDRYLRRLLAADESRANSFQLRIR---N 157
Query: 80 DSNIGSNYSNSLIKTPLSV---HSYGGYSISLSFGT----PPQASTPFIFDTGSSLVWFP 132
D ++ + + PL+ Y +++ G P A+ I DTGS L W
Sbjct: 158 DRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQ 217
Query: 133 CTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRN 192
C C C R P F P S++ + C C+ + C N
Sbjct: 218 CKP---CSACY-----AQRDPLFDPAGSATYAAVRCNASACAASLKAATGTP-GSCGGGN 268
Query: 193 KTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR----QPAGIAG 247
+ C Y L YG G F+ G+L ++T+ ++ F+ GC LS+R AG+ G
Sbjct: 269 ERC-----YYALAYGDGSFSRGVLATDTVALGGASLDGFVFGCG-LSNRGLFGGTAGLMG 322
Query: 248 FGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
GR+ SL SQ L+ FSYCL + DA S +L L S + TP ++YT
Sbjct: 323 LGRTELSLVSQTALRYGGVFSYCLPATTSGDA--SGSLSLGGDASSYRNTTP-VAYTRMI 379
Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
+P A FY++ + VG + L G V++DSG+ T + +
Sbjct: 380 ADP-----AQPPFYFLNVTGAAVGG-------TALAAQGLGASNVLIDSGTVITRLAPSV 427
Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSG---LRPCFDISGKKSVYLPELILKFKGGAKMALP 421
+ V EF RQ AA G L C+D++G V +P L L+ +GGA++ +
Sbjct: 428 YRGVRAEFTRQFA----AAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVD 483
Query: 422 PENYFALV---GNEVLCLIL----FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
+V G++V CL + + D I+G++Q +N + +D R
Sbjct: 484 AAGMLFVVRKDGSQV-CLAMASLSYEDQTP----------IIGNYQQKNKRVVYDTVGSR 532
Query: 475 FGFAKQKC 482
GFA + C
Sbjct: 533 LGFADEDC 540
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 162/391 (41%), Gaps = 59/391 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP T +FDTGS W C CV + + F P RSS
Sbjct: 184 GNYVVTIGLGTPAGRYT-VVFDTGSDTTWVQCEP---CVVVCYEQQEK----LFDPARSS 235
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ I C P CS ++ KGCS + Y +QYG G ++ G +TL
Sbjct: 236 TDANISCAAPACSDLY-------TKGCSGGHCL-------YGVQYGDGSYSIGFFAMDTL 281
Query: 221 RFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
S + F GC ++ + AG+ G GR SLP Q K F++C +R
Sbjct: 282 TLSSYDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARS- 340
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
S LD GPGS + + L+ N + FYYVGL I VG K +
Sbjct: 341 -----SGTGYLDFGPGSSPAVSTKLTTPMLVDNGL-------TFYYVGLTGIRVGGKLLS 388
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGL 391
IP S G IVDSG+ T + + ++ F + Y +A + S L
Sbjct: 389 IPPSVFT-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPAL---SLL 440
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D +G V +P + L F+GGA + + CL F N +G
Sbjct: 441 DTCYDFTGMSQVAIPTVSLLFQGGASLDVDASGIIYAASVSQACL-GFAANEEDDDVG-- 497
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ QL+ F + +D+ GF+ C
Sbjct: 498 ---IVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 110/392 (28%), Positives = 169/392 (43%), Gaps = 65/392 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G PP + + DTGS + W C C +C + DP F P S+
Sbjct: 147 GEYFLRVGIGKPPSQAY-VVLDTGSDVSWIQCAP---CSEC-YQQSDP----IFDPVSSN 197
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S I C P+C + S C RN TC Y + YG G +T G +ET+
Sbjct: 198 SYSPIRCDAPQCKSL----DLSEC-----RNGTC-----LYEVSYGDGSYTVGEFATETV 243
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD--- 274
+ V N GC ++ AG+ G G S P+Q+ FSYCL++R D
Sbjct: 244 TLGTAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNRDSDAVS 303
Query: 275 ----DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
++P+ N+V P +NP FYY+GL+ I VG +
Sbjct: 304 TLEFNSPLPRNVVT----------------APLRRNP-----ELDTFYYLGLKGISVGGE 342
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ IP S + G GG+I+DSG+ T + +++A+ F++ +A V S
Sbjct: 343 ALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGV---SL 399
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
C+D+S ++SV +P + F G ++ LP NY V + F + +
Sbjct: 400 FDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLS--- 456
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q + FD+AN GF+ C
Sbjct: 457 ----IMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 164/391 (41%), Gaps = 61/391 (15%)
Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + GTP Q P + DT S + W PC+ CV C PS AF P +S+
Sbjct: 99 YIVKALIGTPAQ---PLLLAMDTSSDVAWIPCSG---CVGC------PSNT-AFSPAKST 145
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
S + + C P+C + P +R AC S+ L YG A L +T+R
Sbjct: 146 SFKNVSCSAPQCKQVPNPTCGAR-------------AC-SFNLTYGSSSIAANLSQDTIR 191
Query: 222 FPSKTVPNFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
+ + F GC +I + G+ S S + FSYCL S F
Sbjct: 192 LAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPS--F 249
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
S +L L GP S + + YT +NP SS YYV L I VG K V
Sbjct: 250 RSLTFSGSLRL--GPTSQPQR---VKYTQLLRNPRRSS-----LYYVNLVAIRVGRKVVD 299
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+P + + G I DSG+ +T + P++EAV EF +++ + A V G
Sbjct: 300 LPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTT--AVVTSLGGFDT 357
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGP 452
C+ SG+ V +P + FK G M +P +N CL + AA P
Sbjct: 358 CY--SGQ--VKVPTITFMFK-GVNMTMPADNLMLHSTAGSTSCLAM----AAAPENVNSV 408
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
++ Q QN + D+ N R G A+++C+
Sbjct: 409 VNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 124/437 (28%), Positives = 175/437 (40%), Gaps = 79/437 (18%)
Query: 12 FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLK 71
F ++ LL ++AATV + LT H+D+ LA+ L + L+
Sbjct: 6 FVIVTLLAALAISRCNAAATVRMQLT---------HADAG-----RGLAARELMQRMALR 51
Query: 72 TKTKPKTK------DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
+K + + Y N + T VH L+ GTPPQ DTG
Sbjct: 52 SKARAARRLSSSASAPVSPGTYDNGVPTTEYLVH--------LAIGTPPQ-PVQLTLDTG 102
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG-----CQNPKCSWIFGPN 180
S L+W C C D P DPS S+ G C +PK F PN
Sbjct: 103 SDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPK----FWPN 158
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKTVPNFLAGCSIL 237
+TC Y YG T G L + F +VP GC +
Sbjct: 159 ------------QTC-----VYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLF 201
Query: 238 SD----RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
++ GIAGFGR SLPSQL + FS+C + + S ++LD S
Sbjct: 202 NNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAV---NGLKPSTVLLDLPADLYKS 258
Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
+ TP +NP + FYY+ L+ I VGS + +P S +G GG I+DS
Sbjct: 259 GRGAVQSTPLIQNPANPT-----FYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDS 312
Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG--KKSVYLPELILK 411
G+ T + ++ V F Q+ V + P F +S + Y+P+L+L
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-----KLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLH 367
Query: 412 FKGGAKMALPPENYFAL 428
F+ GA M LP ENY L
Sbjct: 368 FE-GATMDLPRENYVWL 383
>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
Length = 415
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 104/405 (25%), Positives = 166/405 (40%), Gaps = 64/405 (15%)
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
S GG P+ + + DTGS++ W T+ C R
Sbjct: 51 SGGGCHYRFELTHRPKDNISAVVDTGSNIFW---TTEKEC------------------SR 89
Query: 160 SSSSQLIGCQNPKCSWIFGPNVE-SRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLL 215
S + ++ C +PKC S K + + C +Y ++YG TAG+L
Sbjct: 90 SKTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKC-----TYAIKYGGNANDSTAGVL 144
Query: 216 LSETLRF---PSKTVP------NFLAGCSI-----LSDRQPAGIAGFGRSSESLPSQLGL 261
+ L SK VP GCS D G+ G GRS+ SLP QL
Sbjct: 145 YEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNF 204
Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
KFSYCL S + D P S L+L P + +S + Y+V
Sbjct: 205 SKFSYCLSSYQKPDLP--SYLLLTAAPDMATGAV--GGAAAVATTALQPNSDYKTRYFVD 260
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
L+ I +G ++P + G + VD+G++FT +EG +F + E R M
Sbjct: 261 LQGISIGG--TRLP----AVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKY 314
Query: 382 AADVEKKSGLRPCF---DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
+ ++ + C+ + +S LP+++L F A M LP ++Y ++ LCL +
Sbjct: 315 VKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTTSK-LCLAI 373
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N +G +LG+FQ+QN ++ D N++ F + C+
Sbjct: 374 DKSNI------KGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCS 412
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 127/460 (27%), Positives = 185/460 (40%), Gaps = 68/460 (14%)
Query: 35 PLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSN--SLI 92
P +PL+ H L + ++ +RA+ ++ + T S G N SL
Sbjct: 97 PCSPLADAH------DGKLPSHEEILAADQNRAKSIQRRVSTTTTVSR-GKPKRNRPSLP 149
Query: 93 KTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI 152
+ S G Y +++ GTP T +FDTGS W C CV + +
Sbjct: 150 ASSGSALGTGNYVVTIGLGTPAGRYT-VVFDTGSDTTWVQCEP---CVVVCYKQQEK--- 202
Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FT 211
F P RSS+ I C P CS ++ KGCS + Y +QYG G ++
Sbjct: 203 -LFDPARSSTYANISCAAPACSDLY-------IKGCSGGHCL-------YGVQYGDGSYS 247
Query: 212 AGLLLSETLRFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---F 264
G +TL S + F GC ++ + AG+ G GR SLP Q K F
Sbjct: 248 IGFFAMDTLTLSSYDAIKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVF 307
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
++C +R S LD GPGS + + L+ N FYYVGL
Sbjct: 308 AHCFPARS------SGTGYLDFGPGSLPAVSAKLTTPMLVDNGP-------TFYYVGLTG 354
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRA 382
I VG K + IP S G IVDSG+ T + + ++ F M Y +A
Sbjct: 355 IRVGGKLLSIPQSVFT-----TSGTIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKA 409
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
+ S L C+D +G V +P + L F+GGA + + CL F N
Sbjct: 410 PAL---SLLDTCYDFTGMSEVAIPTVSLLFQGGASLDVHASGIIYAASVSQACL-GFAGN 465
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G I+G+ QL+ F + +D+ GF C
Sbjct: 466 KEDDDVG-----IVGNTQLKTFGVVYDIGKKVVGFCPGAC 500
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 158/391 (40%), Gaps = 44/391 (11%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + S GTP Q I DTGS L + C C + + P PS F P
Sbjct: 32 GQYFVDFSLGTPEQKFH-LIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTP---- 86
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+ C + +C I P V + C P + P SY +YG T G+ ET
Sbjct: 87 ----VPCDSAECLLIPAP-VGAPCSSSYPESP--PQGACSYEYRYGDNSSTVGVFAYETA 139
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
V + GC + G+ G G+ + S SQ G KF+YCL S
Sbjct: 140 TVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSY-LS 198
Query: 275 DAPVSSNLVLDTGPGSGD---SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
V S+L+ GD S L +TP NP+ S YYV + +I G +
Sbjct: 199 PTSVFSSLIF------GDDMMSTIHDLQFTPLVSNPLNPS-----VYYVQIVRICFGGET 247
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ IP S S GNGG I DSG+T T+ + + F + + Y RA GL
Sbjct: 248 LLIPDSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSV-PYPRAP--PSPQGL 304
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C ++SG P ++F GA NYF V + CL + ++ G
Sbjct: 305 PLCVNVSGIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDG------ 358
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++G+ QN+ +++D R GFA C
Sbjct: 359 -FNVIGNIIQQNYLVQYDREEHRIGFAHANC 388
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 116/435 (26%), Positives = 180/435 (41%), Gaps = 61/435 (14%)
Query: 56 LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQ 115
+H S L + + K + + GS+ + + + G Y + + G+PP+
Sbjct: 1 MHRDVKRVASLIHRLSSGSAAKYEVEDFGSDVVSGMNQGS------GEYFVRIGLGSPPR 54
Query: 116 ASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSW 175
S + D+GS +VW C +C P DP+ +F+ SS+ +N C+
Sbjct: 55 -SQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENAGCN- 112
Query: 176 IFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGC 234
RC+ Y + YG G +T G L ETL F V N GC
Sbjct: 113 ------SGRCR---------------YEVSYGDGSYTKGTLALETLTFGRTVVRNVAIGC 151
Query: 235 SILSDR----QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTG 287
S+R AG+ G G S S QL + FSYCL+SR ++N L+ G
Sbjct: 152 G-HSNRGMFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVSRG-----TNTNGFLEFG 205
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
++ G ++ P +NP S FYY+ L + VG V + G+G
Sbjct: 206 S---EAMPVGAAWIPLVRNPRAPS-----FYYIRLLGLGVGDTRVPVSEDVFQLNELGSG 257
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
GV++D+G+ T +EA FI Q N RA+ V S C+++ G SV +P
Sbjct: 258 GVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGV---SIFDTCYNLFGFLSVRVPT 314
Query: 408 LILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLE 467
+ F GG + +P N+ V + F + +G + ILG+ Q + +
Sbjct: 315 VSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLS-------ILGNIQQEGIQIS 367
Query: 468 FDLANDRFGFAKQKC 482
D AN+ GF C
Sbjct: 368 VDEANEFVGFGPNIC 382
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/372 (27%), Positives = 159/372 (42%), Gaps = 52/372 (13%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
I DT S L W C C D P DP+ P++ ++ C + C +
Sbjct: 141 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYA--------VLPCNSSSCDALQVAT 192
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD 239
+ +C SY L Y G ++ G+L + L + + F+ GC S+
Sbjct: 193 GSAAGACGGGEQPSC-----SYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGT-SN 246
Query: 240 RQP----AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
+ P +G+ G GRS SL SQ + FSYCL ++ S +LVL
Sbjct: 247 QGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL---PLKESESSGSLVLGDDTSVYR 303
Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
+ TP + YT +PV G FY+V L I +G + V+ G VIVD
Sbjct: 304 NSTP-IVYTTMVSDPVQ-----GPFYFVNLTGITIGGQEVE----------SSAGKVIVD 347
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG+ T + ++ AV EF+ Q Y +A S L CF+++G + V +P L F
Sbjct: 348 SGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGF---SILDTCFNLTGFREVQIPSLKFVF 404
Query: 413 KGGAKMALPPEN--YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
+G ++ + YF + +CL L + + I+G++Q +N + FD
Sbjct: 405 EGNVEVEVDSSGVLYFVSSDSSQVCLALASLKS------EYETSIIGNYQQKNLRVIFDT 458
Query: 471 ANDRFGFAKQKC 482
+ GFA++ C
Sbjct: 459 LGSQIGFAQETC 470
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 115/414 (27%), Positives = 165/414 (39%), Gaps = 67/414 (16%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L H +IS++ GTPPQ + + DTGS L W C + P P F
Sbjct: 58 LRFHHNVSLTISITVGTPPQ-NMSMVIDTGSELSWLHCNTN---TTATIP------YPFF 107
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSY-LLQYGLGF---- 210
P SSS I C +P C+ R+ P +C S L L +
Sbjct: 108 NPNISSSYTPISCSSPTCT-------------TRTRDFPIPASCDSNNLCHATLSYADAS 154
Query: 211 -TAGLLLSETLRFPSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
+ G L S+T F S P + GC + SD G+ G S SL SQL +
Sbjct: 155 SSEGNLASDTFGFGSSFNPGIVFGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIP 214
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKN----PVGSSSAFGEFY 318
KFSYC+ F S L+L S S L+YTP + P SA Y
Sbjct: 215 KFSYCISGSDF-----SGILLLGE---SNFSWGGSLNYTPLVQISTPLPYFDRSA----Y 262
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
V L I + K + I + VP G G + D G+ F+++ GP++ A+ EF+ Q
Sbjct: 263 TVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNG 322
Query: 379 YSRAADVEK---KSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVG--- 430
RA D + + C+ + +S LP + L F+G + + + G
Sbjct: 323 TLRALDDPNFVFQIAMDLCYRVPVNQSELPELPSVSLVFEGAEMRVFGDQLLYRVPGFVW 382
Query: 431 --NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ V C + G A I+G Q+ ++EFDL R G A +C
Sbjct: 383 GNDSVYCFTFGNSDLLGVE-----AFIIGHHHQQSMWMEFDLVEHRVGLAHARC 431
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 123/473 (26%), Positives = 198/473 (41%), Gaps = 76/473 (16%)
Query: 28 SAATVTVPLT----PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNI 83
S+AT++VPL P + Y SD P S +R ++K++ + +
Sbjct: 51 SSATLSVPLVHRYGPCAASQY---SDM-PTPSFSETLRHSRARTNYIKSRAS-----TGM 101
Query: 84 GSNYSNSLIKTPLSVHSYGG---YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYR 138
S ++ + P + + Y ++L FGTP PQ + DTGS + W +
Sbjct: 102 ASTPDDAAVTVPTRLGGFVDSLEYMVTLGFGTPSVPQV---LLMDTGSDVSWV------Q 152
Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
C CN P + P F P +SS+ I C C+ + + GC+ C
Sbjct: 153 CAPCNSTECYPQKDPLFDPSKSSTYAPIACGADACNKLG----DHYRNGCTSGGTQC--- 205
Query: 199 CPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSDRQPA----GIAGFGRSS 252
Y ++YG G T G+ +ET+ F P TV +F GC R P+ G+ G G +
Sbjct: 206 --GYRVEYGDGSSTRGVYSNETITFAPGITVKDFHFGCG-HDQRGPSDKFDGLLGLGGAP 262
Query: 253 ESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
ESL Q FSYCL + + + L L P S + T +TP + P+
Sbjct: 263 ESLVVQTASVYGGAFSYCLPALNSE----AGFLALGVRP-SAATNTSAFVFTPMWHLPMD 317
Query: 310 SSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
++S Y V + I VG K + IP S GG+++DSG+ T + + A+
Sbjct: 318 ATS-----YMVNMTGISVGGKPLDIPRSAF------RGGMLIDSGTIVTELPETAYNALN 366
Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
+ Y A + + C++ +G +V +P + L F GGA + L V
Sbjct: 367 AALRKAFAAYPMVASEDFDT----CYNFTGYSNVTVPRVALTFSGGATIDLD-------V 415
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
N +L +GP +G G I+G+ + + +D + + GF C
Sbjct: 416 PNGILVKDCLAFRESGPDVGLG---IIGNVNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 153/388 (39%), Gaps = 71/388 (18%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+PP + D+GS ++W C +C + DP F P SS
Sbjct: 128 GEYFVRVGVGSPP-TDQYLVVDSGSDVIWVQCRPCEQC----YAQTDP----LFDPAASS 178
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C + C + G C Y + YG G +T G L ETL
Sbjct: 179 SFSGVSCGSAICR-----TLSGTGCGGGGDAGKC-----DYSVTYGDGSYTKGELALETL 228
Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
V GC + AG+ G G + SL QLG FSYCL SR
Sbjct: 229 TLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASR--- 285
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
G+G G+ S FYYVGL I VG + + +
Sbjct: 286 --------------GAG-----------------GAGSLASSFYYVGLTGIGVGGERLPL 314
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
S DG GGV++D+G+ T + + A+ F MG R+ V S L C
Sbjct: 315 QDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAV---SLLDTC 371
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
+D+SG SV +P + F GA + LP N VG V CL F +++G +
Sbjct: 372 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLA-FAPSSSGIS------- 423
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ Q + + D AN GF C
Sbjct: 424 ILGNIQQEGIQITVDSANGYVGFGPNTC 451
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 121/473 (25%), Positives = 188/473 (39%), Gaps = 77/473 (16%)
Query: 35 PLTPLSTKH-------YLHHSDSDPLKILHSLASSSLSRARHLKT--------KTKPKTK 79
P +PL+ H + +D + ++ + S++ R + K K P
Sbjct: 79 PCSPLADAHGKPPAHDEILAADQNRVESIQRRVSATTGRDKLTKHAAPVQPGPKKSPGIH 138
Query: 80 DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC 139
+ S+ + SL T S G Y +++ GTP T +FDTGS W C R
Sbjct: 139 PGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKYT-VVFDTGSDTTWVQC--RPCV 195
Query: 140 VDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLAC 199
V C + P F P +SS+ + C + C+ + GC+ +
Sbjct: 196 VKCY-----KQKEPLFDPAKSSTYANVSCTDSACADL-------DTNGCTGGHCL----- 238
Query: 200 PSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESL 255
Y +QYG G +T G +TL + F GC ++ + AG+ G GR SL
Sbjct: 239 --YAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSL 296
Query: 256 PSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
Q K F+YCL A + LD GPGS + TP + +
Sbjct: 297 TVQAYNKYGGAFAYCL------PALTTGTGYLDFGPGSAGNNA---RLTPMLTDKGQT-- 345
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
FYYVG+ I VG + V + S G +VDSG+ T + + A++ F
Sbjct: 346 ----FYYVGMTGIRVGGQQVPVAESVF-----STAGTLVDSGTVITRLPATAYTALSSAF 396
Query: 373 IRQMGNYSRAADVEKKSG---LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
+ M A +K G L C+D +G V LP + L F+GGA + + +
Sbjct: 397 DKVM----LARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAI 452
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+CL F N ++ I+G+ Q + + + +DL GFA C
Sbjct: 453 SEAQVCLA-FASNGDDESVA-----IVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 157/387 (40%), Gaps = 55/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y +++ GTP S + DTGS L W +C CN P + P F P RSS+
Sbjct: 120 YVVTVGLGTP-AVSQVLLIDTGSDLSWV------QCAPCNSTTCYPQKDPLFDPSRSSTY 172
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
I C C + S C S C Y + YG G T G+ +ETL
Sbjct: 173 APIPCNTDACRDLTRDGYGSDCTSGSGGGAQC-----GYAITYGDGSQTTGVYSNETLTM 227
Query: 223 -PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDD 275
P TV +F GC D + G+ G G + ESL Q FSYCL
Sbjct: 228 APGVTVKDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCL------- 280
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
P +++ G+ + G +TP + FY V + I VG + + +P
Sbjct: 281 -PAANDQAGFLALGAPVNDASGFVFTPMVREQQ-------TFYVVNMTGITVGGEPIDVP 332
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S +GG+I+DSG+ T ++ + A+ F + M Y + E L C+
Sbjct: 333 PSAF------SGGMIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGE----LDTCY 382
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
+ +G +V +P + L F GGA + L V + +L AGP G I
Sbjct: 383 NFTGHSNVTVPRVALTFSGGATVDLD-------VPDGILLDNCLAFQEAGPDNQPG---I 432
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
LG+ + + +D+ + R GF C
Sbjct: 433 LGNVNQRTLEVLYDVGHGRVGFGADAC 459
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 110/390 (28%), Positives = 168/390 (43%), Gaps = 71/390 (18%)
Query: 123 DTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
DTGS ++W PC+ R N P + + P+ SS++ L+ C +P C + G
Sbjct: 47 DTGSDVLWVNCRPCSGCPRKSALNIP------LTMYDPRESSTTSLVSCSDPLC--VRGR 98
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP-------SKTVPNFL 231
CS C Y+ YG G T+ G + + +++ + T L
Sbjct: 99 RFAE--AQCSQTTNNC-----EYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQVL 151
Query: 232 AGCSI-----LSDRQPA--GIAGFGRSSESLPSQLGLKK-----FSYCLLSRKFDDAPVS 279
GCSI LS Q A GI GFG+ S+P+QL ++ FS+CL K +
Sbjct: 152 FGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGGILV 211
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
+ + PG++YTP + V Y V LR I V S ++P
Sbjct: 212 IGGIAE----------PGMTYTPLVPDSV--------HYNVVLRGISVNSN--RLPIDAE 251
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
S + GVI+DSG+T + + F++ + + A V + CF +SG
Sbjct: 252 DFSSTNDTGVIMDSGTTLAYFPSGAYNV----FVQAIREATSATPVRVQGMDTQCFLVSG 307
Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVG------NEVLCL-ILFTDNAAGPALGRGP 452
+ S P + L F+GGA M L P+NY G +V C+ + ++AGP G
Sbjct: 308 RLSDLFPNVTLNFEGGA-MELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDG-SQ 365
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILGD L++ + +DL N R G+ C
Sbjct: 366 LTILGDIVLKDKLVVYDLDNSRIGWMSYNC 395
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 167/389 (42%), Gaps = 69/389 (17%)
Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP--AFIPKRSSSSQLIGCQNPKCSWIFGPN 180
DTGS ++W C C C P IP + P+ SS++ L+ C +P C + G
Sbjct: 20 DTGSDVLWVNCRP---CSGC--PRKSALNIPLTMYDPRESSTTSLVSCSDPLC--VRGRR 72
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP-------SKTVPNFLA 232
CS C Y+ YG G T+ G + + +++ + T L
Sbjct: 73 FAE--AQCSQATNNC-----EYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQVLF 125
Query: 233 GCSI-----LSDRQPA--GIAGFGRSSESLPSQLGLKK-----FSYCLLSRKFDDAPVSS 280
GCSI LS Q A GI GFG+ S+P+QL ++ FS+CL K +
Sbjct: 126 GCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGGILVI 185
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
+ + PG++YTP + V Y V LR I V S ++P
Sbjct: 186 GGIAE----------PGMTYTPLVPDSV--------HYNVVLRGISVNSN--RLPIDAED 225
Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
S + GVI+DSG+T + + F++ + + A V + CF +SG+
Sbjct: 226 FSSTNDTGVIMDSGTTLAYFPSGAYNV----FVQAIREATSATPVRVQGMDTQCFLVSGR 281
Query: 401 KSVYLPELILKFKGGAKMALPPENYFALVG------NEVLCL-ILFTDNAAGPALGRGPA 453
S P + L F+GGA M L P+NY G +V C+ + ++AGP G
Sbjct: 282 LSDLFPNVTLNFEGGA-MELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDG-SQL 339
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILGD L++ + +DL N R G+ C
Sbjct: 340 TILGDIVLKDKLVVYDLDNSRIGWMSYNC 368
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 126/459 (27%), Positives = 187/459 (40%), Gaps = 65/459 (14%)
Query: 38 PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNY--SNSLIKTP 95
P ST L H D+ + LA+S R + + K G ++ +SL P
Sbjct: 65 PFST--VLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAGGASGGHHLDDDSLASVP 122
Query: 96 LSVHS---YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI 152
LS + G Y L GTP S + DTGSSL W C+ CV V
Sbjct: 123 LSPGTSVGVGNYVTQLGLGTP-STSYAMVVDTGSSLTWLQCSP---CVVSCHRQVG---- 174
Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFT 211
P F P+ SS+ + C +C + + CS N Y YG F+
Sbjct: 175 PLFDPRASSTYTSVRCSASQCDELQAATLNP--SACSASNVCI------YQASYGDSSFS 226
Query: 212 AGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFS 265
G L ++T+ F S + P+F GC ++ + AG+ G R+ SL QL FS
Sbjct: 227 VGYLSTDTVSFGSTSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFS 286
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL + +S L GP + SYT P+ SSS Y++ L +
Sbjct: 287 YCLPT-------AASTGYLSIGPYNTGHY---YSYT-----PMASSSLDASLYFITLSGM 331
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG + + P + I+DSG+ T + + A++K + M RA
Sbjct: 332 SVGGSPLAVS-----PSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRA--- 383
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF-TDNAA 444
S L CF+ + + +P +++ F GGA M L N V + CL TD+ A
Sbjct: 384 PAFSILDTCFEGQASQ-LRVPTVVMAFAGGASMKLTTRNVLIDVDDSTTCLAFAPTDSTA 442
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +D+A R GF+ C+
Sbjct: 443 ----------IIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 129/439 (29%), Positives = 195/439 (44%), Gaps = 72/439 (16%)
Query: 61 SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF 120
+SS+ R L++K K ++G+ +SLI P + S G+ ++LS G+PP +
Sbjct: 68 TSSIERFDFLESKIKEL---KSVGNEARSSLI--PFNRGS--GFLVNLSIGSPP-VTQLV 119
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
+ DTGSSL+W C C++C F P +S S + +GC P ++I G
Sbjct: 120 VVDTGSSLLWVQCLP---CINCF-----QQSTSWFDPLKSVSFKTLGCGFPGYNYINGYK 171
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTV-----PNFLAGC 234
C+ N+ Y L+Y G + G+L E+L F + N GC
Sbjct: 172 -------CNRFNQ------AEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGC 218
Query: 235 SILS-----DRQPAGIAGFGRSSE-SLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP 288
++ D G+ G G ++ +QLG KFSYC+ + P+ ++ L G
Sbjct: 219 GHMNIKTNNDDAYNGVFGLGAYPHITMATQLG-NKFSYCIGDI---NNPLYTHNHLVLGQ 274
Query: 289 GS---GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
GS GDS TP + FG YYV L+ I VGSK +KI + SDG
Sbjct: 275 GSYIEGDS-------TPLQIH-------FGH-YYVTLQSISVGSKTLKIDPNAFKISSDG 319
Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFD-ISGKKSV 403
+GGV++DSG T+T + FE + E + M G R K GL CF + + V
Sbjct: 320 SGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGL--CFKGVVSRDLV 377
Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
P + F GGA + L + F G + CL + N+ L ++G QN
Sbjct: 378 GFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLS-----VIGILAQQN 432
Query: 464 FYLEFDLANDRFGFAKQKC 482
+ + FDL + F + C
Sbjct: 433 YNVGFDLEQMKVFFRRIDC 451
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 126/408 (30%), Positives = 178/408 (43%), Gaps = 71/408 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKR 159
G Y L GTPP+ I DTGS ++W C S C C P IP F P
Sbjct: 50 GLYYTRLQLGTPPRDFYVQI-DTGSDVLWVSCGS---CNGC--PVNSGLHIPLNFFDPGS 103
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
S ++ LI C + +CS ++S CS +N C Y QYG G T+G +S+
Sbjct: 104 SPTASLISCSDQRCSL----GLQSSDSVCSAQNNLC-----GYNFQYGDGSGTSGYYVSD 154
Query: 219 TLRFPS---KTVPN-----FLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL-- 261
L F + +V N + GCS L SDR GI GFG+ S+ SQL
Sbjct: 155 LLHFDTVLGGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQG 214
Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
+ FS+CL K DD+ LVL G+ P + YTP + Y
Sbjct: 215 ISPRAFSHCL---KGDDSG-GGILVL------GEIVEPNIVYTPLVPSQ--------PHY 256
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
+ ++ I V + + I S V G+ + G I+DSG+T + L EA FI + +
Sbjct: 257 NLNMQSISVNGQTLAIDPS--VFGTSSSQGTIIDSGTTLAY----LAEAAYDPFISAITS 310
Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLC 435
+ S C+ IS + P++ L F GGA M L P++Y + +G L
Sbjct: 311 IVSPSVRPYLSKGNHCYLISSSINDIFPQVSLNFAGGASMILIPQDYLIQQSSIGGAALW 370
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I F G+G ILGD L++ +D+AN R G+A C+
Sbjct: 371 CIGFQ-----KIQGQG-ITILGDLVLKDKIFVYDIANQRIGWANYDCS 412
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 109/396 (27%), Positives = 177/396 (44%), Gaps = 71/396 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
+ +++ FGTP Q T IFDTGS + W +C+ C+ + P F P +S++
Sbjct: 135 FVVTVGFGTPAQTYT-VIFDTGSDVSWI------QCLPCS-GHCYKQHDPIFDPTKSATY 186
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT-AGLLLSETLRF 222
++ C +P+C+ G S+C N TC Y ++YG G + AG+L ETL
Sbjct: 187 SVVPCGHPQCAAADG----SKCS-----NGTC-----LYKVEYGDGSSSAGVLSHETLSL 232
Query: 223 PS-KTVPNFLAGC--SILSD-RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
S + +P F GC + L D G+ G GR SL SQ FSYCL S D+
Sbjct: 233 TSTRALPGFAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPS---DN 289
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
+ T P S D + YT + + FY+V L I +G + +P
Sbjct: 290 TTHGYLTIGPTTPASNDD----VQYTAMVQK-----QDYPSFYFVELVSIDIGGYILPVP 340
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
+ + G +DSG+ T++ + A+ F M Y A + C+
Sbjct: 341 PTLFT-----DDGTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDP---FDTCY 392
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG------ 449
D +G+ ++++P + KF G+ L ++F + ++F D+ A PA+G
Sbjct: 393 DFTGQSAIFIPAVSFKFSDGSVFDL---SFFGI--------LIFPDDTA-PAIGCLGFVA 440
Query: 450 ---RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
P I+G+ Q +N + +D+A ++ GFA C
Sbjct: 441 RPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 121/473 (25%), Positives = 188/473 (39%), Gaps = 77/473 (16%)
Query: 35 PLTPLSTKH-------YLHHSDSDPLKILHSLASSSLSRARHLKT--------KTKPKTK 79
P +PL+ H + +D + ++ + S++ R + K K P
Sbjct: 79 PCSPLADAHGKPPAHDEILAADQNRVESIQRRVSATTGRDKLTKHAAPVQPGPKKSPGIH 138
Query: 80 DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC 139
+ S+ + SL T S G Y +++ GTP T +FDTGS W C R
Sbjct: 139 PGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKYT-VVFDTGSDTTWVQC--RPCV 195
Query: 140 VDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLAC 199
V C + P F P +SS+ + C + C+ + GC+ +
Sbjct: 196 VKCY-----KQKGPLFDPAKSSTYANVSCTDSACADL-------DTNGCTGGHCL----- 238
Query: 200 PSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESL 255
Y +QYG G +T G +TL + F GC ++ + AG+ G GR SL
Sbjct: 239 --YAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSL 296
Query: 256 PSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
Q K F+YCL A + LD GPGS + TP + +
Sbjct: 297 TVQAYNKYGGAFAYCL------PALTTGTGYLDFGPGSAGNNA---RLTPMLTDKGQT-- 345
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
FYYVG+ I VG + V + S G +VDSG+ T + + A++ F
Sbjct: 346 ----FYYVGMTGIRVGGQQVPVAESVF-----STAGTLVDSGTVITRLPATAYTALSSAF 396
Query: 373 IRQMGNYSRAADVEKKSG---LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
+ M A +K G L C+D +G V LP + L F+GGA + + +
Sbjct: 397 DKVM----LARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAI 452
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+CL F N ++ I+G+ Q + + + +DL GFA C
Sbjct: 453 SEAQVCLA-FASNGDDESVA-----IVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 101/372 (27%), Positives = 159/372 (42%), Gaps = 52/372 (13%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
I DT S L W C C D P DP+ P++ ++ C + C +
Sbjct: 140 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSY--------AVLPCNSSSCDALQVAT 191
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD 239
+ +C SY L Y G ++ G+L + L + + F+ GC S+
Sbjct: 192 GSAAGACGGGEQPSC-----SYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGT-SN 245
Query: 240 RQP----AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
+ P +G+ G GRS SL SQ + FSYCL ++ S +LVL
Sbjct: 246 QGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL---PLKESESSGSLVLGDDTSVYR 302
Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVD 352
+ TP + YT +PV G FY+V L I +G + V+ G VIVD
Sbjct: 303 NSTP-IVYTTMVSDPVQ-----GPFYFVNLTGITIGGQEVE----------SSAGKVIVD 346
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG+ T + ++ AV EF+ Q Y +A S L CF+++G + V +P L F
Sbjct: 347 SGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGF---SILDTCFNLTGFREVQIPSLKFVF 403
Query: 413 KGGAKMALPPEN--YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
+G ++ + YF + +CL L + + I+G++Q +N + FD
Sbjct: 404 EGNVEVEVDSSGVLYFVSSDSSQVCLALASLKS------EYETSIIGNYQQKNLRVIFDT 457
Query: 471 ANDRFGFAKQKC 482
+ GFA++ C
Sbjct: 458 LGSQIGFAQETC 469
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 122/465 (26%), Positives = 195/465 (41%), Gaps = 95/465 (20%)
Query: 52 PLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFG 111
P ++L + S++ A ++ + D+ + + L TP Y ++++ G
Sbjct: 57 PARVLEAARRSTVRAAALSRSYVR---VDAPSADGFVSELTSTPFE------YLMAVNIG 107
Query: 112 TPPQASTPFIFDTGSSLVWFPCT--------SRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
TPP I DTGS L+W C+ + R D P V F P +S++
Sbjct: 108 TPPTRMVA-IADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQ------FDPSKSTTF 160
Query: 164 QLIGCQNPKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+L+ C + CS + + +S+C+ Y YG G T+G+L +ET
Sbjct: 161 RLVDCDSVACSELPEASCGADSKCR---------------YSYSYGDGSHTSGVLSTETF 205
Query: 221 RFP----------SKTVPNFLAGCSI--LSDRQPAGIAGFGRSSESLPSQLGL-----KK 263
F + V N GCS + G+ G G SL SQLG ++
Sbjct: 206 TFADAPGARGDGTTTRVANVNFGCSTTFVGSSVGDGLVGLGGGDLSLVSQLGADTSLGRR 265
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
FSYCL+ V ++ L+ GP + + PG TP + V + +Y V LR
Sbjct: 266 FSYCLVPYS-----VKASSALNFGPRAAVTD-PGAVTTPLIPSQVKA------YYIVELR 313
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+ VG+K + P +IVDSG+T TF+ EA+ ++++ +
Sbjct: 314 SVKVGNKTFEAP---------DRSPLIVDSGTTLTFLP----EALVDPLVKELTGRIKLP 360
Query: 384 DVEKKSGLRP-CFDISGKK----SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
+ L P CFD+SG + + +P++ + GGA + L EN F V LCL
Sbjct: 361 PAQSPERLLPLCFDVSGVREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCL-- 418
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A + PA I+G+ QN ++ +DL FA CA
Sbjct: 419 ----AVSAMSEQFPASIIGNIAQQNMHVGYDLDKGTVTFAPAACA 459
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 119/431 (27%), Positives = 179/431 (41%), Gaps = 61/431 (14%)
Query: 65 SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDT 124
+R + +K K ++ + S L S G Y +++ GTP + IFDT
Sbjct: 65 ARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTP-KNDLSLIFDT 123
Query: 125 GSSLVWFPCTSRYR-CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
GS L W C R C D + P F P +S+S + C + C + +
Sbjct: 124 GSDLTWTQCQPCVRTCYD--------QKEPIFNPSKSTSYYNVSCSSAACGSL--SSATG 173
Query: 184 RCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV-PNFLAGCSILSDRQ 241
CS N Y +QYG F+ G L E + V GC + Q
Sbjct: 174 NAGSCSASNCI-------YGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCG--ENNQ 224
Query: 242 -----PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
AG+ G GR S PSQ K FSYCL S A + +L + +G S
Sbjct: 225 GLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS----SASYTGHLTFGS---AGIS 277
Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVD 352
++ + +TP G+S FY + + I VG + + IP + + PG+ ++D
Sbjct: 278 RS--VKFTPISTITDGTS-----FYGLNIVAITVGGQKLPIPSTVFSTPGA------LID 324
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG+ T + + A+ F +M Y + V S L CFD+SG K+V +P++ F
Sbjct: 325 SGTVITRLPPKAYAALRSSFKAKMSKYPTTSGV---SILDTCFDLSGFKTVTIPKVAFSF 381
Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
GGA + L + F + +CL F N+ A I G+ Q Q + +D A
Sbjct: 382 SGGAVVELGSKGIFYVFKISQVCLA-FAGNS-----DDSNAAIFGNVQQQTLEVVYDGAG 435
Query: 473 DRFGFAKQKCA 483
R GFA C+
Sbjct: 436 GRVGFAPNGCS 446
>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
Length = 392
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 164/391 (41%), Gaps = 64/391 (16%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P+ + + DTGS++ W T+ C RS + ++ C +PKC
Sbjct: 42 PKDNISAVVDTGSNIFW---TTEKEC------------------SRSKTRSMLPCCSPKC 80
Query: 174 SWIFGPNVE-SRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLLLSETLRF---PSKT 226
S K + + C +Y ++YG TAG+L + L SK
Sbjct: 81 EQRASCGCRRSELKAEAEKETKC-----TYAIKYGGNANDSTAGVLYEDKLTIVAVASKA 135
Query: 227 VP------NFLAGCSI-----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD 275
VP GCS D G+ G GRS+ SLP QL KFSYCL S + D
Sbjct: 136 VPGSQSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPD 195
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
P S L+L P + + +S + Y+V L+ I +G ++P
Sbjct: 196 LP--SYLLLTAAPDM--ATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGG--TRLP 249
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
G G + VD+G++FT +EG +F + E R M + ++ + C+
Sbjct: 250 AVSTKSG----GNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICY 305
Query: 396 ---DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
+ +S LP+++L F A M LP ++Y ++ LCL + N +G
Sbjct: 306 SPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTTSK-LCLAIDKSNI------KGG 358
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+LG+FQ+QN ++ D N++ F + C+
Sbjct: 359 ISVLGNFQMQNTHMLLDTGNEKLSFVRADCS 389
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 162/386 (41%), Gaps = 57/386 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTPPQ + DT + W PCT+ C F P++S++
Sbjct: 78 YIVRAKIGTPPQ-TLLLAMDTSNDAAWIPCTACDGCAST-----------LFAPEKSTTF 125
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + C P+C + P C ++ ++ L YG A L+ +T+
Sbjct: 126 KNVSCAAPECKQVPNPG--------------CGVSSCNFNLTYGSSSIAANLVQDTITLA 171
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
+ VP++ GC + + P G+ G GR SL SQ L FSYCL S F
Sbjct: 172 TDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 229
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + + YTP KNP SS YYV L I VG K V IP +
Sbjct: 230 FSGSLRL--GPVAQPKR---IKYTPLLKNPRRSS-----LYYVNLEAIRVGRKVVDIPPA 279
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L G I DSG+ FT + P++ AV EF R++G V G C+++
Sbjct: 280 ALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVG---PKLTVTSLGGFDTCYNV 336
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ +P + F G + LP +N CL + A P ++
Sbjct: 337 ----PIVVPTITFIFT-GMNVTLPQDNILIHSTAGSTTCLAM----AGAPDNVNSVLNVI 387
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q QN + +D+ N R G A++ C
Sbjct: 388 ANMQQQNHRVLYDVPNSRVGVARELC 413
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 119/453 (26%), Positives = 173/453 (38%), Gaps = 51/453 (11%)
Query: 40 STKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH 99
S H +H S PL+ + +LA +R L +K + + S S P
Sbjct: 26 SVYHNVHPSSPSPLESIIALARDDDARLLFLSSKAA----TAGVSSAPVASGQAPP---- 77
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
Y + G+P Q DT + W C+ C + F P
Sbjct: 78 ---SYVVRAGLGSPSQ-QLLLALDTSADATWAHCSPCGTCPSSSL----------FAPAN 123
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
SSS + C + C G + G L ++ + L S+T
Sbjct: 124 SSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDT 183
Query: 220 LRFPSKTVPNFLAGCSILSDRQPA------GIAGFGRSSESLPSQLGL---KKFSYCLLS 270
LR +PN+ GC + S P G+ G GR +L SQ G FSYCL S
Sbjct: 184 LRLGKDAIPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPS 242
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
+ S +L L G+G + + YTP +NP SS YYV + + VG
Sbjct: 243 --YRSYYFSGSLRL----GAGGGQPRSVRYTPMLRNPHRSS-----LYYVNVTGLSVGHA 291
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
VK+P + G +VDSG+ T P++ A+ +EF RQ+ S
Sbjct: 292 WVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPS---GYTSLGA 348
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALG 449
CF+ + P + + GG +ALP EN L CL + A P
Sbjct: 349 FDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAM----AEAPQNV 404
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ + Q QN + FD+AN R GFAK+ C
Sbjct: 405 NSVVNVIANLQQQNIRVVFDVANSRVGFAKESC 437
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 164/391 (41%), Gaps = 56/391 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G + + ++ GTP S I DTGS L W C C DC P P + P +SS
Sbjct: 113 GEFLMKMAIGTP-SLSFSAILDTGSDLTWTQCKP---CTDCY-----PQPTPIYDPSQSS 163
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C + C + +C A YL YG T G+L E+
Sbjct: 164 TYSKVPCSSSMCQAL--------------PMYSCSGANCEYLYSYGDQSSTQGILSYESF 209
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLP----SQLGL---KKFSYCLLSRKF 273
S+++P+ GC ++ G P SQLG KFSYCL+S
Sbjct: 210 TLTSQSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVS--I 267
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
D+P ++ + S ++KT +S TP + S + FYY+ L I VG + +
Sbjct: 268 TDSPSKTSPLFIGKTASLNAKT--VSSTPLVQ-----SRSRPTFYYLSLEGISVGGQLLD 320
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS-GLR 392
I DG GGVI+DSG+T T++E ++ V K I + V+ + GL
Sbjct: 321 IADGTFDLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSI----NLPQVDGSNIGLD 376
Query: 393 PCFD-ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
CF+ SG + + P + F+ GA LP ENY + + CL + N
Sbjct: 377 LCFEPQSGSSTSHFPTITFHFE-GADFNLPKENYIYTDSSGIACLAMLPSNGMS------ 429
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I G+ Q QN+ + +D + FA C
Sbjct: 430 ---IFGNIQQQNYQILYDNERNVLSFAPTVC 457
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 119/431 (27%), Positives = 179/431 (41%), Gaps = 61/431 (14%)
Query: 65 SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDT 124
+R + +K K ++ + S L S G Y +++ GTP + IFDT
Sbjct: 93 ARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTP-KNDLSLIFDT 151
Query: 125 GSSLVWFPCTSRYR-CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
GS L W C R C D + P F P +S+S + C + C + +
Sbjct: 152 GSDLTWTQCQPCVRTCYD--------QKEPIFNPSKSTSYYNVSCSSAACGSL--SSATG 201
Query: 184 RCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV-PNFLAGCSILSDRQ 241
CS N Y +QYG F+ G L E + V GC + Q
Sbjct: 202 NAGSCSASNCI-------YGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCG--ENNQ 252
Query: 242 P-----AGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
AG+ G GR S PSQ K FSYCL S A + +L + +G S
Sbjct: 253 GLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS----SASYTGHLTFGS---AGIS 305
Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVD 352
++ + +TP G+S FY + + I VG + + IP + + PG+ ++D
Sbjct: 306 RS--VKFTPISTITDGTS-----FYGLNIVAITVGGQKLPIPSTVFSTPGA------LID 352
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG+ T + + A+ F +M Y + V S L CFD+SG K+V +P++ F
Sbjct: 353 SGTVITRLPPKAYAALRSSFKAKMSKYPTTSGV---SILDTCFDLSGFKTVTIPKVAFSF 409
Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
GGA + L + F + +CL F N+ A I G+ Q Q + +D A
Sbjct: 410 SGGAVVELGSKGIFYVFKISQVCL-AFAGNS-----DDSNAAIFGNVQQQTLEVVYDGAG 463
Query: 473 DRFGFAKQKCA 483
R GFA C+
Sbjct: 464 GRVGFAPNGCS 474
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 174/391 (44%), Gaps = 59/391 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR-CVDCNFPNVDPSRIPAFIPKRS 160
G Y++++ GTP + T IFDTGS L W C + C P +DP++ S
Sbjct: 131 GDYAVTVGLGTPKKEFT-LIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTK--------S 181
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
+S + I C + C + ES CS + TC Y +QYG G ++ G +ET
Sbjct: 182 TSYKNISCSSAFCKLLDTEGGES----CS--SPTCL-----YQVQYGDGSYSIGFFATET 230
Query: 220 LRFPSKTV-PNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRK 272
L S V NFL GC + R AG+ G GR+ SLPSQ K FSYCL
Sbjct: 231 LTLSSSNVFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCL---- 286
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
P SS+ G SKT + +TP ++ + FY + + ++ VG +
Sbjct: 287 ----PASSSSKGYLSFGGQVSKT--VKFTPLSEDFKST-----PFYGLDITELSVGGNKL 335
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
I S G ++DSG+ T + + A++ F + M +Y + S
Sbjct: 336 SIDASIF-----STSGTVIDSGTVITRLPSTAYSALSSAFQKLMTDY---PSTDGYSIFD 387
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALGRG 451
C+D S +++ +P++ + FKGG +M + + + G + +CL F N
Sbjct: 388 TCYDFSKNETIKIPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLA-FAGNGDDV----- 441
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A I G+ Q + + + +D A R GFA C
Sbjct: 442 KAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 118/430 (27%), Positives = 187/430 (43%), Gaps = 57/430 (13%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH---SYGGYSISLSFGTPPQASTPFIF 122
R R ++ + + K N S+ +S I+ PL+ Y +++ G + I
Sbjct: 94 RVRSMQNRIRAKVSGHN--SSEQSSEIQIPLASGINLETLNYIVTIGLGNQ---NMTVII 148
Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
DTGS L W C C+ C + P F P SSS + C + C +
Sbjct: 149 DTGSDLTWVQCDP---CMSCY-----SQQGPVFNPSNSSSYNSLLCNSSTCQNL--QFTT 198
Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR- 240
+ C N P +C ++ + YG G FT G L E L F +V NF+ GC +
Sbjct: 199 GNTEACESNN---PSSC-NHTVSYGDGSFTDGELGVEHLSFGGISVSNFVFGCGRNNKGL 254
Query: 241 --QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKT 295
+GI G GRS+ S+ SQ FSYCL + D+ S +LV+ + T
Sbjct: 255 FGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTT---DSGASGSLVIGNESSLFKNLT 311
Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
P ++YT NP FY + L I VG ++ S GNGG+++DSG+
Sbjct: 312 P-IAYTSMVSNP-----QLSNFYVLNLTGIDVGGVAIQ-------DTSFGNGGILIDSGT 358
Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG 415
T + L+ A+ EF++Q Y A + S L CF+++G + V +P L + F+
Sbjct: 359 VITRLAPSLYNALKAEFLKQFSGYPIAPAL---SILDTCFNLTGIEEVSIPTLSMHFENN 415
Query: 416 AKMALPPEN--YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
+ + Y G++V CL A I+G++Q +N + +D
Sbjct: 416 VDLNVDAVGILYMPKDGSQV-CL------ALASLSDENDMAIIGNYQQRNQRVIYDAKQS 468
Query: 474 RFGFAKQKCA 483
+ GFA++ C+
Sbjct: 469 KIGFAREDCS 478
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 125/461 (27%), Positives = 203/461 (44%), Gaps = 68/461 (14%)
Query: 36 LTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTP 95
+ P S++ ++ L+ + ++ + S+ RA +L +++ S N L K
Sbjct: 32 IHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYL----------NHVFSLSHNDLPKPT 81
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
+ ++ Y +S S GTPP + DTGS +WF C C++ P F
Sbjct: 82 IIPYAGSYYVMSYSIGTPP-FQLYGVVDTGSDGIWFQCKPCKPCLN--------QTSPIF 132
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLL 215
P +SS+ + I C +P C ++RC S R + C +YL + G + G +
Sbjct: 133 NPSKSSTYKNIRCSSPICK----RGEKTRCS--SNRKRKCEYEI-TYLDRSG---SQGDI 182
Query: 216 LSETLRFPSK-----TVPNFLAGC----SILSDRQPAGIAGFGRSSESLPSQLGLK---K 263
+TL S + P + GC S+ ++ +GI GFGR + S+ SQLG K
Sbjct: 183 SKDTLTLNSNDGSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGK 242
Query: 264 FSYCLLSRKFDDAPVSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
FSYCL S F A +SS L D SG G+ TP + S + Y+ L
Sbjct: 243 FSYCLASL-FSKANISSKLYFGDMAVVSGH----GVVSTPLIQ------SFYVGNYFTNL 291
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
VG +K+ S L+P ++GN ++DSGST T + ++ + I M R
Sbjct: 292 EAFSVGDHIIKLKDSSLIPDNEGNA--VIDSGSTITQLPNDVYSQLETAVI-SMVKLKRV 348
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
D ++ L C+ + KK +P + F+ GA + L N F + +EV+C F N
Sbjct: 349 KDPTQQLSL--CYKTTLKK-YEVPIITAHFR-GADVKLNAFNTFIQMNHEVMC---FAFN 401
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
++ P ++ G+ QNF + +D + F C
Sbjct: 402 SSA-----FPWVVYGNIAQQNFLVGYDTLKNIISFKPTNCT 437
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 119/431 (27%), Positives = 179/431 (41%), Gaps = 61/431 (14%)
Query: 65 SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDT 124
+R + +K K +++ + S L S G Y +++ GTP + IFDT
Sbjct: 94 ARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTP-KNDLSLIFDT 152
Query: 125 GSSLVWFPCTSRYR-CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
GS L W C R C D + P F P +S+S + C + C + +
Sbjct: 153 GSDLTWTQCQPCVRTCYD--------QKEPIFNPSKSTSYYNVSCSSAACGSL--SSATG 202
Query: 184 RCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV-PNFLAGCSILSDRQ 241
CS N Y +QYG F+ G L + S V GC + Q
Sbjct: 203 NAGSCSASNCI-------YGIQYGDQSFSVGFLAKDKFTLTSSDVFDGVYFGCG--ENNQ 253
Query: 242 -----PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
AG+ G GR S PSQ K FSYCL S A + +L + +G S
Sbjct: 254 GLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPS----SASYTGHLTFGS---AGIS 306
Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVD 352
++ + +TP G+S FY + + I VG + + IP + + PG+ ++D
Sbjct: 307 RS--VKFTPISTITDGTS-----FYGLNIVAITVGGQKLPIPSTVFSTPGA------LID 353
Query: 353 SGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKF 412
SG+ T + + A+ F +M Y + V S L CFD+SG K+V +P++ F
Sbjct: 354 SGTVITRLPPKAYAALRSSFKAKMSKYPTTSGV---SILDTCFDLSGFKTVTIPKVAFSF 410
Query: 413 KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAN 472
GGA + L + F +CL F N+ A I G+ Q Q + +D A
Sbjct: 411 SGGAVVELGSKGIFYAFKISQVCL-AFAGNS-----DDSNAAIFGNVQQQTLEVVYDGAG 464
Query: 473 DRFGFAKQKCA 483
R GFA C+
Sbjct: 465 GRVGFAPNGCS 475
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 110/396 (27%), Positives = 160/396 (40%), Gaps = 70/396 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
+ +++ GTP Q S IFDTGS L W PC S C P + P F P +S
Sbjct: 149 FVVAVGLGTPAQPSA-LIFDTGSDLSWVQCQPCGSSGHC--------HPQQDPLFDPSKS 199
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
S+ + C P+C+ G CS N TC YL+ YG G T G+L +T
Sbjct: 200 STYAAVHCGEPQCAAAGGL--------CSEDNTTC-----LYLVHYGDGSSTTGVLSRDT 246
Query: 220 LRFPS-KTVPNFLAGCSILSDRQPAGIAGFGR---------SSESLPSQLGLK---KFSY 266
L S + + F GC + + FGR SLPSQ FSY
Sbjct: 247 LALTSSRALAGFPFGCGTRN------LGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSY 300
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
CL S + L + P + T YT + P F FY+V L I
Sbjct: 301 CLPSSN----STTGYLTIGATPAT---DTGAAQYTAMLRKP-----QFPSFYFVELVSID 348
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
+G + +P P GG ++DSG+ T++ +E + F M Y+ A
Sbjct: 349 IGGYILPVP-----PAVFTRGGTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPA---P 400
Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
L C+D +G+ V +P + +F GA L + V CL +A G
Sbjct: 401 PNDVLDACYDFAGESEVIVPAVSFRFGDGAVFELDFFGVMIFLDENVGCLAFAAMDAGGL 460
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
P I+G+ Q ++ + +D+A ++ GF C
Sbjct: 461 -----PLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 121/404 (29%), Positives = 173/404 (42%), Gaps = 68/404 (16%)
Query: 92 IKTPLSVHSYGG---YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD 148
I P + Y G Y I++ FGTP + T IFDTGS++ W C CV +P +
Sbjct: 1 ISIPARIGLYIGTANYVITVGFGTPKKNQT-VIFDTGSNVNWIQCKP---CVVSCYPQQE 56
Query: 149 PSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL 208
P F P SS+ + I C + C+ + +GCS TC Y + YG
Sbjct: 57 P----LFDPTLSSTYRNISCTSAACTGL-------SSRGCS--GSTCV-----YGVTYGD 98
Query: 209 GF-TAGLLLSETLRFPSKTV-PNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGL 261
G T G L +ET + V NF+ GC + Q AG+ G GRS SL SQL
Sbjct: 99 GSSTVGFLATETFTLAAGNVFNNFIFGCG--QNNQGLFTGAAGLIGLGRSPYSLNSQLAT 156
Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
FSYCL S S+ L+ G +TPG YT N S Y
Sbjct: 157 SLGNIFSYCLPSTS------SATGYLNIG---NPLRTPG--YTAMLTN-----SRAPTLY 200
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
++ L I VG + + + + G I+DSG+ T + + A+ F M
Sbjct: 201 FIDLIGISVGGTRLALSSTVFQ-----SVGTIIDSGTVITRLPPTAYGALRTAFRAAMTQ 255
Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
Y+RAA S L C+D S +V P + L + G + +P F ++ + +CL
Sbjct: 256 YTRAA---AASILDTCYDFSRTTTVTFPTIKLHYT-GLDVTIPGAGVFYVISSSQVCL-A 310
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F N+ +G I+G+ Q + + +D A R GFA C
Sbjct: 311 FAGNSDSTQIG-----IIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 129/457 (28%), Positives = 191/457 (41%), Gaps = 78/457 (17%)
Query: 52 PLKILHSLASSSLSRAR--HLKTKTKPKTKDSNIGSNYSN-------SLIKTPLSVHSYG 102
PL H S +S+ + H +T + + + +NI + S+ L ++ +++ +
Sbjct: 62 PLVHRHGPCSPVMSKEKPSHEETLGRDQLRAANIHAKLSSPRNSSAKELQQSGVTIPTSS 121
Query: 103 GYS-------ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
GYS I++S GTP I DTGS + W +C C + + F
Sbjct: 122 GYSLGTPEYVITVSLGTPAVTQVMSI-DTGSDVSWV------QCAPCAAQSCSSQKDKLF 174
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGL 214
P +S++ C + +C+ + G GC N C Y+++Y T G
Sbjct: 175 DPAKSATYSAFSCSSAQCAQLGGEG-----NGC--LNSHC-----QYIVKYVDHSNTTGT 222
Query: 215 LLSETLRFP-SKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL---KKFSYC 267
S+TL S V NF GCS ++ Q G+ G G +ESL SQ K FSYC
Sbjct: 223 YGSDTLGLTTSDAVKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYC 282
Query: 268 LLSRKFDDAPVSSNL--VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
L P SS+ L G +G + + S TP + V + FY V L+ I
Sbjct: 283 L-------PPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFNVPT------FYGVFLQAI 329
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
V + +P S +G +VDSG+ T + ++A+ F ++M Y AA V
Sbjct: 330 TVAGTKLNVPASVF------SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPV 383
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
L CFD SG K+V +P + L F GA M L F CL FT A
Sbjct: 384 GI---LDTCFDFSGIKTVRVPVVTLTFSRGAVMDLDVSGIF-----YAGCLA-FTATAQ- 433
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G ILG+ Q + F + FD+ GF C
Sbjct: 434 ----DGDTGILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 138/496 (27%), Positives = 203/496 (40%), Gaps = 95/496 (19%)
Query: 11 LFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHL 70
+ +L L+ T A S ++ L P HS PL + + L ++ L
Sbjct: 5 VLTLFFLVSTMLVDASKSLMGFSIDLIP-------RHSPISPL-YNSQMTQTELVKSAAL 56
Query: 71 KTKTKPKTKDSNIGSNYSNSL--IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSL 128
++ T+ +K N S L I TP+ H G Y + S GTP IFDTGS L
Sbjct: 57 RSITR--SKRVNFIGQISPPLSPIITPIPDH--GEYLMRFSLGTP-SVERLAIFDTGSDL 111
Query: 129 VWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGC 188
W CT C P P F P +SS+ + C++ C+ P + C
Sbjct: 112 SWLQCTPCKTCY--------PQEAPLFDPTQSSTYVDVPCESQPCTLF--PQNQRECGS- 160
Query: 189 SPRNKTCPLACPSYLLQYGL-GFTAGLLLSETLRFPSK-------TVPNFLAGCSILSD- 239
+K C YL QYG FT G L +T+ F S T P + GC+ S+
Sbjct: 161 ---SKQC-----IYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGCAFYSNF 212
Query: 240 -----RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
+ G G G SL SQLG + KFSYC++ P SS TG
Sbjct: 213 TFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMV-------PFSST---STGKLKF 262
Query: 292 DSKTPG--LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349
S P + TPF NP ++ +Y + L I VG K V + G G G +
Sbjct: 263 GSMAPTNEVVSTPFMINP-----SYPSYYVLNLEGITVGQKKV-------LTGQIG-GNI 309
Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD--ISGKKSVYLPE 407
I+DS T +E + + +FI + A +VE F+ + ++ PE
Sbjct: 310 IIDSVPILTHLE----QGIYTDFISSV---KEAINVEVAEDAPTPFEYCVRNPTNLNFPE 362
Query: 408 LILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLE 467
+ F GA + L P+N F + N ++C+ + P+ G I G++ NF +E
Sbjct: 363 FVFHFT-GADVVLGPKNMFIALDNNLVCMTVV------PSKGIS---IFGNWAQVNFQVE 412
Query: 468 FDLANDRFGFAKQKCA 483
+DL + FA C+
Sbjct: 413 YDLGEKKVSFAPTNCS 428
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 134/287 (46%), Gaps = 42/287 (14%)
Query: 211 TAGLLLSETLRFPS-KTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFS 265
T GLL + F + +VP GC + ++ GIAGFGR SLPSQL + FS
Sbjct: 226 TTGLLEVDKFTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFS 285
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
+C + + S ++LD + + TP +N SA YY+ L+ I
Sbjct: 286 HCFTAV---NGLKQSTVLLDLLADLYKNGRGAVQSTPLIQN-----SANPTLYYLSLKGI 337
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM------GNY 379
VGS + +P S ++G GG I+DSG++ T + +++ V EF Q+ GN
Sbjct: 338 TVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGN- 395
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLC 435
+G CF + +P+L+L F+ GA M LP ENY V GN ++C
Sbjct: 396 --------ATGPYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSMIC 446
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L + LG A I G+FQ QN ++ +DL N+ F +C
Sbjct: 447 LAI-------NELGDERATI-GNFQQQNMHVLYDLQNNMLSFVAAQC 485
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 73/152 (48%), Gaps = 32/152 (21%)
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM------GN 378
I VGS + +P S ++G GG I+DSG++ T + +++ V EF Q+ GN
Sbjct: 42 ITVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGN 100
Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVL 434
+G CF + +P+L+L F+ GA M LP ENY V GN ++
Sbjct: 101 ---------ATGPYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSII 150
Query: 435 CLILFTDNAAGPALGRG-PAIILGDFQLQNFY 465
CL A+ +G I+G+FQ QN +
Sbjct: 151 CL----------AINKGDETTIIGNFQQQNMH 172
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 134/471 (28%), Positives = 195/471 (41%), Gaps = 77/471 (16%)
Query: 27 SSAATVTVPLTPLSTKHYLHHSDSD-PLKILHSLASS---SLSRARHLKTK-TKPKTKDS 81
SS+ TVPL H+ H S P K + SL RA ++K K + KD
Sbjct: 52 SSSGATTVPL------HHRHGPCSPLPTKKMPSLEDRLHRDQLRAAYIKRKFSGDVKKDG 105
Query: 82 NIGSNYSNSLIKTPLSVHSYGG---YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR 138
S + P ++ + Y I++ G+P + T I D+GS + W C
Sbjct: 106 QGAGGVEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLI-DSGSDVSWVQCKP--- 161
Query: 139 CVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA 198
C+ C+ VDP F P SS+ C + C+ + GCS ++
Sbjct: 162 CLQCH-SQVDP----LFDPSLSSTYSPFSCSSAACAQL-----GQDGNGCSSSSQC---- 207
Query: 199 CPSYLLQYGLGF-TAGLLLSETLRFPSKTVPNFLAGCSILS---DRQPAGIAGFGRSSES 254
Y+++Y G T G S+TL S T+ NF GCS + + G+ G G + S
Sbjct: 208 --QYIVRYADGSSTTGTYSSDTLALGSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPS 265
Query: 255 LPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS 311
L SQ FSYCL P SS + T G + F K P+ S
Sbjct: 266 LASQTAGTFGTAFSYCL-----PPTPSSSGFL-----------TLGAGTSGFVKTPMLRS 309
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
S FY V L I VG + IP S + G+++DSG+ T + + A++
Sbjct: 310 SPVPTFYGVRLEAIRVGGTQLSIPTSVF------SAGMVMDSGTIITRLPRTAYSALSSA 363
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
F M Y A +S + CFD SG+ SV LP + L F GGA + L ++GN
Sbjct: 364 FKAGMKQYRPA---PPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVNLDANGI--ILGN 418
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL F N+ + G I+G+ Q + F + +D+ GF C
Sbjct: 419 ---CLA-FAANSDDSSPG-----IVGNVQQRTFEVLYDVGGGAVGFKAGAC 460
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 162/368 (44%), Gaps = 54/368 (14%)
Query: 123 DTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
DTGS + W PC S C + DP F PK SSS + C + +C +
Sbjct: 166 DTGSDVTWLQCQPCASENTC----YKQFDP----IFDPKSSSSYSPLSCNSQQCKLLDKA 217
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
N S TC Y + YG G FT G L +ETL F S ++PN GC
Sbjct: 218 NCNS---------DTCI-----YQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHD 263
Query: 238 SD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
++ AG+ G G + SL SQL FSYCL++ D+ SS L ++ S DS
Sbjct: 264 NEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL---DSDSSSTLEFNSNMPS-DSL 319
Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
T +P KN F + YV + I VG K + I + G GG+IVDSG
Sbjct: 320 T-----SPLVKN-----DRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSG 369
Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
+ + + ++E++ + F++ + S A + S C++ SG+ +V +P +
Sbjct: 370 TIISRLPSDVYESLREAFVKLTSSLSPAPGI---SVFDTCYNFSGQSNVEVPTIAFVLSE 426
Query: 415 GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
G + LP NY ++ + F + + I+G FQ Q + +DL N
Sbjct: 427 GTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLS-------IIGSFQQQGIRVSYDLTNSL 479
Query: 475 FGFAKQKC 482
GF+ KC
Sbjct: 480 VGFSTNKC 487
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 158/387 (40%), Gaps = 55/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTPPQ ++ I D LVW C RC + P F P S++
Sbjct: 51 YVANFTIGTPPQPASAVI-DLAGELVWTQCKQCSRCFE--------QDTPLFDPTASNTY 101
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ C P C I P+ C G C +Y G T G + ++T F
Sbjct: 102 RAEPCGTPLCESI--PSDSRNCSG-----NVC-----AYQASTNAGDTGGKVGTDT--FA 147
Query: 224 SKTVPNFLA-GCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
T LA GC + SD P+GI G GR+ SL +Q G+ FSYCL DA
Sbjct: 148 VGTAKASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGR 204
Query: 279 SSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
+S L L + G G + + TPF N G+ + +Y V L + G + +P
Sbjct: 205 NSALFLGSSAKLAGGGKAAS-----TPFV-NISGNGNDLSNYYKVQLEGLKAGDAMIPLP 258
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S GS V++D+ S +F+ ++AV K +G A VE CF
Sbjct: 259 PS----GST----VLLDTFSPISFLVDGAYQAVKKAVTAAVGAPPMATPVEP---FDLCF 307
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
SG P+L+ F+GGA M +P NY N +CL + + +
Sbjct: 308 PKSGASGAA-PDLVFTFRGGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELS---L 363
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
LG Q +N + FDL + F C
Sbjct: 364 LGSLQQENIHFLFDLDKETLSFEPADC 390
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 164/391 (41%), Gaps = 59/391 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP T +FDTGS W C CV + + F P RSS
Sbjct: 180 GNYVVTIGLGTPASRYT-VVFDTGSDTTWVQCQP---CVVVCYKQQEK----LFDPARSS 231
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + C P CS ++ +GCS + Y +QYG G ++ G +TL
Sbjct: 232 TYANVSCAAPACSDLY-------TRGCSGGHCL-------YSVQYGDGSYSIGFFAMDTL 277
Query: 221 RFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
S V F GC ++ + AG+ G GR SLP Q K F++CL +R
Sbjct: 278 TLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS- 336
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
S LD GPGS + TP + + FYYVG+ I VG + +
Sbjct: 337 -----SGTGYLDFGPGS-PAAVGARQTTPMLTDNGPT------FYYVGMTGIRVGGQLLS 384
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGL 391
IP S G IVDSG+ T + + ++ F M Y +A + S L
Sbjct: 385 IPQSVF-----STAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAL---SLL 436
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D +G V +P++ L F+GGA + + +CL F N +G
Sbjct: 437 DTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASLSQVCL-GFAANEDDDDVG-- 493
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ QL+ F + +D+ GF+ C
Sbjct: 494 ---IVGNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 165/391 (42%), Gaps = 61/391 (15%)
Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + + GTP Q P + DT S + W PC+ CV C PS AF P +S+
Sbjct: 99 YIVKVLIGTPAQ---PLLLAMDTSSDVAWIPCSG---CVGC------PSNT-AFSPAKST 145
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
S + + C P+C + P +R AC S+ L YG A L +T+R
Sbjct: 146 SFKNVSCSAPQCKQVPNPACGAR-------------AC-SFNLTYGSSSIAANLSQDTIR 191
Query: 222 FPSKTVPNFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
+ + F GC +I + G+ S S + FSYCL S F
Sbjct: 192 LAADPIKAFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPS--F 249
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
S +L L GP S + + YT +NP SS YYV L I VG K V
Sbjct: 250 RSLTFSGSLRL--GPTSQPQR---VKYTQLLRNPRRSS-----LYYVNLVAIRVGRKVVD 299
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+P + + G I DSG+ +T + P++EAV EF +++ + A V G
Sbjct: 300 LPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPPT--AVVTSLGGFDT 357
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGP 452
C+ SG+ V +P + FK G M +P +N CL + A+ P
Sbjct: 358 CY--SGQ--VKVPTITFMFK-GVNMTMPADNLMLHSTAGSTSCLAM----ASAPENVNSV 408
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
++ Q QN + D+ N R G A+++C+
Sbjct: 409 VNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 122/467 (26%), Positives = 193/467 (41%), Gaps = 68/467 (14%)
Query: 42 KHYLHHSDSDPL---KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSV 98
+H L DP+ + L L ++ SRA + + ++ S + + + + +
Sbjct: 80 RHSLTAIPEDPVARDRYLRRLLAADESRANSFQPRRNKDRASASTQSASAEVPLTSGIRL 139
Query: 99 HSYGGYSISLSFGTP---PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
+ Y ++S G P A+ I DTGS L W C C C R P F
Sbjct: 140 QTLN-YVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKP---CSACY-----AQRDPLF 190
Query: 156 IPKRSSSSQLIGCQNPKC--SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTA 212
P S++ + C C S C ++ C Y L YG G F+
Sbjct: 191 DPAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKC-----YYALAYGDGSFSR 245
Query: 213 GLLLSETLRFPSKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLK---KFS 265
G+L ++T+ ++ F+ GC LS+R AG+ G GR+ SL SQ + FS
Sbjct: 246 GVLATDTVALGGASLGGFVFGCG-LSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFS 304
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL + DA S +L S T ++YT +P A FY++ +
Sbjct: 305 YCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADP-----AQPPFYFLNVTGA 359
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG + L G V++DSG+ T + ++ AV EF+RQ G AA
Sbjct: 360 AVGG-------TALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFG----AAGY 408
Query: 386 EKKSG---LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV---GNEVLCLIL- 438
G L C+D++G V +P L L+ +GGA + + +V G++V CL +
Sbjct: 409 PAAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQV-CLAMA 467
Query: 439 ---FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ D I+G++Q +N + +D R GFA + C
Sbjct: 468 SLSYEDETP----------IIGNYQQKNKRVVYDTLGSRLGFADEDC 504
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 114/393 (29%), Positives = 166/393 (42%), Gaps = 60/393 (15%)
Query: 104 YSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + L+ GTPP PFI DTGS L W C C + P D + +F P
Sbjct: 83 YLMELAIGTPP---VPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSP---- 135
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ C + C I+ SRC + PS +Y + G E
Sbjct: 136 ----LPCSSATCLPIW----SSRC------------STPSATCRYRYAYDDGAYSPECAG 175
Query: 222 FPSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS--RKFDDA 276
+V GC + + G G GR S SL +QLG+ KFSYCL +
Sbjct: 176 I---SVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLSS 232
Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
PV + + S + + TP ++P S YYV L I +G + IP
Sbjct: 233 PVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSR-----YYVSLEGISLGDARLPIPN 287
Query: 337 -SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
++ + DG+GG+IVDSG+ FT + F V +G V S RPCF
Sbjct: 288 GTFDLNDDDGSGGMIVDSGTIFTILVETGFRVVVDHVAGVLGQ----PVVNASSLDRPCF 343
Query: 396 --DISGKKSV-YLPELILKFKGGAKMALPPENYFALVGNE-VLCL-ILFTDNAAGPALGR 450
+G + + +P+++L F GGA M L +NY + E CL I+ T++A+G
Sbjct: 344 PAPAAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGS---- 399
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+LG+FQ QN + FD+ + F C+
Sbjct: 400 ----VLGNFQQQNIQMLFDITVGQLSFMPTDCS 428
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 124/407 (30%), Positives = 180/407 (44%), Gaps = 69/407 (16%)
Query: 89 NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW---FPCTSRYRCVDCNFP 145
+ L +TP++ + G Y I +S+G PPQ ST I DTGS L W PC S Y + F
Sbjct: 76 DQLFETPVASGN-GEYLIDISYGNPPQKSTA-IVDTGSDLNWVQCLPCKSCYETLSAKF- 132
Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
DPS+ S+S + +GC + C + ++C +C Y
Sbjct: 133 --DPSK--------SASYKTLGCGSNFCQDL--------------PFQSCAASC-QYDYM 167
Query: 206 YGLGF-TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLG- 260
YG G T+G L ++ + + +PN GC ++ + G+ G G+ SL SQLG
Sbjct: 168 YGDGSSTSGALSTDDVTIGTGKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGG 227
Query: 261 --LKKFSYCLLSRKFDDAPVSSNLVLDTGP-GSGDSKTPG-LSYTPFYKNPVGSSSAFGE 316
KKFSYCL+ P+ S T P GDS G ++YTP N + +
Sbjct: 228 TATKKFSYCLV-------PLGST---KTSPLYIGDSTLAGGVAYTPMLTN-----NNYPT 272
Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
FYY L+ I V K V P + + G GG+I+DSG+T T+++ F + ++
Sbjct: 273 FYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLDVDAFNPMVAA-LKAA 331
Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLC 435
Y A GL CF +G + P ++ F GA +AL P+N F AL C
Sbjct: 332 LPYPEADG--SFYGLEYCFSTAGVANPTYPTVVFHFN-GADVALAPDNTFIALDFEGTTC 388
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L + + I G+ Q N + DL N R GF C
Sbjct: 389 LAMASSTGFS---------IFGNIQQLNHVIVHDLVNKRIGFKSANC 426
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 118/408 (28%), Positives = 168/408 (41%), Gaps = 71/408 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L GTPP+ + DTGS ++W C S C C + ++ F P S
Sbjct: 79 GLYYTKLRLGTPPRDFYVQV-DTGSDVLWVSCAS---CNGCPQTSGLQIQLNFFDPGSSV 134
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ I C + +CSW ++S GCS +N C +Y QYG G T+G +S+ L
Sbjct: 135 TASPISCSDQRCSW----GIQSSDSGCSVQNNLC-----AYTFQYGDGSGTSGFYVSDVL 185
Query: 221 RFP----SKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
+F S VPN A GCS + SDR GI GFG+ S+ SQL
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIA 245
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL LVL G+ P + +TP + Y V
Sbjct: 246 PRVFSHCLKGENGGGGI----LVL------GEIVEPNMVFTPLVPSQ--------PHYNV 287
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
L I V + + I S S NG G I+D+G+T ++ EA F+ + N
Sbjct: 288 NLLSISVNGQALPINPSVF---STSNGQGTIIDTGTTLAYLS----EAAYVPFVEAITNA 340
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA----LVGNEVLC 435
+ S C+ I+ P + L F GGA M L P++Y + G V C
Sbjct: 341 VSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWC 400
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G ILGD L++ +DL R G+A C+
Sbjct: 401 IGFQRIQNQG-------ITILGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 130/446 (29%), Positives = 190/446 (42%), Gaps = 61/446 (13%)
Query: 49 DSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISL 108
D + ++ LHS R + ++ + T D G + ++ +K+ LS+ S G Y + +
Sbjct: 60 DEERVRFLHS-------RLTNKESASNSATTDKLGGPSLVSTPLKSGLSIGS-GNYYVKI 111
Query: 109 SFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
GTP + + I DTGSSL W C CV VDP F P S + + + C
Sbjct: 112 GVGTPAKYFS-MIVDTGSSLSWLQCQP---CVIYCHVQVDP----IFTPSVSKTYKALSC 163
Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV 227
+ +CS + + + GCS C Y YG F+ G L + L
Sbjct: 164 SSSQCSSLKSSTLNA--PGCSNATGAC-----VYKASYGDTSFSIGYLSQDVLTLTPSAA 216
Query: 228 PN--FLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAP 277
P+ F+ GC D Q AGI G S+ QL K FSYCL S F P
Sbjct: 217 PSSGFVYGCG--QDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPS-SFSAQP 273
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PY 336
SS + S S +P +TP KNP Y++GL I V K + +
Sbjct: 274 NSSVSGFLSIGASSLSSSP-YKFTPLVKNP-----KIPSLYFLGLTTITVAGKPLGVSAS 327
Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
SY VP I+DSG+ T + ++ A+ K F+ M + A S L CF
Sbjct: 328 SYNVP-------TIIDSGTVITRLPVAIYNALKKSFVMIMSK--KYAQAPGFSILDTCFK 378
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
S K+ +PE+ + F+GGA + L N + CL + A P I+
Sbjct: 379 GSVKEMSTVPEIRIIFRGGAGLELKVHNSLVEIEKGTTCLAI--------AASSNPISII 430
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
G++Q Q F + +D+AN + GFA C
Sbjct: 431 GNYQQQTFTVAYDVANSKIGFAPGGC 456
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 122/402 (30%), Positives = 169/402 (42%), Gaps = 72/402 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIF----DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
G Y ++ GTP + + F D GS + W C +RC P P +
Sbjct: 123 GEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYH------QPG--PVYNR 174
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT-AGLLL 216
+SSS+ +GC P C + GC C Y ++YG G + AG
Sbjct: 175 LKSSSASDVGCYAPACRAL------GSSGGCVQFLNEC-----QYKVEYGDGSSSAGDFG 223
Query: 217 SETLRFPSKT-VPNFLAGCSILSDRQ------PAGIAGFGRSSESLPSQLGLK---KFSY 266
ETL FP VP GC SD Q AGI G GR S S PSQ+ + FSY
Sbjct: 224 VETLTFPPGVRVPGVAIGCG--SDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSY 281
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
CL + SS L +G + + T S+TP N S FYYVGL I
Sbjct: 282 CLAGQG--TGGRSSTLTFGSGASATTTTTTPPSFTPMLTN-----SRMYTFYYVGLVGIS 334
Query: 327 VGSKHVK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
VG V+ + S L + S G+GGVIVDSG+ T + GP + A F R A
Sbjct: 335 VGGVRVRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAF--------RVAA 386
Query: 385 VEKKSGLRP---------CF-DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-- 432
V++ P C+ + G+ +P + + F GG ++ LPP+NY V +
Sbjct: 387 VKELGWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKG 446
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
+C G + I+G+ QLQ F + +D+ R
Sbjct: 447 TMCFAFAGSGDRGVS-------IIGNIQLQGFRVVYDVDGQR 481
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 128/431 (29%), Positives = 179/431 (41%), Gaps = 71/431 (16%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLI--KTPLSVHSYGGYSISLSFGTPPQASTPFIFD 123
RA++++ K + G S ++ T S Y I++S GTP I D
Sbjct: 85 RAKYIQAKLSVNSGSGTDGVQQSAAITLPTTLGSALDTLAYVITVSIGTPAMTQAVMI-D 143
Query: 124 TGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
TGS + W C +R F F P +SS+ C + C+ +E
Sbjct: 144 TGSDVSWVHCHARAGAGSSLF----------FDPGKSSTYTPFSCSSAACT-----RLEG 188
Query: 184 RCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS-KTVPNFLAGCSILSD-- 239
R GCS N TC Y ++YG G T G S+TL S + V NF GCS SD
Sbjct: 189 RDNGCS-LNSTC-----QYTVRYGDGSNTTGTYGSDTLALNSTEKVENFQFGCSETSDPG 242
Query: 240 -----RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
Q G+ G G + SL SQ FSYCL A S+ L G +G
Sbjct: 243 EGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCL------PATTRSSGFLTLGASTG 296
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
T G TP +++ + FY+V L+ I VG V I + GS I+
Sbjct: 297 ---TSGFVTTPMFRSRRAPT-----FYFVILQGINVGGDPVAISPTVFAAGS------IM 342
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
DSG+ T + + A++ F M Y RA S L CFD +G+ +V +P + L
Sbjct: 343 DSGTIITRLPPRAYSALSAAFRAGMRRYPRA---RAFSILDTCFDFTGQDNVSIPAVELV 399
Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
F GGA + L + + G+ CL A PA G G I+G+ Q + F + D+
Sbjct: 400 FSGGAVVDLDADGI--MYGS---CL------AFAPATG-GIGSIIGNVQQRTFEVLHDVG 447
Query: 472 NDRFGFAKQKC 482
GF C
Sbjct: 448 QSVLGFRPGAC 458
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 126/437 (28%), Positives = 175/437 (40%), Gaps = 74/437 (16%)
Query: 47 HSDSDPLKILHSLASSSL---SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG 103
+ D + +K ++S S +L S L + T P S IGS G
Sbjct: 102 NQDKERVKYINSRLSKNLGQDSSVEELDSATLPAKSGSLIGS----------------GN 145
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTP + IFDTGS L W C C + D F P +S+S
Sbjct: 146 YFVVVGLGTPKR-DLSLIFDTGSDLTWTQCEP---CARSCYKQQDV----IFDPSKSTSY 197
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
I C + C+ + GCS K C Y +QYG F+ G E L
Sbjct: 198 SNITCTSALCTQL--STATGNDPGCSASTKACI-----YGIQYGDSSFSVGYFSRERLTV 250
Query: 223 -PSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
+ V NFL GC + Q AG+ G GR S Q K FSYCL S
Sbjct: 251 TATDVVDNFLFGCG--QNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCLPSTS- 307
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
SS L GP + L YTPF GSS FY + + I VG VK
Sbjct: 308 -----SSTGHLSFGPAATGRY---LKYTPFSTISRGSS-----FYGLDITAIAVGG--VK 352
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+P S + GG I+DSG+ T + + A+ F + M Y A ++ S L
Sbjct: 353 LPVS---SSTFSTGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGEL---SILDT 406
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
C+D+SG K +P + F GG + LPP+ + + +CL F N +
Sbjct: 407 CYDLSGYKVFSIPTIEFSFAGGVTVKLPPQGILFVASTKQVCLA-FAANGDDSDV----- 460
Query: 454 IILGDFQLQNFYLEFDL 470
I G+ Q + + +D+
Sbjct: 461 TIYGNVQQRTIEVVYDV 477
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 156/380 (41%), Gaps = 40/380 (10%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S S GTPPQ T + D S VW C++ C C + P F SS
Sbjct: 95 GMYVLSFSVGTPPQVVTG-VLDITSDFVWMQCSA---CATCGADAPAATSAPPFYAFLSS 150
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLLLSE 218
+ + + C N C + P CS + C Y YG G TAGLL +
Sbjct: 151 TIREVRCANRGCQRLV-PQT------CSADDSPC-----GYSYVYGGGAANTTAGLLAVD 198
Query: 219 TLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
F + + GC++ ++ G+ G GR S SQL + +FSY L DDA
Sbjct: 199 AFAFATVRADGVIFGCAVATEGDIGGVIGLGRGELSPVSQLQIGRFSYYLAP---DDAVD 255
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
+ +L D P S P+ +S A YYV L I V + + IP
Sbjct: 256 VGSFILFL-----DDAKPRTSRA--VSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGT 308
Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
+DG+GGV++ TF++ ++ V + ++ RAAD + GL C+
Sbjct: 309 FDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI--ELRAAD-GSELGLDLCYTSE 365
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAIILG 457
+ +P + L F GGA M L NYF + L CL + A G +LG
Sbjct: 366 SLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPA-------GDGSLLG 418
Query: 458 DFQLQNFYLEFDLANDRFGF 477
++ +D++ R F
Sbjct: 419 SLIQVGTHMIYDISGSRLVF 438
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 121/408 (29%), Positives = 171/408 (41%), Gaps = 57/408 (13%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPC-TSRYRCVDCNFPNVDPSRIPA 154
L H ++SL+ GTPPQ T + DTGS L W C T R + +
Sbjct: 53 LRFHHNVSLTVSLAVGTPPQNVT-MVLDTGSELSWLLCATGRAAAAAAD----------S 101
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-G 213
F P+ S++ + C + +CS P S C ++ C ++ L Y G + G
Sbjct: 102 FRPRASATFAAVPCGSARCSSRDLPAPPS----CDAASRRCRVS-----LSYADGSASDG 152
Query: 214 LLLSETLRFPSKTVPNFLAGC-SILSDRQP-----AGIAGFGRSSESLPSQLGLKKFSYC 267
L ++ GC S D P AG+ G R + S +Q ++FSYC
Sbjct: 153 ALATDVFAVGDAPPLRSAFGCMSAAYDSSPDAVATAGLLGMNRGALSFVTQASTRRFSYC 212
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY-VGLRQII 326
+ R DDA V L+L G D L+YTP Y+ P F Y V L I
Sbjct: 213 ISDR--DDAGV---LLL----GHSDLPFLPLNYTPLYQ-PTPPLPYFDRVAYSVQLLGIR 262
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
VG K + IP S L P G G +VDSG+ FTF+ G + AV EF++Q A +
Sbjct: 263 VGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFLKQTKPLLPALEDP 322
Query: 387 K---KSGLRPCFDISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNE------VL 434
+ CF + + S LP + L F GA+M++ + V E V
Sbjct: 323 SFAFQEAFDTCFRVPKGRPPPSARLPPVTLLFN-GAQMSVAGDRLLYKVPGERRGADGVW 381
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL F + P A ++G N ++E+DL R G A KC
Sbjct: 382 CLT-FGNADMVPLT----AYVIGHHHQMNLWVEYDLERGRVGLAPVKC 424
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 163/390 (41%), Gaps = 56/390 (14%)
Query: 104 YSISLSFGTPPQASTPFI--FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y SL GTP +T + DTGS W C C DC F P +SS
Sbjct: 134 YFTSLRLGTP---ATDLLVELDTGSDQSWIQCKP---CPDCY-----EQHEALFDPSKSS 182
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL-GFTAGLLLSETL 220
+ I C + +C + G + + C +K CP Y + Y +T G L +TL
Sbjct: 183 TYSDITCSSRECQEL-GSSHKHNCS----SDKKCP-----YEITYADDSYTVGNLARDTL 232
Query: 221 RF-PSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKF 273
P+ VP F+ GC + S + G+ G GR SL SQ+ + FSYCL S
Sbjct: 233 TLSPTDAVPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPS--- 289
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
+P ++ + +G + + ++P FYY+ L I V + +K
Sbjct: 290 --SPSATGYLSFSGAAAAAPTNAQFTEMVAGQHP--------SFYYLNLTGITVAGRAIK 339
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+P S + G I+DSG+ F+ + + A+ MG Y RA +
Sbjct: 340 VPPSVFATAA----GTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRA---PSSTIFDT 392
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
C+D++G ++V +P + L F GA + L P N + F N +LG
Sbjct: 393 CYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLG---- 448
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+LG+ Q + + +D+ N + GF CA
Sbjct: 449 -VLGNTQQRTLAVIYDVDNQKVGFGANGCA 477
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 118/408 (28%), Positives = 168/408 (41%), Gaps = 71/408 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L GTPP+ + DTGS ++W C S C C + ++ F P S
Sbjct: 79 GLYYTKLRLGTPPRDFYVQV-DTGSDVLWVSCAS---CNGCPQTSGLQIQLNFFDPGSSV 134
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ I C + +CSW ++S GCS +N C +Y QYG G T+G +S+ L
Sbjct: 135 TASPISCSDQRCSW----GIQSSDSGCSVQNNLC-----AYTFQYGDGSGTSGFYVSDVL 185
Query: 221 RFP----SKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
+F S VPN A GCS + SDR GI GFG+ S+ SQL
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIA 245
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL LVL G+ P + +TP + Y V
Sbjct: 246 PRVFSHCLKGENGGGGI----LVL------GEIVEPNMVFTPLVPSQ--------PHYNV 287
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
L I V + + I S S NG G I+D+G+T ++ EA F+ + N
Sbjct: 288 NLLSISVNGQALPINPSVF---STSNGQGTIIDTGTTLAYLS----EAAYVPFVEAITNA 340
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA----LVGNEVLC 435
+ S C+ I+ P + L F GGA M L P++Y + G V C
Sbjct: 341 VSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWC 400
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G ILGD L++ +DL R G+A C+
Sbjct: 401 IGFQRIQNQG-------ITILGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 158/391 (40%), Gaps = 49/391 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y S+ GTPP + + DTGS +VW C CV C P + P+ SS
Sbjct: 97 GEYFASVGVGTPPTPAL-LVIDTGSDVVWLQCKP---CVHCYR-----QLSPLYDPRGSS 147
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ C P+C + C C Y + YG T+G L ++ L
Sbjct: 148 TYAQTPCSPPQCR---------NPQTCDGTTGGC-----GYRIVYGDASSTSGNLATDRL 193
Query: 221 RFPSKT-VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
F + T V N GC ++ AG+ G R + S +Q+ + F+YCL R
Sbjct: 194 VFSNDTSVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTR 253
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
+ S + T P + P +TP NP S YYV + VG + V
Sbjct: 254 SGSSSSYLVFGRTAP-----EPPSSVFTPLRSNPRRPS-----LYYVDMVGFSVGGEPVT 303
Query: 334 --IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
S + + G GGV+VDSG++ T + A+ F + S
Sbjct: 304 GFSNASLSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVF 363
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D+ G P ++L F GGA +ALPPENY LV E F AA G
Sbjct: 364 DACYDLRGVAVADAPGVVLHFAGGADVALPPENY--LVPEESGRYHCFALEAA----GHD 417
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++G+ Q F + FD+ N+R GF C
Sbjct: 418 GLSVIGNVLQQRFRVVFDVENERVGFEPNGC 448
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 119/407 (29%), Positives = 163/407 (40%), Gaps = 76/407 (18%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I L+ GTPPQ + + DTGS L+W C C C DP F P SSS
Sbjct: 103 YLIDLAIGTPPQPVSALL-DTGSDLIWTQCAP---CASC-LAQPDP----LFAPAASSSY 153
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF 222
+ C C+ I + C+ R TC +Y YG G T G+ +E F
Sbjct: 154 VPMRCSGQLCNDI----LHHSCQ----RPDTC-----TYRYNYGDGTTTLGVYATERFTF 200
Query: 223 PSKTVPNFLA----GCSIL---SDRQPAGIAGFGRSSESLPSQLGLKKFSYCLL----SR 271
S + GC + S +GI GFGR SL SQL +++FSYCL +R
Sbjct: 201 ASSSGEKLSVPLGFGCGTMNVGSLNNGSGIVGFGRDPLSLVSQLSIRRFSYCLTPYTSTR 260
Query: 272 KFDDAPVSSNLV---LDTGPGSGDSKTPGLSYTPFY----KNPVGSSSAFGEFYYVGLRQ 324
K S L+ L G GD G T +NP FYYV
Sbjct: 261 K-------STLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPT--------FYYVPFTG 305
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
+ VG++ ++IP S DG+GGVIVDSG+ T + V + F Q+ +
Sbjct: 306 VTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQL-RLPFTSS 364
Query: 385 VEKKSGLRPCFDI---------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
G+ CF S V +P + F+ GA + LP NY
Sbjct: 365 SSPDDGV--CFATPMAAGGRRASAATVVSVPRMAFHFQ-GADLELPRRNYVLDDPRRGSL 421
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
IL D+ A +G+F Q+ + +DL + FA +C
Sbjct: 422 CILLADSGDSGA-------TIGNFVQQDMRVLYDLEAETLSFAPAQC 461
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 131/276 (47%), Gaps = 33/276 (11%)
Query: 211 TAGLLLSETLRFPS-KTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLKKFS 265
T GL+ + F + +VP GC + ++ GIAGFGR SLPSQL + FS
Sbjct: 74 TTGLIEVDKFTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFS 133
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
+C + + S ++LD + + TP +N SA FYY+ L+ I
Sbjct: 134 HCFTAV---NGLKQSTVLLDLPADLYKNGRGAVQSTPLIQN-----SANPTFYYLSLKGI 185
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VGS + +P S ++G GG I+DSG++ T + +++ V EF Q+
Sbjct: 186 TVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI---KLPVVP 241
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTD 441
+G CF + +P+L+L F+ GA M LP ENY V GN ++CL
Sbjct: 242 GNATGPYTCFSAPSQAKPDVPKLVLHFE-GATMDLPRENYVFEVPDDAGNSIICL----- 295
Query: 442 NAAGPALGRG-PAIILGDFQLQNFYLEFDLANDRFG 476
A+ +G I+G+FQ QN ++ +DL N G
Sbjct: 296 -----AINKGDETTIIGNFQQQNMHVLYDLQNMHRG 326
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 161/368 (43%), Gaps = 54/368 (14%)
Query: 123 DTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
DTGS + W PC S C + DP F PK SSS + C + +C +
Sbjct: 166 DTGSDVTWLQCQPCASENTC----YKQFDP----IFDPKSSSSYSPLSCNSQQCKLLDKA 217
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
N S TC Y + YG G FT G L +ETL F S ++PN GC
Sbjct: 218 NCNS---------DTCI-----YQVHYGDGSFTTGELATETLSFGNSNSIPNLPIGCGHD 263
Query: 238 SD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
++ AG+ G G + SL SQL FSYCL++ D+ SS L + S
Sbjct: 264 NEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNL---DSDSSSTLEFN-------SY 313
Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
P S T +P+ + F + YV + I VG K + I + G GG+IVDSG
Sbjct: 314 MPSDSLT----SPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSG 369
Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
+ + + ++E++ + F++ + S A + S C++ SG+ +V +P +
Sbjct: 370 TIISRLPSDVYESLREAFVKLTSSLSPAPGI---SVFDTCYNFSGQSNVEVPTIAFVLSE 426
Query: 415 GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
G + LP NY ++ + F + + I+G FQ Q + +DL N
Sbjct: 427 GTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLS-------IIGSFQQQGIRVSYDLTNSI 479
Query: 475 FGFAKQKC 482
GF+ KC
Sbjct: 480 VGFSTNKC 487
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 130/473 (27%), Positives = 190/473 (40%), Gaps = 79/473 (16%)
Query: 35 PLTPLSTKH--------YLHHSDSDPLKILHSLASSSLSR-----ARHLKTKTKPKTKDS 81
P +PL+ H L S I H +++++ R +RH + + +
Sbjct: 97 PCSPLAAAHGEPPSHGEILAADQSRAESIQHRVSTTTTDRVNPKRSRHRQQQPPSAPAPA 156
Query: 82 NIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
S+ + SL +P G Y +++ GTP T +FDTGS W C CV
Sbjct: 157 ASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYT-VVFDTGSDTTWVQCQP---CVV 212
Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
+ R F P SS+ + C P CS + GCS +
Sbjct: 213 ACYEQ----REKLFDPASSSTYANVSCAAPACSDL-------DVSGCSGGHCL------- 254
Query: 202 YLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLP 256
Y +QYG G ++ G +TL S V F GC +D + AG+ G GR SLP
Sbjct: 255 YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLP 314
Query: 257 SQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY--KNPVGSS 311
Q K F++CL +R + LD G GS P + TP P
Sbjct: 315 VQTYGKYGGVFAHCLPARS------TGTGYLDFGAGS----PPATTTTPMLTGNGPT--- 361
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
FYYVG+ I VG + + I S G IVDSG+ T + + ++
Sbjct: 362 -----FYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPPAAYSSLRSA 411
Query: 372 FIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
F M Y +AA V S L C+D +G V +P + L F+GGA + + V
Sbjct: 412 FAAAMAARGYRKAAAV---SLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 468
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+CL F N G +G I+G+ QL+ F + +D+ GF+ C
Sbjct: 469 SASQVCLA-FAGNEDGGDVG-----IVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 122/415 (29%), Positives = 174/415 (41%), Gaps = 83/415 (20%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
G Y + L GTP + I DT S LVW PC S YR +D P F PK
Sbjct: 90 GEYLVKLGTGTPQHFFSAAI-DTASDLVWMQCQPCVSCYRQLD-----------PVFNPK 137
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLS 217
SSS ++ C + C+ + G G AC Y +Y G G T G L
Sbjct: 138 LSSSYAVVPCTSDTCAQLDGHRCHEDDDG----------AC-QYTYKYSGHGVTKGTLAI 186
Query: 218 ETLRFPSKTVPNFLAGCSILSDRQPA----GIAGFGRSSESLPSQLGLKKFSYCL---LS 270
+ L + GCS S PA G+ G GR SL SQL + +F YCL +S
Sbjct: 187 DKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHRFMYCLPPPMS 246
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
R S LVL G + + + ++ T + SS+ + +YY+ L + VG +
Sbjct: 247 R------TSGKLVLGAGADAVRNMSDRVTVT------MSSSTRYPSYYYLNLDGLAVGDQ 294
Query: 331 HVKIPYSYLVPGSDGNG-------------------GVIVDSGSTFTFMEGPLFEAVAKE 371
+ P S G G G+IVD ST +F+E L++ +A +
Sbjct: 295 TPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADD 354
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFAL 428
++ RA + GL CF + G VY+P + L F G + L + F
Sbjct: 355 LEEEI-RLPRATP-SLRLGLDLCFILPEGVGMDRVYVPTVSLSFD-GRWLELDRDRLFVT 411
Query: 429 VGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G ++CL+ +GR + ILG+FQLQN + F+L + FAK C
Sbjct: 412 DG-RMMCLM----------IGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 455
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 126/459 (27%), Positives = 185/459 (40%), Gaps = 65/459 (14%)
Query: 38 PLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNY--SNSLIKTP 95
P ST L H D+ + LA+S R + + K G ++ +SL P
Sbjct: 65 PFST--VLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAGGASGGHHLDDDSLASVP 122
Query: 96 LSVHS---YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI 152
LS + G Y L GTP S + DTGSSL W C+ CV V
Sbjct: 123 LSPGTSVGVGNYVTQLGLGTP-STSYAMVVDTGSSLTWLQCSP---CVVSCHRQVG---- 174
Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFT 211
P F P+ SS+ + C +C + + CS N Y YG F+
Sbjct: 175 PLFDPRASSTYASVRCSASQCDELQAATLNP--SACSASNVCI------YQASYGDSSFS 226
Query: 212 AGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFS 265
G L ++T+ F S P+F GC ++ + AG+ G R+ SL QL FS
Sbjct: 227 VGSLSTDTVSFGSTRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFS 286
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL + +S L GP + SYT P+ SSS Y++ L +
Sbjct: 287 YCLPT-------AASTGYLSIGPYNTGHY---YSYT-----PMASSSLDASLYFITLSGM 331
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG + + P + I+DSG+ T + + A++K + M RA
Sbjct: 332 SVGGSPLAVS-----PSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRA--- 383
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF-TDNAA 444
S L CF+ + + +P + + F GGA M L N V + CL TD+ A
Sbjct: 384 PAFSILDTCFEGQASQ-LRVPTVAMAFAGGASMKLTTRNVLIDVDDSTTCLAFAPTDSTA 442
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +D+A R GF+ C+
Sbjct: 443 ----------IIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 130/429 (30%), Positives = 185/429 (43%), Gaps = 55/429 (12%)
Query: 63 SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG--YSISLSFGTPPQASTPF 120
SL+R L K + GS+ +NSL S S G Y + G P Q S F
Sbjct: 141 SLNRKLELSLKGGKQFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQ-SYFF 199
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI-PAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGS + W +C C+ N +I P F PK SSS + C + +C +
Sbjct: 200 VPDTGSDVSWL------QCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLL--- 250
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
E+ C +C Y ++YG G FT G L +ET F S ++PN GC
Sbjct: 251 -DEAACDA-----NSC-----IYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHD 299
Query: 238 SD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
++ AG+ G G + SL SQL FSYCL+ D+ SS L + S DS
Sbjct: 300 NEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLVDL---DSESSSTLDFNADQPS-DSL 355
Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
T +P KN F F YV + + VG K + I S G+GG+IVDSG
Sbjct: 356 T-----SPLVKN-----DRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSG 405
Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
+T T + +++ + F+ N A V S C+D+S + +V +P + G
Sbjct: 406 TTITEIPSDVYDVLRDAFVGLTKNLPPAPGV---SPFDTCYDLSSQSNVEVPTIAFILPG 462
Query: 415 GAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
+ LP +N F + CL P I+G+ Q Q + +DLAN
Sbjct: 463 ENSLQLPAKNCLFQVDSAGTFCLAFLPSTF--------PLSIIGNVQQQGIRVSYDLANS 514
Query: 474 RFGFAKQKC 482
GF+ KC
Sbjct: 515 LVGFSTDKC 523
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 117/408 (28%), Positives = 171/408 (41%), Gaps = 71/408 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP+ + DTGS ++W C S C C + ++ F P S
Sbjct: 79 GLYYTKIRLGSPPRDFYVQV-DTGSDVLWVSCAS---CNGCPQTSGLQIQLNFFDPGSSV 134
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ + C + +CSW ++S GCS +N C +Y QYG G T+G +S+ L
Sbjct: 135 TATPVSCSDQRCSW----GIQSSDSGCSVQNNLC-----AYTFQYGDGSGTSGFYVSDVL 185
Query: 221 RFP----SKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQL---GL- 261
+F S VPN A GCS + SDR GI GFG+ S+ SQL GL
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLA 245
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL LVL G+ P + +TP + Y V
Sbjct: 246 PRVFSHCLKGENGGGGI----LVL------GEIVEPNMVFTPLVPSQ--------PHYNV 287
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
L I V + + I S S NG G I+D+G+T ++ EA F+ + N
Sbjct: 288 NLLSISVNGQALPINPSVF---STSNGQGTIIDTGTTLAYLS----EAAYVPFVEAITNA 340
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA----LVGNEVLC 435
+ S C+ I+ + P + L F GGA M L P++Y + G V C
Sbjct: 341 VSQSVRPVVSKGNQCYVIATSVADIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWC 400
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G ILGD L++ +DL R G+A C+
Sbjct: 401 IGFQRIQNQG-------ITILGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 130/473 (27%), Positives = 189/473 (39%), Gaps = 79/473 (16%)
Query: 35 PLTPLSTKH--------YLHHSDSDPLKILHSLASSSLSRA-----RHLKTKTKPKTKDS 81
P +PL+ H L S I H +++++ R RH + + +
Sbjct: 101 PCSPLAAAHGEPPSHGEILAADQSRAESIQHRVSTTTTGRVNPKRRRHRQQQPPSAPAPA 160
Query: 82 NIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
S+ + SL +P G Y +++ GTP T +FDTGS W C CV
Sbjct: 161 ASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYT-VVFDTGSDTTWVQCQP---CVV 216
Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
+ R F P SS+ + C P CS + GCS +
Sbjct: 217 ACYEQ----REKLFDPASSSTYANVSCAAPACS-------DLDVSGCSGGHCL------- 258
Query: 202 YLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLP 256
Y +QYG G ++ G +TL S V F GC +D + AG+ G GR SLP
Sbjct: 259 YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLP 318
Query: 257 SQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY--KNPVGSS 311
Q K F++CL +R + LD G GS P + TP P
Sbjct: 319 VQTYGKYGGVFAHCLPARS------TGTGYLDFGAGS----PPATTTTPMLTGNGPT--- 365
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
FYYVG+ I VG + + I S G IVDSG+ T + + ++
Sbjct: 366 -----FYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPPAAYSSLRSA 415
Query: 372 FIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
F M Y +AA V S L C+D +G V +P + L F+GGA + + V
Sbjct: 416 FAAAMAARGYRKAAAV---SLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 472
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+CL F N G +G I+G+ QL+ F + +D+ GF+ C
Sbjct: 473 SASQVCLA-FAGNEDGGDVG-----IVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 105/396 (26%), Positives = 155/396 (39%), Gaps = 51/396 (12%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + S GTPPQ DT + W PC + C P+ P+F P S++
Sbjct: 94 YLVRASLGTPPQ-RLLLAVDTSNDAAWVPCAGCHGC---------PTTAPSFNPASSATF 143
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + C P CS P+ C + +C + L YG L + L
Sbjct: 144 RPVPCGAPPCSQAPNPS----CTSLAKSKNSC-----GFSLSYGDSSLDATLSQDNLAVT 194
Query: 224 SK--TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKK------FSYCLLSRKFDD 275
+ + + GC S+ A G + K FSYCL S
Sbjct: 195 ANGGVIKGYTFGCLTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSA 254
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
A S +L L G + TP +P S YYV + + +G K V IP
Sbjct: 255 ANFSGSLTLGR---KGQPAPEKMKTTPLLASPHRPS-----LYYVAMTGVRIGKKSVPIP 306
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAADVEKK 388
S L + G ++DSG+ F + P + AV E R++ + V
Sbjct: 307 PSALAFDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSL 366
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPA 447
G C+++S +V P + L F GG ++ LP EN CL + AA PA
Sbjct: 367 GGFDTCYNVS---TVAWPAVTLVFGGGMEVRLPEENVVIRSTYGSTSCLAM----AASPA 419
Query: 448 LGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G A+ ++G Q QN + FD+ N R GFA+++C
Sbjct: 420 DGVNAALNVIGSLQQQNHRVLFDVPNARVGFARERC 455
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 115/411 (27%), Positives = 180/411 (43%), Gaps = 80/411 (19%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ I DTGS ++W C+S C +C + ++ F SS
Sbjct: 79 GLYFTRVKLGTPPREFNVQI-DTGSDVLWVTCSS---CSNCPQTSGLGIQLNYFDTTSSS 134
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+++L+ C +P C+ +++ C P++ C SY QYG G T+G +S+T
Sbjct: 135 TARLVPCSHPICT----SQIQTTATQCPPQSNQC-----SYAFQYGDGSGTSGYYVSDTF 185
Query: 221 RFPSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQL---GL- 261
F + + +A GCS +D+ GI GFG+ S+ SQL G+
Sbjct: 186 YFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGIT 245
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL K +D+ LVL G+ PG+ Y+P + Y +
Sbjct: 246 PRVFSHCL---KGEDSG-GGILVL------GEILEPGIVYSPLVPSQ--------PHYNL 287
Query: 321 GLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GN 378
L+ I V + + I P ++ + N G I+D+G+T + L E F+ +
Sbjct: 288 DLQSIAVSGQLLPIDPAAF---ATSSNRGTIIDTGTTLAY----LVEEAYDPFVSAITAA 340
Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
S+ A G C+ +S S P + F GGA M L PE Y ++
Sbjct: 341 VSQLATPTINKG-NQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEY-----------LM 388
Query: 439 FTDNAAGPALG-------RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ N AG AL +G ILGD L++ +DLA+ R G+A C
Sbjct: 389 YLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRIGWANYDC 439
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 115/393 (29%), Positives = 160/393 (40%), Gaps = 50/393 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++ GTP DT S L W C RC P P F P+ S+
Sbjct: 136 GEYIAKIAVGTP-GVEALLALDTASDLTWLQCQPCRRCY--------PQSGPVFDPRHST 186
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + + C + R G + TC Y + YG G T G + ETL
Sbjct: 187 SYREMSFNAADCQAL------GRSGGGDAKRGTC-----VYTVGYGDGSTTVGDFIEETL 235
Query: 221 RFPSKT-VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLK-KFSYCLLSRKFD 274
F +P GC L AGI G GR S P+Q+ FSYCL+ F
Sbjct: 236 TFAGGVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLV--DFL 293
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
P S + L G G+ D+ P +S+TP N FYYV L I VG V++
Sbjct: 294 SGPGSLSSTLTFGAGAVDTSPP-VSFTPTVLN-----LNMPTFYYVRLTGISVGG--VRV 345
Query: 335 P----YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
P + G GGVIVDSG+ T + P + A F R + + SG
Sbjct: 346 PGVTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAF-RAVAVDLGQVSIGGPSG 404
Query: 391 -LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
C+ + G+ +P + + F G ++ L P+NY L+ + + + F A A G
Sbjct: 405 FFDTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNY--LIPVDSMGTVCF----AFAATG 458
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q F + +D+ R GFA C
Sbjct: 459 DHSVSIIGNIQQQGFRIVYDIGG-RVGFAPNSC 490
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 114/390 (29%), Positives = 170/390 (43%), Gaps = 56/390 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP + T +FDTGS + W C C+ +P + F P +S+
Sbjct: 133 GNYVVTVGLGTPKEDFT-LVFDTGSGITWTQCQP---CLGSCYPQ----KEQKFDPTKST 184
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + C + C+ + P E +GCS N TC Y + YG ++ G +ETL
Sbjct: 185 SYNNVSCSSASCNLL--PTSE---RGCSASNSTCL-----YQIIYGDQSYSQGFFATETL 234
Query: 221 RFPSKTV-PNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKF 273
S V NFL GC ++ Q AG+ G SS SLPSQ K+FSYCL S
Sbjct: 235 TISSSDVFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPS--- 291
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
P S+ + G S+T G +TP S AF FY + + I V +
Sbjct: 292 --TPSSTGYL---NFGGKVSQTAG--FTPI-------SPAFSSFYGIDIVGISVAGSQLP 337
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
I P G I+DSG+ T + ++A+ + F +M NY + E L
Sbjct: 338 ID-----PSIFTTSGAIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDEL---LDT 389
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
C+D S +V P++ + FKGG ++ + LV + + F N G
Sbjct: 390 CYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILYLVNGVKMVCLAFAANKDDSEFG---- 445
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ Q + + + +D A GFA C+
Sbjct: 446 -IFGNHQQKTYEVVYDGAKGMIGFAAGACS 474
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 132/432 (30%), Positives = 188/432 (43%), Gaps = 61/432 (14%)
Query: 63 SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG--YSISLSFGTPPQASTPF 120
SL+R L K + GS+ +NSL S S G Y + G P Q S F
Sbjct: 141 SLNRKLELSLKGGKQFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQ-SYFF 199
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI-PAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGS + W +C C+ N +I P F PK SSS + C + +C +
Sbjct: 200 VPDTGSDVSWL------QCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLL--- 250
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSIL 237
E+ C +C Y ++YG G FT G L +ET F S ++PN GC
Sbjct: 251 -DEAACDA-----NSC-----IYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHD 299
Query: 238 SD---RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSK 294
++ G+ G G + SL SQL FSYCL+ D+ SS L + S DS
Sbjct: 300 NEGLFVGADGLIGLGGGAISLSSQLEATSFSYCLVDL---DSESSSTLDFNADQPS-DSL 355
Query: 295 TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSG 354
T +P KN F F YV + + VG K + I S G+GG+IVDSG
Sbjct: 356 T-----SPLVKN-----DRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSG 405
Query: 355 STFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
+T T + +++ + F+ N A V S C+D+S + +V +P + G
Sbjct: 406 TTITEIPSDVYDVLRDAFVGLTKNLPPAPGV---SPFDTCYDLSSQSNVEVPTIAFILPG 462
Query: 415 GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI----ILGDFQLQNFYLEFDL 470
+ LP +N CLI D+A L P+ I+G+ Q Q + +DL
Sbjct: 463 ENSLQLPAKN----------CLIQ-VDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDL 511
Query: 471 ANDRFGFAKQKC 482
AN GF+ KC
Sbjct: 512 ANSLVGFSTDKC 523
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 164/387 (42%), Gaps = 55/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + G+PPQ + DT + W PCT+ C F P++S++
Sbjct: 98 YIVRAKIGSPPQ-TLLLAMDTSNDAAWIPCTACDGCTST-----------LFAPEKSTTF 145
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + C +P+C+ + P +C + ++ L YG A ++ +T+
Sbjct: 146 KNVSCGSPQCNQVPNP--------------SCGTSACTFNLTYGSSSIAANVVQDTVTLA 191
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
+ +P++ GC + + P G+ G GR SL SQ L FSYCL S F
Sbjct: 192 TDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 249
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + + YTP KNP SS YYV L I VG K V IP
Sbjct: 250 FSGSLRL--GPVAQPIR---IKYTPLLKNPRRSS-----LYYVNLVAIRVGRKVVDIPPE 299
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA-DVEKKSGLRPCFD 396
L + G + DSG+ FT + P + AV EF R++ ++A V G C+
Sbjct: 300 ALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYT 359
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAII 455
+ + P + F G + LP +N CL + A+ P +
Sbjct: 360 V----PIVAPTITFMF-SGMNVTLPEDNILIHSTAGSTTCLAM----ASAPDNVNSVLNV 410
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ + Q QN + +D+ N R G A++ C
Sbjct: 411 IANMQQQNHRVLYDVPNSRLGVARELC 437
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 114/446 (25%), Positives = 181/446 (40%), Gaps = 70/446 (15%)
Query: 49 DSDPLKILHSLASSSLSR---ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
D++ +K + S S +L R + L + T P S IGS Y
Sbjct: 4 DNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGS----------------ANYV 47
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+ + GTP + +FDTGS L W C C + D F P +SSS
Sbjct: 48 VVVGLGTPKR-DLSLVFDTGSDLTWTQCEP---CAGSCYKQQDA----IFDPSKSSSYTN 99
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF-P 223
I C + C+ + ++S C S + +C Y +YG T+ G L E L
Sbjct: 100 ITCTSSLCTQLTSDGIKSECS--SSTDASCI-----YDAKYGDNSTSVGFLSQERLTITA 152
Query: 224 SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAP 277
+ V +FL GC ++ AG+ G GR S+ Q K FSYCL P
Sbjct: 153 TDIVDDFLFGCGQDNEGLFNGSAGLMGLGRHPISIVQQTSSNYNKIFSYCL--------P 204
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
+S+ + G+ + L YTP S+ G+ + GL + + K+P
Sbjct: 205 ATSSSLGHLTFGASAATNASLIYTPL-------STISGDNSFYGLDIVSISVGGTKLPA- 256
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL-RPCFD 396
+ + GG I+DSG+ T + ++ A+ F R M Y V ++GL C+D
Sbjct: 257 -VSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYP----VANEAGLLDTCYD 311
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
+SG K + +P + +F GG + L + + +CL F N + + +
Sbjct: 312 LSGYKEISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLA-FAANGSDNDI-----TVF 365
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
G+ Q + + +D+ R GF C
Sbjct: 366 GNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 168/389 (43%), Gaps = 55/389 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+PP+ S + D+GS +VW C C +C + DP F P S+
Sbjct: 135 GEYFVRIGVGSPPR-SQYVVIDSGSDIVWVQCQP---CSEC-YQQSDP----VFDPAGSA 185
Query: 162 SSQLIGCQNPKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
+ I C + C + + RC+ Y + YG G +T G L ET
Sbjct: 186 TYAGISCDSSVCDRLDNAGCNDGRCR---------------YEVSYGDGSYTRGTLALET 230
Query: 220 LRFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
L F + N GC ++ AG+ G G + S QLG + FSYCL+SR
Sbjct: 231 LTFGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRG- 289
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
S L+ G G+ G ++ P +NP S FYYVGL + VG V
Sbjct: 290 ----TESTGTLEFGRGA---MPVGAAWVPLIRNPRAPS-----FYYVGLSGLGVGGIRVP 337
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
IP G GGV++D+G+ T + P +EA FI Q N R+ ++ S
Sbjct: 338 IPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRS---DRVSIFDT 394
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
C++++G SV +P + F GG + LP N+ V E F +A+G +
Sbjct: 395 CYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLS------ 448
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + + D +N GF C
Sbjct: 449 -IIGNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 158/387 (40%), Gaps = 55/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTPPQ ++ I D LVW C RC + P F P S++
Sbjct: 51 YVANFTIGTPPQPASAVI-DLAGELVWTQCKQCGRCFE--------QGTPLFDPTASNTY 101
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ C P C I P+ C G C +Y G T G + ++T F
Sbjct: 102 RAEPCGTPLCESI--PSDVRNCSG-----NVC-----AYEASTNAGDTGGKVGTDT--FA 147
Query: 224 SKTVPNFLA-GCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
T LA GC + SD P+GI G GR+ SL +Q G+ FSYCL DA
Sbjct: 148 VGTAKASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGK 204
Query: 279 SSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
+S L L + G G + + TPF N G+ + +Y V L + G + +P
Sbjct: 205 NSALFLGSSAKLAGGGKAAS-----TPFV-NISGNGNDLSNYYKVQLEGLKAGDAMIPLP 258
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S GS V++D+ S +F+ ++AV K +G A VE CF
Sbjct: 259 PS----GST----VLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEP---FDLCF 307
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
SG P+L+ F+GGA M +P NY N +CL + + +
Sbjct: 308 PKSGASGAA-PDLVFTFRGGAAMTVPATNYLLDYKNGTVCLAMLSSARLNSTTELS---L 363
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
LG Q +N + FDL + F C
Sbjct: 364 LGSLQQENIHFLFDLDKETLSFEPADC 390
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 133/284 (46%), Gaps = 32/284 (11%)
Query: 211 TAGLLLSETLRFPSKT--VPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLGLKKFS 265
+ G+L +ET F + N GC L++ AG I G S+ QL + KFS
Sbjct: 3 STGVLATETFTFGAHQNFSANLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITKFS 62
Query: 266 YCLLSRKFDD---APVSSNLVLDTGPGSGDSKTPGLSYT-PFYKNPVGSSSAFGEFYYVG 321
YCL F D +PV + D G KT G T P KNPV +YYV
Sbjct: 63 YCL--TPFTDHKTSPVMFGAMADLG----KYKTTGKVQTIPLLKNPVEDI-----YYYVP 111
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
+ I +GSK + +P + L DG GG ++DS +T ++ P F+ + K + M +
Sbjct: 112 MVGISIGSKRLDVPEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAA 171
Query: 382 AADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
++ CF++ + V +P L+L F G A+M+LP ++YF ++CL
Sbjct: 172 NRSIDDYP---VCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYFQEPSPGMMCL-- 226
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A A G ++G+ Q QN ++ +DL N +F +A KC
Sbjct: 227 ----AVMQAPFEGAPNVIGNVQQQNMHVLYDLGNRKFSYAPTKC 266
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 130/473 (27%), Positives = 189/473 (39%), Gaps = 79/473 (16%)
Query: 35 PLTPLSTKH--------YLHHSDSDPLKILHSLASSSLSR-----ARHLKTKTKPKTKDS 81
P +PL+ H L S I H +++++ R +RH + + +
Sbjct: 98 PCSPLAAAHGEPPSHGEILAADQSRAESIQHRVSTTTTGRVNPKRSRHRQQQPPSAPAPA 157
Query: 82 NIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
S+ + SL +P G Y +++ GTP T +FDTGS W C CV
Sbjct: 158 ASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYT-VVFDTGSDTTWVQCQP---CVV 213
Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
+ R F P SS+ + C P CS + GCS +
Sbjct: 214 ACYEQ----REKLFDPASSSTYANVSCAAPACSDL-------DVSGCSGGHCL------- 255
Query: 202 YLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLP 256
Y +QYG G ++ G +TL S V F GC +D + AG+ G GR SLP
Sbjct: 256 YGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLP 315
Query: 257 SQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY--KNPVGSS 311
Q K F++CL R + LD G GS P + TP P
Sbjct: 316 VQTYGKYGGVFAHCLPPRS------TGTGYLDFGAGS----PPATTTTPMLTGNGPT--- 362
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
FYYVG+ I VG + + I S G IVDSG+ T + + ++
Sbjct: 363 -----FYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPPAAYSSLRSA 412
Query: 372 FIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
F M Y +AA V S L C+D +G V +P + L F+GGA + + V
Sbjct: 413 FAAAMAARGYRKAAAV---SLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 469
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+CL F N G +G I+G+ QL+ F + +D+ GF+ C
Sbjct: 470 SASQVCLA-FAGNEDGGDVG-----IVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 159/387 (41%), Gaps = 55/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTPPQ + DT + W PCT+ C F P++S++
Sbjct: 97 YIVRAKIGTPPQ-TLLLAIDTSNDAAWIPCTACDGCTST-----------LFAPEKSTTF 144
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + C +P+C+ + P +C + ++ L YG A ++ +T+
Sbjct: 145 KNVSCGSPECNKVPSP--------------SCGTSACTFNLTYGSSSIAANVVQDTVTLA 190
Query: 224 SKTVPNFLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+ +P + GC + + G+ S S L FSYCL S F
Sbjct: 191 TDPIPGYTFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 248
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + + YTP KNP SS YYV L I VG K V IP +
Sbjct: 249 FSGSLRL--GPVAQPIR---IKYTPLLKNPRRSS-----LYYVNLFAIRVGRKIVDIPPA 298
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA-DVEKKSGLRPCFD 396
L + G + DSG+ FT + P++ AV EF R++ ++A V G C+
Sbjct: 299 ALAFNAATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYT 358
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAII 455
+ + P + F G + LP +N CL + A+ P +
Sbjct: 359 V----PIVAPTITFMF-SGMNVTLPQDNILIHSTAGSTSCLAM----ASAPDNVNSVLNV 409
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ + Q QN + +D+ N R G A++ C
Sbjct: 410 IANMQQQNHRVLYDVPNSRLGVARELC 436
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 170/406 (41%), Gaps = 68/406 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G + +S++ GTPP I DTGS L W C +C N P F K+SS
Sbjct: 83 GEFFMSITIGTPP-IKVFAIADTGSDLTWVQCKPCQQCYKENGP--------IFDKKKSS 133
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C + C + S +GC N C Y YG F+ G + +ET+
Sbjct: 134 TYKSEPCDSRNCQAL-----SSTERGCDESNNICK-----YRYSYGDQSFSKGDVATETV 183
Query: 221 RFPSKT-----VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
S + P + GC + D +GI G G SL SQLG KKFSYCL
Sbjct: 184 SIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL 243
Query: 269 LSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQ 324
+ A + V++ G S SK G+ TP K P+ +YY+ L
Sbjct: 244 SHKS---ATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-------TYYYLTLEA 293
Query: 325 IIVGSKHVKIPY--SYLVPGSDG-----NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
I VG K KIPY S P DG +G +I+DSG+T T +E F+ + +
Sbjct: 294 ISVGKK--KIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVT 351
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
R +D + L CF SG + LPE+ + F GA + L P N F + +++CL
Sbjct: 352 GAKRVSD--PQGLLSHCFK-SGSAEIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLS 407
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I G+F +F + +DL F C+
Sbjct: 408 MVPTTEVA---------IYGNFAQMDFLVGYDLETRTVSFQHMDCS 444
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/389 (27%), Positives = 164/389 (42%), Gaps = 54/389 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+P + + I DTGSSL W C CV DP F P S
Sbjct: 11 GNYYVKVGLGSPARYYS-MIVDTGSSLSWLQCKP---CVVYCHVQADP----LFDPSASK 62
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ + + C + +CS + + + C + C Y +G+ + LL+
Sbjct: 63 TYKSLSCTSSQCSSLVDATLNNPL--CETSSNVCVYTASYGDSSYSMGYLSQDLLTLA-- 118
Query: 222 FPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
PS+T+P F+ GC S+ + AGI G GR+ S+ Q+ K FSYCL +R
Sbjct: 119 -PSQTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR---- 173
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
L G S +TP +P S Y++ L I VG + + +
Sbjct: 174 ---GGGGFLSIGKAS--LAGSAYKFTPMTTDPGNPS-----LYFLRLTAITVGGRALGVA 223
Query: 336 YS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAADVEKKSGLRP 393
+ Y VP I+DSG+ T + ++ + F++ M + Y+RA S L
Sbjct: 224 AAQYRVP-------TIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGF---SILDT 273
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
CF + K +PE+ L F+GGA + L P N V + CL +N
Sbjct: 274 CFKGNLKDMQSVPEVRLIFQGGADLNLRPVNVLLQVDEGLTCLAFAGNNGVA-------- 325
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q Q F + D++ R GFA C
Sbjct: 326 -IIGNHQQQTFKVAHDISTARIGFATGGC 353
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 170/406 (41%), Gaps = 51/406 (12%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L H ++SL+ GTPPQ T + DTGS L W C + +F
Sbjct: 58 LRFHHNVSLTVSLAVGTPPQNVT-MVLDTGSELSWLLCAPGGGGGGGGRSAL------SF 110
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GL 214
P+ S + + C + +C P+ + C G S K C ++ L Y G ++ G
Sbjct: 111 RPRASLTFASVPCDSAQCRSRDLPSPPA-CDGAS---KQCRVS-----LSYADGSSSDGA 161
Query: 215 LLSETLRFPSKTVPNFLAGCSILS-DRQPAGIA-----GFGRSSESLPSQLGLKKFSYCL 268
L +E GC + D P G+A G R + S SQ ++FSYC+
Sbjct: 162 LATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCI 221
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY-VGLRQIIV 327
R DDA V L+L G D L+YTP Y+ P F Y V L I V
Sbjct: 222 SDR--DDAGV---LLL----GHSDLPFLPLNYTPLYQ-PAMPLPYFDRVAYSVQLLGIRV 271
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
G K + IP S L P G G +VDSG+ FTF+ G + A+ EF RQ + A +
Sbjct: 272 GGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPN 331
Query: 388 ---KSGLRPCFDISGKKS--VYLPELILKFKGGAKMALPPENYFALV------GNEVLCL 436
+ CF + ++ LP + L F GA+M + + V G+ V CL
Sbjct: 332 FAFQEAFDTCFRVPQGRAPPARLPAVTLLFN-GAQMTVAGDRLLYKVPGERRGGDGVWCL 390
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F + P A ++G N ++E+DL R G A +C
Sbjct: 391 T-FGNADMVPIT----AYVIGHHHQMNVWVEYDLERGRVGLAPIRC 431
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 109/428 (25%), Positives = 183/428 (42%), Gaps = 55/428 (12%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R R L+++ K +NI + S + + + + + Y +++ G + I DTG
Sbjct: 30 RVRSLQSRIKSIFSGNNIDALDSQIPLSSGVRLQTLN-YIVTVEIGG---RNMTVIVDTG 85
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
S L W C C + + P F P S S Q I C + C +
Sbjct: 86 SDLTWVQCQPCRLCYN--------QQDPLFNPSGSPSYQTILCNSSTCQSL--QYATGNL 135
Query: 186 KGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---Q 241
C TC +Y++ YG G +T G L E L + V NF+ GC +
Sbjct: 136 GVCGSNTPTC-----NYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIFGCGRNNKGLFGG 190
Query: 242 PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGL 298
+G+ G G+S SL SQ FSYCL + D S +L+L + TP +
Sbjct: 191 ASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAAD---ASGSLILGGNSSVYKNTTP-I 246
Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
SYT NP FY++ L I +G ++ P + G+++DSG+ T
Sbjct: 247 SYTRMIANP-----QLPTFYFLNLTGISIGGVALQAP-------NYRQSGILIDSGTVIT 294
Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKM 418
+ P++ + EF++Q + A S L CF+++G V +P + ++F+G A++
Sbjct: 295 RLPPPVYRDLKAEFLKQFSGFPSAPPF---SILDTCFNLNGYDEVDIPTIRMQFEGNAEL 351
Query: 419 ALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRF 475
+ F V + +CL L +L I I+G++Q +N + ++ +
Sbjct: 352 TVDVTGIFYFVKTDASQVCLAL-------ASLSFDDEIPIIGNYQQRNQRVIYNTKESKL 404
Query: 476 GFAKQKCA 483
GFA + C+
Sbjct: 405 GFAAEACS 412
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 137/499 (27%), Positives = 209/499 (41%), Gaps = 85/499 (17%)
Query: 27 SSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSN 86
SSA+T P H ++L LA+ S +RA L + + + G+
Sbjct: 19 SSASTPAAPAVRADLTHVDSGRGFTSRELLRRLATRSRARASRLYSSSSSSSSARPAGAG 78
Query: 87 YSNSLIKTPLSVHSYGG------YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV 140
+ + PL+ + G Y I LS GTP DTGS LVW C
Sbjct: 79 --SHAVTAPLARGTVGDADIDSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA------ 130
Query: 141 DCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP 200
C+ P P F S ++ + C +P C+ P GC+ + TC
Sbjct: 131 -CHVCFAQP--FPTFDALASQTTLAVPCSDPICTSGKYP-----LSGCTFNDNTC----- 177
Query: 201 SYLLQYG-LGFTAGLLLSETLRFPSK------------TVPNFLAGCSILSD----RQPA 243
YL Y T+G ++ +T F S VPN GC + +
Sbjct: 178 FYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNES 237
Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSY 300
GIAGF R SLPSQL + +FS+C + DA +S + L PG G T +
Sbjct: 238 GIAGFSRGPMSLPSQLKVARFSHCFTA--IADAR-TSPVFLGGAPGPDNLGAHATGPVQS 294
Query: 301 TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV----PGSDGNGGVIVDSGST 356
TPF +++ G YY+ L+ I VG ++P + L G+GG I+DSG+
Sbjct: 295 TPF-------ANSNGSLYYLTLKGITVGK--TRLPLNALAFAGKGTGSGSGGTIIDSGTG 345
Query: 357 FTFMEGPLFEAVAKEFIRQ----MGNYSRAADVEKK---SGLRPCFDISGKKSVYLPELI 409
+ GP++ ++ F+ + + N S AAD E R + LP+++
Sbjct: 346 IRTLPGPMYRSLRAAFVARVKLPVANES-AADAESTLCFEAARSASLPPEAPAPALPKVV 404
Query: 410 LKFKGGAKMALPPENY-FALVGNEV-----LCLILFTDNAAGPALGRGPAIILGDFQLQN 463
L GA LP E+Y L+ +E LCL++ N+AG + I+G+FQ QN
Sbjct: 405 LHV-AGADWDLPRESYVLDLLEDEDGSGSGLCLVM---NSAGDS----DLTIIGNFQQQN 456
Query: 464 FYLEFDLANDRFGFAKQKC 482
++ +DL ++ F +C
Sbjct: 457 MHVAYDLEKNKLVFVPARC 475
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 131/499 (26%), Positives = 202/499 (40%), Gaps = 86/499 (17%)
Query: 5 PFSLICL-FSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSS 63
PF CL F + LF+T+A S TV L H DS PL ++ + +
Sbjct: 3 PFVFFCLAFYSVSSLFSTEANESPSGFTVD-----------LIHRDS-PLSPFYNPSLTP 50
Query: 64 LSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFD 123
R + ++ + + + +N L ++ L +H+ G Y + GTPP D
Sbjct: 51 SQRIINAALRSISRLNRVSNLLDQNNKLPQSVLILHN-GEYLMRFYIGTPPVERLA-TAD 108
Query: 124 TGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
TGS L+W C+ C C P P F P +SS+ C++ C+ +
Sbjct: 109 TGSDLIWVQCSP---CASCF-----PQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQ--- 157
Query: 184 RCKGCSPRNKTCPLACPSYLLQYG--LGFTAGLLLSETLRFPSK------TVPNFLAGC- 234
KGC + Y +YG F+ GLL +ETLRF S+ PN GC
Sbjct: 158 --KGCGKSGECI------YTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFGCG 209
Query: 235 -----SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDT 286
++ + GI G G SL SQ+G + KFSYCLL P+ S
Sbjct: 210 LYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLL-------PLGSTSTSKL 262
Query: 287 GPGSGDSKT-PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
G+ T G+ TP P +Y++ L + V K VP
Sbjct: 263 KFGNESIITGEGVVSTPMIIKP-----WLPTYYFLNLEAVTVAQKT--------VPTGST 309
Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL 405
+G VI+DSG+ T++ + A + DV S L CF + +
Sbjct: 310 DGNVIIDSGTLLTYLGESFYYNFAASLQESLA-VELVQDV--LSPLPFCFPY--RDNFVF 364
Query: 406 PELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
PE+ +F GA+++L P N F + + +CL++ + +G + I G F +F
Sbjct: 365 PEIAFQFT-GARVSLKPANLFVMTEDRNTVCLMIAPSSVSGIS-------IFGSFSQIDF 416
Query: 465 YLEFDLANDRFGFAKQKCA 483
+E+DL + F C+
Sbjct: 417 QVEYDLEGKKVSFQPTDCS 435
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 126/449 (28%), Positives = 181/449 (40%), Gaps = 64/449 (14%)
Query: 44 YLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG 103
+ H S S +LH +ASS R +L + K+K +++ N L G
Sbjct: 54 HTHVSASVIDTVLH-MASSDSHRFTYLSSLVAGKSKPTSVPVASGNQL--------HIGN 104
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTPPQ + DT + VW PC+ C C+ N S S+
Sbjct: 105 YVVRARLGTPPQLMF-MVLDTSNDAVWLPCSG---CSGCS--NASTSFNTNSSSTYST-- 156
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG--LGFTAGLLLSETLR 221
+ C +C+ G C +P+ C S+ YG F+A L+ +TL
Sbjct: 157 --VSCSTTQCTQARGLT----CPSSTPQPSIC-----SFNQSYGGDSSFSANLV-QDTLT 204
Query: 222 FPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDD 275
+PNF GC + + P G+ G GR SL SQ L FSYCL S +
Sbjct: 205 LSPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFY 264
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
S L L P S + YTP +NP S YYV L + VGS V +
Sbjct: 265 FSGSLKLGLLGQPKS-------IRYTPLLRNPRRPS-----LYYVNLTGVSVGSVQVPVD 312
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPC 394
YL S+ G I+DSG+ T P++EA+ EF +Q+ G++S + C
Sbjct: 313 PVYLTFDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNGSFSTLGAFDT------C 366
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPA 453
F S P++ L + LP EN L CL + A+
Sbjct: 367 F--SADNENVTPKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN--- 420
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ + Q QN + FD+ N R G A + C
Sbjct: 421 -VIANLQQQNLRILFDVPNSRIGIAPEPC 448
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 126/442 (28%), Positives = 180/442 (40%), Gaps = 65/442 (14%)
Query: 55 ILHSLASSSLSRARHLKTKTKPKTK-DSNIGSNYSNSLIKTPLSVHS---YGGYSISLSF 110
+ H A S AR KT + T D++ + + SL PLS + G Y +
Sbjct: 69 LTHDDARISSLAARLAKTPSARATSLDADADAGLAGSLASVPLSPGASVGVGNYVTRMGL 128
Query: 111 GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
GTP + DTGSSL W C+ V C+ + P F PK SS+ +GC
Sbjct: 129 GTPATQYV-MVVDTGSSLTWLQCSPCL--VSCHRQSG-----PVFNPKSSSTYASVGCSA 180
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPN 229
+CS + P+ CS N Y YG F+ G L +T+ F S ++PN
Sbjct: 181 QQCSDL--PSATLNPSACSSSNVCI------YQASYGDSSFSVGYLSKDTVSFGSTSLPN 232
Query: 230 FLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLV 283
F GC ++ + AG+ G R+ SL QL F+YCL S +
Sbjct: 233 FYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSS----SGYLSL 288
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV--KIPYSYLVP 341
PG SYTP SSS Y++ L + V + +P
Sbjct: 289 GSYNPGQ-------YSYTPMV-----SSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLP 336
Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKK 401
I+DSG+ T + ++ A++K M SRA+ S L CF +
Sbjct: 337 -------TIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRAS---AYSILDTCFKGQASR 386
Query: 402 SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQL 461
V P + + F GGA + L +N V + CL A PA R AII G+ Q
Sbjct: 387 -VSAPAVTMSFAGGAALKLSAQNLLVDVDDSTTCL------AFAPA--RSAAII-GNTQQ 436
Query: 462 QNFYLEFDLANDRFGFAKQKCA 483
Q F + +D+ + R GFA C+
Sbjct: 437 QTFSVVYDVKSSRIGFAAGGCS 458
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 157/387 (40%), Gaps = 55/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTPPQ ++ I D LVW C RC + P F P S++
Sbjct: 51 YVANFTIGTPPQPASAVI-DLAGELVWTQCKQCSRCFE--------QDTPLFDPTASNTY 101
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ C P C I P+ C G C +Y G T G + ++T F
Sbjct: 102 RAEPCGTPLCESI--PSDSRNCSG-----NVC-----AYQASTNAGDTGGKVGTDT--FA 147
Query: 224 SKTVPNFLA-GCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
T LA GC + SD P+GI G GR+ SL +Q G+ FSYCL DA
Sbjct: 148 VGTAKASLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPH---DAGK 204
Query: 279 SSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
+S L L + G G + + TPF N G+ + +Y V L + G + +P
Sbjct: 205 NSALFLGSSAKLAGGGKAAS-----TPFV-NISGNGNDLSNYYKVQLEGLKAGDAMIPLP 258
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S GS V++D+ S +F+ ++AV K +G A VE CF
Sbjct: 259 PS----GST----VLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEP---FDLCF 307
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
SG P+L+ F+GGA M + NY N +CL + + +
Sbjct: 308 PKSGASGAA-PDLVFTFRGGAAMTVAASNYLLDYKNGTVCLAMLSSARLNSTTELS---L 363
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
LG Q +N + FDL + F C
Sbjct: 364 LGSLQQENIHFLFDLDKETLSFEPADC 390
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 118/445 (26%), Positives = 193/445 (43%), Gaps = 80/445 (17%)
Query: 54 KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
+ + +L + S +R R + + + S G+ + +++PL GGY + +S GTP
Sbjct: 10 EAIRALVAKSHARVRWMAARANSSSWSSMAGT----TDVESPLHPDG-GGYVMDISVGTP 64
Query: 114 PQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
+ I DTGS LVW PCT C+ + F P++SS+ + + C +
Sbjct: 65 GKRFRA-IADTGSDLVWVQSEPCTG------CSGGTI-------FDPRQSSTFREMDCSS 110
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP-----SK 225
C+ + G C P + TC SY +YG G T G +T+ S+
Sbjct: 111 QLCAELPGS--------CEPGSSTC-----SYSYEYGSGETEGEFARDTISLGTTSDGSQ 157
Query: 226 TVPNFLAGCSILSD--RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSS 280
P+F GC +++ G+ G G+ SL SQL KFSYCL+ + SS
Sbjct: 158 KFPSFAVGCGMVNSGFDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLV--DINSQSESS 215
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
L+ GP + T G+ T S + +Y + + I V + + P
Sbjct: 216 PLLF--GPSAALHGT-GIQSTKITP----PSDTYPTYYLLTVNGIAVAGQTMGSP----- 263
Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS-GLRPCFDISG 399
G I+DSG+T T++ ++ V + +M + V+ S GL C+D S
Sbjct: 264 ------GTTIIDSGTTLTYVPSGVYGRV----LSRMESMVTLPRVDGSSMGLDLCYDRSS 313
Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGN--EVLCLILFTDNAAGPALGRGPAIILG 457
++ P L ++ G A M P NYF +V + + +CL A G A G P I+G
Sbjct: 314 NRNYKFPALTIRLAG-ATMTPPSSNYFLVVDDSGDTVCL------AMGSASGL-PVSIIG 365
Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q +++ +D + F + KC
Sbjct: 366 NVMQQGYHILYDRGSSELSFVQAKC 390
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 162/391 (41%), Gaps = 59/391 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP T +FDTGS W C CV + R F P RSS
Sbjct: 178 GNYVVTVGLGTPASRYT-VVFDTGSDTTWVQCQP---CVVVCYEQ----REKLFDPARSS 229
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + C P CS + N+ GCS + Y +QYG G ++ G +TL
Sbjct: 230 TYANVSCAAPACSDL---NIH----GCSGGHCL-------YGVQYGDGSYSIGFFAMDTL 275
Query: 221 RFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
S V F GC ++ + AG+ G GR SLP Q K F++CL +R
Sbjct: 276 TLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS- 334
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
+ LD G GS + L+ +N FYYVG+ I VG + +
Sbjct: 335 -----TGTGYLDFGAGSLAAARARLTTPMLTENGP-------TFYYVGMTGIRVGGQLLS 382
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV--AKEFIRQMGNYSRAADVEKKSGL 391
IP S G IVDSG+ T + + ++ A Y +A V S L
Sbjct: 383 IPQSVFA-----TAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAV---SLL 434
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D +G V +P + L F+GGA++ + +CL F N G +G
Sbjct: 435 DTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLA-FAANEDGGDVG-- 491
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ QL+ F + +D+ GF C
Sbjct: 492 ---IVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 129/452 (28%), Positives = 195/452 (43%), Gaps = 85/452 (18%)
Query: 61 SSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF 120
+SS+ R L++K K ++G+ +SLI P + S G+ ++LS G+PP +
Sbjct: 68 TSSIERFDFLESKIKEL---KSVGNEARSSLI--PFNRGS--GFLVNLSIGSPP-VTQLV 119
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
+ DTGSSL+W C C++C F P +S S + +GC P ++I G
Sbjct: 120 VVDTGSSLLWVQCLP---CINCF-----QQSTSWFDPLKSVSFKTLGCGFPGYNYINGYK 171
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTV------------ 227
C+ N+ Y L+Y G + G+L E+L F +
Sbjct: 172 -------CNRFNQ------AEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQ 218
Query: 228 ------PNFLAGCSILS-----DRQPAGIAGFGRSSE-SLPSQLGLKKFSYCLLSRKFDD 275
N GC ++ D G+ G G ++ +QLG KFSYC+ +
Sbjct: 219 ISKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG-NKFSYCIGDI---N 274
Query: 276 APVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
P+ ++ L G GS GDS TP + FG YYV L+ I VGSK +
Sbjct: 275 NPLYTHNHLVLGQGSYIEGDS-------TPLQIH-------FGH-YYVTLQSISVGSKTL 319
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGL 391
KI + SDG+GGV++DSG T+T + FE + E + M G R K GL
Sbjct: 320 KIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGL 379
Query: 392 RPCFD-ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
CF + + V P + F GGA + L + F G + CL + N+ L
Sbjct: 380 --CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLS- 436
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++G QN+ + FDL + F + C
Sbjct: 437 ----VIGILAQQNYNVGFDLEQMKVFFRRIDC 464
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 170/406 (41%), Gaps = 51/406 (12%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L H ++SL+ GTPPQ T + DTGS L W C + +F
Sbjct: 57 LRFHHNVSLTVSLAVGTPPQNVT-MVLDTGSELSWLLCAPGGGGGGGGRSAL------SF 109
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GL 214
P+ S + + C + +C P+ + C G S K C ++ L Y G ++ G
Sbjct: 110 RPRASLTFASVPCGSAQCRSRDLPSPPA-CDGAS---KQCRVS-----LSYADGSSSDGA 160
Query: 215 LLSETLRFPSKTVPNFLAGCSILS-DRQPAGIA-----GFGRSSESLPSQLGLKKFSYCL 268
L +E GC + D P G+A G R + S SQ ++FSYC+
Sbjct: 161 LATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCI 220
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY-VGLRQIIV 327
R DDA V L+L G D L+YTP Y+ P F Y V L I V
Sbjct: 221 SDR--DDAGV---LLL----GHSDLPFLPLNYTPLYQ-PAMPLPYFDRVAYSVQLLGIRV 270
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
G K + IP S L P G G +VDSG+ FTF+ G + A+ EF RQ + A +
Sbjct: 271 GGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPN 330
Query: 388 ---KSGLRPCFDISGKKS--VYLPELILKFKGGAKMALPPENYFALV------GNEVLCL 436
+ CF + ++ LP + L F GA+M + + V G+ V CL
Sbjct: 331 FAFQEAFDTCFRVPQGRAPPARLPAVTLLFN-GAQMTVAGDRLLYKVPGERRGGDGVWCL 389
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F + P A ++G N ++E+DL R G A +C
Sbjct: 390 T-FGNADMVPIT----AYVIGHHHQMNVWVEYDLERGRVGLAPIRC 430
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 109/405 (26%), Positives = 157/405 (38%), Gaps = 60/405 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++ GTP + DT S L W C RC P P F P+ S+
Sbjct: 139 GDYIAKIAVGTPAVEAL-LALDTASDLTWLQCQPCRRCY--------PQSGPVFDPRHST 189
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-------FTAGL 214
S + P C + R G + TC Y + YG G + G
Sbjct: 190 SYGEMNYDAPDCQAL------GRSGGGDAKRGTC-----IYTVLYGDGDGHGSTSTSVGD 238
Query: 215 LLSETLRFPSKTVPNFLA-GCSI----LSDRQPAGIAGFGRSSESLPSQLGL----KKFS 265
L+ ETL F +L+ GC L AGI G R S+P Q+ FS
Sbjct: 239 LVEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFS 298
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YCL+ F P S + L G G+ D+ P S+TP N FYYV L +
Sbjct: 299 YCLV--DFISGPGSPSSTLTFGAGAVDTSPPA-SFTPTVLN-----QNMPTFYYVRLIGV 350
Query: 326 IVGSKHVKIP----YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
VG V++P + G+GGVI+DSG+T T + P + A F +
Sbjct: 351 SVGG--VRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQ 408
Query: 382 AADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ C+ + G+ V +P + + F GG +++L P+NY V +
Sbjct: 409 VSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCF 468
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F G ++G+ Q F + +D+ R GFA C
Sbjct: 469 AFAGT------GDRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 130/483 (26%), Positives = 188/483 (38%), Gaps = 82/483 (16%)
Query: 16 ILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTK 75
ILL + ++A T T PL ++ L H DS IL S S + +T+
Sbjct: 19 ILLSVSVTSTTTTAMTDTKPLRLVTG---LIHQDS----ILSSYQSLDRNNVERRRTRRA 71
Query: 76 PKTKDSNIGSNYSNSLIKTPLSVHSYG-GYSISLSFGTPPQASTPFIFDTGSSLVWFPCT 134
D I+ + G + ++ S G PP I DTGS L+W C
Sbjct: 72 AFITDE----------IQANMVADDRGQAFLVNFSVGRPPVPQLVGI-DTGSDLLWVQCR 120
Query: 135 SRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKT 194
C DC P F P +SS+ + +P C PN SP+ K
Sbjct: 121 P---CADCF-----RQSTPIFDPSKSSTYVDLSYDSPIC-----PN--------SPQKKY 159
Query: 195 CPLACPSYLLQYGLGFTA-GLLLSETLRFPSK-----TVPNFLAGCSILS----DRQPAG 244
L Y Y G T+ G L +E + F + TV + + GC + D Q +G
Sbjct: 160 NHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSG 219
Query: 245 IAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
I G +S+ S+LG +FSYC+ FD + LVL GD S TPF+
Sbjct: 220 ILGLSAGDQSIVSRLG-SRFSYCI-GDLFDPHYTHNQLVL------GDGVKMEGSSTPFH 271
Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
F FYYV L I VG + I G GGV++DSG+T TF+
Sbjct: 272 --------TFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDG 323
Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGL-----RPCFDISGKKSVYLPELILKFKGGAKMA 419
F+ ++ E R + + + G R D+ G PEL F GA +
Sbjct: 324 FDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRG-----FPELAFHFAEGADLV 378
Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
L + F +V CL + N + ++G Q++ + +DL R F +
Sbjct: 379 LDANSLFVQKNQDVFCLAVLESNL------KNIGSVIGIMAQQHYNVAYDLIGKRVYFQR 432
Query: 480 QKC 482
C
Sbjct: 433 TDC 435
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 122/450 (27%), Positives = 181/450 (40%), Gaps = 81/450 (18%)
Query: 49 DSDPLKILHSLAS---SSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
D + +HS + S+ R R K P + IGS G Y
Sbjct: 89 DQSRVDFIHSKIAGELESVDRLRGSKATKIPAKSGATIGS----------------GNYI 132
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCT--SRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
+S+ GTP + + IFDTGS L W C +RY C + + P F+P +S++
Sbjct: 133 VSVGLGTPKKYLS-LIFDTGSDLTWTQCQPCARY-CYN--------QKDPVFVPSQSTTY 182
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
I C +P CS + + GCS AC Y +QYG F+ G ETL
Sbjct: 183 SNISCSSPDCSQL--ESGTGNQPGCSAAR-----ACI-YGIQYGDQSFSVGYFAKETLTL 234
Query: 223 PSKTV-PNFLAGCSILSDR----QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
S V NFL GC ++R AG+ G G+ S+ Q K FSYCL
Sbjct: 235 TSTDVIENFLFGCG-QNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFSYCL------ 287
Query: 275 DAPVSSNLVLDTG--PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
P +S+ TG G L YTP K + FY V + + VG +
Sbjct: 288 --PKTSS---STGYLTFGGGGGGGALKYTPITK-----AHGVANFYGVDIVGMKVGGTQI 337
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
I S G I+DSG+ T + + A+ F + M Y +A ++ S L
Sbjct: 338 PISSSVF-----STSGAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPEL---SILD 389
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
C+D+S ++ +P++ FKGG ++ L +CL F N +
Sbjct: 390 TCYDLSKYSTIQIPKVGFVFKGGEELDLDGIGIMYGASTSQVCL-AFAGNQDPSTVA--- 445
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + + +D+ + GF C
Sbjct: 446 --IIGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 111/405 (27%), Positives = 173/405 (42%), Gaps = 56/405 (13%)
Query: 86 NYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
N +N + LS+ S G Y + L G+PP+ T I DTGSSL W C CV
Sbjct: 103 NSANIPLNPGLSIGS-GNYYLKLGLGSPPKYYT-MILDTGSSLSWLQCKP---CVVYCHS 157
Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLL 204
VDP F P S++ + + C + +CS + + + C + C
Sbjct: 158 QVDP----LFEPSASNTYRPLYCSSSECSLLKAATLNDPLCTA----SGVCVYTASYGDA 209
Query: 205 QYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL 261
Y +G+ + LL+ T PS+T+P+F GC ++ + AGI G R S+ +QL
Sbjct: 210 SYSMGYLSRDLLTLT---PSQTLPSFTYGCGQDNEGLFGKAAGIVGLARDKLSMLAQLSP 266
Query: 262 K---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
K FSYCL P S T G G +S + + P+ +S Y
Sbjct: 267 KYGYAFSYCL--------PTS------TSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLY 312
Query: 319 YVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
++ L I V + V + + Y VP I+DSG+ T + ++ A+ + F++ M
Sbjct: 313 FLRLAAITVAGRPVGVAAAGYQVP-------TIIDSGTVVTRLPISIYAALREAFVKIMS 365
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
R S L CF S K PE+ + F+GGA ++L N + CL
Sbjct: 366 R--RYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIEADKGIACLA 423
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ N I+G+ Q Q + + +D++ + GFA C
Sbjct: 424 FASSNQIA---------IIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 154/386 (39%), Gaps = 58/386 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y +++S GTP A T DTGS L W CT C P + P F P +SSS
Sbjct: 140 YVVTVSLGTPGVAQT-LEVDTGSDLSWVQCT------PCAAPACYSQKDPLFDPAQSSSY 192
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
+ C P C G +C A Y++ YG G T G+ S+TL
Sbjct: 193 AAVPCGGPVCG------------GLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTL 240
Query: 223 -PSKTVPNFLAGCSILSD--RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDA 276
P+ V F GC G+ G GR SL Q FSYCL +R
Sbjct: 241 SPNDAVRGFFFGCGHAQSGFTGNDGLLGLGREEASLVEQTAGTYGGVFSYCLPTR----- 295
Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
P ++ + GP + PG S T +P ++ +Y V L I VG + + +P
Sbjct: 296 PSTTGYLTLGGPSG--AAPPGFSTTQLLSSPNAAT-----YYVVMLTGISVGGQQLSVPS 348
Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
S GG +VD+G+ T + + A+ F M +Y + L C++
Sbjct: 349 SVFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPS-APATGILDTCYN 401
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
SG +V LP + L F GGA + L + ++ F A P+ G IL
Sbjct: 402 FSGYGTVTLPNVALTFSGGATVTLGADG-----------ILSFGCLAFAPSGSDGGMAIL 450
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
G+ Q ++F + D GF C
Sbjct: 451 GNVQQRSFEVRID--GTSVGFKPSSC 474
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 124/446 (27%), Positives = 175/446 (39%), Gaps = 61/446 (13%)
Query: 46 HHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
H S S +LH +ASS R +L + K K +++ N L G Y
Sbjct: 55 HVSASVIDTVLH-MASSDSHRLTYLSSLVAGKPKPTSVPVASGNQL--------HIGNYV 105
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+ GTPPQ + DT + VW PC+ C C+ N S S+
Sbjct: 106 VRAKLGTPPQLMF-MVLDTSNDAVWLPCSG---CSGCS--NASTSFNTNSSSTYST---- 155
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG--LGFTAGLLLSETLRFP 223
+ C +C+ G C SP+ C S+ YG F+A L+ +TL
Sbjct: 156 VSCSTAQCTQARGLT----CPSSSPQPSVC-----SFNQSYGGDSSFSASLV-QDTLTLA 205
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
+PNF GC + + P G+ G GR SL SQ L FSYCL S +
Sbjct: 206 PDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFS 265
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S L L P S + YTP +NP S YYV L + VGS V +
Sbjct: 266 GSLKLGLLGQPKS-------IRYTPLLRNPRRPS-----LYYVNLTGVSVGSVQVPVDPV 313
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
YL ++ G I+DSG+ T P++EA+ EF +Q+ + CF
Sbjct: 314 YLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV----NVSSFSTLGAFDTCF-- 367
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAIIL 456
S P++ L + LP EN L CL + A+ ++
Sbjct: 368 SADNENVAPKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN----VI 422
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q QN + FD+ N R G A + C
Sbjct: 423 ANLQQQNLRILFDVPNSRIGIAPEPC 448
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 117/402 (29%), Positives = 164/402 (40%), Gaps = 73/402 (18%)
Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
PPQ + + DTGS L W C + +P+ + F P RSSS I C +P
Sbjct: 82 PPQ-NISMVIDTGSELSWLRCNR----------SSNPNPVNNFDPTRSSSYSPIPCSSPT 130
Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPS-YLLQYGLGF-----TAGLLLSETLRFPSKT 226
C R+ P +C S L L + + G L +E F + T
Sbjct: 131 CR-------------TRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNST 177
Query: 227 VP-NFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
N + GC D + G+ G R S S SQ+G KFSYC+ DD P
Sbjct: 178 NDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCI--SGTDDFP- 234
Query: 279 SSNLVLDTGPGSGDSK----TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
L+L GDS TP L+YTP + Y V L I V K + I
Sbjct: 235 -GFLLL------GDSNFTWLTP-LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPI 286
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG---NYSRAADVEKKSGL 391
P S LVP G G +VDSG+ FTF+ GP++ A+ F+ + D + +
Sbjct: 287 PKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTM 346
Query: 392 RPCFDISGKKSV-----YLPELILKFKGGAKMALPPENYF-----ALVGNE-VLCLILFT 440
C+ IS + LP + L F+ GA++A+ + VGN+ V C
Sbjct: 347 DLCYRISPVRIRSGILHRLPTVSLVFE-GAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGN 405
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ G A ++G QN ++EFDL R G A +C
Sbjct: 406 SDLMGME-----AYVIGHHHQQNMWIEFDLQRSRIGLAPVEC 442
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 111/392 (28%), Positives = 164/392 (41%), Gaps = 61/392 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
G Y + + G+PP+ + + D+GS ++W PCT Y D P F P
Sbjct: 134 GEYFVRIGVGSPPR-NQYVVMDSGSDIIWVQCEPCTQCYHQSD-----------PVFNPA 181
Query: 159 RSSSSQLIGCQNPKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLL 216
SSS + C + CS + E RC+ Y + YG G +T G L
Sbjct: 182 DSSSFSGVSCASTVCSHVDNAACHEGRCR---------------YEVSYGDGSYTKGTLA 226
Query: 217 SETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE---SLPSQLGLKK---FSYCLLS 270
ET+ F + N GC + G AG S QLG + FSYCL+S
Sbjct: 227 LETITFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVS 286
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
R + S+ +L+ G ++ G ++ P NP S FYY+GL + VG
Sbjct: 287 RG-----IESSGLLEFGR---EAMPVGAAWVPLIHNPRAQS-----FYYIGLSGLGVGGL 333
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
V I G+GGV++D+G+ T + +EA FI Q N RA+ V S
Sbjct: 334 RVSISEDVFKLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGV---SI 390
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
C+D+ G SV +P + F GG + LP N+ V + F +++G +
Sbjct: 391 FDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLS--- 447
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + + D AN GF C
Sbjct: 448 ----IIGNIQQEGIQISVDGANGFVGFGPNVC 475
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 108/405 (26%), Positives = 171/405 (42%), Gaps = 60/405 (14%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
+ P+ + S G Y + + GTPPQ + + TG LVW CT C + + P DP++
Sbjct: 45 VAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGE-LVWTQCTPCQPCFEQDLPLFDPTK 103
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
SS+ + + C + C I P C + C P+ G T
Sbjct: 104 --------SSTFRGLPCGSHLCESI--PESSRNCT-----SDVCIYEAPTKA-----GDT 143
Query: 212 AGLLLSETLRF-PSKTVPNFLAGCSILSDRQ------PAGIAGFGRSSESLPSQLGLKKF 264
G+ ++T +K F GC +++D++ P+GI G GR+ SL +Q+ + F
Sbjct: 144 GGMAGTDTFAIGAAKETLGF--GCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAF 201
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLR 323
SYCL + S+ L G + S TPF K GSS YY
Sbjct: 202 SYCLAGK--------SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYY---- 249
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+V +K + L S V++D+ S +++ ++A+ K +G A+
Sbjct: 250 --MVKLAGIKAGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVAS 307
Query: 384 DVEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
+ +D+ K+V PEL+ F GGA + +PP NY GN +CL + +
Sbjct: 308 PPKP-------YDLCFSKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTIGSS 360
Query: 442 ---NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N G G A ILG Q +N ++ FDL + F C+
Sbjct: 361 ASLNLTGELEG---ASILGSLQQENVHVLFDLKEETLSFKPADCS 402
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 114/399 (28%), Positives = 158/399 (39%), Gaps = 65/399 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I++ G+PP S + DTGS + W C ++ C P VDP F P SS+
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQ--QCR-PQVDP----LFDPSLSSTY 192
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF--TAGLLLSETLR 221
C + C+ +F E GCS + Y+ YG G T G S+TL
Sbjct: 193 SPFSCSSAACAQLF---QEGNANGCSSSGQC------QYIAMYGDGSVGTTGTYSSDTLA 243
Query: 222 FPSKT----VPNFLAGCSILSDRQPAGIAGFGRSS-------ESLPSQ----LGLKKFSY 266
S + V F GCS GI G +SL SQ G FSY
Sbjct: 244 LGSNSNTVVVSKFRFGCS----HAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSY 299
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
CL P SS L G G S F K P+ SS FY V L I
Sbjct: 300 CL-----PPTPSSSGF-LTLGAA-------GTSSAGFVKTPMLRSSQVPAFYGVRLEAIR 346
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
VG + + IP + + G+I+DSG+ T + + +++ F M Y A
Sbjct: 347 VGGRQLSIPTTVF------SAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSA 400
Query: 387 KKSGLRPCFDISGKKSVYLPELILKFK--GGAKMALPPENY-FALVGNEVLCLILFTDNA 443
L CFD+SG+ SV +P + L F GGA + L + + + CL +
Sbjct: 401 GGGFLDTCFDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATS- 459
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+ Q + F + +D+A GF C
Sbjct: 460 -----DDGSTGIIGNVQQRTFQVLYDVAGGAVGFKAGAC 493
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 120/453 (26%), Positives = 181/453 (39%), Gaps = 69/453 (15%)
Query: 48 SDSDPLKILHSL--ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG-GY 104
+D+ PL+++ L S LS + L + + + + I+ + G +
Sbjct: 2 TDTKPLRLVTGLIHQDSILSSYQSLDRNNVERRRTRR--AAFITDEIQANMVADDRGQAF 59
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
++ S G PP I DTGS L+W C C DC + P F P +SS+
Sbjct: 60 LVNFSVGRPPVPQLVGI-DTGSDLLWVQCRP---CADCFRQST-----PIFDPSKSSTYV 110
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP 223
+ +P C PN SP+ K L Y Y G T+ G L +E + F
Sbjct: 111 DLSYDSPIC-----PN--------SPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFE 157
Query: 224 SK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
+ TV + + GC + D Q +GI G +S+ S+LG +FSYC+ FD
Sbjct: 158 TSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCI-GDLFD 215
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
+ LVL GD S TPF+ F FYYV L I VG + I
Sbjct: 216 PHYTHNQLVL------GDGVKMEGSSTPFH--------TFNGFYYVTLEGISVGETRLDI 261
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL--- 391
G GGV++DSG+T TF+ F+ ++ E R + + + G
Sbjct: 262 NPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCY 321
Query: 392 --RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
R D+ G PEL F GA + L + F +V CL + N
Sbjct: 322 KGRVNEDLRG-----FPELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNL------ 370
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ ++G Q++ + +DL R F + C
Sbjct: 371 KNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 119/402 (29%), Positives = 168/402 (41%), Gaps = 73/402 (18%)
Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
PPQ + + DTGS L W C + +P+ + F P RSSS I C +P
Sbjct: 82 PPQ-NISMVIDTGSELSWLRCNR----------SSNPNPVNNFDPTRSSSYSPIPCSSPT 130
Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPS-YLLQYGLGF-----TAGLLLSETLRFPSKT 226
C R+ P +C S L L + + G L +E F + T
Sbjct: 131 CR-------------TRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNST 177
Query: 227 VP-NFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
N + GC D + G+ G R S S SQ+G KFSYC+ DD P
Sbjct: 178 NDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGT--DDFP- 234
Query: 279 SSNLVLDTGPGSGDSK----TPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
L+L GDS TP L+YTP + Y V L I V K + I
Sbjct: 235 -GFLLL------GDSNFTWLTP-LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPI 286
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVE--KKSGL 391
P S L+P G G +VDSG+ FTF+ GP++ A+ +F+ Q G + D E + +
Sbjct: 287 PKSVLLPDHTGAGQTMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTM 346
Query: 392 RPCFDISGKKSV-----YLPELILKFKGGAKMALPPENYFALV-----GNE-VLCLILFT 440
C+ IS + LP + L F+ GA++A+ + V GN+ V C
Sbjct: 347 DLCYRISPFRIRTGILHRLPTVSLVFE-GAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGN 405
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ G A ++G QN ++EFDL R G A +C
Sbjct: 406 SDLMGME-----AYVIGHHHQQNMWIEFDLQRSRIGLAPVQC 442
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 129/454 (28%), Positives = 191/454 (42%), Gaps = 78/454 (17%)
Query: 57 HSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQA 116
H++ S L RAR + + + SN ++S P V G Y + GTPP
Sbjct: 33 HTVELSQL-RARD-ALRHRRMLQSSNGVVDFSVQGTFDPFQV---GLYYTKVQLGTPPVE 87
Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
I DTGS ++W C S C C + ++ F P SS+S +I C + +C+
Sbjct: 88 FNVQI-DTGSDVLWVSCNS---CSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCN-- 141
Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLR----FPSKTVPNFL 231
++S CS +N C SY QYG G T+G +S+ + F N
Sbjct: 142 --NGIQSSDATCSSQNNQC-----SYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNST 194
Query: 232 A----GCS-------ILSDRQPAGIAGFGRSSESLPSQL---GL--KKFSYCLLSRKFDD 275
A GCS SDR GI GFG+ S+ SQL G+ + FS+CL D
Sbjct: 195 APVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKG----D 250
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
+ LVL G+ P + YT P Y + L+ I V + ++I
Sbjct: 251 SSGGGILVL------GEIVEPNIVYTSLVPAQP---------HYNLNLQSIAVNGQTLQI 295
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
S V + + G IVDSG+T + L E F+ + + S C
Sbjct: 296 DSS--VFATSNSRGTIVDSGTTLAY----LAEEAYDPFVSAITASIPQSVHTVVSRGNQC 349
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLCLILFTDNAAGPALGR 450
+ I+ + P++ L F GGA M L P++Y ++ G V C+ G +
Sbjct: 350 YLITSSVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCI--------GFQKIQ 401
Query: 451 GPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G I ILGD L++ + +DLA R G+A C+
Sbjct: 402 GQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 435
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 168/388 (43%), Gaps = 63/388 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I++ G+P + T + DTGS + W C C C+ DP F P SS+
Sbjct: 198 YLITVGLGSPATSQT-MLIDTGSDVSWVQCKP---CSQCH-SQADP----LFDPSSSSTY 248
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
C + C+ + GCS ++ Y++ YG G T G S+TL
Sbjct: 249 SPFSCGSADCA-----QLGQEGNGCSSSSQC------QYIVTYGDGSSTTGTYSSDTLAL 297
Query: 223 PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDD 275
S V +F GCS + + Q G+ G G ++SL SQ LG + FSYCL
Sbjct: 298 GSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCL-----PP 351
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
P SS + G + F K P+ SS FY V L+ I VG + + IP
Sbjct: 352 TPSSSGFLTLG-------AAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIP 404
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPC 394
S + G ++DSG+ T + + A++ F M Y A + SG L C
Sbjct: 405 ASVF------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPA----QPSGILDTC 454
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
FD SG+ SV +P + L F GGA ++L ++ N CL F N+ +LG
Sbjct: 455 FDFSGQSSVSIPSVALVFSGGAVVSLDASGI--ILSN---CLA-FAGNSDDSSLG----- 503
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + F + +D+ GF C
Sbjct: 504 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 531
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 125/446 (28%), Positives = 188/446 (42%), Gaps = 59/446 (13%)
Query: 49 DSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISL 108
D + ++ LHS ++ S R+ T K + S + S + +K+ LS+ S G Y + +
Sbjct: 64 DEERVRFLHSRLTNKES-VRNSATTDKLRGGPSLV----STTPLKSGLSIGS-GNYYVKI 117
Query: 109 SFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
GTP + + I DTGSSL W C CV VDP F P S + + + C
Sbjct: 118 GLGTPAKYFS-MIVDTGSSLSWLQCQP---CVIYCHVQVDP----IFTPSTSKTYKALPC 169
Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV 227
+ +CS + + + GCS C Y YG F+ G L + L
Sbjct: 170 SSSQCSSLKSSTLNA--PGCSNATGAC-----VYKASYGDTSFSIGYLSQDVLTLTPSEA 222
Query: 228 PN--FLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAP 277
P+ F+ GC D Q +GI G S+ QL K FSYCL S
Sbjct: 223 PSSGFVYGCG--QDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNS 280
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PY 336
S + L G S L+ +P+ P+ + Y++ L I V K + +
Sbjct: 281 SSLSGFLSIGASS-------LTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSAS 333
Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
SY VP I+DSG+ T + ++ A+ K F+ M + A S L CF
Sbjct: 334 SYNVP-------TIIDSGTVITRLPVAVYNALKKSFVLIMSK--KYAQAPGFSILDTCFK 384
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIIL 456
S K+ +PE+ + F+GGA + L N + CL + A P I+
Sbjct: 385 GSVKEMSTVPEIQIIFRGGAGLELKAHNSLVEIEKGTTCLAI--------AASSNPISII 436
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
G++Q Q F + +D+AN + GFA C
Sbjct: 437 GNYQQQTFKVAYDVANFKIGFAPGGC 462
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 168/388 (43%), Gaps = 63/388 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I++ G+P + T + DTGS + W C C C+ DP F P SS+
Sbjct: 128 YLITVGLGSPATSQT-MLIDTGSDVSWVQCKP---CSQCH-SQADP----LFDPSSSSTY 178
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
C + C+ + GCS ++ Y++ YG G T G S+TL
Sbjct: 179 SPFSCGSAACAQL-----GQEGNGCSSSSQC------QYIVTYGDGSSTTGTYSSDTLAL 227
Query: 223 PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDD 275
S V +F GCS + + Q G+ G G ++SL SQ LG + FSYCL
Sbjct: 228 GSSAVKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCL-----PP 281
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
P SS + G + F K P+ SS FY V L+ I VG + + IP
Sbjct: 282 TPSSSGFLTLG-------AAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIP 334
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPC 394
S + G ++DSG+ T + + A++ F M Y A + SG L C
Sbjct: 335 ASVF------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPA----QPSGILDTC 384
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
FD SG+ SV +P + L F GGA ++L ++ N CL F N+ +LG
Sbjct: 385 FDFSGQSSVSIPSVALVFSGGAVVSLDASGI--ILSN---CLA-FAANSDDSSLG----- 433
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + F + +D+ GF C
Sbjct: 434 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 120/453 (26%), Positives = 181/453 (39%), Gaps = 69/453 (15%)
Query: 48 SDSDPLKILHSL--ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG-GY 104
+D+ PL+++ L S LS + L + + + + I+ + G +
Sbjct: 2 TDTKPLRLVTGLIHQDSILSSYQSLDRNNVERRRTRR--AAFIXDEIQANMVADDRGQAF 59
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
++ S G PP I DTGS L+W C C DC + P F P +SS+
Sbjct: 60 LVNFSVGRPPVPQLVGI-DTGSDLLWVQCRP---CADCFRQST-----PIFDPSKSSTYV 110
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP 223
+ +P C PN SP+ K L Y Y G T+ G L +E + F
Sbjct: 111 DLSYDSPIC-----PN--------SPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFE 157
Query: 224 SK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
+ TV + + GC + D Q +GI G +S+ S+LG +FSYC+ FD
Sbjct: 158 TSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCI-GDLFD 215
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
+ LVL GD S TPF+ F FYYV L I VG + I
Sbjct: 216 PHYTHNQLVL------GDGVKMEGSSTPFH--------TFNGFYYVTLEGISVGETRLDI 261
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL--- 391
G GGV++DSG+T TF+ F+ ++ E R + + + G
Sbjct: 262 NPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCY 321
Query: 392 --RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
R D+ G PEL F GA + L + F +V CL + N
Sbjct: 322 KGRVNEDLRG-----FPELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNL------ 370
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ ++G Q++ + +DL R F + C
Sbjct: 371 KNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 104/342 (30%), Positives = 147/342 (42%), Gaps = 49/342 (14%)
Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA 212
P+F P RSS+ + + C P+CS P+ C G +C ++ L Y
Sbjct: 145 PSFDPTRSSTYRPVRCGAPQCSQAPAPS----CPGG--LGSSC-----AFNLSYAASTFQ 193
Query: 213 GLLLSETLRFPSKT--VPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGL---KKF 264
LL + L V + GC + P G+ GFGR S PSQ F
Sbjct: 194 ALLGQDALALHDDVDAVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVF 253
Query: 265 SYCLLSRKFDDAPVSSNL--VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
SYCL S K SSN L GP +G K + TP NP S YYV +
Sbjct: 254 SYCLPSYK------SSNFSGTLRLGP-AGQPKR--IKTTPLLSNPHRPS-----LYYVNM 299
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I VG + V +P S L G IVD+G+ FT + P++ AV F ++ RA
Sbjct: 300 VGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRV----RA 355
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTD 441
G C+++ ++ +P + F G + LP EN + + CL +
Sbjct: 356 PVAGPLGGFDTCYNV----TISVPTVTFSFDGRVSVTLPEENVVIRSSSGGIACLAM--- 408
Query: 442 NAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
AAGP G A+ +L Q QN + FD+AN R GF+++ C
Sbjct: 409 -AAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRELC 449
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 124/438 (28%), Positives = 178/438 (40%), Gaps = 77/438 (17%)
Query: 47 HSDSDPLKILHSLASSSL---SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG 103
+ D + +K ++S S +L S L + T P S IGS G
Sbjct: 101 NQDKERVKYINSRISKNLGQDSSVSELDSVTLPAKSGSLIGS----------------GN 144
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTP + IFDTGS L W C C + D F P +S+S
Sbjct: 145 YFVVVGLGTPKR-DLSLIFDTGSDLTWTQCEP---CARSCYKQQDA----IFDPSKSTSY 196
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
I C + C+ + GCS K C Y +QYG F+ G E L
Sbjct: 197 SNITCTSTLCTQL--STATGNEPGCSASTKACI-----YGIQYGDSSFSVGYFSRERLSV 249
Query: 223 -PSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
+ V NFL GC + Q AG+ G GR S Q K FSYCL
Sbjct: 250 TATDIVDNFLFGCG--QNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCL----- 302
Query: 274 DDAPVSSNLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
P +S+ TG S G + T + YTPF GSS FY + + I VG
Sbjct: 303 ---PATSS---STGRLSFGTTTTSYVKYTPFSTISRGSS-----FYGLDITGISVGG--A 349
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
K+P S + GG I+DSG+ T + + A+ F + M Y A ++ S L
Sbjct: 350 KLPVS---SSTFSTGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGEL---SILD 403
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
C+D+SG + +P++ F GG + LPP+ + + +CL F N +
Sbjct: 404 TCYDLSGYEVFSIPKIDFSFAGGVTVQLPPQGILYVASAKQVCL-AFAANGDDSDV---- 458
Query: 453 AIILGDFQLQNFYLEFDL 470
I G+ Q + + +D+
Sbjct: 459 -TIYGNVQQKTIEVVYDV 475
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 112/411 (27%), Positives = 171/411 (41%), Gaps = 71/411 (17%)
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
+S G Y + GTPP+ I DTGS ++W C + C +C + + F
Sbjct: 73 NSVGLYYTKVKMGTPPKEFNVQI-DTGSDILWVNCNT---CSNCPQSSQLGIELNFFDTV 128
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLS 217
SS++ LI C +P C+ V+ CSPR C SY QYG G T+G +S
Sbjct: 129 GSSTAALIPCSDPICT----SRVQGAAAECSPRVNQC-----SYTFQYGDGSGTSGYYVS 179
Query: 218 ETLRF--------PSKTVPNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL--- 259
+ + F + + GCSI +D+ GI GFG S+ SQL
Sbjct: 180 DAMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSR 239
Query: 260 GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
G+ K FS+CL + +L+ P + Y+P +
Sbjct: 240 GITPKVFSHCLKGDGDGGGVLVLGEILE----------PSIVYSPLVPSQ--------PH 281
Query: 318 YYVGLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
Y + L+ I V + + I P + + S+ GG IVD G+T ++ ++ + +
Sbjct: 282 YNLNLQSIAVNGQLLPINPAVFSI--SNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAV 339
Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA----LVGNE 432
+R + S C+ +S P + L F+GGA M L PE Y L G E
Sbjct: 340 SQSAR----QTNSKGNQCYLVSTSIGDIFPSVSLNFEGGASMVLKPEQYLMHNGYLDGAE 395
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ C I F G A ILGD L++ + +D+A R G+A C+
Sbjct: 396 MWC-IGFQKFQEG-------ASILGDLVLKDKIVVYDIAQQRIGWANYDCS 438
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 114/409 (27%), Positives = 172/409 (42%), Gaps = 73/409 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+P + I DTGS ++W C + C +C + + F SS
Sbjct: 81 GLYFTKVKLGSPAKDFYVQI-DTGSDILWINCIT---CSNCPHSSGLGIELDFFDTAGSS 136
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ L+ C +P CS+ V++ GCS + C SY QYG G T G +S+T+
Sbjct: 137 TAALVSCADPICSY----AVQTATSGCSSQANQC-----SYTFQYGDGSGTTGYYVSDTM 187
Query: 221 RFPS-----KTVPN----FLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL--- 261
F + V N + GCS +D+ GI GFG + S+ SQL
Sbjct: 188 YFDTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGV 247
Query: 262 --KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
K FS+CL + LVL G+ P + Y+P + Y
Sbjct: 248 TPKVFSHCLKGGENGGGV----LVL------GEILEPSIVYSPLVPSL--------PHYN 289
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+ L+ I V + +P V + N G IVDSG+T ++ + + +
Sbjct: 290 LNLQSIAVNGQ--LLPIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQF 347
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
S+ + K + C+ +S P++ L F GGA M L PE+Y G F
Sbjct: 348 SKPI-ISKGN---QCYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYG--------F 395
Query: 440 TDNAAGPALG-----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
D+AA +G RG ILGD L++ +DLAN R G+A C+
Sbjct: 396 LDSAAMWCIGFQKVERGFT-ILGDLVLKDKIFVYDLANQRIGWADYNCS 443
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 118/410 (28%), Positives = 173/410 (42%), Gaps = 63/410 (15%)
Query: 87 YSNSLIKTPLS-VHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
+ +SL TP S V+ GG Y ++ S GTPP + + DTGS +VW C +C
Sbjct: 68 FKDSLSNTPESTVYVNGGEYLMTYSVGTPP-FNVYGVVDTGSDIVWLQCKPCEQCY---- 122
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
P F P +SSS + I C + C + R C+ +N +C +
Sbjct: 123 ----KQTTPIFNPSKSSSYKNIPCSSNLCQSV-------RYTSCNKQN-----SCEYTIN 166
Query: 205 QYGLGFTAGLLLSETLRFPSKT-----VPNFLAGCSI----LSDRQPAGIAGFGRSSESL 255
++ G L ETL S T P + GC + + +GI G G SL
Sbjct: 167 FSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSL 226
Query: 256 PSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
+QL KFSYCLL D S D SGD G+ TPF K +
Sbjct: 227 TTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFGDAAVVSGD----GVVSTPFVKKDPQA-- 280
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
FYY+ L VG+K +I + L +GN +I+DSG+T T + ++ +
Sbjct: 281 ----FYYLTLEAFSVGNK--RIEFEVLDDSEEGN--IILDSGTTLTLLPSHVYTNLESA- 331
Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE 432
+ Q+ R D + L C+ I+ + + P + FK GA + L P + FA V +
Sbjct: 332 VAQLVKLDRVDDPNQL--LNLCYSITSDQYDF-PIITAHFK-GADIKLNPISTFAHVADG 387
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V+CL FT + GP I G+ N + +DL + F C
Sbjct: 388 VVCLA-FTSSQTGP--------IFGNLAQLNLLVGYDLQQNIVSFKPSDC 428
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 107/396 (27%), Positives = 159/396 (40%), Gaps = 70/396 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
+ +++ GTP Q S IFDTGS L W PC S C P + P F P +S
Sbjct: 144 FVVAVGLGTPAQPSA-LIFDTGSDLSWVQCQPCGSSGHC--------HPQQDPLFDPSKS 194
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
S+ + C P+C+ + CS N TC YL++YG G T G+L +T
Sbjct: 195 STYAAVHCGEPQCA--------AAGDLCSEDNTTC-----LYLVRYGDGSSTTGVLSRDT 241
Query: 220 LRFP-SKTVPNFLAGCSILSDRQPAGIAGFGR---------SSESLPSQLGLK---KFSY 266
L S+ + F GC + + FGR SLPSQ FSY
Sbjct: 242 LALTSSRALTGFPFGCGTRN------LGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSY 295
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
CL S + L + P + T YT + P F FY+V L I
Sbjct: 296 CLPSSN----STTGYLTIGATPA---TDTGAAQYTAMLRKP-----QFPSFYFVELVSID 343
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
+G + +P P GG ++DSG+ T++ + + F M Y+ A
Sbjct: 344 IGGYVLPVP-----PAVFTRGGTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPA---P 395
Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
L C+D +G+ V +P + +F GA L + V CL + G
Sbjct: 396 PNDVLDACYDFAGESEVVVPAVSFRFGDGAVFELDFFGVMIFLDENVGCLAFAAMDTGG- 454
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
P I+G+ Q ++ + +D+A ++ GF C
Sbjct: 455 ----LPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 486
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 168/388 (43%), Gaps = 63/388 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I++ G+P + T + DTGS + W C C C+ DP F P SS+
Sbjct: 128 YLITVGLGSPATSQT-MLIDTGSDVSWVQCKP---CSQCH-SQADP----LFDPSSSSTY 178
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
C + C+ + GCS ++ Y++ YG G T G S+TL
Sbjct: 179 SPFSCGSADCAQL-----GQEGNGCSSSSQC------QYIVTYGDGSSTTGTYSSDTLAL 227
Query: 223 PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDD 275
S V +F GCS + + Q G+ G G ++SL SQ LG + FSYCL
Sbjct: 228 GSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCL-----PP 281
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
P SS + G + F K P+ SS FY V L+ I VG + + IP
Sbjct: 282 TPSSSGFLTLG-------AAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIP 334
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPC 394
S + G ++DSG+ T + + A++ F M Y A + SG L C
Sbjct: 335 ASVF------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPA----QPSGILDTC 384
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
FD SG+ SV +P + L F GGA ++L ++ N CL F N+ +LG
Sbjct: 385 FDFSGQSSVSIPSVALVFSGGAVVSLDASGI--ILSN---CLA-FAGNSDDSSLG----- 433
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + F + +D+ GF C
Sbjct: 434 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 128/498 (25%), Positives = 213/498 (42%), Gaps = 94/498 (18%)
Query: 7 SLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSR 66
SLI ++ LIL F TV V LS +H + P+ I L++S++S
Sbjct: 7 SLIVIYYPLILFFLD---------TVVV----LSATDIPNH-NHRPMIIPLHLSTSNISS 52
Query: 67 ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGS 126
R T + + N S+ N+ ++ + S G Y+ L GTPPQ I DTGS
Sbjct: 53 HRKPFTSNYHRRQLHN--SDLPNAHMRLYDDLLSNGYYTTRLFIGTPPQEFA-LIVDTGS 109
Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
++ + PC++ C C + P F P+ SS+ + + C NP C+
Sbjct: 110 TVTYVPCST---CEQCG-----KHQDPRFQPESSSTYKPMQC-NPSCN------------ 148
Query: 187 GCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK---TVPNFLAGCSILS---- 238
C K C +Y +Y + ++GLL + L F ++ T + GC +
Sbjct: 149 -CDDEGKQC-----TYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCETVETGEL 202
Query: 239 -DRQPAGIAGFGRSSESLPSQLGLKK-----FSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
++ GI G GR S+ QL +K+ FS C V +VL P
Sbjct: 203 FSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDV----VGGAMVLGNIP---- 254
Query: 293 SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIV 351
P + + + +P S+ +Y + L+++ V K +K+ P + DG G ++
Sbjct: 255 -PPPDMVFA--HSDPYRSA-----YYNIELKELHVAGKRLKLNPRVF-----DGKHGTVL 301
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKK----SVYLPE 407
DSG+T+ ++ F A I+++ + + S CF +G+ S PE
Sbjct: 302 DSGTTYAYLPEEAFVAFKDAIIKEI-KFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPE 360
Query: 408 LILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
+ + F G K++L PENY + CL +F + G+ P +LG ++N
Sbjct: 361 VNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQN-------GKDPTTLLGGIVVRNTL 413
Query: 466 LEFDLANDRFGFAKQKCA 483
+ +D ND+ GF K C+
Sbjct: 414 VTYDRDNDKIGFWKTNCS 431
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 119/425 (28%), Positives = 172/425 (40%), Gaps = 58/425 (13%)
Query: 67 ARHLKTKTKPKTKDSNIG-SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R L KDS +N++ +I + + S G Y + + G+PP+ + + D+G
Sbjct: 107 VRRLSHGAPAAVKDSRYKVANFATDVI-SGMEAGS-GEYFVRIGVGSPPR-NQYMVIDSG 163
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
S +VW C RC + DP F P SSS + C + C +
Sbjct: 164 SDIVWVQCKPCSRC----YQQSDP----VFDPADSSSFAGVSCGSDVCDRL--------- 206
Query: 186 KGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ--- 241
N C Y + YG G +T G L ETL + + GC +
Sbjct: 207 -----ENTGCNAGRCRYEVSYGDGSYTKGTLALETLTVGQVMIRDVAIGCGHTNQGMFIG 261
Query: 242 PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGL 298
AG+ G G S S QLG + FSYCL+SR S L+ G G+ G
Sbjct: 262 AAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRG-----TGSTGALEFGRGA---LPVGA 313
Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
++ +NP S FYY+GL I VG V +P G GV++D+G+ T
Sbjct: 314 TWISLIRNPRAPS-----FYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVT 368
Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKM 418
+ A F Q N RA V S C+D++G +SV +P + F G +
Sbjct: 369 RFPTAAYVAFRDSFTAQTSNLPRAPGV---SIFDTCYDLNGFESVRVPTVSFYFSDGPVL 425
Query: 419 ALPPENYFALV-GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
LP N+ V G CL F + +G + I+G+ Q + + FD AN GF
Sbjct: 426 TLPARNFLIPVDGGGTFCL-AFAPSPSGLS-------IIGNIQQEGIQISFDGANGFVGF 477
Query: 478 AKQKC 482
C
Sbjct: 478 GPNIC 482
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 114/397 (28%), Positives = 184/397 (46%), Gaps = 70/397 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++LS GTPP + DTGS+L+W C C DC + VD P F PK SS
Sbjct: 92 GEYLMNLSLGTPPSPIMA-VADTGSNLIWTQCKP---CDDC-YTQVD----PLFDPKASS 142
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + + C + +C+ + E++ CS +KTC SYL+ Y G +T G +TL
Sbjct: 143 TYKDVSCSSSQCTAL-----ENQA-SCSTEDKTC-----SYLVSYADGSYTMGKFAVDTL 191
Query: 221 RFPSK-----TVPNFLAGC----SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
S + N + GC ++ + +G+ G G + SL QLG KFSYCL
Sbjct: 192 TLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCL 251
Query: 269 LSRKFDDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
+ + ++ +N V+ SG PG TP + FYY+ L+ I
Sbjct: 252 VPENDQTSKINFGTNAVV-----SG----PGTVSTPLV------VKSRDTFYYLTLKSIS 296
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
VGSK+++ P S+ G +++DSG+T T + + + + + + N ++ D
Sbjct: 297 VGSKNMQ------TPDSNIKGNMVIDSGTTLTLLPVKYYIEI-ENAVASLINADKSKDER 349
Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGP 446
S L C++ + + +P + + F+ GA + L P N F V +++CL A G
Sbjct: 350 IGSSL--CYNATA--DLNIPVITMHFE-GADVKLYPYNSFFKVTEDLVCL------AFGM 398
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ R I G+ +NF + +D A+ F CA
Sbjct: 399 SFYRNG--IYGNVAQKNFLVGYDTASKTMSFKPTDCA 433
>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
Group]
Length = 260
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 122/268 (45%), Gaps = 28/268 (10%)
Query: 216 LSETLRF--PSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
++ET F + P GC++ S+ +G+ G GR SL +QL ++ F Y L S
Sbjct: 1 MTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSS 60
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
+P+S + D G+GDS TP NPV FYYVGL I VG K
Sbjct: 61 DLSAPSPISFGSLADVTGGNGDS----FMSTPLLTNPVVQDL---PFYYVGLTGISVGGK 113
Query: 331 HVKIPY-SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
V+IP ++ S G GGVI DSG+T T + P + V E + QMG + +
Sbjct: 114 LVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMG-FQKPPPAANDD 172
Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV----GNEVLCLILFTDNAAG 445
L CF G + P ++L F GGA M L ENY + G C + + A
Sbjct: 173 DLI-CF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQA- 229
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLAND 473
I+G+ +F++ FDL+ +
Sbjct: 230 -------LTIIGNIMQMDFHVVFDLSGN 250
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 114/427 (26%), Positives = 169/427 (39%), Gaps = 99/427 (23%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
++ ++ GTPPQ T + DTGS L W C SR+ F SSS
Sbjct: 64 TVPVAVGTPPQNVT-MVLDTGSELSWLLCNGSRHDA--------------PFDASASSSY 108
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF 222
+ C +P C+W+ G ++ P C + L Y +A GLL ++T
Sbjct: 109 APVPCSSPACTWL-GRDL--------PVRPFCDSSACRVSLSYADASSADGLLAADTFLL 159
Query: 223 PSKTVPNFLAGC-------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD 275
S +P GC + S+ P G+ G R S +Q ++F+YC+ + +
Sbjct: 160 GSSPMPALF-GCITSYSSSTDPSETPPTGLLGMNRGGLSFVTQTATRRFAYCIAAGQ--- 215
Query: 276 APVSSNLVLDTGPG-----SGDSKTP-------GLSYTPFYKNPVGSSSAFGEFYYVGLR 323
GPG D++TP L+YTP + Y V L
Sbjct: 216 -----------GPGILLLGGNDTETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLE 264
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I VGS + IP L P G G +VDSG+ FTF+ + A+ EF Q+ +R+
Sbjct: 265 GIRVGSALLAIPKHLLTPDHTGAGQTMVDSGTRFTFLLPDAYAALKAEFANQL---TRSL 321
Query: 384 DVEKKSGLRP--------------CFDISGKKSV------YLPELILKFKGGAKMALPPE 423
D GL P CF + + LPE+ L +G + E
Sbjct: 322 D----GGLAPLGEPGFVFQGAFDACFRGTEARVSAAAAGGLLPEVGLVLRGAEVVVAGAE 377
Query: 424 NYFALV-------GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
V G V CL + + AG + A ++G Q+ ++E+DL N R G
Sbjct: 378 KLLYRVPGERRGEGEGVWCLTFGSSDMAGVS-----AYVIGHHHQQDVWVEYDLRNARLG 432
Query: 477 FAKQKCA 483
FA +CA
Sbjct: 433 FAAARCA 439
>gi|343161843|dbj|BAK57511.1| extracellular dermal glycoprotein [Nicotiana benthamiana]
Length = 440
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 115/413 (27%), Positives = 174/413 (42%), Gaps = 78/413 (18%)
Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
+ + T Q TP + D G +W VDC+ V S PA R
Sbjct: 46 TFQYLTQIQQRTPLVPVSLTLDLGGQFLW---------VDCDQGYVSSSYKPA----RCR 92
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
S+Q + C F P GC+ N TC L + + Q T+G L S+T++
Sbjct: 93 SAQCSLARAGGCGQCFSPPKP----GCN--NDTCGLIPDNTVTQTA---TSGELASDTVQ 143
Query: 222 FPSKTVPN-----------FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
S N F+ G + L R +G+ AG GR+ SLPSQ +
Sbjct: 144 VQSSNGKNPGRNVVDKDFLFVCGSTFLLKRLASGVKGMAGLGRTRISLPSQFSAEFSFPR 203
Query: 263 KFSYCLLSRK-------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF- 314
KF+ CL S F D P S L + D SYTP + NPV ++SAF
Sbjct: 204 KFAVCLSSSTKSKGVVLFGDGPYS---FLPNREFANDD----FSYTPLFINPVSTASAFS 256
Query: 315 -GE---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
GE Y++G++ I + K V I + L + G GG + + + +T +E ++ AV
Sbjct: 257 SGEPSSEYFIGVKSIKINQKVVSINTTLLSIDNQGVGGTKISTVNPYTILETSIYNAVTN 316
Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFD----ISGKKSVYLPELILKFKG-GAKMALPPENY 425
F++++ N +R A V CFD +S + +P + L + + N
Sbjct: 317 FFVKELVNITRVASVAP---FGACFDSRNIVSTRVGPTVPPIDLVLQNENVFWTIFGANS 373
Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
V VLCL F D P +I++G + +++ L+FDLA+ R GF
Sbjct: 374 MVQVSENVLCL-GFVDGGVNPR----TSIVIGGYTIEDNLLQFDLASSRLGFT 421
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 109/378 (28%), Positives = 157/378 (41%), Gaps = 57/378 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTPPQ + DT + W PCT+ C F P++S++
Sbjct: 93 YIVRAKIGTPPQ-TLLLAMDTSNDAAWIPCTACDGCAST-----------LFAPEKSTTF 140
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + C P+C + P G S RN + L YG A L+ +T+
Sbjct: 141 KNVSCAAPECKQVPNPGC-----GVSSRN---------FNLTYGSSSIAANLVQDTITLA 186
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
+ VP++ GC + + P G+ G GR SL SQ L FSYCL S F
Sbjct: 187 TDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 244
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + + YTP KNP SS YYV L I VG K V IP +
Sbjct: 245 FSGSLRL--GPVAQPKR---IKYTPLLKNPRRSS-----LYYVNLEAIRVGRKVVDIPPA 294
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L G I DSG+ FT + P++ AV EF R++G V G C+++
Sbjct: 295 ALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVG---PKLTVTSLGGFDTCYNV 351
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ +P + F G + LP +N CL + A P ++
Sbjct: 352 ----PIVVPTITFIFT-GMNVTLPQDNILIHSTAGSTTCLAM----AGAPDNVNSVLNVI 402
Query: 457 GDFQLQNFYLEFDLANDR 474
+ Q QN + +D+ N R
Sbjct: 403 ANMQQQNHRVLYDVPNSR 420
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 111/399 (27%), Positives = 158/399 (39%), Gaps = 64/399 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++ GTP + + I DTGS L W C+ C N FIP S+
Sbjct: 1 GEYLATVRLGTPERVFS-VIVDTGSDLTWVQCSPCGTCYSQN--------DSLFIPNTST 51
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG------FTAGLL 215
S + C C+ + P C Y YG G F +
Sbjct: 52 SFTKLACGTELCNGLPYP--------------MCNQTTCVYWYSYGDGSLSTGDFVYDTI 97
Query: 216 LSETLRFPSKTVPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLGL---KKFSYCLL 269
+ + + VPNF GC ++ AG I G G+ S PSQL KFSYCL+
Sbjct: 98 TMDGINGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLV 157
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
+ P ++ +L GD+ P G+ Y NP +YYV L I
Sbjct: 158 --DWLAPPTQTSPLL-----FGDAAVPTFPGVKYISLLTNP-----KVPTYYYVKLNGIS 205
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
VG K + I + S G G I DSG+T T + G + + V +Y R +D
Sbjct: 206 VGGKLLNISSTAFDIDSVGRAGTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSD-- 263
Query: 387 KKSGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFTDNAA 444
SGL C + + +P + F+GG M LPP NYF L ++ C + +
Sbjct: 264 DSSGLDLCLGGFAEGQLPTVPSMTFHFEGG-DMELPPSNYFIFLESSQSYCFSM----VS 318
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
P + I+G Q QNF + +D + GF + C
Sbjct: 319 SPDV-----TIIGSIQQQNFQVYYDTVGRKIGFVPKSCV 352
>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
Length = 191
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 58/177 (32%), Positives = 96/177 (54%), Gaps = 13/177 (7%)
Query: 309 GSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
G + FYYV ++ +IVG + + IP ++G GG I+DSG+T ++ P +E +
Sbjct: 24 GKENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEII 83
Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-A 427
+ F+ ++ Y D L+PC+++SG + + LP + F GA P ENYF
Sbjct: 84 KQAFVNKVKRYPILDDFPI---LKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIK 140
Query: 428 LVGNEVLCL-ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
L +++CL IL T ++A I+G++Q QNF++ +D R GFA ++CA
Sbjct: 141 LEPEDIVCLAILGTPHSAMS--------IIGNYQQQNFHILYDTKRSRLGFAPRRCA 189
>gi|2245012|emb|CAB10432.1| hypothetical protein [Arabidopsis thaliana]
gi|7268406|emb|CAB78698.1| hypothetical protein [Arabidopsis thaliana]
Length = 1046
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 118/411 (28%), Positives = 169/411 (41%), Gaps = 86/411 (20%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS----------------SSQL 165
DTGS LVWFPC + C+ C + PS + ++ SS L
Sbjct: 130 LDTGSDLVWFPCRP-FTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSAAHSSLPSSDL 188
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK 225
N +I C+ T CP + YG G L S++L PS
Sbjct: 189 CAISNCPLDFI-------ETGDCN----TSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSV 237
Query: 226 TVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGL------KKFSYCLLSRKFDDAPVS 279
+V NF GC+ + +P G+AGFGR SLP+QL + FSYCL+S FD V
Sbjct: 238 SVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVR 297
Query: 280 --SNLVLDTGPGSGDSKT----------------PGLSYTPFYKNPVGSSSAFGEFYYVG 321
S L+L + + +T +NP FY V
Sbjct: 298 RPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENP-----KHPYFYSVS 352
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YS 380
L+ I +G +++ P +G GGV+VDSG+TFT + + +V +EF ++G +
Sbjct: 353 LQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHE 412
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG-AKMALPPENYFALVGN-------- 431
RA VE S L+L F G + + LP NYF +
Sbjct: 413 RADRVEPSSA-----------------LVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEK 455
Query: 432 -EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
++ CL+L G G ILG++Q Q F + +DL N R GFAK+
Sbjct: 456 RKIGCLMLMNGGDESELRG-GTGAILGNYQQQGFEVVYDLLNRRVGFAKRN 505
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 112/408 (27%), Positives = 169/408 (41%), Gaps = 71/408 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+P + I DTGS ++W C + C +C + + F SS
Sbjct: 81 GLYFTKVKLGSPAKEFYVQI-DTGSDILWINCIT---CSNCPHSSGLGIELDFFDTAGSS 136
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ L+ C +P CS+ V++ CS + C SY QYG G T G +S+T+
Sbjct: 137 TAALVSCGDPICSYA----VQTATSECSSQANQC-----SYTFQYGDGSGTTGYYVSDTM 187
Query: 221 RFPS-----KTVPN----FLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL--- 261
F + V N + GCS +D+ GI GFG + S+ SQL
Sbjct: 188 YFDTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGV 247
Query: 262 --KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
K FS+CL + LVL G+ P + Y+P + Y
Sbjct: 248 TPKVFSHCLKGGENGGGV----LVL------GEILEPSIVYSPLVPSQ--------PHYN 289
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+ L+ I V + +P V + N G IVDSG+T ++ + K + +
Sbjct: 290 LNLQSIAVNGQ--LLPIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQF 347
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
S+ + K + C+ +S P++ L F GGA M L PE+Y G F
Sbjct: 348 SKPI-ISKGN---QCYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYG--------F 395
Query: 440 TDNAAGPALGRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
D AA +G ILGD L++ +DLAN R G+A C+
Sbjct: 396 LDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDLANQRIGWADYDCS 443
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 117/431 (27%), Positives = 180/431 (41%), Gaps = 70/431 (16%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSV-HSYGG--YSISLSFGTPPQASTPFIF 122
RA +++ K ++ +N+ S + P S +S G Y I+++ GTP I
Sbjct: 90 RAAYIQAKVS--SRYNNVAKELQQSAVTIPTSSGYSLGTTEYVITVTIGTPAVTQVMSI- 146
Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN-- 180
DTGS + W +C C + + F P S++ C + +C+ +
Sbjct: 147 DTGSDVSWV------QCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQLGDEGNG 200
Query: 181 -VESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS-KTVPNFLAGCSIL 237
++S+C+ Y+++YG G TAG S+TL S V +F GCS
Sbjct: 201 CLKSQCQ---------------YIVKYGDGSNTAGTYGSDTLSLTSSDAVKSFQFGCSHR 245
Query: 238 SDR---QPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
+ + G+ G G +ESL SQ K FSYCL P SS T +G
Sbjct: 246 AAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCL------PPPSSSGGGFLTLGAAG 299
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
+ + S+TP + V + FY V L+ I V + +P S +G +V
Sbjct: 300 GASSSRYSHTPMVRFSVPT------FYGVFLQGITVAGTMLNVPASVF------SGASVV 347
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
DSG+ T + ++A+ F ++M Y AA V L CFD SG ++ +P + L
Sbjct: 348 DSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGS---LDTCFDFSGFNTITVPTVTLT 404
Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
F GA M L + G + FT A G ILG+ Q + F + FD+
Sbjct: 405 FSRGAAMDLD------ISGILYAGCLAFTATAH-----DGDTGILGNVQQRTFEMLFDVG 453
Query: 472 NDRFGFAKQKC 482
GF C
Sbjct: 454 GRTIGFRSGAC 464
>gi|449432731|ref|XP_004134152.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
gi|449527081|ref|XP_004170541.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 429
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 122/409 (29%), Positives = 177/409 (43%), Gaps = 58/409 (14%)
Query: 95 PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
P++ H Y I + TP D G L+W VDC+ V S PA
Sbjct: 35 PVTKHPSLQYIIQIHQRTP-LVPVNLTVDLGGWLMW---------VDCDRGFVSSSYKPA 84
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG--FTA 212
R S+Q ++ C + P GC+ N TC L+ + ++Q G T+
Sbjct: 85 ----RCRSAQCSLAKSISCGKCYLP----PHPGCN--NYTCSLSARNTIIQLSSGGEVTS 134
Query: 213 GLL-LSETLRFPSK---TVPNFLAGCS---ILSDRQ--PAGIAGFGRSSESLPSQLGLKK 263
L+ +S T F S +VPNFL CS +L G+AGFGR+ SLPSQ
Sbjct: 135 DLVSVSSTNGFNSTRALSVPNFLFICSSTFLLEGLAGGVTGMAGFGRTRISLPSQFA-AA 193
Query: 264 FSYCLLSRKFDDAPVSSNL---VLDTGPGS-----GDSKTPGLSYTPFYKNPVGSSSAFG 315
FS+ SRKF S V+ +G G T L+YTP NPVG +
Sbjct: 194 FSF---SRKFTMCLSGSTGFPGVIFSGYGPYHFLPNIDLTNSLTYTPLLINPVGFAGEKS 250
Query: 316 EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
Y++G++ I SK V + + L S+GNGG + + + +T +E ++ A+ K F +
Sbjct: 251 SEYFIGVKSIEFNSKTVPLNTTLLKIDSNGNGGTKISTVNPYTVLETSIYRALVKTFTSE 310
Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKSVYL------PELILKFKGGAKMALPPENYFALV 429
+GN R A V C+ S L +LIL+ K + N +V
Sbjct: 311 LGNIPRVAAVAP---FEVCYSSKSFGSTELGPSVPSIDLILQNK-KVIWRMFGANSMVVV 366
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
EVLCL A+++G Q+++ LEFDLA R GF+
Sbjct: 367 TEEVLCLGFVEGGVEAET-----AMVIGGHQIEDNLLEFDLATSRLGFS 410
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 150/350 (42%), Gaps = 52/350 (14%)
Query: 93 KTPLSVHSYGG-YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
K P++ GG Y + S G PP + DTGS L+W C+ C CN P PS
Sbjct: 75 KAPVTKSQKGGKYIMQFSIGEPPLLIWAEV-DTGSDLMWVKCSP---CNGCNPP---PS- 126
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-- 209
P + P RS SS + C + C + R + S + P C Y YG
Sbjct: 127 -PLYDPARSRSSGKLPCSSQLCQAL------GRGRIISDQCSDDPPLC-GYHYAYGHSGD 178
Query: 210 -FTAGLLLSETLRFPSKTVPNFLA-GCSILSDRQP----AGIAGFGRSSESLPSQLGLKK 263
T G+L +ET F V N ++ G S D AG+ G GR SL SQLG +
Sbjct: 179 HSTQGVLGTETFTFGDGYVANNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGAGR 238
Query: 264 FSYCLLSRKFDDAPVSSNLV------LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
F+YCL + D V S ++ LDT G +S TP NP
Sbjct: 239 FAYCLAA----DPNVYSTILFGSLAALDTSAGD-------VSSTPLVTNPKPDRDTH--- 284
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
YYV L+ I VG + I SDG+GGV DSG+ T ++ ++ V + ++
Sbjct: 285 YYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQ 344
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYF 426
A G CF + +++V +P L+L F GA M+L NY
Sbjct: 345 RLGYDA------GDDTCFVAANQQAVAQMPPLVLHFDDGADMSLNGRNYL 388
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 116/388 (29%), Positives = 168/388 (43%), Gaps = 63/388 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I++ G+P + T I DTGS + W C C C+ DP F P SS+
Sbjct: 52 YLITVGLGSPATSQTMLI-DTGSDVSWVQCKP---CSQCH-SQADP----LFDPSSSSTY 102
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
C + C+ + GCS ++ Y++ YG G T G S+TL
Sbjct: 103 SPFSCGSADCAQL-----GQEGNGCSSSSQC------QYIVTYGDGSSTTGTYSSDTLAL 151
Query: 223 PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDD 275
S V +F GCS + + Q G+ G G ++SL SQ LG + FSYCL
Sbjct: 152 GSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCL-----PP 205
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
P SS + G + F K P+ SS FY V L+ I VG + + IP
Sbjct: 206 TPSSSGFLTLG-------AAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIP 258
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPC 394
S + G ++DSG+ T + + A++ F M Y A + SG L C
Sbjct: 259 ASVF------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPA----QPSGILDTC 308
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
FD SG+ SV +P + L F GGA ++L ++ N CL F N+ +LG
Sbjct: 309 FDFSGQSSVSIPSVALVFSGGAVVSLDASGI--ILSN---CLA-FAGNSDDSSLG----- 357
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + F + +D+ GF C
Sbjct: 358 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 117/445 (26%), Positives = 191/445 (42%), Gaps = 80/445 (17%)
Query: 54 KILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
+ + L + S +R R + + + S G+ + +++PL GGY + +S GTP
Sbjct: 10 EAIRGLVAKSHARVRWMAARANSSSWSSMAGT----TDVESPLHPDG-GGYVMDISVGTP 64
Query: 114 PQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
+ I DTGS LVW PCT C+ + F P++SS+ + + C +
Sbjct: 65 GKRFRA-IADTGSDLVWVQSEPCTG------CSGGTI-------FDPRQSSTFREMDCSS 110
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP-----SK 225
C+ + G C P + C SY +YG G T G +T+ S+
Sbjct: 111 QLCTELPGS--------CEPGSSAC-----SYSYEYGSGETEGEFARDTISLGTTSGGSQ 157
Query: 226 TVPNFLAGCSILSD--RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSS 280
P+F GC +++ G+ G G+ SL SQL KFSYCL+ + SS
Sbjct: 158 KFPSFAVGCGMVNSGFDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLV--DINSQSESS 215
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
L+ GP + T G+ T S + +Y + + I V + + P
Sbjct: 216 PLLF--GPSAALHGT-GIQSTKITP----PSDTYPTYYLLTVNGIAVAGQTMGSP----- 263
Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS-GLRPCFDISG 399
G I+DSG+T T++ ++ V + +M + V+ S GL C+D S
Sbjct: 264 ------GTTIIDSGTTLTYVPSGVYGRV----LSRMESMVTLPRVDGSSMGLDLCYDRSS 313
Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGN--EVLCLILFTDNAAGPALGRGPAIILG 457
++ P L ++ G A M P NYF +V + + +CL A G A G P I+G
Sbjct: 314 NRNYKFPALTIRLAG-ATMTPPSSNYFLVVDDSGDTVCL------AMGSAGGL-PVSIIG 365
Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q +++ +D + F + KC
Sbjct: 366 NVMQQGYHILYDRGSSELSFVQAKC 390
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 155/387 (40%), Gaps = 58/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTP Q DT + W PC CV C+ F S++
Sbjct: 90 YIVKANVGTPAQTFL-MALDTSNDAAWIPCNG---CVGCSST--------VFNSVTSTTF 137
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ +GC P+C + P C G TC ++ YG L +T+
Sbjct: 138 KTLGCDAPQCKQVPNPT----CGG-----STC-----TWNTTYGGSTILSNLTRDTIALS 183
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
+ VP + GC + S P G+ G GR S SQ L FSYCL S F
Sbjct: 184 TDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPS--FRTLN 241
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S L L GP + + TP KNP SS YYV L I VG K V IP S
Sbjct: 242 FSGTLRL--GPAGQPLR---IKTTPLLKNPRRSS-----LYYVNLIGIRVGRKIVDIPAS 291
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L G I DSG+ FT + P++ AV EF +++GN A V G C+
Sbjct: 292 ALAFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGN----AIVSSLGGFDTCY-- 345
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ P + F G + LPP+N CL + AA P ++
Sbjct: 346 --TGPIVAPTMTFMFS-GMNVTLPPDNLLIRSTAGSTSCLAM----AAAPDNVNSVLNVI 398
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ Q QN + FD+ N R G A++ C+
Sbjct: 399 ANMQQQNHRILFDVPNSRIGVAREPCS 425
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 166/404 (41%), Gaps = 73/404 (18%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
G Y + GTPPQ + DTGS + W PCT+ R + P I F P+
Sbjct: 46 GLYYTRIYLGTPPQQFYVHV-DTGSDVAWVNCVPCTNCKRASNVALP------ISIFDPE 98
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLS 217
+S+S I C + +C + S K CS + +CP Y YG G TAG L++
Sbjct: 99 KSTSKTSISCTDEECY------LASNSK-CSFNSMSCP-----YSTLYGDGSSTAGYLIN 146
Query: 218 ETLRFPSKTVPNFLA---------GC------SILSDRQPAGIAGFGRSSESLPSQLGLK 262
+ L F N A GC + L+D G+ GFG++ SLPSQL +
Sbjct: 147 DVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTWLTD----GLVGFGQAEVSLPSQLSKQ 202
Query: 263 KFSYCLLSRKFD-DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
S + + D S LV+ G + PGL YTP Y V
Sbjct: 203 NVSVNIFAHCLQGDNKGSGTLVI------GHIREPGLVYTPIVPKQ--------SHYNVE 248
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
L I V +V P ++ + S GGVI+DSG+T T++ P ++ + M
Sbjct: 249 LLNIGVSGTNVTTPTAFDLSNS---GGVIMDSGTTLTYLVQPAYDQFQAKVRDCM----- 300
Query: 382 AADVEKKSGLRP-CFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILF 439
+SG+ P F Y P + L F GGA M L P +Y + + L F
Sbjct: 301 ------RSGVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCF 354
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ + G I GD L++ + +D N+R G+ C
Sbjct: 355 SWLESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDCT 398
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 121/445 (27%), Positives = 174/445 (39%), Gaps = 77/445 (17%)
Query: 53 LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
+ + +AS +R R+L + T KT + I S V + G Y + + GT
Sbjct: 53 MNTVIDMASKDPARIRYLSSLTAQKTVAAPIASGQQ---------VLNVGNYVVRVQLGT 103
Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
P Q + + DT + W PC+ C+ C S F + SS+ + C P+
Sbjct: 104 PGQ-TMYMVLDTSNDAAWAPCSG---CIGC-------SSTTTFSAQNSSTFATLDCSKPE 152
Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPS-------YLLQYGLGFT-AGLLLSETLRFPS 224
C+ G L+CP+ + YG T + L+ ++L
Sbjct: 153 CTQARG------------------LSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGP 194
Query: 225 KTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPV 278
+PNF GC + S P G+ G GR SL SQ G FSYCL S F
Sbjct: 195 NVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPS--FKSYYF 252
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
S +L L GP G K + TP NP S YYV L I VG V I
Sbjct: 253 SGSLKL--GP-VGQPK--AIRTTPLLHNPHRPS-----LYYVNLTGISVGRVLVPISPEL 302
Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
L + G I+DSG+ T ++ AV EF +Q+G CF +
Sbjct: 303 LAFDPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVG-----GSFSPLGAFDTCFATN 357
Query: 399 GKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
+ S P + L G + LP EN + CL + AA P ++
Sbjct: 358 NEVSA--PAITLHLS-GLDLKLPMENSLIHSSAGSLACLAM----AAAPNNVNSVVNVIA 410
Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q QN + FD+ N + G A++ C
Sbjct: 411 NLQQQNHRILFDINNSKLGIARELC 435
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 119/404 (29%), Positives = 182/404 (45%), Gaps = 79/404 (19%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++S GTPP I DTGS L+W C C DC + VD P F PK SS
Sbjct: 92 GEYLMNISLGTPPFPIMA-IADTGSDLLWTQCKP---CDDC-YTQVD----PLFDPKASS 142
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + + C + +C+ + E++ CS + TC SY YG +T G + +TL
Sbjct: 143 TYKDVSCSSSQCTAL-----ENQA-SCSTEDNTC-----SYSTSYGDRSYTKGNIAVDTL 191
Query: 221 RFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
S + N + GC + +++ +GI G G + SL +QLG KFSYCL
Sbjct: 192 TLGSTDTRPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCL 251
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+ ++ S ++ G + S T G+ TP + + FYY+ L+ I VG
Sbjct: 252 VPLTSENDRTSK---INFGTNAVVSGT-GVVSTPLI------AKSQETFYYLTLKSISVG 301
Query: 329 SKHVKIPYSYLVPGSD---GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
SK V+ PGSD G G +I+DSG+T T + EF ++ + + + D
Sbjct: 302 SKEVQ------YPGSDSGSGEGNIIIDSGTTLTLL--------PTEFYSELEDAVASSID 347
Query: 385 VEKK----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
EKK +GL C+ +G V P + + F GA + L P N F + +++C
Sbjct: 348 AEKKQDPQTGLSLCYSATGDLKV--PAITMHFD-GADVNLKPSNCFVQISEDLVCF---- 400
Query: 441 DNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A P+ I G+ NF + +D + F CA
Sbjct: 401 ------AFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 438
>gi|449527083|ref|XP_004170542.1| PREDICTED: LOW QUALITY PROTEIN: basic 7S globulin-like [Cucumis
sativus]
Length = 432
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 121/419 (28%), Positives = 178/419 (42%), Gaps = 73/419 (17%)
Query: 95 PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
P++ H G Y + TP D G +W VDC+ V S
Sbjct: 33 PVTKHPSGQYITQIRQRTP-LVPVKLTVDLGGQFMW---------VDCDRGYVSSS---- 78
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
+ P R S+Q ++ C F P GC+ N TC + ++Q T+G
Sbjct: 79 YKPVRCRSAQCSLSKSTSCGDCFSPPXP----GCN--NNTCGHFPGNTIIQLS---TSGE 129
Query: 215 LLSETLRFPSK---------TVPNFLAGC--SILSDRQPAGI---AGFGRSSESLPSQLG 260
+ S+ L S ++PNFL C + L + G+ AGFGR+ SLPSQ
Sbjct: 130 VTSDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFS 189
Query: 261 L-----KKFSYCLLSRKFDDAPVSSNLVLDTGPG-----SGDSKTPGLSYTPFYKNPVGS 310
+KF+ CL S V+ +G G T L+YTP + NPV +
Sbjct: 190 AAFSFNRKFAVCL------SGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVST 243
Query: 311 S--SAFGE---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
+ S GE Y++G++ I+ SK V I + L S+GNGG + + +T +E ++
Sbjct: 244 AGVSTSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIY 303
Query: 366 EAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMA 419
A+ K R++ N R A V C+ S L +LIL+ K
Sbjct: 304 NALVKTITRELRNIPRVAAVAP---FGVCYKSKSFGSTRLGPGMPSIDLILQNK-KVIWR 359
Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+ N V EVLCL F D G R AI++G +Q+++ LEFDLA R GF+
Sbjct: 360 IFGANSMVQVNEEVLCL-GFVD---GGVEAR-TAIVIGAYQMEDNLLEFDLATSRLGFS 413
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 110/393 (27%), Positives = 152/393 (38%), Gaps = 70/393 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y IS+ GTP T I DTGS + W +C C P F P +SS+
Sbjct: 127 YVISVGLGTPAVTQTVTI-DTGSDVSWV------QCNPCPNPPCHAQTGALFDPAKSSTY 179
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE---TL 220
+ + C +C+ + E + GC N C Y +QYG G T S TL
Sbjct: 180 RAVSCAAAECAQL-----EQQGNGCGATNYEC-----QYGVQYGDGSTTNGTYSRDTLTL 229
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
S V F GCS L Q G+ G G ++SL SQ FSYCL
Sbjct: 230 SGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLP----- 284
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS----SSAFGEFYYVGLRQIIVGSK 330
P SG S L V + S FY L+ I VG K
Sbjct: 285 -------------PTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGK 331
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ + S GS +VDSG+ T + + A++ F M Y A +S
Sbjct: 332 QLGLSPSVFAAGS------VVDSGTIITRLPPTAYSALSSAFKAGMKQYRSA---PARSI 382
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG- 449
L CFD +G+ + +P + L F GGA + L P I++ + A A G
Sbjct: 383 LDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNG------------IMYGNCLAFAATGD 430
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+ Q + F + +D+ + GF C
Sbjct: 431 DGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 108/396 (27%), Positives = 173/396 (43%), Gaps = 70/396 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
+ +++ FGTP Q T +FDTGS + W +C+ C+ + P F P +S++
Sbjct: 120 FVVTVGFGTPAQTYT-LMFDTGSDVSWI------QCLPCSG-HCYKQHDPIFDPTKSATY 171
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
+ C +P+C+ G +C N TC Y +QYG G TAG+L ETL
Sbjct: 172 SAVPCGHPQCAAAGG-----KCS----SNGTC-----LYKVQYGDGSSTAGVLSHETLSL 217
Query: 223 PS-KTVPNFLAGC--SILSD-RQPAGIAGFGRSSESLPSQLGLKKFS---YCLLSRKFDD 275
S + +P F GC + L D G+ G GR SL SQ + YCL S
Sbjct: 218 TSARALPGFAFGCGETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYN--- 274
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
+S+ L G + S + G+ YT + + FY+V L I+VG + +P
Sbjct: 275 ---TSHGYLTIGTTTPASGSDGVRYTAMIQK-----QDYPSFYFVDLVSIVVGGFVLPVP 326
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
P G ++DSG+ T++ + A+ F M Y A + C+
Sbjct: 327 -----PILFTRDGTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDP---FDTCY 378
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG------ 449
D +G+ ++++P + KF G+ L P +++F D+ A PA G
Sbjct: 379 DFAGQNAIFMPLVSFKFSDGSSFDLSP-----------FGVLIFPDDTA-PATGCLAFVP 426
Query: 450 ---RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
P I+G+ Q +N + +D+A ++ GF C
Sbjct: 427 RPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 107/376 (28%), Positives = 154/376 (40%), Gaps = 59/376 (15%)
Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
S + DT S + W +C+ C P + P + P +SS+ I C +P C +
Sbjct: 168 SQTVVVDTSSDIPWV------QCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKEL 221
Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGC 234
S GCSP C Y++ YG G T G +++TL P+ V +F GC
Sbjct: 222 G----SSYGNGCSPTTDEC-----KYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGC 272
Query: 235 SILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTG 287
S Q AGI G SL Q FSYC+ P S+ + G
Sbjct: 273 SHAVRGSFSNQNAGILALGGGRGSLLEQTADAYGNAFSYCI------PKPSSAGFLSLGG 326
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
P K SYTP KN + FY V L IIV K + +P + G+
Sbjct: 327 PVEASLK---FSYTPLIKNKHAPT-----FYIVHLEAIIVAGKQLAVPPTAFATGA---- 374
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS-RAADVEKKSGLRPCFDISGKKSVYLP 406
++DSG+ T + ++ A+ F M Y AA V L C+D + V +P
Sbjct: 375 --VMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPLAAPVRN---LDTCYDFTRFPDVKVP 429
Query: 407 ELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYL 466
++ L F GGA + L P + L G CL AA P G +G+ Q Q + +
Sbjct: 430 KVSLVFAGGATLDLEPASII-LDG----CLAF----AATP--GEESVGFIGNVQQQTYEV 478
Query: 467 EFDLANDRFGFAKQKC 482
+D+ + GF + C
Sbjct: 479 LYDVGGGKVGFRRGAC 494
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 129/439 (29%), Positives = 180/439 (41%), Gaps = 63/439 (14%)
Query: 57 HSLASSSLSRARHLKTKTKPKTKDSNIGSN--YSNSLIKTPLSVHSYGGYSISLSFGTPP 114
+SL SSSL A K KT N S+ YS +LI +SL GTPP
Sbjct: 44 NSLFSSSL--ASQFKQNPNTKTTSYNYRSSFKYSMALI-------------VSLPIGTPP 88
Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIP-AFIPKRSSSSQLIGCQNPKC 173
Q + DTGS L W C V P P AF P SSS ++ C + C
Sbjct: 89 QTQQ-MVLDTGSQLSWIQC------------KVPPKTPPTAFDPLLSSSFSVLPCNHSLC 135
Query: 174 SWIFGPNVESRCKGCS-PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS-KTVPNFL 231
P V S +N+ C SY G + G L+ E F S +T P +
Sbjct: 136 K----PRVPDYTLPTSCDQNRLCHY---SYFYADGT-YAEGNLVREKFTFSSSQTTPPLI 187
Query: 232 AGCSI-LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDD--APVSSNLVLDTGP 288
GC+ SD Q GI G S S + KFSYC+ R+ +P S L P
Sbjct: 188 LGCATDSSDTQ--GILGMNLGRLSFSSLAKISKFSYCVPPRRSQSGSSPTGS-FYLGPNP 244
Query: 289 GSGDSKTPGL-SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
S K L +Y + P A Y + + I + K + I S G G
Sbjct: 245 SSAGFKYVNLMTYRQSQRMPNLDPLA----YTLPMLGIRINGKKLNISTSAFRADPSGAG 300
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV---Y 404
++DSG+ FTF+ + V +E ++ G + V S L CFD G V
Sbjct: 301 QTLIDSGTWFTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGS-LDMCFD--GDAMVIGRM 357
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
+ + +F+ G ++ + E A VG V CL + + G A + I+G+F Q+
Sbjct: 358 IGNMAFEFENGVEIVVEREKMLADVGGGVQCLGIGRSDLLGVA-----SNIIGNFHQQDL 412
Query: 465 YLEFDLANDRFGFAKQKCA 483
++EFDL R GF + C+
Sbjct: 413 WVEFDLVGRRVGFGRTDCS 431
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 108/405 (26%), Positives = 170/405 (41%), Gaps = 60/405 (14%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
+ P+ + S G Y + + GTPPQ + + TG LVW CT C + + P DP++
Sbjct: 45 VAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGE-LVWTQCTPCQPCFEQDLPLFDPTK 103
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
SS+ + + C + C I P C + C P+ G T
Sbjct: 104 --------SSTFRGLPCGSHLCESI--PESSRNCT-----SDVCIYEAPTKA-----GDT 143
Query: 212 AGLLLSETLRF-PSKTVPNFLAGCSILSDRQ------PAGIAGFGRSSESLPSQLGLKKF 264
G ++T +K F GC +++D++ P+GI G GR+ SL +Q+ + F
Sbjct: 144 GGKAGTDTFAIGAAKETLGF--GCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAF 201
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLR 323
SYCL + S+ L G + S TPF K GSS YY
Sbjct: 202 SYCLAGK--------SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYY---- 249
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+V +K + L S V++D+ S +++ ++A+ K +G A+
Sbjct: 250 --MVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVAS 307
Query: 384 DVEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
+ +D+ K+V PEL+ F GGA + +PP NY GN +CL + +
Sbjct: 308 PPKP-------YDLCFPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTIGSS 360
Query: 442 ---NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N G G A ILG Q +N ++ FDL + F C+
Sbjct: 361 ASLNLTGELEG---ASILGSLQQENVHVLFDLKEETLSFKPADCS 402
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 165/387 (42%), Gaps = 65/387 (16%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS ++W C+ C + N+ ++ +F P SS++ I C + +C+ F
Sbjct: 22 IDTGSDILWVTCSPCTGCPTSSGLNI---QLESFNPDSSSTASRITCSDDRCTAGFQTG- 77
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS--------KTVPNFLA 232
E+ C+ + ++ C Y YG G T+G +S+T+ F + + + +
Sbjct: 78 EAICQTSNSQSSPC-----GYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVF 132
Query: 233 GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRKFDDAPVSS 280
GCS +DR GI GFG+ S+ SQL K FS+CL + D+
Sbjct: 133 GCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSDNG--GG 188
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
LVL G+ PGL YTP + Y + L I V + + I S
Sbjct: 189 ILVL------GEIVEPGLVYTPLVPSQ--------PHYNLNLESIAVNGQKLPIDSSLFT 234
Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
+ G IVDSG+T ++ ++ + R+ V K S CF S
Sbjct: 235 --TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSL-VSKGS---QCFITSSS 288
Query: 401 KSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLILFTDNAAGPALGRGPAI-IL 456
P + L F GG M++ PENY A V N VL I + N +G I IL
Sbjct: 289 VDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRN-------QGQEITIL 341
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
GD L++ +DLAN R G+A C+
Sbjct: 342 GDLVLKDKIFVYDLANMRMGWADYDCS 368
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 153/384 (39%), Gaps = 44/384 (11%)
Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
G PPQ + I DTGS+L+W C + C +P + RSS+ + C
Sbjct: 90 IGDPPQRAAALI-DTGSNLIWTQCGT-----TCGLKACAKQDLPYYNLSRSSTFAAVPCA 143
Query: 170 NPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPN 229
+ + + N C + +C A YG G G L +E F S
Sbjct: 144 DS--AKLCAANGVHLCG----LDGSCTFAA-----SYGAGSVFGSLGTEAFTFQSGAA-K 191
Query: 230 FLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLV 283
GC L+ +G+ G GR SL SQ G KFSYCL + S V
Sbjct: 192 LGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATKFSYCLTPYLRNHGASSHLFV 251
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL---- 339
+ SG ++ PF K+P + FYY+ L I VG + IP +
Sbjct: 252 GASASLSGGGGA--VTSIPFVKSP--EDYPYSTFYYLPLVGISVGETKLPIPSAAFELRR 307
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
V +GGVI+D+GS T + + A++ E RQ+ +GL C
Sbjct: 308 VAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNR--SLVQPPADTGLDLCVARQD 365
Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDF 459
V +P L+ F GGA MA+ +Y+ V C+++ G ++G+F
Sbjct: 366 VDKV-VPVLVFHFGGGADMAVSAGSYWGPVDKSTACMLIEEG---------GYETVIGNF 415
Query: 460 QLQNFYLEFDLANDRFGFAKQKCA 483
Q Q+ +L +D+ F C+
Sbjct: 416 QQQDVHLLYDIGKGELSFQTADCS 439
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 105/388 (27%), Positives = 153/388 (39%), Gaps = 72/388 (18%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+PP+ S + D+GS +VW C +C + P DP+ +F S
Sbjct: 199 GEYFVRIGVGSPPR-SQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCS 257
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
SS +N C RC+ Y + YG G +T G L ETL
Sbjct: 258 SSVCDRLENAGCH-------AGRCR---------------YEVSYGDGSYTKGTLALETL 295
Query: 221 RFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
F V + GC + AG+ G G S S QLG + FSYCL+S
Sbjct: 296 TFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS---- 351
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
++ P +NP S FYY+GL + VG V I
Sbjct: 352 -----------------------AAWVPLVRNPRAPS-----FYYIGLAGLGVGGIRVPI 383
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
G+GGV++D+G+ T + ++A F+ Q N RA V C
Sbjct: 384 SEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAI---FDTC 440
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
+D+ G SV +P + F GG + LP N+ + + F + +G +
Sbjct: 441 YDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLS------- 493
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ Q + + FD AN GF C
Sbjct: 494 ILGNIQQEGIQISFDGANGYVGFGPNIC 521
>gi|449432733|ref|XP_004134153.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 432
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 121/419 (28%), Positives = 178/419 (42%), Gaps = 73/419 (17%)
Query: 95 PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
P++ H G Y + TP D G +W VDC+ V S
Sbjct: 33 PVTKHPSGQYITQIRQRTP-LVPVKLTVDLGGQFMW---------VDCDRGYVSSS---- 78
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
+ P R S+Q ++ C F P GC+ N TC + ++Q T+G
Sbjct: 79 YKPVRCRSAQCSLSKSTSCGDCFSPPRP----GCN--NNTCGHFPGNTIIQLS---TSGE 129
Query: 215 LLSETLRFPSK---------TVPNFLAGC--SILSDRQPAGI---AGFGRSSESLPSQLG 260
+ S+ L S ++PNFL C + L + G+ AGFGR+ SLPSQ
Sbjct: 130 VTSDVLSVSSTNGFNPTRAVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFS 189
Query: 261 L-----KKFSYCLLSRKFDDAPVSSNLVLDTGPG-----SGDSKTPGLSYTPFYKNPVGS 310
+KF+ CL S V+ +G G T L+YTP + NPV +
Sbjct: 190 AAFSFNRKFAVCL------SGSTRSPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVST 243
Query: 311 S--SAFGE---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
+ S GE Y++G++ I+ SK V I + L S+GNGG + + +T +E ++
Sbjct: 244 AGVSTSGEKSSEYFIGVKSIVFNSKTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIY 303
Query: 366 EAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMA 419
A+ K R++ N R A V C+ S L +LIL+ K
Sbjct: 304 NALVKTITRELRNIPRVAAVAP---FGVCYKSKSFGSTRLGPGMPSIDLILQNK-KVIWR 359
Query: 420 LPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+ N V EVLCL F D G R AI++G +Q+++ LEFDLA R GF+
Sbjct: 360 IFGANSMVQVNEEVLCL-GFVD---GGVEAR-TAIVIGAYQMEDNLLEFDLATSRLGFS 413
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 116/399 (29%), Positives = 159/399 (39%), Gaps = 60/399 (15%)
Query: 91 LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
L++TP Y + GTPPQ DT + W PC C C S
Sbjct: 104 LLQTPT-------YVVRARLGTPPQQLL-LAVDTSNDAAWIPCAG---CAGCPT-----S 147
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
P F P S+S + + C +P C+ PN C P K C + L Y
Sbjct: 148 SAPPFDPAASTSYRSVPCGSPLCAQ--APNA-----ACPPGGKAC-----GFSLTYADSS 195
Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKF 264
L ++L V + GC + + P G+ G GR S SQ + F
Sbjct: 196 LQAALSQDSLAVAGDAVKTYTFGCLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTF 255
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
SYCL S F S L L + + P + TP NP SS YYV +
Sbjct: 256 SYCLPS--FKSLNFSGTLRLGR-----NGQPPRIKTTPLLANPHRSS-----LYYVNMTG 303
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
I VG K V IP L G ++DSG+ FT + P + AV E R++G A
Sbjct: 304 IRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVG-----AP 358
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNA 443
V G CF+ + +V P + L F G ++ LP EN + CL + A
Sbjct: 359 VSSLGGFDTCFNTT---AVAWPPVTLLFD-GMQVTLPEENVVIHSTYGTISCLAM----A 410
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A P ++ Q QN + FD+ N R GFA+++C
Sbjct: 411 AAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 449
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/316 (31%), Positives = 137/316 (43%), Gaps = 52/316 (16%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
L S G Y + L+ GTPP T I DTGS L+W C C D P F
Sbjct: 81 LVTASSGEYLVDLAIGTPPLYYTA-IMDTGSDLIWTQCAPCLLCAD--------QPTPYF 131
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGL 214
K+S++ + + C++ +C+ + P S K C Y YG TAG+
Sbjct: 132 DVKKSATYRALPCRSSRCASLSSP---------SCFKKMC-----VYQYYYGDTASTAGV 177
Query: 215 LLSETLRFPSKT-----VPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSY 266
L +ET F + N GC L+ A G+ GFGR SL SQLG +FSY
Sbjct: 178 LANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSY 237
Query: 267 CLLSRKFDDAP------VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
CL S P V +NL T SG + TPF NP A Y++
Sbjct: 238 CLTSY-LSATPSRLYFGVYANLS-STNTSSGSP----VQSTPFVINP-----ALPNMYFL 286
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I +G+K + I DG GGVI+DSG++ T+++ +EAV + + +
Sbjct: 287 SLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI---P 343
Query: 381 RAADVEKKSGLRPCFD 396
A + GL CF
Sbjct: 344 LTAMNDTDIGLDTCFQ 359
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 119/447 (26%), Positives = 179/447 (40%), Gaps = 73/447 (16%)
Query: 49 DSDPLKILHSLASSSLSR---ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
D++ +K + S S +L R + L + T P S IGS Y
Sbjct: 94 DNERVKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGS----------------ANYF 137
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+ + GTP + +FDTGS L W C C + D F P +SSS
Sbjct: 138 VVVGLGTPKR-DLSLVFDTGSDLTWTQCEP---CAGSCYKQQDA----IFDPSKSSSYIN 189
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF-P 223
I C + C+ + ++SRC + AC Y +QYG T+ G L E L
Sbjct: 190 ITCTSSLCTQLTSAGIKSRCSSSTT-------ACI-YGIQYGDKSTSVGFLSQERLTITA 241
Query: 224 SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAP 277
+ V +FL GC ++ AG+ G GR S Q K FSYCL P
Sbjct: 242 TDIVDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCL--------P 293
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
+S+ + G+ + L YTP S+ G+ + GL + + K+P
Sbjct: 294 STSSSLGHLTFGASAATNANLKYTPL-------STISGDNTFYGLDIVGISVGGTKLPA- 345
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL-RPCFD 396
+ + GG I+DSG+ T + + A+ F + M Y V + GL C+D
Sbjct: 346 -VSSSTFSAGGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYP----VANEDGLFDTCYD 400
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI-I 455
SG K + +P++ +F GG + LP + +CL A A G I I
Sbjct: 401 FSGYKEISVPKIDFEFAGGVTVELPLVGILIGRSAQQVCL-------AFAANGNDNDITI 453
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
G+ Q + + +D+ R GF C
Sbjct: 454 FGNVQQKTLEVVYDVEGGRIGFGAAGC 480
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 116/409 (28%), Positives = 171/409 (41%), Gaps = 73/409 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP I DTGS ++W C S C C + ++ F P SS
Sbjct: 76 GLYYTKVQLGTPPVEFNVQI-DTGSDVLWVSCNS---CNGCPQTSGLQIQLNFFDPGSSS 131
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+S +I C + +C+ +S CS +N C SY QYG G T+G +S+ +
Sbjct: 132 TSSMIACSDQRCN----NGKQSSDATCSSQNNQC-----SYTFQYGDGSGTSGYYVSDMM 182
Query: 221 R----FPSKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQL---GL- 261
F N A GCS SDR GI GFG+ S+ SQL G+
Sbjct: 183 HLNTIFEGSMTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIA 242
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYY 319
+ FS+CL D+ LVL G+ P + YT P Y
Sbjct: 243 PRIFSHCLKG----DSSGGGILVL------GEIVEPNIVYTSLVPAQP---------HYN 283
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+ L+ I V + ++I S V + + G IVDSG+T ++ ++ +
Sbjct: 284 LNLQSISVNGQTLQIDSS--VFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQS 341
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLC 435
R S C+ I+ + P++ L F GGA M L P++Y ++ G V C
Sbjct: 342 VRTV----VSRGNQCYLITSSVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWC 397
Query: 436 LILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G +G I ILGD L++ + +DLA R G+A C+
Sbjct: 398 I--------GFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 438
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 126/487 (25%), Positives = 190/487 (39%), Gaps = 75/487 (15%)
Query: 12 FSLLILLFTTDAGA---GSSAATVTVPLTPLSTK--HYLHHSDSDPLKILHSLASSSLSR 66
F L LLF+T + + T + + P+ +K ++ + + ++AS R
Sbjct: 10 FFLFALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDPER 69
Query: 67 ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGS 126
++L T KT I V Y + + GTP Q + DT +
Sbjct: 70 LKYLSTLADQKTTAVPIAPGQQ---------VLKIANYVVRVKLGTPGQQMF-MVLDTSN 119
Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
W PC+ C C+ F+P S++ + C +CS +
Sbjct: 120 DAAWVPCSG---CTGCSSTT--------FLPNASTTLGSLDCSGAQCSQV---------- 158
Query: 187 GCSPRNKTCPLACPSY-LLQYGLGFTAGL---LLSETLRFPSKTVPNFLAGC-SILSDRQ 241
R +CP S L G + L L+ + + + +P F GC + +S
Sbjct: 159 ----RGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPGFTFGCINAVSGGS 214
Query: 242 --PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
P G+ G GR SL SQ G FSYCL S F S +L L GP G K+
Sbjct: 215 IPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSLKL--GP-VGQPKS- 268
Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
+ TP +NP S YYV L + VG V IP LV + G I+DSG+
Sbjct: 269 -IRTTPLLRNPHRPS-----LYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTV 322
Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA 416
T P++ A+ EF +Q+ + CF + + P + L F+ G
Sbjct: 323 ITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAATNEAEA--PAITLHFE-GL 374
Query: 417 KMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
+ LP EN + L CL + AA P ++ + Q QN + FD N R
Sbjct: 375 NLVLPMENSLIHSSSGSLACLSM----AAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRL 430
Query: 476 GFAKQKC 482
G A++ C
Sbjct: 431 GIARELC 437
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 118/400 (29%), Positives = 163/400 (40%), Gaps = 62/400 (15%)
Query: 91 LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
L++TP Y + GTP Q DT + W PC+ C C P P
Sbjct: 101 LLQTPT-------YVVRARLGTPAQ-QLLLAVDTSNDAAWIPCSG---CAGC--PTSSP- 146
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
F P S+S + + C +P+C + PN CSP K+C + L Y
Sbjct: 147 ----FNPAASASYRPVPCGSPQC--VLAPN-----PSCSPNAKSC-----GFSLSYADSS 190
Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKF 264
L +TL V + GC + + P G+ G GR S SQ + F
Sbjct: 191 LQAALSQDTLAVAGDVVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATF 250
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLR 323
SYCL S F S L L G + P + TP NP SS YYV +
Sbjct: 251 SYCLPS--FKSLNFSGTLRL------GRNGQPRRIKTTPLLANPHRSS-----LYYVNMT 297
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I VG K V IP S L G ++DSG+ FT + P++ A+ E R++G + AA
Sbjct: 298 GIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVG--AGAA 355
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCLILFTDN 442
V G C++ +V P + L F G ++ LP EN CL +
Sbjct: 356 AVSSLGGFDTCYN----TTVAWPPVTLLFD-GMQVTLPEENVVIHTTYGTTSCLAM---- 406
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
AA P ++ Q QN + FD+ N R GFA++ C
Sbjct: 407 AAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 165/387 (42%), Gaps = 65/387 (16%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS ++W C+ C + N+ ++ +F P SS++ I C + +C+ F
Sbjct: 108 IDTGSDILWVTCSPCTGCPTSSGLNI---QLESFNPDSSSTASRITCSDDRCTAGFQTG- 163
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS--------KTVPNFLA 232
E+ C+ + ++ C Y YG G T+G +S+T+ F + + + +
Sbjct: 164 EAICQTSNSQSSPC-----GYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVF 218
Query: 233 GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRKFDDAPVSS 280
GCS +DR GI GFG+ S+ SQL K FS+CL + D+
Sbjct: 219 GCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSDNG--GG 274
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
LVL G+ PGL YTP + Y + L I V + + I S
Sbjct: 275 ILVL------GEIVEPGLVYTPLVPSQ--------PHYNLNLESIAVNGQKLPIDSSLFT 320
Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
+ G IVDSG+T ++ ++ + R+ V K S CF S
Sbjct: 321 --TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSL-VSKGS---QCFITSSS 374
Query: 401 KSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLILFTDNAAGPALGRGPAI-IL 456
P + L F GG M++ PENY A V N VL I + N +G I IL
Sbjct: 375 VDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRN-------QGQEITIL 427
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
GD L++ +DLAN R G+A C+
Sbjct: 428 GDLVLKDKIFVYDLANMRMGWADYDCS 454
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 165/387 (42%), Gaps = 65/387 (16%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS ++W C+ C + N+ ++ +F P SS++ I C + +C+ F
Sbjct: 106 IDTGSDILWVTCSPCTGCPTSSGLNI---QLESFNPDSSSTASRITCSDDRCTAGFQTG- 161
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRFPS--------KTVPNFLA 232
E+ C+ + ++ C Y YG G T+G +S+T+ F + + + +
Sbjct: 162 EAICQTSNSQSSPC-----GYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVF 216
Query: 233 GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRKFDDAPVSS 280
GCS +DR GI GFG+ S+ SQL K FS+CL + D+
Sbjct: 217 GCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSDNG--GG 272
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
LVL G+ PGL YTP + Y + L I V + + I S
Sbjct: 273 ILVL------GEIVEPGLVYTPLVPSQ--------PHYNLNLESIAVNGQKLPIDSSLFT 318
Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
+ G IVDSG+T ++ ++ + R+ V K S CF S
Sbjct: 319 --TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSL-VSKGS---QCFITSSS 372
Query: 401 KSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLILFTDNAAGPALGRGPAI-IL 456
P + L F GG M++ PENY A V N VL I + N +G I IL
Sbjct: 373 VDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRN-------QGQEITIL 425
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
GD L++ +DLAN R G+A C+
Sbjct: 426 GDLVLKDKIFVYDLANMRMGWADYDCS 452
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 133/484 (27%), Positives = 189/484 (39%), Gaps = 83/484 (17%)
Query: 31 TVTVPLTPLSTKHYLHHSDSD-PLKILHSLASSSLSRARH---LKTKTKPKTK------- 79
TVT L + H+ S S L++LH S++ H L + + T
Sbjct: 38 TVTATLPDFNNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILR 97
Query: 80 ----------DSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
DS N S I + + S G Y + + G+PP+ + D+GS +V
Sbjct: 98 RISGKVIPSSDSRYEVNDFGSDIVSGMDQGS-GEYFVRIGVGSPPRDQY-MVIDSGSDMV 155
Query: 130 WF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
W PC Y+ D P F P +S S + C + C I
Sbjct: 156 WVQCQPCKLCYKQSD-----------PVFDPAKSGSYTGVSCGSSVCDRI---------- 194
Query: 187 GCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSD---RQP 242
N C Y + YG G +T G L ETL F V N GC +
Sbjct: 195 ----ENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGA 250
Query: 243 AGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GL 298
AG+ G G S S QL + F YCL+SR D + +LV G P G
Sbjct: 251 AGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDS---TGSLVF------GREALPVGA 301
Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
S+ P +NP S FYYVGL+ + VG + +P G+GGV++D+G+ T
Sbjct: 302 SWVPLVRNPRAPS-----FYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVT 356
Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKM 418
+ + A F Q N RA+ V S C+D+SG SV +P + F G +
Sbjct: 357 RLPTAAYVAFRDGFKSQTANLPRASGV---SIFDTCYDLSGFVSVRVPTVSFYFTEGPVL 413
Query: 419 ALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
LP N+ V + F + G + I+G+ Q + + FD AN GF
Sbjct: 414 TLPARNFLMPVDDSGTYCFAFAASPTGLS-------IIGNIQQEGIQVSFDGANGFVGFG 466
Query: 479 KQKC 482
C
Sbjct: 467 PNVC 470
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/224 (33%), Positives = 104/224 (46%), Gaps = 31/224 (13%)
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
KFSYCL S DD+ S L+ GS T TP NP S FYY+ L
Sbjct: 5 KFSYCLTS--MDDSKASVLLL-----GSLAKATKDAISTPLLTNPSQPS-----FYYLSL 52
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I VG + I S DG+GGVI+DSG+T T++E +F+ + KEFI Q +
Sbjct: 53 EGIPVGGTQLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQ---SNLQ 109
Query: 383 ADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE---VLCLIL 438
D +GL CF + S V +P+L+ FKGG + LP E+Y ++ + V CL +
Sbjct: 110 LDKSSSTGLDVCFSLPSETTQVEVPKLVFHFKGG-DLELPAESY--MIADSKLGVACLAM 166
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
N I G+ Q QN + DL + F +C
Sbjct: 167 GASNGMS---------IFGNVQQQNILVNHDLEKETISFVPTQC 201
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 126/470 (26%), Positives = 185/470 (39%), Gaps = 74/470 (15%)
Query: 31 TVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNS 90
TV+VPL H P ++ SS R R + ++K + G ++
Sbjct: 55 TVSVPLVH-------RHGPCAPTQLSSDKPSSFTDRLRRNRARSKYIMSRVSKGMMGDDA 107
Query: 91 LIKTPL----SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
+ P SV S Y +++ GTP S + DTGS L W +C CN
Sbjct: 108 DVSIPTHLGGSVDSLE-YVVTVGLGTP-SVSQVLLIDTGSDLSWV------QCQPCNSTT 159
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
P + P F P +SS+ I C C + + GC+ + A + + Y
Sbjct: 160 CYPQKDPLFDPSKSSTYAPIPCNTDACRDL---TDDGYGGGCASGDGA---AQCGFAITY 213
Query: 207 GLGF-TAGLLLSETLRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL 261
G G T G+ +ETL P V +F GC D + G+ G G + ESL Q
Sbjct: 214 GDGSQTRGVYSNETLALAPGVAVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTAS 273
Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS------KTPGLSYTPFYKNPVGSSS 312
FSYCL P +N V G G + T G +TP +
Sbjct: 274 VYGGAFSYCL--------PALNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIREEE---- 321
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
FY V + I VG + + +P S +GG+I+DSG+ T ++ + A+ F
Sbjct: 322 ---TFYVVNMTGITVGGEPIDVPPSAF------SGGMIIDSGTVVTELQHTAYNALQAAF 372
Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE 432
+ M Y + E L C+D SG +V LP++ L F GGA + L V N
Sbjct: 373 RKAMAAYPLVRNGE----LDTCYDFSGYSNVTLPKVALTFSGGATIDLD-------VPNG 421
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+L +GP G ILG+ + + +D R GF C
Sbjct: 422 ILLDDCLAFQESGPDDQPG---ILGNVNQRTLEVLYDAGRGRVGFRAAVC 468
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 127/460 (27%), Positives = 179/460 (38%), Gaps = 73/460 (15%)
Query: 38 PLSTKHYLHHSDSDPLKILHSLASSSLSRA----RHLKTKTKPKTKDSNIGSNYSNSLIK 93
P T HH LH+ R R + K + DS N S +
Sbjct: 70 PSVTYRNHHHR-------LHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFGSDVV 122
Query: 94 TPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPS 150
+ + S G Y + + G+PP+ + D+GS +VW PC Y+ D
Sbjct: 123 SGMDQGS-GEYFVRIGVGSPPRDQY-MVIDSGSDMVWVQCQPCKLCYKQSD--------- 171
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG- 209
P F P +S S + C + C I N C Y + YG G
Sbjct: 172 --PVFDPAKSGSYTGVSCGSSVCDRI--------------ENSGCHSGGCRYEVMYGDGS 215
Query: 210 FTAGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKK--- 263
+T G L ETL F V N GC + AG+ G G S S QL +
Sbjct: 216 YTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGA 275
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGL 322
F YCL+SR D + +LV G P G S+ P +NP S FYYVGL
Sbjct: 276 FGYCLVSRGTDS---TGSLVF------GREALPVGASWVPLVRNPRAPS-----FYYVGL 321
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ + VG + +P G+GGV++D+G+ T + + A F Q N RA
Sbjct: 322 KGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRA 381
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
+ V S C+D+SG SV +P + F G + LP N+ V + F +
Sbjct: 382 SGV---SIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAAS 438
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G + I+G+ Q + + FD AN GF C
Sbjct: 439 PTGLS-------IIGNIQQEGIQVSFDGANGFVGFGPNVC 471
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 118/400 (29%), Positives = 163/400 (40%), Gaps = 62/400 (15%)
Query: 91 LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
L++TP Y + GTP Q DT + W PC+ C C P P
Sbjct: 48 LLQTPT-------YVVRARLGTPAQ-QLLLAVDTSNDAAWIPCSG---CAGC--PTSSP- 93
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
F P S+S + + C +P+C + PN CSP K+C + L Y
Sbjct: 94 ----FNPAASASYRPVPCGSPQC--VLAPN-----PSCSPNAKSC-----GFSLSYADSS 137
Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKF 264
L +TL V + GC + + P G+ G GR S SQ + F
Sbjct: 138 LQAALSQDTLAVAGDVVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATF 197
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLR 323
SYCL S F S L L G + P + TP NP SS YYV +
Sbjct: 198 SYCLPS--FKSLNFSGTLRL------GRNGQPRRIKTTPLLANPHRSS-----LYYVNMT 244
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I VG K V IP S L G ++DSG+ FT + P++ A+ E R++G + AA
Sbjct: 245 GIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVG--AGAA 302
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCLILFTDN 442
V G C++ +V P + L F G ++ LP EN CL +
Sbjct: 303 AVSSLGGFDTCYN----TTVAWPPVTLLFD-GMQVTLPEENVVIHTTYGTTSCLAM---- 353
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
AA P ++ Q QN + FD+ N R GFA++ C
Sbjct: 354 AAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/413 (26%), Positives = 169/413 (40%), Gaps = 64/413 (15%)
Query: 85 SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
++ S + I++P+ + G + +S+ GTPP + I DTGS L W C C +
Sbjct: 72 TSVSTACIRSPI-IPDSGEFLMSIFIGTPP-VNVIAIADTGSDLTWTQCLPCRECFN--- 126
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
P F P+RSSS + + C + C ++ES C P ++C SY
Sbjct: 127 -----QSQPIFNPRRSSSYRKVSCASDTCR-----SLES--YHCGPDLQSC-----SYGY 169
Query: 205 QYG-LGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSS--------ESL 255
YG FT G L S+ + S +P + GC + G+ +
Sbjct: 170 SYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQM 229
Query: 256 PSQLGLK-KFSYCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSS 311
+ G+K +FSYCL + F +A ++ + G TP + +P
Sbjct: 230 RTIAGVKPRFSYCLPTF-FSNANITGTISFGRKAVVSGRQVVSTPLVPRSP--------- 279
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
FY++ L I VG K K + +G +I+DSG+T T + L+ V
Sbjct: 280 ---DTFYFLTLEAISVGKKRFKAANG--ISAMTNHGNIIIDSGTTLTLLPRSLYYGVFST 334
Query: 372 FIRQMGNYSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
R + +A V+ SG L C+ + +P + F GGA + L P N FA V
Sbjct: 335 LARVI----KAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVA 390
Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ V CL PA I G+ NF + +DL N R F + CA
Sbjct: 391 DNVTCLTF------APAT---QVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 434
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 161/387 (41%), Gaps = 52/387 (13%)
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
+ + GTPPQ ++ I D LVW C+ RC +P F+P SS+ +
Sbjct: 70 NFTIGTPPQPASAII-DVAGELVWTQCSMCSRCFK--------QDLPLFVPNASSTFRPE 120
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT 226
C C I N CS T S L G T G++ ++T + T
Sbjct: 121 PCGTDACKSIPTSN-------CSSNMCTYEGTINSKLG----GHTLGIVATDTFAIGTAT 169
Query: 227 VPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNL 282
+ GC + S P+G+ G GR+ SL SQ+ + KFSYCL D+ +S L
Sbjct: 170 A-SLGFGCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCLTPH---DSGKNSRL 225
Query: 283 VLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
+L + G G+S T TPF K G ++Y + L I G + +P S
Sbjct: 226 LLGSSAKLAGGGNSTT-----TPFVKTSPGDD--MSQYYPIQLDGIKAGDAAIALPPS-- 276
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
GN V+V + + +F+ ++A+ KE + +G A ++ CF +G
Sbjct: 277 -----GN-TVLVQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQP---FDLCFPKAG 327
Query: 400 KKSVYLPELILKF-KGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAIIL 456
+ P+L+ F +G A + +PP Y VG E +C+ + + + IL
Sbjct: 328 LSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNIL 387
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
G Q +N + DL F C+
Sbjct: 388 GSLQQENTHFLLDLEKKTLSFEPADCS 414
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 120/409 (29%), Positives = 169/409 (41%), Gaps = 80/409 (19%)
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPA 154
V S G Y ++L GTPP I DTGS L W PCT Y+ V +P
Sbjct: 86 VPSAGEYLMNLYIGTPPVPVIA-IVDTGSDLTWTQCRPCTHCYKQV-----------VPL 133
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
F PK SS+ + C C + + + CS + K C ++ Y G FT G
Sbjct: 134 FDPKNSSTYRDSSCGTSFCLAL------GKDRSCS-KEKKC-----TFRYSYADGSFTGG 181
Query: 214 LLLSETLRFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQL----- 259
L SETL S + P F GC S D+ +GI G G SL SQL
Sbjct: 182 NLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTIN 241
Query: 260 GLKKFSYCLLSRKFDDAPVSSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
GL FSYCLL D+ +SS + G G TP + +P
Sbjct: 242 GL--FSYCLLPVS-TDSSISSRINFGASGRVSGYGTVSTPLVQKSP------------DT 286
Query: 317 FYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
FYY+ L I VG K ++PY Y G +IVDSG+T+TF+ + + K
Sbjct: 287 FYYLTLEGISVGKK--RLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEK----S 340
Query: 376 MGNYSRAADVEKKSGL-RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
+ N + V +G+ C++ + + + P + FK A + L P N F + +++
Sbjct: 341 VANSIKGKRVRDPNGIFSLCYNTTAE--INAPIITAHFK-DANVELQPLNTFMRMQEDLV 397
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
C + + G +LG+ NF + FDL R F C
Sbjct: 398 CFTVAPTSDIG---------VLGNLAQVNFLVGFDLRKKRVSFKAADCT 437
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 119/397 (29%), Positives = 169/397 (42%), Gaps = 82/397 (20%)
Query: 97 SVH-SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
SVH S Y + ++ GTPP T + DTGS L+W C + C C FP P+ P +
Sbjct: 84 SVHASTATYLVDIAIGTPPLPLTA-VLDTGSDLIWTQCDAP--CRRC-FPQ--PA--PLY 135
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGL 214
P RS++ + C++P C + P SR CSP + C +Y YG G T G+
Sbjct: 136 APARSATYANVSCRSPMCQALQSP--WSR---CSPPDTGC-----AYYFSYGDGTSTDGV 185
Query: 215 LLSETLRFPSKTVPNFLA-GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLS 270
L +ET S T +A GC ++ S +G+ G GR SL SQLG+ +
Sbjct: 186 LATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTR------- 238
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
P S P +P L I VG
Sbjct: 239 ------PRRSCRA---------RAAARGGGAPTTTSP--------------LEGITVGDT 269
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ I + G+GGVI+DSG+TFT +E F A+A+ ++ A+ G
Sbjct: 270 LLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARALASRV-RLPLASGAHL--G 326
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA-LG 449
L CF + ++V +P L+L F GA M L E+Y + D +AG A LG
Sbjct: 327 LSLCFAAASPEAVEVPRLVLHFD-GADMELRRESY------------VVEDRSAGVACLG 373
Query: 450 ----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
RG + +LG Q QN ++ +DL F KC
Sbjct: 374 MVSARGMS-VLGSMQQQNTHILYDLERGILSFEPAKC 409
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 120/446 (26%), Positives = 175/446 (39%), Gaps = 74/446 (16%)
Query: 49 DSDPLKILHSLASSSL---SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYS 105
D++ +K + S S +L +R + L + T P IGS Y
Sbjct: 98 DNERVKYIQSRLSKNLGGENRVKELDSTTLPAKSGRLIGS----------------ADYY 141
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+ + GTP + IFDTGS L W C C + DP F P +SSS
Sbjct: 142 VVVGLGTPKR-DLSLIFDTGSYLTWTQCEP---CAGSCYKQQDP----IFDPSKSSSYTN 193
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-P 223
I C + C+ R GCS + Y ++YG + G L E L
Sbjct: 194 IKCTSSLCTQF-------RSAGCSSSTDASCI----YDVKYGDNSISRGFLSQERLTITA 242
Query: 224 SKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
+ V +FL GC ++ R AG+ G R S Q + K FSYCL P
Sbjct: 243 TDIVHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCL--------P 294
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
+ + + G+ + L YTPF S+ GE + GL + + K+P
Sbjct: 295 STPSSLGHLTFGASAATNANLKYTPF-------STISGENSFYGLDIVGISVGGTKLPA- 346
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
+ + GG I+DSG+ T + + A+ F + M Y A L C+D
Sbjct: 347 -VSSSTFSAGGSIIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRL---LDTCYDF 402
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI-IL 456
SG K + +P + +F GG K+ LP + LCL A A G G I I
Sbjct: 403 SGYKEISVPRIDFEFAGGVKVELPLVGILYGESAQQLCL-------AFAANGNGNDITIF 455
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
G+ Q + + +D+ R GF C
Sbjct: 456 GNVQQKTLEVVYDVEGGRIGFGAAGC 481
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 168/409 (41%), Gaps = 62/409 (15%)
Query: 82 NIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
+ + ++ + T + V ++ Y +++S GTP + T + DTGS + W +C
Sbjct: 122 QLATGSRSATVPTTMGVGTFQ-YVVTVSLGTPGVSQTVEV-DTGSDVSWV------QCKP 173
Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
C+ P + R F P +SS+ + C CS + E+ C G C
Sbjct: 174 CSAPACNSQRDQLFDPAKSSTYSAVPCGADACSEL--RIYEAGCSG-----SQC-----G 221
Query: 202 YLLQYGLGF-TAGLLLSETLRF-PSKTVPNFLAGCSILSDRQPAGIAGF---GRSSESLP 256
Y++ YG G T G+ S+TL P TV FL GC AGI G GR S SL
Sbjct: 222 YVVSYGDGSNTTGVYGSDTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLK 281
Query: 257 SQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
SQ FSYCL S++ + L L GP S + F + ++ A
Sbjct: 282 SQAAGAYGGVFSYCLPSKQ----SAAGYLTLG-GP---------TSASGFATTGLLTAWA 327
Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
FY V L I VG + V +P S GG +VD+G+ T + + A+ F
Sbjct: 328 APTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFR 381
Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
+ Y + L C+D S V LP + L F GGA +AL
Sbjct: 382 GAIAPYGYPS-APANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGIL-----SS 435
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL A P G G A ILG+ Q ++F + FD GF C
Sbjct: 436 GCL------AFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 109/406 (26%), Positives = 161/406 (39%), Gaps = 75/406 (18%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y +++ GTP + T +FDTGS L W C C D + +P F P +SS+
Sbjct: 126 YVVTIGIGTPARNFT-VLFDTGSDLTWVQCKP---CTDSCYQQQEP----LFDPSKSSTY 177
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
+ C P+C G + C G TC Y ++YG T G L E
Sbjct: 178 VDVPCGTPQCK--IGGGQDLTCGG-----TTC-----EYSVKYGDQSVTRGNLAQEAFTL 225
Query: 223 PSKTVP--NFLAGCS---------ILSDRQPAGIAGFGRSSESLPSQLGLKK----FSYC 267
P + GCS + AG+ G GR S+ SQ FSYC
Sbjct: 226 SPSAPPAAGVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYC 285
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
L R SS L G + LS+TP V +S Y V L I V
Sbjct: 286 LPPRG------SSAGYLTIGAAA--PPQSNLSFTPL----VTDNSQLSSVYVVNLVGISV 333
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
+ I S G+ ++DSG+ T M + + EF R MG Y+ +
Sbjct: 334 SGAALPIDASAFYIGT------VIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHV 387
Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
+S L C+D++G V P + L+F GGA++ + L++F +A+G +
Sbjct: 388 ES-LDTCYDVTGHDVVTAPPVALEFGGGARIDVDASG----------ILLVFAVDASGQS 436
Query: 448 LGRG---------PA-IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
L P +I+G+ Q + + + FD+ R GF C+
Sbjct: 437 LTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGRRIGFGANGCS 482
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 117/414 (28%), Positives = 164/414 (39%), Gaps = 69/414 (16%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
++ ++ GTPPQ T + DTGS L W C Y P PAF SSS
Sbjct: 56 TVPVAVGTPPQNVT-MVLDTGSELSWLLCNGSYA----------PPLTPAFNASGSSSYG 104
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCP--------------LACPSYLLQYGLGF 210
+ C + C W G ++ +P + C LA ++LL G
Sbjct: 105 AVPCPSTACEW-RGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPP 163
Query: 211 TA-GLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
A G + S T N + +S+ G+ G R + S +Q G ++F+YC+
Sbjct: 164 VAVGAYFGCITSYSSTTATNSNGTGTDVSEAA-TGLLGMNRGTLSFVTQTGTRRFAYCI- 221
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----YYVGLRQI 325
AP VL G G P L+YTP + S F Y V L I
Sbjct: 222 ------APGEGPGVLLLGDDGG--VAPPLNYTPLIE----ISQPLPYFDRVAYSVQLEGI 269
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG + IP S L P G G +VDSG+ FTF+ + A+ EF Q A
Sbjct: 270 RVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ-ARLLLAPLG 328
Query: 386 EKKSGLRPCFDIS--------GKKSVYLPELILKFKGGAKMALPPENYFALVGNE----- 432
E + FD S LPE+ L + GA++A+ E +V E
Sbjct: 329 EPGFVFQGAFDACFRGPEARVAAASGLLPEVGLVLR-GAEVAVSGEKLLYMVPGERRGEG 387
Query: 433 ----VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V CL + AG + A ++G QN ++E+DL N R GFA +C
Sbjct: 388 GAEAVWCLTFGNSDMAGMS-----AYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 122/390 (31%), Positives = 171/390 (43%), Gaps = 71/390 (18%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I++ G+P A T I DTGS + W C C C+ D F P SS+
Sbjct: 127 YLITVGMGSPAVAQTMLI-DTGSDVSWVQCKP---CSQCH-SQADS----LFDPSSSSTY 177
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT-AGLLLSETLRF 222
C + C+ + R +GCS + C Y ++YG G T +G S+TL
Sbjct: 178 SAFSCTSAACAQL-------RQRGCS--SSQC-----QYTVKYGDGSTGSGTYSSDTLAL 223
Query: 223 PSKTVPNFLAGCS------ILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKF 273
S TV NF GCS +L D Q AG+ G G +ESL +Q K FSYCL
Sbjct: 224 GSSTVENFQFGCSQSESGNLLQD-QTAGLMGLGGGAESLATQTAGTFGKAFSYCL----- 277
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
P PGS T G S + F K P+ S+ +Y V L+ I VG + +
Sbjct: 278 --PPT---------PGSSGFLTLGASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQL 326
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
IP S GS I+DSG+ T + + A++ F M Y A +
Sbjct: 327 NIPASAFSAGS------IMDSGTIITRLPRTAYSALSSAFKAGMKQYPPA---QPMGIFD 377
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
CFD SG+ SV +P + L F GGA + L + ++G+ CL F N+ +LG
Sbjct: 378 TCFDFSGQSSVSIPTVALVFSGGAVVDLASDGI--ILGS---CLA-FAANSDDTSLG--- 428
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + F + +D+ GF C
Sbjct: 429 --IIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 170/388 (43%), Gaps = 51/388 (13%)
Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
++ G Q ST I DTGS L W C C + + P F P SSS +
Sbjct: 68 VTVGIGGQNST-LIVDTGSDLTWVQCLPCRLCYN--------QQEPLFNPSNSSSFLSLP 118
Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT 226
C +P C + P S CS +N T +C Y + YG G ++ G L E L
Sbjct: 119 CNSPTCVAL-QPTAGSSGL-CSNKNST---SC-DYQIDYGDGSYSRGELGFEKLTLGKTE 172
Query: 227 VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAPVSS 280
+ NF+ GC + +G+ G RS SL SQ L FSYCL + S
Sbjct: 173 IDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGS---SG 229
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
+L L S +SYT +NP S+ FY++ L I +G ++ V
Sbjct: 230 SLTLGGADFSNFKNISPISYTRMIQNPQMSN-----FYFLNLTGISIGGVNLN------V 278
Query: 341 PGSDGNGGV--IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
P N GV ++DSG+ T + +++A EF +Q Y S L CF+++
Sbjct: 279 PRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGF---SILNTCFNLT 335
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALG-RGPAII 455
G + V +P + F+G A+M + E F V ++ +CL A +LG +I
Sbjct: 336 GYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICL-------AFASLGYEDQTMI 388
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+G++Q +N + ++ + GFA + C+
Sbjct: 389 IGNYQQKNQRVIYNSKESKVGFAGEPCS 416
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 170/388 (43%), Gaps = 51/388 (13%)
Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
++ G Q ST I DTGS L W C C + + P F P SSS +
Sbjct: 147 VTVGIGGQNST-LIVDTGSDLTWVQCLPCRLCYN--------QQEPLFNPSNSSSFLSLP 197
Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT 226
C +P C + P S CS +N T +C Y + YG G ++ G L E L
Sbjct: 198 CNSPTCVAL-QPTAGSSGL-CSNKNST---SC-DYQIDYGDGSYSRGELGFEKLTLGKTE 251
Query: 227 VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAPVSS 280
+ NF+ GC + +G+ G RS SL SQ L FSYCL + S
Sbjct: 252 IDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGS---SG 308
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
+L L S +SYT +NP S+ FY++ L I +G ++ V
Sbjct: 309 SLTLGGADFSNFKNISPISYTRMIQNPQMSN-----FYFLNLTGISIGGVNLN------V 357
Query: 341 PGSDGNGGV--IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
P N GV ++DSG+ T + +++A EF +Q Y S L CF+++
Sbjct: 358 PRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGF---SILNTCFNLT 414
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALG-RGPAII 455
G + V +P + F+G A+M + E F V ++ +CL A +LG +I
Sbjct: 415 GYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICL-------AFASLGYEDQTMI 467
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+G++Q +N + ++ + GFA + C+
Sbjct: 468 IGNYQQKNQRVIYNSKESKVGFAGEPCS 495
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 115/403 (28%), Positives = 167/403 (41%), Gaps = 63/403 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S+ GTP + T +FDTGS L W +C C+ + P F P SS
Sbjct: 83 GNYVVSVGLGTPARDLT-VVFDTGSDLSWV------QCGPCSSGGCYHQQDPLFAPSSSS 135
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C P+C P C SP + CP Y + YG T G L ++TL
Sbjct: 136 TFSAVRCGEPEC-----PRARQSCSS-SPGDDRCP-----YEVVYGDKSRTVGHLGNDTL 184
Query: 221 RF-----------PSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---K 263
S +P F+ GC + + G+ G GR SL SQ K
Sbjct: 185 TLGTTPSTNASENNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEG 244
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
FSYCL S SSN G S + P ++ F P+ + S FYYV L
Sbjct: 245 FSYCLPSS-------SSNA---HGYLSLGTPAPAPAHARF--TPMLNRSNTPSFYYVKLV 292
Query: 324 QIIVGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I V + +K+ L P G+IVDSG+ T + + A+ F+ MG Y
Sbjct: 293 GIRVAGRAIKVSSRPALWPA-----GLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGY- 346
Query: 383 ADVEKKSGLRPCFDIS--GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
+ S L C+D + +V +P + L F GGA +++ + CL F
Sbjct: 347 KRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLA-FA 405
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N G + G ILG+ Q + + +D+ + GFA + C+
Sbjct: 406 PNGNGRSAG-----ILGNTQQRTVAVVYDVGRQKIGFAAKGCS 443
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 107/395 (27%), Positives = 161/395 (40%), Gaps = 61/395 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + L GTPP+ I DTGSSL W C C DP + P S
Sbjct: 123 GNYYVKLGLGTPPKYYA-MILDTGSSLSWLQCQP---CAVYCHAQADP----LYDPSVSK 174
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + + C + +CS + + P +T AC Y YG F+ G L + L
Sbjct: 175 TYKKLSCASVECSRLKAATLND------PLCETDSNAC-LYTASYGDTSFSIGYLSQDLL 227
Query: 221 RFPS-KTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
S +T+P F GC D Q AGI G R S+ +QL K FSYCL +
Sbjct: 228 TLTSSQTLPQFTYGCG--QDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTA 285
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY---KNPVGSSSAFGEFYYVGLRQIIVG 328
+ + P S +TP KNP Y++ L I V
Sbjct: 286 NSGSSGGGFLSIGSISPTS-------YKFTPMLTDSKNP--------SLYFLRLTAITVS 330
Query: 329 SKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
+ + + + Y VP ++DSG+ T + ++ A+ + F++ M ++ A
Sbjct: 331 GRPLDLAAAMYRVP-------TLIDSGTVITRLPMSMYAALRQAFVKIMS--TKYAKAPA 381
Query: 388 KSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
S L CF S K +PE+ + F+GGA + L + + CL AG +
Sbjct: 382 YSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAF-----AGSS 436
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+ Q Q + + +D++ R GFA C
Sbjct: 437 -GTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 470
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 113/393 (28%), Positives = 166/393 (42%), Gaps = 59/393 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L GTP + + D+GSSL W +C C + P P + P+ SS
Sbjct: 106 GNYITRLGLGTP-TTTYVMVVDSGSSLTWL------QCAPCAV-SCHPQAGPLYDPRASS 157
Query: 162 SSQLIGCQNPKCSWIFGPNVE-SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
+ + C P+C+ + + S C G + C Y YG G F+ G L +T
Sbjct: 158 TYAAVPCSAPQCAELQAATLNPSSCSG----SGVC-----QYQASYGDGSFSFGYLSKDT 208
Query: 220 LRFPSK-TVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRK 272
+ S + P F GC ++ + AG+ G R+ SL SQL F+YCL +
Sbjct: 209 VSLSSSGSFPGFYYGCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSA 268
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
+S L G S D+K PG SYT SSS Y+V L + V
Sbjct: 269 -----AASAGYLSFGSNS-DNKNPGKYSYTSMV-----SSSLDASLYFVSLAGMSVAGSP 317
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ +P S G+ I+DSG+ T + P++ A++K +G A S L
Sbjct: 318 LAVPSSEY-----GSLPTIIDSGTVITRLPTPVYTALSKA----VGAALAAPSAPAYSIL 368
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF-TDNAAGPALGR 450
+ CF K + +P + + F GGA + L P N V CL TD+ A
Sbjct: 369 QTCFKGQVAK-LPVPAVNMAFAGGATLRLTPGNVLVDVNETTTCLAFAPTDSTA------ 421
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I+G+ Q Q F + +D+ R GFA C+
Sbjct: 422 ----IIGNTQQQTFSVVYDVKGSRIGFAAGGCS 450
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 113/441 (25%), Positives = 177/441 (40%), Gaps = 85/441 (19%)
Query: 51 DPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS----VHSYGGYSI 106
D +I+ L S A L T+ KPK K N +N + P++ + S Y
Sbjct: 57 DTARIVSMLTSG----AGPLTTRAKPKPK------NRANPPV--PIAPGRQILSIPNYIA 104
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
GTP Q + D + W PC++ C C + P+F P +SS+ + +
Sbjct: 105 RAGLGTPAQ-TLLVAIDPSNDAAWVPCSA---CAGCA------ASSPSFSPTQSSTYRTV 154
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS---YLLQYGLGFTAGLLLSETLRFP 223
C +P+C+ + P +CP S + L Y +L ++L
Sbjct: 155 PCGSPQCAQVPSP--------------SCPAGVGSSCGFNLTYAASTFQAVLGQDSLALE 200
Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLV 283
+ V ++ GC + + AG R L+ + L LV
Sbjct: 201 NNVVVSYTFGCLRVVNGNSRAAAGAHR----------LRPRAALL-------------LV 237
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
D G + + TP NP S YYV + I VGSK V++P S L
Sbjct: 238 ADQGHLGPIGQPKRIKTTPLLYNPHRPS-----LYYVNMIGIRVGSKVVQVPQSALAFNP 292
Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
G I+D+G+ FT + P++ AV F ++ R G C+++ +V
Sbjct: 293 VTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRV----RTPVAPPLGGFDTCYNV----TV 344
Query: 404 YLPELILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAI-ILGDFQL 461
+P + F G + LP EN + V CL + AAGP+ G A+ +L Q
Sbjct: 345 SVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAM----AAGPSDGVNAALNVLASMQQ 400
Query: 462 QNFYLEFDLANDRFGFAKQKC 482
QN + FD+AN R GF+++ C
Sbjct: 401 QNQRVLFDVANGRVGFSRELC 421
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 116/407 (28%), Positives = 170/407 (41%), Gaps = 75/407 (18%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKRSS 161
Y L G+PP+ I DTGS ++W C+S C C P IP F P S
Sbjct: 90 YYTRLQLGSPPRDFYVQI-DTGSDVLWVSCSS---CNGC--PVSSGLHIPLNFFDPGSSP 143
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ LI C + +CS ++S C+ +N C Y QYG G T+G +S+ L
Sbjct: 144 TASLISCSDQRCSL----GLQSSDSVCAAQNNQC-----GYTFQYGDGSGTSGYYVSDLL 194
Query: 221 RFPS--------KTVPNFLAGCSILS-------DRQPAGIAGFGRSSESLPSQLGL---- 261
F + + + GCS L DR GI GFG+ S+ SQL
Sbjct: 195 HFDTILGGSVMKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGIT 254
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL K DD+ LVL G+ P + YTP + Y +
Sbjct: 255 PRVFSHCL---KGDDSG-GGILVL------GEIVEPNIVYTPLVPSQ--------PHYNL 296
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I V + + I S S N G I+DSG+T ++ EA FI + +
Sbjct: 297 NLQSIYVNGQTLAIDPSVFATSS--NQGTIIDSGTTLAYLT----EAAYDPFISAITSTV 350
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLCL 436
+ S C+ S + P++ L F GG M L P++Y ++ G + C+
Sbjct: 351 SPSVSPYLSKGNQCYLTSSSINDVFPQVSLNFAGGTSMILIPQDYLIQQSSINGAALWCV 410
Query: 437 ILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G +G I ILGD L++ +D+A R G+A C
Sbjct: 411 --------GFQKIQGQEITILGDLVLKDKIFVYDIAGQRIGWANYDC 449
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 106/401 (26%), Positives = 165/401 (41%), Gaps = 75/401 (18%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTPP I DTGS L+W C +CV P P F P++SS+
Sbjct: 92 YLMRFYIGTPPVERFA-IADTGSDLIWVQCAPCEKCV--------PQNAPLFDPRKSSTF 142
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG---FTAGLLLSETL 220
+ + C + C+ + P + C QY G +G+L E++
Sbjct: 143 KTVPCDSQPCTLL------------PPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESI 190
Query: 221 RFPSKT----VPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYC 267
F SK P GC ++ ++ G+ G G SL SQLG +KFSYC
Sbjct: 191 NFGSKNNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYC 250
Query: 268 LLSRKFDDAPVSSNLV--LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
P+SSN + G + + G+ TP +G S +YY+ L +
Sbjct: 251 F-------PPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPS-----YYYLNLEGV 298
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL---FEAVAKEFIRQMGNYSRA 382
+G+K VK S +DGN +++DSG++FT ++ F A+ KE Y
Sbjct: 299 SIGNKKVKTSES----QTDGN--ILIDSGTSFTILKQSFYNKFVALVKEV------YGVE 346
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
A CF+ GK+ + P+++ F GAK+ + N F N +LC++ +
Sbjct: 347 AVKIPPLVYNFCFENKGKRKRF-PDVVFLFT-GAKVRVDASNLFEAEDNNLLCMVALPTS 404
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I G+ + +E+DL FA CA
Sbjct: 405 DEDDS-------IFGNHAQIGYQVEYDLQGGMVSFAPADCA 438
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 112/400 (28%), Positives = 156/400 (39%), Gaps = 87/400 (21%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ G+PP+ + + DTGS L W RC C+
Sbjct: 1 GVYYSTITLGSPPKDFS-LVMDTGSDLTWV------RCDPCS------------------ 35
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
P CS F R + + TC Y YG G FT G L +TL
Sbjct: 36 ---------PDCSSTF-----DRLASNTYKALTC---ADDYSYGYGDGSFTQGDLSVDTL 78
Query: 221 RFPS------KTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
+ + P F+ GC L GI S S PSQ+G K KFSYCL
Sbjct: 79 KMAGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCL 138
Query: 269 LSRKFDDAPVSSNLVLDTG------PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
L + ++ S +V PGSG K L YTP +G SS + Y V L
Sbjct: 139 LRQTAQNSLKKSPMVFGEAAVELKEPGSG--KLQELQYTP-----IGESSIY---YTVRL 188
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I VG++ + + S + G D I DSG+T T + V + + +
Sbjct: 189 DGISVGNQRLDLSPSAFLNGQDKP--TIFDSGTTLTMLP----PGVCDSIKQSLASMVSG 242
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
A+ GL CF + LP++ F GGA P NY +G+ + CLI N
Sbjct: 243 AEFVAIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVIDLGS-LQCLIFVPTN 301
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I G+ Q Q+F++ D+ N R GF + C
Sbjct: 302 EVS---------IFGNLQQQDFFVLHDMDNRRIGFKETDC 332
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 115/407 (28%), Positives = 171/407 (42%), Gaps = 68/407 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP+ I DTGS ++W C+ C + N+ ++ F P SS
Sbjct: 89 GLYFTRVKLGSPPKEYFVQI-DTGSDILWVACSPCTGCPSSSGLNI---QLEFFNPDTSS 144
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+S I C + +C+ + E+ C+ + N C Y YG G T+G +S+T+
Sbjct: 145 TSSKIPCSDDRCTAALQTS-EAVCQ--TSDNSPC-----GYTFTYGDGSGTSGYYVSDTM 196
Query: 221 RFPS--------KTVPNFLAGCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
F S + + + GCS +DR GI GFG+ S+ SQL
Sbjct: 197 YFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVS 256
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K FS+CL + D+ LVL G+ PGL YTP + Y +
Sbjct: 257 PKVFSHCL--KGSDNG--GGILVL------GEIVEPGLVYTPLVPSQ--------PHYNL 298
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L I+V + + I S + G IVDSG+T ++ ++ +
Sbjct: 299 NLESIVVNGQKLPIDSSLFT--TSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSV 356
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLI 437
R+ V K + CF S P + L F GG M + PENY A + N VL I
Sbjct: 357 RSL-VSKGN---QCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCI 412
Query: 438 LFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ N +G I ILGD L++ +DLAN R G+ C+
Sbjct: 413 GWQRN-------QGQQITILGDLVLKDKIFVYDLANMRMGWTDYDCS 452
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 116/415 (27%), Positives = 172/415 (41%), Gaps = 80/415 (19%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKR 159
G Y + G+PP+ I DTGS ++W C+S C C P +IP F P
Sbjct: 82 GLYFTRVQLGSPPKDFYVQI-DTGSDVLWVSCSS---CNGC--PVTSGLQIPLTFFDPGS 135
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG------FTAG 213
S+++ L+ C + +C+ ++S CS R C Y QYG G + A
Sbjct: 136 STTAALVSCSDQRCT----AGIQSSDSLCSSRTNQC-----GYTFQYGDGSGTSGYYVAD 186
Query: 214 LLLSETLRFPSKTVPNFLAG--------CSIL-------SDRQPAGIAGFGRSSESLPSQ 258
L+ +TL S + CS L SDR GI GFG+ S+ SQ
Sbjct: 187 LMHLDTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQ 246
Query: 259 LGL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
L + FS+CL K DD+ LVL G+ P + YTP +
Sbjct: 247 LASQGITPRVFSHCL---KGDDSG-GGVLVL------GEIVEPNIVYTPLVPSQ------ 290
Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
Y + L+ I V + + I S V G+ N G IVDSG+T + L E F+
Sbjct: 291 --PHYNLYLQSISVAGQTLAIDPS--VFGASSNQGTIVDSGTTLAY----LAEGAYDPFV 342
Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALV 429
+ + S C+ ++ + P++ L F GGA + L P++Y ++
Sbjct: 343 SAITSVVSLNARTYLSKGNQCYLVTSSVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVG 402
Query: 430 GNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G V C+ G G I ILGD L++ +D+AN R G+ C+
Sbjct: 403 GAAVWCV--------GFQKTPGQQITILGDLVLKDKIFVYDIANQRVGWTNYDCS 449
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 154/387 (39%), Gaps = 58/387 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTP Q DT + W PC CV C+ F S++
Sbjct: 90 YIVKANVGTPAQTFL-MALDTSNDAAWIPCNG---CVGCSST--------VFNSVTSTTF 137
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ +GC P+C + P C G TC ++ YG L +T+
Sbjct: 138 KTLGCDAPQCKQVPNPT----CGG-----STC-----TWNTTYGGSTILSNLTRDTIALS 183
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
+ VP + GC + S P G+ G GR S SQ L FSYCL S F
Sbjct: 184 TDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPS--FRTLN 241
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S L L GP + + TP KNP SS YYV L I VG K V IP S
Sbjct: 242 FSGTLRL--GPAGQPLR---IKTTPLLKNPRRSS-----LYYVNLIGIRVGRKIVDIPAS 291
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
L G I DSG+ FT + P++ AV EF +++GN A V G C+
Sbjct: 292 ALAFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGN----AIVSSLGGFDTCY-- 345
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ P + F G + LP +N CL + AA P ++
Sbjct: 346 --TGPIVAPTMTFMFS-GMNVTLPTDNLLIRSTAGSTSCLAM----AAAPDNVNSVLNVI 398
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ Q QN + FD+ N R G A++ C+
Sbjct: 399 ANMQQQNHRILFDVPNSRIGVAREPCS 425
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 117/464 (25%), Positives = 205/464 (44%), Gaps = 74/464 (15%)
Query: 35 PLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKT 94
PL+P + +H+ +D +I + S SR +L K + N + +
Sbjct: 18 PLSP-----FYNHTMTDTARI-EATVHRSRSRLNYLYYINKLSE------NALDNDVSLS 65
Query: 95 PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR--- 151
P V+ G Y +S + G P F+ DT + L+W C+ +CN +P +
Sbjct: 66 PTLVNEGGEYLMSFNIGNPSSQVMGFL-DTSNGLIWVQCS------NCN-SQCEPEKRGL 117
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-F 210
F+ +S + ++ C + C+ + G + C+ +K C Y L YG
Sbjct: 118 TTKFLSSKSFTYEMEPCGSNFCNSLTG------FQTCNSSDKWC-----KYRLVYGDNKA 166
Query: 211 TAGLLLSETLRFPSKT-----VPNFLAGCS---ILSDRQP-AGIAGFGRSSESLPSQLGL 261
T+G+L S++ F + V GCS + D Q G G ++ SL SQLG+
Sbjct: 167 TSGILSSDSFGFDTSDGMLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGI 226
Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
KKFSYCL+ F++ +S + + P + +TP L Y N + YYV
Sbjct: 227 KKFSYCLV--PFNNLGSTSKMYFGSLPVTSGGQTPLL-----YPN--------SDAYYVK 271
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
+ I +G+ + V + G I+D+G T++ +E F+++ +F+ + ++ +
Sbjct: 272 VLGISIGNDEPHFDGVFDV--YEVRDGWIIDTGITYSSLETDAFDSLLAKFLT-LKDFPQ 328
Query: 382 AADVEKKSGLRPCFDISGKKSVY-LPELILKFKGGAKMALPPENYFALVGNE-VLCLILF 439
D + K CF++ + P++ + F G A + L E+ F + ++ + CL L
Sbjct: 329 RKD-DPKERFELCFELQNANDLESFPDVTVHFDG-ADLILNVESTFVKIEDDGIFCLALL 386
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ P ILG+FQLQN+++ +DL FA CA
Sbjct: 387 RSGS--------PVSILGNFQLQNYHVGYDLEAQVISFAPVDCA 422
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 130/467 (27%), Positives = 179/467 (38%), Gaps = 93/467 (19%)
Query: 40 STKHYLHHSDSDPLKILHSLASSSLSRARHLK---TKTKPKTKDSNIGSNYSNSLIKTPL 96
S K L++S L+ + S+SR H + PK +S I +N
Sbjct: 40 SPKSPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESEIIANG--------- 90
Query: 97 SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
G Y +SLS GTPP I DTGS L+W CT +C P F
Sbjct: 91 -----GEYLMSLSLGTPP-FEILAIADTGSDLIWTQCTPCDKCY--------KQIAPLFD 136
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLL 215
PK S + + + C +C + CS + C Y YG FT G L
Sbjct: 137 PKSSKTYRDLSCDTRQCQNL------GESSSCSSE-QLC-----QYSYYYGDRSFTNGNL 184
Query: 216 LSETLRFPSKT-----VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---K 263
+T+ PS P + GC + D++ +GI G G SL SQ+G K
Sbjct: 185 AVDTVTLPSTNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGK 244
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
FSYCL+ + A SS L GSG TP +S KNP FYY+
Sbjct: 245 FSYCLVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLIS-----KNP-------DTFYYL 292
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L + VG K ++ S +I+DSG++ T F A + N
Sbjct: 293 TLEAMSVGDKKIEFGGSSFGGSEG---NIIIDSGTSLTLFPVNFFTEFATAVENAVINGE 349
Query: 381 RAADVEKKSGL-----RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
R D SGL RP D+ +P + F GA + L N F L+ ++VLC
Sbjct: 350 RTQDA---SGLLSHCYRPTPDLK------VPVITAHFN-GADVVLQTLNTFILISDDVLC 399
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L F +G I G+ NF + +D+ F C
Sbjct: 400 LA-FNSTQSGA--------IFGNVAQMNFLIGYDIQGKSVSFKPTDC 437
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 110/392 (28%), Positives = 160/392 (40%), Gaps = 62/392 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTP + T +FDTGS W C CV + +P F P +S+
Sbjct: 159 GNYVVPVRLGTPAERFT-VVFDTGSDTTWVQCQP---CVAYCYRQKEP----LFDPTKSA 210
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ I C + CS ++ GCS + Y +QYG G +T G +TL
Sbjct: 211 TYANISCSSSYCSDLY-------VSGCSGGHCL-------YGIQYGDGSYTIGFYAQDTL 256
Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
T+ NF GC + + AG+ G GR SLP Q K F+YCL
Sbjct: 257 TLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL------ 310
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFY--KNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
A + LD GPG+ + TP + P FYYVG+ I VG +
Sbjct: 311 PATSAGTGFLDLGPGAPAANA---RLTPMLVDRGPT--------FYYVGMTGIKVGGHVL 359
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
IP S G +VDSG+ T + + + F + M +A S L
Sbjct: 360 PIPGSVF-----STAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSA-APAFSILD 413
Query: 393 PCFDISGKK--SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
C+D++G K S+ LP + L F+GGA + + + CL F NA +
Sbjct: 414 TCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLA-FAPNADDTDVA- 471
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + + +D+ GFA C
Sbjct: 472 ----IVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 117/406 (28%), Positives = 168/406 (41%), Gaps = 68/406 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G + +S++ GTPP I DTGS L W C +C N P F K+SS
Sbjct: 83 GEFFMSITIGTPPMKVFA-IADTGSDLTWVQCKPCQQCYKENGP--------IFDKKKSS 133
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C + C + S +GC C Y YG F+ G + +ET+
Sbjct: 134 TYKSEPCDSRNCHAL-----SSSERGCDESKNVC-----KYRYSYGDQSFSKGDVATETI 183
Query: 221 RFPSKT-----VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
S + P + GC + D +GI G G SL SQLG KKFSYCL
Sbjct: 184 SIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL 243
Query: 269 LSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQ 324
+ A + V++ G S SK G+ TP K P +YY+ L
Sbjct: 244 SHKS---ATTNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEPR-------TYYYLTLEA 293
Query: 325 IIVGSKHVKIPY--SYLVPG-----SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
I VG K KIPY S P S+ +G +I+DSG+T T ++ F+ +
Sbjct: 294 ISVGKK--KIPYTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVT 351
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
R +D + L CF SG + LPE+ + F GA + L P N F V +++CL
Sbjct: 352 GAKRVSD--PQGLLSHCFK-SGSAEIGLPEITVHFT-GADVRLSPINAFVKVSEDMVCLS 407
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ I G+F +F + +DL F + C+
Sbjct: 408 MVPTTEVA---------IYGNFAQMDFLVGYDLETRTVSFQRMDCS 444
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 120/425 (28%), Positives = 170/425 (40%), Gaps = 75/425 (17%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
++ ++ G PPQ T + DTGS L W C V P P AF SS+
Sbjct: 60 TVPVAVGAPPQNVT-MVLDTGSELSWLLCNGSR--VPSTPPQ--PQAPAAFNGSASSTYA 114
Query: 165 LIGCQN-PKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
C + P+C W G + V C G P + +C ++ L Y +A G+L ++T
Sbjct: 115 AAHCSSSPECQW-RGRDLPVPPFCAG--PPSNSCRVS-----LSYADASSADGVLAADTF 166
Query: 221 RFPSKTVPNFLAGC--------------------SILSDRQPAGIAGFGRSSESLPSQLG 260
L GC + S G+ G R S S +Q G
Sbjct: 167 LLGGAPPVRALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTG 226
Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLD-TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF-- 317
+F+YC+ D P LVL G G+ S P L+YTP + S F
Sbjct: 227 TLRFAYCIAP---GDGP--GLLVLGGDGDGAALSAAPQLNYTPLIE----MSQPLPYFDR 277
Query: 318 --YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
Y V L I VG+ + IP S L P G G +VDSG+ FTF+ + + EF+ Q
Sbjct: 278 VAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQ 337
Query: 376 MGNYSR---AADVEKKSGLRPCFDISGKK------SVYLPELILKFKGGAKMALPPENYF 426
D + CF S + S LPE+ L + GA++A+ E
Sbjct: 338 TSALLAPLGEPDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLR-GAEVAVGGEKLL 396
Query: 427 ALVGNE---------VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
+V E V CL + AG + A ++G QN ++E+DL N R GF
Sbjct: 397 YMVPGERRGEGGSEAVWCLTFGNSDMAGMS-----AYVIGHHHQQNVWVEYDLQNSRVGF 451
Query: 478 AKQKC 482
A +C
Sbjct: 452 APARC 456
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 127/460 (27%), Positives = 195/460 (42%), Gaps = 83/460 (18%)
Query: 44 YLHHSDSDPLKILHS-LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG 102
Y+ D + ++ HS LA +S + A K +G + +K+ LS+ S G
Sbjct: 54 YMFAKDEERIRYFHSRLAKNSDANASSKK-----------VGPKLAGIPLKSGLSMGS-G 101
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
Y + + G+P + T I DTGSS W PCT + C+ P F P
Sbjct: 102 NYYVKMGLGSPTKYYT-MIVDTGSSFSWLQCQPCT-----IYCHI-----QEDPVFNPSA 150
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
S + + + C + +CS + + CS ++ C Y YG F+ G L +
Sbjct: 151 SKTYKTVPCSSSQCSSLKSATLNEPT--CSKQSNAC-----VYKASYGDSSFSLGYLSQD 203
Query: 219 TLRF-PSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCL- 268
L PS+T+ +F+ GC D Q GI G + S+ SQL K FSYCL
Sbjct: 204 VLTLTPSQTLSSFVYGCG--QDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLP 261
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQII 326
S ++P L + T S TP SY TP KNP S Y++ L I
Sbjct: 262 TSFSTPNSPKEGFLSIGT-----SSLTPSSSYKFTPLLKNPNNPS-----LYFIDLESIT 311
Query: 327 VGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
V + + + S Y VP I+DSG+ T + P++ + ++ + Y +A
Sbjct: 312 VAGRPLGVAASSYKVP-------TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPG 364
Query: 385 VEKKSGLRPCF--DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
+ S L CF ++G V P++ + FKGGA + L N + + CL +
Sbjct: 365 I---SLLDTCFKGSLAGISEV-APDIRIIFKGGADLQLKGHNSLVELETGITCLAM---- 416
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G++Q Q + +D+ N R GFA C
Sbjct: 417 -----AGSSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 110/390 (28%), Positives = 154/390 (39%), Gaps = 52/390 (13%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPPQ + DT + VW PC+ C C+ N S S+
Sbjct: 28 GNYVVRAKLGTPPQLMF-MVLDTSNDAVWLPCSG---CSGCS--NASTSFNTNSSSTYST 81
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG--LGFTAGLLLSET 219
+ C +C+ G C SP+ C S+ YG F+A L+ +T
Sbjct: 82 ----VSCSTAQCTQARGLT----CPSSSPQPSVC-----SFNQSYGGDSSFSASLV-QDT 127
Query: 220 LRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKF 273
L +PNF GC + + P G+ G GR SL SQ L FSYCL S +
Sbjct: 128 LTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRS 187
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
S L L P S + YTP +NP S YYV L + VGS V
Sbjct: 188 FYFSGSLKLGLLGQPKS-------IRYTPLLRNPRRPS-----LYYVNLTGVSVGSVQVP 235
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+ YL ++ G I+DSG+ T P++EA+ EF +Q+ +
Sbjct: 236 VDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQV----NVSSFSTLGAFDT 291
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGP 452
CF S P++ L + LP EN L CL + A+
Sbjct: 292 CF--SADNENVAPKITLHMT-SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLN-- 346
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ + Q QN + FD+ N R G A + C
Sbjct: 347 --VIANLQQQNLRILFDVPNSRIGIAPEPC 374
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 110/392 (28%), Positives = 160/392 (40%), Gaps = 62/392 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTP + T +FDTGS W C CV + +P F P +S+
Sbjct: 94 GNYVVPVRLGTPAERFT-VVFDTGSDTTWVQCQP---CVAYCYRQKEP----LFDPTKSA 145
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ I C + CS ++ GCS + Y +QYG G +T G +TL
Sbjct: 146 TYANISCSSSYCSDLY-------VSGCSGGHCL-------YGIQYGDGSYTIGFYAQDTL 191
Query: 221 RFPSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFD 274
T+ NF GC + + AG+ G GR SLP Q K F+YCL
Sbjct: 192 TLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL------ 245
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFY--KNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
A + LD GPG+ + TP + P FYYVG+ I VG +
Sbjct: 246 PATSAGTGFLDLGPGAPAANA---RLTPMLVDRGPT--------FYYVGMTGIKVGGHVL 294
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
IP S G +VDSG+ T + + + F + M +A S L
Sbjct: 295 PIPGSVF-----STAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSA-APAFSILD 348
Query: 393 PCFDISGKK--SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
C+D++G K S+ LP + L F+GGA + + + CL F NA +
Sbjct: 349 TCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLA-FAPNADDTDVA- 406
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + + +D+ GFA C
Sbjct: 407 ----IVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 117/408 (28%), Positives = 172/408 (42%), Gaps = 71/408 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP+ I DTGS ++W C S C DC + + F P SS
Sbjct: 84 GLYFTKVKLGSPPREFNVQI-DTGSDILWVTCNS---CNDCPRTSGLGIELSFFDPSSSS 139
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ L+ C +P C+ + V++ CSP++ C SY YG G T G +S+ L
Sbjct: 140 TTSLVSCSHPICTSL----VQTTAAECSPQSNQC-----SYSFHYGDGSGTTGYYVSDML 190
Query: 221 RFPSKTVPNFLA--------GCSILS-------DRQPAGIAGFGRSSESLPSQL---GL- 261
F + + +A GCS D+ GI GFG+ S+ SQL G+
Sbjct: 191 YFDTVLGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGIT 250
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K FS+CL + LVL G+ P + Y+P + Y +
Sbjct: 251 PKVFSHCLKG----EGDGGGKLVL------GEILEPNIIYSPLVPSQ--------SHYNL 292
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I V + +P V + N G IVDSG+T T+ L E F+ +
Sbjct: 293 NLQSISVNGQ--LLPIDPAVFATSNNQGTIVDSGTTLTY----LVETAYDPFVSAITATV 346
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
++ S C+ +S P + L F GGA M L P Y +G F+
Sbjct: 347 SSSTTPVLSKGNQCYLVSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHLG--------FS 398
Query: 441 DNAAGPALG----RGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
D AA +G P I ILGD L++ +DLA+ R G+A C+
Sbjct: 399 DGAAMWCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQRIGWANYDCS 446
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 108/389 (27%), Positives = 152/389 (39%), Gaps = 62/389 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y IS+ GTP T I DTGS + W C C P F P +SS+
Sbjct: 127 YVISVGLGTPAVTQTVTI-DTGSDVSWVQCNP------CPNPPCYAQTGALFDPAKSSTY 179
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE---TL 220
+ + C +C+ + E + GC N C Y +QYG G T S TL
Sbjct: 180 RAVSCAAAECAQL-----EQQGNGCGATNYEC-----QYGVQYGDGSTTNGTYSRDTLTL 229
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
S V F GCS + Q G+ G G ++SL SQ FSYCL
Sbjct: 230 SGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCL------ 283
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
P S G + G + F + S FY L+ I VG K + +
Sbjct: 284 -PPTS-------GSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGL 335
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
S GS +VDSG+ T + + A++ F M Y A +S L C
Sbjct: 336 SPSVFAAGS------VVDSGTIITRLPPTAYSALSSAFKAGMKQYRSA---PARSILDTC 386
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG-RGPA 453
FD +G+ + +P + L F GGA + L P I++ + A A G G
Sbjct: 387 FDFAGQTQISIPTVALVFSGGAAIDLDPNG------------IMYGNCLAFAATGDDGTT 434
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + F + +D+ + GF C
Sbjct: 435 GIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 122/407 (29%), Positives = 173/407 (42%), Gaps = 74/407 (18%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
++ PL+ S G Y +S+S GTPP + DTGS L+W C +C SR
Sbjct: 81 LQAPLTPGS-GEYLMSVSIGTPP-VDYIGMADTGSDLMWAQCLPCLKCYK-------QSR 131
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGF 210
P F P +S+S + C + C I +S C C Y YG +
Sbjct: 132 -PIFDPLKSTSFSHVPCNSQNCKAI----DDSHCGA----QGVC-----DYSYTYGDQTY 177
Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQLGL-----K 262
T G L E + S +V + + GC S +G+ G G SL SQ+ +
Sbjct: 178 TKGDLGFEKITIGSSSVKSVI-GCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISR 236
Query: 263 KFSYCL---LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFY 318
+FSYCL LS N V+ SG PG+ TP KNPV +Y
Sbjct: 237 RFSYCLPTLLSHANGKINFGQNAVV-----SG----PGVVSTPLISKNPV-------TYY 280
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
YV L I +G++ ++ GN VI+DSG+T +F+ L++ V ++ +
Sbjct: 281 YVTLEAISIGNER------HMASAKQGN--VIIDSGTTLSFLPKELYDGVVSSLLKVV-- 330
Query: 379 YSRAADVEKKSGLRP-CFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
+A V+ CFD I+ S +P + +F GGA + L P N F V N V C
Sbjct: 331 --KAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNC 388
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
L L A P G I+G+ L NF + +DL R F C
Sbjct: 389 LTL---TPASPTDEFG---IIGNLALANFLIGYDLEAKRLSFKPTVC 429
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 162/389 (41%), Gaps = 66/389 (16%)
Query: 104 YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + +SFGTP PQ + DTGS + W +C C+ P + P + P SS
Sbjct: 79 YVVRVSFGTPAVPQV---VVIDTGSDVSWL------QCKPCSSGQCFPQKDPLYDPSHSS 129
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+ + C + C + S C K C A + Y G T G + L
Sbjct: 130 TYSAVPCASDVCKKLAADAYGSGCT----SGKQCGFA-----ISYADGTSTVGAYSQDKL 180
Query: 221 RF-PSKTVPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
P V NF GC G+ G GR ESL ++ G FSYCL S
Sbjct: 181 TLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYCLPS------ 233
Query: 277 PVSSN---LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
VSS L L G K P + F P+G+ F V L I VG K +
Sbjct: 234 -VSSKPGFLALGAG------KNP----SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLD 282
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+ S +GG+IVDSG+ T ++ + A+ F + M Y + + L
Sbjct: 283 LRPSAF------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDT 332
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
C++++G K+V +P++ L F GGA + L N + G CL +GP G A
Sbjct: 333 CYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG----CLAF---AESGP---DGSA 382
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+LG+ + F + FD + +FGF + C
Sbjct: 383 GVLGNVNQRAFEVLFDTSTSKFGFRAKAC 411
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 162/389 (41%), Gaps = 66/389 (16%)
Query: 104 YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + +SFGTP PQ + DTGS + W +C C+ P + P + P SS
Sbjct: 113 YVVRVSFGTPAVPQV---VVIDTGSDVSWL------QCKPCSSGQCFPQKDPLYDPSHSS 163
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+ + C + C + S C K C A + Y G T G + L
Sbjct: 164 TYSAVPCASDVCKKLAADAYGSGCT----SGKQCGFA-----ISYADGTSTVGAYSQDKL 214
Query: 221 RF-PSKTVPNFLAGCSILSDRQPA---GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
P V NF GC G+ G GR ESL ++ G FSYCL S
Sbjct: 215 TLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYCLPS------ 267
Query: 277 PVSSN---LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
VSS L L G K P + F P+G+ F V L I VG K +
Sbjct: 268 -VSSKPGFLALGAG------KNP----SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLD 316
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+ S +GG+IVDSG+ T ++ + A+ F + M Y + + L
Sbjct: 317 LRPSAF------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDT 366
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
C++++G K+V +P++ L F GGA + L N + G CL +GP G A
Sbjct: 367 CYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG----CLAF---AESGP---DGSA 416
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+LG+ + F + FD + +FGF + C
Sbjct: 417 GVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 127/460 (27%), Positives = 195/460 (42%), Gaps = 83/460 (18%)
Query: 44 YLHHSDSDPLKILHS-LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYG 102
Y+ D + ++ HS LA +S + A K +G + +K+ LS+ S G
Sbjct: 54 YMFAKDEERIRYFHSRLAKNSDANASFKK-----------VGPKLAGIPLKSGLSMGS-G 101
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
Y + + G+P + T I DTGSS W PCT + C+ P F P
Sbjct: 102 NYYVKMGLGSPTKYYT-MIVDTGSSFSWLQCQPCT-----IYCHI-----QEDPVFNPSA 150
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
S + + + C + +CS + + CS ++ C Y YG F+ G L +
Sbjct: 151 SKTYKTVPCSSSQCSSLKSATLNEPT--CSKQSNAC-----VYKASYGDSSFSLGYLSQD 203
Query: 219 TLRF-PSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCL- 268
L PS+T+ +F+ GC D Q GI G + S+ SQL K FSYCL
Sbjct: 204 VLTLTPSQTLSSFVYGCG--QDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLP 261
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQII 326
S ++P L + T S TP SY TP KNP S Y++ L I
Sbjct: 262 TSFSTPNSPKEGFLSIGT-----SSLTPSSSYKFTPLLKNPNNPS-----LYFIDLESIT 311
Query: 327 VGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRAAD 384
V + + + S Y VP I+DSG+ T + P++ + ++ + Y +A
Sbjct: 312 VAGRPLGVAASSYKVP-------TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPG 364
Query: 385 VEKKSGLRPCF--DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
+ S L CF ++G V P++ + FKGGA + L N + + CL +
Sbjct: 365 I---SLLDTCFKGSLAGISEV-APDIRIIFKGGADLQLKGHNSLVELETGITCLAM---- 416
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G++Q Q + +D+ N R GFA C
Sbjct: 417 -----AGSSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 122/412 (29%), Positives = 170/412 (41%), Gaps = 68/412 (16%)
Query: 82 NIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
+ + ++ + T + V ++ Y +++S GTP + T + DTGS + W +C
Sbjct: 122 QLATGSRSATVPTTMGVGTFQ-YVVTVSLGTPGVSQTVEV-DTGSDVSWV------QCKP 173
Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS 201
C+ P + R F P +SS+ + C CS + E+ C G C
Sbjct: 174 CSAPACNSQRDQLFDPAKSSTYSAVPCGADACSEL--RIYEAGCSG-----SQC-----G 221
Query: 202 YLLQYGLGF-TAGLLLSETLRF-PSKTVPNFLAGCSILSDRQPAGIAGF---GRSSESLP 256
Y++ YG G T G+ S+TL P TV FL GC AGI G GR S SL
Sbjct: 222 YVVSYGDGSNTTGVYGSDTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLK 281
Query: 257 SQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
SQ FSYCL S++ S+ L G G S G + T ++ A
Sbjct: 282 SQAAGAYGGVFSYCLPSKQ------SAAGYLTLG---GPSSASGFATTGLL-----TAWA 327
Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF- 372
FY V L I VG + V +P S GG +VD+G+ T + + A+ F
Sbjct: 328 APTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTVITRLPPTAYAALRSAFR 381
Query: 373 --IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
I G S A+ L C+D S V LP + L F GGA +AL
Sbjct: 382 GAIAPCGYPSAPAN----GILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGIL---- 433
Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL A P G G A ILG+ Q ++F + FD GF C
Sbjct: 434 -SSGCL------AFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 114/407 (28%), Positives = 171/407 (42%), Gaps = 68/407 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP+ I DTGS ++W C+ C + N+ ++ F P SS
Sbjct: 89 GLYFTRVKLGSPPKEYFVQI-DTGSDILWVACSPCTGCPSSSGLNI---QLEFFNPDTSS 144
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+S I C + +C+ + E+ C+ + N C Y YG G T+G +S+T+
Sbjct: 145 TSSKIPCSDDRCTAALQTS-EAVCQ--TSDNSPC-----GYTFTYGDGSGTSGYYVSDTM 196
Query: 221 RFPS--------KTVPNFLAGCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
F + + + + GCS +DR GI GFG+ S+ SQL
Sbjct: 197 YFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVS 256
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K FS+CL + D+ LVL G+ PGL YTP + Y +
Sbjct: 257 PKVFSHCL--KGSDNG--GGILVL------GEIVEPGLVYTPLVPSQ--------PHYNL 298
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L I+V + + I S + G IVDSG+T ++ ++ +
Sbjct: 299 NLESIVVNGQKLPIDSSLFT--TSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSV 356
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLI 437
R+ V K + CF S P + L F GG M + PENY A + N VL I
Sbjct: 357 RSL-VSKGN---QCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCI 412
Query: 438 LFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ N +G I ILGD L++ +DLAN R G+ C+
Sbjct: 413 GWQRN-------QGQQITILGDLVLKDKIFVYDLANMRMGWTDYDCS 452
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 116/426 (27%), Positives = 165/426 (38%), Gaps = 93/426 (21%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
++ ++ GTPPQ T + DTGS L W C Y P PAF SSS
Sbjct: 56 TVPVAVGTPPQNVT-MVLDTGSELSWLLCNGSYA----------PPLTPAFNASGSSSYG 104
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCP--------------LACPSYLLQYGLGF 210
+ C + C W G ++ +P + C LA ++LL G
Sbjct: 105 AVPCPSTACEW-RGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPP 163
Query: 211 TA-GLLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLL 269
A G + S T N + +S+ G+ G R + S +Q G ++F+YC+
Sbjct: 164 VAVGAYFGCITSYSSTTATNSNGTGTDVSEAA-TGLLGMNRGTLSFVTQTGTRRFAYCI- 221
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF----YYVGLRQI 325
AP VL G G + P L+YTP + S F Y V L I
Sbjct: 222 ------APGEGPGVLLLGDDGGVA--PPLNYTPLIE----ISQPLPYFDRVAYSVQLEGI 269
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ---------- 375
VG + IP S L P G G +VDSG+ FTF+ + A+ EF Q
Sbjct: 270 RVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGE 329
Query: 376 -----MGNYSR-----AADVEKKSGLRPCFD---------ISGKKSVYLPELILKFKGGA 416
G + A V SGL P +SG+K +Y+ + +GGA
Sbjct: 330 PGFVFQGAFDACFRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGA 389
Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
+ V CL + AG + A ++G QN ++E+DL N R G
Sbjct: 390 EA--------------VWCLTFGNSDMAGMS-----AYVIGHHHQQNVWVEYDLQNGRVG 430
Query: 477 FAKQKC 482
FA +C
Sbjct: 431 FAPARC 436
>gi|357131275|ref|XP_003567264.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like, partial [Brachypodium distachyon]
Length = 364
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/262 (32%), Positives = 119/262 (45%), Gaps = 40/262 (15%)
Query: 243 AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LS 299
AG+ G R + S SQ G ++FSYC+ R DDA V L+L G S P L+
Sbjct: 110 AGLLGMNRGALSFVSQAGTRRFSYCISDR--DDAGV---LLL------GHSDLPNFLPLN 158
Query: 300 YTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTF 359
YTP Y+ + Y V L I+VGSK + IP S L P G G +VDSG+ FTF
Sbjct: 159 YTPLYQPSLPLPYFDRVAYSVQLLGILVGSKPLPIPASVLAPDHTGAGQTMVDSGTQFTF 218
Query: 360 MEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD--------ISGKKSVYLPELILK 411
+ G + A+ EF RQ + RA D E + FD +S LP + L+
Sbjct: 219 LLGDAYAALKAEFYRQSTPFLRALD-EPSFAFQGAFDTCFRVPRGMSPPPGRLLPSVTLR 277
Query: 412 FKGGAKMALPPENYFALVGNE-----------VLCLILFTDNAAGPALGRGPAIILGDFQ 460
F GA+M + + V E V CL F + P + A ++G
Sbjct: 278 FN-GAEMVVGGDRLLYKVPGERRGGAGADDDAVWCLT-FGNADMVPIM----AYVIGHHH 331
Query: 461 LQNFYLEFDLANDRFGFAKQKC 482
N ++E+DL R G A+ +C
Sbjct: 332 QMNLWVEYDLERGRVGLAQVRC 353
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 113/400 (28%), Positives = 162/400 (40%), Gaps = 77/400 (19%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S+S GTPP I DTGS L W C +C P F P +S+
Sbjct: 90 GEYLMSVSIGTPP-VDYLGIADTGSDLTWAQCLPCLKCYQ--------QLRPIFNPLKST 140
Query: 162 SSQLIGCQNPKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
S + C C + + V+ C Y YG ++ G L E
Sbjct: 141 SFSHVPCNTQTCHAVDDGHCGVQGVCD---------------YSYTYGDRTYSKGDLGFE 185
Query: 219 TLRFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGL-----KKFSYCL-- 268
+ S +V + + GC S +G+ G G SL SQ+ ++FSYCL
Sbjct: 186 KITIGSSSVKSVI-GCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPT 244
Query: 269 -LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQII 326
LS N V+ PG+ TP KN V +YY+ L I
Sbjct: 245 LLSHANGKINFGENAVV---------SGPGVVSTPLISKNTV-------TYYYITLEAIS 288
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
+G++ ++ GN VI+DSG+T T + L++ V ++ + +A V+
Sbjct: 289 IGNER------HMAFAKQGN--VIIDSGTTLTILPKELYDGVVSSLLKVV----KAKRVK 336
Query: 387 KKSG-LRPCFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
G L CFD I+ S+ +P + F GGA + L P N F V + V CL L A
Sbjct: 337 DPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTL---KA 393
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A P G I+G+ NF + +DL R F CA
Sbjct: 394 ASPTTEFG---IIGNLAQANFLIGYDLEAKRLSFKPTVCA 430
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 162/391 (41%), Gaps = 55/391 (14%)
Query: 104 YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y ++L FGTP PQ + DTGS L W +C CN P + P F P SS
Sbjct: 122 YVVTLGFGTPAVPQV---LLIDTGSDLSWV------QCQPCNSSTCYPQKDPVFDPSASS 172
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + C + C + + +S GC+ N + + Y +QYG G T G+ +ETL
Sbjct: 173 TYAPVPCGSEACRDL---DPDSYANGCT--NSSSGASLCQYGIQYGNGDTTVGVYSTETL 227
Query: 221 RF---PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
+ V NF GC ++ G+ G G + ESL SQ FSYCL +
Sbjct: 228 TLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAG 287
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
++ + P +G + T G +TP FY V L I VG K
Sbjct: 288 N-----STAGFLALGAPATGGNNTAGFQFTPLQVVET-------TFYLVKLTGISVGGKQ 335
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ I + GG+I+DSG+ T + + A+ F M Y + + L
Sbjct: 336 LDIEPTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDED-L 388
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D +G +V +P + L F+GG + L + L G CL AG + G
Sbjct: 389 DTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVLLDG----CLAFV----AGAS--DG 438
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ + F + +D A GF C
Sbjct: 439 DTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 117/404 (28%), Positives = 162/404 (40%), Gaps = 74/404 (18%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++ GTP + + I DTGS L W C+ +C N F+P S+
Sbjct: 11 GEYLATVRLGTPERVFS-VIVDTGSDLTWVQCSPCGKCYSQN--------DALFLPNTST 61
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
S + C + C+ + P C Y YG G T G + +T+
Sbjct: 62 SFTKLACGSALCNGLPFP--------------MCNQTTCVYWYSYGDGSLTTGDFVYDTI 107
Query: 221 RF-----PSKTVPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLGL---KKFSYCLL 269
+ VPNF GC ++ AG I G G+ S SQL KFSYCL+
Sbjct: 108 TMDGINGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLV 167
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGL---SYTPFYKNPVGSSSAFGEFYYVGLRQII 326
+ P ++ +L GD+ P L Y P NP +YYV L I
Sbjct: 168 --DWLAPPTQTSPLL-----FGDAAVPILPDVKYLPILANP-----KVPTYYYVKLNGIS 215
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG----NYSRA 382
VG + I + S G G I DSG+T T L EA KE + M YSR
Sbjct: 216 VGDNLLNISSTVFDIDSVGGAGTIFDSGTTVT----QLAEAAYKEVLAAMNASTMAYSRK 271
Query: 383 ADVEKKSGLRPCFDISGKKSV-YLPELILKFKGGAKMALPPENYFA-LVGNEVLCLILFT 440
++ S L C K + +P + F+GG M LPP NYF L ++ C
Sbjct: 272 --IDDISRLDLCLSGFPKDQLPTVPAMTFHFEGG-DMVLPPSNYFIYLESSQSYCF---- 324
Query: 441 DNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A+ P + I+G Q QNF + +D A + GF + C
Sbjct: 325 ------AMTSSPDVNIIGSVQQQNFQVYYDTAGRKLGFVPKDCV 362
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 117/439 (26%), Positives = 178/439 (40%), Gaps = 78/439 (17%)
Query: 60 ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTP 119
A S++RA H + T +S + + GGY ++ S GTPP
Sbjct: 57 ARRSINRANHFFKDSDTSTPESTV--------------IPDRGGYLMTYSVGTPP-TKIY 101
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
I DTGS +VW C +C + P F P +SSS + I C + C +
Sbjct: 102 GIADTGSDIVWLQCEPCEQCYN--------QTTPIFNPSKSSSYKNIPCSSKLCHSV--- 150
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK-----TVPNFLAG 233
R CS +N +C Y + YG + G L +TL S + P + G
Sbjct: 151 ----RDTSCSDQN-----SC-QYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIG 200
Query: 234 CSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVL-D 285
C + +GI G G SL +QLG KFSYCL+ ++ SS L D
Sbjct: 201 CGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGD 260
Query: 286 TGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
SGD G+ TP K+PV FY++ L+ VG+K V+ S G D
Sbjct: 261 AAVVSGD----GVVSTPLIKKDPV--------FYFLTLQAFSVGNKRVEFGGS--SEGGD 306
Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
G +I+DSG+T T + ++ + + + R D ++ L C+ + + +
Sbjct: 307 DEGNIIIDSGTTLTLIPSDVYTNLESAVV-DLVKLDRVDDPNQQFSL--CYSLKSNEYDF 363
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
P + + FK GA + L + F + + ++C P LG I G+ QN
Sbjct: 364 -PIITVHFK-GADVELHSISTFVPITDGIVCFAF----QPSPQLGS----IFGNLAQQNL 413
Query: 465 YLEFDLANDRFGFAKQKCA 483
+ +DL F C
Sbjct: 414 LVGYDLQQKTVSFKPTDCT 432
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 113/405 (27%), Positives = 164/405 (40%), Gaps = 67/405 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++ GTP + DTGS + W C RC P P F P+ S+
Sbjct: 132 GEYMAKIAVGTPAVEAL-LAMDTGSDITWLQCQPCRRCY--------PQSGPVFDPRHST 182
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL--GFTAGLLLSET 219
S + +G P C + R G + TC Y + YG T G + ET
Sbjct: 183 SYREMGYDAPDCQAL------GRSGGGDAKRMTC-----VYAVGYGDDGSTTVGDFIEET 231
Query: 220 LRFPSKT-VPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLL 269
L F VP+ GC L AGI G GR S PSQ+ + FSYCL
Sbjct: 232 LTFAGGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCL- 290
Query: 270 SRKFDDAP---VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
+ F +P VSS L + G +G +P S+TP +N FYYV L +
Sbjct: 291 ADFFLSSPGRSVSSTLTIGDGAAAG---SPPPSFTPTVQN-----LNMATFYYVRLVGVS 342
Query: 327 VG--------SKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
VG +K+ PY+ G GGVI+DSG+ T + + A F
Sbjct: 343 VGGVRVPGVTEDDLKLDPYT-------GRGGVILDSGTAVTRLARRAYIAFRDAFRAAAV 395
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ + + C+ + G+ ++ +P + + F GG ++ LPP+NY V +
Sbjct: 396 DLGQVSIGGPSGFFDTCYTMGGR-AMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCF 454
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F G I+G+ Q Q F + +++ R GFA C
Sbjct: 455 AFA------GTGDRSVSIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 101/401 (25%), Positives = 162/401 (40%), Gaps = 67/401 (16%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+ LS GTPPQ F S W C+S ++C ++ F P S+S
Sbjct: 1 MDLSLGTPPQP-LNFTLAVDSGFSWVACSSSC-AINCTTASL-------FQPGLSTSHTK 51
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT-AGLLLSETLRFPS 224
+ C +P CS V + C P + SY YG F+ AG L+S+ S
Sbjct: 52 LPCGSPSCSAFSA--VSTSCG---------PSSSCSYNTSYGTNFSSAGDLVSDIATMDS 100
Query: 225 ----KTVPNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGL----KKFSYCLLSR 271
K N GC +L +G GF + + S QL KF YCL S
Sbjct: 101 VRNRKVAANLSLGCGRDSGGLLELLDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSD 160
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
F V N L S + ++YTP NP + E Y++ L I +
Sbjct: 161 TFRGKLVIGNYKLRNA-----SISSSMAYTPMITNPQAA-----ELYFINLSTISIDKNK 210
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR-----AADVE 386
++P + S+G GG ++D+ + +++ + + ++ + NY+ ++ V
Sbjct: 211 FQVPIQGFL--SNGTGGTVIDTTTFLSYLTSDFY----TQLVQAIKNYTTNLVEVSSSVA 264
Query: 387 KKSGLRPCFDISGKKSVYLP-ELILKFKGGAKMALPPENYFALVG----NEVLCLILFTD 441
G+ C++IS P L F GGA + + +F L N +C+ +
Sbjct: 265 DALGVELCYNISANSDFPPPATLTYHFLGGAGVEV--STWFLLDDSDSVNNTICMAIGRS 322
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ GP L ++G +Q + +E+DL R+GF Q C
Sbjct: 323 ESVGPNLN-----VIGTYQQLDLTVEYDLEQMRYGFGAQGC 358
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 113/406 (27%), Positives = 171/406 (42%), Gaps = 70/406 (17%)
Query: 104 YSISLSFGTPPQASTPFI-FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
Y + G+PP+ F+ DTGS ++W C+ C + N+ ++ F P SS+
Sbjct: 117 YFTRVKLGSPPKEY--FVQIDTGSDILWVACSPCTGCPSSSGLNI---QLEFFNPDTSST 171
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLR 221
S I C + +C+ + E+ C+ + N C Y YG G T+G +S+T+
Sbjct: 172 SSKIPCSDDRCTAALQTS-EAVCQ--TSDNSPC-----GYTFTYGDGSGTSGYYVSDTMY 223
Query: 222 FPS--------KTVPNFLAGCS-------ILSDRQPAGIAGFGRSSESLPSQLGL----- 261
F + + + + GCS +DR GI GFG+ S+ SQL
Sbjct: 224 FDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSP 283
Query: 262 KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
K FS+CL + D+ LVL G+ PGL YTP + Y +
Sbjct: 284 KVFSHCL--KGSDNG--GGILVL------GEIVEPGLVYTPLVPSQ--------PHYNLN 325
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
L I+V + + I S + G IVDSG+T ++ ++ + R
Sbjct: 326 LESIVVNGQKLPIDSSLFT--TSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR 383
Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCLIL 438
+ V K + CF S P + L F GG M + PENY A + N VL I
Sbjct: 384 SL-VSKGN---QCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIG 439
Query: 439 FTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ N +G I ILGD L++ +DLAN R G+ C+
Sbjct: 440 WQRN-------QGQQITILGDLVLKDKIFVYDLANMRMGWTDYDCS 478
>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
Length = 379
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 112/252 (44%), Gaps = 19/252 (7%)
Query: 239 DRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGL 298
D + G+ G R S S SQ+ KFSYC+ F VL G + P L
Sbjct: 128 DSKNTGLMGMNRGSLSFVSQMDFPKFSYCISDSDFSG-------VLLLGDANFSWLMP-L 179
Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
+YTP + Y V L I V SK + +P S VP G G +VDSG+ FT
Sbjct: 180 NYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFT 239
Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEK---KSGLRPCFDI--SGKKSVYLPELILKFK 413
F+ GP++ A+ EF+ Q R + + G+ C+ + S +LP + L F+
Sbjct: 240 FLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFR 299
Query: 414 GGAKMALPPENYFALVGNEVL---CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
GA+M + + V EV + FT L A ++G QN ++EFDL
Sbjct: 300 -GAEMKVSGDRLLYRVPGEVRGSDSVYCFT--FGNSDLLAVEAYVIGHHHQQNVWMEFDL 356
Query: 471 ANDRFGFAKQKC 482
R GFA+ +C
Sbjct: 357 EKSRIGFAQVQC 368
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 126/487 (25%), Positives = 191/487 (39%), Gaps = 75/487 (15%)
Query: 12 FSLLILLFTTDAGA---GSSAATVTVPLTPLSTK--HYLHHSDSDPLKILHSLASSSLSR 66
F L+ LLF+T + + T + + P+ +K ++ + + ++AS R
Sbjct: 10 FFLVALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDPER 69
Query: 67 ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGS 126
++L T KT I V Y + + GTP Q + DT +
Sbjct: 70 LKYLSTLADQKTTAVPIAPGQQ---------VLKIANYVVRVKLGTPGQQMF-MVLDTSN 119
Query: 127 SLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCK 186
W PC+ C F + F+P S++ + C +CS +
Sbjct: 120 DAAWVPCSG---CT--GFSST------TFLPNASTTLGSLDCSGAQCSQV---------- 158
Query: 187 GCSPRNKTCPLACPSY-LLQYGLGFTAGL---LLSETLRFPSKTVPNFLAGC-SILSDRQ 241
R +CP S L G + L L+ + + + +P F GC + +S
Sbjct: 159 ----RGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPGFTFGCINAVSGGS 214
Query: 242 --PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
P G+ G GR SL SQ G FSYCL S F S +L L GP G K+
Sbjct: 215 IPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSLKL--GP-VGQPKS- 268
Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
+ TP +NP S YYV L + VG V IP LV + G I+DSG+
Sbjct: 269 -IRTTPLLRNPHRPS-----LYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTV 322
Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA 416
T P++ A+ EF +Q+ + CF + + P + L F+ G
Sbjct: 323 ITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAATNEAEA--PAITLHFE-GL 374
Query: 417 KMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
+ LP EN + L CL + AA P ++ + Q QN + FD N R
Sbjct: 375 NLVLPMENSLIHSSSGSLACLSM----AAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRL 430
Query: 476 GFAKQKC 482
G A++ C
Sbjct: 431 GIARELC 437
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 105/405 (25%), Positives = 171/405 (42%), Gaps = 69/405 (17%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
+ + ++G Y + GTPP I DT S L+W C+ C P P F
Sbjct: 82 VRIPNHGEYLMRFYIGTPPVERLA-IADTASDLIWVQCSPCETCF--------PQDTPLF 132
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA---CPSYLLQYGLG-FT 211
P +SS+ + C + C+ S CPL C Y YG G T
Sbjct: 133 EPHKSSTFANLSCDSQPCT--------------SSNIYYCPLVGNLC-LYTNTYGDGSST 177
Query: 212 AGLLLSETLRFPSKTV--PNFLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLK- 262
G+L +E++ F S+TV P + GC +D + GI G G SL SQLG +
Sbjct: 178 KGVLCTESIHFGSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQI 237
Query: 263 --KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKT-PGLSYTPFYKNPVGSSSAFGEFYY 319
KFSYCLL P +S + G+ + T G+ TP +P + +Y+
Sbjct: 238 GHKFSYCLL-------PFTSTSTIKLKFGNDTTITGNGVVSTPLIIDP-----HYPSYYF 285
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+ L I +G K +++ + NG +I+D G+ T++E + +G
Sbjct: 286 LHLVGITIGQKMLQVRTT-----DHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGIS 340
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLIL 438
D+ CF + ++ P+++ +F GAK+ L P+N +F ++CL +
Sbjct: 341 ETKDDIPYPFDF--CF--PNQANITFPKIVFQFT-GAKVFLSPKNLFFRFDDLNMICLAV 395
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
D A +G + + G+ +F +E+D + FA C+
Sbjct: 396 LPDFYA-----KGFS-VFGNLAQVDFQVEYDRKGKKVSFAPADCS 434
>gi|361067987|gb|AEW08305.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125859|gb|AFG43520.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125865|gb|AFG43523.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125875|gb|AFG43528.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 83/143 (58%), Gaps = 15/143 (10%)
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPF---YKNPVGSSSAFGEFYYVGLRQI 325
+FD+ S +VL GD P L+YTPF Y+ P SS +G +YY+GLR +
Sbjct: 1 RFDEENQKSLMVL------GDKAFPNGIPLNYTPFLTNYRAP--PSSQYGVYYYIGLRAV 52
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
+G K +K+P L + GNGG I+DSG+TFT +F+ +A F Q+ Y RA DV
Sbjct: 53 SIGGKRMKLPSKLLRFDTKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI-EYRRAVDV 111
Query: 386 EKKSGLRPCFDISGKKSVYLPEL 408
E +G+ C+++SG +++ LPE
Sbjct: 112 EALTGMGLCYNVSGLENIVLPEF 134
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 118/264 (44%), Gaps = 33/264 (12%)
Query: 234 CSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS 293
C + + G+ G R S S +Q+GL+KFSYC+ + SS ++L S S
Sbjct: 431 CRTRTHSKTTGLIGMNRGSLSFVTQMGLQKFSYCISGQD------SSGILLFGE--SSFS 482
Query: 294 KTPGLSYTPFYKNPVGSSSAFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349
L YTP V S+ F Y V L I V + +++P S P G G
Sbjct: 483 WLKALKYTPL----VQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQT 538
Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK---KSGLRPCFDISGKKSVY-- 404
+VDSG+ FTF+ GP++ A+ EF+RQ + + + + C+ + +
Sbjct: 539 MVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPP 598
Query: 405 LPELILKFKGGAKMALPPENYFALV------GNEVLCLILFTDNAAGPALGRGPAIILGD 458
LP + L F+ GA+M++ E V + V C G + I+G
Sbjct: 599 LPTVTLMFR-GAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVE-----SYIIGH 652
Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
QN ++EFDLA R GFA+ +C
Sbjct: 653 HHQQNVWMEFDLAKSRVGFAEVRC 676
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 35/78 (44%), Gaps = 13/78 (16%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
LS H ++SL+ G+PPQ T + DTGS L W C P+ F
Sbjct: 367 LSFHHNVSLTVSLTVGSPPQTVT-MVLDTGSELSWLHCKK------------APNLHSVF 413
Query: 156 IPKRSSSSQLIGCQNPKC 173
P RSSS I C +P C
Sbjct: 414 DPLRSSSYSPIPCTSPTC 431
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 107/405 (26%), Positives = 160/405 (39%), Gaps = 87/405 (21%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP T +FDTGS W C CV + + F P RSS
Sbjct: 176 GNYVVTVGLGTPASRYT-VVFDTGSDTTWVQCQP---CVVVCYEQQEK----LFDPVRSS 227
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + C P CS + N+ GCS + Y +QYG G ++ G +TL
Sbjct: 228 TYANVSCAAPACSDL---NIH----GCSGGHCL-------YGVQYGDGSYSIGFFAMDTL 273
Query: 221 RFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
S V F GC ++ + AG+ G GR SLP Q K F++CL +R
Sbjct: 274 TLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARST 333
Query: 274 DDA--------------PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
+++ ++ D GP FYY
Sbjct: 334 GTGYLDFGAGSPAAASARLTTPMLTDNGP---------------------------TFYY 366
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV--AKEFIRQMG 377
+G+ I VG + + IP S G IVDSG+ T + P + ++ A
Sbjct: 367 IGMTGIRVGGQLLSIPQSVFA-----TAGTIVDSGTVITRLPPPAYSSLRYAFAAAMAAR 421
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
Y +A V S L C+D +G V +P + L F+GGA++ + +CL
Sbjct: 422 GYKKAPAV---SLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLA 478
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F N G +G I+G+ QL+ F + +D+ GF C
Sbjct: 479 -FAANEDGGDVG-----IVGNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|224090425|ref|XP_002308984.1| predicted protein [Populus trichocarpa]
gi|222854960|gb|EEE92507.1| predicted protein [Populus trichocarpa]
Length = 416
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 95/347 (27%), Positives = 151/347 (43%), Gaps = 55/347 (15%)
Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK--- 225
+NP C+ R K C+ K C L+ + + G T+ L + + S
Sbjct: 71 KNPSCNTAQCSLAVYRLKTCTVDKKFCVLSPDNTATRTG---TSDYLTQDVVSIQSTDGS 127
Query: 226 ------TVPNFLAGCS---ILSD--RQPAGIAGFGRSSESLPSQLGL-----KKFSYCLL 269
+VPNFL C+ IL + G+AG GR+ SLPSQ KKF+ CL
Sbjct: 128 NPGRVVSVPNFLFSCAPTFILQGLAKGVKGMAGLGRTKISLPSQFSAAFSFPKKFAICLT 187
Query: 270 SRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYV 320
S ++ GP D + L YTP NPV ++S + E Y++
Sbjct: 188 SSNAKGV-----VIFGDGPYVLLPHADDLSQSLIYTPLILNPVSTASGYFEGEPSTDYFI 242
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
G++ I + V + S L +G GG + + + +T ME ++ AV F+R++
Sbjct: 243 GVKSIKINENVVPLNASLLSINREGYGGTKISTVNAYTVMETTIYNAVTDSFVRELAK-- 300
Query: 381 RAADVEKKSGLRP---CFDISGKKSVYL------PELILKFKGGAKMALPPENYFALVGN 431
A+V + + + P CF+ S + +L+L+ K + N V +
Sbjct: 301 --ANVPRVASVAPFGACFNSKNIGSTRVGPAVPQIDLVLQSK-NVYWRIFGANSMVQVKD 357
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+VLCL F D P +I++G QL++ L+FDLA R GF+
Sbjct: 358 DVLCL-GFVDGGVNPR----TSIVIGGHQLEDNLLQFDLAASRLGFS 399
>gi|383125861|gb|AFG43521.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 83/143 (58%), Gaps = 15/143 (10%)
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPF---YKNPVGSSSAFGEFYYVGLRQI 325
+FD+ S +VL GD P L+YTPF Y+ P SS +G +YY+GLR +
Sbjct: 1 RFDEENQKSLMVL------GDKAFPNGIPLNYTPFLTNYRAP--PSSQYGVYYYIGLRAV 52
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
+G K +K+P L + GNGG I+DSG+TFT +F+ +A F Q+ Y RA DV
Sbjct: 53 SIGGKRMKLPSKLLRFDAKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI-EYRRAVDV 111
Query: 386 EKKSGLRPCFDISGKKSVYLPEL 408
E +G+ C+++SG +++ LPE
Sbjct: 112 EALTGMGLCYNVSGLENIVLPEF 134
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 165/391 (42%), Gaps = 59/391 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP T +FDTGS W C CV + R F P RSS
Sbjct: 178 GNYVVTVGLGTPVSRYT-VVFDTGSDTTWVQCQP---CVVVCYEQ----REKLFDPARSS 229
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + C P CS + N+ GCS + Y +QYG G ++ G +TL
Sbjct: 230 TYANVSCAAPACSDL---NIH----GCSGGHCL-------YGVQYGDGSYSIGFFAMDTL 275
Query: 221 RFPS-KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKF 273
S V F GC ++ + AG+ G GR SLP Q K F++CL +R
Sbjct: 276 TLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARS- 334
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
+ LD G GS + + L+ TP + + FYYVG+ I VG + +
Sbjct: 335 -----TGTGYLDFGAGSLAAASARLT-TPMLTDNGPT------FYYVGMTGIRVGGQLLS 382
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV--AKEFIRQMGNYSRAADVEKKSGL 391
IP S G IVDSG+ T + + ++ A Y +A V S L
Sbjct: 383 IPQSVFA-----TAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAV---SLL 434
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D +G V +P + L F+GGA++ + +CL F N G +G
Sbjct: 435 DTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLA-FAANEDGGDVG-- 491
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ QL+ F + +D+ GF C
Sbjct: 492 ---IVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 133/473 (28%), Positives = 193/473 (40%), Gaps = 79/473 (16%)
Query: 27 SSAATVTVPLTPLSTKHYLHH------SDSDPLKILHSLASSSLSRARHLKTKTKPKTKD 80
S++ +TVPL H+ H S+ P + L L RA ++K K K
Sbjct: 56 STSGGITVPL------HHRHGPCSPVPSNKMPASLEERLQRDQL-RAAYIKRKFS-GAKG 107
Query: 81 SNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV 140
++ + + ++ T + S Y I++ G+P T DTGS + W C C
Sbjct: 108 GDVEQSDAATVPTTLGTSLSTLEYVITVGIGSPAVTQT-MSMDTGSDVSWVQCKP---CS 163
Query: 141 DCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP 200
C+ VD F P SS+ C + C + + + GCS + C
Sbjct: 164 QCH-SEVDS----LFDPSASSTYSPFSCSSAACVQL---SQSQQGNGCS--SSQC----- 208
Query: 201 SYLLQYGLGF-TAGLLLSETLRFPSKTVPNFLAGCSI-----LSDRQPAGIAGFGRSSES 254
Y++ Y G T G S+TL S + F GCS SD Q G+ G G ++S
Sbjct: 209 QYIVSYVDGSSTTGTYSSDTLTLGSNAIKGFQFGCSQSESGGFSD-QTDGLMGLGGDAQS 267
Query: 255 LPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGL-SYTPFYKNPVGS 310
L SQ K FSYCL P PGS T G S + F K P+
Sbjct: 268 LVSQTAGTFGKAFSYCL-------PPT---------PGSSGFLTLGAASRSGFVKTPMLR 311
Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
S+ +Y V L I VG + + IP S GS ++DSG+ T + + A++
Sbjct: 312 STQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGS------VMDSGTVITRLPPTAYSALSS 365
Query: 371 EFIRQMGNYSRAADVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
F M Y A + SG L CFD SG+ SV +P + L F GGA + L +
Sbjct: 366 AFKAGMKKYPPA----QPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVNLDFNGIMLEL 421
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
N L F N+ +LG +G+ Q + F + +D+ GF C
Sbjct: 422 DNWCLA---FAANSDDSSLG-----FIGNVQQRTFEVLYDVGGGAVGFRAGAC 466
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 119/437 (27%), Positives = 171/437 (39%), Gaps = 82/437 (18%)
Query: 55 ILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPP 114
+ H LA + +RA + + T+ G +S ++ G Y S+ GTPP
Sbjct: 99 LAHRLARDA-ARAEAISVSARNVTR---AGGGFSAPVVSG--LAQGSGEYFASVGVGTPP 152
Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
+ + DTGS +VW +C C R+ F P+RS S + C P C
Sbjct: 153 TPAL-LVLDTGSDVVWL------QCAPCRQCYAQSGRV--FDPRRSRSYAAVRCGAPPCR 203
Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFP-SKTVPNFLA 232
+ C R TC Y + YG G TAG L +ETL F VP
Sbjct: 204 GLDAGGGGG----CDRRRGTC-----LYQVAYGDGSVTAGDLATETLWFARGARVPRVAV 254
Query: 233 GCSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDT 286
GC ++ AG+ G GR SLP+Q ++FSYC D + +
Sbjct: 255 GCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCFQGSDLDHRTIIRTVHQHV 314
Query: 287 GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
G G R VG + ++ L P S G
Sbjct: 315 G---------------------------------GARVRGVGERSLR-----LDP-STGR 335
Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
GGVI+DSG++ T + P++ AV + F G A S C+D+ G++ V +P
Sbjct: 336 GGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPG--GFSLFDTCYDLRGRRVVKVP 393
Query: 407 ELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
+ + GGA++ALPPENY V CL L + G I+G+ Q Q F
Sbjct: 394 TVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTD--------GGVSIVGNIQQQGFR 445
Query: 466 LEFDLANDRFGFAKQKC 482
+ FD R + C
Sbjct: 446 VVFDGDRQRVALVPKSC 462
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 114/410 (27%), Positives = 171/410 (41%), Gaps = 75/410 (18%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPK 158
G Y + L GTP + I DT S LVW PC S YR +D P F P+
Sbjct: 86 GEYLVKLGIGTPQHYFSAAI-DTASDLVWLQCQPCVSCYRQLD-----------PIFNPR 133
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
SSS ++ C + CS + G RC ++ AC G T G L +
Sbjct: 134 LSSSYAVVPCSSDTCSQLDG----HRC------DEDDDQACRYNYKYSGNAVTNGTLAID 183
Query: 219 TLRFPSKTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCL---LSR 271
L + GCS S Q +G+ G R SL SQL +++F YCL +SR
Sbjct: 184 KLAVGGNVFHAVVLGCSDSSVGGPPPQASGLVGLARGPLSLLSQLSVRRFMYCLPPPMSR 243
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK- 330
LVL G G+ + T + SS+ + +YY+ + VG +
Sbjct: 244 ------TPGKLVLGAGAGADAVRNVSDRVTV----TMSSSTRYPSYYYLNFDGLAVGDQT 293
Query: 331 --HVKIPYS-----------YLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
++ P S GS N G+IVD ST +F+E L++ +A + ++
Sbjct: 294 PGTIRRPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEI 353
Query: 377 GNYSRAADVEKKSGLRPCFDI---SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
RA + GL CF + G VY+P + + F G + L + F L +
Sbjct: 354 -RLPRATP-STRLGLDLCFILPEGVGIDRVYVPTVSMSFD-GRWLELERDRLF-LEDGRM 409
Query: 434 LCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+CL+ +GR + ILG++Q QN ++ ++L + FAK C
Sbjct: 410 MCLM----------IGRTSGVSILGNYQQQNMHVLYNLRRGKITFAKASC 449
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 103/376 (27%), Positives = 163/376 (43%), Gaps = 54/376 (14%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
I DTGS L W C C + P DPS SSS + + C + C +
Sbjct: 151 LIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSV--------SSSYKTVFCNSSTCQDLVAA 202
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
S C N C Y++ YG G +T G L SE++ + N + GC +
Sbjct: 203 TGNS--GPCGGFNGVVKTTCE-YVVSYGDGSYTRGDLASESIVLGDTKLENLVFGCGRNN 259
Query: 239 DR---QPAGIAGFGRSSESLPSQLGLKKF----SYCLLSRKFDDAPVSSNLVLDTGPGSG 291
+G+ G GRSS SL SQ LK F SYCL S + D A + + D S
Sbjct: 260 KGLFGGASGLMGLGRSSVSLVSQT-LKTFNGVFSYCLPSLE-DGASGTLSFGNDF---SV 314
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
+ + YTP +NP FY + L +G +K L G G+++
Sbjct: 315 YKNSTSVFYTPLVQNP-----QLRSFYILNLTGASIGGVELKT----LSFGR----GILI 361
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
DSG+ T + +++AV EF++Q + A S L CF+++ + + +P + +
Sbjct: 362 DSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGY---SILDTCFNLTSYEDISIPTIKMI 418
Query: 412 FKGGAKMALPPENYFALVGNE--VLCLILFT---DNAAGPALGRGPAIILGDFQLQNFYL 466
F+G A++ + F V + ++CL L + +N G I+G++Q +N +
Sbjct: 419 FEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---------IIGNYQQKNQRV 469
Query: 467 EFDLANDRFGFAKQKC 482
+D +R G A + C
Sbjct: 470 IYDTTQERLGIAGENC 485
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 157/390 (40%), Gaps = 64/390 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y +S+ GTP + +FDTGS L W +C CN N P F P +S++
Sbjct: 188 YIVSVGLGTP-RRDLLVVFDTGSDLSWV------QCKPCN--NCYKQHDPLFDPSQSTTY 238
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
+ C +C + TC Y + YG + T G L +TL
Sbjct: 239 SAVPCGAQECL----------------DSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTL 282
Query: 223 --PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
S + F+ GC + G+ G GR SL SQ + FSYCL S
Sbjct: 283 GPSSDQLQGFVFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPSSWRA 342
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
+ +S + P +T + S FYY+ L I V + V++
Sbjct: 343 EGYLSLGSA---------AAPPHAQFTAMV-----TRSDTPSFYYLDLVGIKVAGRTVRV 388
Query: 335 -PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
P + PG+ ++DSG+ T + + A+ F M Y RA + S L
Sbjct: 389 APAVFKAPGT------VIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPAL---SILDT 439
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
C+D +G+ V +P + L F GGA + L + CL F N ++G
Sbjct: 440 CYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLYVANRSQACLA-FASNGDDTSVG---- 494
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
ILG+ Q + F + +DLAN + GF + C+
Sbjct: 495 -ILGNMQQKTFAVVYDLANQKIGFGAKGCS 523
>gi|357440775|ref|XP_003590665.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355479713|gb|AES60916.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 435
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 107/398 (26%), Positives = 168/398 (42%), Gaps = 84/398 (21%)
Query: 121 IFDTGSSLVWFPCTSRY----------RCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
I D G +W C ++Y R C+ N D PK GC N
Sbjct: 63 IVDLGGQFLWVDCENKYISSTYRPARCRSAQCSLANSDGCGDCFSSPKP-------GCNN 115
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL------LQYGLGFTAGLLLSETLRFPS 224
C G +P N A L +Q GF G + + RF
Sbjct: 116 NTC-------------GVTPDNSITHTATSGELAEDVLSIQSSNGFNPGQNVVVS-RFLF 161
Query: 225 KTVPNFL-AGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK----FD 274
P FL G + +G+AG GR+ +LPSQL +KF+ CL S K F
Sbjct: 162 SCAPTFLLKGLAT----GASGMAGLGRTKIALPSQLASAFSFARKFAICLSSSKGVVLFG 217
Query: 275 DAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIV 327
D P N+V D+ DS L+YTP NPV ++SAF + Y++G++ I +
Sbjct: 218 DGPYGFLPNVVFDS-----DS----LTYTPLLINPVSTASAFSQGQPSAEYFIGVKTIKI 268
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
K V + S L ++G GG + + +T +E +++AV F++ S A ++++
Sbjct: 269 DEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFVKA----SAARNIKR 324
Query: 388 KSGLRP---CF-DISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
+ P C+ +++G + +V EL L+ + N + +EVLCL
Sbjct: 325 VGSVAPFEFCYTNLTGTRLGAAVPTIELFLQ-NENVVWRIFGANSMVSINDEVLCLGFVN 383
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+I++G +QL+N L+FDLA + GF+
Sbjct: 384 GGK-----NTRTSIVIGGYQLENNLLQFDLAASKLGFS 416
>gi|383125857|gb|AFG43519.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125863|gb|AFG43522.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125867|gb|AFG43524.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125869|gb|AFG43525.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125871|gb|AFG43526.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125873|gb|AFG43527.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125877|gb|AFG43529.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 83/143 (58%), Gaps = 15/143 (10%)
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPF---YKNPVGSSSAFGEFYYVGLRQI 325
+FD+ S +VL GD P L+YTPF Y+ P SS +G +YY+GLR +
Sbjct: 1 RFDEENQKSLMVL------GDKAFPTGIPLNYTPFLTNYRAP--PSSQYGVYYYIGLRAV 52
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
+G K +K+P L + GNGG I+DSG+TFT +F+ +A F Q+ Y RA DV
Sbjct: 53 SIGGKRMKLPSKLLRFDTKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQI-EYRRAVDV 111
Query: 386 EKKSGLRPCFDISGKKSVYLPEL 408
E +G+ C+++SG +++ LPE
Sbjct: 112 EALTGMGLCYNVSGLENIVLPEF 134
>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
Length = 193
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 65/184 (35%), Positives = 87/184 (47%), Gaps = 20/184 (10%)
Query: 301 TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFM 360
TP NP+ S FYY+ L I VG + I S DG+GGVI+DSG+T T++
Sbjct: 25 TPLITNPLQPS-----FYYISLEVISVGDTKLSIEQSTFEVSDDGSGGVIIDSGTTITYI 79
Query: 361 EGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI-SGKKSVYLPELILKFKGGAKMA 419
E F+++ KEF Q D +GL CF + SGK V +P+L+ FKGG +
Sbjct: 80 EENAFDSLKKEFTSQT---KLPVDKSGSTGLDVCFSLPSGKTEVEIPKLVFHFKGG-DLE 135
Query: 420 LPPENYF-ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
LP ENY A V CL + N I G+ Q QN + DL + F
Sbjct: 136 LPGENYMIADSSLGVACLAMGASNGMS---------IFGNIQQQNILVNHDLQKETITFI 186
Query: 479 KQKC 482
+C
Sbjct: 187 PTQC 190
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 156/379 (41%), Gaps = 51/379 (13%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC--SWIFG 178
I DTGS L W C C C R P F P S+S + C C S
Sbjct: 180 IVDTGSDLTWVQCKP---CSVCY-----AQRDPLFDPSGSASYAAVPCNASACEASLKAA 231
Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSIL 237
V C Y L YG G F+ G+L ++T+ +V F+ GC L
Sbjct: 232 TGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCG-L 290
Query: 238 SDRQ----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
S+R AG+ G GR+ SL SQ + FSYCL + DA S +L DT S
Sbjct: 291 SNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTS--S 348
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
+ TP +SYT +P A FY++ + VG V V+
Sbjct: 349 YRNATP-VSYTRMIADP-----AQPPFYFMNVTGASVGGAAVAAAGLGAA-------NVL 395
Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
+DSG+ T + ++ AV EF RQ G R S L C++++G V +P L L
Sbjct: 396 LDSGTVITRLAPSVYRAVRAEFARQFGA-ERYPAAPPFSLLDACYNLTGHDEVKVPLLTL 454
Query: 411 KFKGGAKMALPPEN--YFALVGNEVLCLIL----FTDNAAGPALGRGPAIILGDFQLQNF 464
+ +GGA M + + A +CL + F D I+G++Q +N
Sbjct: 455 RLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTP----------IIGNYQQKNK 504
Query: 465 YLEFDLANDRFGFAKQKCA 483
+ +D R GFA + C+
Sbjct: 505 RVVYDTVGSRLGFADEDCS 523
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 105/406 (25%), Positives = 167/406 (41%), Gaps = 69/406 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ + DTGS ++W C + +C + +D + + PK SS
Sbjct: 84 GLYYTEIKLGTPPKHYYVQV-DTGSDILWVNCITCEQCPHKSGLGLD---LTLYDPKASS 139
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+ ++ C C+ FG + +C P C Y + YG G T G +++ L
Sbjct: 140 TGSMVMCDQAFCAATFGGKLP-KCGANVP--------C-EYSVTYGDGSSTIGSFVTDAL 189
Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
+F +T P + + GC S++ GI GFG ++ S+ SQL
Sbjct: 190 QFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKV 249
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEF 317
K F++CL + K G G GD P + TP +
Sbjct: 250 KKIFAHCLDTIK--------------GGGIFSIGDVVQPKVKTTPLVADK--------PH 287
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
Y V L+ I VG +++P PG G I+DSG+T T++ E V KE + +
Sbjct: 288 YNVNLKTIDVGGTTLQLPAHIFEPGE--KKGTIIDSGTTLTYLP----ELVFKEVMLAVF 341
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
N + G CF G P + F+ + + P YF GN+V C +
Sbjct: 342 NKHQDITFHDVQGFL-CFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFANGNDVYC-V 399
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
F + A+ G+ +++GD L N + +DL N G+ C+
Sbjct: 400 GFQNGASQSKDGK-DIVLMGDLVLSNKLVIYDLENRVIGWTDYNCS 444
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 156/379 (41%), Gaps = 51/379 (13%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC--SWIFG 178
I DTGS L W C C C R P F P S+S + C C S
Sbjct: 179 IVDTGSDLTWVQCKP---CSVCY-----AQRDPLFDPSGSASYAAVPCNASACEASLKAA 230
Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSIL 237
V C Y L YG G F+ G+L ++T+ +V F+ GC L
Sbjct: 231 TGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCG-L 289
Query: 238 SDRQ----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
S+R AG+ G GR+ SL SQ + FSYCL + DA S +L DT S
Sbjct: 290 SNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTS--S 347
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVI 350
+ TP +SYT +P A FY++ + VG V V+
Sbjct: 348 YRNATP-VSYTRMIADP-----AQPPFYFMNVTGASVGGAAVAAAGLGAA-------NVL 394
Query: 351 VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELIL 410
+DSG+ T + ++ AV EF RQ G R S L C++++G V +P L L
Sbjct: 395 LDSGTVITRLAPSVYRAVRAEFARQFGA-ERYPAAPPFSLLDACYNLTGHDEVKVPLLTL 453
Query: 411 KFKGGAKMALPPEN--YFALVGNEVLCLIL----FTDNAAGPALGRGPAIILGDFQLQNF 464
+ +GGA M + + A +CL + F D I+G++Q +N
Sbjct: 454 RLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTP----------IIGNYQQKNK 503
Query: 465 YLEFDLANDRFGFAKQKCA 483
+ +D R GFA + C+
Sbjct: 504 RVVYDTVGSRLGFADEDCS 522
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 149/389 (38%), Gaps = 55/389 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSR-YRCVDCNFPNVDPSRIPAFIPKRSSS 162
Y ++L+ GTPPQ + I D G LVW C RC + P D + F P+
Sbjct: 51 YVVNLTIGTPPQPVSAII-DIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPE---- 105
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS-YLLQYGLGFTAGLLLSETLR 221
P + C+ R+ Y G T G + ++ +
Sbjct: 106 ----------------PCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVA 149
Query: 222 FPSKTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+ GC++ S+ +G G GR++ SL +Q+ FSYCL D
Sbjct: 150 IGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAP---PDTG 206
Query: 278 VSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
SS L L G+G G TPF K S Y + L I G+ + +
Sbjct: 207 KSSALFLGASAKLAGAGK----GAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAM 262
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
P S GN ++V + + T + ++ + K +G V+ P
Sbjct: 263 PQS-------GNT-IMVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPK 314
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
SG P+L+L F+GGA+M +P +Y GN+ C+ + PALG
Sbjct: 315 ASASGGA----PDLVLAFQGGAEMTVPVSSYLFDAGNDTACVAIL----GSPALGG--VS 364
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
ILG Q N +L FDL + F C+
Sbjct: 365 ILGSLQQVNIHLLFDLDKETLSFEPADCS 393
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 127/503 (25%), Positives = 213/503 (42%), Gaps = 87/503 (17%)
Query: 1 MAACPFSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKIL---H 57
MA+ LI + SLL+ L +G S+ + +P +P HH S P IL H
Sbjct: 1 MASLWTQLISMASLLLSLARWVPVSGDSSNVLLLP-SP-------HHEGSRPAMILPLHH 52
Query: 58 SLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAS 117
S+ SS S H + + K DS ++ N+ ++ + G Y+ L GTPPQ
Sbjct: 53 SVPDSSFS---HFNPRRQLKESDSE---HHPNARMRLYDDLLRNGYYTARLWIGTPPQRF 106
Query: 118 TPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF 177
I DTGS++ + PC++ C C + P F P+ S + Q + KC+W
Sbjct: 107 A-LIVDTGSTVTYVPCST---CRHCG-----SHQDPKFRPEDSETYQPV-----KCTW-- 150
Query: 178 GPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV---PNFLAG 233
+C C K C +Y +Y + ++G L + + F ++T + G
Sbjct: 151 ------QC-NCDNDRKQC-----TYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFG 198
Query: 234 CS-----ILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP 288
C + +++ GI G GR S+ QL KK ++S F +
Sbjct: 199 CENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKK----VISDSFSLCYGGMGVGGGAMV 254
Query: 289 GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS-DGNG 347
G S + +T +PV S +Y + L++I V K + +L P DG
Sbjct: 255 LGGISPPADMVFT--RSDPVRSP-----YYNIDLKEIHVAGKRL-----HLNPKVFDGKH 302
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF-----DISGKKS 402
G ++DSG+T+ ++ F A +++ + R + + + CF D+S + S
Sbjct: 303 GTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYN-DICFSGAEIDVS-QIS 360
Query: 403 VYLPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAIILGDFQ 460
P + + F G K++L PENY CL +F++ G P +LG
Sbjct: 361 KSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSN-------GNDPTTLLGGIV 413
Query: 461 LQNFYLEFDLANDRFGFAKQKCA 483
++N + +D + + GF K C+
Sbjct: 414 VRNTLVMYDREHTKIGFWKTNCS 436
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 113/393 (28%), Positives = 164/393 (41%), Gaps = 64/393 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRS 160
G Y + GTP + S + DTGSSL W C+ C V C+ + P F P+ S
Sbjct: 119 GNYVTRMGLGTPAK-SYVMVVDTGSSLTWLQCSP---CLVSCHRQSG-----PVFNPRSS 169
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSET 219
SS + C P+C + + CS N Y YG F+ G L +T
Sbjct: 170 SSYASVSCSAPQCDALTTATLNPST--CSTSNVCI------YQASYGDSSFSVGYLSKDT 221
Query: 220 LRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKF 273
+ F S +VPNF GC ++ Q AG+ G R+ SL QL FSYCL +
Sbjct: 222 VSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSS 281
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
+S PG SYTP K+ + S Y++ + I V K +
Sbjct: 282 SSGYLSIGSY---NPGQ-------YSYTPMAKSSLDDS-----LYFIKMTGITVAGKPLS 326
Query: 334 I---PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ YS L I+DSG+ T + ++ A++K M RA+ S
Sbjct: 327 VSASAYSSL--------PTIIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAF---SI 375
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
L CF + + +P++ + F GGA + L N V + CL A PA
Sbjct: 376 LDTCFQGQASR-LRVPQVSMAFAGGAALKLKATNLLVDVDSATTCL------AFAPARS- 427
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A I+G+ Q Q F + +D+ N + GFA C+
Sbjct: 428 --AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 458
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 119/476 (25%), Positives = 190/476 (39%), Gaps = 87/476 (18%)
Query: 48 SDSDPLKIL-HSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGG-YS 105
SD++ L + H L ++ R+R P+ + ++ N ++ V S GG Y
Sbjct: 34 SDTESLNLTDHELLRRAIQRSRDRLASIAPRL----LPTSSRNKVVVAEAPVLSAGGEYL 89
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+ L GTP T I DT S L+W C CV C + +DP F P S+S +
Sbjct: 90 VKLGLGTPQHCFTAAI-DTASDLIWTQCQP---CVKC-YKQLDP----VFNPVASTSYAV 140
Query: 166 IGCQNPKCSWIFGPNVESRC--KGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
+ C + C + RC G S C Y YG T G+L + L
Sbjct: 141 VPCNSDTCDELD----THRCARDGDSDDEDAC-----QYTYSYGGNATTRGILAVDRLAI 191
Query: 223 PSKTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCL---LSRKFDD 275
+ GCS S P +G+ G GR + SL SQL +++F YCL +SR
Sbjct: 192 GDDVFRGVVFGCSSSSVGGPPPQVSGVVGLGRGALSLVSQLSVRRFMYCLPPPVSRS--- 248
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV--- 332
+ LVL D+ + + P+ + S + +YY+ L I +G + +
Sbjct: 249 ---AGRLVL-----GADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFR 300
Query: 333 -KIPYSYLVPGSDGNG---------------------GVIVDSGSTFTFMEGPLFEAVAK 370
+ + PG+ G+I+D ST TF+E L+E +
Sbjct: 301 SRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVD 360
Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDISG---KKSVYLPELILKFKGGAKMALPPENYFA 427
+ ++ R + + GL CF + VY P + L F+ G + L E F
Sbjct: 361 DLEEEI-RLPRGSGSDL--GLDLCFILPEGVPMSRVYAPPVSLAFE-GVWLRLDKEQMFV 416
Query: 428 L-VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ ++CL++ + ILG++Q QN + ++L R F K C
Sbjct: 417 EDRASGMMCLMVGKTDGVS---------ILGNYQQQNMQVMYNLRRGRITFIKTAC 463
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 117/400 (29%), Positives = 161/400 (40%), Gaps = 66/400 (16%)
Query: 91 LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
L++TP Y + S GTPPQ DT + W PC C C + P
Sbjct: 106 LLQTPT-------YVVRASLGTPPQQLL-LAVDTSNDASWIPCAG---CAGCPTSSAAP- 153
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
F P S+S + + C +P C+ PN C P K C + L Y
Sbjct: 154 ----FDPASSASYRTVPCGSPLCAQ--APNA-----ACPPGGKAC-----GFSLTYADSS 197
Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKF 264
L ++L V + GC + + P G+ G GR S SQ + F
Sbjct: 198 LQAALSQDSLAVAGNAVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATF 257
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLR 323
SYCL S F S L L G + P + TP NP SS YYV +
Sbjct: 258 SYCLPS--FKSLNFSGTLRL------GRNGQPQRIKTTPLLANPHRSS-----LYYVNMT 304
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I VG K V IP G+ G ++DSG+ FT + P + AV E R++G A
Sbjct: 305 GIRVGRKVVPIPAFDPATGA----GTVLDSGTMFTRLVAPAYVAVRDEVRRRVG-----A 355
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDN 442
V G CF+ + +V P + L F G ++ LP EN + CL +
Sbjct: 356 PVSSLGGFDTCFNTT---AVAWPPVTLLFDG-MQVTLPEENVVIHSTYGTISCLAM---- 407
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
AA P ++ Q QN + FD+ N R GFA+++C
Sbjct: 408 AAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 163/380 (42%), Gaps = 62/380 (16%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
I DTGS L W C C + P DPS SSS + + C + C +
Sbjct: 148 LIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSV--------SSSYKTVFCNSSTCQDLVAA 199
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
S C N C Y++ YG G +T G L SE++ + NF+ GC
Sbjct: 200 T--SNSGPCGGNNGVVKTPCE-YVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG--- 253
Query: 239 DRQPAGI-------AGFGRSSESLPSQLGLKKF----SYCLLSRKFDDAPVSSNLVLDTG 287
R G+ G GRSS SL SQ LK F SYCL S + D A S + D+
Sbjct: 254 -RNNKGLFGGSSGLMGLGRSSVSLVSQT-LKTFNGVFSYCLPSLE-DGASGSLSFGNDS- 309
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
S + + +SYTP +NP FY + L +G +K S
Sbjct: 310 --SVYTNSTSVSYTPLVQNP-----QLRSFYILNLTGASIGGVELK--------SSSFGR 354
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
G+++DSG+ T + +++AV EF++Q + A S L CF+++ + + +P
Sbjct: 355 GILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY---SILDTCFNLTSYEDISIPI 411
Query: 408 LILKFKGGAKMALPPENYFALV--GNEVLCLILFT---DNAAGPALGRGPAIILGDFQLQ 462
+ + F+G A++ + F V ++CL L + +N G I+G++Q +
Sbjct: 412 IKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---------IIGNYQQK 462
Query: 463 NFYLEFDLANDRFGFAKQKC 482
N + +D +R G + C
Sbjct: 463 NQRVIYDTTQERLGIVGENC 482
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 119/407 (29%), Positives = 180/407 (44%), Gaps = 68/407 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ I DTGS ++W C S C C + ++ F P SS
Sbjct: 75 GLYYTKVKLGTPPRELYVQI-DTGSDVLWVSCGS---CNGCPQTSGLQIQLNYFDPGSSS 130
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+S LI C + +C V++ CS RN C +Y QYG G T+G +S+ +
Sbjct: 131 TSSLISCLDRRCR----SGVQTSDASCSGRNNQC-----TYTFQYGDGSGTSGYYVSDLM 181
Query: 221 RFPS--------KTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQL---GL- 261
F S + + + GCSIL S+R GI GFG+ S+ SQL G+
Sbjct: 182 HFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIA 241
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL K D++ LVL G+ P + Y+P + Y +
Sbjct: 242 PRVFSHCL---KGDNSG-GGVLVL------GEIVEPNIVYSPLVPSQ--------PHYNL 283
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I V + V+I S V + N G IVDSG+T + L E F+ +
Sbjct: 284 NLQSISVNGQIVRIAPS--VFATSNNRGTIVDSGTTLAY----LAEEAYNPFVIAIAAVI 337
Query: 381 RAADVEKKSGLRPCFDISGKKSVYL-PELILKFKGGAKMALPPENYFA---LVGNEVLCL 436
+ S C+ I+ +V + P++ L F GGA + L P++Y +G +
Sbjct: 338 PQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWC 397
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I F +G ++ ILGD L++ +DLA R G+A C+
Sbjct: 398 IGF-QKISGQSI-----TILGDLVLKDKIFVYDLAGQRIGWANYDCS 438
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 155/385 (40%), Gaps = 55/385 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + S GTPPQ DT + W PC C C + P F P S+S
Sbjct: 112 YVVRASLGTPPQ-QLLLAVDTSNDASWIPCAG---CAGCPTSSAAP-----FDPAASASY 162
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ + C +P C+ PN C P K C + L Y L ++L
Sbjct: 163 RTVPCGSPLCAQ--APNA-----ACPPGGKAC-----GFSLTYADSSLQAALSQDSLAVA 210
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAP 277
V + GC + + P G+ G GR S SQ + FSYCL S F
Sbjct: 211 GNAVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPS--FKSLN 268
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S L L + + + TP NP SS YYV + + VG K V IP
Sbjct: 269 FSGTLRLGR-----NGQPQRIKTTPLLANPHRSS-----LYYVNMTGVRVGRKVVPIPAF 318
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
G+ G ++DSG+ FT + P + AV E R++G A V G CF+
Sbjct: 319 DPATGA----GTVLDSGTMFTRLVAPAYVAVRDEVRRRVG-----APVSSLGGFDTCFNT 369
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILG 457
+ +V P + L F G ++ LP EN +V + I AA P ++
Sbjct: 370 T---AVAWPPMTLLFDG-MQVTLPEEN---VVIHSTYGTISCLAMAAAPDGVNTVLNVIA 422
Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
Q QN + FD+ N R GFA+++C
Sbjct: 423 SMQQQNHRVLFDVPNGRVGFARERC 447
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 177/409 (43%), Gaps = 69/409 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYR-----------CVDCNFPNVDPSRI 152
Y +++ GTPP + DTGS LVW C + + + P P +
Sbjct: 82 YLAAVNVGTPPVRFLA-VADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEAV 140
Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA 212
F P SSS +GC P C + C G S AC + Y G +A
Sbjct: 141 VYFNPFDSSSYSRVGCDGPSC---LALATNASCNGDSH-------AC-DFRYSYRDGASA 189
Query: 213 -GLLLSETLRF------PSKTVPNFLAGCSILS---DRQPAGIAGFGRSSESLPSQLGLK 262
GLL ++T F + + + GC+ + + Q G+ G G SL SQLG +
Sbjct: 190 TGLLAADTFTFGGNINNDTTSTASIDFGCATGTAGREFQADGMVGLGAGPLSLASQLG-R 248
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
KFS+CL + DDA + +L+ G + S PG + TP + SSS +Y + +
Sbjct: 249 KFSFCLTAYDIDDA----SSILNFGARAVVSD-PGAATTPL----IASSSNAAAYYAISI 299
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME-----GPLFEAVAKEFIRQMG 377
+ V + V PG+ VIVD+G+ TF++ PL E++A+ +
Sbjct: 300 DSLKVAGQPV--------PGTTSVSKVIVDTGTVLTFLDRAALLAPLTESLAR--VMDGA 349
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSV--YLPE--LILKFKGGAKMALPPENYFALVGNEV 433
RA ++ L C+D+S K V +P+ L+L GG ++ L E F LV V
Sbjct: 350 GLPRAPPPDET--LELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGV 407
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
LCL + T P L P +LG+ LQ+ ++ DL FA C
Sbjct: 408 LCLAVVT---TSPEL--QPLSVLGNVALQDLHVGIDLDARTATFATANC 451
>gi|62362434|gb|AAX81588.1| nectarin IV [Nicotiana langsdorffii x Nicotiana sanderae]
Length = 437
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 107/390 (27%), Positives = 166/390 (42%), Gaps = 67/390 (17%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
D G +W VDC+ V S PA RS+ L G C F P
Sbjct: 63 LDLGGQFLW---------VDCDQGYVSSSYKPARC--RSAQCSLAGAGG--CGQCFSPPK 109
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPN-----------F 230
GC+ N TC L + + + T+G L S+ ++ S N F
Sbjct: 110 ----PGCN--NNTCSLLPDNTITRTA---TSGELASDIVQVQSSNGKNPGRNVTDKDFLF 160
Query: 231 LAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----KKFSYCLLSRKFDDAPVSSNL 282
+ G + L + +G+ AG GR+ SLPSQ +KF+ CL S V
Sbjct: 161 VCGSTFLLEGLASGVKGMAGLGRTRISLPSQFSAEFSFPRKFAVCLSSSTNSKGVV---- 216
Query: 283 VLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAF--GE---FYYVGLRQIIVGSKHVK 333
+ GP S + SYTP + NPV ++SAF GE Y++G++ I + K V
Sbjct: 217 LFGDGPYSFLPNREFSNNDFSYTPLFINPVSTASAFSSGEPSSEYFIGVKSIKINQKVVP 276
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
I + L + G GG + + + +T +E ++ AV F++++ N +R A V
Sbjct: 277 INTTLLSIDNQGVGGTKISTVNPYTILETSMYNAVTNFFVKELVNITRVASVAP---FGA 333
Query: 394 CFD----ISGKKSVYLPELILKFKG-GAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
CFD +S + +P++ L + + N V VLCL F D P
Sbjct: 334 CFDSRTIVSTRVGPAVPQIDLVLQNENVFWTIFGANSMVQVSENVLCL-GFVDGGINPR- 391
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+I++G + +++ L+FDLA+ R GF
Sbjct: 392 ---TSIVIGGYTIEDNLLQFDLASSRLGFT 418
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 164/380 (43%), Gaps = 62/380 (16%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
I DTGS L W C C + P DPS SSS + + C + C +
Sbjct: 100 LIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSV--------SSSYKTVFCNSSTCQDLVAA 151
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
S C N C Y++ YG G +T G L SE++ + NF+ GC
Sbjct: 152 T--SNSGPCGGNNGVVKTPCE-YVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG--- 205
Query: 239 DRQPAGI-------AGFGRSSESLPSQLGLKKF----SYCLLSRKFDDAPVSSNLVLDTG 287
R G+ G GRSS SL SQ LK F SYCL S + D A S + D+
Sbjct: 206 -RNNKGLFGGSSGLMGLGRSSVSLVSQT-LKTFNGVFSYCLPSLE-DGASGSLSFGNDS- 261
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
S + + +SYTP +NP FY + L +G +K S
Sbjct: 262 --SVYTNSTSVSYTPLVQNP-----QLRSFYILNLTGASIGGVELK--------SSSFGR 306
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
G+++DSG+ T + +++AV EF++Q + A S L CF+++ + + +P
Sbjct: 307 GILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY---SILDTCFNLTSYEDISIPI 363
Query: 408 LILKFKGGAKMALPPENYFALVGNE--VLCLILFT---DNAAGPALGRGPAIILGDFQLQ 462
+ + F+G A++ + F V + ++CL L + +N G I+G++Q +
Sbjct: 364 IKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---------IIGNYQQK 414
Query: 463 NFYLEFDLANDRFGFAKQKC 482
N + +D +R G + C
Sbjct: 415 NQRVIYDTTQERLGIVGENC 434
>gi|356518052|ref|XP_003527698.1| PREDICTED: basic 7S globulin 2-like [Glycine max]
Length = 447
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/399 (24%), Positives = 167/399 (41%), Gaps = 64/399 (16%)
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
++ GTP Q ST + D G +W C++R SSS + I
Sbjct: 59 TIGIGTP-QHSTNLVIDLGGENLWHDCSNRRY--------------------NSSSKRKI 97
Query: 167 GCQNPKCSWIFGPNVESRC-----KGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
C++ KC V + C GC+ + T ++ P L Q+ +T ++ +T+
Sbjct: 98 VCKSKKCPE-GAACVSTGCIGPYKPGCAISDCTITVSNP--LAQFSSSYT---MVEDTIF 151
Query: 222 FPSKTVPNFLAGCSILSD-----------RQPAGIAGFGRSSESLPSQLGLK-----KFS 265
+P FLAGC L D R GI GF S +LPSQL L KFS
Sbjct: 152 LSHTYIPGFLAGCVDLDDGLSGNALQGLPRTSKGIIGFSHSELALPSQLVLSNKLIPKFS 211
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPV--GSSSAFGE---FYYV 320
C S ++ N+ + G G ++ L TP NPV G+ S +G Y++
Sbjct: 212 LCFPSS--NNLKGFGNIFIGAGGGHPQVESKFLQTTPLVVNPVATGAVSIYGAPSIEYFI 269
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
++ I + + + S L GNGG + + + +T + L++ +EFI +
Sbjct: 270 DVKAIKIDGHVLNLNSSLLSIDKKGNGGTKISTMTPWTELHSSLYKPFVQEFINK-AEGR 328
Query: 381 RAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
R V CFD S ++ + +P + L GGA+ + N ++ ++ +
Sbjct: 329 RMKRVAPVPPFDACFDTSTIRNSITGLAVPSIDLVLPGGAQWTIYGANSMTVMTSKNVAC 388
Query: 437 ILFTDNAAGP----ALGRGPAIILGDFQLQNFYLEFDLA 471
+ F D P ++ ++++G QL++ L D+A
Sbjct: 389 LAFVDGGMKPKEMHSIQLEASVVIGGHQLEDNLLVIDMA 427
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 149/389 (38%), Gaps = 55/389 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSR-YRCVDCNFPNVDPSRIPAFIPKRSSS 162
Y ++L+ GTPPQ + I D G LVW C RC + P D + F P+
Sbjct: 51 YVVNLTIGTPPQPVSAII-DIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPE---- 105
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS-YLLQYGLGFTAGLLLSETLR 221
P + C+ R+ Y G T G + ++ +
Sbjct: 106 ----------------PCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVA 149
Query: 222 FPSKTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+ GC++ S+ +G G GR++ SL +Q+ FSYCL D
Sbjct: 150 IGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAP---PDTG 206
Query: 278 VSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
SS L L G+G G TPF K +S Y + L I G+ + +
Sbjct: 207 KSSALFLGASAKLAGAGK----GAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAM 262
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
P S GN + V + + T + ++ + K +G V+ P
Sbjct: 263 PQS-------GNT-ITVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPK 314
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
SG P+L+L F+GGA+M +P +Y GN+ C+ + PALG
Sbjct: 315 ASASGGA----PDLVLAFQGGAEMTVPVSSYLFDAGNDTACVAIL----GSPALGG--VS 364
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
ILG Q N +L FDL + F C+
Sbjct: 365 ILGSLQQVNIHLLFDLDKETLSFEPADCS 393
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 150/360 (41%), Gaps = 62/360 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L GTPP+ + DTGS ++W C S C C + ++ F P S
Sbjct: 79 GLYYTKLRLGTPPRDFYVQV-DTGSDVLWVSCAS---CNGCPQTSGLQIQLNFFDPGSSV 134
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ I C + +CSW ++S GCS +N C +Y QYG G T+G +S+ L
Sbjct: 135 TASPISCSDQRCSW----GIQSSDSGCSVQNNLC-----AYTFQYGDGSGTSGFYVSDVL 185
Query: 221 RFP----SKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
+F S VPN A GCS + SDR GI GFG+ S+ SQL
Sbjct: 186 QFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIA 245
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL LVL G+ P + +TP + Y V
Sbjct: 246 PRVFSHCLKGENGGGGI----LVL------GEIVEPNMVFTPLVPSQ--------PHYNV 287
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
L I V + + I S S NG G I+D+G+T ++ EA F+ + N
Sbjct: 288 NLLSISVNGQALPINPSVF---STSNGQGTIIDTGTTLAYLS----EAAYVPFVEAITNA 340
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN--EVLCLI 437
+ S C+ I+ P + L F GGA M L P++Y N LC +
Sbjct: 341 VSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVASALCFL 400
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 109/392 (27%), Positives = 166/392 (42%), Gaps = 46/392 (11%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+SL GTPPQ T + DTGS L W C + + P + + +F P SSS L
Sbjct: 68 VSLPIGTPPQP-TDLVLDTGSQLSWIQCHDKK--IKKRLPPLPKPKTTSFDPSLSSSFSL 124
Query: 166 IGCQNPKCS-WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF-P 223
+ C +P C I + + C +N+ C SY G G L+ E F
Sbjct: 125 LPCNHPICKPRIPDFTLPTSCD----QNRLCHY---SYFYADGT-LAEGNLVREKFTFSK 176
Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLV 283
S + P + GC+ S GI G R S SQ + KFSYC+ SR + +
Sbjct: 177 SLSTPPVILGCAQASTEN-RGILGMNRGRLSFISQAKISKFSYCVPSRTGSNP--TGLFY 233
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSKHVKIPYSYLVP 341
L P S K Y P SS + Y + ++ I + K + +P + P
Sbjct: 234 LGDNPNSSKFK-----YVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKP 288
Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAADVEKKSGLRPC 394
+ G+G ++DSGS T++ +E V +E +R +G Y+ AD+ C
Sbjct: 289 DAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADM--------C 340
Query: 395 FD--ISGKKSVYLPELILKFKGGAKMAL-PPENYFALVGNEVLCLILFTDNAAGPALGRG 451
FD ++ + + + +F G ++ + E V V C+ + LG G
Sbjct: 341 FDAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGI----GRSERLGIG 396
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
II G QN ++E+DLAN R GF +C+
Sbjct: 397 SNII-GTVHQQNMWVEYDLANKRVGFGGAECS 427
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 116/414 (28%), Positives = 169/414 (40%), Gaps = 75/414 (18%)
Query: 100 SYGGYSISLSF-----GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
+Y Y + L F G+PP+ I DTGS ++W C S C C P IP
Sbjct: 74 TYDPYRVGLYFTRVLLGSPPKEFYVQI-DTGSDVLWVSCGS---CNGC--PQSSGLHIPL 127
Query: 155 --FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-T 211
F P SS++ LI C + +CS V+S GCS + C Y QYG G T
Sbjct: 128 NFFDPGSSSTASLISCSDQRCSL----GVQSSDAGCSSQGNQCI-----YTFQYGDGSGT 178
Query: 212 AGLLLSETLRFPS---KTVPN----FLAGCSI-------LSDRQPAGIAGFGRSSESLPS 257
+G +S+ L F + +V N + GCSI SDR GI GFG+ S+ S
Sbjct: 179 SGYYVSDLLNFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVIS 238
Query: 258 QL---GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
Q+ G+ K FS+CL + +++ + Y+P +
Sbjct: 239 QMSSQGITPKVFSHCLKGDGGGGGILVLGEIVE----------EDIVYSPLVPSQ----- 283
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
Y + L+ I V K + I V + N G IVDSG+T ++ ++
Sbjct: 284 ---PHYNLNLQSISVNGKSLAIDPE--VFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAI 338
Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL---V 429
+ R S C+ I+ P + L F GG M L PE+Y +
Sbjct: 339 TEAVSQSVRPL----LSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSI 394
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G+ + I F G+G ILGD L++ +DLA R G+A C+
Sbjct: 395 GDAAVWCIGFQK-----IQGQG-ITILGDLVLKDKIFVYDLAGQRIGWANYDCS 442
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 163/380 (42%), Gaps = 62/380 (16%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
I DTGS L W C C + P DPS SSS + + C + C +
Sbjct: 148 LIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSV--------SSSYKTVFCNSSTCQDLVAA 199
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILS 238
S C N C Y++ YG G +T G L SE++ + NF+ GC
Sbjct: 200 T--SNSGPCGGNNGVVKTPCE-YVVSYGDGSYTRGDLASESILLGDTKLENFVFGCG--- 253
Query: 239 DRQPAGI-------AGFGRSSESLPSQLGLKKF----SYCLLSRKFDDAPVSSNLVLDTG 287
R G+ G GRSS SL SQ LK F SYCL S + D A S + D+
Sbjct: 254 -RNNKGLFGGSSGLMGLGRSSVSLVSQT-LKTFNGVFSYCLPSLE-DGASGSLSFGNDS- 309
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
S + + +SYTP +NP FY + L +G +K S
Sbjct: 310 --SVYTNSTSVSYTPLVQNP-----QLRSFYILNLTGASIGGVELK--------SSSFGR 354
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
G+++DSG+ T + +++AV EF++Q + A S L CF+++ + + +P
Sbjct: 355 GILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY---SILDTCFNLTSYEDISIPI 411
Query: 408 LILKFKGGAKMALPPENYFALV--GNEVLCLILFT---DNAAGPALGRGPAIILGDFQLQ 462
+ + F+G A++ + F V ++CL L + +N G I+G++Q +
Sbjct: 412 IKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVG---------IIGNYQQK 462
Query: 463 NFYLEFDLANDRFGFAKQKC 482
N + +D +R G + C
Sbjct: 463 NQRVIYDSTQERLGIVGENC 482
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 157/385 (40%), Gaps = 61/385 (15%)
Query: 108 LSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIG 167
+ GTP + DTGSSL W C+ V C+ + P F PK SS+ +G
Sbjct: 1 MGLGTPATQYV-MVVDTGSSLTWLQCSPCL--VSCHRQSG-----PVFNPKSSSTYASVG 52
Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKT 226
C +CS + P+ CS N Y YG F+ G L +T+ F S +
Sbjct: 53 CSAQQCSDL--PSATLNPSACSSSNVCI------YQASYGDSSFSVGYLSKDTVSFGSTS 104
Query: 227 VPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSS 280
+PNF GC ++ + AG+ G R+ SL QL F+YCL S
Sbjct: 105 LPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSS----SGY 160
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK--HVKIPYSY 338
+ PG SYTP SSS Y++ L + V V
Sbjct: 161 LSLGSYNPGQ-------YSYTPMV-----SSSLDDSLYFIKLSGMTVAGNPLSVSSSAYS 208
Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
+P I+DSG+ T + ++ A++K M SRA+ S L CF
Sbjct: 209 SLP-------TIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRAS---AYSILDTCFKGQ 258
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
+ V P + + F GGA + L +N V + CL A PA R AII G+
Sbjct: 259 ASR-VSAPAVTMSFAGGAALKLSAQNLLVDVDDSTTCL------AFAPA--RSAAII-GN 308
Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
Q Q F + +D+ + R GFA C+
Sbjct: 309 TQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 113/400 (28%), Positives = 154/400 (38%), Gaps = 82/400 (20%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS +VW P + + PA P+ +
Sbjct: 120 GEYFAQVGVGTPATTAL-MVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWN- 177
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
C P C + GC R +C Y + YG G TAG SETL
Sbjct: 178 ------CVAPICRRL-------DSAGCDRRRNSC-----LYQVAYGDGSVTAGDFASETL 219
Query: 221 RFP-SKTVPNFLAGCSILSDRQPAGIAG-----FGRSSESLPSQLGL---KKFSYCLLSR 271
F V GC D + IA GR S PSQ+ + FSYCL+ R
Sbjct: 220 TFARGARVQRVAIGCG--HDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDR 277
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
S G + FYYV L VG
Sbjct: 278 TSSRRARPSRRW-------------------------GGTPRMATFYYVHLLGFSVGGAR 312
Query: 332 VK-IPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK- 388
VK + S L + + G GGVI+DSG++ T + P++EAV F RAA V +
Sbjct: 313 VKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRV 364
Query: 389 -----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDN 442
S C+++SG++ V +P + + GGA +ALPPENY V C + +
Sbjct: 365 SPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTD 424
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+ Q Q F + FD R GF + C
Sbjct: 425 --------GGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 155/397 (39%), Gaps = 54/397 (13%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y +S+ GTP + DTGS L W C Y C C PN P R+ F SSS
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNC--EYWCKSCPKPNPHPGRV--FRANDSSSF 174
Query: 164 QLIGCQNPKC--------SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLL 215
+ I C + C S PN + C R P A + + T GL
Sbjct: 175 RTIPCSSDDCKIELQDYFSLTECPNPNAPCL-FDYRYLNGPRAIGVFANET---VTVGLN 230
Query: 216 LSETLRFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGL---KKFSYCLL 269
+ +R + + L GC+ + P G+ G G SL +L KFSYCL+
Sbjct: 231 DHKKIR-----LFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLV 285
Query: 270 SRKFDDAPVSSN----LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
D SSN L P + K P + +T + + FY V + I
Sbjct: 286 -----DHLSSSNHKNFLSFGDIP---EMKLPKMQHTELLLGYINA------FYPVNVSGI 331
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG + I S + G GG+IVDSG++ T + G ++ V + + +
Sbjct: 332 SVGGSMLSI--SSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPI 389
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
E CF+ G +P L++ F GA P ++Y V + CL + + G
Sbjct: 390 ELPELNNFCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPG 449
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ ILG+ QN E+DL + GF C
Sbjct: 450 SS-------ILGNVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 116/414 (28%), Positives = 169/414 (40%), Gaps = 75/414 (18%)
Query: 100 SYGGYSISLSF-----GTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA 154
+Y Y + L F G+PP+ I DTGS ++W C S C C P IP
Sbjct: 59 TYDPYRVGLYFTRVLLGSPPKEFYVQI-DTGSDVLWVSCGS---CNGC--PQSSGLHIPL 112
Query: 155 --FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-T 211
F P SS++ LI C + +CS V+S GCS + C Y QYG G T
Sbjct: 113 NFFDPGSSSTASLISCSDQRCSL----GVQSSDAGCSSQGNQCI-----YTFQYGDGSGT 163
Query: 212 AGLLLSETLRFPS---KTVPN----FLAGCSI-------LSDRQPAGIAGFGRSSESLPS 257
+G +S+ L F + +V N + GCSI SDR GI GFG+ S+ S
Sbjct: 164 SGYYVSDLLNFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVIS 223
Query: 258 QL---GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
Q+ G+ K FS+CL + +++ + Y+P +
Sbjct: 224 QMSSQGITPKVFSHCLKGDGGGGGILVLGEIVE----------EDIVYSPLVPSQ----- 268
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
Y + L+ I V K + I V + N G IVDSG+T ++ ++
Sbjct: 269 ---PHYNLNLQSISVNGKSLAIDPE--VFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAI 323
Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL---V 429
+ R S C+ I+ P + L F GG M L PE+Y +
Sbjct: 324 TEAVSQSVRPL----LSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSI 379
Query: 430 GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G+ + I F G+G ILGD L++ +DLA R G+A C+
Sbjct: 380 GDAAVWCIGFQ-----KIQGQG-ITILGDLVLKDKIFVYDLAGQRIGWANYDCS 427
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 112/407 (27%), Positives = 166/407 (40%), Gaps = 68/407 (16%)
Query: 102 GGYSISLSFGTPPQASTPFI-FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
G Y + G P A F+ DTGS ++W C+ C + N+ ++ F P S
Sbjct: 87 GLYFTRVKLGNP--AKEYFVQIDTGSDILWVACSPCTGCPTSSGLNI---QLEFFNPDSS 141
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
S+S I C + +C+ E+ C+ + C Y YG G T+G +S+T
Sbjct: 142 STSSRIPCSDDRCTAALQTG-EAVCQSSDSPSSPC-----GYTFTYGDGSGTSGFYVSDT 195
Query: 220 LRFPS--------KTVPNFLAGCS-------ILSDRQPAGIAGFGRSSESLPSQL---GL 261
+ F + + + + GCS + +DR GI GFG+ S+ SQL G+
Sbjct: 196 MYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGV 255
Query: 262 --KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
K FS+CL + D+ LVL G+ PGL +TP + Y
Sbjct: 256 SPKTFSHCL--KGSDNG--GGILVL------GEIVEPGLVFTPLVPSQ--------PHYN 297
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+ L I V + K+P + + G IVDSG+T + L + FI +
Sbjct: 298 LNLESIAVSGQ--KLPIDSSLFATSNTQGTIVDSGTTLVY----LVDGAYDPFINAIAAA 351
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNEVLCL 436
+ S CF + P L FKGG M + PENY V N VL
Sbjct: 352 VSPSVRSVVSKGIQCFVTTSSVDSSFPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWC 411
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I + + ILGD L++ +DLAN R G+A C+
Sbjct: 412 IGWQRSQG--------ITILGDLVLKDKIFVYDLANMRMGWADYDCS 450
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 115/407 (28%), Positives = 168/407 (41%), Gaps = 72/407 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ I DTGS ++W CTS C C + ++ F P SS
Sbjct: 82 GLYYTKVKLGTPPREFNVQI-DTGSDVLWVSCTS---CNGCPKTSELQIQLSFFDPGVSS 137
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S+ L+ C + +C F ES GCSP N C SY +YG G T+G +S+ +
Sbjct: 138 SASLVSCSDRRCYSNF--QTES---GCSP-NNLC-----SYSFKYGDGSGTSGFYISDFM 186
Query: 221 RFPSKTVPN--------FLAGCSILSD-------RQPAGIAGFGRSSESLPSQLGL---- 261
F + F+ GCS L R GI G G+ S S+ SQL +
Sbjct: 187 SFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLA 246
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL D +VL G K P YTP + Y V
Sbjct: 247 PRVFSHCLKG----DKSGGGIMVL------GQIKRPDTVYTPLVPSQ--------PHYNV 288
Query: 321 GLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
L+ I V + + I P + + DG I+D+G+T ++ + + + Y
Sbjct: 289 NLQSIAVNGQILPIDPSVFTIATGDGT---IIDTGTTLAYLPDEAYSPFIQAIANAVSQY 345
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY---FALVGNEVLCL 436
R E CF+I+ PE+ L F GGA M L P Y F+ G+ + C+
Sbjct: 346 GRPITYESYQ----CFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQIFSSSGSSIWCI 401
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ ILGD L++ + +DL R G+A+ C+
Sbjct: 402 -------GFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 117/439 (26%), Positives = 177/439 (40%), Gaps = 78/439 (17%)
Query: 60 ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTP 119
A S++RA H + T +S + + GGY ++ S GTPP
Sbjct: 57 ARRSINRANHFFKDSDTSTPESTV--------------IPDRGGYLMTYSVGTPP-TKIY 101
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
I DTGS +VW C +C + P F P +SSS + I C + C +
Sbjct: 102 GIADTGSDIVWLQCEPCEQCYN--------QTTPIFNPSKSSSYKNIPCLSKLCHSV--- 150
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK-----TVPNFLAG 233
R CS +N +C Y + YG + G L +TL S + P + G
Sbjct: 151 ----RDTSCSDQN-----SC-QYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIG 200
Query: 234 CSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVL-D 285
C + +GI G G SL +QLG KFSYCL+ ++ SS L D
Sbjct: 201 CGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGD 260
Query: 286 TGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
SGD G+ TP K+PV FY++ L+ VG+K V+ S G D
Sbjct: 261 AAVVSGD----GVVSTPLIKKDPV--------FYFLTLQAFSVGNKRVEFGGS--SEGGD 306
Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
G +I+DSG+T T + ++ + + + R D ++ L C+ + + +
Sbjct: 307 DEGNIIIDSGTTLTLIPSDVYTNLESAVV-DLVKLDRVDDPNQQFSL--CYSLKSNEYDF 363
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
P + FK GA + L + F + + ++C P LG I G+ QN
Sbjct: 364 -PIITAHFK-GADIELHSISTFVPITDGIVCFAF----QPSPQLGS----IFGNLAQQNL 413
Query: 465 YLEFDLANDRFGFAKQKCA 483
+ +DL F C
Sbjct: 414 LVGYDLQQKTVSFKPTDCT 432
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 110/395 (27%), Positives = 170/395 (43%), Gaps = 63/395 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSSS 162
Y + +S GTPP + I DTGS+L W C + + +C D +I F P SS+
Sbjct: 25 YFMGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKCYD---QAAKAGQI--FNPYNSST 78
Query: 163 SQLIGCQNPKCSWIFGPNVESRCK-GCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+GC C+ G +++ + GC + TC Y L+YG G ++ G L + L
Sbjct: 79 YSKVGCSTEACN---GMHMDLAVEYGCVEEDDTCI-----YSLRYGSGEYSVGYLGKDRL 130
Query: 221 RFPS-KTVPNFLAGC--SILSDRQPAGIAGFGRSSESLPSQL----GLKKFSYCLLSRKF 273
S +++ NF+ GC L + AGI GFG S S +Q+ FSYC
Sbjct: 131 TLASNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHE 190
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
++ L GP + D + P Y + ++V ++
Sbjct: 191 NEGS------LTIGPYARDINLMWTKLIYYDHKPA---------YAIQQLDMMVNGIRLE 235
Query: 334 I-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSG 390
I PY Y+ + IVDSG+ T++ P+F+A+ K ++M Y+R D
Sbjct: 236 IDPYIYISKMT------IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDER---- 285
Query: 391 LRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
R CF IS S P + +K + + LP EN F N V+C D+A
Sbjct: 286 -RICF-ISNSGSANWNDFPTVEMKLI-RSTLKLPVENAFYESSNNVICSTFLPDDA---- 338
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G +LG+ +++F L FD+ FGF + C
Sbjct: 339 -GVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 372
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 110/444 (24%), Positives = 169/444 (38%), Gaps = 81/444 (18%)
Query: 69 HLKTKTKPKTKDSNIGSNYSNSLIKTPLSVH---------SYGGYSISLSFGTPPQASTP 119
H ++ P I +YS+ ++K S Y + ++ S G PP
Sbjct: 49 HHESSLSPYNSKDTIWDHYSHKILKQTFSNDYISNLVPSPRYVVFLMNFSIGEPPIPQLA 108
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGSSL W C + C C+ +P F P +SS+ + C
Sbjct: 109 -VMDTGSSLTWVMC---HPCSSCS-----QQSVPIFDPSKSSTYSNLSC----------- 148
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKT-----VPNFLAG 233
S C C N CP Y ++Y G G + G+ E L + VP+ + G
Sbjct: 149 ---SECNKCDVVNGECP-----YSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFG 200
Query: 234 C----SILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
C SI S+ P G+ G G SL G KKFSYC+ + + + + ++ D
Sbjct: 201 CGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSFG-KKFSYCIGNLRNTNYKFNRLVLGD 259
Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PYSYLVPGSD 344
GDS T + YYV L I +G + + I P + +D
Sbjct: 260 KANMQGDSTTL---------------NVINGLYYVNLEAISIGGRKLDIDPTLFERSITD 304
Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF------DIS 398
N GVI+DSG+ T++ FE ++ E + A +K + C+ D+S
Sbjct: 305 NNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLS 364
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
G P + F GA + L + F C+ + N G +I G
Sbjct: 365 G-----FPLVTFHFAEGAVLDLDVTSMFIQTTENEFCMAMLPGNYFGDDYESFSSI--GM 417
Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
QN+ + +DL R F + C
Sbjct: 418 LAQQNYNVGYDLNRMRVYFQRIDC 441
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 99/410 (24%), Positives = 172/410 (41%), Gaps = 75/410 (18%)
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPAFI 156
S G Y + G+PP+ + DTGS ++W C +C D P + +
Sbjct: 74 SIGLYFTKIKLGSPPKEYYVQV-DTGSDILWVNCAPCPKCPVKTDLGIP------LSLYD 126
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTA-GL 214
K SS+S+ +GC++ CS+I +++TC P SY + YG G T+ G
Sbjct: 127 SKTSSTSKNVGCEDDFCSFIM-------------QSETCGAKKPCSYHVVYGDGSTSDGD 173
Query: 215 LLSETLRFPS-----KTVP---NFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL 259
+ + + +T P + GC +D GI GFG+S+ S+ SQL
Sbjct: 174 FIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQL 233
Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
+ FS+CL N+ G+ ++P + TP N V
Sbjct: 234 AAGGSTKRIFSHCL-----------DNMNGGGIFAVGEVESPVVKTTPIVPNQV------ 276
Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
Y V L+ + V + +P S + ++G+GG I+DSG+T ++ L+ ++ ++
Sbjct: 277 --HYNVILKGMDVDGDPIDLPPS--LASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITA 332
Query: 375 QMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
+ + + CF + P + L F+ K+++ P +Y + ++
Sbjct: 333 K-----QQVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMY 387
Query: 435 CLILFTDNAAGPALGRGP-AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
C F + G G I+LGD L N + +DL N+ G+A C+
Sbjct: 388 C---FGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCS 434
>gi|356500210|ref|XP_003518926.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 435
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 105/411 (25%), Positives = 176/411 (42%), Gaps = 75/411 (18%)
Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
+L + T + TP + D G +W C + Y + P R
Sbjct: 42 TLQYITQIKQRTPLVPENLVLDIGGQFLWVDCDNNYVS-------------STYRPARCG 88
Query: 162 SSQLIGCQNPKCSWIFG---PNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE 218
S+Q ++ C F P + G +P N A L Q + L
Sbjct: 89 SAQCSLARSDSCGNCFSAPKPGCNNNTCGVTPDNTVTGTATSGELAQDVVS------LQS 142
Query: 219 TLRF---PSKTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLG-----LKKFS 265
T F + TV FL C+ Q +G+AG GR+ +LPSQL +KF+
Sbjct: 143 TNGFNPIQNATVSRFLFSCAPTFLLQGLATGVSGMAGLGRTRIALPSQLASAFSFRRKFA 202
Query: 266 YCLLSRK----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE----- 316
CL S F D P ++L P S+ L++TP NPV ++SAF +
Sbjct: 203 VCLSSSNGVAFFGDGPY---VLL---PNVDASQL--LTFTPLLINPVSTASAFSQGEPSA 254
Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
Y++G++ I + K V + + L S G GG + S + +T +E +F+AV + F++
Sbjct: 255 EYFIGVKSIKIDEKTVPLNTTLLSINSKGVGGTKISSVNPYTVLEDSIFKAVTEAFVKA- 313
Query: 377 GNYSRAADVEKKSGLRP---CFD----ISGKKSVYLP--ELILKFKGGAKMALPPENYFA 427
S A ++ + + + P CF ++ + +P EL+L+ + + +
Sbjct: 314 ---SSARNITRVASVAPFEVCFSRENVLATRLGAAVPTIELVLQNQKTVWRIFGANSMVS 370
Query: 428 LVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+ ++VLCL F + P +I++G +QL++ L+FDLA R GF+
Sbjct: 371 VSDDKVLCL-GFVNGGENPR----TSIVIGGYQLEDNLLQFDLATSRLGFS 416
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 110/392 (28%), Positives = 165/392 (42%), Gaps = 46/392 (11%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+SL GTPPQ T + DTGS L W C + V P + + +F P SSS L
Sbjct: 68 VSLPIGTPPQP-TDLVLDTGSQLSWIQCHDKK--VKKRLPPLPKPKTASFDPSLSSSFSL 124
Query: 166 IGCQNPKCS-WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF-P 223
+ C +P C I + + C +N+ C SY G G L+ E F
Sbjct: 125 LPCNHPICKPRIPDFTLPTSCD----QNRLCHY---SYFYADGT-LAEGNLVREKFTFSK 176
Query: 224 SKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLV 283
S + P + GC+ S GI G S SQ + KFSYC+ SR + +
Sbjct: 177 SLSTPPVILGCAQASTEN-RGILGMNHGRLSFISQAKISKFSYCVPSRTGSNP--TGLFY 233
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSKHVKIPYSYLVP 341
L P S K Y P SS + Y + ++ I + K + IP + P
Sbjct: 234 LGDNPNSSKFK-----YVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKP 288
Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAADVEKKSGLRPC 394
+ G+G ++DSGS T++ +E V +E +R +G Y+ AD+ C
Sbjct: 289 DAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADM--------C 340
Query: 395 FD--ISGKKSVYLPELILKFKGGAKMAL-PPENYFALVGNEVLCLILFTDNAAGPALGRG 451
FD ++ + + + +F G ++ + E V V C+ + LG G
Sbjct: 341 FDAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGI----GRSERLGIG 396
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
II G QN ++E+DLAN R GF +C+
Sbjct: 397 SNII-GTVHQQNMWVEYDLANKRVGFGGAECS 427
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 99/410 (24%), Positives = 172/410 (41%), Gaps = 75/410 (18%)
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPAFI 156
S G Y + G+PP+ + DTGS ++W C +C D P + +
Sbjct: 70 SIGLYFTKIKLGSPPKEYYVQV-DTGSDILWVNCAPCPKCPVKTDLGIP------LSLYD 122
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTA-GL 214
K SS+S+ +GC++ CS+I +++TC P SY + YG G T+ G
Sbjct: 123 SKTSSTSKNVGCEDDFCSFIM-------------QSETCGAKKPCSYHVVYGDGSTSDGD 169
Query: 215 LLSETLRFPS-----KTVP---NFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL 259
+ + + +T P + GC +D GI GFG+S+ S+ SQL
Sbjct: 170 FIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQL 229
Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
+ FS+CL N+ G+ ++P + TP N V
Sbjct: 230 AAGGSTKRIFSHCL-----------DNMNGGGIFAVGEVESPVVKTTPIVPNQV------ 272
Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
Y V L+ + V + +P S + ++G+GG I+DSG+T ++ L+ ++ ++
Sbjct: 273 --HYNVILKGMDVDGDPIDLPPS--LASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITA 328
Query: 375 QMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
+ + + CF + P + L F+ K+++ P +Y + ++
Sbjct: 329 K-----QQVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMY 383
Query: 435 CLILFTDNAAGPALGRGP-AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
C F + G G I+LGD L N + +DL N+ G+A C+
Sbjct: 384 C---FGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCS 430
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 126/454 (27%), Positives = 180/454 (39%), Gaps = 90/454 (19%)
Query: 52 PLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFG 111
P + + S S+SR H TK+S+I ++ + S + + + G Y + S G
Sbjct: 50 PTQRIVSAVRRSMSRVHHFS-----PTKNSDIFTDTAQSEM-----ISNQGEYLMKFSLG 99
Query: 112 TPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNP 171
TP I DTGS L+W C +C + P F PK SS+ + I C
Sbjct: 100 TP-AFDILAIADTGSDLIWTQCKPCDQCYE--------QDAPLFDPKSSSTYRDISCSTK 150
Query: 172 KCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKT---- 226
+C + + C G NKTC Y YG FT+G + ++T+ S +
Sbjct: 151 QCDLL---KEGASCSG--EGNKTC-----HYSYSYGDRSFTSGNVAADTITLGSTSGRPV 200
Query: 227 -VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPV 278
+P + GC + + +GI G G SL SQLG KFSYCL+ P+
Sbjct: 201 LLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLV-------PL 253
Query: 279 SSNLV----LDTG-----PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
SSN L+ G G G TP +S P FY++ L + VGS
Sbjct: 254 SSNATNSSKLNFGSNGIVSGGGVQSTPLISKDP------------DTFYFLTLEAVSVGS 301
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ +K P S G +I+DSG+T T F E + + VE S
Sbjct: 302 ERIKFPGSSF---GTSEGNIIIDSGTTLTLFPEDFF----SELSSAVQDAVAGTPVEDPS 354
Query: 390 G-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
G L C+ I + P + F GA + L P N F V + VLC N+
Sbjct: 355 GILSLCYSIDAD--LKFPSITAHFD-GADVKLNPLNTFVQVSDTVLCFAFNPINSGA--- 408
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I G+ NF + +DL F C
Sbjct: 409 ------IFGNLAQMNFLVGYDLEGKTVSFKPTDC 436
>gi|449432735|ref|XP_004134154.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
gi|449527085|ref|XP_004170543.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
Length = 435
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 119/409 (29%), Positives = 179/409 (43%), Gaps = 72/409 (17%)
Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
SL + T TP + D G +W VDC+ V S PA R
Sbjct: 41 SLQYITEIHQRTPLVPVKLTVDLGGQFMW---------VDCDRGYVSSSYKPA----RCR 87
Query: 162 SSQL-IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY---GLGFTAGLLLS 217
S+Q + ++ C F P GC+ N TC L + +++ G + + +S
Sbjct: 88 SAQCSLASKSSACGQCFSPPRP----GCN--NNTCSLFPGNTIIRLSTSGEVASDVVSVS 141
Query: 218 ETLRF-PSKTV--PNFLAGCS---ILSDRQPA--GIAGFGRSSESLPSQLGL-----KKF 264
T F P++ V PNFL C +L P G+AGFGR+ SLPSQ +KF
Sbjct: 142 STNGFNPTRAVSIPNFLFVCGSTFLLEGLAPGVTGMAGFGRNGISLPSQFAAAFSFNRKF 201
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGS-----GDSKTPGLSYTPFYKNPVGSS--SAFGE- 316
+ CL SS V+ +G G T +YTP + NPV ++ S+ GE
Sbjct: 202 AVCL------SGSTSSPGVIFSGNGPYHFLPNIDLTNSFTYTPLFINPVSTAGVSSAGEK 255
Query: 317 --FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
Y++G+ I+V SK V + + L S+GNGG + + + FT +E +++A+ K F
Sbjct: 256 STEYFIGVTSIVVNSKPVPLNTTLLKIDSNGNGGTKISTVNPFTVLESSIYKALVKAFTT 315
Query: 375 QMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMALPPENYFAL 428
++ R V C+ S L +L+L+ K ++ N
Sbjct: 316 EVSKVPRVGAVAP---FEVCYSSKSFPSTRLGAGVPTIDLVLQNK-KVIWSMFGANSMVQ 371
Query: 429 VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
V +EVLCL F D + AI++G Q+++ LEFDLA R GF
Sbjct: 372 VNDEVLCL-GFVDG----GVDVRTAIVIGAHQIEDKLLEFDLATSRLGF 415
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 162/404 (40%), Gaps = 66/404 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S+S GTPP I DTGS L W C +C N P F K+SS
Sbjct: 83 GEYFMSISIGTPPSKFLA-IADTGSDLTWVQCKPCQQCYKQN--------TPLFDKKKSS 133
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C + C+ + +GC C Y YG FT G + +ET+
Sbjct: 134 TYKTESCDSITCNAL-----SEHEEGCDESRNACK-----YRYSYGDESFTKGEVATETI 183
Query: 221 RFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCL 268
S + P GC + + +GI G G SL SQLG KKFSYCL
Sbjct: 184 SIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCL 243
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGD---SKTPGLSYTPF-YKNPVGSSSAFGEFYYVGLRQ 324
A + V++ G S SK + TP K+P +Y++ L
Sbjct: 244 ---SHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDP-------ETYYFLTLEA 293
Query: 325 IIVGSKHVKIPYS-----YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
I VG K+PY+ L S G +I+DSG+T T ++ ++ +
Sbjct: 294 ITVG--KTKLPYTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGA 351
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
R +D + L CF SG K + LP + + F GA + L P N F + +++CL +
Sbjct: 352 KRVSD--PQGILTHCFK-SGDKEIGLPTITMHFT-GADVKLSPINSFVKLSEDIVCLSMI 407
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ +F + +DL F + C+
Sbjct: 408 PTTEVA---------IYGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 110/395 (27%), Positives = 170/395 (43%), Gaps = 63/395 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSSS 162
Y + +S GTPP + I DTGS+L W C + + +C D +I F P SS+
Sbjct: 6 YFMGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKCYD---QAAKAGQI--FNPYNSST 59
Query: 163 SQLIGCQNPKCSWIFGPNVESRCK-GCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+GC C+ G +++ + GC + TC Y L+YG G ++ G L + L
Sbjct: 60 YSKVGCSTEACN---GMHMDLAVEYGCVEEDDTCI-----YSLRYGSGEYSVGYLGKDRL 111
Query: 221 RFPS-KTVPNFLAGC--SILSDRQPAGIAGFGRSSESLPSQL----GLKKFSYCLLSRKF 273
S +++ NF+ GC L + AGI GFG S S +Q+ FSYC
Sbjct: 112 TLASNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHE 171
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
++ L GP + D + P Y + ++V ++
Sbjct: 172 NEGS------LTIGPYARDINLMWTKLIYYDHKPA---------YAIQQLDMMVNGIRLE 216
Query: 334 I-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM--GNYSRAADVEKKSG 390
I PY Y+ + IVDSG+ T++ P+F+A+ K ++M Y+R D
Sbjct: 217 IDPYIYISKMT------IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDER---- 266
Query: 391 LRPCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
R CF IS S P + +K + + LP EN F N V+C D+A
Sbjct: 267 -RICF-ISNSGSANWNDFPTVEMKLI-RSTLKLPVENAFYESSNNVICSTFLPDDA---- 319
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G +LG+ +++F L FD+ FGF + C
Sbjct: 320 -GVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 353
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 103/371 (27%), Positives = 158/371 (42%), Gaps = 57/371 (15%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DT + W +C C P P R P F P SS++ + C++P C + GP
Sbjct: 152 IDTTVDVPWI------QCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSL-GPY- 203
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-VPNFLAGCS---- 235
GCS N++ C YL++Y TAG +++TL T V NF GCS
Sbjct: 204 ---GNGCS--NRSANAEC-RYLIEYSDDRATAGTYMTDTLTISGTTAVRNFRFGCSHAVR 257
Query: 236 -ILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
SD AG G ++SL +Q FSYC+ +S + GP +
Sbjct: 258 GRFSDLT-AGTMSLGGGAQSLLAQTARSLGNAFSYCVPQAS------ASGFLSIGGPATT 310
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
+S T + TP ++ + S Y V L+ I+V + + IP + G ++
Sbjct: 311 NSTTV-FATTPLVRSAINPS-----LYLVRLQGIVVAGRRLGIPPVAF------SAGAVM 358
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
DS + T + + A+ + F M Y R+ L C+D G +V +P + L
Sbjct: 359 DSSAVITQLPPTAYRALRRAFRNAMRAYPRSG---ATGTLDTCYDFLGLTNVRVPAVSLV 415
Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
F GGA + L P ++G CL FT ++ ALG +G+ Q Q + +D+A
Sbjct: 416 FGGGAVVVLDPPA--VMIGG---CLA-FTATSSDLALG-----FIGNVQQQTHEVLYDVA 464
Query: 472 NDRFGFAKQKC 482
GF + C
Sbjct: 465 AGGVGFRRGAC 475
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 158/388 (40%), Gaps = 60/388 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I++ G+P T DTGS + W C C C+ VD F P SS+
Sbjct: 122 YVITVGIGSPAVTQT-MSMDTGSDVSWVQCKP---CSQCH-SEVDS----LFDPSSSSTY 172
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET-LRF 222
C + C+ + + GC + C Y++ YG + S L
Sbjct: 173 SPFSCSSAPCAQL---SQSQEGNGC--MSSQC-----QYIVNYGDSSSTTGTYSSDTLTL 222
Query: 223 PSKTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
S + +F GCS + Q G+ G G ++SL SQ FSYCL
Sbjct: 223 GSSAMTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCL------- 275
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
P S GS T G + F K P+ S+ +Y V L I VGS+ + +P
Sbjct: 276 PPTS---------GSSGFLTLGTGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLP 326
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-LRPC 394
S GS ++DSG+ T + + A++ F M Y A SG L C
Sbjct: 327 TSVFSAGS------LMDSGTIITRLPPTAYSALSSAFKAGMQQYPPAT----PSGILDTC 376
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
FD SG+ S+ +P + L F GGA + L + + + + CL FT N +LG
Sbjct: 377 FDFSGQSSISIPTVTLVFSGGAAVDLAFDGIMLEISSSIRCLA-FTPNGDDSSLG----- 430
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + F + +D+ GF C
Sbjct: 431 IIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 102/356 (28%), Positives = 149/356 (41%), Gaps = 35/356 (9%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
I+++ GTP + + D S VW C P AF P S++
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAA-----GCLPPPATAFRPNGSATFSP 144
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA-CPSYLLQYG--LGFTAGLLLSETLRF 222
+ C + C P + C A C SY L YG T+G L ++T F
Sbjct: 145 LPCSSDMCL----PVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTF 200
Query: 223 PSKTVPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLGLKKFSYCLLS-RKFDDAPV 278
+ VP + GCS S AG + G GR + SL SQL KFSY LL+ DD
Sbjct: 201 GATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSA 260
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV-GSKHVKIPYS 337
S + GD P P+ SS+ + +FYYV L + V G++ IP
Sbjct: 261 DSVIRF------GDDAVPKTKRG--QSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAG 312
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG----NYSRAADVEKKSGLRP 393
++G GGVI+ S + T++E ++ V ++G N S A +++
Sbjct: 313 TFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDL------ 366
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
C++ S V +P+L L F GGA M L NYF + + L + + G LG
Sbjct: 367 CYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLG 422
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 116/415 (27%), Positives = 173/415 (41%), Gaps = 74/415 (17%)
Query: 97 SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFI 156
+V YG + +L GTP + I DTGS++ + PC S R +C + D AF
Sbjct: 55 AVKDYGYFYATLHLGTPAR-QFAVIVDTGSTITYVPCASCGR--NCGPHHKDA----AFD 107
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLL 216
P SSSS +IGC + KC P C GCS + + C +AGLL+
Sbjct: 108 PASSSSSAVIGCDSDKCICGRPP-----C-GCSEKRE-----CTYQRTYAEQSSSAGLLV 156
Query: 217 SETLRFPSKTVPNFLAGCSI-----LSDRQPAGIAGFGRSSESLPSQLGLKK-----FSY 266
S+ L+ V + GC + +++ GI G G S SL +QL F+
Sbjct: 157 SDQLQLRDGAV-EVVFGCETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFAL 215
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
C S + D A L G L YT SS A +Y V L +
Sbjct: 216 CFGSVEGDGA-------LMLGDVDAAEYDVALQYTALL-----SSLAHPHYYSVQLEALW 263
Query: 327 VGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF----EAVAKEFIRQMGNYSR 381
VG + + + P Y + G ++DSG+TFT++ F EAV+ + N +
Sbjct: 264 VGGQQLPVKPERY-----EEGYGTVLDSGTTFTYLPSEAFQLFKEAVSAYALEHGLNSVK 318
Query: 382 AADVEKKSGLR---PCF---------DISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
D ++KS + CF D S + V+ P L+F G ++ P NY +
Sbjct: 319 GPDPKEKSFAQFHDICFGGAPHAGHADQSKLEKVF-PVFELQFADGVRLRTGPLNYLFMH 377
Query: 430 GNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
E+ CL +F + A+G +LG +N +++D N R GF C
Sbjct: 378 TGEMGAYCLGVFDNGASG--------TLLGGISFRNILVQYDRRNRRVGFGAASC 424
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 114/417 (27%), Positives = 175/417 (41%), Gaps = 71/417 (17%)
Query: 84 GSNYSNSLIKTPLSVHSYGGYSI--SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD 141
GS SN+ K +S S G +I ++S G PP + DTGS ++W CT C +
Sbjct: 80 GSLVSNNEYKARVSP-SLTGRTIMANISIGQPPIPQL-VVMDTGSDILWVMCTP---CTN 134
Query: 142 CNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC--KGCSPRNKTCPLAC 199
C+ + L +P S F P ++ C KGCS R P
Sbjct: 135 CD-------------------NHLGLLFDPSMSSTFSPLCKTPCDFKGCS-RCDPIP--- 171
Query: 200 PSYLLQYGLGFTA-GLLLSETLRFPSKT-----VPNFLAGC--SILSDRQPA--GIAGFG 249
+ + Y TA G+ +T+ F + +P+ L GC +I D P GI G
Sbjct: 172 --FTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHNIGQDTDPGHNGILGLN 229
Query: 250 RSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVG 309
+SL +++G +KFSYC+ D L+L G + G S TPF
Sbjct: 230 NGPDSLATKIG-QKFSYCI-GDLADPYYNYHQLILGEG-----ADLEGYS-TPF------ 275
Query: 310 SSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVA 369
FYYV + I VG K + I + GGVI+D+GST TF+ + ++
Sbjct: 276 --EVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVDSVHRLLS 333
Query: 370 KEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV 429
KE +G R +EK ++ + + V P + F GA +AL ++F +
Sbjct: 334 KEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALDSGSFFNQL 393
Query: 430 GNEVLCLILFTDNAAGPA----LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ V C+ + GP L P++I G Q++ + +DL N F + C
Sbjct: 394 NDNVFCMTV------GPVSSLNLKSKPSLI-GLLAQQSYSVGYDLVNQFVYFQRIDC 443
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 158/389 (40%), Gaps = 58/389 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTPP T +FDTGS W C CV + D R+ F P +SS+
Sbjct: 163 YVVPIGLGTPPSRFT-VVFDTGSDTTWVQCRP---CVVSCYKQKD--RL--FDPAKSSTY 214
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
+ C +P C+ + GC+ + Y +QYG G +T G +TL
Sbjct: 215 ANVSCADPACADL-------DASGCNAGHCL-------YGIQYGDGSYTVGFFAKDTLAV 260
Query: 223 PSKTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDA 276
+ F GC + Q AG+ G GR S+ Q K FSYCL +
Sbjct: 261 AQDAIKGFKFGCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAATG 320
Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-KIP 335
+ + + GS TP L+ K P FYYVGL I VG K + IP
Sbjct: 321 YLEFGPLSPSSSGSNAKTTPMLTD----KGPT--------FYYVGLTGIRVGGKQLGAIP 368
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFM--EGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
S N G +VDSG+ T + + A Y +AA S L
Sbjct: 369 ESVF-----SNSGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAA---AYSILDT 420
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
C+D +G V LP + L F+GGA + L + +CL F N ++G
Sbjct: 421 CYDFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQSQVCL-GFASNGDDESVG---- 475
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + + + +D++ GFA C
Sbjct: 476 -IVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 102/356 (28%), Positives = 149/356 (41%), Gaps = 35/356 (9%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
I+++ GTP + + D S VW C P AF P S++
Sbjct: 90 INITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAA-----GCLPPPATAFRPNGSATFSP 144
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLA-CPSYLLQYG--LGFTAGLLLSETLRF 222
+ C + C P + C A C SY L YG T+G L ++T F
Sbjct: 145 LPCSSDMCL----PVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTF 200
Query: 223 PSKTVPNFLAGCSILSDRQPAG---IAGFGRSSESLPSQLGLKKFSYCLLS-RKFDDAPV 278
+ VP + GCS S AG + G GR + SL SQL KFSY LL+ DD
Sbjct: 201 GATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSA 260
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV-GSKHVKIPYS 337
S + GD P P+ SS+ + +FYYV L + V G++ IP
Sbjct: 261 DSVIRF------GDDAVPKTKRG--RSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAG 312
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG----NYSRAADVEKKSGLRP 393
++G GGVI+ S + T++E ++ V ++G N S A +++
Sbjct: 313 TFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDL------ 366
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
C++ S V +P+L L F GGA M L NYF + + L + + G LG
Sbjct: 367 CYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLG 422
>gi|388516731|gb|AFK46427.1| unknown [Medicago truncatula]
Length = 435
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 167/398 (41%), Gaps = 84/398 (21%)
Query: 121 IFDTGSSLVWFPCTSRY----------RCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
I D G +W C ++Y R C+ N D PK GC N
Sbjct: 63 IVDLGGQFLWVDCENKYISSTYRPARCRSAQCSLANSDGCGDCFSSPKP-------GCNN 115
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL------LQYGLGFTAGLLLSETLRFPS 224
C G +P N A L +Q GF G + + RF
Sbjct: 116 NTC-------------GVTPDNSITHTATSGELAEDVLSIQSSNGFNPGQNVVVS-RFLF 161
Query: 225 KTVPNFL-AGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK----FD 274
P FL G + +G+AG GR+ +LPSQL +KF+ CL S K F
Sbjct: 162 SCAPTFLLKGLAT----GASGMAGLGRTKIALPSQLASAFSFARKFAICLSSSKGVVLFG 217
Query: 275 DAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIV 327
D P N+V D+ DS L+YTP NPV ++SAF + Y++G++ I +
Sbjct: 218 DGPYGFLPNVVFDS-----DS----LTYTPLLINPVSTASAFSQGQPSAEYFIGVKTIKI 268
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
K V + S L ++G GG + + +T +E +++AV F++ A ++++
Sbjct: 269 DEKVVSLNTSLLSIDNNGVGGTKISTVDPYTVLEASIYKAVTDAFVKAPA----ARNIKR 324
Query: 388 KSGLRP---CF-DISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
+ P C+ +++G + +V EL L+ + N + +EVLCL
Sbjct: 325 VGSVAPFEFCYTNLTGTRLGAAVPTIELFLQ-NENVVWRIFGANSMVSINDEVLCLGFVN 383
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+I++G +QL+N L+FDLA + GF+
Sbjct: 384 GGK-----NTRTSIVIGGYQLENNLLQFDLAASKLGFS 416
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 109/392 (27%), Positives = 163/392 (41%), Gaps = 96/392 (24%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA----FIPKR 159
Y ++++ G+PP+ S I DTGS LVW V C N D S A F P R
Sbjct: 101 YLMTVNLGSPPR-SMLAIADTGSDLVW---------VKCKKGNNDTSSAAAPTTQFDPSR 150
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
SS+ + CQ C E+ + C +YL YG G T G+L +E
Sbjct: 151 SSTYGRVSCQTDAC--------EALGRATCDDGSNC-----AYLYAYGDGSNTTGVLSTE 197
Query: 219 TLRF--------PSKT-VPNFLAGCSILSDRQ--PAGIAGFGRSSESLPSQLGL-----K 262
T F P + + GCS + G+ G G + SL +QLG +
Sbjct: 198 TFTFDDGGAGRSPRQVRIGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGR 257
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
+FSYCL+ V+++ L+ G D PG + TP N +S+A
Sbjct: 258 RFSYCLVPHS-----VNASSALNFG-ALADVTEPGAASTPLVGNKTVASAASSR------ 305
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+IVDSG+T TF++ L + E R++
Sbjct: 306 --------------------------IIVDSGTTLTFLDPSLLGPIVDELSRRI----TL 335
Query: 383 ADVEKKSGL-RPCFDISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
V+ GL + C++++G++ +P+L L+F GGA +AL PEN F V LCL +
Sbjct: 336 PPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAI 395
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDL 470
+ P ILG+ QN ++ +DL
Sbjct: 396 VATTE------QQPVSILGNLAQQNIHVGYDL 421
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 70/139 (50%), Gaps = 14/139 (10%)
Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL-RPCFDISGKK---SVY 404
+IVDSG+T TF++ L + E R++ V+ GL + C++++G++
Sbjct: 439 IIVDSGTTLTFLDPSLLGPIVDELSRRI----TLPPVQSPDGLLQLCYNVAGREVEAGES 494
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
+P+L L+F GGA +AL PEN F V LCL + + P ILG+ QN
Sbjct: 495 IPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTE------QQPVSILGNLAQQNI 548
Query: 465 YLEFDLANDRFGFAKQKCA 483
++ +DL FA CA
Sbjct: 549 HVGYDLDAGTVTFAVADCA 567
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/390 (26%), Positives = 157/390 (40%), Gaps = 62/390 (15%)
Query: 104 YSISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y ++L FGTP PQ + DTGS + W CT CN P + P F P +SS
Sbjct: 131 YVVTLGFGTPSVPQV---LLMDTGSDVSWVQCTP------CNSTKCYPQKDPLFDPSKSS 181
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ I C C + + GC+ C Y ++Y G + G+ +ETL
Sbjct: 182 TYAPIACNTDACRKLG----DHYHNGCTSGGTQC-----GYSVEYADGSHSRGVYSNETL 232
Query: 221 RF-PSKTVPNFLAGCSILSDRQPA----GIAGFGRSSESLPSQLGL---KKFSYCLLSRK 272
P TV +F GC R P+ G+ G G + SL Q FSYCL +
Sbjct: 233 TLAPGITVEDFHFGCG-RDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALN 291
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
+ + LVL + P S +TP P + FY V + I VG K +
Sbjct: 292 SE----AGFLVLGSPPSGNKSA---FVFTPMRHLP-----GYATFYMVTMTGISVGGKPL 339
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
IP S GG+I+DSG+ T + + A+ + + +A +
Sbjct: 340 HIPQSAF------RGGMIIDSGTVDTELPETAYNALEAALRKAL----KAYPLVPSDDFD 389
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
C++ +G ++ +P + F GGA + L N ++ N+ L +GP G G
Sbjct: 390 TCYNFTGYSNITVPRVAFTFSGGATIDLDVPN--GILVNDCLAF-----QESGPDDGLG- 441
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ + + +D GF C
Sbjct: 442 --IIGNVNQRTLEVLYDAGRGNVGFRAGAC 469
>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 441
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 122/466 (26%), Positives = 187/466 (40%), Gaps = 88/466 (18%)
Query: 42 KHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY 101
KH HH+ +D + S L+ + + TKT P I +S + Y
Sbjct: 26 KHNKHHNVNDSFSL-----SFPLTLSINSTTKTNP---------------IVPSISPYKY 65
Query: 102 G-GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
++L GTPPQ + DTGS + W C ++ P +F P S
Sbjct: 66 SMALVVTLPIGTPPQLQQ-MVLDTGSQVSWIHCDNKK-----GPQKKQPPTTSSFDPSLS 119
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPS-YLLQYGLGFTAGL----- 214
SS + C +P C P V + + P C + L Y +T G
Sbjct: 120 SSFFALPCNHPLCK----PQVP---------DISLPTDCDANRLCHYSFSYTDGTVVEGN 166
Query: 215 LLSETLRF-PSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
L+ E + PS T P + GC+ SD GI G S P+Q + KFSY + ++
Sbjct: 167 LVRENIALSPSLTTPPIILGCANQSD-DARGILGMNLGRLSFPNQAKITKFSYFVPVKQ- 224
Query: 274 DDAPVSSNLVLDTGPGSG---DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
P S +L L P S K S + + P AF + ++ I +G K
Sbjct: 225 -TQPGSGSLYLGNNPNSSCFRYVKLLTFSKSQSQRMPNLDPLAF----TLPMQGISIGGK 279
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-------YSRAA 383
+ IP S P + G G I+DSGS F++M + + E ++++G+ Y A
Sbjct: 280 KLNIPPSVFKPDTTGFGQTIIDSGSEFSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVA 339
Query: 384 DVEKKSGLRPCFDISGKKSVYLP-ELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
D+ CFD + L +++ +F+ G ++ +P E V V C
Sbjct: 340 DI--------CFDGDATEIGRLVGDMVFEFEKGVEIVIPKERVLIEVDGGVHCF------ 385
Query: 443 AAGPALGRGP-----AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+GR I+G+F QN ++EFDLA R GF C+
Sbjct: 386 ----GIGRAEGLGGGGNIIGNFYQQNLWVEFDLAKHRVGFRGANCS 427
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 107/407 (26%), Positives = 164/407 (40%), Gaps = 70/407 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP+ I DTGS ++W C S C +C + ++ F SS
Sbjct: 64 GLYFTKVKLGSPPREFNVQI-DTGSDVLWVCCNS---CNNCPRTSGLGIQLNFFDSSSSS 119
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ L+ C +P C+ V++ CSP+ C SY QY G T+G +S+TL
Sbjct: 120 TAGLVHCSDPICT----SAVQTTVTQCSPQTNQC-----SYTFQYEDGSGTSGYYVSDTL 170
Query: 221 RFPS----KTVPN----FLAGCSI-------LSDRQPAGIAGFGRSSESLPSQLGL---- 261
F + V N + GCS ++D+ GI GFG+ S+ SQL
Sbjct: 171 YFDAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGIT 230
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL + +L+ PG+ Y+P + Y +
Sbjct: 231 PRVFSHCLKGEGIGGGILVLGEILE----------PGMVYSPLVPSQ--------PHYNL 272
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I V K + I S V + + G IVDSG+T ++ ++ F+ +
Sbjct: 273 NLQSIAVNGKLLPIDPS--VFATSNSQGTIVDSGTTLAYLVAEAYDP----FVSAVNVIV 326
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFA----LVGNEVLCL 436
+ S C+ +S S P F GGA M L PE+Y G V+
Sbjct: 327 SPSVTPIISKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGSVMWC 386
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I F ILGD L++ +DL R G+A C+
Sbjct: 387 IGFQKVQG--------VTILGDLVLKDKIFVYDLVRQRIGWANYDCS 425
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 115/397 (28%), Positives = 154/397 (38%), Gaps = 64/397 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y ++L GTP T I DTGS L W +C C + P F P SSS
Sbjct: 91 YVVTLGIGTPAVQQTVLI-DTGSDLSWV------QCKPCGAGECYAQKDPLFDPSSSSSY 143
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
+ C + C + C G S A Y ++YG T G+ +ETL
Sbjct: 144 ASVPCDSDACRKLAAGAYGHGCTGVSGGAA----ALCEYGIEYGNRATTTGVYSTETLTL 199
Query: 223 -PSKTVPNFLAGCSILSDRQPA------GIAGFGRSSESLPSQLGLK---KFSYCLLSRK 272
P V +F GC D Q G+ G G + ESL SQ + FSYCL
Sbjct: 200 KPGVVVADFGFGCG---DHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL---- 252
Query: 273 FDDAPVSSN---LVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
P S L L P S S GLS+TP + P + FY V L I VG
Sbjct: 253 ---PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLP-----SVPTFYIVTLTGISVG 304
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+ IP S + G+++DSG+ T + + A+ F M Y R
Sbjct: 305 GAPLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNG 357
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF---TDNAAG 445
L C+D +G +V +P + L F GGA + L + G CL TDNA G
Sbjct: 358 GVLDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG----CLAFAGAGTDNAIG 413
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ + F + +D GF C
Sbjct: 414 ---------IIGNVNQRTFEVLYDSGKGTVGFRAGAC 441
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 121/423 (28%), Positives = 184/423 (43%), Gaps = 82/423 (19%)
Query: 89 NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD 148
NS + +V YG + +L GTP + I DTGS++ + PC+S C PN
Sbjct: 63 NSTMPLHGAVKDYGYFYATLYLGTPAKKFA-VIVDTGSTMTYVPCSS---CGSGCGPN-- 116
Query: 149 PSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGL 208
+ AF P+ SS++ I C +PKCS RC GCS + T SY Q
Sbjct: 117 -HQDAAFDPEASSTASRISCTSPKCSC-----GSPRC-GCSTQQCT---YTRSYAEQSS- 165
Query: 209 GFTAGLLLSETLRFPSKTVPN--FLAGCSILSD----RQPA-GIAGFGRSSESLPSQL-- 259
++G+LL + L +P + GC RQ A G+ G G S S+ +QL
Sbjct: 166 --SSGILLEDVLAL-HDGLPGAPIIFGCETRETGEIFRQRADGLFGLGNSDASVVNQLVK 222
Query: 260 -GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSA 313
G+ FS C + D A L+L GD++ PG L YTP +S+
Sbjct: 223 AGVIDDVFSLCFGMVEGDGA-----LLL------GDAEVPGSISLQYTPLL-----TSTT 266
Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
+Y V + + V + + + S D G ++DSG+TFT+M P+F+A F
Sbjct: 267 HPFYYNVKMLSLAVEGQLLPVSQSLF----DQGYGTVLDSGTTFTYMPSPVFKA----FA 318
Query: 374 RQMGNYSRAADVEKKSGLRP-----CF-------DISGKKSVYLPELILKFKGGAKMALP 421
+ Y+ + +++ G P CF D+ SV+ P + ++F G + L
Sbjct: 319 GAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLEALSSVF-PSMEVQFDQGTSLVLG 377
Query: 422 PENYFAL--VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
P NY + + CL +F + AG +LG +N + +D AN R GF
Sbjct: 378 PLNYLFVHTFNSGKYCLGVFDNGRAG--------TLLGGITFRNVLVRYDRANQRVGFGP 429
Query: 480 QKC 482
C
Sbjct: 430 ALC 432
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 161/388 (41%), Gaps = 70/388 (18%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS ++W C + C +C + + F SS++ LI C + C+ V
Sbjct: 85 IDTGSDILWVNCNT---CSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICT----SGV 137
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF--------PSKTVPNFLA 232
+ CSPR C SY QYG G T+G +S+ + F + +
Sbjct: 138 QGAAAECSPRVNQC-----SYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIVF 192
Query: 233 GCSI-------LSDRQPAGIAGFGRSSESLPSQL---GL--KKFSYCLLSRKFDDAPVSS 280
GCSI +D+ GI GFG S+ SQL G+ K FS+CL D
Sbjct: 193 GCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKG----DGNGGG 248
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PYSYL 339
LVL G+ P + Y+P + Y + L+ I V + + I P +
Sbjct: 249 ILVL------GEILEPSIVYSPLVPSQ--------PHYNLNLQSIAVNGQPLPINPAVFS 294
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
+ S+ GG IVD G+T ++ ++ + + +R + S C+ +S
Sbjct: 295 I--SNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSAR----QTNSKGNQCYLVST 348
Query: 400 KKSVYLPELILKFKGGAKMALPPENYFA----LVGNEVLCLILFTDNAAGPALGRGPAII 455
P + L F+GGA M L PE Y L G E+ C + F G A I
Sbjct: 349 SIGDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWC-VGFQKLQEG-------ASI 400
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LGD L++ + +D+A R G+A C+
Sbjct: 401 LGDLVLKDKIVVYDIAQQRIGWANYDCS 428
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 168/400 (42%), Gaps = 61/400 (15%)
Query: 93 KTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRI 152
K+ LS+++ G Y + + GTP A +FDTGS W C CV + +
Sbjct: 155 KSGLSLNT-GNYVVPIRLGTP-AARFTVVFDTGSDTTWVQCQP---CVAYCYQQ----KE 205
Query: 153 PAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FT 211
P F P +S++ I C + CS ++++R GCS + Y +QYG G +T
Sbjct: 206 PLFTPTKSATYANISCTSSYCS-----DLDTR--GCSGGHCL-------YAVQYGDGSYT 251
Query: 212 AGLLLSETLRFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLKK---FS 265
G +TL TV +F GC + + AG+ G GR S+P Q K F+
Sbjct: 252 VGFYAQDTLTLGYDTVKDFRFGCGEKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFA 311
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
YC+ A S LD GPG+ + L+ P FYYVG+ I
Sbjct: 312 YCI------PATSSGTGFLDFGPGAPAAANARLTPMLVDNGPT--------FYYVGMTGI 357
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG + IP + + G +VDSG+ T + +E + F + M
Sbjct: 358 KVGGHLLSIPATVF-----SDAGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKT-A 411
Query: 386 EKKSGLRPCFDISG-KKSVYLPELILKFKGGAKMALPPEN--YFALVGNEVLCLILFTDN 442
S L C+D++G + S+ LP + L F+GGA + + Y A V L D+
Sbjct: 412 PAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAANDDD 471
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + + + +DL GFA C
Sbjct: 472 T--------DMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 152/388 (39%), Gaps = 47/388 (12%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTPPQ ++ I D LVW C++ RC + +P F+P SS+
Sbjct: 62 YVANFTIGTPPQPASA-IVDVAGELVWTQCSACRRCFKQD--------LPVFVPNASSTF 112
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ C C I P + CS C P L+ G T+G ++T
Sbjct: 113 KPEPCGTAVCESI--PT-----RSCS--GDVCSYKGPPTQLR---GNTSGFAATDTFAIG 160
Query: 224 SKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
+ TV GC + SD P+G G GR+ SL +Q+ L +FSYCL R S
Sbjct: 161 TATV-RLAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGK---S 216
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
S L L G + + S PF K S +Y + L I G+ + S
Sbjct: 217 SRLFL--GSSAKLAGGESTSTAPFIKTSPDDDSH--HYYLLSLDAIRAGNTTIATAQS-- 270
Query: 340 VPGSDGNGGVIV-DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF-DI 397
GG++V + S F+ + + A K +G + CF
Sbjct: 271 -------GGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKA 323
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAII 455
+G P+L+ F+G A + +PP Y VG E C + + A G +
Sbjct: 324 AGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILS-MAWLNRTGLEGVSV 382
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LG Q ++ + +DL + F C+
Sbjct: 383 LGSLQQEDVHFLYDLKKETLSFEPADCS 410
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 159/382 (41%), Gaps = 79/382 (20%)
Query: 132 PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPR 191
PC S YR +D P F PK SSS ++ C + C+ + G G
Sbjct: 5 PCVSCYRQLD-----------PVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDG---- 49
Query: 192 NKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQPA----GIA 246
AC Y +Y G G T G L + L + GCS S PA G+
Sbjct: 50 ------AC-QYTYKYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLV 102
Query: 247 GFGRSSESLPSQLGLKKFSYCL---LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
G GR SL SQL + +F YCL +SR S LVL G + + + ++ T
Sbjct: 103 GLGRGPLSLVSQLSVHRFMYCLPPPMSR------TSGKLVLGAGADAVRNMSDRVTVT-- 154
Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG---------------- 347
+ SS+ + +YY+ L + VG + + P S G G
Sbjct: 155 ----MSSSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGA 210
Query: 348 ---GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI---SGKK 401
G+IVD ST +F+E L++ +A + ++ RA + GL CF + G
Sbjct: 211 NAYGMIVDVASTISFLETSLYDELADDLEEEI-RLPRATP-SLRLGLDLCFILPEGVGMD 268
Query: 402 SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQ 460
VY+P + L F G + L + F G ++CL+ +GR + ILG+FQ
Sbjct: 269 RVYVPTVSLSFD-GRWLELDRDRLFVTDG-RMMCLM----------IGRTSGVSILGNFQ 316
Query: 461 LQNFYLEFDLANDRFGFAKQKC 482
LQN + F+L + FAK C
Sbjct: 317 LQNMRVLFNLRRGKITFAKASC 338
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 154/388 (39%), Gaps = 47/388 (12%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTPPQ ++ I D LVW C++ RC + +P F+P SS+
Sbjct: 45 YVANFTIGTPPQPASA-IVDVAGELVWTQCSACRRCFKQD--------LPVFVPNASSTF 95
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ C C I P + CS C P L+ G T+G ++T
Sbjct: 96 KPEPCGTAVCESI--PT-----RSCS--GDVCSYKGPPTQLR---GNTSGFAATDTFAIG 143
Query: 224 SKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
+ TV GC + SD P+G G GR+ SL +Q+ L +FSYCL R S
Sbjct: 144 TATV-RLAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGK---S 199
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
S L L G + + + S PF K + +Y + L I G+ + S
Sbjct: 200 SRLFL--GSSAKLAGSESTSTAPFIKTSPDDDGS--NYYLLSLDAIRAGNTTIATAQS-- 253
Query: 340 VPGSDGNGGVIV-DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF-DI 397
GG++V + S F+ + ++A K +G + CF
Sbjct: 254 -------GGILVMHTVSPFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKA 306
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAII 455
+G P+L+ F+G A + +PP Y VG E C + + A G +
Sbjct: 307 AGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILS-MAWLNRTGLEGVSV 365
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LG Q ++ + +DL + F C+
Sbjct: 366 LGSLQQEDVHFLYDLKKETLSFEPADCS 393
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 169/393 (43%), Gaps = 63/393 (16%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTS-RYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
+ +S GTPP + I DTGS+L W C + + +C D +I F P SS+
Sbjct: 1 MGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKCYD---QAAKAGQI--FNPYNSSTYS 54
Query: 165 LIGCQNPKCSWIFGPNVESRCK-GCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
+GC C+ G +++ + GC + TC Y L+YG G ++ G L + L
Sbjct: 55 KVGCSTEACN---GMHMDLAVEYGCVEEDDTCI-----YSLRYGSGEYSVGYLGKDRLTL 106
Query: 223 PS-KTVPNFLAGC--SILSDRQPAGIAGFGRSSESLPSQL----GLKKFSYCLLSRKFDD 275
S +++ NF+ GC L + AGI GFG S S +Q+ FSYC ++
Sbjct: 107 ASNRSIDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENE 166
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI- 334
L GP + D + P Y + ++V ++I
Sbjct: 167 GS------LTIGPYARDINLMWTKLIYYDHKPA---------YAIQQLDMMVNGIRLEID 211
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLR 392
PY Y+ + IVDSG+ T++ P+F+A+ K ++M Y+R D R
Sbjct: 212 PYIYISKMT------IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDER-----R 260
Query: 393 PCFDISGKKSVY---LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
CF IS S P + +K + + LP EN F N V+C D+A G
Sbjct: 261 ICF-ISNSGSANWNDFPTVEMKLI-RSTLKLPVENAFYESSNNVICSTFLPDDA-----G 313
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+LG+ +++F L FD+ FGF + C
Sbjct: 314 VRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 346
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 125/452 (27%), Positives = 174/452 (38%), Gaps = 102/452 (22%)
Query: 60 ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS--VHSYGGYSISLSFGTPPQAS 117
A S++RA H Y SL P S + G Y ++ S GTPP
Sbjct: 57 ARRSINRANHF----------------YKYSLANIPQSTVIPDIGEYLMTYSVGTPP-FK 99
Query: 118 TPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF 177
I DTGS +VW C C + P F P +SSS + I C + C +
Sbjct: 100 LYGIVDTGSDIVWLQCEPCQECYN--------QTTPMFNPSKSSSYKNIPCPSKLCQSM- 150
Query: 178 GPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS-ETLRFPSK-----TVPNFL 231
C+ +N Y YG +G LS +TL S + PN +
Sbjct: 151 ------EDTSCNDKNYC------EYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIV 198
Query: 232 AGC---SILS-DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVL 284
GC +ILS + +GI GFG S +QLG KFSYC L+ F + SN
Sbjct: 199 IGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYC-LTPLFSVTNIQSNATS 257
Query: 285 -----DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
D SGD G+ TP K + FYY+ L VG++ V+I
Sbjct: 258 KLNFGDAATVSGD----GVVTTPILKKDPET------FYYLTLEAFSVGNRRVEIGG--- 304
Query: 340 VPGSDGNGGVIVDSGSTFT--------FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
VP D G +I+DSG+T T F+E + + V E R D + L
Sbjct: 305 VPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLE---------RVDDPTQT--L 353
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+ + + + P + + FK GA + L P + F V + V CL +
Sbjct: 354 NLCYSVKAEGYDF-PIITMHFK-GADVDLHPISTFVSVADGVFCLAFESSQDHA------ 405
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ QN + +DL F C
Sbjct: 406 ---IFGNLAQQNLMVGYDLQQKIVSFKPSDCT 434
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 107/406 (26%), Positives = 167/406 (41%), Gaps = 69/406 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ + DTGS ++W C + +C + +D + + PK SS
Sbjct: 86 GLYYTEVRLGTPPKRFYVQV-DTGSDILWVNCITCDQCPHKSGLGLD---LTLYDPKASS 141
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+ + C C+ FG R CS N C Y + YG G T G +++ L
Sbjct: 142 TGSTVMCDQGFCADTFG----GRLPKCSA-NVPC-----EYSVTYGDGSSTVGSFVNDAL 191
Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
+F +T P + + GC S + GI GFG ++ S+ SQL
Sbjct: 192 QFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKV 251
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEF 317
K F++CL + K G G GD P + TP +
Sbjct: 252 KKIFAHCLDTIK--------------GGGIFAIGDVVQPKVKTTPLVADK--------PH 289
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
Y V L+ I VG +++P PG G I+DSG+T T++ +F+ V +
Sbjct: 290 YNVNLKTIDVGGTTLELPADIFKPGE--KRGTIIDSGTTLTYLPELVFKKVMLAVFNKHQ 347
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ + DV+ CF+ SG P L F+ + + P YF GN+V C +
Sbjct: 348 DIT-FHDVQD----FLCFEYSGSVDDGFPTLTFHFEDDLALHVYPHEYFFPNGNDVYC-V 401
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
F + A G+ +++GD L N + +DL N G+ C+
Sbjct: 402 GFQNGALQSKDGK-DIVLMGDLVLSNKLVVYDLENRVIGWTDYNCS 446
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 105/414 (25%), Positives = 165/414 (39%), Gaps = 82/414 (19%)
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPA 154
V S G Y + G+PP+ + DTGS ++W C C + NF +
Sbjct: 68 VDSVGLYFTKIKLGSPPKEYHVQV-DTGSDILWVNCKPCPECPSKTNLNF------HLSL 120
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
F SS+S+ +GC + CS+I S+ C P + C +++ + G
Sbjct: 121 FDVNASSTSKKVGCDDDFCSFI------SQSDSCQP-----AVGCSYHIVYADESTSEGN 169
Query: 215 LLSETLRFPS-----KTVP---NFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL 259
+ + L +T P + GC SD G+ GFG+S+ S+ SQL
Sbjct: 170 FIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQL 229
Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPG---SGDSKTPGLSYTPFYKNPVGSS 311
+ FS+CL + K G G G +P + TP N +
Sbjct: 230 AATGDAKRVFSHCLDNVK--------------GGGIFAVGVVDSPKVKTTPMVPNQM--- 272
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
Y V L + V + +P S + NGG IVDSG+T + L++++ +
Sbjct: 273 -----HYNVMLMGMDVDGTALDLPPSIM-----RNGGTIVDSGTTLAYFPKVLYDSLIET 322
Query: 372 FI-RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
+ RQ D + CF S V P + +F+ K+ + P +Y +
Sbjct: 323 ILARQPVKLHIVEDTFQ------CFSFSENVDVAFPPVSFEFEDSVKLTVYPHDYLFTLE 376
Query: 431 NEVLCLILFTDNAAGPALG-RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
E+ C F A G G R I+LGD L N + +DL N+ G+A C+
Sbjct: 377 KELYC---FGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLENEVIGWADHNCS 427
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 129/500 (25%), Positives = 196/500 (39%), Gaps = 97/500 (19%)
Query: 11 LFSLLILLFTTDAGAGSSAAT-----VTVPLTPLSTKHYLHHSDS------DPLKILHSL 59
+FSL+I++ + A SAAT TV L H DS +PL+ +
Sbjct: 4 IFSLVIVIIFLISTAVVSAATGPDYGFTVELI---------HRDSPKSPMYNPLENHYHR 54
Query: 60 ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTP 119
+ +L R+ T T ++ I +N G Y + LS GTPP
Sbjct: 55 VADTLRRSISHNTGLVTNTVEAPIYNNR--------------GEYLMKLSVGTPPFPIIA 100
Query: 120 FIFDTGSSLVW---FPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
+ DTGS ++W PCT+ Y+ +P F P +S++ + + C +P CS+
Sbjct: 101 -VADTGSDIIWTQCVPCTNCYQ-----------QDLPMFNPSKSTTYRKVSCSSPVCSFT 148
Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-----VPNF 230
N S C +Y + YG + G +TL S + P
Sbjct: 149 GEDNSCSFKPDC------------TYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRT 196
Query: 231 LAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLV 283
GC + D +GI G G SL Q+G KFSYCL DD +
Sbjct: 197 AIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNK--- 253
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
L+ G + S + +S TP Y S F FY + L+ + VG + + + G
Sbjct: 254 LNFGSNANVSGSGAVS-TPIYI-----SDKFKSFYSLKLKAVSVGRNNTFYSTANSILG- 306
Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
G +I+DSG+T T + L+ AK + N R D + L CF+ +
Sbjct: 307 -GKANIIIDSGTTLTLLPVDLYHNFAKAISNSI-NLQRTDDPNQF--LEYCFETT-TDDY 361
Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
+P + + F+ GA + L EN V + V+CL A I G+ N
Sbjct: 362 KVPFIAMHFE-GANLRLQRENVLIRVSDNVICL-------AFAGAQDNDISIYGNIAQIN 413
Query: 464 FYLEFDLANDRFGFAKQKCA 483
F + +D+ N F C
Sbjct: 414 FLVGYDVTNMSLSFKPMNCV 433
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 126/510 (24%), Positives = 209/510 (40%), Gaps = 101/510 (19%)
Query: 1 MAACPFSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKIL---H 57
MA+ LI SL++ L A +G S + P HH S P IL H
Sbjct: 1 MASLWTQLISTVSLILSLARWVAVSGDSGNVLLFPSR--------HHEGSRPAMILPLHH 52
Query: 58 SLASSSLSR---ARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPP 114
S+ SSLS RHL+ ++ N+ ++ + G Y+ L GTPP
Sbjct: 53 SVPESSLSHFNPRRHLQGSQ---------SEHHPNARMRLFDDLLRNGYYTTRLWIGTPP 103
Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
Q I DTGS++ + PC++ C C + P F P+ S + Q + KC+
Sbjct: 104 QRFA-LIVDTGSTVTYVPCST---CKHCG-----SHQDPKFRPEASETYQPV-----KCT 149
Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTV---PNF 230
W +C C K C +Y +Y + ++G+L + + F +++
Sbjct: 150 W--------QC-NCDDDRKQC-----TYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRA 195
Query: 231 LAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLD 285
+ GC + +++ GI G GR S+ QL KK ++S F +
Sbjct: 196 IFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKK----VISDAFSLCYGGMGVGGG 251
Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS-D 344
G S + +T + +PV S +Y + L++I V K + +L P D
Sbjct: 252 AMVLGGISPPADMVFT--HSDPVRSP-----YYNIDLKEIHVAGKRL-----HLNPKVFD 299
Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD---ISGKK 401
G G ++DSG+T+ ++ F A +++ + R SG P ++ SG +
Sbjct: 300 GKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRI------SGPDPHYNDICFSGAE 353
Query: 402 ------SVYLPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPA 453
S P + + F G K++L PENY CL +F++ G P
Sbjct: 354 INVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSN-------GNDPT 406
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+LG ++N + +D + + GF K C+
Sbjct: 407 TLLGGIVVRNTLVMYDREHSKIGFWKTNCS 436
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 112/417 (26%), Positives = 168/417 (40%), Gaps = 71/417 (17%)
Query: 88 SNSLIKTPLSVHSYGG---YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
S+S + P+S +Y G Y + L GTP Q T + DTGS L W C
Sbjct: 97 SSSAVSLPMSSGAYSGTGQYFVKLRVGTPVQEFT-LVADTGSDLTWVKCAG--------- 146
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
P R+ F PK S S I C + C +V CS C Y
Sbjct: 147 -ASPPGRV--FRPKTSRSWAPIPCSSDTCKL----DVPFTLANCSSPASPCTY---DYRY 196
Query: 205 QYGLGFTAGLLLSE--TLRFPSKTVP---NFLAGCSILSD----RQPAGIAGFGRSSESL 255
+ G G++ +E T+ P V + + GCS D R G+ G + S
Sbjct: 197 KEGSAGARGIVGTESATIALPGGKVAQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISF 256
Query: 256 PSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
+Q + FSYCL+ AP ++ L GPG +TP + T + +P
Sbjct: 257 ATQAAARFGGSFSYCLVDHL---APRNATGYLAFGPGQ-VPRTPA-TQTKLFLDPEM--- 308
Query: 313 AFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
FY V + I V K + IP S GGVI+DSG+T T + P ++AV
Sbjct: 309 ---PFYGVKVDAIHVAGKALDIPAEVWDAKS---GGVILDSGNTLTVLAAPAYKAVVAAL 362
Query: 373 IRQMGNYSRAADVEKKSGLRP---CFDISGKK---SVYLPELILKFKGGAKMALPPENYF 426
+ + D K P C++ + ++ +P+L ++F G A++ P ++Y
Sbjct: 363 SKHL-------DGVPKVSFPPFEHCYNWTARRPGAPEIIPKLAVQFAGSARLEPPAKSYV 415
Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V V C+ G G P + ++G+ Q EFDL N + F + C
Sbjct: 416 IDVKPGVKCI--------GVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQSNC 464
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 100/410 (24%), Positives = 175/410 (42%), Gaps = 75/410 (18%)
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPAFI 156
S G Y + G+PP+ + DTGS ++W C +C D P + +
Sbjct: 73 SIGLYFTKIKLGSPPKEYYVQV-DTGSDILWVNCAPCPKCPVKTDLGIP------LSLYD 125
Query: 157 PKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACP-SYLLQYGLGFTA-GL 214
K SS+S+ +GC++ CS+I +++TC P SY + YG G T+ G
Sbjct: 126 SKASSTSKNVGCEDAFCSFIM-------------QSETCGAKKPCSYHVVYGDGSTSDGD 172
Query: 215 LLSETLRFPS-----KTVP---NFLAGCS-----ILSDRQPA--GIAGFGRSSESLPSQL 259
+ + + +T P + GC L + A GI GFG+S+ S+ SQL
Sbjct: 173 FVKDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQL 232
Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
+ FS+CL D+ + G+ ++P + TP N V
Sbjct: 233 AAGGSVKRIFSHCL-----DNMNGGGIFAI------GEVESPVVKTTPLVPNQV------ 275
Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
Y V L+ + V + + +P S + ++G+GG I+DSG+T ++ L+ ++ ++
Sbjct: 276 --HYNVILKGMDVDGEPIDLPPS--LASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITA 331
Query: 375 QMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
+ + + CF + P + L F+ K+++ P +Y + ++
Sbjct: 332 K-----QQVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMY 386
Query: 435 CLILFTDNAAGPALGRGP-AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
C F + G G I+LGD L N + +DL N+ G+A C+
Sbjct: 387 C---FGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCS 433
>gi|357440781|ref|XP_003590668.1| Basic 7S globulin [Medicago truncatula]
gi|355479716|gb|AES60919.1| Basic 7S globulin [Medicago truncatula]
Length = 434
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 106/396 (26%), Positives = 173/396 (43%), Gaps = 81/396 (20%)
Query: 121 IFDTGSSLVWFPCTSRY----------RCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
I D G +W C ++Y R C+ D + PK GC N
Sbjct: 63 IVDLGGLFLWVDCENQYISSTYRPARCRSAQCSLAKFDDCGVCFSSPKP-------GCNN 115
Query: 171 PKCSWIFGPNV-ESRCKGCSPRNKTCPLACPSYLLQYGLGFTAG--LLLSETLRFPSKTV 227
CS G +V +S G LA +Q GF G +++S L ++T
Sbjct: 116 NTCSVAPGNSVTQSAMSG--------ELAEDILSIQSSNGFNPGQNVMVSRFLFSCARTF 167
Query: 228 PNFLAGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK----FDDAPV 278
L G + +G+AG GR+ +LPSQL KKF+ CL S K F D P
Sbjct: 168 --LLEGLA----SGASGMAGLGRNKLALPSQLASAFSFAKKFAICLSSSKGVVLFGDGPY 221
Query: 279 S--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF-----YYVGLRQIIVGSKH 331
N+V D+ L+YTP NP S++AF + Y++G++ I + K
Sbjct: 222 GFLPNVVFDS---------KSLTYTPLLINPF-STAAFAKSEPSAEYFIGVKTIKIDGKV 271
Query: 332 VKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
V + S L + S+G GG + + +T +E +++AV F++ S A ++++
Sbjct: 272 VSLDTSLLSIDSSNGAGGTKISTVDPYTVLEASIYKAVTDAFVKA----SAARNIKRVDS 327
Query: 391 LRP---CF-DISGKK-SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAG 445
+ P C+ +++G + +P + L + + N + +EVLCL G
Sbjct: 328 VAPFEFCYTNVTGTRLGADVPTIELYLQNNVIWRIFGANSMVNINDEVLCL--------G 379
Query: 446 PALG---RGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+G +I++G +QL+N L+FDLA + GF+
Sbjct: 380 FVIGGENTWASIVIGGYQLENNLLQFDLAASKLGFS 415
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 112/405 (27%), Positives = 175/405 (43%), Gaps = 72/405 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRS 160
G + + +S GTPP A+ + DTGS+L W C RC + C+ P F P +S
Sbjct: 73 GKFFMDISLGTPPVANLVTV-DTGSTLSWVVCQ---RCQISCH--TTAPEAGSVFDPDKS 126
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG----FTAGLLL 216
++ +L+GC + C+ + V GC TC Y L+YG G ++AG L
Sbjct: 127 TTYELVGCSSRDCADVQRSLVAPF--GCIEETDTC-----LYSLRYGSGPSGQYSAGRLG 179
Query: 217 SETLRFPSKT--VPNFLAGCSILSDRQ--PAGIAGFGRSSESLPSQLG----LKKFSYCL 268
++ L S + + F+ GCS + +G+ GFG ++ S +Q+ + FSYC
Sbjct: 180 TDKLTLASSSSIIDGFIFGCSGDDSFKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCF 239
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSS---AFGEFYYVGLRQ 324
GD G LS + K+ + ++ FG+ L+Q
Sbjct: 240 ---------------------PGDHTAEGFLSIGAYPKDELVYTNLIPHFGDRSVYSLQQ 278
Query: 325 I--IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I +V +++ S ++VDSG+ TF+ GP+F+A +K M
Sbjct: 279 IDMMVDGNRLQVDQSEYTKRM-----MVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFL 333
Query: 383 ADVEKKSGLRPCFDISGKKSV---YLPELILKFKGGAKMALPPENYFA--LVGNEVLCLI 437
+D G CF +G SV LP + ++F G + LPPEN F L ++ +CL
Sbjct: 334 SDT---VGTETCFRPNGGDSVDSGDLPTVEMRFI-GTTLKLPPENVFHDLLPSHDKICLA 389
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
D A G ILG+ +F + +DL FGF C
Sbjct: 390 FKPDVA-----GVRNVQILGNKATXSFRVVYDLQAMYFGFQAGAC 429
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 114/397 (28%), Positives = 154/397 (38%), Gaps = 64/397 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y ++L GTP T + DTGS L W +C C + P F P SSS
Sbjct: 171 YVVTLGIGTPAVQQT-VLIDTGSDLSWV------QCKPCGAGECYAQKDPLFDPSSSSSY 223
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
+ C + C + C G S A Y ++YG T G+ +ETL
Sbjct: 224 ASVPCDSDACRKLAAGAYGHGCTGVSGGAA----ALCEYGIEYGNRATTTGVYSTETLTL 279
Query: 223 -PSKTVPNFLAGCSILSDRQPA------GIAGFGRSSESLPSQLGLK---KFSYCLLSRK 272
P V +F GC D Q G+ G G + ESL SQ + FSYCL
Sbjct: 280 KPGVVVADFGFGCG---DHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCL---- 332
Query: 273 FDDAPVSSN---LVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
P S L L P S S GLS+TP + P + FY V L I VG
Sbjct: 333 ---PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLP-----SVPTFYIVTLTGISVG 384
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+ IP S + G+++DSG+ T + + A+ F M Y R
Sbjct: 385 GAPLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNG 437
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF---TDNAAG 445
L C+D +G +V +P + L F GGA + L + G CL TDNA G
Sbjct: 438 GVLDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG----CLAFAGAGTDNAIG 493
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ + F + +D GF C
Sbjct: 494 ---------IIGNVNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 101/384 (26%), Positives = 156/384 (40%), Gaps = 53/384 (13%)
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
++S G PP + DTGS ++W CT C + DPS+ F P +
Sbjct: 104 NISIGQPPIPQL-VVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCKTPCDFE 162
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT 226
GC+ C P T A S F ++ ET +
Sbjct: 163 GCR------------------CDPIPFTVTYADNS---TASGTFGRDTVVFETTDEGTSR 201
Query: 227 VPNFLAGC--SILSDRQPA--GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNL 282
+ + L GC +I D P GI G +SL ++LG +KFSYC+ D L
Sbjct: 202 ISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKLG-QKFSYCI-GNLADPYYNYHQL 259
Query: 283 VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
+L G + G S TPF + FYYV + I VG K + I
Sbjct: 260 ILGEG-----ADLEGYS-TPF--------EVYNGFYYVTMEGISVGEKRLDIAPETFEMK 305
Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS 402
+ GGVI+D+GST TF+ + + ++KE +G R A +EK ++ + +
Sbjct: 306 ENRAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDL 365
Query: 403 VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPA----LGRGPAIILGD 458
V P + F GA +AL ++F + + V C+ + GP + P++I G
Sbjct: 366 VGFPVVTFHFSDGADLALDSGSFFNQLNDNVFCMTV------GPVSSLNIKSKPSLI-GL 418
Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
Q++ + +DL N F + C
Sbjct: 419 LAQQSYNVGYDLVNQFVYFQRIDC 442
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 109/407 (26%), Positives = 166/407 (40%), Gaps = 69/407 (16%)
Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
+G Y S+ G+P Q + I DTGS L W C C P+VD + RS
Sbjct: 97 FGEYYTSIKLGSPGQEAI-LIVDTGSELTWLQCLPCKVCA----PSVDT----IYDAARS 147
Query: 161 SSSQLIGCQNPK-CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
+S + + C N + CS N R C A YG G F+ G L ++
Sbjct: 148 ASYRPVTCNNSQLCS-----NSSQGTYAYCARGSQCQFAA-----FYGDGSFSYGSLSTD 197
Query: 219 TLRFPSK------TVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLK---KFS 265
TL + TV +F GC+ L +GI G +LP QLG + KFS
Sbjct: 198 TLIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFS 257
Query: 266 YCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
+C P S+ + TG G+ + + YT + +S +FY+V L
Sbjct: 258 HCF--------PDRSSHLNSTGVVFFGNAELPHEQVQYTSV---ALTNSELQRKFYHVAL 306
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ + + S + +L GS VI+DSGS+F+ P + + F++ +
Sbjct: 307 KGVSINSHEL----VFLPRGSV----VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKH 358
Query: 383 ADVEKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENYF---ALVGNEVLC 435
+ + L CF +S LP L L F+ G + +P A N V
Sbjct: 359 LEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKM 418
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F D G P ++G++Q QN ++E+D+ R GFA+ C
Sbjct: 419 CFAFEDG------GPNPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 106/392 (27%), Positives = 162/392 (41%), Gaps = 59/392 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y ++L GTP T I DTGS L W +C CN + P + P + P SS+
Sbjct: 127 YVVTLGIGTPAVQQTVLI-DTGSDLSWV------QCKPCNSSSCYPQKDPLYDPTASSTY 179
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-----TAGLLLSE 218
+ C + C + P+ GC+ + T L QYG+ + T G+ +E
Sbjct: 180 APVPCDSKACKDLV-PDAYDH--GCTNSSGTS-------LCQYGIEYGNRDTTVGVYSTE 229
Query: 219 TLRF-PSKTVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSR 271
TL P +V +F GC ++ + G+ G G + ESL SQ FSYCL
Sbjct: 230 TLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPG 289
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
S+ L G + ++ T G +TP + P ++ FY V L + VG K
Sbjct: 290 N------STTGFLALGAPTNNNDTAGFLFTPLHSLPEQAT-----FYLVNLTGVSVGGKP 338
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ IP + L +GG+I+DSG+ T + + A+ F M Y L
Sbjct: 339 LDIPPTVL------SGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPP-NNDDVL 391
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA-GPALGR 450
C++ +G +V +P + L F GGA + L +V +L D A
Sbjct: 392 DTCYNFTGIANVTVPTVALTFDGGATIDL-----------DVPSGVLIQDCLAFAGGASD 440
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+ + F + +D GF C
Sbjct: 441 GDVGIIGNVNQRTFEVLYDSGRGHVGFRPGAC 472
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 106/411 (25%), Positives = 172/411 (41%), Gaps = 61/411 (14%)
Query: 85 SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
S N+ +K + S G Y+ L GTPPQ I DTGS++ + PC++ C C
Sbjct: 57 SQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFA-LIVDTGSTVTYVPCST---CKQCG- 111
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
+ P F P+ S+S Q + C NP C N + K C + ++ S +L
Sbjct: 112 ----KHQDPKFQPELSTSYQALKC-NPDC------NCDDEGKLCVYERRYAEMSSSSGVL 160
Query: 205 QYGLGFTAGLLLSETLRFPSKTVPNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQL 259
L + G +E+ P + V GC L ++ GI G GR S+ QL
Sbjct: 161 SEDL-ISFG---NESQLSPQRAV----FGCENEETGDLFSQRADGIMGLGRGKLSVVDQL 212
Query: 260 GLKKFSYCLLSRKFDDAPVSSN-LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
K + S + V +VL S PG+ ++ + +P S +Y
Sbjct: 213 VDKGVIEDVFSLCYGGMEVGGGAMVL-----GKISPPPGMVFS--HSDPFRSP-----YY 260
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
+ L+Q+ V K +K+ +G G ++DSG+T+ + F A+ I+++ +
Sbjct: 261 NIDLKQMHVAGKSLKLNPKVF----NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPS 316
Query: 379 YSRAADVEKKSGLRPCFDISGKKSV----YLPELILKFKGGAKMALPPENYF--ALVGNE 432
R + CF +G+ + PE+ ++F G K+ L PENY
Sbjct: 317 LKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRG 375
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
CL +F D R +LG ++N + +D ND+ GF K C+
Sbjct: 376 AYCLGIFPD--------RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 91.7 bits (226), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 115/387 (29%), Positives = 152/387 (39%), Gaps = 61/387 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y +++S GTP A T DTGS + W +C C P R P F P RSSS
Sbjct: 142 YVVTVSLGTPAVAQT-LEVDTGSDVSWV------QCKPCPSPPCYSQRDPLFDPTRSSSY 194
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
+ C CS + GCS C Y++ YG G T G+ S+TL
Sbjct: 195 SAVPCAAASCS-----QLALYSNGCS--GGQC-----GYVVSYGDGSTTTGVYSSDTLTL 242
Query: 223 P-SKTVPNFLAGCSILSDRQPAGIA---GFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
S + FL GC AG+ G GR +SL SQ FSYCL
Sbjct: 243 TGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCL------- 295
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
P + N V G G S T G S TP ++S +Y V L I VG + + I
Sbjct: 296 -PPTQNSVGYISLG-GPSSTAGFSTTPLL-----TASNDPTYYIVMLAGISVGGQPLSID 348
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S G+ +VD+G+ T + + A+ F M Y + L C+
Sbjct: 349 ASVFASGA------VVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPS-APATGILDTCY 401
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
D + +V LP + + F GGA M L CL A P G A I
Sbjct: 402 DFTRYGTVTLPTISIAFGGGAAMDLGTSGIL-----TSGCL------AFAPTGGDSQASI 450
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
LG+ Q ++F + FD GF C
Sbjct: 451 LGNVQQRSFEVRFD--GSTVGFMPASC 475
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 107/412 (25%), Positives = 174/412 (42%), Gaps = 63/412 (15%)
Query: 85 SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
S N+ +K + S G Y+ L GTPPQ I DTGS++ + PC++ C C
Sbjct: 57 SQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFA-LIVDTGSTVTYVPCST---CKQCG- 111
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
+ P F P+ S+S Q + C NP C N + K C + ++ S +L
Sbjct: 112 ----KHQDPKFQPELSTSYQALKC-NPDC------NCDDEGKLCVYERRYAEMSSSSGVL 160
Query: 205 QYGLGFTAGLLLSETLRFPSKTVPNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQL 259
L + G +E+ P + V GC L ++ GI G GR S+ QL
Sbjct: 161 SEDL-ISFG---NESQLSPQRAV----FGCENEETGDLFSQRADGIMGLGRGKLSVVDQL 212
Query: 260 GLKKFSYCLLSRKFDDAPVSSN-LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
K + S + V +VL S PG+ ++ + +P S +Y
Sbjct: 213 VDKGVIEDVFSLCYGGMEVGGGAMVL-----GKISPPPGMVFS--HSDPFRSP-----YY 260
Query: 319 YVGLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
+ L+Q+ V K +K+ P + +G G ++DSG+T+ + F A+ I+++
Sbjct: 261 NIDLKQMHVAGKSLKLNPKVF-----NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIP 315
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSV----YLPELILKFKGGAKMALPPENYF--ALVGN 431
+ R + CF +G+ + PE+ ++F G K+ L PENY
Sbjct: 316 SLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVR 374
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
CL +F D R +LG ++N + +D ND+ GF K C+
Sbjct: 375 GAYCLGIFPD--------RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 112/448 (25%), Positives = 190/448 (42%), Gaps = 105/448 (23%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R RHL+ KP + SN+ ++ + + G Y+ L G+PPQ I DTG
Sbjct: 60 RLRHLQNLVKPHS---------SNARMRLHDDLLTNGYYTTRLWIGSPPQ-EFALIVDTG 109
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
S++ + PC++ CV C + P F P+ SS+ Q + C N C+
Sbjct: 110 STVTYVPCSN---CVQCG-----NHQDPRFQPELSSTYQPVKC-NADCN----------- 149
Query: 186 KGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKTVPN-FLAGCSILSD-- 239
C C +Y +Y + ++G+L + + F S+ VP + GC +
Sbjct: 150 --CDENGVQC-----TYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMESGD 202
Query: 240 ---RQPAGIAGFGRSSESLPSQLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS- 290
++ GI G GR + S+ QL K FS C +D G G+
Sbjct: 203 LYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGG-------------MDVGGGAM 249
Query: 291 ---GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PYSYLVPGSDGN 346
G S PG+ ++ + +P S +Y + L++I V K +K+ P ++ DG
Sbjct: 250 VLGGISSPPGMVFS--HSDPSRSP-----YYNIELKEIHVAGKPLKLNPRTF-----DGK 297
Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKK 401
G I+DSG+T+ + + A ++++ + +++ SG P CF +G+
Sbjct: 298 YGAILDSGTTYAYFPEKAYYAFKDAIMKKI------SFLKQISGPDPNFKDICFSGAGRD 351
Query: 402 SVYL----PELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPAII 455
L PE+ + F G K++L PENY + CL +F + G +
Sbjct: 352 VTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKN-------GNDQTTL 404
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LG ++N + ++ N GF K C+
Sbjct: 405 LGGIIVRNTLVTYNRENSTIGFWKTNCS 432
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 113/399 (28%), Positives = 160/399 (40%), Gaps = 62/399 (15%)
Query: 91 LIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS 150
L++TP Y + GTPPQ DT + W PC+ C C P+
Sbjct: 102 LLQTPT-------YVVRARLGTPPQQLL-LAVDTSNDAAWIPCSG---CAGC------PT 144
Query: 151 RIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF 210
P F P S S + + C +P CS P+ CS K+C + L Y
Sbjct: 145 TTP-FNPAASKSYRAVPCGSPACSRAPNPS-------CSLNTKSC-----GFSLTYADSS 191
Query: 211 TAGLLLSETLRFPSKTVPNFLAGC---SILSDRQPAGIAGFGRSSESLPSQ---LGLKKF 264
L ++L + V ++ GC + + P G+ G GR S SQ + F
Sbjct: 192 LEAALSQDSLAVANDVVKSYTFGCLQKATGTATPPQGLLGLGRGPLSFLSQTKDMYEGTF 251
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLR 323
SYCL S F S L L G P + TP NP SS YYV +
Sbjct: 252 SYCLPS--FKSLNFSGTLRL------GRKGQPLRIKTTPLLVNPHRSS-----LYYVSMT 298
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I VG K V IP + L G ++DSG+ FT + P + AV E R++ R A
Sbjct: 299 GIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRI----RGA 354
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
+ G C++ +V P + F G ++ LP +N LV + A
Sbjct: 355 PLSSLGGFDTCYNT----TVKWPPVTFMFT-GMQVTLPADN---LVIHSTYGTTSCLAMA 406
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A P ++ Q QN + FD+ N R GFA+++C
Sbjct: 407 AAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQC 445
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 116/407 (28%), Positives = 179/407 (43%), Gaps = 68/407 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ I DTGS ++W C S C C + ++ F P+ SS
Sbjct: 75 GLYYTKVKLGTPPREFYVQI-DTGSDVLWVSCGS---CNGCPQTSGLQIQLNYFDPRSSS 130
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+S LI C + +C V++ CS +N C +Y QYG G T+G +S+ +
Sbjct: 131 TSSLISCSDRRCR----SGVQTSDASCSSQNNQC-----TYTFQYGDGSGTSGYYVSDLM 181
Query: 221 RFP--------SKTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
F + + + + GCSIL S+R GI GFG+ S+ SQL L
Sbjct: 182 HFAGIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIA 241
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL K D++ LVL G+ P + Y+P ++ Y +
Sbjct: 242 PRVFSHCL---KGDNSG-GGVLVL------GEIVEPNIVYSPLVQSQ--------PHYNL 283
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I V + V P + V + N G IVDSG+T + L E F+ +
Sbjct: 284 NLQSISVNGQIV--PIAPAVFATSNNRGTIVDSGTTLAY----LAEEAYNPFVNAITALV 337
Query: 381 RAADVEKKSGLRPCFDISGKKSVYL-PELILKFKGGAKMALPPENYFA---LVGNEVLCL 436
+ S C+ I+ +V + P++ L F GGA + L P++Y +G +
Sbjct: 338 PQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWC 397
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I F G ++ ILGD L++ +DLA R G+A C+
Sbjct: 398 IGF-QRIPGQSI-----TILGDLVLKDKIFVYDLAGQRIGWANYDCS 438
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 160/391 (40%), Gaps = 64/391 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVDCNFPNVDPSRIPAFIPKRS 160
G Y I++ FGTP + T +FDTGS + W C RC + P F P S
Sbjct: 14 GNYVITVGFGTPTRTQT-VVFDTGSDVNWLQCKPCAVRCY--------AQQEPLFDPSLS 64
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
S+ + + C P C V +GCS + TC Y + YG G T G L +T
Sbjct: 65 STYRNVSCTEPAC-------VGLSTRGCS--SSTCL-----YGVFYGDGSSTIGFLAMDT 110
Query: 220 LRF-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSE-SLPSQLG---LKKFSYCLLSR 271
P++ NF+ GC + + AG+ G GRSS SL SQ+ FSYCL S
Sbjct: 111 FMLTPAQKFKNFIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPST 170
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
S+ L+ G TPG YT + + Y++ L I VG
Sbjct: 171 S------SATGYLNIG---NPQNTPG--YTAMLTD-----TRVPTLYFIDLIGISVGGTR 214
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ + + + G I+DSG+ T + + A+ M Y+ A V L
Sbjct: 215 LSLSSTVF-----QSVGTIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTI---L 266
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+D S SV P ++L F G + +P F + + +CL F N +G
Sbjct: 267 DTCYDFSRTTSVVYPVIVLHFA-GLDVRIPATGVFFVFNSSQVCL-AFAGNTDSTMIG-- 322
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + +D R GF+ C
Sbjct: 323 ---IIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 129/500 (25%), Positives = 196/500 (39%), Gaps = 97/500 (19%)
Query: 11 LFSLLILLFTTDAGAGSSAAT-----VTVPLTPLSTKHYLHHSDS------DPLKILHSL 59
+FSL+I++ + A SAAT TV L H DS +PL+ +
Sbjct: 4 IFSLVIVIIFLISTAVVSAATGPDYGFTVELI---------HRDSPKSPMYNPLENHYHR 54
Query: 60 ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTP 119
+ +L R+ T T ++ I +N G Y + LS GTPP
Sbjct: 55 VADTLRRSISHNTGLVTNTVEAPIYNNR--------------GEYLMKLSVGTPPFPIIA 100
Query: 120 FIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
+ DTGS ++W PCT+ Y+ +P F P +S++ + + C +P CS+
Sbjct: 101 -VADTGSDIIWTQCEPCTNCYQ-----------QDLPMFNPSKSTTYRKVSCSSPVCSFT 148
Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT-----VPNF 230
N S C +Y + YG + G +TL S + P
Sbjct: 149 GEDNSCSFKPDC------------TYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRT 196
Query: 231 LAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLV 283
GC + D +GI G G SL Q+G KFSYCL DD +
Sbjct: 197 AIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNK--- 253
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
L+ G + S + +S TP Y S F FY + L+ + VG + + + G
Sbjct: 254 LNFGSNANVSGSGAVS-TPIYI-----SDKFKSFYSLKLKAVSVGRNNTFYSTANSILG- 306
Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
G +I+DSG+T T + L+ AK + N R D + L CF+ +
Sbjct: 307 -GKANIIIDSGTTLTLLPVDLYHNFAKAISNSI-NLQRTDDPNQF--LEYCFETT-TDDY 361
Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
+P + + F+ GA + L EN V + V+CL A I G+ N
Sbjct: 362 KVPFIAMHFE-GANLRLQRENVLIRVSDNVICL-------AFAGAQDNDISIYGNIAQIN 413
Query: 464 FYLEFDLANDRFGFAKQKCA 483
F + +D+ N F C
Sbjct: 414 FLVGYDVTNMSLSFKPMNCV 433
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 114/407 (28%), Positives = 168/407 (41%), Gaps = 72/407 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ I DTGS ++W CTS C C + ++ F P SS
Sbjct: 82 GLYYTKVKLGTPPREFNVQI-DTGSDVLWVSCTS---CNGCPKTSELQIQLSFFDPGVSS 137
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S+ L+ C + +C F ES GCSP N C SY +YG G T+G +S+ +
Sbjct: 138 SASLVSCSDRRCYSNF--QTES---GCSP-NNLC-----SYSFKYGDGSGTSGYYISDFM 186
Query: 221 RFPSKTVPN--------FLAGCSILSD-------RQPAGIAGFGRSSESLPSQLGL---- 261
F + F+ GCS L R GI G G+ S S+ SQL +
Sbjct: 187 SFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLA 246
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+ FS+CL D +VL G K P YTP + Y V
Sbjct: 247 PRVFSHCLKG----DKSGGGIMVL------GQIKRPDTVYTPLVPSQ--------PHYNV 288
Query: 321 GLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
L+ I V + + I P + + DG I+D+G+T ++ + + + Y
Sbjct: 289 NLQSIAVNGQILPIDPSVFTIATGDGT---IIDTGTTLAYLPDEAYSPFIQAVANAVSQY 345
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY---FALVGNEVLCL 436
R E CF+I+ P++ L F GGA M L P Y F+ G+ + C+
Sbjct: 346 GRPITYESYQ----CFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCI 401
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ ILGD L++ + +DL R G+A+ C+
Sbjct: 402 -------GFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 131/468 (27%), Positives = 193/468 (41%), Gaps = 90/468 (19%)
Query: 37 TPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL 96
+PLS + +H+D D L+ + S S+SR KTK +I S + N L+
Sbjct: 43 SPLSPLYNPNHTDFDRLR---NAFSRSISRVNVFKTKA------VDINS-FQNDLVPNG- 91
Query: 97 SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW---FPCTSRYRCVDCNFPNVDPSRIP 153
G Y + +S GTP I DTGS L W PC YR + P
Sbjct: 92 -----GEYFMKMSIGTP-LVEVIVIADTGSDLTWVQCLPCDPCYR-----------QKSP 134
Query: 154 AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTA 212
F P RSSS + + C + C+ + + + C+ C Y YG +T
Sbjct: 135 LFDPSRSSSYRHMLCGSRFCNAL-----DVSEQACTMDTNIC-----EYHYSYGDKSYTN 184
Query: 213 GLLLSETLRFPSKT-----VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLG--L 261
G L +E S + + + GC + D +GI G G + SL SQL +
Sbjct: 185 GNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSII 244
Query: 262 K-KFSYCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
K KFSYCL+ + + V+S + T G TP +S P +
Sbjct: 245 KGKFSYCLVPLS-EQSNVTSKIKFGTDSVISGPQVVSTPLVSKQP------------DTY 291
Query: 318 YYVGLRQIIVGSKHVKIPYSY-LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
YYV L I VG+K ++PY+ L+ G+ G VI+DSG+T TF++ F E R +
Sbjct: 292 YYVTLEAISVGNK--RLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFF----TELERVL 345
Query: 377 GNYSRAADVEKKSGL-RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
+A V GL CF +G + LP + + F A + L P N F ++LC
Sbjct: 346 EETVKAERVSDPRGLFSVCFRSAGD--IDLPVIAVHFN-DADVKLQPLNTFVKADEDLLC 402
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ + N G I G+ +F + +DL F C
Sbjct: 403 FTMISSNQIG---------IFGNLAQMDFLVGYDLEKRTVSFKPTDCT 441
>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
Length = 495
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 136/340 (40%), Gaps = 59/340 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDC----NFPNVDPSRIPAFIPKR 159
Y++ +GTP Q P FD S RC C + + AF P
Sbjct: 138 YTVLAGYGTPAQ-QLPLFFDVSG-------MSNMRCKPCFSGSSGGETTTTCDVAFDPSM 189
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
SSS + + C +P C CS +C L F G ++ +T
Sbjct: 190 SSSFRSVLCGSPDCGG----------HSCSAGG-----SCTFTLQNSTFVFGNGTIVMDT 234
Query: 220 LRF-PSKTVPNFLAGC-----SILSDRQPAGIAGFGRSSESLPSQL------GLKKFSYC 267
L PS T NF GC + +D G S SL +++ G+ FSYC
Sbjct: 235 LTLSPSATFENFAVGCMQLDNDLFTDGVAVGNIDLSLSRHSLATRVLNSSPPGMAAFSYC 294
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGD-SKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
L A ++ L P D S G+ Y P NP G + FYYV L I
Sbjct: 295 L------PADTDTHGFLTIAPALSDYSDHAGVKYVPLVTNPTGPN-----FYYVDLVAIA 343
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
+ + + IP + GNG ++DS S FT++ P++ A+ EF + M Y V
Sbjct: 344 INGEDLPIPPALFT----GNG-TMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQ---PVP 395
Query: 387 KKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF 426
GL C++ + +++YLP++ L+F G M L +
Sbjct: 396 AFGGLDTCYNFTLAENIYLPDITLRFSNGETMDLDDRQFM 435
>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
Length = 382
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/250 (32%), Positives = 118/250 (47%), Gaps = 28/250 (11%)
Query: 242 PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGL 298
P+G+ G GR SL SQ G KFSYC L+ F + + +L + G GD T
Sbjct: 151 PSGLMGLGRGRLSLVSQTGATKFSYC-LTPYFHNNGATGHLFVGASASLGGHGDVMT--- 206
Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY-----LVPGSDGNGGVIVDS 353
T F K P GS FYY+ L + VG + IP + + PG +GGVI+DS
Sbjct: 207 --TQFVKGPKGS-----PFYYLPLIGLTVGETRLPIPATVFDLREVAPGL-FSGGVIIDS 258
Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFK 413
GS FT + ++A+A E ++ A + G C V +P ++ F+
Sbjct: 259 GSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGAL-CVARRDVGRV-VPAVVFHFR 316
Query: 414 GGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
GGA MA+P E+Y+A V +AGP + ++G++Q QN + +DLAN
Sbjct: 317 GGADMAVPAESYWAPVDKAA---ACMAIASAGPYRRQS---VIGNYQQQNMRVLYDLANG 370
Query: 474 RFGFAKQKCA 483
F F C+
Sbjct: 371 DFSFQPADCS 380
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 121/418 (28%), Positives = 173/418 (41%), Gaps = 61/418 (14%)
Query: 77 KTKDSNIGSNY--SNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCT 134
K K +G + S+S+ TP + + G Y L GTP S + DTGSSL W C+
Sbjct: 102 KKKAGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTP-ATSYVMVVDTGSSLTWLQCS 160
Query: 135 SRYRC-VDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNK 193
C V C+ P F P+ S + + C + +C + + CS N
Sbjct: 161 P---CSVSCHR-----QAGPVFDPRASGTYAAVQCSSSECGELQAATLNP--SACSVSNV 210
Query: 194 TCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAGFG 249
Y YG ++ G L +T+ F S + P F GC ++ + AG+ G
Sbjct: 211 CI------YQASYGDSSYSVGYLSKDTVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLA 264
Query: 250 RSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYK 305
++ SL QL FSYCL P SS G S S PG SYTP
Sbjct: 265 KNKLSLLYQLAPSLGYAFSYCL--------PTSSAAA---GYLSIGSYNPGQYSYTP--- 310
Query: 306 NPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF 365
+ SSS Y+V L I V + +P P + I+DSG+ T + ++
Sbjct: 311 --MASSSLDASLYFVTLSGISVAGAPLAVP-----PSEYRSLPTIIDSGTVITRLPPNVY 363
Query: 366 EAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY 425
A+++ M S A S L CF S + +P + + F GGA +AL P N
Sbjct: 364 TALSRAVAAAM--ASAAPRAPTYSILDTCFRGS-AAGLRVPRVDMAFAGGATLALSPGNV 420
Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
V + CL A P G I+G+ Q Q F + +D+A R GFA C+
Sbjct: 421 LIDVDDSTTCL------AFAPT---GGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 469
>gi|224066523|ref|XP_002302122.1| predicted protein [Populus trichocarpa]
gi|222843848|gb|EEE81395.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 104/410 (25%), Positives = 168/410 (40%), Gaps = 89/410 (21%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
PQ + D G +W VDC+ V + PA C + C
Sbjct: 54 PQVPINLVVDLGGQFLW---------VDCDKNYVSSTYRPA------------RCGSALC 92
Query: 174 SWIFGPNVESRCKGCS-----PR----NKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
S +R GC PR N TC + + + + G G L ++ + S
Sbjct: 93 SL-------ARAGGCGDCFSGPRPGCNNNTCGVIPDNTVTRTATG---GELATDVVSVNS 142
Query: 225 K---------TVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGL-----KKFS 265
+VP FL C+ Q G+AG GR+ + PSQ +KF+
Sbjct: 143 TNGSNPGREASVPRFLFSCAPTFLLQGLASGVVGMAGLGRTRIAFPSQFASAFSFNRKFA 202
Query: 266 YCLLSRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSSAFGE----- 316
CL S AP ++ GP + + LS+TP + NPV ++SAF +
Sbjct: 203 ICLTS----PAPAKGVIIFGDGPYNFLPNIQLTSQSLSFTPLFINPVSTASAFSQGEPSA 258
Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
Y++G++ I + K V + + L S G GG + + + +T +E +F AV + FI
Sbjct: 259 EYFIGVKSIRISDKTVPLNATLLSIDSQGKGGTKISTVNPYTVLESSIFNAVTRAFI--- 315
Query: 377 GNYSRAADVEKKSGLRP---CFD----ISGKKSVYLPELILKFKG-GAKMALPPENYFAL 428
N S A ++ + + + P CF S + +P + L + + N
Sbjct: 316 -NESAARNITRVASVAPFDVCFSSDNIFSTRLGAAVPTISLVLQNENVIWRIFGANSMVQ 374
Query: 429 VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
V + VLCL F + + P +I++G +QL++ +FDLA R GF+
Sbjct: 375 VSDNVLCL-GFVNGGSNPTT----SIVIGGYQLEDNLFQFDLAASRLGFS 419
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 108/418 (25%), Positives = 170/418 (40%), Gaps = 62/418 (14%)
Query: 90 SLIKTPLSVHSY---GGYSISLSFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNF 144
S PL+ +Y G Y + GTP Q PF+ DTGS L W C R +
Sbjct: 93 SAFAMPLTSGAYTGTGQYFVQFRVGTPAQ---PFVLVADTGSDLTWVKCRGR----RASS 145
Query: 145 PNVDPSRIP-AFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL 203
P+ P P F P S S I C + C ++ + G +P P C Y
Sbjct: 146 PDASPLASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTP-----PAPC-GYD 199
Query: 204 LQY-------GLGFT--AGLLLSETLRFPSKTVPNFLAGCSILSDRQP----AGIAGFGR 250
+Y G+ T A + LS + + + GC+ D Q G+ G
Sbjct: 200 YRYKDKSSARGVVGTDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGN 259
Query: 251 SSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNP 307
S+ S S+ + +FSYCL+ AP ++ L GP G +++P + P
Sbjct: 260 SNISFASRAAARFGGRFSYCLVDHL---APRNATSYLTFGP-------VGAAHSP-SRTP 308
Query: 308 VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
+ + FY V + + V K + IP V NGG I+DSG++ T + P ++A
Sbjct: 309 LLLDAQVAPFYAVTVDAVSVAGKALNIPAE--VWDVKKNGGAILDSGTSLTILATPAYKA 366
Query: 368 VAKEFIRQMGNYSRAADVEKKSGLRPCFDISG-KKSVYLPELILKFKGGAKMALPPENYF 426
V +Q+ R C++ + ++ +P L ++F G A++ P ++Y
Sbjct: 367 VVAALSKQLARVPRV----TMDPFEYCYNWTATRRPPAVPRLEVRFAGSARLRPPTKSYV 422
Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
V C+ G G P + ++G+ Q EFDLAN F + +CA
Sbjct: 423 IDAAPGVKCI--------GLQEGVWPGVSVIGNILQQEHLWEFDLANRWLRFQESRCA 472
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 112/448 (25%), Positives = 190/448 (42%), Gaps = 105/448 (23%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R RHL+ KP + SN+ ++ + + G Y+ L G+PPQ I DTG
Sbjct: 60 RLRHLQNLVKPHS---------SNARMRLHDDLLTNGYYTTRLWIGSPPQ-EFALIVDTG 109
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
S++ + PC++ CV C + P F P+ SS+ Q + C N C+
Sbjct: 110 STVTYVPCSN---CVQCG-----NHQDPRFQPELSSTYQPVKC-NADCN----------- 149
Query: 186 KGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF--PSKTVPN-FLAGCSILSD-- 239
C C +Y +Y + ++G+L + + F S+ VP + GC +
Sbjct: 150 --CDENGVQC-----TYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMESGD 202
Query: 240 ---RQPAGIAGFGRSSESLPSQLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS- 290
++ GI G GR + S+ QL K FS C +D G G+
Sbjct: 203 LYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGG-------------MDVGGGAM 249
Query: 291 ---GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI-PYSYLVPGSDGN 346
G S PG+ ++ + +P S +Y + L++I V K +K+ P ++ DG
Sbjct: 250 VLGGISSPPGMVFS--HSDPSRSP-----YYNIELKEIHVAGKPLKLNPRTF-----DGK 297
Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKK 401
G I+DSG+T+ + + A ++++ + +++ SG P CF +G+
Sbjct: 298 YGAILDSGTTYAYFPEKAYYAFKDAIMKKI------SFLKQISGPDPNFKDICFSGAGRD 351
Query: 402 SVYL----PELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGPALGRGPAII 455
L PE+ + F G K++L PENY + CL +F + G +
Sbjct: 352 VTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKN-------GNDQTTL 404
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LG ++N + ++ N GF K C+
Sbjct: 405 LGGIIVRNTLVTYNRENSTIGFWKTNCS 432
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 134/510 (26%), Positives = 204/510 (40%), Gaps = 95/510 (18%)
Query: 1 MAACPFSLICLFSLLILLFTTDAGAGSSAATVTV-----PLTPLSTKHYLHHSDSDPLKI 55
MA F L C + F +++ A TV + P +PL + HH+ SD L
Sbjct: 1 MATKTF-LYCSLLAISFFFASNSSANRENLTVELIHRDSPHSPL---YNPHHTVSDRL-- 54
Query: 56 LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQ 115
++ S+SR+R TKT +++ L + + G Y +S+S GTPP
Sbjct: 55 -NAAFLRSISRSRRFTTKTD----------------LQSGL-ISNGGEYFMSISIGTPP- 95
Query: 116 ASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSW 175
+ I DTGS L W C +C N P F K+SS+ + C + C
Sbjct: 96 SKVFAIADTGSDLTWVQCKPCQQCYKQN--------SPLFDKKKSSTYKTESCDSKTCQA 147
Query: 176 IFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTV-----PN 229
+ +GC C Y YG FT G + +ET+ S + P
Sbjct: 148 L-----SEHEEGCDESKDICK-----YRYSYGDNSFTKGDVATETISIDSSSGSSVSFPG 197
Query: 230 FLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNL 282
+ GC + + +GI G G SL SQLG KKFSYCL A +
Sbjct: 198 TVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCL---SHTAATTNGTS 254
Query: 283 VLDTGPG---SGDSKTPGLSYTPF-YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS- 337
V++ G S SK TP K+P +Y++ L + VG K+PY+
Sbjct: 255 VINLGTNSIPSNPSKDSATLTTPLIQKDP-------ETYYFLTLEAVTVG--KTKLPYTG 305
Query: 338 --YLVPG--SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
Y + G S G +I+DSG+T T ++ ++ + R +D + L
Sbjct: 306 GGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSD--PQGLLTH 363
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
CF SG K + LP + + F A + L P N F + + +CL +
Sbjct: 364 CFK-SGDKEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMIPTTEVA-------- 413
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ +F + +DL F + C+
Sbjct: 414 -IYGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 102/413 (24%), Positives = 163/413 (39%), Gaps = 80/413 (19%)
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPA 154
V S G Y + G+PP+ + DTGS ++W C +C + NF R+
Sbjct: 68 VDSVGLYFTKIKLGSPPKEYHVQV-DTGSDILWINCKPCPKCPTKTNLNF------RLSL 120
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
F SS+S+ +GC + CS+I S+ C P L C +++ + G
Sbjct: 121 FDMNASSTSKKVGCDDDFCSFI------SQSDSCQP-----ALGCSYHIVYADESTSDGK 169
Query: 215 LLSETLRFPS-----KTVP---NFLAGCSILS-------DRQPAGIAGFGRSSESLPSQL 259
+ + L KT P + GC D G+ GFG+S+ S+ SQL
Sbjct: 170 FIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQL 229
Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPG---SGDSKTPGLSYTPFYKNPVGSS 311
+ FS+CL + K G G G +P + TP N +
Sbjct: 230 AATGDAKRVFSHCLDNVK--------------GGGIFAVGVVDSPKVKTTPMVPNQM--- 272
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
Y V L + V + +P S + NGG IVDSG+T + L++++ +
Sbjct: 273 -----HYNVMLMGMDVDGTSLDLPRSIV-----RNGGTIVDSGTTLAYFPKVLYDSLIET 322
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
+ + + + CF S P + +F+ K+ + P +Y +
Sbjct: 323 ILAR-----QPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEE 377
Query: 432 EVLCLILFTDNAAGPALG-RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
E+ C F A G R I+LGD L N + +DL N+ G+A C+
Sbjct: 378 ELYC---FGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 116/453 (25%), Positives = 179/453 (39%), Gaps = 83/453 (18%)
Query: 49 DSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISL 108
D D + +H LA+ AR T P + + + PL +Y +S+
Sbjct: 94 DQDRVDSIHRLAA-----ARPSSTADDPSSASKGVSLPARRGV---PLGTANY---IVSV 142
Query: 109 SFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
GTP + +FDTGS L W C C C + D P F P +S++ + C
Sbjct: 143 GLGTPKR-DLLVVFDTGSDLSWVQCKP---CDGC-YQQHD----PLFDPSQSTTYSAVPC 193
Query: 169 QNPKCSWIFGPNVES-RCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF---- 222
+C + + S +C+ Y + YG + T G L +TL
Sbjct: 194 GAQECRRLDSGSCSSGKCR---------------YEVVYGDMSQTDGNLARDTLTLGPSS 238
Query: 223 ---PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKF 273
S + F+ GC + G+ G GR SL SQ K FSYCL S
Sbjct: 239 SSSSSDQLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSSST 298
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
+ +S G + P +T + S FYY+ L I V + V+
Sbjct: 299 AEGYLS----------LGSAAPPNARFTAMV-----TRSDTPSFYYLNLVGIKVAGRTVR 343
Query: 334 I-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS--RAADVEKKSG 390
+ P + PG+ ++DSG+ T + + A+ F M YS RA + S
Sbjct: 344 VSPAVFRTPGT------VIDSGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPAL---SI 394
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
L C+D +G+ V +P + L F GGA + L + CL F N ++
Sbjct: 395 LDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLYVANKSQACLA-FASNGDDTSIA- 452
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
ILG+ Q + F + +D+AN + GF + C+
Sbjct: 453 ----ILGNMQQKTFAVVYDVANQKIGFGAKGCS 481
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 115/387 (29%), Positives = 152/387 (39%), Gaps = 61/387 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y +++S GTP A T DTGS + W +C C P R P F P RSSS
Sbjct: 131 YVVTVSLGTPAVAQT-LEVDTGSDVSWV------QCKPCPSPPCYSQRDPLFDPTRSSSY 183
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF 222
+ C CS + GCS C Y++ YG G T G+ S+TL
Sbjct: 184 SAVPCAAASCS-----QLALYSNGCS--GGQC-----GYVVSYGDGSTTTGVYSSDTLTL 231
Query: 223 P-SKTVPNFLAGCSILSDRQPAGIA---GFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
S + FL GC AG+ G GR +SL SQ FSYCL
Sbjct: 232 TGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCL------- 284
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
P + N V G G S T G S TP ++S +Y V L I VG + + I
Sbjct: 285 -PPTQNSVGYISLG-GPSSTAGFSTTPLL-----TASNDPTYYIVMLAGISVGGQPLSID 337
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
S G+ +VD+G+ T + + A+ F M Y + L C+
Sbjct: 338 ASVFASGA------VVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPS-APATGILDTCY 390
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
D + +V LP + + F GGA M L CL A P G A I
Sbjct: 391 DFTRYGTVTLPTISIAFGGGAAMDLGTSGIL-----TSGCL------AFAPTGGDSQASI 439
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
LG+ Q ++F + FD GF C
Sbjct: 440 LGNVQQRSFEVRFD--GSTVGFMPASC 464
>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 451
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 162/391 (41%), Gaps = 56/391 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+P Q + DT + W PCT C C+ S + P+ S+
Sbjct: 106 GSYVVRVKLGSPNQLFF-MVLDTSTDEAWVPCTG---CTGCS------SSSTYYSPQAST 155
Query: 162 S-SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL 220
+ + C P+C+ + +G P T AC G F+A L+ ++L
Sbjct: 156 TYGGAVACYAPRCA---------QARGALPCPYTGSKACTFNQSYAGSTFSA-TLVQDSL 205
Query: 221 RFPSKTVPNFLAGCS------ILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
R T+P++ GC L + G+ S S S+L FSYCL S F
Sbjct: 206 RLGIDTLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPS--FQ 263
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
+ S +L L GP + + TP +NP S YYV L + VG V +
Sbjct: 264 SSYFSGSLKL--GPTGQPRR---IRTTPLLQNPRRPS-----LYYVNLTGVTVGRVKVPL 313
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGLR 392
P YL + G I+DSG+ T GP++ A+ EF Q+ +SR G
Sbjct: 314 PIEYLAFDPNKGSGTILDSGTVITRFVGPVYSAIRDEFRNQVKGPFFSRG-------GFD 366
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG-NEVLCLILFTDNAAGPALGRG 451
CF + + P + L+F G + LP EN + CL + AA P
Sbjct: 367 TCFVKTYEN--LTPLIKLRFT-GLDVTLPYENTLIHTAYGGMACLAM----AAAPNNVNS 419
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ ++Q QN + FD N+R G A++ C
Sbjct: 420 VLNVIANYQQQNLRVLFDTVNNRVGIARELC 450
>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 115/413 (27%), Positives = 175/413 (42%), Gaps = 74/413 (17%)
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA-FIPKRSSSS 163
++S+ GTPPQ T + DTGS L + CN ++ P PA F S +
Sbjct: 66 TVSVVVGTPPQNVT-MVLDTGSEL---------SGLLCNGSSLSP---PAPFNASASLTY 112
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF 222
+ C +P C W G ++ R +P + +C ++ + Y +A G L+++T
Sbjct: 113 SAVDCSSPACVW-RGRDLPVRPFCDAPPSTSCRVS-----ISYADASSADGHLVADTFIL 166
Query: 223 PSKTVPNFLAGC-----------SILSDRQPA--GIAGFGRSSESLPSQLGLKKFSYCLL 269
++ VP GC S +D A G+ G R S S +Q +F+YC+
Sbjct: 167 GTQAVPALF-GCITSYSSSTAINSSATDPSEAATGLLGMNRGSLSFVTQTATLRFAYCI- 224
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYK--NPVGSSSAFGEFYYVGLRQIIV 327
AP +L G P L+YTP + P+ Y V L I V
Sbjct: 225 ------APGQGPGILLLG--GDGGAAPPLNYTPLIEISQPLPYFDRVA--YSVQLEGIRV 274
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
GS ++IP S L P G G +VDSG+ FTF+ + A+ EF+ Q S A + +
Sbjct: 275 GSALLQIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFLNQA--RSLLAPLGE 332
Query: 388 -----KSGLRPCF----DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE------ 432
+ CF + S LPE+ L + GA++A+ E V E
Sbjct: 333 PGFVFQGAFDACFRGPEERVSAASRLLPEVGLVLR-GAEVAVAGEKLLYSVPGERRGEEG 391
Query: 433 ---VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V CL + AG + A ++G Q+ ++E+DL N R GFA +C
Sbjct: 392 AEAVWCLTFGNSDMAGMS-----AYVIGHHHQQDVWVEYDLQNGRVGFAPARC 439
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/397 (26%), Positives = 162/397 (40%), Gaps = 62/397 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + +S GTPP S + DTGS ++W C C N P DPS +S+
Sbjct: 81 GEYLVEISVGTPP-FSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPS--------KST 131
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + + C +P CS+ CS ++ Y + YG + G L +T+
Sbjct: 132 TYKNVACSSPVCSY------SGDGSSCSDDSECL------YSIAYGDDSHSQGNLAVDTV 179
Query: 221 RFPSKT-----VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
S + P + GC + + +GI G GR SL +QLG KFSYCL
Sbjct: 180 TMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCL 239
Query: 269 LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+ + + L+ G + S + G TP Y SS+ + FY + L + VG
Sbjct: 240 I--PIGTGSTNDSTKLNFGSNANVSGS-GTVSTPIY-----SSAQYKTFYSLKLEAVSVG 291
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
P G + N +I+DSG+T T++ L + I Q + A D +
Sbjct: 292 DTKFNFPEGASKLGGESN--IIIDSGTTLTYLPSALLNSFGSA-ISQSMSLPHAQDPSEF 348
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
L CF + +P + + F+ GA + L EN F + ++ +CL A
Sbjct: 349 --LDYCFATT-TDDYEMPPVTMHFE-GADVPLQRENLFVRLSDDTICL----------AF 394
Query: 449 GRGP---AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G P I G+ NF + +D+ N F C
Sbjct: 395 GSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431
>gi|225432542|ref|XP_002277699.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
Length = 435
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 109/401 (27%), Positives = 170/401 (42%), Gaps = 78/401 (19%)
Query: 116 ASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSW 175
S P D G +W VDC+ V S + P R S+Q ++ C
Sbjct: 56 VSIPLTLDLGGQFLW---------VDCDQGYVSSS----YRPVRCGSAQCSLTRSKACGE 102
Query: 176 IF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPN----- 229
F GP KGC+ TC L+ + + T+G + + + S N
Sbjct: 103 CFSGP-----VKGCN--YSTCVLSPDNTVTGTA---TSGEVGEDAVSIQSTDGSNPGRVV 152
Query: 230 ------FLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK--- 272
F G + L + + G+AG GRS +LPSQ +KFS CL S
Sbjct: 153 SVRRLLFTCGSTFLLEGLASRVKGMAGLGRSRVALPSQFSSAFSFNRKFSICLSSSTKST 212
Query: 273 ----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF--GEF---YYVGLR 323
F D P +D + L+YTP NPV ++SA+ GE Y++G++
Sbjct: 213 GVVFFGDGPYVLLPKVD--------ASQSLTYTPLITNPVSTASAYFQGEASVEYFIGVK 264
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I + K V + + L S G GG + + +T +E +++AV + F++++ +R A
Sbjct: 265 SIKINGKAVPLNATLLSIDSQGYGGTKISTVHPYTVLETSIYKAVTQAFLKELSTITRVA 324
Query: 384 DVEKKSGLRPCF---DISGKK---SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
V S CF DI + +V +L+L+ + + N V + VLCL
Sbjct: 325 SV---SPFGACFSSKDIGSTRVGPAVPPIDLVLQ-RQSVYWRVFGANSMVQVSDNVLCL- 379
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
F D P +I++G QL++ L+FDLA R GF+
Sbjct: 380 GFVDGGVNPR----TSIVIGGRQLEDNLLQFDLATSRLGFS 416
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 181/406 (44%), Gaps = 82/406 (20%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++S GTPP I DTGS L+W C C DC + VD P F PK SS
Sbjct: 88 GEYLMNVSIGTPPFPIMA-IADTGSDLLWTQCAP---CDDC-YTQVD----PLFDPKTSS 138
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + + C + +C+ + E++ CS + TC SY L YG +T G + +TL
Sbjct: 139 TYKDVSCSSSQCTAL-----ENQA-SCSTNDNTC-----SYSLSYGDNSYTKGNIAVDTL 187
Query: 221 RF-PSKTVP----NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
S T P N + GC + +++ +GI G G SL QLG KFSYCL
Sbjct: 188 TLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCL 247
Query: 269 L---SRKFDDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
+ S+K + ++ +N ++ GSG TP + + ++ FYY+ L+
Sbjct: 248 VPLTSKKDQTSKINFGTNAIV---SGSGVVSTPLI-----------AKASQETFYYLTLK 293
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRA 382
I VGSK ++ S G +I+DSG+T T + EF ++ + + +
Sbjct: 294 SISVGSKQIQYSGSDSESSE---GNIIIDSGTTLTLL--------PTEFYSELEDAVASS 342
Query: 383 ADVEKK----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
D EKK SGL C+ +G V P + + F GA + L N F V +++C
Sbjct: 343 IDAEKKQDPQSGLSLCYSATGDLKV--PVITMHFD-GADVKLDSSNAFVQVSEDLVCF-- 397
Query: 439 FTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A P+ I G+ NF + +D + F CA
Sbjct: 398 --------AFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 118/465 (25%), Positives = 185/465 (39%), Gaps = 88/465 (18%)
Query: 57 HSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQA 116
H L ++ R+R+ ++ S + +TP+ + + G Y + L GTPP
Sbjct: 45 HELLRRAIQRSRYRLAGIGMARGEA--ASARKAVVAETPI-MPAGGEYLVKLGIGTPPYK 101
Query: 117 STPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
T I DT S L+W PCT Y VD P F P+ SS+ + C + C
Sbjct: 102 FTAAI-DTASDLIWTQCQPCTGCYHQVD-----------PMFNPRVSSTYAALPCSSDTC 149
Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTVPNFLA 232
+ +V RC +++C Y Y G T G L + L
Sbjct: 150 DEL---DVH-RCG--HDDDESC-----QYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAF 198
Query: 233 GCSILSDR-----QPAGIAGFGRSSESLPSQLGLKKFSYCL---LSRKFDDAPVSSNLVL 284
GCS S Q +G+ G GR SL SQL +++F+YCL SR + LVL
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPPASR------IPGKLVL 252
Query: 285 DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI---------- 334
+ + T ++ P ++P + +YY+ L +++G + + +
Sbjct: 253 GADADAARNATNRIA-VPMRRDP-----RYPSYYYLNLDGLLIGDRAMSLPPTTTTTATA 306
Query: 335 ------------PYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
P + V D N G+I+D ST TF+E L++ + + ++ R
Sbjct: 307 TATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEI-RLPR 365
Query: 382 AADVEKKSGLRPCF---DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLI 437
GL CF D VY+P + L F G + L FA ++CL+
Sbjct: 366 GTG--SSLGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDRESGMMCLM 422
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ A G ILG+FQ QN + ++L R F + C
Sbjct: 423 VGRAEA-------GSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 118/465 (25%), Positives = 185/465 (39%), Gaps = 88/465 (18%)
Query: 57 HSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQA 116
H L ++ R+R+ ++ S + +TP+ + + G Y + L GTPP
Sbjct: 45 HELLRRAIQRSRYRLAGIGMARGEA--ASARKAVVAETPI-MPAGGEYLVKLGIGTPPYK 101
Query: 117 STPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
T I DT S L+W PCT Y VD P F P+ SS+ + C + C
Sbjct: 102 FTAAI-DTASDLIWTQCQPCTGCYHQVD-----------PMFNPRVSSTYAALPCSSDTC 149
Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTVPNFLA 232
+ +V RC +++C Y Y G T G L + L
Sbjct: 150 DEL---DVH-RCG--HDDDESC-----QYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAF 198
Query: 233 GCSILSDR-----QPAGIAGFGRSSESLPSQLGLKKFSYCL---LSRKFDDAPVSSNLVL 284
GCS S Q +G+ G GR SL SQL +++F+YCL SR + LVL
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPPASR------IPGKLVL 252
Query: 285 DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI---------- 334
+ + T ++ P ++P + +YY+ L +++G + + +
Sbjct: 253 GADADAARNATNRIA-VPMRRDP-----RYPSYYYLNLDGLLIGDRTMSLPPTTTTTATA 306
Query: 335 ------------PYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
P + V D N G+I+D ST TF+E L++ + + ++ R
Sbjct: 307 TATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEI-RLPR 365
Query: 382 AADVEKKSGLRPCF---DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLI 437
GL CF D VY+P + L F G + L FA ++CL+
Sbjct: 366 GTG--SSLGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDRESGMMCLM 422
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ A G ILG+FQ QN + ++L R F + C
Sbjct: 423 VGRAEA-------GSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 140/513 (27%), Positives = 208/513 (40%), Gaps = 101/513 (19%)
Query: 1 MAACPFSLICLFSLLILL-----FTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKI 55
MAA + + LF + + L T GS A++ +P+S L++ +
Sbjct: 1 MAAFSITHLSLFVIFVALISKTSLTASMNNGSFTASLIHRDSPISP---LYNPKNTYFDR 57
Query: 56 LHSLASSSLSRARHL--KTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTP 113
L S S+SRA + + KT + +I + G Y + +S GTP
Sbjct: 58 LQSSFHRSISRANRFTPNSVSAAKTLEYDI--------------IPGGGEYFMRISIGTP 103
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P I DTGS L+W C C +C + P F PK+SS+ + + C+ C
Sbjct: 104 P-IEVLVIADTGSDLIWVQCQP---CQECY-----KQKSPIFNPKQSSTYRRVLCETRYC 154
Query: 174 SWIFGPNVESRCKGCSPRN--KTCPLACPSYLLQYG-LGFTAGLLLSETLRFPS--KTVP 228
+ + S + CS K C Y YG FT G L +E S ++
Sbjct: 155 NAL-----NSDMRACSAHGFFKAC-----GYSYSYGDHSFTMGYLATERFIIGSTNNSIQ 204
Query: 229 NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPV--S 279
GC + D +GI G G S SL SQLG K KFSYCL+ P+
Sbjct: 205 ELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLV-------PILEK 257
Query: 280 SNLVLDT---GPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
SN L G S S + TP K P FYY+ L I VG++ +
Sbjct: 258 SNFSLGKIVFGDNSFISGSDTYVSTPLVSKEP-------ETFYYLTLEAISVGNERL--- 307
Query: 336 YSYLVPGSDGN---GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
+Y +DGN G +I+DSG+T TF++ L+ + E + + +A + E+ S
Sbjct: 308 -AYENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKL--ELVLE-----KAVEGERVSDPN 359
Query: 393 PCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGR 450
F I K + LP + + F A + L P N FA ++LC + N
Sbjct: 360 GIFSICFRDKIGIELPIITVHFT-DADVELKPINTFAKAEEDLLCFTMIPSNGIA----- 413
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ NF + +DL + F C+
Sbjct: 414 ----IFGNLAQMNFLVGYDLDKNCVSFMPTDCS 442
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 181/406 (44%), Gaps = 82/406 (20%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++S GTPP I DTGS L+W C C DC + VD P F PK SS
Sbjct: 88 GEYLMNVSIGTPPFPIMA-IADTGSDLLWTQCAP---CDDC-YTQVD----PLFDPKTSS 138
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ + + C + +C+ + E++ CS + TC SY L YG +T G + +TL
Sbjct: 139 TYKDVSCSSSQCTAL-----ENQA-SCSTNDNTC-----SYSLSYGDNSYTKGNIAVDTL 187
Query: 221 RF-PSKTVP----NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCL 268
S T P N + GC + +++ +GI G G SL QLG KFSYCL
Sbjct: 188 TLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCL 247
Query: 269 L---SRKFDDAPVS--SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
+ S+K + ++ +N ++ GSG TP + + ++ FYY+ L+
Sbjct: 248 VPLTSKKDQTSKINFGTNAIV---SGSGVVSTPLI-----------AKASQETFYYLTLK 293
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN-YSRA 382
I VGSK ++ S G +I+DSG+T T + EF ++ + + +
Sbjct: 294 SISVGSKQIQYSGSDSESSE---GNIIIDSGTTLTLL--------PTEFYSELEDAVASS 342
Query: 383 ADVEKK----SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
D EKK SGL C+ +G V P + + F GA + L N F V +++C
Sbjct: 343 IDAEKKQDPQSGLSLCYSATGDLKV--PVITMHFD-GADVKLDSSNAFVQVSEDLVCF-- 397
Query: 439 FTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A P+ I G+ NF + +D + F CA
Sbjct: 398 --------AFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 112/440 (25%), Positives = 172/440 (39%), Gaps = 81/440 (18%)
Query: 59 LASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAST 118
+ S+ SR + + T P + + S ++L G Y + G+P Q
Sbjct: 78 VVSNYDSRRKGFEMTTTPAEVEMPMHSGRDDAL----------GEYFAEVKVGSPGQ-RF 126
Query: 119 PFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFG 178
+ DTGS W C+ + V C R L S +F
Sbjct: 127 WLVVDTGSEFTWLNCSKSFEAVTC--------------ASRKCKVDL--------SELFS 164
Query: 179 PNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRF-----PSKTVPNFLA 232
+V C + C Y + Y G +A G ++++ + N
Sbjct: 165 LSV------CPKPSDPCL-----YDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTI 213
Query: 233 GC--SILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLV 283
GC S+L+ + + GI G G + +S + K KFSYCL+ VSSNL
Sbjct: 214 GCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDH-LSHRSVSSNLT 272
Query: 284 LDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
+ G ++K G + T P FY V + I +G + +KIP V
Sbjct: 273 IG---GHHNAKLLGEIRRTELILFP--------PFYGVNVVGISIGGQMLKIPPQ--VWD 319
Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS 402
+ GG ++DSG+T T + P +EAV + + + R E L CFD G
Sbjct: 320 FNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTG-EDFDALEFCFDAEGFDD 378
Query: 403 VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQ 462
+P L+ F GGA+ P ++Y V V C+ + P G G A ++G+ Q
Sbjct: 379 SVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIV------PIDGIGGASVIGNIMQQ 432
Query: 463 NFYLEFDLANDRFGFAKQKC 482
N EFDL+ + GFA C
Sbjct: 433 NHLWEFDLSTNTVGFAPSTC 452
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 108/403 (26%), Positives = 165/403 (40%), Gaps = 66/403 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S+ GTP + T +FDTGS L W +C C+ + P F P SS
Sbjct: 152 GNYVVSVGLGTPARDLT-VVFDTGSDLSWV------QCGPCSSGGCYKQQDPLFAPSDSS 204
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C +C C G SP + CP Y + YG T G L ++TL
Sbjct: 205 TFSAVRCGAREC------RARQSCGG-SPGDDRCP-----YEVVYGDKSRTQGHLGNDTL 252
Query: 221 RFPS-----------KTVPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGLK---K 263
+ +P F+ GC + Q G+ G GR SL SQ K
Sbjct: 253 TLGTMAPANASAENDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEG 312
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
FSYCL AP +L + P ++ F P+ + + FYYV L
Sbjct: 313 FSYCL-PSSSSSAPGYLSL---------GTPVPAPAHAQF--TPMLNRTTTPSFYYVKLV 360
Query: 324 QIIVGSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I V + +++ + +P +IVDSG+ T + + A+ F+ MG Y
Sbjct: 361 GIRVAGRAIRVSSPRVALP-------LIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYK 413
Query: 383 ADVEKKSGLRPCFDISG--KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
+ S L C+D + +V +P + L F GGA +++ + CL F
Sbjct: 414 -RAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLA-FA 471
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N G + G ILG+ Q + + +D+A + GFA + C+
Sbjct: 472 PNGDGRSAG-----ILGNTQQRTLAVVYDVARQKIGFAAKGCS 509
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 107/388 (27%), Positives = 150/388 (38%), Gaps = 63/388 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y ++ S GTP A T DTGS L W +C C P+ + P F P +SSS
Sbjct: 137 YVVTASLGTPGMAQT-LEVDTGSDLSWV------QCKPCAAPSCYRQKDPLFDPAQSSSY 189
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
+ C S C G C A Y++ YG G T G+ S+TL
Sbjct: 190 AAVPCG------------RSACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 237
Query: 223 PSK-TVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
+ TV FL GC G+ GFGR SL Q FSYCL ++
Sbjct: 238 AANATVQGFLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTKS-- 295
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
S+ L G SG PG S T +P + +Y V L I VG + + +
Sbjct: 296 ----STTGYLTLGGPSG--VAPGFSTTQLLPSPNAPT-----YYVVMLTGISVGGQPLSV 344
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
P S G+ +VD+G+ T + + A+ F M +Y A + L C
Sbjct: 345 PASAFAAGT------VVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGI---LDTC 395
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
+ +G +V L + L F GA M L + + CL + + G
Sbjct: 396 YSFAGYGTVNLTSVALTFSSGATMTLGADGIMSFG-----CLAFASSGS------DGSMA 444
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ Q ++F + D GF C
Sbjct: 445 ILGNVQQRSFEVRID--GSSVGFRPSSC 470
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 124/420 (29%), Positives = 170/420 (40%), Gaps = 79/420 (18%)
Query: 89 NSLIKTPLS-VHSY-GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
NSL TP S V SY G Y +S S GTPP S I DTGS +VW C +C +
Sbjct: 70 NSLASTPESTVISYEGDYIMSYSVGTPPIKSYG-IVDTGSDIVWLQCEPCEQCYN----- 123
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
P F P +SSS + I C + C + R C+ + K C Y + Y
Sbjct: 124 ---QTTPKFNPSKSSSYKNISCSSKLCQSV-------RDTSCNDK-KNCE-----YSINY 167
Query: 207 G-LGFTAGLLLSETLRFPSKT-----VPNFLAGCSILSDRQPAGIAGFGRSS-------- 252
G + G L ETL S T P + GC I F R S
Sbjct: 168 GNQSHSQGDLSLETLTLESTTGRPVSFPKTVIGCG------TNNIGSFKRVSSGVVGLGG 221
Query: 253 --ESLPSQLGLK---KFSYCL--LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYK 305
SL +QLG KFSYCL +S + + S+ L+ G + S LS TP K
Sbjct: 222 GPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSK-LNFGDVAIVSGHNVLS-TPIVK 279
Query: 306 NPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG--NGGVIVDSGSTFTFMEGP 363
+F FYY+ + VG K V+ S S G G +I+DS + TF+
Sbjct: 280 K----DHSF--FYYLTIEAFSVGDKRVEFAGS-----SKGVEEGNIIIDSSTIVTFVPSD 328
Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
++ + + + R D ++ L C+++S + P + FK GA + L
Sbjct: 329 VYTKLNSAIV-DLVTLERVDDPNQQFSL--CYNVSSDEEYDFPYMTAHFK-GADILLYAT 384
Query: 424 NYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N F V +VLC A P+ G I G F Q+F + +DL F C
Sbjct: 385 NTFVEVARDVLCF------AFAPSNG---GAIFGSFSQQDFMVGYDLQQKTVSFKSVDCT 435
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 108/406 (26%), Positives = 166/406 (40%), Gaps = 72/406 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPC------TSRYRCVDCNFPNVDPSRIPAFIP 157
Y + G P Q I DTGS ++WF C +S+ + C+ + I + P
Sbjct: 88 YYAQIGVGHPVQFLNA-IVDTGSDILWFKCKLCQGCSSKKNVIVCS-SIIMQGPITLYDP 145
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLL 216
+ S ++ C +P CS C N +C +Y + Y + G+
Sbjct: 146 ELSITASPATCSDPLCS---------EGGSCRGNNNSC-----AYDISYEDTSSSTGIYF 191
Query: 217 SETLRFPSKTVPN---FLAGCSILSDRQPA-GIAGFGRSSESLPSQLGLKKFSYCLLSRK 272
+ + K N FL + +S P GI GFGRS S+P+QL + SY +
Sbjct: 192 RDVVHLGHKASLNTTMFLGCATSISGLWPVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHC 251
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
+++ G D + P + YTP N + Y V L + V SK +
Sbjct: 252 LSGEKEGGGILV---LGKND-EFPEMVYTPMLANDI--------VYNVKLVSLSVNSKAL 299
Query: 333 KIPYS-YLVPGSDGNGGVIVDSG-STFTFMEGPLFEAVAKEFIRQMGNYSRAA-DVEKKS 389
I S + + GNGG I+DSG S+ TF L F++ + ++ A +S
Sbjct: 300 PIEASEFEYNATVGNGGTIIDSGTSSATFPSKAL-----ALFVKAVSKFTTAIPTAPLES 354
Query: 390 GLRPCF-DISGKKSVYL--PELILKFKGGAKMALPPENYFALV------------GNEVL 434
PCF IS + SV + P + LKF GGA M L NY V G ++
Sbjct: 355 SGSPCFISISDRNSVEVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLV 414
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
C+ + G + ILGD L++ + +D+ R G+ KQ
Sbjct: 415 CI----------SWSVGNSTILGDAILKDKVVVYDMEKSRIGWVKQ 450
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 109/407 (26%), Positives = 165/407 (40%), Gaps = 69/407 (16%)
Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
+G Y S+ G+P Q + I DTGS L W C C P+VD + RS
Sbjct: 97 FGEYYTSIKLGSPGQEAI-LIVDTGSELTWLKCLPCKVCA----PSVDT----IYDAARS 147
Query: 161 SSSQLIGCQNPK-CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
S + + C N + CS N R C A YG G F+ G L ++
Sbjct: 148 VSYKPVTCNNSQLCS-----NSSQGTYAYCARGSQCQFAA-----FYGDGSFSYGSLSTD 197
Query: 219 TLRFPSK------TVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLK---KFS 265
TL + TV +F GC+ L +GI G +LP QLG + KFS
Sbjct: 198 TLIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFS 257
Query: 266 YCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
+C P S+ + TG G+ + + YT + +S +FY+V L
Sbjct: 258 HCF--------PDRSSHLNSTGVVFFGNAELPHEQVQYTSV---ALTNSELQRKFYHVAL 306
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ + + S + + L GS VI+DSGS+F+ P + + F++ +
Sbjct: 307 KGVSINSHELVL----LPRGSV----VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKH 358
Query: 383 ADVEKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENYF---ALVGNEVLC 435
+ + L CF +S LP L L F+ G + +P A N V
Sbjct: 359 LEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKM 418
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F D G P ++G++Q QN ++E+D+ R GFA+ C
Sbjct: 419 CFAFEDG------GPNPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 135/500 (27%), Positives = 200/500 (40%), Gaps = 85/500 (17%)
Query: 8 LICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSR- 66
L+C+F L AG GS VTVP + + P + ++ L R
Sbjct: 6 LLCIFLCFYLSIVNGAGNGS---FVTVPSSSFVPDTVCSGALVKPEQNGSAVYVPLLHRH 62
Query: 67 ---ARHLKTKTKPKTKDSNIGSNYSNSLIKT--PLSVHSYGGYSI-------SLSFGTP- 113
A L T T P + S+ S I + +SV ++ G S+ ++SFGTP
Sbjct: 63 GPCAPSLSTDTPPSMSEMFRRSHARLSYIVSGKKVSVPAHLGTSVKSLEYVATVSFGTPA 122
Query: 114 -PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
PQ + DTGS L W +C C+ P + P F P SS+ + C + +
Sbjct: 123 VPQV---VVIDTGSDLTWL------QCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCASGE 173
Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF-PSKTVPNF 230
C + S C P C A + Y G T G+ + L P V +F
Sbjct: 174 CKKLAADAYGSGCSNGQP----CGFA-----ISYVDGTSTVGVYGKDKLTLAPGAIVKDF 224
Query: 231 LAGCSILSDRQPAGIAGFGRS---SESLPSQLGLKK-FSYCLLSRKFDDAPVSSNLVLDT 286
GC P G SESL +Q G FSYCL + +++
Sbjct: 225 YFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPA-------------VNS 271
Query: 287 GPGS---GDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
PG G + P G +TP + P + F V L I VG K + + S
Sbjct: 272 KPGFLAFGAGRNPSGFVFTPMGRVPGQPT-----FSTVTLAGITVGGKKLDLRPSAF--- 323
Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS 402
+GG+IVDSG+ T ++ ++ A+ F M Y L C+D++G K+
Sbjct: 324 ---SGGMIVDSGTVVTVLQSTVYRALRAAFREAMKAYRLV-----HGDLDTCYDLTGYKN 375
Query: 403 VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQ 462
V +P++ L F GGA + L N + G CL F + G A +LG+ +
Sbjct: 376 VVVPKIALTFSGGATINLDVPNGILVNG----CLA-FAETGK-----DGTAGVLGNVNQR 425
Query: 463 NFYLEFDLANDRFGFAKQKC 482
F + FD + +FGF + C
Sbjct: 426 TFEVLFDTSASKFGFRAKAC 445
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 116/406 (28%), Positives = 156/406 (38%), Gaps = 75/406 (18%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + +S G P Q I DTGS L+W C C N P F P+RSS
Sbjct: 91 GEYLMRISIGNP-QVEILAIADTGSDLIWVQCQPCEMCYKQN--------SPIFDPRRSS 141
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRN--KTCPLACPSYLLQYG-LGFTAGLLLSE 218
S + + C N C+ + G + C R KTC Y YG F+ G L E
Sbjct: 142 SYRNVLCGNEFCNKLDG-----EARSCDARGFVKTC-----GYTYSYGDQSFSDGHLAIE 191
Query: 219 TLRFPSKTVPNFLA---------GCSILS----DRQPAGIAGFGRSSESLPSQLGLK--- 262
S A GC + D +GI G G S SL SQLG K
Sbjct: 192 RFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSG 251
Query: 263 KFSYCLLSRKFDDAPVSS----NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
KFSYCL+ S N + +G TP L P +Y
Sbjct: 252 KFSYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKP------------ETYY 299
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
Y+ L I V +K ++PY+ L G G +I+DSG+T TF++ F +
Sbjct: 300 YLTLEAISVENK--RLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAV------ 351
Query: 379 YSRAADVEKKSGLRPCFDI--SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
A E+ S F+I +K++ LP + F GA + L P N FA V ++LC
Sbjct: 352 -EEAVKGERVSDPHGLFNICFKDEKAIELPIITAHFT-GADVELQPVNTFAKVEEDLLCF 409
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ N I G+ NF + +DL F C
Sbjct: 410 TMIPSNDIA---------IFGNLAQMNFLVGYDLEKKAVSFLPTDC 446
>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
Length = 382
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 77/229 (33%), Positives = 111/229 (48%), Gaps = 25/229 (10%)
Query: 257 SQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFG 315
SQLG +KFSYCL S + SS L G + + PG + TP +NP S
Sbjct: 173 SQLGTQKFSYCLTS--IHENKTSSLLF---GSLAYSNFNPGKIPRTPLIQNPFLPS---- 223
Query: 316 EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
+YY+ L+ I VG + IP G DG+GG+I+DSG+T T+++ F+ + FI Q
Sbjct: 224 -YYYLALKGITVGYTLLPIPEFAFQLGKDGSGGMILDSGTTITYLQEDAFDVLKNAFISQ 282
Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKS--VYLPELILKFKGGAKMALPPENYFALVGNEV 433
+ A+ +GL CF + K + V +P+LI FK G +ALP ENY +V +
Sbjct: 283 --TELQVAN-SSTTGLDLCFHLPVKNAAEVKVPKLIFHFK-GLDLALPVENY--MVSDPE 336
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ LI +A G I G+ Q QN + DL +C
Sbjct: 337 MGLICLAIDAT------GSLSIFGNIQQQNMLVLHDLKKSTLSLVPTQC 379
>gi|255552241|ref|XP_002517165.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543800|gb|EEF45328.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 434
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 107/416 (25%), Positives = 171/416 (41%), Gaps = 85/416 (20%)
Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
+L + T TP + D G +W C Y SS
Sbjct: 41 TLQYLTSINQRTPLVPVKLTLDLGGQYLWVDCDQGYV---------------------SS 79
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPR----NKTCPLACPSYLLQYGLGFTAGLLLS 217
S + + C++ +CS + S C SPR N TC L + + G T+G +
Sbjct: 80 SYKPVRCRSAQCSLAKSKSCISECFS-SPRPGCNNDTCALLPDNTVTHSG---TSGEVGQ 135
Query: 218 ETLRFPSK---------TVPNFLAGC--SILSDRQPAGI---AGFGRSSESLPSQLGL-- 261
+ + S +VP + C + L + +G+ AG GR+ SLPSQ
Sbjct: 136 DVVTVQSTDGFSPGRVVSVPKLIFTCATTFLLEGLASGVKGMAGLGRTKISLPSQFSAAF 195
Query: 262 ---KKFSYCLLSRK------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSS 312
+KF+ CL S F D P +D + L YTP NPV ++S
Sbjct: 196 SFDRKFAICLTSSNAKGIVFFGDGPYVFLPNIDV--------SKSLIYTPLILNPVSTAS 247
Query: 313 AF-----GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
AF Y++G++ I + K V + S L +G GG + + +T +E +++A
Sbjct: 248 AFFKGDPSSEYFIGVKSIKINGKAVPLNTSLLFIDKEGVGGTKISTVDPYTVLETTIYQA 307
Query: 368 VAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY----LPELILKFKGGAKM-ALPP 422
V K FI+++ R A V S CF+ S S +P++ L + + +
Sbjct: 308 VTKVFIKELAEVPRVAPV---SPFGVCFNSSNIGSTRVGPAVPQIDLVLQSSSVFWRIFG 364
Query: 423 ENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
N V ++VLCL F D P +I++G Q+++ L+FDLA + GF+
Sbjct: 365 ANSMVQVKSDVLCL-GFVDGGLNPR----TSIVIGGHQIEDNLLQFDLAASKLGFS 415
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 122/403 (30%), Positives = 156/403 (38%), Gaps = 68/403 (16%)
Query: 102 GGYSISLSFGTPPQASTP--FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
G Y + GTP TP + DTGS +VW C RC D + D P+
Sbjct: 145 GEYFTKIGVGTP---VTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFD--------PRA 193
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
S S + C P C + GC R K C Y + YG G TAG +E
Sbjct: 194 SHSYGAVDCAAPLCRRL-------DSGGCDLRRKAC-----LYQVAYGDGSVTAGDFATE 241
Query: 219 TLRFPSKT-VPNFLAGCSILSDR---QPAGIAGFGRSSESLPSQLGL---KKFSYCLLSR 271
TL F S VP GC ++ AG+ G GR S S PSQ+ + FSYCL+ R
Sbjct: 242 TLTFASGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDR 301
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
A +S T GSG G +P G G+ +
Sbjct: 302 TSSSASATSRSSTVTF-GSGARGALGRRVL----HPDGEEPQDGDVLLRAAHGHQRRRRA 356
Query: 332 VKIPYSYLVP--GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
P S G GGVIVDSG P + + SRAA +
Sbjct: 357 RPGRGRVRPPPDPSTGRGGVIVDSG-----RPSPAWARAGR--TPPCATRSRAA----AA 405
Query: 390 GLR----------PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
GLR C+D+SG K V +P + + F GGA+ ALPPENY V + F
Sbjct: 406 GLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF 465
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G + I+G+ Q Q F + FD R GF + C
Sbjct: 466 AGTDGGVS-------IIGNIQQQGFRVVFDGDGQRLGFVPKGC 501
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 115/447 (25%), Positives = 173/447 (38%), Gaps = 67/447 (14%)
Query: 46 HHSDSDPLKILHSLASSSLSRARHLKT-KTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGY 104
H + + ++AS +R +L + PK I S V + G Y
Sbjct: 49 QHKAGSWVNTVINMASKDPARVTYLSSLVASPKATSVPIASGQQ---------VLNIGNY 99
Query: 105 SISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
+ + GTP Q + DT W PC C C+ P F P SS+
Sbjct: 100 VVRVKLGTPGQLMF-MVLDTSRDAAWVPCAD---CAGCSSPT--------FSPNTSSTYA 147
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSE-TLRFP 223
+ C P+C+ + G + + N+T YG + +LS+ +L
Sbjct: 148 SLQCSVPQCTQVRGLSCPTTGTAACFFNQT-----------YGGDSSFSAMLSQDSLGLA 196
Query: 224 SKTVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAP 277
T+P++ GC S P G+ G GR SL SQ G FSYC S F
Sbjct: 197 VDTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPS--FKSYY 254
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP G K + TP +NP + YYV L + VG V +
Sbjct: 255 FSGSLRL--GP-LGQPK--NIRTTPLLRNPHRPT-----LYYVNLTGVSVGRVLVPVAPE 304
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM-GNYSRAADVEKKSGLRPCFD 396
L + G I+DSG+ T P++ A+ EF +Q+ G ++ + CF
Sbjct: 305 LLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPFATIGAFDT------CFA 358
Query: 397 ISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAII 455
+ + P + F G + LP EN + CL + AA P +
Sbjct: 359 ATNED--IAPPVTFHFT-GMDLKLPLENTLIHSSAGSLACLAM----AAAPNNVNSVLNV 411
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ + Q QN + FD+ N R G A++ C
Sbjct: 412 IANLQQQNLRIMFDVTNSRLGIARELC 438
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 106/407 (26%), Positives = 162/407 (39%), Gaps = 71/407 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP I DTGS ++W C+S C + +D + F S
Sbjct: 98 GLYFTKVKLGSPPTEFNVQI-DTGSDILWVTCSSCSNCPHSSGLGID---LHFFDAPGSF 153
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ + C +P CS +F ++ CS N+ Y +YG G T+G +++T
Sbjct: 154 TAGSVTCSDPICSSVF----QTTAAQCSENNQC------GYSFRYGDGSGTSGYYMTDTF 203
Query: 221 RFPSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKK-- 263
F + + +A GCS SD+ GI GFG+ S+ SQL +
Sbjct: 204 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 263
Query: 264 ---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
FS+CL D VL G+ PG+ Y+P + +
Sbjct: 264 PPVFSHCLKG----DGSGGGVFVL------GEILVPGMVYSPLLPSQP----------HY 303
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L + +G +P V + G IVD+G+T T+ L + F+ + N
Sbjct: 304 NLNLLSIGVNGQILPIDAAVFEASNTRGTIVDTGTTLTY----LVKEAYDPFLNAISNSV 359
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
S C+ +S S P + L F GGA M L P++Y G F
Sbjct: 360 SQLVTLIISNGEQCYLVSTSISDMFPPVSLNFAGGASMMLRPQDYLFHYG--------FY 411
Query: 441 DNAAGPALGRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
D A+ +G A ILGD L++ +DLA R G+A C+
Sbjct: 412 DGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWANYDCS 458
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 116/458 (25%), Positives = 171/458 (37%), Gaps = 65/458 (14%)
Query: 39 LSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL-S 97
LS H +H PL+ + +LA + +R L +K S + P+ S
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAA-----------SSGGITSAPVAS 72
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--- 154
+ Y + GTP Q DT + W C C C PA
Sbjct: 73 GQTPPSYVVRAGLGTPVQQLL-LALDTSADATWSHCAP---CDTC----------PAGSR 118
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
FIP SSS + C + C G + ++ + PL ++ +
Sbjct: 119 FIPASSSSYASLPCASDWCPLFEGQPCPAN------QDASAPLPACAFSKPFADTSFQAS 172
Query: 215 LLSETLRFPSKTVPNFLAGC-----SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSY 266
L S+TLR + + GC ++ G+ G GR SL SQ G + FSY
Sbjct: 173 LGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSY 232
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
CL S + S +L L G + P + YTP NP S YYV + +
Sbjct: 233 CLPS--YRSYYFSGSLRL------GAAGQPRNVRYTPLLTNPHRPS-----LYYVNVTGL 279
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG VK+P G ++DSG+ T P++ A+ +EF RQ+ S
Sbjct: 280 SVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPS---GY 336
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAA 444
CF+ + P + L GG + LP EN L CL + A
Sbjct: 337 TSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM----AE 392
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
P ++ + Q QN + D+A R GFA++ C
Sbjct: 393 APQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 103/417 (24%), Positives = 174/417 (41%), Gaps = 73/417 (17%)
Query: 85 SNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
S N+ +K + S G Y+ L GTPPQ I DTGS++ + PC++ C C
Sbjct: 61 SQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFA-LIVDTGSTVTYVPCST---CKQCG- 115
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
+ P F P+ SSS + + C NP C+ C K C Y
Sbjct: 116 ----KHQDPKFQPELSSSYKALKC-NPDCN-------------CDDEGKLC-----VYER 152
Query: 205 QYG-LGFTAGLLLSETLRFPSK---TVPNFLAGCS-----ILSDRQPAGIAGFGRSSESL 255
+Y + ++G+L + + F ++ T + GC L ++ GI G GR S+
Sbjct: 153 RYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCENVETGDLFSQRADGIMGLGRGKLSV 212
Query: 256 PSQLGLKKFSYCLLSRKFDDAPVSSNLVL--DTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
QL K + S + V ++ P +G S++ +++P
Sbjct: 213 VDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPAGMV----FSHSDPFRSP------ 262
Query: 314 FGEFYYVGLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
+Y + L+Q+ V K +K+ P + +G G ++DSG+T+ + F A+
Sbjct: 263 ---YYNIDLKQMHVAGKSLKLNPKVF-----NGKHGTVLDSGTTYAYFPKEAFIAIKDAI 314
Query: 373 IRQMGNYSRAADVEKKSGLRPCFDISGKKSV----YLPELILKFKGGAKMALPPENYF-- 426
I+++ + R + CF +G+ + PE+ ++F G K+ L PENY
Sbjct: 315 IKEIPSLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLILSPENYLFR 373
Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
CL +F D R +LG ++N + +D ND+ GF K C+
Sbjct: 374 HTKVRGAYCLGIFPD--------RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 422
>gi|255552253|ref|XP_002517171.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543806|gb|EEF45334.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 437
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 112/401 (27%), Positives = 165/401 (41%), Gaps = 71/401 (17%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P D G SL+W C Y SSS + + C + C
Sbjct: 54 PLVPVKLTVDLGGSLMWINCEEGYV---------------------SSSYRPLSCDSALC 92
Query: 174 SWIFGPNVESRCKGC--SPR----NKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK-- 225
S N +S K C SP+ N TC + + ++ G G G + F K
Sbjct: 93 SL---SNSQSCNKECYSSPKPGCYNNTCGQSSNNRVVYIGTGGDLGQDVVALQSFDGKNL 149
Query: 226 ----TVPNFLAGCSI---LSDRQPA--GIAGFGRSSESLP----SQLGLKK-FSYCLLSR 271
+VPNF C I L D G+AG GRS+ SLP S +G K FS CL
Sbjct: 150 GRIVSVPNFPFVCGITWLLDDLADGVTGMAGLGRSNISLPAYFSSAIGFSKTFSICL--- 206
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS--SSAFGEF---YYVGLRQII 326
+ SN V+ G G + L Y NPVG+ S+ GE YY+G++ I
Sbjct: 207 ---SSSTKSNGVIVFGDGPSSIVSNDLIYIRLILNPVGTPGYSSLGESSADYYIGVKSIR 263
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
V K VK + L DGNGG ++ + + +T + +++A+ K FI+++
Sbjct: 264 VDGKEVKFDKTLLSIDKDGNGGTMLSTVNPYTVLHTSIYKALLKAFIKKLVFRFSLVVPS 323
Query: 387 KKSGLRPCFDISGKKSV-----YLPELILKFK----GGAKMALPPENYFALVGNEVLCLI 437
C +G ++ Y+P + L+ + + N V + +CL
Sbjct: 324 VPVPFGACVFSNGFRTTEEFLSYVPIINLELESEQGNSVYWRILGANSMVAVNSYTMCLA 383
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
F D + P R P II+G QL++ L FDLA+ R GF+
Sbjct: 384 -FIDGGSQP---RTP-IIIGGHQLEDNLLHFDLASSRLGFS 419
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 129/461 (27%), Positives = 192/461 (41%), Gaps = 83/461 (18%)
Query: 45 LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKT---KDSNIGSNYSNSLIKTPLSVHSY 101
L H DS P ++ A +S R R+ ++ T + + N S I + +
Sbjct: 30 LIHRDS-PKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITS-----NR 83
Query: 102 GGYSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
G Y +++S GTPP P I DTGS L+W C C DC P F PK
Sbjct: 84 GEYLMNISIGTPP---VPILAIADTGSDLIWTQCNP---CEDCY-----QQTSPLFDPKE 132
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
SS+ + + C + +C + CS TC SY + YG +T G + +
Sbjct: 133 SSTYRKVSCSSSQCRAL-------EDASCSTDENTC-----SYTITYGDNSYTKGDVAVD 180
Query: 219 TLRFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSY 266
T+ S ++ N + GC + D +GI G G S SL SQL KFSY
Sbjct: 181 TVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSY 240
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
CL+ + S G SGD +S + K+P +Y++ L I
Sbjct: 241 CLVPFTSETGLTSKINFGTNGIVSGDGV---VSTSMVKKDP-------ATYYFLNLEAIS 290
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF---EAVAKEFIRQMGNYSRAA 383
VGSK KI ++ + G+ G G +++DSG+T T + + E+V I +A
Sbjct: 291 VGSK--KIQFTSTIFGT-GEGNIVIDSGTTLTLLPSNFYYELESVVASTI-------KAE 340
Query: 384 DVEKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
V+ G L C+ S S +P++ + FKGG + L N F V +V C
Sbjct: 341 RVQDPDGILSLCYRDS--SSFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVSCFAF---- 393
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
AA L I G+ NF + +D + F K C+
Sbjct: 394 AANEQL-----TIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429
>gi|255552245|ref|XP_002517167.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543802|gb|EEF45330.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 435
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 103/406 (25%), Positives = 166/406 (40%), Gaps = 89/406 (21%)
Query: 114 PQASTPFIFDTGSSLVWFPC----TSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI--- 166
P + D G + +W C +S Y V C+ S + S +++
Sbjct: 58 PLVAVKLTVDLGGTFMWVDCDNYVSSSYTPVRCD------SALCKLADSHSCTTECYSSP 111
Query: 167 --GCQNPKCSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
GC N CS I + P V G +G L S ++P
Sbjct: 112 KPGCYNNTCSHIPYNPVVHVSTSG-------------------DIGLDVVSLQSMDGKYP 152
Query: 224 SK--TVPN--FLAGCSILSDRQP---AGIAGFGRSSESLP----SQLGLK-KFSYCLLSR 271
+ +VPN F+ G + + G+AG GR + SLP S LGL+ KF+ CL S
Sbjct: 153 GRNVSVPNVPFVCGTGFMLENLADGVLGVAGLGRGNISLPAYFSSALGLQSKFAICLSSL 212
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQII 326
+S+ V+ G G + L YTP +NPV ++ A+ E Y++ ++ +
Sbjct: 213 ------TNSSGVIYFGDSIGPLSSDFLIYTPLVRNPVSTAGAYFEGQSSTDYFIAVKTLR 266
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--------- 377
VG K +K + L ++G GG + + +T + +++AV K F +QM
Sbjct: 267 VGGKEIKFNKTLLSIDNEGKGGTRISTVHPYTLLHTSIYKAVIKAFAKQMKFLIEVNPPI 326
Query: 378 ------NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
S A D+ + + P D L+L+ +G + N + +
Sbjct: 327 APFGLCYQSAAMDINEYGPVVPFID-----------LVLESQGSVYWRIWGANSMVKISS 375
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
V+CL F D P +II+G QL++ L+FDLA+ R GF
Sbjct: 376 YVMCL-GFVDGGLKP----DSSIIIGGRQLEDNLLQFDLASARLGF 416
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 117/452 (25%), Positives = 182/452 (40%), Gaps = 82/452 (18%)
Query: 40 STKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLS-- 97
S+K L+ + + + + A S++RA H Y +L TP S
Sbjct: 37 SSKSPLYQPTQNKYQHIVNAARRSINRANHF----------------YKTALTNTPQSTV 80
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
+ +G Y ++ S GTPP I DTGS +VW C C + P F P
Sbjct: 81 IPDHGEYLMTYSVGTPP-FKLYGIADTGSDIVWLQCEPCKECYN--------QTTPKFKP 131
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
+SS+ + I C + CK N L+ + L+ G
Sbjct: 132 SKSSTYKNIPCSS------------DLCKSGQQGN----LSVDTLTLESSTG-------- 167
Query: 218 ETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
+ FP KTV ++ + +GI G G SL +QLG KFSYCLL +
Sbjct: 168 HPISFP-KTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVE 226
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
S DT SGD G+ TP K+P+ FYY+ L VG+K ++
Sbjct: 227 SNTTSKLNFGDTAVVSGD----GVVSTPIVKKDPI-------VFYYLTLEAFSVGNKRIE 275
Query: 334 IPYSYLVPGSDG--NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
S S+G G +I+DSG+T T + ++ + + + ++ R D + L
Sbjct: 276 FEGS-----SNGGHEGNIIIDSGTTLTVIPTDVYNNL-ESAVLELVKLKRVNDPTRLFNL 329
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
C+ ++ + P + FK GA + L P + F V + ++CL T +A P+
Sbjct: 330 --CYSVTSDGYDF-PIITTHFK-GADVKLHPISTFVDVADGIVCLAFATTSAFIPS---D 382
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ QN + +DL F C+
Sbjct: 383 VVSIFGNLAQQNLLVGYDLQQKIVSFKPTDCS 414
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 116/458 (25%), Positives = 171/458 (37%), Gaps = 65/458 (14%)
Query: 39 LSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL-S 97
LS H +H PL+ + +LA + +R L +K S + P+ S
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAA-----------SSGGVTSAPVAS 72
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--- 154
+ Y + GTP Q DT + W C C C PA
Sbjct: 73 GQTPPSYVVRAGLGTPVQQLL-LALDTSADATWSHCAP---CDTC----------PAGSR 118
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
FIP SSS + C + C G + ++ + PL ++ +
Sbjct: 119 FIPASSSSYASLPCASDWCPLFEGQPCPAN------QDASAPLPACAFSKPFADTSFQAS 172
Query: 215 LLSETLRFPSKTVPNFLAGC-----SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSY 266
L S+TLR + + GC ++ G+ G GR SL SQ G + FSY
Sbjct: 173 LGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSY 232
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
CL S + S +L L G + P + YTP NP S YYV + +
Sbjct: 233 CLPS--YRSYYFSGSLRL------GAAGQPRNVRYTPLLTNPHRPS-----LYYVNVTGL 279
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG VK+P G ++DSG+ T P++ A+ +EF RQ+ S
Sbjct: 280 SVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPS---GY 336
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAA 444
CF+ + P + L GG + LP EN L CL + A
Sbjct: 337 TSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM----AE 392
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
P ++ + Q QN + D+A R GFA++ C
Sbjct: 393 APQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 137/334 (41%), Gaps = 51/334 (15%)
Query: 168 CQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT 226
C CS I + E R TC +Y YG G T G+ +E F S
Sbjct: 3 CAGTLCSDILHHSCE--------RPDTC-----TYRYNYGDGTMTVGVYATERFTFASSG 49
Query: 227 VPNFLA-------GC---SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
GC ++ S +GI GFGR+ SL SQL +++FSYCL S +
Sbjct: 50 GGGLTTTTVPLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTS--YASR 107
Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
S+ L G T + TP ++P + FYYV + VG++ ++IP
Sbjct: 108 RQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPT-----FYYVHFTGLTVGARRLRIPE 162
Query: 337 SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
S DG+GGVIVDSG+ T + + V + F RQ A + G+ CF
Sbjct: 163 SAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAF-RQQLRLPFANGGNPEDGV--CFL 219
Query: 397 I-------SGKKSVYLPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPAL 448
+ S + +P ++L F+ GA + LP NY LCL+L G
Sbjct: 220 VPAAWRRSSSTSQMPVPRMVLHFQ-GADLDLPRRNYVLDDHRRGRLCLLLADSGDDGST- 277
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G+ Q+ + +DL + A +C
Sbjct: 278 -------IGNLVQQDMRVLYDLEAETLSIAPARC 304
>gi|359806276|ref|NP_001241217.1| uncharacterized protein LOC100818868 precursor [Glycine max]
gi|255644718|gb|ACU22861.1| unknown [Glycine max]
Length = 450
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 106/416 (25%), Positives = 163/416 (39%), Gaps = 72/416 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
YS S+ GTPP + + D +WF C + Y N P R K++ +
Sbjct: 50 YSTSIDMGTPP-LTLDLVIDIRERFLWFECGNDY-----NSSTYYPVRCGTKKCKKAKGT 103
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL--- 220
I C N GC+ N TC + +G F +G + + L
Sbjct: 104 ACITCTNHPLK-----------TGCT--NNTCGV---DPFNPFGEFFVSGDVGEDILSSL 147
Query: 221 ------RFPSKT-VPNFLAGCSILSDR------------QPAGIAGFGRSSESLPSQLGL 261
R PS VP F++ C + D+ G+ G R++ SLP+QL
Sbjct: 148 HSTSGARAPSTLHVPRFVSTC-VYPDKFGVEGFLQGLAKGKKGVLGLARTAISLPTQLAA 206
Query: 262 K-----KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG------LSYTPFYKNPVGS 310
K KF+ CL S N + D G G P LSYTP NP +
Sbjct: 207 KYNLEPKFALCLPSTS------KYNKLGDLFVGGGPYYLPPHDASKFLSYTPILTNPQST 260
Query: 311 SSAF----GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFE 366
F Y++ ++ I + K V + S L GNGG + + +T +++
Sbjct: 261 GPIFDADPSSEYFIDVKSIKLDGKIVNVNTSLLSIDRQGNGGCKLSTVVPYTKFHTSIYQ 320
Query: 367 AVAKEFIRQMGNYSRAADVEKKSGLRPCFD--ISGKKSV--YLPELILKFKGGAKMALPP 422
+ +F++Q + V + CFD GK +P + L KGG + +
Sbjct: 321 PLVNDFVKQAA-LRKIKRVTSVAPFGACFDSRTIGKTVTGPNVPTIDLVLKGGVQWRIYG 379
Query: 423 ENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
N V VLCL F D P +I++G +Q+++ LEFDL + + GF+
Sbjct: 380 ANSMVKVSKNVLCL-GFVDGGLEPGSPIATSIVIGGYQMEDNLLEFDLVSSKLGFS 434
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 104/407 (25%), Positives = 162/407 (39%), Gaps = 71/407 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP I DTGS ++W C+S C + +D + F S
Sbjct: 98 GLYFTKVKLGSPPTEFNVQI-DTGSDILWVTCSSCSNCPHSSGLGID---LHFFDAPGSL 153
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ + C +P CS +F ++ CS N+ Y +YG G T+G +++T
Sbjct: 154 TAGSVTCSDPICSSVF----QTTAAQCSENNQC------GYSFRYGDGSGTSGYYMTDTF 203
Query: 221 RFPSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKK-- 263
F + + +A GCS SD+ GI GFG+ S+ SQL +
Sbjct: 204 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 263
Query: 264 ---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
FS+CL D VL G+ PG+ Y+P + +
Sbjct: 264 PPVFSHCLKG----DGSGGGVFVL------GEILVPGMVYSPLVPSQP----------HY 303
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L + +G +P V + G IVD+G+T T++ ++ F+ + N
Sbjct: 304 NLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDL----FLNAISNSV 359
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
S C+ +S S P + L F GGA M L P++Y G
Sbjct: 360 SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYG--------IY 411
Query: 441 DNAAGPALGRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
D A+ +G A ILGD L++ +DLA R G+A C+
Sbjct: 412 DGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCS 458
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 119/405 (29%), Positives = 165/405 (40%), Gaps = 70/405 (17%)
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPA 154
V S G Y ++LS GTPP I DTGS L W PCT Y+ V +P
Sbjct: 86 VPSAGEYIMNLSIGTPPVPVIA-IVDTGSDLTWTQCRPCTHCYKQV-----------VPF 133
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
F PK SS+ + C C + C+ K C +++ Y G FT G
Sbjct: 134 FDPKNSSTYRDSSCGTSFC---LALGNDRSCR----NGKKC-----TFMYSYADGSFTGG 181
Query: 214 LLLSETLRFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK-- 262
L ETL S + P F GC S D +GI G G + S+ SQL
Sbjct: 182 NLAVETLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTIN 241
Query: 263 -KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF-YKNPVGSSSAFGEFYYV 320
+FSYCLL F D+ +SS + SG G TP K P +Y +
Sbjct: 242 GRFSYCLLP-VFTDSSMSSRINFGR---SGIVSGAGTVSTPLVMKGPD------TYYYLI 291
Query: 321 GLRQIIVGSKHVKIP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPL-FEAVAKEFIRQMGN 378
L VG K + +S +GN +IVDSG+T+T++ PL F +E +
Sbjct: 292 TLEGFSVGKKRLSYKGFSKKAEVEEGN--IIVDSGTTYTYL--PLEFYVKLEESVAHSIK 347
Query: 379 YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
R D S L C++ + + + P + FK A + L P N F + +++C +
Sbjct: 348 GKRVRDPNGISSL--CYNTTVDQ-IDAPIITAHFK-DANVELQPWNTFLRMQEDLVCFTV 403
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G ILG+ NF + FDL R F C
Sbjct: 404 LPTSDIG---------ILGNLAQVNFLVGFDLRKKRVSFKAADCT 439
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 122/440 (27%), Positives = 178/440 (40%), Gaps = 56/440 (12%)
Query: 58 SLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQAS 117
+LAS +R +L+ + P S+ S S I + H G Y + + G+PP
Sbjct: 81 ALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTIVS----HGSGEYLVRVGIGSPPLEQ 136
Query: 118 TPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIF 177
+ DTGS ++W C+ C DC + DP F P S+S + C + C
Sbjct: 137 H-LVADTGSDVIWVQCSP---CSDC-YAQGDP----LFDPANSASFSPVPCNSGVC---- 183
Query: 178 GPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKT-VPNFLAGCS 235
+R S Y + YG +T G+L ETL T V GC
Sbjct: 184 --RAAARYSSSSCGGGG---GECEYKVSYGDKSYTNGVLALETLTLDGGTEVQGVAMGCG 238
Query: 236 ILSD---RQPAGIAGFGRSSESLPSQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPG 289
+ + AG+ G G SL QLG FSYCL + S +LVL G
Sbjct: 239 HENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSLVL----G 294
Query: 290 SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV 349
D+ G + P +NP S FYYVG+ + V + +++ G DG GGV
Sbjct: 295 REDAAPTGAVWVPLVRNPDAPS-----FYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGV 349
Query: 350 IVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL-RPCFDISGKKSVYLPEL 408
++D+G+ T + + A+ F G + A L C+D+SG SV +P +
Sbjct: 350 VMDTGTAVTRLPAEAYAALRGAF---AGAFEEGAPRAPGVSLFDTCYDLSGYASVRVPTV 406
Query: 409 ILKFKG------GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQ 462
L F G A + LP N V + + F A+GP+ ILG+ Q Q
Sbjct: 407 ALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPS-------ILGNIQQQ 459
Query: 463 NFYLEFDLANDRFGFAKQKC 482
+ D A+ GF C
Sbjct: 460 GIEITVDSASGYVGFGPATC 479
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 104/406 (25%), Positives = 161/406 (39%), Gaps = 71/406 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP I DTGS ++W C+S C + +D + F S
Sbjct: 98 GLYFTKVKLGSPPTEFNVQI-DTGSDILWVTCSSCSNCPHSSGLGID---LHFFDAPGSL 153
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ + C +P CS +F ++ CS N+ Y +YG G T+G +++T
Sbjct: 154 TAGSVTCSDPICSSVF----QTTAAQCSENNQC------GYSFRYGDGSGTSGYYMTDTF 203
Query: 221 RFPSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKK-- 263
F + + +A GCS SD+ GI GFG+ S+ SQL +
Sbjct: 204 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 263
Query: 264 ---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
FS+CL D VL G+ PG+ Y+P + +
Sbjct: 264 PPVFSHCLKG----DGSGGGVFVL------GEILVPGMVYSPLVPSQP----------HY 303
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L + +G +P V + G IVD+G+T T++ ++ F+ + N
Sbjct: 304 NLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDL----FLNAISNSV 359
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
S C+ +S S P + L F GGA M L P++Y G
Sbjct: 360 SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYG--------IY 411
Query: 441 DNAAGPALGRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
D A+ +G A ILGD L++ +DLA R G+A C
Sbjct: 412 DGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 87/305 (28%), Positives = 142/305 (46%), Gaps = 44/305 (14%)
Query: 194 TCPLACP--SYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAG 247
C A P +Y + YG G FT G L E L+F + V +F+ GC + +G+ G
Sbjct: 125 VCGSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMG 184
Query: 248 FGRSSESLPSQL-GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
GRS SL SQ G+ FSYCL S + S +L+L + +P +SY
Sbjct: 185 LGRSDLSLISQTSGIFGGVFSYCLPS---TERKGSGSLILGGNSSVYRNSSP-ISYAKMI 240
Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
+NP FY++ L I +G ++ P S G ++VDSG+ T + +
Sbjct: 241 ENP-----QLYNFYFINLTGISIGGVALQAP-------SVGPSRILVDSGTVITRLPPTI 288
Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN 424
++A+ EF++Q + A S L CF++S + V +P + + F+G A++ +
Sbjct: 289 YKALKAEFLKQFTGFPPAPAF---SILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTG 345
Query: 425 YFALVGNEV--LCLIL----FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
F V ++ +CL L + D A ILG++Q +N + +D + GFA
Sbjct: 346 VFYFVKSDASQVCLALASLEYQDEVA----------ILGNYQQKNLRVIYDTKETKVGFA 395
Query: 479 KQKCA 483
+ C+
Sbjct: 396 LETCS 400
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 156/374 (41%), Gaps = 54/374 (14%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
I DTGSSL W C C DP + P S + + + C + +CS +
Sbjct: 1 MILDTGSSLSWLQCQP---CAVYCHAQADP----LYDPSVSKTYKKLSCASVECSRLKAA 53
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPS-KTVPNFLAGCSIL 237
+ P +T AC Y YG F+ G L + L S +T+P F GC
Sbjct: 54 TLND------PLCETDSNACL-YTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCG-- 104
Query: 238 SDRQ-----PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPG 289
D Q AGI G R S+ +QL K FSYCL P +++ G
Sbjct: 105 QDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCL--------PTANSGSSGGGFL 156
Query: 290 SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS-YLVPGSDGNGG 348
S S +P T + P+ + S Y++ L I V + + + + Y VP
Sbjct: 157 SIGSISP----TSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVP------- 205
Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
++DSG+ T + ++ A+ + F++ M ++ A S L CF S K +PE+
Sbjct: 206 TLIDSGTVITRLPMSMYAALRQAFVKIMS--TKYAKAPAYSILDTCFKGSLKSISAVPEI 263
Query: 409 ILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
+ F+GGA + L + + CL AG + G I+G+ Q Q + + +
Sbjct: 264 KMIFQGGADLTLRAPSILIEADKGITCLAF-----AGSS-GTNQIAIIGNRQQQTYNIAY 317
Query: 469 DLANDRFGFAKQKC 482
D++ R GFA C
Sbjct: 318 DVSTSRIGFAPGSC 331
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 108/407 (26%), Positives = 163/407 (40%), Gaps = 71/407 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP+ I DTGS ++W C S C +C + ++ F SS
Sbjct: 64 GLYFTKVKLGSPPREFNVQI-DTGSDVLWVCCNS---CNNCPRTSGLGIQLNFFDSSSSS 119
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ + C +P C+ V++ CS + C SY QYG G T+G +S+TL
Sbjct: 120 TAGQVRCSDPICT----SAVQTTATQCSSQTDQC-----SYTFQYGDGSGTSGYYVSDTL 170
Query: 221 RFPS----KTVPN----FLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKK-- 263
F + + N + GCS +D+ GI GFG+ S+ SQL +
Sbjct: 171 YFDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGIT 230
Query: 264 ---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
FS+CL D LVL G+ PG+ Y+P + Y +
Sbjct: 231 PRVFSHCLKG----DGSGGGILVL------GEILEPGIVYSPLVPSQ--------PHYNL 272
Query: 321 GLRQIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
L I V + + I P ++ S G IVDSG+T ++ ++ F+ +
Sbjct: 273 NLLSIAVNGQLLPIDPAAFATSNSQGT---IVDSGTTLAYLVAEAYD----PFVSAVNAI 325
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY---FALVGNEVLCL 436
+ S C+ +S S P F GGA M L PE+Y F G +
Sbjct: 326 VSPSVTPITSKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWC 385
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I F ILGD L++ +DL R G+A C+
Sbjct: 386 IGFQKVQG--------VTILGDLVLKDKIFVYDLVRQRIGWANYDCS 424
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 85/302 (28%), Positives = 133/302 (44%), Gaps = 51/302 (16%)
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
+ + G Y +S GTPPQ + DTGS++ W C C C P + F P
Sbjct: 35 IFAMGLYYTRISLGTPPQQFYVDV-DTGSNVAWVKCAP---CTGCEHSGDVPVPMSTFDP 90
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLL 216
++S++ I C + +C V ++ CSP +CP Y L YG G TAG L
Sbjct: 91 RKSTTKISISCTDAEC------GVLNKKLQCSPERLSCP-----YSLLYGDGSSTAGYYL 139
Query: 217 SETLRFPSKTVPN-----------FLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFS 265
++ F N F G + G+ GFG ++ SLP+QL + S
Sbjct: 140 NDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSWSVDGLLGFGPTTVSLPNQLAQQNIS 199
Query: 266 YCLLSRKFD-DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY-VGLR 323
+ + D +LV+ G + P L YTP FGE +Y V L
Sbjct: 200 VNIFAHCLQGDVSGRGSLVI------GTIREPDLVYTPM---------VFGEDHYNVQLL 244
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I + ++V P S+ + + GGVI+DSG+T T++ P ++ EF R + + +++
Sbjct: 245 NIGISGRNVTTPASFDL---EYTGGVIIDSGTTLTYLVQPAYD----EFRRGVSVFKQSS 297
Query: 384 DV 385
D+
Sbjct: 298 DL 299
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 115/435 (26%), Positives = 174/435 (40%), Gaps = 78/435 (17%)
Query: 63 SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPF-- 120
S++RA H N S YSN+ +++P+++ G Y +S S GTPP P
Sbjct: 59 SMNRANHF-----------NQISVYSNA-VESPVTLLDDGDYLMSYSLGTPP---FPVYG 103
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
I DT S ++W C C + P DPS + SS+ Q CS
Sbjct: 104 IVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTTCKSVQGTSCS------ 157
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-----PSKTVPNFLAGC 234
S K C + + Y G + G L+ ET+ P P + GC
Sbjct: 158 --------SDERKIC-----EHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGC 204
Query: 235 SILSDR--QPAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKFDDAPVSS-NLVLDTGP 288
++ GI G G SL QL KKFSYCL AP+S + L G
Sbjct: 205 IRNTNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSYCL-------APISDRSSKLKFGD 257
Query: 289 GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
+ S +S +K+ + +FYY+ L VG+ +I + S G G
Sbjct: 258 AAMVSGDGTVSTRIVFKD-------WKKFYYLTLEAFSVGNN--RIEFRSSSSRSSGKGN 308
Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
+I+DSG+TFT + ++ + + + + RA D K+ L C+ + K V +P +
Sbjct: 309 IIIDSGTTFTVLPDDVYSKL-ESAVADVVKLERAEDPLKQFSL--CYKSTYDK-VDVPVI 364
Query: 409 ILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEF 468
F GA + L N F + + V+CL + + I G+ QNF + +
Sbjct: 365 TAHF-SGADVKLNALNTFIVASHRVVCLAFLSSQSGA---------IFGNLAQQNFLVGY 414
Query: 469 DLANDRFGFAKQKCA 483
DL F C
Sbjct: 415 DLQRKIVSFKPTDCT 429
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 115/439 (26%), Positives = 169/439 (38%), Gaps = 81/439 (18%)
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFI--FD 123
R + + KPK S + +N+ SL T Y SL GTP +T + D
Sbjct: 110 RRKVTASSNKPKGGVSLL-ANWGKSLSTT--------NYVASLRLGTP---ATELVVELD 157
Query: 124 TGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVES 183
TGS W C C DC R P F P SS+ + C +C + +
Sbjct: 158 TGSDQSWVQCKP---CADCY-----EQRDPVFDPTASSTYSAVPCGARECQELASSSSSR 209
Query: 184 RCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP-------SKTVPNFLAGCSI 236
C + +N CP + T G L +TL + TVP F+ GC
Sbjct: 210 NCSSDNNKN------CPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCG- 262
Query: 237 LSDRQPAGIAG-------FGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDT 286
AG G G SLPSQ+ + FSYCL S S+ L
Sbjct: 263 ---HSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSP------SAAGYLSF 313
Query: 287 GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
G G + +T +S YY+ L I+V + +K+P S +
Sbjct: 314 G---GAAARANAQFTEMVTGQDPTS------YYLNLTGIVVAGRAIKVPASAFATAA--- 361
Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
G I+DSG+ F+ + + A+ F MG Y R C+D +G ++V +P
Sbjct: 362 -GTIIDSGTAFSRLPPSAYAALRSSFRSAMGRY-RYKRAPSSPIFDTCYDFTGHETVRIP 419
Query: 407 ELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
+ L F GA + L P N+V CL ++ G ILG+ Q +
Sbjct: 420 AVELVFADGATVHLHPSGVL-YTWNDVAQTCLAFVPNHDLG---------ILGNTQQRTL 469
Query: 465 YLEFDLANDRFGFAKQKCA 483
+ +D+ + R GF ++ CA
Sbjct: 470 AVIYDVGSQRIGFGRKGCA 488
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 104/406 (25%), Positives = 161/406 (39%), Gaps = 73/406 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + G+PP I DTGS ++W C+S C + +D + F S ++
Sbjct: 105 YFTKVKLGSPPTEFNVQI-DTGSDILWVTCSSCSNCPHSSGLGID---LHFFDAPGSLTA 160
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
+ C +P CS +F ++ CS N+ Y +YG G T+G +++T F
Sbjct: 161 GSVTCSDPICSSVF----QTTAAQCSENNQC------GYSFRYGDGSGTSGYYMTDTFYF 210
Query: 223 PSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQLGLKK---- 263
+ + +A GCS SD+ GI GFG+ S+ SQL +
Sbjct: 211 DAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPP 270
Query: 264 -FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVG 321
FS+CL D VL G+ PG+ Y+P P +
Sbjct: 271 VFSHCLKG----DGSGGGVFVL------GEILVPGMVYSPLVPSQP-----------HYN 309
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
L + +G +P V + G IVD+G+T T++ ++ F+ + N
Sbjct: 310 LNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDL----FLNAISNSVS 365
Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
S C+ +S S P + L F GGA M L P++Y G D
Sbjct: 366 QLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYG--------IYD 417
Query: 442 NAAGPALGRGPA----IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A+ +G A ILGD L++ +DLA R G+A C+
Sbjct: 418 GASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCS 463
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 115/410 (28%), Positives = 176/410 (42%), Gaps = 75/410 (18%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKR 159
G Y + G PP+ I DTGS ++W C S C C P +IP F P
Sbjct: 81 GLYYTRVQLGNPPKDFYVQI-DTGSDVLWVSCNS---CNGC--PATSGLQIPLNFFDPGS 134
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
S+++ L+ C + C+ V+S C ++ C +Y+ QYG G T+G + +
Sbjct: 135 STTASLVSCSDQICAL----GVQSSDSACFGQSNQC-----AYVFQYGDGSGTSGYYVMD 185
Query: 219 TLRFP--------SKTVPNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL---G 260
+ S + + + GCS SDR GI GFG+ S+ SQL G
Sbjct: 186 MIHLDVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRG 245
Query: 261 L--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
+ K FS+CL K DD+ LVL G+ P + YTP + Y
Sbjct: 246 IAPKVFSHCL---KGDDSG-GGILVL------GEIVEPNVVYTPLVPSQ--------PHY 287
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
+ L+ I V + +P S V + + G I+DSG+T ++ + A F+ + N
Sbjct: 288 NLNLQSISVNGQ--VLPISPAVFATSSSQGTIIDSGTTLAYLAEEAYNA----FVVAVTN 341
Query: 379 -YSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEV 433
S++ G R C+ S S P++ L F GGA + L ++Y ++ G V
Sbjct: 342 IVSQSTQSVVLKGNR-CYVTSSSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTTV 400
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
C+ G+G I LGD L++ +DLAN R G+ C+
Sbjct: 401 WCI------GFQKIPGQGITI-LGDLVLKDKIFIYDLANQRIGWTNYDCS 443
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 117/427 (27%), Positives = 169/427 (39%), Gaps = 95/427 (22%)
Query: 87 YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIF---DTGSSLVWFPCTSRYRCVDCN 143
Y S I++P+S + Y + LS GTPP I+ DTGS LVWF C +C
Sbjct: 44 YKPSTIQSPVSAYDCE-YLMELSIGTPPIK----IYAEADTGSDLVWFQCIPCTKCY--- 95
Query: 144 FPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYL 203
+ P F P+ SSS I C C+ + CS KTC +Y
Sbjct: 96 -----KQQNPMFDPRSSSSYTNITCGTESCNKL-------DSSLCSTDQKTC-----NYT 138
Query: 204 LQYGLG-FTAGLLLSETLRFPSKT-----VPNFLAGC----SILSDRQPAGIAGFGRSSE 253
Y T G+L ETL S T + GC S +DR+ G+ G GR
Sbjct: 139 YSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNNSGFNDRE-MGLIGLGRGPL 197
Query: 254 SLPSQLGL------KKFSYCLLSRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFY 304
SL SQ+G FS CL+ D ++S + G G+G TP +S
Sbjct: 198 SLISQIGSSLGAGGNMFSQCLVPFN-TDPSITSQMNFGKGSEVLGNGTVSTPLIS----- 251
Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP-GSDGNGGVIVDSGSTFTFMEGP 363
K+ G Y+ L I V + + +P+S G+ G +++DSG+T T++
Sbjct: 252 KDGTG--------YFATLLGISV--EDINLPFSNGSSLGTITKGNILIDSGTTITYL--- 298
Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL-------PELILKFKGGA 416
+EF ++ V K L P F I G + Y P L + F+GG
Sbjct: 299 -----PEEFYHRL-----IEQVRNKVALEP-FRIDGYELCYQTPTNLNGPTLTIHFEGG- 346
Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
+ L P F V ++ C +F N + G++ N+ + FDL
Sbjct: 347 DVLLTPAQMFIPVQDDNFCFAVFDTNEE--------YVTYGNYAQSNYLIGFDLERQVVS 398
Query: 477 FAKQKCA 483
F C
Sbjct: 399 FKATDCT 405
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 101/403 (25%), Positives = 159/403 (39%), Gaps = 63/403 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ + DTGS ++W C S +C + +D + + PK SS
Sbjct: 82 GLYFTEIKLGTPPKRYYVQV-DTGSDILWVNCISCEKCPRKSGLGLD---LTFYDPKASS 137
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S + C C+ +G + GC+ N C Y + YG G T G +++ L
Sbjct: 138 SGSTVSCDQGFCAATYG----GKLPGCTA-NVPC-----EYSVMYGDGSSTTGFFVTDAL 187
Query: 221 RF----------PSKTVPNFLAGCSILSD-----RQPAGIAGFGRSSESLPSQLGL---- 261
+F P F G D + GI GFG+++ S+ SQL
Sbjct: 188 QFDQVTGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKV 247
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K F++CL + K N+V P + TP + Y V
Sbjct: 248 KKIFAHCLDTIKGGGIFAIGNVV-----------QPKVKTTPLVADM--------PHYNV 288
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I VG +++P G G I+DSG+T T++ E V KE + + N
Sbjct: 289 NLKSIDVGGTTLQLPAHVFETGE--RKGTIIDSGTTLTYLP----ELVFKEVMAAIFNKH 342
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
+ CF G P + F+ + + P YF GN++ C + F
Sbjct: 343 QDIVFHNVQDFM-CFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFPNGNDMYC-VGFQ 400
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ A G+ +++GD L N + +DL N G+ C+
Sbjct: 401 NGALQSKDGK-DIVLMGDLVLSNKLVIYDLENQVIGWTDYNCS 442
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 104/407 (25%), Positives = 159/407 (39%), Gaps = 71/407 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCV-DCNFPNVDPSRIPAFIPKRS 160
G Y + GTPP + DTGS + W C CV + P++ ++ + P RS
Sbjct: 35 GLYYTKIYLGTPPVGYYVQV-DTGSDVTWLNCAPCTSCVTETQLPSI---KLTTYDPSRS 90
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
S+ + C++ C G N E C T C +Y YG G T G + +
Sbjct: 91 STDGALSCRDSNCGAALGSN-EVSC--------TSAGYC-AYSTTYGDGSSTQGYFIQDV 140
Query: 220 LRFP---SKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQLGL---- 261
+ F + T N A GC ++S R G+ GFG+++ S+PSQL
Sbjct: 141 MTFQEIHNNTQVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKV 200
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
+F++CL D +V+ G P +SYTP Y V
Sbjct: 201 GNRFAHCLQG----DNQGGGTIVI------GSVSEPNISYTPIVSR---------NHYAV 241
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
G++ I V ++V P S+ S GGVI+DSG+T ++ P + +F+ + +
Sbjct: 242 GMQNIAVNGRNVTTPASFDTT-STSAGGVIMDSGTTLAYLVDPAY----TQFVNAVSTFE 296
Query: 381 RAADVEKKSGLRPCFDISG-KKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLC 435
+ S C ++ P + L F GA M L P NY G C
Sbjct: 297 SS----MFSSHSQCLQLAWCSLQADFPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYC 352
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ L ILGD L++ + +D N G+ C
Sbjct: 353 MGWQKSTTKAGYLSYS---ILGDIVLKDHLVVYDNDNRVVGWKSFDC 396
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 110/398 (27%), Positives = 163/398 (40%), Gaps = 70/398 (17%)
Query: 96 LSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF 155
LS + Y +++ GTP + P IFDTGS L+W C C +P ++P F
Sbjct: 124 LSKITASDYIVNVGIGTP-KKEMPLIFDTGSGLIWTQCKPCKAC----YP-----KVPVF 173
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGL 214
P +S+S + + C + C +S +GCS T YL Y + G
Sbjct: 174 DPTKSASFKGLPCSSKLC--------QSIRQGCSSPKCT-------YLTAYVDNSSSTGT 218
Query: 215 LLSETLRFP--SKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLG---LKKFSY 266
L +ET+ F N L GCS + +GI G RS SL SQ K FSY
Sbjct: 219 LATETISFSHLKYDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSY 278
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
C+ S P S TG + K P + ++P K S Y + + I
Sbjct: 279 CIPS-----TPGS------TGHLTFGGKVPNDVRFSPVSKTAPSSD------YDIKMTGI 321
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG + + I S S +DSG+ T + + A+ F M Y +
Sbjct: 322 SVGGRKLLIDASAFKIAS------TIDSGAVLTRLPPKAYSALRSVFREMMKGYPL---L 372
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV-GNEVLCLILFTDNAA 444
++ L C+D S +V +P + + F+GG +M + V G++V CL
Sbjct: 373 DQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAF------ 426
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A I G+FQ + + + FD A +R GFA C
Sbjct: 427 --AELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462
>gi|255552239|ref|XP_002517164.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543799|gb|EEF45327.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 433
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 104/393 (26%), Positives = 157/393 (39%), Gaps = 73/393 (18%)
Query: 121 IFDTGSSLVWFPCTSRY--------RC--VDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
I D G +W C Y RC CN N + F R GC N
Sbjct: 60 ILDLGGLYLWVDCDRGYVSSTYRPARCNSAQCNLANANGCITACFDAPRP------GCNN 113
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNF 230
C+ + V + L LQ G G ++S V NF
Sbjct: 114 NTCALLVDNTVTNI-------GTDGELGQDVVSLQSTDGSNPGRVVS---------VSNF 157
Query: 231 LAGC--SILSDRQPAG---IAGFGRSSESLPSQLGL-----KKFSYCLLSRK----FDDA 276
L C S + + P+G +AG GR+ SLPSQ +KF+ CL S K F
Sbjct: 158 LFVCAPSFILNGLPSGTEGMAGLGRTKVSLPSQFAAAFSFNRKFAICLSSSKGVVFFGKE 217
Query: 277 PVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKH 331
P +D SK L+YTP NPV +++AF + Y++G++ I + K
Sbjct: 218 PYIIQPNIDV------SKI--LTYTPLIINPVSTAAAFVQGDPSSDYFIGVKSININGKP 269
Query: 332 VKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
V + + L + G GG ++ + +T ME ++ A F++++ + R A V
Sbjct: 270 VPLNTTLLSINSQTGFGGTMISTVVPYTVMETTIYNAFVNAFVKELVDVPRVASVAP--- 326
Query: 391 LRPCFD----ISGKKSVYLPELILKFKG-GAKMALPPENYFALVGNEVLCLILFTDNAAG 445
CFD + + +P + L + + N V +VLCL F D
Sbjct: 327 FGACFDASKIVGTRLGAAVPSIDLVLQSSNVFWRIVGANSMVQVNEDVLCL-GFVDGGEN 385
Query: 446 PALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
P +I++G QL++ L+FDLA R GF+
Sbjct: 386 PR----TSIVIGGHQLEDNLLQFDLATSRLGFS 414
>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
Length = 382
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 92/382 (24%), Positives = 155/382 (40%), Gaps = 36/382 (9%)
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
S + GTPPQ ++ FI D G LVW C+ N +P F P +SS+ +
Sbjct: 27 SFTIGTPPQPASAFI-DVGGLLVWTQCSQCSSSSCFN------QELPPFDPTKSSTYRPE 79
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT 226
C C F P C G C + L ++ T+G + ++ + + T
Sbjct: 80 PCGTALCE--FFPASIRNCSG-----DVCAYEASTQLFEH----TSGKIGTDAVAIGTAT 128
Query: 227 VPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSN 281
+ GC + SD + P+G G R+ SL +Q+ + FS+CL +S
Sbjct: 129 AASVAFGCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTAFSHCLAPHDGGGGK-NSR 187
Query: 282 LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVP 341
L L TPF K+ + +Y + L I G + + VP
Sbjct: 188 LFLGAAAKLAGGGKSAAMTTPFVKSSPDDIKSL--YYLINLEGIKAGDEAI-----ITVP 240
Query: 342 GSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKK 401
S V++ + S +F+ +++ + K +G + + +S CF G
Sbjct: 241 QSGRT--VLLQTFSPVSFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSIFDLCFKRGGVS 298
Query: 402 SVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQL 461
P+++L F+G A + +PP NY VG++ +C+ + + G + ILG Q
Sbjct: 299 GA--PDVVLTFQGAAALTVPPTNYLLDVGDDTVCVAIASSARLNSTEVAGMS-ILGGLQQ 355
Query: 462 QNFYLEFDLANDRFGFAKQKCA 483
QN + +DL + F C+
Sbjct: 356 QNVHFLYDLEKETLSFEAADCS 377
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 118/448 (26%), Positives = 175/448 (39%), Gaps = 104/448 (23%)
Query: 63 SLSRARHL-KTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFI 121
S++RA HL ++ P + ++ + + + G Y IS S GTP I
Sbjct: 61 SINRANHLNQSFVSPNSPETTV--------------ISALGEYLISYSVGTP-SLQVFGI 105
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DTGS ++W C +C + P F +S + + + C + C + G
Sbjct: 106 LDTGSDIIWLQCQPCKKCYE--------QTTPIFDSSKSQTYKTLPCPSNTCQSVQGTFC 157
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFPSKT-----VPNFLAGC- 234
SR K C Y + Y G + G L ETL S P + GC
Sbjct: 158 SSR--------KHCL-----YSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGCG 204
Query: 235 ---SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLL--------SRKFDDAPVSS 280
+I + + +GI G GR SL +QL KFSYCL+ F +A V S
Sbjct: 205 RYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTASSKLNFGNAAVVS 264
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
G G TP F KN + FY++ L VG ++ +
Sbjct: 265 --------GRGTVSTP-----LFSKNGL-------VFYFLTLEAFSVGRNRIE----FGS 300
Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFE----AVAKEFIRQMGNYSRAADVEKKSGLRPCFD 396
PGS G G +I+DSG+T T + ++ AVAK I Q R D + GL C+
Sbjct: 301 PGSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQ-----RVRDPNQVLGL--CYK 353
Query: 397 IS-GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
++ K +P + F GA + L N F V ++V+C F G +
Sbjct: 354 VTPDKLDASVPVITAHFS-GADVTLNAINTFVQVADDVVCFA-FQPTETGA--------V 403
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G+ QN + +DL + F C
Sbjct: 404 FGNLAQQNLLVGYDLQMNTVSFKHTDCT 431
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 116/458 (25%), Positives = 170/458 (37%), Gaps = 65/458 (14%)
Query: 39 LSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPL-S 97
LS H +H PL+ + +LA + +R L +K S + P+ S
Sbjct: 24 LSVYHNVHPPSPSPLESIIALARADDARLLFLSSKAA-----------SSGGVTSAPVAS 72
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--- 154
+ Y + GTP Q DT + W C C C PA
Sbjct: 73 GQTPPSYVVRAGLGTPVQQLL-LALDTSADATWSHCAP---CDTC----------PAGSR 118
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
FIP SSS + C + C G + ++ + PL ++ +
Sbjct: 119 FIPASSSSYASLPCASDWCPLFEGQPCPAN------QDASAPLPACAFSKPFADTSFQAS 172
Query: 215 LLSETLRFPSKTVPNFLAGC-----SILSDRQPAGIAGFGRSSESLPSQLGLK---KFSY 266
L S+TLR + + GC ++ G+ G GR SL SQ G FSY
Sbjct: 173 LGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSY 232
Query: 267 CLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQI 325
CL S + S +L L G + P + YTP NP S YYV + +
Sbjct: 233 CLPS--YRSYYFSGSLRL------GAAGQPRNVRYTPLLTNPHRPS-----LYYVNVTGL 279
Query: 326 IVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG VK+P G ++DSG+ T P++ A+ +EF RQ+ S
Sbjct: 280 SVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPS---GY 336
Query: 386 EKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAA 444
CF+ + P + L GG + LP EN L CL + A
Sbjct: 337 TSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM----AE 392
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
P ++ + Q QN + D+A R GFA++ C
Sbjct: 393 APQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 108/395 (27%), Positives = 160/395 (40%), Gaps = 61/395 (15%)
Query: 103 GYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSS 162
GY IS GTPP + DT + +WF C C + P DPS +SS+
Sbjct: 88 GYIISFLIGTPP-FQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFDPS--------KSST 138
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF 222
+ I C +PKC NVE+ CS +K C G ++ G L +TL
Sbjct: 139 YKTIPCSSPKCK-----NVEN--THCSSDDKK---VCEYSFTYGGEAYSQGDLSIDTLTL 188
Query: 223 PSK-----TVPNFLAGCSILSDRQP-----AGIAGFGRSSESLPSQLGLK---KFSYCLL 269
S + N + GC ++ P +G G GR S SQL KFSYCL+
Sbjct: 189 NSNNDTPISFKNIVIGCG-HRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLV 247
Query: 270 SRKFDDAPVSSNLVL-DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
F + +S L D SG G TP +G Y L + VG
Sbjct: 248 PL-FSNEGISGKLHFGDKSVVSG----VGTVSTPITAGEIG--------YSTTLNALSVG 294
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+K S +D G I+DSG+T T + ++ + + + M RA ++
Sbjct: 295 DHIIKFENS--TSKNDNLGNTIIDSGTTLTILPENVYSRL-ESIVTSMVKLERAKSPNQQ 351
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPAL 448
+ C+ + K++ +P + F GA + L N F + +EV+C A ++
Sbjct: 352 --FKLCYKAT-LKNLDVPIITAHFN-GADVHLNSLNTFYPIDHEVVCF-------AFVSV 400
Query: 449 GRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G P I+G+ QNF + FDL + F C
Sbjct: 401 GNFPGTIIGNIAQQNFLVGFDLQKNIISFKPTDCT 435
>gi|255552237|ref|XP_002517163.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543798|gb|EEF45326.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 469
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 100/397 (25%), Positives = 162/397 (40%), Gaps = 66/397 (16%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P I D G+ +W C Y SSS + C + C
Sbjct: 89 PLVPVKLIVDLGARFMWVDCEEGYV---------------------SSSYTPVSCDSLLC 127
Query: 174 SWIFGPNVESRCK-----GCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT-- 226
+ C GC N TC + + +++ G G + F KT
Sbjct: 128 KLANSLACATECNSTPKPGC--HNNTCAHSPENPVIRLGTSGQIGQDVVSLQSFNGKTPD 185
Query: 227 ----VPNF--LAGCSILSDRQP---AGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK 272
VPNF + G + L + G+AG G S+ SLP+Q KKF+ CL +
Sbjct: 186 RIVSVPNFPFVCGPTFLLENLADGVTGLAGLGNSNISLPAQFSSAFGFPKKFAVCLSNS- 244
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS--SAFGEF---YYVGLRQIIV 327
SN ++ G G + L+YTP NPV ++ S GE Y++G++ I +
Sbjct: 245 -----TKSNGLIFFGDGPYSNLPNDLTYTPLIHNPVSTAGGSYLGEASVEYFIGVKSIRI 299
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
G K VK + L S+G GG + + +T + +++AV K F+++M
Sbjct: 300 GGKDVKFNKTLLSIDSEGKGGTKISTVDPYTVLHTSIYKAVVKAFVKEMDKKFIPQVQPP 359
Query: 388 KSGLRPCFDI----SGKKSVYLP--ELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
+ CF S + LP +L+L+ +G + N + + V+CL F D
Sbjct: 360 IAPFGACFQSIVIDSNEFGPVLPFIDLVLEGQGSVTWRIWGANSMVKISSLVMCL-GFVD 418
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
P +I++G Q+++ L+FDLA+ + GF+
Sbjct: 419 GGIEPRT----SIVIGGRQIEDNLLQFDLASSKLGFS 451
>gi|147801500|emb|CAN61502.1| hypothetical protein VITISV_011733 [Vitis vinifera]
Length = 415
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 106/389 (27%), Positives = 155/389 (39%), Gaps = 80/389 (20%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ D G +W C Y V S P + GC N CS +
Sbjct: 59 LVVDLGGQFLWVDCEQNY---------VSSSYRPGAVQP--------GCNNNTCS-VLPD 100
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSD 239
N +R LA + +Q G G S +V FL C+ S
Sbjct: 101 NTVTRTASSDE------LAEDAVSVQSTDGSNPG---------RSVSVSKFLFSCAPTSL 145
Query: 240 RQ-----PAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKFDDAPVSSNLVLDTGPG 289
+ G+AG GR+ +LPSQ +KF+ CL S D V+ G G
Sbjct: 146 LEGLASGAKGMAGLGRTRIALPSQFASAFSFHRKFAICLSSSTTADG------VILLGDG 199
Query: 290 S-----GDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKHVKIPYSYL 339
S + L YTP NPV ++SA + Y++G++ I + K V + S L
Sbjct: 200 SYGLLPNVDASQLLIYTPLILNPVSTASAHSQGEPSAEYFIGVKSIQINEKAVPLNTSLL 259
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLRPCFDI 397
S G GG + + + +T ME ++ A K FI N +R A V S CF
Sbjct: 260 SINSKGVGGTKISTVNPYTVMETSIYSAFTKAFISAAASMNITRVAAVAPFS---VCFS- 315
Query: 398 SGKKSVY-------LPELILKFKGGAKM-ALPPENYFALVGNEVLCLILFTDNAAGPALG 449
K+VY +P + L + + + + N V +VLCL F D A P
Sbjct: 316 --SKNVYSTRGGAAVPTIGLVLQNNSVVWRIFGANSMVFVNGDVLCL-GFVDGGANPR-- 370
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+I++G +QL++ L+FDLA R GF+
Sbjct: 371 --TSIVIGGYQLEDNLLQFDLAASRLGFS 397
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 103/404 (25%), Positives = 165/404 (40%), Gaps = 65/404 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS ++W C S RC + ++ + + PK SS
Sbjct: 87 GLYYTEIGIGTPTKRYYVQV-DTGSDILWVNCISCDRCPRKSGLGLE---LTLYDPKDSS 142
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+ + C C+ +G GC T L C Y + YG G T G +S+ L
Sbjct: 143 TGSKVSCDQGFCAATYG----GLLPGC-----TTSLPC-EYSVTYGDGSSTTGYFVSDLL 192
Query: 221 RF----------PSKTVPNFLAGCSILSD-----RQPAGIAGFGRSSESLPSQLGL---- 261
+F P+ + F G D + GI GFG+S+ S+ SQL
Sbjct: 193 QFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKV 252
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K F++CL + N+V P + TP N Y V
Sbjct: 253 KKIFAHCLDTINGGGIFAIGNVV-----------QPKVKTTPLVPNM--------PHYNV 293
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I VG +K+P G G I+DSG+T T++ E V KE + + ++
Sbjct: 294 NLKSIDVGGTALKLPSHMFDTGE--KKGTIIDSGTTLTYLP----EIVYKEIM--LAVFA 345
Query: 381 RAADVEKKSGLR-PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
+ D+ + CF G+ P++ F+ + + P +YF G+ + C + F
Sbjct: 346 KHKDITFHNVQEFLCFQYVGRVDDDFPKITFHFENDLPLNVYPHDYFFENGDNLYC-VGF 404
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G+G ++LGD L N + +DL N G+ + C+
Sbjct: 405 QNGGLQSKDGKG-MVLLGDLVLSNKLVVYDLENQVIGWTEYNCS 447
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 96/349 (27%), Positives = 145/349 (41%), Gaps = 52/349 (14%)
Query: 154 AFIPKRSSSSQLIGCQNPKC----SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG 209
F P RS S Q + C + KC S +F ++ C + C Y + Y G
Sbjct: 190 VFCPHRSKSFQAVTCASQKCKIDLSQLFSLSL------CPKPSDPCL-----YDISYADG 238
Query: 210 FTA-GLLLSETLRFPSKT-----VPNFLAGCS------ILSDRQPAGIAGFGRSSESLPS 257
+A G ++T+ K + N GC+ + + GI G G + +S
Sbjct: 239 SSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFID 298
Query: 258 QLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG-LSYTPFYKNPVGSSSA 313
+ + KFSYCL+ VSS L + G ++K G + T P
Sbjct: 299 KAAYEYGAKFSYCLVDH-LSHRNVSSYLTIG---GHHNAKLLGEIKRTELILFP------ 348
Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
FY V + I +G + +KIP V + GG ++DSG+T T + P +E V + I
Sbjct: 349 --PFYGVNVVGISIGGQMLKIPPQ--VWDFNSQGGTLIDSGTTLTALLVPAYEPVFEALI 404
Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
+ + R E L CFD G +P L+ F GGA+ P ++Y V V
Sbjct: 405 KSLTKVKRVTG-EDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLV 463
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
C+ + P G G A ++G+ QN EFDL+ + GFA C
Sbjct: 464 KCIGIV------PIDGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSIC 506
>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
Length = 424
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 99/418 (23%), Positives = 149/418 (35%), Gaps = 93/418 (22%)
Query: 95 PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD------ 148
PL Y S G PPQ + + DTGS LVW C++ C P
Sbjct: 69 PLRWSGKTQYIASYGIGDPPQPAEAVV-DTGSDLVWTQCST------CRLPAAAAAGGGG 121
Query: 149 --PSRIPAFIPKRSSSSQLIGCQNPKCSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
P +P + S +++ + C + + P +G + C +A
Sbjct: 122 CFPQNLPYYNFSLSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAA-----S 176
Query: 206 YGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQP------AGIAGFGRSSESLPSQL 259
YG G G+L ++ FPS + GC + P +GI G GR + SL
Sbjct: 177 YGAGVALGVLGTDAFTFPSSSSVTLAFGCVSQTRISPGALTGASGIIGLGRGALSL---- 232
Query: 260 GLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
NP S F FYY
Sbjct: 233 ----------------------------------------------NP--KDSPFSTFYY 244
Query: 320 VGLRQIIVGSKHVKIPYSYL----VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
+ L + G+ V +P GG ++DSGS FT + P A+ KE RQ
Sbjct: 245 LPLVGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQ 304
Query: 376 M-GNYSRAADVEKKSG-LRPCF----DISGKKSVYLPELILKFK----GGAKMALPPENY 425
+ G+ S K G L C D + +P L+L+F GG ++ +P E Y
Sbjct: 305 LRGSGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPSLVLRFDDGVGGGRELVIPAEKY 364
Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+A V C+ + + + L I+G+F Q+ + +DLAN F C+
Sbjct: 365 WARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 422
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 159/391 (40%), Gaps = 58/391 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA-FIPKRSSS 162
Y IS+ G+P + + DTGS + W +C C P+ + A F P SS+
Sbjct: 135 YVISVGLGSPAM-TQRVVIDTGSDVSWV------QCEPCPAPSPCHAHAGALFDPAASST 187
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLR 221
C C+ + G + E+ GC +++ Y+++YG G T G S+ L
Sbjct: 188 YAAFNCSAAACAQL-GDSGEA--NGCDAKSRC------QYIVKYGDGSNTTGTYSSDVLT 238
Query: 222 FP-SKTVPNFLAGCSILS-----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRK 272
S V F GCS D + G+ G G ++SL SQ K FSYCL +
Sbjct: 239 LSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPA-- 296
Query: 273 FDDAPVSSN-LVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
P SS L L G + TP + S +Y+ L I VG K
Sbjct: 297 ---TPASSGFLTLGAPASGGGGGASRFATTPMLR-----SKKVPTYYFAALEDIAVGGKK 348
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ + S GS +VDSG+ T + + A++ F M Y+RA E L
Sbjct: 349 LGLSPSVFAAGS------LVDSGTVITRLPPAAYAALSSAFRAGMTRYARA---EPLGIL 399
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
CF+ +G V +P + L F GGA + L + +V L D+ A G
Sbjct: 400 DTCFNFTGLDKVSIPTVALVFAGGAVVDL---DAHGIVSGGCLAFAPTRDD---KAFG-- 451
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G+ Q + F + +D+ FGF C
Sbjct: 452 ---TIGNVQQRTFEVLYDVGGGVFGFRAGAC 479
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 105/410 (25%), Positives = 171/410 (41%), Gaps = 72/410 (17%)
Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
YG Y+ + GTPP+ T I DTGS ++W C + C +C + + F S
Sbjct: 81 YGLYTTKVKMGTPPREFTVQI-DTGSDILWINCNT---CSNCPKSSGLGIELNFFDTVGS 136
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSET 219
S++ L+ C +P C+ ++ CSP+ C SY QY G T+G+ +S+
Sbjct: 137 STAALVPCSDPMCA----SAIQGAAAQCSPQVNQC-----SYTFQYEDGSGTSGVYVSDA 187
Query: 220 LRFP---SKTVPNFLA-------GCSIL-------SDRQPAGIAGFGRSSESLPSQL--- 259
+ F ++ P +A GCS +D+ GI GFG S+ SQL
Sbjct: 188 MYFDMILGQSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSR 247
Query: 260 GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
G+ K FS+CL D LVL G+ P + Y+P +
Sbjct: 248 GITPKVFSHCLKG----DGNGGGILVL------GEILEPSIVYSPLVPSQ--------PH 289
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
Y + L+ I V + + I + V + G I+DSG+T +++ ++ + +
Sbjct: 290 YNLNLQSIAVNGQVLSINPA--VFATSDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVS 347
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
++ + + K S C+ + P + F+GGA M L P Y G
Sbjct: 348 QFATSF-ISKGS---QCYLVLTSIDDSFPTVSFNFEGGASMDLKPSQYLLNRG------- 396
Query: 438 LFTDNAAGPALG----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
F D A +G + ILGD L++ + +DLA + G+ C+
Sbjct: 397 -FQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDLARQQIGWTNYDCS 445
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 126/291 (43%), Gaps = 36/291 (12%)
Query: 202 YLLQYGLG-FTAGLLLSETLRFPSK-TVPNFLAGCSILSDR---QPAGIAGFGRSSESLP 256
Y +QYG G +T G +TL S + F GC ++ + AG+ G GR SLP
Sbjct: 23 YGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLP 82
Query: 257 SQLGLKK---FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSA 313
Q K F++C +R S L+ GPGS + + LS TP + G +
Sbjct: 83 VQTYDKYGGVFAHCFPARS------SGTGYLEFGPGSSPAVSAKLSTTPMLID-TGPT-- 133
Query: 314 FGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
FYYVG+ I VG K + IP S G IVDSG+ T + + ++ F
Sbjct: 134 ---FYYVGMTGIRVGGKLLPIPQSVFAAA-----GTIVDSGTVITRLPPAAYSSLRSAFA 185
Query: 374 RQMG--NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
M Y RA + S L C+D++G V +P + L F+GG + +
Sbjct: 186 ASMAARGYKRAPAL---SLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIYAASV 242
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL + AA I+G+ QL+ F + +D+A+ GF C
Sbjct: 243 SQACLGFAGNEAA------DDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|24796804|gb|AAN64480.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 161
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 80/140 (57%), Gaps = 3/140 (2%)
Query: 197 LACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSD-RQPAGIAGFGRSSESL 255
LA + + Y G T LL+S+TLR P +T+ NF+ GCS++S +Q +G+ GF S+
Sbjct: 3 LAADAIGVVYSSGSTTRLLISDTLRTPGRTIRNFVVGCSLMSVYQQSSGLTGFSCGVPSV 62
Query: 256 PSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFG 315
PSQLGL KF Y LL+R+FDD +S+ ++ G G D + Y P ++ +
Sbjct: 63 PSQLGLTKFFYFLLARRFDDNATASDELILGGAGGKDDNVR-MQYIPLARS-ASTRPLCS 120
Query: 316 EFYYVGLRQIIVGSKHVKIP 335
+YY+ L I V K V++P
Sbjct: 121 VYYYLALIAITVRRKSVQLP 140
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 158/390 (40%), Gaps = 57/390 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + GTP Q + DT + + PC+ C D F PK S+
Sbjct: 98 GNYVVRVKLGTPGQLLF-MVLDTSTDEAFVPCSGCTGCSDTTFS-----------PKAST 145
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
S + C P+C + G + + G N+ SY G F+A L+ ++LR
Sbjct: 146 SYGPLDCSVPQCGQVRGLSCPATGTGACSFNQ-------SYA---GSSFSA-TLVQDSLR 194
Query: 222 FPSKTVPNFLAGC--SILSDRQPA------GIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
+ +PN+ GC +I PA G S+S + G+ FSYCL S F
Sbjct: 195 LATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGI--FSYCLPS--F 250
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
S +L L GP G K+ + TP ++P S YYV I VG V
Sbjct: 251 KSYYFSGSLKL--GP-VGQPKS--IRTTPLLRSPHRPS-----LYYVNFTGISVGRVLVP 300
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
P YL + G I+DSG+ T P++ AV +EF +Q+G
Sbjct: 301 FPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVG----GTTFTSIGAFDT 356
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTDNAAGPALGRGP 452
CF P + L F+ G + LP EN + CL + AA P
Sbjct: 357 CF--VKTYETLAPPITLHFE-GLDLKLPLENSLIHSSAGSLACLAM----AAAPDNVNSV 409
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ +FQ QN + FD N++ G A++ C
Sbjct: 410 LNVIANFQQQNLRILFDTVNNKVGIAREVC 439
>gi|356576537|ref|XP_003556387.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 438
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 106/404 (26%), Positives = 175/404 (43%), Gaps = 79/404 (19%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P + D G +W C Y SS+S+ C + +C
Sbjct: 55 PLVAVKLTVDLGGGYLWVNCEKGYV---------------------SSTSRPARCGSAQC 93
Query: 174 SW--IFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL-RFPSK--TVP 228
S ++G + E + G SP N ++ YG + ++ T P+K +VP
Sbjct: 94 SLFGLYGCSTEDKICGRSPSNTVTGVS------TYGDIHADVVAVNSTDGNNPTKVVSVP 147
Query: 229 NFL--AGCSILSDRQPAGI---AGFGRSSESLPSQLG-----LKKFSYCLLSRKFDDAPV 278
FL G +++ +G+ AG GR+ SLPSQ +KF+ CL S +
Sbjct: 148 KFLFICGSNVVQKGLASGVTGMAGLGRTKVSLPSQFASAFSFHRKFAICLSSSTMTNGV- 206
Query: 279 SSNLVLDTGP------GSGDSKTPGLSYTPFYKNPVGSSSAF--GE---FYYVGLRQIIV 327
+ GP S SK L++TP NPV ++ ++ GE Y++G++ I V
Sbjct: 207 ---MFFGDGPYNFGYLNSDLSKV--LTFTPLISNPVSTAPSYFQGEPSVEYFIGVKSIKV 261
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEK 387
K+V + + L +G GG + + + +T ME +++AV++ F++++G A V
Sbjct: 262 SDKNVALNTTLLSIDRNGIGGTKISTVNPYTVMETTIYKAVSEVFVKEVG----APTVAP 317
Query: 388 KSGLRPCF---DI-SGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNA 443
+ CF DI S + +P + L + + N V N+V+CL F D
Sbjct: 318 VAPFGTCFATKDIGSTRMGPAVPGIDLVLQNDVVWTIIGANSMVYV-NDVICL-GFVDAG 375
Query: 444 AGPAL--------GRGP--AIILGDFQLQNFYLEFDLANDRFGF 477
+ P++ G P +I +G QL+N L+FDLA R GF
Sbjct: 376 SSPSVAQVGFVAGGSHPRTSITIGAHQLENNLLQFDLATSRLGF 419
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 102/402 (25%), Positives = 164/402 (40%), Gaps = 65/402 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTP + + DTGS ++W C S RC + ++ + + PK SS+
Sbjct: 4 YYTEIGIGTPTKRYYVQV-DTGSDILWVNCISCDRCPRKSGLGLE---LTLYDPKDSSTG 59
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
+ C C+ +G GC T L C Y + YG G T G +S+ L+F
Sbjct: 60 SKVSCDQGFCAATYG----GLLPGC-----TTSLPC-EYSVTYGDGSSTTGYFVSDLLQF 109
Query: 223 ----------PSKTVPNFLAGCSILSD-----RQPAGIAGFGRSSESLPSQLGL-----K 262
P+ + F G D + GI GFG+S+ S+ SQL K
Sbjct: 110 DQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKK 169
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
F++CL + N+V P + TP N Y V L
Sbjct: 170 IFAHCLDTINGGGIFAIGNVV-----------QPKVKTTPLVPNM--------PHYNVNL 210
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ I VG +K+P G G I+DSG+T T++ E V KE + + +++
Sbjct: 211 KSIDVGGTALKLPSHMFDTGE--KKGTIIDSGTTLTYLP----EIVYKEIM--LAVFAKH 262
Query: 383 ADVEKKSGLR-PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
D+ + CF G+ P++ F+ + + P +YF G+ + C + F +
Sbjct: 263 KDITFHNVQEFLCFQYVGRVDDDFPKITFHFENDLPLNVYPHDYFFENGDNLYC-VGFQN 321
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G+G ++LGD L N + +DL N G+ + C+
Sbjct: 322 GGLQSKDGKG-MVLLGDLVLSNKLVVYDLENQVIGWTEYNCS 362
>gi|225436984|ref|XP_002272235.1| PREDICTED: basic 7S globulin [Vitis vinifera]
Length = 436
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/400 (25%), Positives = 162/400 (40%), Gaps = 70/400 (17%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P + D G+ +W C Y ++ P R S+Q + C
Sbjct: 53 PLVPVKLVVDLGAQFLWVDCEQNYVS-------------SSYRPARCRSAQCSLARANGC 99
Query: 174 SWIFG---PNVESRCKGCSPRNKTCPLACPSYL------LQYGLGFTAGLLLSETLRFPS 224
F P + G P N A L +Q G G ++S + +F
Sbjct: 100 GDCFSAPRPGCNNNTCGVLPDNTVTRTATSGELAEDFVSVQSTDGSNPGRVVSVS-KFLF 158
Query: 225 KTVPNFL-AGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKFDDAPV 278
P FL G + G+AG GR+ + PSQ +KF+ CL S
Sbjct: 159 SCAPTFLLEGLA----SSAMGMAGLGRTRIAFPSQFASAFSFHRKFATCLSSS------T 208
Query: 279 SSNLVLDTGPGS-----GDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVG 328
++N V+ G G + L YTP Y NPV ++SA+ + Y++ ++ I +
Sbjct: 209 TANGVVFFGDGPYRLLPNIDASQSLIYTPLYINPVSTASAYTQGEPSAEYFIRVKSIRIN 268
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVE 386
K + + S L S+G GG + + + +T ME +++A K FI N +R A V
Sbjct: 269 EKAISLNTSLLSIDSEGVGGTKISTVNPYTVMETSIYKAFTKAFISAAAAINITRVAAVA 328
Query: 387 KKSGLRPCFDISGKKSVY-------LPELILKFKGGAKM-ALPPENYFALVGNEVLCLIL 438
CF K+VY +P + L + + + N V ++VLCL
Sbjct: 329 P---FNVCFS---SKNVYSTRVGPSVPSIDLVLQNESVFWRIFGANSMVYVSDDVLCL-G 381
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
F D A P +I++G +QL++ L+FDLA R GF+
Sbjct: 382 FVDGGANPR----TSIVIGGYQLEDNLLQFDLATSRLGFS 417
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 104/404 (25%), Positives = 164/404 (40%), Gaps = 64/404 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + S DTGS ++W C +C C + + + S
Sbjct: 78 GLYYAKIGIGTPAK-SYYVQVDTGSDIMWVNCI---QCKQCPRRSTLGIELTLYNIDESD 133
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S +L+ C + C I G + S CK N +CP YL YG G TAG + + +
Sbjct: 134 SGKLVSCDDDFCYQISGGPL-SGCKA----NMSCP-----YLEIYGDGSSTAGYFVKDVV 183
Query: 221 RFPS-----KTVP---NFLAGC------SILSDRQPA--GIAGFGRSSESLPSQLG---- 260
++ S KT + + GC + S + A GI GFG+++ S+ SQL
Sbjct: 184 QYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGR 243
Query: 261 -LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
K F++CL R +V P ++ TP N Y
Sbjct: 244 VKKIFAHCLDGRNGGGIFAIGRVV-----------QPKVNMTPLVPNQ--------PHYN 284
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
V + + VG + + IP PG G I+DSG+T ++ ++E + K+ Q
Sbjct: 285 VNMTAVQVGQEFLTIPADLFQPGD--RKGAIIDSGTTLAYLPEIIYEPLVKKITSQ---- 338
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
A V CF SG+ P + F+ + + P +Y L +E + I +
Sbjct: 339 EPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDY--LFPHEGMWCIGW 396
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N+A + R +LGD L N + +DL N G+ + C+
Sbjct: 397 -QNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCS 439
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 150/361 (41%), Gaps = 64/361 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP I DTGS ++W C S C C + ++ F P SS
Sbjct: 23 GLYYTKVQLGTPPVEFNVQI-DTGSDVLWVSCNS---CSGCPQTSGLQIQLNFFDPGSSS 78
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+S +I C + +C+ ++S CS +N C SY QYG G T+G +S+ +
Sbjct: 79 TSSMIACSDQRCN----NGIQSSDATCSSQNNQC-----SYTFQYGDGSGTSGYYVSDMM 129
Query: 221 R----FPSKTVPNFLA----GCS-------ILSDRQPAGIAGFGRSSESLPSQL---GL- 261
F N A GCS SDR GI GFG+ S+ SQL G+
Sbjct: 130 HLNTIFEGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIA 189
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYY 319
+ FS+CL D+ LVL G+ P + YT P Y
Sbjct: 190 PRVFSHCLKG----DSSGGGILVL------GEIVEPNIVYTSLVPAQP---------HYN 230
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+ L+ I V + ++I S V + + G IVDSG+T ++ ++ +
Sbjct: 231 LNLQSIAVNGQTLQIDSS--VFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQS 288
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF----ALVGNEVLC 435
A S C+ I+ + P++ L F GGA M L P++Y ++ G V C
Sbjct: 289 VHTA----VSRGNQCYLITSSVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWC 344
Query: 436 L 436
+
Sbjct: 345 I 345
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 108/248 (43%), Gaps = 33/248 (13%)
Query: 242 PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYT 301
P+G G GR+ SL +Q+ L +FSYCL D +S L L G+ G ++T
Sbjct: 148 PSGFIGLGRTPWSLVAQMKLTRFSYCLAPH---DTGKNSRLFL----GASAKLAGGGAWT 200
Query: 302 PFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
PF K + ++Y + L +I G + +P G V+V + +
Sbjct: 201 PFVKT--SPNDGMSQYYPIELEEIKAGDATITMPR--------GRNTVLVQTAVVRVSL- 249
Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSG--LRPCFDISGKKSVYLPELILKFKGGAKMA 419
L ++V +EF + + AA G CF +G P+L+ F+ GA +
Sbjct: 250 --LVDSVYQEFKKAVMASVGAAPTATPVGAPFEVCFPKAGVSGA--PDLVFTFQAGAALT 305
Query: 420 LPPENYFALVGNEVLCL----ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRF 475
+PP NY VGN+ +CL I + A L ILG FQ +N +L FDL D
Sbjct: 306 VPPANYLFDVGNDTVCLSVMSIALLNITALDGLN-----ILGSFQQENVHLLFDLDKDML 360
Query: 476 GFAKQKCA 483
F C+
Sbjct: 361 SFEPADCS 368
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 114/405 (28%), Positives = 162/405 (40%), Gaps = 62/405 (15%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVD 148
I + LS+ S G Y + G P Q S DTGS + W PC+S Y VD
Sbjct: 1 ISSGLSLGS-GEYFARMGIGNP-QRSYYLELDTGSDVTWIQCAPCSSCYSQVD------- 51
Query: 149 PSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG- 207
P + P SSS + + C + C + S C+G + C SY + YG
Sbjct: 52 ----PIYDPSNSSSYRRVYCGSALCQAL----DYSACQG---------MGC-SYRVVYGD 93
Query: 208 LGFTAGLLLSETLRF---PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGL 261
++G L E+ S + N GC + R AG+ G G + S SQ+
Sbjct: 94 SSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAA 153
Query: 262 K---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEF 317
FSYCL+ R SS L+ G + P +TP KNP F
Sbjct: 154 SIGPAFSYCLVDRYSQLQSRSSPLIF------GRTAIPFAARFTPLLKNP-----RINTF 202
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
YY L I VG + IP + +G GG I+DSG++ T + P + + +
Sbjct: 203 YYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTSVTRVVPPAYAVLRDAYRAASR 262
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
N A V L CF+ G +V +P L+L F G M LP N V +
Sbjct: 263 NLPPAPGVYL---LDTCFNFQGLPTVQIPSLVLHFDNGVDMVLPGGNILIPVDRSGTFCL 319
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
F ++ P ++G+ Q Q F + FDL A ++C
Sbjct: 320 AFAPSSM-------PISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357
>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
Length = 492
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 138/342 (40%), Gaps = 51/342 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +S S GTPPQ T + D S VW C++ C C + P F
Sbjct: 95 GMYVLSFSVGTPPQVVTG-VLDITSDFVWMQCSA---CATCGADAPAATSAPPF------ 144
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF---TAGLLLSE 218
+ F ++R +P C Y YG G TAGLL +
Sbjct: 145 -------------YAFLSFHDTR----APTTPPC-----GYSYVYGGGAANTTAGLLAVD 182
Query: 219 TLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
F + + GC++ ++ G+ G GR S SQL + +FSY L DDA
Sbjct: 183 AFAFATVRADGVIFGCAVATEGDIGGVIGLGRGELSPVSQLQIGRFSYYLAP---DDAVD 239
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSY 338
+ +L D P S P+ +S A YYV L I V + + IP
Sbjct: 240 VGSFILFL-----DDAKPRTSRA--VSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGT 292
Query: 339 LVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS 398
+DG+GGV++ TF++ ++ V + ++ RAAD + GL C+
Sbjct: 293 FDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI--ELRAAD-GSELGLDLCYTSE 349
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFAL---VGNEVLCLI 437
+ +P + L F GGA M L NYF + G E L ++
Sbjct: 350 SLATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTIL 391
>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
Length = 500
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 117/466 (25%), Positives = 182/466 (39%), Gaps = 103/466 (22%)
Query: 56 LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSL-IKTPLSVHSYGGYSISLSFGTPP 114
LH L+ ++ S +HL + P TKDS Y + +TPL
Sbjct: 21 LHQLSLNNHSDPKHLFS---PVTKDSATTLQYIAQINQRTPL------------------ 59
Query: 115 QASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS 174
+ D G +W C + Y + P R S+Q ++ C
Sbjct: 60 -VPLNLVVDLGGKFLWVDCENHYTS-------------STYRPVRCPSAQCSLAKSDSC- 104
Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT-------- 226
G S GC N TC L P + + T G L + L S +
Sbjct: 105 ---GDCFSSPKPGC---NNTCGLI-PDNTITHSA--TRGDLAEDVLSIQSTSGFNTGQNV 155
Query: 227 -VPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK--- 272
V FL C+ S + +G+AG GR+ +LPSQL +KF++C S
Sbjct: 156 VVSRFLFSCAPTSLLRGLAGGASGMAGLGRTKIALPSQLASAFIFKRKFAFCFSSSDGVI 215
Query: 273 -FDDAPVS--------SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF--GEF---Y 318
F D P S N+V D+ L+YTP N V ++SAF GE Y
Sbjct: 216 IFGDGPYSFLADNPSLPNVVFDS---------KSLTYTPLLINHVSTASAFLQGESSVEY 266
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
++G++ I + K V + S L + G GG + + +T +E +++AV F++ +
Sbjct: 267 FIGVKTIKIDGKVVSLNSSLLSIDNKGVGGTKISTVDPYTVLEASIYKAVTDAFVK--AS 324
Query: 379 YSRAADVEKKS-GLRPCFDISGKKSVYL----PELILKFKGGAKMALPPENYFALVGNEV 433
+R E S C+ L P + L + ++ N + +EV
Sbjct: 325 VARNITTEDSSPPFEFCYSFDNLPGTPLGASVPTIELLLQNNVIWSMFGANSMVNINDEV 384
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
LCL + +I++G +QL+N L+FDLA R GF+
Sbjct: 385 LCL-----GFVNGGVNLRTSIVIGGYQLENNLLQFDLAASRLGFSN 425
>gi|388509650|gb|AFK42891.1| unknown [Lotus japonicus]
Length = 347
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 136/288 (47%), Gaps = 52/288 (18%)
Query: 226 TVPNFL--AGCSILSD---RQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK--- 272
+VPNFL G ++ + + G+AG GR+ SLPSQ +KF+ CL +
Sbjct: 57 SVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGAD 116
Query: 273 ----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS-SAF-GE---FYYVGLR 323
F D P NL D SK L+YTP NPV ++ SAF GE Y++G++
Sbjct: 117 GVMFFGDGPY--NLNQDV------SKV--LTYTPLITNPVSTAPSAFLGEPSVEYFIGVK 166
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I V K+V + + L +G GG + + + +T ME +++AVA F++ +G A
Sbjct: 167 SIKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLG----AP 222
Query: 384 DVEKKSGLRPCF---DIS-GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
V + CF DIS + +P + L + G + + N ++V+CL F
Sbjct: 223 TVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVICL-GF 280
Query: 440 TDNAAGPAL--------GRGP--AIILGDFQLQNFYLEFDLANDRFGF 477
D + P G P +I +G QL+N L+FDLA R GF
Sbjct: 281 VDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 328
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 104/406 (25%), Positives = 166/406 (40%), Gaps = 71/406 (17%)
Query: 104 YSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
Y + GTPP+ PF DTGS ++W C S +C + +D + + PK SS
Sbjct: 87 YYTKIEIGTPPK---PFHVQVDTGSDILWVNCVSCDKCPTKSGLGID---LALYDPKGSS 140
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S + C N C+ +G + GC+ K C Y +YG G TAG +S++L
Sbjct: 141 SGSAVSCDNKFCAATYGSG--EKLPGCTA-GKPC-----EYRAEYGDGSSTAGSFVSDSL 192
Query: 221 RFPS--------KTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLG----- 260
++ N + GC +++ GI GFG+S+ S SQL
Sbjct: 193 QYNQLSGNAQTRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEV 252
Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEF 317
K FS+CL + K G G G+ P + TP N
Sbjct: 253 KKIFSHCLDTIK--------------GGGIFAIGEVVQPKVKSTPLLPNM--------SH 290
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
Y V L+ I V +++P ++ S+ G I+DSG+T T++ E V K+ + +
Sbjct: 291 YNVNLQSIDVAGNALQLP-PHIFETSEKRG-TIIDSGTTLTYLP----ELVYKDILAAVF 344
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ G CF+ S P++ F+ + + P +YF G+ + CL
Sbjct: 345 QKHQDITFRTIQGFL-CFEYSESVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYCL- 402
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
F + P + ++LGD L N + +DL G+ C+
Sbjct: 403 GFQNGGFQPKDAK-DMVLLGDLVLSNKVVVYDLEKQVIGWTDYNCS 447
>gi|388508700|gb|AFK42416.1| unknown [Lotus japonicus]
Length = 440
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 85/288 (29%), Positives = 134/288 (46%), Gaps = 52/288 (18%)
Query: 226 TVPNFL--AGCSILSD---RQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRK--- 272
+VPNFL G ++ + + G+AG GR+ SLPSQ +KF+ CL +
Sbjct: 150 SVPNFLFICGSKVVQNGLAKGVTGMAGLGRTRVSLPSQFSSAFSFHRKFAICLTANSGAD 209
Query: 273 ----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS-SAF-GE---FYYVGLR 323
F D P + N + L+YTP NPV ++ SAF GE Y++G++
Sbjct: 210 GVMFFGDGPYNLN----------QDVSKVLTYTPLITNPVSTAPSAFLGEPSVEYFIGVK 259
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+ V K+V + + L +G GG + + + +T ME +++AVA F++ +G A
Sbjct: 260 SVKVSEKNVPLNTTLLSINKNGVGGTKISTVNPYTVMETTIYKAVADAFVKSLG----AP 315
Query: 384 DVEKKSGLRPCF---DIS-GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
V + CF DIS + +P + L + G + + N ++V+CL F
Sbjct: 316 TVSPVAPFGTCFATKDISFSRIGPGVPAIDLVLQNGVEWPIIGANSMVQF-DDVICL-GF 373
Query: 440 TDNAAGPAL--------GRGP--AIILGDFQLQNFYLEFDLANDRFGF 477
D + P G P +I +G QL+N L+FDLA R GF
Sbjct: 374 VDAGSNPKASQVGFVNGGSHPVTSITIGAHQLENNLLKFDLAASRLGF 421
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 103/413 (24%), Positives = 170/413 (41%), Gaps = 67/413 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ + DTGS ++W C S +C + +D + + PK SS
Sbjct: 85 GLYFTEIKLGTPPKRYYVQV-DTGSDILWVNCISCSKCPRKSGLGLD---LTFYDPKASS 140
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S + C C+ +G + GC+ N C Y + YG G T G +++ L
Sbjct: 141 SGSTVSCDQGFCAATYG----GKLPGCTA-NVPC-----EYSVMYGDGSSTTGFFITDAL 190
Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
+F +T P GC S++ GI GFG+++ S+ SQL
Sbjct: 191 QFDQVTGDGQTQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKA 250
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF------ 314
K F++CL + K N+V P + F+ + + + F
Sbjct: 251 KKIFAHCLDTIKGGGIFAIGNVV-----------QPKCYFVFFFAHGLLNIPLFLLVMIL 299
Query: 315 --GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF 372
Y V L+ I VG +++P G G I+DSG+T T++ +F+ V
Sbjct: 300 LSRPHYNVNLKSIDVGGTTLQLPAHVFETGE--KKGTIIDSGTTLTYLPELVFKQVMDVV 357
Query: 373 IRQMGNYSRAADVEKKSGLRP--CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
+S+ D+ + L+ CF SG P + F+ + + P YF G
Sbjct: 358 ------FSKHRDIAFHN-LQDFLCFQYSGSVDDGFPTITFHFEDDLALHVYPHEYFFPNG 410
Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N++ C + F + A G+ +++GD L N + +DL N G+ C+
Sbjct: 411 NDIYC-VGFQNGALQSKDGK-DIVLMGDLVLSNKLVVYDLENQVIGWTDYNCS 461
>gi|383143511|gb|AFG53183.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
Length = 135
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 79/152 (51%), Gaps = 20/152 (13%)
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEFYYVGL 322
YCL D SS +V+ G+ PG L+YTP NP+ + FYY+GL
Sbjct: 1 YCL-----DYVNNSSKIVV------GNKAVPGDISLTYTPLIINPI-----YPFFYYLGL 44
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ +G K + +P++ S GNGG I+DSG++FT ++ +A EF Q+G Y R
Sbjct: 45 EAVSIGRKRMNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQIG-YKRV 103
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
E +GL C+++SG ++ P+ FKG
Sbjct: 104 PGAESTTGLGLCYNVSGVENTQFPQFAFHFKG 135
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 110/409 (26%), Positives = 161/409 (39%), Gaps = 87/409 (21%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAF--IPKR 159
G Y IS S G PP I DTGS ++W C +C + DPS+ + +P
Sbjct: 84 GEYLISYSVGIPP-FQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFS 142
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
S++ Q + ++ CS S K C Y + YG G ++ G L E
Sbjct: 143 STTCQSV--EDTSCS--------------SDNRKMCE-----YTIYYGDGSYSQGDLSVE 181
Query: 219 TLRFPSKTVPNF-----LAGC----SILSDRQPAGIAGFGRSSESLPSQLGL------KK 263
TL S + + GC ++ + + +GI G G SL +QL +K
Sbjct: 182 TLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRK 241
Query: 264 FSYCLLSR-------KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
FSYCL S F DA V S G G TP +++ P
Sbjct: 242 FSYCLASMSNISSKLNFGDAAVVS--------GDGTVSTPIVTHDP------------KV 281
Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
FYY+ L VG+ ++ S G GN +I+DSG+T T + ++ + + + +
Sbjct: 282 FYYLTLEAFSVGNNRIEFTSSSFRFGEKGN--IIIDSGTTLTLLPNDIYSKL-ESAVADL 338
Query: 377 GNYSRAADVEKKSGL--RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL 434
R D K+ L R FD + P ++ F GA + L N F V V
Sbjct: 339 VELDRVKDPLKQLSLCYRSTFD-----ELNAPVIMAHF-SGADVKLNAVNTFIEVEQGVT 392
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
CL F + GP I G+ QNF + +DL F C+
Sbjct: 393 CL-AFISSKIGP--------IFGNMAQQNFLVGYDLQKKIVSFKPTDCS 432
>gi|297843130|ref|XP_002889446.1| EDGP precursor [Arabidopsis lyrata subsp. lyrata]
gi|297335288|gb|EFH65705.1| EDGP precursor [Arabidopsis lyrata subsp. lyrata]
Length = 433
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 121/255 (47%), Gaps = 33/255 (12%)
Query: 243 AGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK----FDDAPVSSNLVLDTGPGSGDS 293
G+AG GR + LPSQ +KF+ CL S + F + P + L PG
Sbjct: 174 VGMAGMGRHNIGLPSQFAAAFSFNRKFAVCLTSGRGVAFFGNGPY---VFL---PGI--- 224
Query: 294 KTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKHVKI-PYSYLVPGSDGNG 347
+ GL TP NPV ++SAF + Y++G+ I + K V I P + S G G
Sbjct: 225 QISGLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIKIVEKTVPINPTLLKINASTGFG 284
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGLRPCFDISGKKSVY- 404
G + S + +T +E ++ A EF++Q N +R A V+ S ++ + Y
Sbjct: 285 GTKISSVNPYTVLESSIYNAFTSEFVKQAAARNITRVASVKPFSACFSTKNVGVTRLGYA 344
Query: 405 LPELILKFKGG-AKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
+PE+ L + N V ++V+CL F D ++++G FQL++
Sbjct: 345 VPEIQLVLHSNDVVWRIFGANSMVSVSDDVICL-GFVDGGVNAR----TSVVIGGFQLED 399
Query: 464 FYLEFDLANDRFGFA 478
+EFDLA++RFGF+
Sbjct: 400 NLIEFDLASNRFGFS 414
>gi|383143501|gb|AFG53178.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143503|gb|AFG53179.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143507|gb|AFG53181.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143509|gb|AFG53182.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143517|gb|AFG53186.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143519|gb|AFG53187.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
Length = 135
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 79/152 (51%), Gaps = 20/152 (13%)
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEFYYVGL 322
YCL D SS +V+ G+ PG L+YTP NP+ + FYY+GL
Sbjct: 1 YCL-----DYVNNSSKIVV------GNKAVPGDISLTYTPLIINPI-----YPFFYYLGL 44
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ +G K + +P++ S GNGG I+DSG++FT ++ +A EF Q+G Y R
Sbjct: 45 EAVSIGRKRLNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQIG-YKRV 103
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
E +GL C+++SG ++ P+ FKG
Sbjct: 104 PGAESTTGLGLCYNVSGVENTQFPQFAFHFKG 135
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 121/296 (40%), Gaps = 65/296 (21%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + L+ GTPP+ DTGS LVW C C D P +DP+ SS+
Sbjct: 86 YLVHLAVGTPPR-PVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAA--------SSTY 136
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
+ C P+C + + C G ++C Y+ YG T G + ++ F
Sbjct: 137 AALPCGAPRCRAL----PFTSCGG-----RSC-----VYVYHYGDKSVTVGKIATDRFTF 182
Query: 223 PSKTVPN----------FLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLKKFSYCL 268
N GC + GIAGFGR SLPSQL FSYC
Sbjct: 183 GDNGRRNGDGSLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCF 242
Query: 269 LSRKFDDAPVSSNLVLDTGPG-------SGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
S FD SS + L P SG+ +T TP +KNP S Y++
Sbjct: 243 TS-MFDSK--SSIVTLGGAPAALYSHAHSGEVRT-----TPLFKNPSQPS-----LYFLS 289
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
L+ I VG + +P + I+DSG++ T + ++EAV EF Q+G
Sbjct: 290 LKGISVGKTRLPVPETKFR-------STIIDSGASITTLPEEVYEAVKAEFAAQVG 338
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 76/248 (30%), Positives = 114/248 (45%), Gaps = 23/248 (9%)
Query: 211 TAGLLLSETLRFPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYC 267
T+G L ++T F + VP + GCS S +G+ G GR + SL SQL KFSY
Sbjct: 129 TSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQ 188
Query: 268 LLS-RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
LL+ DD S + GD P P+ SS+ + +FYYV L +
Sbjct: 189 LLAPEATDDGSADSVIRF------GDDAVPKTKRG--RSTPLLSSTLYPDFYYVNLTGVR 240
Query: 327 V-GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG----NYSR 381
V G++ IP ++G GGVI+ S + T++E ++ V ++G N S
Sbjct: 241 VDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSA 300
Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
A +++ C++ S V +P+L L F GGA M L NYF + + L +
Sbjct: 301 ALELDL------CYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLP 354
Query: 442 NAAGPALG 449
+ G LG
Sbjct: 355 SQGGSVLG 362
>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 598
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/249 (31%), Positives = 112/249 (44%), Gaps = 35/249 (14%)
Query: 242 PAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDDAPVSSNL--VLDTGPGSGDSKT 295
P G+ GFG S PSQ G FSYCL S K SSN L GP +
Sbjct: 375 PQGLVGFGCGPLSFPSQNKDVYGFV-FSYCLPSYK------SSNFSSTLRLGPAGQPKR- 426
Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
+ TP NP S YYV + I VG + + +P S L G IVD+G+
Sbjct: 427 --IKMTPLLSNPHRPS-----LYYVNMVGIHVGGRPMLVPASALAFDPASGRGTIVDAGT 479
Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG 415
FT + P++ AV F ++ RA G C+++ ++ +P + F G
Sbjct: 480 MFTRLSAPVYAAVRDVFRSRV----RAPVTGPLGGFDTCYNV----TISVPTVTFSFDGR 531
Query: 416 AKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLAND 473
+ LP EN ++ + CL + AAGP+ G + +L Q QN + FD+AN
Sbjct: 532 VSVTLPEENVVIRSSSDGIACLAM----AAGPSDGVDAVLNVLASMQQQNHRVLFDVANG 587
Query: 474 RFGFAKQKC 482
R GF+++ C
Sbjct: 588 RVGFSRELC 596
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 173/400 (43%), Gaps = 72/400 (18%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y ++L GTPP I DTGS L+W C+ C P P F P +SS
Sbjct: 90 GEYLMTLYIGTPPVERLA-IADTGSDLIWVQCSPCQNCF--------PQDTPLFEPLKSS 140
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C + C+ + P + +C + C Y YG FT G++ +ETL
Sbjct: 141 TFKAATCDSQPCTSV--PPSQRQCG----KVGQC-----IYSYSYGDKSFTVGVVGTETL 189
Query: 221 RFPS----KTV--PNFLAGCSILSD------RQPAGIAGFGRSSESLPSQLGLK---KFS 265
F S +TV P+ + GC + ++ + G+ G G SL SQLG + KFS
Sbjct: 190 SFGSTGDAQTVSFPSSIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFS 249
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGS-GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQ 324
YCLL P SSN GS T G+ TP P+ F FY++ L
Sbjct: 250 YCLL-------PFSSNSTSKLKFGSEAIVTTNGVVSTPLIIKPL-----FPSFYFLNLEA 297
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
+ +G K +VP +G +I+DSG+ T++E + ++++ + A D
Sbjct: 298 VTIGQK--------VVPTGRTDGNIIIDSGTVLTYLEQTFYNNFVAS-LQEVLSVESAQD 348
Query: 385 VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF-ALVGNEVLCLILFTDNA 443
+ + CF + + +P + +F GA +AL P+N L +LCL + +
Sbjct: 349 LPFP--FKFCFPY---RDMTIPVIAFQFT-GASVALQPKNLLIKLQDRNMLCLAVVPSSL 402
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+G + I G+ +F + +DL + FA C
Sbjct: 403 SGIS-------IFGNVAQFDFQVVYDLEGKKVSFAPTDCT 435
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 101/410 (24%), Positives = 161/410 (39%), Gaps = 80/410 (19%)
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC---VDCNFPNVDPSRIPA 154
V S G Y + G+PP+ + DTGS ++W C +C + NF R+
Sbjct: 68 VDSVGLYFTKIKLGSPPKEYHVQV-DTGSDILWINCKPCPKCPTKTNLNF------RLSL 120
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGL 214
F SS+S+ +GC + CS+I S+ C P L C +++ + G
Sbjct: 121 FDMNASSTSKKVGCDDDFCSFI------SQSDSCQP-----ALGCSYHIVYADESTSDGK 169
Query: 215 LLSETLRFPS-----KTVP---NFLAGCSILS-------DRQPAGIAGFGRSSESLPSQL 259
+ + L KT P + GC D G+ GFG+S+ S+ SQL
Sbjct: 170 FIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQL 229
Query: 260 GL-----KKFSYCLLSRKFDDAPVSSNLVLDTGPG---SGDSKTPGLSYTPFYKNPVGSS 311
+ FS+CL + K G G G +P + TP N +
Sbjct: 230 AATGDAKRVFSHCLDNVK--------------GGGIFAVGVVDSPKVKTTPMVPNQM--- 272
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
Y V L + V + +P S + NGG IVDSG+T + L++++ +
Sbjct: 273 -----HYNVMLMGMDVDGTSLDLPRSIV-----RNGGTIVDSGTTLAYFPKVLYDSLIET 322
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGN 431
+ + + + CF S P + +F+ K+ + P +Y +
Sbjct: 323 ILAR-----QPVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEE 377
Query: 432 EVLCLILFTDNAAGPALG-RGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
E+ C F A G R I+LGD L N + +DL N+ G+A
Sbjct: 378 ELYC---FGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADH 424
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 103/404 (25%), Positives = 164/404 (40%), Gaps = 64/404 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP ++ + DTGS ++W C +C C + + + S
Sbjct: 78 GLYYAKIGIGTPAKSYYVQV-DTGSDIMWVNCI---QCKQCPRRSTLGIELTLYNIDESD 133
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S +L+ C + C I G + S CK N +CP YL YG G TAG + + +
Sbjct: 134 SGKLVSCDDDFCYQISGGPL-SGCKA----NMSCP-----YLEIYGDGSSTAGYFVKDVV 183
Query: 221 RFPS-----KTVP---NFLAGC------SILSDRQPA--GIAGFGRSSESLPSQLG---- 260
++ S KT + + GC + S + A GI GFG+++ S+ SQL
Sbjct: 184 QYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGR 243
Query: 261 -LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
K F++CL R +V P ++ TP N Y
Sbjct: 244 VKKIFAHCLDGRNGGGIFAIGRVV-----------QPKVNMTPLVPNQ--------PHYN 284
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
V + + VG + + IP PG G I+DSG+T ++ ++E + K+ Q
Sbjct: 285 VNMTAVQVGQEFLNIPADLFQPGD--RKGAIIDSGTTLAYLPEIIYEPLVKKITSQ---- 338
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
A V CF SG+ P + F+ + + P +Y L E + I +
Sbjct: 339 EPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDY--LFPYEGMWCIGW 396
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N+A + R +LGD L N + +DL N G+ + C+
Sbjct: 397 -QNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCS 439
>gi|222635172|gb|EEE65304.1| hypothetical protein OsJ_20543 [Oryza sativa Japonica Group]
Length = 274
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 77/248 (31%), Positives = 107/248 (43%), Gaps = 77/248 (31%)
Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD-------SKTP 296
GIAGFGR SLPSQL + FSYC S FD S+ V+ G + + + T
Sbjct: 88 GIAGFGRGRWSLPSQLNVTSFSYCFTS-MFD---TKSSSVVTLGAAAAELLHTHHAAHTG 143
Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
+ T KNP S Y+V LR I VG V +P S L I+DSG++
Sbjct: 144 DVRTTRLIKNPSQPS-----LYFVPLRGISVGGARVAVPESRL------RSSTIIDSGAS 192
Query: 357 FTFMEGPLFEAVAKEFIRQM--GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
T + ++EAV EF+ Q+ GNY V+
Sbjct: 193 ITTLPEDVYEAVKAEFVSQLPRGNY-----------------------VF---------- 219
Query: 415 GAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
E+Y A VLC++L D AA G +++G++Q QN ++ +DL ND
Sbjct: 220 --------EDYAA----RVLCVVL--DAAA------GEQVVIGNYQQQNTHVVYDLENDV 259
Query: 475 FGFAKQKC 482
FA +C
Sbjct: 260 LSFAPARC 267
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 104/404 (25%), Positives = 160/404 (39%), Gaps = 72/404 (17%)
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
HSY + +L GTP + + I DTGS++ + PC C C + F P
Sbjct: 10 HSY--FYTTLKLGTP-ERTFSVIIDTGSTITYIPCKD---CSHCGKHTAE-----WFDPD 58
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLS 217
+S++++ + C +P C+ C + TC Y Y + G ++
Sbjct: 59 KSTTAKKLACGDPLCN-------------CGTPSCTCNNDRCYYSRTYAERSSSEGWMIE 105
Query: 218 ETLRFPSKTVPNFLA-GCSILSD----RQPA-GIAGFGRSSESLPSQLGLKKFSYCLLSR 271
+T FP P L GC RQ A GI G G + + SQL +K + S
Sbjct: 106 DTFGFPDSDSPVRLVFGCENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSL 165
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTP---GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
F P L+L GD P YTP + +Y V + I V
Sbjct: 166 CFG-YPKDGILLL------GDVTLPEGANTVYTPLL------THLHLHYYNVKMDGITVN 212
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
+ + S D G ++DSG+TFT++ F+A+AK +G+Y ++
Sbjct: 213 GQTLAFDASVF----DRGYGTVLDSGTTFTYLPTDAFKAMAK----AVGDYVEKKGLQST 264
Query: 389 SGLRPCF-DISGKKS--------VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
G P + DI K + Y P F GGAK+ LPP Y L CL +F
Sbjct: 265 PGADPQYNDICWKGAPDQFKDLDKYFPPAEFVFGGGAKLTLPPLRYLFLSKPAEYCLGIF 324
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ +G ++G +++ + +D N + GF CA
Sbjct: 325 DNGNSG--------ALVGGVSVRDVVVTYDRRNSKVGFTTMACA 360
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 107/415 (25%), Positives = 165/415 (39%), Gaps = 67/415 (16%)
Query: 88 SNSLIKTPLSVHSYGG---YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF 144
S+S + P+S +Y G Y + + GTP Q T + DTGS L W C
Sbjct: 72 SSSAVSLPMSSGAYAGTGQYFVKVLVGTPAQEFT-LVADTGSELTWVKCAG--------- 121
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
P + F P+ S S + C + C +V CS C SY
Sbjct: 122 -GASPPGL-VFRPEASKSWAPVPCSSDTCKL----DVPFSLANCSSSASPC-----SYDY 170
Query: 205 QYGLGFTAGLLL----SETLRFPSKTVP---NFLAGCSILSDRQP----AGIAGFGRSSE 253
+Y G L + S T+ P V + + GCS D Q G+ G +
Sbjct: 171 RYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKI 230
Query: 254 SLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGS 310
S S+ + FSYCL+ AP ++ L GPG +TP + T + +P
Sbjct: 231 SFASRAAARFGGSFSYCLVDHL---APRNATGYLAFGPGQ-VPRTPA-TQTKLFLDPAM- 284
Query: 311 SSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
FY V + + V + + IP P S GGVI+DSG+T T + P ++AV
Sbjct: 285 -----PFYGVKVDAVHVAGQALDIPAEVWDPKS---GGVILDSGTTLTVLATPAYKAVVA 336
Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDISGKK--SVYLPELILKFKGGAKMALPPENYFAL 428
+ + + D C++ + + + +P+L ++F G A++ P ++Y
Sbjct: 337 ALTKLLAGVPK-VDFPP---FEHCYNWTAPRPGAPEIPKLAVQFTGCARLEPPAKSYVID 392
Query: 429 VGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V V C+ G G P + ++G+ Q EFDL N F C
Sbjct: 393 VKPGVKCI--------GLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTC 439
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 106/462 (22%), Positives = 173/462 (37%), Gaps = 73/462 (15%)
Query: 52 PLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY---GGYSISL 108
P L A L R +++++ ++ + S PLS +Y G Y +
Sbjct: 47 PGASLSDRARDDLHRHAYIRSQLA-SSRRGRRAAEVGASAFAMPLSSGAYTGTGQYFVRF 105
Query: 109 SFGTPPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
GTP Q PF+ DTGS L W C R P+R+ F S S I
Sbjct: 106 RVGTPAQ---PFVLVADTGSDLTWVKC--RGAGAAAGTGAGSPARV--FRTAASKSWAPI 158
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFP-- 223
C + C+ V CS C +Y +Y G A G++ +++
Sbjct: 159 ACSSDTCT----SYVPFSLANCSSPASPC-----AYDYRYRDGSAARGVVGTDSATIALS 209
Query: 224 --------------SKTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLK--- 262
+ + GC+ D Q G+ G S+ S S+ +
Sbjct: 210 SGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGG 269
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
+FSYCL+ AP ++ L GPG+ P + P+ FY V +
Sbjct: 270 RFSYCLVDHL---APRNATSYLTFGPGA---------TAPAAQTPLLLDRRMTPFYAVTV 317
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ V + + IP V D NGG I+DSG++ T + P + AV + + R
Sbjct: 318 DAVYVAGEALDIPAD--VWDVDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRV 375
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDN 442
C++ + ++ +P++ + F G A++ P ++Y V C+
Sbjct: 376 ----TMDPFEYCYNWTDAGALEIPKMEVHFAGSARLEPPAKSYVIDAAPGVKCI------ 425
Query: 443 AAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G G P + ++G+ Q EFDL + F +CA
Sbjct: 426 --GVQEGSWPGVSVIGNILQQEHLWEFDLRDRWLRFKHTRCA 465
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 113/435 (25%), Positives = 182/435 (41%), Gaps = 75/435 (17%)
Query: 69 HLKTKT-KPKTKDSNIGSNYSNSLIKTPLSVH---SYGGYSISLSFGTPPQASTPFIFDT 124
L+ K+ + + N GS++ P+ G Y + ++ GTP + S DT
Sbjct: 6 QLRVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLGAGNYLVKMALGTP-KLSLSLALDT 64
Query: 125 GSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESR 184
GS + W C CV + F P++SSS + + C + C I
Sbjct: 65 GSDITWTQCEP---CVGSCYRQAQTK----FDPRKSSSYKNVSCSSSSCRII---TDSGG 114
Query: 185 CKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCSILSDRQP 242
+GC + TC Y +QYG G ++ G +E L PS + NFL GC +Q
Sbjct: 115 ARGCV--SSTCI-----YKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCG----QQN 163
Query: 243 AGIAGFGRSSESL------------PSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
AG FGR + L S+ F+YCL S + + +L L
Sbjct: 164 AG--RFGRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFS---SSSTGHLTL------ 212
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
G + +TP S AF FY + ++ + VG + I S N G
Sbjct: 213 GGQVPKSVKFTPL-------SPAFKNTPFYGIDIKGLSVGGHVLPIDASVF-----SNAG 260
Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
I+DSG+ T ++ ++ A++ +F + M +Y + + S L C+D SG +S+ +P +
Sbjct: 261 AIIDSGTVITRLQPTVYSALSSKFQQLMKDYPK---TDGFSILDTCYDFSGNESISVPRI 317
Query: 409 ILKFKGGAKMALPPENYFALVGN-EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLE 467
FKGG ++ + ++ + +CL A P G ++ G+ Q Q + +
Sbjct: 318 SFFFKGGVEVDIKFFGILTVINAWDKVCL------AFAPNDDDGDFVVFGNSQQQTYDVV 371
Query: 468 FDLANDRFGFAKQKC 482
DLA R GFA C
Sbjct: 372 HDLAKGRIGFAPSGC 386
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 150/357 (42%), Gaps = 54/357 (15%)
Query: 92 IKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSR 151
+ P+ + S G Y + + GTPPQ + + TG LVW CT C + + P DP++
Sbjct: 45 VAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGE-LVWTQCTPCQPCFEQDLPLFDPTK 103
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
SS+ + + C + C I P C + C P+ G T
Sbjct: 104 --------SSTFRGLPCGSHLCESI--PESSRNCT-----SDVCIYEAPTKA-----GDT 143
Query: 212 AGLLLSETLRF-PSKTVPNFLAGCSILSDRQ------PAGIAGFGRSSESLPSQLGLKKF 264
G ++T +K F GC +++D++ P+GI G GR+ SL +Q+ + F
Sbjct: 144 GGKAGTDTFAIGAAKETLGF--GCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAF 201
Query: 265 SYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLR 323
SYCL + S+ L G + S TPF K GSS YY
Sbjct: 202 SYCLAGK--------SSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYY---- 249
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+V +K + L S V++D+ S +++ ++A+ K +G A+
Sbjct: 250 --MVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVAS 307
Query: 384 DVEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNEVLCLIL 438
+ +D+ K+V PEL+ F GGA + +PP NY GN +CL +
Sbjct: 308 PPKP-------YDLCFPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTI 357
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 100/403 (24%), Positives = 160/403 (39%), Gaps = 63/403 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS ++W C S C C + + + P+ S
Sbjct: 88 GLYFTRIGIGTPAKRYYVQV-DTGSDILWVNCVS---CDGCPRKSNLGIELTMYDPRGSQ 143
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S +L+ C C +G V C SP C Y + YG G TAG +++ L
Sbjct: 144 SGELVTCDQQFCVANYG-GVLPSCTSTSP--------C-EYSISYGDGSSTAGFFVTDFL 193
Query: 221 RF----------PSKTVPNFLAGCSILSDRQPA-----GIAGFGRSSESLPSQLGL---- 261
++ P+ +F G + D + GI GFG+S+ S+ SQL
Sbjct: 194 QYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKV 253
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K F++CL + N+V P + TP + Y V
Sbjct: 254 RKMFAHCLDTVNGGGIFAIGNVV-----------QPKVKTTPLVSDM--------PHYNV 294
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I VG + +P + G+ + G I+DSG+T ++ E V K + +
Sbjct: 295 ILKGIDVGGTALGLPTNIFDSGN--SKGTIIDSGTTLAYVP----EGVYKALFAMVFDKH 348
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
+ V+ CF SG PE+ F+G + + P +Y G + C+ F
Sbjct: 349 QDISVQTLQDF-SCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCM-GFQ 406
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G+ ++LGD L N + +DL N G+A C+
Sbjct: 407 NGGVQTKDGK-DMVLLGDLVLSNKLVLYDLENQAIGWADYNCS 448
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 153/385 (39%), Gaps = 56/385 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y S GTPPQ + + D S LVW C + F P RS+
Sbjct: 98 GMYVFSYGIGTPPQQVSGAL-DISSDLVWTACGA----------------TAPFNPVRST 140
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ + C + C F P ++ G + C +Y+ G T GLL +E
Sbjct: 141 TVADVPCTDDACQQ-FAP--QTCGAGAGAGSSECAY---TYMYGGGAANTTGLLGTEAFT 194
Query: 222 FPSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
F + + GC + + +G+ G GR + SL SQL + +FSY DD+
Sbjct: 195 FGDTRIDGVVFGCGLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAP---DDSVD 251
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY-S 337
+ + +L GD TP S+T + +S A YYV L I V K + IP +
Sbjct: 252 TQSFIL-----FGDDATPQTSHT--LSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGT 304
Query: 338 YLVPGSDGNGGVIVDSGSTFTFME----GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+ + DG+GGV + T +E PL +AVA + N S GL
Sbjct: 305 FDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSAL-------GLDL 357
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGP 452
C+ +P + L F GGA M L NYF + L CL + +A G
Sbjct: 358 CYTGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSA-------GD 410
Query: 453 AIILGDFQLQNFYLEFDLANDRFGF 477
+LG ++ +D+ + F
Sbjct: 411 GSVLGSLIQVGTHMMYDINGSKLVF 435
>gi|240255485|ref|NP_189841.4| aspartyl protease family protein [Arabidopsis thaliana]
gi|332644216|gb|AEE77737.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 430
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 100/432 (23%), Positives = 162/432 (37%), Gaps = 96/432 (22%)
Query: 63 SLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIF 122
+ ARH + P N S++ + L Y ++ GTPP+ +
Sbjct: 44 TFDSARHGRLLQSPVHGSFNWKVERDTSILLSAL-------YYTTVQIGTPPR-ELDVVI 95
Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
DTGS LVW C S CV C NV F P SSS+ + C + +CS +
Sbjct: 96 DTGSDLVWVSCNS---CVGCPLHNV-----TFFDPGASSSAVKLACSDKRCS--SDLQKK 145
Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ 241
SRC L +Y ++YG G T+G +S+ + F + + ++A
Sbjct: 146 SRCS---------LLESCTYKVEYGDGSVTSGYYISDLISFDTMSDWTYIA--------- 187
Query: 242 PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYT 301
F +S P V ++ T P + +S
Sbjct: 188 ------FRDNSTWHPW--------------------VRQGAIIGTFPALCSTPCSTVSSQ 221
Query: 302 PFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
P Y NP S + V +++P V G I+DSG+T
Sbjct: 222 PLYYNPQFS------------HMMTVAVNDLRLPIDPSVFSVAKGYGTIIDSGTTLVHFP 269
Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL------PELILKFKGG 415
G ++ + + + + Y R E CF+I+ S +L PE+ L F GG
Sbjct: 270 GEAYDPLIQAILNVVSQYGRPIPYESFQ----CFNITSGISSHLVIADMFPEVHLGFAGG 325
Query: 416 AKMALPPENY----FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
A M + PE Y F + N + CL ++ + I+G+ +++ +DL
Sbjct: 326 ASMVIKPEAYLFQKFLDLTNAIWCLGFYSSTSR-------RITIIGEVAIRDKMFVYDLD 378
Query: 472 NDRFGFAKQKCA 483
+ R G+A+ C+
Sbjct: 379 HQRIGWAEYNCS 390
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 151/385 (39%), Gaps = 60/385 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y S GTPPQ + + D S LVW C + F P RS+
Sbjct: 98 GMYVFSYGIGTPPQQVSGAL-DISSDLVWTACGA----------------TAPFNPVRST 140
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ + C + C F P C C +Y+ G T GLL +E
Sbjct: 141 TVADVPCTDDACQQ-FAPQT------CGAGASECAY---TYMYGGGAANTTGLLGTEAFT 190
Query: 222 FPSKTVPNFLAGCSI--LSD-RQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
F + + GC + + D +G+ G GR + SL SQL + +FSY DD+
Sbjct: 191 FGDTRIDGVVFGCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAP---DDSVD 247
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY-S 337
+ + +L GD TP S+T + +S A YYV L I V K + IP +
Sbjct: 248 TQSFIL-----FGDDATPQTSHT--LSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGT 300
Query: 338 YLVPGSDGNGGVIVDSGSTFTFME----GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
+ + DG+GGV + T +E PL +AVA + N S GL
Sbjct: 301 FDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSAL-------GLDL 353
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL-CLILFTDNAAGPALGRGP 452
C+ +P + L F GGA M L NYF + L CL + +A G
Sbjct: 354 CYTGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSA-------GD 406
Query: 453 AIILGDFQLQNFYLEFDLANDRFGF 477
+LG ++ +D+ + F
Sbjct: 407 GSVLGSLIQVGTHMMYDINGSKLVF 431
>gi|357443039|ref|XP_003591797.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355480845|gb|AES62048.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 436
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 165/363 (45%), Gaps = 71/363 (19%)
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSET 219
SS+ + I C + +CS +FG + GCS + K C + Y + G+ T+G + S+
Sbjct: 81 SSTLKPILCSSSQCS-LFGSH------GCSDK-KICGRS--PYNIVTGVS-TSGDIQSDI 129
Query: 220 LRFPSK---------TVPNFL--AGCSILSD---RQPAGIAGFGRSSESLPSQLG----- 260
+ S +VPNFL G +++ + + G+AG GR+ SLPSQ
Sbjct: 130 VSVQSTNGNYSGRFVSVPNFLFICGSNVVQNGLAKGVKGMAGLGRTKVSLPSQFSSAFSF 189
Query: 261 LKKFSYCLLSRK----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSS--SAF 314
KF+ CL ++ F D P N ++ L YTP NPV +S S
Sbjct: 190 KNKFAICLGTQNGVLFFGDGPYLFNF----------DESKNLIYTPLITNPVSTSPSSFL 239
Query: 315 GE---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
GE Y++G++ I V SK+VK+ + L +G GG + + + +T ME +++AVA
Sbjct: 240 GEKSVEYFIGVKSIRVSSKNVKLNTTLLSIDQNGFGGTKISTVNPYTIMETSIYKAVADA 299
Query: 372 FIRQMGNYSRAADVEKKSGLRPCF---DISGKK---SVYLPELILKFKGGAKMALPPENY 425
F++ + + VE + CF IS + V +L+L+ + +
Sbjct: 300 FVKAL----NVSTVEPVAPFGTCFASQSISSSRMGPDVPSIDLVLQNENVVWNIIGANAM 355
Query: 426 FALVGNEVLCLILFTDNAAGPAL---------GRGP--AIILGDFQLQNFYLEFDLANDR 474
+ +V+CL F D + A G P +I +G QL+N L+FDLA R
Sbjct: 356 VRINDKDVICL-GFVDAGSDFAKTSQVGFVVGGSKPMTSITIGAHQLENNLLQFDLATSR 414
Query: 475 FGF 477
GF
Sbjct: 415 LGF 417
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 156/391 (39%), Gaps = 77/391 (19%)
Query: 110 FGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQ 169
GTPP I DTGS L W C +C P F P +S+S + C
Sbjct: 86 IGTPP-VDYLGIADTGSDLTWAQCLPCLKCYQ--------QLRPIFNPLKSTSFSHVPCN 136
Query: 170 NPKCSWIFGPN--VESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKT 226
C + + V+ C Y YG ++ G L E + S +
Sbjct: 137 TQTCHAVDDGHCGVQGVCD---------------YSYTYGDRTYSKGDLGFEKITIGSSS 181
Query: 227 VPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGL-----KKFSYCL---LSRKFDD 275
V + + GC S +G+ G G SL SQ+ ++FSYCL LS
Sbjct: 182 VKSVI-GCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGK 240
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFY-KNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
N V+ SG PG+ TP KN V +YY+ L I +G++
Sbjct: 241 INFGQNAVV-----SG----PGVVSTPLISKNTV-------TYYYITLEAISIGNER--- 281
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP- 393
++ GN VI+DSG+T +F+ L++ V ++ + +A V+
Sbjct: 282 ---HMAFAKQGN--VIIDSGTTLSFLPKELYDGVVSSLLKVV----KAKRVKDPGNFWDL 332
Query: 394 CFD--ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRG 451
CFD I+ S +P + +F GGA + L P N F V N V CL L A P G
Sbjct: 333 CFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTL---TPASPTDEFG 389
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ L NF + +DL R F C
Sbjct: 390 ---IIGNLALANFLIGYDLEAKRLSFKPTVC 417
>gi|383143497|gb|AFG53176.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143499|gb|AFG53177.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143505|gb|AFG53180.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143513|gb|AFG53184.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143515|gb|AFG53185.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
Length = 135
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 79/152 (51%), Gaps = 20/152 (13%)
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPG---LSYTPFYKNPVGSSSAFGEFYYVGL 322
YCL D SS +V+ G+ PG L+YTP NP+ + FYY+GL
Sbjct: 1 YCL-----DYVNNSSKIVV------GNKAVPGDISLTYTPLIINPI-----YPFFYYLGL 44
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ +G K + +P++ S GNGG I+DSG++FT ++ +A EF Q+G Y R
Sbjct: 45 EAVSIGRKRLNLPFNSATFDSKGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQIG-YKRV 103
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKG 414
E + L C+++SG +++ P+ FKG
Sbjct: 104 PGAESTTALGLCYNVSGVENIQFPQFAFHFKG 135
>gi|147857949|emb|CAN80378.1| hypothetical protein VITISV_038701 [Vitis vinifera]
Length = 436
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 102/400 (25%), Positives = 161/400 (40%), Gaps = 70/400 (17%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P + D G+ +W C Y ++ P R S+Q + C
Sbjct: 53 PLVPVKLVVDLGAQFLWVDCEQNYVS-------------SSYRPARCRSAQCSLARANGC 99
Query: 174 SWIFG---PNVESRCKGCSPRNKTCPLACPSYL------LQYGLGFTAGLLLSETLRFPS 224
F P + G P N A L +Q G G ++S + +F
Sbjct: 100 GDCFSAPRPGCNNNTCGVLPDNTVTRTATSGELAEDFVSVQSTDGSNPGRVVSVS-KFLF 158
Query: 225 KTVPNFL-AGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKFDDAPV 278
P FL G + G+AG GR+ + PSQ +KF+ CL S
Sbjct: 159 SCAPTFLLEGLA----SSAMGMAGLGRTRIAFPSQFASAFSFHRKFATCLSSS------T 208
Query: 279 SSNLVLDTGPGS-----GDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVG 328
++N V+ G G + L YTP Y NPV ++SA+ + Y++ ++ I +
Sbjct: 209 TANGVVFFGDGPYRLLPNIDASQSLIYTPLYINPVSTASAYTQGEPSAEYFIRVKSIRIN 268
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVE 386
K + + S L S+G GG + + + +T ME +++ K FI N +R A V
Sbjct: 269 EKAISLNTSLLSIDSEGVGGTKISTVNPYTVMETSIYKXFTKAFISAAAAINITRVAAVA 328
Query: 387 KKSGLRPCFDISGKKSVY-------LPELILKFKGGAKM-ALPPENYFALVGNEVLCLIL 438
CF K+VY +P + L + + + N V ++VLCL
Sbjct: 329 P---FNVCFS---SKNVYSTRVGPSVPSIDLVLQNESVFWRIFGANSMVYVSDDVLCL-G 381
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
F D A P +I++G +QL++ L+FDLA R GF+
Sbjct: 382 FVDGGANPR----TSIVIGGYQLEDNLLQFDLATSRLGFS 417
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 112/427 (26%), Positives = 177/427 (41%), Gaps = 94/427 (22%)
Query: 86 NYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
N SN+ ++ + G Y+ L GTPPQ I DTGS++ + PC++ C C
Sbjct: 65 NLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFA-LIVDTGSTVTYVPCST---CEQCG-- 118
Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
+ P F P+ SS+ + I C N C C C Y Q
Sbjct: 119 ---RHQDPKFDPESSSTYKPIKC-NIDCI-------------CDSDGVQC-----VYERQ 156
Query: 206 YG-LGFTAGLLLSETLRF--PSKTVPN-FLAGCSILS-----DRQPAGIAGFGRSSESLP 256
Y + ++G+L + + F S+ +P + GC + ++ GI G G SL
Sbjct: 157 YAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLV 216
Query: 257 SQLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNP 307
QL K FS C +D G G+ G S + +T Y +P
Sbjct: 217 DQLVEKGAINDSFSLCYGG-------------MDIGGGAMVLGGISPPSDMIFT--YSDP 261
Query: 308 VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
V S +Y V L++I V K K+P S + DG G ++DSG+T+ ++ F A
Sbjct: 262 VRSP-----YYNVDLKEIHVAGK--KLPLSSGI--FDGRYGAVLDSGTTYAYLPAEAFSA 312
Query: 368 VAKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKM 418
+ ++ + ++K G P CF +G + L P + + F+ G K+
Sbjct: 313 FKDAIMDEIHS------LKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKL 366
Query: 419 ALPPENYFALVG--NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
+L PENYF + CL +F + G +LG ++N + +D AN + G
Sbjct: 367 SLTPENYFFRHSKVHGAYCLGIFEN-------GNDQTTLLGGIVVRNTLVMYDRANSKIG 419
Query: 477 FAKQKCA 483
F K C+
Sbjct: 420 FWKTNCS 426
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 112/427 (26%), Positives = 177/427 (41%), Gaps = 94/427 (22%)
Query: 86 NYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFP 145
N SN+ ++ + G Y+ L GTPPQ I DTGS++ + PC++ C C
Sbjct: 65 NLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFA-LIVDTGSTVTYVPCST---CEQCG-- 118
Query: 146 NVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ 205
+ P F P+ SS+ + I C N C C C Y Q
Sbjct: 119 ---RHQDPKFDPESSSTYKPIKC-NIDCI-------------CDSDGVQC-----VYERQ 156
Query: 206 YG-LGFTAGLLLSETLRF--PSKTVPN-FLAGCSILS-----DRQPAGIAGFGRSSESLP 256
Y + ++G+L + + F S+ +P + GC + ++ GI G G SL
Sbjct: 157 YAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLV 216
Query: 257 SQLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNP 307
QL K FS C +D G G+ G S + +T Y +P
Sbjct: 217 DQLVEKGAINDSFSLCYGG-------------MDIGGGAMVLGGISPPSDMIFT--YSDP 261
Query: 308 VGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEA 367
V S +Y V L++I V K K+P S + DG G ++DSG+T+ ++ F A
Sbjct: 262 VRSP-----YYNVDLKEIHVAGK--KLPLSSGI--FDGRYGAVLDSGTTYAYLPAEAFSA 312
Query: 368 VAKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKM 418
+ ++ + ++K G P CF +G + L P + + F+ G K+
Sbjct: 313 FKDAIMDEIHS------LKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKL 366
Query: 419 ALPPENYFALVG--NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
+L PENYF + CL +F + G +LG ++N + +D AN + G
Sbjct: 367 SLTPENYFFRHSKVHGAYCLGIFEN-------GNDQTTLLGGIVVRNTLVMYDRANSKIG 419
Query: 477 FAKQKCA 483
F K C+
Sbjct: 420 FWKTNCS 426
>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 537
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 78/249 (31%), Positives = 112/249 (44%), Gaps = 35/249 (14%)
Query: 242 PAGIAGFGRSSESLPSQ----LGLKKFSYCLLSRKFDDAPVSSNL--VLDTGPGSGDSKT 295
P G+ GFG S PSQ G FSYCL S K SSN L GP +
Sbjct: 314 PQGLVGFGCGPLSFPSQNKDVYGFV-FSYCLPSYK------SSNFSSTLRLGPAGQPKR- 365
Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
+ TP NP S YYV + I VG + + +P S L G IVD+G+
Sbjct: 366 --IKMTPLLSNPHRPS-----LYYVNMVGIHVGGRPMLVPASALAFDPASGRGTIVDAGT 418
Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG 415
FT + P++ AV F ++ RA G C+++ ++ +P + F G
Sbjct: 419 MFTRLSAPVYAAVRDVFRSRV----RAPVTGPLGGFDTCYNV----TISVPTVTFSFDGR 470
Query: 416 AKMALPPENYFALVGNE-VLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLAND 473
+ LP EN ++ + CL + AAGP+ G + +L Q QN + FD+AN
Sbjct: 471 VSVTLPEENVVIRSSSDGIACLAM----AAGPSDGVDAVLNVLASMQQQNHRVLFDVANG 526
Query: 474 RFGFAKQKC 482
R GF+++ C
Sbjct: 527 RVGFSRELC 535
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 121/459 (26%), Positives = 169/459 (36%), Gaps = 105/459 (22%)
Query: 53 LKILHSLASSSL----SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSV--HSYGGYSI 106
L+++H +S S ++ ++ + + + + Y SL TP S G Y +
Sbjct: 31 LELIHRDSSKSPFYQPTQNKYERIANAVRRSINRVNHFYKYSLTSTPQSTVNSDKGEYLM 90
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
S S GTPP F+ DTGS LVW C +C P P F P SSS Q I
Sbjct: 91 SYSIGTPPFKVFGFV-DTGSDLVWLQCEPCKQCY--------PQITPIFDPSLSSSYQNI 141
Query: 167 GCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT 226
C + C + R C R G L ETL S T
Sbjct: 142 PCLSDTCHSM-------RTTSCDVR---------------------GYLSVETLTLDSTT 173
Query: 227 -----VPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLK---KFSYCL------ 268
P + GC + +GI G G SLPSQLG KFSYCL
Sbjct: 174 GYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPN 233
Query: 269 --LSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
F DA + GD G TP K S YY+ L
Sbjct: 234 STSKLNFGDAAIV----------YGD----GAMTTPIVKKDAQSG------YYLTLEAFS 273
Query: 327 VGSKHVKIPYSYLVPGSDGN-GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADV 385
VG+K ++ + P GN G +++DSG+TFTF+ V F + Y V
Sbjct: 274 VGNKLIE----FGGPTYGGNEGNILIDSGTTFTFLP----YDVYYRFESAVAEYINLEHV 325
Query: 386 EKKSG-LRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAA 444
E +G + C++++ P + FK GA + L + F V + + CL A
Sbjct: 326 EDPNGTFKLCYNVA-YHGFEAPLITAHFK-GADIKLYYISTFIKVSDGIACLAFIPSQTA 383
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ QN + ++L + F C
Sbjct: 384 ----------IFGNVAQQNLLVGYNLVQNTVTFKPVDCT 412
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 104/398 (26%), Positives = 157/398 (39%), Gaps = 69/398 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + +G P Q P FDT + C C+ PAF P RSSS
Sbjct: 88 YRVLAGYGAPAQ-RFPVAFDTNFGVSVLRCKPCVGGAPCD---------PAFEPSRSSSF 137
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
I C +P+C+ C G S CP + +Q+G + G L+ +TL
Sbjct: 138 AAIPCGSPECAV--------ECTGAS-----CP-----FTIQFGNVTVANGTLVRDTLTL 179
Query: 223 -PSKTVPNFLAGC-SILSDRQ----PAGIAGFGRSSESLPSQL-------GLKKFSYCLL 269
PS T F GC + +D G+ RSS SL S++ FSYCL
Sbjct: 180 PPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLP 239
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
S + SS L G + + Y P NP +S Y+V L I VG
Sbjct: 240 S----SSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS-----YFVELVGISVGG 290
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ + +P P G ++++ + FTF+ + A+ F R M Y A
Sbjct: 291 EDLPVP-----PAVFAAHGTLLEAATEFTFLAPAAYAALRDAFRRDMAPYPAAPPFRV-- 343
Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN--YFA---LVGNEVLCLILFTDNAA 444
L C++++G S+ +P + L+F GG ++ L YFA V + V CL
Sbjct: 344 -LDTCYNLTGLASLAVPTVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLP 402
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ ++G ++ + +DL R GF +C
Sbjct: 403 AFPVS-----VIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 100/405 (24%), Positives = 162/405 (40%), Gaps = 64/405 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+P + + DTGS ++W C C C + + + PKRS
Sbjct: 67 GLYFTKIGLGSPSKDYYVQV-DTGSDILWVNCV---ECTRCPRKSDIGIGLTLYDPKRSK 122
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+S+ + C++ CS + E R GC N CP Y + YG G T G + + L
Sbjct: 123 TSEFVSCEHNFCSSTY----EGRILGCKAEN-----PCP-YSISYGDGSATTGYYVQDYL 172
Query: 221 RF------PSKTVPN--FLAGCSIL--------SDRQPAGIAGFGRSSESLPSQLGL--- 261
F P N + GC S+ GI GFG+++ S+ SQL
Sbjct: 173 TFNRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGK 232
Query: 262 --KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
K FS+CL D V + G+ P + TP N Y
Sbjct: 233 VKKIFSHCL------DTNVGGGIF-----SIGEVVEPKVKTTPLVPNMA--------HYN 273
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
V L+ I V +++P S+ G ++DSG+T ++ +++ + + + +
Sbjct: 274 VILKNIEVDGDILQLPSDTF--DSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRL 331
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLIL 438
E+ S CF +G P + L F+ + + P +Y F G+ C I
Sbjct: 332 KVYLVEEQYS----CFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWC-IG 386
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ +A+ G+ +LGDF L N + +DL N G+ C+
Sbjct: 387 WQKSASETKNGK-DMTLLGDFVLSNKLVVYDLENMTIGWTDYNCS 430
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 117/437 (26%), Positives = 177/437 (40%), Gaps = 72/437 (16%)
Query: 58 SLASSSLSRARHLKTKTKPKTKDSN-IGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQA 116
++AS R ++L T KT + I S + ++ G Y + + GTP Q
Sbjct: 62 NMASKDPVRVKYLSTLVSQKTVSTAPIASGQAFNI----------GNYVVRVKLGTPGQL 111
Query: 117 STPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI 176
+ DT + + PC+ C D F PK S+S + C P+C +
Sbjct: 112 LF-MVLDTSTDEAFVPCSGCTGCSDTTFS-----------PKASTSYGPLDCSVPQCGQV 159
Query: 177 FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC-- 234
G + + G N+ SY G F+A L+ + LR + +P + GC
Sbjct: 160 RGLSCPATGTGACSFNQ-------SYA---GSSFSA-TLVQDALRLATDVIPYYSFGCVN 208
Query: 235 SILSDRQPA------GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGP 288
+I PA G S+S + G+ FSYCL S F S +L L GP
Sbjct: 209 AITGASVPAQGLLGLGRGPLSLLSQSGSNYSGI--FSYCLPS--FKSYYFSGSLKL--GP 262
Query: 289 GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
G K+ + TP ++P S YYV I VG V P YL + G
Sbjct: 263 -VGQPKS--IRTTPLLRSPHRPS-----LYYVNFTGISVGRVLVPFPSEYLGFNPNTGSG 314
Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPEL 408
I+DSG+ T P++ AV +EF +Q+G CF P +
Sbjct: 315 TIIDSGTVITRFVEPVYNAVREEFRKQVG----GTTFTSIGAFDTCF--VKTYETLAPPI 368
Query: 409 ILKFKGGAKMALPPENYFALV---GNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
L F+ G + LP EN +L+ + CL + AA P ++ +FQ QN
Sbjct: 369 TLHFE-GLDLKLPLEN--SLIHSSAGSLACLAM----AAAPDNVNSVLNVIANFQQQNLR 421
Query: 466 LEFDLANDRFGFAKQKC 482
+ FD+ N++ G A++ C
Sbjct: 422 ILFDIVNNKVGIAREVC 438
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/285 (27%), Positives = 125/285 (43%), Gaps = 32/285 (11%)
Query: 209 GFTAGLLLSETLRFPSKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKF 264
G T G++ ++T + T + GC + S P+G+ G GR+ SL SQ+ + KF
Sbjct: 136 GHTLGIVATDTFAIGTATA-SLGFGCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNITKF 194
Query: 265 SYCLLSRKFDDAPVSSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
SYCL D+ +S L+L + G G+S T TPF K G ++Y +
Sbjct: 195 SYCLTPH---DSGKNSRLLLGSSAKLAGGGNSTT-----TPFVKTSPGDD--MSQYYPIQ 244
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
L I G + +P S GN V+V + + +F+ ++A+ KE + +G
Sbjct: 245 LDGIKAGDAAIALPPS-------GN-TVLVQTLAPMSFLVDSAYQALKKEVTKAVGAAPT 296
Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKF-KGGAKMALPPENYFALVGNE--VLCLIL 438
A ++ CF +G + P+L+ F +G A + +PP Y VG E +C+ +
Sbjct: 297 ATPLQP---FDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAI 353
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ + ILG Q +N + DL F CA
Sbjct: 354 LSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCA 398
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 100/403 (24%), Positives = 160/403 (39%), Gaps = 63/403 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS ++W C S C C + + + P+ S
Sbjct: 88 GLYFTRIGIGTPAKRYYVQV-DTGSDILWVNCVS---CDGCPRKSNLGIELTMYDPRGSQ 143
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S +L+ C C +G V C SP C Y + YG G TAG +++ L
Sbjct: 144 SGELVTCDQQFCVANYG-GVLPSCTSTSP--------C-EYSISYGDGSSTAGFFVTDFL 193
Query: 221 RF----------PSKTVPNFLAGCSILSDRQPA-----GIAGFGRSSESLPSQLGL---- 261
++ P+ +F G + D + GI GFG+S+ S+ SQL
Sbjct: 194 QYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKV 253
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K F++CL + N+V P + TP + Y V
Sbjct: 254 RKMFAHCLDTVNGGGIFAIGNVV-----------QPKVKTTPLVPDM--------PHYNV 294
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I VG + +P + G+ + G I+DSG+T ++ E V K + +
Sbjct: 295 ILKGIDVGGTALGLPTNIFDSGN--SKGTIIDSGTTLAYVP----EGVYKALFAMVFDKH 348
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
+ V+ CF SG PE+ F+G + + P +Y G + C+ F
Sbjct: 349 QDISVQTLQDF-SCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCM-GFQ 406
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G+ ++LGD L N + +DL N G+A C+
Sbjct: 407 NGGVQTKDGK-DMVLLGDLVLSNKLVLYDLENQAIGWADYNCS 448
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 165/394 (41%), Gaps = 62/394 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP + T IFDTGS + W C CV + + P P S+
Sbjct: 69 GDYVVTVGLGTPKKEFT-LIFDTGSDITWTQCEP---CVKTCYKQ----KEPRLNPSTST 120
Query: 162 SSQLIGCQNPKCSWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
S + I C + C + G C + TC Y +QYG G ++ G +ET
Sbjct: 121 SYKNISCSSALCKLVASGKKFSQSCS-----SSTCL-----YQVQYGDGSYSIGFFATET 170
Query: 220 LRFPSKTV-PNFLAGCSILSDRQPAGIAGFG---RSSESLPSQLGL---KKFSYCLLSRK 272
L S V NFL GC ++ G AG R+ +LPSQ K FSYCL
Sbjct: 171 LTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCL---- 226
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSK 330
P SS+ G SK+ + +TP S+ F FY + + + VG +
Sbjct: 227 ----PASSSSKGYLSLGGQVSKS--VKFTPL-------SADFDSTPFYGLDITGLSVGGR 273
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ I S + G ++DSG+ T + + ++ F M +Y + S
Sbjct: 274 QLSIDESAF------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGY---SI 324
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALG 449
C+D S +V +P++ + FKGG +M + + + G + +CL ++
Sbjct: 325 FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGND------D 378
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ Q + + + +D A R GFA C+
Sbjct: 379 DSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 412
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 165/394 (41%), Gaps = 62/394 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP + T IFDTGS + W C CV + + P P S+
Sbjct: 129 GDYVVTVGLGTPKKEFT-LIFDTGSDITWTQCEP---CVKTCYKQ----KEPRLNPSTST 180
Query: 162 SSQLIGCQNPKCSWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
S + I C + C + G C + TC Y +QYG G ++ G +ET
Sbjct: 181 SYKNISCSSALCKLVASGKKFSQSCS-----SSTCL-----YQVQYGDGSYSIGFFATET 230
Query: 220 LRFPSKTV-PNFLAGCSILSDRQPAGIAGFG---RSSESLPSQLGL---KKFSYCLLSRK 272
L S V NFL GC ++ G AG R+ +LPSQ K FSYCL
Sbjct: 231 LTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCL---- 286
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSK 330
P SS+ G SK+ + +TP S+ F FY + + + VG +
Sbjct: 287 ----PASSSSKGYLSLGGQVSKS--VKFTPL-------SADFDSTPFYGLDITGLSVGGR 333
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ I S + G ++DSG+ T + + ++ F M +Y + S
Sbjct: 334 KLSIDESAF------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGY---SI 384
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALG 449
C+D S +V +P++ + FKGG +M + + + G + +CL ++
Sbjct: 385 FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGND------D 438
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ Q + + + +D A R GFA C+
Sbjct: 439 DSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 472
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 165/394 (41%), Gaps = 62/394 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++ GTP + T IFDTGS + W C CV + + P P S+
Sbjct: 117 GDYVVTVGLGTPKKEFT-LIFDTGSDITWTQCEP---CVKTCYKQ----KEPRLNPSTST 168
Query: 162 SSQLIGCQNPKCSWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
S + I C + C + G C + TC Y +QYG G ++ G +ET
Sbjct: 169 SYKNISCSSALCKLVASGKKFSQSCS-----SSTCL-----YQVQYGDGSYSIGFFATET 218
Query: 220 LRFPSKTV-PNFLAGCSILSDRQPAGIAGFG---RSSESLPSQLGL---KKFSYCLLSRK 272
L S V NFL GC ++ G AG R+ +LPSQ K FSYCL
Sbjct: 219 LTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCL---- 274
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSK 330
P SS+ G SK+ + +TP S+ F FY + + + VG +
Sbjct: 275 ----PASSSSKGYLSLGGQVSKS--VKFTPL-------SADFDSTPFYGLDITGLSVGGR 321
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+ I S + G ++DSG+ T + + ++ F M +Y + S
Sbjct: 322 KLSIDESAF------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGY---SI 372
Query: 391 LRPCFDISGKKSVYLPELILKFKGGAKMALPPENY-FALVGNEVLCLILFTDNAAGPALG 449
C+D S +V +P++ + FKGG +M + + + G + +CL ++
Sbjct: 373 FDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGND------D 426
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ Q + + + +D A R GFA C+
Sbjct: 427 DSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 460
>gi|222822564|gb|ACM68431.1| xyloglucan-specific endoglucanase inhibitor protein [Capsicum
annuum]
Length = 437
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 107/411 (26%), Positives = 166/411 (40%), Gaps = 77/411 (18%)
Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
+L + T Q TP + D G +W VDC+ V S PA RS+
Sbjct: 44 TLQYLTQIQQRTPLVPVSLTLDLGGQFLW---------VDCDQGYVSSSYKPARC--RSA 92
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
L G C F P GC+ N TC L + + + T+G L S+ +
Sbjct: 93 QCSLAGATG--CGECFSPPRP----GCN--NNTCGLFPDNTVTRTA---TSGELASDVVS 141
Query: 222 FPSKTVPN-----------FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
S N F+ G + L +G+ AG GR+ SLPSQ +
Sbjct: 142 VQSSNGKNPGRNVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTRISLPSQFSAEFSFPR 201
Query: 263 KFSYCLLSRK------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFG- 315
KF+ CL S K F D P + +T + D YTP NPV ++SAF
Sbjct: 202 KFAVCLSSSKSKGVVLFGDGPYF--FLPNTEFSNND-----FQYTPLLINPVSTASAFSA 254
Query: 316 ----EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
Y++G++ + + K V I + L + G GG + + + +T +E L+ A+
Sbjct: 255 GQPSSEYFIGVKSVKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTVLETSLYNAITNF 314
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY----LPELILKFKG-GAKMALPPENYF 426
F++++ N +R A V CFD S +P++ L + + N
Sbjct: 315 FVKELANVTRVASVAP---FGACFDSRNIGSTRVGPAVPQIDLVLQNENVIWTIFGANSM 371
Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
V VLCL F D + +I++G +++ L+ D+A R GF
Sbjct: 372 VQVSENVLCL-GFVDG----GVNSRTSIVIGGHTIEDNLLQLDIARSRLGF 417
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 150/388 (38%), Gaps = 50/388 (12%)
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
+ + GTPPQ ++ I D LVW C+ RC + +P FIP SS+ +
Sbjct: 46 NFTIGTPPQPASAII-DVAGELVWTQCSRCSRCFKQD--------LPLFIPNASSTFRPE 96
Query: 167 GCQNPKCSWIFGPNVESRCKG--CSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
C C S C G C+ + T ++ T G++ +ET +
Sbjct: 97 PCGTDACK----STPTSNCSGDVCTYESTTN--------IRLDRHTTLGIVGTETFAIGT 144
Query: 225 KTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS 280
T + GC + SD +G G GR+ SL +Q+ L KFSYCL R SS
Sbjct: 145 ATA-SLAFGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGK---SS 200
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
L L G + + S PF K S +Y + L I G+ + S
Sbjct: 201 RLFL--GSSAKLAGGESTSTAPFIKTSPDDDSH--HYYLLSLDAIRAGNTTIATAQS--- 253
Query: 341 PGSDGNGGVIV-DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF-DIS 398
GG++V + S F+ + + A K +G + CF +
Sbjct: 254 ------GGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAA 307
Query: 399 GKKSVYLPELILKFK-GGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAII 455
G P+L+ F+ GGA + +PP Y VG E C + + A G +
Sbjct: 308 GFSRATAPDLVFTFQGGGAALTVPPAKYLIDVGEEKDTACAAILS-MARLNRTGLEGVSV 366
Query: 456 LGDFQLQNFYLEFDLANDRFGFAKQKCA 483
LG Q +N + +DL + F C+
Sbjct: 367 LGSLQQENVHFLYDLKKETLSFEPADCS 394
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 162/405 (40%), Gaps = 76/405 (18%)
Query: 102 GGYSISLSFGTPPQASTPF--IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
G Y +++S GTPP P I DTGS L+W +C+ C PN P F PK
Sbjct: 92 GAYLMNISLGTPP---VPMLGIADTGSDLIW------RQCLPC--PNCYEQVEPLFDPKE 140
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
S + + + C N C + +G + TC +Y YG +T G L S+
Sbjct: 141 SETYKTLDCDNEFCQDL-------GQQGSCDDDNTC-----TYSYSYGDRSYTRGDLSSD 188
Query: 219 TLRFPSK-----TVPNFLAGC-----SILSDRQPAGIAGFGRSSE---SLPSQLGLKKFS 265
TL S + P GC +++ I G L S++G +FS
Sbjct: 189 TLTIGSTEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVG-GQFS 247
Query: 266 YCLLSRKFDDAPVSSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
YCL+ D+ VSS + GSG TP + TP FYY+ L
Sbjct: 248 YCLVPLS-SDSTVSSKINFGKSGVVSGSGTVSTPLIKGTP------------DTFYYLTL 294
Query: 323 RQIIVGSKHVKIP---YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+ VGS+ V + P + G +I+DSG+T T + + V +G
Sbjct: 295 EGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQ 354
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
+ + C+ S ++ +P + F GA + LPP N F V +++C +
Sbjct: 355 TT---TDPNGIFSLCY--SSVNNLEIPTITAHFT-GADVQLPPLNTFVQVQEDLVCFSMI 408
Query: 440 -TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ N A I G+ NF + +DL N++ F + C
Sbjct: 409 PSSNLA----------IFGNLAQINFLVGYDLKNNKVSFKQTDCT 443
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 100/401 (24%), Positives = 164/401 (40%), Gaps = 82/401 (20%)
Query: 106 ISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
++ S G P PQ + I DTGS+++W C RC N P +DPS +SS+
Sbjct: 101 VNFSMGQPATPQLA---IMDTGSNILWVRCAPCKRCTQQNGPLLDPS--------KSSTY 149
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
+ C N C + P+ C+ N+ Y L Y G +AG+L +E L F
Sbjct: 150 ASLPCTNTMCH--YAPSAY-----CNRLNQC------GYNLSYATGLSSAGVLATEQLIF 196
Query: 223 PS-----KTVPNFLAGCSI----LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKF 273
S VP+ + GCS DR+ G+ G G+ S +++G KFSYCL
Sbjct: 197 HSSDEGVNAVPSVVFGCSHENGDYKDRRFTGVFGLGKGITSFVTRMG-SKFSYCL----- 250
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPF----------YKNPVGSSSAFGEFYYVGLR 323
G+ P Y Y P+ YYV L
Sbjct: 251 -----------------GNIADPHYGYNQLVFGEKANFEGYSTPL---KVVNGHYYVTLE 290
Query: 324 QIIVGSKHVKIP-YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I VG K + I ++ + G++ + ++DSG+ T++ F A+ E +RQ+ +
Sbjct: 291 GISVGEKRLDIDSTAFSMKGNEKSA--LIDSGTALTWLAESAFRALDNE-VRQLLD---G 344
Query: 383 ADVEKKSGLRPCFD-ISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
+ G C+ + + P + F GGA + L E+ F ++LC+ +
Sbjct: 345 VLMPFWRGSFACYKGTVSQDLIGFPVVTFHFSGGADLDLDTESMFYQATPDILCIAVRQA 404
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+A G ++G Q + + +DL +++ F + C
Sbjct: 405 SAYGNDFKSFS--VIGLMAQQYYNMAYDLNSNKLFFQRIDC 443
>gi|32482806|gb|AAP84703.1| putative xyloglucanase inhibitor [Solanum tuberosum]
Length = 437
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 106/411 (25%), Positives = 167/411 (40%), Gaps = 77/411 (18%)
Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
+L + T Q TP + D G +W VDC+ V S PA RS+
Sbjct: 44 TLQYLTQIQQRTPLVPISLTLDLGGQFLW---------VDCDQGYVSSSYKPARC--RSA 92
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL- 220
L G C F P GC+ N TC L + + + T+G L S+ +
Sbjct: 93 QCSLGGASG--CGECFSPPR----PGCN--NNTCGLLPDNTVTRTA---TSGELASDIVS 141
Query: 221 ------RFPSKTVPN----FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
+ P ++V + F+ G + L +G+ AG GR+ SLPSQ +
Sbjct: 142 VQSTNGKNPGRSVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTRISLPSQFSAEFSFPR 201
Query: 263 KFSYCLLSRK------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
KF+ CL S F D P L S + YTP + NPV ++SAF
Sbjct: 202 KFALCLTSSNSKGVVLFGDGPY---FFLPNREFSNND----FQYTPLFINPVSTASAFSS 254
Query: 317 -----FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
Y++G++ I + K V I + L + G GG + + + +T +E L+ A+
Sbjct: 255 GQPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSLYNAITNF 314
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY----LPELILKFKG-GAKMALPPENYF 426
F++++ N +R A V + CFD S +P + L + + N
Sbjct: 315 FVKELANVTRVAAVAP---FKVCFDSRNIGSTRVGPAVPSIDLVLQNENVVWTIFGANSM 371
Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
V VLCL + + +I++G +++ L+FD A R GF
Sbjct: 372 VQVSENVLCLGVLDG-----GVNSRTSIVIGGHTIEDNLLQFDHAASRLGF 417
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 142/378 (37%), Gaps = 57/378 (15%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P + P DT L W +C C P P + F P+RS +S + C + C
Sbjct: 158 PILAQPMSIDTSIDLPWI------QCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAAC 211
Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFL 231
+ GCS N C Y + YG G T+G + + L PS V NF
Sbjct: 212 G-----ELGRYGAGCS--NNQC-----QYFVDYGDGRATSGTYMVDALTLNPSTVVMNFR 259
Query: 232 AGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVL 284
GCS +G G +SL SQ FSYC+ P SS L
Sbjct: 260 FGCSHAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPD------PSSSGF-L 312
Query: 285 DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
G + + TP +NP S Y V LR I VG + + +P
Sbjct: 313 SLGGPADGGGAGRFARTPLVRNP----SIIPTLYLVRLRGIEVGGRRLNVPPVVFA---- 364
Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
GG ++DS T + + A+ F M Y R A ++GL C+D SV
Sbjct: 365 --GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAG--GRAGLDTCYDFVRFTSVT 420
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
+P + L F GGA + L +G V + F ALG +G+ Q Q
Sbjct: 421 VPAVSLVFDGGAVVRLDA------MGVMVEGCLAFVPTPGDFALG-----FIGNVQQQTH 469
Query: 465 YLEFDLANDRFGFAKQKC 482
+ +D+ GF + C
Sbjct: 470 EVLYDVGGGSVGFRRGAC 487
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 142/378 (37%), Gaps = 57/378 (15%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P + P DT L W +C C P P + F P+RS +S + C + C
Sbjct: 142 PILAQPMSIDTSIDLPWI------QCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAAC 195
Query: 174 SWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFL 231
+ GCS N C Y + YG G T+G + + L PS V NF
Sbjct: 196 G-----ELGRYGAGCS--NNQC-----QYFVDYGDGRATSGTYMVDALTLNPSTVVMNFR 243
Query: 232 AGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVL 284
GCS +G G +SL SQ FSYC+ P SS L
Sbjct: 244 FGCSHAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPD------PSSSGF-L 296
Query: 285 DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
G + + TP +NP S Y V LR I VG + + +P
Sbjct: 297 SLGGPADGGGAGRFARTPLVRNP----SIIPTLYLVRLRGIEVGGRRLNVPPVVFA---- 348
Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
GG ++DS T + + A+ F M Y R A ++GL C+D SV
Sbjct: 349 --GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAG--GRAGLDTCYDFVRFTSVT 404
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
+P + L F GGA + L +G V + F ALG +G+ Q Q
Sbjct: 405 VPAVSLVFDGGAVVRLDA------MGVMVEGCLAFVPTPGDFALG-----FIGNVQQQTH 453
Query: 465 YLEFDLANDRFGFAKQKC 482
+ +D+ GF + C
Sbjct: 454 EVLYDVGGGSVGFRRGAC 471
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 106/404 (26%), Positives = 155/404 (38%), Gaps = 68/404 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + L GTPP I DTGS+++W PC + C DC N S F P SS
Sbjct: 96 GNYLMKLLIGTPPTEIHAAI-DTGSNVIWIPCIN---CKDCF--NQSSS---IFNPLASS 146
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ Q C + +C E+ C N C +C ++ L G + +T+
Sbjct: 147 TYQDAPCDSYQC--------ETTSSSCQSDN-VCLYSCDE---KHQLNCPNGRIAVDTMT 194
Query: 222 FPSKT-------VPNFLAGCSILSDRQPAGIAGFGRSSESLPSQ---LGLKKFSYCLLSR 271
S +F+ G SI G+ G GR + SL S+ L KFSYC L+
Sbjct: 195 LTSSDGRPFPLPYSDFVCGNSIYKTFAGVGVIGLGRGALSLTSKLYHLSDGKFSYC-LAD 253
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
+ P N L + D + + +G G YYV L I VG K
Sbjct: 254 YYSKQPSKINFGLQSFISDDDLEVVSTT--------LGHHRHSGN-YYVTLEGISVGEKR 304
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFE--------AVAKEFIRQMGNYSRAA 383
+ Y P + G +++DSG+ FT + ++ A+ + N
Sbjct: 305 QDL-YYVDDPFAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPF 363
Query: 384 DVEKKSGLRPCFDISGKKSVYLPEL----ILKFKGGAKMALPPENYFALVGNEVLCLILF 439
++ L PCF Y PEL I A + L +N F V +V+C
Sbjct: 364 SMDNTLKLSPCF-------WYYPELKFPKITIHFTDADVELSDDNSFIRVAEDVVCF--- 413
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A A G + + G +Q NF L +DL F + C+
Sbjct: 414 ----AFAATQPGQSTVYGSWQQMNFILGYDLKRGTVSFKRTDCS 453
>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 521
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 109/429 (25%), Positives = 161/429 (37%), Gaps = 90/429 (20%)
Query: 70 LKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLV 129
LK KT P+T SL L ++SL+ G+PPQ T + DTGS L
Sbjct: 13 LKVKTLPQT-----------SLSPRKLPFQHNVTLTVSLTVGSPPQRVT-MVLDTGSELS 60
Query: 130 WFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCS 189
W C + + NF F P SSS C +P C+
Sbjct: 61 WLHCK---KLPNLNF---------IFNPLVSSSYTPTPCTSPICT-------------TQ 95
Query: 190 PRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC------SILSDRQPA 243
R+ P++C + L + + F G + F GC S D +
Sbjct: 96 TRDLINPVSCDANKLCHIITFFVGGPAQRGMVF----------GCMDTGTSSGDEDSKTT 145
Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
G+ G S S +Q+ L KFSYC+ ++ V N+ + + L YTP
Sbjct: 146 GLMGMDLGSLSFSNQMRLPKFSYCISNKDSTGVLVLENI-------ANPPRLGPLHYTPL 198
Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
K ++ F R + K S +P G G +VDS + FTF+ P
Sbjct: 199 VKK----TTPLPYFN----RNCCLFQK------SAFLPDHTGAGQTMVDSATQFTFLRQP 244
Query: 364 LFEAVAKEFIRQMGNYSRAADVEK---KSGLRPCFDIS-GKKSVYLPELILKFKGGAKMA 419
++ A+ EF Q N K + + CF + G LP + L F GA++
Sbjct: 245 VYTALKNEFAIQTKNILTPLGDPKFVFQGVMDLCFRVPIGSTLPVLPVVTLMF-DGAELR 303
Query: 420 LPPENYFALVGN------EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
+ E V N + C + G A I+G +N ++E+DLAN
Sbjct: 304 VTGERLLYKVSNVAKSNSWIYCFTFGNSDLLGIE-----AFIIGHHHQRNVWMEYDLANS 358
Query: 474 RFGFAKQKC 482
R GF+ C
Sbjct: 359 RIGFSDTNC 367
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 149/387 (38%), Gaps = 49/387 (12%)
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLI 166
+ + GTPPQ ++ I D LVW C+ RC + +P FIP SS+ +
Sbjct: 46 NFTIGTPPQPASAII-DVAGELVWTQCSRCSRCFKQD--------LPLFIPNASSTFRPE 96
Query: 167 GCQNPKCSWIFGPNVESRCKG--CSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
C C S C G C+ + T ++ T G++ +ET +
Sbjct: 97 PCGTDACK----STPTSNCSGDVCTYESTTN--------IRLDRHTTLGIVGTETFAIGT 144
Query: 225 KTVPNFLAGCSILSDRQP----AGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSS 280
T + GC + SD +G G GR+ SL +Q+ L KFSYCL R SS
Sbjct: 145 ATA-SLAFGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGK---SS 200
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
L L G + + S PF K S +Y + L I G+ + S
Sbjct: 201 RLFL--GSSAKLAGGESTSTAPFIKTSPDDDSH--HYYLLSLDAIRAGNTTIATAQS--- 253
Query: 341 PGSDGNGGVIV-DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF-DIS 398
GG++V + S F+ + + A K +G + CF +
Sbjct: 254 ------GGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAA 307
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAIIL 456
G P+L+ F+G A + +PP Y VG E C + + A G +L
Sbjct: 308 GFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILS-MAWLNRTGLEGVSVL 366
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKCA 483
G Q ++ + +DL + F C+
Sbjct: 367 GSLQQEDVHFLYDLKKETLSFEPADCS 393
>gi|350536487|ref|NP_001234249.1| xyloglucan-specific fungal endoglucanase inhibitor protein
precursor [Solanum lycopersicum]
gi|27372527|gb|AAN87262.1| xyloglucan-specific fungal endoglucanase inhibitor protein
precursor [Solanum lycopersicum]
Length = 438
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 104/410 (25%), Positives = 165/410 (40%), Gaps = 74/410 (18%)
Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
+L + T Q TP + D G +W VDC+ V S PA R
Sbjct: 44 TLQYLTQIQQRTPLVPISLTLDLGGQFLW---------VDCDQGYVSSSYKPA----RCG 90
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL- 220
S+Q C F P GC+ N TC L + + T+G L S+ +
Sbjct: 91 SAQCSLGGASGCGECFSPPR----PGCN--NNTCGLLPDNTVTGTA---TSGELASDVVS 141
Query: 221 ------RFPSKTVPN----FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
+ P ++V + F+ G + L +G+ AG GR+ SLPSQ +
Sbjct: 142 VESSNGKNPGRSVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTKISLPSQFSAEFSFPR 201
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSSAFGE-- 316
KF+ CL S V + GP + YTP + NPV ++SAF
Sbjct: 202 KFALCLTSSSNSKGVV----LFGDGPYFFLPNRQFSNNDFQYTPLFINPVSTASAFSSGQ 257
Query: 317 ---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
Y++G++ I + K V I + L + G GG + + + +T +E L+ A+ F+
Sbjct: 258 PSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSLYNAITNFFV 317
Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMALPPENYFA 427
+++ N +R A V R CFD S + +L+L+ + N
Sbjct: 318 KELANVTRVAVVAP---FRVCFDSRDIGSTRVGPAVPSIDLVLQ-NANVVWTIFGANSMV 373
Query: 428 LVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
V VLCL + + +I++G +++ L+FD A R GF
Sbjct: 374 QVSENVLCLGVLDG-----GVNARTSIVIGGHTIEDNLLQFDHAASRLGF 418
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 114/403 (28%), Positives = 165/403 (40%), Gaps = 96/403 (23%)
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPA 154
V S G Y ++L GTPP I DTGS L W PCT Y+ V +P
Sbjct: 86 VPSAGEYLMNLYIGTPPVPVIA-IVDTGSDLTWTQCRPCTHCYKQV-----------VPL 133
Query: 155 FIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAG 213
F PK SS+ + C C + + + CS + K C ++ Y G FT G
Sbjct: 134 FDPKNSSTYRDSSCGTSFCLAL------GKDRSCS-KEKKC-----TFRYSYADGSFTGG 181
Query: 214 LLLSETLRFPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQL----- 259
L SETL S + P F GC S D+ +GI G G SL SQL
Sbjct: 182 NLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTIN 241
Query: 260 GLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
GL FSYCLL PVS++ + + G S G S +G
Sbjct: 242 GL--FSYCLL-------PVSTDSSISSRINFGAS---------------GRVSGYGTV-- 275
Query: 320 VGLRQIIVGSKHVKIPYS-YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
S +++PY Y G +IVDSG+T+TF+ + + K + N
Sbjct: 276 ---------STPLRLPYKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEK----SVAN 322
Query: 379 YSRAADVEKKSGL-RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ V +G+ C++ + + + P + FK A + L P N F + +++C
Sbjct: 323 SIKGKRVRDPNGIFSLCYNTTAE--INAPIITAHFK-DANVELQPLNTFMRMQEDLVCFT 379
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
+ + G +LG+ NF + FDL R GF+K+
Sbjct: 380 VAPTSDIG---------VLGNLAQVNFLVGFDLRKKR-GFSKK 412
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 60/139 (43%), Gaps = 16/139 (11%)
Query: 346 NGGVIVDSGSTFTFMEGPL-FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
G +IVDSG+T+T++ PL F +E + R D S L C++ + + +
Sbjct: 417 EGNIIVDSGTTYTYL--PLEFYVKLEESVAHSIKGKRVRDPNGISSL--CYNTTVDQ-ID 471
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
P + FK A + L P N F + +++C + + G ILG+ NF
Sbjct: 472 APIITAHFK-DANVELQPWNTFLRMQEDLVCFTVLPTSDIG---------ILGNLAQVNF 521
Query: 465 YLEFDLANDRFGFAKQKCA 483
+ FDL R F C
Sbjct: 522 LVGFDLRKKRVSFKAADCT 540
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/406 (23%), Positives = 166/406 (40%), Gaps = 68/406 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G PP+ + DTGS ++W C + C C + ++ + P+ S+
Sbjct: 80 GLYFAKIGLGNPPKDYYVQV-DTGSDILWVNCAN---CDKCPTKSDLGVKLTLYDPQSST 135
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S+ I C + C+ + +GC T L C Y + YG G TAG + + L
Sbjct: 136 SATRIYCDDDFCAATY----NGVLQGC-----TKDLPC-QYSVVYGDGSSTAGFFVKDNL 185
Query: 221 RFP--------SKTVPNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQLGL---- 261
+F S + + GC S GI GFG+++ S+ SQL
Sbjct: 186 QFDRVTGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKV 245
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEF 317
+ F++CL + K G G G+ +P ++ TP N
Sbjct: 246 KRVFAHCLDNVK--------------GGGIFAIGEVVSPKVNTTPMVPNQ--------PH 283
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
Y V +++I VG +++P G G I+DSG+T ++ ++E++ + + +
Sbjct: 284 YNVVMKEIEVGGNVLELPTDIFDTGD--RRGTIIDSGTTLAYLPEVVYESMMTKIVSEQP 341
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ VE++ CF +G + P + F G + + P +Y + EV C
Sbjct: 342 GL-KLHTVEEQ---FTCFQYTGNVNEGFPVVKFHFNGSLSLTVNPHDYLFQIHEEVWCF- 396
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ ++ GR +LGD L N + +DL N G+ C+
Sbjct: 397 GWQNSGMQSKDGR-DMTLLGDLVLSNKLVLYDLENQAIGWTDYNCS 441
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 140/517 (27%), Positives = 212/517 (41%), Gaps = 115/517 (22%)
Query: 6 FSLICLFSLLILLF---------TTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPL-KI 55
FSL LF L ++F T + G + + +PLS + + D L K
Sbjct: 4 FSLKFLFYTLAVIFFIHFSGLSHTEASNKGGFSTDLISRDSPLSPFYNPSETQFDRLQKA 63
Query: 56 LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQ 115
H S+SRA H + + S + I++P+ + + G Y +++S GTPP
Sbjct: 64 FHR----SISRANHFR------------ANGVSTNSIQSPV-ISNNGEYLMNISLGTPP- 105
Query: 116 ASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
S I DTGS L+W PC S Y ++ P F P +S + Q++ C+
Sbjct: 106 VSMHGIADTGSDLLWRQCKPCDSCYEQIE-----------PIFDPAKSKTYQILSCEGKS 154
Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPSKT----- 226
CS + G GCS N TC Y YG G T+G L +TL S T
Sbjct: 155 CSNLGGQG------GCSDDN-TCI-----YSYSYGDGSHTSGDLAVDTLTIGSTTGRPVS 202
Query: 227 VPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLG---LKKFSYCLLSRKFDDAPVS 279
VP + GC + + +G+ G G S+ SQL +FSYCL+ +D VS
Sbjct: 203 VPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLG-NDPSVS 261
Query: 280 SNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPY 336
S + + G+G TP S P FYY+ L + VGSK K+ Y
Sbjct: 262 SKMHFGSRGIVSGAGAVSTPLASRQP------------DTFYYLTLESMSVGSK--KLAY 307
Query: 337 -------SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
S L +GN +I+DSG+T T + + + + +G + +
Sbjct: 308 KGFSKVGSPLADADEGN--IIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVR---DPNN 362
Query: 390 GLRPCF-DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC--LILFTDNAAGP 446
C+ ++SG + +P + F GA + L P N F V ++ C +I +D A
Sbjct: 363 VFSLCYSNLSGLR---IPTITAHFV-GADLELKPLNTFVQVQEDLFCFAMIPVSDLA--- 415
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I G+ NF + +DL + F C
Sbjct: 416 --------IFGNLAQMNFLVGYDLKSRTVSFKPTDCT 444
>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
Length = 565
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/246 (30%), Positives = 109/246 (44%), Gaps = 33/246 (13%)
Query: 244 GIAGFGRSSESLPSQ---LGLKKFSYCLLSRKFDDAPVSSNL--VLDTGPGSGDSKTPGL 298
G+ GF R S PSQ + FSYCL S K SSN L GP + +
Sbjct: 344 GLVGFNRGPLSFPSQNKNVYGSVFSYCLPSYK------SSNFSGTLRLGPAGQPKR---I 394
Query: 299 SYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFT 358
TP NP S YYV + I VG + V +P S L G IVD+G+ FT
Sbjct: 395 KTTPLLSNPHRPS-----LYYVNMVGIRVGGRPVAVPASALAFDPASGHGTIVDAGTMFT 449
Query: 359 FMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKM 418
+ P++ AV F ++ RA G C+++ ++ +P + F G +
Sbjct: 450 RLSAPVYAAVCDVFRSRV----RAPVAGPLGGFDTCYNV----TISVPTVTFLFDGRVSV 501
Query: 419 ALPPENYFALVG-NEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFG 476
LP EN + + CL + AAGP+ + ++ Q QN + FD+AN R G
Sbjct: 502 TLPEENVVIRSSLDGIACLAM----AAGPSDSVDAVLNVMASMQQQNHRVLFDVANGRVG 557
Query: 477 FAKQKC 482
F+++ C
Sbjct: 558 FSRELC 563
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/391 (25%), Positives = 150/391 (38%), Gaps = 62/391 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y ++ GTP T I DTGSSL W +C CN P R+P F P SSS
Sbjct: 129 YVATVGLGTPAVPQT-LILDTGSSLTWV------QCKPCNSSQCYPQRLPLFDPNTSSSY 181
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT-AGLLLSETLRF 222
+ C + +C ++ GC+ C +Y + YG G T AG ++ L
Sbjct: 182 SPVPCDSQECR-ALAAGIDG--DGCTSDGD---WGC-AYEIHYGSGATPAGEYSTDALTL 234
Query: 223 -PSKTVPNFLAGCSILSDR----QPAGIAGFGRSSESLPSQLGLKK----FSYCLLSRKF 273
P V F GC R G+ G GR +SL Q ++ FS+CL
Sbjct: 235 GPGAIVKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGV 294
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
++ DT + F P+ + FY + I V + +
Sbjct: 295 STGFLALGAPHDT--------------SAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLD 340
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP 393
IP + GVI DSG+ + ++ + A+ F M Y A V L
Sbjct: 341 IPPAVF------REGVITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGH---LDT 391
Query: 394 CFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT--DNAAGPALGRG 451
CF+ +G +V +P + L F+GGA + L + + G CL ++ D G
Sbjct: 392 CFNFTGYDNVTVPTVSLTFRGGATVHLDASSGVLMDG----CLAFWSSGDEYTG------ 441
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++G + + +D+ + GF C
Sbjct: 442 ---LIGSVSQRTIEVLYDMPGRKVGFRTGAC 469
>gi|222822566|gb|ACM68432.1| xyloglucanase-specific endoglucanase inhibitor protein [Petunia x
hybrida]
Length = 436
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 111/412 (26%), Positives = 164/412 (39%), Gaps = 79/412 (19%)
Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
+L + T TP + D G +W VDC+ V S IPA RS+
Sbjct: 43 TLQYLTQISQRTPLVPVSLTLDLGGQFLW---------VDCDQGYVSSSYIPARC--RSA 91
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
L G C F P GC+ N TC + + + T+G L S+ +
Sbjct: 92 KCSLAGSSG--CGDCFSP----PSPGCN--NNTCGAFPDNSITRTA---TSGELASDIVS 140
Query: 222 FPSKTVPN-----------FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
S N F+ G + L + +G+ AG GR+ SLPSQ +
Sbjct: 141 VQSSNGKNPGRNVSDKDFLFVCGATFLLNGLASGVKGMAGLGRTRISLPSQFSAEFSFPR 200
Query: 263 KFSYCLLSRK-------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFG 315
KF+ CL S F D P S L S D SYTP + NPV ++SAF
Sbjct: 201 KFAVCLSSTSNSKGVVLFGDGPYS---FLPNREYSSDD----FSYTPLFINPVSTASAFS 253
Query: 316 E-----FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAK 370
Y++G++ I + K V I + L S G GG + + + +T +E ++ AV
Sbjct: 254 SGTPSSEYFIGVKSIKINEKVVPINTTLLSIDSQGVGGTKISTVNPYTILETSIYNAVTN 313
Query: 371 EFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY----LPELILKFKG-GAKMALPPENY 425
F++++ V + CFD S +P + L + + N
Sbjct: 314 FFVKELA----IPTVPSVAPFGVCFDSRNITSTRVGPGVPSIDLVLQNENVFWRIFGANS 369
Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
LV VLCL F D P +I++G +++ L+FDLA R GF
Sbjct: 370 MVLVSENVLCL-GFVDGGVNPR----TSIVIGGHTIEDNLLQFDLAASRLGF 416
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 108/393 (27%), Positives = 153/393 (38%), Gaps = 73/393 (18%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I++S G+P A T FI DTGS + W C SR + P SS+
Sbjct: 131 YVITVSIGSPAVAXTMFI-DTGSDVSWLRCKSRL-----------------YDPGTSSTY 172
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
C P C+ + R GCS TC Y ++YG G T G S+TL
Sbjct: 173 APFSCSAPACA-----QLGRRGTGCS-SGSTC-----VYSVKYGDGSNTTGTYGSDTLTL 221
Query: 223 PSKTVP---NFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRK 272
+ P F GCS + + G+ G G ++S SQ FSYCL
Sbjct: 222 AGTSEPLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCL---- 277
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
+S+ L G + F P+ S FY + LR I VG K +
Sbjct: 278 --PPTWNSSGFLTLG------APSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTL 329
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
+IP S GS IVDSG+ T + + A++ F M Y + + L
Sbjct: 330 EIPSSVFSAGS------IVDSGTVITRLPPTAYGALSAAFRDGMARY-QYQPAAPRGLLD 382
Query: 393 PCFDISGK---KSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
CFD +G + +P + L GGA + L P +V + L D+
Sbjct: 383 TCFDFTGHGEGNNFTVPSVALVLDGGAVVDLHPN---GIVQDGCLAFAATDDD------- 432
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G I+G+ Q + F + +D+ FGF C
Sbjct: 433 -GRTGIIGNVQQRTFEVLYDVGQSVFGFRPGAC 464
>gi|350536203|ref|NP_001234746.1| xyloglucan-specific fungal endoglucanase inhibitor protein
precursor [Solanum lycopersicum]
gi|68449754|gb|AAY97864.1| xyloglucan-specific fungal endoglucanase inhibitor protein
precursor [Solanum lycopersicum]
Length = 438
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 104/410 (25%), Positives = 164/410 (40%), Gaps = 74/410 (18%)
Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
+L + T Q TP + D G +W VDC+ V S PA R
Sbjct: 44 TLQYLTQIQQRTPLVPISLTLDLGGQFLW---------VDCDQGYVSSSYKPA----RCG 90
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL- 220
S+Q C F P GC N TC L + + T+G L S+ +
Sbjct: 91 SAQCSLGGASGCGECFSPPR----PGCD--NNTCGLLPDNTVTGTA---TSGELASDVVS 141
Query: 221 ------RFPSKTVPN----FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
+ P ++V + F+ G + L +G+ AG GR+ SLPSQ +
Sbjct: 142 VESSNGKNPGRSVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTKISLPSQFSAEFSFPR 201
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSSAFGE-- 316
K + CL S V + GP + YTP + NPV ++SAF
Sbjct: 202 KSALCLTSSSNSKGVV----LFGDGPYFFLPNRQFSNNDFQYTPLFINPVSTASAFSSGQ 257
Query: 317 ---FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI 373
Y++G++ I + K V I + L + G GG + + + +T +E L+ A+ F+
Sbjct: 258 PSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSLYNAITNFFV 317
Query: 374 RQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMALPPENYFA 427
+++ N +R A V R CFD S + +L+L+ + N
Sbjct: 318 KELANVTRVAVVAP---FRVCFDSRDIGSTRVGPAVPSIDLVLQ-NANVVWTIFGANSMV 373
Query: 428 LVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
V VLCL + + G +I++G +++ L+FD A R GF
Sbjct: 374 QVSENVLCLGVLDG-----GVNAGTSIVIGGHTIEDNLLQFDHAASRLGF 418
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 101/413 (24%), Positives = 172/413 (41%), Gaps = 94/413 (22%)
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
S G Y+ L GTPPQ I DTGS++ + PC+S C C + P F P
Sbjct: 73 SNGYYTTRLFIGTPPQEFA-LIVDTGSTVTYVPCSS---CEQCG-----KHQDPRFQPDL 123
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSE 218
SS+ + + C NP C+ C K C +Y +Y + ++G++ +
Sbjct: 124 SSTYRPVKC-NPSCN-------------CDDEGKQC-----TYERRYAEMSSSSGVIAED 164
Query: 219 TLRFPSKTV---PNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLK-----KFS 265
+ F +++ + GC L ++ GI G GR S+ QL K FS
Sbjct: 165 VVSFGNESELKPQRAVFGCENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFS 224
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVG 321
C +D G G+ S P + ++ + NP S +Y +
Sbjct: 225 LCYGG-------------MDVGGGAMVLGQISPPPNMVFS--HSNPYRSP-----YYNIE 264
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
L+++ V K +K+ D G ++DSG+T+ + F A+ ++++ +
Sbjct: 265 LKELHVAGKPLKLKPKVF----DEKHGTVLDSGTTYAYFPEAAFHALKDAIMKEIRH--- 317
Query: 382 AADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKMALPPENYF--ALVG 430
+++ G P CF +G++ +L PE+ + F G K++L PENY
Sbjct: 318 ---LKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKV 374
Query: 431 NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ CL +F + G +LG ++N + +D ND+ GF K C+
Sbjct: 375 SGAYCLGIFQN-------GNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNCS 420
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 115/407 (28%), Positives = 164/407 (40%), Gaps = 70/407 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP T I DTGS ++W C S C C + ++ F SS
Sbjct: 77 GLYFTKVKLGTPPMEFTVQI-DTGSDILWVNCNS---CNGCPRSSGLGIQLNFFDASSSS 132
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
SS L+ C +P C+ F ++ C ++ C SY QYG G T+G +SE++
Sbjct: 133 SSSLVSCSDPICNSAF----QTTATQCLTQSNQC-----SYTFQYGDGSGTSGYYVSESM 183
Query: 221 RFPSKTVPNFLA--------GCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
F + +A GCS SD GI GFG S+ SQL
Sbjct: 184 YFDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGIT 243
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K FS+CL + LVL G+ PG+ Y+P + Y +
Sbjct: 244 PKVFSHCLKG----EGNGGGILVL------GEVLEPGIVYSPLVPSQ--------PHYNL 285
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I V + + I S V + N G I+DSG+T + L E F+ +
Sbjct: 286 YLQSISVNGQTLPIDPS--VFATSINRGTIIDSGTTLAY----LVEEAYTPFVSAITAAV 339
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
+ S C+ +S P + L F G A M L PE Y +G F
Sbjct: 340 SQSVTPTISKGNQCYLVSTSVGEIFPLVSLNFAGSASMVLKPEEYLMHLG--------FY 391
Query: 441 DNAAGPALG----RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
D AA +G + ILGD +++ +DLA R G+A C+
Sbjct: 392 DGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQRIGWASYDCS 438
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 103/398 (25%), Positives = 157/398 (39%), Gaps = 69/398 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + +G P Q P FDT + C C+ PAF P RSSS
Sbjct: 88 YRVLAGYGAPAQ-RFPVAFDTNFGVSVLRCKPCVGGAPCD---------PAFEPSRSSSF 137
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
I C +P+C+ C G S CP + +Q+G + G L+ +TL
Sbjct: 138 AAIPCGSPECAV--------ECTGAS-----CP-----FTIQFGNVTVANGTLVRDTLTL 179
Query: 223 -PSKTVPNFLAGC-SILSDRQ----PAGIAGFGRSSESLPSQL-------GLKKFSYCLL 269
PS T F GC + +D G+ RSS SL S++ FSYCL
Sbjct: 180 PPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLP 239
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
S + SS L G + + Y P NP +S Y+V L I VG
Sbjct: 240 S----SSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS-----YFVDLVGISVGG 290
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ + +P P G ++++ + FTF+ + A+ F + M Y A
Sbjct: 291 EDLPVP-----PAVFAAHGTLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRV-- 343
Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN--YFA---LVGNEVLCLILFTDNAA 444
L C++++G S+ +P + L+F GG ++ L YFA V + V CL
Sbjct: 344 -LDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLP 402
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ ++G ++ + +DL R GF +C
Sbjct: 403 AFPVS-----VIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|384482417|pdb|3VLA|A Chain A, Crystal Structure Of Edgp
Length = 413
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 105/407 (25%), Positives = 167/407 (41%), Gaps = 83/407 (20%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P S + D G +W C Y + P R +SQ + C
Sbjct: 31 PLVSENLVVDLGGRFLWVDCDQNYVS-------------STYRPVRCRTSQCSLSGSIAC 77
Query: 174 SWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK------- 225
F GP GC+ N TC + + ++ G G + + + S
Sbjct: 78 GDCFNGPR-----PGCN--NNTCGVFPENPVINTATG---GEVAEDVVSVESTDGSSSGR 127
Query: 226 --TVPNFLAGCSILSDRQP-----AGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKF 273
TVP F+ C+ S Q G+AG GR+ +LPSQ +KF+ CL
Sbjct: 128 VVTVPRFIFSCAPTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL----- 182
Query: 274 DDAPVSSNLVLDTGPGSGDSKT---------PGLSYTPFYKNPVGSS--SAFGE---FYY 319
SSN V+ G D T L+YTP NPV +S S GE Y+
Sbjct: 183 -SGSTSSNSVIIFG---NDPYTFLPNIIVSDKTLTYTPLLTNPVSTSATSTQGEPSVEYF 238
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+G++ I + SK V + S L S G GG + + + +T +E +++AV + FI++
Sbjct: 239 IGVKSIKINSKIVALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKE---- 294
Query: 380 SRAADVEKKSGLRP---CFDISGKKSVYL----PELILKFKGGAKM-ALPPENYFALVGN 431
S A ++ + + + P CF S L P + L + + + + N + +
Sbjct: 295 SAARNITRVASVAPFGACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYIND 354
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
V+CL + G L +I++G QL++ ++FDLA R GF+
Sbjct: 355 NVVCLGVVD---GGSNLRT--SIVIGGHQLEDNLVQFDLATSRVGFS 396
>gi|285741|dbj|BAA03413.1| EDGP precursor [Daucus carota]
Length = 433
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 131/285 (45%), Gaps = 50/285 (17%)
Query: 226 TVPNFLAGCSILSDRQP-----AGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKFDD 275
TVP F+ C+ S Q G+AG GR+ +LPSQ +KF+ CL
Sbjct: 150 TVPRFIFSCAPTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL------S 203
Query: 276 APVSSNLVLDTGPGSGDSKT---------PGLSYTPFYKNPVGSS--SAFGE---FYYVG 321
SSN V+ G D T L+YTP NPV +S S GE Y++G
Sbjct: 204 GSTSSNSVIIFG---NDPYTFLPNIIVSDKTLTYTPLLTNPVSTSATSTQGEPSVEYFIG 260
Query: 322 LRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSR 381
++ I + SK V + S L S G GG + + + +T +E +++AV + FI++ S
Sbjct: 261 VKSIKINSKIVALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKE----SA 316
Query: 382 AADVEKKSGLRP---CFDISGKKSVYL----PELILKFKGGAKM-ALPPENYFALVGNEV 433
A ++ + + + P CF S L P + L + + + + N + + V
Sbjct: 317 ARNITRVASVAPFGACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYINDNV 376
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+CL + G L +I++G QL++ ++FDLA R GF+
Sbjct: 377 VCLGVVD---GGSNLRT--SIVIGGHQLEDNLVQFDLATSRVGFS 416
>gi|356535355|ref|XP_003536212.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 444
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 134/287 (46%), Gaps = 47/287 (16%)
Query: 226 TVPNFL--AGCSILSDRQPAGI---AGFGRSSESLPSQLG-----LKKFSYCLLSRKFDD 275
+VP FL G +++ + +G+ AG GR+ SLPSQ L+KF+ CL S +
Sbjct: 151 SVPKFLFICGANVVQNGLASGVTGMAGLGRTKVSLPSQFSSAFSFLRKFAICLSSSTMTN 210
Query: 276 APVSSNLVLDTGP------GSGDSKTPGLSYTPFYKNPVGSSSAF--GE---FYYVGLRQ 324
+ GP S SK L++TP NPV ++ ++ GE Y++G++
Sbjct: 211 GV----MFFGDGPYNFGYLNSDLSKV--LTFTPLITNPVSTAPSYFQGEPSVEYFIGVKS 264
Query: 325 IIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAAD 384
I V K+V + + L +G GG + + + +T +E +++AV++ F++ +G A
Sbjct: 265 IRVSDKNVPLNTTLLSIDRNGIGGTKISTVNPYTVLETTIYKAVSEAFVKAVG----APT 320
Query: 385 VEKKSGLRPCFDISGKKSVYL----PELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
V + CF +S + P++ L + ++ N N+V+CL F
Sbjct: 321 VAPVAPFGTCFATKDIQSTRMGPAVPDINLVLQNEVVWSIIGANSMVYT-NDVICL-GFV 378
Query: 441 DNAAGPALGRG----------PAIILGDFQLQNFYLEFDLANDRFGF 477
D + P+ + +I +G QL+N L+FDLA R GF
Sbjct: 379 DAGSDPSTAQVGFVVGYSQPITSITIGAHQLENNMLQFDLATSRLGF 425
>gi|384482418|pdb|3VLB|A Chain A, Crystal Structure Of Xeg-Edgp
gi|384482420|pdb|3VLB|C Chain C, Crystal Structure Of Xeg-Edgp
Length = 413
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 105/407 (25%), Positives = 167/407 (41%), Gaps = 83/407 (20%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P S + D G +W C Y + P R +SQ + C
Sbjct: 31 PLVSENLVVDLGGRFLWVDCDQNYVS-------------STYRPVRCRTSQCSLSGSIAC 77
Query: 174 SWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK------- 225
F GP GC+ N TC + + ++ G G + + + S
Sbjct: 78 GDCFNGPR-----PGCN--NNTCGVFPENPVINTATG---GEVAEDVVSVESTDGSSSGR 127
Query: 226 --TVPNFLAGCSILSDRQP-----AGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKF 273
TVP F+ C+ S Q G+AG GR+ +LPSQ +KF+ CL
Sbjct: 128 VVTVPRFIFSCAPTSLLQNLASGVVGMAGLGRTRIALPSQFASAFSFKRKFAMCL----- 182
Query: 274 DDAPVSSNLVLDTGPGSGDSKT---------PGLSYTPFYKNPVGSS--SAFGE---FYY 319
SSN V+ G D T L+YTP NPV +S S GE Y+
Sbjct: 183 -SGSTSSNSVIIFG---NDPYTFLPNIIVSDKTLTYTPLLTNPVSTSATSTQGEPSVEYF 238
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
+G++ I + SK V + S L S G GG + + + +T +E +++AV + FI++
Sbjct: 239 IGVKSIKINSKIVALNTSLLSISSAGLGGTKISTINPYTVLETSIYKAVTEAFIKE---- 294
Query: 380 SRAADVEKKSGLRP---CFDISGKKSVYL----PELILKFKGGAKM-ALPPENYFALVGN 431
S A ++ + + + P CF S L P + L + + + + N + +
Sbjct: 295 SAARNITRVASVAPFGACFSTDNILSTRLGPSVPSIDLVLQSESVVWTITGSNSMVYIND 354
Query: 432 EVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
V+CL + G L +I++G QL++ ++FDLA R GF+
Sbjct: 355 NVVCLGVVD---GGSNLRT--SIVIGGHQLEDNLVQFDLATSRVGFS 396
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/170 (31%), Positives = 80/170 (47%), Gaps = 18/170 (10%)
Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
+YYVGL I VG + + IP + S GNGG+IVDSG+ T ++ ++ V F++
Sbjct: 10 YYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVRDAFVKGT 69
Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
+ +V S C+D+S K SV +P + F G + LP +NY V
Sbjct: 70 KDLLATNEV---SLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPV------- 119
Query: 437 ILFTDNAAGPALGRGPAI----ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
D+ P + I+G+ Q Q + FDLAN GF+ +C
Sbjct: 120 ----DSVGTFCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165
>gi|291002742|gb|ADD71503.1| xyloglucanase inhibitor 1 [Humulus lupulus]
Length = 443
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 104/412 (25%), Positives = 171/412 (41%), Gaps = 73/412 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y ++ TPP + D G +W C Y+ SS+
Sbjct: 48 YITQITQRTPP-VQLKVVLDVGGEFLWIDCEKGYK---------------------SSTK 85
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQ--YGLGFTAGLLLSETLR 221
+ + C +P+C V S C+ + + + + T+G L + L
Sbjct: 86 RPVPCGSPQC-------VLSGSGACTTSDNPSDVGVCGVMPNNPFSSVGTSGDLFEDILY 138
Query: 222 FPSK---------TVPNFLAGC---SILSDRQPA--GIAGFGRSSESLPSQLGLKKFSYC 267
S +VPN L C S+L G+AGFGR+ +LPS L FS+
Sbjct: 139 IQSTNGFNPGKQVSVPNLLFSCAPNSLLEGLASGIIGMAGFGRNKVALPS-LFSSAFSF- 196
Query: 268 LLSRKFDDAPVSSNLVLDTG-------PGSGDSKTPGLSYTPFYKNPVGSSSAF----GE 316
RKF SSN V+ G PG S L+YTP +NP S+F
Sbjct: 197 --PRKFGVCLSSSNGVIFFGKEPYVLLPGIDVSDPTSLTYTPLIQNPRSLVSSFEGNPSA 254
Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDG-NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
Y++G++ I V K +++ + L ++G +GG + + FT +E +++AV F++
Sbjct: 255 EYFIGVKSIKVDGKPLRLNTTLLTFDNEGGHGGTKISTVDPFTTLETSIYKAVVGAFVKA 314
Query: 376 MGNYSRAADVEKKSGLRPCFDIS--GKKSV--YLPELILKFKGGAKMALPPENYFALVGN 431
+G + V+ + CF+ G V +P++ L + ++ N VG+
Sbjct: 315 LG--PKVPRVKAVAPFGACFNAKYIGNTRVGPAVPQIDLVLRNDKLWSIFGANSMVSVGD 372
Query: 432 EVLCLILFTDNAAGPALGRG-----PAIILGDFQLQNFYLEFDLANDRFGFA 478
+VLCL F D + G A+++G Q++N +L FDL R GF+
Sbjct: 373 DVLCL-GFVDGGPLNFVDWGVKFTPTAVVIGGHQIENNFLLFDLGASRLGFS 423
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 105/398 (26%), Positives = 159/398 (39%), Gaps = 69/398 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + +G P Q P FDT + C C+ PAF P RSSS
Sbjct: 176 YRVLAGYGAPAQ-RFPVAFDTNFGVSVLRCKPCVGGAPCD---------PAFEPSRSSSF 225
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF 222
I C +P+C+ VE C G S CP + +Q+G + G L+ +TL
Sbjct: 226 AAIPCGSPECA------VE--CTGAS-----CP-----FTIQFGNVTVANGTLVRDTLTL 267
Query: 223 -PSKTVPNFLAGC-SILSDRQ----PAGIAGFGRSSESLPSQL-------GLKKFSYCLL 269
PS T F GC + +D G+ RSS SL S++ FSYCL
Sbjct: 268 PPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLP 327
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
S + SS L G + + Y P NP +S Y+V L I VG
Sbjct: 328 S----SSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNS-----YFVDLVGISVGG 378
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ + +P P G ++++ + FTF+ + A+ F + M Y A
Sbjct: 379 EDLPVP-----PAVFAAHGTLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRV-- 431
Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN--YFA---LVGNEVLCLILFTDNAA 444
L C++++G S+ +P + L+F GG ++ L YFA V + V CL
Sbjct: 432 -LDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLP 490
Query: 445 GPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ ++G ++ + +DL R GF +C
Sbjct: 491 AFPVS-----VIGTLAQRSTEVVYDLRGGRVGFIPGRC 523
>gi|223974335|gb|ACN31355.1| unknown [Zea mays]
Length = 91
Score = 82.4 bits (202), Expect = 4e-13, Method: Composition-based stats.
Identities = 42/84 (50%), Positives = 53/84 (63%), Gaps = 6/84 (7%)
Query: 405 LPELILKFKGGAKMALPPENYFALVGN---EVLCLILFTDNAAGPALGR---GPAIILGD 458
LPEL +F+GGA M LP ENYF + G E +CL + TD + G G GPAIILG
Sbjct: 3 LPELSFRFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGS 62
Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
FQ QN+ +E+DL +R GF +Q C
Sbjct: 63 FQQQNYLVEYDLEKERLGFRRQSC 86
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 95/388 (24%), Positives = 145/388 (37%), Gaps = 65/388 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + + G+P + D+GS +VW C +C + P +P+ +FI
Sbjct: 127 GEYFVRIGIGSPAIYQY-MVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIG---- 181
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+ C + C+ + +V R C Y + YG G +T G L ET+
Sbjct: 182 ----VACSSNVCNQL-DDDVACRKGRCG------------YQVAYGDGSYTKGTLALETI 224
Query: 221 RFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSE---SLPSQLGLKK---FSYCLLSRKFD 274
+ + GC ++ G AG S QLG + F YCL+SR
Sbjct: 225 TIGRTVIQDTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAM- 283
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
PV G + P NP + FYYV L + VG V I
Sbjct: 284 --PV------------------GAMWVPLIHNPF-----YPSFYYVSLSGLAVGGIRVPI 318
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
G GGV++D+G+ T + + A FI Q N RA V S C
Sbjct: 319 SEQIFQLTDIGTGGVVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGV---SIFDTC 375
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
+D++G +V +P + F GG + P N+ + F + +G +
Sbjct: 376 YDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPADDVGTFCFAFAPSPSGLS------- 428
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+G+ Q + + D N GF C
Sbjct: 429 IIGNIQQEGIQVSIDGTNGFVGFGPNVC 456
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 119/448 (26%), Positives = 175/448 (39%), Gaps = 75/448 (16%)
Query: 49 DSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISL 108
D L+ +H SS RH + ++ +T + + LS+ S G Y +
Sbjct: 4 DEARLRWIHHRIQSS--DHRHRRGRSLLQTAQ-----------VSSGLSLGS-GEYFARM 49
Query: 109 SFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
G+P Q S DTGS + W PC+S Y VD P + P SSS +
Sbjct: 50 GIGSP-QRSYYLELDTGSDVTWIQCAPCSSCYSQVD-----------PIYDPSNSSSYRR 97
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF-- 222
+ C + C + S C+G + C SY + YG ++G L E+
Sbjct: 98 VYCGSALCQAL----DYSACQG---------MGC-SYRVVYGDSSASSGDLGIESFYLGP 143
Query: 223 -PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
S + N GC + R AG+ G G + S SQ+ FSYCL+ R
Sbjct: 144 NSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQL 203
Query: 276 APVSSNLVLDTGPGSGDSKTP-GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
SS L+ G + P +TP KNP FYY L I VG + I
Sbjct: 204 QSRSSPLIF------GRTAIPFAARFTPLLKNP-----RIDTFYYAILTGISVGGTALPI 252
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
P + +G GG I+DSG++ T + + + + N A V L C
Sbjct: 253 PPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPAPGVYL---LDTC 309
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
F+ G +V +P L+L F M LP N V + F ++ P
Sbjct: 310 FNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPSSM-------PIS 362
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++G+ Q Q F + FDL A ++C
Sbjct: 363 VIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 112/434 (25%), Positives = 164/434 (37%), Gaps = 86/434 (19%)
Query: 95 PLSVHSY---GGYSISLSFGTPPQASTPFIF--DTGSSLVWFPC-----TSRYRCVDCNF 144
PL+ +Y G Y + GTP Q PF+ DTGS L W C + +
Sbjct: 83 PLTSAAYTGIGQYFVRFRVGTPAQ---PFLLVADTGSDLTWVKCRPAKAAAASTNSSSSA 139
Query: 145 PNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL 204
P R AF P++S + I C + CS K TCP P
Sbjct: 140 SASSPRR--AFRPEKSKTWAPIPCASDTCS-----------KSLPFSLSTCP--TPGSPC 184
Query: 205 QYGLGFTAGLLLSETLRFPSKTVP------------------NFLAGCSIL----SDRQP 242
Y + G T+ S T+ + GC+ S
Sbjct: 185 AYDYRYKDGSAARGTVGTESATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEAS 244
Query: 243 AGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDS------ 293
G+ G S+ S S + +FSYCL+ +P ++ L GP S S
Sbjct: 245 DGVLSLGYSNVSFASHAASRFGGRFSYCLVDHL---SPRNATSYLTFGPNSALSGPCPAA 301
Query: 294 KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDS 353
PG TP + S FY V ++ I V + +KIP V DG GGVIVDS
Sbjct: 302 AGPGARQTPLVLD-----SRMRPFYDVSIKAISVDGELLKIPRD--VWEVDGGGGVIVDS 354
Query: 354 GSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG----KKSVYLPELI 409
G++ T + P + AV +++ + R A C++ + + LP+L
Sbjct: 355 GTSLTVLAKPAYRAVVAALGKKLARFPRVA----MDPFEYCYNWTSPSRKDEGDDLPKLA 410
Query: 410 LKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEF 468
+ F G A++ P ++Y V C+ G G P I ++G+ Q EF
Sbjct: 411 VHFAGSARLEPPSKSYVIDAAPGVKCI--------GVQEGPWPGISVIGNILQQEHLWEF 462
Query: 469 DLANDRFGFAKQKC 482
DL N R F + +C
Sbjct: 463 DLKNRRLRFKRSRC 476
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 149/379 (39%), Gaps = 71/379 (18%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DT S + W +C C P P + + P +SSSS + C +P C+ + GP
Sbjct: 171 MVLDTASDVTWV------QCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GP 223
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF-PSKTVPNFLAGC--- 234
GC+ N+ Y ++Y G TAG +S+ L P+ V +F GC
Sbjct: 224 YA----NGCTNNNQC------QYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHG 273
Query: 235 ---SILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCL---LSRKFDDAPVSSNLVLD 285
S AGI G ESL SQ + FS+C R F L
Sbjct: 274 VQGSFSFGSSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGF--------FTL- 324
Query: 286 TGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
G + Y TP KNP + FY V L I V + + +P +
Sbjct: 325 -----GVPRVAAWRYVLTPMLKNPAIPPT----FYMVRLEAIAVAGQRIAVPPTVFA--- 372
Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
G +DS + T + ++A+ + F +M Y A K L C+D++G +S
Sbjct: 373 ---AGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPA---PPKGPLDTCYDMAGVRSF 426
Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
LP + L F A + L P G + FT AGP + P II G+ QLQ
Sbjct: 427 ALPRITLVFDKNAAVELDPS------GVLFQGCLAFT---AGPN-DQVPGII-GNIQLQT 475
Query: 464 FYLEFDLANDRFGFAKQKC 482
+ +++ GF C
Sbjct: 476 LEVLYNIPAALVGFRHAAC 494
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 149/379 (39%), Gaps = 71/379 (18%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DT S + W +C C P P + + P +SSSS + C +P C+ + GP
Sbjct: 146 MVLDTASDVTWV------QCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GP 198
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF-PSKTVPNFLAGC--- 234
GC+ N C Y ++Y G TAG +S+ L P+ V +F GC
Sbjct: 199 YA----NGCT-NNNQC-----QYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHG 248
Query: 235 ---SILSDRQPAGIAGFGRSSESLPSQLGL---KKFSYCL---LSRKFDDAPVSSNLVLD 285
S AGI G ESL SQ + FS+C R F L
Sbjct: 249 VQGSFSFGSSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGF--------FTL- 299
Query: 286 TGPGSGDSKTPGLSY--TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
G + Y TP KNP + FY V L I V + + +P +
Sbjct: 300 -----GVPRVAAWRYVLTPMLKNPAIPPT----FYMVRLEAIAVAGQRIAVPPTVFA--- 347
Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
G +DS + T + ++A+ + F +M Y A K L C+D++G +S
Sbjct: 348 ---AGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPA---PPKGPLDTCYDMAGVRSF 401
Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
LP + L F A + L P G + FT AGP + P II G+ QLQ
Sbjct: 402 ALPRITLVFDKNAAVELDPS------GVLFQGCLAFT---AGPN-DQVPGII-GNIQLQT 450
Query: 464 FYLEFDLANDRFGFAKQKC 482
+ +++ GF C
Sbjct: 451 LEVLYNIPAALVGFRHAAC 469
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 104/404 (25%), Positives = 166/404 (41%), Gaps = 80/404 (19%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y+ L GTPPQ I D+GS++ + PC S +C + + P F P SS
Sbjct: 86 GYYTTRLHIGTPPQEFA-LIVDSGSTVTYVPCASCEQCGN--------HQDPRFQPDLSS 136
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C NV+ C S +N+ +Y QY + ++G+L + +
Sbjct: 137 TYSPVKC-----------NVDCTCD--SDKNQC------TYERQYAEMSSSSGVLGEDIV 177
Query: 221 RFPSKTV---PNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLK-----KFSYC 267
F +++ + GC L + GI G GR S+ QL K FS C
Sbjct: 178 SFGTESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMC 237
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
+VL P PG+ YT + N V S +Y + L+++ V
Sbjct: 238 YGGMDIG----GGAMVLGAMPAP-----PGMIYT--HSNAVRSP-----YYNIELKEMHV 281
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS--RAADV 385
K +++ DG G ++DSG+T+ ++ F A Q+ R D
Sbjct: 282 AGKALRVDPRIF----DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDS 337
Query: 386 EKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENY-FALVGNE-VLCLILF 439
K CF +G+ S P++ + F G K++L PENY F E CL +F
Sbjct: 338 NYKD---ICFAGAGRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVF 394
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G+ P +LG ++N + +D N++ GF K C+
Sbjct: 395 QN-------GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 431
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 150/377 (39%), Gaps = 65/377 (17%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DT S + W +C C + P +S SS+ C +P C + GP
Sbjct: 184 MLLDTASDVAWV------QCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQL-GP 236
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF-PSKTVPNFLAGCS-- 235
GCS + + Y ++Y G T+G L+++ L P+ VP F GCS
Sbjct: 237 YA----NGCSSSSNSAGQC--QYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFEFGCSHA 290
Query: 236 ---ILSDRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSN---LVLDT 286
S + AGI GR +SL SQ K FSYC P +S+ VL
Sbjct: 291 ARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCF-------PPTASHKGFFVLGV 343
Query: 287 GPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
P S+ + TP K P+ Y V L I V + + +P +
Sbjct: 344 -PRRSSSR---YAVTPMLKTPM--------LYQVRLEAIAVAGQRLDVPPTVFA------ 385
Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
G +DS + T + ++A+ F +M Y AA L C+D +G S+ LP
Sbjct: 386 AGAALDSRTVITRLPPTAYQALRSAFRDKMSMYRPAA---ANGQLDTCYDFTGVSSIMLP 442
Query: 407 ELILKF-KGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFY 465
+ L F + GA + L P L G+ CL + A G I+G QLQ
Sbjct: 443 TISLVFDRTGAGVQLDPSG--VLFGS---CLAFASTAGDDRATG-----IIGFLQLQTIE 492
Query: 466 LEFDLANDRFGFAKQKC 482
+ +++A GF + C
Sbjct: 493 VLYNVAGGSVGFRRGAC 509
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 111/417 (26%), Positives = 172/417 (41%), Gaps = 82/417 (19%)
Query: 93 KTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDP 149
+TP+SVH Y Y + LS GTPP + + DTGS L+W PCT+ Y+ ++
Sbjct: 49 QTPVSVHHYD-YLMELSIGTPPVKTYAQV-DTGSDLIWLQCIPCTNCYKQLN-------- 98
Query: 150 SRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GL 208
P F P+ SS+ I + CS ++ CSP C +Y Y
Sbjct: 99 ---PMFDPQSSSTYSNIAYGSESCSKLYS-------TSCSPDQNNC-----NYTYSYEDD 143
Query: 209 GFTAGLLLSETLRFPSKT-----VPNFLAGC-----SILSDRQPAGIAGFGRSSESLPSQ 258
T G+L ETL S T + + GC + +D++ GI G GR SL SQ
Sbjct: 144 SITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKE-MGIIGLGRGPLSLVSQ 202
Query: 259 LGL----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
+G K FS CL+ + + S + G GS + G+ TP S +
Sbjct: 203 IGSSFGGKMFSQCLVPFHTNPSITSP---MSFGKGS-EVLGNGVVSTPLV-----SKNTH 253
Query: 315 GEFYYVGLRQIIVGSKHVKIPY---SYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
FY+V L I V + + +P+ S L P + GN +++DSG+ T + + + +E
Sbjct: 254 QAFYFVTLLGISV--EDINLPFNDGSSLEPITKGN--MVIDSGTPTTLLPEDFYHRLVEE 309
Query: 372 FIRQMGNYSRAAD---VEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFAL 428
++ A D ++ G + C+ ++ L F+ GA + L P F
Sbjct: 310 VRNKV-----ALDPIPIDPTLGYQLCYRT--PTNLKGTTLTAHFE-GADVLLTPTQIFIP 361
Query: 429 VGNEVLCLILFT--DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
V + + C + N G I G+ N+ + FDL F C
Sbjct: 362 VQDGIFCFAFTSTFSNEYG---------IYGNHAQSNYLIGFDLEKQLVSFKATDCT 409
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 161/391 (41%), Gaps = 59/391 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + S + DTGSSL W C+ V C+ + P F PK SS
Sbjct: 125 GNYVTRMGLGTPAK-SYVMVVDTGSSLTWLQCSPCV--VSCHRQSG-----PVFNPKASS 176
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + C +CS + + CS N Y YG F+ G L +T+
Sbjct: 177 SYASVSCSAQQCSDLTTATLNP--ASCSTSNVCI------YQASYGDSSFSVGYLSKDTV 228
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
F S +VPNF GC ++ Q AG+ G R+ SL QL FSYCL +
Sbjct: 229 SFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSS 288
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-- 332
+ S + G SYT P+ SSS Y++ + I V K +
Sbjct: 289 SSGYLSIGSYNPGQ---------YSYT-----PMASSSLDDSLYFIKMTGIKVAGKPLSV 334
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
+P I+DSG+ T + ++ A++K M RA+ S L
Sbjct: 335 SSSAYSSLP-------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAF---SILD 384
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
CF + + +PE+ + F GGA + L N V + CL A PA
Sbjct: 385 TCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLVDVDSATTCL------AFAPARS--- 434
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A I+G+ Q Q F + +D+ N + GFA C+
Sbjct: 435 AAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 153/378 (40%), Gaps = 71/378 (18%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DTGS + W C +C + F P S++ + + C + C
Sbjct: 3 LLIDTGSDITWIQCDPCPQCYK--------QQDSLFQPAGSATYKPLPCNSTMCQ----- 49
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSK-----TVPNFLAG 233
++S C N +C +Y++ YG T G ETL S +VPNF G
Sbjct: 50 QLQSFSHSC--LNSSC-----NYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFG 102
Query: 234 CSILSD---RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNL---VL 284
C + AG+ G G+SS P+Q + K FSYCL S VSS + +L
Sbjct: 103 CGHANKGLFNGAAGLMGLGKSSIGFPAQTSVAFGKVFSYCLPS-------VSSTIPSGIL 155
Query: 285 DTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSD 344
G + L Y + V SSS + Y+V + I VG + + I
Sbjct: 156 HFG------EAAMLDYDVRFTPLVDSSSGPSQ-YFVSMTGINVGDELLPI---------- 198
Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVY 404
+ V+VDSG+ + E +E + F + + A V CF +S +
Sbjct: 199 -SATVMVDSGTVISRFEQSAYERLRDAFTQILPGLQTAVSVAP---FDTCFRVSTVDDIN 254
Query: 405 LPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNF 464
+P + L F+ A++ L P + V + V+C A + GR +LG+FQ QN
Sbjct: 255 IPLITLHFRDDAELRLSPVHILYPVDDGVMCFAF-----APSSSGRS---VLGNFQQQNL 306
Query: 465 YLEFDLANDRFGFAKQKC 482
+D+ R G + +C
Sbjct: 307 RFVYDIPKSRLGISAFEC 324
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 149/382 (39%), Gaps = 65/382 (17%)
Query: 53 LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
+ + ++AS R ++L T KT I V Y + + GT
Sbjct: 3 VNTVITMASKDPERLKYLSTLADQKTTAVPIAPGQQ---------VLKIANYVVRVKLGT 53
Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
P Q + DT + W PC+ C C+ F+P S++ + C +
Sbjct: 54 PGQQMF-MVLDTSNDAAWVPCSG---CTGCSSTT--------FLPNASTTLGSLDCSEAQ 101
Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLL---QYGLGFT-AGLLLSETLRFPSKTVP 228
CS + R +CP S L YG + A L+ + + + +P
Sbjct: 102 CSQV--------------RGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIP 147
Query: 229 NFLAGC-SILSDRQ--PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNL 282
F GC + +S P G+ G GR SL SQ G FSYCL S F S +L
Sbjct: 148 GFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSL 205
Query: 283 VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
L GP G K+ + TP +NP S YYV L + VG V IP LV
Sbjct: 206 KL--GP-VGQPKS--IRTTPLLRNPHRPS-----LYYVNLTGVSVGRIKVPIPSEQLVFD 255
Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS 402
+ G I+DSG+ T P++ A+ EF +Q+ + CF + +
Sbjct: 256 PNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAATNEAE 310
Query: 403 VYLPELILKFKGGAKMALPPEN 424
P + L F+ G + LP EN
Sbjct: 311 A--PAVTLHFE-GLNLVLPMEN 329
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 105/388 (27%), Positives = 151/388 (38%), Gaps = 58/388 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNF-PNVDPSRIPAFIPKRSSS 162
Y ++ S GTP A T DTGS L W +C C+ P+ + P F P +SSS
Sbjct: 140 YVVTASLGTPGVAQT-MEVDTGSDLSWV------QCKPCSAAPSCYSQKDPLFDPAQSSS 192
Query: 163 SQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLR 221
+ C P C+ G + + C A Y++ YG G T G+ S+TL
Sbjct: 193 YAAVPCGGPVCA---GLGIYA--------ASACSAAQCGYVVSYGDGSNTTGVYSSDTLT 241
Query: 222 F-PSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
S V F GC G+ G GR SL Q FSYCL ++
Sbjct: 242 LSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKP-- 299
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
S+ L G G PG S T +P + +Y V L I VG + + +
Sbjct: 300 ----STAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPT-----YYVVMLTGISVGGQQLSV 350
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
P S GG +VD+G+ T + + A+ F M +Y L C
Sbjct: 351 PASAFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGY-PTAPSNGILDTC 403
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAI 454
++ +G +V LP + L F GA + L + ++ F A P+ G
Sbjct: 404 YNFAGYGTVTLPNVALTFGSGATVMLGADG-----------ILSFGCLAFAPSGSDGGMA 452
Query: 455 ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ Q ++F + D GF C
Sbjct: 453 ILGNVQQRSFEVRID--GTSVGFKPSSC 478
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 147/371 (39%), Gaps = 60/371 (16%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
+ D+ S + W +CV C P P + P RS SS C +P C+ + GP
Sbjct: 162 VLDSASDVPWV------QCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTAL-GPY 214
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILS 238
GC+ N C YL++Y G T+G +++ L + V F GCS
Sbjct: 215 A----NGCA--NNQC-----QYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAE 263
Query: 239 ----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
D + AGI G ESL SQ + FSYC+ + D G
Sbjct: 264 QGSFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDS-------------GFF 310
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
P + + + P+ FY V LR I VG + + + + GS ++
Sbjct: 311 TLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGS------VL 364
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
DS + T + ++A+ F M Y A K L C+D +G ++ LP++ L
Sbjct: 365 DSRTAITRLPPTAYQALRSAFRSSMTMYRSA---PPKGYLDTCYDFTGVVNIRLPKISLV 421
Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
F A + L P ++ N+ L FT NA G +LG Q Q + +D+
Sbjct: 422 FDRNAVLPLDPS---GILFNDCLA---FTSNADDRMPG-----VLGSVQQQTIEVLYDVG 470
Query: 472 NDRFGFAKQKC 482
GF + C
Sbjct: 471 GGAVGFRQGAC 481
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 128/486 (26%), Positives = 200/486 (41%), Gaps = 83/486 (17%)
Query: 25 AGSSAATVTVPLT-----PLSTKHYLHHSDSDPLKILHS-LASSSLSRARHLKTKTKPKT 78
A S + T+P T P S L H DS P +S + S L R +++ ++
Sbjct: 9 AASCSLLATLPFTEPSKTPSSFTIDLIHHDSPPSPFYNSSMTRSQLIRNAAMRSISR-AN 67
Query: 79 KDSNIGSNYSNSLIKT---PLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTS 135
+ S S+ N L ++ P+ + + G Y + + GTP I DTGS L W C+
Sbjct: 68 QLSLSLSHSLNQLKESSPEPIIIPNNGNYLMRIYIGTP-SVERLAIADTGSDLTWVQCSP 126
Query: 136 RYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWI-FGPNVESRCKGCSPRNKT 194
C+ P + P SS+ L+ C + C+ + + V S C
Sbjct: 127 ------CDNTKCFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGDCI----- 175
Query: 195 CPLACPSYLLQYGLG-FTAGLLLSETLRFPSKTV---PNFLAGC----SILSDR--QPAG 244
Y YG ++ G L S+++R + GC +D+ + G
Sbjct: 176 -------YAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKICFGCGFQNKFTADKSGKTTG 228
Query: 245 IAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGD-SKTPGLSY 300
I G G SL SQLG + KFSYCLL P SSN G + G+
Sbjct: 229 IVGLGAGPLSLVSQLGDEIGHKFSYCLL-------PFSSNSNSKLKFGEAAIVQGNGVVS 281
Query: 301 TPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFM 360
TP P FYY+ L I VG+K VK +DGN +I+DSGST T++
Sbjct: 282 TPLIIKPDL------PFYYLNLEGITVGAKTVK------TGQTDGN--IIIDSGSTLTYL 327
Query: 361 EGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFD--ISGKKSVYL-PELILKFKGGAK 417
E E+ EF+ + VE+ + FD + K+ + P+++ F GG
Sbjct: 328 E----ESFYNEFVSLV---KETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFTGG-D 379
Query: 418 MALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
+ L P N L+ + ++C + + G A I G+ +F++ +D+ + F
Sbjct: 380 VVLKPMNTLVLIEDNLICSTVVPSHFDGIA-------IFGNLGQIDFHVGYDIQGGKVSF 432
Query: 478 AKQKCA 483
A C+
Sbjct: 433 APTDCS 438
>gi|21537233|gb|AAM61574.1| EDGP precursor [Arabidopsis thaliana]
Length = 433
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 121/260 (46%), Gaps = 43/260 (16%)
Query: 243 AGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK----FDDAPVSSNLVLDTGPGSGDS 293
G+AG GR + LPSQ +KF+ CL S K F + P + L PG
Sbjct: 174 VGMAGMGRHNIGLPSQFAAAFSFHRKFAVCLTSGKGVAFFGNGPY---VFL---PGI--- 224
Query: 294 KTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKHVKI-PYSYLVPGSDGNG 347
+ L TP NPV ++SAF + Y++G+ I + K V I P + S G G
Sbjct: 225 QISSLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIQIVEKTVPINPTLLKINASTGFG 284
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP---CFDISGKKSVY 404
G + S + +T +E ++ A EF++Q + A +++ + ++P CF
Sbjct: 285 GTKISSVNPYTVLESSIYNAFTSEFVKQ----ALARSIKRVASVKPFGACFSTKNVGVTR 340
Query: 405 LP------ELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
L EL+L K + N V ++V+CL F D ++++G
Sbjct: 341 LGYAVPEIELVLHSKD-VVWRIFGANSMVSVSDDVICL-GFVDGGVNAR----TSVVIGG 394
Query: 459 FQLQNFYLEFDLANDRFGFA 478
FQL++ +EFDLA++RFGF+
Sbjct: 395 FQLEDNLIEFDLASNRFGFS 414
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 116/462 (25%), Positives = 193/462 (41%), Gaps = 104/462 (22%)
Query: 52 PLKILHSLASSSLSRAR-HLKTKTKPKTKDSNIGSNYSNSLIKTPL--SVHSYGGYSISL 108
PL + +S +LS +R HL+ ++S + + PL + YG Y+ +
Sbjct: 48 PLTLSAPNSSRTLSHSRRHLQRS-----------ESHSTATARMPLYDDLIPYGYYTTRI 96
Query: 109 SFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
GTPPQ + I DTGS+L + PC++ +C PN F P SS+ Q + C
Sbjct: 97 WIGTPPQ-TFALIVDTGSTLTYVPCSTCEQCGKHQDPN--------FQPDWSSTYQPLKC 147
Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF----- 222
+ +C+ C C Y QY + ++G+L + + F
Sbjct: 148 -SMECT-------------CDSEMMHC-----VYDRQYAEMSSSSGVLGEDIVSFGKQSE 188
Query: 223 --PSKTV---PNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK-----KFSYCLLSRK 272
P +TV N G I S R GI G GR S+ QL K FS C
Sbjct: 189 LKPQRTVFGCENVETG-DIYSQRAD-GIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGG-- 244
Query: 273 FDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+D G G+ G S G+ +T + +P S+ +Y + L++I +
Sbjct: 245 -----------MDVGGGAMVLGGISPPAGMVFT--HSDPARSA-----YYNIDLKEIHIA 286
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
K ++P + +V DG G I+DSG+T+ ++ P F+A ++++ + ++
Sbjct: 287 GK--QLPINPMV--FDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRN 342
Query: 389 SGLRPCF-----DISGKKSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTD 441
CF D+S + S P + L F G +++L PENY + CL +F +
Sbjct: 343 YN-DICFSGVGSDVS-QLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQN 400
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+LG ++N + +D + + GF K C+
Sbjct: 401 E-------NDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNCS 435
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 104/404 (25%), Positives = 166/404 (41%), Gaps = 80/404 (19%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y+ L GTPPQ I D+GS++ + PC S +C + + P F P SS
Sbjct: 86 GYYTTRLHIGTPPQEFA-LIVDSGSTVTYVPCASCEQCGN--------HQDPRFQPDLSS 136
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ + C NV+ C S +N+ +Y QY + ++G+L + +
Sbjct: 137 TYSPVKC-----------NVDCTCD--SDKNQC------TYERQYAEMSSSSGVLGEDIV 177
Query: 221 RFPSKTV---PNFLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLK-----KFSYC 267
F +++ + GC L + GI G GR S+ QL K FS C
Sbjct: 178 SFGTESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMC 237
Query: 268 LLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV 327
+VL P PG+ YT + N V S +Y + L+++ V
Sbjct: 238 YGGMDIG----GGAMVLGAMPAP-----PGMIYT--HSNAVRSP-----YYNIELKEMHV 281
Query: 328 GSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS--RAADV 385
K +++ DG G ++DSG+T+ ++ F A Q+ R D
Sbjct: 282 AGKALRVDPRIF----DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDP 337
Query: 386 EKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPPENY-FALVGNE-VLCLILF 439
K CF +G+ S P++ + F G K++L PENY F E CL +F
Sbjct: 338 NYKD---ICFAGAGRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVF 394
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ G+ P +LG ++N + +D N++ GF K C+
Sbjct: 395 QN-------GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 431
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 100/407 (24%), Positives = 155/407 (38%), Gaps = 68/407 (16%)
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIP 157
S G Y + GTP + DTGS ++W C RC + + P + A
Sbjct: 80 ESIGLYFAKIGLGTPSR-DFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDA--- 135
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLL 216
SS+++ + C + CS++ N S C S TC Y++ YG G T G L+
Sbjct: 136 --SSTAKSVSCSDNFCSYV---NQRSECHSGS----TCQ-----YVIMYGDGSSTNGYLV 181
Query: 217 SETL--------RFPSKTVPNFLAGCS-----ILSDRQPA--GIAGFGRSSESLPSQLG- 260
+ + R T + GC L + Q A GI GFG+S+ S SQL
Sbjct: 182 KDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLAS 241
Query: 261 ----LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
+ F++CL D G G + P+ S SA
Sbjct: 242 QGKVKRSFAHCL----------------DNNNGGGIFAIGEVVSPKVKTTPMLSKSAH-- 283
Query: 317 FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM 376
Y V L I VG+ +++ + G D GVI+DSG+T ++ ++ + E +
Sbjct: 284 -YSVNLNAIEVGNSVLELSSNAFDSGDDK--GVIIDSGTTLVYLPDAVYNPLLNEILASH 340
Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
+ E + CF + K + P + +F +A+ P Y V + C
Sbjct: 341 PELTLHTVQESFT----CFHYTDKLDRF-PTVTFQFDKSVSLAVYPREYLFQVREDTWCF 395
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N G ILGD L N + +D+ N G+ C+
Sbjct: 396 GW--QNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 116/462 (25%), Positives = 193/462 (41%), Gaps = 104/462 (22%)
Query: 52 PLKILHSLASSSLSRAR-HLKTKTKPKTKDSNIGSNYSNSLIKTPL--SVHSYGGYSISL 108
PL + +S +LS +R HL+ ++S + + PL + YG Y+ +
Sbjct: 48 PLTLSAPNSSRTLSHSRRHLQRS-----------ESHSTATARMPLYDDLIPYGYYTTRI 96
Query: 109 SFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGC 168
GTPPQ + I DTGS+L + PC++ +C PN F P SS+ Q + C
Sbjct: 97 WIGTPPQ-TFALIVDTGSTLTYVPCSTCEQCGKHQDPN--------FQPDWSSTYQPLKC 147
Query: 169 QNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRF----- 222
+ +C+ C C Y QY + ++G+L + + F
Sbjct: 148 -SMECT-------------CDSEMMHC-----VYDRQYAEMSSSSGVLGEDIVSFGKQSE 188
Query: 223 --PSKTV---PNFLAGCSILSDRQPAGIAGFGRSSESLPSQLGLK-----KFSYCLLSRK 272
P +TV N G I S R GI G GR S+ QL K FS C
Sbjct: 189 LKPQRTVFGCENVETG-DIYSQRAD-GIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGG-- 244
Query: 273 FDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
+D G G+ G S G+ +T + +P S+ +Y + L++I +
Sbjct: 245 -----------MDVGGGAMVLGGISPPAGMVFT--HSDPARSA-----YYNIDLKEIHIA 286
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
K ++P + +V DG G I+DSG+T+ ++ P F+A ++++ + ++
Sbjct: 287 GK--QLPINPMV--FDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRN 342
Query: 389 SGLRPCF-----DISGKKSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTD 441
CF D+S + S P + L F G +++L PENY + CL +F +
Sbjct: 343 YN-DICFSGVGSDVS-QLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQN 400
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+LG ++N + +D + + GF K C+
Sbjct: 401 E-------NDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNCS 435
>gi|295646769|gb|ADG23123.1| xyloglucan specific endoglucanase inhibitor [Solanum melongena]
Length = 437
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 106/412 (25%), Positives = 162/412 (39%), Gaps = 79/412 (19%)
Query: 107 SLSFGTPPQASTPFI-----FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
+L + T Q TP + D G +W VDC+ V S PA RS+
Sbjct: 44 TLQYLTQIQQRTPLVPISLTLDLGGQFLW---------VDCDQGYVSSSYKPARC--RSA 92
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
L G C F P GC+ N TC L + + G G L S+ +
Sbjct: 93 QCSLAGAS--ACGECFSPPRP----GCN--NNTCSLFPDNTVTGTATG---GELASDIVS 141
Query: 222 FPSKTVPN-----------FLAGCSILSDRQPAGI---AGFGRSSESLPSQLGL-----K 262
S N F+ G + L +G+ AG GR+ SLPSQ +
Sbjct: 142 VQSSNGKNPGRNVSDKNFLFVCGATFLLQGLASGVKGMAGLGRTRISLPSQFSAEFSFPR 201
Query: 263 KFSYCLLSRK------FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE 316
KF+ CL S F D P L S + YTP + NPV +++AF
Sbjct: 202 KFALCLTSSNSKGVVLFGDGPY---FFLPNKEFSNND----FQYTPLFINPVSTAAAFSS 254
Query: 317 -----FYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
Y++G++ I + K V I + L + G GG + + + +T ME L+ A+
Sbjct: 255 GQPSSEYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKLSTVNPYTVMETSLYNAITNF 314
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMALPPENY 425
F++++ N +R A V CFD S + +L+L+ + + N
Sbjct: 315 FVKELANVTRVAPVTP---FGACFDSRNIGSTRVGPAVPWIDLVLQNQ-NVVWTIFGANS 370
Query: 426 FALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
V VLCL + + +I++G +++ L+FD A R GF
Sbjct: 371 MVQVSENVLCLGIVDG-----GVNARTSIVIGGHTIEDNLLQFDHAASRLGF 417
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 149/382 (39%), Gaps = 65/382 (17%)
Query: 53 LKILHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGT 112
+ + ++AS R ++L T KT I V Y + + GT
Sbjct: 3 VNTVITMASKDPERLKYLSTLADQKTTAVPIAPGQQ---------VLKIANYVVRVKLGT 53
Query: 113 PPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPK 172
P Q + DT + W PC+ C C+ F+P S++ + C +
Sbjct: 54 PGQQMF-MVLDTSNDAAWVPCSG---CTGCSSTT--------FLPNASTTLGSLDCSEAQ 101
Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLL---QYGLGFT-AGLLLSETLRFPSKTVP 228
CS + R +CP S L YG + A L+ + + + +P
Sbjct: 102 CSQV--------------RGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIP 147
Query: 229 NFLAGC-SILSDRQ--PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNL 282
F GC + +S P G+ G GR SL SQ G FSYCL S F S +L
Sbjct: 148 GFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPS--FKSYYFSGSL 205
Query: 283 VLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPG 342
L GP G K+ + TP +NP S YYV L + VG V IP LV
Sbjct: 206 KL--GP-VGQPKS--IRTTPLLRNPHRPS-----LYYVNLTGVSVGRIKVPIPSEQLVFD 255
Query: 343 SDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKS 402
+ G I+DSG+ T P++ A+ EF +Q+ + CF + +
Sbjct: 256 PNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAETNEAE 310
Query: 403 VYLPELILKFKGGAKMALPPEN 424
P + L F+ G + LP EN
Sbjct: 311 A--PAVTLHFE-GLNLVLPMEN 329
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 161/391 (41%), Gaps = 59/391 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + S + DTGSSL W C+ V C+ + P F PK SS
Sbjct: 127 GNYVTRMGLGTPAK-SYVMVVDTGSSLTWLQCSPC--VVSCHRQSG-----PVFNPKASS 178
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + C +CS + + CS N Y YG F+ G L +T+
Sbjct: 179 SYTSVSCSAQQCSDLTTATLNP--ASCSTSNVCI------YQASYGDSSFSVGYLSKDTV 230
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
F S +VPNF GC ++ Q AG+ G R+ SL QL FSYCL +
Sbjct: 231 SFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSS 290
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-- 332
+ S + G SYT P+ SSS Y++ + I V K +
Sbjct: 291 SSGYLSIGSYNPGQ---------YSYT-----PMASSSLDDSLYFIKMTGIKVAGKPLSV 336
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
+P I+DSG+ T + ++ A++K M RA+ S L
Sbjct: 337 SSSAYSSLP-------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAF---SILD 386
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
CF + + +PE+ + F GGA + L N V + CL A PA
Sbjct: 387 TCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLVDVDSATTCL------AFAPARS--- 436
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A I+G+ Q Q F + +D+ N + GFA C+
Sbjct: 437 AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 148/371 (39%), Gaps = 60/371 (16%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
+ D+ S + W +CV C P P + P RS +S C +P C+ + GP
Sbjct: 32 VLDSASDVPWV------QCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTAL-GPY 84
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-KTVPNFLAGCSILS 238
GC+ N C YL++Y G T+G +++ L + V F GCS
Sbjct: 85 A----NGCA--NNQC-----QYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAE 133
Query: 239 ----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
D + AGI G ESL SQ + FSYC+ + D + +
Sbjct: 134 QGSFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGV--------- 184
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
P + + + P+ FY V LR I VG + + + + GS ++
Sbjct: 185 ----PRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGS------VL 234
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
DS + T + ++A+ F M Y A K L C+D +G ++ LP++ L
Sbjct: 235 DSRTAITRLPPTAYQALRAAFRSSMTMYRSA---PPKGYLDTCYDFTGVVNIRLPKISLV 291
Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
F A + L P ++ N+ L FT NA G +LG Q Q + +D+
Sbjct: 292 FDRNAVLPLDPSG---ILFNDCLA---FTSNADDRMPG-----VLGSVQQQTIEVLYDVG 340
Query: 472 NDRFGFAKQKC 482
GF + C
Sbjct: 341 GGAVGFRQGAC 351
>gi|15218740|ref|NP_171821.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|13272443|gb|AAK17160.1|AF325092_1 unknown protein [Arabidopsis thaliana]
gi|3850579|gb|AAC72119.1| Strong similarity to gb|D14550 extracellular dermal glycoprotein
(EDGP) precursor from Daucus carota. ESTs gb|H37281,
gb|T44167, gb|T21813, gb|N38437, gb|Z26470, gb|R65072,
gb|N76373, gb|F15470, gb|Z35182, gb|H76373, gb|Z34678
and gb|Z35387 come from this gene [Arabidopsis thaliana]
gi|14334706|gb|AAK59531.1| unknown protein [Arabidopsis thaliana]
gi|16323420|gb|AAL15204.1| unknown protein [Arabidopsis thaliana]
gi|332189425|gb|AEE27546.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 433
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/260 (28%), Positives = 122/260 (46%), Gaps = 43/260 (16%)
Query: 243 AGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK----FDDAPVSSNLVLDTGPGSGDS 293
G+AG GR + LPSQ +KF+ CL S K F + P + L PG
Sbjct: 174 VGMAGMGRHNIGLPSQFAAAFSFHRKFAVCLTSGKGVAFFGNGPY---VFL---PGI--- 224
Query: 294 KTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKHVKI-PYSYLVPGSDGNG 347
+ L TP NPV ++SAF + Y++G+ I + K V I P + S G G
Sbjct: 225 QISSLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIQIVEKTVPINPTLLKINASTGIG 284
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP---CFDISGKKSVY 404
G + S + +T +E ++ A EF++Q + A +++ + ++P CF
Sbjct: 285 GTKISSVNPYTVLESSIYNAFTSEFVKQ----AAARSIKRVASVKPFGACFSTKNVGVTR 340
Query: 405 LP------ELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
L EL+L K + N V ++V+CL F D + ++++G
Sbjct: 341 LGYAVPEIELVLHSKD-VVWRIFGANSMVSVSDDVICL-GFVDGG----VNARTSVVIGG 394
Query: 459 FQLQNFYLEFDLANDRFGFA 478
FQL++ +EFDLA+++FGF+
Sbjct: 395 FQLEDNLIEFDLASNKFGFS 414
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 102/386 (26%), Positives = 146/386 (37%), Gaps = 57/386 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTP Q + DT S + W PC C+ C+ F S++
Sbjct: 36 YIVRAKIGTPAQ-TMLMAMDTSSDVAWIPCNG---CLGCSST--------LFNSPASTTY 83
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ +GCQ +C + P TC S+ L YG A L +T+
Sbjct: 84 KSLGCQAAQCKQVPKP--------------TCGGGVCSFNLTYGGSSLAANLSQDTITLA 129
Query: 224 SKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+ VP + GC L + G+ S S L FSYCL S F
Sbjct: 130 TDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 187
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + YTP KNP S Y+V L + VG + V +P
Sbjct: 188 FSGSLRL--GPVGQPKR---IKYTPLLKNPRRPS-----LYFVNLMAVRVGRRVVDVPPG 237
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
G I DSG+ FT + P + AV F ++G R V G C+ +
Sbjct: 238 SFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTCYTV 294
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ P + F G + LPP+N CL + AA P ++
Sbjct: 295 ----PIAAPTITFMFT-GMNVTLPPDNLLIHSTAGSTTCLAM----AAAPDNVNSVLNVI 345
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q QN L +D+ N R G A++ C
Sbjct: 346 ANLQQQNHRLLYDVPNSRLGVARELC 371
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 161/391 (41%), Gaps = 59/391 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + S + DTGSSL W C+ V C+ + P F PK SS
Sbjct: 125 GNYVTRMGLGTPAK-SYVMVVDTGSSLTWLQCSPC--VVSCHRQSG-----PVFNPKASS 176
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + C +CS + + CS N Y YG F+ G L +T+
Sbjct: 177 SYASVSCSAQQCSDLTTATLNP--ASCSTSNVCI------YQASYGDSSFSVGYLSKDTV 228
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
F S +VPNF GC ++ Q AG+ G R+ SL QL FSYCL +
Sbjct: 229 SFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSS 288
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-- 332
+ S + G SYT P+ SSS Y++ + I V K +
Sbjct: 289 SSGYLSIGSYNPGQ---------YSYT-----PMASSSLDDSLYFIKMTGIKVAGKPLSV 334
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
+P I+DSG+ T + ++ A++K M RA+ S L
Sbjct: 335 SSSAYSSLP-------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAF---SILD 384
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
CF + + +PE+ + F GGA + L N V + CL A PA
Sbjct: 385 TCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLVDVDSATTCL------AFAPARS--- 434
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A I+G+ Q Q F + +D+ N + GFA C+
Sbjct: 435 AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 106/407 (26%), Positives = 165/407 (40%), Gaps = 69/407 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L G+PP+ + DTGS ++W C RC + +D + + PK S
Sbjct: 68 GLYFTKLGLGSPPKDYYVQV-DTGSDILWVNCVKCSRCPRKSDLGID---LTLYDPKGSE 123
Query: 162 SSQLIGCQNPKCSWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
+S+LI C CS + GP GC + CP Y + YG G T G + +
Sbjct: 124 TSELISCDQEFCSATYDGP-----IPGCKSE-----IPCP-YSITYGDGSATTGYYVQDY 172
Query: 220 LRFPS-----KTVPN---FLAGCSIL--------SDRQPAGIAGFGRSSESLPSQLGL-- 261
L + +T P + GC + S+ GI GFG+S+ S+ SQL
Sbjct: 173 LTYNHVNDNLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASG 232
Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
K FS+CL N+ G+ P +S TP Y
Sbjct: 233 KVKKIFSHCL-----------DNIRGGGIFAIGEVVEPKVSTTPLVPRMA--------HY 273
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNG-GVIVDSGSTFTFMEGPLF-EAVAKEFIRQM 376
V L+ I V + +++P GNG G I+DSG+T ++ ++ E + K RQ
Sbjct: 274 NVVLKSIEVDTDILQLPSDIF---DSGNGKGTIIDSGTTLAYLPAIVYDELIPKVMARQ- 329
Query: 377 GNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCL 436
+ VE++ CF +G P + L F+ + + P +Y + + C
Sbjct: 330 -PRLKLYLVEQQ---FSCFQYTGNVDRGFPVVKLHFEDSLSLTVYPHDYLFQFKDGIWC- 384
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
I + + A G+ +LGD L N + +DL N G+ C+
Sbjct: 385 IGWQKSVAQTKNGK-DMTLLGDLVLSNKLVIYDLENMAIGWTDYNCS 430
>gi|115442113|ref|NP_001045336.1| Os01g0937500 [Oryza sativa Japonica Group]
gi|20160770|dbj|BAB89711.1| putative xylanase inhibitor [Oryza sativa Japonica Group]
gi|113534867|dbj|BAF07250.1| Os01g0937500 [Oryza sativa Japonica Group]
gi|125573257|gb|EAZ14772.1| hypothetical protein OsJ_04701 [Oryza sativa Japonica Group]
gi|215766348|dbj|BAG98576.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 443
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 106/408 (25%), Positives = 163/408 (39%), Gaps = 59/408 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y+IS+ G PP + D +LVW C S + V C D + P+R
Sbjct: 48 YTISVKNGAPP-----LVVDLAGALVWSTCPSTHSTVPCQSAACD--AVNRQQPRR---- 96
Query: 164 QLIGCQNPKCSWIF-GPNVESRCKGCS-----PRNKTCPLA-CPSYLLQYGLGFTAGLLL 216
C+ W + G SRC C+ P C ++ + LL
Sbjct: 97 ----CRYVDGGWFWAGREPGSRC-ACTAHPFNPVTGECSTGDLTTFTMSANTTNGTDLLY 151
Query: 217 SETLRFPSKTVPNFLAGCSILSDRQPAGIAGF-GRSSESLPSQLGLKK-----FSYCL-L 269
E+ P L L + AG+AGF G + SLPSQL ++ F+ CL +
Sbjct: 152 PESFTAVGACAPERLLASPSLP-QAAAGVAGFSGTTPLSLPSQLAAQRRFGSTFALCLPV 210
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV-- 327
F D PV + + P T L TPF NP + YY+ +++I V
Sbjct: 211 FATFGDTPV---YLPNYNPYGPFDYTKMLRRTPFLTNPRRNGG-----YYLPVKRISVSW 262
Query: 328 ---GSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF--IRQMGNYSR 381
G V +P L + G GGV++ + + + M +F A K F + G SR
Sbjct: 263 RGPGDVPVSLPAGALDLNARTGRGGVVLSTTTPYAIMRTDVFRAFGKAFDTVVTRGTESR 322
Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE-------NYFALVGNEVL 434
A V ++ C+ +G + P ++K G A+ E N+ L GN ++
Sbjct: 323 MARVARQKQFELCYGGAGDTMLSFP--MMKRTGFDAPAITLELDAGATGNWTILNGNYLV 380
Query: 435 ---CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
C+ + G + PA++LG QL+N + FDL GF++
Sbjct: 381 RETCVGVVEMGPEGMPVDGEPAVVLGGMQLENILMVFDLDKRTLGFSR 428
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 166/396 (41%), Gaps = 68/396 (17%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
+ +++ FG+P Q T I DTGS + W +C+ C+ + P F P +S++
Sbjct: 161 FVVTVGFGSPAQNYTLSI-DTGSDVSWI------QCLPCS-GHCYKQHDPVFDPTKSATY 212
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
+ C +P+C+ G +C + TC Y + YG G TAG+L ETL
Sbjct: 213 SAVPCGHPQCAAAGG-----KCS----NSGTC-----LYKVTYGDGSSTAGVLSHETLSL 258
Query: 223 PS-KTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDD 275
S + +P F GC + + G+ G GR + SLPSQ FSYCL S
Sbjct: 259 SSTRDLPGFAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYD--- 315
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
L + + + + + YT + + Y+V + I +G + +P
Sbjct: 316 -TTHGYLTMGSTTPAASNDDDDVQYTAMIQK-----EDYPSLYFVEVVSIDIGGYILPVP 369
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
P G + DSG+ T++ + ++ F M Y A + C+
Sbjct: 370 -----PTVFTRDGTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDP---FDTCY 421
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG------ 449
D +G ++++P + KF GA L P + ++++ D+ A PA G
Sbjct: 422 DFTGHNAIFMPAVAFKFSDGAVFDLSP-----------VAILIYPDDTA-PATGCLAFVP 469
Query: 450 ---RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
P I+G+ Q + + +D+A ++ GF + C
Sbjct: 470 RPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQFTC 505
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 161/391 (41%), Gaps = 59/391 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + S + DTGSSL W C+ V C+ + P F PK SS
Sbjct: 127 GNYVTRMGLGTPAK-SYVMVVDTGSSLTWLQCSPC--VVSCHRQSG-----PVFNPKASS 178
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S + C +CS + + CS N Y YG F+ G L +T+
Sbjct: 179 SYTSVSCSAQQCSDLTTATLSP--ASCSTSNVCI------YQASYGDSSFSVGYLSKDTV 230
Query: 221 RFPSKTVPNFLAGCSILSD---RQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFD 274
F S +VPNF GC ++ Q AG+ G R+ SL QL FSYCL +
Sbjct: 231 SFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSS 290
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV-- 332
+ S + G SYT P+ SSS Y++ + I V K +
Sbjct: 291 SSGYLSIGSYNPGQ---------YSYT-----PMASSSLDDSLYFIKMTGIKVAGKPLSV 336
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLR 392
+P I+DSG+ T + ++ A++K M RA+ S L
Sbjct: 337 SSSAYSSLP-------TIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAF---SILD 386
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGP 452
CF + + +PE+ + F GGA + L N V + CL A PA
Sbjct: 387 TCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLVDVDSATTCL------AFAPARS--- 436
Query: 453 AIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A I+G+ Q Q F + +D+ N + GFA C+
Sbjct: 437 AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|218189700|gb|EEC72127.1| hypothetical protein OsI_05116 [Oryza sativa Indica Group]
Length = 443
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 106/408 (25%), Positives = 163/408 (39%), Gaps = 59/408 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y+IS+ G PP + D +LVW C S + V C D + P+R
Sbjct: 48 YTISVKNGAPP-----LVVDLAGALVWSTCPSTHSTVPCQSAACD--AVNRQQPRR---- 96
Query: 164 QLIGCQNPKCSWIF-GPNVESRCKGCS-----PRNKTCPLA-CPSYLLQYGLGFTAGLLL 216
C+ W + G SRC C+ P C ++ + LL
Sbjct: 97 ----CRYVDGGWFWAGREPGSRC-ACTAHPFNPVTGECSTGDLTTFAMSANTTNGTDLLY 151
Query: 217 SETLRFPSKTVPNFLAGCSILSDRQPAGIAGF-GRSSESLPSQLGLKK-----FSYCL-L 269
E+ P L L + AG+AGF G + SLPSQL ++ F+ CL +
Sbjct: 152 PESFTAVGACAPERLLASPSLP-QAAAGVAGFSGTTPLSLPSQLAAQRRFGSTFALCLPV 210
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV-- 327
F D PV + + P T L TPF NP + YY+ +++I V
Sbjct: 211 FATFGDTPV---YLPNYNPYGPFDYTKMLRRTPFLTNPRRNGG-----YYLPVKRISVSW 262
Query: 328 ---GSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF--IRQMGNYSR 381
G V +P L + G GGV++ + + + M +F A K F + G SR
Sbjct: 263 RGPGDVPVSLPAGALDLNARTGRGGVVLSTTTPYAIMRTDVFRAFGKAFDTVVTRGTESR 322
Query: 382 AADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE-------NYFALVGNEVL 434
A V ++ C+ +G + P ++K G A+ E N+ L GN ++
Sbjct: 323 MARVARQKQFELCYGGAGDTMLSFP--MMKRTGFDAPAITLELDAGATGNWTILNGNYLV 380
Query: 435 ---CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAK 479
C+ + G + PA++LG QL+N + FDL GF++
Sbjct: 381 RETCVGVVEMGPEGMPVDGEPAVVLGGMQLENILMVFDLDKRTLGFSR 428
>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
Length = 431
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 109/260 (41%), Gaps = 41/260 (15%)
Query: 244 GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
G+ G R + S +Q G ++F+YC+ AP VL G G P L+YTP
Sbjct: 181 GLLGMNRGTLSFVTQTGTRRFAYCI-------APGEGPGVLLLGDDGG--VAPPLNYTPL 231
Query: 304 YKNPVGSSSAFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTF 359
+ S F Y V L I VG + IP S L P G G +VDSG+ FTF
Sbjct: 232 IE----ISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTF 287
Query: 360 MEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI--------SGKKSVYLPELILK 411
+ + A+ EF Q A E + FD S LPE+ L
Sbjct: 288 LLADAYAALKAEFTSQ-ARLLLAPLGEPGFVFQGAFDACFRGPEARVAAASGLLPEVGLV 346
Query: 412 FKGGAKMALPPENYFALVGNE---------VLCLILFTDNAAGPALGRGPAIILGDFQLQ 462
+ GA++A+ E +V E V CL + AG + A ++G Q
Sbjct: 347 LR-GAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMS-----AYVIGHHHQQ 400
Query: 463 NFYLEFDLANDRFGFAKQKC 482
N ++E+DL N R GFA +C
Sbjct: 401 NVWVEYDLQNGRVGFAPARC 420
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 108/406 (26%), Positives = 160/406 (39%), Gaps = 92/406 (22%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWF---PCTSR-YRCVDCNFPNVDPSRIPAFIPKR 159
+ + + FGTP Q + I DTGS L W PC+ YR D P F P +
Sbjct: 137 FVVVVGFGTPAQTAA-IILDTGSDLSWIQCKPCSGHCYRQHD-----------PDFDPAK 184
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSE 218
SSS + C P C+ G C G TC Y +QYG G T G+L +
Sbjct: 185 SSSYAAVPCGTPVCAAAGG-----MCNG-----TTC-----LYGVQYGDGSSTTGVLSRD 229
Query: 219 TLRFPSKT-VPNFLAGCSILSDRQPAGIAGFGR--------------SSESLPSQLGLKK 263
TL F S + F GC I FG S++ PS G+
Sbjct: 230 TLTFNSSSKFTGFTFGCG------EKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGV-- 281
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
FSYCL S ++ L+ G S P + YT K P + FY++ L
Sbjct: 282 FSYCLPSYN------TTPGYLNIGATKPTSTVP-VQYTAMIKKP-----QYPSFYFIELV 329
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
I +G + +P S G ++DSG+ T++ P + ++ F M A
Sbjct: 330 SINIGGYILPVPPSVFT-----KTGTLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAP 384
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALP-------PENYFALVGNEVLCL 436
E L C+D +G+ ++ +P + F GA L P++ L+G CL
Sbjct: 385 PYEP---LDTCYDFTGQGAIVIPAVSFNFSDGAVFDLDFYGIMIFPDDAKPLIG----CL 437
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ AA P I+G+ Q + + +D+ + + GF C
Sbjct: 438 AFVSRPAAMPFS------IVGNTQQRAAEVIYDVPSQKIGFIPISC 477
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 102/386 (26%), Positives = 145/386 (37%), Gaps = 57/386 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTP Q DT S + W PC C+ C+ F S++
Sbjct: 101 YIVRAKIGTPAQTML-MAMDTSSDVAWIPCNG---CLGCSST--------LFNSPASTTY 148
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ +GCQ +C + P TC S+ L YG A L +T+
Sbjct: 149 KSLGCQAAQCKQVPKP--------------TCGGGVCSFNLTYGGSSLAANLSQDTITLA 194
Query: 224 SKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAP 277
+ VP + GC L + G+ S S L FSYCL S F
Sbjct: 195 TDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLN 252
Query: 278 VSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYS 337
S +L L GP + + YTP KNP S Y+V L + VG + V +P
Sbjct: 253 FSGSLRL--GPVGQPKR---IKYTPLLKNPRRPS-----LYFVNLMAVRVGRRVVDVPPG 302
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
G I DSG+ FT + P + AV F ++G R V G C+ +
Sbjct: 303 SFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTCYTV 359
Query: 398 SGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIIL 456
+ P + F G + LPP+N CL + AA P ++
Sbjct: 360 ----PIAAPTITFMFT-GMNVTLPPDNLLIHSTAGSTTCLAM----AAAPDNVNSVLNVI 410
Query: 457 GDFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q QN L +D+ N R G A++ C
Sbjct: 411 ANLQQQNHRLLYDVPNSRLGVARELC 436
>gi|148907857|gb|ABR17052.1| unknown [Picea sitchensis]
Length = 422
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 124/278 (44%), Gaps = 41/278 (14%)
Query: 223 PSKTVPNFLAGCSILSDRQ---PAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK-- 272
P P C + S+R G+AG S+ +LPSQL +KF+ CL S
Sbjct: 145 PLARFPQLAFACDLSSNRVISGTVGVAGMTSSTLALPSQLSAAEGFSRKFAMCLPSGNAP 204
Query: 273 ----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVG 328
F D P LV PG S + TP KN S + + +Y+G+++I VG
Sbjct: 205 GALFFGDEP----LVFLPPPGRDLSSQ--IIRTPLIKN-----SVYTDVFYLGVQRIEVG 253
Query: 329 SKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEF--IRQMGNYSRAADVE 386
+V I L DG GG + + +T + P++ ++ F + + N +R A V
Sbjct: 254 GVNVAIDAEKLRFDKDGRGGTKLSTVVRYTQLASPIYNSLEGVFTSVAKKMNITRVASV- 312
Query: 387 KKSGLRPCFDISGKKSVYLP------ELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
S CFD SG S + +++L+ + N V N+VLCL F
Sbjct: 313 --SPFGACFDSSGVGSTRVGPAVPTIDIVLQGNSTTTWRIFGANSMVRVNNKVLCL-GFV 369
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
D G L + +I++G +Q+Q+ L+FDLA GF+
Sbjct: 370 D--GGDNLQQ--SIVIGTYQMQDNLLQFDLATSTLGFS 403
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 103/423 (24%), Positives = 172/423 (40%), Gaps = 82/423 (19%)
Query: 89 NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD--CNFPN 146
N+ ++ + + G Y+ L GTP Q I D+GS++ + PC + +C + PN
Sbjct: 77 NARMRLHDDLLTNGYYTTRLYIGTPSQEFA-LIVDSGSTVTYVPCATCEQCGNHQSESPN 135
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
+ + P F P SS+ + C N C+ C C +Y QY
Sbjct: 136 IIEAHDPRFQPDLSSTYSPVKC-NVDCT-------------CDNERSQC-----TYERQY 176
Query: 207 G-LGFTAGLLLSETLRF--PSKTVPNFLA-GCS-----ILSDRQPAGIAGFGRSSESLPS 257
+ ++G+L + + F S+ P GC L + GI G GR S+
Sbjct: 177 AEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMD 236
Query: 258 QLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPV 308
QL K FS C +D G G+ G P + ++ + NPV
Sbjct: 237 QLVEKGVISDSFSLCYGG-------------MDVGGGTMVLGGMPAPPDMVFS--HSNPV 281
Query: 309 GSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
S +Y + L++I V K +++ + G ++DSG+T+ ++ F A
Sbjct: 282 RSP-----YYNIELKEIHVAGKALRLDPKIF----NSKHGTVLDSGTTYAYLPEQAFVAF 332
Query: 369 AKEFIRQMGNYS--RAADVEKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPP 422
++ + R D K CF +G+ S P++ + F G K++L P
Sbjct: 333 KDAVTNKVNSLKKIRGPDPNYKD---ICFAGAGRNVSQLSEVFPDVDMVFGNGQKLSLSP 389
Query: 423 ENY-FALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
ENY F E CL +F + G+ P +LG ++N + +D N++ GF K
Sbjct: 390 ENYLFRHSKVEGAYCLGVFQN-------GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 442
Query: 481 KCA 483
C+
Sbjct: 443 NCS 445
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 102/408 (25%), Positives = 168/408 (41%), Gaps = 73/408 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTPP+ + DTGS ++W C S +C + +D + + PK SS
Sbjct: 81 GLYYTEIEIGTPPKQYHVQV-DTGSDILWVNCISCNKCPRKSDLGID---LRLYDPKGSS 136
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S + C C+ +G + GC+ +N C Y + YG G T G +S++L
Sbjct: 137 SGSTVSCDQKFCAATYG----GKLPGCA-KNIPC-----EYSVMYGDGSSTTGYFVSDSL 186
Query: 221 RFPS--------KTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
++ + + GC +++ GI GFG+S+ S+ SQL
Sbjct: 187 QYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEV 246
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGS---GDSKTPGLSYTPFYKNPVGSSSAFGEF 317
K FS+CL + K G G GD P + TP +
Sbjct: 247 KKIFSHCLDTIK--------------GGGIFAIGDVVQPKVKSTPLVPDM--------PH 284
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
Y V L I VG +++P G G I+DSG+T T++ E V K+ + +
Sbjct: 285 YNVNLESINVGGTTLQLPSHMFETGE--KKGTIIDSGTTLTYLP----ELVYKDVLAAV- 337
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNEVLC 435
+++ D S ++ I +SV P++ F+ + + P +YF G+ + C
Sbjct: 338 -FAKHPDTTFHS-VQDFLCIQYFQSVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYC 395
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
F + G+ ++LGD L N + +DL N G+ C+
Sbjct: 396 F-GFQNGGLQSKDGK-DMVLLGDLVLSNKVVVYDLENQVVGWTDYNCS 441
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 103/423 (24%), Positives = 172/423 (40%), Gaps = 82/423 (19%)
Query: 89 NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD--CNFPN 146
N+ ++ + + G Y+ L GTP Q I D+GS++ + PC + +C + PN
Sbjct: 76 NARMRLHDDLLTNGYYTTRLYIGTPSQEFA-LIVDSGSTVTYVPCATCEQCGNHQSESPN 134
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
+ + P F P SS+ + C N C+ C C +Y QY
Sbjct: 135 IIEAHDPRFQPDLSSTYSPVKC-NVDCT-------------CDNERSQC-----TYERQY 175
Query: 207 G-LGFTAGLLLSETLRF--PSKTVPNFLA-GCS-----ILSDRQPAGIAGFGRSSESLPS 257
+ ++G+L + + F S+ P GC L + GI G GR S+
Sbjct: 176 AEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMD 235
Query: 258 QLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPV 308
QL K FS C +D G G+ G P + ++ + NPV
Sbjct: 236 QLVEKGVISDSFSLCYGG-------------MDVGGGTMVLGGMPAPPDMVFS--HSNPV 280
Query: 309 GSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
S +Y + L++I V K +++ + G ++DSG+T+ ++ F A
Sbjct: 281 RSP-----YYNIELKEIHVAGKALRLDPKIF----NSKHGTVLDSGTTYAYLPEQAFVAF 331
Query: 369 AKEFIRQMGNYS--RAADVEKKSGLRPCFDISGKK----SVYLPELILKFKGGAKMALPP 422
++ + R D K CF +G+ S P++ + F G K++L P
Sbjct: 332 KDAVTNKVNSLKKIRGPDPNYKD---ICFAGAGRNVSQLSEVFPDVDMVFGNGQKLSLSP 388
Query: 423 ENY-FALVGNE-VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQ 480
ENY F E CL +F + G+ P +LG ++N + +D N++ GF K
Sbjct: 389 ENYLFRHSKVEGAYCLGVFQN-------GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 441
Query: 481 KCA 483
C+
Sbjct: 442 NCS 444
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 105/405 (25%), Positives = 152/405 (37%), Gaps = 67/405 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS LVW C+ RC R F P+RSS
Sbjct: 84 GEYFALVGVGTPSTKAM-LVIDTGSDLVWLQCSPCRRCY--------AQRGQVFDPRRSS 134
Query: 162 SSQLIGCQNPKCSWIFGPNVES---RCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLS 217
+ + + C +P+C + P +S GC Y++ YG G ++ G L +
Sbjct: 135 TYRRVPCSSPQCRALRFPGCDSGGAAGGGCR------------YMVAYGDGSSSTGDLAT 182
Query: 218 ETLRFPSKT-VPNFLAGCSILSDRQPAGIAGFGRSSESL-PSQLGLKKFSYCLLSRKFDD 275
+ L F + T V N GC GR +E L S GL L R
Sbjct: 183 DKLAFANDTYVNNVTLGC--------------GRDNEGLFDSAAGL------LGRRAAAR 222
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNP------------------VGSSSAFGEF 317
P T P S + G + + A +
Sbjct: 223 YPSRRRWPRRTAPSSSTASATGRRAQRAARTSCSAARRSRRPRRSPPCCRTRGARACTTW 282
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
+ G GS + P S G GGV+VDSG+ + + A+ F +
Sbjct: 283 TWPGSASAARGSPGSRTPASRWT-RRRGRGGVVVDSGTAISRFARDAYAALRDAFDARAR 341
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ S C+D+ G+ + P ++L F GGA MALPPENYF V
Sbjct: 342 AAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAA 401
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ A G ++I G+ Q Q F + FD+ +R GFA + C
Sbjct: 402 SYRRCLGFEAADDGLSVI-GNVQQQGFRVVFDVEKERIGFAPKGC 445
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 104/401 (25%), Positives = 163/401 (40%), Gaps = 80/401 (19%)
Query: 108 LSFGTPPQASTPFIFDTGSSLVWF---PCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQ 164
+ GTPP + + DTG++L + PCT R C+ D I F P +S S
Sbjct: 210 IKLGTPPVWNLVAV-DTGATLSFVQCEPCTLR-----CH-KQTDAGEI--FDPSKSESFS 260
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG--LGFTAGLLLSETLRF 222
+GC KC + + + K C + +C Y + +G ++ G L+ + L
Sbjct: 261 RVGCSENKCRTV-QRALHLQSKACMEKEDSCL-----YSMTFGGTSSYSVGKLVRDRLAI 314
Query: 223 ----PSKTVPNFLAGCSILSD--RQPAGIAGFGRSSESLPSQLG----LKKFSYCLLSRK 272
+ P+FL GCS+ ++ + AG+ GF S Q+ K FSYC S +
Sbjct: 315 GKYAKGYSFPDFLFGCSLDTEYHQYEAGLVGFADEPFSFFEQVAPLVNYKAFSYCFPSDR 374
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIV-GSKH 331
+S GD +YTP + S Y + L +++V G
Sbjct: 375 RKTGYLSI----------GDYTRVNSTYTPLFLARQQSR------YALKLDEVLVNGMAL 418
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF---EAVAKEFIRQMG---NYSRAADV 385
V P +IVDSGS +T + F +A E +R +G NY R +D
Sbjct: 419 VTTP-----------SEMIVDSGSRWTILLSDTFTQLDAAITEAMRPLGYNRNYYRGSDY 467
Query: 386 EKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTD 441
CF+ + + LP + LKF G KM L P++ F + LC D
Sbjct: 468 -------ICFEDAHFQQFSDWAALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMRD 520
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ G + +LG+ ++ + FD+ +FGF K C
Sbjct: 521 ASLGSGVQ-----LLGNTMTRSVGITFDIQGGQFGFRKGDC 556
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 161/393 (40%), Gaps = 79/393 (20%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y I++ G+P T I DTGS + W C S + F P +S++
Sbjct: 129 YVITVGIGSPAVTQTMMI-DTGSDVSWVRCNST-------------DGLTLFDPSKSTTY 174
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETLRF 222
C + C+ + G N + GCS N C Y +QYG G T G S+TL
Sbjct: 175 APFSCSSAACAQL-GNNGD----GCS--NSGC-----QYRVQYGDGSNTTGTYSSDTLAL 222
Query: 223 -PSKTVPNFLAGCSILSD----RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFD 274
S TV +F GCS + + G+ G G ++SL SQ K FSYCL
Sbjct: 223 SASDTVTDFHFGCSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCL------ 276
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
P ++ G+ + + G TP + P + Y V L+ I VG + I
Sbjct: 277 --PPTNRTSGFLTFGAPNGTSGGFVTTPMLRWPKAPT-----LYGVLLQDISVGGTPLGI 329
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGLR 392
S L GS ++DSG+ T++ + A++ F M + RAA + L
Sbjct: 330 QPSVLSNGS------VMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGI---LD 380
Query: 393 PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVL---CLILFTDNAAGPALG 449
C+D +G +V +P + L GGA + L GN ++ CL A
Sbjct: 381 TCYDFTGLVNVSIPAVSLVLDGGAVVDLD--------GNGIMIQDCLAF--------AAT 424
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
G +II G+ Q + F + D+ FGF C
Sbjct: 425 SGDSII-GNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|125561847|gb|EAZ07295.1| hypothetical protein OsI_29543 [Oryza sativa Indica Group]
Length = 205
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 86/179 (48%), Gaps = 8/179 (4%)
Query: 245 IAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
+ G GR SL SQLG +FSYCL S P N + ++ + GL P
Sbjct: 1 MVGLGRGLLSLVSQLGPSRFSYCLTSF-LSPEPSRLNFGVFATLNGTNASSSGL---PVQ 56
Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
P+ ++A Y++ L+ I +G K + I DG GGV +DSG++ T+++ +
Sbjct: 57 STPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDV 116
Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL--PELILKFKGGAKMALP 421
++AV +E + + A D E GL CF +V + P++ L F GGA M P
Sbjct: 117 YDAVRRELVSVLRPLPPANDTEI--GLETCFPWPPPPTVTMTVPDMELHFDGGANMLHP 173
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 101/393 (25%), Positives = 149/393 (37%), Gaps = 54/393 (13%)
Query: 101 YGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRS 160
YG P S DT + W +C+ C P P R F P+RS
Sbjct: 142 YGAVIDGDDDDDPMILSQTMAIDTTEDVPWI------QCLPCLIPQCYPQRNAFFDPRRS 195
Query: 161 SSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSET 219
S+ + C + C + G GCS N T Y ++Y T G +++T
Sbjct: 196 STGAPVRCGSRACRTLGG-----YANGCSKPNSTGDCL---YRIEYSDHRLTLGTYMTDT 247
Query: 220 LRF-PSKTVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSR 271
L PS T NF GCS Q +G G +SL SQ FSYC+
Sbjct: 248 LTISPSTTFLNFRFGCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYGNAFSYCV--- 304
Query: 272 KFDDAPVSSNLVLDTGPGSGDS--KTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
P ++ + GP +GD + + TP ++ ++ Y V L+ I V
Sbjct: 305 ---PGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRS---ANVINPTIYVVRLQGIEVAG 358
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ + +P +GG ++DS + T + + A+ F M Y A
Sbjct: 359 RRLNVPPVVF------SGGTVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRA---PTG 409
Query: 390 GLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG 449
L CFD G V +P + L F GGA + L L+ + + F AA ALG
Sbjct: 410 NLDTCFDFVGVSKVTVPTVSLVFDGGAVIEL------GLLSVLLDSCLAFAPMAADFALG 463
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+G+ Q Q + +D+A GF C
Sbjct: 464 -----FIGNVQQQTHEVLYDVAGGAVGFRHGAC 491
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 148/385 (38%), Gaps = 73/385 (18%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
+ DT S + W +C C P+ + P +SSSS C +P C
Sbjct: 158 MVIDTASDVPWV------QCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACR----- 206
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRF----PSKTVPNFLAGC 234
N+ GC+P C Y +QY G +AG +S+ L P+ + F GC
Sbjct: 207 NLGPYANGCTPAGDQC-----QYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGC 261
Query: 235 S--ILS----DRQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLD 285
S +L + +GI GR ++SLP+Q FSYCL PV S +
Sbjct: 262 SHALLQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCL-----PPTPVHSGFFI- 315
Query: 286 TGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDG 345
P ++ + + P+ S A Y V L I V K + +P +
Sbjct: 316 -------LGVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFA----- 363
Query: 346 NGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDIS-----GK 400
G ++DS + T + + A+ F+ +M Y AA K L C+D S G
Sbjct: 364 -AGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYRAAA---PKEHLDTCYDFSGAAPGGG 419
Query: 401 KSVYLPELILKFKGGAKMALPPENYFALVGNEVL---CLILFTDNAAGPALGRGPAIILG 457
V LP++ L F G P L + VL CL F N G I+G
Sbjct: 420 GGVKLPKITLVFDG-------PNGAVELDPSGVLLDGCLA-FAPNTDDQMTG-----IIG 466
Query: 458 DFQLQNFYLEFDLANDRFGFAKQKC 482
+ Q Q + +++ GF + C
Sbjct: 467 NVQQQALEVLYNVDGATVGFRRGAC 491
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 99/406 (24%), Positives = 154/406 (37%), Gaps = 66/406 (16%)
Query: 99 HSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPK 158
S G Y + GTP + + DTGS ++W C RC D + +
Sbjct: 80 ESIGLYFAKIGLGTPSRDFHVQV-DTGSDILWVNCAGCIRCP----RKSDLVELTPYDAD 134
Query: 159 RSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLS 217
SS+++ + C + CS++ N S C S TC Y++ YG G T G L+
Sbjct: 135 ASSTAKSVSCSDNFCSYV---NQRSECHSGS----TCQ-----YVILYGDGSSTNGYLVR 182
Query: 218 ETL--------RFPSKTVPNFLAGCS-----ILSDRQPA--GIAGFGRSSESLPSQLG-- 260
+ + R T + GC L + Q A GI GFG+S+ S SQL
Sbjct: 183 DVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQ 242
Query: 261 ---LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
+ F++CL D G G + P+ S SA
Sbjct: 243 GKVKRSFAHCL----------------DNNNGGGIFAIGEVVSPKVKTTPMLSKSAH--- 283
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
Y V L I VG+ +++ G D GVI+DSG+T ++ +AV + Q+
Sbjct: 284 YSVNLNAIEVGNSVLQLSSDAFDSGDDK--GVIIDSGTTLVYLP----DAVYNPLMNQIL 337
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ ++ CF + + P + +F +A+ P+ Y V + C
Sbjct: 338 ASHQELNLHTVQDSFTCFHYIDRLDRF-PTVTFQFDKSVSLAVYPQEYLFQVREDTWCFG 396
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N G ILGD L N + +D+ N G+ C+
Sbjct: 397 W--QNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440
>gi|388493426|gb|AFK34779.1| unknown [Medicago truncatula]
Length = 454
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/414 (23%), Positives = 160/414 (38%), Gaps = 64/414 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
YS S+ GTP + D +WF C Y N +P + K++ +
Sbjct: 50 YSTSIKLGTP-AVPLDLVIDIRERFLWFECDDSY-----NSTTYNPIQCGTKKCKQARGT 103
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
I C N GC+ N TC + +G F +G + + L FP
Sbjct: 104 GCIDCTN-----------HPSKTGCT--NNTCGV---EPFNPFGGFFVSGDVGEDILSFP 147
Query: 224 SKT----------VPNFLAGCSILSDR------------QPAGIAGFGRSSESLPSQLGL 261
T VP F++ C + D+ G+ G R+ SLP+Q+
Sbjct: 148 RVTSDGRRVTNVRVPRFISSC-VYPDKFGVQGFLEGLSKGKKGVLGLARTLISLPTQIAT 206
Query: 262 K-----KFSYCLLSRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSS 312
+ KF+ CL S + +L + GP + D + L YTP N +
Sbjct: 207 RFKLDRKFTLCLPSTSQKNGLGPGSLFVGGGPYNLGSNKDDASKFLKYTPLITNRRSTGP 266
Query: 313 AFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
F F Y++ ++ I V + V + L G GG + + T + ++ +
Sbjct: 267 IFDNFPSTEYFIKVKSIKVDNNVVNFNTTLLSINKLGEGGTKLSTVIPHTTLHTSIYNPL 326
Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFD-ISGKKSVY---LPELILKFKGGAKMALPPEN 424
F+++ + V+ + CFD + KSV +P + L KGG + + N
Sbjct: 327 LNAFVKK-AEIRKIKRVKAVAPFGACFDSRTISKSVNGPNVPTIDLVLKGGVEWRIFGAN 385
Query: 425 YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
V VLCL F D + +II+G QL++ +EFDL + + GF+
Sbjct: 386 SMVKVNENVLCL-GFVDAGSEEVGPSATSIIIGGHQLEDNLVEFDLVSSKLGFS 438
>gi|125575539|gb|EAZ16823.1| hypothetical protein OsJ_32295 [Oryza sativa Japonica Group]
Length = 383
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 153/384 (39%), Gaps = 39/384 (10%)
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCN--FPNVDPSRIPAFIPKRSSSSQ 164
S + GTPPQ ++ FI D G LVW C+ N P V P ++ +P
Sbjct: 27 SFTIGTPPQPASAFI-DVGGLLVWTQCSQCSSSSCFNQGAPAVRPDQV---VPPTGPEP- 81
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
C C F P C G C + L ++ T+G + ++ + +
Sbjct: 82 ---CGTALCE--FFPASIRNCSG-----DVCAYEASTQLFEH----TSGKIGTDAVAIGT 127
Query: 225 KTVPNFLAGCSILSDRQ-----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
T + GC + SD + P+G G R+ SL +Q+ + FS+CL +
Sbjct: 128 ATAASVAFGCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTAFSHCLAPHDGGGGK-N 186
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
S L L TPF K+ + +Y + L I G + +
Sbjct: 187 SRLFLGAAAKLAGGGKSAAMTTPFVKSSPDDIKSL--YYLINLEGIKAGDEAI-----IT 239
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISG 399
VP S V++ + S +F+ +++ + K +G + + +S CF G
Sbjct: 240 VPQSGRT--VLLQTFSPVSFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSIFDLCFKRGG 297
Query: 400 KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDF 459
P+++L F+G A + +PP NY VG++ +C+ + + G + ILG
Sbjct: 298 VSGA--PDVVLTFQGAAALTVPPTNYLLDVGDDTVCVAIASSARLNSTEVAGMS-ILGGL 354
Query: 460 QLQNFYLEFDLANDRFGFAKQKCA 483
Q QN + +DL + F C+
Sbjct: 355 QQQNVHFLYDLEKETLSFEAADCS 378
>gi|326496543|dbj|BAJ94733.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511583|dbj|BAJ91936.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/402 (24%), Positives = 163/402 (40%), Gaps = 76/402 (18%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVD-PSRIPAFIPKRSSSSQLIGCQNPK 172
PQ + D G + +W C + Y V ++ V S++ + ++ +G +P
Sbjct: 52 PQVPVTAVLDLGGASLWVDCDAGY--VSSSYAGVPCASKLCRLAKSVACATSCVGKPSPG 109
Query: 173 CSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK------- 225
C + C G P N ++ T G L+++ L P+
Sbjct: 110 C-------LNDTCSG-FPENTVTRVS------------TGGNLITDVLSVPTTFRPAPGP 149
Query: 226 --TVPNFL--AGCSILSDRQPAGIAGFG---RSSESLPSQLGL-----KKFSYCLLSRK- 272
T P FL G + L+D AG G R+ +LP+QL +KF+ CL S
Sbjct: 150 LATAPAFLFTCGATFLTDGLAAGATGMASLSRARFALPTQLAATFRFSRKFALCLTSTSA 209
Query: 273 -----FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGL 322
F DAP + PG SK+ L+YTP N V ++ G+ Y++G+
Sbjct: 210 AGVVVFGDAPYAFQ------PGVDLSKS--LTYTPLLVNNVSTAGVSGQKDKSNEYFIGV 261
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
I V + V + S L G GG + + + +T +E + +AV F + R
Sbjct: 262 TAIKVNGRAVPLNASLLAIDKQGGGGTKLSTVAPYTVLETSIHKAVTDAFAAETAMIPR- 320
Query: 383 ADVEKKSGLRPCFDISGKKSVYL------PELILKFKGGAKMALPPENYFALVGNEVLCL 436
V + + C+D S S + EL+L+ + + + + A G LCL
Sbjct: 321 --VRAVAPFKLCYDGSKVGSTRVGPAVPTVELVLQNEAASWVVFGANSMVAAKGGA-LCL 377
Query: 437 ILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
+ D A P ++++G +++ LEFDL R GF+
Sbjct: 378 GVV-DGGAAPRT----SVVIGGHTMEDNLLEFDLQRARLGFS 414
>gi|42407406|dbj|BAD09564.1| nucleoid DNA-binding protein-like [Oryza sativa Japonica Group]
Length = 205
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 86/179 (48%), Gaps = 8/179 (4%)
Query: 245 IAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
+ G GR SL SQLG +FSYCL S P N + ++ + GL P
Sbjct: 1 MVGLGRGLLSLVSQLGPSRFSYCLTSF-LSPEPSRLNFGVFATLNGTNASSSGL---PVQ 56
Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
P+ ++A Y++ L+ I +G K + I DG GGV +DSG++ T+++ +
Sbjct: 57 STPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDV 116
Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYL--PELILKFKGGAKMALP 421
++AV +E + + A D E GL CF +V + P++ L F GGA M P
Sbjct: 117 YDAVRRELVSVLRPLPPANDTEI--GLETCFPWPPPPTVTMTVPDMELHFDGGANMLHP 173
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 101/408 (24%), Positives = 155/408 (37%), Gaps = 74/408 (18%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP ++ + DTGS ++W C C C + + + P SS
Sbjct: 79 GLYFTQIGIGTPAKSYYVQV-DTGSDILWVNCV---FCDTCPRKSGLGIELTLYDPSGSS 134
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
S + C C G + S P A Y + YG G T G +++ L
Sbjct: 135 SGTGVTCGQDFCVATHGGVIPS----------CVPAAPCQYSISYGDGSSTTGFFVTDFL 184
Query: 221 RFP----------SKTVPNFLAGCSILSD-----RQPAGIAGFGRSSESLPSQLGL---- 261
++ + T F G I D + GI GFG+S+ S+ SQL
Sbjct: 185 QYNQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKV 244
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSG-----DSKTPGLSYTPFYKNPVGSSSAFG 315
K F++CL DT G G D P +S TP
Sbjct: 245 RKVFAHCL----------------DTINGGGIFAIGDVVQPKVSTTPLVPGM-------- 280
Query: 316 EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
Y V L I VG +++P + G + G I+DSG+T ++ G ++ A+ + Q
Sbjct: 281 PHYNVNLEAIDVGGVKLQLPTNIFDIGE--SKGTIIDSGTTLAYLPGVVYNAIMSKVFAQ 338
Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
G+ D + + CF SG P + F+GG + + P +Y L N L
Sbjct: 339 YGDMPLKNDQDFQ-----CFRYSGSVDDGFPIITFHFEGGLPLNIHPHDY--LFQNGELY 391
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ F G+ ++LGD N + +DL N G+ C+
Sbjct: 392 CMGFQTGGLQTKDGK-DMVLLGDLAFSNRLVLYDLENQVIGWTDYNCS 438
>gi|413923981|gb|AFW63913.1| hypothetical protein ZEAMMB73_837345 [Zea mays]
Length = 414
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 78/154 (50%), Gaps = 19/154 (12%)
Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
P L YT F +SS FYYV L+ ++VG + +KI G DG+GG I+DSG+
Sbjct: 33 PELKYTAF----TPTSSPADTFYYVKLKGVLVGGELLKISSDTWDVGKDGSGGTIIDSGT 88
Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG 415
T ++ P+++AV + G PC+++SG + +PEL L F G
Sbjct: 89 TLSYFVEPVYQAVPSD--------------PGLLGAEPCYNVSGMERPEVPELSLLFPDG 134
Query: 416 AKMALPPENYFALVG-NEVLCLILFTDNAAGPAL 448
A P ENYF + ++++CL + + G ++
Sbjct: 135 AVWDFPAENYFVRLDPDDIMCLAVLGTSRTGMSI 168
>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
Length = 472
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 103/429 (24%), Positives = 163/429 (37%), Gaps = 82/429 (19%)
Query: 75 KPKTKDSNIGSNYS----NSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW 130
K + +NI +++S + ++T V + ++++L+ GTPP F S W
Sbjct: 55 KQRRTLANITTDFSVRGGDKGLETSFYVDNGLNFAMNLNLGTPP-VQHNFTMALNSEFFW 113
Query: 131 FPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSP 190
C+ CVDCN DP F S+S I C +P CS P + G S
Sbjct: 114 AACSP---CVDCNVSTNDP----LFSSASSTSYTRIPCTSPFCS--TSPGFSTNACGSSA 164
Query: 191 RNKTCPLACPSYLLQYGLGFTAGLLLSET--LRFPSKTVPN----FLAGC-----SILSD 239
T L SY Y +AG + S+ ++ P KT N GC ++L
Sbjct: 165 VGSTTCLYNFSYSTDYS---SAGEMASDVVAMKTPRKTRGNKSLRMSLGCGRESTTLLGI 221
Query: 240 RQPAGIAGFGRSSESLPSQLG----LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKT 295
+G+ GF ++ +S QL KF YC+ S F V N + S
Sbjct: 222 LNTSGLVGFAKTDKSFIGQLAEMDYTSKFIYCVPSDTFSGKIVLGNYKI--------SSH 273
Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
LSYTP N YY+GLR I + + + P ++ +DG GG I+DS
Sbjct: 274 SSLSYTPMIVNSTA-------LYYIGLRSISI-TDTLTFPVQGIL--ADGTGGTIIDSTF 323
Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS--GLRPCFDISGKKSVYLPELILKFK 413
F++ + + + N ++ + E + G C+++S
Sbjct: 324 AFSYFTPDSYTPLVQAIQNLNSNLTKVSSNETAALLGNDICYNVSVNDDD---------- 373
Query: 414 GGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLAND 473
N +CL + G +L ++G +Q + +EFDL
Sbjct: 374 ---------------AENATVCLAVGDSEKVGFSLN-----VIGTYQQLDVAVEFDLEKQ 413
Query: 474 RFGFAKQKC 482
GF C
Sbjct: 414 EIGFGTAGC 422
>gi|358347314|ref|XP_003637703.1| Basic 7S globulin [Medicago truncatula]
gi|355503638|gb|AES84841.1| Basic 7S globulin [Medicago truncatula]
Length = 454
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/414 (23%), Positives = 160/414 (38%), Gaps = 64/414 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
YS S+ GTP + D +WF C Y N +P + K++ +
Sbjct: 50 YSTSIKLGTP-AVPLDLVIDIRERFLWFECDDSY-----NSTTYNPIQCGTKKCKQARGT 103
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
I C N GC+ N TC + +G F +G + + L FP
Sbjct: 104 GCIDCTNHPFK-----------TGCT--NNTCGV---EPFNPFGGFFVSGDVGEDILSFP 147
Query: 224 SKT----------VPNFLAGCSILSDR------------QPAGIAGFGRSSESLPSQLGL 261
T VP F++ C + D+ G+ G R+ SLP+Q+
Sbjct: 148 RVTSDGRRVTNVRVPRFISSC-VYPDKFGVQGFLEGLSKGKKGVLGLARTLISLPTQIAT 206
Query: 262 K-----KFSYCLLSRKFDDAPVSSNLVLDTGP----GSGDSKTPGLSYTPFYKNPVGSSS 312
+ KF+ CL S + +L + GP + D + L YTP N +
Sbjct: 207 RFKLDRKFTLCLPSTSQKNGLGPGSLFVGGGPYNLGSNKDDASKFLKYTPLITNRRSTGP 266
Query: 313 AFGEF----YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
F F Y++ ++ I V + V + L G GG + + T + ++ +
Sbjct: 267 IFDNFPSTEYFIKVKSIKVDNNVVNFNTTLLSINKLGEGGTKLSTVIPHTTLHTSIYNPL 326
Query: 369 AKEFIRQMGNYSRAADVEKKSGLRPCFD-ISGKKSVY---LPELILKFKGGAKMALPPEN 424
F+++ + V+ + CFD + KSV +P + L KGG + + N
Sbjct: 327 LNAFVKK-AEIRKIKRVKAVAPFGACFDSRTISKSVNGPNVPTIDLVLKGGVEWRIFGAN 385
Query: 425 YFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
V VLCL F D + +II+G QL++ +EFDL + + GF+
Sbjct: 386 SMVKVNENVLCL-GFVDAGSEEVGPSATSIIIGGHQLEDNLVEFDLVSSKLGFS 438
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 163/385 (42%), Gaps = 64/385 (16%)
Query: 123 DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVE 182
DTGS +W C C C + + + P S +S+ + C + C+ + +
Sbjct: 92 DTGSDTLWVNCVG---CTACPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFCTSTYDGQIS 148
Query: 183 SRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFPS-----KTVPN---FLAG 233
KG ++CP Y + YG G T+G + + L F +TVP+ + G
Sbjct: 149 GCTKG---------MSCP-YSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFG 198
Query: 234 C--------SILSDRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRKFDDAPVSS 280
C S +D GI GFG+++ S+ SQL + FS+CL S +S
Sbjct: 199 CGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDS-------ISG 251
Query: 281 NLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLV 340
+ G+ P + TP + Y V L+ I V +++P S ++
Sbjct: 252 GGIF----AIGEVVQPKVKTTPLLQGMA--------HYNVVLKDIEVAGDPIQLP-SDIL 298
Query: 341 PGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGK 400
S G G I+DSG+T ++ +++ + ++ + Q + VE + CF S +
Sbjct: 299 DSSSGRG-TIIDSGTTLAYLPVSIYDQLLEKILAQRSGM-KLYLVEDQ---FTCFHYSDE 353
Query: 401 KSV--YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
+SV P + F+ G + P +Y L ++ C + + + A G+ I+LGD
Sbjct: 354 ESVDDLFPTVKFTFEEGLTLTTYPRDYLFLFKEDMWC-VGWQKSMAQTKDGK-ELILLGD 411
Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
L N + +DL N G+A C+
Sbjct: 412 LVLANKLVVYDLDNMAIGWADYNCS 436
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 135/291 (46%), Gaps = 44/291 (15%)
Query: 194 TCPLACP--SYLLQYGLG-FTAGLLLSETLRFPSKTVPNFLAGCSILSDR---QPAGIAG 247
C A P +Y + YG G FT G L E L+F + V +F+ GC + +G+ G
Sbjct: 68 VCGSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMG 127
Query: 248 FGRSSESLPSQL-GL--KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFY 304
GRS SL SQ G+ FSYCL S + S +L+L + +P +SY
Sbjct: 128 LGRSDLSLISQTSGIFGGVFSYCLPS---TERKGSGSLILGGNSSVYRNSSP-ISYAKMI 183
Query: 305 KNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPL 364
+NP FY++ L I +G ++ P S G ++VDSG+ T + +
Sbjct: 184 ENP-----QLYNFYFINLTGISIGGVALQAP-------SVGPSRILVDSGTVITRLPPTI 231
Query: 365 FEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN 424
++A+ EF++Q + A S L CF++S + V +P + + F+G A++ +
Sbjct: 232 YKALKAEFLKQFTGFPPAPAF---SILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTG 288
Query: 425 YFALVGNEV--LCLIL----FTDNAAGPALGRGPAIILGDFQLQNFYLEFD 469
F V ++ +CL L + D A ILG++Q +N + +D
Sbjct: 289 VFYFVKSDASQVCLALASLEYQDEVA----------ILGNYQQKNLRVIYD 329
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 97/404 (24%), Positives = 160/404 (39%), Gaps = 64/404 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP + DTGS ++W C C + VD + + PK SS
Sbjct: 71 GLYYARIGIGSPPNDFHVQV-DTGSDILWVNCVGCSNCPKKSDIGVD---LQLYNPKSSS 126
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+S LI C P CS + ++ GC P L C Y + YG G TAG +++ +
Sbjct: 127 TSTLITCDQPFCSATY----DAPIPGCKP-----DLLC-QYKVIYGDGSATAGYFVNDYI 176
Query: 221 RFP--------SKTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
+ S+T + + GC S GI GFG+++ S+ SQL
Sbjct: 177 QLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKV 236
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K F++CL S +S + G+ P L TP N Y V
Sbjct: 237 KKIFAHCLDS-------ISGGGIF----AIGEVVEPKLKTTPVVPNQA--------HYNV 277
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L + VG + +P + G I+DSG+T ++ ++ + ++ + +
Sbjct: 278 VLNGVKVGDTALDLPLGLF--ETSYKRGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLK 335
Query: 381 -RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
R D + CF P + KF+ + + P Y + ++V C +
Sbjct: 336 LRTVDDQ-----FTCFVFDKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRDDVWC--VG 388
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N+ + +LGD LQN + ++L N G+ + C+
Sbjct: 389 WQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLENQTIGWTEYNCS 432
>gi|356548993|ref|XP_003542883.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 473
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 95/398 (23%), Positives = 160/398 (40%), Gaps = 61/398 (15%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y S+ GTP + + + D +W+ C + Y N R A K+
Sbjct: 87 YYTSVGIGTP-RHNFDLVIDLSGENLWYDCDTHY--------NSSSYRPIACGSKQCPEI 137
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+GC P F P GC+ N TCP + L ++ +G L + +
Sbjct: 138 GCVGCNGP-----FKP-------GCT--NNTCPANVINQLAKF---IYSGGLGEDFIFIR 180
Query: 224 SKTVPNFLAGC-------SILSDRQP--------AGIAGFGRSSESLPSQLGL-----KK 263
V L+ C S D P GI G +S +LP QL K
Sbjct: 181 QNKVSGLLSSCIDTDAFPSFSDDELPLFGLPNNTKGIIGLSKSQLALPIQLASANKVPSK 240
Query: 264 FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPV--GSSSAFG---EFY 318
FS CL S +NL++ G + L TP N V G+ S G + Y
Sbjct: 241 FSLCLPSLNNQGF---TNLLVRAGEEHPQGISKFLKTTPLIVNNVSTGAISVEGVPSKEY 297
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN 378
++ ++ + + V + S L + GNGG + + S FT ++ +++ ++FI++ +
Sbjct: 298 FIDVKAVQIDGNVVNLKPSLLAIDNKGNGGTKLSTMSPFTELQTTVYKTFIRDFIKKASD 357
Query: 379 YSRAADVEKKSGLRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEVL 434
R V + C+D + ++ + +P + L +GG + + N + V
Sbjct: 358 -RRLKRVASVAPFEACYDSTSIRNSSTGLVVPTIDLVLRGGVQWTIYGANSMVMAKKNVA 416
Query: 435 CLILFTDNAAGPALGRGPA-IILGDFQLQNFYLEFDLA 471
CL + D P + A I++G +QL++ LEFD+A
Sbjct: 417 CLAI-VDGGTEPRMSFVKASIVIGGYQLEDNLLEFDVA 453
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 101/404 (25%), Positives = 158/404 (39%), Gaps = 63/404 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS ++W C S C + +D + + P S+
Sbjct: 87 GLYFTQIGIGTPSKGYYVQV-DTGSDILWVNCISCDSCPRKSGLGID---LTLYDPTASA 142
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
SS+ + C C+ V C SP C Y + YG G T G +++ L
Sbjct: 143 SSKTVTCGQEFCATATNGGVPPSCAANSP--------C-QYSITYGDGSSTTGFFVADFL 193
Query: 221 RFP--SKTVPNFLAGCSIL-------------SDRQPAGIAGFGRSSESLPSQLG----- 260
++ S LA S+ S+ GI GFG+++ S+ SQL
Sbjct: 194 QYDQVSGDGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKV 253
Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K FS+CL + N+V P + TP Y V
Sbjct: 254 TKIFSHCLDTVNGGGIFAIGNVV-----------QPKVKTTPLVPGM--------PHYNV 294
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L+ I VG +++P + G G+ G I+DSG+T ++ +++AV +S
Sbjct: 295 VLKTIDVGGSTLQLPTNIFDIGG-GSRGTIIDSGTTLAYLPEVVYKAVLSAV------FS 347
Query: 381 RAADVEKKSGLR-PCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
DV K+ CF SG PE+ F G + + P +Y +V C + F
Sbjct: 348 NHPDVTLKNVQDFLCFQYSGSVDNGFPEVTFHFDGDLPLVVYPHDYLFQNTEDVYC-VGF 406
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
G+ ++LGD L N + +DL N G+ C+
Sbjct: 407 QSGGVQSKDGK-DMVLLGDLALSNKLVVYDLENQVIGWTNYNCS 449
>gi|297818546|ref|XP_002877156.1| hypothetical protein ARALYDRAFT_484681 [Arabidopsis lyrata subsp.
lyrata]
gi|297322994|gb|EFH53415.1| hypothetical protein ARALYDRAFT_484681 [Arabidopsis lyrata subsp.
lyrata]
Length = 420
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 124/271 (45%), Gaps = 41/271 (15%)
Query: 227 VPNFLAGCSILS-----DRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK---- 272
+PN + C S + G+AG GR SLPSQ +KF+ CL S +
Sbjct: 153 IPNIIFSCGSTSLLKGLAKGTVGMAGMGRHKISLPSQFAAAFSFNRKFAVCLTSGRGVTF 212
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
F + P + L PG S+ L TP NP GE Y++G+R+I + K V
Sbjct: 213 FGNGPY---VFL---PGIQISR---LQKTPLLINP-------GE-YFIGVREIKIVEKTV 255
Query: 333 KIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKS 389
I L + G GG + S + +T +E +F++ F+RQ N +R A V+ S
Sbjct: 256 PINQMLLKINKETGFGGTKISSVNPYTVLESSIFKSFTSMFVRQATARNMTRVASVKPFS 315
Query: 390 GLRPCFDISGKKSVY-LPELILKFKGG-AKMALPPENYFALVGNEVLCLILFTDNAAGPA 447
++ + Y +PE+ L + N V ++V+CL F D
Sbjct: 316 ACFSTQNVGVTRLGYAVPEIQLVLHSNDVVWRIFGGNSMVSVSDDVICL-GFVDGGVNAR 374
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
++++G FQL++ +EFDLA++RFGF+
Sbjct: 375 ----TSVVIGGFQLEDNLIEFDLASNRFGFS 401
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 148/379 (39%), Gaps = 71/379 (18%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
I D+GS + W +C C P R P F P S++ + C + C+ + GP
Sbjct: 171 IIDSGSDVSWV------QCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPY 223
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF------PSKTVPNFLAGC 234
+GCS N C Q+G+ + G + T F P + F GC
Sbjct: 224 R----RGCS-ANAQC---------QFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGC 269
Query: 235 SILS-----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNL---V 283
+ D AG G S+SL Q + FSYCL P +S+L V
Sbjct: 270 AHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCL-------PPTASSLGFLV 322
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
L P F P+ SSS FY V LR IIV + + +P + S
Sbjct: 323 LGVPPERAQL------IPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS 376
Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
++DS + + + ++A+ F M Y A V S L C+D +G +S+
Sbjct: 377 ------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPV---SILDTCYDFTGVRSI 427
Query: 404 YLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQN 463
LP + L F GGA + L + L+G+ CL A A R P I G+ Q +
Sbjct: 428 TLPSIALVFDGGATVNL--DAAGILLGS---CLAF-----APTASDRMPGFI-GNVQQKT 476
Query: 464 FYLEFDLANDRFGFAKQKC 482
+ +D+ F C
Sbjct: 477 LEVVYDVPAKAMRFRTAAC 495
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 106/437 (24%), Positives = 170/437 (38%), Gaps = 85/437 (19%)
Query: 77 KTKDSNIGSNYSNSLIKTPLSVHSYGG-----YSISLSFGTPPQASTPFI--FDTGSSLV 129
K + NIGS Y V +G + + GTP S PF+ D GS L+
Sbjct: 71 KRRRLNIGSKYDVLFPSEGSQVIFFGNEFNWLHYTWIDLGTP---SVPFLVALDVGSDLL 127
Query: 130 WFPCTSRYRCVDC-----NFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESR 184
W PC C+ C N+ +V + + P SS+S+ + C + C+W
Sbjct: 128 WVPCD----CIQCAPLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAW--------- 174
Query: 185 CKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKT--------VPNFLAGC-- 234
C N C Y T+G ++ + L+ S + + + GC
Sbjct: 175 STTCKSANDPCTYKRDYYSDNTS---TSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGR 231
Query: 235 ----SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGS 290
S L P G+ G G + S+P+ L + S FD+ L D GP +
Sbjct: 232 KQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPAT 291
Query: 291 GDSKTPGLSYTPFYKNPVGSSSAFGEF--YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGG 348
+ + P FGEF Y++G+ VGS ++
Sbjct: 292 QQTT----QFLPL----------FGEFAAYFIGVESFCVGSSCLQ----------RSGFQ 327
Query: 349 VIVDSGSTFTFMEGPLFEAVAKEFIRQMG-NYSRAADVEKKSGLRPCFDISGKKSVYLPE 407
+VDSGS+FT++ +++ + EF +Q+ N +R V ++ C++IS S +P
Sbjct: 328 ALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRI--VLRELPWNYCYNISTLVSFNIPS 385
Query: 408 LILKFKGGAKMALPPENYF-ALVGNEVLCLIL-FTDNAAGPALGRGPAIILGDFQLQNFY 465
+ L F P A G +V CL L TD G ++G + +
Sbjct: 386 MQLVFPLNQIFIHDPVYVLPANQGYKVFCLTLEETDEDYG---------VIGQNLMVGYR 436
Query: 466 LEFDLANDRFGFAKQKC 482
+ FD N + G++K KC
Sbjct: 437 MVFDRENLKLGWSKSKC 453
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 100/403 (24%), Positives = 161/403 (39%), Gaps = 63/403 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS ++W C RC + VD + + K S+
Sbjct: 153 GLYFAKIGIGTPSKDYYVQV-DTGSDILWVNCAGCDRCPTKSDLGVD---LTLYDMKAST 208
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+S +GC + CS GP GC P L C Y + YG G T G + + +
Sbjct: 209 TSDAVGCDDNFCSLYDGP-----LPGCKP-----GLQCL-YSVLYGDGSSTTGYFVQDFV 257
Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLG----- 260
++ +T P + GC S GI GFG+++ S+ SQL
Sbjct: 258 QYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKV 317
Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K FS+CL D+ + G+ P ++ TP +N Y V
Sbjct: 318 KKVFSHCL-----DNVDGGGIFAI------GEVVEPKVNITPLVQNQA--------HYNV 358
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
+++I VG + +P G G I+DSG+T + ++ + ++ + Q +
Sbjct: 359 VMKEIEVGGDPLDVPSDAFESGD--RKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL- 415
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
R VE+ CFD +G P + L F + + P Y V C I +
Sbjct: 416 RLHTVEQAF---TCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWC-IGWQ 471
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
++ A G+ +LGD L N + +DL G+ + C+
Sbjct: 472 NSGAQTKDGK-DLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 513
>gi|296086729|emb|CBI32364.3| unnamed protein product [Vitis vinifera]
Length = 400
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 103/395 (26%), Positives = 155/395 (39%), Gaps = 96/395 (24%)
Query: 114 PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKC 173
P + D G+ +W C Y SSS + C++ +C
Sbjct: 53 PLVPVKLVVDLGAQFLWVDCEQNYV---------------------SSSYRPARCRSAQC 91
Query: 174 SWIFGPNVESRCKGC-----SPR----NKTCPLACPSYLLQYGLGFTAGLLLSETLRFPS 224
S +R GC +PR N TC LA +Q G G ++S + +F
Sbjct: 92 SL-------ARANGCGDCFSAPRPGCNNNTCGLAEDFVSVQSTDGSNPGRVVSVS-KFLF 143
Query: 225 KTVPNFL-AGCSILSDRQPAGIAGFGRSSESLPSQLG-----LKKFSYCLLSRKFDDAPV 278
P FL G + G+AG GR+ + PSQ +KF+ CL S
Sbjct: 144 SCAPTFLLEGLA----SSAMGMAGLGRTRIAFPSQFASAFSFHRKFATCLSSS------T 193
Query: 279 SSNLVLDTGPGS-----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
++N V+ G G + L YTP Y NP I + K +
Sbjct: 194 TANGVVFFGDGPYRLLPNIDASQSLIYTPLYINP----------------SIRINEKAIS 237
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG--NYSRAADVEKKSGL 391
+ S L S+G GG + + + +T ME +++A K FI N +R A V
Sbjct: 238 LNTSLLSIDSEGVGGTKISTVNPYTVMETSIYKAFTKAFISAAAAINITRVAAVAP---F 294
Query: 392 RPCFDISGKKSVY-------LPELILKFKGGAKM-ALPPENYFALVGNEVLCLILFTDNA 443
CF K+VY +P + L + + + N V ++VLCL F D
Sbjct: 295 NVCFS---SKNVYSTRVGPSVPSIDLVLQNESVFWRIFGANSMVYVSDDVLCL-GFVDGG 350
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
A P +I++G +QL++ L+FDLA R GF+
Sbjct: 351 ANPR----TSIVIGGYQLEDNLLQFDLATSRLGFS 381
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 100/403 (24%), Positives = 161/403 (39%), Gaps = 63/403 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS ++W C RC + VD + + K S+
Sbjct: 72 GLYFAKIGIGTPSKDYYVQV-DTGSDILWVNCAGCDRCPTKSDLGVD---LTLYDMKAST 127
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+S +GC + CS GP GC P L C Y + YG G T G + + +
Sbjct: 128 TSDAVGCDDNFCSLYDGP-----LPGCKP-----GLQCL-YSVLYGDGSSTTGYFVQDFV 176
Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLG----- 260
++ +T P + GC S GI GFG+++ S+ SQL
Sbjct: 177 QYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKV 236
Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K FS+CL D+ + G+ P ++ TP +N Y V
Sbjct: 237 KKVFSHCL-----DNVDGGGIFAI------GEVVEPKVNITPLVQNQA--------HYNV 277
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
+++I VG + +P G G I+DSG+T + ++ + ++ + Q +
Sbjct: 278 VMKEIEVGGDPLDVPSDAFESGD--RKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL- 334
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
R VE+ CFD +G P + L F + + P Y V C I +
Sbjct: 335 RLHTVEQA---FTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWC-IGWQ 390
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
++ A G+ +LGD L N + +DL G+ + C+
Sbjct: 391 NSGAQTKDGK-DLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 432
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 114/427 (26%), Positives = 172/427 (40%), Gaps = 54/427 (12%)
Query: 65 SRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDT 124
SR + +K + S++ + + +L S+ G Y +++ GTP + IFDT
Sbjct: 114 SRVDSIHSKLSKDSGLSDVKATAATTLPAKDGSIIGSGNYFVTVGLGTPKK-DFSLIFDT 172
Query: 125 GSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESR 184
GS L W C CV + + F P +S+S I C + C + +
Sbjct: 173 GSDLTWTQCEP---CVKSCYNQ----KEAIFNPSQSTSYANISCGSTLCDSL--ASATGN 223
Query: 185 CKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETLRFPSKTVPN-FLAGC---SILSD 239
C+ + TC Y +QYG F+ G E L + V N F GC +
Sbjct: 224 IFNCA--SSTCV-----YGIQYGDSSFSIGFFGKEKLSLTATDVFNDFYFGCGQNNKGLF 276
Query: 240 RQPAGIAGFGRSSESLPSQLGL---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTP 296
AG+ G GR SL SQ K FSYCL S ++ G S +
Sbjct: 277 GGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCLPSSSSSTGFLT----------FGGSTSK 326
Query: 297 GLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGST 356
S+TP GSS FY + L I VG + + I P G I+DSG+
Sbjct: 327 SASFTPLATISGGSS-----FYGLDLTGISVGGRKLAIS-----PSVFSTAGTIIDSGTV 376
Query: 357 FTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGA 416
T + + A++ F + M Y A + S L CFD S ++ +P++ L F GG
Sbjct: 377 ITRLPPAAYSALSSTFRKLMSQYPAAPAL---SILDTCFDFSNHDTISVPKIGLFFSGGV 433
Query: 417 KMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFG 476
+ + F + +CL F N+ + I G+ Q + + +D A R G
Sbjct: 434 VVDIDKTGIFYVNDLTQVCLA-FAGNSDASDVA-----IFGNVQQKTLEVVYDGAAGRVG 487
Query: 477 FAKQKCA 483
FA C+
Sbjct: 488 FAPAGCS 494
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 100/403 (24%), Positives = 162/403 (40%), Gaps = 64/403 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS ++W C RC + VD + + K S+
Sbjct: 153 GLYFAKIGIGTPSKDYYVQV-DTGSDILWVNCAGCDRCPTKSDLGVD---LTLYDMKAST 208
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+S +GC + CS GP GC P L C Y + YG G T G + + +
Sbjct: 209 TSDAVGCDDNFCSLYDGP-----LPGCKP-----GLQCL-YSVLYGDGSSTTGYFVQDFV 257
Query: 221 RFPS-----KTVP---NFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLG----- 260
++ +T P + GC S GI GFG+++ S+ SQL
Sbjct: 258 QYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKV 317
Query: 261 LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K FS+CL D+ + G+ P ++ TP +N Y V
Sbjct: 318 KKVFSHCL-----DNVDGGGIFAI------GEVVEPKVNITPLVQNQA--------HYNV 358
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
+++I VG + +P G G I+DSG+T + ++ + ++ + Q +
Sbjct: 359 VMKEIEVGGDPLDVPSDAFESGD--RKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL- 415
Query: 381 RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFT 440
R VE+ CFD +G P + L F + + P Y L +E I +
Sbjct: 416 RLHTVEQAF---TCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEY--LFQHEFEWCIGWQ 470
Query: 441 DNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
++ A G+ +LGD L N + +DL G+ + C+
Sbjct: 471 NSGAQTKDGK-DLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 512
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 148/389 (38%), Gaps = 49/389 (12%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + GTP Q DT S + W PC C+ C+ F S++
Sbjct: 101 YIVRAKIGTPAQTML-MAMDTSSDVAWIPCNG---CLGCSST--------LFNSPASTTY 148
Query: 164 QLIGCQNPKCSWIF---GPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETL 220
+ +GCQ +C + P + S P TC S+ L YG A L +T+
Sbjct: 149 KSLGCQAAQCKQVLHLLSPLLTSPSVVPKP---TCGGGVCSFNLTYGGSSLAANLSQDTI 205
Query: 221 RFPSKTVPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFD 274
+ VP + GC L + G+ S S L FSYCL S F
Sbjct: 206 TLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FK 263
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
S +L L GP + + YTP KNP S Y+V L + VG + V +
Sbjct: 264 SLNFSGSLRL--GPVGQPKR---IKYTPLLKNPRRPS-----LYFVNLMAVRVGRRVVDV 313
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC 394
P G I DSG+ FT + P + AV F ++G R V G C
Sbjct: 314 PPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTC 370
Query: 395 FDISGKKSVYLPELILKFKGGAKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPA 453
+ + + P + F G + LPP+N CL + AA P
Sbjct: 371 YTV----PIAAPTITFMFT-GMNVTLPPDNLLIHSTAGSTTCLAM----AAAPDNVNSVL 421
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++ + Q QN L +D+ N R G A++ C
Sbjct: 422 NVIANLQQQNHRLLYDVPNSRLGVARELC 450
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 108/469 (23%), Positives = 179/469 (38%), Gaps = 72/469 (15%)
Query: 27 SSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNIGSN 86
+S +T T+ LT L +H + P++ S +R R L + D S
Sbjct: 53 NSPSTSTIRLTILHREHPCAPASKRPVRRSPSALQEYHTRVRRLANRLSSCPADEATAS- 111
Query: 87 YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
LI Y Y + GTP + + DT SSL W C P
Sbjct: 112 ---GLIFANGVPWDYYSYVTQVQLGTPAKTHNVLV-DTASSLSWVGCE----------PC 157
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
++ IP F P SS+ +++GC + C+ + P+ K C + C SY Y
Sbjct: 158 INACLIPTFNPNASSTYKVVGCGSALCNAV--PSATMARKSCMAPTEGC-----SYRQSY 210
Query: 207 -GLGFTAGLLLSETLRFPSKTVPNFLAGCSIL---SDRQPAGIAGFGRSSESLPSQLGL- 261
+ G++ S+TL + + F+ GC L + +GI G + SL SQ+ +
Sbjct: 211 HDYSLSVGVVSSDTLTYGLGS-QKFIFGCCNLFRGVGGRYSGILGMSVNKFSLFSQMTVG 269
Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
+ SYC P + + G D L +TP Y + G Y
Sbjct: 270 HRYRAMSYCF------PHPRNQGFL---QFGRYDEHKSLLRFTPLYID--------GNNY 312
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGV--IVDSGSTFTFMEGPLFEAVAKEFIRQM 376
+V + ++V + + + S GN + D+G+ +T + LF +++ +
Sbjct: 313 FVHVSNVMVETMSLDVQ-------SSGNQTMRCFFDTGTPYTMLPQSLFVSLSDTVGNLV 365
Query: 377 GNYSRAADVEKKSGLRPCFDISG---KKSVYLPELILKFKGGAKMALPPENYFALVGNEV 433
Y R S + CF G + +Y+P + ++F+ GA++ L E+ + V
Sbjct: 366 EGYYRVG----ASTGQTCFQADGNWIEGDLYMPTVKIEFQNGARITLNSEDLMFMEEPNV 421
Query: 434 LCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL + G I+LG L + DL G Q C
Sbjct: 422 FCLAF--------KMNDGGDIVLGSRHLMGVHTVVDLEMMTMGLRGQGC 462
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 97/404 (24%), Positives = 160/404 (39%), Gaps = 64/404 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+PP + DTGS ++W C C + VD + + PK SS
Sbjct: 71 GLYYARIGIGSPPNDFHVQV-DTGSDILWVNCVGCSNCPKKSDIGVD---LQLYNPKSSS 126
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETL 220
+S LI C P CS + ++ GC P L C Y + YG G TAG +++ +
Sbjct: 127 TSTLITCDQPFCSATY----DAPIPGCKP-----DLLC-QYKVIYGDGSATAGYFVNDYI 176
Query: 221 RFP--------SKTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQLGL---- 261
+ S+T + + GC S GI GFG+++ S+ SQL
Sbjct: 177 QLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKV 236
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K F++CL S +S + G+ P L TP N Y V
Sbjct: 237 KKIFAHCLDS-------ISGGGIF----AIGEVVEPKLXNTPVVPNQA--------HYNV 277
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYS 380
L + VG + +P + G I+DSG+T ++ ++ + ++ + +
Sbjct: 278 VLNGVKVGDTALDLPLGLF--ETSYKRGAIIDSGTTLAYLPESIYLPLMEKILGAQPDLK 335
Query: 381 -RAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
R D + CF P + KF+ + + P Y + ++V C +
Sbjct: 336 LRTVDDQ-----FTCFVFDKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRDDVWC--VG 388
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N+ + +LGD LQN + ++L N G+ + C+
Sbjct: 389 WQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLENQTIGWTEYNCS 432
>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
Length = 357
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 112/405 (27%), Positives = 170/405 (41%), Gaps = 76/405 (18%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRSSSSQ 164
+++S G PP + I DTGS+L W C C V C+ + P F P RS +S+
Sbjct: 1 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQP---CAVHCHTQSAKAG--PIFDPGRSYTSR 54
Query: 165 LIGCQNPKCSWIFGPNVESRCK--GCSPRNKTCPLACPSYLLQYGLG--FTAGLLLSETL 220
+ C + KC P + R + C + +C +Y + YG G ++ G ++++TL
Sbjct: 55 RVRCSSVKCG---EPRYDLRLQQANCMEKEDSC-----TYSVTYGNGWAYSVGKMVTDTL 106
Query: 221 RFPSKTVPNFLAGCS--ILSDRQPAGIAGFGRSSESLPSQLG-------LKKFSYCLLSR 271
R + + + GCS + AGI GFG SS S QL K FSYCL +
Sbjct: 107 RI-GDSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT- 164
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
D ++L G D YTP ++ S Y + + +I +
Sbjct: 165 ---DETKPGYMIL----GRYDRAAMDGGYTPLFR------SINRPTYSLTMEMLIANGQR 211
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKS 389
+ S + IVDSG+ T + F + K + M + Y R + ++S
Sbjct: 212 LVTSSSEM----------IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQES 261
Query: 390 GLRPCF----DISGKKSVY--------LPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ C+ D SG LP L + F GGA +ALPP N F + LC+
Sbjct: 262 YI--CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMT 319
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A PAL + ILG+ ++F FD+ +FGF C
Sbjct: 320 F----AQNPAL---RSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 357
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 110/473 (23%), Positives = 171/473 (36%), Gaps = 89/473 (18%)
Query: 56 LHSLASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY---GGYSISLSFGT 112
L LA S R + + + + +++ GS S + + PL+ +Y G Y + GT
Sbjct: 45 LADLARSDRQRMAFIASHGRRRARETAAGS--SAAAFEMPLTSGAYTGIGQYFVRFRVGT 102
Query: 113 PPQASTPFIF--DTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQN 170
P Q PF+ DTGS L W C R N AF P+ S + I C +
Sbjct: 103 PAQ---PFLLVADTGSDLTWVKC----RRPAANSSESGSGSGRAFRPEDSRTWAPISCAS 155
Query: 171 PKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVP-- 228
C+ K TCP P Y + G T+ S T+
Sbjct: 156 DTCT-----------KSLPFSLATCP--TPGSPCAYDYRYKDGSAARGTVGTESATIALS 202
Query: 229 ------------NFLAGCSI--------LSDRQPAGIAGFGRSSESLPSQLGLK---KFS 265
+ GC+ +SD G+ G S S S + +FS
Sbjct: 203 GRGREERKAKLKGLVLGCTSSYTGPSFEVSD----GVLSLGYSDVSFASHAASRFAGRFS 258
Query: 266 YCLLSRKFDDAPVSSNLVLDTGPGSGD--------------SKTPGLSYTPFYKNPVGSS 311
YCL+ +P ++ L GP + + P+
Sbjct: 259 YCLVDHL---SPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLD 315
Query: 312 SAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKE 371
FY V ++ + V + +KIP + V D GGVI+DSG++ T + P + AV
Sbjct: 316 RRMRPFYDVAVKAVSVAGQFLKIPRA--VWDVDAGGGVILDSGTSLTVLAKPAYRAVVAA 373
Query: 372 FIRQMGNYSRAADVEKKSGLRPCFD-ISGKKSVYLPELILKFKGGAKMALPPENYFALVG 430
+ R C++ S V LP++ + F G A++ P ++Y
Sbjct: 374 LSEGLAGLPRV----TMDPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAA 429
Query: 431 NEVLCLILFTDNAAGPALGRGPAI-ILGDFQLQNFYLEFDLANDRFGFAKQKC 482
V C+ G G P I ++G+ Q EFD+ N R F + +C
Sbjct: 430 PGVKCI--------GLQEGPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 474
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 105/431 (24%), Positives = 165/431 (38%), Gaps = 92/431 (21%)
Query: 97 SVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAF 155
SV +G Y +++ G P + I DTGS+L + PC + +C DP
Sbjct: 105 SVKEHGYYYANIALGDPSPRTFQVIVDTGSTLTYVPCATCAKCGTHTGGTRFDP------ 158
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGL 214
+ + + CQ +C GP + + +G + T Y Y G +G
Sbjct: 159 ------TGKWLTCQEKQCKAAGGPGICAGGRGAAANRCT-------YSRTYAEGSGVSGD 205
Query: 215 LLSETLRFPSKTVP------NFLAGCS-----ILSDRQPAGIAGFGRSS-ESLPSQL--- 259
L+ + + F P + + GC+ + D++ G+ G G + S+P+QL
Sbjct: 206 LVRDKMHFGGDIAPATNGTLDVVFGCTNAESGTIHDQEADGLIGLGNNQFASIPNQLADT 265
Query: 260 -GLKK-FSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
GL + FS C S F+ S L P TP L YT N A +
Sbjct: 266 HGLPRVFSLCFGS--FEGGGALSFGRLPATP-----HTPPLVYTDMRVN-----EAHPAY 313
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMG 377
Y V + +G V P V G ++DSG+TFT++ +F A A +
Sbjct: 314 YVVSTAAMKIGDVAVATPSDLAV-----GYGTVMDSGTTFTYVPTKVFHATAAALDAAVT 368
Query: 378 NYSRAADVEKKSGLRP----------CFDISGKKSV-----------YLPELILKFKG-G 415
++ EKK P CF G + Y P L + F G G
Sbjct: 369 TNAKP---EKKLAKVPGPDPSYPDDVCFQREGATEIEPIVTMANLGEYYPPLTIAFDGEG 425
Query: 416 AKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFD--LA 471
A + LPP NY + G + CL + + G ++G +++ +E+D +
Sbjct: 426 ASLVLPPSNYLFVHGKKPGAFCLGVMDNKQQG--------TLIGGISVRDVLVEYDKTVG 477
Query: 472 NDRFGFAKQKC 482
R GFA C
Sbjct: 478 GGRIGFAATDC 488
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 154/381 (40%), Gaps = 70/381 (18%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G + + ++FGTPPQ T I DTGSS+ W C RC+ + + DPS
Sbjct: 160 GNFLVDVAFGTPPQKFT-LILDTGSSITWTQCKPCVRCLKASRRHFDPS----------- 207
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETL 220
++ G + S +Y + YG T+ G +T+
Sbjct: 208 ---------ASLTYSLGSCIPSTVGN-------------TYNMTYGDKSTSVGNYGCDTM 245
Query: 221 RFPSKTV-PNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLG---LKKFSYCLLSRK 272
V P F GC ++ G+ G G+ S SQ K FSYCL
Sbjct: 246 TLEHSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLP--- 302
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHV 332
++ + S L + S++ L +T P S +Y+V L I VG+K +
Sbjct: 303 -EEDSIGSLLFGE----KATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRL 357
Query: 333 KIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG-L 391
IP S + G I+DSG+ T + + A+ F + M Y + KK L
Sbjct: 358 NIPSSVFA-----SPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL 412
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEV--LCLILFTDNAAGPALG 449
C+++SG+K V LPE++L F GA + L + + GN+ LCL G
Sbjct: 413 DTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKR--VIWGNDASRLCLAF---------AG 461
Query: 450 RGPAIILGDFQLQNFYLEFDL 470
I+G+ Q + + +D+
Sbjct: 462 NSELTIIGNRQQVSLTVLYDI 482
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 105/401 (26%), Positives = 161/401 (40%), Gaps = 68/401 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y +++S GTPP S I DTGS L+W C C DC + V+ P F PK+S
Sbjct: 92 GSYLMNISLGTPP-VSMLGIADTGSDLIWRQC---LPCDDC-YKQVE----PLFDPKKSK 142
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLR 221
+ + +GC N C + + C N C S +T L SET
Sbjct: 143 TYKTLGCNNDFCQDL------GQQGSCGDDN-----TCTSSYSYGDQSYTRRDLSSETFT 191
Query: 222 FPSK-----TVPNFLAGCSILS----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLL 269
S + P GC + + + +G+ G G SL QL K +FSYCL+
Sbjct: 192 IGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLV 251
Query: 270 SRKFDDAPVSSNLVLDTGP---GSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
D+ SS + GSG TP + TP FYY+ L +
Sbjct: 252 PLS-SDSTASSKINFGKSAVVSGSGTVSTPLIKGTP------------DTFYYLTLEGMS 298
Query: 327 VGSKHVKIP---YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+GS+ V + P + +I+DSG+T T + + + + +G +
Sbjct: 299 LGSEKVAFKGFSKNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTT-- 356
Query: 384 DVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF-TDN 442
+ + C+ SG K + +P + F GA + LPP N F +++C + + N
Sbjct: 357 -TDPRGTFSLCY--SGVKKLEIPTITAHFI-GADVQLPPLNTFVQAQEDLVCFSMIPSSN 412
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A I G+ NF + +DL N++ F C
Sbjct: 413 LA----------IFGNLSQMNFLVGYDLKNNKVSFKPTDCT 443
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 104/426 (24%), Positives = 176/426 (41%), Gaps = 94/426 (22%)
Query: 87 YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
+ N+ ++ + + G Y+ L GTPPQ I D+GS++ + PC+S +C +
Sbjct: 71 HPNARMRLHDDLLTNGYYTTRLYIGTPPQEFA-LIVDSGSTVTYVPCSSCEQCGN----- 124
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
+ P F P SSS + C N C+ C K C +Y QY
Sbjct: 125 ---HQDPRFQPDLSSSYSPVKC-NVDCT-------------CDSDKKQC-----TYERQY 162
Query: 207 G-LGFTAGLLLSETLRF--PSKTVPNF-LAGCS-----ILSDRQPAGIAGFGRSSESLPS 257
+ ++G+L + + F S+ P + GC L + GI G GR S+
Sbjct: 163 AEMSSSSGVLGEDIVSFGRESELKPQHAIFGCENSETGDLFSQHADGIMGLGRGQLSIMD 222
Query: 258 QLGLK-----KFSYCLLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPV 308
QL K FS C +D G G+ G P + ++ +P+
Sbjct: 223 QLVEKGVISDSFSLCYGG-------------MDIGGGAMVLGGMLAPPDMIFS--NSDPL 267
Query: 309 GSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV 368
S +Y + L++I V K +++ + G ++DSG+T+ ++ F A
Sbjct: 268 RSP-----YYNIELKEIHVAGKALRVESRIF----NSKHGTVLDSGTTYAYLPEQAFVAF 318
Query: 369 AKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKMA 419
KE + S+ ++K G P CF +G+ L P++ + F G K++
Sbjct: 319 -KEAVT-----SKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLS 372
Query: 420 LPPENYFALVG--NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
L PENY + CL +F + G+ P +LG ++N + +D N++ GF
Sbjct: 373 LTPENYLFRHSKVDGAYCLGVFQN-------GKDPTTLLGGIIVRNTLVTYDRHNEKIGF 425
Query: 478 AKQKCA 483
K C+
Sbjct: 426 WKTNCS 431
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 103/412 (25%), Positives = 170/412 (41%), Gaps = 96/412 (23%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y+ L GTPPQ I DTGS++ + PC++ C C + P F P S
Sbjct: 87 GYYTTRLWIGTPPQRFA-LIVDTGSTVTYVPCST---CEHCG-----RHQDPKFQPDLSE 137
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ Q + C P C+ C C Y QY + ++G+L + +
Sbjct: 138 TYQPVKC-TPDCN-------------CDGDTNQC-----MYDRQYAEMSSSSGVLGEDVV 178
Query: 221 RFP--SKTVPN-FLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLKK-----FSYC 267
F S+ P + GC L ++ GI G GR S+ QL KK FS C
Sbjct: 179 SFGNLSELAPQRAVFGCENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLC 238
Query: 268 LLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
+D G G+ G S + +T + +P S +Y + L+
Sbjct: 239 YGG-------------MDVGGGAMILGGISPPEDMVFT--HSDPDRSP-----YYNINLK 278
Query: 324 QIIVGSKHVKI-PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
++ V K +++ P + DG G ++DSG+T+ ++ F A + +++ +
Sbjct: 279 EMHVAGKKLQLNPKVF-----DGKHGTVLDSGTTYAYLPETAFLAFKRAIMKERNS---- 329
Query: 383 ADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKMALPPENYFALVGNE- 432
+++ +G P CF +G L P + + F+ G K++L PENY
Sbjct: 330 --LKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKLSLSPENYLFRHSKVR 387
Query: 433 -VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
CL +F++ GR P +LG ++N + +D N + GF K C+
Sbjct: 388 GAYCLGVFSN-------GRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNCS 432
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 101/420 (24%), Positives = 170/420 (40%), Gaps = 82/420 (19%)
Query: 87 YSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPN 146
+ N+ ++ + + G Y+ L GTPPQ I D+GS++ + PC S +C +
Sbjct: 72 HPNARMRLHDDLLTNGYYTTRLYIGTPPQEFA-LIVDSGSTVTYVPCASCEQCGN----- 125
Query: 147 VDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQY 206
+ P F P SSS + C N C+ C K C +Y QY
Sbjct: 126 ---HQDPRFQPDLSSSYSPVKC-NVDCT-------------CDSDKKQC-----TYERQY 163
Query: 207 G-LGFTAGLLLSETLRF--PSKTVPNFLA-GCS-----ILSDRQPAGIAGFGRSSESLPS 257
+ ++G+L + + F S+ P GC L + GI G GR S+
Sbjct: 164 AEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMD 223
Query: 258 QL---GLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF 314
QL G+ S+ L D +VL P D S++ ++P
Sbjct: 224 QLVEKGVISDSFSLCYGGMDIG--GGAMVLGGVPAPSDMV---FSHSDPLRSP------- 271
Query: 315 GEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR 374
+Y + L++I V K +++ + G ++DSG+T+ ++ F A
Sbjct: 272 --YYNIELKEIHVAGKALRVDSRVF----NSKHGTVLDSGTTYAYLPEQAFVAFKDAVT- 324
Query: 375 QMGNYSRAADVEKKSGLRP-----CFDISGKKSVYL----PELILKFKGGAKMALPPENY 425
S+ ++K G P CF +G+ L P++ + F G K++L PENY
Sbjct: 325 -----SKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENY 379
Query: 426 FALVG--NEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ CL +F + G+ P +LG ++N + +D N++ GF K C+
Sbjct: 380 LFRHSKVDGAYCLGVFQN-------GKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNCS 432
>gi|316927704|gb|ADU58605.1| xyloglucan-specific endoglucanase inhibitor 4 [Solanum tuberosum]
Length = 440
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 114/259 (44%), Gaps = 38/259 (14%)
Query: 244 GIAGFGRSSESLPSQLGL-----KKFSYCLLSRK-------FDDAPVSSNLVLDTGPGSG 291
GI G G P+QL +KF+ CL S F D+P + L PG
Sbjct: 177 GILGLGNGYVGFPTQLANAFSVPRKFAICLTSSTTSRGVIFFGDSPY---VFL---PGMD 230
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGE-----FYYVGLRQIIVGSKHVKIPYSYLVPGSDGN 346
SK L YTP KNPV +S ++ E Y++G+ I + V I + L DG
Sbjct: 231 VSKR--LVYTPLLKNPVSTSGSYFEGEPSTDYFIGVTSIKINGNVVPINTTLLNITKDGK 288
Query: 347 GGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLP 406
GG + + +T +E ++ A+ K F++ + R V + + C++ + S +
Sbjct: 289 GGTKISTVDPYTKLETSIYNALTKAFVKSLAKVPRVKPV---APFKVCYNRTSLGSTRVG 345
Query: 407 ------ELILKFKGG-AKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDF 459
EL+L K + N + N+VLCL F D G +I++G
Sbjct: 346 RGVPPIELVLGNKNATTSWTIWGVNSMVAMNNDVLCL-GFLD--GGVEFEPTTSIVIGAH 402
Query: 460 QLQNFYLEFDLANDRFGFA 478
Q+++ L+FD+AN R GF
Sbjct: 403 QIEDNLLQFDIANKRLGFT 421
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 107/401 (26%), Positives = 153/401 (38%), Gaps = 71/401 (17%)
Query: 98 VHSYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIP 157
V + G Y + + GTP Q + DT + W PC+ C C+ + +
Sbjct: 91 VLNIGNYVVRVKLGTPGQFMF-MVLDTSNDAAWVPCSG---CTGCSSTTFSTNTSSTY-- 144
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLL---QYG--LGFTA 212
+ C +C+ + R +CP S + YG F+A
Sbjct: 145 ------GSLDCSMAQCTQV--------------RGFSCPATGSSSCVFNQSYGGDSSFSA 184
Query: 213 GLLLSETLRFPSKTVPNFLAGC--SILSDRQPAGIAGFGR--------SSESLPSQLGLK 262
L+ ++LR + +PNF GC SI P S SL S L
Sbjct: 185 -TLVEDSLRLVNDVIPNFAFGCINSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGL--- 240
Query: 263 KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGL 322
FSYCL S F S +L L GP +G K+ + YTP +NP S YYV L
Sbjct: 241 -FSYCLPS--FKSYYFSGSLKL--GP-AGQPKS--IRYTPLLRNPHRPS-----LYYVNL 287
Query: 323 RQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRA 382
+ VG V I L + G I+DSG+ T P++ A+ EF +Q+ A
Sbjct: 288 TGVSVGRTLVPIAPELLAFNPNTGAGTIIDSGTVITRFVQPIYTAIRDEFRKQV-----A 342
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPEN-YFALVGNEVLCLILFTD 441
CF + P + L F G + LP EN + CL +
Sbjct: 343 GPFSSLGAFDTCF--AATNEAVAPAVTLHFT-GLNLVLPMENSLIHSSAGSLACLAM--- 396
Query: 442 NAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
AA P ++ + Q QN L FD+ N R G A++ C
Sbjct: 397 -AAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIARELC 436
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 114/486 (23%), Positives = 192/486 (39%), Gaps = 73/486 (15%)
Query: 6 FSLICLFSLLILLFTTDAGAGSSAATVTVPLTPLSTKHYLHHSDSDPLKILHSLASSSLS 65
FS++ L + F + A ++ + S K L+H + +++ S++
Sbjct: 4 FSVLTLIFFYLCCFIYFSHASKKGLSIEMIHRDFS-KSPLYHPTVTKFQRAYNVVHRSIN 62
Query: 66 RARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTG 125
R + TK+ ++ N S + L G Y IS S GTPP F+ DTG
Sbjct: 63 RVNYF-------TKEFSLNKNQPVSTLTPEL-----GEYLISYSVGTPPFKVYGFM-DTG 109
Query: 126 SSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRC 185
S++VW C C + P F P +SSS + I C + C +
Sbjct: 110 SNIVWLQCQPCNTCFN--------QTSPIFNPSKSSSYKNIPCTSSTCK-----DTNDTH 156
Query: 186 KGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKT-----VPNFLAGC---SI 236
CS C Y + Y G + G L +++L S + PN + GC ++
Sbjct: 157 ISCSNGGDVC-----EYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHINV 211
Query: 237 LSDR-QPAGIAGFGRSSESLPSQLGL----KKFSYCLLSRKFDDAPVSSNLVLDTGPGSG 291
L D Q +G+ G GR SL Q+G KFSYCL+ D+ SS L+
Sbjct: 212 LQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYN-SDSNSSSKLIF------- 263
Query: 292 DSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIV 351
+ +S P+ + +Y++ L VG+ +I Y S N +++
Sbjct: 264 -GEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNN--RIEYGERSNASTQN--ILI 318
Query: 352 DSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILK 411
DSG+ T + LF + ++ Q R + L C++ +GK+ + +P++
Sbjct: 319 DSGTPLTMLPN-LFLSKLVSYVAQEVKLPRIEPPDHH--LSLCYNTTGKQ-LNVPDITAH 374
Query: 412 FKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
F GA + L F + ++C + N I G+ N +++DL
Sbjct: 375 FN-GADVKLNSNGTFFPFEDGIMCFGFISSNGLE---------IFGNIAQNNLLIDYDLE 424
Query: 472 NDRFGF 477
+ F
Sbjct: 425 KEIISF 430
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 99/395 (25%), Positives = 154/395 (38%), Gaps = 60/395 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKR 159
G Y + GTPP+ DTGS L+W C + C+ C P +IP + K
Sbjct: 34 GLYFTQVQLGTPPRTYN-LQVDTGSDLLWVNC---HPCIGC--PAFSDLKIPIVPYDVKA 87
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
S+SS + C +P C+ I + GC+ +N+ Y QYG G T G L+ +
Sbjct: 88 SASSSKVPCSDPSCTLI----TQISESGCNDQNQC------GYSFQYGDGSGTLGYLVED 137
Query: 219 TLRFPSKTVPNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
L + + GC S+R GI GFG S S SQL + + + +
Sbjct: 138 VLHYMVNATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAH 197
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
D +++ G+ P + YTP + Y V L+ I V + +
Sbjct: 198 CLDGGERGGGILV-----LGNVIEPDIQYTPLVP--------YMYHYNVVLQSISVNNAN 244
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ I +D G I DSG+T ++ ++A +++A + L
Sbjct: 245 LTIDPKLF--SNDVMQGTIFDSGTTLAYLPDEAYQA-----------FTQAVSLVVAPFL 291
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNE-VLCLILFTDNAAGPA 447
+S P ++L F+ GA M L P Y A N + C+ + G A
Sbjct: 292 LCDTRLSRFIYKLFPNVVLYFE-GASMTLTPAEYLIRQASAANAPIWCMGW---QSMGSA 347
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I GD L+N + +DL R G+ C
Sbjct: 348 ESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 99/395 (25%), Positives = 155/395 (39%), Gaps = 60/395 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPA--FIPKR 159
G Y + GTPP+ + DTGS L+W C + C+ C P +IP + K
Sbjct: 34 GLYFTQVQLGTPPRTYNLQV-DTGSDLLWVNC---HPCIGC--PAFSDLKIPIVPYDVKA 87
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSE 218
S+SS + C +P C+ I + GC+ +N+ Y QYG G T G L+ +
Sbjct: 88 SASSSKVPCSDPSCTLI----TQISESGCNDQNQC------GYSFQYGDGSGTLGYLVED 137
Query: 219 TLRFPSKTVPNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSR 271
L + + GC S+R GI GFG S S SQL + + + +
Sbjct: 138 VLHYMVNATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAH 197
Query: 272 KFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKH 331
D +++ G+ P + YTP + Y V L+ I V + +
Sbjct: 198 CLDGGERGGGILV-----LGNVIEPDIQYTPLVP--------YMSHYNVVLQSISVNNAN 244
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGL 391
+ I +D G I DSG+T ++ ++A +++A + L
Sbjct: 245 LTIDPKLF--SNDVMQGTIFDSGTTLAYLPDEAYQA-----------FTQAVSLVVAPFL 291
Query: 392 RPCFDISGKKSVYLPELILKFKGGAKMALPPENYF---ALVGNE-VLCLILFTDNAAGPA 447
+S P ++L F+ GA M L P Y A N + C+ + G A
Sbjct: 292 LCDTRLSRFIYKLFPNVVLYFE-GASMTLTPAEYLIRQASAANAPIWCMGW---QSMGSA 347
Query: 448 LGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I GD L+N + +DL R G+ C
Sbjct: 348 ESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 162/392 (41%), Gaps = 78/392 (19%)
Query: 120 FIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGP 179
I DTGS+ + PC RC + D R F + + C + +
Sbjct: 53 LIVDTGSARTYVPCKGCARCGEHAHGYYDYDRSMEF--------ERLDCGEASDATL--- 101
Query: 180 NVESRCKGCSPRNKTCPLACPSYLLQYGLGFTA-GLLLSETLRFPSKTVPNFLA-GC--- 234
E KG + C SY++ Y G ++ G ++ + +R T+ LA GC
Sbjct: 102 -CEETMKGTCQSDGRC-----SYVVSYAEGSSSRGYVVRDRVRLGEGTLSAMLAFGCEEA 155
Query: 235 --SILSDRQPAGIAGFGRSSESLPSQL---GLKK--FSYCLLSRKFDDAPVSSNLVLDTG 287
+ + +++ G+ GFGR + ++ +QL GL + FS+C+ + ++ VL G
Sbjct: 156 ETNAIYEQKADGLFGFGRGTATVHAQLASAGLIENVFSFCV------EGFGANGGVLTLG 209
Query: 288 PGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNG 347
+ P L+ TP +P + F+ V +G ++ SY
Sbjct: 210 RFDFGADAPALARTPLVADPANPA-----FHNVRTSSWKLGDSLIEHLNSYTT------- 257
Query: 348 GVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP-----CFDISGKK- 401
+DSG+TFTF+ +V F ++ + A +E +G P C+ +S
Sbjct: 258 --TLDSGTTFTFVP----RSVWVSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAM 311
Query: 402 ---------SVYLPELILKFKGGAKMALPPENY-FALVGNEV-LCLILFTDNAAGPALGR 450
S + P L + ++GG + L PENY FA N C+ +F A P
Sbjct: 312 NMTLSQSTVSEWFPPLTIAYEGGVSLTLGPENYLFAHETNSAAFCVGIF----ANP---- 363
Query: 451 GPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
I+LG +++ +EFD+AN R G A C
Sbjct: 364 NNQILLGQITMRDTLMEFDVANSRVGMAPANC 395
>gi|108707516|gb|ABF95311.1| hypothetical protein LOC_Os03g17280 [Oryza sativa Japonica Group]
Length = 353
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/97 (44%), Positives = 62/97 (63%), Gaps = 1/97 (1%)
Query: 197 LACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILSD-RQPAGIAGFGRSSESL 255
LA + + Y G T LL+S+TLR P +T+ NF+ GCS++S +Q +G+ GF S+
Sbjct: 3 LAADAIGVVYSSGSTTRLLISDTLRTPGRTIRNFVVGCSLMSVYQQSSGLTGFSCGVPSV 62
Query: 256 PSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGD 292
PSQLGL KF Y LL+R+FDD +S+ ++ G G D
Sbjct: 63 PSQLGLTKFFYFLLARRFDDNATASDELILGGAGGKD 99
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 37/67 (55%), Gaps = 11/67 (16%)
Query: 386 EKKSGLRPCFDISGK-KSVYLPELILKFKGGAKMALPPENYFALVG----------NEVL 434
EK GL P +S + K++ LP++ L FKGG+ M LP ENYF + G E +
Sbjct: 124 EKGLGLSPYIAMSSRTKTMELPKISLYFKGGSVMNLPVENYFMVAGPAPSASVPAMAEAI 183
Query: 435 CLILFTD 441
CL + +D
Sbjct: 184 CLAVVSD 190
>gi|168008086|ref|XP_001756738.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691976|gb|EDQ78335.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 174
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 84/192 (43%), Gaps = 27/192 (14%)
Query: 298 LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTF 357
L +TP K+P+ + FY+V L + V + I L S+GNGG I+D + F
Sbjct: 2 LEFTPLLKHPLVET-----FYFVNLVAVAVNGAKLPISSKVLKMNSEGNGGAILDMSTRF 56
Query: 358 TFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP----CFDISGKKSVYLPELILKFK 413
T F+ + K A + + + P C+ ++ +P + L F+
Sbjct: 57 TRFPNSAFDHLVKAL---------KALIRLPTMVVPRFQLCYSTVNTGTLIIPTVTLIFE 107
Query: 414 GGAKMALPPENYFALVGNE--VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLA 471
G +M LP EN F V + V+CL + N G A ++G Q QNF + D
Sbjct: 108 NGVRMRLPMENTFVSVTEQGDVMCLAMVPGNP-------GTATVIGSAQQQNFLIVIDRE 160
Query: 472 NDRFGFAKQKCA 483
R GFA +CA
Sbjct: 161 ASRLGFAPLQCA 172
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 109/444 (24%), Positives = 172/444 (38%), Gaps = 71/444 (15%)
Query: 60 ASSSLSRARHLKTKTKPKTKDSNIGSNYSNSLIKTPLSVHSY--GGYSISLSFGTPPQAS 117
AS SRAR + + K + S I SNS K P+S S Y + + G+PP
Sbjct: 70 ASVRTSRARGDRIR---KIRSSGI----SNSR-KYPVSRISIIDKVYVMKFNIGSPP-VE 120
Query: 118 TPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCS--- 174
T I DTGS++VW C S C +C +IP F P +SS+ + C + +C
Sbjct: 121 TYAIPDTGSNIVWIQCGSPI-CTNCY-----KQKIPLFNPTKSSTYAIRLCGHRECKQAL 174
Query: 175 WIFGPNVESRCKGCSPRNKTCPLACPSYLLQY-GLGFTAGLLLSETLRFPSKTVP----- 228
W G + GC + C Y + Y F+ G + ++ + FP
Sbjct: 175 WGLGEYL-----GCKSSVQVC-----RYHISYEDHSFSEGTISTDIITFPEHIAEFGNYS 224
Query: 229 -NFLAGCSILSDRQPA---------GIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPV 278
GC + P G+ G G SL QL L +FSYC+ +
Sbjct: 225 LRMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQLTLGQFSYCISTPDVQK--- 281
Query: 279 SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK-IPYS 337
P GL+ + + +++ G + + + I V VK P
Sbjct: 282 ---------PNGTIEIRFGLAASISGHSTALANNLEGWYIFQNVDGIYVDDTKVKGYPEW 332
Query: 338 YLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDI 397
G GG+I+DSG+T+T + +A+ E Q+ D S C++
Sbjct: 333 VFQFAEGGIGGLIMDSGTTYTELYFSALDALIGELKEQIELAPDTQD-HSNSNYSLCYNA 391
Query: 398 SGKKSVYLPELILKFKGGAKMALP--PENYFALVGNEVLCLILFTDNAAGPALGRGPAII 455
+ Y+P + LKF + P N + GN+ CL +F G I
Sbjct: 392 ANFLLTYVPAIELKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMF---------GTSGISI 442
Query: 456 LGDFQLQNFYLEFDLANDRFGFAK 479
+G +Q ++ + +DL + F +
Sbjct: 443 IGIYQHRDIKIGYDLKYNLVSFTE 466
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 95/390 (24%), Positives = 149/390 (38%), Gaps = 57/390 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y +L+ GTPPQ ++ I G VW C+ RC + P + S + P+
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGE-FVWTQCSPCRRCFKQDLPLFNRSASSTYRPEP---- 82
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
C C + S C G + C SY ++ G T+G+ ++T
Sbjct: 83 ----CGTALCESV----PASTCSG----DGVC-----SYEVETMFGDTSGIGGTDTFAIG 125
Query: 224 SKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
+ T + GC++ S+ + +G+ G GR+ SL Q+ FSYCL A
Sbjct: 126 TATA-SLAFGCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHG--AAGKK 182
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
S L+L K+ + TP SS Y + L I G + P
Sbjct: 183 SALLLGASAKLAGGKSA--ATTPLVNTSDDSSD-----YMIHLEGIKFGDVIIAPP---- 231
Query: 340 VPGSDGNGGVI-VDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF--- 395
NG V+ VD+ +F+ F+A+ K +G A + CF
Sbjct: 232 -----PNGSVVLVDTIFGVSFLVDAAFQAIKKAVTVAVGAAPMATPTKP---FDLCFPKA 283
Query: 396 --DISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPA 453
S+ LP+++L F+G A + +PP Y GN +CL + +
Sbjct: 284 AAAAGANSSLPLPDVVLTFQGAAALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELS-- 341
Query: 454 IILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
ILG +N + FDL + F C+
Sbjct: 342 -ILGRLHQENIHFLFDLDKETLSFEPADCS 370
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 110/403 (27%), Positives = 170/403 (42%), Gaps = 72/403 (17%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRSSSSQ 164
+++S G PP + I DTGS+L W C C V C+ + P F P RS +S+
Sbjct: 116 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQP---CAVHCHTQSAKAG--PIFDPGRSYTSR 169
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG--FTAGLLLSETLRF 222
+ C + KC + ++ + C + +C +Y + YG G ++ G ++++TLR
Sbjct: 170 RVRCSSVKCGEL-RYDLRLQQANCMEKENSC-----TYSVTYGNGWAYSVGKMVTDTLRI 223
Query: 223 PSKTVPNFLAGCS--ILSDRQPAGIAGFGRSSESLPSQLG-------LKKFSYCLLSRKF 273
+ + + GCS + AGI GFG SS S QL K FSYCL +
Sbjct: 224 -GDSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT--- 279
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
D ++L G D YTP ++ S Y + + +I + +
Sbjct: 280 -DETKPGYMIL----GRYDRAAMDGGYTPLFR------SINRPTYSLTMEMLIANGQRLV 328
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGL 391
S + IVDSG+ T + F + K + M + Y R + ++S +
Sbjct: 329 TSSSEM----------IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 378
Query: 392 RPCF----DISGKKSVY--------LPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
C+ D SG LP L + F GGA +ALPP N F + LC+
Sbjct: 379 --CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF- 435
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A PAL + ILG+ ++F FD+ +FGF C
Sbjct: 436 ---AQNPAL---RSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 112/452 (24%), Positives = 170/452 (37%), Gaps = 111/452 (24%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + G+P + I DTGS ++W C + C + +D + F SS
Sbjct: 69 GLYFTKVKMGSPAKEFYVQI-DTGSDILWLNCNTCNNCPKSSGLGID---LNYFDTASSS 124
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
++ L+ C +P CS+ V++ CS + C SY QYG G T+G + + +
Sbjct: 125 TAALVSCSDPVCSYA----VQTATSQCSSQANQC-----SYTFQYGDGSGTSGYYVYDAM 175
Query: 221 RFP--------SKTVPNFLAGCSIL-------SDRQPAGIAGFGRSSESLPSQL---GL- 261
F S + + GCS +++ GI GFG + S+ SQ+ G+
Sbjct: 176 YFDVIMGQSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMA 235
Query: 262 -KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYV 320
K FS+CL + LVL G+ P + YTP Y +
Sbjct: 236 PKVFSHCLKGQ----GSGGGILVL------GEILEPNIVYTPLVP--------LQPHYNL 277
Query: 321 GLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAV------------ 368
L+ I V + + I G+ N G IVDSG+T ++ ++
Sbjct: 278 NLQSIAVNGQILPIDQDVFATGN--NRGTIVDSGTTLAYLVQEAYDPFLNAGSPCHFFTH 335
Query: 369 ----AKEFIRQMGNYSRAADVEK----KSGLRPCFD----------------ISGKKSVY 404
+ GN + + V++ + LR IS Y
Sbjct: 336 FNEPTNNIKYEDGNNNHQSRVKRHYYDEVTLRLVLKHSAIITTTVSQFSKPIISKGNQCY 395
Query: 405 L---------PELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALG----RG 451
L P + L F GGA M L PE Y G F D AA +G +
Sbjct: 396 LVPTSLGDIFPLVSLNFMGGASMVLKPEQYLIHYG--------FLDGAAMWCIGFQKVQK 447
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
ILGD L++ +DLAN R G+ C+
Sbjct: 448 GYTILGDLVLKDKIFVYDLANQRIGWTDYDCS 479
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 163/393 (41%), Gaps = 60/393 (15%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQL 165
+ +S GTP + I DTGS++ W C +Y V C D P F SS+ +
Sbjct: 25 MGISLGTPAVFNLVTI-DTGSTISWVQC--QYCIVHC--YTQDQRAGPTFNTSSSSTYRR 79
Query: 166 IGCQNPKCSWI-FGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSETLRFP 223
+GC C + N+ S GC +C Y L+Y G ++AG L + L
Sbjct: 80 VGCSAQVCHDMHVSQNIPS---GCVEEEDSCI-----YSLRYASGEYSAGYLSQDRLTLA 131
Query: 224 -SKTVPNFLAGCSILSDRQ----PAGIAGFGRSSESLPSQLG----LKKFSYCLLSRKFD 274
S ++ F+ GC SD + AGI GFG S S +Q+ FSYC S + +
Sbjct: 132 NSYSIQKFIFGCG--SDNRYNGHSAGIIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQEN 189
Query: 275 DAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKI 334
+ L GP DS L+ Y A Y + ++V +++
Sbjct: 190 EG------FLSIGPYVRDSNKLILTQLFDY-------GAHLPVYALQQFDMMVNGMRLQV 236
Query: 335 PYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQM--GNYSRAADVEKKSGLR 392
P +VDSG+ TF+ P+F A+ + + M Y R +D ++
Sbjct: 237 D-----PPVYTTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKE----- 286
Query: 393 PCFDISGKKSVY--LPELILKFKGGAKMALPPENYFALVGNE-VLCLILFTDNAAGPALG 449
CF +G + LP + +KF + + LP EN F ++ +C D+A P +
Sbjct: 287 ICFHSNGDSVDWSKLPVVEIKFS-RSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQ 345
Query: 450 RGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
ILG+ ++F + FD+ FGF C
Sbjct: 346 -----ILGNRATRSFRVVFDIQQRNFGFEAGAC 373
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 97/384 (25%), Positives = 155/384 (40%), Gaps = 52/384 (13%)
Query: 106 ISLSFGTP--PQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
++LS G P PQ + DTGS ++W C C + DPS F P +
Sbjct: 103 VNLSIGQPSIPQL---VVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPLCKTPC 159
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
GC KC I P S S F +L+ ET
Sbjct: 160 GFKGC---KCDPI--PFTISYVDNSSASGT----------------FGRDILVFETTDEG 198
Query: 224 SKTVPNFLAGCS----ILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVS 279
+ + + + GC SD GI G SL +Q+G +KFSYC+ + P
Sbjct: 199 TSQISDVIIGCGHNIGFNSDPGYNGILGLNNGPNSLATQIG-RKFSYCIGNLA---DPYY 254
Query: 280 SNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYL 339
+ L G G+ G S TPF + FYYV + I VG K + I
Sbjct: 255 NYNQLRLGEGA---DLEGYS-TPF--------EVYHGFYYVTMEGISVGEKRLDIALETF 302
Query: 340 VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPC-FDIS 398
+G GGVI+DSG+T T++ + + E +R + +S + + + + C + I
Sbjct: 303 EMKRNGTGGVILDSGTTITYLVDSAHKLLYNE-VRNLLKWSFRQVIFENAPWKLCYYGII 361
Query: 399 GKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
+ V P + F GA +AL ++F+ +++ C+ + + + P++I G
Sbjct: 362 SRDLVGFPVVTFHFVDGADLALDTGSFFSQ-RDDIFCMTVSPASILNTTI--SPSVI-GL 417
Query: 459 FQLQNFYLEFDLANDRFGFAKQKC 482
Q++ + +DL N F + C
Sbjct: 418 LAQQSYNVGYDLVNQFVYFQRIDC 441
>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
Length = 484
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 153/391 (39%), Gaps = 63/391 (16%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y ++ FGTP Q T FDT ++ ++ +C C D AF P SSS
Sbjct: 145 YHVTAGFGTPVQQFT-VGFDTTTT-----GATQLQCKPCA---ADEPCHHAFDPSASSSI 195
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ C +P C + KGCS + T ++ + LL FT L L+ P
Sbjct: 196 AHVPCGSPDCPF---------NKGCSGHSCTLSVSINNTLLGNATFFTDKLTLT-----P 241
Query: 224 SKTVPNFLAGC---SILSDRQPAGIAGFGRSSESL-----PSQLGLKKFSYCLLSRKFDD 275
V +F C D GI R+S SL PS FSYCL S D
Sbjct: 242 WNIVDDFRFVCLEAGFRPDDDSTGILDLSRNSHSLASRAAPSSPDAVAFSYCLPSYPSDV 301
Query: 276 APVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIP 335
L G + +SYTP N G Y V L + +G + +P
Sbjct: 302 G------FLSLGATKPELLGRKVSYTPLRSN-----RHNGNLYVVELVGLGLGGVDLPVP 350
Query: 336 YSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCF 395
+ + GG I++ +TFT+++ ++ A+ EF + M Y A + L C+
Sbjct: 351 RAAIA-----GGGTILELHTTFTYLKPKVYAALRDEFRKSMSQYPVA---PPQGSLDTCY 402
Query: 396 DISGKKSVYLPELILKFKGGAKMALPPEN--YFALVGN--EVLCLILFTDNAAGPALGRG 451
+ + S +P + LKF GGA+ L + YF G+ V CL +
Sbjct: 403 NFTALSSYSVPAVTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGCLAFVAQDGGA------ 456
Query: 452 PAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
++G + + +D+ + GF +C
Sbjct: 457 ---VIGSMAQMSTEVVYDVRGGKVGFVPYRC 484
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 110/403 (27%), Positives = 170/403 (42%), Gaps = 72/403 (17%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRSSSSQ 164
+++S G PP + I DTGS+L W C C V C+ + P F P RS +S+
Sbjct: 116 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQP---CAVHCHTQSAKAG--PIFDPGRSYTSR 169
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG--FTAGLLLSETLRF 222
+ C + KC + ++ + C + +C +Y + YG G ++ G ++++TLR
Sbjct: 170 RVRCSSVKCGEL-RYDLRLQQANCMEKEDSC-----TYSVTYGNGWAYSVGKMVTDTLRI 223
Query: 223 PSKTVPNFLAGCS--ILSDRQPAGIAGFGRSSESLPSQLG-------LKKFSYCLLSRKF 273
+ + + GCS + AGI GFG SS S QL K FSYCL +
Sbjct: 224 -GDSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT--- 279
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
D ++L G D YTP ++ S Y + + +I + +
Sbjct: 280 -DETKPGYMIL----GRYDRAAMDGGYTPLFR------SINRPTYSLTMEMLIANGQRLV 328
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGL 391
S + IVDSG+ T + F + K + M + Y R + ++S +
Sbjct: 329 TSSSEM----------IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 378
Query: 392 RPCF----DISGKKSVY--------LPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
C+ D SG LP L + F GGA +ALPP N F + LC+
Sbjct: 379 --CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF- 435
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A PAL + ILG+ ++F FD+ +FGF C
Sbjct: 436 ---AQNPAL---RSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 100/408 (24%), Positives = 160/408 (39%), Gaps = 66/408 (16%)
Query: 100 SYGGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKR 159
S G Y + GTP + DTG+ ++W C +C +C + + + K
Sbjct: 69 SVGLYYAKIGIGTPSK-DYYLQVDTGTDMMWVNCI---QCKECPTRSNLGMDLTLYNIKE 124
Query: 160 SSSSQLIGCQNPKCSWIFGPNVESRCKGCSPR-NKTCPLACPSYLLQYGLGF-TAGLLLS 217
SSS +L+ C C I G GC+ + N +CP YL YG G TAG +
Sbjct: 125 SSSGKLVPCDQELCKEING----GLLTGCTSKTNDSCP-----YLEIYGDGSSTAGYFVK 175
Query: 218 ETLRFPS-----KTVP---NFLAGC--------SILSDRQPAGIAGFGRSSESLPSQLG- 260
+ + F KT + + GC S ++ GI GFG+++ S+ SQL
Sbjct: 176 DVVLFDQVSGDLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSS 235
Query: 261 ----LKKFSYCLLSRKFDDAPVSSNLVLDTGP-GSGDSKTPGLSYTPFYKNPVGSSSAFG 315
K F++CL N V G G P ++ TP +
Sbjct: 236 SGKVKKMFAHCL------------NGVNGGGIFAIGHVVQPTVNTTPLLPDQ-------- 275
Query: 316 EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQ 375
Y V + I VG H + S + G I+DSG+T ++ +++ + + + Q
Sbjct: 276 PHYSVNMTAIQVG--HTFLNLSTDASEQRDSKGTIIDSGTTLAYLPDGIYQPLVYKILSQ 333
Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLC 435
N V+ CF SG P + F+ G + + P +Y L N + C
Sbjct: 334 QPNLK----VQTLHDEYTCFQYSGSVDDGFPNVTFYFENGLSLKVYPHDYLFLSEN-LWC 388
Query: 436 LILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ N+ + +LGD L N + +DL N G+ + C+
Sbjct: 389 IGW--QNSGAQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNCS 434
>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
Length = 357
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 110/403 (27%), Positives = 170/403 (42%), Gaps = 72/403 (17%)
Query: 106 ISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRC-VDCNFPNVDPSRIPAFIPKRSSSSQ 164
+++S G PP + I DTGS+L W C C V C+ + P F P RS +S+
Sbjct: 1 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQP---CAVHCHTQSAKAG--PIFDPGRSYTSR 54
Query: 165 LIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG--FTAGLLLSETLRF 222
+ C + KC + ++ + C + +C +Y + YG G ++ G ++++TLR
Sbjct: 55 RVRCSSVKCGEL-RYDLRLQQANCMEKEDSC-----TYSVTYGNGWAYSVGKMVTDTLRI 108
Query: 223 PSKTVPNFLAGCS--ILSDRQPAGIAGFGRSSESLPSQLG-------LKKFSYCLLSRKF 273
+ + + GCS + AGI GFG SS S QL K FSYCL +
Sbjct: 109 -GDSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCLPT--- 164
Query: 274 DDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVK 333
D ++L G D YTP ++ S Y + + +I + +
Sbjct: 165 -DETKPGYMIL----GRYDRAAMDGGYTPLFR------SINRPTYSLTMEMLIANGQRLV 213
Query: 334 IPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGN--YSRAADVEKKSGL 391
S + IVDSG+ T + F + K + M + Y R + ++S +
Sbjct: 214 TSSSEM----------IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 263
Query: 392 RPCF----DISGKKSVY--------LPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
C+ D SG LP L + F GGA +ALPP N F + LC+
Sbjct: 264 --CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTF- 320
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A PAL + ILG+ ++F FD+ +FGF C
Sbjct: 321 ---AQNPAL---RSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 357
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 99/404 (24%), Positives = 161/404 (39%), Gaps = 64/404 (15%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + GTP + + DTGS +VW C +C +C + + + + S+
Sbjct: 85 GLYYAKIGIGTPSKDYYVQV-DTGSDIVWVNCI---QCRECPRTSSLGMELTPYDLEEST 140
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGF-TAGLLLSETL 220
+ +L+ C C + G + GC+ N +CP YL YG G TAG + + +
Sbjct: 141 TGKLVSCDEQFCLEVNGGPL----SGCTT-NMSCP-----YLQIYGDGSSTAGYFVKDYV 190
Query: 221 RFP--SKTVPNFLAGCSI---LSDRQPA-----------GIAGFGRSSESLPSQLG---- 260
++ S + A SI RQ GI GFG+S+ S+ SQL
Sbjct: 191 QYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRK 250
Query: 261 -LKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
K F++CL D + G P ++ TP N Y
Sbjct: 251 VKKMFAHCL------DGTNGGGIF-----AMGHVVQPKVNMTPLVPNQ--------PHYN 291
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNY 379
V + + VG H+ + S V + G I+DSG+T ++ ++E + + + Q N
Sbjct: 292 VNMTGVQVG--HIILNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHN- 348
Query: 380 SRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILF 439
+V+ G CF S + P +I F+ + + P Y L E L I +
Sbjct: 349 ---LEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLKVYPHEY--LFQYENLWCIGW 403
Query: 440 TDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N+ + R + GD L N + +DL N G+ + C+
Sbjct: 404 -QNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCS 446
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 106/406 (26%), Positives = 167/406 (41%), Gaps = 67/406 (16%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y L G+PP+ + DTGS ++W C RC + +D + + PK S
Sbjct: 68 GLYFTKLGLGSPPRDYYVQV-DTGSDILWVNCVECSRCPRKSDLGID---LTLYDPKGSE 123
Query: 162 SSQLIGCQNPKCSWIF-GPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
+S ++ C CS F GP GC + CP Y + YG G T G + +
Sbjct: 124 TSDVVSCDQDFCSATFDGP-----IPGCKSE-----IPCP-YSITYGDGSATTGYYVQDY 172
Query: 220 LRFPS-----KTVPN---FLAGCSIL--------SDRQPAGIAGFGRSSESLPSQLGL-- 261
L + +T P + GC + S+ GI GFG+++ S+ SQL
Sbjct: 173 LTYNRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASG 232
Query: 262 ---KKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFY 318
K FS+CL D+ + G+ P +S TP Y
Sbjct: 233 KVKKIFSHCL-----DNVRGGGIFAI------GEVVEPKVSTTPLVPRMA--------HY 273
Query: 319 YVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLF-EAVAKEFIRQMG 377
V L+ I V + +++P S + +G G VI DSG+T ++ ++ E + K RQ G
Sbjct: 274 NVVLKSIEVDTDILQLP-SDIFDSVNGKGTVI-DSGTTLAYLPDIVYDELIQKVLARQPG 331
Query: 378 NYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLI 437
+ VE++ CF +G P + L FK + + P +Y + + C I
Sbjct: 332 --LKLYLVEQQF---RCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWC-I 385
Query: 438 LFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
+ + A G+ +LGD L N + +DL N G+ C+
Sbjct: 386 GWQRSVAQTKNGK-DMTLLGDLVLSNKLVIYDLENMVIGWTDYNCS 430
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 96/408 (23%), Positives = 152/408 (37%), Gaps = 94/408 (23%)
Query: 107 SLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVD-CNFPNVDPSRIPAFIPKRSSSSQL 165
++S GTP +S DTGS L W PC +CV + K SS+S+
Sbjct: 116 NVSVGTPA-SSYLVALDTGSDLFWLPCNCT-KCVHGIQLSTGQKIAFNIYDNKESSTSKN 173
Query: 166 IGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSK 225
+ C + C E + + S TCP YL + T G L+ + L +
Sbjct: 174 VACNSSLC--------EQKTQCSSSSGGTCPYQV-EYLSENTS--TTGFLVEDVLHLITD 222
Query: 226 T-------VPNFLAGC------SILSDRQPAGIAGFGRSSESLPSQLGLK-----KFSYC 267
P GC + L P G+ G G S S+PS L + FS C
Sbjct: 223 NDDQTQHANPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMC 282
Query: 268 LLSRKFDDAPV-SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQII 326
+ +N LD G TPF P S+ Y + + QII
Sbjct: 283 FAADGLGRITFGDNNSSLDQGK------------TPFNIRPSHST------YNITVTQII 324
Query: 327 VGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIR----QMGNYSRA 382
VG + ++ I D+G++FT++ P ++ + + F Q ++S +
Sbjct: 325 VGGNSADLEFN-----------AIFDTGTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNS 373
Query: 383 ADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYFALV--------GNEVL 434
D+ C+D+ +++ +P + L KGG +NYF + N VL
Sbjct: 374 DDLP----FEYCYDLRTNQTIEVPNINLTMKGG-------DNYFVMDPIITSGGGNNGVL 422
Query: 435 CLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
CL + N I+G + + + FD N G+ + C
Sbjct: 423 CLAVLKSNNVN---------IIGQNFMTGYRIVFDRENMTLGWKESNC 461
>gi|242044812|ref|XP_002460277.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
gi|241923654|gb|EER96798.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
Length = 369
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 83/185 (44%), Gaps = 17/185 (9%)
Query: 298 LSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTF 357
+ TP NP SS YYV + I VG K V IP L G ++DSG+ F
Sbjct: 199 IKTTPLLANPHRSS-----LYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMF 253
Query: 358 TFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAK 417
T + P + AV E R++G A V G CF+ + +V P + L F G +
Sbjct: 254 TRLVAPAYVAVRDEVRRRVG-----APVSSLGGFDTCFNTT---AVAWPPVTLLFD-GMQ 304
Query: 418 MALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGF 477
+ LP EN +V + I AA P ++ Q QN + FD+ N R GF
Sbjct: 305 VTLPEEN---VVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGF 361
Query: 478 AKQKC 482
A+++C
Sbjct: 362 ARERC 366
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 97/368 (26%), Positives = 139/368 (37%), Gaps = 56/368 (15%)
Query: 122 FDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNV 181
DT S + W PC C+ C+ F S++ + +GCQ +C + P
Sbjct: 1 MDTSSDVAWIPCNG---CLGCS--------STLFNSPASTTYKSLGCQAAQCKQVPKP-- 47
Query: 182 ESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGC------S 235
TC S+ L YG A L +T+ + VP + GC
Sbjct: 48 ------------TCGGGVCSFNLTYGGSSLAANLSQDTITLATDAVPGYSFGCIQKATGG 95
Query: 236 ILSDRQPAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKT 295
L + G+ S S L FSYCL S F S +L L GP +
Sbjct: 96 SLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS--FKSLNFSGSLRL--GPVGQPKR- 150
Query: 296 PGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGS 355
+ YTP KNP S Y+V L + VG + V +P G I DSG+
Sbjct: 151 --IKYTPLLKNPRRPS-----LYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGT 203
Query: 356 TFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGG 415
FT + P + AV F ++G R V G C+ + + P + F G
Sbjct: 204 VFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTCYTV----PIAAPTITFMFT-G 255
Query: 416 AKMALPPENYFAL-VGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDR 474
+ LPP+N CL + AA P ++ + Q QN L +D+ N R
Sbjct: 256 MNVTLPPDNLLIHSTAGSTTCLAM----AAAPDNVNSVLNVIANLQQQNHRLLYDVPNSR 311
Query: 475 FGFAKQKC 482
G A++ C
Sbjct: 312 LGVARELC 319
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 110/470 (23%), Positives = 184/470 (39%), Gaps = 86/470 (18%)
Query: 45 LHHSDSDPLKILHSLASSSLSRARHLKTKTKPKTKDSNI-GSNYSNSLIKTPLSVHSYGG 103
LHH SDP+K + L+ L L +D I G + TPL+ S
Sbjct: 45 LHHRYSDPVKGM--LSVDDLPEKGSLHYYASMAHRDILIHGRKLVSDNTSTPLTFFSGNE 102
Query: 104 ----------YSISLSFGTPPQASTPFIFDTGSSLVWFPCT-SRYRCVD-CNFPNVDPSR 151
+ ++S GTP S DTGS L W PC + CV FP+ +
Sbjct: 103 TYRFSSLGFLHYANVSIGTP-SLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPSGEQID 161
Query: 152 IPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFT 211
+ P SS+SQ I C N CS +SRC TCP Y +QY T
Sbjct: 162 FNIYRPNASSTSQTIPCNNTLCSR------QSRCPSA---QSTCP-----YQVQYLSNGT 207
Query: 212 A--GLLLSETLRFPSKTVPN------FLAGC------SILSDRQPAGIAGFGRSSESLPS 257
+ G+L+ + L + + + GC S L P G+ G G ++ S+PS
Sbjct: 208 SSTGVLVEDLLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPS 267
Query: 258 QLGLKKFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEF 317
L + ++ S F + DTG + G TPF + +
Sbjct: 268 TLAREGYTSNSFSMCFGRDGIGRISFGDTG-------SSGQGETPFNLRQLHPT------ 314
Query: 318 YYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFI--RQ 375
Y V + +I VG + + +S I DSG++FT++ P + +++ F +
Sbjct: 315 YNVSITKINVGGRDADLEFS-----------AIFDSGTSFTYLNDPAYTLISESFNIGAK 363
Query: 376 MGNYSRAADVEKKSGLRPCFDISGKKS-VYLPELILKFKGGAKMALPPENYFALV--GNE 432
YS +D+ C+++S ++ + +P + L +GG++ + ++ G
Sbjct: 364 EKRYSSISDIP----FEYCYEMSSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGAS 419
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ CL + + G I+G + + + F+ + G+ C
Sbjct: 420 IYCLAI---------VKSGDVNIIGQNFMTGYRIVFNRERNVLGWKASDC 460
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 126/317 (39%), Gaps = 60/317 (18%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
I D+GS + W +C C P R P F P S++ + C + C+ + GP
Sbjct: 171 IIDSGSDVSWV------QCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPY 223
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF------PSKTVPNFLAGC 234
+GCS N C Q+G+ + G + T F P + F GC
Sbjct: 224 R----RGCS-ANAQC---------QFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGC 269
Query: 235 SILS-----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNL---V 283
+ D AG G S+SL Q + FSYCL P +S+L V
Sbjct: 270 AHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCL-------PPTASSLGFLV 322
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
L P F P+ SSS FY V LR IIV + + +P + S
Sbjct: 323 LGVPPERAQL------IPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS 376
Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
++DS + + + ++A+ F M Y A V S L C+D +G +S+
Sbjct: 377 ------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPV---SILDTCYDFTGVRSI 427
Query: 404 YLPELILKFKGGAKMAL 420
LP + L F GGA + L
Sbjct: 428 TLPSIALVFDGGATVNL 444
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 76/181 (41%), Gaps = 21/181 (11%)
Query: 303 FYKNPVGSSSAFG-EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
F P+ SSS+ FY V LR IIV + + +P + S ++ S + + +
Sbjct: 560 FVSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLP 613
Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALP 421
++A+ F R M Y A V S L C+D +G +S+ LP + L F GGA + L
Sbjct: 614 PTAYQALRAAFRRAMTMYRTAPPV---SILDTCYDFTGVRSITLPSIALVFDGGATVNLD 670
Query: 422 PENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
L G CL A A R P I G+ Q + + +D+ F
Sbjct: 671 AAGIL-LQG----CLAF-----APTATDRMPGFI-GNVQQRTLEVVYDVPGKAIRFRSAA 719
Query: 482 C 482
C
Sbjct: 720 C 720
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 156/400 (39%), Gaps = 68/400 (17%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y+ + GTP Q I DTGS++ + PC+S C C P F P SS
Sbjct: 97 GYYTSRVFIGTPAQ-EFALIVDTGSTVTYVPCSS---CTHCGHHQA--CFDPRFKPDNSS 150
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
S Q + C +P C K C R C Y Y + + G+L + L
Sbjct: 151 SYQTVSCNSPDCI----------TKMCDARVHQC-----KYERVYAEMSSSKGVLGKDLL 195
Query: 221 RF--PSKTVPN-FLAGCSI-----LSDRQPAGIAGFGRSSESLPSQL---GLKKFSYCLL 269
F S+ P+ L GC L + GI G GR S+ QL G + S+ L
Sbjct: 196 GFGNGSRLQPHPLLFGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLC 255
Query: 270 SRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGS 329
D+ ++VL P P + + S +Y + L +I V
Sbjct: 256 YGGMDEG--GGSMVLGAIP-----PPPAMVFAK-------SDPNRSNYYNLELSEIQVQG 301
Query: 330 KHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKS 389
+ +P +G G ++DSG+T+ ++ F+A +Q+G+ +A S
Sbjct: 302 VSLNVPSEVF----NGRLGTVLDSGTTYAYLPDKAFDAFKDAITQQLGSL-QAVPGPDPS 356
Query: 390 GLRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEV---LCLILFTDN 442
CF +G S + P + F G K+ L PENY +V CL F +
Sbjct: 357 YPDVCFAGAGSDSKALGKHFPPVDFVFSGNQKVFLAPENYL-FKHTKVPGAYCLGFFKNQ 415
Query: 443 AAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A +LG ++N + +D AN + GF K C
Sbjct: 416 DA--------TTLLGGIVVRNTLVTYDRANHQIGFFKTNC 447
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 102/397 (25%), Positives = 146/397 (36%), Gaps = 59/397 (14%)
Query: 104 YSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSS 163
Y + + GTPPQA + I D LVW +C C +P F P S++
Sbjct: 62 YVANFTIGTPPQAVS-GIVDLSGELVW------TQCAACRSSGCFKQELPVFDPSASNTY 114
Query: 164 QLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRFP 223
+ C +P C I N C G + C PS G T G+ ++ +
Sbjct: 115 RAEQCGSPLCKSIPTRN----CSG----DGECGYEAPSMF-----GDTFGIASTDAIAI- 160
Query: 224 SKTVPNFLAGCSILSDRQ-------PAGIAGFGRSSESLPSQLGLKKFSYCLLSRKFDDA 276
GC + SD P+G G GR+ SL Q + FSYCL
Sbjct: 161 GNAEGRLAFGCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVTAFSYCLAPHGPGK- 219
Query: 277 PVSSNLVLDTG---PGSGDSKTPGLSYTPFYKNPVGSSSAFGE--FYYVGLRQIIVGSKH 331
S L L G+G S P TP ++S G +Y V L I G
Sbjct: 220 --KSALFLGASAKLAGAGKSNPP----TPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVA 273
Query: 332 VKIPYSYLVPGSDGNGGVIVDSGSTF---TFMEGPLFEAVAKEFIRQMGNYSRAADVEKK 388
V S G G + + TF +++ ++A+ K +G+ S A E
Sbjct: 274 VAA-------ASSGGGAITILQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEP- 325
Query: 389 SGLRPCFDISGKKSVYLPELILKFKGGAKMALPPENYF--ALVGNEVLCLILFTDNAAGP 446
CF + V P+L+ F+GGA + PP Y GN +CL + +
Sbjct: 326 --FDLCFQNAAVSGV--PDLVFTFQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDS 381
Query: 447 ALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
A ILG +N + FDL + F C+
Sbjct: 382 A--DDGVSILGSLLQENVHFLFDLEKETLSFEPADCS 416
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 105/416 (25%), Positives = 166/416 (39%), Gaps = 85/416 (20%)
Query: 101 YGGYSISLSFGTPPQASTPFIFD--TGSSLVWFPCTSRYRCVDC-NFPNVDPSRIPAFIP 157
YG Y +++ G P S P+ D +GS L W C + C+ C P+ P +
Sbjct: 76 YGLYYVTMLVGNP---SKPYFLDVDSGSELTWIQCDAP--CISCAKGPH------PLYKL 124
Query: 158 KRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLS 217
K+ S L+ ++P C+ + G +K C + G++ G L+
Sbjct: 125 KKGS---LVPSKDPLCAAV------QAGSGHYHNHKEASQRCDYDVAYADHGYSEGFLVR 175
Query: 218 ETLR--FPSKTV--PNFLAGCSI-------LSDRQPAGIAGFGRSSESLPSQL---GLKK 263
+++R +KTV N + GC +SD + GI G G SLPSQ GL K
Sbjct: 176 DSVRALLTNKTVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIK 235
Query: 264 --FSYCLLSRKFDDAPV--SSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYY 319
+C+ D + +LV T +++ P P + YY
Sbjct: 236 NVIGHCIFGAGRDGGYMFFGDDLV----------STSAMTWVPMLGRPSI------KHYY 279
Query: 320 VGLRQIIVGSKHVKIPYSYLVPGSDGN--GGVIVDSGSTFTFMEGPLFEA---VAKEFIR 374
VG Q+ G+K L DG GG+I DSGST+T+ + A V KE +
Sbjct: 280 VGAAQMNFGNKP-------LDKDGDGKKLGGIIFDSGSTYTYFTNQAYGAFLSVVKENLS 332
Query: 375 QMGNYSRAAD------VEKKSGLRPCFDISGKKSVYLPELILKFKG--GAKMALPPENYF 426
++D +K G R + + Y L LKF+ +M + PE Y
Sbjct: 333 GKQLEQDSSDSFLSLCWRRKEGFRSV----AEAAAYFKPLTLKFRSTKTKQMEIFPEGYL 388
Query: 427 ALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
+ +CL + G A+G +LGD Q + +D ++ G+A+ C
Sbjct: 389 VVNKKGNVCLGILN----GTAIGIVDTNVLGDISFQGQLVVYDNEKNQIGWARSDC 440
>gi|238479902|ref|NP_001154646.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332643534|gb|AEE77055.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 350
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 67/145 (46%), Gaps = 19/145 (13%)
Query: 345 GNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRP----CFDISG- 399
GNGG +VDSG+T F+ P + +V + R + L P C ++SG
Sbjct: 217 GNGGTVVDSGTTLAFLAEPAYRSV-------IAAVRRRVKLPIADALTPGFDLCVNVSGV 269
Query: 400 -KKSVYLPELILKFKGGAKMALPPENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGD 458
K LP L +F GGA PP NYF ++ CL + + P +G ++G+
Sbjct: 270 TKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAI---QSVDPKVGFS---VIGN 323
Query: 459 FQLQNFYLEFDLANDRFGFAKQKCA 483
Q F EFD R GF+++ CA
Sbjct: 324 LMQQGFLFEFDRDRSRLGFSRRGCA 348
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 54/119 (45%), Gaps = 15/119 (12%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y + L G PPQ S I DTGS LVW C++ C +C+ + P+ + F P+ SS
Sbjct: 82 GQYFVDLRIGQPPQ-SLLLIADTGSDLVWVKCSA---CRNCS--HHSPATV--FFPRHSS 133
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYGLG-FTAGLLLSET 219
+ C +P C + P+ C N T + Y Y G T+GL ET
Sbjct: 134 TFSPAHCYDPVCRLVPKPDRAPIC------NHTRIHSTCHYEYGYADGSLTSGLFARET 186
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 105/399 (26%), Positives = 160/399 (40%), Gaps = 58/399 (14%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPS------RIPAF 155
G Y+ + GTPP I DTGS++ + PC+S C C S R P F
Sbjct: 38 GYYTSRVFIGTPPN-EFALIVDTGSTVTYVPCSS---CTHCGHHQASFSTHRLFCRDPRF 93
Query: 156 IPKRSSSSQLIGCQNPKCSWIFGPNVESRCK--GCSPRNKTCPLACPSYLLQYGLGFTAG 213
P+ SSS Q IGC++ C + +CK T LL +G A
Sbjct: 94 KPENSSSYQKIGCRSSDCITGLCDSNSHQCKYERMYAEMSTSKGVLGKDLLDFG---PAS 150
Query: 214 LLLSETLRFPSKTVPNFLAGCSILSDRQPAGIAGFGRSSESLPSQL---GLKKFSYCLLS 270
L S+ L F +T A L + GI G GR S+ QL G + S+ L
Sbjct: 151 RLQSQLLSFGCET-----AESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCY 205
Query: 271 RKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSK 330
D+ ++VL P S F K+ S+ +Y + L +I V
Sbjct: 206 GGMDEG--GGSMVL--------GAIPAPSGMVFAKSDPRRSN----YYNLELTEIQVQGA 251
Query: 331 HVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSG 390
+K+ + +G G I+DSG+T+ ++ FEA + Q+G+ +A D +
Sbjct: 252 SLKLDSNVF----NGKFGTILDSGTTYAYLPDRAFEAFTDAVVAQLGSL-QAVDGPDPNY 306
Query: 391 LRPCFDISGKKS----VYLPELILKFKGGAKMALPPENYFALVGNEV---LCLILFTDNA 443
C+ +G + + P + F K++L PENY +V CL F +
Sbjct: 307 PDICYAGAGTDTKELGKHFPLVDFVFAENQKVSLAPENYL-FKHTKVPGAYCLGFFKNQD 365
Query: 444 AGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKC 482
A +LG ++N + +D N + GF K C
Sbjct: 366 A--------TTLLGGIIVRNMLVTYDRYNHQIGFLKTNC 396
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 126/317 (39%), Gaps = 60/317 (18%)
Query: 121 IFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPN 180
I D+GS + W +C C P R P F P S++ + C + C+ + GP
Sbjct: 80 IIDSGSDVSWV------QCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPY 132
Query: 181 VESRCKGCSPRNKTCPLACPSYLLQYGLGFTAGLLLSETLRF------PSKTVPNFLAGC 234
+GCS N C Q+G+ + G + T F P + F GC
Sbjct: 133 R----RGCS-ANAQC---------QFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGC 178
Query: 235 SILS-----DRQPAGIAGFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNL---V 283
+ D AG G S+SL Q + FSYCL P +S+L V
Sbjct: 179 AHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCL-------PPTASSLGFLV 231
Query: 284 LDTGPGSGDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGS 343
L P F P+ SSS FY V LR IIV + + +P + S
Sbjct: 232 LGVPPERAQL------IPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS 285
Query: 344 DGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSV 403
++DS + + + ++A+ F M Y A V S L C+D +G +S+
Sbjct: 286 ------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPV---SILDTCYDFTGVRSI 336
Query: 404 YLPELILKFKGGAKMAL 420
LP + L F GGA + L
Sbjct: 337 TLPSIALVFDGGATVNL 353
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 76/181 (41%), Gaps = 21/181 (11%)
Query: 303 FYKNPVGSSSAFG-EFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFME 361
F P+ SSS+ FY V LR IIV + + +P + S ++ S + + +
Sbjct: 469 FVSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLP 522
Query: 362 GPLFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALP 421
++A+ F R M Y A V S L C+D +G +S+ LP + L F GGA + L
Sbjct: 523 PTAYQALRAAFRRAMTMYRTAPPV---SILDTCYDFTGVRSITLPSIALVFDGGATVNLD 579
Query: 422 PENYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQK 481
L G CL A A R P I G+ Q + + +D+ F
Sbjct: 580 AAGIL-LQG----CLAF-----APTATDRMPGFI-GNVQQRTLEVVYDVPGKAIRFRSAA 628
Query: 482 C 482
C
Sbjct: 629 C 629
>gi|18379072|ref|NP_563679.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12083230|gb|AAG48774.1|AF332411_1 unknown protein [Arabidopsis thaliana]
gi|3850580|gb|AAC72120.1| Strong similarity to gb|D14550 extracellular dermal glycoprotein
(EDGP) precursor from Daucus carota. ESTs gb|84105 and
gb|AI100071 come from this gene [Arabidopsis thaliana]
gi|332189426|gb|AEE27547.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 434
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/280 (28%), Positives = 128/280 (45%), Gaps = 46/280 (16%)
Query: 227 VPNFLAGCSILS-----DRQPAGIAGFGRSSESLPSQLGL-----KKFSYCLLSRK---- 272
+PN + C S + G+AG GR + LP Q +KF+ CL S +
Sbjct: 154 IPNLIFSCGSTSLLKGLAKGAVGMAGMGRHNIGLPLQFAAAFSFNRKFAVCLTSGRGVAF 213
Query: 273 FDDAPVSSNLVLDTGPGSGDSKTPGLSYTPFYKNPVGSSSAF--GE---FYYVGLRQIIV 327
F + P + L PG S+ L TP NP + F GE Y++G+ I +
Sbjct: 214 FGNGPY---VFL---PGIQISR---LQKTPLLINPGTTVFEFSKGEKSPEYFIGVTAIKI 264
Query: 328 GSKHVKIPYSYL-VPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAADVE 386
K + I + L + S G GG + S + +T +E +++A EFIRQ + A ++
Sbjct: 265 VEKTLPIDPTLLKINASTGIGGTKISSVNPYTVLESSIYKAFTSEFIRQ----AAARSIK 320
Query: 387 KKSGLRP---CFDISGKKSVYL----PELILKFKG-GAKMALPPENYFALVGNEVLCLIL 438
+ + ++P CF L PE+ L + N V ++V+CL
Sbjct: 321 RVASVKPFGACFSTKNVGVTRLGYAVPEIQLVLHSKDVVWRIFGANSMVSVSDDVICL-G 379
Query: 439 FTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFA 478
F D P G ++++G FQL++ +EFDLA+++FGF+
Sbjct: 380 FVDGGVNP----GASVVIGGFQLEDNLIEFDLASNKFGFS 415
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 113/480 (23%), Positives = 180/480 (37%), Gaps = 84/480 (17%)
Query: 12 FSLLILLFTTDAGAGSSAATVTVPLT-PLSTKHYLHHSDSDPLKILHSLASSSLSRARHL 70
F LL+ F + + V L P+S++ ++ ++ + S+ + S++R R+L
Sbjct: 7 FVLLLFCFCRLSLTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINRVRYL 66
Query: 71 KTKTKPKTKDSNIGSNYSNSLIKTPLSVHSYGGYSISLSFGTPPQASTPFIFDTGSSLVW 130
+++ S N + PLS GY +S S GTPP I DTG+ +W
Sbjct: 67 ----------NHVFSFSPNKIQDVPLSSFMGAGYVMSYSIGTPPFQLYSLI-DTGNDNIW 115
Query: 131 FPCTSRYRCVDCNFPNVDPSRIPAFIPKRSSSSQLIGCQNPKCSWIFGPNVESRCKGCSP 190
F C C++ P F P +SS+ + I C +P C G + +
Sbjct: 116 FQCKPCKPCLN--------QTSPMFHPSKSSTYKTIPCTSPICKNADGHYLGVDTLTLNS 167
Query: 191 RNKTCPLACPSYLLQYGLGFTAGLLLSETLRFPSKTVPNFLAGCSILS----DRQPAGIA 246
N T P++ N + GC + + +G
Sbjct: 168 NNGT-PIS----------------------------FKNIVIGCGHRNQGPLEGYVSGNI 198
Query: 247 GFGRSSESLPSQLGLK---KFSYCLLSRKFDDAPVSSNLVLDTGPGSGDSKTPGLSYTPF 303
G R S SQL KFSYCL+ F VSS L GD T +S
Sbjct: 199 GLARGPLSFISQLNSSIGGKFSYCLVPL-FSKENVSSKLHF------GDKST--VSGLGT 249
Query: 304 YKNPVGSSSAFGEFYYVGLRQIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGP 363
P+ + Y+V L VG +K+ SD G I+DSG+T T +
Sbjct: 250 VSTPIKEENG----YFVSLEAFSVGDHIIKLE------NSDNRGNSIIDSGTTMTILPKD 299
Query: 364 LFEAVAKEFIRQMGNYSRAADVEKKSGLRPCFDISGKKSVYLPELILKFKGGAKMALPPE 423
++ + + + M R D ++ L C+ + + +I G+++ L
Sbjct: 300 VYSRL-ESVVLDMVKLKRVKDPSQQFNL--CYQTTSTTLLTKVLIITAHFSGSEVHLNAL 356
Query: 424 NYFALVGNEVLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
N F + +EV+C F +L I G+ QNF + FDL F C
Sbjct: 357 NTFYPITDEVICF-AFVSGGNFSSLA-----IFGNVVQQNFLVGFDLNKKTISFKPTDCT 410
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 103/411 (25%), Positives = 162/411 (39%), Gaps = 94/411 (22%)
Query: 102 GGYSISLSFGTPPQASTPFIFDTGSSLVWFPCTSRYRCVDCNFPNVDPSRIPAFIPKRSS 161
G Y+ L GTPPQ I DTGS++ + PC++ C C + P F P+ SS
Sbjct: 82 GYYTTRLWIGTPPQMFA-LIVDTGSTVTYVPCST---CEQCG-----RHQDPKFQPESSS 132
Query: 162 SSQLIGCQNPKCSWIFGPNVESRCKGCSPRNKTCPLACPSYLLQYG-LGFTAGLLLSETL 220
+ Q + C C+ C C Y QY + ++G+L + +
Sbjct: 133 TYQPVKC-TIDCN-------------CDSDRMQC-----VYERQYAEMSTSSGVLGEDLI 173
Query: 221 RF--PSKTVPN-FLAGCS-----ILSDRQPAGIAGFGRSSESLPSQLGLKK-----FSYC 267
F S+ P + GC L + GI G GR S+ QL K FS C
Sbjct: 174 SFGNQSELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLC 233
Query: 268 LLSRKFDDAPVSSNLVLDTGPGS----GDSKTPGLSYTPFYKNPVGSSSAFGEFYYVGLR 323
+D G G+ G S +++ Y +PV S +Y + L+
Sbjct: 234 YGG-------------MDVGGGAMVLGGISPPSDMAFA--YSDPVRSP-----YYNIDLK 273
Query: 324 QIIVGSKHVKIPYSYLVPGSDGNGGVIVDSGSTFTFMEGPLFEAVAKEFIRQMGNYSRAA 383
+I V K + + + DG G ++DSG+T+ ++ F A ++++
Sbjct: 274 EIHVAGKRLPLNANVF----DGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKEL------Q 323
Query: 384 DVEKKSGLRP-----CFDISG----KKSVYLPELILKFKGGAKMALPPENYFALVGNE-- 432
++K SG P CF +G + S P + + F+ G K L PENY
Sbjct: 324 SLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRG 383
Query: 433 VLCLILFTDNAAGPALGRGPAIILGDFQLQNFYLEFDLANDRFGFAKQKCA 483
CL +F + G +LG ++N + +D + GF K CA
Sbjct: 384 AYCLGVFQN-------GNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNCA 427
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.137 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,104,121,173
Number of Sequences: 23463169
Number of extensions: 361954812
Number of successful extensions: 733658
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 539
Number of HSP's successfully gapped in prelim test: 1693
Number of HSP's that attempted gapping in prelim test: 726736
Number of HSP's gapped (non-prelim): 3060
length of query: 483
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 336
effective length of database: 8,910,109,524
effective search space: 2993796800064
effective search space used: 2993796800064
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)