BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011649
(480 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 332 bits (850), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 192/468 (41%), Positives = 273/468 (58%), Gaps = 24/468 (5%)
Query: 22 AYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPG-KVSLEVLGRYGPCSKLNQ-GKS 79
A N+L + V ++SL P + + ++ +GP K SLEV+ ++GPCS+LN GK+
Sbjct: 30 ATKESNNLRQYHFVHLNSLFPSS----SCSSSAKGPKRKASLEVVHKHGPCSQLNHSGKA 85
Query: 80 RNTPSLEEILRRDQQRLHLKNSRRLQKAIPDN-FKKTKAFTFPAKTG-IVAADEYYIVVA 137
T S +I+ D +R+ SR + +N K+ + T PAK+G ++ + +YY+VV
Sbjct: 86 EATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDSTTLPAKSGRLIGSADYYVVVG 145
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILL 196
+G PK+ +SL+ DTGS +TWTQC+PC C +Q+DP FDPSKS +++ I C S+ C
Sbjct: 146 LGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSSLCT--- 202
Query: 197 EWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDN 256
F G + C YD+ Y D S GF + +R+TI + + FL GC +
Sbjct: 203 -QFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD-----IVHDFLFGCGQD 256
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFV 313
N G G +G+MGL R P+S + +T+ Y F YCL S S G++TFG N +
Sbjct: 257 NEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTPSSLGHLTFGASAATNAN-L 315
Query: 314 KYTPIVTTPEQSEFYHITLTGISVGGERLP-LKASYFTKLSTEIDSGTIITRFPAPVYSA 372
KYTP T ++ FY + + GISVGG +LP + +S F+ + IDSGT+ITR P Y+A
Sbjct: 316 KYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLPPTAYAA 375
Query: 373 LRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVE 432
LRSAFR+ M KY + G L DTCYD S YK + VP+I F GGV +EL + G L E
Sbjct: 376 LRSAFRQFMMKYPVAYGTR-LLDTCYDFSGYKEISVPRIDFEFAGGVKVELPLVGILYGE 434
Query: 433 SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
S +Q+CL FA + + + GNVQQ+ EV YDV G R+GFG CN
Sbjct: 435 SAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGCN 482
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 328 bits (840), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 194/471 (41%), Positives = 277/471 (58%), Gaps = 27/471 (5%)
Query: 22 AYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPG-KVSLEVLGRYGPCSKLNQ-GKS 79
A N+L + V ++SL P + + ++ +GP K SLEV+ ++GPCS+LN GK+
Sbjct: 26 ATKESNNLRQYHFVHLNSLFPSS----SCSSSAKGPKRKASLEVVHKHGPCSQLNHNGKA 81
Query: 80 RNTPSLEEILRRDQQRLHLKNSRRLQKAIPDN-FKKTKAFTFPAKTG-IVAADEYYIVVA 137
+ T S +I+ D +R+ SR + +N K+ + T PAK+G ++ + Y++VV
Sbjct: 82 KTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSANYFVVVG 141
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILL 196
+G PK+ +SL+ DTGS +TWTQC+PC C +Q+D FDPSKS ++ I C S+ C L
Sbjct: 142 LGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLT 201
Query: 197 EWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
+ +CSS C Y I Y D S GF + +R+TI + FL GC
Sbjct: 202 S---AGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITATD-----IVDDFLFGCG 253
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKK 311
+N G +G++G++GL R P+S + +T+ Y F YCL S S G++TFG N
Sbjct: 254 QDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLPSTSSSLGHLTFGASAATNAN 313
Query: 312 FVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKASYFTKLSTEIDSGTIITRFPAPVY 370
+KYTP+ T + FY + + GISVGG +LP + +S F+ + IDSGT+ITR Y
Sbjct: 314 -LKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTAY 372
Query: 371 SALRSAFRKRMKKYKMGKGIED-LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL 429
+ALRSAFR+ M+KY + ED LFDTCYD S YK + VPKI F GGV +EL + G L
Sbjct: 373 AALRSAFRQGMEKYPVAN--EDGLFDTCYDFSGYKEISVPKIDFEFAGGVTVELPLVGIL 430
Query: 430 VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ S +QVCL FA +D + + GNVQQ+ EV YDV G R+GFG CN
Sbjct: 431 IGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGCN 481
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 327 bits (838), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 191/489 (39%), Positives = 282/489 (57%), Gaps = 34/489 (6%)
Query: 1 MRILFKAFLLFIWLLRSSNNGAYANDNDLSHSY--IVSVSSLIPPTVCNRTRTALPQGPG 58
+ AFLL +L N G +++++ Y I+ V SL+P T CN+T
Sbjct: 10 LTFFVNAFLLLCYL----NKGHAVGEDEITKGYLHIIKVKSLLPSTACNQTFKV----SN 61
Query: 59 KVSLEVLGRYGPCSK-LNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKA 117
+SLEV+ R GPC + LNQ K+ N PS EIL +D+ R+ ++R + F++ +A
Sbjct: 62 SLSLEVVHRSGPCIQVLNQEKAANAPSNMEILLQDRHRVDSIHARLSSHGV---FQEKQA 118
Query: 118 FTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFD 175
T P ++G + + +Y + V +G PK+ +L+ DTGS +TWTQC+PC C +Q++P D
Sbjct: 119 -TLPVQSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLD 177
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI 235
P+KS ++ I C+S CK+L G + CSS C Y + Y DGS GF+AT+ +T+
Sbjct: 178 PTKSTSYKNISCSSAFCKLL----DTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTL 233
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS 292
N FL GC N+G GA+G++GL R +S+ S+T Y F YCL +
Sbjct: 234 SSSN-----VFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPA 288
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
S GY++FG + K VK+TP+ + + FY + +T +SVGG +L + AS F+
Sbjct: 289 SSSSKGYLSFGGQVS---KTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTS 345
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
T IDSGT+ITR P+ YSAL SAF+K M Y G +FDTCYD S +T+ +PK+
Sbjct: 346 GTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYS-IFDTCYDFSKNETIKIPKVG 404
Query: 413 IHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
+ F GGV++++DV G L V +++VCL FA D + + GN QQ+ Y+V YD A R
Sbjct: 405 VSFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGR 464
Query: 472 LGFGPGNCN 480
+GF P CN
Sbjct: 465 VGFAPSGCN 473
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 323 bits (829), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 189/487 (38%), Positives = 273/487 (56%), Gaps = 24/487 (4%)
Query: 3 ILFKAFLLFIWLLRSSNNGAYANDNDL----SHSYIVSVSSLIPPTVCNRTRTALPQGPG 58
I FLL+ LL S A+ S + V ++SL+P +VC+ + P+G
Sbjct: 8 IFLLKFLLYSALLSSKRGLAFQGRKTALSTPSTLHNVHITSLMPSSVCSPS----PKGDD 63
Query: 59 K-VSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKA 117
K SLEV+ ++GPCSKL+Q K R +PS ++L +D+ R++ SR + K
Sbjct: 64 KRASLEVIHKHGPCSKLSQDKGR-SPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKGSK 122
Query: 118 FTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFD 175
T P+K+G + Y + V +G PK+ ++ + DTGS +TWTQC+PC +C Q++P F+
Sbjct: 123 VTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFN 182
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI 235
PSKS +++ I C+S TC L CS+ C Y I Y D S GF+A D++ +
Sbjct: 183 PSKSTSYTNISCSSPTCDELKSG--TGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLAL 240
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS 292
+ FL GC NN G G +G++GL R +S++S+T Y F YCL S
Sbjct: 241 TSTD-----VFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPS 295
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
STGY+TFG +K VK+TP + + FY + L ISVGG +L AS F+
Sbjct: 296 TSSSTGYLTFGSGGGTSKA-VKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTA 354
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
T IDSGT+I+R P YS LR++F+++M KY + DTCYD S Y TV VPKI
Sbjct: 355 GTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKA-APASILDTCYDFSQYDTVDVPKIN 413
Query: 413 IHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
++F G +++LD G + ++ QVCL FA + +LGNVQQ+ ++V YDVAG R+
Sbjct: 414 LYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRI 473
Query: 473 GFGPGNC 479
GF PG C
Sbjct: 474 GFAPGGC 480
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 318 bits (814), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 191/490 (38%), Positives = 277/490 (56%), Gaps = 26/490 (5%)
Query: 3 ILFKAFLLFIWLLRSSNNGAYANDNDLSHSYI------VSVSSLIPPTVCNRTRTALPQG 56
I FLL+ LL + A ++ V ++SL+P + C+ + Q
Sbjct: 15 ICLLRFLLYASLLSLKSGFAIEGRESAESHHVQPIHHNVHITSLMPSSACSPSPKGHDQ- 73
Query: 57 PGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQK-AIPDNFKKT 115
+ SLEV+ ++GPCSKL K+ N+PS +IL +D+ R+ SR + A N K +
Sbjct: 74 --RASLEVVHKHGPCSKLRPHKA-NSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKAS 130
Query: 116 KAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPF 173
KA T P+K+ + + Y + V +G PK+ ++ + DTGS +TWTQC+PC+ +C QQR+
Sbjct: 131 KA-TLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHI 189
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
FDPS S ++S + C+S +C+ L CSS C Y I Y DGS GF+A +++
Sbjct: 190 FDPSTSLSYSNVSCDSPSCEKLESA--TGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKL 247
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
++ + F + F GC NN G G +G++GL R P+S++S+T Y F YCL
Sbjct: 248 SLTSTD---VFNNFQF--GCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCL 302
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
S STGY++FG D + K VK+TP + FY + + GISVG +LP+ S F+
Sbjct: 303 PSSSSSTGYLSFGSGDG-DSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFS 361
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
T IDSGT+I+R P VYS+++ FR+ M Y KG+ + DTCYDLS YKTV VPK
Sbjct: 362 TAGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVS-ILDTCYDLSKYKTVKVPK 420
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
I ++F GG +++L G + V V QVCL FA D ++GNVQQ+ V YD A
Sbjct: 421 IILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEG 480
Query: 471 RLGFGPGNCN 480
R+GF P CN
Sbjct: 481 RVGFAPSGCN 490
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 173/427 (40%), Positives = 249/427 (58%), Gaps = 16/427 (3%)
Query: 59 KVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAF 118
K SL V R+G CS+LN GK+ +P EILR DQ R++ +S+ +K D+ ++K+
Sbjct: 59 KSSLHVTHRHGTCSRLNNGKA-TSPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKST 117
Query: 119 TFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDP 176
PAK G + + Y + V +G PK +SL+ DTGS +TWTQC+PC+ C Q++P F+P
Sbjct: 118 DLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNP 177
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
SKS ++ + C+S C L G CS+ C Y I Y D S GF A ++ T+
Sbjct: 178 SKSTSYYNVSCSSAACGSLSSATGNAGS--CSASNCIYGIQYGDQSFSVGFLAKEKFTL- 234
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSP 293
N + + Y GC +NN G G +G++GL R +S S+T +Y F YCL S
Sbjct: 235 -TNSDVFDGVY---FGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSS 290
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
TG++TFG + VK+TPI T + + FY + + I+VGG++LP+ ++ F+
Sbjct: 291 ASYTGHLTFGSAGI--SRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPG 348
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
IDSGT+ITR P Y+ALRS+F+ +M KY G+ + DTC+DLS +KTV +PK+
Sbjct: 349 ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVS-ILDTCFDLSGFKTVTIPKVAF 407
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F GG +EL +G V + QVCL FA D N+ + GNVQQ+ EV YD AG R+G
Sbjct: 408 SFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVG 467
Query: 474 FGPGNCN 480
F P C+
Sbjct: 468 FAPNGCS 474
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 184/480 (38%), Positives = 266/480 (55%), Gaps = 23/480 (4%)
Query: 8 FLLFIWLLRSSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGR 67
+L + L N GA + D SH+ VS + C + A K SL V R
Sbjct: 12 IILCVCLNLGCNEGAQEREIDDSHTIQVSSLFPASSSSCVLSPRASTT---KSSLHVTHR 68
Query: 68 YGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTG-I 126
+G CS+LN GK+ +P EILR DQ R++ +S+ +K ++ ++++ PAK G
Sbjct: 69 HGTCSRLNNGKA-TSPDHVEILRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGST 127
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDPSKSKTFSKI 185
+ + Y + V +G PK +SL+ DTGS +TWTQC+PC+ C Q++P F+PSKS ++ +
Sbjct: 128 LGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNV 187
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI--QEVNGNGY 243
C+S C L G CS+ C Y I Y D S GF A D+ T+ +V Y
Sbjct: 188 SCSSAACGSLSSATGNAGS--CSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSDVFDGVY 245
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYI 300
F GC +NN G G +G++GL R +S S+T +Y F YCL S TG++
Sbjct: 246 F-------GCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHL 298
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGT 360
TFG + VK+TPI T + + FY + + I+VGG++LP+ ++ F+ IDSGT
Sbjct: 299 TFGSAGI--SRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGT 356
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
+ITR P Y+ALRS+F+ +M KY G+ + DTC+DLS +KTV +PK+ F GG
Sbjct: 357 VITRLPPKAYAALRSSFKAKMSKYPTTSGVS-ILDTCFDLSGFKTVTIPKVAFSFSGGAV 415
Query: 421 LELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+EL +G + QVCL FA D N+ + GNVQQ+ EV YD AG R+GF P C+
Sbjct: 416 VELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 475
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 315 bits (807), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 172/425 (40%), Positives = 248/425 (58%), Gaps = 16/425 (3%)
Query: 61 SLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTF 120
SL V R+G CS+LN GK+ +P EILR DQ R++ +S+ +K D+ ++K+
Sbjct: 33 SLHVTHRHGTCSRLNNGKA-TSPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDL 91
Query: 121 PAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDPSK 178
PAK G + + Y + V +G PK +SL+ DTGS +TWTQC+PC+ C Q++P F+PSK
Sbjct: 92 PAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSK 151
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV 238
S ++ + C+S C L G CS+ C Y I Y D S GF A ++ T+
Sbjct: 152 STSYYNVSCSSAACGSLSSATGNAGS--CSASNCIYGIQYGDQSFSVGFLAKEKFTL--T 207
Query: 239 NGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYG 295
N + + Y GC +NN G G +G++GL R +S S+T +Y F YCL S
Sbjct: 208 NSDVFDGVY---FGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSAS 264
Query: 296 STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE 355
TG++TFG + VK+TPI T + + FY + + I+VGG++LP+ ++ F+
Sbjct: 265 YTGHLTFGSAGI--SRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGAL 322
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF 415
IDSGT+ITR P Y+ALRS+F+ +M KY G+ + DTC+DLS +KTV +PK+ F
Sbjct: 323 IDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVS-ILDTCFDLSGFKTVTIPKVAFSF 381
Query: 416 LGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
GG +EL +G V + QVCL FA D N+ + GNVQQ+ EV YD AG R+GF
Sbjct: 382 SGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 441
Query: 476 PGNCN 480
P C+
Sbjct: 442 PNGCS 446
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 313 bits (802), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 178/422 (42%), Positives = 249/422 (59%), Gaps = 18/422 (4%)
Query: 55 QGPG-KVSLEVLGRYGPCSKLNQ--GKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDN 111
+GP K SLEV+ ++GPCS+LN GK+++ EIL +D++R+ NSR + D+
Sbjct: 63 KGPKRKASLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDS 122
Query: 112 -FKKTKAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQ 168
+ + T PAK+G ++ + Y++VV +G PK+ +SL+ DTGS +TWTQC+PC C +
Sbjct: 123 SVSELDSVTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYK 182
Query: 169 QRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFW 228
Q+D FDPSKS ++S I C ST C L S+K C Y I Y D S G++
Sbjct: 183 QQDAIFDPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYF 242
Query: 229 ATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY--- 285
+ +R+++ + FL GC NN G G++G++GL R P+S + +T Y
Sbjct: 243 SRERLSVTATD-----IVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKI 297
Query: 286 FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLK 345
F YCL + STG ++FG T +VKYTP T S FY + +TGISVGG +LP+
Sbjct: 298 FSYCLPATSSSTGRLSFG---TTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVS 354
Query: 346 ASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
+S F+ IDSGT+ITR P Y+ALRSAFR+ M KY G + DTCYDLS Y+
Sbjct: 355 SSTFSTGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYP-SAGELSILDTCYDLSGYEV 413
Query: 406 VVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
+PKI F GGV ++L +G L V S +QVCL FA D + + GNVQQ+ EV Y
Sbjct: 414 FSIPKIDFSFAGGVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVY 473
Query: 466 DV 467
DV
Sbjct: 474 DV 475
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 311 bits (798), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 175/417 (41%), Positives = 247/417 (59%), Gaps = 16/417 (3%)
Query: 59 KVSLEVLGRYGPCSKLNQ--GKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDN-FKKT 115
K SLEV+ ++GPCS+LN GK+++T +IL +D++R+ NSR + D+ ++
Sbjct: 69 KASLEVVHKHGPCSQLNDHDGKAKSTTPHSDILNQDKERVKYINSRLSKNLGQDSSVEEL 128
Query: 116 KAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPF 173
+ T PAK+G ++ + Y++VV +G PK+ +SL+ DTGS +TWTQC+PC C +Q+D
Sbjct: 129 DSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVI 188
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
FDPSKS ++S I C S C L + S+K C Y I Y D S G+++ +R+
Sbjct: 189 FDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERL 248
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
T+ + FL GC NN G G++G++GL R P+S + +T Y F YCL
Sbjct: 249 TVTATD-----VVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCL 303
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
S STG+++FG T +++KYTP T S FY + +T I+VGG +LP+ +S F+
Sbjct: 304 PSTSSSTGHLSFGPAAT--GRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFS 361
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
IDSGT+ITR P Y ALRSAFR+ M KY G + DTCYDLS YK +P
Sbjct: 362 TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYP-SAGELSILDTCYDLSGYKVFSIPT 420
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
I F GGV ++L +G L V S +QVCL FA D + + GNVQQR EV YDV
Sbjct: 421 IEFSFAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 311 bits (796), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 193/504 (38%), Positives = 272/504 (53%), Gaps = 40/504 (7%)
Query: 3 ILFKAFLLFIWLLR---SSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGK 59
+LF +F + LL ++ A + SH + + ++SL+P + CN +G
Sbjct: 13 LLFSSFTFLLILLSFPVEKSHALEAKETIESHFHTLQLTSLLPSSSCNTATKGKRRG--- 69
Query: 60 VSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAF- 118
SLEV+ R GPC++LNQ K P+L EIL DQ R+ +R ++ D FKK
Sbjct: 70 ASLEVVNRQGPCTQLNQ-KGAKAPTLTEILAHDQARVDSIQARVTDQSY-DLFKKKDKKS 127
Query: 119 ------------TFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIH 165
PA++G+ YIV V +G PK+ +SL+ DTGS +TWTQC+PC+
Sbjct: 128 SNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVK 187
Query: 166 -CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGE 224
C Q+ P FDPS SKT+S I C ST C L CSS C Y I Y D S
Sbjct: 188 SCYAQQQPIFDPSASKTYSNISCTSTACSGLKS--ATGNSPGCSSSNCVYGIQYGDSSFT 245
Query: 225 TGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS 284
GF+A D +T+ + N F F+ GC NN G +G++GL R P+SI+ +T
Sbjct: 246 VGFFAKDTLTLTQ---NDVFDG--FMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQK 300
Query: 285 ---YFFYCLHSPYGSTGYITFGKPDTVN-----KKFVKYTPIVTTPEQSEFYHITLTGIS 336
YF YCL + GS G++TFG + V K + +TP ++ + + FY I + GIS
Sbjct: 301 FGKYFSYCLPTSRGSNGHLTFGNGNGVKTSKAVKNGITFTPFASS-QGATFYFIDVLGIS 359
Query: 337 VGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDT 396
VGG+ L + F T IDSGT+ITR P+ VY +L+S F++ M KY + L DT
Sbjct: 360 VGGKALSISPMLFQNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALS-LLDT 418
Query: 397 CYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNV 456
CYDLS Y ++ +PKI+ +F G +++L+ G L+ QVCL FA D + GN+
Sbjct: 419 CYDLSNYTSISIPKISFNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNI 478
Query: 457 QQRGYEVHYDVAGRRLGFGPGNCN 480
QQ+ EV YDVAG +LGFG C+
Sbjct: 479 QQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 186/459 (40%), Positives = 265/459 (57%), Gaps = 22/459 (4%)
Query: 31 HSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILR 90
HS+ + VSSL+P C + L K SL+V+ ++GPCSKL+Q ++ P+ EIL
Sbjct: 45 HSHSIEVSSLLPSASCKPSTKVLSNNDNKASLKVVHKHGPCSKLSQDEASAAPTHTEILL 104
Query: 91 RDQQRLHLKNSR--RLQKAIPDNFKKTKAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSL 147
+DQ R+ +SR + + + K T + T PAK G V + Y + V +G PK+ +SL
Sbjct: 105 QDQSRVKSIHSRLSNSKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGTPKKDLSL 164
Query: 148 LLDTGSGITWTQCKPCIH-CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK 206
+ DTGS ITWTQC+PC C +Q++ FDPS+S +++ I C+S+ C L
Sbjct: 165 IFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTS--ATGNTPG 222
Query: 207 CSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG--NGYFARYPFLLGCTDNNTGDQNGA 264
C+S C Y I Y D S GF+ T+++T+ + N YF GC NN G G+
Sbjct: 223 CASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDAFNNIYF-------GCGQNNQGLFGGS 275
Query: 265 SGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTT 321
+G++GL R +S++S+T Y F YCL S STG++TFG + N KF TP+ T
Sbjct: 276 AGLLGLGRDKLSVVSQTAQKYNKIFSYCLPSSSSSTGFLTFGGSASKNAKF---TPLSTI 332
Query: 322 PEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRM 381
FY + TGISVGG++L + AS F+ IDSGT+ITR P YSALR++FR M
Sbjct: 333 SAGPSFYGLDFTGISVGGKKLAISASVFSTAGAIIDSGTVITRLPPAAYSALRASFRNLM 392
Query: 382 KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF 441
KY M K + + DTCYD S+Y T+ VPKI F G+++++D G L S+ QVCL F
Sbjct: 393 SKYPMTKALS-ILDTCYDFSSYTTISVPKIGFSFSSGIEVDIDATGILYASSLSQVCLAF 451
Query: 442 ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
A + + GNVQQ+ EV YD + ++GF PG C+
Sbjct: 452 AGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGCS 490
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 180/463 (38%), Positives = 267/463 (57%), Gaps = 37/463 (7%)
Query: 24 ANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTP 83
A +N L + + +S+L+P C + T + Q K SL+V+ ++GPCS+LNQ ++ N P
Sbjct: 32 AQENHLQLIHAIEISNLLPSADCEHS-TKVAQN--KASLKVVHKHGPCSQLNQ-QNGNAP 87
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPK 142
+L EIL DQ R+ +S + + K+T A P K+G+ YIV + +G PK
Sbjct: 88 NLVEILLEDQSRV---DSIHAKLSDHSGVKETDAAKLPTKSGMSLGTGNYIVSIGLGSPK 144
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
+ + L+ DTGS +TW +C FDP+KS +++ + C++ C ++
Sbjct: 145 KDLMLIFDTGSDLTWARCSAA--------ETFDPTKSTSYANVSCSTPLCSSVIS--ATG 194
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTI--QEVNGNGYFARYPFLLGCTDNNTGD 260
+C++ C Y I Y DGS GF +R+TI ++ N YF GC + G
Sbjct: 195 NPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTDIFNNFYF-------GCGQDVDGL 247
Query: 261 QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTP 317
A+G++GL R +S++S+T Y F YCL S STG+++FG + K K+TP
Sbjct: 248 FGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSS-SSTGFLSFGSSQS---KSAKFTP 303
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAF 377
+ + P S FY++ LTGI+VGG++L + S F+ T IDSGT++TR P YSALRSAF
Sbjct: 304 LSSGP--SSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVVTRLPPAAYSALRSAF 361
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
RK M Y MGK + + DTCYD S YKT+ VPKI I F GGVD+++D G V ++QV
Sbjct: 362 RKAMASYPMGKPLS-ILDTCYDFSKYKTIKVPKIVISFSGGVDVDVDQAGIFVANGLKQV 420
Query: 438 CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CL FA ++ + GN QQR +EV YDV+G ++GF P +C+
Sbjct: 421 CLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 189/489 (38%), Positives = 280/489 (57%), Gaps = 48/489 (9%)
Query: 7 AFLLFIWLLRSSNNGAYANDNDLSHSYI--VSVSSLIPPTVCNRTRTALPQGPGKVSLEV 64
+F+++ +LL S N N ++ + +Y + +SSL VC + AL +G SL++
Sbjct: 8 SFVIYGFLLLSPCNSLKDNADEGTRAYFHTLKISSLPSTEVCKESSKALNEGSS--SLKL 65
Query: 65 LGRYGPCSKLNQGKSRNTP--SLEEILRRDQQR----LHLKNSRRLQKAIPDNFKKTKAF 118
+ R+GPC N ++ P S EILRRD+ R + + S L ++ ++ K + F
Sbjct: 66 VHRFGPC---NPHRTSTAPASSFNEILRRDKLRVDSIIQARRSMNLTSSV-EHMKSSVPF 121
Query: 119 TFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSK 178
+K + A +Y + V IG PK+ + L+ DTGSG+ WTQCKPC C + P FDP+K
Sbjct: 122 YGLSK---ITASDYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKAC-YPKVPVFDPTK 177
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV 238
S +F +PC+S C+ + + CSS +C Y AYVD S TG AT+ ++ +
Sbjct: 178 SASFKGLPCSSKLCQSI--------RQGCSSPKCTYLTAYVDNSSSTGTLATETISFSHL 229
Query: 239 NGNGYFARYPF---LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS 292
+Y F L+GC+D +G+ G SGIMGL+R P+S+ S+T Y F YC+ S
Sbjct: 230 -------KYDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYCIPS 282
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
GSTG++TFG + V+++P+ T S+ Y I +TGISVGG +L + AS F K+
Sbjct: 283 TPGSTGHLTFGGKVPND---VRFSPVSKTAPSSD-YDIKMTGISVGGRKLLIDASAF-KI 337
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
++ IDSG ++TR P YSALRS FR+ MK Y + +D DTCYD S Y TV +P I+
Sbjct: 338 ASTIDSGAVLTRLPPKAYSALRSVFREMMKGYPLLDQ-DDFLDTCYDFSNYSTVAIPSIS 396
Query: 413 IHFLGGVDLELDVRGTL-VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
+ F GGV++++DV G + V + CL FA L D + GN QQ+ Y V +D A R
Sbjct: 397 VFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAEL--DDEVSIFGNFQQKTYTVVFDGAKER 454
Query: 472 LGFGPGNCN 480
+GF PG C+
Sbjct: 455 IGFAPGGCD 463
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 300 bits (769), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 186/476 (39%), Positives = 271/476 (56%), Gaps = 28/476 (5%)
Query: 14 LLRSSNNGAYANDNDLSHSY--IVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPC 71
LL S G +N+ + SY I+ V+SL+P T CN + +SLEV+ R+GPC
Sbjct: 4 LLFSLEKGYAVEENEATKSYLHIIKVNSLLPTTACNHSSKV----SNSLSLEVVHRHGPC 59
Query: 72 -SKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAA 129
+NQ K + PS EI RDQ R+ ++R + + F + +A T P ++G + A
Sbjct: 60 IGIVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGM---FPEKQATTLPVQSGASIGA 116
Query: 130 DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDPSKSKTFSKIPCN 188
+Y + V +G PK+ +L+ DTGS ITWTQC+PC+ C +Q++P +PS S ++ I C+
Sbjct: 117 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCS 176
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
S CK++ CSS C Y + Y DGS GF+AT+ +T+ N
Sbjct: 177 SALCKLVASG--KKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN-----VFKN 229
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKP 305
FL GC N G GA+G++GL R +++ S+T +Y F YCL + S GY++ G
Sbjct: 230 FLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQ 289
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRF 365
+ K VK+TP+ + + FY + +TG+SVGG +L + S F+ T IDSGT+ITR
Sbjct: 290 VS---KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSA-GTVIDSGTVITRL 345
Query: 366 PAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
YS L SAF+ M Y G +FDTCYD S Y TV +PK+ + F GGV++++DV
Sbjct: 346 SPTAYSELSSAFQNLMTDYPSTSGYS-IFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDV 404
Query: 426 RGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G L V +++VCL FA D ++ + GNVQQR Y+V YD A R+GF PG C+
Sbjct: 405 SGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 460
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 166/398 (41%), Positives = 231/398 (58%), Gaps = 14/398 (3%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDN-FKKTKAFTFPAKTG-IVAADEYYIVVAIGKPKQYVS 146
+ D +R+ SR + +N K + T PA++G ++ + Y +VV +G PK+ +S
Sbjct: 1 MNLDNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLS 60
Query: 147 LLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
L+ DTGS +TWTQC+PC C +Q+D FDPSKS +++ I C S+ C L +
Sbjct: 61 LVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECS 120
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
+ C YD Y D S GF + +R+TI + FL GC +N G NG++
Sbjct: 121 SSTDASCIYDAKYGDNSTSVGFLSQERLTITATD-----IVDDFLFGCGQDNEGLFNGSA 175
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTP 322
G+MGL R P+SI+ +T+ +Y F YCL + S G++TFG N + YTP+ T
Sbjct: 176 GLMGLGRHPISIVQQTSSNYNKIFSYCLPATSSSLGHLTFGASAATNASLI-YTPLSTIS 234
Query: 323 EQSEFYHITLTGISVGGERLP-LKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRM 381
+ FY + + ISVGG +LP + +S F+ + IDSGT+ITR VY+ALRSAFR+ M
Sbjct: 235 GDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXM 294
Query: 382 KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF 441
+KY + L DTCYDLS YK + VP+I F GGV +EL RG L VES +QVCL F
Sbjct: 295 EKYPVANE-AGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLAF 353
Query: 442 ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
A SD + + GNVQQ+ EV YDV G R+GFG C
Sbjct: 354 AANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 298 bits (762), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 184/469 (39%), Positives = 269/469 (57%), Gaps = 28/469 (5%)
Query: 21 GAYANDNDLSHSY--IVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPC-SKLNQG 77
G +N+ + SY I+ V+SL+P T CN + +SLEV+ R+GPC +NQ
Sbjct: 23 GYAVEENEATKSYLHIIKVNSLLPTTACNHSSKV----SNSLSLEVVHRHGPCIGIVNQE 78
Query: 78 KSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVV 136
K + PS EI RDQ R+ ++R + + F + +A T P ++G + A +Y + V
Sbjct: 79 KGADAPSNMEIFLRDQNRVDSIHARLSSRGM---FPEKQATTLPVQSGASIGAGDYVVTV 135
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
+G PK+ +L+ DTGS ITWTQC+PC+ C +Q++P +PS S ++ I C+S CK++
Sbjct: 136 GLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLV 195
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
CSS C Y + Y DGS GF+AT+ +T+ N F FL GC
Sbjct: 196 ASG--KKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN---VFKN--FLFGCGQ 248
Query: 256 NNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKF 312
N G GA+G++GL R +++ S+T +Y F YCL + S GY++ G + K
Sbjct: 249 QNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVS---KS 305
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSA 372
VK+TP+ + + FY + +TG+SVGG +L + S F+ T IDSGT+ITR YS
Sbjct: 306 VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSA-GTVIDSGTVITRLSPTAYSE 364
Query: 373 LRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-V 431
L SAF+ M Y G +FDTCYD S Y TV +PK+ + F GGV++++DV G L V
Sbjct: 365 LSSAFQNLMTDYPSTSGYS-IFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPV 423
Query: 432 ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+++VCL FA D ++ + GNVQQR Y+V YD A R+GF PG C+
Sbjct: 424 NGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 472
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 297 bits (760), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 194/504 (38%), Positives = 272/504 (53%), Gaps = 40/504 (7%)
Query: 3 ILFKAFLLFIWLLRSSNNGAYA---NDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGK 59
+LF + + LL S ++A + SH + + +SSL+P + CN +G
Sbjct: 13 LLFSSSAFLLILLSFSVEKSHALETRETIESHFHTLQLSSLLPSSSCNPATKGKRRG--- 69
Query: 60 VSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAF- 118
SLEV+ R GPC+ LNQ K P+L EIL DQ R+ +R ++ D FKK
Sbjct: 70 ASLEVVNRQGPCTLLNQ-KGAKAPTLTEILAHDQARVDSIQARITDQSY-DLFKKKDKKS 127
Query: 119 ------------TFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIH 165
PA++G+ YIV V +G PK+ +SL+ DTGS +TWTQC+PC+
Sbjct: 128 SNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVK 187
Query: 166 -CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGE 224
C Q+ P FDPS SKT+S I C S C L CSS C Y I Y D S
Sbjct: 188 SCYAQQQPIFDPSTSKTYSNISCTSAACSSLKS--ATGNSPGCSSSNCVYGIQYGDSSFT 245
Query: 225 TGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS 284
GF+A D++T+ + N F F+ GC NN G +G++GL R P+SI+ +T
Sbjct: 246 IGFFAKDKLTLTQ---NDVFDG--FMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQK 300
Query: 285 ---YFFYCLHSPYGSTGYITFGKPDTVN-----KKFVKYTPIVTTPEQSEFYHITLTGIS 336
YF YCL + GS G++TFG + V K + +TP ++ + + +Y I + GIS
Sbjct: 301 FGKYFSYCLPTSRGSNGHLTFGNGNGVKASKAVKNGITFTPFASS-QGTAYYFIDVLGIS 359
Query: 337 VGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDT 396
VGG+ L + F T IDSGT+ITR P+ Y +L+SAF++ M KY + L DT
Sbjct: 360 VGGKALSISPMLFQNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALS-LLDT 418
Query: 397 CYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNV 456
CYDLS Y ++ +PKI+ +F G ++ELD G L+ QVCL FA D + + GN+
Sbjct: 419 CYDLSNYTSISIPKISFNFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNI 478
Query: 457 QQRGYEVHYDVAGRRLGFGPGNCN 480
QQ+ EV YDVAG +LGFG C+
Sbjct: 479 QQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 180/461 (39%), Positives = 253/461 (54%), Gaps = 27/461 (5%)
Query: 30 SHSYIVSVSSLIPPTVCNRTRTALPQGP--GKVSLEVLGRYGPCSKLNQGKSRNTPSLEE 87
SH V ++ L P C R + + SLEV+ R+GPC + N P+ E
Sbjct: 29 SHFLTVDLAGLFPSASCTRRSPQVHTSSLGEQSSLEVIHRHGPCGD----EVSNAPTAAE 84
Query: 88 ILRRDQQR---LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPKQ 143
+L +DQ R +H K + L+ D + +KA PAK+G YIV V +G PK+
Sbjct: 85 MLVKDQSRVDFIHSKIAGELESV--DRLRGSKATKIPAKSGATIGSGNYIVSVGLGTPKK 142
Query: 144 YVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Y+SL+ DTGS +TWTQC+PC +C Q+DP F PS+S T+S I C+S C LE N
Sbjct: 143 YLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSPDCS-QLESGTGN 201
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN 262
+++ C Y I Y D S G++A + +T+ + FL GC NN G
Sbjct: 202 QPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSTD-----VIENFLFGCGQNNRGLFG 256
Query: 263 GASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIV 319
A+G++GL + +SI+ +T Y F YCL STGY+TFG +KYTPI
Sbjct: 257 SAAGLIGLGQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTFGGGGGGGA--LKYTPIT 314
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRK 379
+ FY + + G+ VGG ++P+ +S F+ IDSGT+ITR P YSAL+SAF K
Sbjct: 315 KAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVITRLPPDAYSALKSAFEK 374
Query: 380 RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
M KY + + DTCYDLS Y T+ +PK+ F GG +L+LD G + S QVCL
Sbjct: 375 GMAKYPKAPELS-ILDTCYDLSKYSTIQIPKVGFVFKGGEELDLDGIGIMYGASTSQVCL 433
Query: 440 GFALLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
FA DP+++ ++GNVQQ+ +V YDV G ++GFG C
Sbjct: 434 AFA-GNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 294 bits (753), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 185/488 (37%), Positives = 266/488 (54%), Gaps = 26/488 (5%)
Query: 3 ILFKAFLLFIWLLRSSNN-----GAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGP 57
+ F L +WLL S NN G ++ +H+ I ++SL+P C + T +P
Sbjct: 23 VSFIKHFLSLWLLFSFNNCYAFEGRKFAESQHTHTTI-HLTSLLPAASC-KPSTQVPSIE 80
Query: 58 GKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKA 117
K L+V+ ++GPCS L QG + IL +DQ R+ +S+ + + + K T A
Sbjct: 81 NKAFLKVVHKHGPCSDLRQGHKAEA---QYILLQDQSRVDSIHSKLSKDSGLSDVKATAA 137
Query: 118 FTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFD 175
T PAK G I+ + Y++ V +G PK+ SL+ DTGS +TWTQC+PC+ C Q++ F+
Sbjct: 138 TTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFN 197
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI 235
PS+S +++ I C ST C L C+S C Y I Y D S GF+ +++++
Sbjct: 198 PSQSTSYANISCGSTLCDSLAS--ATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSL 255
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS 292
+ F GC NN G GA+G++GL R +S++S+T Y F YCL S
Sbjct: 256 TATD-----VFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCLPS 310
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
STG++TFG + K +TP+ T S FY + LTGISVGG +L + S F+
Sbjct: 311 SSSSTGFLTFGGSTS---KSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTA 367
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
T IDSGT+ITR P YSAL S FRK M +Y + + DTC+D S + T+ VPKI
Sbjct: 368 GTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALS-ILDTCFDFSNHDTISVPKIG 426
Query: 413 IHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
+ F GGV +++D G V + QVCL FA + + GNVQQ+ EV YD A R+
Sbjct: 427 LFFSGGVVVDIDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRV 486
Query: 473 GFGPGNCN 480
GF P C+
Sbjct: 487 GFAPAGCS 494
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 282 bits (721), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 171/427 (40%), Positives = 248/427 (58%), Gaps = 22/427 (5%)
Query: 61 SLEVLGRYGPC-SKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
SLEV+ R+GPC +NQ K + PS EI RDQ R+ ++R + + F + +A T
Sbjct: 1 SLEVVHRHGPCIGIVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGM---FPEKQATT 57
Query: 120 FPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDPS 177
P ++G + A +Y + V +G PK+ +L+ DTGS ITWTQC+PC+ C +Q++P +PS
Sbjct: 58 LPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPS 117
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
S ++ I C+S CK++ CSS C Y + Y DGS GF+AT+ +T+
Sbjct: 118 TSTSYKNISCSSALCKLVASG--KKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSS 175
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPY 294
N FL GC N G GA+G++GL R +++ S+T +Y F YCL +
Sbjct: 176 SN-----VFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASS 230
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
S GY++ G + K VK+TP+ + + FY + +TG+SVGG +L + S F+ T
Sbjct: 231 SSKGYLSLGGQVS---KSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSA-GT 286
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
IDSGT+ITR YS L SAF+ M Y G +FDTCYD S Y TV +PK+ +
Sbjct: 287 VIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYS-IFDTCYDFSKYDTVRIPKVGVT 345
Query: 415 FLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F GGV++++DV G L V +++VCL FA D ++ + GNVQQR Y+V YD A R+G
Sbjct: 346 FKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVG 405
Query: 474 FGPGNCN 480
F PG C+
Sbjct: 406 FAPGGCS 412
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 183/483 (37%), Positives = 261/483 (54%), Gaps = 35/483 (7%)
Query: 8 FLLFIWLLRSSNNG--AYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVL 65
FLLF+ L S G AN++ + + + V+SL+ C+++ + + SL+VL
Sbjct: 17 FLLFLCPLCSLKKGYAVEANEHIKKYVHTLEVNSLLASDSCDQSSKVIDKAS---SLQVL 73
Query: 66 GRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTG 125
+YGPC ++ N S E L +DQ R+ +R L K + PA++G
Sbjct: 74 HKYGPCMQV-----LNDRSHVEFLLQDQLRVDSIQAR-LSKISGHGIFEEMVTKLPAQSG 127
Query: 126 I-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFS 183
I + Y + V +G PK+ +L+ DTGSGITWTQC+PC+ C Q++ FDP+KS +++
Sbjct: 128 IAIGTGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYN 187
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
+ C+S +C +L P + CS+ C Y I Y D S GF+AT+ +TI +
Sbjct: 188 NVSCSSASCNLL-----PTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSD-- 240
Query: 242 GYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTG 298
FL GC +N G A+G++GL VS+ S+T Y F YCL S STG
Sbjct: 241 ---VFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPSSTG 297
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDS 358
Y+ FG + F TPI +P S FY I + GISV G +LP+ S FT IDS
Sbjct: 298 YLNFGGKVSQTAGF---TPI--SPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDS 352
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT+ITR P Y AL+ AF ++M Y G ++L DTCYD S Y TV PK+++ F GG
Sbjct: 353 GTVITRLPPTAYKALKEAFDEKMSNYPKTNG-DELLDTCYDFSNYTTVSFPKVSVSFKGG 411
Query: 419 VDLELDVRGTL-VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
V++++D G L +V V+ VCL FA D + GN QQ+ YEV YD A +GF G
Sbjct: 412 VEVDIDASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAG 471
Query: 478 NCN 480
C+
Sbjct: 472 ACS 474
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 274 bits (701), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 167/457 (36%), Positives = 241/457 (52%), Gaps = 24/457 (5%)
Query: 33 YIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRD 92
++VSV++L+P VC R A +L V+ R+GPCS L Q + PS EIL RD
Sbjct: 40 HVVSVAALLPDAVCTPKRAAASN---SSALSVVHRHGPCSPL-QARG-GEPSHAEILDRD 94
Query: 93 QQR---LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPKQYVSLL 148
Q R +H + R D +K + PA+ G+ YIV V +G PK+ + ++
Sbjct: 95 QDRVDSIHRLAAARPSSTADDPSSASKGVSLPARRGVPLGTANYIVSVGLGTPKRDLLVV 154
Query: 149 LDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS 208
DTGS ++W QCKPC C QQ DP FDPS+S T+S +PC + C+ L CS
Sbjct: 155 FDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRL-------DSGSCS 207
Query: 209 SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY-PFLLGCTDNNTGDQNGASGI 267
S +C Y++ Y D S G A D +T+ + + + F+ GC D++TG A G+
Sbjct: 208 SGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLFGKADGL 267
Query: 268 MGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQ 324
GL R VS+ S+ Y F YCL S + GY++ G N +F T +VT +
Sbjct: 268 FGLGRDRVSLASQAAAKYGAGFSYCLPSSSTAEGYLSLGSAAPPNARF---TAMVTRSDT 324
Query: 325 SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKY 384
FY++ L GI V G + + + F T IDSGT+ITR P+ Y+ALRS+F M++Y
Sbjct: 325 PSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVITRLPSRAYAALRSSFAGLMRRY 384
Query: 385 KMGKGIE-DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFAL 443
+ + DTCYD + V +P + + F GG L L L V + Q CL FA
Sbjct: 385 SYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLYVANKSQACLAFAS 444
Query: 444 LPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
D + +LGN+QQ+ + V YDVA +++GFG C+
Sbjct: 445 NGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 183/490 (37%), Positives = 262/490 (53%), Gaps = 34/490 (6%)
Query: 1 MRILFKAFLLFIWLLRSSNNGAYANDNDLSHSYI--VSVSSLIPPTVCNRTRTALPQGPG 58
+ + FL+ + L S G + + +YI V V+SL+P VC+++ L +
Sbjct: 11 LTFILYVFLVLLCPLCSLKKGLTVEGKETTKNYIRTVRVNSLLPSNVCSQSTRVLNRAS- 69
Query: 59 KVSLEVLGRYGPCSKLNQG-KSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKA 117
SL+V+ +YGPC + K+ N PS E L +DQ R+ R FK+ +
Sbjct: 70 --SLKVVNKYGPCIPVTGAPKTINVPSTAEFLLQDQLRVKSFQVRLSMNPSSGVFKEMQT 127
Query: 118 FTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDP 176
T PA + Y + V +G PK+ +L DTGS +TWTQC+PC+ C Q P FDP
Sbjct: 128 -TIPASI-VPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDP 185
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTI 235
+ S ++ + C+S CK++ E P QD C S C Y I Y GSG T GF AT+ + I
Sbjct: 186 TTSTSYKNVSCSSEFCKLIAEGNYP-AQD-CISNTCLYGIQY--GSGYTIGFLATETLAI 241
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS 292
+ F FL GC++ + G NG +G++GL R P+++ S+T Y F YCL +
Sbjct: 242 ASSD---VFKN--FLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPA 296
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
STG+++FG + + K TPI +P+ + Y + GISV G LP+ S
Sbjct: 297 SPSSTGHLSFGVEVS---QAAKSTPI--SPKLKQLYGLNTVGISVRGRELPINGSI---S 348
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS--AYKTVVVPK 410
T IDSGT T P+P YSAL SAFR+ M Y + G F CYD S T+ +P
Sbjct: 349 RTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSS-FQPCYDFSNIGNGTLTIPG 407
Query: 411 ITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
I+I F GGV++E+DV G ++ V +++VCL FA SD + + GN QQ+ YEV YDVA
Sbjct: 408 ISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAK 467
Query: 470 RRLGFGPGNC 479
+GF P C
Sbjct: 468 GMVGFAPKGC 477
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 271 bits (694), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 177/476 (37%), Positives = 264/476 (55%), Gaps = 36/476 (7%)
Query: 15 LRSSNNGAYANDNDLSHSYI--VSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCS 72
L S G N+++ Y V+V+SL+P +VC+ + L + SL+V+ +YGPC+
Sbjct: 21 LCSLKKGHTVAANEITKGYFRNVNVNSLLPSSVCDHSNKVLNKAS---SLKVVSKYGPCT 77
Query: 73 KLNQGKSRNTPSLEEILRRDQQRLH-LKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADE 131
G + PS EILRRDQ R+ ++ + + F + K G
Sbjct: 78 V--TGDPKTFPSAAEILRRDQLRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFG----GG 131
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDPSKSKTFSKIPCNST 190
Y + V +G PK+ SLL DTGS +TWTQC+PC C Q D FDP+KS ++ + C+S
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSE 191
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYPF 249
CK + + + Q SS C Y + Y G+G T GF AT+ +TI + F F
Sbjct: 192 PCKSIGKE---SAQGCSSSNSCLYGVKY--GTGYTVGFLATETLTITPSD---VFEN--F 241
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPD 306
++GC + N G +G +G++GL R PV++ S+T+ +Y F YCL + STG+++FG
Sbjct: 242 VIGCGERNGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYCLPASSSSTGHLSFGGGV 301
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFP 366
+ + K+TPI T + E Y + ++GISVGG +LP+ S F T IDSGT +T P
Sbjct: 302 S---QAAKFTPI--TSKIPELYGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLP 356
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS--AYKTVVVPKITIHFLGGVDLELD 424
+ +SAL SAF++ M Y + KG L CYD S A + +P+I+I F GGV++++D
Sbjct: 357 STAHSALSSAFQEMMTNYTLTKGTSGL-QPCYDFSKHANDNITIPQISIFFEGGVEVDID 415
Query: 425 VRGTLVVES-VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
G + + + +VCL F +D + + GNVQQ+ YEV YDVA +GF PG C
Sbjct: 416 DSGIFIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 176/485 (36%), Positives = 244/485 (50%), Gaps = 28/485 (5%)
Query: 4 LFKAFLLFIWLLRSSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLE 63
L A L+ L GA A + + ++VSV+SL+P TVC T+ A P +L
Sbjct: 11 LLAASLVLATLASPHRLGAAAGEGSETKWHVVSVNSLLPSTVCTPTKAA----PSSSALT 66
Query: 64 VLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAK 123
V+ +GPCS Q R PS EIL RDQ R+ RR A+ +K P +
Sbjct: 67 VVHGHGPCSP--QESRRGAPSHTEILGRDQDRVDAI--RRKVAAVTTAASSSKPKGVPLQ 122
Query: 124 TG---IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSK 180
G + Y+ + +G P + + LDTGS +W QCKPC C +Q + FDPSKS
Sbjct: 123 VGWGKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSS 182
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVN 239
T+S I C+S C+ L + + CSS K+CPY+I Y D S G A D +T+ +
Sbjct: 183 TYSDITCSSRECQELGS----SHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTD 238
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS 296
A F+ GC NN G G++GL RG S+ S+ Y F YCL S +
Sbjct: 239 -----AVPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPSA 293
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-TKLSTE 355
TGY++F ++T +V + FY++ LTGI+V G + + S F T T
Sbjct: 294 TGYLSFSGAAAAAPTNAQFTEMVAG-QHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTI 352
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF 415
IDSGT + P Y+ALRS+ R M +YK +FDTCYDL+ ++TV +P + + F
Sbjct: 353 IDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPS-STIFDTCYDLTGHETVRIPSVALVF 411
Query: 416 LGGVDLELDVRGTLVVES-VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G + L G L S V Q CL F P D + +LGN QQR V YDV +++GF
Sbjct: 412 ADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGF 471
Query: 475 GPGNC 479
G C
Sbjct: 472 GANGC 476
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 162/475 (34%), Positives = 240/475 (50%), Gaps = 47/475 (9%)
Query: 34 IVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQ 93
++SV+SL P C T P + ++ ++GPCS L + P+ +EIL DQ
Sbjct: 43 LLSVASLFPGPACPATAEHGPSAAASARMRIVHQHGPCSPLADAHGK-PPAHDEILAADQ 101
Query: 94 QRLHLKNSR--------RLQKAI--------------PDNFKKTKAFTFPAKTG-IVAAD 130
R+ R +L K P + + + PA +G V+
Sbjct: 102 NRVESIQRRVSATTGRDKLTKHAAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTG 161
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNS 189
Y + V +G P +++ DTGS TW QC+PC+ C +Q++P FDP+KS T++ + C
Sbjct: 162 NYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTD 221
Query: 190 TTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
+ C L + C+ C Y + Y DGS GF+A D +TI G F
Sbjct: 222 SACADL-------DTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKG------F 268
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPD 306
GC + N G +G+MGL RG S+ + Y F YCL + TGY+ FG
Sbjct: 269 RFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGS 328
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFP 366
N + TP++T Q+ FY++ +TGI VGG+++P+ S F+ T +DSGT+ITR P
Sbjct: 329 AGNN--ARLTPMLTDKGQT-FYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLP 385
Query: 367 APVYSALRSAFRKRM--KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
A Y+AL SAF K M + YK G + DTCYD + V +P +++ F GG L++D
Sbjct: 386 ATAYTALSSAFDKVMLARGYKKAPGYS-ILDTCYDFTGLSDVELPTVSLVFQGGACLDVD 444
Query: 425 VRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V G + S QVCL FA D + ++GN QQ+ Y V YD+ + +GF PG+C
Sbjct: 445 VSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 268 bits (684), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 162/475 (34%), Positives = 239/475 (50%), Gaps = 47/475 (9%)
Query: 34 IVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQ 93
++SV+SL P C T P + ++ ++GPCS L + P+ +EIL DQ
Sbjct: 43 LLSVASLFPGPACPATAEHGPSAAASARMRIVHQHGPCSPLADAHGK-PPAHDEILAADQ 101
Query: 94 QRLHLKNSR--------RLQKAI--------------PDNFKKTKAFTFPAKTG-IVAAD 130
R+ R +L K P + + + PA +G V+
Sbjct: 102 NRVESIQRRVSATTGRDKLTKHAAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTG 161
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNS 189
Y + V +G P +++ DTGS TW QC+PC+ C +Q+ P FDP+KS T++ + C
Sbjct: 162 NYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTD 221
Query: 190 TTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
+ C L + C+ C Y + Y DGS GF+A D +TI G F
Sbjct: 222 SACADL-------DTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKG------F 268
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPD 306
GC + N G +G+MGL RG S+ + Y F YCL + TGY+ FG
Sbjct: 269 RFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGS 328
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFP 366
N + TP++T Q+ FY++ +TGI VGG+++P+ S F+ T +DSGT+ITR P
Sbjct: 329 AGNN--ARLTPMLTDKGQT-FYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLP 385
Query: 367 APVYSALRSAFRKRM--KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
A Y+AL SAF K M + YK G + DTCYD + V +P +++ F GG L++D
Sbjct: 386 ATAYTALSSAFDKVMLARGYKKAPGYS-ILDTCYDFTGLSDVELPTVSLVFQGGACLDVD 444
Query: 425 VRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V G + S QVCL FA D + ++GN QQ+ Y V YD+ + +GF PG+C
Sbjct: 445 VSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 167/462 (36%), Positives = 246/462 (53%), Gaps = 26/462 (5%)
Query: 24 ANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTP 83
A+ D ++SV SL C+ + P G +++ + R+GPCS + K
Sbjct: 25 AHAADHRTHKVLSVGSLKSAATCSEPKATPPSTSGGITVPLHHRHGPCSPVPSNK--MPA 82
Query: 84 SLEEILRRDQQRL-HLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKP 141
SLEE L+RDQ R ++K R+ A + +++ A T P G ++ EY I V IG P
Sbjct: 83 SLEERLQRDQLRAAYIK--RKFSGAKGGDVEQSDAATVPTTLGTSLSTLEYVITVGIGSP 140
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
++ +DTGS ++W QCKPC C + D FDPS S T+S C+S C L +
Sbjct: 141 AVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSSAACVQLSQSQQG 200
Query: 202 NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD- 260
NG CSS +C Y ++YVDGS TG +++D +T+ G A F GC+ + +G
Sbjct: 201 NG---CSSSQCQYIVSYVDGSSTTGTYSSDTLTL------GSNAIKGFQFGCSQSESGGF 251
Query: 261 QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTP 317
+ G+MGL S++S+T ++ F YCL GS+G++T G FVK TP
Sbjct: 252 SDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTPGSSGFLTLGAAS--RSGFVK-TP 308
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAF 377
++ + + +Y + L I VGG++L + S F+ S +DSGT+ITR P YSAL SAF
Sbjct: 309 MLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSV-MDSGTVITRLPPTAYSALSSAF 367
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
+ MKKY + + DTC+D S +V +P + + F GG + LD G ++ +
Sbjct: 368 KAGMKKYPPAQ-PSGILDTCFDFSGQSSVSIPSVALVFSGGAVVNLDFNGIML--ELDNW 424
Query: 438 CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA D + +GNVQQR +EV YDV G +GF G C
Sbjct: 425 CLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 166/476 (34%), Positives = 237/476 (49%), Gaps = 45/476 (9%)
Query: 28 DLSHSYI-VSVSSLIPPTVCN-----------RTRTALPQGPGKVSLEVLGRYGPCSKLN 75
D + Y+ VS SS + C R A P+ L + R+GPC+
Sbjct: 21 DAARGYVTVSTSSFAVSSTCADELPGRDWDSLRVSAASPRNGTSAVLRLTHRHGPCAPAG 80
Query: 76 QGKSRNTP-SLEEILRRDQQRLHLKNSRRLQKAIPD----NFKKTKAFTFPAKTGI-VAA 129
+ + +P S + LR DQ+R RR+ A +KA T PA G +
Sbjct: 81 KASALGSPPSFLDTLRADQRRAEYIQ-RRVSGAAAAAPGMQLAGSKAATVPANLGFSIGT 139
Query: 130 DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH--CSQQRDPFFDPSKSKTFSKIPC 187
+Y + V++G P +L +DTGS ++W QCKPC C QRDP FDP++S ++S +PC
Sbjct: 140 LQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPC 199
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
+ +C L + + CS +C Y ++Y DGS TG +++D +T+ N A
Sbjct: 200 AAASCSQLALY-----SNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSN-----ALK 249
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGK 304
FL GC G G G++GL R S++S+ + +Y F YCL S GYI+ G
Sbjct: 250 GFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGG 309
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITR 364
P + TP++T +Y + L GISVGG+ L + AS F +D+GT++TR
Sbjct: 310 PSSTAG--FSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFAS-GAVVDTGTVVTR 366
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKG-IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
P YSALRSAFR M Y + DTCYD + Y TV +P I+I F GG ++L
Sbjct: 367 LPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDL 426
Query: 424 DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
G L CL FA D + +LGNVQQR +EV +D G +GF P +C
Sbjct: 427 GTSGILTSG-----CLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 475
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 264 bits (675), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 159/444 (35%), Positives = 227/444 (51%), Gaps = 33/444 (7%)
Query: 48 RTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTP-SLEEILRRDQQRLHLKNSRRLQK 106
R A P+ L + R+GPC+ + + +P S + LR DQ+R RR+
Sbjct: 42 RVSAASPRNGTSAVLRLTHRHGPCAPAGKASALGSPPSFLDTLRADQRRAEYIQ-RRVSG 100
Query: 107 AIPD----NFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK 161
A +KA T PA G + +Y + V++G P +L +DTGS ++W QCK
Sbjct: 101 AAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCK 160
Query: 162 PCIH--CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYV 219
PC C QRDP FDP++S ++S +PC + +C L + + CS +C Y ++Y
Sbjct: 161 PCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALY-----SNGCSGGQCGYVVSYG 215
Query: 220 DGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIIS 279
DGS TG +++D +T+ N A FL GC G G G++GL R S++S
Sbjct: 216 DGSTTTGVYSSDTLTLTGSN-----ALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVS 270
Query: 280 KTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGIS 336
+ + +Y F YCL S GYI+ G P + TP++T +Y + L GIS
Sbjct: 271 QASSTYGGVFSYCLPPTQNSVGYISLGGPSSTAG--FSTTPLLTASNDPTYYIVMLAGIS 328
Query: 337 VGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG-IEDLFD 395
VGG+ L + AS F +D+GT++TR P YSALRSAFR M Y + D
Sbjct: 329 VGGQPLSIDASVFAS-GAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILD 387
Query: 396 TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGN 455
TCYD + Y TV +P I+I F GG ++L G L CL FA D + +LGN
Sbjct: 388 TCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILTSG-----CLAFAPTGGDSQASILGN 442
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNC 479
VQQR +EV +D G +GF P +C
Sbjct: 443 VQQRSFEVRFD--GSTVGFMPASC 464
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 261 bits (668), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 150/428 (35%), Positives = 227/428 (53%), Gaps = 25/428 (5%)
Query: 61 SLEVLGRYGPCSKLNQGKSRNTPSLE-EILRRDQQRLHLKNSRRLQKAIP--DNFKKTKA 117
+L V+ R GPCS L ++R P E+L DQ R+ + + A P D + K
Sbjct: 74 ALNVVHRQGPCSPL---QARGAPPPHAELLNDDQARVDSIHRKIAAAASPVLDQARGKKG 130
Query: 118 FTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDP 176
T PA+ GI + Y + + +G P + ++++ DTGS ++W QC PC C +Q+DP FDP
Sbjct: 131 VTLPAQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDP 190
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
++S T+S +PC S C+ L + + K+C Y++ Y D S G A D +T+
Sbjct: 191 ARSSTYSAVPCASPECQGL------DSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLT 244
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSP 293
+ + F+ GC + +TG A G++GL R VS+ S+ Y F YCL S
Sbjct: 245 QSD-----VLPGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPSS 299
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
+ GY++ G P N +F T + T + FY++ L G+ V G + + F+
Sbjct: 300 PSAAGYLSLGGPAPANARF---TAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAG 356
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE-DLFDTCYDLSAYKTVVVPKIT 412
T IDSGT+ITR P VY+ALRSAF + M +Y + + DTCYD + + TV +P +
Sbjct: 357 TVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVA 416
Query: 413 IHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
+ F GG + LD G L V V Q CL FA ++ ++GN QQ+ V YDVA +++
Sbjct: 417 LVFAGGAAVGLDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKI 476
Query: 473 GFGPGNCN 480
GFG C+
Sbjct: 477 GFGANGCS 484
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 151/421 (35%), Positives = 218/421 (51%), Gaps = 22/421 (5%)
Query: 64 VLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAK 123
V+ R+GPCS L PS EIL RDQ R+ + +K + PA
Sbjct: 121 VVHRHGPCSPLL--ARGGEPSHAEILDRDQDRVDSIHRMTAGPWTAGQSSASKGVSLPAH 178
Query: 124 TGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTF 182
G+ YIV V +G P++ + ++ DTGS ++W QCKPC +C +Q DP FDPS+S T+
Sbjct: 179 RGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTY 238
Query: 183 SKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNG 242
S +PC + C CSS +C Y++ Y D S G A D +T+ G
Sbjct: 239 SAVPCGAQECL---------DSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTL----GPS 285
Query: 243 YFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGY 299
F+ GC D++TG A G+ GL R VS+ S+ Y F YCL S + + GY
Sbjct: 286 SDQLQGFVFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPSSWRAEGY 345
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
++ G ++T +VT + FY++ L GI V G + + + F T IDSG
Sbjct: 346 LSLGS--AAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSG 403
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T+ITR P+ YSALRS+F M++YK + + DTCYD + V +P + + F GG
Sbjct: 404 TVITRLPSRAYSALRSSFAGFMRRYKRAPALS-ILDTCYDFTGRTKVQIPSVALLFDGGA 462
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L L G L V + Q CL FA D + +LGN+QQ+ + V YD+A +++GFG C
Sbjct: 463 TLNLGFGGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGC 522
Query: 480 N 480
+
Sbjct: 523 S 523
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 163/446 (36%), Positives = 235/446 (52%), Gaps = 66/446 (14%)
Query: 41 IPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKN 100
+P + C+ + Q + SLEV+ ++GPCSKL K+ N+PS +IL +D+ R+
Sbjct: 1 MPSSACSPSPKGHDQ---RASLEVVHKHGPCSKLRPHKA-NSPSHTQILAQDESRVASIQ 56
Query: 101 SRRLQK-AIPDNFKKTKAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWT 158
SR + A N K +KA T P+K+ + + Y + V +G PK+ ++ + DTGS +TWT
Sbjct: 57 SRLAKNLAGGSNLKASKA-TLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWT 115
Query: 159 QCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIA 217
QC+PC+ +C QQR+ FDPS S ++S + C+S +C+ L CSS C Y I
Sbjct: 116 QCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLES--ATGNSPGCSSSTCLYGIR 173
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSI 277
Y DGS GF+A +++++ + F + F GC NN G G +G++GL R P+S+
Sbjct: 174 YGDGSYSIGFFAREKLSLTSTD---VFNNFQF--GCGQNNRGLFGGTAGLLGLARNPLSL 228
Query: 278 ISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTG 334
+S+T Y F YCL S STGY++FG D + K VK+TP
Sbjct: 229 VSQTAQKYGKVFSYCLPSSSSSTGYLSFGSGDG-DSKAVKFTP----------------- 270
Query: 335 ISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF 394
R P VYS+++ FR+ M Y KG+ +
Sbjct: 271 -----------------------------RLPPTVYSSVQKVFRELMSDYPRVKGVS-IL 300
Query: 395 DTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLG 454
DTCYDLS YKTV VPKI ++F GG +++L G + V V QVCL FA D ++G
Sbjct: 301 DTCYDLSKYKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIG 360
Query: 455 NVQQRGYEVHYDVAGRRLGFGPGNCN 480
NVQQ+ V YD A R+GF P CN
Sbjct: 361 NVQQKTIHVVYDDAEGRVGFAPSGCN 386
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 159/457 (34%), Positives = 233/457 (50%), Gaps = 36/457 (7%)
Query: 35 VSVSSLIPPTVCNRTRTALPQGPGKVS-LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQ 93
VS +S P + C+ + PQ + L + R+GPC+ L + S PS+ + LR DQ
Sbjct: 38 VSAASFAPSSTCSASDPVAPQQNDTFTVLRLTHRHGPCAPL-RASSLAAPSVADTLRADQ 96
Query: 94 QRLHLKNSRRLQKAIPDNFK-KTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDT 151
+R R + P + K A T PA G + Y + ++G P +L +DT
Sbjct: 97 RRAEHILRRVSGRGAPQLWDYKAAAATVPANWGYDIGTSNYVVTASLGTPGMAQTLEVDT 156
Query: 152 GSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS 209
GS ++W QCKPC C +Q+DP FDP++S +++ +PC + C L + CS+
Sbjct: 157 GSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGRSACAGLGIY-----ASACSA 211
Query: 210 KECPYDIAYVDGSGETGFWATDRMTIQE---VNGNGYFARYPFLLGCTDNNTGDQ-NGAS 265
+C Y ++Y DGS TG +++D +T+ V G FL GC +G G
Sbjct: 212 AQCGYVVSYGDGSNTTGVYSSDTLTLAANATVQG--------FLFGCGHAQSGGLFTGID 263
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTP 322
G++G R S++ +T +Y F YCL + +TGY+T G P V F T ++ +P
Sbjct: 264 GLLGFGREQPSLVQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPSGVAPGF-STTQLLPSP 322
Query: 323 EQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMK 382
+Y + LTGISVGG+ L + AS F T +D+GT+ITR P Y+ALRSAFR M
Sbjct: 323 NAPTYYVVMLTGISVGGQPLSVPASAFAA-GTVVDTGTVITRLPPAAYAALRSAFRSGMA 381
Query: 383 KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA 442
Y I + DTCY + Y TV + + + F G + L G + CL FA
Sbjct: 382 SYPSAPPI-GILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIMSFG-----CLAFA 435
Query: 443 LLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
SD + +LGNVQQR +EV D G +GF P +C
Sbjct: 436 SSGSDGSMAILGNVQQRSFEVRID--GSSVGFRPSSC 470
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 164/475 (34%), Positives = 234/475 (49%), Gaps = 39/475 (8%)
Query: 21 GAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSR 80
G A ND + ++ SVSSL+P + C TA +L V+ R+GPCS + Q + R
Sbjct: 35 GPAARTND-PNWHVFSVSSLLPSSAC----TASKAASNSSALGVVHRHGPCSPV-QARPR 88
Query: 81 N---TPSLEEILRRDQQR---LHLK--NSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADE 131
+ EIL RDQ R +H K + + + + PA+ GI +
Sbjct: 89 GGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGN 148
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + V +G P + +++ DTGS ++W QCKPC C +Q+DP FDPS S T++ + C +
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C+ L + S C Y++ Y D S G D +T+ + P F+
Sbjct: 209 CQEL------DASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD------TLPGFV 256
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDT 307
GC D N G G+ GL R VS+ S+ SY F YCL S GY++ G
Sbjct: 257 FGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLGGAPP 316
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYFTKLSTEIDSGTIITRF 365
N +F T FY+I L GI VGG R+P A + IDSGT+ITR
Sbjct: 317 ANAQFTALADGATP----SFYYIDLVGIKVGGRAIRIPATAFAAAGGTV-IDSGTVITRL 371
Query: 366 PAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
P Y+ LR+AF + M +YK + + DTCYD + ++T +P + + F GG + LD
Sbjct: 372 PPRAYAPLRAAFARSMAQYKKAPALS-ILDTCYDFTGHRTAQIPTVELAFAGGATVSLDF 430
Query: 426 RGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G L V V Q CL FA D + +LGN QQ+ + V YDVA +R+GFG C+
Sbjct: 431 TGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 166/476 (34%), Positives = 235/476 (49%), Gaps = 41/476 (8%)
Query: 21 GAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSR 80
G A ND + ++ SVSSL+P + C TA +L V+ R+GPCS + Q + R
Sbjct: 35 GPAARTND-PNWHVFSVSSLLPSSAC----TASKAASNSSALGVVHRHGPCSPV-QARRR 88
Query: 81 N---TPSLEEILRRDQQR---LHLK--NSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADE 131
+ EIL RDQ R +H K + + + + PA+ GI +
Sbjct: 89 GGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGN 148
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + V +G P + +++ DTGS ++W QCKPC C +Q+DP FDPS S T++ + C +
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPE 208
Query: 192 CKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-F 249
C+ L CSS C Y++ Y D S G D +T+ + P F
Sbjct: 209 CQEL-------DASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD------TLPGF 255
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPD 306
+ GC D N G G+ GL R VS+ S+ SY F YCL S GY++ G
Sbjct: 256 VFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLGGAP 315
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYFTKLSTEIDSGTIITR 364
N +F T FY+I L GI VGG R+P A + IDSGT+ITR
Sbjct: 316 PANAQFTALADGATP----SFYYIDLVGIKVGGRAIRIPATAFAAAGGTV-IDSGTVITR 370
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
P Y+ LR+AF + M +YK + + DTCYD + ++T +P + + F GG + LD
Sbjct: 371 LPPRAYAPLRAAFARSMAQYKKAPALS-ILDTCYDFTGHRTAQIPTVELAFAGGATVSLD 429
Query: 425 VRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G L V V Q CL FA D + +LGN QQ+ + V YDVA +R+GFG C+
Sbjct: 430 FTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 165/471 (35%), Positives = 250/471 (53%), Gaps = 40/471 (8%)
Query: 22 AYANDNDLSHSY-IVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSR 80
A+A D+ SY ++S+ SL +VC+ ++ A+ G ++ + R+GPCS L ++
Sbjct: 23 AHAGDHG---SYKVLSLGSLRTKSVCSESK-AVKSSTGAATVPLHHRHGPCSPL---PTK 75
Query: 81 NTPSLEEILRRDQ------QRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYY 133
P+LEE L RDQ QR + + +++ A T P G + EY
Sbjct: 76 KMPTLEERLHRDQLRAAYIQRKFSGGGVNGSRGGAGDVQQSHA-TVPTTLGTSLDTLEYL 134
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
I V +G P + ++L+DTGS ++W QCKPC C Q DP FDPS S T+S C+S C
Sbjct: 135 ITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAACA 194
Query: 194 ILLEWFPPNGQD--KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
L GQ+ CSS +C Y + Y DGS TG +++D + + G A F
Sbjct: 195 QL-------GQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLAL------GSNAVRKFQF 241
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTV 308
GC++ +G + G+MGL G S++S+T ++ F YCL + S+G++T G
Sbjct: 242 GCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLPATSSSSGFLTLGAG--- 298
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAP 368
FVK TP++ + + FY + + I VGG +L + S F+ T +DSGT++TR P
Sbjct: 299 TSGFVK-TPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSA-GTIMDSGTVLTRLPPT 356
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT 428
YSAL SAF+ MK+Y + DTC+D S +V +P + + F GG +++ G
Sbjct: 357 AYSALSSAFKAGMKQYPSAP-PSGILDTCFDFSGQSSVSIPTVALVFSGGAVVDIASDGI 415
Query: 429 LVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++ S +CL FA D + ++GNVQQR +EV YDV G +GF G C
Sbjct: 416 MLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 166/464 (35%), Positives = 233/464 (50%), Gaps = 39/464 (8%)
Query: 28 DLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEE 87
D +V+ SSL P VC+ + P G +L + R+GPCS + S+ PS EE
Sbjct: 28 DAQRYIVVATSSLKPSEVCSGHKVT-PSKNGS-TLALSHRHGPCSPV---ISKEKPSHEE 82
Query: 88 ILRRDQQR---LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQ 143
LRRDQ R + K S R + + A T P +G + EY I V IG P
Sbjct: 83 TLRRDQLRAAYIQAKVSSRYNNVAKE--LQQSAVTIPTSSGYSLGTTEYVITVTIGTPAV 140
Query: 144 YVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
+ +DTGS ++W QC PC CS Q+D FDP+ S T+S C S C L +
Sbjct: 141 TQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQLGD--EG 198
Query: 202 NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ 261
NG C +C Y + Y DGS G + +D +++ + A F GC+ G
Sbjct: 199 NG---CLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSD-----AVKSFQFGCSHRAAGFV 250
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTG-YITFGKPDTVNKKFVKYTP 317
G+MGL S++S+T +Y F YCL P S G ++T G + +TP
Sbjct: 251 GELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGGFLTLGAAGGASSSRYSHTP 310
Query: 318 IV--TTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRS 375
+V + P FY + L GI+V G L + AS F+ S +DSGT+IT+ P Y ALR+
Sbjct: 311 MVRFSVPT---FYGVFLQGITVAGTMLNVPASVFSGASV-VDSGTVITQLPPTAYQALRT 366
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVR 435
AF+K MK Y + L DTC+D S + T+ VP +T+ F G ++LD+ G L
Sbjct: 367 AFKKEMKAYPSAAPVGSL-DTCFDFSGFNTITVPTVTLTFSRGAAMDLDISGILYAG--- 422
Query: 436 QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL F D ++ +LGNVQQR +E+ +DV GR +GF G C
Sbjct: 423 --CLAFTATAHDGDTGILGNVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 250 bits (639), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 154/450 (34%), Positives = 230/450 (51%), Gaps = 33/450 (7%)
Query: 44 TVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRR 103
TVC+ ++ L VS+ ++ RYGPC+ +Q + TPS+ E LRR + R + S+
Sbjct: 39 TVCSASKVNLEPSSATVSMSLVHRYGPCAP-SQYSNVPTPSISETLRRSRARTNYIMSQA 97
Query: 104 LQK------AIPDNFKKTKAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGIT 156
+ + PD+ A T P + G V + EY + + G P LL+DTGS ++
Sbjct: 98 SKSMGMGMASTPDD--DDAAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVS 155
Query: 157 WTQCKPC--IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS--KEC 212
W QC PC C Q+DP FDPSKS T++ I CN+ C+ L + + + C+S +C
Sbjct: 156 WVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNTDACRKLGDHY----HNGCTSGGTQC 211
Query: 213 PYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDR 272
Y + Y DGS G ++ + +T+ F GC + G + G++GL
Sbjct: 212 GYSVEYADGSHSRGVYSNETLTLAPG-----ITVEDFHFGCGRDQRGPSDKYDGLLGLGG 266
Query: 273 GPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYH 329
PVS++ +T+ Y F YCL + G++ G P + NK +TP+ P + FY
Sbjct: 267 APVSLVVQTSSVYGGAFSYCLPALNSEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYM 326
Query: 330 ITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG 389
+T+TGISVGG+ L + S F + IDSGT+ T P Y+AL +A RK +K Y +
Sbjct: 327 VTMTGISVGGKPLHIPQSAF-RGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVP- 384
Query: 390 IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN 449
D FDTCY+ + Y + VP++ F GG ++LDV ++V CL F D
Sbjct: 385 -SDDFDTCYNFTGYSNITVPRVAFTFSGGATIDLDVPNGILVND----CLAFQESGPDDG 439
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GNV QR EV YD +GF G C
Sbjct: 440 LGIIGNVNQRTLEVLYDAGRGNVGFRAGAC 469
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 149/428 (34%), Positives = 222/428 (51%), Gaps = 30/428 (7%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIP---DNFKKTKAF 118
L + ++GPC+ ++ S TPS+ + LR DQ+R R + P D+ +
Sbjct: 67 LRLTHKHGPCAP-SRASSLATPSVADTLRADQRRAEYILRRVSGRGTPQLWDSKAEAATA 125
Query: 119 TFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH--CSQQRDPFFD 175
T PA G + Y + V++G P +L +DTGS ++W QC PC C Q+DP FD
Sbjct: 126 TVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFD 185
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI 235
P++S +++ +PC C L + CS+ +C Y ++Y DGS TG +++D +T+
Sbjct: 186 PAQSSSYAAVPCGGPVCGGLGIY-----ASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTL 240
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS 292
+ A F GC +G G G++GL R S++ +T +Y F YCL +
Sbjct: 241 SPND-----AVRGFFFGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTYGGVFSYCLPT 294
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
+TGY+T G P T ++++P + +Y + LTGISVGG++L + +S F
Sbjct: 295 RPSTTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAG- 353
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG-IEDLFDTCYDLSAYKTVVVPKI 411
T +D+GT+ITR P Y+ALRSAFR M Y + DTCY+ S Y TV +P +
Sbjct: 354 GTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNV 413
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
+ F GG + L G L CL FA SD +LGNVQQR +EV D G
Sbjct: 414 ALTFSGGATVTLGADGILSFG-----CLAFAPSGSDGGMAILGNVQQRSFEVRID--GTS 466
Query: 472 LGFGPGNC 479
+GF P +C
Sbjct: 467 VGFKPSSC 474
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 145/367 (39%), Positives = 189/367 (51%), Gaps = 23/367 (6%)
Query: 118 FTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFD 175
+ PA+ G+ + Y I V G PK+ +++ DTGS + W QCKPC+ C Q++P FD
Sbjct: 1 ISIPARIGLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFD 60
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI 235
P+ S T+ I C S C L CS C Y + Y DGS GF AT+ T+
Sbjct: 61 PTLSSTYRNISCTSAACTGL-------SSRGCSGSTCVYGVTYGDGSSTVGFLATETFTL 113
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS 292
N F+ GC NN G GA+G++GL R P S+ S+ S F YCL S
Sbjct: 114 AAGN-----VFNNFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPS 168
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
+TGY+ G P + YT ++T Y I L GISVGG RL L ++ F +
Sbjct: 169 TSSATGYLNIGNP----LRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSV 224
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
T IDSGT+ITR P Y ALR+AFR M +Y + DTCYD S TV P I
Sbjct: 225 GTIIDSGTVITRLPPTAYGALRTAFRAAMTQYTRAAA-ASILDTCYDFSRTTTVTFPTIK 283
Query: 413 IHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
+H+ G+D+ + G V S QVCL FA ++GNVQQR EV YD A +R+
Sbjct: 284 LHYT-GLDVTIPGAGVFYVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRI 342
Query: 473 GFGPGNC 479
GF G C
Sbjct: 343 GFAAGAC 349
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 170/430 (39%), Positives = 235/430 (54%), Gaps = 36/430 (8%)
Query: 59 KVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAF 118
K SL V+ +G CS L+ + +EI+RRDQ R+ S+ L K + + K+
Sbjct: 62 KSSLRVVHMHGACSHLSSDARVDH---DEIIRRDQARVESIYSK-LSKNSANEVSEAKST 117
Query: 119 TFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDP 176
PAK+GI YIV + IG PK +SL+ DTGS +TWTQC+PC+ C Q++P F+P
Sbjct: 118 ELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNP 177
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI- 235
S S T+ + C+S C+ + CS+ C Y I Y D S GF A ++ T+
Sbjct: 178 SSSSTYQNVSCSSPMCE---------DAESCSASNCVYSIGYGDKSFTQGFLAKEKFTLT 228
Query: 236 -QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
+V + YF GC +NN G +G +G++GL G +S+ ++T +Y F YCL
Sbjct: 229 NSDVLEDVYF-------GCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLP 281
Query: 292 S-PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEF-YHITLTGISVGGERLPLKASYF 349
S STG++TFG + VK+TPI + P S F Y I + GISVG + L + + F
Sbjct: 282 SFTSNSTGHLTFGSAGI--SESVKFTPISSFP--SAFNYGIDIIGISVGDKELAITPNSF 337
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
+ IDSGT+ TR P VY+ LRS F+++M YK G LFDTCYD + TV P
Sbjct: 338 STEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGY-GLFDTCYDFTGLDTVTYP 396
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
I F GG +ELD G + + QVCL FA +D + GNVQQ +V YDVAG
Sbjct: 397 TIAFSFAGGTVVELDGSGISLPIKISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAG 454
Query: 470 RRLGFGPGNC 479
R+GF P C
Sbjct: 455 GRVGFAPNGC 464
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 159/480 (33%), Positives = 236/480 (49%), Gaps = 51/480 (10%)
Query: 35 VSVSSLIPPTV--CNRTRTALPQGPGK-VSLEVLGRYGPCSKLNQGKSRNTPSLEEILRR 91
+ V SL+P C + QG + V+ ++GPCS L ++ PS EIL
Sbjct: 36 LDVESLLPSAAAPCPTPQAEQKQGAAPPTRMPVVHQHGPCSPLADNRNGKAPSHAEILAA 95
Query: 92 DQQR---LHLK------NSRRLQKAIPDNFK---------------KTKAFTFPAKTGIV 127
DQ+R +H + +RR ++ P + T PA G+
Sbjct: 96 DQRRAEYIHRRVAETTGRARRRKQGAPVELRPGTPPSSIVVPSSSSATSTTDLPASYGVA 155
Query: 128 AADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKI 185
Y+V V +G P + +++ DTGS TW QC+PC+ +C +Q++P FDP+KS T++ I
Sbjct: 156 LGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANI 215
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C+S+ C L CS C Y I Y DGS GF+A D +T+ Y
Sbjct: 216 SCSSSYCSDLYV-------SGCSGGHCLYGIQYGDGSYTIGFYAQDTLTL------AYDT 262
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF 302
F GC + N G A+G++GL RG S+ + Y F YCL + TG++
Sbjct: 263 IKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDL 322
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
G + TP++ FY++ +TGI VGG LP+ S F+ T +DSGT+I
Sbjct: 323 GP--GAPAANARLTPMLVD-RGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVI 379
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIE-DLFDTCYDLSAYK--TVVVPKITIHFLGGV 419
TR P Y+ LRSAF K M+ + DTCYDL+ +K ++ +P +++ F GG
Sbjct: 380 TRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGA 439
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L++D G L V V Q CL FA D + ++GN QQ+ + V YD+ + +GF PG C
Sbjct: 440 CLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 248 bits (632), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 161/466 (34%), Positives = 244/466 (52%), Gaps = 37/466 (7%)
Query: 24 ANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTP 83
A+ D ++S+ SL +VC+ ++ A+ G ++ + R+GPCS L ++ P
Sbjct: 22 AHAGDHGSYKVLSIGSLRTKSVCSESK-AVRSSSGATTVPLHHRHGPCSPL---PTKKMP 77
Query: 84 SLEEILRRDQQRL-HLKNSRRLQKAIPDNFK-----KTKAFTFPAKTGI-VAADEYYIVV 136
SLE+ L RDQ R ++K R+ + + + + T P G + EY I V
Sbjct: 78 SLEDRLHRDQLRAAYIK--RKFSGDVKKDGQGAGGVEQSHVTVPTTLGTSLNTLEYLITV 135
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILL 196
+G P + ++L+D+GS ++W QCKPC+ C Q DP FDPS S T+S C+S C L
Sbjct: 136 RLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSSAACAQLG 195
Query: 197 EWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDN 256
+ +G SS +C Y + Y DGS TG +++D + + G+ + + F GC+
Sbjct: 196 Q----DGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLAL----GSNTISNFQF--GCSHV 245
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFV 313
+G + G+MGL G S+ S+T ++ F YCL S+G++T G FV
Sbjct: 246 ESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLPPTPSSSGFLTLGAG---TSGFV 302
Query: 314 KYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSAL 373
K TP++ + FY + L I VGG +L + S F+ +DSGTIITR P YSAL
Sbjct: 303 K-TPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSA-GMVMDSGTIITRLPRTAYSAL 360
Query: 374 RSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVES 433
SAF+ MK+Y+ + DTC+D S +V +P + + F GG + LD G ++
Sbjct: 361 SSAFKAGMKQYRPAP-PRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVNLDANGIILGN- 418
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA D + ++GNVQQR +EV YDV G +GF G C
Sbjct: 419 ----CLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 152/450 (33%), Positives = 226/450 (50%), Gaps = 48/450 (10%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQR---LHLK------NSRRLQKAIPDNF 112
+ V+ ++GPCS L ++ PS EIL DQ+R +H + +RR ++ P
Sbjct: 1 MPVVHQHGPCSPLADNRNGKAPSHAEILAADQRRAEYIHRRVAETTGRARRRKQGAPVEL 60
Query: 113 K---------------KTKAFTFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGIT 156
+ T PA G+ Y+V V +G P + +++ DTGS T
Sbjct: 61 RPGTPPSSIVVPSSSSATSTTDLPASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTT 120
Query: 157 WTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYD 215
W QC+PC+ +C +Q++P FDP+KS T++ I C+S+ C L CS C Y
Sbjct: 121 WVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSSYCSDLYV-------SGCSGGHCLYG 173
Query: 216 IAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPV 275
I Y DGS GF+A D +T+ Y F GC + N G A+G++GL RG
Sbjct: 174 IQYGDGSYTIGFYAQDTLTLA------YDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKT 227
Query: 276 SIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITL 332
S+ + Y F YCL + TG++ G + TP++ FY++ +
Sbjct: 228 SLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPAAN--ARLTPMLVD-RGPTFYYVGM 284
Query: 333 TGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE- 391
TGI VGG LP+ S F+ T +DSGT+ITR P Y+ LRSAF K M+
Sbjct: 285 TGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAF 344
Query: 392 DLFDTCYDLSAYK--TVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN 449
+ DTCYDL+ +K ++ +P +++ F GG L++D G L V V Q CL FA D +
Sbjct: 345 SILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNADDTD 404
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN QQ+ + V YD+ + +GF PG C
Sbjct: 405 VAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 169/430 (39%), Positives = 234/430 (54%), Gaps = 36/430 (8%)
Query: 59 KVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAF 118
K SL V+ +G CS L+ + +EI+RRDQ R+ S+ L K + + K+
Sbjct: 62 KSSLRVVHMHGACSHLSSDARVDH---DEIIRRDQARVESIYSK-LSKNSANEVSEAKST 117
Query: 119 TFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDP 176
PAK+GI YIV + IG PK +SL+ DTGS +TWTQC+PC+ C Q++P F+P
Sbjct: 118 ELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNP 177
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI- 235
S S T+ + C+S C+ + CS+ C Y I Y D S GF A ++ T+
Sbjct: 178 SSSSTYQNVSCSSPMCE---------DAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLT 228
Query: 236 -QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
+V + YF GC +NN G +G +G++GL G +S+ ++T +Y F YCL
Sbjct: 229 NSDVLEDVYF-------GCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLP 281
Query: 292 S-PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEF-YHITLTGISVGGERLPLKASYF 349
S STG++TFG + VK+TPI + P S F Y I + GISVG + L + + F
Sbjct: 282 SFTSNSTGHLTFGSAGI--SESVKFTPISSFP--SAFNYGIDIIGISVGDKELAITPNSF 337
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
+ IDSGT+ TR P VY+ LRS F+++M YK G LFDTCYD + TV P
Sbjct: 338 STEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGY-GLFDTCYDFTGLDTVTYP 396
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
I F G +ELD G + + QVCL FA +D + GNVQQ +V YDVAG
Sbjct: 397 TIAFSFAGSTVVELDGSGISLPIKISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAG 454
Query: 470 RRLGFGPGNC 479
R+GF P C
Sbjct: 455 GRVGFAPNGC 464
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 165/465 (35%), Positives = 235/465 (50%), Gaps = 28/465 (6%)
Query: 26 DNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKV--SLEVLGRYGPCSKLNQGKSRNTP 83
D ++ ++VSV+SL+P TVC T+ GP SL V+ R+GPCS L + + P
Sbjct: 40 DGSETNWHVVSVNSLLPNTVCTSTK-----GPAAAPSSLTVVHRHGPCSPL-RSRGSGAP 93
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
S EILRRDQ R+ ++ R + N K ++ Y + +G P
Sbjct: 94 SHTEILRRDQDRV---DAIRRKVTASSNKPKGGVSLLANWGKSLSTTNYVASLRLGTPAT 150
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+ + LDTGS +W QCKPC C +QRDP FDP+ S T+S +PC + C+ L
Sbjct: 151 ELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGARECQELASSSSSRN 210
Query: 204 QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGDQN 262
++K CPY+++Y D S G A D +T+ P F+ GC +N G
Sbjct: 211 CSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCGHSNAGTFG 270
Query: 263 GASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIV 319
G++GL G S+ S+ Y F YCL S + GY++FG + ++T +V
Sbjct: 271 EVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPSAAGYLSFGG--AAARANAQFTEMV 328
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYF-TKLSTEIDSGTIITRFPAPVYSALRSAFR 378
T + + +Y + LTGI V G + + AS F T T IDSGT +R P Y+ALRS+FR
Sbjct: 329 TGQDPTSYY-LNLTGIVVAGRAIKVPASAFATAAGTIIDSGTAFSRLPPSAYAALRSSFR 387
Query: 379 KRMKKYKMGKGIED-LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVV-ESVRQ 436
M +Y+ + +FDTCYD + ++TV +P + + F G + L G L V Q
Sbjct: 388 SAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELVFADGATVHLHPSGVLYTWNDVAQ 447
Query: 437 VCLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL F PN L LGN QQR V YDV +R+GFG C
Sbjct: 448 TCLAFV-----PNHDLGILGNTQQRTLAVIYDVGSQRIGFGRKGC 487
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 149/430 (34%), Positives = 216/430 (50%), Gaps = 24/430 (5%)
Query: 59 KVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAF 118
+ + ++ R+GPCS L PS EEIL DQ R RR+ + K K
Sbjct: 86 RTRMPIVHRHGPCSPLADAHDGKLPSHEEILAADQNRAK-SIQRRVSTTTTVSRGKPKRN 144
Query: 119 --TFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFF 174
+ PA +G + Y + + +G P +++ DTGS TW QC+PC+ C +Q++ F
Sbjct: 145 RPSLPASSGSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLF 204
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
DP++S T++ I C + C L CS C Y + Y DGS GF+A D +T
Sbjct: 205 DPARSSTYANISCAAPACSDLY-------IKGCSGGHCLYGVQYGDGSYSIGFFAMDTLT 257
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
+ Y A F GC + N G A+G++GL RG S+ + Y F +C
Sbjct: 258 LSS-----YDAIKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFP 312
Query: 292 SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTK 351
+ TGY+ FG P ++ K T + FY++ LTGI VGG+ L + S FT
Sbjct: 313 ARSSGTGYLDFG-PGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTT 371
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRM--KKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T +DSGT+ITR P YS+LRSAF M + YK + L DTCYD + V +P
Sbjct: 372 SGTIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALS-LLDTCYDFTGMSEVAIP 430
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
+++ F GG L++ G + SV Q CLGFA D + ++GN Q + + V YD+
Sbjct: 431 TVSLLFQGGASLDVHASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGK 490
Query: 470 RRLGFGPGNC 479
+ +GF PG C
Sbjct: 491 KVVGFCPGAC 500
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 170/496 (34%), Positives = 251/496 (50%), Gaps = 48/496 (9%)
Query: 7 AFLLFIWLLRSSNNGAYANDNDLSHSYIV-SVSSLIPPTVCNRTRTALPQGPGKVSLEVL 65
AF L + +L S N+ H ++V SS +P C+ P + S+ +
Sbjct: 2 AFPLLLCVLVCSYCSVALGGNE--HGFVVVPTSSFVPAAACSTPIGVGNPDPTRASVPLA 59
Query: 66 GRYGPCS-KLNQGKSRNTPSLEEILRRDQQR----LHLKNSRRLQKAIPDNFKKTKAFTF 120
R+GPC+ K + + PS E LR D+ R L + RR+ + +
Sbjct: 60 HRHGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRRM-------MSEGGGASI 112
Query: 121 PAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC--IHCSQQRDPFFDPS 177
P G V + EY + + IG P ++L+DTGS ++W QCKPC C Q+DP FDPS
Sbjct: 113 PTYLGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPS 172
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDK-CSSK------ECPYDIAYVDGSGETGFWAT 230
KS TF+ IPC S CK L P +G D C++ +C Y I Y +G+ G ++T
Sbjct: 173 KSSTFATIPCASDACKQL----PVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYST 228
Query: 231 DRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FF 287
+ + + G+ + F GC + G + G++GL P S++S+T Y F
Sbjct: 229 ETLAL----GSSAVVKS-FRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFS 283
Query: 288 YCLHSPYGSTGYITFGKPDTVNKK---FVKYTPI-VTTPEQSEFYHITLTGISVGGERLP 343
YCL G++T G P++ N FV +TP+ +P+ + FY +TLTGISVGG+ L
Sbjct: 284 YCLPPLNSGAGFLTLGAPNSTNNSNSGFV-FTPMHAFSPKIATFYVVTLTGISVGGKALD 342
Query: 344 LKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY 403
+ + F K +DSGT+IT P Y ALR+AFR M +Y + + DTCY+ + +
Sbjct: 343 IPPAVFAK-GNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGH 401
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEV 463
TV VPK+ + F+GG ++LDV ++VE CL FA D + ++GNV R EV
Sbjct: 402 GTVTVPKVALTFVGGATVDLDVPSGVLVED----CLAFADA-GDGSFGIIGNVNTRTIEV 456
Query: 464 HYDVAGRRLGFGPGNC 479
YD LGF G C
Sbjct: 457 LYDSGKGHLGFRAGAC 472
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 158/471 (33%), Positives = 240/471 (50%), Gaps = 35/471 (7%)
Query: 14 LLRSSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSK 73
+ R+ ++G+Y ++S+ S +VC++++ G ++ + R+GPCS
Sbjct: 21 IARAGDDGSYK---------VLSMGSPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSP 71
Query: 74 LNQGKSRNTPSLEEILRRDQQRL-HLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADE 131
L ++ P+LEE L RDQ R +++ + +++ A T P G + E
Sbjct: 72 L---PTKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDA-TVPTALGTSLNTLE 127
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P ++L+DTGS ++W QCKPC C Q DP FDPS S T+S C S
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAA 187
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C L + G SS +C Y + Y DGS TG +++D + + G A F
Sbjct: 188 CAQLGQ----EGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVKSFQF 237
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTV 308
GC++ +G + G+MGL G S++S+T + F YCL S+G++T G
Sbjct: 238 GCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 297
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAP 368
TP++ + + FY + L I VGG +L + AS F+ T +DSGT+ITR P
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA-GTVMDSGTVITRLPPT 356
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT 428
YSAL SAF+ MK+Y + + DTC+D S +V +P + + F GG + LD G
Sbjct: 357 AYSALSSAFKAGMKQYPPAQ-PSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGI 415
Query: 429 LVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++ CL FA D + ++GNVQQR +EV YDV +GF G C
Sbjct: 416 ILSN-----CLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 158/471 (33%), Positives = 240/471 (50%), Gaps = 35/471 (7%)
Query: 14 LLRSSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSK 73
+ R+ ++G+Y ++S+ S +VC++++ G ++ + R+GPCS
Sbjct: 21 IARAGDDGSYK---------VLSMGSPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSP 71
Query: 74 LNQGKSRNTPSLEEILRRDQQRL-HLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADE 131
L ++ P+LEE L RDQ R +++ + +++ A T P G + E
Sbjct: 72 L---PTKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDA-TVPTALGTSLNTLE 127
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P ++L+DTGS ++W QCKPC C Q DP FDPS S T+S C S
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 187
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C L + G SS +C Y + Y DGS TG +++D + + G A F
Sbjct: 188 CAQLGQ----EGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVRSFQF 237
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTV 308
GC++ +G + G+MGL G S++S+T + F YCL S+G++T G
Sbjct: 238 GCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 297
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAP 368
TP++ + + FY + L I VGG +L + AS F+ T +DSGT+ITR P
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA-GTVMDSGTVITRLPPT 356
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT 428
YSAL SAF+ MK+Y + + DTC+D S +V +P + + F GG + LD G
Sbjct: 357 AYSALSSAFKAGMKQYPPAQ-PSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGI 415
Query: 429 LVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++ CL FA D + ++GNVQQR +EV YDV +GF G C
Sbjct: 416 ILSN-----CLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 158/471 (33%), Positives = 240/471 (50%), Gaps = 35/471 (7%)
Query: 14 LLRSSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSK 73
+ R+ ++G+Y ++S+ S +VC++++ G ++ + R+GPCS
Sbjct: 91 IARAGDDGSYK---------VLSMGSPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSP 141
Query: 74 LNQGKSRNTPSLEEILRRDQQRL-HLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADE 131
L ++ P+LEE L RDQ R +++ + +++ A T P G + E
Sbjct: 142 L---PTKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDA-TVPTALGTSLNTLE 197
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P ++L+DTGS ++W QCKPC C Q DP FDPS S T+S C S
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAD 257
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C L + G SS +C Y + Y DGS TG +++D + + G A F
Sbjct: 258 CAQLGQ----EGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVRSFQF 307
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTV 308
GC++ +G + G+MGL G S++S+T + F YCL S+G++T G
Sbjct: 308 GCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 367
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAP 368
TP++ + + FY + L I VGG +L + AS F+ T +DSGT+ITR P
Sbjct: 368 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA-GTVMDSGTVITRLPPT 426
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT 428
YSAL SAF+ MK+Y + + DTC+D S +V +P + + F GG + LD G
Sbjct: 427 AYSALSSAFKAGMKQYPPAQ-PSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGI 485
Query: 429 LVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++ CL FA D + ++GNVQQR +EV YDV +GF G C
Sbjct: 486 ILSN-----CLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 159/482 (32%), Positives = 234/482 (48%), Gaps = 53/482 (10%)
Query: 35 VSVSSLIPPTVCNRTRT--ALPQGPGKVSLEVLGRYGPCSKLNQGK-SRNTPSLEEILRR 91
+ SL+P T P+ + ++ ++GPCS L K + PS EIL
Sbjct: 38 LDAESLLPSAAAASCHTPEQRPEAGTATRMPIVHQHGPCSPLADDKHGKKAPSHTEILVA 97
Query: 92 DQQR---LHLKNS------RRLQKAIP-----------------DNFKKTKAFTFPAKTG 125
DQ+R +H + S RR + + P + PAK+G
Sbjct: 98 DQRRVEYIHRRVSETTGRVRRQKHSAPVVELRPGTPSSTRSSSSSLSSSATSTNLPAKSG 157
Query: 126 IVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFS 183
+ Y+V + +G P +++ DTGS TW QC+PC+ +C QQ++P F P+KS T++
Sbjct: 158 LSLNTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYA 217
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
I C S+ C L CS C Y + Y DGS GF+A D +T+ GY
Sbjct: 218 NISCTSSYCSDL-------DTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTL------GY 264
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYI 300
F GC + N G A+G+MGL RG S+ + Y F YC+ + TG++
Sbjct: 265 DTVKDFRFGCGEKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSGTGFL 324
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGT 360
FG + TP++ FY++ +TGI VGG L + A+ F+ +DSGT
Sbjct: 325 DFGP-GAPAAANARLTPMLVD-NGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGT 382
Query: 361 IITRFPAPVYSALRSAFRKRMKK--YKMGKGIEDLFDTCYDLSAYK-TVVVPKITIHFLG 417
+ITR P Y LRSAF K M+ YK + DTCYDL+ Y+ ++ +P +++ F G
Sbjct: 383 VITRLPPSAYEPLRSAFAKGMEGLGYKTAPAFS-ILDTCYDLTGYQGSIALPAVSLVFQG 441
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G L++D G L V V Q CL FA D + ++GN QQ+ Y V YD+ + +GF PG
Sbjct: 442 GACLDVDASGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPG 501
Query: 478 NC 479
C
Sbjct: 502 AC 503
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 150/448 (33%), Positives = 221/448 (49%), Gaps = 45/448 (10%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL----HLKNSRRLQKAIPDNFKKTKA 117
+ ++ R+GPCS L + PS E+IL DQ R H ++ + P ++ +
Sbjct: 87 MTIVHRHGPCSPLADAHGK-PPSHEDILAADQNRAESIQHRVSTTATGRGNPKRSRRAPS 145
Query: 118 -------------------FTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITW 157
+ PA +G + Y + V +G P +++ DTGS TW
Sbjct: 146 RRQQPSSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTW 205
Query: 158 TQCKPCIH-CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDI 216
QC+PC+ C +QR+ FDP++S T++ I C + C L CS C Y +
Sbjct: 206 VQCQPCVVVCYEQREKLFDPARSSTYANISCAAPACSDL-------DTRGCSGGNCLYGV 258
Query: 217 AYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVS 276
Y DGS GF+A D +T+ Y A F GC + N G A+G++GL RG S
Sbjct: 259 QYGDGSYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTS 313
Query: 277 IISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLT 333
+ +T Y F +CL + TGY+ FG P + + T + T FY++ +T
Sbjct: 314 LPVQTYDKYGGVFAHCLPARSSGTGYLDFG-PGSPAAAGARLTTPMLTDNGPTFYYVGMT 372
Query: 334 GISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRM--KKYKMGKGIE 391
GI VGG+ L + S FT T +DSGT+ITR P YS+LRSAF M + YK +
Sbjct: 373 GIRVGGQLLSIPQSVFTTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVS 432
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI 451
L DTCYD + V +P +++ F GG L++D G + SV QVCLGFA +
Sbjct: 433 -LLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASVSQVCLGFAANEDGGDVG 491
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN Q + + V YD+ + +GF PG C
Sbjct: 492 IVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 166/486 (34%), Positives = 248/486 (51%), Gaps = 47/486 (9%)
Query: 9 LLFIWLLRSSNNGAYANDNDLSHSYIVSVSSLI--PPTVCNRTRTA-LPQGPGKVSLEVL 65
LL ++L + N+ A+ N+ H + +S P C+ +R L +G VS+ ++
Sbjct: 6 LLVCFILCTYNSLAHGG-NEEEHVLVAVPTSRYSEPAATCSTSRVRWLDEGSNTVSVPLV 64
Query: 66 GRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSR--RLQKAIPDNFKKTKAFTFPAK 123
R+GPC+ +S + PSL E LRR + R SR + +IP +
Sbjct: 65 HRHGPCAPST--RSSDEPSLSERLRRSRARSKYIMSRASKSNVSIPTHLG---------- 112
Query: 124 TGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC--IHCSQQRDPFFDPSKSKT 181
G V + EY + V +G P LL+DTGS ++W QC PC C Q+DP FDPS+S T
Sbjct: 113 -GSVDSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSST 171
Query: 182 FSKIPCNSTTCKILLEWFPPNGQDKCSS-----KECPYDIAYVDGSGETGFWATDRMTIQ 236
++ IPCN+ C+ L G D C+S +C Y I Y DGS TG ++ + +T+
Sbjct: 172 YAPIPCNTDACRDLTR--DGYGSD-CTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMA 228
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSP 293
G + F GC + G + G++GL P S++ +T+ Y F YCL +
Sbjct: 229 P----GVTVK-DFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAA 283
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G++ G P FV +TP+V EQ FY + +TGI+VGGE + + S F+
Sbjct: 284 NDQAGFLALGAPVNDASGFV-FTPMVR--EQQTFYVVNMTGITVGGEPIDVPPSAFSG-G 339
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
IDSGT++T Y+AL++AFRK M Y + E DTCY+ + + V VP++ +
Sbjct: 340 MIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGE--LDTCYNFTGHSNVTVPRVAL 397
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F GG ++LDV +++++ CL F D +LGNV QR EV YDV R+G
Sbjct: 398 TFSGGATVDLDVPDGILLDN----CLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVG 453
Query: 474 FGPGNC 479
FG C
Sbjct: 454 FGADAC 459
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 163/472 (34%), Positives = 238/472 (50%), Gaps = 44/472 (9%)
Query: 23 YANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNT 82
+ +D +V+ SSL P VC+ + + +L ++ R+GPCS + S+
Sbjct: 24 HGTADDAQRYMVVASSSLEPSEVCSGQK--VTSSKNGATLPLVHRHGPCSPV---MSKEK 78
Query: 83 PSLEEILRRDQQR---LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAI 138
PS EE L RDQ R +H K S + + + T P +G + EY I V++
Sbjct: 79 PSHEETLGRDQLRAANIHAKLSSPRNSSAKE--LQQSGVTIPTSSGYSLGTPEYVITVSL 136
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILL 196
G P + +DTGS ++W QC PC CS Q+D FDP+KS T+S C+S C L
Sbjct: 137 GTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSSAQCAQL- 195
Query: 197 EWFPPNGQ-DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
G+ + C + C Y + YVD S TG + +D + + + A F GC+
Sbjct: 196 -----GGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSD-----AVKNFQFGCSH 245
Query: 256 NNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH-SPYGSTGYITFGKP--DTVN 309
G G+MGL S++S+T +Y F YCL S + G++T G T +
Sbjct: 246 RANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLTLGAAAGGTSS 305
Query: 310 KKFVKYTPIV--TTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPA 367
++ + TP+V P FY + L I+V G +L + AS F+ S +DSGT+IT+ P
Sbjct: 306 SRYSR-TPLVRFNVPT---FYGVFLQAITVAGTKLNVPASVFSGASV-VDSGTVITQLPP 360
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
Y ALR+AF+K MK Y + + DTC+D S KTV VP +T+ F G ++LDV G
Sbjct: 361 TAYQALRTAFKKEMKAYPSAAPV-GILDTCFDFSGIKTVRVPVVTLTFSRGAVMDLDVSG 419
Query: 428 TLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL F D ++ +LGNVQQR +E+ +DV G LGF PG C
Sbjct: 420 IFYAG-----CLAFTATAQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 155/484 (32%), Positives = 234/484 (48%), Gaps = 50/484 (10%)
Query: 31 HSYIVSVSSLIP-PTVCNRTRTALPQGPGKVS----LEVLGRYGPCSKLNQGKSRNTPSL 85
H ++SV + P P+ + + G S + ++ R+GPCS L + PS
Sbjct: 50 HHVMLSVEDMFPGPSSSSCDDASREHKHGATSSGTRMTIVHRHGPCSPLAAAHGK-PPSH 108
Query: 86 EEILRRDQQRL----HLKNSRRLQKAIPDNFKKTKA-------------------FTFPA 122
E+IL DQ R H ++ + P ++ + + PA
Sbjct: 109 EDILAADQNRAESIQHRVSTTATARGNPKRSRRAPSRRQQPSSAPAPAASLSSSTASLPA 168
Query: 123 KTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDPSKSK 180
+G + Y + V +G P +++ DTGS TW QC+PC+ C +Q++ FDP++S
Sbjct: 169 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSS 228
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
T++ + C + C L CS C Y + Y DGS GF+A D +T+
Sbjct: 229 TYANVSCAAPACFDL-------DTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSS--- 278
Query: 241 NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGST 297
Y A F GC + N G A+G++GL RG S+ +T Y F +CL + T
Sbjct: 279 --YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGT 336
Query: 298 GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEID 357
GY+ FG P + + T + T FY++ +TGI VGG+ L + S F T +D
Sbjct: 337 GYLDFG-PGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVD 395
Query: 358 SGTIITRFPAPVYSALRSAFRKRM--KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF 415
SGT+ITR P P YS+LRSAF M + YK + L DTCYD + V +P +++ F
Sbjct: 396 SGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVS-LLDTCYDFTGMSQVAIPTVSLLF 454
Query: 416 LGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
GG L++D G + SV QVCLGFA + ++GN Q + + V YD+ + +GF
Sbjct: 455 QGGAILDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFS 514
Query: 476 PGNC 479
PG C
Sbjct: 515 PGAC 518
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 159/423 (37%), Positives = 216/423 (51%), Gaps = 48/423 (11%)
Query: 83 PSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKP 141
P ILRRD R+ + RRL A A T PA G+ + EY + + IG P
Sbjct: 83 PHYTGILRRDHNRVRSIH-RRLTGA------GDTAATIPASLGLAFHSLEYVVTIGIGTP 135
Query: 142 KQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFP 200
+ ++L DTGS +TW QCKPC C QQ++P FDPSKS T+ +PC + CKI
Sbjct: 136 ARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGTPQCKI------ 189
Query: 201 PNGQD-KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG 259
GQD C C Y + Y D S G A + T+ + GC+ +
Sbjct: 190 GGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPA----AGVVFGCSHEYSS 245
Query: 260 DQNGA------SGIMGLDRGPVSIISKT----NISYFFYCLHSPYGSTGYITFG--KPDT 307
GA +G++GL RG SI+S+T + F YCL S GY+T G P
Sbjct: 246 GVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRGSSAGYLTIGAAAPPQ 305
Query: 308 VNKKFVKYTPIVTTPEQ-SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFP 366
N F TP+VT Q S Y + L GISV G LP+ AS F + T IDSGT+IT P
Sbjct: 306 SNLSF---TPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAF-YIGTVIDSGTVITHMP 361
Query: 367 APVYSALRSAFRKRMKKYKM-GKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
A Y LR FR+ M Y M +G + DTCYD++ + V P + + F GG +++D
Sbjct: 362 AAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVALEFGGGARIDVDA 421
Query: 426 RGTLVV-------ESVRQVCLGFALLPSD-PNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G L+V +S+ CL F +P++ P +++GN+QQR Y V +DV GRR+GFG
Sbjct: 422 SGILLVFAVDASGQSLTLACLAF--VPTNLPGFVIIGNMQQRAYNVVFDVEGRRIGFGAN 479
Query: 478 NCN 480
C+
Sbjct: 480 GCS 482
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 153/482 (31%), Positives = 234/482 (48%), Gaps = 48/482 (9%)
Query: 31 HSYIVSVSSLIPPTVCNRTRTALPQGPGKVS----LEVLGRYGPCSKLNQGKSRNTPSLE 86
H ++ V ++P + T G S + ++ R+GPCS L + PS +
Sbjct: 55 HHVMLRVEDVLPAPSSSSCDTPREHEHGASSSGTRMTIVHRHGPCSPLADAHGK-PPSHD 113
Query: 87 EILRRDQQRLHLKN-------------------SRRLQKAIPDNFKKTKAFTFP---AKT 124
EIL DQ R+ + SRR Q+ + + + A +
Sbjct: 114 EILAADQNRVESIHHRVSTTATVRGKPKRRPSPSRRQQQPSAPAPAASLSSSTASLPASS 173
Query: 125 G-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDPSKSKTF 182
G + Y + + +G P +++ DTGS TW QC+PC+ C +Q++ FDP++S T+
Sbjct: 174 GRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTY 233
Query: 183 SKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNG 242
+ + C + C L CS C Y + Y DGS GF+A D +T+
Sbjct: 234 ANVSCAAPACSDLYTR-------GCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSS----- 281
Query: 243 YFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGY 299
Y A F GC + N G A+G++GL RG S+ +T Y F +CL + TGY
Sbjct: 282 YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGY 341
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ FG + TP++T FY++ +TGI VGG+ L + S F+ T +DSG
Sbjct: 342 LDFGPGSPAAVGARQTTPMLTD-NGPTFYYVGMTGIRVGGQLLSIPQSVFSTAGTIVDSG 400
Query: 360 TIITRFPAPVYSALRSAFRKRM--KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
T+ITR P YS+LRSAF M + YK + L DTCYD + V +PK+++ F G
Sbjct: 401 TVITRLPPAAYSSLRSAFASAMAARGYKKAPALS-LLDTCYDFTGMSEVAIPKVSLLFQG 459
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G L+++ G + S+ QVCLGFA D + ++GN Q + + V YD+ + +GF PG
Sbjct: 460 GAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPG 519
Query: 478 NC 479
C
Sbjct: 520 AC 521
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 146/416 (35%), Positives = 211/416 (50%), Gaps = 34/416 (8%)
Query: 77 GKSRNTPSLEEILRRDQQRLHLKNSR-------RLQKAIPDNFKKTKAFTFPAKTGIVAA 129
G S + S E+ R D+QR+ R + A+ +++ T P G V
Sbjct: 82 GPSTASASFAEVQRADEQRVEYIQRRVSGGGARGAKGALQQLATGSRSATVPTTMG-VGT 140
Query: 130 DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH--CSQQRDPFFDPSKSKTFSKIPC 187
+Y + V++G P ++ +DTGS ++W QCKPC C+ QRD FDP+KS T+S +PC
Sbjct: 141 FQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPC 200
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
+ C L + + CS +C Y ++Y DGS TG + +D + + N G
Sbjct: 201 GADACSELRIY-----EAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVG----- 250
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGK 304
FL GC G G G++ L R +S+ S+ +Y F YCL S + GY+T G
Sbjct: 251 TFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGG 310
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITR 364
P + + T ++T FY + LTGISVGG+++ + AS F T +D+GT+ITR
Sbjct: 311 PTSASG--FATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAG-GTVVDTGTVITR 367
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKG-IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
P Y+ALRSAFR + Y + DTCYD S Y V +P + + F GG L L
Sbjct: 368 LPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLAL 427
Query: 424 DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ G L CL FA D ++ +LGNVQQR + V +D G +GF PG C
Sbjct: 428 EAPGIL-----SSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 145/416 (34%), Positives = 210/416 (50%), Gaps = 34/416 (8%)
Query: 77 GKSRNTPSLEEILRRDQQRLHLKNSR-------RLQKAIPDNFKKTKAFTFPAKTGIVAA 129
G S + S E+ R D+QR+ R + A+ +++ T P G V
Sbjct: 82 GPSTASASFAEVQRADEQRVEYIQRRVSGGGARGAKGALQQLATGSRSATVPTTMG-VGT 140
Query: 130 DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH--CSQQRDPFFDPSKSKTFSKIPC 187
+Y + V++G P ++ +DTGS ++W QCKPC C+ QRD FDP+KS T+S +PC
Sbjct: 141 FQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPC 200
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
+ C L + + CS +C Y ++Y DGS TG + +D + + N G
Sbjct: 201 GADACSELRIY-----EAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGT---- 251
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGK 304
FL GC G G G++ L R +S+ S+ +Y F YCL S + GY+T G
Sbjct: 252 -FLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGG 310
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITR 364
P + + T ++T FY + LTGISVGG+++ + AS F T +D+GT+ITR
Sbjct: 311 PSSASG--FATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAG-GTVVDTGTVITR 367
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKG-IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
P Y+ALRSAFR + + DTCYD S Y V +P + + F GG L L
Sbjct: 368 LPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLAL 427
Query: 424 DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ G L CL FA D ++ +LGNVQQR + V +D G +GF PG C
Sbjct: 428 EAPGIL-----SSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 234 bits (598), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 169/483 (34%), Positives = 243/483 (50%), Gaps = 48/483 (9%)
Query: 3 ILFKAFLLF-IWLLRSSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVS 61
+L FL F + ++ + NG++ V SS +P TVC+ Q V
Sbjct: 5 LLLCIFLCFYLSIVNGAGNGSFVT---------VPSSSFVPDTVCSGALVKPEQNGSAVY 55
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFP 121
+ +L R+GPC+ + PS+ E+ RR RL S K + P
Sbjct: 56 VPLLHRHGPCAP--SLSTDTPPSMSEMFRRSHARLSYIVSG-------------KKVSVP 100
Query: 122 AKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSK 178
A G V + EY V+ G P +++DTGS +TW QCKPC CS Q+DP FDPS
Sbjct: 101 AHLGTSVKSLEYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSH 160
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQE 237
S T+S +PC S CK L +G CS+ + C + I+YVDG+ G + D++T+
Sbjct: 161 SSTYSAVPCASGECKKLAADAYGSG---CSNGQPCGFAISYVDGTSTVGVYGKDKLTLAP 217
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISK-TNISYFFYCLHSPYGS 296
G + F GC + + G++GL R S+ ++ F YCL +
Sbjct: 218 ----GAIVK-DFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNSK 272
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEI 356
G++ FG N +TP+ P Q F +TL GI+VGG++L L+ S F+ +
Sbjct: 273 PGFLAFGAGR--NPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSG-GMIV 329
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFL 416
DSGT++T + VY ALR+AFR+ MK Y++ G DL DTCYDL+ YK VVVPKI + F
Sbjct: 330 DSGTVVTVLQSTVYRALRAAFREAMKAYRLVHG--DL-DTCYDLTGYKNVVVPKIALTFS 386
Query: 417 GGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
GG + LDV ++V CL FA D + +LGNV QR +EV +D + + GF
Sbjct: 387 GGATINLDVPNGILVNG----CLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRA 442
Query: 477 GNC 479
C
Sbjct: 443 KAC 445
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 158/402 (39%), Positives = 232/402 (57%), Gaps = 24/402 (5%)
Query: 88 ILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVS 146
+L +DQ R+ ++R K +FK+ +A P ++GI + A Y + +A+G PK +S
Sbjct: 1 MLLQDQLRVKSMHARFSNKNAGSHFKEMQA-DIPVQSGIPLGAGNYLVKMALGTPKLSLS 59
Query: 147 LLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
L LDTGS ITWTQC+PC+ C +Q FDP KS ++ + C+S++C+I+ + G
Sbjct: 60 LALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITD---SGGAR 116
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQ--EVNGNGYFARYPFLLGCTDNNTGDQNG 263
C S C Y + Y DGS GF+AT+++TI +V N FL GC N G
Sbjct: 117 GCVSSTCIYKVQYGDGSYSVGFFATEKLTISPSDVISN-------FLFGCGQQNAGRFGR 169
Query: 264 ASGIMGLDRGPVSIISKTNISY---FFYCLHS-PYGSTGYITFGKPDTVNKKFVKYTPIV 319
+G++GL RG +S+ +T+ Y F YCL S STG++T G K VK+TP+
Sbjct: 170 IAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQ---VPKSVKFTPLS 226
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRK 379
+ + FY I + G+SVGG LP+ AS F+ IDSGT+ITR VYSAL S F++
Sbjct: 227 PAFKNTPFYGIDIKGLSVGGHVLPIDASVFSNAGAIIDSGTVITRLQPTVYSALSSKFQQ 286
Query: 380 RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL-VVESVRQVC 438
MK Y G + DTCYD S +++ VP+I+ F GGV++++ G L V+ + +VC
Sbjct: 287 LMKDYPKTDGFS-ILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINAWDKVC 345
Query: 439 LGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L FA D + ++ GN QQ+ Y+V +D+A R+GF P CN
Sbjct: 346 LAFAPNDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGCN 387
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 234 bits (596), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 161/485 (33%), Positives = 237/485 (48%), Gaps = 37/485 (7%)
Query: 8 FLLFIWLLRSSNNGAYANDNDLSHSYIV-SVSSLIPPTVCNRTRTALPQGPGKVSLEVLG 66
LLF+ L + Y + D H ++V S P VC+ + L +S+ ++
Sbjct: 5 LLLFVVLCSYCS---YISHADNEHGFVVVPRRSYEPKAVCSASSVNLEPSSATLSVPLVH 61
Query: 67 RYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRL--QKAIPDNFKKTKAFTFPAKT 124
RYGPC+ +Q TPS E LR + R + SR + PD+ A T P +
Sbjct: 62 RYGPCAA-SQYSDMPTPSFSETLRHSRARTNYIKSRASTGMASTPDD----AAVTVPTRL 116
Query: 125 G-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC--IHCSQQRDPFFDPSKSKT 181
G V + EY + + G P LL+DTGS ++W QC PC C Q+DP FDPSKS T
Sbjct: 117 GGFVDSLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSST 176
Query: 182 FSKIPCNSTTCKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVN 239
++ I C + C L + + ++ C+S +C Y + Y DGS G ++ + +T
Sbjct: 177 YAPIACGADACNKLGDHY----RNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFAP-- 230
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS 296
G + F GC + G + G++GL P S++ +T Y F YCL +
Sbjct: 231 --GITVK-DFHFGCGHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNSE 287
Query: 297 TGYITFG-KPDTV-NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G++ G +P N +TP+ P + Y + +TGISVGG+ L + S F +
Sbjct: 288 AGFLALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAF-RGGM 346
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
IDSGTI+T P Y+AL +A RK Y M ED FDTCY+ + Y V VP++ +
Sbjct: 347 LIDSGTIVTELPETAYNALNAALRKAFAAYPM-VASED-FDTCYNFTGYSNVTVPRVALT 404
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F GG ++LDV ++V+ CL F D ++GNV QR EV YD ++GF
Sbjct: 405 FSGGATIDLDVPNGILVKD----CLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGF 460
Query: 475 GPGNC 479
G C
Sbjct: 461 RAGAC 465
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 158/460 (34%), Positives = 233/460 (50%), Gaps = 35/460 (7%)
Query: 35 VSVSSLIPPTVCNRTRTALPQGPGKVS--LEVLGRYGPCSKLNQGKSRNTPSLEEILRRD 92
VS +S +P + C+ PQ S L + R+GPC+ ++ S PS+ + LR D
Sbjct: 39 VSAASFVPSSTCSSPDPVPPQRRNGTSAVLRLTHRHGPCAP-SRASSLAAPSVADTLRAD 97
Query: 93 QQRLHLKNSRRLQKAIP---DNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLL 148
Q+R RR+ P D+ A T PA G + Y + ++G P ++
Sbjct: 98 QRRAEYIL-RRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTME 156
Query: 149 LDTGSGITWTQCKPCI---HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
+DTGS ++W QCKPC C Q+DP FDP++S +++ +PC C L +
Sbjct: 157 VDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYA----AS 212
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
CS+ +C Y ++Y DGS TG +++D +T+ + A F GC +G NG
Sbjct: 213 ACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS-----AVQGFFFGCGHAQSGLFNGVD 267
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFG--KPDTVNKKFVKYTPIVT 320
G++GL R S++ +T +Y F YCL + + GY+T G P F T ++
Sbjct: 268 GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLGGPSGAAPGF-STTQLLP 326
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKR 380
+P +Y + LTGISVGG++L + AS F T +D+GT+ITR P Y+ALRSAFR
Sbjct: 327 SPNAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVDTGTVITRLPPTAYAALRSAFRSG 385
Query: 381 MKKYKMGKGIED-LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
M Y + + DTCY+ + Y TV +P + + F G + L G L CL
Sbjct: 386 MASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVMLGADGILSFG-----CL 440
Query: 440 GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
FA SD +LGNVQQR +EV D G +GF P +C
Sbjct: 441 AFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 152/442 (34%), Positives = 223/442 (50%), Gaps = 33/442 (7%)
Query: 57 PGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLH-----LKNSRRLQKAIPDN 111
P + S+ ++ R+GPC+ S PSL E LRRD+ R + R A+ D
Sbjct: 14 PNRASVPLVHRHGPCAP--SAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDA 71
Query: 112 FKK-TKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC--IHCSQ 168
T TF + V + EY + + IG P ++L+DTGS ++W QCKPC C
Sbjct: 72 AGGGTSIPTFLGDS--VNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYA 129
Query: 169 QRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGET 225
Q+DP FDPS S +++ +PC+S C+ L +G S C Y I Y + + T
Sbjct: 130 QKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTT 189
Query: 226 GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY 285
G ++T+ +T++ F GC D+ G G++GL P S++S+T+ +
Sbjct: 190 GVYSTETLTLKP-----GVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQF 244
Query: 286 ---FFYCLHSPYGSTGYITFGKP----DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
F YCL G G++T G P + + +TP+ P FY +TLTGISVG
Sbjct: 245 GGPFSYCLPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVG 304
Query: 339 GERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK-MGKGIEDLFDTC 397
G L + S F+ IDSGT+IT PA Y+ALRSAFR M +Y+ + + DTC
Sbjct: 305 GAPLAIPPSAFSS-GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTC 363
Query: 398 YDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQ 457
YD + + V VP I++ F GG ++L ++V+ CL FA +D ++GNV
Sbjct: 364 YDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG----CLAFAGAGTDNAIGIIGNVN 419
Query: 458 QRGYEVHYDVAGRRLGFGPGNC 479
QR +EV YD +GF G C
Sbjct: 420 QRTFEVLYDSGKGTVGFRAGAC 441
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 156/470 (33%), Positives = 235/470 (50%), Gaps = 35/470 (7%)
Query: 28 DLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEE 87
+L++ +V SS P C+ + P + S+ ++ R+GPC+ S PSL E
Sbjct: 13 NLNNFAVVPASSFEPEAACSTSSAN--SDPNRASVPLVHRHGPCAP--SAASGGKPSLAE 68
Query: 88 ILRRDQQRLHLKNSRRLQKA-----IPDNFKK--TKAFTFPAKTGIVAADEYYIVVAIGK 140
LRRD+ R + ++ + D T TF + V + EY + + IG
Sbjct: 69 RLRRDRARANYIVTKAAGGRTAATAVSDAVGGGGTSIPTFLGDS--VDSLEYVVTLGIGT 126
Query: 141 PKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
P +L+DTGS ++W QCKPC C Q+DP FDPS S +++ +PC+S C+ L
Sbjct: 127 PAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAG 186
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNT 258
+G ++ C Y I Y + + TG ++T+ +T++ F GC D+
Sbjct: 187 AYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTLKP-----GVVVADFGFGCGDHQH 241
Query: 259 GDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPD-----TVNK 310
G G++GL P S++S+T+ + F YCL G G++ G P+ T
Sbjct: 242 GPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAGFLALGAPNSSSSSTAAA 301
Query: 311 KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVY 370
F+ +TP+ P FY +TLTGISVGG L + S F+ IDSGT+IT PA Y
Sbjct: 302 GFL-FTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAFSS-GMVIDSGTVITGLPATAY 359
Query: 371 SALRSAFRKRMKKYK-MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL 429
+ALRSAFR M +Y+ + + DTCYD + + V VP I + F GG ++L +
Sbjct: 360 AALRSAFRSAMSEYRLLPPSNGAVLDTCYDFTGHTNVTVPTIALTFSGGATIDLATPAGV 419
Query: 430 VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+V+ CL FA +D ++GNV QR +EV YD +GF G C
Sbjct: 420 LVDG----CLAFAGAGTDDTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 145/450 (32%), Positives = 209/450 (46%), Gaps = 46/450 (10%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFP 121
+ ++ R+GPCS L PS EEIL DQ R R K + P
Sbjct: 90 MPIVHRHGPCSPLADAHGGKPPSHEEILDADQNRAESIQRRVSTTTTAARGKPKRNRPSP 149
Query: 122 AK--------------------------TGIVAADEYYIVVAIGKPKQYVSLLLDTGSGI 155
++ + Y + + +G P +++ DTGS
Sbjct: 150 SRRQQPSSSAPAPGASLSSSAASLPASSGRALGTGNYVVTIGLGTPAGRYTVVFDTGSDT 209
Query: 156 TWTQCKPCIH-CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPY 214
TW QC+PC+ C +Q++ FDP++S T + I C + C L CS C Y
Sbjct: 210 TWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAPACSDLY-------TKGCSGGHCLY 262
Query: 215 DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGP 274
+ Y DGS GF+A D +T+ Y A F GC + N G A+G++GL RG
Sbjct: 263 GVQYGDGSYSIGFFAMDTLTLSS-----YDAIKGFRFGCGERNEGLFGEAAGLLGLGRGK 317
Query: 275 VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
S+ + Y F +C + TGY+ FG P + K T + FY++
Sbjct: 318 TSLPVQAYDKYGGVFAHCFPARSSGTGYLDFG-PGSSPAVSTKLTTPMLVDNGLTFYYVG 376
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRM--KKYKMGKG 389
LTGI VGG+ L + S FT T +DSGT+ITR P YS+LRSAF + + YK
Sbjct: 377 LTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPA 436
Query: 390 IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN 449
+ L DTCYD + V +P +++ F GG L++D G + SV Q CLGFA D +
Sbjct: 437 LS-LLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASGIIYAASVSQACLGFAANEEDDD 495
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN Q + + V YD+ + +GF PG C
Sbjct: 496 VGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 152/442 (34%), Positives = 223/442 (50%), Gaps = 33/442 (7%)
Query: 57 PGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLH-----LKNSRRLQKAIPDN 111
P + S+ ++ R+GPC+ S PSL E LRRD+ R + R A+ D
Sbjct: 94 PNRASVPLVHRHGPCAP--SAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDA 151
Query: 112 FKK-TKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC--IHCSQ 168
T TF + V + EY + + IG P ++L+DTGS ++W QCKPC C
Sbjct: 152 AGGGTSIPTFLGDS--VNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYA 209
Query: 169 QRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGET 225
Q+DP FDPS S +++ +PC+S C+ L +G S C Y I Y + + T
Sbjct: 210 QKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTT 269
Query: 226 GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY 285
G ++T+ +T++ F GC D+ G G++GL P S++S+T+ +
Sbjct: 270 GVYSTETLTLKP-----GVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQF 324
Query: 286 ---FFYCLHSPYGSTGYITFGKP----DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
F YCL G G++T G P + + +TP+ P FY +TLTGISVG
Sbjct: 325 GGPFSYCLPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVG 384
Query: 339 GERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK-MGKGIEDLFDTC 397
G L + S F+ IDSGT+IT PA Y+ALRSAFR M +Y+ + + DTC
Sbjct: 385 GAPLAIPPSAFSS-GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTC 443
Query: 398 YDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQ 457
YD + + V VP I++ F GG ++L ++V+ CL FA +D ++GNV
Sbjct: 444 YDFTGHANVTVPTISLTFSGGATIDLAAPAGVLVDG----CLAFAGAGTDNAIGIIGNVN 499
Query: 458 QRGYEVHYDVAGRRLGFGPGNC 479
QR +EV YD +GF G C
Sbjct: 500 QRTFEVLYDSGKGTVGFRAGAC 521
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 146/443 (32%), Positives = 214/443 (48%), Gaps = 42/443 (9%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHL---------------KNSRRLQK 106
+ ++ R+GPCS L S+ PS +EIL DQ R K SRR Q
Sbjct: 91 MTIVHRHGPCSPLAAAHSK-PPSHDEILAADQNRAESIQHRVSTTATSRGQPKRSRRQQP 149
Query: 107 AIPDNFKKTKAFTFPAKTG----IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKP 162
+ + + + + + Y + V +G P +++ DTGS TW QC+P
Sbjct: 150 SSAPAPAASLSSSTASLPASPGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQP 209
Query: 163 CIH-CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDG 221
C+ C +QR+ FDP++S T++ + C + C L CS C Y + Y DG
Sbjct: 210 CVVVCYEQREKLFDPARSSTYANVSCAAPACSDL-------DTRGCSGGHCLYGVQYGDG 262
Query: 222 SGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKT 281
S GF+A D +T+ Y A F GC + N G A+G++GL RG S+ +T
Sbjct: 263 SYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQT 317
Query: 282 NISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
Y F +CL + TGY+ FG + + TP++ FY++ LTGI VG
Sbjct: 318 YDKYGGVFAHCLPARSTGTGYLDFGAGSPAAR--LTTTPMLVD-NGPTFYYVGLTGIRVG 374
Query: 339 GERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRM--KKYKMGKGIEDLFDT 396
G L + S F T +DSGT+ITR P YS+LRSAF M + YK + L DT
Sbjct: 375 GRLLYIPQSVFATAGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVS-LLDT 433
Query: 397 CYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNV 456
CYD + V +P +++ F GG L++D G + S QVCL FA + ++GN
Sbjct: 434 CYDFAGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNT 493
Query: 457 QQRGYEVHYDVAGRRLGFGPGNC 479
Q + + V YD+ + + F PG C
Sbjct: 494 QLKTFGVAYDIGKKVVSFSPGAC 516
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 156/450 (34%), Positives = 226/450 (50%), Gaps = 38/450 (8%)
Query: 35 VSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQ 94
V SS P +VC+ Q V + ++ R+GPC+ S +T S +I RR +
Sbjct: 29 VPSSSFEPESVCSGEFVKPEQNGSTVYVPLVHRHGPCAPAPS-LSTDTRSFADIFRRSRA 87
Query: 95 RLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGS 153
R P + K + PA G V + EY + V+ G P +++DTGS
Sbjct: 88 R-------------PSYIVRGKKVSVPAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGS 134
Query: 154 GITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE 211
++W QCKPC C Q+DP +DPS S T+S +PC S CK L G S K+
Sbjct: 135 DVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCASDVCKKLAA--DAYGSGCTSGKQ 192
Query: 212 CPYDIAYVDGSGETGFWATDRMTIQ--EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMG 269
C + I+Y DG+ G ++ D++T+ + N YF GC + G++G
Sbjct: 193 CGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYF-------GCGHGKHAVRGLFDGVLG 245
Query: 270 LDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYH 329
L R S+ ++ F YCL S G++ G N +TP+ T P Q F
Sbjct: 246 LGRLRESLGARYG-GVFSYCLPSVSSKPGFLALGAGK--NPSGFVFTPMGTVPGQPTFST 302
Query: 330 ITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG 389
+TL GI+VGG++L L+ S F+ +DSGT+IT + Y ALRSAFRK M+ Y++
Sbjct: 303 VTLAGINVGGKKLDLRPSAFSG-GMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPN 361
Query: 390 IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN 449
+ DTCY+L+ YK VVVPKI + F GG + LDV ++V CL FA D +
Sbjct: 362 GD--LDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG----CLAFAESGPDGS 415
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ +LGNV QR +EV +D + + GF C
Sbjct: 416 AGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 136/367 (37%), Positives = 186/367 (50%), Gaps = 24/367 (6%)
Query: 119 TFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDP 176
+ PA+ G+ + + Y I V G P + +++ DTGS + W QCKPC + C Q++P FDP
Sbjct: 2 SIPARIGLFIGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDP 61
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
S S T+ + C C L CSS C Y + Y DGS GF A D +
Sbjct: 62 SLSSTYRNVSCTEPACVGL-------STRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLT 114
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPV----SIISKTNISYFFYCLHS 292
F+ GC NNTG G +G++GL R S ++ + + F YCL S
Sbjct: 115 PAQ-----KFKNFIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPS 169
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
+TGY+ G P + YT ++T Y I L GISVGG RL L ++ F +
Sbjct: 170 TSSATGYLNIGNP----QNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSV 225
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
T IDSGT+ITR P YSAL++A R M +Y + + + DTCYD S +VV P I
Sbjct: 226 GTIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVT-ILDTCYDFSRTTSVVYPVIV 284
Query: 413 IHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
+HF G+D+ + G V + QVCL FA ++GNVQQ EV YD +R+
Sbjct: 285 LHF-AGLDVRIPATGVFFVFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRI 343
Query: 473 GFGPGNC 479
GF G C
Sbjct: 344 GFSAGAC 350
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 146/443 (32%), Positives = 213/443 (48%), Gaps = 40/443 (9%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHL---------------KNSRRLQK 106
+ ++ R+GPCS L R PS EIL DQ R K SRR Q
Sbjct: 92 MTIVHRHGPCSPLAAAH-RKPPSHGEILAADQNRAESIQHRVSTTATGRGKPKRSRRQQP 150
Query: 107 AIPDNFKKTKAFTFP---AKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKP 162
+ + + + A +G + Y + V +G P +++ DTGS TW QC+P
Sbjct: 151 SSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQP 210
Query: 163 CIH-CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDG 221
C+ C +QR+ FDP++S T++ + C + C L CS C Y + Y DG
Sbjct: 211 CVVVCYEQREKLFDPARSSTYANVSCAAPACSDL-------NIHGCSGGHCLYGVQYGDG 263
Query: 222 SGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKT 281
S GF+A D +T+ Y A F GC + N G A+G++GL RG S+ +T
Sbjct: 264 SYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQT 318
Query: 282 NISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
Y F +CL + TGY+ FG + TP++T FY++ +TGI VG
Sbjct: 319 YDKYGGVFAHCLPARSTGTGYLDFGAGSLAAARARLTTPMLTE-NGPTFYYVGMTGIRVG 377
Query: 339 GERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALR--SAFRKRMKKYKMGKGIEDLFDT 396
G+ L + S F T +DSGT+ITR P YS+LR A + YK + L DT
Sbjct: 378 GQLLSIPQSVFATAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVS-LLDT 436
Query: 397 CYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNV 456
CYD + V +P +++ F GG L++D G + S QVCL FA + ++GN
Sbjct: 437 CYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNT 496
Query: 457 QQRGYEVHYDVAGRRLGFGPGNC 479
Q + + V YD+ + +GF PG C
Sbjct: 497 QLKTFGVAYDIGKKVVGFYPGAC 519
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 165/465 (35%), Positives = 240/465 (51%), Gaps = 36/465 (7%)
Query: 22 AYANDNDLSHSYIVSVSSLIPPTV-CNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSR 80
A+A D DL ++ V SL V C+ + A G V++ + R+GPCS + S
Sbjct: 21 AHAGD-DLRSYKVLPVGSLKSAAVSCSLPKVA--PSSGVVTVPLHHRHGPCSTV---PST 74
Query: 81 NTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIG 139
N P+LE++LRRDQ R + + T P G + EY I V +G
Sbjct: 75 NAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVEGSDVTVPTTLGTSLDTLEYLITVGMG 134
Query: 140 KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF 199
P ++L+DTGS ++W QCKPC C Q D FDPS S T+S C S C L
Sbjct: 135 SPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTSAACAQLR--- 191
Query: 200 PPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG 259
Q CSS +C Y + Y DGS +G +++D + + G F GC+ + +G
Sbjct: 192 ----QRGCSSSQCQYTVKYGDGSTGSGTYSSDTLAL------GSSTVENFQFGCSQSESG 241
Query: 260 D--QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVK 314
+ Q+ +G+MGL G S+ ++T ++ F YCL GS+G++T G FV
Sbjct: 242 NLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPGSSGFLTLGAS---TSGFVV 298
Query: 315 YTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALR 374
TP++ + + +Y + L I VGG +L + AS F+ S +DSGTIITR P YSAL
Sbjct: 299 KTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSAGSI-MDSGTIITRLPRTAYSALS 357
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESV 434
SAF+ MK+Y + + +FDTC+D S +V +P + + F GG ++L G ++
Sbjct: 358 SAFKAGMKQYPPAQPM-GIFDTCFDFSGQSSVSIPTVALVFSGGAVVDLASDGIILGS-- 414
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA D + ++GNVQQR +EV YDV G +GF G C
Sbjct: 415 ---CLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 154/463 (33%), Positives = 229/463 (49%), Gaps = 29/463 (6%)
Query: 33 YIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRD 92
++VSV+ L+P VC ++ A + V+ R+GPCS L + PS ++L +D
Sbjct: 61 HVVSVADLLPAAVCTASQAASNSS-SASAFSVMHRHGPCSPLQ--TPGDAPSDADLLDQD 117
Query: 93 QQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDT 151
Q R+ L + + PA+ GI V Y + V +G P + ++++ DT
Sbjct: 118 QARVD----SILGMITNETSAVGPGVSLPAERGISVGTGNYVVSVGLGTPARDLTVVFDT 173
Query: 152 GSGITWTQCKPCIH--CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS 209
GS ++W QC PC C +Q+DP F PS S TFS + C + C+ G D+C
Sbjct: 174 GSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGARECRARQSCGGSPGDDRC-- 231
Query: 210 KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA----RYP-FLLGCTDNNTGDQNGA 264
PY++ Y D S G D +T+ + A + P F+ GC +NNTG A
Sbjct: 232 ---PYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFVFGCGENNTGLFGQA 288
Query: 265 SGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYITFGKPDTVNKKFVKYTPIVT 320
G+ GL RG VS+ S+ + F YCL S + GY++ G P ++TP++
Sbjct: 289 DGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGYLSLGTP-VPAPAHAQFTPMLN 347
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKR 380
FY++ L GI V G + + +S L +DSGT+ITR Y ALR+AF
Sbjct: 348 RTTTPSFYYVKLVGIRVAGRAIRV-SSPRVALPLIVDSGTVITRLAPRAYRALRAAFLSA 406
Query: 381 MKKYKMGKGIE-DLFDTCYDLSAYK--TVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
M KY + + DTCYD +A+ TV +P + + F GG + +D G L V V Q
Sbjct: 407 MGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQA 466
Query: 438 CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CL FA ++ +LGN QQR V YDVA +++GF C+
Sbjct: 467 CLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFAAKGCS 509
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 170/476 (35%), Positives = 251/476 (52%), Gaps = 31/476 (6%)
Query: 11 FIWLLRSSNNGAYANDNDLSHSYIVSVSSLI-PPTVCNRTRTALPQGPGKVSLEVLGRYG 69
F+ L S + A+ D ++SV SL+ T C+ + P V++ + RY
Sbjct: 7 FLLALLFSYHTLIAHAADDRRHKVLSVGSLMKSSTACSEPKVTPPST--GVTVPLHHRYD 64
Query: 70 PCSKLNQGKSRNTPSLEEILRRDQQRL-HLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-V 127
PCS + S+ P+LEE LRRDQ R ++K R+ A + +++ A T P G +
Sbjct: 65 PCSPV---PSKKVPTLEERLRRDQLRAAYIK--RKFSGA--GDIEQSDAATVPTTLGTSL 117
Query: 128 AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPC 187
+ EY I V IG P ++ +DTGS ++W QCKPC C + D FDPS S T+S C
Sbjct: 118 STLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFSC 177
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
+S C L + NG C S +C Y + Y D S TG +++D +T+ G A
Sbjct: 178 SSAPCAQLSQSQEGNG---CMSSQCQYIVNYGDSSSTTGTYSSDTLTL------GSSAMT 228
Query: 248 PFLLGCTDNNTGDQNGAS-GIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFG 303
F GC+ + +G N + G+MGL G S+ S+T ++ F YCL GS+G++T G
Sbjct: 229 DFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSGSSGFLTLG 288
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIIT 363
T + FVK TP++ + + +Y + L I VG ++L L S F+ S +DSGTIIT
Sbjct: 289 ---TGSSGFVK-TPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSL-MDSGTIIT 343
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
R P YSAL SAF+ M++Y + DTC+D S ++ +P +T+ F GG ++L
Sbjct: 344 RLPPTAYSALSSAFKAGMQQYPPAT-PSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDL 402
Query: 424 DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
G ++ S CL F D + ++GNVQQR +EV YDV G +GF G C
Sbjct: 403 AFDGIMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 149/442 (33%), Positives = 215/442 (48%), Gaps = 45/442 (10%)
Query: 64 VLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKT-KAFTFPA 122
V+ R+GPCS L + PS ++L DQ R+ + + I + + + PA
Sbjct: 22 VMHRHGPCSPLQ--TPDDAPSDADLLEHDQARVD-----SIHRMIANETAVVGQDVSLPA 74
Query: 123 KTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH--CSQQRDPFFDPSKS 179
+ GI V Y + V +G P + ++++ DTGS ++W QC PC C Q+DP F PS S
Sbjct: 75 ERGISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSS 134
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSK----ECPYDIAYVDGSGETGFWATDRMTI 235
TFS + C C P + CSS CPY++ Y D S G D +T+
Sbjct: 135 STFSAVRCGEPEC--------PRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTL 186
Query: 236 --------QEVNGNGYFARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY- 285
E N N + P F+ GC +NNTG A G+ GL RG VS+ S+ Y
Sbjct: 187 GTTPSTNASENNSN----KLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYG 242
Query: 286 --FFYCL-HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL 342
F YCL S + GY++ G P ++TP++ FY++ L GI V G +
Sbjct: 243 EGFSYCLPSSSSNAHGYLSLGTPAPA-PAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAI 301
Query: 343 PLKAS-YFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE-DLFDTCYDL 400
+ + +DSGT+ITR YSALR+AF M KY + + DTCYD
Sbjct: 302 KVSSRPALWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDF 361
Query: 401 SAYK--TVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQ 458
+A+ TV +P + + F GG + +D G L V V Q CL FA + ++ +LGN QQ
Sbjct: 362 TAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQ 421
Query: 459 RGYEVHYDVAGRRLGFGPGNCN 480
R V YDV +++GF C+
Sbjct: 422 RTVAVVYDVGRQKIGFAAKGCS 443
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 144/402 (35%), Positives = 207/402 (51%), Gaps = 23/402 (5%)
Query: 83 PSLEEILRRDQQRL-HLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGK 140
P+LEE L RDQ R +++ + +++ A T P G + EY I V +G
Sbjct: 2 PTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDA-TVPTALGTSLNTLEYLITVGLGS 60
Query: 141 PKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFP 200
P ++L+DTGS ++W QCKPC C Q DP FDPS S T+S C S C L +
Sbjct: 61 PATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQ--- 117
Query: 201 PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD 260
G SS +C Y + Y DGS TG +++D + + G A F GC++ +G
Sbjct: 118 -EGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVRSFQFGCSNVESGF 170
Query: 261 QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTP 317
+ G+MGL G S++S+T + F YCL S+G++T G TP
Sbjct: 171 NDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTP 230
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAF 377
++ + + FY + L I VGG +L + AS F+ T +DSGT+ITR P YSAL SAF
Sbjct: 231 MLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA-GTVMDSGTVITRLPPTAYSALSSAF 289
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
+ MK+Y + + DTC+D S +V +P + + F GG + LD G ++
Sbjct: 290 KAGMKQYPPAQ-PSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIILSN----- 343
Query: 438 CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA D + ++GNVQQR +EV YDV +GF G C
Sbjct: 344 CLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 150/430 (34%), Positives = 218/430 (50%), Gaps = 38/430 (8%)
Query: 55 QGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKK 114
Q V + ++ R+GPC+ S +T S +I RR + R P +
Sbjct: 15 QNGSTVYVPLVHRHGPCAPAPS-LSTDTRSFADIFRRSRAR-------------PSYIVR 60
Query: 115 TKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRD 171
K + PA G V + EY + V+ G P +++DTGS ++W QCKPC C Q+D
Sbjct: 61 GKKVSVPAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKD 120
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATD 231
P +DPS S T+S +PC S CK L G S K+C + I+Y DG+ G ++ D
Sbjct: 121 PLYDPSHSSTYSAVPCASDVCKKLAA--DAYGSGCTSGKQCGFAISYADGTSTVGAYSQD 178
Query: 232 RMTIQ--EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYC 289
++T+ + N YF GC + G++GL R S+ ++ F YC
Sbjct: 179 KLTLAPGAIVQNFYF-------GCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYC 230
Query: 290 LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
L S G++ G N +TP+ T P Q F +TL GI+VGG++L L+ S F
Sbjct: 231 LPSVSSKPGFLALGAGK--NPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF 288
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
+ +DSGT+IT + Y ALRSAFRK M+ Y++ + DTCY+L+ YK VVVP
Sbjct: 289 SG-GMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD--LDTCYNLTGYKNVVVP 345
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
KI + F GG + LDV ++V CL FA D ++ +LGNV QR +EV +D +
Sbjct: 346 KIALTFTGGATINLDVPNGILVNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTST 401
Query: 470 RRLGFGPGNC 479
+ GF C
Sbjct: 402 SKFGFRAKAC 411
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 140/404 (34%), Positives = 201/404 (49%), Gaps = 33/404 (8%)
Query: 87 EILRRDQQRLHLKNSRRLQKAI-PDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYV 145
+++ RD R SR A P F +++ + EY++ V IG P
Sbjct: 83 DLVARDNARAEYLASRLSPAAYQPTGFSGSESKVVSGLD--EGSGEYFVRVGIGSPPTEQ 140
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
L++D+GS + W QCKPC+ C Q DP FDP+ S TFS +PC S C+ L
Sbjct: 141 YLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAVCRTLR-------TS 193
Query: 206 KCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
C S C Y+++Y DGS G A + +T+ G +GC N G GA
Sbjct: 194 GCGDSGGCDYEVSYGDGSYTKGALALETLTLGGTAVEG------VAIGCGHRNRGLFVGA 247
Query: 265 SGIMGLDRGPVSIISK---TNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTT 321
+G++GL GP+S++ + F YCL S G + G+ + V + V + P+V
Sbjct: 248 AGLLGLGWGPMSLVGQLGGAAGGAFSYCLASR--GAGSLVLGRSEAVPEGAV-WVPLVRN 304
Query: 322 PEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPVYSALRS 375
P+ FY++ L+GI VG ERLPL+ F +L+ + +D+GT +TR P Y+ALR
Sbjct: 305 PQAPSFYYVGLSGIGVGDERLPLQEDLF-QLTEDGAGGVVMDTGTAVTRLPQEAYAALRD 363
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVR 435
AF + G+ L DTCYDLS Y +V VP ++ +F G L L R L+
Sbjct: 364 AFVAAVGALPRAPGVS-LLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEVDGG 422
Query: 436 QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA PS +LGN+QQ G ++ D A +GFGP C
Sbjct: 423 IYCLAFA--PSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 150/424 (35%), Positives = 219/424 (51%), Gaps = 49/424 (11%)
Query: 88 ILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVS 146
ILRRD+ R+ R + + + T T PA+ G+ + EY + + IG P + +
Sbjct: 82 ILRRDRHRV-----RSIYRRLTAAETTTTTTTIPARLGLAFQSLEYVVTIGIGTPPRNFT 136
Query: 147 LLLDTGSGITWTQCKPC--IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG- 203
+L DTGS +TW QC PC C Q++P FDPSKS T+ +PC++ C I G
Sbjct: 137 VLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSAPECHI-------GGV 189
Query: 204 -QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD------N 256
Q +C + C Y + Y D S G A + T+ + A + GC+ N
Sbjct: 190 QQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAP-AATGVVFGCSHEYISVFN 248
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNIS------YFFYCLHSPYGSTGYITFGKPDTVNK 310
+TG G +G++GL RG SI+S+T S F YCL STGY+T G +
Sbjct: 249 DTG--MGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGSSTGYLTIGGGAAAPQ 306
Query: 311 K---FVKYTPIVTTPEQ-SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFP 366
+ + +TP++TT Q Y + L G+SV G + + AS F+ L IDSGT++T P
Sbjct: 307 QQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFS-LGAVIDSGTVVTHMP 365
Query: 367 APVYSALRSAFRKRMKKYKM-GKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
A Y LR FR M YKM +G L DTCYD++ V P++ + F GG +++D
Sbjct: 366 AAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVTAPRVALEFGGGARIDVDA 425
Query: 426 RGTLVV--------ESVRQVCLGFALLPSDPNS-ILLGNVQQRGYEVHYDVAGRRLGFGP 476
G L+V +S+ CL F LP++ +++GN+QQR Y V +DV G R+GFGP
Sbjct: 426 SGILLVLPAEDGSGQSLTLACLAF--LPTNSAGLVIVGNMQQRAYNVVFDVDGGRIGFGP 483
Query: 477 GNCN 480
C+
Sbjct: 484 NGCS 487
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 144/421 (34%), Positives = 210/421 (49%), Gaps = 39/421 (9%)
Query: 83 PSLE----EILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAI 138
PSL +++ RD R +R P F +++ + EY + V++
Sbjct: 120 PSLRHAVLDLVARDNARAEYLATRLSPAYQPPGFSGSESKVVSGLD--EGSGEYLVRVSV 177
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
G P L++D+GS + W QCKPC+ C Q DP FDP+ S TFS + C S C+IL
Sbjct: 178 GSPPTEQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAICRIL--- 234
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNT 258
P + C Y+++Y DGS G A + +T+ G A ++GC N
Sbjct: 235 -PTSACGDGELGGCEYEVSYADGSYTKGALALETLTL------GGTAVEGVVIGCGHRNR 287
Query: 259 GDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGS------TGYITFGKPDT 307
G GA+G+MGL GP+S++ + F YCL S YGS G++ G+ +
Sbjct: 288 GLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDAGWLVLGRSEA 347
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTI 361
V + V + P+V P FY++ L+GI VG ERLPL+A F +L+ + +D+GT
Sbjct: 348 VPEGAV-WVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLF-QLTEDGAGDVVMDTGTT 405
Query: 362 ITRFPAPVYSALRSAFRKRMK-KYKMGKGI-EDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+TR P Y+ALR AF + +G+ + DTCYDLS Y +V VP ++ F G
Sbjct: 406 VTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGYASVRVPTVSFCFDGDA 465
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L L R L+ + CL FA PS ++GN QQ G ++ D A +GFGP NC
Sbjct: 466 RLILAARNVLLEVDMGIYCLAFA--PSSSGLSIMGNTQQAGIQITVDSANGYIGFGPANC 523
Query: 480 N 480
Sbjct: 524 G 524
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 158/462 (34%), Positives = 237/462 (51%), Gaps = 31/462 (6%)
Query: 31 HSYIVSVSSLIPPTVCNRTRTA-LPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEIL 89
H ++V +S P+ + A + P + S+ ++ R+GPC+ + + N PS E+L
Sbjct: 26 HGFVVVQTSTSSPSNAACSPAAQVTSDPSRASMPLMYRHGPCAPASAAAT-NRPSPAEML 84
Query: 90 RRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLL 148
RRD+ R + L+KA + T + P G V + +Y + + G P LL
Sbjct: 85 RRDRAR----RNHILRKA--SGRRITLGVSIPTSLGAFVDSLQYVVTLGFGTPAVPQVLL 138
Query: 149 LDTGSGITWTQCKPCIH--CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK 206
+DTGS ++W QC+PC C Q+DP FDPS S T++ +PC S C+ L NG
Sbjct: 139 IDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLDPDSYANGCTN 198
Query: 207 CSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
SS C Y I Y +G G ++T+ +T+ F GC G +
Sbjct: 199 SSSGASLCQYGIQYGNGDTTVGVYSTETLTLSP---EAATVVNNFSFGCGLVQKGVFDLF 255
Query: 265 SGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDT--VNKKFVKYTPIV 319
G++GL P S++S+T +Y F YCL + + G++ G P T N ++TP+
Sbjct: 256 DGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGFLALGAPATGGNNTAGFQFTPLQ 315
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRK 379
++ FY + LTGISVGG++L ++ + F IDSGTI+T P YSALR+AFR
Sbjct: 316 VV--ETTFYLVKLTGISVGGKQLDIEPTVFAG-GMIIDSGTIVTGLPETAYSALRTAFRS 372
Query: 380 RMKKYKM--GKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
M Y + EDL DTCYD + V VP + + F GGV ++LDV ++++
Sbjct: 373 AMSAYPLLPPNDDEDL-DTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVLLDG---- 427
Query: 438 CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL F SD ++ ++GNV QR +EV YD A +GF G C
Sbjct: 428 CLAFVAGASDGDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 224 bits (572), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 171/488 (35%), Positives = 254/488 (52%), Gaps = 48/488 (9%)
Query: 22 AYANDNDLSHSYIVSVSS-----LIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQ 76
A+A D D S+ ++S+ S VC+ +R P V L R+GPCS L
Sbjct: 24 AHAGD-DGSYKLVLSIGSHQSLRTNKSVVCSESRA--PAVHATVPLH--HRHGPCSPL-- 76
Query: 77 GKSRNTPSLEEILRRDQQR---LHLKNSRRLQKAIPDN-----FKKTKAFTFPAKTGI-V 127
++ P+LEE L RD+ R +H K SR ++ +++ A T P G +
Sbjct: 77 -PNKKMPTLEERLHRDKLRAAYIHRKLSRGKKQGGGGAGGDVVVQQSHAMTVPTTLGTSL 135
Query: 128 AADEYYIVVAIGKPK-QYVSLLLDTGSGITWTQCKPCIH-CSQQRDPFFDPSKSKTFSKI 185
EY I V +G P + ++L+DTGS I+W +CKPC C Q DP FDPS S T+S
Sbjct: 136 DTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPF 195
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGS-GETGFWATDRMTIQEVNGNGY 243
C+S C L + NG CSS +C Y Y DGS G TG +++D + + +
Sbjct: 196 SCSSAACAQLFQEGNANG---CSSSGQCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVV 252
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY----FFYCLHSPYGSTGY 299
+++ F GC+ TG +G+MGL G S++S+T ++ F YCL S+G+
Sbjct: 253 VSKFRF--GCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCLPPTPSSSGF 310
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+T G T + FVK TP++ + + FY + L I VGG +L + + F+ +DSG
Sbjct: 311 LTLGAAGTSSAGFVK-TPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVFSA-GMIMDSG 368
Query: 360 TIITRFPAPVYSALRSAFRKRMKKY-----KMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
T++TR P YS+L SAF+ MK+Y G G DTC+D+S +V +P + +
Sbjct: 369 TVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGG---FLDTCFDMSGQSSVSMPTVALV 425
Query: 415 F--LGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
F GG + LD G L+ +E+ CL F D ++ ++GNVQQR ++V YDVAG
Sbjct: 426 FSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYDVAGGA 485
Query: 472 LGFGPGNC 479
+GF G C
Sbjct: 486 VGFKAGAC 493
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 142/415 (34%), Positives = 207/415 (49%), Gaps = 47/415 (11%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVA-----ADEYYIVVAIGKP 141
+++ RD R SR P +F F +++ +V+ + EY++ V IG P
Sbjct: 82 DLVSRDNARAEYLASRLSPAYQPTDF-------FGSESKVVSGLDEGSGEYFVRVGIGSP 134
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
L++D+GS + W QCKPC+ C Q DP FDP+ S TFS + C S C+ L
Sbjct: 135 PTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFSAVSCGSAICRTLR----- 189
Query: 202 NGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD 260
C S C Y+++Y DGS G A + +T+ G A +GC N G
Sbjct: 190 --TSGCGDSGGCEYEVSYGDGSYTKGTLALETLTL------GGTAVEGVAIGCGHRNRGL 241
Query: 261 QNGASGIMGLDRGPVSIISK---TNISYFFYCLHSPYGS-------TGYITFGKPDTVNK 310
GA+G++GL GP+S++ + F YCL S GS G + G+ + V +
Sbjct: 242 FVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVLGRSEAVPE 301
Query: 311 KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITR 364
V + P+V P+ FY++ ++GI VG ERLPL+ F +L+ + +D+GT +TR
Sbjct: 302 GAV-WVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLF-QLTEDGGGGVVMDTGTAVTR 359
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
P Y+ALR AF + G+ L DTCYDLS Y +V VP ++ +F G L L
Sbjct: 360 LPQEAYAALRDAFVGAVGALPRAPGVS-LLDTCYDLSGYTSVRVPTVSFYFDGAATLTLP 418
Query: 425 VRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
R L+ CL FA PS +LGN+QQ G ++ D A +GFGP C
Sbjct: 419 ARNLLLEVDGGIYCLAFA--PSSSGLSILGNIQQEGIQITVDSANGYIGFGPATC 471
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 158/473 (33%), Positives = 230/473 (48%), Gaps = 40/473 (8%)
Query: 34 IVSVSSLIP-PTVCNRT--RTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILR 90
++ V SL P P+ C T R + + ++ R+GPCS L + PS EIL
Sbjct: 44 LLRVDSLFPGPSSCTSTQERKPITATSSAARVPIVHRHGPCSPLAGAHAGKPPSHAEILA 103
Query: 91 RDQQRLHLKNSR------------RLQKAIPDNFKKTKAFTFPAKT-----GIVAADEYY 133
DQ R+ + R R +K P + + + + + G+ Y
Sbjct: 104 ADQNRVESLHHRVSSTTTGLGGKPRTKKKTPGHSSVPASSSSSSSSVPASSGLSLGTANY 163
Query: 134 IV-VAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+V + +G P +++ DTGS TW QC+PC+ C +Q+D FDP+KS T++ + C
Sbjct: 164 VVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADPA 223
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C L C++ C Y I Y DGS GF+A D + + + G F
Sbjct: 224 CADL-------DASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAVAQDAIKG------FKF 270
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTV 308
GC + N G +G++GL RGP SI + Y F YCL + +TGY+ FG
Sbjct: 271 GCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAATGYLEFGPLSPS 330
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL-PLKASYFTKLSTEIDSGTIITRFPA 367
+ T + T + FY++ LTGI VGG++L + S F+ T +DSGT+ITR P
Sbjct: 331 SSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNSGTLVDSGTVITRLPD 390
Query: 368 PVYSALRSAFRKRMKKYKMGKGIE-DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
Y+AL SAF M K + DTCYD + V +P +++ F GG L+LD
Sbjct: 391 TAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQGGACLDLDAS 450
Query: 427 GTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
G + S QVCLGFA D + ++GN QQR Y V YDV+ + +GF PG C
Sbjct: 451 GIVYAISQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 147/443 (33%), Positives = 212/443 (47%), Gaps = 40/443 (9%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHL---------------KNSRRLQK 106
+ ++ R+GPCS L R PS EIL DQ R K SRR Q
Sbjct: 90 MTIVHRHGPCSPLAAAH-RKPPSHGEILAADQNRAESIQHRVSTTATGRGKPKRSRRQQP 148
Query: 107 AIPDNFKKTKAFTFP---AKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKP 162
+ + + + A +G + Y + V +G P +++ DTGS TW QC+P
Sbjct: 149 SSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQP 208
Query: 163 CIH-CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDG 221
C+ C +Q++ FDP +S T++ + C + C L CS C Y + Y DG
Sbjct: 209 CVVVCYEQQEKLFDPVRSSTYANVSCAAPACSDL-------NIHGCSGGHCLYGVQYGDG 261
Query: 222 SGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKT 281
S GF+A D +T+ Y A F GC + N G A+G++GL RG S+ +T
Sbjct: 262 SYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQT 316
Query: 282 NISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
Y F +CL + TGY+ FG TP++T FY+I +TGI VG
Sbjct: 317 YDKYGGVFAHCLPARSTGTGYLDFGAGSPAAASARLTTPMLTD-NGPTFYYIGMTGIRVG 375
Query: 339 GERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALR--SAFRKRMKKYKMGKGIEDLFDT 396
G+ L + S F T +DSGT+ITR P P YS+LR A + YK + L DT
Sbjct: 376 GQLLSIPQSVFATAGTIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVS-LLDT 434
Query: 397 CYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNV 456
CYD + V +P +++ F GG L++D G + S QVCL FA + ++GN
Sbjct: 435 CYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNT 494
Query: 457 QQRGYEVHYDVAGRRLGFGPGNC 479
Q + + V YD+ + +GF PG C
Sbjct: 495 QLKTFGVAYDIGKKVVGFYPGVC 517
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 162/455 (35%), Positives = 233/455 (51%), Gaps = 52/455 (11%)
Query: 37 VSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL 96
VSSL+P C+ + QG L + +YGPCS G S+ PS +EI RD+ R+
Sbjct: 46 VSSLLPKNKCSASARGGSQG-----LPITQKYGPCS--GSGHSQ-PPSPQEIFGRDESRV 97
Query: 97 HLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGI 155
NS+ + N K + D ++V VA G P + L+LDTGS I
Sbjct: 98 SFINSK-CNQYTSGNLKN-----HAHNNNLFDEDGNFLVDVAFGTPXTEIXLILDTGSSI 151
Query: 156 TWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYD 215
TWTQCK C++C Q + +FD S S T+S C +T E Y+
Sbjct: 152 TWTQCKACVNCLQDSNRYFDSSASSTYSFGSCIPSTV------------------ENNYN 193
Query: 216 IAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGP 274
+ Y D S G + D MT++ + F ++ F GC NN GD +G G++GL +G
Sbjct: 194 MTYGDDSTSVGNYGCDTMTLEPSD---VFQKFQF--GCGRNNKGDFGSGVDGMLGLGQGQ 248
Query: 275 VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTP---EQSEFY 328
+S +S+T + F YCL S G + FG+ T +K+T +V P ++S +Y
Sbjct: 249 LSTVSQTASKFNKVFSYCLPEE-DSIGSLLFGEKATSQSSSLKFTSLVNGPGTLQESGYY 307
Query: 329 HITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK 388
+ L+ ISVG ERL + +S F T IDS T+ITR P YSAL++AF+K M KY +
Sbjct: 308 FVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSN 367
Query: 389 GIE---DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLP 445
G D+ DTCY+LS K V++P+I +HF GG D+ L+ + ++CL FA
Sbjct: 368 GRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDASRLCLAFA--- 424
Query: 446 SDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++GN QQ V YD+ GRR+GFG C+
Sbjct: 425 GTSELTIIGNRQQLSLTVLYDIQGRRIGFGGNGCS 459
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 131/369 (35%), Positives = 188/369 (50%), Gaps = 21/369 (5%)
Query: 119 TFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDP 176
T P TG + E+ + V G P Q +++ DTGS ++W QC PC HC +Q DP FDP
Sbjct: 121 TIPDSTGTSLDTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDP 180
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
+KS T+S +PC C +G KCS+ C Y + Y DGS G + + +++
Sbjct: 181 TKSATYSVVPCGHPQCAAA------DGS-KCSNGTCLYKVEYGDGSSSAGVLSHETLSLT 233
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSP 293
A F GC N GD G++GL RG +S+ S+ S+ F YCL S
Sbjct: 234 STR-----ALPGFAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSD 288
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
+ GY+T G + V+YT +V + FY + L I +GG LP+ + FT
Sbjct: 289 NTTHGYLTIGPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDG 348
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T +DSGTI+T P Y+ALR F+ M +YK D FDTCYD + + +P ++
Sbjct: 349 TFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAY-DPFDTCYDFTGQSAIFIPAVSF 407
Query: 414 HFLGGVDLELDVRGTLVV--ESVRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
F G +L G L+ ++ + CLGF PS ++GN+QQR EV YDVA
Sbjct: 408 KFSDGSVFDLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAE 467
Query: 471 RLGFGPGNC 479
++GF +C
Sbjct: 468 KIGFASASC 476
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 146/443 (32%), Positives = 212/443 (47%), Gaps = 40/443 (9%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHL---------------KNSRRLQK 106
+ ++ R+GPCS L R PS EIL DQ R K SRR Q
Sbjct: 92 MTIVHRHGPCSPLAAAH-RKPPSHGEILAADQNRAESIQHRVSTTATGRGKPKRSRRQQP 150
Query: 107 AIPDNFKKTKAFTFP---AKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKP 162
+ + + + A +G + Y + V +G P +++ DTGS TW QC+P
Sbjct: 151 SSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQP 210
Query: 163 CIH-CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDG 221
C+ C +QR+ FDP++S T++ + C + C L CS C Y + Y DG
Sbjct: 211 CVVVCYEQREKLFDPARSSTYANVSCAAPACSDL-------NIHGCSGGHCLYGVQYGDG 263
Query: 222 SGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKT 281
S GF+A D +T+ Y A F GC + N G A+G++GL RG S+ +T
Sbjct: 264 SYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQT 318
Query: 282 NISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
Y F +CL + TGY+ FG TP++T FY++ +TGI VG
Sbjct: 319 YDKYGGVFAHCLPARSTGTGYLDFGAGSLAAASARLTTPMLTD-NGPTFYYVGMTGIRVG 377
Query: 339 GERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALR--SAFRKRMKKYKMGKGIEDLFDT 396
G+ L + S F T +DSGT+ITR P YS+LR A + YK + L DT
Sbjct: 378 GQLLSIPQSVFATAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVS-LLDT 436
Query: 397 CYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNV 456
CYD + V +P +++ F GG L++D G + S QVCL FA + ++GN
Sbjct: 437 CYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNT 496
Query: 457 QQRGYEVHYDVAGRRLGFGPGNC 479
Q + + V YD+ + +GF PG C
Sbjct: 497 QLKTFGVAYDIGKKVVGFYPGAC 519
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 147/450 (32%), Positives = 212/450 (47%), Gaps = 55/450 (12%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQR---------------LHLKNSRRLQK 106
+ ++ R+GPCS L PS EIL DQ R ++ K SR Q+
Sbjct: 89 MTIVHRHGPCSPLAAAHG-EPPSHGEILAADQSRAESIQHRVSTTTTDRVNPKRSRHRQQ 147
Query: 107 --------AIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWT 158
A + P + + Y + V +G P +++ DTGS TW
Sbjct: 148 QPPSAPAPAASLSSSTASLPASPGRA--LGTGNYVVTVGLGTPASRYTVVFDTGSDTTWV 205
Query: 159 QCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIA 217
QC+PC+ C +QR+ FDP+ S T++ + C + C L CS C Y +
Sbjct: 206 QCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDL-------DVSGCSGGHCLYGVQ 258
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSI 277
Y DGS GF+A D +T+ Y A F GC + N G A+G++GL RG S+
Sbjct: 259 YGDGSYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSL 313
Query: 278 ISKTNISY---FFYCLHSPYGSTGYITFGK---PDTVNKKFVKYTPIVTTPEQSEFYHIT 331
+T Y F +CL + TGY+ FG P T TP++T FY++
Sbjct: 314 PVQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPATTT------TPMLTG-NGPTFYYVG 366
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRS--AFRKRMKKYKMGKG 389
+TGI VGG LP+ S F T +DSGT+ITR P YS+LRS A + Y+
Sbjct: 367 MTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAA 426
Query: 390 IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN 449
+ L DTCYD + V +P +++ F GG L++D G + S QVCL FA +
Sbjct: 427 VS-LLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGD 485
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN Q + + V YD+ + +GF PG C
Sbjct: 486 VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 147/450 (32%), Positives = 211/450 (46%), Gaps = 55/450 (12%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQR---------------LHLKNSRRLQK 106
+ ++ R+GPCS L PS EIL DQ R ++ K SR Q+
Sbjct: 90 MTIVHRHGPCSPLAAAHG-EPPSHGEILAADQSRAESIQHRVSTTTTGRVNPKRSRHRQQ 148
Query: 107 --------AIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWT 158
A + P + + Y + V +G P +++ DTGS TW
Sbjct: 149 QPPSAPAPAASLSSSTASLPASPGRA--LGTGNYVVTVGLGTPASRYTVVFDTGSDTTWV 206
Query: 159 QCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIA 217
QC+PC+ C +QR+ FDP+ S T++ + C + C L CS C Y +
Sbjct: 207 QCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDL-------DVSGCSGGHCLYGVQ 259
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSI 277
Y DGS GF+A D +T+ Y A F GC + N G A+G++GL RG S+
Sbjct: 260 YGDGSYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSL 314
Query: 278 ISKTNISY---FFYCLHSPYGSTGYITFGK---PDTVNKKFVKYTPIVTTPEQSEFYHIT 331
+T Y F +CL TGY+ FG P T TP++T FY++
Sbjct: 315 PVQTYGKYGGVFAHCLPPRSTGTGYLDFGAGSPPATTT------TPMLTG-NGPTFYYVG 367
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRS--AFRKRMKKYKMGKG 389
+TGI VGG LP+ S F T +DSGT+ITR P YS+LRS A + Y+
Sbjct: 368 MTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAA 427
Query: 390 IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN 449
+ L DTCYD + V +P +++ F GG L++D G + S QVCL FA +
Sbjct: 428 VS-LLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGD 486
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN Q + + V YD+ + +GF PG C
Sbjct: 487 VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 221 bits (562), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 144/450 (32%), Positives = 210/450 (46%), Gaps = 55/450 (12%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL----HLKNSRRLQKAIPDNFKKTK- 116
+ ++ R+GPCS L PS EIL DQ R H ++ + P + +
Sbjct: 93 MTIVHRHGPCSPLAAAHG-EPPSHGEILAADQSRAESIQHRVSTTTTGRVNPKRRRHRQQ 151
Query: 117 ------------------AFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWT 158
P + + Y + V +G P +++ DTGS TW
Sbjct: 152 QPPSAPAPAASLSSSTASLPASPGRA--LGTGNYVVTVGLGTPASRYTVVFDTGSDTTWV 209
Query: 159 QCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIA 217
QC+PC+ C +QR+ FDP+ S T++ + C + C L CS C Y +
Sbjct: 210 QCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDL-------DVSGCSGGHCLYGVQ 262
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSI 277
Y DGS GF+A D +T+ Y A F GC + N G A+G++GL RG S+
Sbjct: 263 YGDGSYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSL 317
Query: 278 ISKTNISY---FFYCLHSPYGSTGYITFGK---PDTVNKKFVKYTPIVTTPEQSEFYHIT 331
+T Y F +CL + TGY+ FG P T TP++T FY++
Sbjct: 318 PVQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPATTT------TPMLTG-NGPTFYYVG 370
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRS--AFRKRMKKYKMGKG 389
+TGI VGG LP+ S F T +DSGT+ITR P YS+LRS A + Y+
Sbjct: 371 MTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAA 430
Query: 390 IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN 449
+ L DTCYD + V +P +++ F GG L++D G + S QVCL FA +
Sbjct: 431 VS-LLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGD 489
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN Q + + V YD+ + +GF PG C
Sbjct: 490 VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 149/439 (33%), Positives = 218/439 (49%), Gaps = 27/439 (6%)
Query: 53 LPQG---PGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIP 109
LPQ G + LE+ R G CS+ R +E+ L D + + ++
Sbjct: 44 LPQSRKEKGAIILEMKDR-GECSE----SERKGDWVEKQLVLDGLHVRSIQNHIRKRTSS 98
Query: 110 DNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ 169
+ P +GI YIV +G Q +S+++DTGS +TW QC+PC C Q
Sbjct: 99 SQIADSSETQVPLTSGIKFQTLNYIVT-MGLGSQNMSVIVDTGSDLTWVQCEPCRSCYNQ 157
Query: 170 RDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWA 229
P F PS S ++ I CNSTTC+ L G D +S C Y + Y DGS +G
Sbjct: 158 NGPLFKPSTSPSYQPILCNSTTCQSL--ELGACGSDPSTSATCDYVVNYGDGSYTSGELG 215
Query: 230 TDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---F 286
+++ G G + F+ GC NN G GASG+MGL R +S+IS+TN ++ F
Sbjct: 216 IEKL------GFGGISVSNFVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVF 269
Query: 287 FYCLHS--PYGSTGYITFGKPDTVNKKF--VKYTPIVTTPEQSEFYHITLTGISVGGERL 342
YCL S G++G + G V K + YT ++ + S FY + LTGI VGG L
Sbjct: 270 SYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSL 329
Query: 343 PLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA 402
++AS F +DSGT+I+R VY AL++ F ++ + G + DTC++L+
Sbjct: 330 HVQASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGF-SILDTCFNLTG 388
Query: 403 YKTVVVPKITIHFLGGVDLELDVRGT--LVVESVRQVCLGFALLPSDPNSILLGNVQQRG 460
Y V +P I+++F G +L +D G LV E +VCL A L + ++GN QQR
Sbjct: 389 YDQVNIPTISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRN 448
Query: 461 YEVHYDVAGRRLGFGPGNC 479
V YD ++GF C
Sbjct: 449 QRVLYDAKLSQVGFAKEPC 467
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 153/463 (33%), Positives = 229/463 (49%), Gaps = 34/463 (7%)
Query: 31 HSYI-VSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEIL 89
H ++ V ++ P VC+ + L G VS+ ++ R+GPC+ Q S S + L
Sbjct: 26 HGFVAVPTTASEPEPVCSTSGVTLDPGSNTVSVPLVHRHGPCAP-TQLSSDKPSSFTDRL 84
Query: 90 RRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLL 148
RR++ R SR + + D+ + P G V + EY + V +G P LL
Sbjct: 85 RRNRARSKYIMSRVSKGMMGDDAD----VSIPTHLGGSVDSLEYVVTVGLGTPSVSQVLL 140
Query: 149 LDTGSGITWTQCKPC--IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK 206
+DTGS ++W QC+PC C Q+DP FDPSKS T++ IPCN+ C+ L + G
Sbjct: 141 IDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPCNTDACRDLTDDGYGGG--- 197
Query: 207 CSS----KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN 262
C+S +C + I Y DGS G ++ + + + A F GC + G +
Sbjct: 198 CASGDGAAQCGFAITYGDGSQTRGVYSNETLALAP-----GVAVKDFRFGCGHDQDGAND 252
Query: 263 GASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIV 319
G++GL P S++ +T Y F YCL + G++ G + V + V
Sbjct: 253 KYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLALGGGGAPSGGVVNTSGFV 312
Query: 320 TTP---EQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSA 376
TP E+ FY + +TGI+VGGE + + S F+ IDSGT++T Y+AL++A
Sbjct: 313 FTPMIREEETFYVVNMTGITVGGEPIDVPPSAFSG-GMIIDSGTVVTELQHTAYNALQAA 371
Query: 377 FRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQ 436
FRK M Y + + E DTCYD S Y V +PK+ + F GG ++LDV ++++
Sbjct: 372 FRKAMAAYPLVRNGE--LDTCYDFSGYSNVTLPKVALTFSGGATIDLDVPNGILLDD--- 426
Query: 437 VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL F D +LGNV QR EV YD R+GF C
Sbjct: 427 -CLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAVC 468
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 218 bits (555), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 143/433 (33%), Positives = 231/433 (53%), Gaps = 27/433 (6%)
Query: 58 GKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLH-LKNSRRLQKAIPDNFKKTK 116
G + LE+ R G CS+ +R L++ L D R+ ++N R + + ++ +++
Sbjct: 61 GAIVLEMKDR-GYCSERKINWNR---KLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSS 116
Query: 117 AFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDP 176
P +GI YIV IG Q +++++DTGS +TW QC PC+ C Q+ P F+P
Sbjct: 117 EIQIPLASGINLETLNYIV-TIGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNP 175
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGETGFWATDRM 233
S S +++ + CNS+TC+ L F + C S C + ++Y DGS G + +
Sbjct: 176 SNSSSYNSLLCNSSTCQNL--QFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHL 233
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
+ G + F+ GC NN G G SGIMGL R +S+IS+TN ++ F YCL
Sbjct: 234 SF------GGISVSNFVFGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCL 287
Query: 291 -HSPYGSTGYITFGKPDTVNKKF--VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS 347
+ G++G + G ++ K + YT +V+ P+ S FY + LTGI VGG + ++ +
Sbjct: 288 PTTDSGASGSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGG--VAIQDT 345
Query: 348 YFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV 407
F IDSGT+ITR +Y+AL++ F K+ Y + + + DTC++L+ + V
Sbjct: 346 SFGNGGILIDSGTVITRLAPSLYNALKAEFLKQFSGYPIAPALS-ILDTCFNLTGIEEVS 404
Query: 408 VPKITIHFLGGVDLELDVRGTLVV-ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
+P +++HF VDL +D G L + + QVCL A L + + ++GN QQR V YD
Sbjct: 405 IPTLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYD 464
Query: 467 VAGRRLGFGPGNC 479
++GF +C
Sbjct: 465 AKQSKIGFAREDC 477
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 218 bits (555), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 149/445 (33%), Positives = 227/445 (51%), Gaps = 29/445 (6%)
Query: 45 VCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL-HLKNSRR 103
VC+ R A+ ++ + R+GPCS + K R P+ EE+L+RDQ R H++
Sbjct: 38 VCSE-RNAISSSLSGTTVALNHRHGPCSPVPSSKKR--PTEEELLKRDQLRAEHIQRKFA 94
Query: 104 LQKAI--PDNFKKTK-AFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQ 159
+ A+ + +++K + + P K G + EY I V +G P ++ +DTGS ++W Q
Sbjct: 95 MNAAVDGAGDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQ 154
Query: 160 CKPCIH--CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIA 217
C PC + C Q FDP+KS T+ + C + C L + G ++ EC Y +
Sbjct: 155 CNPCPNPPCHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCG---ATNYECQYGVQ 211
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSI 277
Y DGS G ++ D +T+ + A F GC+ +G + G+MGL G S+
Sbjct: 212 YGDGSTTNGTYSRDTLTLSGASD----AVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSL 267
Query: 278 ISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTG 334
+S+T +Y F YCL GS+G++T G + T ++ + + FY L
Sbjct: 268 VSQTAAAYGNSFSYCLPPTSGSSGFLTLGGGGGASGFVT--TRMLRSKQIPTFYGARLQD 325
Query: 335 ISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF 394
I+VGG++L L S F S +DSGTIITR P YSAL SAF+ MK+Y+ +
Sbjct: 326 IAVGGKQLGLSPSVFAAGSV-VDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPA-RSIL 383
Query: 395 DTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLG 454
DTC+D + + +P + + F GG ++LD G + CL FA D + ++G
Sbjct: 384 DTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMYGN-----CLAFAATGDDGTTGIIG 438
Query: 455 NVQQRGYEVHYDVAGRRLGFGPGNC 479
NVQQR +EV YDV LGF G C
Sbjct: 439 NVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 162/461 (35%), Positives = 232/461 (50%), Gaps = 54/461 (11%)
Query: 36 SVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQR 95
+VSSL+P C+ + QG L + +YGPCS G S+ PS +EI RD+ R
Sbjct: 44 TVSSLLPKNKCSASARGGSQG-----LPITQKYGPCS--GSGHSQ-PPSPQEIFGRDESR 95
Query: 96 LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSG 154
+ NS+ + N K + D ++V VA G P Q L+LDTGS
Sbjct: 96 VSFINSK-CNQYTSGNLK-----NHAHNNNLFDEDGNFLVDVAFGTPPQKFKLILDTGSS 149
Query: 155 ITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPY 214
ITWTQCK C+HC + FD S T+S C +T Y
Sbjct: 150 ITWTQCKACVHCLKDSHRHFDSLASSTYSFGSCIPSTVG------------------NTY 191
Query: 215 DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRG 273
++ Y D S G + D MT++ + F ++ F GC NN GD +GA G++GL +G
Sbjct: 192 NMTYGDKSTSVGNYGCDTMTLEPSD---VFQKFQF--GCGRNNEGDFGSGADGMLGLGQG 246
Query: 274 PVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTP-----EQS 325
+S +S+T + F YCL S G + FG+ T +K+T +V P E+S
Sbjct: 247 QLSTVSQTASKFKKVFSYCLPEE-NSIGSLLFGEKATSQSSSLKFTSLVNGPGTSGLEES 305
Query: 326 EFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK 385
+Y + L ISVG +RL + +S F T IDSGT+ITR P YSAL++AF+K M KY
Sbjct: 306 GYYFVKLLDISVGNKRLNIPSSVFASPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYP 365
Query: 386 MGKG---IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA 442
+ G D+ DTCY+LS K V++P+ +HF G D+ L+ + + ++CL FA
Sbjct: 366 LSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLNGKRVVWGNDASRLCLAFA 425
Query: 443 ---LLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+P ++GN QQ V YD+ GRR+GFG C+
Sbjct: 426 GNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGCS 466
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 137/407 (33%), Positives = 212/407 (52%), Gaps = 28/407 (6%)
Query: 90 RRDQQRLHLKNSR------RLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
RR Q++L + R R+++ + + + P +GI YIV +G
Sbjct: 16 RRLQKQLISDDLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLNYIV-TMGLGST 74
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+++++DTGS +TW QC+PC+ C Q+ P F PS S ++ + CNS+TC+ L F
Sbjct: 75 NMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSL--QFATGN 132
Query: 204 QDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ 261
C S C Y + Y DGS G ++++ G + F+ GC NN G
Sbjct: 133 TGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSF------GGVSVSDFVFGCGRNNKGLF 186
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYITFGKPDTVNKKF--VKY 315
G SG+MGL R +S++S+TN ++ F YCL + G++G + G +V K + Y
Sbjct: 187 GGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGASGSLVMGNESSVFKNVTPITY 246
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRS 375
T ++ P+ S FY + LTGI V G + L+ F IDSGT+ITR P+ VY AL++
Sbjct: 247 TRMLPNPQLSNFYILNLTGIDVDG--VALQVPSFGNGGVLIDSGTVITRLPSSVYKALKA 304
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVV--ES 433
F K+ + G + DTC++L+ Y V +P I++HF G +L++D GT V E
Sbjct: 305 LFLKQFTGFPSAPGFS-ILDTCFNLTGYDEVSIPTISMHFEGNAELKVDATGTFYVVKED 363
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
QVCL A L ++ ++GN QQR V YD ++GF +C+
Sbjct: 364 ASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEESCS 410
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 154/460 (33%), Positives = 229/460 (49%), Gaps = 35/460 (7%)
Query: 35 VSVSSLIPPTVCNRTRTALPQGPGKVS--LEVLGRYGPCSKLNQGKSRNTPSLEEILRRD 92
VS +S +P + C+ PQ S L + R+GPC+ ++ S PS+ + LR D
Sbjct: 39 VSAASFVPSSTCSSPDPVPPQRRNGTSAVLRLTHRHGPCAP-SRASSLAAPSVADTLRAD 97
Query: 93 QQRLHLKNSRRLQKAIP---DNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLL 148
Q+R RR+ P D+ A T PA G + Y + ++G P ++
Sbjct: 98 QRRAEYIL-RRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTME 156
Query: 149 LDTGSGITWTQCKPCI---HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
+DTGS ++W QCKPC C Q+DP FDP++S +++ +PC C L +
Sbjct: 157 VDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYA----AS 212
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
CS+ +C Y ++Y DGS TG +++D +T+ + A F GC +G NG
Sbjct: 213 ACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS-----AVQGFFFGCGHAQSGLFNGVD 267
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFG--KPDTVNKKFVKYTPIVT 320
G++GL R S++ +T +Y F YCL + + GY+T G P F T ++
Sbjct: 268 GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGF-STTQLLP 326
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKR 380
+P +Y + LTGISVGG++L + AS F + T++TR P Y+ALRSAFR
Sbjct: 327 SPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG-TVVTRLPPTAYAALRSAFRSG 385
Query: 381 MKKYKMGKGIED-LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
M Y + + DTCY+ + Y TV +P + + F G + L G L CL
Sbjct: 386 MASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSFG-----CL 440
Query: 440 GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
FA SD +LGNVQQR +EV D G +GF P +C
Sbjct: 441 AFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 140/408 (34%), Positives = 209/408 (51%), Gaps = 28/408 (6%)
Query: 90 RRDQQRLHLKNSR------RLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
RR Q++L L + R R+++ + + P +GI YIV +G +
Sbjct: 16 RRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINLQTLNYIVT-MGLGSK 74
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+++++DTGS +TW QC+PC+ C Q+ P F PS S ++ + CNS+TC+ L F
Sbjct: 75 NMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSL--QFATGN 132
Query: 204 QDKCSSKE---CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD 260
C S C Y + Y DGS G + ++ G + F+ GC NN G
Sbjct: 133 TGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSF------GGVSVSDFVFGCGRNNKGL 186
Query: 261 QNGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYITFGKPDTV--NKKFVK 314
G SG+MGL R +S++S+TN ++ F YCL + GS+G + G +V N +
Sbjct: 187 FGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNESSVFKNANPIT 246
Query: 315 YTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALR 374
YT +++ P+ S FY + LTGI VGG L S F IDSGT+ITR P+ VY AL+
Sbjct: 247 YTRMLSNPQLSNFYILNLTGIDVGGVALKAPLS-FGNGGILIDSGTVITRLPSSVYKALK 305
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVV--E 432
+ F K+ + G + DTC++L+ Y V +P I++ F G L +D GT V E
Sbjct: 306 AEFLKKFTGFPSAPGFS-ILDTCFNLTGYDEVSIPTISLRFEGNAQLNVDATGTFYVVKE 364
Query: 433 SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
QVCL A L ++ ++GN QQR V YD ++GF C+
Sbjct: 365 DASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPCS 412
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 140/419 (33%), Positives = 206/419 (49%), Gaps = 33/419 (7%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVV-----AIG 139
L +L D+ R + RR K ++ + P +GI Y+ + G
Sbjct: 97 LRRLLAADESRANSFQPRR-NKDRASASTQSASAEVPLTSGIRLQTLNYVTTISLGGSSG 155
Query: 140 KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF 199
P +++++DTGS +TW QCKPC C QRDP FDP+ S T++ + CN++ C L
Sbjct: 156 SPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRAA 215
Query: 200 PPN----GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
G S++C Y +AY DGS G ATD + + + G F+ GC
Sbjct: 216 TGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGG------FVFGCGL 269
Query: 256 NNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYG--STGYITFGKPDTVNK 310
+N G G +G+MGL R +S++S+T Y F YCL + ++G ++ G D
Sbjct: 270 SNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDDAAS 329
Query: 311 KF-----VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRF 365
+ V YT ++ P Q FY + +TG +VGG L A + IDSGT+ITR
Sbjct: 330 SYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGT--ALAAQGLGASNVLIDSGTVITRL 387
Query: 366 PAPVYSALRSAFRKRM--KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
VY A+R+ F ++ Y G + DTCYDL+ + V VP +T+ GG D+ +
Sbjct: 388 APSVYRAVRAEFMRQFGAAGYPAAPGFS-ILDTCYDLTGHDEVKVPLLTLRLEGGADVTV 446
Query: 424 DVRGTLVV--ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
D G L V + QVCL A L + + ++GN QQ+ V YD G RLGF +CN
Sbjct: 447 DAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTLGSRLGFADEDCN 505
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 139/419 (33%), Positives = 202/419 (48%), Gaps = 27/419 (6%)
Query: 68 YGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI 126
+G CS L S + + + RD RL+ S+ +N + P + G
Sbjct: 79 HGACSPLRPINSSSWIDMVSQSFDRDNDRLNTIWSK-------NNGTYSTMSNLPLQPGS 131
Query: 127 VAADEYYIVVA-IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKI 185
YIV A G P + L++DTGS +TW QCKPC C Q DP F+P +S ++ +
Sbjct: 132 KVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHL 191
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C S+ C L + C C Y+I Y DGS G ++ + +T+ G+ F
Sbjct: 192 SCLSSACTELTT------MNHCRLGGCVYEINYGDGSRSQGDFSQETLTL----GSDSFP 241
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF 302
+ F GC NTG G++G++GL R +S S+T Y F YCL ST +F
Sbjct: 242 SFAF--GCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSF 299
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
+ P+V+ FY + L GISVGGERL + + + T +DSGT+I
Sbjct: 300 SVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVI 359
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
TR Y AL+++FR + + K + DTCYDLS+Y V +P IT HF D+
Sbjct: 360 TRLVPQAYDALKTSFRSKTRNLPSAKPFS-ILDTCYDLSSYSQVRIPTITFHFQNNADVA 418
Query: 423 LDVRGTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ G L + QVCL FA ++ ++GN QQ+ V +D R+GF PG+C
Sbjct: 419 VSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 148/445 (33%), Positives = 225/445 (50%), Gaps = 29/445 (6%)
Query: 45 VCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL-HLKNSRR 103
VC+ R A+ ++ + R+GPCS + K R P+ EE+L+RDQ R H++
Sbjct: 38 VCSE-RNAISSSLSGTTVALNHRHGPCSPVPSSKKR--PTEEELLKRDQLRAEHIQRKFA 94
Query: 104 LQKAI--PDNFKKTK-AFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQ 159
+ A+ + +++K + + P K G + EY I V +G P ++ +DTGS ++W Q
Sbjct: 95 MNAAVDGAGDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQ 154
Query: 160 CKPCIH--CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIA 217
C PC + C Q FDP+KS T+ + C + C L + G ++ EC Y +
Sbjct: 155 CNPCPNPPCYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCG---ATNYECQYGVQ 211
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSI 277
Y DGS G ++ D +T+ + A F GC+ +G + G+MGL G S+
Sbjct: 212 YGDGSTTNGTYSRDTLTLSGASD----AVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSL 267
Query: 278 ISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTG 334
+S+T +Y F YCL GS+G++T T ++ + + FY L
Sbjct: 268 VSQTAAAYGNSFSYCLPPTSGSSGFLT--LGGGGGVSGFVTTRMLRSRQIPTFYGARLQD 325
Query: 335 ISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF 394
I+VGG++L L S F S +DSGTIITR P YSAL SAF+ MK+Y+ +
Sbjct: 326 IAVGGKQLGLSPSVFAAGSV-VDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPA-RSIL 383
Query: 395 DTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLG 454
DTC+D + + +P + + F GG ++LD G + CL FA D + ++G
Sbjct: 384 DTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMYGN-----CLAFAATGDDGTTGIIG 438
Query: 455 NVQQRGYEVHYDVAGRRLGFGPGNC 479
NVQQR +EV YDV LGF G C
Sbjct: 439 NVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 141/379 (37%), Positives = 205/379 (54%), Gaps = 28/379 (7%)
Query: 112 FKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR 170
F + A P +G+ + EY+ V IG P + + ++LDTGS +TW QC+PC C QQ
Sbjct: 145 FAASAAIQGPVVSGVGQGSGEYFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQS 204
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWAT 230
DP FDPS S +++ + C+S C+ L N ++ C Y++AY DGS G +AT
Sbjct: 205 DPVFDPSLSASYAAVSCDSQRCRDLDTAACRN-----ATGACLYEVAYGDGSYTVGDFAT 259
Query: 231 DRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL 290
+ +T+ + G A +GC +N G GA+G++ L GP+S S+ + S F YCL
Sbjct: 260 ETLTLGDSTPVGNVA-----IGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCL 314
Query: 291 ---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS 347
SP ST + FG D + P+V +P S FY++ L+GISVGG+ L + AS
Sbjct: 315 VDRDSPAAST--LQFG--DGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPAS 370
Query: 348 YFTKLSTE------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
F +T +DSGT +TR + Y+ALR AF + G+ LFDTCYDLS
Sbjct: 371 AFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVS-LFDTCYDLS 429
Query: 402 AYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRG 460
+V VP +++ F GG L L + L+ V+ CL FA P++ ++GNVQQ+G
Sbjct: 430 DRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFA--PTNAAVSIIGNVQQQG 487
Query: 461 YEVHYDVAGRRLGFGPGNC 479
V +D A +GF P C
Sbjct: 488 TRVSFDTARGAVGFTPNKC 506
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 137/402 (34%), Positives = 198/402 (49%), Gaps = 26/402 (6%)
Query: 88 ILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSL 147
++ RD R+ R + P + + P + EY++ V +G P L
Sbjct: 88 LVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVDD--GSGEYFVRVGVGSPPTDQYL 145
Query: 148 LLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKC 207
++D+GS + W QC+PC C Q DP FDP+ S +FS + C S C+ L
Sbjct: 146 VVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSGT---GCGGGG 202
Query: 208 SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGI 267
+ +C Y + Y DGS G A + +T+ G A +GC N+G GA+G+
Sbjct: 203 DAGKCDYSVTYGDGSYTKGELALETLTL------GGTAVQGVAIGCGHRNSGLFVGAAGL 256
Query: 268 MGLDRGPVSIISKTNIS---YFFYCLHS-PYGSTGYITFGKPDTVNKKFVKYTPIVTTPE 323
+GL G +S++ + + F YCL S G G + G+ + V V + P+V +
Sbjct: 257 LGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLVLGRTEAVPVGAV-WVPLVRNNQ 315
Query: 324 QSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPVYSALRSAF 377
S FY++ LTGI VGGERLPL+ S F +L+ + +D+GT +TR P Y+ALR AF
Sbjct: 316 ASSFYYVGLTGIGVGGERLPLQDSLF-QLTEDGAGGVVMDTGTAVTRLPREAYAALRGAF 374
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
M + L DTCYDLS Y +V VP ++ +F G L L R LV
Sbjct: 375 DGAMGALPRSPAVS-LLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVF 433
Query: 438 CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA PS +LGN+QQ G ++ D A +GFGP C
Sbjct: 434 CLAFA--PSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 159/485 (32%), Positives = 243/485 (50%), Gaps = 31/485 (6%)
Query: 8 FLLFIWLLRSSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQ---GPGKVSLEV 64
LL + LL S + A N+ H ++V ++ T N + PQ P + S+ +
Sbjct: 6 MLLCVLLLCSYSLTALGGGNE-QHGFVVVPTTTGTSTSSNPACSPAPQVTSDPNRASMPL 64
Query: 65 LGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL-HLKNSRRLQKAIPDNFKKTKAFTFPAK 123
R+GPC+ + + PSL E LRRD+ R H+ + + P
Sbjct: 65 AHRHGPCAP---ATTSSWPSLAERLRRDRARRDHITRKAKASG----RTTTLSDVSIPTS 117
Query: 124 TGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC--IHCSQQRDPFFDPSKSK 180
G V + EY + + IG P ++L+DTGS ++W QCKPC C Q+DP +DP+ S
Sbjct: 118 LGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASS 177
Query: 181 TFSKIPCNSTTCKILL-EWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN 239
T++ +PC+S CK L+ + + + + C Y I Y + G ++T+ +T+
Sbjct: 178 TYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSP-- 235
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS 296
+ F GC G + G++GL P S++S+T +Y F YCL +
Sbjct: 236 ---QVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPGNST 292
Query: 297 TGYITFGKPDTVNKKF-VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE 355
TG++ G P N +TP+ + PEQ+ FY + LTG+SVGG+ L + + +
Sbjct: 293 TGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSG-GMI 351
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYK-MGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
IDSGTIIT P YSALR+AFR M Y + +D+ DTCY+ + V VP + +
Sbjct: 352 IDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVTVPTVALT 411
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F GG ++LDV +++ Q CL FA SD + ++GNV QR +EV YD +GF
Sbjct: 412 FDGGATIDLDVPSGVLI----QDCLAFAGGASDGDVGIIGNVNQRTFEVLYDSGRGHVGF 467
Query: 475 GPGNC 479
PG C
Sbjct: 468 RPGAC 472
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 141/419 (33%), Positives = 213/419 (50%), Gaps = 26/419 (6%)
Query: 77 GKSRNTPSLEEILRRDQQRLHLKNSRRLQKAI---PDNFKKTKAFTFPAKTGI-VAADEY 132
GKSR + +L D R+ R + D +K P +G + Y
Sbjct: 55 GKSRAEEA-HAVLASDAARVSSLQRRIGSYGLIRSSDAASASKLAQVPVTSGARLRTLNY 113
Query: 133 YIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTC 192
V IG + ++++DT S +TW QC+PC C Q++P FDPS S +++ +PCNS++C
Sbjct: 114 VATVGIGGGE--ATVIVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPCNSSSC 171
Query: 193 KILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
L +GQ C + C Y ++Y DGS G A DR+++ + G F+
Sbjct: 172 DALRVATGMSGQ-ACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGEDIQG------FV 224
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYITFGKPD 306
GC +N G G SG+MGL R +S+IS+T + F YCL GS+G + G
Sbjct: 225 FGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESGSSGSLVLGDDA 284
Query: 307 TV--NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA-SYFTKLSTEIDSGTIIT 363
+V N + YT +V+ P Q FY LTGI+VGGE + S +DSGTIIT
Sbjct: 285 SVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGGGGKAIVDSGTIIT 344
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
VY+A+R+ F ++ +Y + DTC+DL+ + V VP + + F GG ++E+
Sbjct: 345 SLVPSVYAAVRAEFVSQLAEYPQAAPFS-ILDTCFDLTGLREVQVPSLKLVFDGGAEVEV 403
Query: 424 DVRGTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
D +G L V QVCL A L S+ ++ ++GN QQ+ V +D G ++GF C+
Sbjct: 404 DSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGSQIGFAQETCD 462
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 137/402 (34%), Positives = 197/402 (49%), Gaps = 26/402 (6%)
Query: 88 ILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSL 147
++ RD R+ R + P + + P + EY++ V +G P L
Sbjct: 88 LVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVDD--GSGEYFVRVGVGSPPTDQYL 145
Query: 148 LLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKC 207
++D+GS + W QC+PC C Q DP FDP+ S +FS + C S C+ L
Sbjct: 146 VVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSGT---GCGGGG 202
Query: 208 SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGI 267
+ +C Y + Y DGS G A + +T+ G A +GC N+G GA+G+
Sbjct: 203 DAGKCDYSVTYGDGSYTKGELALETLTL------GGTAVQGVAIGCGHRNSGLFVGAAGL 256
Query: 268 MGLDRGPVSIISKTNIS---YFFYCLHS-PYGSTGYITFGKPDTVNKKFVKYTPIVTTPE 323
+GL G +S+I + + F YCL S G G + G+ + V V + P+V +
Sbjct: 257 LGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGGAGSLVLGRTEAVPVGAV-WVPLVRNNQ 315
Query: 324 QSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPVYSALRSAF 377
S FY++ LTGI VGGERLPL+ F +L+ + +D+GT +TR P Y+ALR AF
Sbjct: 316 ASSFYYVGLTGIGVGGERLPLQDGLF-QLTEDGAGGVVMDTGTAVTRLPREAYAALRGAF 374
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
M + L DTCYDLS Y +V VP ++ +F G L L R LV
Sbjct: 375 DGAMGALPRSPAVS-LLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVF 433
Query: 438 CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA PS +LGN+QQ G ++ D A +GFGP C
Sbjct: 434 CLAFA--PSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 138/419 (32%), Positives = 210/419 (50%), Gaps = 37/419 (8%)
Query: 85 LEEILRRDQQR-----LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAI 138
L +L D+ R L ++N R + ++ + P +GI Y +A+
Sbjct: 137 LRRLLAADESRANSFQLRIRNDRAAAAST-----QSGSAEVPLTSGIRFQTLNYVTTIAL 191
Query: 139 G-----KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
G P +++++DTGS +TW QCKPC C QRDP FDP+ S T++ + CN++ C
Sbjct: 192 GGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACA 251
Query: 194 ILLEWFPPN-GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
L+ G ++ C Y +AY DGS G ATD + + + +G F+ G
Sbjct: 252 ASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGASLDG------FVFG 305
Query: 253 CTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYG--STGYITFGKPDT 307
C +N G G +G+MGL R +S++S+T + Y F YCL + ++G ++ G +
Sbjct: 306 CGLSNRGLFGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDAS 365
Query: 308 V--NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRF 365
N V YT ++ P Q FY + +TG +VGG L A + IDSGT+ITR
Sbjct: 366 SYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGT--ALAAQGLGASNVLIDSGTVITRL 423
Query: 366 PAPVYSALRSAFRKRMKK--YKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
VY +R+ F ++ Y G + DTCYDL+ + V VP +T+ GG ++ +
Sbjct: 424 APSVYRGVRAEFTRQFAAAGYPTAPGFS-ILDTCYDLTGHDEVKVPLLTLRLEGGAEVTV 482
Query: 424 DVRGTLVV--ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
D G L V + QVCL A L + + ++GN QQ+ V YD G RLGF +CN
Sbjct: 483 DAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCN 541
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 159/443 (35%), Positives = 226/443 (51%), Gaps = 52/443 (11%)
Query: 37 VSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL 96
VSSL+P C+ + QG L + +YGPCS G S+ PS +EI RD+ R+
Sbjct: 80 VSSLLPKNKCSASARGGSQG-----LPITQKYGPCS--GSGHSQ-PPSPQEIFGRDESRV 131
Query: 97 HLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGIT 156
NS+ Q A P+N K P + + VA G P Q +L+LDTGS IT
Sbjct: 132 SFINSKFNQYA-PENLKDHT----PNNKLFDEDGNFLVDVAFGTPPQKFTLILDTGSSIT 186
Query: 157 WTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDI 216
WTQCKPC+ C + FDPS S T+S C +T Y++
Sbjct: 187 WTQCKPCVRCLKASRRHFDPSASLTYSLGSCIPSTVGNT------------------YNM 228
Query: 217 AYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGPV 275
Y D S G + D MT++ + F ++ F GC NN GD +GA G++GL +G +
Sbjct: 229 TYGDKSTSVGNYGCDTMTLEHSD---VFPKFQF--GCGRNNEGDFGSGADGMLGLGQGQL 283
Query: 276 SIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTP-----EQSEF 327
S +S+T + F YCL S G + FG+ T +K+T +V P E+S +
Sbjct: 284 STVSQTASKFKKVFSYCLPEE-DSIGSLLFGEKATSQSSSLKFTSLVNGPGTSGLEESGY 342
Query: 328 YHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMG 387
Y + L ISVG +RL + +S F T IDSGT+ITR P YSAL++AF+K M KY +
Sbjct: 343 YFVKLLDISVGNKRLNIPSSVFASPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLS 402
Query: 388 KGIE---DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALL 444
G D+ DTCY+LS K V++P+I +HF G D+ L+ + + ++CL FA
Sbjct: 403 NGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDASRLCLAFA-- 460
Query: 445 PSDPNSILLGNVQQRGYEVHYDV 467
+ ++GN QQ V YD+
Sbjct: 461 -GNSELTIIGNRQQVSLTVLYDI 482
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 138/407 (33%), Positives = 212/407 (52%), Gaps = 37/407 (9%)
Query: 101 SRRLQKAIPDN--FKKTKAFTFPAKTGIVAADEYYI-----------VVAIGKPKQYVSL 147
+R + AI N F K+ FP +T ++ + I +V +G Q +L
Sbjct: 99 NRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTL 158
Query: 148 LLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK- 206
++DTGS +TW QC PC C Q++P F+PS S +F +PCNS TC L P G
Sbjct: 159 IVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQ---PTAGSSGL 215
Query: 207 CSSKE---CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG 263
CS+K C Y I Y DGS G +++T+ + + F+ GC NN G G
Sbjct: 216 CSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDN------FIFGCGRNNKGLFGG 269
Query: 264 ASGIMGLDRGPVSIISKTNI---SYFFYCL-HSPYGSTGYITFGKPDTVNKKF---VKYT 316
ASG+MGL R +S++S+T+ S F YCL + GS+G +T G D N K + YT
Sbjct: 270 ASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYT 329
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLPL-KASYFTKLSTEIDSGTIITRFPAPVYSALRS 375
++ P+ S FY + LTGIS+GG L + + S + + +DSGT+ITR +Y A ++
Sbjct: 330 RMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKA 389
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT--LVVES 433
F K+ Y+ G + +TC++L+ Y+ V +P + F G ++ +DV G V
Sbjct: 390 EFEKQFSGYRTTPGF-SILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSD 448
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
Q+CL FA L + ++++GN QQ+ V Y+ ++GF C+
Sbjct: 449 ASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 495
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 138/407 (33%), Positives = 212/407 (52%), Gaps = 37/407 (9%)
Query: 101 SRRLQKAIPDN--FKKTKAFTFPAKTGIVAADEYYI-----------VVAIGKPKQYVSL 147
+R + AI N F K+ FP +T ++ + I +V +G Q +L
Sbjct: 20 NRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTL 79
Query: 148 LLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK- 206
++DTGS +TW QC PC C Q++P F+PS S +F +PCNS TC L P G
Sbjct: 80 IVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQ---PTAGSSGL 136
Query: 207 CSSKE---CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG 263
CS+K C Y I Y DGS G +++T+ + + F+ GC NN G G
Sbjct: 137 CSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDN------FIFGCGRNNKGLFGG 190
Query: 264 ASGIMGLDRGPVSIISKTNI---SYFFYCL-HSPYGSTGYITFGKPDTVNKKF---VKYT 316
ASG+MGL R +S++S+T+ S F YCL + GS+G +T G D N K + YT
Sbjct: 191 ASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYT 250
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLPL-KASYFTKLSTEIDSGTIITRFPAPVYSALRS 375
++ P+ S FY + LTGIS+GG L + + S + + +DSGT+ITR +Y A ++
Sbjct: 251 RMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKA 310
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT--LVVES 433
F K+ Y+ G + +TC++L+ Y+ V +P + F G ++ +DV G V
Sbjct: 311 EFEKQFSGYRTTPGFS-ILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSD 369
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
Q+CL FA L + ++++GN QQ+ V Y+ ++GF C+
Sbjct: 370 ASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 416
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 149/482 (30%), Positives = 230/482 (47%), Gaps = 34/482 (7%)
Query: 8 FLLFIWLLRSSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGR 67
LL ++ + + A D ++S SSL P VC + G ++ + R
Sbjct: 7 LLLLPCIIMITYHALVARAGDEKSYKVLSASSLKPGAVCAEPKVRDSSSSG-ATVPLNHR 65
Query: 68 YGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIP--DNFKKTKAFTFPAKTG 125
+GPCS + GK + P+ E+LRRDQ R + + + P ++++A A
Sbjct: 66 HGPCSPVPSGKKKQ-PTFTELLRRDQLRANYIQRQFSDEHYPRTGGLQQSEATVPIALGS 124
Query: 126 IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKI 185
++ EY I V+IG P ++ +DTGS ++W +CK +DP S T++
Sbjct: 125 LLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------SRLYDPGTSSTYAPF 175
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C++ C L G S C Y + Y DGS TG + +D +T+ G
Sbjct: 176 SCSAPACAQLGR----RGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLA---GTSEPL 228
Query: 246 RYPFLLGCTDNNTG-DQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYIT 301
F GC+ G +++ G+MGL S +S+T +Y F YCL + S+G++T
Sbjct: 229 ISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLPPTWNSSGFLT 288
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTI 361
G P + TP++ + + + FY + L GISVGG+ L + +S F+ S +DSGT+
Sbjct: 289 LGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFSAGSI-VDSGTV 347
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGI-EDLFDTCYDLSAY---KTVVVPKITIHFLG 417
ITR P Y AL +AFR M +Y+ L DTC+D + + VP + + G
Sbjct: 348 ITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGEGNNFTVPSVALVLDG 407
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G ++L G V+ CL FA D + ++GNVQQR +EV YDV GF PG
Sbjct: 408 GAVVDLHPNGI-----VQDGCLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPG 462
Query: 478 NC 479
C
Sbjct: 463 AC 464
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 126/363 (34%), Positives = 193/363 (53%), Gaps = 28/363 (7%)
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
V +G ++++DT S +TW QC+PC C Q+DP FDPS S +++ +PCNS++C
Sbjct: 121 VATVGLGAAEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDA 180
Query: 195 LLEWFP----PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI--QEVNGNGYFARYP 248
L P D C Y ++Y DGS G A D++ + Q++ G
Sbjct: 181 LRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQDIEG-------- 232
Query: 249 FLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYITFG 303
F+ GC +N G G SG+MGL R VS++S+T + F YCL GS+G + G
Sbjct: 233 FVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMRESGSSGSLVLG 292
Query: 304 KPDTV--NKKFVKYTPIVTT--PEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ N + YT +V+ P Q FY + LTGI+VGG+ +++ +F+ IDSG
Sbjct: 293 DDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQE--VESPWFSAGRVIIDSG 350
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
TIIT VY+A+R+ F ++ +Y + DTC++L+ K V VP + F G V
Sbjct: 351 TIITTLVPSVYNAVRAEFLSQLAEYPQAPAFS-ILDTCFNLTGLKEVQVPSLKFVFEGSV 409
Query: 420 DLELDVRGTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
++E+D +G L V QVCL A L S+ ++ ++GN QQ+ V +D G ++GF
Sbjct: 410 EVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFDTLGSQIGFAQE 469
Query: 478 NCN 480
C+
Sbjct: 470 TCD 472
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 153/479 (31%), Positives = 233/479 (48%), Gaps = 47/479 (9%)
Query: 9 LLFIWLLRSSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRY 68
LL + L + A+A D+ ++ +++V SL VC+ T P ++ + RY
Sbjct: 17 LLLVLLCGYYSGVAFAADDARTYK-VLAVGSLKAEVVCSVT----PASSSGTTVPLNHRY 71
Query: 69 GPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIP--DNFKKTKAFTFPAKTG- 125
GPCS K P++ E+L DQ R ++ +Q+ + D + T P G
Sbjct: 72 GPCSPAPSAK---VPTILELLEHDQLR-----AKYIQRKLSGTDGLQPLD-LTVPTTLGS 122
Query: 126 IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKI 185
+ EY I V IG P ++++DTGS ++W +C S FDPSKS T++
Sbjct: 123 ALDTMEYVITVGIGSPAVTQTMMIDTGSDVSWVRCN-----STDGLTLFDPSKSTTYAPF 177
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C+S C L N D CS+ C Y + Y DGS TG +++D + + +
Sbjct: 178 SCSSAACAQL-----GNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASD-----T 227
Query: 246 RYPFLLGCTDNNTG-DQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYIT 301
F GC+ + D G+MGL S++S+T +Y F YCL ++G++T
Sbjct: 228 VTDFHFGCSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCLPPTNRTSGFLT 287
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTI 361
FG P+ + FV TP++ P+ Y + L ISVGG L ++ S + S +DSGT+
Sbjct: 288 FGAPNGTSGGFVT-TPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLSNGSV-MDSGTV 345
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIE-DLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
IT P YSAL SAFR M + + + + DTCYD + V +P +++ GG
Sbjct: 346 ITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSIPAVSLVLDGGAV 405
Query: 421 LELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++LD G ++ Q CL FA D ++GNVQQR +EV +DV GF G C
Sbjct: 406 VDLDGNGIMI-----QDCLAFAATSGDS---IIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 141/430 (32%), Positives = 220/430 (51%), Gaps = 29/430 (6%)
Query: 59 KVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL-HLKNSRRLQKAIPDNFKKTKA 117
+VS+ + R GPCS + + + E+LRRD++R ++ + + DN A
Sbjct: 60 RVSVPLAHRNGPCSPV---RGKGELPRAEMLRRDRERTEYIIRRASRSRRLQDN---NDA 113
Query: 118 FTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC--IHCSQQRDPFF 174
+ P + G + EY V +G P +L+LDTGS +TW QCKPC C QR P F
Sbjct: 114 VSVPTQLGSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLF 173
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
DP+ S ++S +PC+S C+ L +G C Y+I Y G+ G ++TD +T
Sbjct: 174 DPNTSSSYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALT 233
Query: 235 IQEVNGNGYFARYPFLLGCTDNNT-GDQNGASGIMGLDRGPVSIISKTNI----SYFFYC 289
+ G G + F GC + G + A G++GL R P S+ + + F +C
Sbjct: 234 L----GPGAIVKR-FHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHC 288
Query: 290 LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
L STG++ G P + FV +TP++T +Q FY + T ISV G+ L + + F
Sbjct: 289 LPPTGVSTGFLALGAPHDTS-AFV-FTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVF 346
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
+ DSGT+++ Y+ALR+AFR M +Y + + L DTC++ + Y V VP
Sbjct: 347 -REGVITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHL-DTCFNFTGYDNVTVP 404
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
+++ F GG + LD ++++ CL F D + L+G+V QR EV YD+ G
Sbjct: 405 TVSLTFRGGATVHLDASSGVLMDG----CLAF-WSSGDEYTGLIGSVSQRTIEVLYDMPG 459
Query: 470 RRLGFGPGNC 479
R++GF G C
Sbjct: 460 RKVGFRTGAC 469
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 135/374 (36%), Positives = 191/374 (51%), Gaps = 26/374 (6%)
Query: 117 AFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI---HCSQQRDP 172
A T P ++G + E+ + V +G P Q +L+ DTGS ++W QC+PC HC Q+DP
Sbjct: 128 AVTIPDRSGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDP 187
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWAT 230
FDPSKS T++ + C C D CS C Y + Y DGS TG +
Sbjct: 188 LFDPSKSSTYAAVHCGEPQCA--------AAGDLCSEDNTTCLYLVRYGDGSSTTGVLSR 239
Query: 231 DRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FF 287
D + + +PF GC N GD G++GL RG +S+ S+ S+ F
Sbjct: 240 DTLALTSSRA---LTGFPF--GCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFS 294
Query: 288 YCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS 347
YCL S +TGY+T G + +YT ++ P+ FY + L I +GG LP+ +
Sbjct: 295 YCLPSSNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPA 354
Query: 348 YFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV 407
FT+ T +DSGT++T PA Y+ LR FR M++Y D+ D CYD + VV
Sbjct: 355 VFTRGGTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPP-NDVLDACYDFAGESEVV 413
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSD--PNSILLGNVQQRGYEVHY 465
VP ++ F G ELD G ++ CL FA + + P SI +GN QQR EV Y
Sbjct: 414 VPAVSFRFGDGAVFELDFFGVMIFLDENVGCLAFAAMDTGGLPLSI-IGNTQQRSAEVIY 472
Query: 466 DVAGRRLGFGPGNC 479
DVA ++GF P +C
Sbjct: 473 DVAAEKIGFVPASC 486
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 138/356 (38%), Positives = 192/356 (53%), Gaps = 26/356 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY++ V IGKP ++LDTGS ++W QC PC C QQ DP FDP S ++S I C++
Sbjct: 148 EYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPIRCDAP 207
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
CK L +C + C Y+++Y DGS G +AT+ +T+ G A
Sbjct: 208 QCKSL-------DLSECRNGTCLYEVSYGDGSYTVGEFATETVTL------GTAAVENVA 254
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFGKPDTVN 309
+GC NN G GA+G++GL G +S ++ N + F YCL + + + F P N
Sbjct: 255 IGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNRDSDAVSTLEFNSPLPRN 314
Query: 310 KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEIDSGTIITR 364
V P+ PE FY++ L GISVGGE LP+ S F IDSGT +TR
Sbjct: 315 ---VVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTR 371
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
+ VY ALR AF K K G+ LFDTCYDLS+ ++V VP ++ HF G +L L
Sbjct: 372 LRSEVYDALRDAFVKGAKGIPKANGVS-LFDTCYDLSSRESVQVPTVSFHFPEGRELPLP 430
Query: 425 VRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
R L+ V+SV C FA P+ + ++GNVQQ+G V +D+A +GF +C
Sbjct: 431 ARNYLIPVDSVGTFCFAFA--PTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 150/407 (36%), Positives = 212/407 (52%), Gaps = 46/407 (11%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSL 147
L RD R+H NSR F+ +G+ + EY+ + +G P +Y+ +
Sbjct: 78 LHRDTLRVHALNSR------------AAGFSSSVVSGLSQGSGEYFTRLGVGTPPRYLYM 125
Query: 148 LLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKC 207
+LDTGS + W QC PC C Q DP F+P KSK+F+ IPC+S C+ L C
Sbjct: 126 VLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRL-------DSSGC 178
Query: 208 SSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
S++ C Y ++Y DGS TG +AT+ +T + GN LGC +N G GA+
Sbjct: 179 STRRHTCLYQVSYGDGSFTTGDFATETLTFR---GNKI---AKVALGCGHHNEGLFVGAA 232
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFGKPDTVNKKFVKYTPIVT 320
G++GL RG +S S+T I + F YCL S + FG D + ++TP++
Sbjct: 233 GLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFG--DAAISRLARFTPLIR 290
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPVYSALR 374
P+ FY++ L GISVGG R+ + KL + IDSGT +TR P Y+ALR
Sbjct: 291 NPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALR 350
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VES 433
AFR + K G LFDTCYDLS +V VP + +HF G D+ L L+ V+
Sbjct: 351 DAFRVGARHLKRGPEFS-LFDTCYDLSGQSSVKVPTVVLHFR-GADMALPATNYLIPVDE 408
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
C FA S + ++GN+QQ+G+ V YD+AG R+GF P C
Sbjct: 409 NGSFCFAFAGTISGLS--IIGNIQQQGFRVVYDLAGSRIGFAPRGCT 453
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 211 bits (538), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 150/448 (33%), Positives = 220/448 (49%), Gaps = 41/448 (9%)
Query: 45 VCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRL 104
VC+ P G ++ + R+GPCS S P++ E+LRRDQ R ++
Sbjct: 39 VCSEPPVTPPSSSG-TTVPLSHRHGPCSP---APSTVEPTMAELLRRDQLRAKYIQAKLS 94
Query: 105 --QKAIPDNFKKTKAFTFPAKTGIVAAD--EYYIVVAIGKPKQYVSLLLDTGSGITWTQC 160
+ D +++ A T P G A D Y I V+IG P ++++DTGS ++W C
Sbjct: 95 VNSGSGTDGVQQSAAITLPTTLG-SALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHC 153
Query: 161 KPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK-CS-SKECPYDIAY 218
FFDP KS T++ C+S C L G+D CS + C Y + Y
Sbjct: 154 H--ARAGAGSSLFFDPGKSSTYTPFSCSSAACTRL------EGRDNGCSLNSTCQYTVRY 205
Query: 219 VDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG----DQNGASGIMGLDRGP 274
DGS TG + +D + + N F GC++ + D++ G+MGL G
Sbjct: 206 GDGSNTTGTYGSDTLAL-----NSTEKVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGA 260
Query: 275 VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
S++S+T +Y F YCL + S+G++T G T FV TP+ + FY +
Sbjct: 261 PSLVSQTAATYGSAFSYCLPATTRSSGFLTLGA-STGTSGFVT-TPMFRSRRAPTFYFVI 318
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
L GI+VGG+ + + + F S +DSGTIITR P YSAL +AFR M++Y +
Sbjct: 319 LQGINVGGDPVAISPTVFAAGSI-MDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFS 377
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI 451
+ DTC+D + V +P + + F GG ++LD G + CL FA SI
Sbjct: 378 -ILDTCFDFTGQDNVSIPAVELVFSGGAVVDLDADGIMYGS-----CLAFAPATGGIGSI 431
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+GNVQQR +EV +DV LGF PG C
Sbjct: 432 -IGNVQQRTFEVLHDVGQSVLGFRPGAC 458
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 148/441 (33%), Positives = 222/441 (50%), Gaps = 44/441 (9%)
Query: 71 CSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIP-DNFK------KTKAFT---- 119
C + GK R + +LE R + +++++A+ DN + + KA T
Sbjct: 57 CFSRSLGKGRESTTLEMKHRELCSGKTIDWGKKMRRALLLDNIRVQSLQLRIKAMTSSTT 116
Query: 120 --------FPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR 170
P +GI YIV V +G +SL++DTGS +TW QC+PC C Q+
Sbjct: 117 EQSVSETQIPLTSGIKLETLNYIVTVELGGKN--MSLIVDTGSDLTWVQCQPCRSCYNQQ 174
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWF----PPNGQDKCSSKECPYDIAYVDGSGETG 226
P +DPS S ++ + CNS+TC+ L+ P G + C Y ++Y DGS G
Sbjct: 175 GPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRG 234
Query: 227 FWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY- 285
A++ + + G + GC NN G GASG+MGL R VS++S+T ++
Sbjct: 235 DLASESIVL------GDTKLENLVFGCGRNNKGLFGGASGLMGLGRSSVSLVSQTLKTFN 288
Query: 286 --FFYCLHS-PYGSTGYITFGKPDTV--NKKFVKYTPIVTTPEQSEFYHITLTGISVGGE 340
F YCL S G++G ++FG +V N V YTP+V P+ FY + LTG S+GG
Sbjct: 289 GVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGG- 347
Query: 341 RLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
+ LK F + IDSGT+ITR P +Y A+++ F K+ + G + DTC++L
Sbjct: 348 -VELKTLSFGR-GILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGY-SILDTCFNL 404
Query: 401 SAYKTVVVPKITIHFLGGVDLELDVRGT--LVVESVRQVCLGFALLPSDPNSILLGNVQQ 458
++Y+ + +P I + F G +LE+DV G V VCL A L + ++GN QQ
Sbjct: 405 TSYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQ 464
Query: 459 RGYEVHYDVAGRRLGFGPGNC 479
+ V YD RLG NC
Sbjct: 465 KNQRVIYDTTQERLGIAGENC 485
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 143/438 (32%), Positives = 214/438 (48%), Gaps = 32/438 (7%)
Query: 58 GKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQR---LHLKNSRRLQKAIPDNFKK 114
G S+ + RYGPCS + P+ EE+LRRDQ R + K S A ++ +
Sbjct: 58 GTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQS 117
Query: 115 TKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH---CSQQR 170
+K + P G + EY I V +G P +++DTGS ++W QC+PC C
Sbjct: 118 SK-VSVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHA 176
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWAT 230
FDP+ S T++ C++ C L + NG D + C Y + Y DGS TG +++
Sbjct: 177 GALFDPAASSTYAAFNCSAAACAQLGDSGEANGCD--AKSRCQYIVKYGDGSNTTGTYSS 234
Query: 231 DRMTIQEVNGNGYFARYPFLLGCT--DNNTGDQNGASGIMGLDRGPVSIISKTNISY--- 285
D +T+ +G F GC+ + G + G++GL S++S+T Y
Sbjct: 235 DVLTL-----SGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKS 289
Query: 286 FFYCLHSPYGSTGYITF----GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGER 341
F YCL + S+G++T +F TP++ + + +Y L I+VGG++
Sbjct: 290 FSYCLPATPASSGFLTLGAPASGGGGGASRFAT-TPMLRSKKVPTYYFAALEDIAVGGKK 348
Query: 342 LPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
L L S F S +DSGT+ITR P Y+AL SAFR M +Y + + + DTC++ +
Sbjct: 349 LGLSPSVFAAGSL-VDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPL-GILDTCFNFT 406
Query: 402 AYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGY 461
V +P + + F GG ++LD G V CL FA D +GNVQQR +
Sbjct: 407 GLDKVSIPTVALVFAGGAVVDLDAHGI-----VSGGCLAFAPTRDDKAFGTIGNVQQRTF 461
Query: 462 EVHYDVAGRRLGFGPGNC 479
EV YDV G GF G C
Sbjct: 462 EVLYDVGGGVFGFRAGAC 479
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 133/372 (35%), Positives = 190/372 (51%), Gaps = 22/372 (5%)
Query: 117 AFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI---HCSQQRDP 172
A T P ++G + E+ + V +G P Q +L+ DTGS ++W QC+PC HC Q+DP
Sbjct: 133 AVTIPDRSGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDP 192
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDR 232
FDPSKS T++ + C C G + C Y + Y DGS TG + D
Sbjct: 193 LFDPSKSSTYAAVHCGEPQCAAA------GGLCSEDNTTCLYLVHYGDGSSTTGVLSRDT 246
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYC 289
+ + A +PF GC N GD G++GL RG +S+ S+ S+ F YC
Sbjct: 247 LALTSSRA---LAGFPF--GCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYC 301
Query: 290 LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
L S +TGY+T G + +YT ++ P+ FY + L I +GG LP+ + F
Sbjct: 302 LPSSNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVF 361
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T+ T +DSGT++T PA Y LR FR M++Y D+ D CYD + V+VP
Sbjct: 362 TRGGTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPP-NDVLDACYDFAGESEVIVP 420
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSD--PNSILLGNVQQRGYEVHYDV 467
++ F G ELD G ++ CL FA + + P SI +GN QQR EV YDV
Sbjct: 421 AVSFRFGDGAVFELDFFGVMIFLDENVGCLAFAAMDAGGLPLSI-IGNTQQRSAEVIYDV 479
Query: 468 AGRRLGFGPGNC 479
A ++GF P +C
Sbjct: 480 AAEKIGFVPASC 491
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 125/362 (34%), Positives = 194/362 (53%), Gaps = 23/362 (6%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKI 185
+ + YY+ V +G P +Y S+++DTGS ++W QCKPC+ +C Q DP FDPS SKT+ +
Sbjct: 8 IGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSL 67
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI---QEVNGNG 242
C S+ C L++ N + SS C Y +Y D S G+ + D +T+ Q + G
Sbjct: 68 SCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPG-- 125
Query: 243 YFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGY 299
F+ GC ++ G A+GI+GL R +S++ + + + F YCL + G G+
Sbjct: 126 ------FVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPT-RGGGGF 178
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
++ GK + K+TP+ T P Y + LT I+VGG L + A+ + ++ T IDSG
Sbjct: 179 LSIGKASLAGSAY-KFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQY-RVPTIIDSG 236
Query: 360 TIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
T+ITR P VY+ + AF K M KY G + DTC+ + VP++ + F GG
Sbjct: 237 TVITRLPMSVYTPFQQAFVKIMSSKYARAPGFS-ILDTCFKGNLKDMQSVPEVRLIFQGG 295
Query: 419 VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
DL L L+ CL FA + ++GN QQ+ ++V +D++ R+GF G
Sbjct: 296 ADLNLRPVNVLLQVDEGLTCLAFA---GNNGVAIIGNHQQQTFKVAHDISTARIGFATGG 352
Query: 479 CN 480
CN
Sbjct: 353 CN 354
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 149/460 (32%), Positives = 223/460 (48%), Gaps = 35/460 (7%)
Query: 35 VSVSSLIPPTVCNRTRTALPQGPGKVS--LEVLGRYGPCSKLNQGKSRNTPSLEEILRRD 92
VS +S +P + C+ P S L + R+GPC+ ++ S PS+ + LR D
Sbjct: 39 VSAASFVPSSTCSSPDRVPPHRRNGTSAVLRLTHRHGPCAP-SRASSLAAPSVADTLRAD 97
Query: 93 QQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD----EYYIVVAIGKPKQYVSLL 148
Q+R RR+ P + A D Y + ++G P ++
Sbjct: 98 QRRAEYIL-RRVSGRAPQLWDSKAAAAVATVPASWGYDIGTLNYVVTASLGTPGVAQTME 156
Query: 149 LDTGSGITWTQCKPCI---HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
+DTGS ++W QCKPC C Q+DP FDP++S +++ +PC C L +
Sbjct: 157 VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYA----AS 212
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
CS+ +C Y ++Y DGS TG +++D +T+ + A F GC +G NG
Sbjct: 213 ACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS-----AVQGFFFGCGHAQSGLFNGVD 267
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFG--KPDTVNKKFVKYTPIVT 320
G++GL R S++ +T +Y F YCL + + GY+T G P F T ++
Sbjct: 268 GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGF-STTQLLP 326
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKR 380
+P +Y + LTGISVGG++L + AS F + T++TR P Y+ALRSAFR
Sbjct: 327 SPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG-TVVTRLPPTAYAALRSAFRSG 385
Query: 381 MKKYKMGKGIED-LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
M Y + + DTCY+ + Y TV +P + + F G + L G L CL
Sbjct: 386 MASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSFG-----CL 440
Query: 440 GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
FA SD +LGNVQQR +EV D G +GF P +C
Sbjct: 441 AFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 129/356 (36%), Positives = 189/356 (53%), Gaps = 21/356 (5%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNS 189
E+ +VV G P Q +++LDTGS ++W QCKPC HC +Q DP FDP+KS +++ +PC +
Sbjct: 136 EFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGT 195
Query: 190 TTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C C+ C Y + Y DGS TG + D +T N + F + F
Sbjct: 196 PVCAA--------AGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTF---NSSSKFTGFTF 244
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPD 306
GC + N GD G++GL RG +S+ S+ S+ F YCL S + GY+ G
Sbjct: 245 --GCGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTTPGYLNIGATK 302
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFP 366
+ V+YT ++ P+ FY I L I++GG LP+ S FTK T +DSGTI+T P
Sbjct: 303 PTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKTGTLLDSGTILTYLP 362
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
P Y++LR F+ M+ K E L DTCYD + +V+P ++ +F G +LD
Sbjct: 363 PPAYTSLRDRFKFTMQGNKPAPPYEPL-DTCYDFTGQGAIVIPAVSFNFSDGAVFDLDFY 421
Query: 427 GTLVVESVRQV---CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
G ++ + CL F P+ ++GN QQR EV YDV +++GF P +C
Sbjct: 422 GIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 144/430 (33%), Positives = 225/430 (52%), Gaps = 34/430 (7%)
Query: 74 LNQGKSRNTP-SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAF--------TFPAKT 124
L+ ++ +P S +++ +D++R+ +SR K N T T P K+
Sbjct: 45 LDSSQTSTSPFSFSDMITKDEERVRFLHSRLTNKESVRNSATTDKLRGGPSLVSTTPLKS 104
Query: 125 GI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTF 182
G+ + + YY+ + +G P +Y S+++DTGS ++W QC+PC I+C Q DP F PS SKT+
Sbjct: 105 GLSIGSGNYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTY 164
Query: 183 SKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI--QEVNG 240
+PC+S+ C L ++ C Y +Y D S G+ + D +T+ E
Sbjct: 165 KALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAPS 224
Query: 241 NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS- 296
+G F+ GC +N G +SGI+GL +S++ + + Y F YCL S + +
Sbjct: 225 SG------FVYGCGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAP 278
Query: 297 -----TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTK 351
+G+++ G + + K+TP+V + Y + LT I+V G+ L + AS +
Sbjct: 279 NSSSLSGFLSIGASSLTSSPY-KFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSY-N 336
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
+ T IDSGT+ITR P VY+AL+ +F M KKY G + DTC+ S + VP+
Sbjct: 337 VPTIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFS-ILDTCFKGSVKEMSTVPE 395
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
I I F GG LEL +LV CL A S+P SI +GN QQ+ ++V YDVA
Sbjct: 396 IQIIFRGGAGLELKAHNSLVEIEKGTTCLAIA-ASSNPISI-IGNYQQQTFKVAYDVANF 453
Query: 471 RLGFGPGNCN 480
++GF PG C
Sbjct: 454 KIGFAPGGCQ 463
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 132/359 (36%), Positives = 196/359 (54%), Gaps = 23/359 (6%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ + IG P + + ++LDTGS +TW QC PC C Q DP FDP+ S +++ +PC+S
Sbjct: 195 EYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDSP 254
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ L N +S C Y++AY DGS G +AT+ +T+ G+G A +
Sbjct: 255 HCRALDASACHNNAANGNSS-CVYEVAYGDGSYTVGDFATETLTL---GGDGSAAVHDVA 310
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGSTGYITFGKPDT 307
+GC +N G GA+G++ L GP+S S+ + + F YCL SP ST + FG D+
Sbjct: 311 IGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATEFSYCLVDRDSPSAST--LQFGASDS 368
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL---PLKASYFTKLSTE---IDSGTI 361
P++ +P + FY++ L GISVGGE L P A + + +DSGT
Sbjct: 369 STVT----APLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVDSGTA 424
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
+TR + YSALR AF + + G+ LFDTCYDL+ +V VP +++ F GG +L
Sbjct: 425 VTRLQSSAYSALRDAFVRGTQALPRASGVS-LFDTCYDLAGRSSVQVPAVSLRFEGGGEL 483
Query: 422 ELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+L + L+ V+ CL FA + ++GNVQQ+G V +D A +GF P C
Sbjct: 484 KLPAKNYLIPVDGAGTYCLAFAATGGAVS--IVGNVQQQGIRVSFDTAKNTVGFSPNKC 540
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 122/367 (33%), Positives = 188/367 (51%), Gaps = 11/367 (2%)
Query: 119 TFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDP 176
+ P G+ + + YY+ + +G P +Y +++LDTGS ++W QC+PC ++C Q DP +DP
Sbjct: 111 SIPLNPGLSIGSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDP 170
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
S SKT+ K+ C S C L + + S C Y +Y D S G+ + D +T+
Sbjct: 171 SVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLT 230
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSP 293
F GC +N G A+GI+GL R +S++++ + Y F YCL +
Sbjct: 231 SSQ-----TLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTA 285
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
+ F +++ K+TP++T + Y + LT I+V G L L A+ + ++
Sbjct: 286 NSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMY-RVP 344
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T IDSGT+ITR P +Y+ALR AF K M + DTC+ S VP+I +
Sbjct: 345 TLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKM 404
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F GG DL L L+ CL FA ++GN QQ+ Y + YDV+ R+G
Sbjct: 405 IFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIG 464
Query: 474 FGPGNCN 480
F PG+C+
Sbjct: 465 FAPGSCH 471
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 134/402 (33%), Positives = 193/402 (48%), Gaps = 35/402 (8%)
Query: 88 ILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSL 147
++ RD R+ R + P + + P + EY++ V +G P L
Sbjct: 88 LVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVDD--GSGEYFVRVGVGSPPTDQYL 145
Query: 148 LLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKC 207
++D+GS + W QC+PC C Q DP FDP+ S +FS + C S C+ L
Sbjct: 146 VVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSGT---GCGGGG 202
Query: 208 SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGI 267
+ +C Y + Y DGS G A + +T+ G A +GC N+G GA+G+
Sbjct: 203 DAGKCDYSVTYGDGSYTKGELALETLTL------GGTAVQGVAIGCGHRNSGLFVGAAGL 256
Query: 268 MGLDRGPVSIISKTNIS---YFFYCLHS-PYGSTGYITFGKPDTVNKKFVKYTPIVTTPE 323
+GL G +S++ + + F YCL S G G + G+ + V +
Sbjct: 257 LGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLVLGRTEAVPRG----------RR 306
Query: 324 QSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPVYSALRSAF 377
S FY++ LTGI VGGERLPL+ S F +L+ + +D+GT +TR P Y+ALR AF
Sbjct: 307 ASSFYYVGLTGIGVGGERLPLQDSLF-QLTEDGAGGVVMDTGTAVTRLPREAYAALRGAF 365
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
M + L DTCYDLS Y +V VP ++ +F G L L R LV
Sbjct: 366 DGAMGALPRSPAVS-LLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVF 424
Query: 438 CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA PS +LGN+QQ G ++ D A +GFGP C
Sbjct: 425 CLAFA--PSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 131/377 (34%), Positives = 185/377 (49%), Gaps = 24/377 (6%)
Query: 113 KKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQR 170
+ A T P TG + E+ + V G P Q +L+ DTGS ++W QC PC HC +Q
Sbjct: 100 AEAPAVTIPDSTGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQH 159
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWA 229
DP FDP+KS T+S +PC C KCSS C Y + Y DGS G +
Sbjct: 160 DPIFDPTKSATYSAVPCGHPQCAA--------AGGKCSSNGTCLYKVQYGDGSSTAGVLS 211
Query: 230 TDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFF-- 287
+ +++ FA GC + N GD G++GL RG +S+ S+ S+
Sbjct: 212 HETLSLTSARALPGFA-----FGCGETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAF 266
Query: 288 -YCLHSPYGSTGYITFGKPDTVN-KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLK 345
YCL S S GY+T G + V+YT ++ + FY + L I VGG LP+
Sbjct: 267 SYCLPSYNTSHGYLTIGTTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVP 326
Query: 346 ASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
FT+ T +DSGT++T P Y+ALR F+ M +YK D FDTCYD +
Sbjct: 327 PILFTRDGTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAY-DPFDTCYDFAGQNA 385
Query: 406 VVVPKITIHFLGGVDLELDVRGTLVVE---SVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
+ +P ++ F G +L G L+ + CL F PS ++GN QQR E
Sbjct: 386 IFMPLVSFKFSDGSSFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTE 445
Query: 463 VHYDVAGRRLGFGPGNC 479
+ YDVA ++GF G+C
Sbjct: 446 MIYDVAAEKIGFVSGSC 462
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 136/363 (37%), Positives = 197/363 (54%), Gaps = 35/363 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ V IG P + + ++LDTGS +TW QC+PC C QQ DP FDPS S +++ + C+S
Sbjct: 168 EYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSP 227
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ L N ++ C Y++AY DGS G +AT+ +T+ + A
Sbjct: 228 RCRDLDTAACRN-----ATGACLYEVAYGDGSYTVGDFATETLTLGDSTPVTNVA----- 277
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGSTGYITFG---- 303
+GC +N G GA+G++ L GP+S S+ + S F YCL SP ST + FG
Sbjct: 278 IGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDRDSPAAST--LQFGADGA 335
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ID 357
+ DTV P+V +P FY++ L+GISVGG+ L + +S F +T +D
Sbjct: 336 EADTVTA------PLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVD 389
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT +TR + Y+ALR AF + G+ LFDTCYDLS +V VP +++ F G
Sbjct: 390 SGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVS-LFDTCYDLSDRTSVEVPAVSLRFEG 448
Query: 418 GVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
G L L + L+ V+ CL FA P++ ++GNVQQ+G V +D A +GF P
Sbjct: 449 GGALRLPAKNYLIPVDGAGTYCLAFA--PTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTP 506
Query: 477 GNC 479
C
Sbjct: 507 NKC 509
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 139/410 (33%), Positives = 213/410 (51%), Gaps = 29/410 (7%)
Query: 88 ILRRDQQRLHLKNSRRLQKAIPD-NFKKT--KAFTFPAKTGI-VAADEYYIVVAIGKPKQ 143
+ +D++R+ +SR + + + +FKK K P K+G+ + + YY+ + +G P +
Sbjct: 55 MFAKDEERIRYFHSRLAKNSDANASFKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTK 114
Query: 144 YVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Y ++++DTGS +W QC+PC I+C Q DP F+PS SKT+ +PC+S+ C L
Sbjct: 115 YYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKS--ATL 172
Query: 203 GQDKCS--SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD 260
+ CS S C Y +Y D S G+ + D +T+ F+ GC +N G
Sbjct: 173 NEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQ-----TLSSFVYGCGQDNQGL 227
Query: 261 QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS-----TGYITFGKPDTVNKKF 312
GI+GL +S++S+ + Y F YCL + + + G+++ G
Sbjct: 228 FGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSS 287
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSA 372
K+TP++ P Y I L I+V G L + AS + K+ T IDSGT+ITR P PVY+
Sbjct: 288 YKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSY-KVPTIIDSGTVITRLPTPVYTT 346
Query: 373 LRSAFRKRM-KKYKMGKGIEDLFDTCYDLS-AYKTVVVPKITIHFLGGVDLELDVRGTLV 430
L++A+ + KKY+ GI L DTC+ S A + V P I I F GG DL+L +LV
Sbjct: 347 LKNAYVTILSKKYQQAPGIS-LLDTCFKGSLAGISEVAPDIRIIFKGGADLQLKGHNSLV 405
Query: 431 VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CL A + ++GN QQ+ +V YDV R+GF PG C
Sbjct: 406 ELETGITCLAMA---GSSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGCQ 452
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 138/374 (36%), Positives = 197/374 (52%), Gaps = 27/374 (7%)
Query: 114 KTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP 172
++ A P +G + EY++ V IGKP ++LDTGS ++W QC PC C QQ DP
Sbjct: 130 ESNALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDP 189
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDR 232
FDP S ++S I C+ CK L +C + C Y+++Y DGS G +AT+
Sbjct: 190 IFDPISSNSYSPIRCDEPQCKSL-------DLSECRNGTCLYEVSYGDGSYTVGEFATET 242
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-H 291
+T+ G A +GC NN G GA+G++GL G +S ++ N + F YCL +
Sbjct: 243 VTL------GSAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVN 296
Query: 292 SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-- 349
+ + F P N P++ PE FY++ L GISVGGE LP+ S F
Sbjct: 297 RDSDAVSTLEFNSPLPRN---AATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEV 353
Query: 350 ---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
IDSGT +TR + VY ALR AF K K G+ LFDTCYDLS+ ++V
Sbjct: 354 DAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVS-LFDTCYDLSSRESV 412
Query: 407 VVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
+P ++ F G +L L R L+ V+SV C FA P+ + ++GNVQQ+G V +
Sbjct: 413 EIPTVSFRFPEGRELPLPARNYLIPVDSVGTFCFAFA--PTTSSLSIIGNVQQQGTRVGF 470
Query: 466 DVAGRRLGFGPGNC 479
D+A +GF +C
Sbjct: 471 DIANSLVGFSVDSC 484
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 146/410 (35%), Positives = 214/410 (52%), Gaps = 40/410 (9%)
Query: 89 LRRDQQRLHLKNSRRLQKAIP----DNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQ 143
L+RD +R+ K+ L IP + +T F+ +G+ + EY+ + +G P +
Sbjct: 96 LQRDSRRV--KSIATLAAQIPGRNVTHAPRTGGFSSSVVSGLSQGSGEYFTRLGVGTPAR 153
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
YV ++LDTGS I W QC PC C Q DP FDP KSKT++ IPC+S C+ L
Sbjct: 154 YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRL-------D 206
Query: 204 QDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ 261
C++ K C Y ++Y DGS G ++T+ +T + G LGC +N G
Sbjct: 207 SAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VALGCGHDNEGLF 260
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFGKPDTVNKKFVKYT 316
GA+G++GL +G +S +T + F YCL S + FG + + ++T
Sbjct: 261 VGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFG--NAAVSRIARFT 318
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS------TEIDSGTIITRFPAPVY 370
P+++ P+ FY++ L GISVGG R+P A+ KL IDSGT +TR P Y
Sbjct: 319 PLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTSVTRLIRPAY 378
Query: 371 SALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV 430
A+R AFR K K LFDTC+DLS V VP + +HF G D+ L L+
Sbjct: 379 IAMRDAFRVGAKALKRAPDFS-LFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLI 436
Query: 431 -VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V++ + C FA + ++GN+QQ+G+ V YD+A R+GF PG C
Sbjct: 437 PVDTNGKFCFAFAGTMGGLS--IIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 142/419 (33%), Positives = 204/419 (48%), Gaps = 23/419 (5%)
Query: 68 YGPCSKLNQGKSRNTPSL-EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI 126
+G CS L S + L + RD RL N+ R + + P T P ++G
Sbjct: 78 HGACSPLRPINSSSWIDLVSQSFERDNARL---NTIRSKNSGP----YTTMSNLPLQSGT 130
Query: 127 VAADEYYIVVA-IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKI 185
YIV A G P + L++DTGS +TW QCKPC C Q D F+P +S ++ +
Sbjct: 131 TVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTL 190
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
PC S TC L+ + C C Y+I Y DGS G ++ + +T+ G+ F
Sbjct: 191 PCLSATCTELIT--SESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTL----GSDSFQ 244
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF 302
+ F GC NTG G+SG++GL + +S S++ Y F YCL ST +F
Sbjct: 245 NFAF--GCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSF 302
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
+TP+V+ FY + L GISVGG+RL + + + ST +DSGT+I
Sbjct: 303 SVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVDSGTVI 362
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
TR Y+AL+++FR + + K + DTCYDLS + V +P IT HF D+
Sbjct: 363 TRLLPQAYNALKTSFRSKTRDLPSAKPFS-ILDTCYDLSRHSQVRIPTITFHFQNNADVA 421
Query: 423 LDVRGTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ G L V QVCL FA ++GN QQ+ V +D R+GF G+C
Sbjct: 422 VSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRIGFASGSC 480
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 129/408 (31%), Positives = 209/408 (51%), Gaps = 28/408 (6%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPD-NFKKTKA--------FTFPAKTGI-VAADEYYIVV 136
+IL RD++ + +SR +K + +F + K+ P G+ + + YY+ +
Sbjct: 65 DILSRDEEHVKFLSSRLRKKDVQGASFSRHKSGHLLEPNSANIPLNPGLSIGSGNYYLKL 124
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
+G P +Y +++LDTGS ++W QCKPC+ +C Q DP F+PS S T+ + C+S+ C L
Sbjct: 125 GLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECS-L 183
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
L+ N +S C Y +Y D S G+ + D +T+ F GC
Sbjct: 184 LKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQ-----TLPSFTYGCGQ 238
Query: 256 NNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTG-YITFGKPDTVNKK 311
+N G A+GI+GL R +S++++ + Y F YCL + S G +++ GK ++
Sbjct: 239 DNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSIGK---ISPS 295
Query: 312 FVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYS 371
K+TP++ + Y + L I+V G + + A+ + ++ T IDSGT++TR P +Y+
Sbjct: 296 SYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGY-QVPTIIDSGTVVTRLPISIYA 354
Query: 372 ALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVV 431
ALR AF K M + + DTC+ S P+I + F GG DL L L+
Sbjct: 355 ALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIE 414
Query: 432 ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA S ++GN QQ+ Y + YDV+ ++GF PG C
Sbjct: 415 ADKGIACLAFA---SSNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 132/377 (35%), Positives = 191/377 (50%), Gaps = 30/377 (7%)
Query: 121 PAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
P +G+ + EY+ + +G P ++LDTGS + W QC PC C Q FDP +S
Sbjct: 130 PVVSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRS 189
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN 239
+++ + C++ C+ L +G K C Y +AY DGS G +AT+ +T
Sbjct: 190 RSYGAVGCSAPLCRRL-----DSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA--- 241
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL------ 290
G AR LGC +N G A+G++GL RG +S ++ + Y F YCL
Sbjct: 242 GGARVAR--IALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSS 299
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
+P + +TFG + +TP+V P FY++ L GISVGG R+ A
Sbjct: 300 ANPASHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDL 359
Query: 351 KLSTE-------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY 403
+L +DSGT +TR P YSALR AFR ++ G LFDTCYDLS
Sbjct: 360 RLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGR 419
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
K V VP +++HF GG + L L+ V+S C FA +D ++GN+QQ+G+
Sbjct: 420 KVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFA--GTDGGVSIIGNIQQQGFR 477
Query: 463 VHYDVAGRRLGFGPGNC 479
V +D G+R+GF P C
Sbjct: 478 VVFDGDGQRVGFVPKGC 494
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 204 bits (520), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 129/372 (34%), Positives = 193/372 (51%), Gaps = 28/372 (7%)
Query: 119 TFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDP 176
T P TG + E+ +VV G P Q + + DTGS ++W QC+PC HC +Q DP FDP
Sbjct: 98 TIPDHTGTNLKTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDP 157
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
+KS +++ +PC +T C +C+ C Y + Y DGS TG A + +T
Sbjct: 158 AKSSSYAVVPCGTTECAA--------AGGECNGTTCVYGVEYGDGSSTTGVLARETLTFS 209
Query: 237 ---EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
E G F+ GC + N GD G++GL RG +S+ S+ ++ F YCL
Sbjct: 210 SSSEFTG--------FIFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCL 261
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
S + GY++ G + V+YT +V P+ FY I L I++GG LP+ S FT
Sbjct: 262 PSYNTTPGYLSIGATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFT 321
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
K T +DSGTI+T P P Y+ALR F+ M+ K ++L DTCYD + +++P
Sbjct: 322 KTGTLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDEL-DTCYDFTGQSGILIPG 380
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQV---CLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
++ +F G L+ G + + CL F P+D ++G+ QR EV YDV
Sbjct: 381 VSFNFSDGAVFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDV 440
Query: 468 AGRRLGFGPGNC 479
+++GF P +C
Sbjct: 441 PAQKIGFIPASC 452
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 204 bits (520), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 140/426 (32%), Positives = 222/426 (52%), Gaps = 28/426 (6%)
Query: 74 LNQGKSRNTP-SLEEILRRDQQRLHLKNSRRLQK------AIPDNFKKTKAFTFPAKTGI 126
L+ ++ +P S +++ +D++R+ +SR K A D + P K+G+
Sbjct: 41 LDSSQTSTSPFSFSDMITKDEERVRFLHSRLTNKESASNSATTDKLGGPSLVSTPLKSGL 100
Query: 127 -VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSK 184
+ + YY+ + +G P +Y S+++DTGS ++W QC+PC I+C Q DP F PS SKT+
Sbjct: 101 SIGSGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKA 160
Query: 185 IPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
+ C+S+ C L ++ C Y +Y D S G+ + D +T+
Sbjct: 161 LSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSAA---- 216
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS----- 296
F+ GC +N G ++GI+GL +S++ + + Y F YCL S + +
Sbjct: 217 PSSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSS 276
Query: 297 -TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE 355
+G+++ G + + K+TP+V P+ Y + LT I+V G+ L + AS + + T
Sbjct: 277 VSGFLSIGASSLSSSPY-KFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSY-NVPTI 334
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
IDSGT+ITR P +Y+AL+ +F M KKY G + DTC+ S + VP+I I
Sbjct: 335 IDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFS-ILDTCFKGSVKEMSTVPEIRII 393
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F GG LEL V +LV CL A S+P SI +GN QQ+ + V YDVA ++GF
Sbjct: 394 FRGGAGLELKVHNSLVEIEKGTTCLAIA-ASSNPISI-IGNYQQQTFTVAYDVANSKIGF 451
Query: 475 GPGNCN 480
PG C
Sbjct: 452 APGGCQ 457
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 204 bits (518), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 147/433 (33%), Positives = 224/433 (51%), Gaps = 36/433 (8%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKK------T 115
L + R+GPC+ +S + PS E+LR D++R R P ++ +
Sbjct: 425 LRLTHRHGPCA--GPSRSASAPSFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTAASSS 482
Query: 116 KAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ--QRDP 172
K+ T PA G + +Y + V++G P ++ +DTGS ++W QC PC + Q+D
Sbjct: 483 KSVTIPANIGHSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQ 542
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDR 232
FDP+KS ++S +PC + C L + G + +C Y ++Y DGS TG + +D
Sbjct: 543 LFDPAKSSSYSAVPCAADACSELSTY----GHGCAAGSQCGYVVSYGDGSNTTGVYGSDT 598
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY----FFY 288
+T+ + + A FL GC G G G++ L R +S+ S+T+ +Y F Y
Sbjct: 599 LTLTDAD-----AVTGFLFGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSY 653
Query: 289 CLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKAS 347
CL STG++T G P + + T ++T + FY + LTGI VGG++L + AS
Sbjct: 654 CLPPSPSSTGFLTLGGPSSASG--FATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPAS 711
Query: 348 YFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKM-GKGIEDLFDTCYDLSAYKTV 406
F T +D+GT+ITR P Y+ALR+AFR M Y + DTCY+ + Y TV
Sbjct: 712 AFAG-GTVVDTGTVITRLPPTAYAALRAAFRAAMAPYGYPAAPATGILDTCYNFTDYGTV 770
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
+P +++ F GG L+LD G L CL FA D + +LGNVQQR + V +D
Sbjct: 771 TLPTVSLTFSGGATLKLDAPGFL-----SSGCLAFATNSGDGDPAILGNVQQRSFAVRFD 825
Query: 467 VAGRRLGFGPGNC 479
G +GF P +C
Sbjct: 826 --GSSVGFMPHSC 836
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 137/410 (33%), Positives = 210/410 (51%), Gaps = 29/410 (7%)
Query: 88 ILRRDQQRLHLKNSRRLQKAIPDNFKKT---KAFTFPAKTGI-VAADEYYIVVAIGKPKQ 143
+ +D++R+ +SR + + + K K P K+G+ + + YY+ + +G P +
Sbjct: 55 MFAKDEERIRYFHSRLAKNSDANASSKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTK 114
Query: 144 YVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Y ++++DTGS +W QC+PC I+C Q DP F+PS SKT+ +PC+S+ C L
Sbjct: 115 YYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKS--ATL 172
Query: 203 GQDKCS--SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD 260
+ CS S C Y +Y D S G+ + D +T+ F+ GC +N G
Sbjct: 173 NEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQ-----TLSSFVYGCGQDNQGL 227
Query: 261 QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS-----TGYITFGKPDTVNKKF 312
GI+GL +S++S+ + Y F YCL + + + G+++ G
Sbjct: 228 FGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSS 287
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSA 372
K+TP++ P Y I L I+V G L + AS + K+ T IDSGT+ITR P PVY+
Sbjct: 288 YKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSY-KVPTIIDSGTVITRLPTPVYTT 346
Query: 373 LRSAFRKRM-KKYKMGKGIEDLFDTCYDLS-AYKTVVVPKITIHFLGGVDLELDVRGTLV 430
L++A+ + KKY+ GI L DTC+ S A + V P I I F GG DL+L +LV
Sbjct: 347 LKNAYVTILSKKYQQAPGIS-LLDTCFKGSLAGISEVAPDIRIIFKGGADLQLKGHNSLV 405
Query: 431 VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CL A + ++GN QQ+ +V YDV R+GF PG C
Sbjct: 406 ELETGITCLAMA---GSSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGCQ 452
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 141/420 (33%), Positives = 205/420 (48%), Gaps = 40/420 (9%)
Query: 85 LEEILRRDQQRL-HLKNSRRLQKAIPDNFKKTK--AFTFPAKTGIV-AADEYYIVVAIGK 140
L L+RD++R + + A N +++ A P +G+ + EY+ + +G
Sbjct: 89 LRHRLQRDKRRAARISKAAAGGGAGAANGTRSRGGAVAAPVVSGLAQGSGEYFTKIGVGT 148
Query: 141 PKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFP 200
P ++LDTGS + W QC PC C Q P FDP +S ++ + C + C+ L
Sbjct: 149 PSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAPLCRRL----- 203
Query: 201 PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD 260
+G + C Y +AY DGS G +AT+ +T G AR LGC +N G
Sbjct: 204 DSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFA---GGARVAR--VALGCGHDNEGL 258
Query: 261 QNGASGIMGLDRGPVSIISKTNISY---FFYCL----------HSPYGSTGYITFGKPDT 307
A+G++GL RG +S ++ + Y F YCL + + +TFG P
Sbjct: 259 FVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSRSSTVTFGPPSA 318
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-------IDSGT 360
F TP+V P FY++ L GISVGG R+P A +L +DSGT
Sbjct: 319 SAASF---TPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGT 375
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
+TR P YSALR AFR ++ G LFDTCYDL K V VP +++HF GG +
Sbjct: 376 SVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPTVSMHFAGGAE 435
Query: 421 LELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L L+ V+S C FA +D ++GN+QQ+G+ V +D G+R+GF P C
Sbjct: 436 AALPPENYLIPVDSRGTFCFAFA--GTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 493
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 133/369 (36%), Positives = 202/369 (54%), Gaps = 29/369 (7%)
Query: 121 PAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
P +G+ + + EY+ V +G P + + ++LDTGS +TW QC+PC C QQ DP FDPS S
Sbjct: 155 PVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 214
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN 239
+++ + C++ C L N S+ C Y++AY DGS G +AT+ +T+ +
Sbjct: 215 TSYASVACDNPRCHDLDAAACRN-----STGACLYEVAYGDGSYTVGDFATETLTLGDSA 269
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGS 296
A +GC +N G GA+G++ L GP+S S+ + + F YCL SP S
Sbjct: 270 PVSSVA-----IGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSS 324
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE- 355
T + FG D + + P++ +P S FY++ L+G+SVGG+ L + S F ST
Sbjct: 325 T--LQFG--DAADAEVTA--PLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGA 378
Query: 356 ----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
+DSGT +TR + Y+ALR AF + + G+ LFDTCYDLS +V VP +
Sbjct: 379 GGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVS-LFDTCYDLSDRTSVEVPAV 437
Query: 412 TIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
++ F GG +L L + L+ V+ CL FA P++ ++GNVQQ+G V +D A
Sbjct: 438 SLRFAGGGELRLPAKNYLIPVDGAGTYCLAFA--PTNAAVSIIGNVQQQGTRVSFDTAKS 495
Query: 471 RLGFGPGNC 479
+GF C
Sbjct: 496 TVGFTTNKC 504
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 133/369 (36%), Positives = 201/369 (54%), Gaps = 29/369 (7%)
Query: 121 PAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
P +G+ + + EY+ V +G P + + ++LDTGS +TW QC+PC C QQ DP FDPS S
Sbjct: 151 PVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 210
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN 239
+++ + C++ C L N S+ C Y++AY DGS G +AT+ +T+ +
Sbjct: 211 TSYASVACDNPRCHDLDAAACRN-----STGACLYEVAYGDGSYTVGDFATETLTLGDSA 265
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGS 296
A +GC +N G GA+G++ L GP+S S+ + + F YCL SP S
Sbjct: 266 PVSSVA-----IGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSS 320
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE- 355
T + FG D + + P++ +P S FY++ L+GISVGG+ L + S F T
Sbjct: 321 T--LQFG--DAADAEVTA--PLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGA 374
Query: 356 ----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
+DSGT +TR + Y+ALR AF + + G+ LFDTCYDLS +V VP +
Sbjct: 375 GGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVS-LFDTCYDLSDRTSVEVPAV 433
Query: 412 TIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
++ F GG +L L + L+ V+ CL FA P++ ++GNVQQ+G V +D A
Sbjct: 434 SLRFAGGGELRLPAKNYLIPVDGAGTYCLAFA--PTNAAVSIIGNVQQQGTRVSFDTAKS 491
Query: 471 RLGFGPGNC 479
+GF C
Sbjct: 492 TVGFTSNKC 500
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 190/365 (52%), Gaps = 27/365 (7%)
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
V +G ++++DT S +TW QC PC C Q+DP FDPS S +++ +PCNS++C
Sbjct: 154 VATVGLGGGEATVIVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDA 213
Query: 195 LL--------EWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
L GQD+ S+ C Y ++Y DGS G A DR+++ +G
Sbjct: 214 LQLATGGTSGGAAACQGQDQ-SAAACSYTLSYRDGSYSRGVLAHDRLSLAGEVIDG---- 268
Query: 247 YPFLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYIT 301
F+ GC +N G G SG+MGL R +S++S+T + F YCL S+G +
Sbjct: 269 --FVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLKESDSSGSLV 326
Query: 302 FGKPDTV--NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE--ID 357
G +V N + Y +V+ P Q FY + LTGI+VGG+ + + ID
Sbjct: 327 IGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGKAIID 386
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT+IT +Y+A+++ F + +Y G + DTC++++ + V VP + + F G
Sbjct: 387 SGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFS-ILDTCFNMTGLREVQVPSLKLVFDG 445
Query: 418 GVDLELDVRGTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
GV++E+D G L V QVCL A L S+ + ++GN QQ+ V +D +G ++GF
Sbjct: 446 GVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFDTSGSQVGFA 505
Query: 476 PGNCN 480
C
Sbjct: 506 QETCG 510
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 134/407 (32%), Positives = 209/407 (51%), Gaps = 28/407 (6%)
Query: 90 RRDQQRLHLKN--SRRLQKAIPD-----NFKKTKAFTFPAKTGI-VAADEYYIVVAIGKP 141
++ Q+RL + N R LQ I + N + P +GI + + Y + V +G
Sbjct: 16 KKLQKRLIMDNFQLRSLQSRIKNIILSGNIDDSVDTQIPLTSGIRLQSLNYIVTVELGGR 75
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
K +++++DTGS ++W QC+PC C Q+DP F+PSKS ++ + CNS TC+ L
Sbjct: 76 K--MTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGN 133
Query: 202 NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ 261
+G + C Y + Y DGS +G + + + N F+ GC N G
Sbjct: 134 SGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTTVNN------FIFGCGRKNQGLF 187
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYG-STGYITFGKPDTV--NKKFVKY 315
GASG++GL R +S+IS+ + + F YCL + ++G + G +V N + Y
Sbjct: 188 GGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEASGSLVMGGNSSVYKNTTPISY 247
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRS 375
T ++ P FY + LTGI+VGG + ++A F K IDSGT+I+R P +Y AL++
Sbjct: 248 TRMIHNPLL-PFYFLNLTGITVGG--VEVQAPSFGKDRMIIDSGTVISRLPPSIYQALKA 304
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL--VVES 433
F K+ Y + D+C++LS Y+ V +P I ++F G +L +DV G V
Sbjct: 305 EFVKQFSGYPSAPSFM-ILDSCFNLSGYQEVKIPDIKMYFEGSAELNVDVTGVFYSVKTD 363
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
QVCL A LP + ++GN QQ+ + YD G LGF C+
Sbjct: 364 ASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEACS 410
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 136/414 (32%), Positives = 209/414 (50%), Gaps = 22/414 (5%)
Query: 77 GKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYI 134
GKS + L++ L D R+ SR ++ N P +G+ + Y +
Sbjct: 11 GKSTDWNKKLQKSLILDDFRVRSLQSR-IKSIFSGNNIDALDSQIPLSSGVRLQTLNYIV 69
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
V IG + +++++DTGS +TW QC+PC C Q+DP F+PS S ++ I CNS+TC+
Sbjct: 70 TVEIGG--RNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQS 127
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L G ++ C Y + Y DGS G +++ + G F+ GC
Sbjct: 128 LQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNL------GTTHVSNFIFGCG 181
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYITFGKPDTV-- 308
NN G GASG+MGL + +S++S+T+ + F YCL + ++G + G +V
Sbjct: 182 RNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASGSLILGGNSSVYK 241
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAP 368
N + YT ++ P+ FY + LTGIS+GG + L+A + + IDSGT+ITR P P
Sbjct: 242 NTTPISYTRMIANPQLPTFYFLNLTGISIGG--VALQAPNYRQSGILIDSGTVITRLPPP 299
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT 428
VY L++ F K+ + + DTC++L+ Y V +P I + F G +L +DV G
Sbjct: 300 VYRDLKAEFLKQFSGFPSAPPFS-ILDTCFNLNGYDEVDIPTIRMQFEGNAELTVDVTGI 358
Query: 429 --LVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
V QVCL A L D ++GN QQR V Y+ +LGF C+
Sbjct: 359 FYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACS 412
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 135/381 (35%), Positives = 190/381 (49%), Gaps = 33/381 (8%)
Query: 118 FTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDP 176
F P +G+ EY+ VV +G P++ + L++DTGS ITW QC PC +C +Q+D F+P
Sbjct: 1 FEAPIFSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNP 60
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
S S +F + C+S+ C L C S +C Y Y DGS G TD + +
Sbjct: 61 SSSSSFKVLDCSSSLCLNLDVM-------GCLSNKCLYQADYGDGSFTMGELVTDNVVLD 113
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS---YFFYCL--- 290
+ G G LGC +N G A+GI+GL RGP+S + + S F YCL
Sbjct: 114 DAFGPGQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDR 173
Query: 291 HSPYGSTGYITFGK---PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKA 346
S + FG P T VK+ P + P + +Y++ +TGISVGG L + A
Sbjct: 174 ESDPNHKSTLVFGDAAIPHTATGS-VKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPA 232
Query: 347 SYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
S F S T DSGT ITR A Y+A+R AFR + +FDTCYD +
Sbjct: 233 SVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFK-IFDTCYDFT 291
Query: 402 AYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFA--LLPSDPNSILLGNVQQ 458
++ VP +T HF G VD+ L +V S + C FA + PS ++GNVQQ
Sbjct: 292 GMNSISVPTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFAASMGPS-----VIGNVQQ 346
Query: 459 RGYEVHYDVAGRRLGFGPGNC 479
+ + V YD +++G P C
Sbjct: 347 QSFRVIYDNVHKQIGLLPDQC 367
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 137/404 (33%), Positives = 203/404 (50%), Gaps = 20/404 (4%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQ 143
L++ L D +L SR N + P +GI + Y + V +G K
Sbjct: 87 LKKHLIMDDFQLRSLQSRMKSIISGRNIDDSVDAPIPLTSGIRLQTLNYIVTVELGGRK- 145
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+++++DTGS ++W QC+PC C Q+DP F+PS S ++ + C+S TC+ L G
Sbjct: 146 -MTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLG 204
Query: 204 QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG 263
+ C Y + Y DGS G T+ + + GN A F+ GC NN G G
Sbjct: 205 VCGSNPPSCNYVVNYGDGSYTRGELGTEHLDL----GNST-AVNNFIFGCGRNNQGLFGG 259
Query: 264 ASGIMGLDRGPVSIISKTNISY---FFYCLH-SPYGSTGYITFGKPDTV--NKKFVKYTP 317
ASG++GL R +S+IS+T+ + F YCL + ++G + G +V N + YT
Sbjct: 260 ASGLVGLGRSSLSLISQTSAMFGGVFSYCLPITETEASGSLVMGGNSSVYKNTTPISYTR 319
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAF 377
++ P Q FY + LTGI+VG + ++A F K IDSGT+ITR P +Y AL+ F
Sbjct: 320 MIPNP-QLPFYFLNLTGITVG--SVAVQAPSFGKDGMMIDSGTVITRLPPSIYQALKDEF 376
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT--LVVESVR 435
K+ + + DTC++LS Y+ V +P I +HF G +L +DV G V
Sbjct: 377 VKQFSGFPSAPAFM-ILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDAS 435
Query: 436 QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
QVCL A L + ++GN QQ+ V YD G LGF C
Sbjct: 436 QVCLAIASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEAC 479
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 201 bits (512), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 150/410 (36%), Positives = 217/410 (52%), Gaps = 41/410 (10%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKA----FTFPAKTGIV-AADEYYIVVAIGKPKQ 143
L RD R+ K+ L A+ +T+A F+ +G+ + EY+ + +G P +
Sbjct: 102 LARDASRV--KSLTSLAAAVGST-NRTRARGPGFSSSVTSGLAQGSGEYFTRLGVGTPAR 158
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
YV ++LDTGS + W QC PC C Q DP F+P+KS++F+ IPC S C+ L
Sbjct: 159 YVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSPLCRRL-------D 211
Query: 204 QDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ 261
CS+K+ C Y ++Y DGS G ++T+ +T + G A LGC +N G
Sbjct: 212 SPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTR-VGRVA-----LGCGHDNEGLF 265
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFGKPDTVNKKFVKYT 316
GA+G++GL RG +S S+ + F YCL S Y+ FG D+ + ++T
Sbjct: 266 IGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFG--DSAISRTARFT 323
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLP-LKASYFTKLSTE-----IDSGTIITRFPAPVY 370
P+V+ P+ FY++ L G+SVGG R+P + AS F ST IDSGT +TR P Y
Sbjct: 324 PLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAY 383
Query: 371 SALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV 430
ALR AFR K LFDTC+DLS V VP + +HF G D+ L L+
Sbjct: 384 VALRDAFRVGASNLKRAPEFS-LFDTCFDLSGKTEVKVPTVVLHFR-GADVSLPASNYLI 441
Query: 431 -VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V++ C FA S + ++GN+QQ+G+ V YD+A R+GF P C
Sbjct: 442 PVDNSGSFCFAFAGTMSGLS--IVGNIQQQGFRVVYDLAASRVGFAPRGC 489
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 132/365 (36%), Positives = 188/365 (51%), Gaps = 33/365 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY++ V +G P L++D+GS + W QC+PC C QQ DP FDP+ S +F+ +PC+S
Sbjct: 132 EYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCDSG 191
Query: 191 TCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQE---VNGNGYFAR 246
C+ L P G C+ S C Y ++Y DGS G A + +T + V G
Sbjct: 192 VCRTL-----PGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDSTPVQG------ 240
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISK---TNISYFFYCLHS--PYGSTGYIT 301
+GC N G GA+G++GL GP+S++ + F YCL S G +
Sbjct: 241 --VAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGAGSLV 298
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL-----STEI 356
FG+ D + V + P++ +Q FY++ LTG+ VGGERLPL+ F +
Sbjct: 299 FGRDDAMPVGAV-WVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVM 357
Query: 357 DSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF 415
D+GT +TR P Y+ALR AF + G+ L DTCYDLS Y +V VP + ++F
Sbjct: 358 DTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVS-LLDTCYDLSGYASVRVPTVALYF 416
Query: 416 -LGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G L L R LV CL FA S + +LGN+QQ+G ++ D A +GF
Sbjct: 417 GRDGAALTLPARNLLVEMGGGVYCLAFAASASGLS--ILGNIQQQGIQITVDSANGYVGF 474
Query: 475 GPGNC 479
GP C
Sbjct: 475 GPSTC 479
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 148/437 (33%), Positives = 224/437 (51%), Gaps = 44/437 (10%)
Query: 75 NQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKA-IPDNFK------KTKAFT-------- 119
N GK R + +LE R + +++++A + DN + K KA T
Sbjct: 10 NLGKGRESTTLEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSV 69
Query: 120 ----FPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFF 174
P +GI + + Y + V +G +SL++DTGS +TW QC+PC C Q+ P +
Sbjct: 70 SETQIPLTSGIKLESLNYIVTVELGGKN--MSLIVDTGSDLTWVQCQPCRSCYNQQGPLY 127
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWF----PPNGQDKCSSKECPYDIAYVDGSGETGFWAT 230
DPS S ++ + CNS+TC+ L+ P G + C Y ++Y DGS G A+
Sbjct: 128 DPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLAS 187
Query: 231 DRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FF 287
+ + + G F+ GC NN G G+SG+MGL R VS++S+T ++ F
Sbjct: 188 ESILL------GDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFS 241
Query: 288 YCLHS-PYGSTGYITFGKPDTV--NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPL 344
YCL S G++G ++FG +V N V YTP+V P+ FY + LTG S+GG + L
Sbjct: 242 YCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VEL 299
Query: 345 KASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYK 404
K+S F + IDSGT+ITR P +Y A++ F K+ + G + DTC++L++Y+
Sbjct: 300 KSSSFGR-GILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYS-ILDTCFNLTSYE 357
Query: 405 TVVVPKITIHFLGGVDLELDVRGT--LVVESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
+ +P I + F G +LE+DV G V VCL A L + ++GN QQ+
Sbjct: 358 DISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQR 417
Query: 463 VHYDVAGRRLGFGPGNC 479
V YD RLG NC
Sbjct: 418 VIYDTTQERLGIVGENC 434
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 141/420 (33%), Positives = 201/420 (47%), Gaps = 38/420 (9%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFK-----KTKAFTFPAKTGIV-AADEYYIVVAI 138
L LRRD++R ++ A + + F P +G+ + EY+ + +
Sbjct: 94 LAHRLRRDKRRASRISAAAGGAAAANGTRVGGGGGGSGFVAPVVSGLAQGSGEYFTKIGV 153
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
G P ++LDTGS + W QC PC C Q FDP S ++ + C + C+ L
Sbjct: 154 GTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPLCRRL--- 210
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNN 257
+G K C Y +AY DGS G +AT+ +T AR P LGC +N
Sbjct: 211 --DSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS------GARVPRVALGCGHDN 262
Query: 258 TGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL-------HSPYGSTGYITFGKPDT 307
G A+G++GL RG +S S+ + + F YCL S + +TFG
Sbjct: 263 EGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGAV 322
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-------IDSGT 360
+TP+V P FY++ L GISVGG R+P A +L +DSGT
Sbjct: 323 GPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRGGVIVDSGT 382
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
+TR P Y+ALR AFR ++ G LFDTCYDLS K V VP +++HF GG +
Sbjct: 383 SVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAE 442
Query: 421 LELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L L+ V+S C FA +D ++GN+QQ+G+ V +D G+RLGF P C
Sbjct: 443 AALPPENYLIPVDSRGTFCFAFA--GTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 500
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 138/425 (32%), Positives = 208/425 (48%), Gaps = 32/425 (7%)
Query: 58 GKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQR---LHLKNSRRLQKAIPDNFKK 114
G S+ + RYGPCS + P+ EE+LRRDQ R + K S A ++ +
Sbjct: 31 GTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQS 90
Query: 115 TKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH---CSQQR 170
+K + P G + EY I V +G P +++DTGS ++W QC+PC C
Sbjct: 91 SK-VSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHA 149
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWAT 230
FDP+ S T++ C++ C L + NG D + C Y + Y DGS TG +++
Sbjct: 150 GALFDPAASSTYAAFNCSAAACAQLGDSGEANGCD--AKSRCQYIVKYGDGSNTTGTYSS 207
Query: 231 DRMTIQEVNGNGYFARYPFLLGCT--DNNTGDQNGASGIMGLDRGPVSIISKTNISY--- 285
D +T+ +G F GC+ + G + G++GL S +S+T Y
Sbjct: 208 DVLTL-----SGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKS 262
Query: 286 FFYCLHSPYGSTGYITF----GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGER 341
FFYCL + S+G++T +F TP++ + + +Y L I+VGG++
Sbjct: 263 FFYCLPATPASSGFLTLGAPASGGGGGASRFAT-TPMLRSKKVPTYYFAALEDIAVGGKK 321
Query: 342 LPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
L L S F S +DSGT+ITR P Y+AL SAFR M +Y + + + DTC++ +
Sbjct: 322 LGLSPSVFAAGSL-VDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPL-GILDTCFNFT 379
Query: 402 AYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGY 461
V +P + + F GG ++LD G V CL FA D +GNVQQR +
Sbjct: 380 GLDKVSIPTVALVFAGGAVVDLDAHGI-----VSGGCLAFAPTRDDKAFGTIGNVQQRTF 434
Query: 462 EVHYD 466
EV YD
Sbjct: 435 EVLYD 439
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 185/356 (51%), Gaps = 22/356 (6%)
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
V +G ++++DT S +TW QC PC C Q+ P FDP+ S +++ +PCNS++C
Sbjct: 128 VATVGLGGGEATVIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDA 187
Query: 195 LLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
L ++ C Y ++Y DGS G A D++++ +G F+ G
Sbjct: 188 LQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDG------FVFG 241
Query: 253 CTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYITFGKPDTV 308
C +N G G SG+MGL R +S+IS+T + F YCL S+G + G +V
Sbjct: 242 CGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSV 301
Query: 309 --NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFP 366
N + YT +V+ P Q FY + LTGI++GG+ + A +DSGTIIT
Sbjct: 302 YRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVI-----VDSGTIITSLV 356
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
VY+A+++ F + +Y G + DTC++L+ ++ V +P + F G V++E+D
Sbjct: 357 PSVYNAVKAEFLSQFAEYPQAPGF-SILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSS 415
Query: 427 GTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G L V QVCL A L S+ + ++GN QQ+ V +D G ++GF C+
Sbjct: 416 GVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETCD 471
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 129/366 (35%), Positives = 186/366 (50%), Gaps = 29/366 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ + +G P ++LDTGS + W QC PC C +Q FDP +S++++ + C +
Sbjct: 139 EYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAP 198
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ L +G C Y +AY DGS G +AT+ +T G AR
Sbjct: 199 LCRRL-----DSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFA---GGARVAR--VA 248
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS------TGYIT 301
LGC +N G A+G++GL RG +S ++ + Y F YCL S + +T
Sbjct: 249 LGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVT 308
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ 355
FG + +TP+V P FY++ L GISVGG R+P A+ +L
Sbjct: 309 FGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGGV 368
Query: 356 -IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+DSGT +TR P YSALR AFR ++ G LFDTCYDLS K V VP +++H
Sbjct: 369 IVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMH 428
Query: 415 FLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F GG + L L+ V+S C FA +D ++GN+QQ+G+ V +D G+R+
Sbjct: 429 FAGGAEAALPPENYLIPVDSKGTFCFAFA--GTDGGVSIIGNIQQQGFRVVFDGDGQRVA 486
Query: 474 FGPGNC 479
F P C
Sbjct: 487 FTPKGC 492
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 152/423 (35%), Positives = 212/423 (50%), Gaps = 46/423 (10%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIP--------------DNFKKTKAFTFPAKTGIV-AA 129
L+E L+RD R+ N+R A+ D K F+ +G+ +
Sbjct: 91 LQERLKRDAARVDSINARVQLAAMGVSKAEMKPLNGSSIDARFDAKDFSSSIISGLAQGS 150
Query: 130 DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNS 189
EY+ + +G P +Y ++LDTGS I W QC PC C Q DP F+P+ S T+ K+PC +
Sbjct: 151 GEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCAT 210
Query: 190 TTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
CK L C +K C Y ++Y DGS G ++T+ +T + G R
Sbjct: 211 PLCKKL-------DISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTFR-----GQVIRR- 257
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFG 303
LGC +N G GA+G++GL RG +S S+T + F YCL S G+ + FG
Sbjct: 258 VALGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDRSASGTASSLIFG 317
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL-PLKASYFTKLSTE-----ID 357
K K +TP+++ P+ FY++ L GISVGG RL + AS F +T ID
Sbjct: 318 KAAI--PKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIID 375
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT +TR YS +R AFR K G LFDTCYDLS KTV VP + HF G
Sbjct: 376 SGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFS-LFDTCYDLSGLKTVKVPTLVFHFQG 434
Query: 418 GVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
G + L L+ V+S C FA + ++GN+QQ+GY V +D R+GF
Sbjct: 435 GAHISLPATNYLIPVDSSATFCFAFA--GNTGGLSIIGNIQQQGYRVVFDSLANRVGFKA 492
Query: 477 GNC 479
G+C
Sbjct: 493 GSC 495
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 151/490 (30%), Positives = 233/490 (47%), Gaps = 49/490 (10%)
Query: 20 NGAYANDNDLSHSYIVSVSSL--IPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQG 77
N + ++L +V SSL IP +P G + + +GPCS +
Sbjct: 24 NAGAGDHHELKRFMVVPTSSLKHIPEDATCSGHKVIPSN-GTAWVPMNRPHGPCSSTSSR 82
Query: 78 KSRNTP-SLEEILRRDQQRL---------HLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV 127
S + ++++L DQ R H+ I + + P+ T V
Sbjct: 83 ASEDMGIDIDDMLMWDQLRTSYIRTQLSTHVGVVGGGMPVIARSTTVSNRDYTPSSTASV 142
Query: 128 AAD-------EYYIVVAIGKPKQYVS--LLLDTGSGITWTQCKPCI--HCSQQRDPFFDP 176
+ E A + + VS +++DT S I W QC PC C Q+DP +DP
Sbjct: 143 GTNSGTSKTIEKSDQTATNEHQDAVSQTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDP 202
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS--SKECPYDIAYVDGSGETGFWATDRMT 234
+KS TF+ IPC S CK L + + CS + EC Y + Y DG TG + TD +T
Sbjct: 203 AKSSTFAPIPCGSPACKELGSSY----GNGCSPTTDECKYIVNYGDGKATTGTYVTDTLT 258
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
+ F GC+ G N +GI+ L G S++ +T +Y F YC+
Sbjct: 259 MSPT-----IVVKDFRFGCSHAVRGSFSNQNAGILALGGGRGSLLEQTADAYGNAFSYCI 313
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
P S G+++ G P + KF YTP++ FY + L I V G++L + + F
Sbjct: 314 PKP-SSAGFLSLGGPVEASLKF-SYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFA 371
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKY-KMGKGIEDLFDTCYDLSAYKTVVVP 409
+ +DSG ++T+ P VY+ALR+AFR M Y + + +L DTCYD + + V VP
Sbjct: 372 TGAV-MDSGAVVTQLPPQVYAALRAAFRSAMAAYGPLAAPVRNL-DTCYDFTRFPDVKVP 429
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
K+++ F GG L+L+ +++++ CL FA P + + +GNVQQ+ YEV YDV G
Sbjct: 430 KVSLVFAGGATLDLE-PASIILDG----CLAFAATPGEESVGFIGNVQQQTYEVLYDVGG 484
Query: 470 RRLGFGPGNC 479
++GF G C
Sbjct: 485 GKVGFRRGAC 494
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 185/356 (51%), Gaps = 22/356 (6%)
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
V +G ++++DT S +TW QC PC C Q+ P FDP+ S +++ +PCNS++C
Sbjct: 127 VATVGLGGGEATVIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDA 186
Query: 195 LLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
L ++ C Y ++Y DGS G A D++++ +G F+ G
Sbjct: 187 LQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDG------FVFG 240
Query: 253 CTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYITFGKPDTV 308
C +N G G SG+MGL R +S+IS+T + F YCL S+G + G +V
Sbjct: 241 CGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSV 300
Query: 309 --NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFP 366
N + YT +V+ P Q FY + LTGI++GG+ + A +DSGTIIT
Sbjct: 301 YRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVI-----VDSGTIITSLV 355
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
VY+A+++ F + +Y G + DTC++L+ ++ V +P + F G V++E+D
Sbjct: 356 PSVYNAVKAEFLSQFAEYPQAPGF-SILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSS 414
Query: 427 GTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G L V QVCL A L S+ + ++GN QQ+ V +D G ++GF C+
Sbjct: 415 GVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETCD 470
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 144/410 (35%), Positives = 212/410 (51%), Gaps = 40/410 (9%)
Query: 89 LRRDQQRLHLKNSRRLQKAIP----DNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQ 143
L+RD +R+ K+ L IP + + F+ +G+ + EY+ + +G P +
Sbjct: 96 LQRDSRRV--KSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPAR 153
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
YV ++LDTGS I W QC PC C Q DP FDP KSKT++ IPC+S C+ L
Sbjct: 154 YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRL-------D 206
Query: 204 QDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ 261
C++ K C Y ++Y DGS G ++T+ +T + G LGC +N G
Sbjct: 207 SAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VALGCGHDNEGLF 260
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFGKPDTVNKKFVKYT 316
GA+G++GL +G +S +T + F YCL S + FG + + ++T
Sbjct: 261 VGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFG--NAAVSRIARFT 318
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS------TEIDSGTIITRFPAPVY 370
P+++ P+ FY++ L GISVGG R+P + KL IDSGT +TR P Y
Sbjct: 319 PLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAY 378
Query: 371 SALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV 430
A+R AFR K K LFDTC+DLS V VP + +HF G D+ L L+
Sbjct: 379 IAMRDAFRVGAKTLKRAPDFS-LFDTCFDLSNMNEVKVPTVVLHFR-GADVSLPATNYLI 436
Query: 431 -VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V++ + C FA + ++GN+QQ+G+ V YD+A R+GF PG C
Sbjct: 437 PVDTNGKFCFAFAGTMGGLS--IIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 131/343 (38%), Positives = 188/343 (54%), Gaps = 27/343 (7%)
Query: 147 LLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK 206
++LDTGS +TW QC+PC C QQ DP FDPS S +++ + C+S C+ L N
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRN---- 56
Query: 207 CSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASG 266
++ C Y++AY DGS G +AT+ +T+ + G A +GC +N G GA+G
Sbjct: 57 -ATGACLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVA-----IGCGHDNEGLFVGAAG 110
Query: 267 IMGLDRGPVSIISKTNISYFFYCL---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPE 323
++ L GP+S S+ + S F YCL SP ST + FG D + P+V +P
Sbjct: 111 LLALGGGPLSFPSQISASTFSYCLVDRDSPAAST--LQFG--DGAAEAGTVTAPLVRSPR 166
Query: 324 QSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPVYSALRSAF 377
S FY++ L+GISVGG+ L + AS F +T +DSGT +TR + Y+ALR AF
Sbjct: 167 TSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAF 226
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQ 436
+ G+ LFDTCYDLS +V VP +++ F GG L L + L+ V+
Sbjct: 227 VQGAPSLPRTSGVS-LFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGT 285
Query: 437 VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA P++ ++GNVQQ+G V +D A +GF P C
Sbjct: 286 YCLAFA--PTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 135/375 (36%), Positives = 198/375 (52%), Gaps = 31/375 (8%)
Query: 115 TKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
T+ F P +G + EY+ V IG+P V ++LDTGS ++W QC PC C +Q DP
Sbjct: 133 TEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPI 192
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
F+P+ S +F+ + C + CK L +C + C Y+++Y DGS G + T+ +
Sbjct: 193 FEPTSSASFTSLSCETEQCKSL-------DVSECRNGTCLYEVSYGDGSYTVGDFVTETV 245
Query: 234 TIQEVN-GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-H 291
T+ + GN +GC NN G GA+G++GL G +S S+ N S F YCL
Sbjct: 246 TLGSTSLGN-------IAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVD 298
Query: 292 SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTK 351
ST + F P T + P+ P F+++ LTG+SVGG LP+ + F +
Sbjct: 299 RDSDSTSTLDFNSPITPD---AVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSF-Q 354
Query: 352 LSTE------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
+S + +DSGT +TR VY+ LR AF K + +G+ LFDTCYDLS+
Sbjct: 355 MSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVA-LFDTCYDLSSKSR 413
Query: 406 VVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVH 464
V VP ++ HF G +L L + L+ V+S C FA P+D +LGN QQ+G V
Sbjct: 414 VEVPTVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFA--PTDSTLSILGNAQQQGTRVG 471
Query: 465 YDVAGRRLGFGPGNC 479
+D+A +GF P C
Sbjct: 472 FDLANSLVGFSPNKC 486
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 135/375 (36%), Positives = 198/375 (52%), Gaps = 31/375 (8%)
Query: 115 TKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
T+ F P +G + EY+ V IG+P V ++LDTGS ++W QC PC C +Q DP
Sbjct: 133 TEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPX 192
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
F+P+ S +F+ + C + CK L +C + C Y+++Y DGS G + T+ +
Sbjct: 193 FEPTSSASFTSLSCETEQCKSL-------DVSECRNGTCLYEVSYGDGSYTVGDFVTETV 245
Query: 234 TIQEVN-GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-H 291
T+ + GN +GC NN G GA+G++GL G +S S+ N S F YCL
Sbjct: 246 TLGSTSLGN-------IAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVD 298
Query: 292 SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTK 351
ST + F P T + P+ P F+++ LTG+SVGG LP+ + F +
Sbjct: 299 RDSDSTSTLDFNSPITPD---AVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSF-Q 354
Query: 352 LSTE------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
+S + +DSGT +TR VY+ LR AF K + +G+ LFDTCYDLS+
Sbjct: 355 MSEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVA-LFDTCYDLSSKSR 413
Query: 406 VVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVH 464
V VP ++ HF G +L L + L+ V+S C FA P+D +LGN QQ+G V
Sbjct: 414 VEVPTVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFA--PTDSTLSILGNAQQQGTRVG 471
Query: 465 YDVAGRRLGFGPGNC 479
+D+A +GF P C
Sbjct: 472 FDLANSLVGFSPNKC 486
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 148/441 (33%), Positives = 225/441 (51%), Gaps = 44/441 (9%)
Query: 71 CSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKA-IPDNFK------KTKAFT---- 119
C + GK R + +LE R + +++++A + DN + K KA T
Sbjct: 54 CFSRSLGKGRESTTLEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTT 113
Query: 120 --------FPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR 170
P +GI + + Y + V +G +SL++DTGS +TW QC+PC C Q+
Sbjct: 114 EQSVSETQIPLTSGIKLESLNYIVTVELGGKN--MSLIVDTGSDLTWVQCQPCRSCYNQQ 171
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWF----PPNGQDKCSSKECPYDIAYVDGSGETG 226
P +DPS S ++ + CNS+TC+ L+ P G + C Y ++Y DGS G
Sbjct: 172 GPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRG 231
Query: 227 FWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY- 285
A++ + + G F+ GC NN G G+SG+MGL R VS++S+T ++
Sbjct: 232 DLASESILL------GDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFN 285
Query: 286 --FFYCLHS-PYGSTGYITFGKPDTV--NKKFVKYTPIVTTPEQSEFYHITLTGISVGGE 340
F YCL S G++G ++FG +V N V YTP+V P+ FY + LTG S+GG
Sbjct: 286 GVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG- 344
Query: 341 RLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
+ LK+S F + IDSGT+ITR P +Y A++ F K+ + G + DTC++L
Sbjct: 345 -VELKSSSFGR-GILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY-SILDTCFNL 401
Query: 401 SAYKTVVVPKITIHFLGGVDLELDVRGT--LVVESVRQVCLGFALLPSDPNSILLGNVQQ 458
++Y+ + +P I + F G +LE+DV G V VCL A L + ++GN QQ
Sbjct: 402 TSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQ 461
Query: 459 RGYEVHYDVAGRRLGFGPGNC 479
+ V YD RLG NC
Sbjct: 462 KNQRVIYDTTQERLGIVGENC 482
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 148/441 (33%), Positives = 225/441 (51%), Gaps = 44/441 (9%)
Query: 71 CSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKA-IPDNFK------KTKAFT---- 119
C + GK R + +LE R + +++++A + DN + K KA T
Sbjct: 54 CFSRSLGKGRESTTLEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTT 113
Query: 120 --------FPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR 170
P +GI + + Y + V +G +SL++DTGS +TW QC+PC C Q+
Sbjct: 114 EQSVSETQIPLTSGIKLESLNYIVTVELGGKN--MSLIVDTGSDLTWVQCQPCRSCYNQQ 171
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWF----PPNGQDKCSSKECPYDIAYVDGSGETG 226
P +DPS S ++ + CNS+TC+ L+ P G + C Y ++Y DGS G
Sbjct: 172 GPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRG 231
Query: 227 FWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY- 285
A++ + + G F+ GC NN G G+SG+MGL R VS++S+T ++
Sbjct: 232 DLASESILL------GDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFN 285
Query: 286 --FFYCLHS-PYGSTGYITFGKPDTV--NKKFVKYTPIVTTPEQSEFYHITLTGISVGGE 340
F YCL S G++G ++FG +V N V YTP+V P+ FY + LTG S+GG
Sbjct: 286 GVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG- 344
Query: 341 RLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
+ LK+S F + IDSGT+ITR P +Y A++ F K+ + G + DTC++L
Sbjct: 345 -VELKSSSFGR-GILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY-SILDTCFNL 401
Query: 401 SAYKTVVVPKITIHFLGGVDLELDVRGT--LVVESVRQVCLGFALLPSDPNSILLGNVQQ 458
++Y+ + +P I + F G +LE+DV G V VCL A L + ++GN QQ
Sbjct: 402 TSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQ 461
Query: 459 RGYEVHYDVAGRRLGFGPGNC 479
+ V YD RLG NC
Sbjct: 462 KNQRVIYDSTQERLGIVGENC 482
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 142/412 (34%), Positives = 201/412 (48%), Gaps = 36/412 (8%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIP-DNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYV 145
++L+R +R H + SR + +A P G E+ + VAIG P
Sbjct: 57 QLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQVPVHAG---NGEFLMDVAIGTPALSY 113
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
+ ++DTGS + WTQCKPC+ C +Q P FDPS S T++ +PC+S C L P
Sbjct: 114 AAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSALCSDL-----PTSTC 168
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGDQ-NG 263
+SK C Y Y D S G A++ T+ + + P GC D N GD
Sbjct: 169 TSASK-CGYTYTYGDASSTQGVLASETFTLGKEK-----KKLPGVAFGCGDTNEGDGFTQ 222
Query: 264 ASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY--ITFGKPDTVNKKF-----VKYT 316
+G++GL RGP+S++S+ + F YCL S G + G + V+ T
Sbjct: 223 GAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTT 282
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYS 371
P+V P Q FY+++LTG++VG R+ L AS F +DSGT IT Y
Sbjct: 283 PLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYR 342
Query: 372 ALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA--YKTVVVPKITIHFLGGVDLELDVRGTL 429
AL+ AF +M + G E D C+ A V VPK+ +HF GG DL+L +
Sbjct: 343 ALKKAFVAQMALPTV-DGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYM 401
Query: 430 VVESVR-QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
V++S +CL A PS SI +GN QQ+ ++ YDVAG L F P CN
Sbjct: 402 VLDSASGALCLTVA--PSRGLSI-IGNFQQQNFQFVYDVAGDTLSFAPVQCN 450
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 188/359 (52%), Gaps = 21/359 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKI 185
V Y + +G P ++++DTGS +TW QC PC+ C +Q P FDP S T++ +
Sbjct: 129 VGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASV 188
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C+++ C L+ N +S C Y +Y D S G +TD ++
Sbjct: 189 RCSASQCD-ELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGST------- 240
Query: 246 RYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYIT 301
RYP F GC +N G ++G++GL R +S++ + S F YCL + STGY++
Sbjct: 241 RYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTA-ASTGYLS 299
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTI 361
G +T + YTP+ ++ + Y ITL+G+SVGG L + S ++ L T IDSGT+
Sbjct: 300 IGPYNT--GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTV 357
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
ITR P V++AL A + M + + DTC++ A + + VP + + F GG +
Sbjct: 358 ITRLPTAVHTALSKAVAQAMAGAQRAPAFS-ILDTCFEGQASQ-LRVPTVAMAFAGGASM 415
Query: 422 ELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+L R L+ CL FA P+D +I +GN QQ+ + V YDVA R+GF G C+
Sbjct: 416 KLTTRNVLIDVDDSTTCLAFA--PTDSTAI-IGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 121/359 (33%), Positives = 189/359 (52%), Gaps = 21/359 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKI 185
V Y + +G P ++++DTGS +TW QC PC+ C +Q P FDP S T++ +
Sbjct: 129 VGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSV 188
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C+++ C L+ N +S C Y +Y D S G+ +TD ++ +
Sbjct: 189 RCSASQCD-ELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS------ 241
Query: 246 RYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYIT 301
YP F GC +N G ++G++GL R +S++ + S F YCL + STGY++
Sbjct: 242 -YPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTA-ASTGYLS 299
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTI 361
G +T + YTP+ ++ + Y ITL+G+SVGG L + S ++ L T IDSGT+
Sbjct: 300 IGPYNT--GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTV 357
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
ITR P V++AL A + M + + DTC++ A + + VP + + F GG +
Sbjct: 358 ITRLPTAVHTALSKAVAQAMAGAQRAPAFS-ILDTCFEGQASQ-LRVPTVVMAFAGGASM 415
Query: 422 ELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+L R L+ CL FA P+D +I +GN QQ+ + V YDVA R+GF G C+
Sbjct: 416 KLTTRNVLIDVDDSTTCLAFA--PTDSTAI-IGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 132/401 (32%), Positives = 189/401 (47%), Gaps = 46/401 (11%)
Query: 88 ILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSL 147
++ RD R+ R + P + + P + EY++ V +G P L
Sbjct: 88 LVARDNARVEHLEKRLVASTSPYLPEDLVSEVVPGVDD--GSGEYFVRVGVGSPPTDQYL 145
Query: 148 LLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKC 207
++D+GS + W QC+PC C Q DP FDP+ S +FS + C S C+ L
Sbjct: 146 VVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSGT---GCGGGG 202
Query: 208 SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGI 267
+ +C Y + Y DGS G A + +T+ G A +GC N+G GA+G+
Sbjct: 203 DAGKCDYSVTYGDGSYTKGELALETLTL------GGTAVQGVAIGCGHRNSGLFVGAAGL 256
Query: 268 MGLDRGPVSIISKTNIS---YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQ 324
+GL G +S++ + + F YCL S G+ G +
Sbjct: 257 LGLGWGAMSLVGQLGGAAGGVFSYCLAS-RGAGGAGSL---------------------A 294
Query: 325 SEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPVYSALRSAFR 378
S FY++ LTGI VGGERLPL+ S F +L+ + +D+GT +TR P Y+ALR AF
Sbjct: 295 SSFYYVGLTGIGVGGERLPLQDSLF-QLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFD 353
Query: 379 KRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVC 438
M + L DTCYDLS Y +V VP ++ +F G L L R LV C
Sbjct: 354 GAMGALPRSPAVS-LLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFC 412
Query: 439 LGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L FA PS +LGN+QQ G ++ D A +GFGP C
Sbjct: 413 LAFA--PSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 198 bits (503), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 143/395 (36%), Positives = 202/395 (51%), Gaps = 30/395 (7%)
Query: 99 KNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITW 157
NS Q +P ++ F P +G+ + + EY+I V++G P + + L++DTGS I W
Sbjct: 8 SNSHDRQTKVP-----SQDFQAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILW 62
Query: 158 TQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIA 217
QC PC+ C Q D FDP KS T+S + CNS C L C +C Y +
Sbjct: 63 LQCAPCVSCYHQCDEVFDPYKSSTYSTLGCNSRQCLNL-------DVGGCVGNKCLYQVD 115
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSI 277
Y DGS TG +ATD +++ +G G LGC +N G GA+G++GL +GP+S
Sbjct: 116 YGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSF 175
Query: 278 ---ISKTNISYFFYCL---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
I+ N F YCL + + FG V V++TP + S FY++
Sbjct: 176 PNQINSENGGRFSYCLTGRDTDSTERSSLIFGDA-AVPPAGVRFTPQASNLRVSTFYYLK 234
Query: 332 LTGISVGGERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKM 386
+TGISVGG L + S F S IDSGT +TR Y++LR AFR +
Sbjct: 235 MTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVL 294
Query: 387 GKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLP 445
LFDTCY+LS +V VP +T+HF GG DL+L LV V++ CL FA
Sbjct: 295 TTEFS-LFDTCYNLSDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFA--- 350
Query: 446 SDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++GN+QQ+G+ V YD ++GF P C+
Sbjct: 351 GTTGPSIIGNIQQQGFRVIYDNLHNQVGFVPSQCD 385
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 138/363 (38%), Positives = 193/363 (53%), Gaps = 33/363 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ + +G P +YV ++LDTGS I W QC PC C Q DP FDP KS++F+ I C S
Sbjct: 125 EYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSP 184
Query: 191 TCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C L C++++ C Y ++Y DGS G ++T+ +T + AR
Sbjct: 185 LCHRL-------DSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTR----VARVA 233
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFG 303
LGC +N G GA+G++GL RG +S S+T + F YCL S + FG
Sbjct: 234 --LGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFG 291
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKASYFTKLSTE-----ID 357
D+ + ++TP+V+ P+ FY++ L GISVGG R+P + AS F T ID
Sbjct: 292 --DSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIID 349
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT +TR P Y A R AFR K LFDTC+DLS V VP + +HF
Sbjct: 350 SGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFS-LFDTCFDLSGKTEVKVPTVVLHFR- 407
Query: 418 GVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
G D+ L L+ V++ CL FA + ++GN+QQ+G+ V YD+AG R+GF P
Sbjct: 408 GADVSLPASNYLIPVDTSGNFCLAFAGTMGGLS--IIGNIQQQGFRVVYDLAGSRVGFAP 465
Query: 477 GNC 479
C
Sbjct: 466 HGC 468
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 130/358 (36%), Positives = 196/358 (54%), Gaps = 28/358 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ V +G+P + + ++LDTGS +TW QC+PC C Q DP +DPS S +++ + C+S
Sbjct: 162 EYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCDSP 221
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ L N S+ C Y++AY DGS G +AT+ +T+ + A
Sbjct: 222 RCRDLDAAACRN-----STGSCLYEVAYGDGSYTVGDFATETLTLGDSAPVSNVA----- 271
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGSTGYITFGKPDT 307
+GC +N G GA+G++ L GP+S S+ + + F YCL SP ST + FG
Sbjct: 272 IGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSST--LQFGD--- 326
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTII 362
+++ P++ +P + FY++ L+GISVGGE L + +S F +DSGT +
Sbjct: 327 -SEQPAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAV 385
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
TR + Y ALR AF + + G+ LFDTCYDL+ +V VP + + F GG +L+
Sbjct: 386 TRLQSGAYGALREAFVQGTQSLPRASGVS-LFDTCYDLAGRSSVQVPAVALWFEGGGELK 444
Query: 423 LDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L + L+ V++ CL FA S P SI +GNVQQ+G V +D A +GF C
Sbjct: 445 LPAKNYLIPVDAAGTYCLAFAGT-SGPVSI-IGNVQQQGVRVSFDTAKNTVGFTADKC 500
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 147/420 (35%), Positives = 212/420 (50%), Gaps = 37/420 (8%)
Query: 79 SRNTPSLEEILRRDQQR---LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYI 134
SR+ + I R Q L SR Q +P ++ F P +G+ + + EY+I
Sbjct: 6 SRDNLRVASIHGRINQTVNGLTRSRSRDRQTKVP-----SQDFQAPVVSGLSLGSGEYFI 60
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
+++G P + + L++DTGS I W QC PC++C Q D FDP KS T+S + C++ C
Sbjct: 61 RISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLN 120
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L C + +C Y + Y DGS TG + TD +++ +G G LGC
Sbjct: 121 L-------DIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCG 173
Query: 255 DNNTGDQNGASGIMGLDRGPVSI---ISKTNISYFFYCL-----HSPYGSTGYITFGKPD 306
+N G GA+G++GL +GP+S + N F YCL S GS+ + FG+
Sbjct: 174 HDNEGYFVGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSS--LVFGEA- 230
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS-----TEIDSGTI 361
V ++TP + FY++ +TGISVGG L + S F S IDSGT
Sbjct: 231 AVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTS 290
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
+TR Y++LR AFR G LFDTCYDLS +V VP +T+HF GG DL
Sbjct: 291 VTRLQNAAYASLRDAFRAGTSDLAPTAGFS-LFDTCYDLSGLASVDVPTVTLHFQGGTDL 349
Query: 422 ELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+L L+ V++ CL FA ++GN+QQ+G+ V YD ++GF P CN
Sbjct: 350 KLPASNYLIPVDNSNTFCLAFA---GTTGPSIIGNIQQQGFRVIYDNLHNQVGFVPSQCN 406
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 142/410 (34%), Positives = 211/410 (51%), Gaps = 40/410 (9%)
Query: 89 LRRDQQRLHLKNSRRLQKAIP----DNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQ 143
L+RD +R+ ++ L IP + + F+ +G+ + EY+ + +G P +
Sbjct: 96 LQRDSRRV--RSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPAR 153
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
YV ++LDTGS I W QC PC C Q DP FDP KSKT++ IPC+S C+ L
Sbjct: 154 YVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRL-------D 206
Query: 204 QDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ 261
C++ K C Y ++Y DGS G ++T+ +T + G LGC +N G
Sbjct: 207 SAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VALGCGHDNEGLF 260
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFGKPDTVNKKFVKYT 316
GA+G++GL +G +S +T + F YCL S + FG + + ++T
Sbjct: 261 VGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFG--NAAVSRIARFT 318
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS------TEIDSGTIITRFPAPVY 370
P+++ P+ FY++ L GISVGG R+P + KL IDSGT +TR P Y
Sbjct: 319 PLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAY 378
Query: 371 SALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV 430
A+R AFR K K LFDTC+DLS V VP + +HF D+ L L+
Sbjct: 379 IAMRDAFRVGAKTLKRAPNFS-LFDTCFDLSNMNEVKVPTVVLHFR-RADVSLPATNYLI 436
Query: 431 -VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V++ + C FA + ++GN+QQ+G+ V YD+A R+GF PG C
Sbjct: 437 PVDTNGKFCFAFAGTMGGLS--IIGNIQQQGFRVVYDLASSRVGFAPGGC 484
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 131/373 (35%), Positives = 182/373 (48%), Gaps = 40/373 (10%)
Query: 128 AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPC 187
A+ EY+ V +G P L++DTGS + W QCKPC+HC +Q P +DP S T+++ PC
Sbjct: 95 ASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGSSTYAQTPC 154
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
+ C+ P D ++ C Y I Y D S +G ATDR+ G
Sbjct: 155 SPPQCRN------PQTCDG-TTGGCGYRIVYGDASSTSGNLATDRLVFSNDTSVGNVT-- 205
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS---YFFYCLHS---PYGSTGYIT 301
LGC +N G A+G++G+ RG S ++ S YF YCL S+ Y+
Sbjct: 206 ---LGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYLV 262
Query: 302 FGK--PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---- 355
FG+ P+ + F TP+ + P + Y++ + G SVGGE P+ LS +
Sbjct: 263 FGRTAPEPPSSVF---TPLRSNPRRPSLYYVDMVGFSVGGE--PVTGFSNASLSLDPATG 317
Query: 356 -----IDSGTIITRFPAPVYSALRSAFRKRMKKY---KMGKGIEDLFDTCYDLSAYKTVV 407
+DSGT ITRF Y ALR AF R K K+G+GI +FD CYDL
Sbjct: 318 RGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGIS-VFDACYDLRGVAVAD 376
Query: 408 VPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
P + +HF GG D+ L LV ES R C D S+ +GNV Q+ + V +D
Sbjct: 377 APGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSV-IGNVLQQRFRVVFD 435
Query: 467 VAGRRLGFGPGNC 479
V R+GF P C
Sbjct: 436 VENERVGFEPNGC 448
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 126/375 (33%), Positives = 182/375 (48%), Gaps = 39/375 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ +V +G P L++DTGS + W QC PC C QR FDP +S T+ ++PC+S
Sbjct: 85 EYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSP 144
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE---VNGNGYFARY 247
C+ L FP + C Y +AY DGS TG ATD++ VN
Sbjct: 145 QCRAL--RFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFANDTYVNN------- 195
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH---SPYGSTGYIT 301
LGC +N G + A+G++G+ RG +SI ++ +Y F YCL S + Y+
Sbjct: 196 -VTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLV 254
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ 355
FG+ T +T +++ P + Y++ + G SVGGER+ ++ L T
Sbjct: 255 FGR--TPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGV 312
Query: 356 -IDSGTIITRFPAPVYSAL--RSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
+DSGT I+RF Y+AL R R + G +FD CYDL P I
Sbjct: 313 VVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIV 372
Query: 413 IHFLGGVDLE-------LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
+HF GG D+ L V G + + CLGF +D ++GNVQQ+G+ V +
Sbjct: 373 LHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEA--ADDGLSVIGNVQQQGFRVVF 430
Query: 466 DVAGRRLGFGPGNCN 480
DV R+GF P C
Sbjct: 431 DVEKERIGFAPKGCT 445
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 126/375 (33%), Positives = 182/375 (48%), Gaps = 39/375 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ +V +G P L++DTGS + W QC PC C QR FDP +S T+ ++PC+S
Sbjct: 85 EYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSP 144
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE---VNGNGYFARY 247
C+ L FP + C Y +AY DGS TG ATD++ VN
Sbjct: 145 QCRAL--RFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDTYVNN------- 195
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH---SPYGSTGYIT 301
LGC +N G + A+G++G+ RG +SI ++ +Y F YCL S + Y+
Sbjct: 196 -VTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLV 254
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ 355
FG+ T +T +++ P + Y++ + G SVGGER+ ++ L T
Sbjct: 255 FGR--TPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGV 312
Query: 356 -IDSGTIITRFPAPVYSAL--RSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
+DSGT I+RF Y+AL R R + G +FD CYDL P I
Sbjct: 313 VVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIV 372
Query: 413 IHFLGGVDLE-------LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
+HF GG D+ L V G + + CLGF +D ++GNVQQ+G+ V +
Sbjct: 373 LHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEA--ADDGLSVIGNVQQQGFRVVF 430
Query: 466 DVAGRRLGFGPGNCN 480
DV R+GF P C
Sbjct: 431 DVEKERIGFAPKGCT 445
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 129/366 (35%), Positives = 188/366 (51%), Gaps = 29/366 (7%)
Query: 132 YYIVVAIGKP-KQYVSLLLDTGSGITWTQCKPC--IHCSQQRDPFFDPSKSKTFSKIPCN 188
Y +A+G + +++++DTGS +TW QC+PC C QRDP FDP+ S TF+ +PC
Sbjct: 180 YVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCG 239
Query: 189 STTCKILLEWFPPNGQDKC------SSKECPYDIAYVDGSGETGFWATDRM---TIQEVN 239
S C L+ C S + C Y ++Y DGS G A D + T +++
Sbjct: 240 SPACAASLK-DATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTTKLD 298
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS 296
G F+ GC +N G G +G+MGL R +S++S+T + F YCL + S
Sbjct: 299 G--------FVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPATTTS 350
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEI 356
TG ++ G + + + YT ++ P Q FY I +TG +VG L A F + +
Sbjct: 351 TGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVG-GGAALTAPGFGAGNVLV 409
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFL 416
DSGT+ITR VY A+R+ F +R + Y G + D CYDL+ V VP +T+
Sbjct: 410 DSGTVITRLAPSVYKAVRAEFARRFE-YPAAPGFS-ILDACYDLTGRDEVNVPLLTLTLE 467
Query: 417 GGVDLELDVRGTLVV--ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
GG + +D G L V + QVCL A LP + + ++GN QQR V YD G RLGF
Sbjct: 468 GGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVGSRLGF 527
Query: 475 GPGNCN 480
+C
Sbjct: 528 ADEDCT 533
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 129/371 (34%), Positives = 180/371 (48%), Gaps = 35/371 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + V IG P L+ DTGS + W QC PC C Q DP FDP+ S +FS +PCNS
Sbjct: 122 EYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFSPVPCNSG 181
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ---EVNGNGYFARY 247
C+ + + EC Y ++Y D S G A + +T+ EV G
Sbjct: 182 VCRAAARYS--SSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTEVQG------- 232
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISK---TNISYFFYCLH----SPYGSTGYI 300
+GC N G A+G++GL GP+S++ + F YCL +G +
Sbjct: 233 -VAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSL 291
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE----- 355
G+ D V + P+V P+ FY++ + G+ V GERL L+ F
Sbjct: 292 VLGREDAAPTGAV-WVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVV 350
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF 415
+D+GT +TR PA Y+ALR AF ++ LFDTCYDLS Y +V VP + ++F
Sbjct: 351 MDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYDLSGYASVRVPTVALYF 410
Query: 416 LG------GVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
G L L R LV V+ CL FA + S P+ +LGN+QQ+G E+ D A
Sbjct: 411 GGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPS--ILGNIQQQGIEITVDSA 468
Query: 469 GRRLGFGPGNC 479
+GFGP C
Sbjct: 469 SGYVGFGPATC 479
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 139/363 (38%), Positives = 193/363 (53%), Gaps = 33/363 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ + +G P +Y+ ++LDTGS + W QCKPC C Q D FDPSKSK+F+ IPC S
Sbjct: 129 EYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSP 188
Query: 191 TCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C+ L CS K C Y ++Y DGS G ++T+ +T + A
Sbjct: 189 LCRRL-------DSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFRRA------AVPR 235
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGST--GYITFG 303
+GC +N G GA+G++GL RG +S ++T + F YCL S I FG
Sbjct: 236 VAIGCGHDNEGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASAKPSSIVFG 295
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKASYFTKLSTE-----ID 357
D+ + ++TP+V P+ FY++ L GISVGG + + AS+F ST ID
Sbjct: 296 --DSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIID 353
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT +TR P Y +LR AFR K LFDTCYDLS V VP + +HF G
Sbjct: 354 SGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFS-LFDTCYDLSGLSEVKVPTVVLHFRG 412
Query: 418 GVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
D+ L LV V++ C FA S + ++GN+QQ+G+ V +D+AG R+GF P
Sbjct: 413 A-DVSLPAANYLVPVDNSGSFCFAFAGTMSGLS--IIGNIQQQGFRVVFDLAGSRVGFAP 469
Query: 477 GNC 479
C
Sbjct: 470 RGC 472
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 121/361 (33%), Positives = 189/361 (52%), Gaps = 26/361 (7%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKI 185
V Y + +G P ++++DTGS +TW QC PC+ C +Q P +DP S T++ +
Sbjct: 129 VGVGNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATV 188
Query: 186 PCNSTTC-KILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
PC+++ C ++ P+ CS + C Y +Y D S G+ + D ++ G+G
Sbjct: 189 PCSASQCDELQAATLNPS---ACSVRNVCIYQASYGDSSFSVGYLSRDTVSF----GSG- 240
Query: 244 FARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGY 299
YP F GC +N G ++G++GL R +S++ + S F YCL +P STGY
Sbjct: 241 --SYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTP-ASTGY 297
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
++ G YTP+ ++ + Y +TL+G+SVGG L + + ++ L T IDSG
Sbjct: 298 LSIGP---YTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSG 354
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T+ITR P VY+AL A M + + DTC+ A + + VP + + F GG
Sbjct: 355 TVITRLPTAVYTALSKAVAAAMVGVQSAPAFS-ILDTCFQGQASQ-LRVPAVAMAFAGGA 412
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L+L + L+ CL FA P+D +I +GN QQ+ + V YDVA R+GF G C
Sbjct: 413 TLKLATQNVLIDVDDSTTCLAFA--PTDSTTI-IGNTQQQTFSVVYDVAQSRIGFAAGGC 469
Query: 480 N 480
+
Sbjct: 470 S 470
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 134/364 (36%), Positives = 187/364 (51%), Gaps = 35/364 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ + +G P +YV ++LDTGS I W QC PCI C Q DP FDP+KS++F+ IPC S
Sbjct: 144 EYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSP 203
Query: 191 TCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C+ L +P CS+K+ C Y ++Y DGS G ++T+ +T +
Sbjct: 204 LCRRL--DYP-----GCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVG------R 250
Query: 249 FLLGCTDNNTG----DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST--GYITF 302
+LGC +N G G P I + N S F YCL S+ I F
Sbjct: 251 VVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFN-SKFSYCLGDRSASSRPSSIVF 309
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKASYFTKLSTE-----I 356
G D+ + ++TP+++ P+ FY++ L GISVGG R+ + AS F ST I
Sbjct: 310 G--DSAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVII 367
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFL 416
DSGT +TR Y ALR AF K LFDTC+DLS V VP + +HF
Sbjct: 368 DSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFS-LFDTCFDLSGKTEVKVPTVVLHFR 426
Query: 417 GGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
G D+ L L+ V++ C FA S + ++GN+QQ+G+ V YD+A R+GF
Sbjct: 427 -GADVPLPASNYLIPVDNSGSFCFAFAGTASGLS--IIGNIQQQGFRVVYDLATSRVGFA 483
Query: 476 PGNC 479
P C
Sbjct: 484 PRGC 487
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 123/364 (33%), Positives = 180/364 (49%), Gaps = 30/364 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
E+ + ++IG P + ++DTGS + WTQCKPC+ C Q P FDPS S T+S +PC+S+
Sbjct: 117 EFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSS 176
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C L P ++K+C Y Y D S G A + T+ + G
Sbjct: 177 LCSDL-----PTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTKLPG------VA 225
Query: 251 LGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST------GYITFG 303
GC D N GD +G++GL RGP+S++S+ + F YCL S ++ G +
Sbjct: 226 FGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTSLDDTSKSPLLLGSLAAI 285
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS-----TEIDS 358
DT + ++ TP++ P Q FY++TL ++VG R+PL S F +DS
Sbjct: 286 STDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDS 345
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD--LSAYKTVVVPKITIHFL 416
GT IT Y L+ AF +M K + G D C+ S V VPK+ +HF
Sbjct: 346 GTSITYLELQGYRPLKKAFAAQM-KLPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFD 404
Query: 417 GGVDLELDVRGTLVVESVR-QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
GG DL+L +V++S +CL ++ S SI +GN QQ+ + YDV L F
Sbjct: 405 GGADLDLPAENYMVLDSASGALCL--TVMGSRGLSI-IGNFQQQNIQFVYDVDKDTLSFA 461
Query: 476 PGNC 479
P C
Sbjct: 462 PVQC 465
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 178/345 (51%), Gaps = 28/345 (8%)
Query: 146 SLLLDTGSGITWTQCKPC--IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
++++D+GS + W QC+PC + C QRDP FDP+ S T++ +PC+S C L P
Sbjct: 82 TVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARL----GPYR 137
Query: 204 QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD--Q 261
+ ++ +C + I Y +G+ TG +++D +T+ Y FL GC + G
Sbjct: 138 RGCLANSQCQFGITYANGATATGTYSSDDLTLGP-----YDVVRGFLFGCAHADQGSTFS 192
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKP---DTVNKKFVKY 315
+G + L G S + +T Y F YC+ S G+I FG P + FV
Sbjct: 193 YDVAGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSSFGFIMFGVPPQRAALVPTFVS- 251
Query: 316 TPIVTTPEQS-EFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALR 374
TP++++ S FY + L I V G LP+ + F+ S+ IDS T+I+R P Y ALR
Sbjct: 252 TPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSA-SSVIDSATVISRIPPTAYQALR 310
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESV 434
+AFR M Y+ + + DTCYD S +++ +P I + F GG + LD G L+
Sbjct: 311 AAFRSAMTMYRPAPPVS-ILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL---- 365
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
Q CL FA SD +GNVQQR EV YDV G+ + F C
Sbjct: 366 -QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 194 bits (493), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 129/373 (34%), Positives = 183/373 (49%), Gaps = 26/373 (6%)
Query: 119 TFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDP 176
T P TG + E+ + V G P Q +L +DTGS ++W QC PC HC +Q DP FDP
Sbjct: 147 TIPDSTGTSLDTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDP 206
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTI 235
+KS T+S +PC C KCS S C Y + Y DGS G + + +++
Sbjct: 207 TKSATYSAVPCGHPQCAA--------AGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSL 258
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS 292
FA GC N G+ G G++GL RG +S+ S+ ++ F YCL S
Sbjct: 259 SSTRDLPGFA-----FGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPS 313
Query: 293 PYGSTGYITFGK--PDTVNKKF-VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
+ GY+T G P N V+YT ++ + Y + + I +GG LP+ + F
Sbjct: 314 YDTTHGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVF 373
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T+ T DSGTI+T P Y++LR F+ M +YK D FDTCYD + + + +P
Sbjct: 374 TRDGTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAY-DPFDTCYDFTGHNAIFMP 432
Query: 410 KITIHFLGGVDLELDVRGTLVV---ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
+ F G +L L+ + CL F PS ++GN QQRG EV YD
Sbjct: 433 AVAFKFSDGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYD 492
Query: 467 VAGRRLGFGPGNC 479
VA ++GFG C
Sbjct: 493 VAAEKIGFGQFTC 505
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 145/460 (31%), Positives = 209/460 (45%), Gaps = 61/460 (13%)
Query: 35 VSVSSLIPPTVCNRTRTALPQGPGKVS--LEVLGRYGPCSKLNQGKSRNTPSLEEILRRD 92
VS +S +P + C+ PQ S L + R+GPC+ ++ S PS+ + LR D
Sbjct: 39 VSAASFVPSSTCSSPDPVPPQRRNGTSAVLRLTHRHGPCAP-SRASSLAAPSVADTLRAD 97
Query: 93 QQRLHLKNSRRLQKAIP---DNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLL 148
Q+R RR+ P D+ A T PA G + Y + ++G P ++
Sbjct: 98 QRRAEYIL-RRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTME 156
Query: 149 LDTGSGITWTQCKPCI---HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
+DTGS ++W QCKPC C Q+DP FDP++S +++ +PC C
Sbjct: 157 VDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCA------------ 204
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
G G A V G F GC +G NG
Sbjct: 205 ---------------GLGIYAASACSAAQCGAVQG--------FFFGCGHAQSGLFNGVD 241
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF--GKPDTVNKKFVKYTPIVT 320
G++GL R S++ +T +Y F YCL + + GY+T G P F T ++
Sbjct: 242 GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGF-STTQLLP 300
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKR 380
+P +Y + LTGISVGG++L + AS F + T++TR P Y+ALRSAFR
Sbjct: 301 SPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG-TVVTRLPPTAYAALRSAFRSG 359
Query: 381 MKKYKMGKGIED-LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
M Y + + DTCY+ + Y TV +P + + F G + L G L CL
Sbjct: 360 MASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSFG-----CL 414
Query: 440 GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
FA SD +LGNVQQR +EV D G +GF P +C
Sbjct: 415 AFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 452
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 119/365 (32%), Positives = 191/365 (52%), Gaps = 29/365 (7%)
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
V +G ++++DT S +TW QC PC C Q+ P FDPS S +++ +PC+S +C
Sbjct: 144 VATVGLGGGEATVIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDA 203
Query: 195 LLEWFPPN---GQDKCSS---KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
L + G C + C Y ++Y DGS G A DR+++ +G
Sbjct: 204 LQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGEVIDG------ 257
Query: 249 FLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISY---FFYC--LHSPYGSTGYITF 302
F+ GC +N G G SG+MGL R +S++S+T + F YC L ++G +
Sbjct: 258 FVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASGSLVL 317
Query: 303 GKPDTV--NKKFVKYTPIVTTPE---QSEFYHITLTGISVGGERLPLKASYFTKLSTEID 357
G + N V YT +V+ + Q FY + LTGI+VGG+ ++++ F+ + +D
Sbjct: 318 GDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQE--VESTGFSARAI-VD 374
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT+IT VY+A+R+ F ++ +Y G + DTC++++ K V VP +T+ F G
Sbjct: 375 SGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFS-ILDTCFNMTGLKEVQVPSLTLVFDG 433
Query: 418 GVDLELDVRGTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
G ++E+D G L V QVCL A L S+ + ++GN QQ+ V +D + ++GF
Sbjct: 434 GAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFA 493
Query: 476 PGNCN 480
C
Sbjct: 494 QETCG 498
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 115/338 (34%), Positives = 172/338 (50%), Gaps = 10/338 (2%)
Query: 147 LLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
++LDTGS ++W QC+PC ++C Q DP +DPS SKT+ K+ C S C L +
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
+ S C Y +Y D S G+ + D +T+ F GC +N G A+
Sbjct: 61 ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ-----TLPQFTYGCGQDNQGLFGRAA 115
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTP 322
GI+GL R +S++++ + Y F YCL + + F +++ K+TP++T
Sbjct: 116 GIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDS 175
Query: 323 EQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMK 382
+ Y + LT I+V G L L A+ + ++ T IDSGT+ITR P +Y+ALR AF K M
Sbjct: 176 KNPSLYFLRLTAITVSGRPLDLAAAMY-RVPTLIDSGTVITRLPMSMYAALRQAFVKIMS 234
Query: 383 KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA 442
+ DTC+ S VP+I + F GG DL L L+ CL FA
Sbjct: 235 TKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAFA 294
Query: 443 LLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++GN QQ+ Y + YDV+ R+GF PG+C+
Sbjct: 295 GSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSCH 332
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 140/445 (31%), Positives = 220/445 (49%), Gaps = 31/445 (6%)
Query: 54 PQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFK 113
P+ G +SLE++ R + + + L E L+RD+QR+ S+ +
Sbjct: 50 PRDGGTLSLELIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKDEA 109
Query: 114 KTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP 172
+ P +G++ + EY++ + +G P + + +++DTGS + W QC+PC C +Q DP
Sbjct: 110 SSTDLNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADP 169
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDR 232
FDP S +F +IPC S CK LE +G +S+ C Y +AY DGS G +++D
Sbjct: 170 IFDPRNSSSFQRIPCLSPLCKA-LEIHSCSGSRGATSR-CSYQVAYGDGSFSVGDFSSDL 227
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISK--------TNIS 284
T+ G G A GC +N G GA+G++GL G +S S+ + +
Sbjct: 228 FTL----GTGSKA-MSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTAN 282
Query: 285 YFFYCL---HSPYG-STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE 340
F YCL +P S+ + FG + +P++ P+ FY+ + G+SVGG
Sbjct: 283 SFSYCLVDRSNPMTRSSSSLIFGAAAIPST--AALSPLLKNPKLDTFYYAAMIGVSVGGA 340
Query: 341 RLP-----LKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFD 395
+LP L+ S IDSGT +TRFP VY+ +R AFR LFD
Sbjct: 341 QLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYS-LFD 399
Query: 396 TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLG 454
TCY+ S +V VP + +HF G DL+L L+ + + CL FA P+ ++G
Sbjct: 400 TCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFA--PTSMELGIIG 457
Query: 455 NVQQRGYEVHYDVAGRRLGFGPGNC 479
N+QQ+ + + +D+ L F P C
Sbjct: 458 NIQQQSFRIGFDLQKSHLAFAPQQC 482
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 130/359 (36%), Positives = 188/359 (52%), Gaps = 31/359 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ V +G+P + ++LDTGS I W QC+PC C QQ DP FDP S +F+ +PC S
Sbjct: 154 EYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQ 213
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ L C + +C Y ++Y DGS G + T+ +T GN
Sbjct: 214 QCQAL-------ETSGCRASKCLYQVSYGDGSFTVGEFVTETLTF----GNSGMIN-DVA 261
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL----HSPYGSTGYITFGKPD 306
+GC +N G G++G++GL GP+S+ S+ S F YCL S + + D
Sbjct: 262 VGCGHDNEGLFVGSAGLLGLGGGPLSLTSQMKASSFSYCLVDRDSSSSSDLEFNSAAPSD 321
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTI 361
+VN P++ + + FY++ LTG+SVGG+ L + + F + +DSGT
Sbjct: 322 SVN------APLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTA 375
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
ITR Y+ LR AF R K G LFDTCYDLS+ V +P ++ F GG L
Sbjct: 376 ITRLQTQAYNTLRDAFVSRTPYLKKTNGFA-LFDTCYDLSSQSRVTIPTVSFEFAGGKSL 434
Query: 422 ELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+L + L+ V+SV C FA P+ + ++GNVQQ+G VHYD+A +GF P C
Sbjct: 435 QLPPKNYLIPVDSVGTFCFAFA--PTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 137/416 (32%), Positives = 202/416 (48%), Gaps = 36/416 (8%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYV 145
E+L+ QR + +R + A K P +G+ + EY+ + +G P
Sbjct: 83 ELLKHRLQRDKRRAARISEAAGAGGGNGRKGVAAPVVSGLAQGSGEYFTKIGVGTPATQA 142
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
++LDTGS + W QC PC C +Q P FDP +S ++ + C + C+ L +G
Sbjct: 143 LMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRL-----DSGGC 197
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
C Y +AY DGS G + T+ +T G AR LGC +N G A+
Sbjct: 198 DLRRGACMYQVAYGDGSVTAGDFVTETLTFA---GGARVAR--VALGCGHDNEGLFVAAA 252
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCL----HSPYGS------TGYITFGKPDTVNKKF 312
G++GL RG +S ++ + Y F YCL S G+ + ++FG +V
Sbjct: 253 GLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGA-GSVGASS 311
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-------IDSGTIITRF 365
+TP+V P FY++ L GISVGG R+P A +L +DSGT +TR
Sbjct: 312 ASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRL 371
Query: 366 PAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
YSALR AFR ++ G LFDTCYDL + V VP +++HF GG + L
Sbjct: 372 ARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALP 431
Query: 425 VRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L+ V+S C FA +D ++GN+QQ+G+ V +D G+R+GF P C
Sbjct: 432 PENYLIPVDSRGTFC--FAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 128/371 (34%), Positives = 189/371 (50%), Gaps = 28/371 (7%)
Query: 119 TFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI---HCSQQRDPFF 174
T PA G + Y + ++G P ++ +DTGS ++W QCKPC C Q+DP F
Sbjct: 34 TVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLF 93
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
DP++S +++ +PC C L + CS+ +C Y ++Y DGS TG +++D +T
Sbjct: 94 DPAQSSSYAAVPCGGPVCAGLGIYA----ASACSAAQCGYVVSYGDGSNTTGVYSSDTLT 149
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
+ + A F GC +G NG G++GL R S++ +T +Y F YCL
Sbjct: 150 LSASS-----AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLP 204
Query: 292 SPYGSTGYITFG--KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
+ + GY+T G P F T ++ +P +Y + LTGISVGG++L + AS F
Sbjct: 205 TKPSTAGYLTLGVGGPSGAAPGF-STTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAF 263
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED-LFDTCYDLSAYKTVVV 408
+ T++TR P Y+ALRSAFR M Y + + DTCY+ + Y TV +
Sbjct: 264 AGGTVVDTG-TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTL 322
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P + + F G + L G L CL FA SD +LGNVQQR +EV D
Sbjct: 323 PNVALTFGSGATVTLGADGILSFG-----CLAFAPSGSDGGMAILGNVQQRSFEVRID-- 375
Query: 469 GRRLGFGPGNC 479
G +GF P +C
Sbjct: 376 GTSVGFKPSSC 386
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 119/366 (32%), Positives = 178/366 (48%), Gaps = 29/366 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + V IG P +Y S ++DTGS + WTQC PC+ C +Q P+F+P+KS +++ +PC+S
Sbjct: 84 EYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSA 143
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C L C C Y Y D + G A + T + R F
Sbjct: 144 MCNALYS-------PLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSF- 195
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGST----GYITFG 303
GC + N G SG++G RG +S++S+ F YCL SP S Y T
Sbjct: 196 -GCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATLN 254
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ID 357
+T + V+ TP + P Y + +TGISV G+ LP+ S F T+ ID
Sbjct: 255 STNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIID 314
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL--SAYKTVVVPKITIHF 415
SGT +T P Y+ ++ AF + + D FDTC+ + V +P++ +HF
Sbjct: 315 SGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHF 374
Query: 416 LGGVDLELDVRGTLVVE-SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G D+EL + +V++ +CL A+LPSD SI +G+ Q + + + YD+ L F
Sbjct: 375 -DGADMELPLENYMVMDGGTGNLCL--AMLPSDDGSI-IGSFQHQNFHMLYDLENSLLSF 430
Query: 475 GPGNCN 480
P CN
Sbjct: 431 VPAPCN 436
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 119/366 (32%), Positives = 178/366 (48%), Gaps = 29/366 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + V IG P +Y S ++DTGS + WTQC PC+ C +Q P+F+P+KS +++ +PC+S
Sbjct: 87 EYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSA 146
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C L C C Y Y D + G A + T + R F
Sbjct: 147 MCNALYS-------PLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSF- 198
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGST----GYITFG 303
GC + N G SG++G RG +S++S+ F YCL SP S Y T
Sbjct: 199 -GCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATLN 257
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ID 357
+T + V+ TP + P Y + +TGISV G+ LP+ S F T+ ID
Sbjct: 258 STNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIID 317
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL--SAYKTVVVPKITIHF 415
SGT +T P Y+ ++ AF + + D FDTC+ + V +P++ +HF
Sbjct: 318 SGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHF 377
Query: 416 LGGVDLELDVRGTLVVE-SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G D+EL + +V++ +CL A+LPSD SI +G+ Q + + + YD+ L F
Sbjct: 378 -DGADMELPLENYMVMDGGTGNLCL--AMLPSDDGSI-IGSFQHQNFHMLYDLENSLLSF 433
Query: 475 GPGNCN 480
P CN
Sbjct: 434 VPAPCN 439
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 140/411 (34%), Positives = 208/411 (50%), Gaps = 33/411 (8%)
Query: 80 RNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAI 138
R++ ++ ++ R ++ +S L+ D+ K + P +G + EY+ V I
Sbjct: 96 RDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQSPIISGTSQGSGEYFSRVGI 155
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
GKP L+LDTGS + W QC PC C QQ DP F+P+ S +FS + CN+ C+ L
Sbjct: 156 GKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTRQCRSL--- 212
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNT 258
+C + C Y+++Y DGS G + T+ +T+ G +GC NN
Sbjct: 213 ----DVSECRNDTCLYEVSYGDGSYTVGDFVTETITL------GSAPVDNVAIGCGHNNE 262
Query: 259 GDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFGK---PDTVNKKFVK 314
G GA+G++GL G +S S+ N + F YCL S + F P+ V+
Sbjct: 263 GLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSESASTLEFNSTLPPNAVSA---- 318
Query: 315 YTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPV 369
P++ FY++ LTG+SVGGE + + S F + +DSGT ITR V
Sbjct: 319 --PLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDV 376
Query: 370 YSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL 429
Y++LR AF KR + GI LFDTCYDLS+ V VP ++ HF G +L L + L
Sbjct: 377 YNSLRDAFVKRTRDLPSTNGIA-LFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYL 435
Query: 430 V-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V ++S C FA P+ + ++GNVQQ+G V YD+ +GF P C
Sbjct: 436 VPLDSEGTFCFAFA--PTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 129/363 (35%), Positives = 182/363 (50%), Gaps = 33/363 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ + +G P +YV ++LDTGS + W QC PC C Q DP FDP+KS+T++ IPC +
Sbjct: 128 EYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAP 187
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ L P +K +K C Y ++Y DGS G ++T+ +T +
Sbjct: 188 LCRRLDS---PGCNNK--NKVCQYQVSYGDGSFTFGDFSTETLTFRRTRVT------RVA 236
Query: 251 LGCTDNNTG----DQNGASGIMGLDRGPVSIISKTNISYFFYCL--HSPYGSTGYITFGK 304
LGC +N G G PV + N F YCL S + FG
Sbjct: 237 LGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFN-QKFSYCLVDRSASAKPSSVVFG- 294
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKASYFTKLSTE------ID 357
D+ + ++TP++ P+ FY++ L GISVGG + L AS F +L ID
Sbjct: 295 -DSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLF-RLDAAGNGGVIID 352
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT +TR P Y ALR AFR K LFDTC+DLS V VP + +HF G
Sbjct: 353 SGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFS-LFDTCFDLSGLTEVKVPTVVLHFRG 411
Query: 418 GVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
D+ L L+ V++ C FA S + ++GN+QQ+G+ V +D+AG R+GF P
Sbjct: 412 A-DVSLPATNYLIPVDNSGSFCFAFAGTMSGLS--IIGNIQQQGFRVSFDLAGSRVGFAP 468
Query: 477 GNC 479
C
Sbjct: 469 RGC 471
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 141/480 (29%), Positives = 218/480 (45%), Gaps = 53/480 (11%)
Query: 34 IVSVSSLIPPTVCNRTRTA---LPQGPGKVSLEVLGRYGPCS----KLNQGKSRNTPSLE 86
+++ S++ P T C+ + A +P P + YGPCS N + S+
Sbjct: 35 VIATSTMKPKTFCSGHKVAPGDVPS-PNSTWAPLHHLYGPCSPAPSSANSTAADVAASMA 93
Query: 87 EILRRDQQRLHLKNSRRLQKAIPD-----------NFKKTKAFTFPAKTGIVAADEYYIV 135
+++ DQ+R +RL A D ++K + G V +
Sbjct: 94 DMVDDDQRRADYIQ-KRLTGATDDKQPMAFSSRTSQYEKNGQYATNGGLGSVPHLKSLST 152
Query: 136 VAI------GKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPC 187
A G ++++D+GS ++W QCKPC C +QRDP FDP+ S T++ +PC
Sbjct: 153 TATTNSAPDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPC 212
Query: 188 NSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
S C L + + CS+ +C + I Y DGS TG ++ D +T+ Y
Sbjct: 213 TSAACAQLGPY-----RRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGP-----YDVI 262
Query: 247 YPFLLGCTDNNTGD--QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYIT 301
F GC + G +G + L G S++ +T Y F YCL S G++
Sbjct: 263 RGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLV 322
Query: 302 FGKPDTVNKKFVKY--TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
G P + + TP++++ FY + L I V G L + + F+ S+ IDS
Sbjct: 323 LGVPPERAQLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSA-SSVIDSS 381
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
TII+R P Y ALR+AFR M Y+ + + DTCYD + +++ +P I + F GG
Sbjct: 382 TIISRLPPTAYQALRAAFRSAMTMYRAAPPVS-ILDTCYDFTGVRSITLPSIALVFDGGA 440
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ LD G L+ CL FA SD +GNVQQ+ EV YDV + + F C
Sbjct: 441 TVNLDAAGILLGS-----CLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 158/494 (31%), Positives = 238/494 (48%), Gaps = 63/494 (12%)
Query: 24 ANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGR-YGPCSKLNQGKS--- 79
A D +L + +V VS L P +P P S L R GPCS +G +
Sbjct: 18 AADEELELT-VVDVSLLQEPRASCSGHRVMPPHPYNNSWVPLFRPLGPCSPSFKGAAAAA 76
Query: 80 -RNTPSLEEILRRDQQRLH-----LKNSRRLQKAIPDNFKK-----------TKAFTFPA 122
R PSL ++LR+D+ R+H + S R +A +FK+ A +
Sbjct: 77 ARTKPSLADVLRQDRLRVHHIHRRVSGSSRGARASKGSFKEPVSVEETQLHHQAAISVEV 136
Query: 123 KTGIVAADE----YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSK 178
T +++ + G V+++LDT + W +C PC +Q D +DP++
Sbjct: 137 GTSQTSSEPSSGIHPAAATDGSSSPPVTVVLDTAGDVPWMRCVPCTF-AQCAD--YDPTR 193
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYV-DGSGETGFWATDRMTIQ- 236
S T+S PCNS+ CK L + NG D ++ +C Y + D +G +++D +TI
Sbjct: 194 SSTYSAFPCNSSACKQLGRY--ANGCD--ANGQCQYMVVTAGDSFTTSGTYSSDVLTINS 249
Query: 237 --EVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
V G F GC+ N G +N A GIM L RG S++++T+ +Y F YCL
Sbjct: 250 GDRVEG--------FRFGCSQNEQGSFENQADGIMALGRGVQSLMAQTSSTYGDAFSYCL 301
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIV-----TTPEQSEFYHITLTGISVGGERLPLK 345
+ G+ G P + +FV TP++ + + Y L I+V G+ L +
Sbjct: 302 PPTETTKGFFQIGVPIGASYRFVT-TPMLKERGGASAAAATLYRALLLAITVDGKELNVP 360
Query: 346 ASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
A F T +DS TIITR P Y ALR+AFR RM+ Y++ E+L DTCYDL+ +
Sbjct: 361 AEVFAA-GTVMDSRTIITRLPVTAYGALRAAFRNRMR-YRVAPPQEEL-DTCYDLTGVRY 417
Query: 406 VVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
+P+I + F G +E+D G L+ CL FA D + +LGNVQQ+ +V +
Sbjct: 418 PRLPRIALVFDGNAVVEMDRSGILL-----NGCLAFASNDDDSSPSILGNVQQQTIQVLH 472
Query: 466 DVAGRRLGFGPGNC 479
DV G R+GF C
Sbjct: 473 DVGGGRIGFRSAAC 486
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 152/482 (31%), Positives = 225/482 (46%), Gaps = 46/482 (9%)
Query: 22 AYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKL--NQGKS 79
A A+++D +V+ SSL P C R + PQ V L +GPCS L + S
Sbjct: 22 AAAHEHD--EYTLVAKSSLKPKATCTGYRVSPPQNITWVPLNA--PHGPCSPLPGSAAPS 77
Query: 80 RNTPSLEEILRRDQQRLHLKNSRRLQKAIP---DNFKKTKAF-------------TFPAK 123
L + LR D L ++ K +P ++F+ + +
Sbjct: 78 LAALLLHDQLRVDGIERRLSDNPHDSKLVPAGGEDFQTNGNLLQVNYGNSGQPMSSEAQQ 137
Query: 124 TGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKT 181
+G+V A P +++LD+ S + W QC PC C Q D F+DPS+S +
Sbjct: 138 SGVVNASAAGGGSRSKLPGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPS 197
Query: 182 FSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
+ C+S TC L + + C++ +C Y + Y DGS +G + D +T+ N
Sbjct: 198 SAPFSCSSPTCTALGPY-----ANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGN-- 250
Query: 242 GYFARYPFLLGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGST 297
A F GC+ G + A+GIM L GP S++S+T Y F YC+ + +
Sbjct: 251 ---AVSGFKFGCSHAEQGSFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDS 307
Query: 298 GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEID 357
G+ T G P + ++V TP+V + + FY + L I+VGG+RL + + F S +D
Sbjct: 308 GFFTLGVPRRASSRYV-VTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSV-LD 365
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
S T ITR P Y ALRSAFR M Y+ + DTCYD + + +PKI++ F
Sbjct: 366 SRTAITRLPPTAYQALRSAFRSSMTMYRSAP-PKGYLDTCYDFTGVVNIRLPKISLVFDR 424
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
L LD G L + CL F D +LG+VQQ+ EV YDV G +GF G
Sbjct: 425 NAVLPLDPSGILFND-----CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQG 479
Query: 478 NC 479
C
Sbjct: 480 AC 481
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 128/359 (35%), Positives = 186/359 (51%), Gaps = 31/359 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ V +G+P + ++LDTGS I W QC+PC C QQ DP FDP S +F+ +PC S
Sbjct: 154 EYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQ 213
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ L C + +C Y ++Y DGS G + + +T GN
Sbjct: 214 QCQAL-------ETSGCRASKCLYQVSYGDGSFTVGEFVIETLTF----GNSGMINN-VA 261
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL----HSPYGSTGYITFGKPD 306
+GC +N G G++G++GL G +S+ S+ S F YCL S + + D
Sbjct: 262 VGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCLVDRDSSSSSDLEFNSAAPSD 321
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTI 361
+VN P++ + + FY++ LTG+SVGG+ L + + F + +DSGT
Sbjct: 322 SVN------APLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTA 375
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
ITR Y+ LR AF R K G LFDTCYDLS+ V +P ++ F GG L
Sbjct: 376 ITRLQTQAYNTLRDAFVSRTPYLKKTNGFA-LFDTCYDLSSQSRVTIPTVSFEFAGGKSL 434
Query: 422 ELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+L + L+ V+SV C FA P+ + ++GNVQQ+G VHYD+A +GF P C
Sbjct: 435 QLPPKNYLIPVDSVGTFCFAFA--PTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 133/390 (34%), Positives = 196/390 (50%), Gaps = 35/390 (8%)
Query: 104 LQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC 163
+Q+ +P +++ FP G E+ + + +G P Q +++DTGS +TW Q +PC
Sbjct: 1 MQETLPGQ-TDNESYEFPESAGY---GEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPC 56
Query: 164 IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGS 222
C +Q DP FDPSKS T++KI C+S+ C LL G CS + C Y Y DGS
Sbjct: 57 RACFEQADPIFDPSKSSTYNKIACSSSACADLL------GTQTCSAAANCIYAYGYGDGS 110
Query: 223 GETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG--DQNGASGIMGLDRGPVSIISK 280
G+++ + +T + G G + NTG G GI+GL +GPVS+ S+
Sbjct: 111 VTRGYFSKETITATDTAGE------EVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQ 164
Query: 281 TNI---SYFFYCL---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTG 334
+ F YCL S T + FG V V+YTPIV + +Y+I + G
Sbjct: 165 LGSVLGNKFSYCLVDWLSAGSETSTMYFGDA-AVPSGEVQYTPIVPNADHPTYYYIAVQG 223
Query: 335 ISVGGERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG 389
ISVGG L + S + S T IDSGT IT V++AL +A+ ++ +Y
Sbjct: 224 ISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQV-RYPTTTS 282
Query: 390 IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN 449
L D C++ + V P +TIH L GV LEL T + +CL FA P
Sbjct: 283 ATGL-DLCFNTRGTGSPVFPAMTIH-LDGVHLELPTANTFISLETNIICLAFASALDFPI 340
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+I GN+QQ+ +++ YD+ R+GF P +C
Sbjct: 341 AI-FGNIQQQNFDIVYDLDNMRIGFAPADC 369
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 140/438 (31%), Positives = 218/438 (49%), Gaps = 49/438 (11%)
Query: 68 YGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRL--------QKAIPDNFKKT---- 115
Y P + G RN L RD+ RL L S R+ + ++ + K T
Sbjct: 10 YRPANATVHGLVRNR------LHRDELRL-LSISSRISLGVAGIPKSSLTNPLKNTNPFL 62
Query: 116 -KAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
+ F P ++G+ + EY++ + +G P + V+++ DTGS + W QC PC C Q DP
Sbjct: 63 QQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPL 122
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
F+PS S TF I C S+ C+ LL C +C Y ++Y DGS G ++T+ +
Sbjct: 123 FNPSFSSTFQSITCGSSLCQQLLIR-------GCRRNQCLYQVSYGDGSFTVGEFSTETL 175
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
+ G A +GC NN G GA+G++GL +G +S S+ Y F YCL
Sbjct: 176 SF------GSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCL 229
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
+ STG + + ++T ++T P+ FY++ + GI VGG + + A +
Sbjct: 230 PTRE-STGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLS 288
Query: 351 KLSTE------IDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDLFDTCYDLSAY 403
S+ +DSGT +TR Y+ +R AFR M KM G LFDTCYDLS
Sbjct: 289 LDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFS-LFDTCYDLSGR 347
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
++++P ++ F GG + L + +V V++ CL FA P+ N ++GN+QQ+ +
Sbjct: 348 SSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFA--PNSENFSIIGNIQQQSFR 405
Query: 463 VHYDVAGRRLGFGPGNCN 480
+ +D G R+G G CN
Sbjct: 406 MSFDSTGNRVGIGANQCN 423
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 144/415 (34%), Positives = 211/415 (50%), Gaps = 44/415 (10%)
Query: 90 RRDQQRLHLKNSR---RLQK-----AIPDNFKK---TKAFTFPAKTGIV-AADEYYIVVA 137
R ++ HL+ R R++K A N K T F+ +G+ + EY+ +
Sbjct: 75 RTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRIG 134
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
+G P +YV ++LDTGS I W QC PC +C Q DP F+P KS +F+K+ C + C+ L
Sbjct: 135 VGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRL-- 192
Query: 198 WFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDN 256
C+ ++ C Y ++Y DGS TG + T+ +T + LGC +
Sbjct: 193 -----ESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE------QVALGCGHD 241
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFGKPDTVNKK 311
N G GA+G++GL RG +S S+ ++ F YCL S + FG ++ +
Sbjct: 242 NEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFG--NSAVSR 299
Query: 312 FVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKASYFTKLSTE-----IDSGTIITRF 365
++TP++T P FY++ L GISVGG + + AS+F T ID GT +TR
Sbjct: 300 TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRL 359
Query: 366 PAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
P Y ALR AFR K LFDTCYDLS TV VP + +HF G D+ L
Sbjct: 360 NKPAYIALRDAFRAGASSLKSAPEFS-LFDTCYDLSGKTTVKVPTVVLHFRGA-DVSLPA 417
Query: 426 RGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L+ V+ + C FA S + ++GN+QQ+G+ V YD+A R+GF P C
Sbjct: 418 SNYLIPVDGSGRFCFAFAGTTSGLS--IIGNIQQQGFRVVYDLASSRVGFSPRGC 470
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 133/435 (30%), Positives = 209/435 (48%), Gaps = 44/435 (10%)
Query: 72 SKLNQGKSRNTPS----LEEILRRDQQRLHLKNSR-----RLQKAIPDNFKKTKAFTFPA 122
S L +G + T S LEE LRR+ R+ R +L+K +++ T
Sbjct: 80 SLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYENVAGVTAEF 139
Query: 123 KTGIVA-----ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
+ +V+ + EY+ + IG P + ++LDTGS + W QC+PC C Q DP F+PS
Sbjct: 140 GSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPS 199
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
S +FS + C+S C L + C C Y+++Y DGS G +AT+ +T
Sbjct: 200 SSVSFSTVGCDSAVCSQL-------DANDCHGGGCLYEVSYGDGSYTVGSYATETLTF-- 250
Query: 238 VNGNGYFARYPFLLGCTDNNTG----DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSP 293
G + +GC +N G G P + ++T ++ + +
Sbjct: 251 ----GTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRD 306
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGG---ERLPLKASYFT 350
S+G + FG P++V + +TP+V P FY++++ ISVGG + +P +A
Sbjct: 307 SESSGTLEFG-PESVPIGSI-FTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRID 364
Query: 351 KLSTE----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
+ + IDSGT +TR Y ALR AF + GI +FDTCYDLSA ++V
Sbjct: 365 ETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGIS-IFDTCYDLSALQSV 423
Query: 407 VVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
+P + HF G L + L+ ++S+ C FA P+D N ++GN+QQ+G V +
Sbjct: 424 SIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFA--PADSNLSIMGNIQQQGIRVSF 481
Query: 466 DVAGRRLGFGPGNCN 480
D A +GF C
Sbjct: 482 DSANSLVGFAIDQCQ 496
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 131/419 (31%), Positives = 199/419 (47%), Gaps = 38/419 (9%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDN-------FKKTKAFTFPAKTGIVAADEYYIVVAI 138
++L R QR L+ + + KA + + F P + + EY +A+
Sbjct: 85 AQLLARRLQRDVLRAAWIISKAAANGTPPPVAGLSSARGFVAPVVSRAPTSGEYIAKIAV 144
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
G P L LDT S +TW QC+PC C Q P FDP S ++ ++ N+ C+ L
Sbjct: 145 GTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSFNAADCQALGR- 203
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCTDNN 257
+G C Y + Y DGS G + + +T R P + +GC +N
Sbjct: 204 ---SGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAG------GVRLPRISIGCGHDN 254
Query: 258 TGDQNG-ASGIMGLDRGPVSIISKTNIS-YFFYC----LHSPYGSTGYITFGKPDTVNKK 311
G A+GI+GL RG +S ++ + + F YC L P + +TFG
Sbjct: 255 KGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLSGPGSLSSTLTFGAGAVDTSP 314
Query: 312 FVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS-------YFTKLSTEIDSGTIITR 364
V +TP V FY++ LTGISVGG R+P Y + +DSGT +TR
Sbjct: 315 PVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGRGGVIVDSGTAVTR 374
Query: 365 FPAPVYSALRSAFRK---RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
P Y+A R AFR + + +G G FDTCY + VP +++HF G V++
Sbjct: 375 LARPAYTAFRDAFRAVAVDLGQVSIG-GPSGFFDTCYTVGGRGMKKVPTVSMHFAGSVEV 433
Query: 422 ELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+L + L+ V+S+ VC FA D + ++GN+QQ+G+ + YD+ G R+GF P +C
Sbjct: 434 KLQPKNYLIPVDSMGTVCFAFAAT-GDHSVSIIGNIQQQGFRIVYDIGG-RVGFAPNSC 490
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 139/414 (33%), Positives = 204/414 (49%), Gaps = 46/414 (11%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFK----------KTKAFTFPAKTGIV-AADEYYIVVA 137
L RD R + + RLQ A+ D K K + + P +G + EY+ V
Sbjct: 108 LHRDTVRFN-SLTARLQLALEDISKSDLKPLETEIKPEDLSTPVTSGTSQGSGEYFTRVG 166
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
+G P + ++LDTGS I W QC+PC C QQ DP FDP+ S T++ + C S C L
Sbjct: 167 VGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQCSSL-- 224
Query: 198 WFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN 257
C S +C Y + Y DGS G +AT+ ++ GN + LGC +N
Sbjct: 225 -----EMSSCRSGQCLYQVNYGDGSYTFGDFATESVSF----GNSGSVKN-VALGCGHDN 274
Query: 258 TGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFGKP----DTVNKKF 312
G GA+G++GL GP+S+ ++ + F YCL + + + F D+V
Sbjct: 275 EGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNSAQLGVDSVTAPL 334
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFP 366
+K I T FY++ L+G+SVGG+ + + S F +L +D GT ITR
Sbjct: 335 MKNRKIDT------FYYVGLSGMSVGGQMVSIPESTF-RLDESGNGGIIVDCGTAITRLQ 387
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
Y+ LR AF + + K+ + LFDTCYDLS +V VP ++ HF G L
Sbjct: 388 TQAYNPLRDAFVRMTQNLKLTSAVA-LFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAA 446
Query: 427 GTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L+ V+S C FA P+ + ++GNVQQ+G V +D+A R+GF P C
Sbjct: 447 NYLIPVDSAGTYCFAFA--PTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 142/413 (34%), Positives = 205/413 (49%), Gaps = 44/413 (10%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKT-----------KAFTFPAKTGIV-AADEYYIVV 136
L RD R+ N++ LQ A+ K + F+ P +G + EY++ V
Sbjct: 106 LARDSARVKAINTK-LQLAVSGTDKSDLVPMDTEILHPQDFSTPVTSGTSQGSGEYFLRV 164
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILL 196
IG+P + +++DTGS + W QCKPC C QQ DP FDP+ S +FS++ C + C+ L
Sbjct: 165 GIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQCRNLD 224
Query: 197 EWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDN 256
+ C + C Y ++Y DGS G +AT+ ++ A +GC +
Sbjct: 225 VF-------ACRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGSVDKVA-----IGCGHD 272
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGSTGYITFGKP-DTVNKKF 312
N G GA+G++GL GP+S+ S+ S F YCL S ST KP D+V
Sbjct: 273 NEGLFVGAAGLIGLGGGPLSLTSQIKASSFSYCLVNRDSVDSSTLEFNSAKPSDSVT--- 329
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPA 367
PI + FY++ +TG+SVGGE+L + S F K +D GT +TR
Sbjct: 330 ---APIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTRLQT 386
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
Y+ALR F K K G LFDTCY+LS+ +V VP + F GG L L
Sbjct: 387 QAYNALRDTFVKLTKDLPSTSGFA-LFDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPPSN 445
Query: 428 TLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L+ V+S CL FA P+ + ++GNVQQ+G V YD+A ++ F C
Sbjct: 446 YLIPVDSAGTFCLAFA--PTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 140/438 (31%), Positives = 218/438 (49%), Gaps = 49/438 (11%)
Query: 68 YGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRL--------QKAIPDNFKKT---- 115
Y P + G RN L RD+ RL L S R+ + ++ + K T
Sbjct: 10 YRPANATVHGLVRNR------LHRDELRL-LSISSRISLGVAGIPKSSLTNPLKNTNPFL 62
Query: 116 -KAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
+ F P ++G+ + EY++ + +G P + V+++ DTGS + W QC PC C Q DP
Sbjct: 63 QQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPL 122
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
F+PS S TF I C S+ C+ LL C +C Y ++Y DGS G ++T+ +
Sbjct: 123 FNPSFSSTFQSITCGSSLCQQLLIR-------GCRRNQCLYQVSYGDGSFTVGEFSTETL 175
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
+ G A +GC NN G GA+G++GL +G +S S+ Y F YCL
Sbjct: 176 SF------GSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCL 229
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
+ STG + + ++T ++T P+ FY++ + GI VGG + + A +
Sbjct: 230 PTRE-STGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLS 288
Query: 351 KLSTE------IDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDLFDTCYDLSAY 403
S+ +DSGT +TR Y+ +R AFR M KM G LFDTCYDLS
Sbjct: 289 LDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFS-LFDTCYDLSGR 347
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
++++P ++ F GG + L + +V V++ CL FA P+ N ++GN+QQ+ +
Sbjct: 348 SSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFA--PNSENFSIIGNIQQQSFR 405
Query: 463 VHYDVAGRRLGFGPGNCN 480
+ +D G R+G G CN
Sbjct: 406 MSFDSTGNRVGIGANQCN 423
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 120/375 (32%), Positives = 181/375 (48%), Gaps = 38/375 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ V+ +G P +++DTGS + W QC PC HC +Q P +DP S T +IPC S
Sbjct: 87 EYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASP 146
Query: 191 TCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C+ +L + C ++ C Y + Y DGS +G ATDR+ + +
Sbjct: 147 RCRDVLRY------PGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDT-----HVHN 195
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYC----LHSPYGSTGYIT 301
LGC +N G A+G++G+ RG +S ++ +Y F YC L + Y+
Sbjct: 196 VTLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLV 255
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ 355
FG+ T +TP+ T P + Y++ + G SVGGER+ ++ L+
Sbjct: 256 FGR--TPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGI 313
Query: 356 -IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE---DLFDTCYDL----SAYKTVV 407
+DSGT I+RF Y+A+R AF + + +FD CYDL + V
Sbjct: 314 VVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVR 373
Query: 408 VPKITIHFLGGVDLELDVRGTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
VP I +HF GG D+ L L V R+ L +D +LGNVQQ+G+ + +
Sbjct: 374 VPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVF 433
Query: 466 DVAGRRLGFGPGNCN 480
DV R+GF P C+
Sbjct: 434 DVERGRIGFTPNGCS 448
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 130/403 (32%), Positives = 193/403 (47%), Gaps = 23/403 (5%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQ 143
+ D R+ SR K + A + P +G V Y + +G P
Sbjct: 64 FSAFITHDAARIAGLASRLATK----DKDWVAASSVPLASGASVGVGNYITRLGLGTPTT 119
Query: 144 YVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
+++D+GS +TW QC PC + C Q P +DP S T++ +PC++ C L+ N
Sbjct: 120 TYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAPQCA-ELQAATLN 178
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN 262
S C Y +Y DGS G+ + D +++ +G F F GC +N G
Sbjct: 179 PSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSS---SGSFPG--FYYGCGQDNVGLFG 233
Query: 263 GASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYITFGK-PDTVNKKFVKYTP 317
A+G++GL R +S++S+ S F YCL S S GY++FG D N YT
Sbjct: 234 RAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYLSFGSNSDNKNPGKYSYTS 293
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAF 377
+V++ + Y ++L G+SV G L + +S + L T IDSGT+ITR P PVY+AL A
Sbjct: 294 MVSSSLDASLYFVSLAGMSVAGSPLAVPSSEYGSLPTIIDSGTVITRLPTPVYTALSKAV 353
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
+ + TC+ K + VP + + F GG L L LV +
Sbjct: 354 GAALAAPSAPA--YSILQTCFKGQVAK-LPVPAVNMAFAGGATLRLTPGNVLVDVNETTT 410
Query: 438 CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CL FA P+D +I +GN QQ+ + V YDV G R+GF G C+
Sbjct: 411 CLAFA--PTDSTAI-IGNTQQQTFSVVYDVKGSRIGFAAGGCS 450
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 114/340 (33%), Positives = 177/340 (52%), Gaps = 23/340 (6%)
Query: 147 LLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK 206
LL+DTGS ITW QC PC C +Q+D F P+ S T+ +PCNST C+ L +
Sbjct: 3 LLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSF-----SHS 57
Query: 207 CSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGDQNGAS 265
C + C Y ++Y D S G +A + +T++ + + P F GC N G NGA+
Sbjct: 58 CLNSSCNYMVSYGDKSTTRGDFALETLTLR--SDDTILVSVPNFAFGCGHANKGLFNGAA 115
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCLHSPYGS--TGYITFGKPDTVNKKFVKYTPIVT 320
G+MGL + + ++T++++ F YCL S + +G + FG+ ++ V++TP+V
Sbjct: 116 GLMGLGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYD-VRFTPLVD 174
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKR 380
+ Y +++TGI+VG E LP+ A+ +DSGT+I+RF Y LR AF +
Sbjct: 175 SSSGPSQYFVSMTGINVGDELLPISATVM------VDSGTVISRFEQSAYERLRDAFTQI 228
Query: 381 MKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLG 440
+ + + FDTC+ +S + +P IT+HF D EL + ++ V +
Sbjct: 229 LPGLQTAVSVAP-FDTCFRVSTVDDINIPLITLHFRD--DAELRLSPVHILYPVDDGVMC 285
Query: 441 FALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
FA PS +LGN QQ+ YD+ RLG CN
Sbjct: 286 FAFAPSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFECN 325
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 153/457 (33%), Positives = 218/457 (47%), Gaps = 77/457 (16%)
Query: 37 VSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL 96
VSSL+P C + QG L + +YGPCS G S+ PS +EI RD+ R+
Sbjct: 46 VSSLLPKNKCLASARGGSQG-----LPITQKYGPCS--GSGHSQ-PPSPQEIFGRDESRV 97
Query: 97 HLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGIT 156
NS+ Q A P+N K P + + VA G P Q +L+LDTGS IT
Sbjct: 98 SFINSKFNQYA-PENLKDHT----PNNKLFDEDGNFLVDVAFGTPPQNFTLILDTGSSIT 152
Query: 157 WTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDI 216
WTQCK C + E Y++
Sbjct: 153 WTQCKAC---------------------------------------------TVENNYNM 167
Query: 217 AYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGPV 275
Y D S G + D MT++ + F ++ F G NN GD +G G++GL +G +
Sbjct: 168 TYGDDSTSVGNYGCDTMTLEPSD---VFQKFQFGRG--RNNKGDFGSGVDGMLGLGQGQL 222
Query: 276 SIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTP---EQSEFYH 329
S +S+T + F YCL S G + FG+ T +K+T +V P ++S +Y
Sbjct: 223 STVSQTASKFNKVFSYCLPEE-DSIGSLLFGEKATSQSSSLKFTSLVNGPGTLQESGYYF 281
Query: 330 ITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG 389
+ L+ ISVG ERL + +S F T IDS T+ITR P YSAL++AF+K M KY + G
Sbjct: 282 VNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNG 341
Query: 390 IE---DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA---L 443
D+ DTCY+LS K V++P+I +HF GG D+ L+ + ++CL FA
Sbjct: 342 RRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDESRLCLAFAGNSK 401
Query: 444 LPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+P ++GN QQ V YD+ G R+GF C+
Sbjct: 402 STMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 438
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 156/482 (32%), Positives = 222/482 (46%), Gaps = 52/482 (10%)
Query: 27 NDLSHSYIVSVSSLIPPTVCNRTRTALP-QGPGKVSLEVLGRYGPCSKLNQGKSRNTP-- 83
++ ++ Y V+ SS P VC R + P G G V L +GPCS S + P
Sbjct: 33 DEANYYYFVAASS--PNPVCQGHRVSPPLSGGGWVPLSR--PHGPCSS-----SMDAPPS 83
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIP-----------DNFKKTKAFTFPAKTGIVAADEY 132
S+ E LR DQ R R+L+ +P + K T TG+ A E
Sbjct: 84 SVAETLRWDQHRAGYIQ-RKLEDQVPITRSVITQVSHQGVVQPKVGTQGQGTGVQPAGEP 142
Query: 133 YIVVAIGKPKQYV-SLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNS 189
G ++++DT S + W QC PC HC Q D +DPSKS + + PC+S
Sbjct: 143 VGDAPTGGSGGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSS 202
Query: 190 TTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C+ L + NG + +C Y + Y DGS G + +D +T+ + + F
Sbjct: 203 PACRNLGPYA--NGCTP-AGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRF 259
Query: 250 LLGCTDN--NTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFG 303
GC+ G N SGIM L RG S+ ++T +Y F YCL +G+ G
Sbjct: 260 --GCSHALLQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILG 317
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIIT 363
P ++ TP++ + Y + L I V G+RLP+ + F + +DS TI+T
Sbjct: 318 VPRVAASRYA-VTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAV-MDSRTIVT 375
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-----VVVPKITIHFLG- 417
R P Y ALR+AF M+ Y+ E L DTCYD S V +PKIT+ F G
Sbjct: 376 RLPPTAYMALRAAFVAEMRAYRAAAPKEHL-DTCYDFSGAAPGGGGGVKLPKITLVFDGP 434
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
+ELD G L+ CL FA D + ++GNVQQ+ EV Y+V G +GF G
Sbjct: 435 NGAVELDPSGVLL-----DGCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRG 489
Query: 478 NC 479
C
Sbjct: 490 AC 491
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 129/388 (33%), Positives = 193/388 (49%), Gaps = 30/388 (7%)
Query: 118 FTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
FT P T A EYY+ + +G P V L++DTGS ++W QC PC C P F+P
Sbjct: 125 FTSPVVTLGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPR 184
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
S +F K+PC S+TC + + P S + C + I Y DGS +G A + +
Sbjct: 185 HSSSFFKLPCASSTCTNVYQGVKPFCSP--SGRTCLFSIQYGDGSLSSGLLAMETIAGNT 242
Query: 238 VN-GNGYFARYP-FLLGCTD-NNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
N G+G + LGC D + G GASG++G+DR P+S S+ + Y F +C
Sbjct: 243 PNFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFP 302
Query: 292 ---SPYGSTGYITFGKPDTVNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERLPL 344
+ S+G + FG+ D ++ +++YTP+V P ++Y++ L GISV RLPL
Sbjct: 303 DKIAHLNSSGLVFFGESDIIS-PYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPL 361
Query: 345 KASYF--TKLS----TEIDSGTIITRFPAPVYSALRSAFRKR---MKKYKMGKGIEDLFD 395
F K++ T IDSGT T P + A+R F R + K G ++
Sbjct: 362 SHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYN 421
Query: 396 TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSI 451
+A ++ ++P IT+HF GG+D+ L L+ E +CL F L+ D
Sbjct: 422 ITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAF-LMSGDIPFN 480
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN QQ+ V YD+ RLG P C
Sbjct: 481 IIGNYQQQNLWVEYDLEKLRLGIAPAQC 508
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 129/388 (33%), Positives = 193/388 (49%), Gaps = 30/388 (7%)
Query: 118 FTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
FT P T A EYY+ + +G P V L++DTGS ++W QC PC C P F+P
Sbjct: 124 FTSPVVTLGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPR 183
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
S +F K+PC S+TC + + P S + C + I Y DGS +G A + +
Sbjct: 184 HSSSFFKLPCASSTCTNVYQGVKPFCSP--SGRTCLFSIQYGDGSLSSGLLAMETIAGNT 241
Query: 238 VN-GNGYFARYP-FLLGCTD-NNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
N G+G + LGC D + G GASG++G+DR P+S S+ + Y F +C
Sbjct: 242 PNFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFP 301
Query: 292 ---SPYGSTGYITFGKPDTVNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERLPL 344
+ S+G + FG+ D ++ +++YTP+V P ++Y++ L GISV RLPL
Sbjct: 302 DKIAHLNSSGLVFFGESDIIS-PYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPL 360
Query: 345 KASYF--TKLS----TEIDSGTIITRFPAPVYSALRSAFRKR---MKKYKMGKGIEDLFD 395
F K++ T IDSGT T P + A+R F R + K G ++
Sbjct: 361 SHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYN 420
Query: 396 TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSI 451
+A ++ ++P IT+HF GG+D+ L L+ E +CL F + P +I
Sbjct: 421 ITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNI 480
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+GN QQ+ V YD+ RLG P C
Sbjct: 481 -IGNYQQQNLWVEYDLEKLRLGIAPAQC 507
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 136/411 (33%), Positives = 204/411 (49%), Gaps = 33/411 (8%)
Query: 80 RNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAI 138
R++ ++ I R +H ++ L+ D+ + + P +G + EY+ V I
Sbjct: 91 RDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQFRAEDLQGPIISGTSQGSGEYFSRVGI 150
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
GKP V ++LDTGS + W QC PC C Q DP F+P+ S ++S + C++ C+ L
Sbjct: 151 GKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQCQSL--- 207
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNT 258
+C + C Y+++Y DGS G + T+ +T+ + + +GC NN
Sbjct: 208 ----DVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSASVDN------VAIGCGHNNE 257
Query: 259 GDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFGKPDTVNKKFVKY-- 315
G GA+G++GL G +S S+ N S F YCL S + F N + +
Sbjct: 258 GLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSASTLEF------NSALLPHAI 311
Query: 316 -TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPV 369
P++ E FY++ +TG+SVGGE L + S F + IDSGT +TR
Sbjct: 312 TAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAA 371
Query: 370 YSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL 429
Y+ALR AF K K + + LFDTCYDLS +V VP +T H GG L L L
Sbjct: 372 YNALRDAFVKGTKDLPVTSEVA-LFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYL 430
Query: 430 V-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ V+S C FA P+ ++GNVQQ+G V +D+A +GF P C
Sbjct: 431 IPVDSDGTFCFAFA--PTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 139/369 (37%), Positives = 189/369 (51%), Gaps = 39/369 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY++ + +G P V ++LDTGS + W QC PC C Q D FDP KSKTF+ +PC S
Sbjct: 134 EYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSR 193
Query: 191 TCKILLEWFPPNGQDKC---SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
C+ L + +C SK C Y ++Y DGS G ++T+ +T AR
Sbjct: 194 LCRRL------DDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHG-------ARV 240
Query: 248 PFL-LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL------HSPYGST 297
+ LGC +N G GA+G++GL RG +S S+T Y F YCL S
Sbjct: 241 DHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPP 300
Query: 298 GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-- 355
I FG + K +TP++T P+ FY++ L GISVGG R+P + KL
Sbjct: 301 STIVFG--NAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGN 358
Query: 356 ----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
IDSGT +TR P Y ALR AFR K K LFDTC+DLS TV VP +
Sbjct: 359 GGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYS-LFDTCFDLSGMTTVKVPTV 417
Query: 412 TIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
HF GG ++ L L+ V + + C FA + ++GN+QQ+G+ V YD+ G
Sbjct: 418 VFHF-GGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLS--IIGNIQQQGFRVAYDLVGS 474
Query: 471 RLGFGPGNC 479
R+GF C
Sbjct: 475 RVGFLSRAC 483
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 135/410 (32%), Positives = 202/410 (49%), Gaps = 34/410 (8%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIP----DNFKKTKAFTFPAKTGIVAAD-EYYIVVAIG 139
+ ++RD R+ RRL P D+ K F +G+ A EY++ + +G
Sbjct: 92 FNDRMKRDAIRVATL-VRRLSHGAPAAVKDSRYKVANFATDVISGMEAGSGEYFVRIGVG 150
Query: 140 KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF 199
P + +++D+GS I W QCKPC C QQ DP FDP+ S +F+ + C S C L
Sbjct: 151 SPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAGVSCGSDVCDRLEN-- 208
Query: 200 PPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG 259
C++ C Y+++Y DGS G A + +T+ +V +GC N G
Sbjct: 209 -----TGCNAGRCRYEVSYGDGSYTKGTLALETLTVGQV------MIRDVAIGCGHTNQG 257
Query: 260 DQNGASGIMGLDRGPVSIISK---TNISYFFYCLHS-PYGSTGYITFGKPDTVNKKFVKY 315
GA+G++GL G +S I + F YCL S GSTG + FG+ +
Sbjct: 258 MFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEFGRGAL--PVGATW 315
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--TKLSTE---IDSGTIITRFPAPVY 370
++ P FY+I L GI VGG R+ + F T+ T +D+GT +TRFP Y
Sbjct: 316 ISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTRFPTAAY 375
Query: 371 SALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV 430
A R +F + G+ +FDTCYDL+ +++V VP ++ +F G L L R L+
Sbjct: 376 VAFRDSFTAQTSNLPRAPGVS-IFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPARNFLI 434
Query: 431 -VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V+ CL FA PS ++GN+QQ G ++ +D A +GFGP C
Sbjct: 435 PVDGGGTFCLAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC 482
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 122/361 (33%), Positives = 180/361 (49%), Gaps = 33/361 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ + +G P + + ++LDTGS + W QC PC C QQ DP FDP+ S TF + C+
Sbjct: 163 EYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDP 222
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C L C S +C Y ++Y DGS G +ATD +T E A
Sbjct: 223 KCASL-------DVSACRSNKCLYQVSYGDGSFTVGNYATDTVTFGESGKVNDVA----- 270
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGST---GYITFGK 304
LGC +N G GA+G++GL G +S+ ++ F YCL S S+ + G
Sbjct: 271 LGCGHDNEGLFTGAAGLLGLGGGALSMTNQIKAKSFSYCLVDRDSAKSSSLDFNSVQIGA 330
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSG 359
D P++ + FY++ L+G SVGG+++ + +S F ++ +D G
Sbjct: 331 GDAT-------APLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCG 383
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T +TR Y++LR AF K +K G LFDTCYD S+ TV VP +T HF GG
Sbjct: 384 TAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGK 443
Query: 420 DLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
L L + L+ ++ C FA P+ + ++GNVQQ+G + YD+A +G
Sbjct: 444 SLNLPAKNYLIPIDDAGTFCFAFA--PTSSSLSIIGNVQQQGTRITYDLANNLIGLSANK 501
Query: 479 C 479
C
Sbjct: 502 C 502
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 136/423 (32%), Positives = 203/423 (47%), Gaps = 30/423 (7%)
Query: 69 GPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKT------KAFTFPA 122
GPCS L S + P +L D R+ +R +K+ P + T + P
Sbjct: 52 GPCSPL----SADIP-FSAVLTHDAARIASFAARLAKKSSPSSASATTQAAGSSLASVPL 106
Query: 123 KTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSK 180
G V Y + +G P + +++DTGS +TW QC PC + C +Q P FDP S
Sbjct: 107 TPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSS 166
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
+++ + C+S C L N S C Y +Y D S G+ + D ++
Sbjct: 167 SYAAVSCSSPQCDGL-STATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSF----- 220
Query: 241 NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKT--NISYFF-YCLHSPYGST 297
G + F GC +N G ++G+MGL R +S++ + + Y F YCL S S+
Sbjct: 221 -GANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPST-SSS 278
Query: 298 GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEID 357
GY++ G + N YTP+V+ Y I+L+G++V G+ L + +S +T L T ID
Sbjct: 279 GYLSIG---SYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIID 335
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT+ITR P VY+AL A MK + DTC++ A K VP +++ F G
Sbjct: 336 SGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSG 395
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G L+L LV CL FA P+ +I +GN QQ+ + V YDV R+GF
Sbjct: 396 GATLKLSAGNLLVDVDGATTCLAFA--PARSAAI-IGNTQQQTFSVVYDVKSNRIGFAAA 452
Query: 478 NCN 480
C+
Sbjct: 453 GCS 455
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 187 bits (476), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 129/359 (35%), Positives = 189/359 (52%), Gaps = 32/359 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ V IGKP + V ++LDTGS + W QC PC C Q +P F+PS S ++ + C++
Sbjct: 147 EYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTP 206
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C L +C + C Y+++Y DGS G +AT+ +TI G
Sbjct: 207 QCNAL-------EVSECRNATCLYEVSYGDGSYTVGDFATETLTI------GSTLVQNVA 253
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFG---KPD 306
+GC +N G GA+G++GL G +++ S+ N + F YCL S + FG PD
Sbjct: 254 VGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVDFGTSLSPD 313
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTI 361
V P++ + FY++ LTGISVGGE L + S F + IDSGT
Sbjct: 314 AV------VAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTA 367
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
+TR +Y++LR +F K + G+ +FDTCY+LSA TV VP + HF GG L
Sbjct: 368 VTRLQTEIYNSLRDSFVKGTLDLEKAAGVA-MFDTCYNLSAKTTVEVPTVAFHFPGGKML 426
Query: 422 ELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L + ++ V+SV CL FA P+ + ++GNVQQ+G V +D+A +GF C
Sbjct: 427 ALPAKNYMIPVDSVGTFCLAFA--PTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 150/419 (35%), Positives = 207/419 (49%), Gaps = 50/419 (11%)
Query: 89 LRRDQQRLH-------LKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGK 140
L+RD R+ + R K P + F+ +G+ + EY++ + +G
Sbjct: 90 LQRDSLRVKSITSLAAVSTGRNATKRTP---RSAGGFSGAVISGLSQGSGEYFMRLGVGT 146
Query: 141 PKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFP 200
P V ++LDTGS + W QC PC C Q D FDP KSKTF+ +PC S C+ L
Sbjct: 147 PATNVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRL----- 201
Query: 201 PNGQDKC---SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCTDN 256
+ +C SK C Y ++Y DGS G ++T+ +T AR + LGC +
Sbjct: 202 -DDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHG-------ARVDHVPLGCGHD 253
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL------HSPYGSTGYITFGKPDT 307
N G GA+G++GL RG +S S+T Y F YCL S I FG D
Sbjct: 254 NEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPSTIVFGN-DA 312
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTI 361
V K V +TP++T P+ FY++ L GISVGG R+P + KL IDSGT
Sbjct: 313 VPKTSV-FTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTS 371
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
+TR Y ALR AFR K K LFDTC+DLS TV VP + HF GG ++
Sbjct: 372 VTRLTQSAYVALRDAFRLGATKLKRAPSYS-LFDTCFDLSGMTTVKVPTVVFHF-GGGEV 429
Query: 422 ELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L L+ V + + C FA + ++GN+QQ+G+ V YD+ G R+GF C
Sbjct: 430 SLPASNYLIPVNTEGRFCFAFAGTMGSLS--IIGNIQQQGFRVAYDLVGSRVGFLSRAC 486
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 135/379 (35%), Positives = 197/379 (51%), Gaps = 33/379 (8%)
Query: 115 TKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
T F+ +G+ + EY+ + +G P +YV ++LDTGS I W QC PC +C Q DP
Sbjct: 24 TTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPV 83
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDR 232
F+P KS +F+K+ C + C+ L C+ ++ C Y ++Y DGS TG + T+
Sbjct: 84 FNPVKSGSFAKVLCRTPLCRRLES-------PGCNQRQTCLYQVSYGDGSYTTGEFVTET 136
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYC 289
+T + LGC +N G GA+G++GL RG +S S+ ++ F YC
Sbjct: 137 LTFRRTKVE------QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYC 190
Query: 290 L--HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKA 346
L S + FG ++ + ++TP++T P FY++ L GISVGG + + A
Sbjct: 191 LVDRSASSKPSSVVFG--NSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITA 248
Query: 347 SYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
S+F T ID GT +TR P Y ALR AFR K LFDTCYDLS
Sbjct: 249 SHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFS-LFDTCYDLS 307
Query: 402 AYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRG 460
TV VP + +HF G D+ L L+ V+ + C FA S + ++GN+QQ+G
Sbjct: 308 GKTTVKVPTVVLHFR-GADVSLPASNYLIPVDGSGRFCFAFAGTTSGLS--IIGNIQQQG 364
Query: 461 YEVHYDVAGRRLGFGPGNC 479
+ V YD+A R+GF P C
Sbjct: 365 FRVVYDLASSRVGFSPRGC 383
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 151/493 (30%), Positives = 230/493 (46%), Gaps = 50/493 (10%)
Query: 23 YANDNDLS-HSYIVSVSSLI---PPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGK 78
+A + +LS H +V+ SSL VC R + P G + + PCS G+
Sbjct: 28 HAAEAELSNHHVVVAASSLELANASPVCQGHRVS-PSSSGGSWAPLSHLHSPCSPAAGGR 86
Query: 79 SRNTP--SLEEILRRDQQRL-HLK-----NSRRLQKAIPDNFKKTKAFTFPA------KT 124
P +L L+ D+ R H++ N+ + A + + T+ + PA K+
Sbjct: 87 DSAPPPKTLSATLQWDEHRAGHIQRKLSGNAAPMDDAGEETPQSTQVTSSPAANVNVGKS 146
Query: 125 GIVAADEYYIVVAIGKPKQY-------VSLLLDTGSGITWTQCKPCIH--CSQQRDPFFD 175
+A E IV A P S+++DT S + W QC PC C Q D +D
Sbjct: 147 STDSAFEQGIVPAATGPGGQKKLPGVAQSMVVDTASDVPWVQCAPCPQPQCYAQSDVLYD 206
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNG-QDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
P+KS + PC+S C+ L + NG ++ C Y + Y DGSG +G + +D +T
Sbjct: 207 PTKSILSAPFPCSSPQCRSLGRYA--NGCTGAGNTGTCQYRVLYPDGSGTSGTYVSDLLT 264
Query: 235 IQEVNGNGYFARYPFLLGCTDN--NTGD-QNGASGIMGLDRGPVSIISKTNISY-----F 286
+ N + A F GC+ G N +G M L RG S+ S+T ++ F
Sbjct: 265 L---NADPKGAVSKFQFGCSHALLRPGSFNNKTAGFMALGRGAQSLSSQTKGTFSKGNVF 321
Query: 287 FYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
YCL G+++ G P ++ TP++ + Y + L GI V G+RLP+
Sbjct: 322 SYCLPPTGSHKGFLSLGVPQHAASRYA-VTPMLKSKMAPMIYMVRLIGIDVAGQRLPVPP 380
Query: 347 SYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
+ F + +DS TIITR P Y ALR+AFR +M+ Y+ + DTCYD + V
Sbjct: 381 AVFAA-NAAMDSRTIITRLPPTAYMALRAAFRAQMRAYR-AVAPKGQLDTCYDFTGVPMV 438
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
+PK+T+ F +ELD G ++ CL FA +D ++GNVQQ+ EV Y+
Sbjct: 439 RLPKVTLVFDRNAAVELDPSGVML-----DSCLAFAPNANDFMPGIIGNVQQQTLEVLYN 493
Query: 467 VAGRRLGFGPGNC 479
V G +GF C
Sbjct: 494 VDGASVGFRRAAC 506
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 137/363 (37%), Positives = 192/363 (52%), Gaps = 34/363 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ + +G P +YV ++LDTGS + W QC PC C Q DP FDP KS +FS I C S
Sbjct: 146 EYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSP 205
Query: 191 TCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP- 248
C L P C+S++ C Y +AY DGS G ++T+ +T + R P
Sbjct: 206 LC---LRLDSPG----CNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGT-------RVPK 251
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFG 303
LGC +N G GA+G++GL RG +S ++T + + F YCL S + FG
Sbjct: 252 VALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSASSKPSSVVFG 311
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ID 357
+ V++ V +TP++T P+ FY++ LTGISVGG R+ + KL T ID
Sbjct: 312 Q-SAVSRTAV-FTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIID 369
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT +TR Y +LR AFR K LFDTC+DLS V VP + +HF
Sbjct: 370 SGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYS-LFDTCFDLSGKTEVKVPTVVMHFR- 427
Query: 418 GVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
G D+ L L+ V++ C FA S + ++GN+QQ+G+ V +DVA R+GF
Sbjct: 428 GADVSLPATNYLIPVDTNGVFCFAFAGTMSGLS--IIGNIQQQGFRVVFDVAASRIGFAA 485
Query: 477 GNC 479
C
Sbjct: 486 RGC 488
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 141/453 (31%), Positives = 221/453 (48%), Gaps = 47/453 (10%)
Query: 33 YIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRD 92
+ + ++SL+P + C P G G L + YGPCS+L Q KS PS ++I +D
Sbjct: 40 HTLDINSLLPKSNCTA-----PVGGGSQGLPITYSYGPCSQLGQKKS---PSRQQIFLQD 91
Query: 93 QQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDT 151
+ R+ N++ + + +++K P + D ++V V G P+Q +L++DT
Sbjct: 92 RSRVRSINAKIFGQY---STQESKDGWSPESMDTLNEDGLFLVNVGFGTPQQKFNLIIDT 148
Query: 152 GSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE 211
GS TW QC C + F+PS S ++S C +T +
Sbjct: 149 GSDTTWIQCNSCSLGNCHNKKTFNPSLSSSYSNRSCIPST-------------------D 189
Query: 212 CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLD 271
Y + Y D S G + D +T++ F ++ F GC D+ G+ ASG++GL
Sbjct: 190 TNYTMKYEDNSYSKGVFVCDEVTLKP----DVFPKFQF--GCGDSGGGEFGTASGVLGLA 243
Query: 272 RGP-VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEF 327
+G S+IS+T + F YC + G + FG+ +K+T ++ P +
Sbjct: 244 KGEQYSLISQTASKFKKKFSYCFPPKEHTLGSLLFGEKAISASPSLKFTQLLNPPSGLGY 303
Query: 328 YHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK-- 385
+ + L GISV +RL + +S F T IDSGT+ITR P Y ALR+AF++ M
Sbjct: 304 F-VELIGISVAKKRLNVSSSLFASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSI 362
Query: 386 MGKGIEDLFDTCYDLSAY--KTVVVPKITIHFLGGVDLELDVRGTLVVE-SVRQVCLGFA 442
E L DTCY+L + + +P+I +HF+G VD+ L G L + Q CL FA
Sbjct: 363 SPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFA 422
Query: 443 LLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
+ + ++GN QQ +V YD+ G RLGFG
Sbjct: 423 RKSNPSHVTIIGNRQQVSLKVVYDIEGGRLGFG 455
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 129/403 (32%), Positives = 197/403 (48%), Gaps = 34/403 (8%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSL 147
++RD +R+ R L P +AF +G+ + EY++ + +G P + +
Sbjct: 93 MQRDTKRVAALR-RHLAAGKPT--YAEEAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYV 149
Query: 148 LLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKC 207
++D+GS I W QC+PC C Q DP F+P+ S +++ + C ST C + C
Sbjct: 150 VIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCASTVCSHV-------DNAGC 202
Query: 208 SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGI 267
C Y+++Y DGS G A + +T G +GC +N G GA+G+
Sbjct: 203 HEGRCRYEVSYGDGSYTKGTLALETLTF------GRTLIRNVAIGCGHHNQGMFVGAAGL 256
Query: 268 MGLDRGPVSIISKTNISY---FFYCLHS-PYGSTGYITFGKPDTVNKKFVKYTPIVTTPE 323
+GL GP+S + + F YCL S S+G + FG+ + P++ P
Sbjct: 257 LGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQFGREAV--PVGAAWVPLIHNPR 314
Query: 324 QSEFYHITLTGISVGGERLPLKASYFTKLS------TEIDSGTIITRFPAPVYSALRSAF 377
FY++ L+G+ VGG R+P+ F KLS +D+GT +TR P Y A R AF
Sbjct: 315 AQSFYYVGLSGLGVGGLRVPISEDVF-KLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAF 373
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQ 436
+ G+ +FDTCYDL + +V VP ++ +F GG L L R L+ V+ V
Sbjct: 374 IAQTTNLPRASGVS-IFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGS 432
Query: 437 VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
C FA PS ++GN+QQ G E+ D A +GFGP C
Sbjct: 433 FCFAFA--PSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNVC 473
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 121/345 (35%), Positives = 175/345 (50%), Gaps = 24/345 (6%)
Query: 141 PKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
P +++LD+ S + W QC PC C Q D F+DPS+S T + C+S TC L +
Sbjct: 25 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPY 84
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNT 258
+ C++ +C Y + Y DGS +G + D +T+ N A F GC+
Sbjct: 85 -----ANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGN-----AVSGFKFGCSHAEQ 134
Query: 259 GDQNG-ASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVK 314
G + A+GIM L GP S++S+T Y F YC+ + +G+ T G P + ++V
Sbjct: 135 GSFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYV- 193
Query: 315 YTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALR 374
TP+V + + FY + L I+VGG+RL + + F S +DS T ITR P Y ALR
Sbjct: 194 VTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSV-LDSRTAITRLPPTAYQALR 252
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESV 434
+AFR M Y+ + DTCYD + + +PKI++ F L LD G L +
Sbjct: 253 AAFRSSMTMYRSAP-PKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFND-- 309
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL F D +LG+VQQ+ EV YDV G +GF G C
Sbjct: 310 ---CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 181/373 (48%), Gaps = 36/373 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ V+ +G P + +++DTGS + W QC PC C +Q P +DP SKT +IPC S
Sbjct: 91 EYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSKTHRRIPCASP 150
Query: 191 TCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C+ +L + C ++ C Y + Y DGS +G ATD + + + +
Sbjct: 151 QCRGVLRY------PGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDT-----RVHN 199
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL----HSPYGSTGYIT 301
LGC +N G A+G++G RG +S ++ +Y F YCL S+ Y+
Sbjct: 200 VTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSYLV 259
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ 355
FG+ T +TP+ T P + Y++ + G SVGGER+ ++ L+
Sbjct: 260 FGR--TPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGV 317
Query: 356 -IDSGTIITRFPAPVYSALRSAFRKRMKKYKMG--KGIEDLFDTCYDLSAY---KTVVVP 409
+DSGT I+RF Y+A+R AF M + +FDTCYD+ V VP
Sbjct: 318 VVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRVP 377
Query: 410 KITIHFLGGVDLELDVRGTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
I +HF D+ L L VV R+ L +D +LGNVQQ+G+ V +DV
Sbjct: 378 SIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFDV 437
Query: 468 AGRRLGFGPGNCN 480
R+GF P C+
Sbjct: 438 ERGRIGFTPNGCS 450
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 120/357 (33%), Positives = 186/357 (52%), Gaps = 31/357 (8%)
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+++++DTGS +TW QCKPC C QRDP FDPS S +++ +PCN++ C+ L+
Sbjct: 176 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKA-ATGVP 234
Query: 205 DKCS----------SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
C+ S+ C Y +AY DGS G ATD + + + +G F+ GC
Sbjct: 235 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDG------FVFGCG 288
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYG--STGYITFGKPDTV- 308
+N G G +G+MGL R +S++S+T + F YCL + + G ++ G +
Sbjct: 289 LSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSY 348
Query: 309 -NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPA 367
N V YT ++ P Q FY + +TG SV + A+ + +DSGT+ITR
Sbjct: 349 RNATPVSYTRMIADPAQPPFYFMNVTGASV--GGAAVAAAGLGAANVLLDSGTVITRLAP 406
Query: 368 PVYSALRSAFRKRM--KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
VY A+R+ F ++ ++Y L D CY+L+ + V VP +T+ GG D+ +D
Sbjct: 407 SVYRAVRAEFARQFGAERYPAAPPFS-LLDACYNLTGHDEVKVPLLTLRLEGGADMTVDA 465
Query: 426 RGTLVV--ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G L + + QVCL A L + + ++GN QQ+ V YD G RLGF +C+
Sbjct: 466 AGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 522
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 120/357 (33%), Positives = 186/357 (52%), Gaps = 31/357 (8%)
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+++++DTGS +TW QCKPC C QRDP FDPS S +++ +PCN++ C+ L+
Sbjct: 177 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKA-ATGVP 235
Query: 205 DKCS----------SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
C+ S+ C Y +AY DGS G ATD + + + +G F+ GC
Sbjct: 236 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDG------FVFGCG 289
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYG--STGYITFGKPDTV- 308
+N G G +G+MGL R +S++S+T + F YCL + + G ++ G +
Sbjct: 290 LSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSY 349
Query: 309 -NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPA 367
N V YT ++ P Q FY + +TG SV + A+ + +DSGT+ITR
Sbjct: 350 RNATPVSYTRMIADPAQPPFYFMNVTGASV--GGAAVAAAGLGAANVLLDSGTVITRLAP 407
Query: 368 PVYSALRSAFRKRM--KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
VY A+R+ F ++ ++Y L D CY+L+ + V VP +T+ GG D+ +D
Sbjct: 408 SVYRAVRAEFARQFGAERYPAAPPFS-LLDACYNLTGHDEVKVPLLTLRLEGGADMTVDA 466
Query: 426 RGTLVV--ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G L + + QVCL A L + + ++GN QQ+ V YD G RLGF +C+
Sbjct: 467 AGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 523
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 137/427 (32%), Positives = 199/427 (46%), Gaps = 48/427 (11%)
Query: 87 EILRRDQQRLHLKNSRRLQKAI------------PDNFKKTKAFTFPAKTGIVAADEYYI 134
++L+R +R H + SR + +A + K P G E+ +
Sbjct: 62 QLLQRAARRSHHRMSRLVARATGAASTSSSKAAAAGDGSGGKDLQVPVHAG---NGEFLM 118
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
+++G P + ++DTGS + WTQCKPC+ C Q P FDP+ S T++ +PC+S C
Sbjct: 119 DLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCSSALCAD 178
Query: 195 LLEWFPPNGQDKCSSKECP-YDIAYVDGSGETGFWATDRMTI--QEVNGNGYFARYPFLL 251
L + S+ Y Y D S G AT+ T+ Q+V G +
Sbjct: 179 LPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQKVPGVAF-------- 230
Query: 252 GCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTG------YITFGK 304
GC D N GD +G++GL RGP+S++S+ I F YCL S + G G
Sbjct: 231 GCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTSLDDAAGRSPLLLGSAAGI 290
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSG 359
+ + TP+V P Q FY+++LTG++VG RL L +S F +DSG
Sbjct: 291 SASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGTGGVIVDSG 350
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-----VVVPKITIH 414
T IT Y ALR AF M + E D C+ A V VPK+ +H
Sbjct: 351 TSITYLELRAYRALRKAFVAHMSLPTV-DASEIGLDLCFQGPAGAVDQDVQVQVPKLVLH 409
Query: 415 FLGGVDLELDVRGTLVVESVR-QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F GG DL+L +V++S +CL ++ S SI +GN QQ+ ++ YDVAG L
Sbjct: 410 FDGGADLDLPAENYMVLDSASGALCL--TVMASRGLSI-IGNFQQQNFQFVYDVAGDTLS 466
Query: 474 FGPGNCN 480
F P CN
Sbjct: 467 FAPAECN 473
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 130/413 (31%), Positives = 212/413 (51%), Gaps = 45/413 (10%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPD-----------NFKKTKAFTFPAKTGIV-AADEYYIVV 136
L RD R++ N++ LQ A+ + + + P +G + EY+ V
Sbjct: 103 LARDTARVNSLNTK-LQLALSSLNRSDLYPTETELLRPEDLSTPVSSGTAQGSGEYFSRV 161
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILL 196
+G+P + ++LDTGS + W QCKPC C QQ DP FDP+ S +++ + C++ C+ L
Sbjct: 162 GVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTCDAQQCQDL- 220
Query: 197 EWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDN 256
C + +C Y ++Y DGS G + T+ ++ G G R +GC +
Sbjct: 221 ------EMSACRNGKCLYQVSYGDGSFTVGEYVTETVSF----GAGSVNR--VAIGCGHD 268
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFGKP---DTVNKKF 312
N G G++G++GL GP+S+ S+ + F YCL G + + F P D+V
Sbjct: 269 NEGLFVGSAGLLGLGGGPLSLTSQIKATSFSYCLVDRDSGKSSTLEFNSPRPGDSV---- 324
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPA 367
P++ + + FY++ LTG+SVGGE + + F + +DSGT ITR
Sbjct: 325 --VAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRT 382
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
Y+++R AF+++ + +G+ LFDTCYDLS+ ++V VP ++ HF G L +
Sbjct: 383 QAYNSVRDAFKRKTSNLRPAEGVA-LFDTCYDLSSLQSVRVPTVSFHFSGDRAWALPAKN 441
Query: 428 TLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L+ V+ C FA P+ + ++GNVQQ+G V +D+A +GF P C
Sbjct: 442 YLIPVDGAGTYCFAFA--PTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 140/460 (30%), Positives = 203/460 (44%), Gaps = 61/460 (13%)
Query: 35 VSVSSLIPPTVCNRTRTALPQGPGKVS--LEVLGRYGPCSKLNQGKSRNTPSLEEILRRD 92
VS +S +P + C+ P S L + R+GPC+ ++ S PS+ + LR D
Sbjct: 39 VSAASFVPSSTCSSPDRVPPHRRNGTSAVLRLTHRHGPCAP-SRASSLAAPSVADTLRAD 97
Query: 93 QQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD----EYYIVVAIGKPKQYVSLL 148
Q+R RR+ P + A D Y + ++G P ++
Sbjct: 98 QRRAEYIL-RRVSGRAPQLWDSKAAAAVATVPASWGYDIGTLNYVVTASLGTPGVAQTME 156
Query: 149 LDTGSGITWTQCKPCI---HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
+DTGS ++W QCKPC C Q+DP FDP++S +++ +PC C
Sbjct: 157 VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCA------------ 204
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
G G A V G F GC +G NG
Sbjct: 205 ---------------GLGIYAASACSAAQCGAVQG--------FFFGCGHAQSGLFNGVD 241
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF--GKPDTVNKKFVKYTPIVT 320
G++GL R S++ +T +Y F YCL + + GY+T G P F T ++
Sbjct: 242 GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGF-STTQLLP 300
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKR 380
+P +Y + LTGISVGG++L + AS F + T++TR P Y+ALRSAFR
Sbjct: 301 SPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG-TVVTRLPPTAYAALRSAFRSG 359
Query: 381 MKKYKMGKGIED-LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
M Y + + DTCY+ + Y TV +P + + F G + L G L CL
Sbjct: 360 MASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSFG-----CL 414
Query: 440 GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
FA SD +LGNVQQR +EV D G +GF P +C
Sbjct: 415 AFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 452
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 132/381 (34%), Positives = 190/381 (49%), Gaps = 38/381 (9%)
Query: 117 AFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDP 176
FT P A EY V +G P++ S+++DTGS +TW QC PC C Q D F P
Sbjct: 1 GFTAPVA---AARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLP 57
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
+ S +F+K+ C S C L FP C+ C Y +Y DGS TG + D +T+
Sbjct: 58 NTSTSFTKLACGSALCNGLP--FP-----MCNQTTCVYWYSYGDGSLTTGDFVYDTITMD 110
Query: 237 EVNGNGYFARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL-- 290
+NG + P F GC +N G GA GI+GL +GP+S S+ Y F YCL
Sbjct: 111 GINGQK--QQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVD 168
Query: 291 -HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
+P T + FG VKY PI+ P+ +Y++ L GISVG L + ++ F
Sbjct: 169 WLAPPTQTSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVF 228
Query: 350 TKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSA 402
S T DSGT +T+ Y + +A Y + I+D+ D C LS
Sbjct: 229 DIDSVGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYS--RKIDDISRLDLC--LSG 284
Query: 403 Y---KTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQ 458
+ + VP +T HF GG D+ L + +ES + C + S P+ ++G+VQQ
Sbjct: 285 FPKDQLPTVPAMTFHFEGG-DMVLPPSNYFIYLESSQSYCFA---MTSSPDVNIIGSVQQ 340
Query: 459 RGYEVHYDVAGRRLGFGPGNC 479
+ ++V+YD AGR+LGF P +C
Sbjct: 341 QNFQVYYDTAGRKLGFVPKDC 361
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 124/356 (34%), Positives = 190/356 (53%), Gaps = 25/356 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ V IG P ++V +++DTGS + W QC PC C QQ DP F+PS S +++ + C +
Sbjct: 154 EYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETH 213
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
CK L +C + C Y+++Y DGS G +AT+ +T+ +G +
Sbjct: 214 QCKSL-------DVSECRNDSCLYEVSYGDGSYTVGDFATETITL-----DGSASLNNVA 261
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFGKPDTVN 309
+GC +N G GA+G++GL G +S S+ N S F YCL + S + F P +
Sbjct: 262 IGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLVNRDTDSASTLEFNSP--IP 319
Query: 310 KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITR 364
V P++ + FY++ +TGI VGG+ L + S F + +DSGT +TR
Sbjct: 320 SHSVT-APLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTR 378
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
+ VY++LR +F + + G+ LFDTCYDLS+ +V VP ++ HF G L L
Sbjct: 379 LQSDVYNSLRDSFVRGTQHLPSTSGVA-LFDTCYDLSSRSSVEVPTVSFHFPDGKYLALP 437
Query: 425 VRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ L+ V+S C FA P+ ++GNVQQ+G V YD++ +GF P C
Sbjct: 438 AKNYLIPVDSAGTFCFAFA--PTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 127/359 (35%), Positives = 188/359 (52%), Gaps = 32/359 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ V IG P + V ++LDTGS + W QC PC C Q +P F+PS S ++ + C++
Sbjct: 150 EYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTP 209
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C L +C + C Y+++Y DGS G +AT+ +TI G
Sbjct: 210 QCNAL-------EVSECRNATCLYEVSYGDGSYTVGDFATETLTI------GSTLVQNVA 256
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFGK---PD 306
+GC +N G GA+G++GL G +++ S+ N + F YCL S + FG PD
Sbjct: 257 VGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVEFGTSLPPD 316
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTI 361
V P++ + FY++ LTGISVGGE L + S F + IDSGT
Sbjct: 317 AV------VAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTA 370
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
+TR +Y++LR +F K + G+ +FDTCY+LSA T+ VP + HF GG L
Sbjct: 371 VTRLQTGIYNSLRDSFLKGTSDLEKAAGVA-MFDTCYNLSAKTTIEVPTVAFHFPGGKML 429
Query: 422 ELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L + ++ V+SV CL FA P+ + ++GNVQQ+G V +D+A +GF C
Sbjct: 430 ALPAKNYMIPVDSVGTFCLAFA--PTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 125/383 (32%), Positives = 190/383 (49%), Gaps = 34/383 (8%)
Query: 110 DNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ 168
D +T+ T P +G + EY+ + +G P + + L+LDTGS + W QC+PC C Q
Sbjct: 139 DTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQ 198
Query: 169 QRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFW 228
Q DP F+P+ S T+ + C++ C +L C S +C Y ++Y DGS G
Sbjct: 199 QSDPVFNPTSSSTYKSLTCSAPQCSLL-------ETSACRSNKCLYQVSYGDGSFTVGEL 251
Query: 229 ATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFY 288
ATD +T A LGC +N G GA+G++GL G +SI ++ + F Y
Sbjct: 252 ATDTVTFGNSGKINNVA-----LGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSY 306
Query: 289 CLHSPYGSTGYITFGKPDTVNKKFVKY------TPIVTTPEQSEFYHITLTGISVGGERL 342
CL GK +++ V+ P++ + FY++ L+G SVGGE++
Sbjct: 307 CLVDRDS-------GKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKV 359
Query: 343 PLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTC 397
L + F ++ +D GT +TR Y++LR AF K K G LFDTC
Sbjct: 360 VLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTC 419
Query: 398 YDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNV 456
YD S+ TV VP + HF GG L+L + L+ V+ C FA P+ + ++GNV
Sbjct: 420 YDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFA--PTSSSLSIIGNV 477
Query: 457 QQRGYEVHYDVAGRRLGFGPGNC 479
QQ+G + YD++ +G C
Sbjct: 478 QQQGTRITYDLSKNVIGLSGNKC 500
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 129/379 (34%), Positives = 191/379 (50%), Gaps = 35/379 (9%)
Query: 114 KTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP 172
K + + P +G + EY+ V +G P + ++LDTGS I W QC+PC C QQ DP
Sbjct: 1 KPEDLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDP 60
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDR 232
FDP+ S T++ + C S C L C S +C Y + Y DGS G +AT+
Sbjct: 61 IFDPTASSTYAPVTCQSQQCSSL-------EMSSCRSGQCLYQVNYGDGSYTFGDFATES 113
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-H 291
++ GN + LGC +N G GA+G++GL GP+S+ ++ + F YCL +
Sbjct: 114 VSF----GNSGSVK-NVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVN 168
Query: 292 SPYGSTGYITFGKP----DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS 347
+ + F D+V +K I T FY++ L+G+SVGG+ + + S
Sbjct: 169 RDSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDT------FYYVGLSGMSVGGQMVSIPES 222
Query: 348 YFTKLSTE------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
F +L +D GT ITR Y+ LR AF + + K+ + LFDTCYDLS
Sbjct: 223 TF-RLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVA-LFDTCYDLS 280
Query: 402 AYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRG 460
+V VP ++ HF G L L+ V+S C FA P+ + ++GNVQQ+G
Sbjct: 281 GQASVRVPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFA--PTTSSLSIIGNVQQQG 338
Query: 461 YEVHYDVAGRRLGFGPGNC 479
V +D+A R+GF P C
Sbjct: 339 TRVTFDLANNRMGFSPNKC 357
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 134/362 (37%), Positives = 191/362 (52%), Gaps = 31/362 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ + +G P +YV ++LDTGS + W QC PC C Q D FDP+KS+T++ IPC +
Sbjct: 117 EYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAP 176
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ L P +K +K C Y ++Y DGS G ++T+ +T + N A
Sbjct: 177 LCRRLDS---PGCSNK--NKVCQYQVSYGDGSFTFGDFSTETLTFRR-NRVTRVA----- 225
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFGKP 305
LGC +N G GA+G++GL RG +S +T + F YCL S + FG
Sbjct: 226 LGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVIFG-- 283
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKASYFTKLSTE------IDS 358
D+ + +TP++ P+ FY++ L GISVGG + L AS F +L IDS
Sbjct: 284 DSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLF-RLDAAGNGGVIIDS 342
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT +TR P Y ALR AFR K LFDTC+DLS V VP + +HF G
Sbjct: 343 GTSVTRLTRPAYIALRDAFRIGASHLKRAPEFS-LFDTCFDLSGLTEVKVPTVVLHFRGA 401
Query: 419 VDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
D+ L L+ V++ C FA S + ++GN+QQ+G+ + YD+ G R+GF P
Sbjct: 402 -DVSLPATNYLIPVDNSGSFCFAFAGTMSGLS--IIGNIQQQGFRISYDLTGSRVGFAPR 458
Query: 478 NC 479
C
Sbjct: 459 GC 460
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 143/472 (30%), Positives = 223/472 (47%), Gaps = 38/472 (8%)
Query: 19 NNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGK 78
NN +Y L+ ++ + +IP V +G K ++V+ R +L+ G
Sbjct: 35 NNSSYPTFQHLNVKETIAGTRIIPLEVSEDHE----EGGEKWMMKVVHR----DQLSFGN 86
Query: 79 SRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVA 137
S + L+ L+RD +R+ RRL +++ T + EY++ +
Sbjct: 87 SDDHRHRLDGRLKRDAKRVA-SLIRRLSSGGGGSYRVDDFGTDVISGMEQGSGEYFVRIG 145
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
+G P + +++D+GS I W QC+PC C Q DP FDP+ S +F+ + C+S+ C L
Sbjct: 146 VGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVCDRLE- 204
Query: 198 WFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN 257
C + C Y+++Y DGS G A + +T G +GC N
Sbjct: 205 ------NAGCHAGRCRYEVSYGDGSYTKGTLALETLTF------GRTMVRSVAIGCGHRN 252
Query: 258 TGDQNGASGIMGLDRGPVSIISK---TNISYFFYCLHS-PYGSTGYITFGKPDTVNKKFV 313
G GA+G++GL G +S + + F YCL S S+G + FG+
Sbjct: 253 RGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSSGSLVFGREAL--PAGA 310
Query: 314 KYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--TKL---STEIDSGTIITRFPAP 368
+ P+V P FY+I L G+ VGG R+P+ F T+L +D+GT +TR P
Sbjct: 311 AWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTL 370
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT 428
Y A R AF + G+ +FDTCYDL + +V VP ++ +F GG L L R
Sbjct: 371 AYQAFRDAFLAQTANLPRATGVA-IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNF 429
Query: 429 LV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L+ ++ C FA PS +LGN+QQ G ++ +D A +GFGP C
Sbjct: 430 LIPMDDAGTFCFAFA--PSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 136/372 (36%), Positives = 191/372 (51%), Gaps = 39/372 (10%)
Query: 128 AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPC 187
+ EY++ + +G P + ++LDTGS + W QC PC C Q DP F+P+KSKTF+ +PC
Sbjct: 132 GSGEYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPC 191
Query: 188 NSTTCKILLEWFPPNGQDKC---SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
S C+ L + +C SK C Y ++Y DGS G ++T+ +T
Sbjct: 192 GSRLCRRL------DDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHG------- 238
Query: 245 ARYPFL-LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL------HSPY 294
AR + LGC +N G GA+G++GL RG +S S+T Y F YCL S
Sbjct: 239 ARVDHVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSS 298
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
I FG + K +TP++T P+ FY++ L GISVGG R+P + KL
Sbjct: 299 KPPSTIVFG--NGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDA 356
Query: 355 E------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
IDSGT +TR Y ALR AFR + K LFDTC+DLS TV V
Sbjct: 357 TGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYS-LFDTCFDLSGMTTVKV 415
Query: 409 PKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
P + HF GG ++ L L+ V + + C FA + ++GN+QQ+G+ V YD+
Sbjct: 416 PTVVFHFTGG-EVSLPASNYLIPVNNQGRFCFAFAGTMGSLS--IIGNIQQQGFRVAYDL 472
Query: 468 AGRRLGFGPGNC 479
G R+GF C
Sbjct: 473 VGSRVGFLSRAC 484
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 125/383 (32%), Positives = 190/383 (49%), Gaps = 34/383 (8%)
Query: 110 DNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ 168
D +T+ T P +G + EY+ + +G P + + L+LDTGS + W QC+PC C Q
Sbjct: 139 DTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQ 198
Query: 169 QRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFW 228
Q DP F+P+ S T+ + C++ C +L C S +C Y ++Y DGS G
Sbjct: 199 QSDPVFNPTSSSTYKSLTCSAPQCSLL-------ETSACRSNKCLYQVSYGDGSFTVGEL 251
Query: 229 ATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFY 288
ATD +T A LGC +N G GA+G++GL G +SI ++ + F Y
Sbjct: 252 ATDTVTFGNSGKINNVA-----LGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSY 306
Query: 289 CLHSPYGSTGYITFGKPDTVNKKFVKY------TPIVTTPEQSEFYHITLTGISVGGERL 342
CL GK +++ V+ P++ + FY++ L+G SVGGE++
Sbjct: 307 CLVDRDS-------GKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKV 359
Query: 343 PLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTC 397
L + F ++ +D GT +TR Y++LR AF K K G LFDTC
Sbjct: 360 VLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTC 419
Query: 398 YDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNV 456
YD S+ TV VP + HF GG L+L + L+ V+ C FA P+ + ++GNV
Sbjct: 420 YDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFA--PTSSSLSIIGNV 477
Query: 457 QQRGYEVHYDVAGRRLGFGPGNC 479
QQ+G + YD++ +G C
Sbjct: 478 QQQGTRITYDLSKNVIGLSGNKC 500
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 188/372 (50%), Gaps = 24/372 (6%)
Query: 116 KAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFF 174
+A T P +G+ + EY+ + +G P + + L+LDTGS + W QC+PC C QQ DP F
Sbjct: 145 EALTTPVVSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVF 204
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
+P+ S T+ + C++ C +L C S +C Y ++Y DGS G ATD +T
Sbjct: 205 NPTSSSTYKSLTCSAPQCSLL-------ETSACRSNKCLYQVSYGDGSFTVGELATDTVT 257
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSP 293
A LGC +N G GA+G++GL G +SI ++ + F YCL
Sbjct: 258 FGNSGKINDVA-----LGCGHDNEGLFTGAAGLLGLGGGALSITNQMKATSFSYCLVDRD 312
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G + + F + P++ + FY++ L+G SVGG+++ + + F +
Sbjct: 313 SGKSSSLDFNSVQLGSGD--ATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDA 370
Query: 354 TE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
+ +D GT +TR Y++LR AF K K G LFDTCYD S+ +V V
Sbjct: 371 SGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKV 430
Query: 409 PKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
P + HF GG L+L + L+ V+ C FA P+ + ++GNVQQ+G + YD+
Sbjct: 431 PTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFA--PTSSSLSIIGNVQQQGTRITYDL 488
Query: 468 AGRRLGFGPGNC 479
A + +G C
Sbjct: 489 ANKIIGLSGNKC 500
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 121/340 (35%), Positives = 176/340 (51%), Gaps = 27/340 (7%)
Query: 149 LDTGSGITWTQCKPCI---HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
+DTGS ++W QCKPC C Q+DP FDP++S +++ +PC C L +
Sbjct: 3 VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYA----AS 58
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
CS+ +C Y ++Y DGS TG +++D +T+ + A F GC +G NG
Sbjct: 59 ACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS-----AVQGFFFGCGHAQSGLFNGVD 113
Query: 266 GIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFG--KPDTVNKKFVKYTPIVT 320
G++GL R S++ +T +Y F YCL + + GY+T G P F T ++
Sbjct: 114 GLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGF-STTQLLP 172
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKR 380
+P +Y + LTGISVGG++L + AS F + T++TR P Y+ALRSAFR
Sbjct: 173 SPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG-TVVTRLPPTAYAALRSAFRSG 231
Query: 381 MKKYKMGKGIED-LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
M Y + + DTCY+ + Y TV +P + + F G + L G L CL
Sbjct: 232 MASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSFG-----CL 286
Query: 440 GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
FA SD +LGNVQQR +EV D G +GF P +C
Sbjct: 287 AFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 324
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 125/368 (33%), Positives = 185/368 (50%), Gaps = 37/368 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
E+ + V+IG P S ++DTGS + WTQCKPC+ C +Q P FDPS S T++ +PC+S
Sbjct: 104 EFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSA 163
Query: 191 TCKILLEWFPPNGQDKC-SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP- 248
+C L P KC S+ +C Y Y D S G AT+ T+ + ++ P
Sbjct: 164 SCSDL-----PT--SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK-------SKLPG 209
Query: 249 FLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST------GYIT 301
+ GC D N GD + +G++GL RGP+S++S+ + F YCL S + G +
Sbjct: 210 VVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGSLA 269
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL-----STEI 356
+ V+ TP++ P Q FY+++L I+VG R+ L +S F +
Sbjct: 270 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 329
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSA--YKTVVVPKITI 413
DSGT IT Y AL+ AF +M G G+ D C+ A V VP++
Sbjct: 330 DSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVG--LDLCFRAPAKGVDQVEVPRLVF 387
Query: 414 HFLGGVDLELDVRGTLVVESVR-QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
HF GG DL+L +V++ +CL ++ S SI +GN QQ+ ++ YDV L
Sbjct: 388 HFDGGADLDLPAENYMVLDGGSGALCL--TVMGSRGLSI-IGNFQQQNFQFVYDVGHDTL 444
Query: 473 GFGPGNCN 480
F P CN
Sbjct: 445 SFAPVQCN 452
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 125/368 (33%), Positives = 185/368 (50%), Gaps = 37/368 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
E+ + V+IG P S ++DTGS + WTQCKPC+ C +Q P FDPS S T++ +PC+S
Sbjct: 94 EFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSA 153
Query: 191 TCKILLEWFPPNGQDKC-SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP- 248
+C L P KC S+ +C Y Y D S G AT+ T+ + ++ P
Sbjct: 154 SCSDL-----PT--SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK-------SKLPG 199
Query: 249 FLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST------GYIT 301
+ GC D N GD + +G++GL RGP+S++S+ + F YCL S + G +
Sbjct: 200 VVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGSLA 259
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----I 356
+ V+ TP++ P Q FY+++L I+VG R+ L +S F +
Sbjct: 260 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 319
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSA--YKTVVVPKITI 413
DSGT IT Y AL+ AF +M G G+ D C+ A V VP++
Sbjct: 320 DSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVG--LDLCFRAPAKGVDQVEVPRLVF 377
Query: 414 HFLGGVDLELDVRGTLVVESVR-QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
HF GG DL+L +V++ +CL ++ S SI +GN QQ+ ++ YDV L
Sbjct: 378 HFDGGADLDLPAENYMVLDGGSGALCL--TVMGSRGLSI-IGNFQQQNFQFVYDVGHDTL 434
Query: 473 GFGPGNCN 480
F P CN
Sbjct: 435 SFAPVQCN 442
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 129/416 (31%), Positives = 198/416 (47%), Gaps = 41/416 (9%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
S+ + R D RL +S+ T + P +G + Y + +G P Q
Sbjct: 39 SIIALAREDDARLLFLSSKA---------ASTGVSSAPVASG-QSPPSYVVRAGLGSPAQ 88
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+ L LDT + TW C PC C F P+ S +++ +PC+ST C +L + P
Sbjct: 89 PILLALDTSADATWAHCSPCGTCPSSGS-LFAPANSTSYAPLPCSSTMCTVL-QGQPCPA 146
Query: 204 QDKCSSKE----CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG 259
QD S C + + D S + A+D + + G Y F GC +G
Sbjct: 147 QDPYDSSAPLPMCAFTKPFADASFQASL-ASDWLHL----GKDAIPNYAF--GCVSAVSG 199
Query: 260 DQNG--ASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKF 312
G++GL RGP++++S+ Y F YCL S Y +G + G +
Sbjct: 200 PTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLGAAG--QPRG 257
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPA 367
V+YTP++ P +S Y++ +TG+SVG + + A F T T +DSGT+ITR+
Sbjct: 258 VRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITRWTP 317
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
PVY+ALR FR+ + G FDTC++ V P +T+H GG+DL L +
Sbjct: 318 PVYAALREEFRRHVAA-PSGYTSLGAFDTCFNTDEVAAGVAPAVTVHMDGGLDLALPMEN 376
Query: 428 TLVVESVRQV-CLGFALLPSDPNSI--LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
TL+ S + CL A P + N++ +L N+QQ+ V +DVA R+GF +CN
Sbjct: 377 TLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFARESCN 432
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 126/360 (35%), Positives = 186/360 (51%), Gaps = 33/360 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ V +G P + ++LDTGS I W QC+PC C QQ DP F P+ S ++S + C+S
Sbjct: 158 EYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCDSQ 217
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C L C + +C Y + Y DGS G + T+ M+ G+G
Sbjct: 218 QCNSL-------QMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSF---GGSGTVNS--IA 265
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFGKP---D 306
LGC +N G GA+G++GL GP+S+ S+ + F YCL + ++ + F D
Sbjct: 266 LGCGHDNEGLFVGAAGLLGLGGGPLSLTSQLKATSFSYCLVNRDSAASSTLDFNSAPVGD 325
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGT 360
+V +K + I T FY++ L+G+SVGGE L + F KL +D GT
Sbjct: 326 SVIAPLLKSSKIDT------FYYVGLSGMSVGGELLRIPQEVF-KLDDSGDGGVIVDCGT 378
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
ITR + Y++LR +F + + G+ LFDTCYDLS +V VP ++ HF GG
Sbjct: 379 AITRLQSEAYNSLRDSFVSMSRHLRSTSGVA-LFDTCYDLSGQSSVKVPTVSFHFDGGKS 437
Query: 421 LELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+L L+ V+S C FA P+ + ++GNVQQ+G V +D+A R+GF C
Sbjct: 438 WDLPAANYLIPVDSAGTYCFAFA--PTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 124/367 (33%), Positives = 183/367 (49%), Gaps = 35/367 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
E+ + V+IG P S ++DTGS + WTQCKPC+ C +Q P FDPS S T++ +PC+S
Sbjct: 73 EFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSA 132
Query: 191 TCKILLEWFPPNGQDKC-SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP- 248
+C L P KC S+ +C Y Y D S G AT+ T+ + ++ P
Sbjct: 133 SCSDL-----PT--SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK-------SKLPG 178
Query: 249 FLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST------GYIT 301
+ GC D N GD + +G++GL RGP+S++S+ + F YCL S + G +
Sbjct: 179 VVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGSLA 238
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----I 356
+ V+ TP++ P Q FY+++L I+VG R+ L +S F +
Sbjct: 239 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 298
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSA--YKTVVVPKITI 413
DSGT IT Y AL+ AF +M G G+ D C+ A V VP++
Sbjct: 299 DSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVG--LDLCFRAPAKGVDQVEVPRLVF 356
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
HF GG DL+L +V++ L ++ S SI +GN QQ+ ++ YDV L
Sbjct: 357 HFDGGADLDLPAENYMVLDGGSGA-LCLTVMGSRGLSI-IGNFQQQNFQFVYDVGHDTLS 414
Query: 474 FGPGNCN 480
F P CN
Sbjct: 415 FAPVQCN 421
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 133/396 (33%), Positives = 194/396 (48%), Gaps = 32/396 (8%)
Query: 94 QRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTG 152
QR + RLQ+ KT +F + + A + E+ + +AIG P + S ++DTG
Sbjct: 62 QRAVKRGRLRLQRL----SAKTASFEPSVEAPVHAGNGEFLMNLAIGTPAETYSAIMDTG 117
Query: 153 SGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKEC 212
S + WTQCKPC C Q P FDP KS +FSK+PC+S C L CS C
Sbjct: 118 SDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVAL-------PISSCSDG-C 169
Query: 213 PYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ-NGASGIMGLD 271
Y +Y D S G AT+ T G+ ++ F GC ++N G + +G++GL
Sbjct: 170 EYRYSYGDHSSTQGVLATETFTF----GDASVSKIGF--GCGEDNRGRAYSQGAGLVGLG 223
Query: 272 RGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
RGP+S+IS+ + F YCL S S G T K TP++ P + FY+++
Sbjct: 224 RGPLSLISQLGVPKFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLS 283
Query: 332 LTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKM 386
L GISVG LP++ S F+ IDSGT IT ++AL+ F +MK
Sbjct: 284 LEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVD 343
Query: 387 GKGIEDLFDTCYDLSAYKTVV-VPKITIHFLGGVDLELDVRGTLVVES-VRQVCLGFALL 444
G +L + C+ L + V VP++ HF GVDL+L ++ +S +R +CL
Sbjct: 344 ASGSTEL-ELCFTLPPDGSPVDVPQLVFHF-EGVDLKLPKENYIIEDSALRVICLTMG-- 399
Query: 445 PSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
S + GN QQ+ V +D+ + F P CN
Sbjct: 400 -SSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCN 434
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 154/491 (31%), Positives = 222/491 (45%), Gaps = 40/491 (8%)
Query: 9 LLFIWLLRSSNNGAYANDNDLSHSY-IVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGR 67
L+ I LL S++ A S Y +V+ S L P ++C+ + A P G + +
Sbjct: 5 LVVILLLSISSSVASHGAGAGSQRYHVVATSHLEPESLCSGLKVA-PSADGTW-VPLHRP 62
Query: 68 YGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV 127
+GPCS + G++ PSL E+LR DQ R R+ D K ++T
Sbjct: 63 FGPCSP-SAGRA-PAPSLLEMLRWDQVRTEYVR-RKASGGAEDVLNPAKPRVLMSQTDFA 119
Query: 128 AADEYYI---------VVAIGKPK--QYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFF 174
+ + + A G P ++ +DT + W QC PC C QRDP F
Sbjct: 120 VRSPFGVGSGSGSSAWIDADGDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLF 179
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNG-QDKCSSKECPYDIAYVDGSGETGFWATDRM 233
DP+ S T + + C S C+ L + NG ++ ++ EC Y I Y D G + TD +
Sbjct: 180 DPTTSSTAAAVRCRSPACRSLGPYG--NGCSNRSANAECRYLIEYSDDRATAGTYMTDTL 237
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYC 289
TI +G A F GC+ G + +G M L G S++++T S F YC
Sbjct: 238 TI-----SGTTAVRNFRFGCSHAVRGRFSDLTAGTMSLGGGAQSLLAQTARSLGNAFSYC 292
Query: 290 LHSPYGSTGYITFGKPDTVNKKFV-KYTPIVTTPEQSEFYHITLTGISVGGERLPLKASY 348
+ S G+++ G P T N V TP+V + Y + L GI V G RL +
Sbjct: 293 VPQASAS-GFLSIGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPVA 351
Query: 349 FTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
F+ +DS +IT+ P Y ALR AFR M+ Y G DTCYD V V
Sbjct: 352 FSA-GAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPR-SGATGTLDTCYDFLGLTNVRV 409
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P +++ F GG + LD ++ CL F SD +GNVQQ+ +EV YDVA
Sbjct: 410 PAVSLVFGGGAVVVLDPPAVMIGG-----CLAFTATSSDLALGFIGNVQQQTHEVLYDVA 464
Query: 469 GRRLGFGPGNC 479
+GF G C
Sbjct: 465 AGGVGFRRGAC 475
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 122/374 (32%), Positives = 181/374 (48%), Gaps = 34/374 (9%)
Query: 126 IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKI 185
+ + EY + + IG P +Y S +LDTGS + WTQC PC+ C Q PFFDP++S +++K+
Sbjct: 83 LASEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKL 142
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
PCNS C L ++P C C Y Y D + G + + T +
Sbjct: 143 PCNSPMCNAL--YYP-----LCYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVP 195
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGSTGYITF 302
R F GC + N G SG++G RGP+S++S+ F YCL SP S Y F
Sbjct: 196 RIAF--GCGNLNAGSLFNGSGMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPVPSRLY--F 251
Query: 303 GKPDTVNK------KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE- 355
G T+N + V+ TP + P Y++ +TGISVGGE LP+ S F +
Sbjct: 252 GAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADG 311
Query: 356 -----IDSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDL--SAYKTVV 407
IDSG+ IT Y + AF ++ + D+ DTC+ K V
Sbjct: 312 TGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPPRKIVT 371
Query: 408 VPKITIHFLGGVDLELDVRGTLVVE-SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
+P++ HF G ++EL + ++++ +CL A+ SD SI +G+ Q + + V YD
Sbjct: 372 MPELAFHF-EGANMELPLENYMLIDGDTGNLCL--AIAASDDGSI-IGSFQHQNFHVLYD 427
Query: 467 VAGRRLGFGPGNCN 480
L F P CN
Sbjct: 428 NENSLLSFTPATCN 441
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 131/405 (32%), Positives = 192/405 (47%), Gaps = 40/405 (9%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKPKQYV 145
E+L R +R SRRLQ+ + +T + A D EY + ++IG P Q
Sbjct: 58 ELLERAVER----GSRRLQR-----LEAMLNGPSGVETPVYAGDGEYLMNLSIGTPAQPF 108
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
S ++DTGS + WTQC+PC C Q P F+P S +FS +PC+S C+ L
Sbjct: 109 SAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQS-------P 161
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG-DQNGA 264
CS+ C Y Y DGS G T+ +T G + GC +NN G Q
Sbjct: 162 TCSNNSCQYTYGYGDGSETQGSMGTETLTF------GSVSIPNITFGCGENNQGFGQGNG 215
Query: 265 SGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKY--TPIVTTP 322
+G++G+ RGP+S+ S+ +++ F YC+ +P GS+ T N T ++ +
Sbjct: 216 AGLVGMGRGPLSLPSQLDVTKFSYCM-TPIGSSNSSTLLLGSLANSVTAGSPNTTLIQSS 274
Query: 323 EQSEFYHITLTGISVGGERLPLKASYFTKLSTE-------IDSGTIITRFPAPVYSALRS 375
+ FY+ITL G+SVG LP+ S F KL++ IDSGT +T F Y A+R
Sbjct: 275 QIPTFYYITLNGLSVGSTPLPIDPSVF-KLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQ 333
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESV 434
AF +M + G FD C+ + S + +P +HF GG DL L + S
Sbjct: 334 AFISQM-NLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPSN 391
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+CL A+ S + GN+QQ+ V YD + F C
Sbjct: 392 GLICL--AMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 133/396 (33%), Positives = 194/396 (48%), Gaps = 32/396 (8%)
Query: 94 QRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTG 152
QR + RLQ+ KT +F + + A + E+ + +AIG P + S ++DTG
Sbjct: 62 QRAVKRGRLRLQRL----SAKTASFEPSVEAPVHAGNGEFLMNLAIGTPAETYSAIMDTG 117
Query: 153 SGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKEC 212
S + WTQCKPC C Q P FDP KS +FSK+PC+S C L CS C
Sbjct: 118 SDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVAL-------PISSCSDG-C 169
Query: 213 PYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ-NGASGIMGLD 271
Y +Y D S G AT+ T G+ ++ F GC ++N G + +G++GL
Sbjct: 170 EYRYSYGDHSSTQGVLATETFTF----GDASVSKIGF--GCGEDNRGRAYSQGAGLVGLG 223
Query: 272 RGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
RGP+S+IS+ + F YCL S S G T K TP++ P + FY+++
Sbjct: 224 RGPLSLISQLGVPKFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLS 283
Query: 332 LTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKM 386
L GISVG LP++ S F+ IDSGT IT ++AL+ F +MK
Sbjct: 284 LEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVD 343
Query: 387 GKGIEDLFDTCYDLSAYKT-VVVPKITIHFLGGVDLELDVRGTLVVES-VRQVCLGFALL 444
G +L + C+ L + V VP++ HF GVDL+L ++ +S +R +CL
Sbjct: 344 ASGSTEL-ELCFTLPPDGSPVEVPQLVFHF-EGVDLKLPKENYIIEDSALRVICLTMG-- 399
Query: 445 PSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
S + GN QQ+ V +D+ + F P CN
Sbjct: 400 -SSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCN 434
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 134/406 (33%), Positives = 193/406 (47%), Gaps = 42/406 (10%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKPKQYV 145
E+L R +R SRRLQ+ + +T + A D EY + ++IG P Q
Sbjct: 58 ELLERAVER----GSRRLQR-----LEAMLNGPSGVETPVYAGDGEYLMNLSIGTPAQPF 108
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
S ++DTGS + WTQC+PC C Q P F+P S +FS +PC+S C+ L
Sbjct: 109 SAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQS-------P 161
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG-DQNGA 264
CS+ C Y Y DGS G T+ +T G + GC +NN G Q
Sbjct: 162 TCSNNSCQYTYGYGDGSETQGSMGTETLTF------GSVSIPNITFGCGENNQGFGQGNG 215
Query: 265 SGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQ 324
+G++G+ RGP+S+ S+ +++ F YC+ +P GS+ T N +P T E
Sbjct: 216 AGLVGMGRGPLSLPSQLDVTKFSYCM-TPIGSSTSSTLLLGSLAN-SVTAGSPNTTLIES 273
Query: 325 SE---FYHITLTGISVGGERLPLKASYFTKLSTE-------IDSGTIITRFPAPVYSALR 374
S+ FY+ITL G+SVG LP+ S F KL++ IDSGT +T F Y A+R
Sbjct: 274 SQIPTFYYITLNGLSVGSTPLPIDPSVF-KLNSNNGTGGIIIDSGTTLTYFADNAYQAVR 332
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITIHFLGGVDLELDVRGTLVVES 433
AF +M + G FD C+ + S + +P +HF GG DL L + S
Sbjct: 333 QAFISQM-NLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGG-DLVLPSENYFISPS 390
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+CL A+ S + GN+QQ+ V YD + F C
Sbjct: 391 NGLICL--AMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 131/421 (31%), Positives = 208/421 (49%), Gaps = 46/421 (10%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKA--------------FTFPAKTGI-VAA 129
+++ L+RD R+ NSR L+ A+ + K++ F P +G+ +
Sbjct: 85 MQQRLKRDAARVAAINSR-LELAV-NGIKRSSLKPDSSSSFTMAESDFQSPVVSGMDQGS 142
Query: 130 DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNS 189
EY+ + +G P++ ++LDTGS +TW QC+PC C QQ DP ++P+ S ++ + C +
Sbjct: 143 GEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSSYKLVGCQA 202
Query: 190 TTCKIL-LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C+ L + NG C Y ++Y DGS G +AT+ +T+ G
Sbjct: 203 NLCQQLDVSGCSRNG-------SCLYQVSYGDGSYTQGNFATETLTL------GGAPLQN 249
Query: 249 FLLGCTDNNTG---DQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFGK 304
+GC +N G G G+ G S ++ N F YCL S+ + FG+
Sbjct: 250 VAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVDRDSESSSTLQFGR 309
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSG 359
N + P++ FY+++L+GISVGG+ L + S F ++ +DSG
Sbjct: 310 AAVPNGAVL--APMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSG 367
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T +TR Y +LR AFR K G+ LFDTCYDLS+ ++V VP + HF GG
Sbjct: 368 TAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVS-LFDTCYDLSSKESVDVPTVVFHFSGGG 426
Query: 420 DLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
+ L + LV V+S+ C FA P+ + ++GN+QQ+G V +D A ++GF
Sbjct: 427 SMSLPAKNYLVPVDSMGTFCFAFA--PTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNK 484
Query: 479 C 479
C
Sbjct: 485 C 485
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 130/433 (30%), Positives = 211/433 (48%), Gaps = 31/433 (7%)
Query: 58 GKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKA 117
GK L+++ R + N+ ++ + ++RD++R+ RRL + +
Sbjct: 69 GKWKLKLVHR-DKITAFNKSSYDHSHNFHARIQRDKKRVATL-IRRLSPRDATSSYSVEE 126
Query: 118 FTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDP 176
F +G+ + EY+I + +G P + +++D+GS I W QC+PC C Q DP FDP
Sbjct: 127 FGAEVVSGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDP 186
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
+ S +F +PC+S+ C+ + C + C Y++ Y DGS G A + +T
Sbjct: 187 ADSASFMGVPCSSSVCERIE-------NAGCHAGGCRYEVMYGDGSYTKGTLALETLTF- 238
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISK---TNISYFFYCLHS- 292
G +GC N G GA+G++GL G +S++ + F YCL S
Sbjct: 239 -----GRTVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 293
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-- 350
S G + FG+ + P++ P FY+I L+G+ VGG ++P+ F
Sbjct: 294 GTDSAGSLEFGR--GAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLN 351
Query: 351 ---KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV 407
+D+GT +TR P Y A R AF + G+ +FDTCY+L+ + +V
Sbjct: 352 EMGNGGVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVS-IFDTCYNLNGFVSVR 410
Query: 408 VPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
VP ++ +F GG L L R L+ V+ V C FA PS + ++GN+QQ G ++ +D
Sbjct: 411 VPTVSFYFAGGPILTLPARNFLIPVDDVGTFCFAFAASPSGLS--IIGNIQQEGIQISFD 468
Query: 467 VAGRRLGFGPGNC 479
A +GFGP C
Sbjct: 469 GANGFVGFGPNVC 481
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 133/414 (32%), Positives = 207/414 (50%), Gaps = 31/414 (7%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQ 143
L E L+RD++R+ S+ + + P +G++ + EY++ + +G P +
Sbjct: 6 LLETLQRDERRVRWIESKAKLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGLGTPAR 65
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+ +++DTGS + W QC+PC C +Q DP FDP S +F +IPC S CK LE +G
Sbjct: 66 SLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCKA-LEVHSCSG 124
Query: 204 QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG 263
+S+ C Y +AY DGS G +++D T+ G G A GC +N G G
Sbjct: 125 SRGATSR-CSYQVAYGDGSFSVGDFSSDLFTL----GTGSKA-MSVAFGCGFDNEGLFAG 178
Query: 264 ASGIMGLDRGPVSIISK--------TNISYFFYCL---HSPYG-STGYITFGKPDTVNKK 311
A+G++GL G +S S+ + + F YCL +P S+ + FG +
Sbjct: 179 AAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAAIPST- 237
Query: 312 FVKYTPIVTTPEQSEFYHITLTGISVGGERLP-----LKASYFTKLSTEIDSGTIITRFP 366
+P++ P+ FY+ + G+SVGG +LP L+ S IDSGT +TRFP
Sbjct: 238 -AALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFP 296
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
VY+ +R AFR LFDTCY+ S +V VP + +HF G DL+L
Sbjct: 297 TSVYATIRDAFRNATINLPSAPRYS-LFDTCYNFSGKASVDVPALVLHFENGADLQLPPT 355
Query: 427 GTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L+ + + CL FA P+ ++GN+QQ+ + + +D+ L F P C
Sbjct: 356 NYLIPINTAGSFCLAFA--PTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 127/366 (34%), Positives = 183/366 (50%), Gaps = 37/366 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
E+ + ++IG P + ++DTGS + WTQCKPC+ C Q P FDPS S T++ +PC+ST
Sbjct: 101 EFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPCSST 160
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-F 249
C L P+ KC+S +C Y Y D S G A + T+ + + P
Sbjct: 161 LCSDL-----PS--SKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKT-------KLPDV 206
Query: 250 LLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHS-PYGSTGYITFGKPDT 307
GC D N GD +G++GL RGP+S++S+ ++ F YCL S S + G T
Sbjct: 207 AFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTSLDDTSKSPLLLGSLAT 266
Query: 308 V-----NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL-----STEID 357
+ V+ TP++ P Q FY++ L G++VG + L +S F +D
Sbjct: 267 ISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVD 326
Query: 358 SGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYD--LSAYKTVVVPKITIH 414
SGT IT Y AL+ AF +MK G GI DTC++ S V VPK+ H
Sbjct: 327 SGTSITYLELQGYRALKKAFAAQMKLPAADGSGIG--LDTCFEAPASGVDQVEVPKLVFH 384
Query: 415 FLGGVDLELDVRGTLVVESVR-QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
L G DL+L +V++S +CL ++ S SI +GN QQ+ + YDV L
Sbjct: 385 -LDGADLDLPAENYMVLDSGSGALCL--TVMGSRGLSI-IGNFQQQNIQFVYDVGENTLS 440
Query: 474 FGPGNC 479
F P C
Sbjct: 441 FAPVQC 446
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 145/486 (29%), Positives = 218/486 (44%), Gaps = 51/486 (10%)
Query: 24 ANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTP 83
A +++++ +++ S L P +VC+ + P V L YGPCS + K R P
Sbjct: 30 AGVDEVNYIVVLTSSWLKPNSVCSSLMSPHPNVTNWVPLS--RPYGPCSS-SPAKGRAAP 86
Query: 84 S-LEEILRRDQQRLHLKNSRRL--------------------QKAIPDNFKKTKAFTFPA 122
S ++ +L DQ R R Q++I + + PA
Sbjct: 87 STVDGMLWSDQHRADYIQWRLSGSVAGVLQPADDVPVSTNYEQQSIEGDLNYGTYYPAPA 146
Query: 123 KTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH--CSQQRDPFFDPSKSK 180
A + G P +++LDT S +TW QC PC C Q+D +DP+KS
Sbjct: 147 PMSSKAMNPAATGGGGGGPGVTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSS 206
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
+ CNS TC L P ++ +C Y + Y DG+ G + +D +TI
Sbjct: 207 SSGVFSCNSPTCTQLG----PYANGCTNNNQCQYRVRYPDGTSTAGTYISDLLTITPAT- 261
Query: 241 NGYFARYPFLLGCTDNNTGD---QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPY 294
A F GC+ G + A+GIM L GP S++S+T +Y F +C P
Sbjct: 262 ----AVRSFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPT 317
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPE-QSEFYHITLTGISVGGERLPLKASYFTKLS 353
G+ T G P ++V TP++ P FY + L I+V G+R+ + + F
Sbjct: 318 -RRGFFTLGVPRVAAWRYV-LTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAA-G 374
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
+DS T ITR P Y ALR AFR RM Y+ L DTCYD++ ++ +P+IT+
Sbjct: 375 AALDSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPL-DTCYDMAGVRSFALPRITL 433
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F +ELD G L Q CL F P+D ++GN+Q + EV Y++ +G
Sbjct: 434 VFDKNAAVELDPSGVLF-----QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVG 488
Query: 474 FGPGNC 479
F C
Sbjct: 489 FRHAAC 494
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 184/362 (50%), Gaps = 30/362 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY++ V IG P + L++DTGS + W QC PC C +Q D FDP S +F ++ C++
Sbjct: 13 EYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTP 72
Query: 191 TCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
CK+L C+S + C Y ++Y DGS G A+D ++ + P
Sbjct: 73 QCKLL-------DVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS------P 119
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS---PYGSTGYITFGKP 305
+ GC +N G GA+G++GL G +S S+ + F YCL S ++ + FG
Sbjct: 120 VVFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDS 179
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-------IDS 358
YT ++ P+ FY+ L+GIS+GG L + ++ F KLS+ IDS
Sbjct: 180 ALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAF-KLSSSTGRGGVIIDS 238
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT +TR P Y+ +R AFR +K LFDTCYD SA +V +P ++ HF GG
Sbjct: 239 GTSVTRLPTYAYTVMRDAFRSATQKLPRAADFS-LFDTCYDFSALTSVTIPTVSFHFEGG 297
Query: 419 VDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
++L LV V++ C F+ D + ++GN+QQ+ V D+ R+GF P
Sbjct: 298 ASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLS--IIGNIQQQTMRVAIDLDSSRVGFAPR 355
Query: 478 NC 479
C
Sbjct: 356 QC 357
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 118/375 (31%), Positives = 183/375 (48%), Gaps = 29/375 (7%)
Query: 122 AKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSK 180
A+ ++A+D EY + + IG P ++ S +LDTGS + WTQC PC+ C Q P+FDP+ S
Sbjct: 81 ARILVLASDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSS 140
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
T+ + C++ C L ++P C K C Y Y D + G A + T +
Sbjct: 141 TYRSLGCSAPACNAL--YYP-----LCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDT 193
Query: 241 NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLH---SPYGST 297
R F GC + N G SG++G RG +S++S+ F YCL SP S
Sbjct: 194 RVTLPRISF--GCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVRSR 251
Query: 298 GYI-TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE- 355
Y + ++ N V+ TP + P Y + +TGISVGG RLP+ + T+
Sbjct: 252 LYFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDG 311
Query: 356 -----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED--LFDTCYDL--SAYKTV 406
IDSGT IT P Y A+R AF + + + + DTC+ ++V
Sbjct: 312 TGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSV 371
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVE-SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
+P++ +HF G D EL ++ ++V+ S +CL A + + ++G+ Q + + V Y
Sbjct: 372 TLPQLVLHF-DGADWELPLQNYMLVDPSTGGLCLAMA---TSSDGSIIGSYQHQNFNVLY 427
Query: 466 DVAGRRLGFGPGNCN 480
D+ L F P CN
Sbjct: 428 DLENSLLSFVPAPCN 442
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 125/395 (31%), Positives = 191/395 (48%), Gaps = 37/395 (9%)
Query: 112 FKKTKAFTFPAKTGIVAADE----YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCS 167
F +KA T + VA+ + Y + +G P Q + L LDT + TW C PC C
Sbjct: 57 FLSSKAATAGVSSAPVASGQAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCP 116
Query: 168 QQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF---PPNGQDKCSSKE----CPYDIAYVD 220
F P+ S +++ +PC+S+ C + P G D C + + D
Sbjct: 117 SSS--LFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFAD 174
Query: 221 GSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA--SGIMGLDRGPVSII 278
S + A+D + + G Y F GC + TG G++GL RGP++++
Sbjct: 175 ASFQAAL-ASDTLRL----GKDAIPNYTF--GCVSSVTGPTTNMPRQGLLGLGRGPMALL 227
Query: 279 SKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLT 333
S+ Y F YCL S Y +G + G + V+YTP++ P +S Y++ +T
Sbjct: 228 SQAGSLYNGVFSYCLPSYRSYYFSGSLRLGA-GGGQPRSVRYTPMLRNPHRSSLYYVNVT 286
Query: 334 GISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK 388
G+SVG + + A F T T +DSGT+ITR+ APVY+ALR FR+++ G
Sbjct: 287 GLSVGRAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAA-PSGY 345
Query: 389 GIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSD 447
FDTC++ P +T+H GGVDL L + TL+ S + CL A P +
Sbjct: 346 TSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQN 405
Query: 448 PNSI--LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
NS+ ++ N+QQ+ V +DVA R+GF +CN
Sbjct: 406 VNSVVNVIANLQQQNIRVVFDVANSRIGFAKESCN 440
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 127/390 (32%), Positives = 193/390 (49%), Gaps = 23/390 (5%)
Query: 98 LKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGIT 156
L + R +KA + + + P G VA Y + +G P +++DTGS +T
Sbjct: 96 LLHGHRKKKAGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLT 155
Query: 157 WTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNSTTC-KILLEWFPPNGQDKCS-SKECP 213
W QC PC + C +Q P FDP S T++ + C+S+ C ++ P+ CS S C
Sbjct: 156 WLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPS---ACSVSNVCI 212
Query: 214 YDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRG 273
Y +Y D S G+ + D ++ G+G F F GC +N G ++G++GL +
Sbjct: 213 YQASYGDSSYSVGYLSKDTVSF----GSGSFPG--FYYGCGQDNEGLFGRSAGLIGLAKN 266
Query: 274 PVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHI 330
+S++ + S F YCL + + GY++ G + N YTP+ ++ + Y +
Sbjct: 267 KLSLLYQLAPSLGYAFSYCLPTSSAAAGYLSIG---SYNPGQYSYTPMASSSLDASLYFV 323
Query: 331 TLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGI 390
TL+GISV G L + S + L T IDSGT+ITR P VY+AL A M
Sbjct: 324 TLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPT 383
Query: 391 EDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNS 450
+ DTC+ SA + VP++ + F GG L L L+ CL FA P+ +
Sbjct: 384 YSILDTCFRGSA-AGLRVPRVDMAFAGGATLALSPGNVLIDVDDSTTCLAFA--PTGGTA 440
Query: 451 ILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
I +GN QQ+ + V YDVA R+GF G C+
Sbjct: 441 I-IGNTQQQTFSVVYDVAQSRIGFAAGGCS 469
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 125/395 (31%), Positives = 191/395 (48%), Gaps = 37/395 (9%)
Query: 112 FKKTKAFTFPAKTGIVAADE----YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCS 167
F +KA T + VA+ + Y + +G P Q + L LDT + TW C PC C
Sbjct: 55 FLSSKAATAGVSSAPVASGQAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCP 114
Query: 168 QQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF---PPNGQDKCSSKE----CPYDIAYVD 220
F P+ S +++ +PC+S+ C + P G D C + + D
Sbjct: 115 SSS--LFAPANSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFAD 172
Query: 221 GSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA--SGIMGLDRGPVSII 278
S + A+D + + G Y F GC + TG G++GL RGP++++
Sbjct: 173 ASFQAAL-ASDTLRL----GKDAIPNYTF--GCVSSVTGPTTNMPRQGLLGLGRGPMALL 225
Query: 279 SKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLT 333
S+ Y F YCL S Y +G + G + V+YTP++ P +S Y++ +T
Sbjct: 226 SQAGSLYNGVFSYCLPSYRSYYFSGSLRLGA-GGGQPRSVRYTPMLRNPHRSSLYYVNVT 284
Query: 334 GISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK 388
G+SVG + + A F T T +DSGT+ITR+ APVY+ALR FR+++ G
Sbjct: 285 GLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAA-PSGY 343
Query: 389 GIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSD 447
FDTC++ P +T+H GGVDL L + TL+ S + CL A P +
Sbjct: 344 TSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQN 403
Query: 448 PNSI--LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
NS+ ++ N+QQ+ V +DVA R+GF +CN
Sbjct: 404 VNSVVNVIANLQQQNIRVVFDVANSRVGFAKESCN 438
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 129/421 (30%), Positives = 189/421 (44%), Gaps = 36/421 (8%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPDN-------FKKTKAFTFPAKTGIVAADEYYIVVAIG 139
E+L R QR L+ + + KA + + P + + EY +A+G
Sbjct: 82 ELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVAPVVSRAPTSGEYMAKIAVG 141
Query: 140 KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF 199
P L LDT S +TW QC+PC C Q P FDP S ++ ++ ++ C+ L
Sbjct: 142 TPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGR-- 199
Query: 200 PPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG 259
+G C Y + Y DG G T D + G Y +GC +N G
Sbjct: 200 --SGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAY-LSIGCGHDNKG 256
Query: 260 DQNG-ASGIMGLDRGPVSIISKTNI----SYFFYCL----HSPYGSTGYITFGKPDTVNK 310
A+GI+GL RG +SI + + F YCL P + +TFG
Sbjct: 257 LFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTS 316
Query: 311 KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS-------YFTKLSTEIDSGTIIT 363
+TP V FY++ L G+SVGG R+P Y + +DSGT +T
Sbjct: 317 PPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGRGGVILDSGTTVT 376
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGK----GIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
R P Y + R +G+ G LFDTCY + V VP +++HF GGV
Sbjct: 377 RLARPAY--VAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGGV 434
Query: 420 DLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
++ L + L+ V+S VC FA D + ++GN+ Q+G+ V YD+AG+R+GF P N
Sbjct: 435 EVSLQPKNYLIPVDSRGTVCFAFAGT-GDRSVSVIGNILQQGFRVVYDLAGQRVGFAPNN 493
Query: 479 C 479
C
Sbjct: 494 C 494
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 183/362 (50%), Gaps = 30/362 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY++ V IG P + L++DTGS + W QC PC C +Q D FDP S +F ++ C++
Sbjct: 13 EYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTP 72
Query: 191 TCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
CK+L C+S + C Y ++Y DGS G A+D + + P
Sbjct: 73 QCKLL-------DVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTS------P 119
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS---PYGSTGYITFGKP 305
+ GC +N G GA+G++GL G +S S+ + F YCL S ++ + FG
Sbjct: 120 VVFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDS 179
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-------IDS 358
YT ++ P+ FY+ L+GIS+GG L + ++ F KLS+ IDS
Sbjct: 180 ALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAF-KLSSSTGRGGVIIDS 238
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT +TR P Y+ +R AFR +K LFDTCYD SA +V +P ++ HF GG
Sbjct: 239 GTSVTRLPTYAYTVMRDAFRSATQKLPRAADFS-LFDTCYDFSALTSVTIPTVSFHFEGG 297
Query: 419 VDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
++L LV V++ C F+ D + ++GN+QQ+ V D+ R+GF P
Sbjct: 298 ASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLS--IIGNIQQQTMRVAIDLDSSRVGFAPR 355
Query: 478 NC 479
C
Sbjct: 356 QC 357
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 129/413 (31%), Positives = 192/413 (46%), Gaps = 56/413 (13%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKPKQYV 145
++L R +R SRRLQ+ + +T + A D EY + ++IG P Q
Sbjct: 58 QLLERAIER----GSRRLQR-----LEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPF 108
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
S ++DTGS + WTQC+PC C Q P F+P S +FS +PC+S C+ L
Sbjct: 109 SAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQAL-------SSP 161
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG-DQNGA 264
CS+ C Y Y DGS G T+ +T G + GC +NN G Q
Sbjct: 162 TCSNNFCQYTYGYGDGSETQGSMGTETLTF------GSVSIPNITFGCGENNQGFGQGNG 215
Query: 265 SGIMGLDRGPVSIISKTNISYFFYCLHSPYGST-----------GYITFGKPDTVNKKFV 313
+G++G+ RGP+S+ S+ +++ F YC+ +P GS+ +T G P+T
Sbjct: 216 AGLVGMGRGPLSLPSQLDVTKFSYCM-TPIGSSTPSNLLLGSLANSVTAGSPNTT----- 269
Query: 314 KYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPA 367
++ + + FY+ITL G+SVG RLP+ S F S IDSGT +T F
Sbjct: 270 ----LIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVN 325
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITIHFLGGVDLELDVR 426
Y ++R F ++ + G FD C+ S + +P +HF GG DLEL
Sbjct: 326 NAYQSVRQEFISQI-NLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELPSE 383
Query: 427 GTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ S +CL A+ S + GN+QQ+ V YD + F C
Sbjct: 384 NYFISPSNGLICL--AMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 132/396 (33%), Positives = 193/396 (48%), Gaps = 32/396 (8%)
Query: 94 QRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTG 152
QR + RLQ+ KT +F + + A + E+ + +AIG P + S ++DTG
Sbjct: 62 QRAMKRGKLRLQRLS----AKTASFESSVEAPVHAGNGEFLMKLAIGTPAETYSAIMDTG 117
Query: 153 SGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKEC 212
S + WTQCKPC C Q P FDP KS +FSK+PC+S C L P + CS C
Sbjct: 118 SDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAAL----PIS---SCSDG-C 169
Query: 213 PYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ-NGASGIMGLD 271
Y +Y D S G AT+ G+ ++ F GC ++N G + +G++GL
Sbjct: 170 EYLYSYGDYSSTQGVLATETFAF----GDASVSKIGF--GCGEDNDGSGFSQGAGLVGLG 223
Query: 272 RGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
RGP+S+IS+ F YCL S S G + K TP++ P Q FY+++
Sbjct: 224 RGPLSLISQLGEPKFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLS 283
Query: 332 LTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKM 386
L GISVG LP++ S F+ + IDSGT IT ++AL+ F ++K
Sbjct: 284 LEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVD 343
Query: 387 GKGIEDLFDTCYDLSA-YKTVVVPKITIHFLGGVDLELDVRGTLVVES-VRQVCLGFALL 444
G L D C+ L TV VP++ HF G DL+L ++ +S + +CL +
Sbjct: 344 ESGSTGL-DLCFTLPPDASTVDVPQLVFHF-EGADLKLPAENYIIADSGLGVICL---TM 398
Query: 445 PSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
S + GN QQ+ V +D+ + F P CN
Sbjct: 399 GSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCN 434
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 119/363 (32%), Positives = 185/363 (50%), Gaps = 27/363 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY V +G P++ S+++DTGS +TW QC PC C Q D F P+ S +F+K+ C +
Sbjct: 2 EYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTE 61
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-F 249
C L +P C+ C Y +Y DGS TG + D +T+ +NG + P F
Sbjct: 62 LCNGLP--YP-----MCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQK--QQVPNF 112
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGSTGYITFG 303
GC +N G GA GI+GL +GP+S S+ + F YCL +P T + FG
Sbjct: 113 AFGCGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFG 172
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
VKY ++T P+ +Y++ L GISVGG+ L + ++ F + T DS
Sbjct: 173 DAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDS 232
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY-DLSAYKTVVVPKITIHFLG 417
GT +T+ V+ + +A Y D C + + VP +T HF G
Sbjct: 233 GTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEG 292
Query: 418 GVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
G D+EL + +ES + C F+++ S P+ ++G++QQ+ ++V+YD GR++GF P
Sbjct: 293 G-DMELPPSNYFIFLESSQSYC--FSMV-SSPDVTIIGSIQQQNFQVYYDTVGRKIGFVP 348
Query: 477 GNC 479
+C
Sbjct: 349 KSC 351
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 137/432 (31%), Positives = 204/432 (47%), Gaps = 40/432 (9%)
Query: 61 SLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTF 120
+++++ R P S + + + LRR R+H + P K
Sbjct: 33 TVDLIHRDSPLSPFYNSEETDLQRINNALRRSISRVHHFDPIAAASVSP----KAAESDV 88
Query: 121 PAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSK 180
+ G EY + +++G P + + DTGS + WTQCKPC C +Q DP FDP SK
Sbjct: 89 TSNRG-----EYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSK 143
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
T+ C++ C +L Q CS C Y +Y D S G A+D +T+ G
Sbjct: 144 TYRDFSCDARQCSLL-------DQSTCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTG 196
Query: 241 NGYFARYP-FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYC---LHS 292
+ +P ++GC N G + SGI+GL GP+S+IS+ S F YC L S
Sbjct: 197 SP--VSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSS 254
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--T 350
G++ + FG V+ V+ TP++++ S FY +TL +SVG ER+ S
Sbjct: 255 RAGNSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTG 314
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED---LFDTCYDLSAYKTVV 407
+ + IDSGT +T P +S L +A +++ G+ ED CY SA +
Sbjct: 315 EGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVE----GRRAEDPSGFLSVCY--SATSDLK 368
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
VP IT HF G D++L T V S VCL FA S + + GNV Q + V Y++
Sbjct: 369 VPAITAHFT-GADVKLKPINTFVQVSDDVVCLAFASTTSGIS--IYGNVAQMNFLVEYNI 425
Query: 468 AGRRLGFGPGNC 479
G+ L F P +C
Sbjct: 426 QGKSLSFKPTDC 437
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 133/418 (31%), Positives = 202/418 (48%), Gaps = 42/418 (10%)
Query: 85 LEEILRRDQQRLH-----LKNSRRLQK---AIPDNFKKTKA-FTFPAKTGIV-AADEYYI 134
LEE LRRD +R+ ++ RL K +N + A F +G+ + EY+
Sbjct: 140 LEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHENVAEVAAEFGGEVVSGMAQGSGEYFT 199
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
+ +G P + ++LDTGS + W QC+PC C Q DP F+PS S +FS + CNS C
Sbjct: 200 RIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCSY 259
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L + C C Y ++Y DGS G +AT+ +T G + +GC
Sbjct: 260 LDAY-------NCHGGGCLYKVSYGDGSYTIGSFATEMLTF------GTTSVRNVAIGCG 306
Query: 255 DNNTG----DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYG-STGYITFGKPDTVN 309
+N G GL P + ++T + F YCL + S+G + FG P++V
Sbjct: 307 HDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRA-FSYCLVDRFSESSGTLEFG-PESVP 364
Query: 310 KKFVKYTPIVTTPEQSEFYHITLTGISVGG---ERLPLKASYFTKLSTE----IDSGTII 362
+ TP++T P FY++ L ISVGG + +P + S +DSGT +
Sbjct: 365 LGSI-LTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFIVDSGTAV 423
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
TR PVY A+R AF ++ +G+ +FDTCYDLS V VP + HF G L
Sbjct: 424 TRLQTPVYDAVRDAFVAGTRQLPKAEGVS-IFDTCYDLSGLPLVNVPTVVFHFSNGASLI 482
Query: 423 LDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L + ++ ++ + C FA P+ + ++GN+QQ+G V +D A +GF C
Sbjct: 483 LPAKNYMIPMDFMGTFCFAFA--PATSDLSIMGNIQQQGIRVSFDTANSLVGFALRQC 538
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 128/368 (34%), Positives = 185/368 (50%), Gaps = 29/368 (7%)
Query: 130 DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNS 189
+EY + +A+G P++ V+L LDTGS + WTQC PC C Q P DP+ S T++ +PC +
Sbjct: 82 NEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGA 141
Query: 190 TTCKILLEWFPPNG-QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNG---YFA 245
C+ L F G + + + C Y Y D S G ATDR T + G+G +
Sbjct: 142 ARCRALP--FTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTR 199
Query: 246 RYPFLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS-TGYITFG 303
R F GC N G Q+ +GI G RG S+ S+ N++ F YC S + S + +T G
Sbjct: 200 RLTF--GCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFESKSSLVTLG 257
Query: 304 KPDT-----VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDS 358
+ V+ TPI+ P Q Y ++L GISVG RLP+ + F ST IDS
Sbjct: 258 GSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFR--STIIDS 315
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE-DLFDTCYDL---SAYKTVVVPKITIH 414
G IT P VY A+++ F ++ G+E D C+ L + ++ VP +T+H
Sbjct: 316 GASITTLPEEVYEAVKAEFAAQVGLPP--SGVEGSALDLCFALPVTALWRRPAVPSLTLH 373
Query: 415 FLGGVDLELDVRGTLVVESV--RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
L G D EL R V E + R +C+ P + ++GN QQ+ V YD+ RL
Sbjct: 374 -LEGADWELP-RSNYVFEDLGARVMCIVLDAAPGE--QTVIGNFQQQNTHVVYDLENDRL 429
Query: 473 GFGPGNCN 480
F P C+
Sbjct: 430 SFAPARCD 437
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 135/463 (29%), Positives = 210/463 (45%), Gaps = 53/463 (11%)
Query: 34 IVSVSSLIPPTVCNRTRTA---LPQGPGKVSLEVLGRYGPCS----KLNQGKSRNTPSLE 86
+++ S++ P T C+ + A +P P + YGPCS N + S+
Sbjct: 35 VIATSTMKPKTFCSGHKVAPGDVPS-PNSTWAPLHHLYGPCSPAPSSANSTAADVAASMA 93
Query: 87 EILRRDQQRLHLKNSRRLQKAIPD-----------NFKKTKAFTFPAKTGIVAADEYYIV 135
+++ DQ+R +RL A D ++K + G V +
Sbjct: 94 DMVDDDQRRADYIQ-KRLTGATDDKQPMAFSSRTSQYEKNGQYATNGGLGSVPHLKSLST 152
Query: 136 VAI------GKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPC 187
A G ++++D+GS ++W QCKPC C +QRDP FDP+ S T++ +PC
Sbjct: 153 TATTNSAPDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPC 212
Query: 188 NSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
S C L + + CS+ +C + I Y DGS TG ++ D +T+ Y
Sbjct: 213 TSAACAQLGPY-----RRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGP-----YDVI 262
Query: 247 YPFLLGCTDNNTGD--QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYIT 301
F GC + G +G + L G S++ +T Y F YCL S G++
Sbjct: 263 RGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLV 322
Query: 302 FGKPDTVNKKFVKY--TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
G P + + TP++++ FY + L I V G L + + F+ S+ IDS
Sbjct: 323 LGVPPERAQLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSA-SSVIDSS 381
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
TII+R P Y ALR+AFR M Y+ + + DTCYD + +++ +P I + F GG
Sbjct: 382 TIISRLPPTAYQALRAAFRSAMTMYRAAPPVS-ILDTCYDFTGVRSITLPSIALVFDGGA 440
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
+ LD G L+ CL FA SD +GNVQQ+ E
Sbjct: 441 TVNLDAAGILLGS-----CLAFAPTASDRMPGFIGNVQQKTLE 478
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 86/280 (30%), Positives = 128/280 (45%), Gaps = 42/280 (15%)
Query: 205 DKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG 263
+ CS+ +C + I Y DGS TG ++ D +T+
Sbjct: 478 EGCSANAQCQFGINYGDGSTATGTYSFDDLTL---------------------------- 509
Query: 264 ASGIMGLDRGPVSIISKTNISYFF-YCLHSPYGSTGYITFGKPD---TVNKKFVKYTPIV 319
G +DR + + + T F YC+ S G+IT G P + FV +
Sbjct: 510 --GPYDVDRQGLPLRTATQYGRVFSYCIPPSPSSLGFITLGVPPQRAALVPTFVSTPLLS 567
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRK 379
++ FY + L I V G LP+ + F+ S+ I S T+I+R P Y ALR+AFR+
Sbjct: 568 SSSMPPTFYRVLLRAIIVAGRPLPVPPTVFST-SSVIASTTVISRLPPTAYQALRAAFRR 626
Query: 380 RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
M Y+ + + DTCYD + +++ +P I + F GG + LD G L+ Q CL
Sbjct: 627 AMTMYRTAPPVS-ILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----QGCL 680
Query: 440 GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
FA +D +GNVQQR EV YDV G+ + F C
Sbjct: 681 AFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 136/434 (31%), Positives = 204/434 (47%), Gaps = 47/434 (10%)
Query: 74 LNQGKSRNTPSL-EEILRRDQQRLH-LKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD- 130
++ GK + P L +RR + R L R + N ++T A P + + D
Sbjct: 38 VDAGKQLSRPELIRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRP---SGDL 94
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +AIG P Q VS LLDTGS + WTQC PC C Q DP F P +S ++ + C T
Sbjct: 95 EYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGT 154
Query: 191 TCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTI-QEVNGNGYFARYP 248
C +L C + C Y Y DG+ G +AT+R T G P
Sbjct: 155 LCSDIL-------HHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP 207
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS---------PYGSTGY 299
GC N G N SGI+G R P+S++S+ +I F YCL S +GS
Sbjct: 208 LGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYASRRQSTLLFGSLSD 267
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLST 354
+G D + V+ TP++ +P+ FY++ TG++VG RL + S F
Sbjct: 268 GVYG--DATGR--VQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGV 323
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDL-------SAYKTV 406
+DSGT +T PA V + + AFR++++ + G ED C+ + S+ +
Sbjct: 324 IVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPED--GVCFLVPAAWRRSSSTSQM 381
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVR-QVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
VP++ +HF G DL+L R ++ + R ++CL A D ++I GN+ Q+ V Y
Sbjct: 382 PVPRMVLHF-QGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTI--GNLVQQDMRVLY 438
Query: 466 DVAGRRLGFGPGNC 479
D+ L P C
Sbjct: 439 DLEAETLSIAPARC 452
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 143/424 (33%), Positives = 205/424 (48%), Gaps = 49/424 (11%)
Query: 80 RNTPSLEEILR---RDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIV 135
+N E + R R + RLH N+ L A + KA +VA + E+ +
Sbjct: 62 KNLTRFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKA-------PVVAGNGEFLMK 114
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
+AIG P + S ++DTGS + WTQCKPC C Q P FDP +S +F KI C+S C L
Sbjct: 115 LAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGAL 174
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCT 254
P CSS C Y Y D S G A + T + + P L GC
Sbjct: 175 -----PT--STCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQI--SIPGLGFGCG 225
Query: 255 DNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSP---------YGSTGYITFGK 304
++N GD + +G++GL RGP+S++S+ F YCL + GS IT
Sbjct: 226 NDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLGSLANIT--- 282
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDS 358
P T +K +K TP++ P Q FY+++L GISVGG +L + S F +L + IDS
Sbjct: 283 PKT-SKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTF-ELHDDGSGGVIIDS 340
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA-YKTVVVPKITIHFLG 417
GT IT +++L++ F +M G L D C++L A V VPK+T HF
Sbjct: 341 GTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGL-DLCFNLPAGTNQVEVPKLTFHF-K 398
Query: 418 GVDLELDVRGTLVVES-VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
G DLEL ++ +S +CL S + GN+QQ+ + V +D+ L F P
Sbjct: 399 GADLELPGENYMIGDSKAGLLCLAIG---SSRGMSIFGNLQQQNFMVVHDLQEETLSFLP 455
Query: 477 GNCN 480
C+
Sbjct: 456 TQCD 459
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 145/455 (31%), Positives = 225/455 (49%), Gaps = 49/455 (10%)
Query: 33 YIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRD 92
+ + ++SL+P + C + P G G L + YGPCS+L Q KS PS ++I +D
Sbjct: 40 HTLDINSLLPKSNC-----SAPVGGGSQGLPITYSYGPCSQLGQKKS---PSRQQIFLQD 91
Query: 93 QQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDT 151
+ R+ N+R L + + +++K P + D +++V V GKP+Q ++L++DT
Sbjct: 92 RSRVRSINARILGQY---STEESKDGGSPESMHSLNEDGFFLVNVGFGKPQQNLNLIIDT 148
Query: 152 GSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS 209
GS TW +C C +C ++ P F+PS S ++S C +T
Sbjct: 149 GSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYSNRSCIPST------------------ 190
Query: 210 KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMG 269
+ Y + Y D S G + D +T++ F + F GC D+ GD ASG++G
Sbjct: 191 -KTNYTMNYEDNSYSKGVFVCDEVTLKP----DVFPK--FQFGCGDSGGGDFGSASGVLG 243
Query: 270 LDRGP-VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQS 325
L +G S+IS+T + F YC + G + FG+ +K+T ++ P
Sbjct: 244 LAQGEQYSLISQTASKFKKKFSYCFPHNENTRGSLLFGEKAISASPSLKFTRLLN-PSSG 302
Query: 326 EFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK 385
Y + L GISV +RL + +S F T IDSGT+IT P Y ALR+AF++ M
Sbjct: 303 SVYFVELIGISVAKKRLNVSSSLFASPGTIIDSGTVITHLPTAAYEALRTAFQQEMLHCP 362
Query: 386 MGK--GIEDLFDTCYDLSAY--KTVVVPKITIHFLGGVDLELDVRGTLVVE-SVRQVCLG 440
E DTCY+L + + +P+I +HF+G VD+ L G L + Q CL
Sbjct: 363 SVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLA 422
Query: 441 FALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
FA + ++GN QQ +V YD+ G RLGFG
Sbjct: 423 FARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLGFG 457
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 136/442 (30%), Positives = 198/442 (44%), Gaps = 49/442 (11%)
Query: 68 YGPCSKLNQGKSRNTPS-LEEILRRDQQRLHLKNSRRL--------------------QK 106
YGPCS + K R PS ++ +L DQ R R Q+
Sbjct: 47 YGPCSS-SPAKGRAAPSTVDGMLWSDQHRADYIQWRLSGSVAGVLQPADDVPVSTNYEQQ 105
Query: 107 AIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH- 165
+I + + PA A + G P +++LDT S +TW QC PC
Sbjct: 106 SIEGDLNYGTYYPAPAPMSSKAMNPAATGGGGGGPGVTQTMVLDTASDVTWVQCSPCPTP 165
Query: 166 -CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGE 224
C Q+D +DP+KS + CNS TC L P ++ +C Y + Y DG+
Sbjct: 166 PCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLG----PYANGCTNNNQCQYRVRYPDGTST 221
Query: 225 TGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD---QNGASGIMGLDRGPVSIISKT 281
G + +D +TI A F GC+ G + A+GIM L GP S++S+T
Sbjct: 222 AGTYISDLLTITPAT-----AVRSFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVSQT 276
Query: 282 NISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPE-QSEFYHITLTGISV 337
+Y F +C P G+ T G P ++V TP++ P FY + L I+V
Sbjct: 277 AATYGRVFSHCFPPPT-RRGFFTLGVPRVAAWRYV-LTPMLKNPAIPPTFYMVRLEAIAV 334
Query: 338 GGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTC 397
G+R+ + + F +DS T ITR P Y ALR AFR RM Y+ L DTC
Sbjct: 335 AGQRIAVPPTVFAA-GAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPL-DTC 392
Query: 398 YDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQ 457
YD++ ++ +P+IT+ F +ELD G L Q CL F P+D ++GN+Q
Sbjct: 393 YDMAGVRSFALPRITLVFDKNAAVELDPSGVLF-----QGCLAFTAGPNDQVPGIIGNIQ 447
Query: 458 QRGYEVHYDVAGRRLGFGPGNC 479
+ EV Y++ +GF C
Sbjct: 448 LQTLEVLYNIPAALVGFRHAAC 469
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 116/362 (32%), Positives = 179/362 (49%), Gaps = 30/362 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+ + IG P + ++LDTGS + W QC+PC C Q DP F+PS S +FS + C+S
Sbjct: 7 EYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSA 66
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C L + C C Y+++Y DGS G +AT+ +T G +
Sbjct: 67 VCSQL-------DANDCHGGGCLYEVSYGDGSYTVGSYATETLTF------GTTSIQNVA 113
Query: 251 LGCTDNNTG----DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPD 306
+GC +N G G P + ++T ++ + + S+G + FG P+
Sbjct: 114 IGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFG-PE 172
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGG---ERLPLKASYFTKLSTE----IDSG 359
+V + +TP+V P FY++++ ISVGG + +P +A + + IDSG
Sbjct: 173 SVPIGSI-FTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSG 231
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T +TR Y ALR AF + GI +FDTCYDLSA ++V +P + HF G
Sbjct: 232 TAVTRLQTSAYDALRDAFIAGTQHLPRADGIS-IFDTCYDLSALQSVSIPAVGFHFSNGA 290
Query: 420 DLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
L + L+ ++S+ C FA P+D N ++GN+QQ+G V +D A +GF
Sbjct: 291 GFILPAKNCLIPMDSMGTFCFAFA--PADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQ 348
Query: 479 CN 480
C
Sbjct: 349 CQ 350
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 129/424 (30%), Positives = 206/424 (48%), Gaps = 37/424 (8%)
Query: 74 LNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAF------TFPAKTGIV 127
L+ G+ R P L L + +L +++AI ++ ++ + +T +
Sbjct: 31 LHHGQKRPQPGLRVDLEQVDSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVY 90
Query: 128 AAD-EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
A D EY + VAIG P S ++DTGS + WTQC+PC C Q P F+P S +FS +P
Sbjct: 91 AGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLP 150
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C S C+ L + C++ EC Y Y DGS G+ AT+ T + +
Sbjct: 151 CESQYCQDLPS-------ETCNNNECQYTYGYGDGSTTQGYMATETFTFET-------SS 196
Query: 247 YP-FLLGCTDNNTG-DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST--GYITF 302
P GC ++N G Q +G++G+ GP+S+ S+ + F YC+ S YGS+ +
Sbjct: 197 VPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTS-YGSSSPSTLAL 255
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------I 356
G + + T ++ + +Y+ITL GI+VGG+ L + +S F +L + I
Sbjct: 256 GSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTF-QLQDDGTGGMII 314
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITIHF 415
DSGT +T P Y+A+ AF ++ + + L TC+ S TV VP+I++ F
Sbjct: 315 DSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGL-STCFQQPSDGSTVQVPEISMQF 373
Query: 416 LGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
GGV L L + L+ + +CL S + GN+QQ+ +V YD+ + F
Sbjct: 374 DGGV-LNLGEQNILISPAEGVICLAMG-SSSQLGISIFGNIQQQETQVLYDLQNLAVSFV 431
Query: 476 PGNC 479
P C
Sbjct: 432 PTQC 435
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 126/390 (32%), Positives = 194/390 (49%), Gaps = 39/390 (10%)
Query: 113 KKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD 171
++ F P +G+ + EY+ V +G P ++LDTGS + W QC PC HC Q
Sbjct: 108 RRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSG 167
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATD 231
FDP +S++++ + C + C+ L G D+ C Y +AY DGS G +A++
Sbjct: 168 RVFDPRRSRSYAAVDCVAPICRRLDSA----GCDR-RRNSCLYQVAYGDGSVTAGDFASE 222
Query: 232 RMTIQEVNGNGYFARYPFL----LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY-- 285
+T FAR + +GC +N G ASG++GL RG +S S+ S+
Sbjct: 223 TLT---------FARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGR 273
Query: 286 -FFYCLHSPYGS-------TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISV 337
F YCL S + +TFG +TP+ P + FY++ L G SV
Sbjct: 274 SFSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSV 333
Query: 338 GGERLPLKASYFTKLSTE-------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGI 390
GG R+ + +L+ +DSGT +TR PVY A+R AFR ++ G
Sbjct: 334 GGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGG 393
Query: 391 EDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPN 449
LFDTCY+LS + V VP +++H GG + L L+ V++ C FA+ +D
Sbjct: 394 FSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFC--FAMAGTDGG 451
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN+QQ+G+ V +D +R+GF P +C
Sbjct: 452 VSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 481
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 126/390 (32%), Positives = 194/390 (49%), Gaps = 39/390 (10%)
Query: 113 KKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD 171
++ F P +G+ + EY+ V +G P ++LDTGS + W QC PC HC Q
Sbjct: 102 RRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSG 161
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATD 231
FDP +S++++ + C + C+ L G D+ C Y +AY DGS G +A++
Sbjct: 162 RVFDPRRSRSYAAVDCVAPICRRLDSA----GCDR-RRNSCLYQVAYGDGSVTAGDFASE 216
Query: 232 RMTIQEVNGNGYFARYPFL----LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY-- 285
+T FAR + +GC +N G ASG++GL RG +S S+ S+
Sbjct: 217 TLT---------FARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGR 267
Query: 286 -FFYCLHSPYGS-------TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISV 337
F YCL S + +TFG +TP+ P + FY++ L G SV
Sbjct: 268 SFSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSV 327
Query: 338 GGERLPLKASYFTKLSTE-------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGI 390
GG R+ + +L+ +DSGT +TR PVY A+R AFR ++ G
Sbjct: 328 GGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGG 387
Query: 391 EDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPN 449
LFDTCY+LS + V VP +++H GG + L L+ V++ C FA+ +D
Sbjct: 388 FSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFC--FAMAGTDGG 445
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN+QQ+G+ V +D +R+GF P +C
Sbjct: 446 VSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 142/424 (33%), Positives = 204/424 (48%), Gaps = 49/424 (11%)
Query: 80 RNTPSLEEILR---RDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIV 135
+N E + R R + RLH N+ L A + KA +VA + E+ +
Sbjct: 317 KNLTRFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKA-------PVVAGNGEFLMK 369
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
+AIG P + S ++DTGS + WTQCKPC C Q P FDP +S +F KI C+S C L
Sbjct: 370 LAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGAL 429
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCT 254
CSS C Y Y D S G A + T + + P L GC
Sbjct: 430 -------PTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQI--SIPGLGFGCG 480
Query: 255 DNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSP---------YGSTGYITFGK 304
++N GD + +G++GL RGP+S++S+ F YCL + GS IT
Sbjct: 481 NDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLGSLANIT--- 537
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDS 358
P T +K +K TP++ P Q FY+++L GISVGG +L + S F +L + IDS
Sbjct: 538 PKT-SKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTF-ELHDDGSGGVIIDS 595
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA-YKTVVVPKITIHFLG 417
GT IT +++L++ F +M G L D C++L A V VPK+T HF
Sbjct: 596 GTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGL-DLCFNLPAGTNQVEVPKLTFHF-K 653
Query: 418 GVDLELDVRGTLVVES-VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
G DLEL ++ +S +CL S + GN+QQ+ + V +D+ L F P
Sbjct: 654 GADLELPGENYMIGDSKAGLLCLAIG---SSRGMSIFGNLQQQNFMVVHDLQEETLSFLP 710
Query: 477 GNCN 480
C+
Sbjct: 711 TQCD 714
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 126/396 (31%), Positives = 186/396 (46%), Gaps = 26/396 (6%)
Query: 94 QRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGS 153
QR H + + K PD F ++ F P K G EY + + +G P Q +++DTGS
Sbjct: 5 QRSHERVAFYTLKLSPDAFG-SQEFQSPVKAG---NGEYLMTLTLGSPPQSFDVIVDTGS 60
Query: 154 GITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECP 213
+ W QC PC C QQ P FDPSKS++F K C C + P C++ C
Sbjct: 61 DLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNLCNV--SALP---LKACAANVCQ 115
Query: 214 YDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRG 273
Y Y D S G A + +++ NG G + F GC N G GA+G++GL +G
Sbjct: 116 YQYTYGDQSNTNGDLAFETISLN--NGAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQG 173
Query: 274 PVSI---ISKTNISYFFYCLHSPYG-STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYH 329
P+S+ +S T + F YCL S S +TFG ++YT IV +Y+
Sbjct: 174 PLSLNSQLSHTFANKFSYCLVSLNSLSASPLTFGSIAAAAN--IQYTSIVVNARHPTYYY 231
Query: 330 ITLTGISVGGERLPLKASYFT------KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKK 383
+ L I VGG+ L L S F + T IDSGT IT P YSA+ A+ +
Sbjct: 232 VQLNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNY 291
Query: 384 YKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFAL 443
++ L D C++++ VP + F G D ++ V+ L A+
Sbjct: 292 PRLDGSAYGL-DLCFNIAGVSNPSVPDMVFKFQ-GADFQMRGENLFVLVDTSATTLCLAM 349
Query: 444 LPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
S SI +GN+QQ+ + V YD+ +++GF +C
Sbjct: 350 GGSQGFSI-IGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 122/360 (33%), Positives = 177/360 (49%), Gaps = 35/360 (9%)
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
IG P S ++DTGS + WTQCKPC+ C +Q P FDPS S T++ +PC+S +C L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232
Query: 198 WFPPNGQDKC-SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDN 256
KC S+ +C Y Y D S G AT+ T+ + G + GC D
Sbjct: 233 -------SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPG------VVFGCGDT 279
Query: 257 NTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST------GYITFGKPDTVN 309
N GD + +G++GL RGP+S++S+ + F YCL S + G + +
Sbjct: 280 NEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAA 339
Query: 310 KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITR 364
V+ TP++ P Q FY+++L I+VG R+ L +S F +DSGT IT
Sbjct: 340 ASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITY 399
Query: 365 FPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSA--YKTVVVPKITIHFLGGVDL 421
Y AL+ AF +M G G+ D C+ A V VP++ HF GG DL
Sbjct: 400 LEVQGYRALKKAFAAQMALPAADGSGVG--LDLCFRAPAKGVDQVEVPRLVFHFDGGADL 457
Query: 422 ELDVRGTLVVESVR-QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+L +V++ +CL ++ S SI +GN QQ+ ++ YDV L F P CN
Sbjct: 458 DLPAENYMVLDGGSGALCL--TVMGSRGLSI-IGNFQQQNFQFVYDVGHDTLSFAPVQCN 514
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 174/362 (48%), Gaps = 28/362 (7%)
Query: 128 AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPC 187
+ EY++ + +G P + +++D+GS I W QCKPC C Q DP FDP+ S +F + C
Sbjct: 39 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSC 98
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
+S C + C+S C Y+++Y DGS G A + +T+ G
Sbjct: 99 SSAVCDQV-------DNAGCNSGRCRYEVSYGDGSSTKGTLALETLTL------GRTVVQ 145
Query: 248 PFLLGCTDNNTG---DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY-GSTGYITFG 303
+GC N G G G+ G V +S+ + F YCL S S G++ FG
Sbjct: 146 NVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFG 205
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--TKLSTE---IDS 358
+ P++ P +Y+I L+G+ VG ++P+ F T+L +D+
Sbjct: 206 S--EAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDT 263
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT +TRFP Y A R AF + G+ +FDTCY+L + +V VP ++ +F GG
Sbjct: 264 GTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVS-IFDTCYNLFGFLSVRVPTVSFYFSGG 322
Query: 419 VDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
L L L+ V+ C FA PS +LGN+QQ G ++ D A +GFGP
Sbjct: 323 PILTLPANNFLIPVDDAGTFCFAFA--PSPSGLSILGNIQQEGIQISVDGANEFVGFGPN 380
Query: 478 NC 479
C
Sbjct: 381 VC 382
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 122/369 (33%), Positives = 182/369 (49%), Gaps = 30/369 (8%)
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPC 187
A Y++++++G P ++DTGS +TWTQC PC C Q P +DP++S TFSK+PC
Sbjct: 93 AGAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPC 152
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN----GNGY 243
S C+ L P+ C++ C YD Y G G+ A D + I + + +
Sbjct: 153 ASPLCQAL-----PSAFRACNATGCVYDYRYAVGF-TAGYLAADTLAIGDGDGDGDASSS 206
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY-ITF 302
FA F GC+ N GD +GASGI+GL R +S++S+ + F YCL S + I F
Sbjct: 207 FAGVAF--GCSTANGGDMDGASGIVGLGRSALSLLSQIGVGRFSYCLRSDADAGASPILF 264
Query: 303 GKPDTVNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERLPLKASYFTKLSTE--- 355
G V V+ T ++ P ++ +Y++ LTGI+VG LP+ +S F +
Sbjct: 265 GALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGG 324
Query: 356 --IDSGTIITRFPAPVYSALRSAFRKRMKK-YKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
+DSGT T Y+ LR AF + G + FD C++ A T VP++
Sbjct: 325 VIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFEAGAADT-PVPRLV 383
Query: 413 IHFLGGVDLELDVRGTL--VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
F GG + + + V E R CL +LP+ S+ +GNV Q V YD+ G
Sbjct: 384 FRFAGGAEYAVPRQSYFDAVDEGGRVACL--LVLPTRGVSV-IGNVMQMDLHVLYDLDGA 440
Query: 471 RLGFGPGNC 479
F P +C
Sbjct: 441 TFSFAPADC 449
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 128/378 (33%), Positives = 181/378 (47%), Gaps = 33/378 (8%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ +EY + +A+G P + V+L LDTGS + WTQC PC C Q P DP+ S T++ +P
Sbjct: 87 IVTNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALP 146
Query: 187 CNSTTCKILLEWFPPNGQDKCSS-----KECPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
C + C+ L F G SS + C Y Y D S G ATDR T NG+
Sbjct: 147 CGAPRCRALP--FTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGD 204
Query: 242 GYFARYP---FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS- 296
G +R P GC N G Q+ +GI G RG S+ S+ N++ F YC S + S
Sbjct: 205 GD-SRLPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTTFSYCFTSMFESK 263
Query: 297 TGYITFGKPDTVNKKF---------VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS 347
+ +T G + V+ TP++ P Q Y ++L GISVG RL + +
Sbjct: 264 SSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEA 323
Query: 348 YFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL---SAYK 404
ST IDSG IT P VY A+++ F ++ G D C+ L + ++
Sbjct: 324 KLR--STIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPVTALWR 381
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLVVE--SVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
VP +T+H L G D EL RG V E + R +C+ P D ++GN QQ+
Sbjct: 382 RPPVPSLTLH-LDGADWELP-RGNYVFEDLAARVMCVVLDAAPGD--QTVIGNFQQQNTH 437
Query: 463 VHYDVAGRRLGFGPGNCN 480
V YD+ L F P C+
Sbjct: 438 VVYDLENDWLSFAPARCD 455
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 141/426 (33%), Positives = 200/426 (46%), Gaps = 49/426 (11%)
Query: 80 RNTPSLEEILRRDQQRLHLKNSRRLQK----AIPDNFKKTKAFTFPAKTGIVAADEYYIV 135
+N +++I R + H N RL A+ N T P G + E+ +
Sbjct: 57 KNLTKIQKIQRGINRGFHRLN--RLGAVAVLAVASNPDDTNNIKAPTHGG---SGEFLME 111
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
++IG P + ++DTGS + WTQCKPC C Q P FDP KS ++SK+ C+S C L
Sbjct: 112 LSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNAL 171
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN---GNGYFARYPFLLG 252
P C Y Y D S G AT+ T ++ N G G+ G
Sbjct: 172 -----PRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENSISGIGF--------G 218
Query: 253 CTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCL----HSPYGSTGYITFGKPDT 307
C N GD + SG++GL RGP+S+IS+ + F YCL S S+ +I
Sbjct: 219 CGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGI 278
Query: 308 VNK-------KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE----- 355
VNK + K ++ P+Q FY++ L GI+VG +RL ++ S F +LS +
Sbjct: 279 VNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTF-ELSEDGTGGM 337
Query: 356 -IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITI 413
IDSGT IT + L+ F RM G L D C+ L +A K + VPK+
Sbjct: 338 IIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGL-DLCFKLPNAAKNIAVPKLIF 396
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
HF G DLEL +V +S V L A+ S+ SI GNVQQ+ + V +D+ +
Sbjct: 397 HF-KGADLELPGENYMVADSSTGV-LCLAMGSSNGMSI-FGNVQQQNFNVLHDLEKETVT 453
Query: 474 FGPGNC 479
F P C
Sbjct: 454 FVPTEC 459
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 125/390 (32%), Positives = 194/390 (49%), Gaps = 39/390 (10%)
Query: 113 KKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD 171
++ F P +G+ + EY+ V +G P ++LDTGS + W QC PC HC Q
Sbjct: 102 RRRGGFAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSG 161
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATD 231
FDP +S++++ + C + C+ L G D+ C Y +AY DGS G +A++
Sbjct: 162 RVFDPRRSRSYAAVDCVAPICRRLDSA----GCDR-RRNSCLYQVAYGDGSVTAGDFASE 216
Query: 232 RMTIQEVNGNGYFARYPFL----LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY-- 285
+T FAR + +GC +N G ASG++GL RG +S ++ S+
Sbjct: 217 TLT---------FARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGR 267
Query: 286 -FFYCLHSPYGS-------TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISV 337
F YCL S + +TFG +TP+ P + FY++ L G SV
Sbjct: 268 SFSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSV 327
Query: 338 GGERLPLKASYFTKLSTE-------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGI 390
GG R+ + +L+ +DSGT +TR PVY A+R AFR ++ G
Sbjct: 328 GGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGG 387
Query: 391 EDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPN 449
LFDTCY+LS + V VP +++H GG + L L+ V++ C FA+ +D
Sbjct: 388 FSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFC--FAMAGTDGG 445
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN+QQ+G+ V +D +R+GF P +C
Sbjct: 446 VSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 129/424 (30%), Positives = 208/424 (49%), Gaps = 38/424 (8%)
Query: 74 LNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT--FPAKTGI----- 126
L+ G+ R P L +L + ++L +++AI ++ ++ + +GI
Sbjct: 31 LHHGQKRPQPGLRVVLEQVDSGMNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVY 90
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ EY + VAIG P +S ++DTGS + WTQC+PC C Q P F+P S +FS +P
Sbjct: 91 AGSGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLP 150
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C S C+ L P+ + C +C Y Y DGS G+ AT+ T + +
Sbjct: 151 CESQYCQDL-----PS--ESC-YNDCQYTYGYGDGSSTQGYMATETFTFET-------SS 195
Query: 247 YP-FLLGCTDNNTG-DQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYGSTGYITFG 303
P GC ++N G Q +G++G+ GP+S+ S+ + F YC+ S S + G
Sbjct: 196 VPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSSGSSSPSTLALG 255
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ID 357
+ + T ++ + +Y+ITL GI+VGG+ L + +S F +L + ID
Sbjct: 256 SAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTF-QLQDDGTGGMIID 314
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITIHFL 416
SGT +T P Y+A+ AF ++ + + L TC+ L S TV VP+I++ F
Sbjct: 315 SGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSGL-STCFQLPSDGSTVQVPEISMQFD 373
Query: 417 GGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFG 475
GGV L L L+ + +CL A+ S I + GN+QQ+ +V YD+ + F
Sbjct: 374 GGV-LNLGEENVLISPAEGVICL--AMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFV 430
Query: 476 PGNC 479
P C
Sbjct: 431 PTQC 434
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 135/439 (30%), Positives = 208/439 (47%), Gaps = 39/439 (8%)
Query: 61 SLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLH--LKNSRRLQKAIPDNFKKTKAF 118
SL V+ G CS S ++ E ++ D R +K K + +
Sbjct: 53 SLSVMHIQGKCSPFRLLNSSWWTAVSESIKGDTARYRAMVKGGWSAGKTM---VNPQEDA 109
Query: 119 TFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G +++ Y I + G P Q +LDTGS I W C PC CS ++ P F+PS
Sbjct: 110 DIPLASGQAISSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQP-FEPS 168
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI-- 235
KS T++ + C S C++L + CS + Y D S +++ +++
Sbjct: 169 KSSTYNYLTCASQQCQLLRVCTKSDNSVNCSLTQ-----RYGDQSEVDEILSSETLSVGS 223
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS 292
Q+V F+ GC++ G ++G R P+S +S+T Y F YCL S
Sbjct: 224 QQVEN--------FVFGCSNAARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPS 275
Query: 293 PYGS--TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF- 349
+ S TG + GK + ++ + +K+TP+++ FY++ L GISVG E + + A
Sbjct: 276 LFSSAFTGSLLLGK-EALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLS 334
Query: 350 ----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
T T IDSGT+ITR P Y+A+R +FR ++ M DLFDTCY+ +
Sbjct: 335 LDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPT-DLFDTCYNRPS-GD 392
Query: 406 VVVPKITIHFLGGVDLELDVRGTLV--VESVRQVCLGFALLPSDPNSIL--LGNVQQRGY 461
V P IT+HF +DL L + L + +CL F L P + +L GN QQ+
Sbjct: 393 VEFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKL 452
Query: 462 EVHYDVAGRRLGFGPGNCN 480
+ +DVA RLG NC+
Sbjct: 453 RIVHDVAESRLGIASENCD 471
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 145/461 (31%), Positives = 227/461 (49%), Gaps = 39/461 (8%)
Query: 29 LSHSYIVSVSSLIPPTVCNRTRTALPQGPGK----VSLEVLGRYGPCSKLNQGKSRNTPS 84
L+ +++ V+ + P C + L + GK VS ++ Y CS
Sbjct: 17 LAITFMCGVAEIAPGLNCRSSDKILNRKVGKRSHSVSFPLIHIYSECSPFRPPNRTWESL 76
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
+ E +R D RL R K + K+ P ++G + EY I V G PKQ
Sbjct: 77 MSEKIRGDANRL------RFLKRTSRSSKQDANANVPVRSG---SGEYIIQVDFGTPKQS 127
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+ L+DTGS + W CK C C P FDP+KS ++ C+S C+ + +G
Sbjct: 128 MYTLIDTGSDVAWIPCKQCQGC-HSTAPIFDPAKSSSYKPFACDSQPCQEI------SGN 180
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
+SK C ++++Y DG+ G A+D +T+ G+ Y + F GC ++ + D + +
Sbjct: 181 CGGNSK-CQFEVSYGDGTQVDGTLASDAITL----GSQYLPNFSF--GCAESLSEDTSPS 233
Query: 265 SGIMGLDRGPVSIISKTNISYFF-----YCLHSPYGSTGYITFGKPDTVNKKFVKYTPIV 319
G+MGL G +S++++ + F YCL S S+G + GK V+ +K+T ++
Sbjct: 234 PGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTLI 293
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYFTK-LSTEIDSGTIITRFPAPVYSALRSAFR 378
P FY +TL ISVG R+ + + T IDSGT IT Y+ALR AFR
Sbjct: 294 KDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTTITHLVPSAYTALRDAFR 353
Query: 379 KRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVC 438
+++ + +ED+ DTCYDLS+ +V VP IT+H VDL L L+ + C
Sbjct: 354 QQLSSLQ-PTPVEDM-DTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENILITQESGLAC 410
Query: 439 LGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L F+ +D SI +GNVQQ+ + + +DV ++GF C
Sbjct: 411 LAFS--STDSRSI-IGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 142/437 (32%), Positives = 201/437 (45%), Gaps = 45/437 (10%)
Query: 68 YGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKA--------FT 119
YGPCS PSL E+LR DQ R R+ + D + + F
Sbjct: 75 YGPCSP----SEGTPPSLVEMLRWDQARTDYVR-RKATGEVDDVLEPDRPHVDMMQMDFM 129
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYV----SLLLDTGSGITWTQCKPCI--HCSQQRDPF 173
GI + Y V+ + ++ +DT + W QC PC+ C QR+ F
Sbjct: 130 LRGTFGIGSGSGYGAVIDGDDDDDPMILSQTMAIDTTEDVPWIQCLPCLIPQCYPQRNAF 189
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK-CSSKECPYDIAYVDGSGETGFWATDR 232
FDP +S T + + C S C+ L + NG K S+ +C Y I Y D G + TD
Sbjct: 190 FDPRRSSTGAPVRCGSRACRTLGGY--ANGCSKPNSTGDCLYRIEYSDHRLTLGTYMTDT 247
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISY---FFY 288
+TI + F + F GC+ G + ASG M L GP S++S+T +Y F Y
Sbjct: 248 LTISP---STTFLNFRF--GCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYGNAFSY 302
Query: 289 CLHSPYGSTGYITFGKP----DTVNKKFVKYTPIVTTPE--QSEFYHITLTGISVGGERL 342
C+ P + G+++ G P D TP+V + Y + L GI V G RL
Sbjct: 303 CVPGP-SAAGFLSIGGPVNGDDGGGSGAFATTPLVRSANVINPTIYVVRLQGIEVAGRRL 361
Query: 343 PLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA 402
+ F+ T +DS +IT+ P Y ALR AFR M+ YK +L DTC+D
Sbjct: 362 NVPPVVFSG-GTVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTGNL-DTCFDFVG 419
Query: 403 YKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
V VP +++ F GG +EL + L+ CL FA + +D +GNVQQ+ +E
Sbjct: 420 VSKVTVPTVSLVFDGGAVIELGLLSVLL-----DSCLAFAPMAADFALGFIGNVQQQTHE 474
Query: 463 VHYDVAGRRLGFGPGNC 479
V YDVAG +GF G C
Sbjct: 475 VLYDVAGGAVGFRHGAC 491
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 123/371 (33%), Positives = 179/371 (48%), Gaps = 36/371 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +A+G P Q ++ LLDTGS + WTQC C C +Q DP F P S ++ + C
Sbjct: 97 EYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 191 TCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C +L C + C Y +Y DG+ G++AT+R T + +G P
Sbjct: 157 LCGDIL-------HHSCVRPDTCTYRYSYGDGTTTLGYYATERFTF--ASSSGETQSVPL 207
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST--GYITFGKPDT 307
GC N G N ASGI+G R P+S++S+ +I F YCL +PY S+ + FG
Sbjct: 208 GFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCL-TPYASSRKSTLQFGSLAD 266
Query: 308 VN-----KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEID 357
V V+ TPI+ + + FY++ TG++VG RL + AS F ID
Sbjct: 267 VGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIID 326
Query: 358 SGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSAY--------KTVVV 408
SGT +T FPA V + + AFR +++ + G +D C+ A + V V
Sbjct: 327 SGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDD--GVCFAAPAVAAGGGRMARQVAV 384
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P++ HF G DL+L R V+E R+ L L S + +GN Q+ V YD+
Sbjct: 385 PRMVFHFQ-GADLDLP-RENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLE 442
Query: 469 GRRLGFGPGNC 479
L F P C
Sbjct: 443 RETLSFAPVEC 453
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 135/393 (34%), Positives = 188/393 (47%), Gaps = 46/393 (11%)
Query: 109 PDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ 168
PD+ KA P G + E+ + ++IG P S ++DTGS + WTQCKPC C
Sbjct: 90 PDDTNNIKA---PTHGG---SGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFD 143
Query: 169 QRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFW 228
Q P FDP KS ++SK+ C+S C L P C Y Y D S G
Sbjct: 144 QPTPIFDPEKSSSYSKVGCSSGLCNAL-----PRSNCNEDKDACEYLYTYGDYSSTRGLL 198
Query: 229 ATDRMTIQEVN---GNGYFARYPFLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNIS 284
AT+ T ++ N G G+ GC N GD + SG++GL RGP+S+IS+ +
Sbjct: 199 ATETFTFEDENSISGIGF--------GCGVENEGDGFSQGSGLVGLGRGPLSLISQLKET 250
Query: 285 YFFYCL----HSPYGSTGYITFGKPDTVNK-------KFVKYTPIVTTPEQSEFYHITLT 333
F YCL S S+ +I VNK + K ++ P+Q FY++ L
Sbjct: 251 KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQ 310
Query: 334 GISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMG 387
GI+VG +RL ++ S F +L+ + IDSGT IT + L+ F RM
Sbjct: 311 GITVGAKRLSVEKSTF-ELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDD 369
Query: 388 KGIEDLFDTCYDL-SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPS 446
G L D C+ L A K + VPK+ HF G DLEL +V +S V L A+ S
Sbjct: 370 SGSTGL-DLCFKLPDAAKNIAVPKMIFHF-KGADLELPGENYMVADSSTGV-LCLAMGSS 426
Query: 447 DPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ SI GNVQQ+ + V +D+ + F P C
Sbjct: 427 NGMSI-FGNVQQQNFNVLHDLEKETVSFVPTEC 458
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 109/327 (33%), Positives = 164/327 (50%), Gaps = 27/327 (8%)
Query: 146 SLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
++++D+GS ++W QCKPC C +QRDP FDP+ S T++ +PC S C L +
Sbjct: 78 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPY----- 132
Query: 204 QDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-- 260
+ CS+ +C + I Y DGS TG ++ D +T+ Y F GC + G
Sbjct: 133 RRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGP-----YDVIRGFRFGCAHADRGSAF 187
Query: 261 QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKY-- 315
+G + L G S++ +T Y F YCL S G++ G P + +
Sbjct: 188 DYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVS 247
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRS 375
TP++++ FY + L I V G L + + F+ S+ IDS TII+R P Y ALR+
Sbjct: 248 TPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSA-SSVIDSSTIISRLPPTAYQALRA 306
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVR 435
AFR M Y+ + + DTCYD + +++ +P I + F GG + LD G L+
Sbjct: 307 AFRSAMTMYRAAPPVS-ILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLGS--- 362
Query: 436 QVCLGFALLPSDPNSILLGNVQQRGYE 462
CL FA SD +GNVQQ+ E
Sbjct: 363 --CLAFAPTASDRMPGFIGNVQQKTLE 387
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 86/280 (30%), Positives = 128/280 (45%), Gaps = 42/280 (15%)
Query: 205 DKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG 263
+ CS+ +C + I Y DGS TG ++ D +T+
Sbjct: 387 EGCSANAQCQFGINYGDGSTATGTYSFDDLTL---------------------------- 418
Query: 264 ASGIMGLDRGPVSIISKTNISYFF-YCLHSPYGSTGYITFGKP---DTVNKKFVKYTPIV 319
G +DR + + + T F YC+ S G+IT G P + FV +
Sbjct: 419 --GPYDVDRQGLPLRTATQYGRVFSYCIPPSPSSLGFITLGVPPQRAALVPTFVSTPLLS 476
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRK 379
++ FY + L I V G LP+ + F+ S+ I S T+I+R P Y ALR+AFR+
Sbjct: 477 SSSMPPTFYRVLLRAIIVAGRPLPVPPTVFST-SSVIASTTVISRLPPTAYQALRAAFRR 535
Query: 380 RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
M Y+ + + DTCYD + +++ +P I + F GG + LD G L+ Q CL
Sbjct: 536 AMTMYRTAPPVS-ILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----QGCL 589
Query: 440 GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
FA +D +GNVQQR EV YDV G+ + F C
Sbjct: 590 AFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/362 (32%), Positives = 182/362 (50%), Gaps = 28/362 (7%)
Query: 128 AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPC 187
+ EY++ + +G P + +++D+GS I W QCKPC C Q DP FDP+ S +F + C
Sbjct: 39 GSGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSC 98
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
+S C + C+S C Y+++Y DGS G A + +T G
Sbjct: 99 SSAVCDRVE-------NAGCNSGRCRYEVSYGDGSYTKGTLALETLTF------GRTVVR 145
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSI---ISKTNISYFFYCLHSPYGST-GYITFG 303
+GC +N G GA+G++GL G +S +S + F YCL S +T G++ FG
Sbjct: 146 NVAIGCGHSNRGMFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFG 205
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--TKLSTE---IDS 358
+ P+V P FY+I L G+ VG R+P+ F +L + +D+
Sbjct: 206 S--EAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDT 263
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT +TRFP Y A R+AF ++ + G+ +FDTCY+L + +V VP ++ +F GG
Sbjct: 264 GTAVTRFPTVAYEAFRNAFIEQTQNLPRASGVS-IFDTCYNLFGFLSVRVPTVSFYFSGG 322
Query: 419 VDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
L + L+ V+ C FA PS +LGN+QQ G ++ D A +GFGP
Sbjct: 323 PILTIPANNFLIPVDDAGTFCFAFA--PSPSGLSILGNIQQEGIQISVDEANEFVGFGPN 380
Query: 478 NC 479
C
Sbjct: 381 IC 382
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 120/361 (33%), Positives = 176/361 (48%), Gaps = 38/361 (10%)
Query: 128 AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPC 187
+ EY++ + IG P Y +++D+GS I W QC+PC C Q DP F+P+ S +F + C
Sbjct: 125 GSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVAC 184
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
+S C L + C C Y +AY DGS G A + +TI G
Sbjct: 185 SSNVCNQL------DDDVACRKGRCGYQVAYGDGSYTKGTLALETITI------GRTVIQ 232
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS---YFFYCLHSPYGSTGYITFGK 304
+GC N G GA+G++GL GP+S + + F YCL S G +
Sbjct: 233 DTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAMPVGAM---- 288
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--TKLSTE---IDSG 359
+ P++ P FY+++L+G++VGG R+P+ F T + T +D+G
Sbjct: 289 ----------WVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTG 338
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T ITR P Y+A R AF + G+ +FDTCYDL+ + TV VP ++ +F GG
Sbjct: 339 TAITRLPTVAYNAFRDAFIAQTTNLPRAPGVS-IFDTCYDLNGFVTVRVPTVSFYFSGGQ 397
Query: 420 DLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
L R L+ + V C FA PS ++GN+QQ G +V D +GFGP
Sbjct: 398 ILTFPARNFLIPADDVGTFCFAFA--PSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNV 455
Query: 479 C 479
C
Sbjct: 456 C 456
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 108/282 (38%), Positives = 154/282 (54%), Gaps = 19/282 (6%)
Query: 207 CSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASG 266
CS C Y + Y DGS GF+A D +T+ + A F GC + N G A+G
Sbjct: 16 CSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHD-----AIKGFRFGCGERNEGLFGEAAG 70
Query: 267 IMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDT--VNKKFVKYTPIV-- 319
++GL RG S+ +T Y F +C + TGY+ FG + V+ K + TP++
Sbjct: 71 LLGLGRGKTSLPVQTYDKYGGVFAHCFPARSSGTGYLEFGPGSSPAVSAK-LSTTPMLID 129
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRK 379
T P FY++ +TGI VGG+ LP+ S F T +DSGT+ITR P YS+LRSAF
Sbjct: 130 TGPT---FYYVGMTGIRVGGKLLPIPQSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAA 186
Query: 380 RM--KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
M + YK + L DTCYDL+ V +P +++ F GGV L++D G + SV Q
Sbjct: 187 SMAARGYKRAPALS-LLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIYAASVSQA 245
Query: 438 CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CLGFA + + ++GN Q + + V YD+A + +GF PG C
Sbjct: 246 CLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 124/355 (34%), Positives = 178/355 (50%), Gaps = 35/355 (9%)
Query: 147 LLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK 206
++LDTGS + W QC PC C +Q P FDP +S ++ + C + C+ L +G
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRL-----DSGGCD 55
Query: 207 CSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASG 266
C Y +AY DGS G + T+ +T G AR LGC +N G A+G
Sbjct: 56 LRRGACMYQVAYGDGSVTAGDFVTETLTFA---GGARVAR--VALGCGHDNEGLFVAAAG 110
Query: 267 IMGLDRGPVSIISKTNISY---FFYCL----HSPYGS------TGYITFGKPDTVNKKFV 313
++GL RG +S ++ + Y F YCL S G+ + ++FG +V
Sbjct: 111 LLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGA-GSVGASSA 169
Query: 314 KYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-------IDSGTIITRFP 366
+TP+V P FY++ L GISVGG R+P A +L +DSGT +TR
Sbjct: 170 SFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLA 229
Query: 367 APVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
YSALR AFR ++ G LFDTCYDL + V VP +++HF GG + L
Sbjct: 230 RASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPP 289
Query: 426 RGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L+ V+S C FA +D ++GN+QQ+G+ V +D G+R+GF P C
Sbjct: 290 ENYLIPVDSRGTFC--FAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 189/404 (46%), Gaps = 46/404 (11%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
L+ L+RD +R+ RRL +++ T + EY++ + +G P +
Sbjct: 155 LDGRLKRDAKRVA-SLIRRLSSGGGGSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRS 213
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+++D+GS I W QC+PC C Q DP FDP+ S +F+ + C+S+ C L
Sbjct: 214 QYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVCDRLEN------- 266
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
C + C Y+++Y DGS G A + +T G +GC N G GA
Sbjct: 267 AGCHAGRCRYEVSYGDGSYTKGTLALETLTF------GRTMVRSVAIGCGHRNRGMFVGA 320
Query: 265 SGIMGLDRGPVSIISK---TNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTT 321
+G++GL G +S + + F YCL S + P+V
Sbjct: 321 AGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS--------------------AAWVPLVRN 360
Query: 322 PEQSEFYHITLTGISVGGERLPLKASYF--TKL---STEIDSGTIITRFPAPVYSALRSA 376
P FY+I L G+ VGG R+P+ F T+L +D+GT +TR P Y A R A
Sbjct: 361 PRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDA 420
Query: 377 FRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVR 435
F + G+ +FDTCYDL + +V VP ++ +F GG L L R L+ ++
Sbjct: 421 FLAQTANLPRATGVA-IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAG 479
Query: 436 QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
C FA PS +LGN+QQ G ++ +D A +GFGP C
Sbjct: 480 TFCFAFA--PSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 174/371 (46%), Gaps = 20/371 (5%)
Query: 125 GIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ-RDPFFDPSKSKTFS 183
G + +EY + V++G P + V+L LDTGS + WTQC PC+ C +Q P DP+ S T +
Sbjct: 83 GGIVTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHA 142
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
+PC++ C+ L F G + C Y Y D S G ATD T + G
Sbjct: 143 ALPCDAPLCRALP--FTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGG 200
Query: 244 FARYPFLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYG--STGYI 300
A GC N G Q +GI G RG S+ S+ N++ F YC S + S+ +
Sbjct: 201 LAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTKSSSVV 260
Query: 301 TFGKP--------DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
T G + V+ T ++ P Q Y + L GISVGG R+ + S +
Sbjct: 261 TLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRL-RS 319
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL---SAYKTVVVP 409
ST IDSG IT P VY A+++ F ++ G L D C+ L + ++ VP
Sbjct: 320 STIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAAL-DLCFALPVAALWRRPAVP 378
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
+T+H GG D EL RG V E L L + +++GN QQ+ V YD+
Sbjct: 379 ALTLHLDGGADWELP-RGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVVYDLEN 437
Query: 470 RRLGFGPGNCN 480
L F P C+
Sbjct: 438 DVLSFAPARCD 448
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 122/372 (32%), Positives = 182/372 (48%), Gaps = 42/372 (11%)
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN 188
A EY ++ G P Q + DT G++ +CKPC+ + DP F+PS+S +F+ IPC
Sbjct: 173 ALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPC-DPAFEPSRSSSFAAIPCG 231
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
S C + +C+ CP+ I + + + G D +T+ + FA +
Sbjct: 232 SPECAV-----------ECTGASCPFTIQFGNVTVANGTLVRDTLTLPP---SATFAGFT 277
Query: 249 FLLGC----TDNNTGDQNGASGIMGLDRGPVSIISK-------TNISYFFYCL--HSPYG 295
F GC D +T D GA G++ L R S+ S+ T+ + F YCL S
Sbjct: 278 F--GCIEVGADADTFD--GAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATS 333
Query: 296 STGYITFG--KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
S G+++ G +P+ +KY P+ + P Y + L GISVGGE LP+ + F
Sbjct: 334 SRGFLSIGASRPEYSGGD-IKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAHG 392
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T +++ T T Y+ALR AFRK M Y + DTCY+L+ ++ VP + +
Sbjct: 393 TLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFR-VLDTCYNLTGLASLAVPAVAL 451
Query: 414 HFLGGVDLELDVRGTLVVESVRQV-----CLGFALLPSDPNSI-LLGNVQQRGYEVHYDV 467
F GG +LELDVR + V CL FA P + ++G + QR EV YD+
Sbjct: 452 RFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDL 511
Query: 468 AGRRLGFGPGNC 479
G R+GF PG C
Sbjct: 512 RGGRVGFIPGRC 523
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 121/368 (32%), Positives = 173/368 (47%), Gaps = 30/368 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +A+G P Q VS LLDTGS + WTQC PC C Q DP F P S ++ + C
Sbjct: 103 EYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGE 162
Query: 191 TCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY-- 247
C +L C + C Y +Y DG+ G +AT+R T + G +
Sbjct: 163 LCNDIL-------HHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSA 215
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST-------GYI 300
P GC N G N SGI+G R P+S++S+ I F YCL +PY S G +
Sbjct: 216 PLGFGCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRFSYCL-TPYASGRKSTLLFGSL 274
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTE 355
G D V+ T ++ + + FY++ TG++VG RL + S F
Sbjct: 275 RGGVYDAATAT-VQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAI 333
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFD-TCYDLSAYKT---VVVPKI 411
+DSGT +T FPAPV + + AFR +++ G D C+ +A + VVP++
Sbjct: 334 VDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRM 393
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
H L G DL+L R V++ R+ L L S + +GN Q+ V YD+
Sbjct: 394 VFH-LQGADLDLPRR-NYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADT 451
Query: 472 LGFGPGNC 479
L F P C
Sbjct: 452 LSFAPAQC 459
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 122/372 (32%), Positives = 181/372 (48%), Gaps = 42/372 (11%)
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN 188
A EY ++ G P Q + DT G++ +CKPC+ DP F+PS+S +F+ IPC
Sbjct: 85 ALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVG-GAPCDPAFEPSRSSSFAAIPCG 143
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
S C + +C+ CP+ I + + + G D +T+ + FA +
Sbjct: 144 SPECAV-----------ECTGASCPFTIQFGNVTVANGTLVRDTLTLPP---SATFAGFT 189
Query: 249 FLLGC----TDNNTGDQNGASGIMGLDRGPVSIISK-------TNISYFFYCL--HSPYG 295
F GC D +T D GA G++ L R S+ S+ T+ + F YCL S
Sbjct: 190 F--GCIEVGADADTFD--GAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATS 245
Query: 296 STGYITFG--KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
S G+++ G +P+ +KY P+ + P Y + L GISVGGE LP+ + F
Sbjct: 246 SRGFLSIGASRPEYSGGD-IKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAHG 304
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T +++ T T Y+ALR AFRK M Y + DTCY+L+ ++ VP + +
Sbjct: 305 TLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFR-VLDTCYNLTGLASLAVPAVAL 363
Query: 414 HFLGGVDLELDVRGTLVVESVRQV-----CLGFALLPSDPNSI-LLGNVQQRGYEVHYDV 467
F GG +LELDVR + V CL FA P + ++G + QR EV YD+
Sbjct: 364 RFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDL 423
Query: 468 AGRRLGFGPGNC 479
G R+GF PG C
Sbjct: 424 RGGRVGFIPGRC 435
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 178/371 (47%), Gaps = 36/371 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +A+G P Q ++ LLDTGS + WTQC C C +Q DP F P S ++ + C
Sbjct: 97 EYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 191 TCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C +L C + C Y +Y DG+ G++AT+R T + +G P
Sbjct: 157 LCGDIL-------HHSCVRPDTCTYRYSYGDGTTTLGYYATERFTF--ASSSGETQSVPL 207
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST--GYITFGKPDT 307
GC N G N ASGI+G R P+S++S+ +I F YCL +PY S+ + FG
Sbjct: 208 GFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCL-TPYASSRKSTLQFGSLAD 266
Query: 308 VN-----KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEID 357
V V+ TPI+ + + FY++ TG++VG RL + AS F ID
Sbjct: 267 VGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIID 326
Query: 358 SGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSAY--------KTVVV 408
SGT +T FP V + + AFR +++ + G +D C+ A + V V
Sbjct: 327 SGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDD--GVCFAAPAVAAGGGRMARQVAV 384
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P++ HF G DL+L R V+E R+ L L S + +GN Q+ V YD+
Sbjct: 385 PRMVFHFQ-GADLDLP-RENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLE 442
Query: 469 GRRLGFGPGNC 479
L F P C
Sbjct: 443 RETLSFAPVEC 453
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 181/372 (48%), Gaps = 42/372 (11%)
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN 188
A EY ++ G P Q + DT G++ +CKPC+ DP F+PS+S +F+ IPC
Sbjct: 85 ALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVG-GAPCDPAFEPSRSSSFAAIPCG 143
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
S C + +C+ CP+ I + + + G D +T+ + FA +
Sbjct: 144 SPECAV-----------ECTGASCPFTIQFGNVTVANGTLVRDTLTLPP---SATFAGFT 189
Query: 249 FLLGC----TDNNTGDQNGASGIMGLDRGPVSIISK-------TNISYFFYCL--HSPYG 295
F GC D +T D GA G++ L R S+ S+ T+ + F YCL S
Sbjct: 190 F--GCIEVGADADTFD--GAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATS 245
Query: 296 STGYITFG--KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
S G+++ G +P+ +KY P+ + P Y + L GISVGGE LP+ + F
Sbjct: 246 SRGFLSIGASRPEYSGGD-IKYAPMSSNPNHPNSYFVELVGISVGGEDLPVPPAVFAAHG 304
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T +++ T T Y+ALR AFR+ M Y + DTCY+L+ ++ VP + +
Sbjct: 305 TLLEAATEFTFLAPAAYAALRDAFRRDMAPYPAAPPFR-VLDTCYNLTGLASLAVPTVAL 363
Query: 414 HFLGGVDLELDVRGTLVVESVRQV-----CLGFALLPSDPNSI-LLGNVQQRGYEVHYDV 467
F GG +LELDVR + V CL FA P + ++G + QR EV YD+
Sbjct: 364 RFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDL 423
Query: 468 AGRRLGFGPGNC 479
G R+GF PG C
Sbjct: 424 RGGRVGFIPGRC 435
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 174 bits (442), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 129/374 (34%), Positives = 189/374 (50%), Gaps = 31/374 (8%)
Query: 118 FTFPAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDP 176
F P +GI +Y+ + +G P + V ++ DTGS ++W QC PC C +Q+DP F+P
Sbjct: 66 FASPLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNP 125
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTI 235
S S +F + C S+ C L CS K EC Y ++Y DGS G ++T+ ++
Sbjct: 126 SLSSSFKPLACASSICGKL-------KIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSF 178
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL-H 291
G A +GC NN G +GA+G++GL RGP+S S+T SY F YCL
Sbjct: 179 ------GEHAVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPR 232
Query: 292 SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTK 351
+ FG P V +K ++T ++ +Y++ L I V G + + F
Sbjct: 233 RESAIAASLVFG-PSAVPEK-ARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAM 290
Query: 352 LS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
S +DSGT I+R P Y+ALR AFR + + GI LFDTCYDLS+ KT
Sbjct: 291 GSRGTGGVIVDSGTAISRLTTPAYTALRDAFRS-LVTFPSAPGIS-LFDTCYDLSSMKTA 348
Query: 407 VVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
+P + + F GG + L G LV V+ CL FA P + ++GNVQQ+ + +
Sbjct: 349 TLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFA--PEEEAFSIIGNVQQQTFRISI 406
Query: 466 DVAGRRLGFGPGNC 479
D ++G P C
Sbjct: 407 DNQKEQMGIAPDQC 420
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 123/372 (33%), Positives = 174/372 (46%), Gaps = 35/372 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY I +AIG P Q VS LLDTGS + WTQC PC C Q DP F P+ S ++ + C+
Sbjct: 102 EYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQ 161
Query: 191 TCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C +L C + C Y Y DG+ G +AT+R T +G P
Sbjct: 162 LCNDILHH-------SCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKL--SVPL 212
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST--GYITFG---- 303
GC N G N SGI+G R P+S++S+ +I F YCL +PY ST + FG
Sbjct: 213 GFGCGTMNVGSLNNGSGIVGFGRDPLSLVSQLSIRRFSYCL-TPYTSTRKSTLMFGSLSD 271
Query: 304 ---KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTE 355
+ D V+ T ++ + + FY++ TG++VG RL + S F
Sbjct: 272 GVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVI 331
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIED-------LFDTCYDLSAYKTVV 407
+DSGT +T FPA V + + AFR +++ + +D + SA V
Sbjct: 332 VDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVVS 391
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
VP++ HF G DLEL R V++ R+ L L S + +GN Q+ V YD+
Sbjct: 392 VPRMAFHFQ-GADLELPRR-NYVLDDPRRGSLCILLADSGDSGATIGNFVQQDMRVLYDL 449
Query: 468 AGRRLGFGPGNC 479
L F P C
Sbjct: 450 EAETLSFAPAQC 461
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 148/478 (30%), Positives = 211/478 (44%), Gaps = 53/478 (11%)
Query: 31 HSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGR-YGPCSKLNQGKSRNTPSLEEIL 89
H +V SSL+ P A+P G + L R YGPCS S L ++L
Sbjct: 18 HYIVVETSSLLKPKAICSGLKAMPSSNG--TWVALHRPYGPCSPSPTTTSPPL--LVDML 73
Query: 90 RRDQQRLHLKNSRRLQKAIPD---------------NFKKTKAFTFPAKTGIVAADEYYI 134
R D +LH RR A D ++K +F ++
Sbjct: 74 RWD--KLHTDAIRRKATAGGDVVLEPDKPIVDVQQSDYKMQASFGIGTGGRSGSSSSSSS 131
Query: 135 VV----AIGKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCN 188
+ AI P + +DT + W QC PC C Q++ FDP +S+T + +PC
Sbjct: 132 RISRPSAIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCG 191
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
S C L + CS+ +C Y + Y DG +G + D +T+ N
Sbjct: 192 SAACGELGRY-----GAGCSNNQCQYFVDYGDGRATSGTYMVDALTL-----NPSTVVMN 241
Query: 249 FLLGCTDNNTGDQNGA-SGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGK 304
F GC+ G+ + + SG M L G S++S+T ++ F YC+ P S+G+++ G
Sbjct: 242 FRFGCSHAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDP-SSSGFLSLGG 300
Query: 305 PDTVNK--KFVKYTPIVTTPEQ-SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTI 361
P +F + TP+V P Y + L GI VGG RL + F +DS I
Sbjct: 301 PADGGGAGRFAR-TPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAG-GAVMDSSVI 358
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
IT+ P Y ALR AFR M Y G DTCYD + +V VP +++ F GG +
Sbjct: 359 ITQLPPTAYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVV 418
Query: 422 ELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
LD G +V + CL F P D +GNVQQ+ +EV YDV G +GF G C
Sbjct: 419 RLDAMGVMV-----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 145/461 (31%), Positives = 224/461 (48%), Gaps = 39/461 (8%)
Query: 29 LSHSYIVSVSSLIPPTVCNRTRTALPQGPGK----VSLEVLGRYGPCSKLNQGKSRNTPS 84
L+ +++ V+ + P C + L + GK VS ++ Y CS
Sbjct: 17 LAITFMCGVAEIAPGLNCRSSDKILNRKVGKRSHSVSFPLIHIYSECSPFRPPNRTWESL 76
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
+ E +R D RL R K + K+ P ++G + EY I V G PKQ
Sbjct: 77 MSEKIRGDANRL------RFLKRTSRSSKEDANANVPVRSG---SGEYIIQVDFGTPKQS 127
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+ L+DTGS + W CK C C P FDP+KS ++ C+S C+ + +G
Sbjct: 128 MYTLIDTGSDVAWIPCKQCQGC-HSTAPIFDPAKSSSYKPFACDSQPCQEI------SGN 180
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
+SK C +++ Y DG+ G A+D +T+ G+ Y + F GC ++ + D +
Sbjct: 181 CGGNSK-CQFEVLYGDGTQVDGTLASDAITL----GSQYLPNFSF--GCAESLSEDTYSS 233
Query: 265 SGIMGLDRGPVSIISKTNISYFF-----YCLHSPYGSTGYITFGKPDTVNKKFVKYTPIV 319
G+MGL G +S++++ + F YCL S S+G + GK V+ +K+T ++
Sbjct: 234 PGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTLI 293
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYFTK-LSTEIDSGTIITRFPAPVYSALRSAFR 378
P FY +TL ISVG R+ + A+ T IDSGT IT Y LR AFR
Sbjct: 294 KDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTIIDSGTTITYLVPSAYKDLRDAFR 353
Query: 379 KRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVC 438
+++ + +ED+ DTCYDLS+ +V VP IT+H VDL L L+ + C
Sbjct: 354 QQLSSLQ-PTPVEDM-DTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENILITQESGLSC 410
Query: 439 LGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L F+ +D SI +GNVQQ+ + + +DV ++GF C
Sbjct: 411 LAFS--STDSRSI-IGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 120/328 (36%), Positives = 171/328 (52%), Gaps = 40/328 (12%)
Query: 155 ITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPY 214
ITWTQCKPC+ C + FDPS S T+S C +T Y
Sbjct: 98 ITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSCIPSTVGNT------------------Y 139
Query: 215 DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRG 273
++ Y D S G + D MT++ + F ++ F GC NN GD +GA G++GL +G
Sbjct: 140 NMTYGDKSTSVGNYGCDTMTLEPSD---VFPKFQF--GCGRNNEGDFGSGADGMLGLGQG 194
Query: 274 PVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTP-----EQS 325
+S +S+T + F YCL S G + FG+ T ++ +K+T +V P E+S
Sbjct: 195 QLSTVSQTASKFKKVFSYCLPEE-DSIGSLLFGEKAT-SQSSLKFTSLVNGPGTSGLEES 252
Query: 326 EFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK 385
+Y + L ISVG +RL + +S F T IDSGT+IT P YSAL +AF+K M KY
Sbjct: 253 GYYFVKLLDISVGNKRLNVPSSVFASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKYP 312
Query: 386 MGKGIE---DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA 442
+ G D+ DTCY+LS K V++P+I +HF G D+ L+ + + ++CL FA
Sbjct: 313 LSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDASRLCLAFA 372
Query: 443 -LLPSDPNSIL--LGNVQQRGYEVHYDV 467
S NS L +GN QQ V YD+
Sbjct: 373 GNSKSTMNSELTIIGNRQQVSLTVLYDI 400
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 176/365 (48%), Gaps = 31/365 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +++G P + + DTGS + WTQCKPC +C QQ P FDPSKS T+ + C+S
Sbjct: 82 EYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSSP 141
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY-FARYPF 249
C + +G EC Y IAY D S G A D +T+Q +G F R
Sbjct: 142 VCS-----YSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRT-- 194
Query: 250 LLGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS-----TGYI 300
++GC +N G N SGI+GL RGP S++++ + F YCL P G+ + +
Sbjct: 195 VIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCL-IPIGTGSTNDSTKL 253
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGER--LPLKASYFTKLSTE--- 355
FG V+ TPI ++ + FY + L +SVG + P AS KL E
Sbjct: 254 NFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGAS---KLGGESNI 310
Query: 356 -IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
IDSGT +T P+ + ++ SA + M + + D C+ + +P +T+H
Sbjct: 311 IIDSGTTLTYLPSALLNSFGSAISQSM-SLPHAQDPSEFLDYCF-ATTTDDYEMPPVTMH 368
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F G D+ L V S +CL F P D N + GN+ Q + V YD+ + F
Sbjct: 369 F-EGADVPLQRENLFVRLSDDTICLAFGSFPDD-NIFIYGNIAQSNFLVGYDIKNLAVSF 426
Query: 475 GPGNC 479
P +C
Sbjct: 427 QPAHC 431
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 175/372 (47%), Gaps = 39/372 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +AIG P Q VS LLDTGS + WTQC PC C Q DP F P +S ++ + C
Sbjct: 101 EYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQ 160
Query: 191 TCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C +L C + C Y Y DG+ G +AT+R T G+ P
Sbjct: 161 LCSDIL-------HHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGD-RLMTVPL 212
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS--TGYITFGK--- 304
GC N G N SGI+G R P+S++S+ +I F YCL S YGS + FG
Sbjct: 213 GFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTS-YGSGRKSTLLFGSLSG 271
Query: 305 ---PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEI 356
D V+ TP++ + + FY++ L G++VG RL + S F +
Sbjct: 272 GVYGDATGP--VQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIV 329
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDL-------SAYKTVVV 408
DSGT +T P V + + AFR++++ + G ED C+ + S+ V V
Sbjct: 330 DSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPED--GVCFLVPAAWRRSSSTSQVPV 387
Query: 409 PKITIHFLGGVDLELDV-RGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
P++ HF D +LD+ R V++ R+ L L S + +GN+ Q+ V YD+
Sbjct: 388 PRMVFHF---QDADLDLPRRNYVLDDHRKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDL 444
Query: 468 AGRRLGFGPGNC 479
L F P C
Sbjct: 445 EAETLSFAPAQC 456
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 128/366 (34%), Positives = 177/366 (48%), Gaps = 40/366 (10%)
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
++IG P S ++DTGS + WTQCKPC C Q P FDP KS ++SK+ C+S C L
Sbjct: 3 LSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNAL 62
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN---GNGYFARYPFLLG 252
P C Y Y D S G AT+ T ++ N G G+ G
Sbjct: 63 -----PRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGF--------G 109
Query: 253 CTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCL----HSPYGSTGYITFGKPDT 307
C N GD + SG++GL RGP+S+IS+ + F YCL S S+ +I
Sbjct: 110 CGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGI 169
Query: 308 VNK-------KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE----- 355
VNK + K ++ P+Q FY++ L GI+VG +RL ++ S F +L+ +
Sbjct: 170 VNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTF-ELAEDGTGGM 228
Query: 356 -IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITI 413
IDSGT IT + L+ F RM G L D C+ L A K + VPK+
Sbjct: 229 IIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGL-DLCFKLPDAAKNIAVPKMIF 287
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
HF G DLEL +V +S V L A+ S+ SI GNVQQ+ + V +D+ +
Sbjct: 288 HF-KGADLELPGENYMVADSSTGV-LCLAMGSSNGMSI-FGNVQQQNFNVLHDLEKETVS 344
Query: 474 FGPGNC 479
F P C
Sbjct: 345 FVPTEC 350
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 113/343 (32%), Positives = 170/343 (49%), Gaps = 27/343 (7%)
Query: 147 LLLDTGSGITWTQCKPC--IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+LLDT S + W QC PC C Q D +DPSKS++ C+S TC+ L +
Sbjct: 184 MLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPYANGCSS 243
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTG--DQ 261
S+ +C Y + Y DGS +G D++++ + + P F GC+ G +
Sbjct: 244 SSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTS------QVPKFEFGCSHAARGSFSR 297
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPI 318
+ +GIM L RG S++S+T+ Y F YC G+ G P + ++ TP+
Sbjct: 298 SKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHKGFFVLGVPRRSSSRYA-VTPM 356
Query: 319 VTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFR 378
+ TP Y + L I+V G+RL + + F +DS T+ITR P Y ALRSAFR
Sbjct: 357 LKTP---MLYQVRLEAIAVAGQRLDVPPTVFAA-GAALDSRTVITRLPPTAYQALRSAFR 412
Query: 379 KRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF-LGGVDLELDVRGTLVVESVRQV 437
+M Y+ L DTCYD + ++++P I++ F G ++LD G L
Sbjct: 413 DKMSMYRPAAANGQL-DTCYDFTGVSSIMLPTISLVFDRTGAGVQLDPSGVLFGS----- 466
Query: 438 CLGFALLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA D + ++G +Q + EV Y+VAG +GF G C
Sbjct: 467 CLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 129/415 (31%), Positives = 193/415 (46%), Gaps = 42/415 (10%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKPKQ 143
L LRR R+ S L P + A+ ++A+D EY + + IG P +
Sbjct: 50 LSRALRRSSARVATLQS--LAALAPGDAITA------ARILVLASDGEYLMEMGIGTPTR 101
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
Y S +LDTGS + WTQC PC+ C Q P+FDP++S T+ + C S C L ++P
Sbjct: 102 YYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNAL--YYP--- 156
Query: 204 QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG 263
C K C Y Y D + G A + T F GC + N G
Sbjct: 157 --LCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISF--GCGNLNAGSLAN 212
Query: 264 ASGIMGLDRGPVSIISKTNISYFFYCLH---SPYGSTGYITFGKPDTVN-----KKFVKY 315
SG++G RG +S++S+ F YCL SP S Y FG T+N + V+
Sbjct: 213 GSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVPSRLY--FGVYATLNSTNASSEPVQS 270
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPV 369
TP V P Y + +TGISVGG LP+ + F T+ IDSGT IT P
Sbjct: 271 TPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPA 330
Query: 370 YSALRSAFRKRMKKYKMGKGIEDLFDTCYDL--SAYKTVVVPKITIHFLGGVDLELDVRG 427
Y A+R+AF ++ + + DTC+ ++V +P++ +HF G D EL ++
Sbjct: 331 YDAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHF-DGADWELPLQN 389
Query: 428 TLVVESVR--QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++V+ +CL A S + ++G+ Q + + V YD+ + F P C+
Sbjct: 390 YMLVDPSTGGGLCLAMA---SSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPCH 441
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 132/362 (36%), Positives = 184/362 (50%), Gaps = 34/362 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
E+ + +AIG P + S ++DTGS + WTQCKPC C Q P FDP KS +FSK+ C+S
Sbjct: 99 EFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQ 158
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
CK L Q C S C Y Y D S G AT+ T +V+ P +
Sbjct: 159 LCKAL-------PQSSC-SDSCEYLYTYGDYSSTQGTMATETFTFGKVS-------IPNV 203
Query: 251 -LGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS-TGYITFGKPDT 307
GC ++N GD SG++GL RGP+S++S+ + F YCL S + T + G +
Sbjct: 204 GFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKFSYCLTSIDDTKTSTLLMGSLAS 263
Query: 308 VN--KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSG 359
VN ++ TP++ P Q FY+++L GISVGG RLP+K S F +L + IDSG
Sbjct: 264 VNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTF-QLQDDGTGGLIIDSG 322
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITIHFLGG 418
T IT + ++ F +M G L + CY+L S + VPK+ +HF G
Sbjct: 323 TTITYLEESAFDLVKKEFTSQMGLPVDNSGATGL-ELCYNLPSDTSELEVPKLVLHFT-G 380
Query: 419 VDLELDVRGTLVVESVRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
DLEL ++ +S V CL S + GNVQQ+ V +D+ L F P
Sbjct: 381 ADLELPGENYMIADSSMGVICLAMG---SSGGMSIFGNVQQQNMFVSHDLEKETLSFLPT 437
Query: 478 NC 479
NC
Sbjct: 438 NC 439
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 129/415 (31%), Positives = 193/415 (46%), Gaps = 42/415 (10%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKPKQ 143
L LRR R+ S L P + A+ ++A+D EY + + IG P +
Sbjct: 50 LSRALRRSSARVATLQS--LAALAPGDAITA------ARILVLASDGEYLMEMGIGTPTR 101
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
Y S +LDTGS + WTQC PC+ C Q P+FDP++S T+ + C S C L ++P
Sbjct: 102 YYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNAL--YYP--- 156
Query: 204 QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG 263
C K C Y Y D + G A + T F GC + N G
Sbjct: 157 --LCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISF--GCGNLNAGLLAN 212
Query: 264 ASGIMGLDRGPVSIISKTNISYFFYCLH---SPYGSTGYITFGKPDTVN-----KKFVKY 315
SG++G RG +S++S+ F YCL SP S Y FG T+N + V+
Sbjct: 213 GSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVPSRLY--FGVYATLNSTNASSEPVQS 270
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPV 369
TP V P Y + +TGISVGG LP+ + F T+ IDSGT IT P
Sbjct: 271 TPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPA 330
Query: 370 YSALRSAFRKRMKKYKMGKGIEDLFDTCYDL--SAYKTVVVPKITIHFLGGVDLELDVRG 427
Y A+R+AF ++ + + DTC+ ++V +P++ +HF G D EL ++
Sbjct: 331 YDAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHF-DGADWELPLQN 389
Query: 428 TLVVESVR--QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++V+ +CL A S + ++G+ Q + + V YD+ + F P C+
Sbjct: 390 YMLVDPSTGGGLCLAMA---SSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPCH 441
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 171/338 (50%), Gaps = 30/338 (8%)
Query: 55 QGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQK--AIPDNF 112
Q G V + + +GP S L + S ++L D R+ NSR +K P +
Sbjct: 35 QSGGVVQMTIHHVHGPGSSL---APQPPVSFSDVLAWDDARVKTLNSRLTRKDTRFPKSV 91
Query: 113 KKTKAFTFPAKTGI-------VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI- 164
K FP + + + YY+ V G P +Y S+++DTGS ++W QCKPC+
Sbjct: 92 LTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVV 151
Query: 165 HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGE 224
+C Q DP FDPS SKT+ + C S+ C L++ N + SS C Y +Y D S
Sbjct: 152 YCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYS 211
Query: 225 TGFWATDRMTI---QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKT 281
G+ + D +T+ Q + G F+ GC ++ G A+GI+GL R +S++ +
Sbjct: 212 MGYLSQDLLTLAPSQTLPG--------FVYGCGQDSDGLFGRAAGILGLGRNKLSMLGQV 263
Query: 282 NISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
+ + F YCL + G G+++ GK + K+TP+ T P Y + LT I+VG
Sbjct: 264 SSKFGYAFSYCLPT-RGGGGFLSIGKASLAGSAY-KFTPMTTDPGNPSLYFLRLTAITVG 321
Query: 339 GERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSA 376
G L + A+ + ++ T IDSGT+ITR P VY+ + A
Sbjct: 322 GRALGVAAAQY-RVPTIIDSGTVITRLPMSVYTPFQQA 358
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 119/352 (33%), Positives = 169/352 (48%), Gaps = 27/352 (7%)
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
AI P + +DT + W QC PC C Q++ FDP +S+T + +PC S C
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 213
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L + CS+ +C Y + Y DG +G + D +T+ N F GC+
Sbjct: 214 LGRY-----GAGCSNNQCQYFVDYGDGRATSGTYMVDALTL-----NPSTVVMNFRFGCS 263
Query: 255 DNNTGDQNGA-SGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNK 310
G+ + + SG M L G S++S+T ++ F YC+ P S+G+++ G P
Sbjct: 264 HAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDP-SSSGFLSLGGPADGGG 322
Query: 311 --KFVKYTPIVTTPEQ-SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPA 367
+F + TP+V P Y + L GI VGG RL + F +DS IIT+ P
Sbjct: 323 AGRFAR-TPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAG-GAVMDSSVIITQLPP 380
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
Y ALR AFR M Y G DTCYD + +V VP +++ F GG + LD G
Sbjct: 381 TAYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMG 440
Query: 428 TLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+V + CL F P D +GNVQQ+ +EV YDV G +GF G C
Sbjct: 441 VMV-----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 133/426 (31%), Positives = 196/426 (46%), Gaps = 39/426 (9%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVA--------ADEYYIVVAI 138
E+L R QR L+ + + A + G+VA + +Y +A+
Sbjct: 88 ELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLVAPVVSRAPTSGDYIAKIAV 147
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
G P L LDT S +TW QC+PC C Q P FDP S ++ ++ ++ C+ L
Sbjct: 148 GTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGR- 206
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCTDNN 257
+G C Y + Y DG G + ++E R +L +GC +N
Sbjct: 207 ---SGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDN 263
Query: 258 TGDQNG-ASGIMGLDRGPVSIISKTNI----SYFFYCL----HSPYGSTGYITFGKPDTV 308
G A+GI+GL RG +SI + + F YCL P + +TFG
Sbjct: 264 KGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVD 323
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP------LKASYFTKLSTEI-DSGTI 361
+TP V FY++ L G+SVGG R+P L+ +T I DSGT
Sbjct: 324 TSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGHGGVILDSGTT 383
Query: 362 ITRFPAPVYSALRSAFRKR---MKKYKMGKGIEDLFDTCYDLSA----YKTVVVPKITIH 414
+TR P Y+A R AFR + + G G LFDTCY + V VP +++H
Sbjct: 384 VTRLARPAYTAFRDAFRAAATGLGQVSTG-GPSGLFDTCYTVGGRAGLRHCVKVPAVSMH 442
Query: 415 FLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F GGV+L L + L+ V+S VC FA D + ++GN+ Q+G+ V YD+ G+R+G
Sbjct: 443 FAGGVELSLQPKNYLITVDSRGTVCFAFAGT-GDRSVSVIGNILQQGFRVVYDIGGQRVG 501
Query: 474 FGPGNC 479
F P +C
Sbjct: 502 FAPNSC 507
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 175/354 (49%), Gaps = 20/354 (5%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNS 189
Y + +G P + +++DTGS +TW QC PC + C +Q P FDP S +++ + C++
Sbjct: 136 NYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCST 195
Query: 190 TTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C L N SS C Y +Y D S G+ + D ++ G + F
Sbjct: 196 PQCNDL-STATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSF------GSNSVPNF 248
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKT--NISYFF-YCLHSPYGSTGYITFGKPD 306
GC +N G ++G+MGL R +S++ + + Y F YCL S + +
Sbjct: 249 YYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPS----SSSSGYLSIG 304
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFP 366
+ N YTP+V++ Y I L+G++V G+ L + +S ++ L T IDSGT+ITR P
Sbjct: 305 SYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSGTVITRLP 364
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
VY AL A MK K + DTC+ + ++ VP +++ F GG L+L +
Sbjct: 365 TTVYDALSKAVAGAMKGTKRADAYS-ILDTCF-VGQASSLRVPAVSMAFSGGAALKLSAQ 422
Query: 427 GTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
LV CL FA P+ +I +GN QQ+ + V YDV R+GF G C
Sbjct: 423 NLLVDVDSSTTCLAFA--PARSAAI-IGNTQQQTFSVVYDVKSNRIGFAAGGCT 473
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 125/378 (33%), Positives = 185/378 (48%), Gaps = 41/378 (10%)
Query: 128 AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPC 187
+A Y + ++IG P S+L DTGS + WTQC PC C+ + P F P+ S TFSK+PC
Sbjct: 86 SAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPC 145
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFAR 246
S+ C+ L + C++ C Y Y G G T G+ AT+ + + G F
Sbjct: 146 ASSLCQFLTSPY-----LTCNATGCVYYYPY--GMGFTAGYLATETLHV----GGASFPG 194
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY-ITFGKP 305
F GC+ N G N +SGI+GL R P+S++S+ + F YCL S + I FG
Sbjct: 195 VAF--GCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRSDADAGDSPILFGSL 251
Query: 306 DTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKL---------ST 354
V V+ TP++ PE S +Y++ LTGI+VG LP+ ++ F T
Sbjct: 252 AKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGT 311
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL---FDTCYDLSAY---KTVVV 408
+DSGT +T Y+ ++ AF +M + + FD C+D +A V V
Sbjct: 312 IVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPV 371
Query: 409 PKITIHFLGGVDLELDVR---GTLVVESVRQV---CLGFALLPSDPNSI-LLGNVQQRGY 461
P + + F GG + + R G + V+S + CL L S+ SI ++GNV Q
Sbjct: 372 PTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECL-LVLPASEKLSISIIGNVMQMDL 430
Query: 462 EVHYDVAGRRLGFGPGNC 479
V YD+ G F P +C
Sbjct: 431 HVLYDLDGGMFSFAPADC 448
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 127/371 (34%), Positives = 188/371 (50%), Gaps = 31/371 (8%)
Query: 121 PAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
P +GI +Y+ + +G P + V ++ DTGS ++W QC PC C +Q+DP F+PS S
Sbjct: 2 PLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLS 61
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEV 238
+F + C S+ C L CS K +C Y ++Y DGS G ++T+ ++ E
Sbjct: 62 SSFKPLACASSICGKL-------KIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGE- 113
Query: 239 NGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPY 294
A +GC NN G +GA+G++GL RGP+S S+T SY F YCL
Sbjct: 114 -----HAVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRES 168
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS- 353
+ FG P V +K ++T ++ +Y++ L I V G + + F S
Sbjct: 169 AIAASLVFG-PSAVPEK-ARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSR 226
Query: 354 ----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
+DSGT I+R P Y+ALR AFR + + GI LFDTCYDLS+ KT +P
Sbjct: 227 GTGGVIVDSGTAISRLTTPAYTALRDAFRS-LVTFPSAPGIS-LFDTCYDLSSMKTATLP 284
Query: 410 KITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
+ + F GG + L G LV V+ CL FA P + ++GNVQQ+ + + D
Sbjct: 285 AVVLDFDGGASMPLPADGILVNVDDEGTYCLAFA--PEEEAFSIIGNVQQQTFRISIDNQ 342
Query: 469 GRRLGFGPGNC 479
++G P C
Sbjct: 343 KEQMGIAPDQC 353
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 114/404 (28%), Positives = 181/404 (44%), Gaps = 32/404 (7%)
Query: 95 RLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSG 154
R L S + +P + + P +G +Y +++G P + S++ DTGS
Sbjct: 6 RSKLAASSLITSEVPYPPSVSTDYESPVASG---GGDYVTTISLGTPAKVFSVIADTGSD 62
Query: 155 ITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPY 214
+ W QCKPC C Q+DP FDP S +++ + C T C L K S +C Y
Sbjct: 63 LIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPR--------KSCSPDCDY 114
Query: 215 DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGP 274
Y DGSG G +++ +T+ G A+ GC N G N ASG++GL RG
Sbjct: 115 SYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKN-IAFGCGHLNRGSFNDASGLVGLGRGN 173
Query: 275 VSIISKTNISY---FFYCL---HSPYGSTGYITFGKPDTVN----KKFVKYTPIVTTPEQ 324
+S +S+ + F YCL T + FG + + K +TP++ P
Sbjct: 174 LSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAM 233
Query: 325 SEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRK 379
FY++ L IS+ G L + A F DSGT +T P Y + A R
Sbjct: 234 ESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRS 293
Query: 380 RMKKYKMGKGIEDLFDTCYDLSAYKT---VVVPKITIHFLGGVDLELDVRGTLVVESVRQ 436
++ K+ G D CYD+S K + +P + HF G D +L V + +
Sbjct: 294 KISFPKI-DGSSAGLDLCYDVSGSKASYKMKIPAMVFHFE-GADYQLPVENYFIAANDAG 351
Query: 437 VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ A++ S+ + + GN+ Q+ + V YD+ ++G+ P C+
Sbjct: 352 TIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQCD 395
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 128/413 (30%), Positives = 191/413 (46%), Gaps = 37/413 (8%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
S+ + R D RL +S+ + T P+ Y + +G P Q
Sbjct: 40 SIIALARADDARLLFLSSKAASSGGVTSAPVASGQTPPS---------YVVRAGLGTPVQ 90
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+ L LDT + TW+ C PC C F P+ S +++ +PC S C + P
Sbjct: 91 QLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPCPAN 148
Query: 204 QDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN 262
QD + C + + D S + +D + + G A Y F GC G
Sbjct: 149 QDASAPLPACAFSKPFADTSFQASL-GSDTLRL----GKDAIAGYAF--GCVGAVAGPTT 201
Query: 263 G--ASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKY 315
G++GL RGP+S++S+T +Y F YCL S Y +G + G + V+Y
Sbjct: 202 NLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSLRLGAAG--QPRNVRY 259
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPAPVY 370
TP++T P + Y++ +TG+SVG + + A F T T IDSGT+ITR+ APVY
Sbjct: 260 TPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVY 319
Query: 371 SALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV 430
+ALR FR+++ G FDTC++ P +T+H GGVDL L + TL+
Sbjct: 320 AALREEFRRQVAA-PSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLI 378
Query: 431 VESVRQV-CLGFALLP--SDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
S + CL A P + ++ N+QQ+ V DVAG R+GF CN
Sbjct: 379 HSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 431
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 115/374 (30%), Positives = 182/374 (48%), Gaps = 28/374 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY++ + +G P ++V L+LDTGS ++W QC PC C +Q + P S T+ I C
Sbjct: 170 EYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDP 229
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV--NGNGYFAR-Y 247
C+ L+ P K ++ CPY Y DGS TG +A++ T+ NG F +
Sbjct: 230 RCQ-LVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVV 288
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGY---IT 301
+ GC N G GASG++GL RGP+S S+ Y F YCL + +T +
Sbjct: 289 DVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSKLI 348
Query: 302 FGK-PDTVNKKFVKYTPIVT---TPEQSEFYHITLTGISVGGERLPLKASYFTKLS---- 353
FG+ + +N + +T ++ TP+++ FY++ + I VGGE L + + S
Sbjct: 349 FGEDKELLNNHNLNFTTLLAGEETPDET-FYYLQIKSIMVGGEVLDISEQTWHWSSEGAA 407
Query: 354 ------TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS-AYKTV 406
T IDSG+ +T FP Y ++ AF K++K ++ + + CY++S A V
Sbjct: 408 ADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAAD-DFVMSPCYNVSGAMMQV 466
Query: 407 VVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
+P IHF G E +CL P+ + ++GN+ Q+ + + Y
Sbjct: 467 ELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNLLQQNFHILY 526
Query: 466 DVAGRRLGFGPGNC 479
DV RLG+ P C
Sbjct: 527 DVKRSRLGYSPRRC 540
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 128/408 (31%), Positives = 198/408 (48%), Gaps = 31/408 (7%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAK--TGI-VAADEYYIVVAIGKP 141
L +RRD R+ R K IP + + + F + +G+ + EY++ + +G P
Sbjct: 81 LHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSP 140
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
+ +++D+GS + W QC+PC C +Q DP FDP+KS +++ + C S+ C +
Sbjct: 141 PRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIE----- 195
Query: 202 NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ 261
C S C Y++ Y DGS G A + +T + +GC N G
Sbjct: 196 --NSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKT------VVRNVAMGCGHRNRGMF 247
Query: 262 NGASGIMGLDRGPVSII---SKTNISYFFYCLHS-PYGSTGYITFGKPDTVNKKFVKYTP 317
GA+G++G+ G +S + S F YCL S STG + FG+ + P
Sbjct: 248 IGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGR--EALPVGASWVP 305
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSA 372
+V P FY++ L G+ VGG R+PL F T +D+GT +TR P Y A
Sbjct: 306 LVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVA 365
Query: 373 LRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-V 431
R F+ + G+ +FDTCYDLS + +V VP ++ +F G L L R L+ V
Sbjct: 366 FRDGFKSQTANLPRASGVS-IFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPV 424
Query: 432 ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ C FA P+ + ++GN+QQ G +V +D A +GFGP C
Sbjct: 425 DDSGTYCFAFAASPTGLS--IIGNIQQEGIQVSFDGANGFVGFGPNVC 470
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 142/417 (34%), Positives = 200/417 (47%), Gaps = 39/417 (9%)
Query: 80 RNTPSLEEI---LRRDQQRLHLKNSRRLQ-KAIPDNFKKTKAFTFPAKTGIVAADEYYIV 135
+N LE + ++R + RL N+ L + PD+ + +A P G EY I
Sbjct: 58 KNLTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQLEA---PIHAG---NGEYLIE 111
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
+AIG P +LDTGS + WTQCKPC C +Q P FDP KS +FSK+ C S+ C L
Sbjct: 112 LAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSAL 171
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
+G C Y +Y D S G AT+ T + F GC +
Sbjct: 172 PSSTCSDG--------CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGF--GCGE 221
Query: 256 NNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST--GYITFGKPDTV-NKK 311
+N GD ASG++GL RGP+S++S+ F YCL +P T + G V + K
Sbjct: 222 DNEGDGFEQASGLVGLGRGPLSLVSQLKEQRFSYCL-TPIDDTKESVLLLGSLGKVKDAK 280
Query: 312 FVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFP 366
V TP++ P Q FY+++L ISVG RL ++ S F IDSGT IT
Sbjct: 281 EVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQ 340
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-VPKITIHFLGGVDLELDV 425
Y AL+ F + K + K D C+ L + T V +PK+ HF GG DLEL
Sbjct: 341 QKAYEALKKEFISQT-KLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGG-DLELPA 398
Query: 426 RGTLVVESVRQVCLGFALLPSDPNS--ILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++ +S LG A L +S + GNVQQ+ V++D+ + F P +C+
Sbjct: 399 ENYMIGDSN----LGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSCD 451
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 140/416 (33%), Positives = 199/416 (47%), Gaps = 38/416 (9%)
Query: 80 RNTPSLEEI---LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVV 136
+N LE + ++R + RL N+ L + D+ + +A P G EY + +
Sbjct: 59 KNLTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEA---PIHAG---NGEYLMEL 112
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILL 196
AIG P +LDTGS + WTQCKPC C +Q P FDP KS +FSK+ C S+ C +
Sbjct: 113 AIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSAV- 171
Query: 197 EWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDN 256
S C Y +Y D S G AT+ T + F GC ++
Sbjct: 172 -------PSSTCSDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGF--GCGED 222
Query: 257 NTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST--GYITFGKPDTV-NKKF 312
N GD ASG++GL RGP+S++S+ F YCL +P T + G V + K
Sbjct: 223 NEGDGFEQASGLVGLGRGPLSLVSQLKEPRFSYCL-TPMDDTKESILLLGSLGKVKDAKE 281
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPA 367
V TP++ P Q FY+++L GISVG RL ++ S F IDSGT IT
Sbjct: 282 VVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQ 341
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-VPKITIHFLGGVDLELDVR 426
+ AL+ F + K + K D C+ L + T V +PKI HF GG DLEL
Sbjct: 342 KAFEALKKEFISQT-KLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGG-DLELPAE 399
Query: 427 GTLVVESVRQVCLGFALLPSDPNS--ILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++ +S LG A L +S + GNVQQ+ V++D+ + F P +C+
Sbjct: 400 NYMIGDS----NLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSCD 451
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 115/379 (30%), Positives = 179/379 (47%), Gaps = 28/379 (7%)
Query: 118 FTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDP 176
F P +G + + +Y++ +G P Q SL++D+GS + W QC PC+ C Q P + P
Sbjct: 50 FQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAP 109
Query: 177 SKSKTFSKIPCNSTTCKIL--LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
S S TF+ +PC S C ++ E FP D C Y+ Y D S G +A + T
Sbjct: 110 SNSSTFNPVPCLSPECLLIPATEGFP---CDFHYPGACAYEYRYADTSLSKGVFAYESAT 166
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
+ +V + GC +N G A G++GL +GP+S S+ +Y F YCL
Sbjct: 167 VDDVRID------KVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLV 220
Query: 292 S---PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS- 347
+ P + ++ FG +++TPIV+ Y++ + + VGGE LP+ S
Sbjct: 221 NYLDPTSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSA 280
Query: 348 ----YFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY 403
+ + DSGT +T + P Y + +AF K + +Y ++ L D C D++
Sbjct: 281 WSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNV-RYPRAASVQGL-DLCVDVTGV 338
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSI-LLGNVQQRGY 461
P TI LGG + +G V+ V CL A LPS +GN+ Q+ +
Sbjct: 339 DQPSFPSFTI-VLGGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNF 397
Query: 462 EVHYDVAGRRLGFGPGNCN 480
V YD R+GF P C+
Sbjct: 398 LVQYDREENRIGFAPAKCS 416
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 129/431 (29%), Positives = 201/431 (46%), Gaps = 41/431 (9%)
Query: 63 EVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPA 122
E++ R P S L + + + +RR R+H +F++T A P
Sbjct: 34 ELVHRDSPKSPLYNSQQTHLQRWNKAMRRSVSRVH-------------HFQRTAATVSPK 80
Query: 123 KTG---IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
+ I EY + +++G P + + DTGS + WTQC PC C +Q P FDP S
Sbjct: 81 EVESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSS 140
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEV 238
KT+ + C++ C+ L E CSS++ C Y Y D S G A D +T+
Sbjct: 141 KTYRDLSCDTRQCQNLGE------SSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPST 194
Query: 239 NGNG-YFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---- 290
NG YF + G +N T D+ SGI+GL GP+S+IS+ S F YCL
Sbjct: 195 NGGPVYFPKTVIGCGRRNNGTFDKKD-SGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFS 253
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
G++ + FG+ V+ V+ TP+++ + FY++TL +SVG +++ S F
Sbjct: 254 SESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDT-FYYLTLEAMSVGDKKIEFGGSSFG 312
Query: 351 KLSTE--IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
IDSGT +T FP ++ +A + + + L CY + + V
Sbjct: 313 GSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPT--PDLKV 370
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P IT HF G D+ L T ++ S +CL F S + + GNV Q + + YD+
Sbjct: 371 PVITAHF-NGADVVLQTLNTFILISDDVLCLAFN---STQSGAIFGNVAQMNFLIGYDIQ 426
Query: 469 GRRLGFGPGNC 479
G+ + F P +C
Sbjct: 427 GKSVSFKPTDC 437
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 128/413 (30%), Positives = 190/413 (46%), Gaps = 37/413 (8%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
S+ + R D RL +S+ + T P+ Y + +G P Q
Sbjct: 40 SIIALARADDARLLFLSSKAASSGGVTSAPVASGQTPPS---------YVVRAGLGTPVQ 90
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+ L LDT + TW+ C PC C F P+ S +++ +PC S C + P
Sbjct: 91 QLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPCPAN 148
Query: 204 QDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN 262
QD + C + + D S + +D + + G A Y F GC G
Sbjct: 149 QDASAPLPACAFSKPFADTSFQASL-GSDTLRL----GKDAIAGYAF--GCVGAVAGPTT 201
Query: 263 G--ASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKY 315
G++GL RGP+S++S+T Y F YCL S Y +G + G + V+Y
Sbjct: 202 NLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAG--QPRNVRY 259
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPAPVY 370
TP++T P + Y++ +TG+SVG + + A F T T IDSGT+ITR+ APVY
Sbjct: 260 TPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVY 319
Query: 371 SALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV 430
+ALR FR+++ G FDTC++ P +T+H GGVDL L + TL+
Sbjct: 320 AALREEFRRQVAA-PSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLI 378
Query: 431 VESVRQV-CLGFALLP--SDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
S + CL A P + ++ N+QQ+ V DVAG R+GF CN
Sbjct: 379 HSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 431
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 127/414 (30%), Positives = 185/414 (44%), Gaps = 44/414 (10%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYV 145
E +RRD R+ S + +F + G+ Y + +++G P
Sbjct: 44 SEAVRRDSHRIAFL-SDATAAGKATTTNSSVSFQALLENGV---GGYNMNISVGTPLLTF 99
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
S++ DTGS + WTQC PC C QQ P F P+ S TFSK+PC S+ C+ L PN
Sbjct: 100 SVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFL-----PNSIR 154
Query: 206 KCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
C++ C Y+ Y GSG T G+ AT+ + + G+ F F GC+ N G N
Sbjct: 155 TCNATGCVYNYKY--GSGYTAGYLATETLKV----GDASFPSVAF--GCSTEN-GVGNST 205
Query: 265 SGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY-ITFGKPDTVNKKFVKYTPIVTTPE 323
SGI GL RG +S+I + + F YCL S + I FG + V+ TP V P
Sbjct: 206 SGIAGLGRGALSLIPQLGVGRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPA 265
Query: 324 -QSEFYHITLTGISVGGERLPLKASYF------TKLSTEIDSGTIITRFPAPVYSALRSA 376
+Y++ LTGI+VG LP+ S F T +DSGT +T Y ++ A
Sbjct: 266 VHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQA 325
Query: 377 FRKRMKKYKMGKGIEDLFDTCYDLS--AYKTVVVPKITIHFLGGVD---------LELDV 425
F + G L D C+ + + VP + + F GG + +E D
Sbjct: 326 FLSQTADVTTVNGTRGL-DLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDS 384
Query: 426 RGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+G++ V CL D ++GNV Q + YD+ G F P +C
Sbjct: 385 QGSVTV-----ACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 433
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 176/366 (48%), Gaps = 28/366 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
Y + +G P Q + L LDT + TW+ C PC C F P+ S +++ +PC S
Sbjct: 78 SYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASD 135
Query: 191 TCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C + P QD + C + + D S + +D + + G A Y F
Sbjct: 136 WCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQASL-GSDTLRL----GKDAIAGYAF 190
Query: 250 LLGCTDNNTGDQNG--ASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGSTGYITF 302
GC G G++GL RGP+S++S+T Y F YCL S Y +G +
Sbjct: 191 --GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRL 248
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEID 357
G + V+YTP++T P + Y++ +TG+SVG + + A F T T ID
Sbjct: 249 GAAG--QPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVID 306
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT+ITR+ APVY+ALR FR+++ G FDTC++ P +T+H G
Sbjct: 307 SGTVITRWTAPVYAALREEFRRQVAA-PSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDG 365
Query: 418 GVDLELDVRGTLVVESVRQV-CLGFALLP--SDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
GVDL L + TL+ S + CL A P + ++ N+QQ+ V DVAG R+GF
Sbjct: 366 GVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGF 425
Query: 475 GPGNCN 480
CN
Sbjct: 426 AREPCN 431
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 134/416 (32%), Positives = 199/416 (47%), Gaps = 37/416 (8%)
Query: 87 EILRRDQQRLHL-----KNSRRLQKAIPDNFKKTKAFTFP---AKTGIVAAD-EYYIVVA 137
E++ RD R +R+ A+ + + F AK I D EY I +
Sbjct: 32 EMIHRDSSRSPFFRPTETQFQRVANAVHRSVNRANHFHKAHKAAKATITQNDGEYLISYS 91
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
+G P + ++DTGS + W QCKPC C Q FDPSKS T+ +P +STTC+ + +
Sbjct: 92 VGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTTCQSVED 151
Query: 198 WFPPNGQDKCSS---KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
CSS K C Y I Y DGS G + + +T+ NG+ R ++GC
Sbjct: 152 -------TSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRT-VIGCG 203
Query: 255 DNNTGDQNG-ASGIMGLDRGPVSIISK-----TNISY-FFYCLHSPYGSTGYITFGKPDT 307
NNT G +SGI+GL GPVS+I++ ++I F YCL S + + FG
Sbjct: 204 RNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLNFGDAAV 263
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TKLSTEIDSGTIITR 364
V+ TPIV T + FY++TL SVG R+ +S F K + IDSGT +T
Sbjct: 264 VSGDGTVSTPIV-THDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIIDSGTTLTL 322
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
P +YS L SA ++ ++ ++ L CY S + + P I HF G D++L+
Sbjct: 323 LPNDIYSKLESAVADLVELDRVKDPLKQL-SLCYR-STFDELNAPVIMAHF-SGADVKLN 379
Query: 425 VRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
T + CL F P + GN+ Q+ + V YD+ + + F P +C+
Sbjct: 380 AVNTFIEVEQGVTCLAFISSKIGP---IFGNMAQQNFLVGYDLQKKIVSFKPTDCS 432
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 128/418 (30%), Positives = 203/418 (48%), Gaps = 37/418 (8%)
Query: 73 KLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADE 131
+L+ R+T + ILRR ++ + +S D+ + F +G+ + E
Sbjct: 80 RLHARMRRDTDRVSAILRRISGKVVVASS--------DSRYEVNDFGSDVVSGMDQGSGE 131
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y++ + +G P + +++D+GS + W QC+PC C +Q DP FDP+KS +++ + C S+
Sbjct: 132 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSV 191
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C + C S C Y++ Y DGS G A + +T + +
Sbjct: 192 CDRIE-------NSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKT------VVRNVAM 238
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSII---SKTNISYFFYCLHS-PYGSTGYITFGKPDT 307
GC N G GA+G++G+ G +S + S F YCL S STG + FG+
Sbjct: 239 GCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGR--E 296
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTII 362
+ P+V P FY++ L G+ VGG R+PL F T +D+GT +
Sbjct: 297 ALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAV 356
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
TR P Y+A R F+ + G+ +FDTCYDLS + +V VP ++ +F G L
Sbjct: 357 TRLPTGAYAAFRDGFKSQTANLPRASGVS-IFDTCYDLSGFVSVRVPTVSFYFTEGPVLT 415
Query: 423 LDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L R L+ V+ C FA P+ + ++GN+QQ G +V +D A +GFGP C
Sbjct: 416 LPARNFLMPVDDSGTYCFAFAASPTGLS--IIGNIQQEGIQVSFDGANGFVGFGPNVC 471
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 125/373 (33%), Positives = 189/373 (50%), Gaps = 25/373 (6%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY++ V IG P ++ SL+LDTGS + W QC PC C +Q P++DP +S +F I
Sbjct: 85 LGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIG 144
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN--GNGYF 244
C+ C ++ PP K ++ CPY Y D S TG +AT+ T+ + G F
Sbjct: 145 CHDPRCHLVSSPDPPLPC-KAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEF 203
Query: 245 ARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGST 297
R + GC N G +GASG++GL RGP+S S+ Y F YCL +S +
Sbjct: 204 KRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVS 263
Query: 298 GYITFGK-PDTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKLS- 353
+ FG+ D +N + +T +V E FY++ + I VGGE L + S + S
Sbjct: 264 SKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSD 323
Query: 354 ----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED--LFDTCYDLSAYKTVV 407
T +DSGT ++ F P Y ++ AF K++K Y + ++D + D CY++S + +
Sbjct: 324 GVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPI---VQDFPILDPCYNVSGVEKID 380
Query: 408 VPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
+P I F G V + ++ VCL P SI +GN QQ+ + V YD
Sbjct: 381 LPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSI-IGNYQQQNFHVLYD 439
Query: 467 VAGRRLGFGPGNC 479
RLG+ P NC
Sbjct: 440 TKKSRLGYAPMNC 452
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 173/358 (48%), Gaps = 20/358 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKI 185
V Y + +G P +++DTGS +TW QC PC + C +Q P F+P S T++ +
Sbjct: 117 VGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASV 176
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C++ C L N SS C Y +Y D S G+ + D ++ G +
Sbjct: 177 GCSAQQCSDLPSA-TLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTS 229
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF 302
F GC +N G ++G++GL R +S++ + S F YCL S + +
Sbjct: 230 LPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPS----SSSSGY 285
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
+ N YTP+V++ Y I L+G++V G L + +S ++ L T IDSGT+I
Sbjct: 286 LSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVI 345
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
TR P VYSAL A MK + DTC+ A + V P +T+ F GG L+
Sbjct: 346 TRLPTSVYSALSKAVAAAMKGTSRASAYS-ILDTCFKGQASR-VSAPAVTMSFAGGAALK 403
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L + LV CL FA P+ +I +GN QQ+ + V YDV R+GF G C+
Sbjct: 404 LSAQNLLVDVDDSTTCLAFA--PARSAAI-IGNTQQQTFSVVYDVKSSRIGFAAGGCS 458
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/366 (31%), Positives = 173/366 (47%), Gaps = 28/366 (7%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY+ + IG P++ L LDTGS +TW QC PC C Q DP +DPS S ++ ++
Sbjct: 7 LGSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVY 66
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C S C+ L C C Y + Y D S +G + + N A
Sbjct: 67 CGSALCQAL-------DYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGP---NSSTAM 116
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS----TGY 299
GC +N+G G +G++G+ G +S S+ S F YCL Y +
Sbjct: 117 RNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSP 176
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---- 355
+ FG+ T ++TP++ P + FY+ LTGISVGG LP+ + F
Sbjct: 177 LIFGR--TAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGA 234
Query: 356 -IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+DSGT +TR P Y+ LR A+R + G+ L DTC++ TV +P + +H
Sbjct: 235 ILDSGTSVTRVVPPAYAVLRDAYRAASRNLPPAPGVY-LLDTCFNFQGLPTVQIPSLVLH 293
Query: 415 FLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F GVD+ L L+ V+ CL FA PS ++GNVQQ+ + + +D+ +
Sbjct: 294 FDNGVDMVLPGGNILIPVDRSGTFCLAFA--PSSMPISVIGNVQQQTFRIGFDLQRSLIA 351
Query: 474 FGPGNC 479
P C
Sbjct: 352 IAPREC 357
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 130/403 (32%), Positives = 195/403 (48%), Gaps = 34/403 (8%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSL 147
++RD +R RRL P +AF +G+ + EY++ + +G P + +
Sbjct: 95 MQRDTKRA-ASLLRRLAAGKPT--YAAEAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYV 151
Query: 148 LLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKC 207
++D+GS I W QC+PC C Q DP F+P+ S +FS + C ST C + C
Sbjct: 152 VMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCASTVCSHV-------DNAAC 204
Query: 208 SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGI 267
C Y+++Y DGS G A + +T G +GC +N G GA+G+
Sbjct: 205 HEGRCRYEVSYGDGSYTKGTLALETITF------GRTLIRNVAIGCGHHNQGMFVGAAGL 258
Query: 268 MGLDRGPVSIISK---TNISYFFYCLHS-PYGSTGYITFGKPDTVNKKFVKYTPIVTTPE 323
+GL GP+S + + F YCL S S+G + FG+ + P++ P
Sbjct: 259 LGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIESSGLLEFGR--EAMPVGAAWVPLIHNPR 316
Query: 324 QSEFYHITLTGISVGGERLPLKASYFTKLS------TEIDSGTIITRFPAPVYSALRSAF 377
FY+I L+G+ VGG R+ + F KLS +D+GT +TR P Y A R F
Sbjct: 317 AQSFYYIGLSGLGVGGLRVSISEDVF-KLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGF 375
Query: 378 RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQ 436
+ G+ +FDTCYDL + +V VP ++ +F GG L L R L+ V+ V
Sbjct: 376 IAQTTNLPRASGVS-IFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGT 434
Query: 437 VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
C FA PS ++GN+QQ G ++ D A +GFGP C
Sbjct: 435 FCFAFA--PSSSGLSIIGNIQQEGIQISVDGANGFVGFGPNVC 475
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 122/417 (29%), Positives = 194/417 (46%), Gaps = 40/417 (9%)
Query: 85 LEEILRRDQQRL-----HLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVA-----ADEYYI 134
L+E LRR+ R+ ++ + L K + ++ +V+ + EY+
Sbjct: 100 LKEKLRREAVRVRGLERQIERTLTLNKDPVNRYENVAEVDADFGGEVVSGMEQGSGEYFT 159
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
+ +G P + ++LDTGS + W QC+PC C Q DP F+PS S +FS + C+S C
Sbjct: 160 RIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAVCSQ 219
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L + C S C Y+ +Y DGS TG +AT+ +T G + +GC
Sbjct: 220 LDAY-------DCHSGGCLYEASYGDGSYSTGSFATETLTF------GTTSVANVAIGCG 266
Query: 255 DNNTG----DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNK 310
N G G P I ++T ++ + + S+G + FG P +V
Sbjct: 267 HKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYCLVDRESDSSGPLQFG-PKSVPV 325
Query: 311 KFVKYTPIVTTPEQSEFYHITLTGISVGG---ERLPLKASYFTKLSTE----IDSGTIIT 363
+ +TP+ P FY++++T ISVGG + +P + + S IDSGT++T
Sbjct: 326 GSI-FTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVT 384
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
R Y A+R AF + + +FDTCYDLS + V VP + HF G L L
Sbjct: 385 RLVTSAYDAVRDAFVAGTGQLPRTDAVS-IFDTCYDLSGLQFVSVPTVGFHFSNGASLIL 443
Query: 424 DVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ L+ +++V C FA P+ + ++GN QQ+ V +D A +GF C
Sbjct: 444 PAKNYLIPMDTVGTFCFAFA--PAASSVSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 139/426 (32%), Positives = 202/426 (47%), Gaps = 47/426 (11%)
Query: 70 PCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAA 129
P S + G +T + ++R Q RL +LQ ++ + KA P G
Sbjct: 65 PLSPFSPGNISSTERFKRAIKRSQDRL-----EKLQMSV----DEVKAVEAPVYAG---N 112
Query: 130 DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNS 189
E+ + +AIG P S +LDTGS +TWTQCKPC C Q P +DPS+S T+SK+PC+S
Sbjct: 113 GEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSS 172
Query: 190 TTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
+ C+ L + CS C Y +Y D S G + + T+ P
Sbjct: 173 SMCQALPMY-------SCSGANCEYLYSYGDQSSTQGILSYESFTLTS-------QSLPH 218
Query: 250 L-LGC-TDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS---TGYIT 301
+ GC +N G + G++G RGP+S+IS+ S F YCL S S T +
Sbjct: 219 IAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLF 278
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ 355
GK ++N K V TP+V + + FY+++L GISVGG+ L + F L +
Sbjct: 279 IGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTF-DLQLDGTGGVI 337
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD-LSAYKTVVVPKITIH 414
IDSGT +T Y ++ A + ++ G D C++ S T P IT H
Sbjct: 338 IDSGTTVTYLEQSGYDVVKKAVISSINLPQV-DGSNIGLDLCFEPQSGSSTSHFPTITFH 396
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F G D L + +S CL A+LPS+ SI GN+QQ+ Y++ YD L F
Sbjct: 397 F-EGADFNLPKENYIYTDSSGIACL--AMLPSNGMSI-FGNIQQQNYQILYDNERNVLSF 452
Query: 475 GPGNCN 480
P C+
Sbjct: 453 APTVCD 458
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 130/435 (29%), Positives = 204/435 (46%), Gaps = 46/435 (10%)
Query: 62 LEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTF 120
L V+ YG CS NQ K+ + ++ + +D R+ +S KA +
Sbjct: 35 LSVIHVYGQCSPFNQHKAGSWVNTVINMASKDPARVTYLSSL---------VASPKATSV 85
Query: 121 PAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSK 178
P +G ++ Y + V +G P Q + ++LDT W C C CS P F P+
Sbjct: 86 PIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCS---SPTFSPNT 142
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV 238
S T++ + C+ C + P + C ++ Y G++ F A M Q+
Sbjct: 143 SSTYASLQCSVPQCTQVRGLSCPT----TGTAACFFNQTY---GGDSSFSA---MLSQDS 192
Query: 239 NGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--P 293
G + GC + +G G++GL RGP+S++S++ Y F YC S
Sbjct: 193 LGLAVDTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKS 252
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---- 349
Y +G + G K ++ TP++ P + Y++ LTG+SVG +P+
Sbjct: 253 YYFSGSLRLGPLG--QPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDP 310
Query: 350 -TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
T T IDSGT+ITRF PVY+A+R FRK++K G FDTC+ +A +
Sbjct: 311 NTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPFATIGA---FDTCF--AATNEDIA 365
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEVHY 465
P +T HF G+DL+L + TL+ S + CL A P++ NS+L + N+QQ+ + +
Sbjct: 366 PPVTFHFT-GMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMF 424
Query: 466 DVAGRRLGFGPGNCN 480
DV RLG CN
Sbjct: 425 DVTNSRLGIARELCN 439
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 125/381 (32%), Positives = 179/381 (46%), Gaps = 34/381 (8%)
Query: 124 TGIVAADEYYIVVAIGKPK-QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTF 182
T + ++ EY I IG P+ Q V+L +DTGS + WTQC PC C Q P FDPS S TF
Sbjct: 79 TAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTF 138
Query: 183 SKIPCNSTTCKILLEWFPPNGQ--DKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEV 238
+ C C+ P +G C+ K C Y +Y D S G+ D T
Sbjct: 139 RAVACPDPICR------PSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSP 192
Query: 239 NGNGY--FARYPFLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHS--- 292
NG G A GC D NTG + SGI G RGP+S+ S+ + F YCL S
Sbjct: 193 NGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQLRVGRFSYCLTSHDE 252
Query: 293 -PYGSTGYITFGKPDTVNKKF----VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS 347
T + G P + + TPI+ +P FY+++L GI+VG RLP+ +S
Sbjct: 253 TESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSS 312
Query: 348 YFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKR--MKKYKMGKGIEDLFDTCYDL 400
F T IDSGT +T FPA V+ L++ F + + +Y + +L C+
Sbjct: 313 VFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGNLL--CFQR 370
Query: 401 -SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQR 459
K V VPK+ H L D++L R + E + + ++ + +L+GN QQ+
Sbjct: 371 PKGGKQVPVPKLIFH-LASADMDLP-RENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQQ 428
Query: 460 GYEVHYDVAGRRLGFGPGNCN 480
+ YDV +L F C+
Sbjct: 429 NMHIVYDVENSKLLFASAQCD 449
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/404 (27%), Positives = 179/404 (44%), Gaps = 32/404 (7%)
Query: 95 RLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSG 154
R L S + +P + + P +G +Y +++G P + S++ DTGS
Sbjct: 6 RSKLAASSLITSEVPYPPSVSTDYESPVASG---GGDYVTTISLGTPAKVFSVIADTGSD 62
Query: 155 ITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPY 214
+ W QCKPC C Q+DP FDP S +++ + C T C L K S C Y
Sbjct: 63 LIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPR--------KSCSPNCDY 114
Query: 215 DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGP 274
Y DGSG G +++ +T+ G A+ GC N G N ASG++GL RG
Sbjct: 115 SYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKN-IAFGCGHLNRGSFNDASGLVGLGRGN 173
Query: 275 VSIISKTNISY---FFYCL---HSPYGSTGYITFGKPDTVN----KKFVKYTPIVTTPEQ 324
+S +S+ + F YCL T + FG + + K +TP++ P
Sbjct: 174 LSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAM 233
Query: 325 SEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRK 379
FY++ L IS+ G L + A F DSGT +T P Y + A R
Sbjct: 234 ESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRS 293
Query: 380 RMKKYKMGKGIEDLFDTCYDLSAYKTVV---VPKITIHFLGGVDLELDVRGTLVVESVRQ 436
++ ++ G D CYD+S K +P + HF G D +L V + +
Sbjct: 294 KVSFPEI-DGSSAGLDLCYDVSGSKASYKKKIPAMVFHFE-GADHQLPVENYFIAANDAG 351
Query: 437 VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ A++ S+ + + GN+ Q+ + V YD+ ++G+ P C+
Sbjct: 352 TIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQCD 395
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 121/428 (28%), Positives = 183/428 (42%), Gaps = 40/428 (9%)
Query: 72 SKLNQGKSRNTPSL-EEILRRDQQRLHLKNSRRLQKA-IPDNFKKTKAFTFPAKTGIVAA 129
+ ++ G S P L + R + R+ S + A + D + ++
Sbjct: 33 THVDAGTSYTKPQLLSRAIARSKARVAALQSAAVSPAPVADPITAARVLV------TASS 86
Query: 130 DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNS 189
EY + +AIG P Y + ++DTGS + WTQC PC+ C+ Q P+FD +S T+ +PC S
Sbjct: 87 GEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCRS 146
Query: 190 TTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
+ C L C K C Y Y D + G A + T + A
Sbjct: 147 SRCAAL-------SSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAAN-I 198
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTG-------YITF 302
GC N G+ +SG++G RGP+S++S+ S F YCL S T +
Sbjct: 199 SFGCGSLNAGELANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSPTPSRLYFGVFANL 258
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEID 357
+T + V+ TP V P Y +++ GIS+G +RLP+ F ID
Sbjct: 259 NSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIID 318
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-FDTCYDLSAYK--TVVVPKITIH 414
SGT IT Y A+R + M D+ DTC+ TV VP H
Sbjct: 319 SGTSITWLQQDAYEAVRRGLASTIPLPAMND--TDIGLDTCFQWPPPPNVTVTVPDFVFH 376
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI--LLGNVQQRGYEVHYDVAGRRL 472
F G ++ L +++ S G+ L P S+ ++GN QQ+ + YD+A L
Sbjct: 377 F-DGANMTLPPENYMLIASTT----GYLCLAMAPTSVGTIIGNYQQQNLHLLYDIANSFL 431
Query: 473 GFGPGNCN 480
F P C+
Sbjct: 432 SFVPAPCD 439
>gi|110740049|dbj|BAF01928.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
Length = 183
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 81/184 (44%), Positives = 115/184 (62%), Gaps = 3/184 (1%)
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEI 356
TG++TFG + VK+TPI T + + FY + + I+VGG++LP+ ++ F+ I
Sbjct: 3 TGHLTFGSAGI--SRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALI 60
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFL 416
DSGT+ITR P Y+ALRS+F+ +M KY G+ + DTC+DLS +KTV +PK+ F
Sbjct: 61 DSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVS-ILDTCFDLSGFKTVTIPKVAFSFS 119
Query: 417 GGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
GG +EL +G V + QVCL FA D N+ + GNVQQ+ EV YD AG R+GF P
Sbjct: 120 GGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAP 179
Query: 477 GNCN 480
C+
Sbjct: 180 NGCS 183
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 126/418 (30%), Positives = 182/418 (43%), Gaps = 75/418 (17%)
Query: 58 GKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQR---LHLKNSRRLQKAIPDNFKK 114
G S+ + RYGPCS + P+ EE+LRRDQ R + K S A ++ +
Sbjct: 29 GTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQS 88
Query: 115 TKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH---CSQQR 170
+K + P G + EY I V +G P +++DTGS ++W QC+PC C
Sbjct: 89 SK-VSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHA 147
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWAT 230
FDP+ S T++ C++ C L + NG D + C Y + Y DGS TG
Sbjct: 148 GALFDPAASSTYAAFNCSAAACAQLGDSGEANGCD--AKSRCQYIVKYGDGSNTTGTG-- 203
Query: 231 DRMTIQEVNGNGYFARYPFLLGCT--DNNTGDQNGASGIMGLDRGPVSIISKTNISYFFY 288
F GC+ + G + G++GL S++S+T
Sbjct: 204 ------------------FQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAAR---- 241
Query: 289 CLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASY 348
+KK Y Y L I+VGG++L L S
Sbjct: 242 --------------------SKKVPTY------------YFAALEDIAVGGKKLGLSPSV 269
Query: 349 FTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
F S +DSGT+ITR P Y+AL SAFR M +Y + + + DTC++ + V +
Sbjct: 270 FAAGSL-VDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPL-GILDTCFNFTGLDKVSI 327
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
P + + F GG ++LD G V CL FA D +GNVQQR +EV YD
Sbjct: 328 PTVALVFAGGAVVDLDAHGI-----VSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 171/349 (48%), Gaps = 20/349 (5%)
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
+ +G P +++DTGS +TW QC PC + C +Q P F+P S T++ + C++ C
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L N SS C Y +Y D S G+ + D ++ G + F GC
Sbjct: 61 LPSA-TLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTSLPNFYYGCG 113
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKK 311
+N G ++G++GL R +S++ + S F YCL S + + + N
Sbjct: 114 QDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPS----SSSSGYLSLGSYNPG 169
Query: 312 FVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYS 371
YTP+V++ Y I L+G++V G L + +S ++ L T IDSGT+ITR P VYS
Sbjct: 170 QYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVYS 229
Query: 372 ALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVV 431
AL A MK + DTC+ A + V P +T+ F GG L+L + LV
Sbjct: 230 ALSKAVAAAMKGTSRASAYS-ILDTCFKGQASR-VSAPAVTMSFAGGAALKLSAQNLLVD 287
Query: 432 ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CL FA P+ +I +GN QQ+ + V YDV R+GF G C+
Sbjct: 288 VDDSTTCLAFA--PARSAAI-IGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 126/413 (30%), Positives = 183/413 (44%), Gaps = 43/413 (10%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYV 145
E +RRD R+ S + +F + G+ Y + +++G P
Sbjct: 44 SEAVRRDSHRIAFL-SDATAAGKATTTNSSVSFQALLENGV---GGYNMNISVGTPLLTF 99
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
++ DTGS + WTQC PC C QQ P F P+ S TFSK+PC S+ C+ L PN
Sbjct: 100 PVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFL-----PNSIR 154
Query: 206 KCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
C++ C Y+ Y GSG T G+ AT+ + + G+ F F GC+ N G N
Sbjct: 155 TCNATGCVYNYKY--GSGYTAGYLATETLKV----GDASFPSVAF--GCSTEN-GVGNST 205
Query: 265 SGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY-ITFGKPDTVNKKFVKYTPIVTTPE 323
SGI GL RG +S+I + + F YCL S + I FG + V+ TP V P
Sbjct: 206 SGIAGLGRGALSLIPQLGVGRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPA 265
Query: 324 -QSEFYHITLTGISVGGERLPLKASYF------TKLSTEIDSGTIITRFPAPVYSALRSA 376
+Y++ LTGI+VG LP+ S F T +DSGT +T Y ++ A
Sbjct: 266 VHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQA 325
Query: 377 FRKRMKKYKMGKGIEDLFDTCY-DLSAYKTVVVPKITIHFLGGVD---------LELDVR 426
F + G L D C+ + VP + + F GG + +E D +
Sbjct: 326 FLSQTANVTTVNGTRGL-DLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQ 384
Query: 427 GTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
G++ V CL D ++GNV Q + YD+ G F P +C
Sbjct: 385 GSVTV-----ACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADC 432
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 131/416 (31%), Positives = 195/416 (46%), Gaps = 40/416 (9%)
Query: 87 EILRRDQQR-----------LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADE--YY 133
EI+ RD R + N+ R ++F + ++ ++ + D+ Y
Sbjct: 30 EIIHRDSSRSPFYRATETQFQRVTNAVRRSMNRANHFNQISVYSNAVESPVTLLDDGDYL 89
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ ++G P V ++DT S I W QC+ C C P FDPS SKT+ +PC+STTCK
Sbjct: 90 MSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTTCK 149
Query: 194 ILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-F 249
+ CSS E C + + Y DGS G + +T+ N F +P
Sbjct: 150 SV-------QGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDP--FVHFPRT 200
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPD 306
++GC NT + GI+GL GPVS++ + + S F YCL + + FG
Sbjct: 201 VIGCI-RNTNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDRSSKLKFGDAA 259
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT---KLSTEIDSGTIIT 363
V+ T IV + +FY++TL SVG R+ ++S K + IDSGT T
Sbjct: 260 MVSGDGTVSTRIV-FKDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSGTTFT 318
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
P VYS L SA +K + ++ F CY S Y V VP IT HF G D++L
Sbjct: 319 VLPDDVYSKLESAVADVVKLERAEDPLKQ-FSLCYK-STYDKVDVPVITAHF-SGADVKL 375
Query: 424 DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ T +V S R VCL F S + + GN+ Q+ + V YD+ + + F P +C
Sbjct: 376 NALNTFIVASHRVVCLAFL---SSQSGAIFGNLAQQNFLVGYDLQRKIVSFKPTDC 428
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 129/369 (34%), Positives = 185/369 (50%), Gaps = 26/369 (7%)
Query: 121 PAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSK 180
P T I A EY I ++G P V +LDTGS I W QC+PC C +Q P FD SKS+
Sbjct: 78 PETTVISALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQ 137
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVN 239
T+ +PC S TC+ + F CSS K C Y I YVDGS G + + +T+ N
Sbjct: 138 TYKTLPCPSNTCQSVQGTF-------CSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTN 190
Query: 240 GNGYFARYP-FLLGCTD-NNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPY 294
G+ ++P ++GC N G + SGI+GL RGP+S+I++ + S F YCL P
Sbjct: 191 GSP--VQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCL-VPG 247
Query: 295 GSTGY--ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA-SYFTK 351
ST + FG V+ + TP+ + FY +TL SVG R+ + K
Sbjct: 248 LSTASSKLNFGNAAVVSGRGTVSTPLF-SKNGLVFYFLTLEAFSVGRNRIEFGSPGSGGK 306
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPK 410
+ IDSGT +T P VYS L +A K + ++ + + CY ++ K VP
Sbjct: 307 GNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRV-RDPNQVLGLCYKVTPDKLDASVPV 365
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
IT HF G D+ L+ T V + VC FA P++ ++ GN+ Q+ V YD+
Sbjct: 366 ITAHF-SGADVTLNAINTFVQVADDVVC--FAFQPTETGAV-FGNLAQQNLLVGYDLQMN 421
Query: 471 RLGFGPGNC 479
+ F +C
Sbjct: 422 TVSFKHTDC 430
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 119/411 (28%), Positives = 172/411 (41%), Gaps = 34/411 (8%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
L + R + R+ S + + D + ++ EY + +AIG P Y
Sbjct: 48 LSRAIARSKARVAALQSAAVLPPVVDPITAARVLV------TASSGEYLVDLAIGTPPLY 101
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+ ++DTGS + WTQC PC+ C+ Q P+FD KS T+ +PC S+ C L
Sbjct: 102 YTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASL-------SS 154
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
C K C Y Y D + G A + T N A GC N GD +
Sbjct: 155 PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATN-IAFGCGSLNAGDLANS 213
Query: 265 SGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTG-------YITFGKPDTVNKKFVKYTP 317
SG++G RGP+S++S+ S F YCL S +T Y +T + V+ TP
Sbjct: 214 SGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTP 273
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSA 372
V P Y ++L IS+G + LP+ F IDSGT IT Y A
Sbjct: 274 FVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEA 333
Query: 373 LRSAFRKRMKKYKMGKGIEDL-FDTCYDL--SAYKTVVVPKITIHFLGGVDLELDVRGTL 429
+R + M D+ DTC+ TV VP + HF L L
Sbjct: 334 VRRGLVSAIPLPAMND--TDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYML 391
Query: 430 VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ + +CL A P+ +I +GN QQ+ + YD+ L F P C+
Sbjct: 392 IASTTGYLCLVMA--PTGVGTI-IGNYQQQNLHLLYDIGNSFLSFVPAPCD 439
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 139/440 (31%), Positives = 199/440 (45%), Gaps = 53/440 (12%)
Query: 54 PQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFK 113
PQG L V PCS Q NT S E L +D+ RL +S + ++P
Sbjct: 27 PQG-HPSDLRVFHVNSPCSPFKQ---PNTVSWESTLLKDKARLQYLSSLAKKPSVP---- 78
Query: 114 KTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
+ IV + Y + IG P Q + + LDT + W C C+ C+
Sbjct: 79 ------IASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV--L 130
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDR 232
FDPSKS + + C++ CK PN C++ K C +++ Y GS D
Sbjct: 131 FDPSKSSSSRNLQCDAPQCKQA-----PN--PTCTAGKSCGFNMTY-GGSTIEASLTQDT 182
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTN---ISYFFYC 289
+T+ N Y F GC TG A G+MGL RGP+S+IS+T +S F YC
Sbjct: 183 LTL----ANDVIKSYTF--GCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYC 236
Query: 290 LHSPYGS--TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLK 345
L + S +G + G +K TP++ P +S Y++ L GI VG + +P
Sbjct: 237 LPNSKSSNFSGSLRLGP--KYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTS 294
Query: 346 ASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA 402
A F T T DSGT+ TR P Y A+R+ FR+R+K FDTCY S
Sbjct: 295 ALAFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATS--LGGFDTCYSGS- 351
Query: 403 YKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQR 459
VV P +T F G+++ L L+ S CL A P++ NS+L + ++QQ+
Sbjct: 352 ---VVYPSVTFMF-AGMNVTLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQ 407
Query: 460 GYEVHYDVAGRRLGFGPGNC 479
+ V D+ RLG C
Sbjct: 408 NHRVLIDLPNSRLGISRETC 427
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 183/369 (49%), Gaps = 18/369 (4%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ A EY++ V +G P ++ L++DTGS +TW QCKPC C Q P FDPS+S +F IP
Sbjct: 82 LGAGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIP 141
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
CN+ C +++ + K S K C Y Y D S +G A + +++ +
Sbjct: 142 CNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEI 201
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS----YFFYCL---HSPYGSTGY 299
++GC +N G GA G++GL +G +S S+ S F YCL + +
Sbjct: 202 RDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSA 261
Query: 300 ITFGKPDTVNKKF--VKYTPIVTTPEQSE-FYHITLTGISVGGERLPLKASYFTKLS--- 353
I+FG +++ F +K+TP V T E FY++ + GI + E LP+ A F +
Sbjct: 262 ISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGS 321
Query: 354 --TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
T IDSGT +T Y A+ SAF R+ + D+ CY+ + V P +
Sbjct: 322 GGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRADPF--DILGICYNATGRAAVPFPAL 379
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
+I F G +L+L + ++ A+LP+D SI +GN QQ+ YDV R
Sbjct: 380 SIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSI-IGNFQQQNIHFLYDVQHAR 438
Query: 472 LGFGPGNCN 480
LGF +C+
Sbjct: 439 LGFANTDCS 447
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 120/405 (29%), Positives = 184/405 (45%), Gaps = 29/405 (7%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSL 147
+ RD+ RL + R ++ T +G+ + + EY+ + IG P++ L
Sbjct: 1 MERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYL 60
Query: 148 LLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKC 207
LDTGS +TW QC PC C Q DP +DPS S ++ ++ C S C+ L C
Sbjct: 61 ELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQAL-------DYSAC 113
Query: 208 SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGI 267
C Y + Y D S +G + + N A GC +N+G G +G+
Sbjct: 114 QGMGCSYRVVYGDSSASSGDLGIESFYLGP---NSSTAMRNIAFGCGHSNSGLFRGEAGL 170
Query: 268 MGLDRGPVSIISKTNISY---FFYCLHSPYGS----TGYITFGKPDTVNKKFVKYTPIVT 320
+G+ G +S S+ S F YCL Y + + FG+ T ++TP++
Sbjct: 171 LGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGR--TAIPFAARFTPLLK 228
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRS 375
P FY+ LTGISVGG LP+ + F +DSGT +TR Y+ LR
Sbjct: 229 NPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRD 288
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESV 434
A+R + G+ L DTC++ TV +P + +HF VD+ L L+ V+
Sbjct: 289 AYRAASRNLPPAPGVY-LLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRS 347
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA PS ++GNVQQ+ + + +D+ + P C
Sbjct: 348 GTFCLAFA--PSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 132/450 (29%), Positives = 210/450 (46%), Gaps = 45/450 (10%)
Query: 70 PCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAA 129
P + + R+ ++ + R +R + + RL+K+ + K + + PA++ A
Sbjct: 115 PKESITESAVRDLARIQTLHTRITERKNQDTTSRLKKSNVERKKPMEEVSSPAESPESYA 174
Query: 130 D-------------------EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR 170
D EY+I V IG P ++ SL+LDTGS + W QC PC C +Q
Sbjct: 175 DYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQN 234
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWAT 230
P++DP S +F I CN C+++ PP K ++ CPY Y D S TG +A
Sbjct: 235 GPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPC-KFETQSCPYFYWYGDSSNTTGDFAL 293
Query: 231 DRMTIQ---EVNGNGYFARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY- 285
+ T+ G F R + GC N G +GA+G++GL RGP+S S+ Y
Sbjct: 294 ETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYG 353
Query: 286 --FFYCL---HSPYGSTGYITFGK-PDTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISV 337
F YCL S + + FG+ D + + +T ++ E FY++ + I V
Sbjct: 354 HSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFV 413
Query: 338 GGERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED 392
GGE+L + + + T IDSGT ++ F P Y ++ AF +++K YK+ +ED
Sbjct: 414 GGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKL---VED 470
Query: 393 --LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPN 449
+ CY++S + P+ I F G V + ++ + VCL P
Sbjct: 471 FPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSAL 530
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
SI +GN QQ+ + + YD RLG+ P C
Sbjct: 531 SI-IGNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 132/450 (29%), Positives = 210/450 (46%), Gaps = 45/450 (10%)
Query: 70 PCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAA 129
P + + R+ ++ + R +R + + RL+K+ + K + + PA++ A
Sbjct: 115 PKESITESAVRDLARIQTLHTRITERKNQDTTSRLKKSNVERKKPMEEVSSPAESPESYA 174
Query: 130 D-------------------EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR 170
D EY+I V IG P ++ SL+LDTGS + W QC PC C +Q
Sbjct: 175 DYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQN 234
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWAT 230
P++DP S +F I CN C+++ PP K ++ CPY Y D S TG +A
Sbjct: 235 GPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPC-KFETQSCPYFYWYGDSSNTTGDFAL 293
Query: 231 DRMTIQ---EVNGNGYFARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY- 285
+ T+ G F R + GC N G +GA+G++GL RGP+S S+ Y
Sbjct: 294 ETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYG 353
Query: 286 --FFYCL---HSPYGSTGYITFGK-PDTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISV 337
F YCL S + + FG+ D + + +T ++ E FY++ + I V
Sbjct: 354 HSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFV 413
Query: 338 GGERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED 392
GGE+L + + + T IDSGT ++ F P Y ++ AF +++K YK+ +ED
Sbjct: 414 GGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKL---VED 470
Query: 393 --LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPN 449
+ CY++S + P+ I F G V + ++ + VCL P
Sbjct: 471 FPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSAL 530
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
SI +GN QQ+ + + YD RLG+ P C
Sbjct: 531 SI-IGNYQQQNFHILYDTKNSRLGYAPMRC 559
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 135/416 (32%), Positives = 196/416 (47%), Gaps = 44/416 (10%)
Query: 80 RNTPSLEEI---LRRDQQRLHLKNSRRLQKA--IPDNFKKTKAFTFPAKTGIVAADEYYI 134
+N LE I ++R + RL +RLQ + + + +A P E+ +
Sbjct: 51 KNLTKLERIRHGVKRGRNRL-----QRLQAMALVASSSSEIEAPVLPGN------GEFLM 99
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
+AIG P + S +LDTGS + WTQCKPC C Q P FDP KS +FSK+ C+S C+
Sbjct: 100 KLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEA 159
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L + NG C Y +Y D S G A++ +T G F G
Sbjct: 160 LPQSSCNNG--------CEYLYSYGDYSSTQGILASETLTF----GKASVPNVAFGCGAD 207
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS-TGYITFGKPDTVN--KK 311
+ +G GA G++GL RGP+S++S+ F YCL + + T + G +VN
Sbjct: 208 NEGSGFSQGA-GLVGLGRGPLSLVSQLKEPKFSYCLTTVDDTKTSTLLMGSLASVNASSS 266
Query: 312 FVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFP 366
+K TP++ +P FY+++L GISVG RLP+K S F+ IDSGT IT
Sbjct: 267 AIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLE 326
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITIHFLGGVDLELDV 425
++ + F ++ G L D C+ L S + VPK+ HF G DLEL
Sbjct: 327 ESAFNLVAKEFTAKINLPVDSSGSTGL-DVCFTLPSGSTNIEVPKLVFHF-DGADLELPA 384
Query: 426 RGTLVVESVRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++ +S V CL S + GNVQQ+ V +D+ L F P C+
Sbjct: 385 ENYMIGDSSMGVACLAMG---SSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQCD 437
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 135/432 (31%), Positives = 209/432 (48%), Gaps = 38/432 (8%)
Query: 61 SLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTF 120
++E++ R P S + T + +RR R+H + K + FT
Sbjct: 30 TVELINRDSPKSPFYNPRETPTQRIVSAVRRSMSRVHHFSPT----------KNSDIFTD 79
Query: 121 PAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
A++ +++ EY + ++G P + + DTGS + WTQCKPC C +Q P FDP S
Sbjct: 80 TAQSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSS 139
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN 239
T+ I C++ C +L E +G+ +K C Y +Y D S +G A D +T+ +
Sbjct: 140 STYRDISCSTKQCDLLKEGASCSGE---GNKTCHYSYSYGDRSFTSGNVAADTITLGSTS 196
Query: 240 GNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGPVSIISK---TNISYFFYC---LHS 292
G ++GC NN G SGI+GL GP+S+IS+ T F YC L S
Sbjct: 197 GRPVLLPKA-IIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSS 255
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--T 350
++ + FG V+ V+ TP+++ + FY +TL +SVG ER+ S F +
Sbjct: 256 NATNSSKLNFGSNGIVSGGGVQSTPLISK-DPDTFYFLTLEAVSVGSERIKFPGSSFGTS 314
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED---LFDTCYDLSAYKTVV 407
+ + IDSGT +T FP +S L SA + + G +ED + CY + A +
Sbjct: 315 EGNIIIDSGTTLTLFPEDFFSELSSAVQDAVA----GTPVEDPSGILSLCYSIDA--DLK 368
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
P IT HF G D++L+ T V V L FA P + +I GN+ Q + V YD+
Sbjct: 369 FPSITAHF-DGADVKLNPLNTFV--QVSDTVLCFAFNPINSGAI-FGNLAQMNFLVGYDL 424
Query: 468 AGRRLGFGPGNC 479
G+ + F P +C
Sbjct: 425 EGKTVSFKPTDC 436
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 135/423 (31%), Positives = 191/423 (45%), Gaps = 37/423 (8%)
Query: 76 QGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTG--IVAADEYY 133
+G +RN E+LRR R + +++L P T P +G +V EY
Sbjct: 42 RGFTRN-----ELLRRMVLRSRARAAKQL---CPSRSGTPVRVTAPVASGSHVVGYTEYL 93
Query: 134 IVVAIGKPK-QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTC 192
I IG P+ Q V+L +DTGS + WTQC+PC C Q P FD S S T + C C
Sbjct: 94 IHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTDPIC 153
Query: 193 KILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
+ L C C Y + Y D S G A D T + G G + G
Sbjct: 154 RALRP-------HACFLGGCTYQVNYGDNSVTIGQLAKDSFTF-DGKGGGKVTVPDLVFG 205
Query: 253 CTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITF-GKPDTVNK 310
C NTG+ + +GI G RGP+S+ + +S F YC + + S F G
Sbjct: 206 CGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTTIFESKSTPVFLGGAPADGL 265
Query: 311 KFVKYTPIVTT---PEQSEFYHITLTGISVGGERLPLKASYFTKLS-----TEIDSGTII 362
+ PI++T P E+Y+++L GI+VG RL + S F + T IDSGT I
Sbjct: 266 RAHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGTAI 325
Query: 363 TRFPAPVYSALRSAFRKRM-----KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
T FP V+ +L AF ++ G+ F T A K V VPK+T+H L
Sbjct: 326 TAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASK-VPVPKMTLH-LE 383
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G D EL R + E L +L D + ++GN QQ+ + +D+AG +L P
Sbjct: 384 GADWELP-RENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIEPA 442
Query: 478 NCN 480
C+
Sbjct: 443 QCD 445
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 126/413 (30%), Positives = 197/413 (47%), Gaps = 29/413 (7%)
Query: 80 RNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIG 139
R++P + + H ++ R ++F K + P T I Y + ++G
Sbjct: 35 RDSPKSPYYKPTENKYQHFVDAARRSINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVG 94
Query: 140 KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF 199
P + + DTGS I W QC+PC C Q P F+PSKS ++ IPC+S C + +
Sbjct: 95 TPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDT- 153
Query: 200 PPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNN 257
CS + C Y I+Y D S G + D ++++ +G+ +P ++GC +N
Sbjct: 154 ------SCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSP--VSFPKIVIGCGTDN 205
Query: 258 TGDQNGA-SGIMGLDRGPVSIISKTNISY---FFYC----LHSPYGSTGYITFGKPDTVN 309
G GA SGI+GL GPVS+I++ S F YC L+ ++ ++FG V+
Sbjct: 206 AGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVVS 265
Query: 310 KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TKLSTEIDSGTIITRFP 366
V TP++ + FY +TL SVG +R+ S + + IDSGT +T P
Sbjct: 266 GDGVVSTPLIK--KDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTLIP 323
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
+ VY+ L SA +K ++ + F CY L + + P IT+HF G D+EL
Sbjct: 324 SDVYTNLESAVVDLVKLDRVDDPNQQ-FSLCYSLKSNE-YDFPIITVHF-KGADVELHSI 380
Query: 427 GTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
T V + VC FA PS + GN+ Q+ V YD+ + + F P +C
Sbjct: 381 STFVPITDGIVC--FAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDC 431
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 133/399 (33%), Positives = 193/399 (48%), Gaps = 36/399 (9%)
Query: 95 RLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGS 153
L LK ++ + I + T + T P +G A EY+ + +G+P Q + DTGS
Sbjct: 147 ELSLKGGKQFGRRI-NGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGS 205
Query: 154 GITWTQCKPCI---HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK 210
++W QC+PC C +Q P FDP S ++S + C+S C +L E C +
Sbjct: 206 DVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEA-------ACDAN 258
Query: 211 ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGL 270
C Y++ Y DGS G AT+ + + N P +GC +N G GA G++GL
Sbjct: 259 SCIYEVEYGDGSFTVGELATETFSFRHSNS---IPNLP--IGCGHDNEGLFVGADGLIGL 313
Query: 271 DRGPVSIISKTNISYFFYC---LHSPYGSTGYITFGKP-DTVNKKFVKYTPIVTTPEQSE 326
G +S+ S+ + F YC L S ST +P D++ +P+V
Sbjct: 314 GGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSLT------SPLVKNDRFPT 367
Query: 327 FYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRM 381
F ++ + G+SVGG+ LP+ +S F + +DSGT IT P+ VY LR AF
Sbjct: 368 FRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLT 427
Query: 382 KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLG 440
K G+ FDTCYDLS+ V VP I G L+L + L+ V+S CL
Sbjct: 428 KNLPPAPGVSP-FDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLA 486
Query: 441 FALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
F LPS ++GNVQQ+G V YD+A +GF C
Sbjct: 487 F--LPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 112/375 (29%), Positives = 170/375 (45%), Gaps = 35/375 (9%)
Query: 113 KKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP 172
T++ + P + Y + + +G P + ++DTGS ITWTQC PC+HC +Q P
Sbjct: 46 SNTQSGSSPYANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAP 105
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDR 232
FDPSKS TF + +C CPY++ Y D + G AT+
Sbjct: 106 IFDPSKSSTFK--------------------EKRCDGHSCPYEVDYFDHTYTMGTLATET 145
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYC 289
+T+ +G F ++GC NN+ + SG++GL+ GP S+I++ Y YC
Sbjct: 146 ITLHSTSGEP-FVMPETIIGCGHNNSWFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYC 204
Query: 290 LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
T I FG V V T + T + FY++ L +SVG R+ + F
Sbjct: 205 FSGQ--GTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTF 262
Query: 350 TKLSTE--IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGI-EDLFDTCYDLSAYKTV 406
L IDSGT +T FP + +R A + + D+ CY+
Sbjct: 263 HALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDML--CYNSDTID-- 318
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDP-NSILLGNVQQRGYEVHY 465
+ P IT+HF GGVDL LD + + +ES A++ + P + GN Q + V Y
Sbjct: 319 IFPVITMHFSGGVDLVLD-KYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGY 377
Query: 466 DVAGRRLGFGPGNCN 480
D + + F P NC+
Sbjct: 378 DSSSLLVSFSPTNCS 392
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 126/385 (32%), Positives = 192/385 (49%), Gaps = 39/385 (10%)
Query: 114 KTKAFTFPAKTGIVAADEYYIVVA-IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP 172
K+K + P +G Y+V A +G P Q + ++LDT + W C C CS
Sbjct: 86 KSKPTSVPVASGNQLHIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTS 145
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDR 232
F + S T+S + C++T C P+ + S C ++ +Y S + D
Sbjct: 146 FNT-NSSSTYSTVSCSTTQCTQARGLTCPSSTPQPS--ICSFNQSYGGDSSFSANLVQDT 202
Query: 233 MTIQ-EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFY 288
+T+ +V N F GC ++ +G+ G+MGL RGP+S++S+T Y F Y
Sbjct: 203 LTLSPDVIPN-------FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSY 255
Query: 289 CLHSP-----YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP 343
CL S GS G+P K ++YTP++ P + Y++ LTG+SVG ++P
Sbjct: 256 CLPSFRSFYFSGSLKLGLLGQP-----KSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP 310
Query: 344 LKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY 398
+ Y T S T IDSGT+ITRF PVY A+R FRK++ G FDTC+
Sbjct: 311 VDPVYLTFDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNGSFSTLGA---FDTCF 367
Query: 399 DLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGN 455
SA V PKIT+H + +DL+L + TL+ S + CL A + + N++L + N
Sbjct: 368 --SADNENVTPKITLH-MTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIAN 424
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNCN 480
+QQ+ + +DV R+G P CN
Sbjct: 425 LQQQNLRILFDVPNSRIGIAPEPCN 449
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 127/423 (30%), Positives = 198/423 (46%), Gaps = 43/423 (10%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQY 144
E +RRD RL + A T + + + + A Y + +++G P
Sbjct: 44 SEAVRRDGHRLAFLSYAATAAAGKATTTGTNSSSVNVQAQLENGAGAYNMNISLGTPPLD 103
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRD--PFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
+++DTGS + W QC PC C + P P++S TFS++PCN + C ++ P +
Sbjct: 104 FPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSFC----QYLPTS 159
Query: 203 GQDKC--SSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG 259
+ + ++ C Y+ Y GSG T G+ AT+ +T+ G+G F + F GC+ N
Sbjct: 160 SRPRTCNATAACAYNYTY--GSGYTAGYLATETLTV----GDGTFPKVAF--GCSTENGV 211
Query: 260 DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY--ITFGK-PDTVNKKFVKYT 316
D +SGI+GL RGP+S++S+ + F YCL S G I FG + V+ T
Sbjct: 212 DN--SSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGGASPILFGSLAKLTERSVVQST 269
Query: 317 PIVTTP--EQSEFYHITLTGISVGGERLPLKASYF----TKL--STEIDSGTIITRFPAP 368
P++ P ++S Y++ LTGI+V LP+ S F T L T +DSGT +T
Sbjct: 270 PLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKD 329
Query: 369 VYSALRSAFRKRMKKYKM---GKGIEDLFDTCYDLSA---YKTVVVPKITIHFLGGVDLE 422
Y+ ++ AF+ +M G D CY SA K V VP++ + F GG
Sbjct: 330 GYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYN 389
Query: 423 LDVRGTLV-VESVRQ-----VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
+ V+ VE+ Q CL D ++GN+ Q + YD+ G F P
Sbjct: 390 VPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAP 449
Query: 477 GNC 479
+C
Sbjct: 450 ADC 452
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 131/417 (31%), Positives = 192/417 (46%), Gaps = 32/417 (7%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAK-TGIVAADEYYIVVAIGKPK 142
S E+LRR R +++R L + A P T V EY + +AIG P
Sbjct: 68 STRELLRRMAARSKARSARLLSG------RAASARMDPGSYTDGVPDTEYLVHMAIGTPP 121
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Q V L+LDTGS +TWTQC PC+ C +Q P F+PS+S TFS +PC+ C+ L W
Sbjct: 122 QPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRD-LTW-SSC 179
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCTDNNTGD- 260
G+ + C Y AY D S TG +D + + A P L GC N G
Sbjct: 180 GEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIF 239
Query: 261 QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITF-GKPDTV-------NKKF 312
+ +GI G RG +S+ ++ + F YC + GS F G P +
Sbjct: 240 VSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGV 299
Query: 313 VKYTPIVT-TPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFP 366
V+ T ++ Q + Y+I+L G++VG RLP+ S F T +DSGT +T P
Sbjct: 300 VQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLP 359
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG-VDLELDV 425
VY+ + AF + K + L C+ + VP + +HF G +DL +
Sbjct: 360 EAVYNLVCDAFVAQ-TKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPREN 418
Query: 426 RGTLVVES--VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ E+ +R CL + + + ++GN QQ+ V YD+A L F P CN
Sbjct: 419 YMFEIEEAGGIRLTCLA---INAGEDLSVIGNFQQQNMHVLYDLANDMLSFVPARCN 472
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 114/399 (28%), Positives = 178/399 (44%), Gaps = 26/399 (6%)
Query: 97 HLKNSRRLQKAIPDNFKKTKAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGI 155
H N + I F P +G + + +Y++ +G P Q SL++D+GS +
Sbjct: 28 HTANPPVITAVIAGPPSHDYGFQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDL 87
Query: 156 TWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL--LEWFPPNGQDKCSSKECP 213
W QC PC C Q P + PS S TFS +PC S+ C ++ E FP D C
Sbjct: 88 LWVQCSPCRQCYAQDSPLYVPSNSSTFSPVPCLSSDCLLIPATEGFP---CDFRYPGACA 144
Query: 214 YDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRG 273
Y+ Y D S G +A + T+ V + GC +N G A G++GL +G
Sbjct: 145 YEYLYADTSSSKGVFAYESATVDGVRID------KVAFGCGSDNQGSFAAAGGVLGLGQG 198
Query: 274 PVSIISKTNISY---FFYCLHS---PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEF 327
P+S S+ +Y F YCL + P + + FG ++YTPIV+ P+
Sbjct: 199 PLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSSLIFGDELISTIHDMQYTPIVSNPKSPTL 258
Query: 328 YHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMK 382
Y++ + ++VGG+ LP+ S + + DSGT +T + YS + +AF +
Sbjct: 259 YYVQIEKVTVGGKSLPISDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGV- 317
Query: 383 KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA 442
Y + ++ L D C +L+ P TI F G + + V + CL A
Sbjct: 318 HYPRAESVQGL-DLCVELTGVDQPSFPSFTIEFDDGAVFQPEAENYFVDVAPNVRCLAMA 376
Query: 443 LLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L S +GN+ Q+ + V YD +GF P C+
Sbjct: 377 GLASPLGGFNTIGNLLQQNFFVQYDREENLIGFAPAKCS 415
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 174/372 (46%), Gaps = 26/372 (6%)
Query: 126 IVAAD--EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFS 183
+VAA EY + +AIG P + ++DTGS + WTQC PC+ C+ Q P+F P++S T+
Sbjct: 84 LVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYR 143
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
+PC S C L +P Q C Y Y D + G A++ T N +
Sbjct: 144 LVPCRSPLCAAL--PYPACFQRSV----CVYQYYYGDEASTAGVLASETFTFGAANSSKV 197
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYG-STGYITF 302
GC + N+G +SG++GL RGP+S++S+ S F YCL S + F
Sbjct: 198 MVS-DVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNF 256
Query: 303 GKPDTVN-------KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT----- 350
G T+N V+ TP+V Y ++L GIS+G +RLP+ F
Sbjct: 257 GVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDG 316
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV--V 408
IDSGT +T Y A+R ++ E +TC+ +V V
Sbjct: 317 TGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTV 376
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P + +HF GG ++ + ++++ L A++ S ++ ++GN QQ+ + YD+A
Sbjct: 377 PDMELHFDGGANMTVPPENYMLIDGATGF-LCLAMIRSG-DATIIGNYQQQNMHILYDIA 434
Query: 469 GRRLGFGPGNCN 480
L F P CN
Sbjct: 435 NSLLSFVPAPCN 446
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 131/417 (31%), Positives = 192/417 (46%), Gaps = 32/417 (7%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAK-TGIVAADEYYIVVAIGKPK 142
S E+LRR R +++R L + A P T V EY + +AIG P
Sbjct: 42 STRELLRRMAARSKARSARLLSG------RAASARMDPGSYTDGVPDTEYLVHMAIGTPP 95
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Q V L+LDTGS +TWTQC PC+ C +Q P F+PS+S TFS +PC+ C+ L W
Sbjct: 96 QPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRD-LTW-SSC 153
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCTDNNTGD- 260
G+ + C Y AY D S TG +D + + A P L GC N G
Sbjct: 154 GEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIF 213
Query: 261 QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITF-GKPDTV-------NKKF 312
+ +GI G RG +S+ ++ + F YC + GS F G P +
Sbjct: 214 VSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGV 273
Query: 313 VKYTPIVT-TPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFP 366
V+ T ++ Q + Y+I+L G++VG RLP+ S F T +DSGT +T P
Sbjct: 274 VQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLP 333
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG-VDLELDV 425
VY+ + AF + K + L C+ + VP + +HF G +DL +
Sbjct: 334 EAVYNLVCDAFVAQ-TKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPREN 392
Query: 426 RGTLVVES--VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ E+ +R CL + + + ++GN QQ+ V YD+A L F P CN
Sbjct: 393 YMFEIEEAGGIRLTCLA---INAGEDLSVIGNFQQQNMHVLYDLANDMLSFVPARCN 446
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 138/445 (31%), Positives = 206/445 (46%), Gaps = 60/445 (13%)
Query: 61 SLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQR-----------LHLKNSRRLQKAIP 109
SL +L + C ++ + N E++ RD + H+ N+ R
Sbjct: 5 SLLILFYFSLCFIISLSHALNNGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRSINRA 64
Query: 110 DNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ 169
++F KT P T I EY + ++G P + + DTGS I W QC+PC C Q
Sbjct: 65 NHFYKTALTNTPQSTVIPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQ 124
Query: 170 RDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWA 229
P F PSKS T+ IPC+S CK SG+ G +
Sbjct: 125 TTPKFKPSKSSTYKNIPCSSDLCK----------------------------SGQQGNLS 156
Query: 230 TDRMTIQEVNGNGYFARYP-FLLGC-TDNNTGDQNGASGIMGLDRGPVSIISKTNISY-- 285
D +T++ + G+ +P ++GC TDN + +SGI+GL GP S+I++ S
Sbjct: 157 VDTLTLE--SSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDA 214
Query: 286 -FFYCLH-SPYGS--TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGER 341
F YCL +P S T + FG V+ V TPIV + FY++TL SVG +R
Sbjct: 215 KFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKK-DPIVFYYLTLEAFSVGNKR 273
Query: 342 LPLKASYF--TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
+ + S + + IDSGT +T P VY+ L SA + +K ++ LF+ CY
Sbjct: 274 IEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTR-LFNLCYS 332
Query: 400 LSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF----ALLPSDPNSILLGN 455
+++ P IT HF G D++L T V + VCL F A +PSD SI GN
Sbjct: 333 VTS-DGYDFPIITTHF-KGADVKLHPISTFVDVADGIVCLAFATTSAFIPSDVVSI-FGN 389
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNCN 480
+ Q+ V YD+ + + F P +C+
Sbjct: 390 LAQQNLLVGYDLQQKIVSFKPTDCS 414
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 182/369 (49%), Gaps = 18/369 (4%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ A EY++ V +G P ++ L++DTGS +TW QCKPC C Q P FDPS+S +F IP
Sbjct: 166 LGAGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIP 225
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
CN+ C +++ + K S K C Y Y D S +G A + +++ +
Sbjct: 226 CNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEI 285
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS----YFFYCL---HSPYGSTGY 299
++GC +N G GA G++GL +G +S S+ S F YCL + +
Sbjct: 286 RDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSA 345
Query: 300 ITFGKPDTVNKKF--VKYTPIVTTPEQSE-FYHITLTGISVGGERLPLKASYFTKL---- 352
I+FG +++ F +++TP V T E FY++ + GI + E LP+ A F
Sbjct: 346 ISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGS 405
Query: 353 -STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
T IDSGT +T Y A+ SAF R+ Y D+ CY+ + V P +
Sbjct: 406 GGTIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPF-DILGICYNATGRTAVPFPTL 463
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
+I F G +L+L + ++ A+LP+D SI +GN QQ+ YDV R
Sbjct: 464 SIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSI-IGNFQQQNIHFLYDVQHAR 522
Query: 472 LGFGPGNCN 480
LGF +C+
Sbjct: 523 LGFANTDCS 531
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 174/372 (46%), Gaps = 26/372 (6%)
Query: 126 IVAAD--EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFS 183
+VAA EY + +AIG P + ++DTGS + WTQC PC+ C+ Q P+F P++S T+
Sbjct: 84 LVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYR 143
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
+PC S C L +P Q C Y Y D + G A++ T N +
Sbjct: 144 LVPCRSPLCAAL--PYPACFQRSV----CVYQYYYGDEASTAGVLASETFTFGAANSSKV 197
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYG-STGYITF 302
GC + N+G +SG++GL RGP+S++S+ S F YCL S + F
Sbjct: 198 MVS-DVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNF 256
Query: 303 GKPDTVN-------KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT----- 350
G T+N V+ TP+V Y ++L GIS+G +RLP+ F
Sbjct: 257 GVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDG 316
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV--V 408
IDSGT +T Y A+R ++ E +TC+ +V V
Sbjct: 317 TGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTV 376
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P + +HF GG ++ + ++++ L A++ S ++ ++GN QQ+ + YD+A
Sbjct: 377 PDMELHFDGGANMTVPPENYMLIDGATGF-LCLAMIRSG-DATIIGNYQQQNMHILYDIA 434
Query: 469 GRRLGFGPGNCN 480
L F P CN
Sbjct: 435 NSLLSFVPAPCN 446
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 133/399 (33%), Positives = 193/399 (48%), Gaps = 36/399 (9%)
Query: 95 RLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGS 153
L LK ++ + I + T + T P +G A EY+ + +G+P Q + DTGS
Sbjct: 147 ELSLKGGKQFGRRI-NGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGS 205
Query: 154 GITWTQCKPCI---HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK 210
++W QC+PC C +Q P FDP S ++S + C+S C +L E C +
Sbjct: 206 DVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEA-------ACDAN 258
Query: 211 ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGL 270
C Y++ Y DGS G AT+ + + N P +GC +N G GA+G++GL
Sbjct: 259 SCIYEVEYGDGSFTVGELATETFSFRHSNS---IPNLP--IGCGHDNEGLFVGAAGLIGL 313
Query: 271 DRGPVSIISKTNISYFFYC---LHSPYGSTGYITFGKP-DTVNKKFVKYTPIVTTPEQSE 326
G +S+ S+ + F YC L S ST +P D++ +P+V
Sbjct: 314 GGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSLT------SPLVKNDRFPT 367
Query: 327 FYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRM 381
F ++ + G+SVGG+ LP+ +S F + +DSGT IT P+ VY LR AF
Sbjct: 368 FRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLT 427
Query: 382 KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLG 440
K G+ FDTCYDLS+ V VP I G L+L + L V+S CL
Sbjct: 428 KNLPPAPGVSP-FDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLA 486
Query: 441 FALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
F LPS ++GNVQQ+G V YD+A +GF C
Sbjct: 487 F--LPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 132/415 (31%), Positives = 200/415 (48%), Gaps = 46/415 (11%)
Query: 89 LRRDQQRLHLKNSRRLQKAIP--DNFKKT-------KAFTFPAKTGIV--AADEYYIVVA 137
L RD R+ N R L++++ +F ++ + T P +G + EY +
Sbjct: 95 LTRDAARVQFLN-RNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIG 153
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIH---CSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
+G+P + L+ DTGS +TW QC+PC C +Q DP FDP S ++S + CNS CK+
Sbjct: 154 VGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKL 213
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L + C+S C Y + Y DGS TG AT+ ++ N P +GC
Sbjct: 214 L-------DKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNS---IPNLP--IGCG 261
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYC---LHSPYGSTGYITFGKP-DTVNK 310
+N G G +G++GL G +S+ S+ S F YC L S ST P D++
Sbjct: 262 HDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCLVNLDSDSSSTLEFNSNMPSDSLT- 320
Query: 311 KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRF 365
+P+V + ++ + GISVGG+ LP+ + F + +DSGTII+R
Sbjct: 321 -----SPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRL 375
Query: 366 PAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
P+ VY +LR AF K GI +FDTCY+ S V VP I G L L
Sbjct: 376 PSDVYESLREAFVKLTSSLSPAPGIS-VFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPA 434
Query: 426 RGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
R L+ +++ CL F + + + ++G+ QQ+G V YD+ +GF C
Sbjct: 435 RNYLIMLDTAGTYCLAF--IKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 487
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 121/370 (32%), Positives = 188/370 (50%), Gaps = 20/370 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY++ V IG P ++ SL+LDTGS + W QC PCI C +Q P++DP +S +F I
Sbjct: 187 LGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENIT 246
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C+ CK++ PP K ++ CPY Y D S TG +A + T+ NG +
Sbjct: 247 CHDPRCKLVSSPDPPKPC-KDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQ 305
Query: 247 YP---FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGST 297
+ GC N G +GA+G++GL RGP+S S+ Y F YCL +S +
Sbjct: 306 KHVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSIYGHSFSYCLVDRNSDTSVS 365
Query: 298 GYITFGK-PDTVNKKFVKYTPIVTTPEQS--EFYHITLTGISVGGE--RLPLKASYFTKL 352
+ FG+ + ++ + +T V E S FY++ + I V GE ++P + + +K
Sbjct: 366 SKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKE 425
Query: 353 ---STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T IDSGT +T F P Y ++ AF K++K Y++ +G L CY++S + + +P
Sbjct: 426 GGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPL-KPCYNVSGIEKMELP 484
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
I F G + V + VCL P SI +GN QQ+ + + YD+
Sbjct: 485 DFGILFSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSI-IGNYQQQNFHILYDMKK 543
Query: 470 RRLGFGPGNC 479
RLG+ P C
Sbjct: 544 SRLGYAPMKC 553
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 161 bits (407), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 129/418 (30%), Positives = 198/418 (47%), Gaps = 52/418 (12%)
Query: 89 LRRDQQRLHLKNSRRLQKAIP--DNFKKT-------KAFTFPAKTGIV--AADEYYIVVA 137
L RD R+ N R L++++ +F ++ + T P +G + EY +
Sbjct: 95 LTRDAARVQFLN-RNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIG 153
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIH---CSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
+G+P + L+ DTGS +TW QC+PC C +Q DP FDP S ++S + CNS CK+
Sbjct: 154 VGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKL 213
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L + C+S C Y + Y DGS TG AT+ ++ N P +GC
Sbjct: 214 L-------DKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNS---IPNLP--IGCG 261
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVK 314
+N G G +G++GL G +S+ S+ S F YCL + + +F
Sbjct: 262 HDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL---------VNLDSDSSSTLEFNS 312
Query: 315 Y-------TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTII 362
Y +P+V + ++ + GISVGG+ LP+ + F + +DSGTII
Sbjct: 313 YMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTII 372
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
+R P+ VY +LR AF K GI +FDTCY+ S V VP I G L
Sbjct: 373 SRLPSDVYESLREAFVKLTSSLSPAPGIS-VFDTCYNFSGQSNVEVPTIAFVLSEGTSLR 431
Query: 423 LDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L R L+ +++ CL F + + + ++G+ QQ+G V YD+ +GF C
Sbjct: 432 LPARNYLIMLDTAGTYCLAF--IKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 487
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 153/285 (53%), Gaps = 19/285 (6%)
Query: 3 ILFKAFLLFIWLLRSSNNGAYANDNDL----SHSYIVSVSSLIPPTVCNRTRTALPQGPG 58
I FLL+ LL S A+ S + V ++SL+P +VC+ + P+G
Sbjct: 8 IFLLKFLLYSALLSSKRGLAFQGRKTALSTPSTLHNVHITSLMPSSVCSPS----PKGDD 63
Query: 59 K-VSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKA 117
K SLEV+ ++GPCSKL+Q K R +PS ++L +D+ R++ SR + K
Sbjct: 64 KRASLEVIHKHGPCSKLSQDKGR-SPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKGSK 122
Query: 118 FTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFD 175
T P+K+G + Y + V +G PK+ ++ + DTGS +TWTQC+PC +C Q++P F+
Sbjct: 123 VTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFN 182
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI 235
PSKS +++ I C+S TC L CS+ C Y I Y D S GF+A D++ +
Sbjct: 183 PSKSTSYTNISCSSPTCDELKSG--TGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLAL 240
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISK 280
+ FL GC NN G G +G++GL R +S++SK
Sbjct: 241 TSTD-----VFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLMSK 280
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 57/87 (65%)
Query: 393 LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSIL 452
+ DTCYD S Y TV VPKI ++F G +++LD G + ++ QVCL FA + +
Sbjct: 289 ILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAI 348
Query: 453 LGNVQQRGYEVHYDVAGRRLGFGPGNC 479
LGNVQQ+ ++V YDVAG R+GF PG C
Sbjct: 349 LGNVQQKTFDVVYDVAGGRIGFAPGGC 375
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 127/423 (30%), Positives = 199/423 (47%), Gaps = 43/423 (10%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQY 144
E +RRD RL + A T + + + + A Y + +++G P
Sbjct: 44 SEAVRRDGHRLAFLSYAATAAAGKATTTGTNSSSVNVQAQLENGAGAYNMNISLGTPPLD 103
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRD--PFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
+++DTGS + W QC PC C + P P++S TFS++PCN + C ++ P +
Sbjct: 104 FPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSFC----QYLPTS 159
Query: 203 GQDKC--SSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG 259
+ + ++ C Y+ Y GSG T G+ AT+ +T+ G+G F + F GC+ N
Sbjct: 160 SRPRTCNATAACAYNYTY--GSGYTAGYLATETLTV----GDGTFPKVAF--GCSTENGV 211
Query: 260 DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY--ITFGKPDTVNK-KFVKYT 316
D +SGI+GL RGP+S++S+ + F YCL S G I FG + + V+ T
Sbjct: 212 DN--SSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGGASPILFGSLAKLTEGSVVQST 269
Query: 317 PIVTTP--EQSEFYHITLTGISVGGERLPLKASYF----TKL--STEIDSGTIITRFPAP 368
P++ P ++S Y++ LTGI+V LP+ S F T L T +DSGT +T
Sbjct: 270 PLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKD 329
Query: 369 VYSALRSAFRKRMKKYKM---GKGIEDLFDTCYDLSA---YKTVVVPKITIHFLGGVDLE 422
Y+ ++ AF+ +M G D CY SA K V VP++ + F GG
Sbjct: 330 GYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYN 389
Query: 423 LDVRGTLV-VESVRQ-----VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
+ V+ VE+ Q CL D ++GN+ Q + YD+ G F P
Sbjct: 390 VPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAP 449
Query: 477 GNC 479
+C
Sbjct: 450 ADC 452
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 141/449 (31%), Positives = 209/449 (46%), Gaps = 54/449 (12%)
Query: 50 RTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIP 109
AL +G G S++++ R P S L + RR R+
Sbjct: 23 EVALARG-GGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRV------------- 68
Query: 110 DNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ 168
F+ T + ++ IV +A EY + + IG P V ++DTGS +TWTQC+PC HC +
Sbjct: 69 GRFRPTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK 128
Query: 169 QRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETG 226
Q P FDP S T+ C ++ C L G+D+ SKE C + +Y DGS G
Sbjct: 129 QVVPLFDPKNSSTYRDSSCGTSFCLAL-------GKDRSCSKEKKCTFRYSYADGSFTGG 181
Query: 227 FWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGD-QNGASGIMGLDRGPVSIIS--KTN 282
A++ +T+ G +P F GC ++ G +SGI+GL G +S+IS K+
Sbjct: 182 NLASETLTVDSTAGKP--VSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKST 239
Query: 283 ISYFF-YCL---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
I+ F YCL + + I FG V+ TP+V + FY++TL GISVG
Sbjct: 240 INGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDT-FYYLTLEGISVG 298
Query: 339 GERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED- 392
+RLP K Y K E +DSGT T P YS L + +K GK + D
Sbjct: 299 KKRLPYKG-YSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIK----GKRVRDP 353
Query: 393 --LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNS 450
+F CY+ +A + P IT HF ++EL T + VC F + P+
Sbjct: 354 NGIFSLCYNTTA--EINAPIITAHF-KDANVELQPLNTFMRMQEDLVC--FTVAPTSDIG 408
Query: 451 ILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ LGN+ Q + V +D+ +R+ F +C
Sbjct: 409 V-LGNLAQVNFLVGFDLRKKRVSFKAADC 436
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/417 (29%), Positives = 196/417 (47%), Gaps = 35/417 (8%)
Query: 85 LEEILRRDQQRLHLKN-SRRLQKAIPDNFKKTKAFTFPAKT--GIVAADEYYIVVAIGKP 141
+ + LRRD R ++ R + + ++ +T T A+T + EY + +AIG P
Sbjct: 65 VRDALRRDMHRQRSRSFGRDRDRELAESDGRT---TVSARTRKDLPNGGEYLMTLAIGTP 121
Query: 142 KQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFP 200
+ + DTGS + WTQC PC C +Q P ++P+ S TFS +PCNS+
Sbjct: 122 PLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAG 181
Query: 201 PNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNT 258
C+ C Y+ Y G+G T G ++ T + AR P GC++ ++
Sbjct: 182 AAPPPGCA---CMYNQTY--GTGWTAGVQGSETFTFGSSAADQ--ARVPGVAFGCSNASS 234
Query: 259 GDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPDTVNKKFVKY 315
D NG++G++GL RG +S++S+ F YCL +P+ ST + G +N V+
Sbjct: 235 SDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL-TPFQDTNSTSTLLLGPSAALNGTGVRS 293
Query: 316 TPIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPA 367
TP V +P + S +Y++ LTGIS+G + LP+ F+ IDSGT IT
Sbjct: 294 TPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLAN 353
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDL-FDTCYDLSAYKT---VVVPKITIHFLGGVDLEL 423
Y +R+A + + G + D C+ L A + V+P +T+HF G D+ L
Sbjct: 354 AAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHF-DGADMVL 412
Query: 424 DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++ S CL +D GN QQ+ + YDV L F P C+
Sbjct: 413 PADSYMISGS-GVWCLAMR-NQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 467
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 121/375 (32%), Positives = 180/375 (48%), Gaps = 34/375 (9%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V + EY + V +G P + +++DTGS + W QC PC+ C QR P FDP S ++ +
Sbjct: 145 VGSGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVT 204
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGETGFWATDRMTI-------Q 236
C T C ++ PP C S CPY Y D S TG A + T+ +
Sbjct: 205 CGDTRCGLVS---PPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSR 261
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSP 293
V+G +LGC N G +GA+G++GL RGP+S S+ Y F YCL
Sbjct: 262 RVDG--------VVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLVDH 313
Query: 294 YGSTGY-ITFGKPDT-VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-- 349
+ G I FG + ++ + YT + ++ FY++ L GI VGGE L + ++ +
Sbjct: 314 GSAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGV 373
Query: 350 ----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
T IDSGT ++ FP P Y A+R AF RM K + CY++S +
Sbjct: 374 SKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVER 433
Query: 406 VVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVH 464
V VP+ ++ F G + + +++ +CL P SI +GN QQ+ + V
Sbjct: 434 VEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSI-IGNYQQQNFHVL 492
Query: 465 YDVAGRRLGFGPGNC 479
YD+ RLGF P C
Sbjct: 493 YDLHHNRLGFAPRRC 507
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 147/454 (32%), Positives = 207/454 (45%), Gaps = 91/454 (20%)
Query: 37 VSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL 96
VSSL+P C+ + QG L + +YGPCS G S+ PS +EI RD+ R+
Sbjct: 46 VSSLLPKNKCSASARGGSQG-----LPITQKYGPCS--GSGHSQ-PPSPQEIFGRDESRV 97
Query: 97 HLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGI 155
NS+ + N K + D ++V VA G P Q L+LDTGS I
Sbjct: 98 SFINSK-CNQYTSGNLKN-----HAHNNNLFDEDGNFLVDVAFGTPPQNFMLILDTGSSI 151
Query: 156 TWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYD 215
TWTQCK C++C Q +F+ S S T+S C T E Y+
Sbjct: 152 TWTQCKACVNCLQDSHRYFNWSASSTYSSGSCIPGTV------------------ENNYN 193
Query: 216 IAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGP 274
+ Y D S G + D MT++ + F ++ F GC NN GD +G G++GL +G
Sbjct: 194 MTYGDDSTSVGNYGCDTMTLEPSD---VFQKFQF--GCGRNNKGDFGSGVDGMLGLGQGQ 248
Query: 275 VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTP---EQSEFY 328
+S +S+T + F YCL S G + FG+ T +K+T +V P ++S +Y
Sbjct: 249 LSTVSQTASKFNKVFSYCLPEE-DSIGSLLFGEKATSQSSSLKFTSLVNGPGTLQESGYY 307
Query: 329 HITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK 388
+ L+ ISVG ERL + +S F T IDS T+ITR P YSAL++AF+K M KY +
Sbjct: 308 FVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSN 367
Query: 389 GIE---DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLP 445
G D+ DTCY+ P++TI
Sbjct: 368 GRRKKGDILDTCYNXXX---XXXPELTI-------------------------------- 392
Query: 446 SDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+GN QQ V YD+ G R+GF C
Sbjct: 393 -------IGNRQQLSLTVLYDIQGGRIGFRSNGC 419
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 135/415 (32%), Positives = 187/415 (45%), Gaps = 36/415 (8%)
Query: 87 EILRRDQQRLHLKN-----SRRLQKAIPDNFKKTKAFTFPAKTGIVAAD------EYYIV 135
+++ RD + N S+RL+ AI + + FT T D EY +
Sbjct: 34 DLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSNSGEYLMN 93
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
V+IG P + + DTGS + WTQC PC C Q DP FDP S T+ + C+S+ C L
Sbjct: 94 VSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTAL 153
Query: 196 LEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC 253
Q CS+ + C Y ++Y D S G A D +T+ + + ++GC
Sbjct: 154 ------ENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKN-IIIGC 206
Query: 254 TDNNTGDQN-GASGIMGLDRGPVSIISKTNISY---FFYC---LHSPYGSTGYITFGKPD 306
NN G N SGI+GL GPVS+I + S F YC L S T I FG
Sbjct: 207 GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNA 266
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE--IDSGTIITR 364
V+ V TP++ Q FY++TL ISVG +++ S IDSGT +T
Sbjct: 267 IVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTL 326
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
P YS L A + K + + CY SA + VP IT+HF G D++LD
Sbjct: 327 LPTEFYSELEDAVASSIDAEKK-QDPQSGLSLCY--SATGDLKVPVITMHF-DGADVKLD 382
Query: 425 VRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V S VC F P+ + GNV Q + V YD + + F P +C
Sbjct: 383 SSNAFVQVSEDLVCFAFR---GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 434
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 135/415 (32%), Positives = 187/415 (45%), Gaps = 36/415 (8%)
Query: 87 EILRRDQQRLHLKN-----SRRLQKAIPDNFKKTKAFTFPAKTGIVAAD------EYYIV 135
+++ RD + N S+RL+ AI + + FT T D EY +
Sbjct: 34 DLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSNSGEYLMN 93
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
V+IG P + + DTGS + WTQC PC C Q DP FDP S T+ + C+S+ C L
Sbjct: 94 VSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTAL 153
Query: 196 LEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC 253
Q CS+ + C Y ++Y D S G A D +T+ + + ++GC
Sbjct: 154 ------ENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKN-IIIGC 206
Query: 254 TDNNTGDQN-GASGIMGLDRGPVSIISKTNISY---FFYC---LHSPYGSTGYITFGKPD 306
NN G N SGI+GL GPVS+I + S F YC L S T I FG
Sbjct: 207 GHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNA 266
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE--IDSGTIITR 364
V+ V TP++ Q FY++TL ISVG +++ S IDSGT +T
Sbjct: 267 IVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTL 326
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
P YS L A + K + + CY SA + VP IT+HF G D++LD
Sbjct: 327 LPTEFYSELEDAVASSIDAEKK-QDPQSGLSLCY--SATGDLKVPVITMHF-DGADVKLD 382
Query: 425 VRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V S VC F P+ + GNV Q + V YD + + F P +C
Sbjct: 383 SSNAFVQVSEDLVCFAFR---GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 434
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 135/440 (30%), Positives = 197/440 (44%), Gaps = 63/440 (14%)
Query: 61 SLEVLGRYGPCSKLNQGKS-RNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+L+V Y PCS K + S+ ++ +DQ RL +S +K++
Sbjct: 33 NLQVFHVYSPCSPFWPSKPLKWEESVLQMQAKDQARLQFLSSLVARKSV----------- 81
Query: 120 FPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G IV + Y + IG P Q + L +DT + W C C+ CS F+
Sbjct: 82 VPIASGRQIVQSPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGCSST---VFNNV 138
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
KS TF + C + CK + PN KC C +++ Y S + D +T+
Sbjct: 139 KSTTFKTVGCEAPQCKQV-----PN--SKCGGSACAFNMTYGSSSIAANL-SQDVVTL-- 188
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSP- 293
Y F GC TG G++GL RGP+S++S+T Y F YCL S
Sbjct: 189 --ATDSIPSYTF--GCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFR 244
Query: 294 ----YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKAS 347
GS G+P K +K TP++ P +S Y++ L I VG +P A
Sbjct: 245 SLNFSGSLRLGPVGQP-----KRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSAL 299
Query: 348 YF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSA 402
F T T DSGT+ TR AP Y+A+R AFRKR+ + L FDTCY
Sbjct: 300 AFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGNAT----VTSLGGFDTCYT--- 352
Query: 403 YKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQR 459
+V P IT F G+++ L L+ + + CL A P + NS+L + N+QQ+
Sbjct: 353 -SPIVAPTITFMF-SGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQ 410
Query: 460 GYEVHYDVAGRRLGFGPGNC 479
+ + +DV RLG C
Sbjct: 411 NHRILFDVPNSRLGVAREPC 430
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 130/417 (31%), Positives = 191/417 (45%), Gaps = 32/417 (7%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAK-TGIVAADEYYIVVAIGKPK 142
S E+L R R +++R L + A P T V EY + +AIG P
Sbjct: 68 STRELLHRMAARSKARSARLLSG------RAASARVDPGSYTDGVPDTEYLVHMAIGTPP 121
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Q V L+LDTGS +TWTQC PC+ C +Q P F+PS+S TFS +PC+ C+ L W
Sbjct: 122 QPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRD-LTW-SSC 179
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCTDNNTGD- 260
G+ + C Y AY D S TG +D + + A P L GC N G
Sbjct: 180 GEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIF 239
Query: 261 QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITF-GKPDTV-------NKKF 312
+ +GI G RG +S+ ++ + F YC + GS F G P +
Sbjct: 240 VSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGV 299
Query: 313 VKYTPIVT-TPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFP 366
V+ T ++ Q + Y+I+L G++VG RLP+ S F T +DSGT +T P
Sbjct: 300 VQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLP 359
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG-VDLELDV 425
VY+ + AF + K + L C+ + VP + +HF G +DL +
Sbjct: 360 EAVYNLVCDAFVAQ-TKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPREN 418
Query: 426 RGTLVVES--VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ E+ +R CL + + + ++GN QQ+ V YD+A L F P CN
Sbjct: 419 YMFEIEEAGGIRLTCLA---INAGEDLSVIGNFQQQNMHVLYDLANDMLSFVPARCN 472
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 116/363 (31%), Positives = 180/363 (49%), Gaps = 31/363 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +++G P Q S ++DTGS + W QC PC C +Q DP F P S ++S C +
Sbjct: 7 EYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDS 66
Query: 191 TCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C L + CS + C Y +Y DGS G +A + +T+ NG+ AR F
Sbjct: 67 LCDAL-------PRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTL---NGS-TLARIGF 115
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFGK 304
GC N G GA G++GL +GP+S+ S+ N S+ F YCL S G+ ITFG
Sbjct: 116 --GCGHNQEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFG- 172
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEI-----DSG 359
+ +TP++ + +Y++ + ISVG R+P S F + + DSG
Sbjct: 173 -NAAENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSG 231
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY--KTVVVPKITIHFLG 417
T IT + + + + R+++ Y + CYD+S+ ++ +P +T+H L
Sbjct: 232 TTITYWRLAAFIPILAELRRQI-SYPEADPTPYGLNLCYDISSVSASSLTLPSMTVH-LT 289
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
VD E+ V V+ + A+ SD SI +GNVQQ+ + DVA R+GF
Sbjct: 290 NVDFEIPVSNLWVLVDNFGETVCTAMSTSDQFSI-IGNVQQQNNLIVTDVANSRVGFLAT 348
Query: 478 NCN 480
+C+
Sbjct: 349 DCS 351
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 168/358 (46%), Gaps = 38/358 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + + IG P + +LDTGS WTQC PC+HC Q P FDPSKS TF +I C++
Sbjct: 64 EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT- 122
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
CPY++ Y S G T+ +TI +G F +
Sbjct: 123 -----------------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQP-FVMPETI 164
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDT 307
+GC NN+G + G +G++GLDRGP S+I++ Y YC T I FG
Sbjct: 165 IGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGK--GTSKINFGANAI 222
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL--STEIDSGTIITRF 365
V V T + + FY++ L +SVG R+ + F L + IDSG+ +T F
Sbjct: 223 VAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYF 282
Query: 366 PAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV-VVPKITIHFLGGVDLELD 424
P + +R A + + + + D+ CY KT+ + P IT+HF GG DL LD
Sbjct: 283 PESYCNLVRKAVEQVVTAVRFPR--SDIL--CY---YSKTIDIFPVITMHFSGGADLVLD 335
Query: 425 VRGTLVVESVRQV-CLGFALLPSDP-NSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
V + V CL A++ + P + GN Q + V YD + + F P NC+
Sbjct: 336 KYNMYVASNTGGVFCL--AIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 129/420 (30%), Positives = 194/420 (46%), Gaps = 36/420 (8%)
Query: 80 RNTPSLEEILRRDQQR-----------LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVA 128
RN+ S E ++ RD + H+ N+ R + K P T V
Sbjct: 25 RNSFSFE-LIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLFKDSLSNTPESTVYVN 83
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN 188
EY + ++G P V ++DTGS I W QCKPC C +Q P F+PSKS ++ IPC+
Sbjct: 84 GGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSSYKNIPCS 143
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
S C+ + + N Q+ C Y I + D S G + + +T+ G+ +P
Sbjct: 144 SNLCQS-VRYTSCNKQNSCE-----YTINFSDQSYSQGELSVETLTLDSTTGHS--VSFP 195
Query: 249 -FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCLHS---PYGSTGYI 300
++GC NN G Q SGI+GL GPVS+ ++ S F YCL T +
Sbjct: 196 KTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDSNKTSKL 255
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEI-DSG 359
FG V+ V TP V Q+ FY++TL SVG +R+ + ++ I DSG
Sbjct: 256 NFGDAAVVSGDGVVSTPFVKKDPQA-FYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSG 314
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T +T P+ VY+ L SA + +K ++ L + CY +++ P IT HF G
Sbjct: 315 TTLTLLPSHVYTNLESAVAQLVKLDRVDDP-NQLLNLCYSITS-DQYDFPIITAHF-KGA 371
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
D++L+ T + VCL F + P + GN+ Q V YD+ + F P +C
Sbjct: 372 DIKLNPISTFAHVADGVVCLAFTSSQTGP---IFGNLAQLNLLVGYDLQQNIVSFKPSDC 428
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 126/413 (30%), Positives = 195/413 (47%), Gaps = 29/413 (7%)
Query: 80 RNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIG 139
R++P + + H ++ R ++F K + P T I Y + ++G
Sbjct: 35 RDSPKSPYYKPTENKYQHFVDAARRSINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVG 94
Query: 140 KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF 199
P + + DTGS I W QC+PC C Q P F+PSKS ++ IPC S C + +
Sbjct: 95 TPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDT- 153
Query: 200 PPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNN 257
CS + C Y I+Y D S G + D ++++ +G+ +P ++GC +N
Sbjct: 154 ------SCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSP--VSFPKTVIGCGTDN 205
Query: 258 TGDQNGA-SGIMGLDRGPVSIISKTNISY---FFYC----LHSPYGSTGYITFGKPDTVN 309
G GA SGI+GL GPVS+I++ S F YC L+ ++ ++FG V+
Sbjct: 206 AGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFGDAAVVS 265
Query: 310 KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TKLSTEIDSGTIITRFP 366
V TP++ + FY +TL SVG +R+ S + + IDSGT +T P
Sbjct: 266 GDGVVSTPLIK--KDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTLIP 323
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
+ VY+ L SA +K ++ + F CY L + + P IT HF G D+EL
Sbjct: 324 SDVYTNLESAVVDLVKLDRVDDPNQQ-FSLCYSLKSNE-YDFPIITAHF-KGADIELHSI 380
Query: 427 GTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
T V + VC FA PS + GN+ Q+ V YD+ + + F P +C
Sbjct: 381 STFVPITDGIVC--FAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDC 431
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 128/405 (31%), Positives = 195/405 (48%), Gaps = 40/405 (9%)
Query: 96 LHL--KNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVA-IGKPKQYVSLLLDTG 152
LH+ +S RL K K + P +G Y+V A +G P Q + ++LDT
Sbjct: 65 LHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTS 124
Query: 153 SGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKEC 212
+ W C C CS F + S T+S + C++ C P+ + S C
Sbjct: 125 NDAVWLPCSGCSGCSNASTSFNT-NSSSTYSTVSCSTAQCTQARGLTCPSSSPQPS--VC 181
Query: 213 PYDIAYVDGSGETGFWATDRMTIQ-EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLD 271
++ +Y S + D +T+ +V N F GC ++ +G+ G+MGL
Sbjct: 182 SFNQSYGGDSSFSASLVQDTLTLAPDVIPN-------FSFGCINSASGNSLPPQGLMGLG 234
Query: 272 RGPVSIISKTNISY---FFYCLHSP-----YGSTGYITFGKPDTVNKKFVKYTPIVTTPE 323
RGP+S++S+T Y F YCL S GS G+P K ++YTP++ P
Sbjct: 235 RGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQP-----KSIRYTPLLRNPR 289
Query: 324 QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFR 378
+ Y++ LTG+SVG ++P+ Y T T IDSGT+ITRF PVY A+R FR
Sbjct: 290 RPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFR 349
Query: 379 KRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV- 437
K++ FDTC+ SA V PKIT+H + +DL+L + TL+ S +
Sbjct: 350 KQVNVSSFST--LGAFDTCF--SADNENVAPKITLH-MTSLDLKLPMENTLIHSSAGTLT 404
Query: 438 CLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CL A + + N++L + N+QQ+ + +DV R+G P CN
Sbjct: 405 CLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 168/358 (46%), Gaps = 38/358 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + + IG P + +LDTGS WTQC PC+HC Q P FDPSKS TF +I C++
Sbjct: 58 EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT- 116
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
CPY++ Y S G T+ +TI +G F +
Sbjct: 117 -----------------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQP-FVMPETI 158
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDT 307
+GC NN+G + G +G++GLDRGP S+I++ Y YC T I FG
Sbjct: 159 IGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGK--GTSKINFGANAI 216
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL--STEIDSGTIITRF 365
V V T + + FY++ L +SVG R+ + F L + IDSG+ +T F
Sbjct: 217 VAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYF 276
Query: 366 PAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV-VVPKITIHFLGGVDLELD 424
P + +R A + + + + D+ CY KT+ + P IT+HF GG DL LD
Sbjct: 277 PESYCNLVRKAVEQVVTAVRFPR--SDIL--CY---YSKTIDIFPVITMHFSGGADLVLD 329
Query: 425 VRGTLVVESVRQV-CLGFALLPSDP-NSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
V + V CL A++ + P + GN Q + V YD + + F P NC+
Sbjct: 330 KYNMYVASNTGGVFCL--AIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 126/432 (29%), Positives = 192/432 (44%), Gaps = 41/432 (9%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFP 121
L V+ Y CS P +E + K+ RL+ +KT A
Sbjct: 35 LSVIPIYSKCSPF-------VPPKQESWVNTVITMASKDPERLKYLSTLADQKTTAVPIA 87
Query: 122 AKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKT 181
++ Y + V +G P Q + ++LDT + W C C CS F P+ S T
Sbjct: 88 PGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPNASTT 144
Query: 182 FSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
+ C+ C + + P S C ++ +Y S T D +T+
Sbjct: 145 LGSLDCSGAQCSQVRGFSCP----ATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIP 200
Query: 242 GYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGS 296
G F GC + +G G++GL RGP+S+IS+ Y F YCL S Y
Sbjct: 201 G------FTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYF 254
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TK 351
+G + G K ++ TP++ P + Y++ LTG+SVG ++P+ + T
Sbjct: 255 SGSLKLGP--VGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTG 312
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
T IDSGT+ITRF PVY A+R FRK++ G FDTC+ +A P I
Sbjct: 313 AGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLGA---FDTCF--AATNEAEAPAI 367
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEVHYDVA 468
T+HF G++L L + +L+ S + CL A P++ NS+L + N+QQ+ + +D
Sbjct: 368 TLHF-EGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTT 426
Query: 469 GRRLGFGPGNCN 480
RLG CN
Sbjct: 427 NSRLGIARELCN 438
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 112/356 (31%), Positives = 163/356 (45%), Gaps = 35/356 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + + +G P + ++DTGS ITWTQC PC+HC +Q P FDPSKS TF
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFK-------- 431
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
+ +C CPY++ Y D + G ATD +TI +G F ++
Sbjct: 432 ------------EKRCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEP-FVMAETII 478
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTV 308
GC NN+ + G +GL+ GP+S+I++ Y YC T I FG V
Sbjct: 479 GCGRNNSWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAG--NGTSKINFGTNAIV 536
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE--IDSGTIITRFP 366
V T + T + FY++ L +SVG R+ + F L IDSGT +T FP
Sbjct: 537 GGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTLTYFP 596
Query: 367 APVYSALRSAFRKRMKKYKMGKGI-EDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
+ +R A + DL CY + T + P IT+HF GG DL LD
Sbjct: 597 ESYCNLVRQAVEHVVPAVPAADPTGNDLL--CY--YSNTTEIFPVITMHFSGGADLVLD- 651
Query: 426 RGTLVVESVRQVCLGFALLPSDP-NSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ + +ES A++ ++P + GN Q + V YD + + F P NC+
Sbjct: 652 KYNMFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 105/342 (30%), Positives = 159/342 (46%), Gaps = 53/342 (15%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + + IG P V +LDTGS + WTQC PC+HC Q+ P FDPSKS TF + CN+
Sbjct: 64 EYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNT- 122
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
CPY + Y D S G AT+ +TI +G F +
Sbjct: 123 -----------------PDHSCPYKLVYDDKSYTQGTLATETVTIHSTSGVP-FVMPETI 164
Query: 251 LGCTDNNTGD--QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
+GC+ NN+G + +SGI+GL RG +S+IS+ +Y
Sbjct: 165 IGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQMGGAY----------------------P 202
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE--IDSGTIITRFP 366
V T T ++ ++Y + L +SVG R+ + F L+ IDSGT +T FP
Sbjct: 203 GDGVVSTTMFAKTAKRGQYY-LNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLTYFP 261
Query: 367 APVYSALRSAFRKRMKKYK-MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
+ +R A + + + + D+ CY + + + P IT+HF GG DL LD
Sbjct: 262 VSYCNLVRKAVERVVTADRVVDPSRNDML--CYYSNTIE--IFPVITVHFSGGADLVLD- 316
Query: 426 RGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYD 466
+ + +E R A++ ++P + + GN Q + V YD
Sbjct: 317 KYNMYMELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 123/385 (31%), Positives = 188/385 (48%), Gaps = 38/385 (9%)
Query: 114 KTKAFTFPAKTGIVAADEYYIVVA-IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP 172
K K + P +G Y+V A +G P Q + ++LDT + W C C CS
Sbjct: 11 KPKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTS 70
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDR 232
F + S T+S + C++ C P+ + S C ++ +Y S + D
Sbjct: 71 FNT-NSSSTYSTVSCSTAQCTQARGLTCPSSSPQPS--VCSFNQSYGGDSSFSASLVQDT 127
Query: 233 MTIQ-EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFY 288
+T+ +V N F GC ++ +G+ G+MGL RGP+S++S+T Y F Y
Sbjct: 128 LTLAPDVIPN-------FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSY 180
Query: 289 CLHSP-----YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP 343
CL S GS G+P K ++YTP++ P + Y++ LTG+SVG ++P
Sbjct: 181 CLPSFRSFYFSGSLKLGLLGQP-----KSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP 235
Query: 344 LKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY 398
+ Y T T IDSGT+ITRF PVY A+R FRK++ FDTC+
Sbjct: 236 VDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFST--LGAFDTCF 293
Query: 399 DLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGN 455
SA V PKIT+H + +DL+L + TL+ S + CL A + + N++L + N
Sbjct: 294 --SADNENVAPKITLH-MTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIAN 350
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNCN 480
+QQ+ + +DV R+G P CN
Sbjct: 351 LQQQNLRILFDVPNSRIGIAPEPCN 375
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 112/358 (31%), Positives = 174/358 (48%), Gaps = 19/358 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKI 185
V Y + +G P + +++DTGS +TW QC PC + C +Q P F+P S +++ +
Sbjct: 116 VGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASV 175
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C++ C L N +S C Y +Y D S G+ + D ++ G +
Sbjct: 176 SCSAPQCDALTTATL-NPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTS 228
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF 302
F GC +N G ++G++GL R +S++ + S F YCL + S+ +
Sbjct: 229 VPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPT---SSSSSGY 285
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
+ N YTP+ + Y I +TGI+V G+ L + AS ++ L T IDSGT+I
Sbjct: 286 LSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDSGTVI 345
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
TR P VYSAL A MK + DTC+ A + + VP++++ F GG L+
Sbjct: 346 TRLPTDVYSALSKAVAGAMKGTPRASAFS-ILDTCFQGQASR-LRVPQVSMAFAGGAALK 403
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L LV CL FA S + ++GN QQ+ + V YDV ++GF G C+
Sbjct: 404 LKATNLLVDVDSATTCLAFAPARS---AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 458
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 181/371 (48%), Gaps = 45/371 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQC-KPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
Y + +AIG P ++ +LDTGS + WTQC PC C Q P + P++S T++ + C S
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 191 TCKILLE-WFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTI---QEVNGNGYF 244
C+ L W +CS + C Y +Y DG+ G AT+ T+ V G +
Sbjct: 152 MCQALQSPW------SRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAF- 204
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY--ITF 302
GC N G + +SG++G+ RGP+S++S+ ++ F YC +P+ +T +
Sbjct: 205 -------GCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCF-TPFNATAASPLFL 256
Query: 303 GKPDTVNKKFVKYTPIVTTP-----EQSEFYHITLTGISVGGERLPLKASYFTKLS---- 353
G ++ K TP V +P +S +Y+++L GI+VG LP+ + F +L+
Sbjct: 257 GSSARLSSA-AKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVF-RLTPMGD 314
Query: 354 --TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
IDSGT T + AL A R+ + + G C+ ++ + V VP++
Sbjct: 315 GGVIIDSGTTFTALEESAFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRL 373
Query: 412 TIHFLGGVDLELDVRGTLVVE--SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
+HF G D+EL R + VVE S CLG S +LG++QQ+ + YD+
Sbjct: 374 VLHF-DGADMELR-RESYVVEDRSAGVACLGMV---SARGMSVLGSMQQQNTHILYDLER 428
Query: 470 RRLGFGPGNCN 480
L F P C
Sbjct: 429 GILSFEPAKCG 439
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 143/504 (28%), Positives = 236/504 (46%), Gaps = 51/504 (10%)
Query: 19 NNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQ-GPGKVSLEVLGRYGPCSKLNQG 77
N+ +++N L HS V ++ + A P P K S+++ ++ SK +
Sbjct: 61 NDCSFSNSEQLGHS----VPTMTSGEETDEESEAFPAPKPHKNSVKLHLKHRSGSKGAEP 116
Query: 78 KS-------RNTPSLEEILRR---DQQRLHLKNSRRLQKAIPDN-----FKKTKAFTFPA 122
K+ R+ ++ + RR ++ + + +RLQK P F + T P
Sbjct: 117 KNSVIDSTVRDLTRIQNLHRRVIENRNQNTISRLQRLQKEQPKQSFKPVFAPAASSTSPV 176
Query: 123 KTGIVA---------ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
+VA + EY++ V +G P ++ SL+LDTGS + W QC PCI C +Q P+
Sbjct: 177 SGQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPY 236
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
+DP S +F I C+ C+++ PPN K ++ CPY Y DGS TG +A +
Sbjct: 237 YDPKDSSSFRNISCHDPRCQLVSSPDPPN-PCKAENQSCPYFYWYGDGSNTTGDFALETF 295
Query: 234 TIQEVNGNGYFAR---YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FF 287
T+ NG + GC N G +GA+G++GL +GP+S S+ Y F
Sbjct: 296 TVNLTTPNGKSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFS 355
Query: 288 YCL---HSPYGSTGYITFGK-PDTVNKKFVKYTPIVTTPEQS--EFYHITLTGISVGGER 341
YCL +S + + FG+ + ++ + +T + S FY++ + + V E
Sbjct: 356 YCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEV 415
Query: 342 LPLKASYFTKLSTE------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFD 395
L + + LS+E IDSGT +T F P Y ++ AF +++K Y++ +G+ L
Sbjct: 416 LKIPEETW-HLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPL-K 473
Query: 396 TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGN 455
CY++S + + +P I F G V + VCL P SI +GN
Sbjct: 474 PCYNVSGIEKMELPDFGILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSALSI-IGN 532
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNC 479
QQ+ + + YD+ RLG+ P C
Sbjct: 533 YQQQNFHILYDMKKSRLGYAPMKC 556
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 125/376 (33%), Positives = 170/376 (45%), Gaps = 42/376 (11%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V EY + +AIG P Q V L LDTGS + WTQC+PC C Q P+FDPS S T S
Sbjct: 77 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 136
Query: 187 CNSTTCKIL------LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
C+ST C+ L F PN + C Y +Y D S TGF D+ T V
Sbjct: 137 CDSTLCQGLPVASCGSPKFWPN-------QTCVYTYSYGDKSVTTGFLEVDKFTF--VGA 187
Query: 241 NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYI 300
F G +N N +GI G RGP+S+ S+ + F +C + G
Sbjct: 188 GASVPGVAFGCGLFNNGVFKSN-ETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGL---- 242
Query: 301 TFGKPDTV-----------NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
KP TV + V+ TP++ P FY+++L GI+VG RLP+ S F
Sbjct: 243 ---KPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEF 299
Query: 350 TKLS----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
T + T IDSGT +T P VY +R AF ++K + D + C
Sbjct: 300 TLKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPY-FCLSAPLRAK 358
Query: 406 VVVPKITIHFLGG-VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVH 464
VPK+ +HF G +DL + VE L A++ +GN QQ+ V
Sbjct: 359 PYVPKLVLHFEGATMDLPRE-NYVFEVEDAGSSILCLAIIEGG-EVTTIGNFQQQNMHVL 416
Query: 465 YDVAGRRLGFGPGNCN 480
YD+ +L F P C+
Sbjct: 417 YDLQNSKLSFVPAQCD 432
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 123/372 (33%), Positives = 185/372 (49%), Gaps = 23/372 (6%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY++ V IG P ++ SL+LDTGS + W QC PC C Q P++DP +S +F I
Sbjct: 187 LGSGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIG 246
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN--GNGYF 244
C+ C ++ PP K ++ CPY Y D S TG +A + T+ + G F
Sbjct: 247 CHDPRCHLVSSPDPPQ-PCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEF 305
Query: 245 ARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGST 297
R + GC N G +GA+G++GL RGP+S S+ Y F YCL +S +
Sbjct: 306 KRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVS 365
Query: 298 GYITFGK-PDTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKLST 354
+ FG+ D +N V +T +V E FY++ + I VGGE L + + LS
Sbjct: 366 SKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETW-HLSP 424
Query: 355 E------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
E +DSGT ++ F P Y ++ AF K++K Y + K + D CY++S + + +
Sbjct: 425 EGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFP-ILDPCYNVSGVEKMEL 483
Query: 409 PKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
P+ I F G V + +E VCL P SI +GN QQ+ + + YD
Sbjct: 484 PEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSI-IGNYQQQNFHILYDT 542
Query: 468 AGRRLGFGPGNC 479
RLG+ P C
Sbjct: 543 KKSRLGYAPMKC 554
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 181/371 (48%), Gaps = 45/371 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQC-KPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
Y + +AIG P ++ +LDTGS + WTQC PC C Q P + P++S T++ + C S
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 191 TCKILLE-WFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTI---QEVNGNGYF 244
C+ L W +CS + C Y +Y DG+ G AT+ T+ V G +
Sbjct: 152 MCQALQSPW------SRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAF- 204
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY--ITF 302
GC N G + +SG++G+ RGP+S++S+ ++ F YC +P+ +T +
Sbjct: 205 -------GCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCF-TPFNATAASPLFL 256
Query: 303 GKPDTVNKKFVKYTPIVTTP-----EQSEFYHITLTGISVGGERLPLKASYFTKLS---- 353
G ++ K TP V +P +S +Y+++L GI+VG LP+ + F +L+
Sbjct: 257 GSSARLSSA-AKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVF-RLTPMGD 314
Query: 354 --TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
IDSGT T + AL A R+ + + G C+ ++ + V VP++
Sbjct: 315 GGVIIDSGTTFTALEERAFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRL 373
Query: 412 TIHFLGGVDLELDVRGTLVVE--SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
+HF G D+EL R + VVE S CLG S +LG++QQ+ + YD+
Sbjct: 374 VLHF-DGADMELR-RESYVVEDRSAGVACLGMV---SARGMSVLGSMQQQNTHILYDLER 428
Query: 470 RRLGFGPGNCN 480
L F P C
Sbjct: 429 GILSFEPAKCG 439
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 176/369 (47%), Gaps = 39/369 (10%)
Query: 125 GIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSK 184
GIV + Y + IG P Q + + LDT + W C C+ CS FDPSKS +
Sbjct: 81 GIVQSPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRT 138
Query: 185 IPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
+ C + CK PN C+ SK C +++ Y GS + D +T+
Sbjct: 139 LQCEAPQCKQ-----APN--PSCTVSKSCGFNMTY-GGSAIEAYLTQDTLTL----ATDV 186
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS--TG 298
Y F GC + +G A G+MGL RGP+S+IS++ Y F YCL + S +G
Sbjct: 187 IPNYTF--GCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSG 244
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF---TKLS 353
+ G + + +K TP++ P +S Y++ L GI VG + +P A F T
Sbjct: 245 SLRLGPKNQPIR--IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAG 302
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T DSGT+ TR P Y A+R+ FR+R+K FDTCY S VV P +T
Sbjct: 303 TIFDSGTVYTRLVEPAYVAMRNEFRRRVKNANATS--LGGFDTCYSGS----VVFPSVTF 356
Query: 414 HFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGR 470
F G+++ L L+ S + CL A P++ NS+L + ++QQ+ + V DV
Sbjct: 357 MF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNS 415
Query: 471 RLGFGPGNC 479
RLG C
Sbjct: 416 RLGISRETC 424
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 172/356 (48%), Gaps = 49/356 (13%)
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+++++DTGS +TW QCKPC C QRDP FDPS S +++ +PCN++ C+ L+
Sbjct: 122 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKA-ATGVP 180
Query: 205 DKCS----------SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
C+ S+ C Y +AY DGS G ATD + + + +G F+ GC
Sbjct: 181 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDG------FVFGCG 234
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST----GYITFGKPDTV-- 308
+N G R P S S SP G++ G ++ G +
Sbjct: 235 LSNRG-----------LRRPGSAASSPTA--------SPPGTSGDAAGSLSLGGDTSSYR 275
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAP 368
N V YT ++ P Q FY + +TG SV + A+ + +DSGT+ITR
Sbjct: 276 NATPVSYTRMIADPAQPPFYFMNVTGASV--GGAAVAAAGLGAANVLLDSGTVITRLAPS 333
Query: 369 VYSALRSAFRKRM--KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
VY A+R+ F ++ ++Y L D CY+L+ + V VP +T+ G D+ +D
Sbjct: 334 VYRAVRAEFARQFGAERYPAAPPFS-LLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAA 392
Query: 427 GTLVV--ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G L + + QVCL A L + + ++GN QQ+ V YD G RLGF +C+
Sbjct: 393 GMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 123/374 (32%), Positives = 172/374 (45%), Gaps = 35/374 (9%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V EY + +AIG P Q V L LDTGS + WTQCKPC+ C Q P+FD S+S T + +P
Sbjct: 30 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLP 89
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C ST CK L + + + C Y +Y D S G A D+ T V G
Sbjct: 90 CESTQCK-LDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTF--VAGTSLPG- 145
Query: 247 YPFLLGCTDNNTGDQN-GASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKP 305
GC NNTG N +GI G RGP+S+ S+ + F +C + IT P
Sbjct: 146 --VTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKVGNFSHCFTT-------ITGAIP 196
Query: 306 DTV-----------NKKFVKYTPIVTTPEQSE---FYHITLTGISVGGERLPLKASYFTK 351
TV + V+ TP++ + Y+++L GI+VG RLP+ S F
Sbjct: 197 STVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL 256
Query: 352 LS----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV 407
+ T IDSGT IT P VY +R F ++ K + G TC+ +
Sbjct: 257 TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGHYTCFSAPSQAKPD 315
Query: 408 VPKITIHFLGG-VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
VPK+ +HF G +DL + V + + A+ D +I +GN QQ+ V YD
Sbjct: 316 VPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTI-IGNFQQQNMHVLYD 374
Query: 467 VAGRRLGFGPGNCN 480
+ L F C+
Sbjct: 375 LQNNMLSFVAAQCD 388
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 126/429 (29%), Positives = 195/429 (45%), Gaps = 45/429 (10%)
Query: 63 EVLGRYGPCSKLNQGKSRNTPS--LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTF 120
E++ R P S L S+ T L + R ++R +L K I + + F+
Sbjct: 21 ELIHREHPSSPLRSNTSKTTTEIFLAAVKRGAERR------AQLSKHI---LAEGRLFST 71
Query: 121 PAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSK 180
P +G EY I ++ G P Q S+++DTGS + WTQC PC C+ FDP KS
Sbjct: 72 PVASG---NGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSS 128
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
T+ + C S C L + + C YD Y DGS +G +T+ +T+
Sbjct: 129 TYDTVSCASNFCSSL--------PFQSCTTSCKYDYMYGDGSSTSGALSTETVTVGT--- 177
Query: 241 NGYFARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTN---ISYFFYCLHSPYGS 296
P GC N G GA+GI+GL +GP+S+IS+ + F YCL P GS
Sbjct: 178 ----GTIPNVAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCL-VPLGS 232
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE- 355
T D+ V YT ++T FY+ LTGISV G+ + F+ ++
Sbjct: 233 TKTSPMLIGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQ 292
Query: 356 ----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
+DSGT +T ++AL +A + + + G D C+ + P +
Sbjct: 293 GGFILDSGTTLTYLETGAFNALVAALKAEV-PFPEADGSLYGLDYCFSTAGVANPTYPTM 351
Query: 412 TIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
T HF G D EL V +++ +CL A + ++GN+QQ+ + + +D+ +
Sbjct: 352 TFHF-KGADYELPPENVFVALDTGGSICLAMA---ASTGFSIMGNIQQQNHLIVHDLVNQ 407
Query: 471 RLGFGPGNC 479
R+GF NC
Sbjct: 408 RVGFKEANC 416
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 178/366 (48%), Gaps = 19/366 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V + EY + V +G P + +++DTGS + W QC PC+ C +Q P FDP+ S ++ +
Sbjct: 144 VGSGEYLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVT 203
Query: 187 CNSTTCKILLEWFPP--NGQDKC---SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
C C+++ PP + +C S CPY Y D S TG A + T+ + +
Sbjct: 204 CGDDRCRLVS---PPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVN-LTQS 259
Query: 242 GYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY----FFYCLHSPYGST 297
G GC N G +GA+G++GL RGP+S S+ Y F YCL +
Sbjct: 260 GTRRVDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAA 319
Query: 298 GY-ITFGKPDT-VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE 355
G I FG D + + YT T + FY++ L I VGGE + + + + T
Sbjct: 320 GSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTI 379
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
IDSGT ++ FP P Y A+R AF RM Y + G + CY++S + V VP++++
Sbjct: 380 IDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFP-VLSPCYNVSGAEKVEVPELSLV 438
Query: 415 FLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F G E + +E +CL P SI +GN QQ+ + V YD+ RLG
Sbjct: 439 FADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSI-IGNYQQQNFHVLYDLEHNRLG 497
Query: 474 FGPGNC 479
F P C
Sbjct: 498 FAPRRC 503
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 124/376 (32%), Positives = 169/376 (44%), Gaps = 42/376 (11%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V EY + +AIG P Q V L LDTGS + WTQC+PC C Q P+FDPS S T S
Sbjct: 77 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 136
Query: 187 CNSTTCKIL------LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
C+ST C+ L F PN + C Y +Y D S TGF D+ T V
Sbjct: 137 CDSTLCQGLPVASCGSPKFWPN-------QTCVYTYSYGDKSVTTGFLEVDKFTF--VGA 187
Query: 241 NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYI 300
F G +N N +GI G RGP+S+ S+ + F +C + G
Sbjct: 188 GASVPGVAFGCGLFNNGVFKSN-ETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGL---- 242
Query: 301 TFGKPDTV-----------NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
KP TV + V+ TP++ P FY+++L GI+VG RLP+ S F
Sbjct: 243 ---KPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEF 299
Query: 350 TKLS----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
+ T IDSGT +T P VY +R AF ++K + D + C
Sbjct: 300 ALKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPY-FCLSAPLRAK 358
Query: 406 VVVPKITIHFLGG-VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVH 464
VPK+ +HF G +DL + VE L A++ +GN QQ+ V
Sbjct: 359 PYVPKLVLHFEGATMDLPRE-NYVFEVEDAGSSILCLAIIEGG-EVTTIGNFQQQNMHVL 416
Query: 465 YDVAGRRLGFGPGNCN 480
YD+ +L F P C+
Sbjct: 417 YDLQNSKLSFVPAQCD 432
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 180/359 (50%), Gaps = 28/359 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY++ + +G P + +++D+GS I W QC+PC C QQ DP FDP+ S T++ I C+S+
Sbjct: 136 EYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSS 195
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C L C+ C Y+++Y DGS G A + +T G
Sbjct: 196 VCDRL-------DNAGCNDGRCRYEVSYGDGSYTRGTLALETLTF------GRVLIRNIA 242
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISK---TNISYFFYCLHS-PYGSTGYITFGKPD 306
+GC N G GA+G++GL G +S + + F YCL S STG + FG+
Sbjct: 243 IGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGR-- 300
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--TKL---STEIDSGTI 361
+ P++ P FY++ L+G+ VGG R+P+ F T L +D+GT
Sbjct: 301 GAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTA 360
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
+TR PAP Y A R F + + +FDTCY+L+ + +V VP ++ +F GG L
Sbjct: 361 VTRLPAPAYEAFRDTFIGQTANLPRSDRVS-IFDTCYNLNGFVSVRVPTVSFYFSGGPIL 419
Query: 422 ELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L R L+ V+ C FA S + ++GN+QQ G ++ D + +GFGP C
Sbjct: 420 TLPARNFLIPVDGEGTFCFAFAASASGLS--IIGNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 130/417 (31%), Positives = 190/417 (45%), Gaps = 41/417 (9%)
Query: 87 EILRRDQQRLHLKNSRRLQ-KAIPDNFKKTKAFTFPAKTGIVAA------DEYYIVVAIG 139
E++ RD + + N + D +++ + T V A EY + +++G
Sbjct: 33 ELIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVTNTVEAPIYNNRGEYLMKLSVG 92
Query: 140 KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF 199
P + + DTGS I WTQC+PC +C QQ P F+PSKS T+ K+ C+S C E
Sbjct: 93 TPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGE-- 150
Query: 200 PPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNN 257
+ CS K +C Y I+Y D S G +A D +T+ +G +P +GC +N
Sbjct: 151 ----DNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGR--VVAFPRTAIGCGHDN 204
Query: 258 TG--DQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS----TGYITFGKPDTV 308
G D N SGI+GL GP S+I + + F YCL +P G+ + + FG V
Sbjct: 205 AGSFDAN-VSGIVGLGLGPASLIKQMGSAVGGKFSYCL-TPIGNDDGGSNKLNFGSNANV 262
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT---KLSTEIDSGTIITRF 365
+ TPI + + FY + L +SVG + K + IDSGT +T
Sbjct: 263 SGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDSGTTLTLL 322
Query: 366 PAPVYSALRSAFRKRMKKYKM---GKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
P +Y A + + + +E F+T D YK VP I +HF G +L
Sbjct: 323 PVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTD--DYK---VPFIAMHF-EGANLR 376
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L L+ S +CL FA + SI GN+ Q + V YDV L F P NC
Sbjct: 377 LQRENVLIRVSDNVICLAFAGAQDNDISI-YGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 120/372 (32%), Positives = 181/372 (48%), Gaps = 21/372 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY++ V +G P ++ SL+LDTGS + W QC PC C Q + F+DP S +F I
Sbjct: 157 LGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNIT 216
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
CN C ++ PP Q K ++ CPY Y D S TG +A + T+ G +
Sbjct: 217 CNDPRCSLISSPEPP-VQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSE 275
Query: 247 YP---FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGST 297
Y + GC N G +GASG++GL RGP+S S+ Y F YCL +S +
Sbjct: 276 YKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVS 335
Query: 298 GYITFGK-PDTVNKKFVKYTPIVTTPEQS--EFYHITLTGISVGGERLPLKASYFT---- 350
+ FG+ D +N + +T V E S FY+I + I VGGE L + +
Sbjct: 336 SKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPD 395
Query: 351 -KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYK--TVV 407
T IDSGT ++ F P Y +++ F ++MK+ + + D C+++S + +
Sbjct: 396 GAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIH 455
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
+P++ I F G + + S VCL P SI +GN QQ+ + + YD
Sbjct: 456 LPELGIAFADGAVWNFPAENSFIWLSEDLVCLAILGTPKSTFSI-IGNYQQQNFHILYDT 514
Query: 468 AGRRLGFGPGNC 479
RLGF P C
Sbjct: 515 KMSRLGFTPTKC 526
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 125/418 (29%), Positives = 193/418 (46%), Gaps = 42/418 (10%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKT--GIVAADEYYIVVAIGKPK 142
+ + LRRD +H + SR L ++ T A+T + EY + ++IG P
Sbjct: 49 VRDALRRD---MHRQQSRSL---FGRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPP 102
Query: 143 QYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNST---TCKILLE 197
+ DTGS + WTQC PC C Q P ++P+ S TF +PCNS+ +L
Sbjct: 103 LSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAG 162
Query: 198 WFPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYP-FLLGCTD 255
PP G C+ C Y+ Y G+G T G ++ T + AR P GC++
Sbjct: 163 KAPPPG---CA---CMYNQTY--GTGWTAGVQGSETFTFGSAAADQ--ARVPGIAFGCSN 212
Query: 256 NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPDTVNKKF 312
++ D NG++G++GL RG +S++S+ F YCL +P+ ST + G +N
Sbjct: 213 ASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL-TPFQDTNSTSTLLLGPSAALNGTG 271
Query: 313 VKYTPIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITR 364
V+ TP V +P + S +Y++ LTGIS+G + L + F+ + IDSGT IT
Sbjct: 272 VRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTITS 331
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV--VVPKITIHFLGGVDLE 422
Y +R+A + + + D CY L + +P +T+HF G D+
Sbjct: 332 LVNAAYQQVRAAVQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPAMPSMTLHF-DGADMV 390
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L ++ S CL +D GN QQ+ + YDV L F P C+
Sbjct: 391 LPADSYMISGS-GVWCLAMR-NQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAPAKCS 446
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 129/415 (31%), Positives = 178/415 (42%), Gaps = 40/415 (9%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYV 145
E+L R RL S R A D P G V EY + +AIG P Q V
Sbjct: 378 REVLHRMAARLLFSASGRAASARVD--------PGPYANG-VPDTEYLVHLAIGTPPQPV 428
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
L+LDTGS + WTQC+PC C + DPS S TF +PC+S C L W G+
Sbjct: 429 QLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPVCDN-LTW-SSCGKH 486
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGA 264
++ C Y AY DGS TG + T +G G GC N G +
Sbjct: 487 NWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGCGLFNNGIFTSNE 546
Query: 265 SGIMGLDRGPVSIISKTNISYFFYCLHSPYGST-GYITFGKPDTVNKK---FVKYTPIVT 320
+GI G RG +S+ S+ + F +C + GS + G P + V+ TP+V
Sbjct: 547 TGIAGFGRGALSLPSQLKVDNFSHCFTAITGSEPSSVLLGLPANLYSDADGAVQSTPLVQ 606
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRS 375
Y+++L GI+VG RLP+ S F T IDSGT +T P Y +
Sbjct: 607 NFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHD 666
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV--VPKITIHFLGG-VDL-------ELDV 425
AF +++ L C+ S + VPK+ +HF G +DL E +
Sbjct: 667 AFTAQVRLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFEGATLDLPRENYMFEFED 726
Query: 426 RGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G V CL + + + ++GN QQ+ V YD+ L F P CN
Sbjct: 727 AGGSV------TCLA---INAGDDLTIIGNYQQQNLHVLYDLVRNMLSFVPAQCN 772
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 125/418 (29%), Positives = 195/418 (46%), Gaps = 34/418 (8%)
Query: 85 LEEILRRDQQRLHLKN-SRRLQKAIPDNFKKTKAFTFPAKT--GIVAADEYYIVVAIGKP 141
+ + LRRD R ++ R + + ++ +T T A+T + EY + +AIG P
Sbjct: 65 VRDALRRDMHRQRSRSFGRDRDRELAESDGRTST-TVSARTRKDLPNGGEYLMTLAIGTP 123
Query: 142 KQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFP 200
+ + DTGS + WTQC PC C +Q P ++P+ S TFS +PCNS+
Sbjct: 124 PLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAG 183
Query: 201 PNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNT 258
C+ C Y Y G+G T G ++ T + AR P GC++ ++
Sbjct: 184 AAPPPGCA---CMYYQTY--GTGWTAGVQGSETFTFGSSAADQ--ARVPGVAFGCSNASS 236
Query: 259 GDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPDTVNKKFVKY 315
D NG++G++GL RG +S++S+ F YCL +P+ ST + G +N V+
Sbjct: 237 SDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL-TPFQDTNSTSTLLLGPSAALNGTGVRS 295
Query: 316 TPIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPA 367
TP V +P + S +Y++ LTGIS+G + LP+ F+ IDSGT IT
Sbjct: 296 TPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLAN 355
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSAYKT---VVVPKITIHFLGGVDLE 422
Y +R+A + ++ D D C+ L A + V+P +T+HF G D+
Sbjct: 356 AAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHF-DGADMV 414
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L ++ S CL +D GN QQ+ + YDV L F P C+
Sbjct: 415 LPADSYMISGS-GVWCLAMR-NQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 470
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 122/401 (30%), Positives = 179/401 (44%), Gaps = 34/401 (8%)
Query: 110 DNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHC-SQ 168
D + + T A GIV +EY + +++G P + V+L LDTGS + WTQC PC++C Q
Sbjct: 73 DRPVRARVRTAGAGGGIVT-NEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQ 131
Query: 169 QRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFW 228
P DP+ S T + + C++ C+ L G + C Y Y D S G
Sbjct: 132 GAIPVLDPAASSTHAAVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKL 191
Query: 229 ATDRMTI---QEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNIS 284
A+DR T +G G R GC N G Q +GI G RG S+ S+ ++
Sbjct: 192 ASDRFTFGPGDNADGGGVSERR-LTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVT 250
Query: 285 YFFYCLHSPYGST-GYITFG-KPDTVN-KKFVKYTPIVTTPEQSEFYHITLTGISVGGER 341
F YC S + ST +T G P ++ V+ TP++ P Q Y ++L I+VG R
Sbjct: 251 SFSYCFTSMFESTSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATR 310
Query: 342 LPL--KASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
+P+ + + S IDSG IT P VY A+++ F ++ + D C+
Sbjct: 311 IPIPERRQRLREASAIIDSGASITTLPEDVYEAVKAEFVAQV-GLPVSAVEGSALDLCFA 369
Query: 400 LSAYKT-----------------VVVPKITIHFLGGVDLELDVRGTLVVE--SVRQVCLG 440
L + V VP++ H GG D EL R V E R +CL
Sbjct: 370 LPSAAAPKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELP-RENYVFEDYGARVMCLV 428
Query: 441 F-ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
A ++++GN QQ+ V YD+ L F P C
Sbjct: 429 LDAATGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARCE 469
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 132/421 (31%), Positives = 197/421 (46%), Gaps = 49/421 (11%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQ 143
+ + LRRD +H N+R+L + + T A T I A EY + +AIG P
Sbjct: 47 VRDALRRD---MHRHNARQLAAS------SSNGTTVSAPTQISPTAGEYLMTLAIGTPPV 97
Query: 144 YVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNS--TTCKILLE-WF 199
+ DTGS + WTQC PC C QQ P ++PS S TF+ +PCNS + C L
Sbjct: 98 SYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAALAGTT 157
Query: 200 PPNGQDKCSSKECPYDIAYVDGSGETGFW-ATDRMTIQEVNGNGYFARYPFLLGCTDNNT 258
PP G C+ C Y++ Y GSG T + ++ T GC++ +
Sbjct: 158 PPPG---CT---CMYNMTY--GSGWTSVYQGSETFTFGSSTPANQTGVPGIAFGCSNASG 209
Query: 259 G-DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPDTVNKK-FV 313
G + + ASG++GL RG +S++S+ + F YCL +PY ST + G ++N V
Sbjct: 210 GFNTSSASGLVGLGRGSLSLVSQLGVPKFSYCL-TPYQDTNSTSTLLLGPSASLNDTGGV 268
Query: 314 KYTPIVTTPE---QSEFYHITLTGISVGGERLPLKASYFTKLSTE--------IDSGTII 362
TP V +P S +Y++ LTGIS+G L + T LS + IDSGT I
Sbjct: 269 SSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPT---TALSLKADGTGGFIIDSGTTI 325
Query: 363 TRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSAYKTV--VVPKITIHFLGGV 419
T Y +R+A + G D C++L + + +P +T+HF G
Sbjct: 326 TLLGNTAYQQVRAAVVSLVTLPTTDGGSAATGLDLCFELPSSTSAPPTMPSMTLHF-DGA 384
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
D+ L ++++S CL +D +LGN QQ+ + YDV L F P C
Sbjct: 385 DMVLPADSYMMLDS-NLWCLAMQ-NQTDGGVSILGNYQQQNMHILYDVGQETLTFAPAKC 442
Query: 480 N 480
+
Sbjct: 443 S 443
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 115/364 (31%), Positives = 181/364 (49%), Gaps = 33/364 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 139
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 140 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGFS 191
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 192 FGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGY 251
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + LT ISV GERL L S F++ DSG
Sbjct: 252 FSLGKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSG 309
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + K G E+ CYD+ + +P I++HF G
Sbjct: 310 SELSYIPDRALSVLSQRIRELL--LKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 367
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
+L G V SV++ CL FA P++ SI +G++ Q EV YD+ + +G GP
Sbjct: 368 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSI-IGSLMQTSKEVVYDLKRQLIGIGP 424
Query: 477 -GNC 479
G C
Sbjct: 425 SGAC 428
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 125/432 (28%), Positives = 192/432 (44%), Gaps = 41/432 (9%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFP 121
L V+ Y CS P +E + K+ RL+ +KT A
Sbjct: 35 LSVIPIYSKCSPF-------VPPKQESWVNTVITMASKDPERLKYLSTLADQKTTAVPIA 87
Query: 122 AKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKT 181
++ Y + V +G P Q + ++LDT + W PC C+ F P+ S T
Sbjct: 88 PGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWV---PCSGCTGFSSTTFLPNASTT 144
Query: 182 FSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
+ C+ C + + P S C ++ +Y S T D +T+
Sbjct: 145 LGSLDCSGAQCSQVRGFSCP----ATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIP 200
Query: 242 GYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGS 296
G F GC + +G G++GL RGP+S+IS+ Y F YCL S Y
Sbjct: 201 G------FTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYF 254
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TK 351
+G + G K ++ TP++ P + Y++ LTG+SVG ++P+ + T
Sbjct: 255 SGSLKLGP--VGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTG 312
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
T IDSGT+ITRF PVY A+R FRK++ G FDTC+ +A P I
Sbjct: 313 AGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLGA---FDTCF--AATNEAEAPAI 367
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEVHYDVA 468
T+HF G++L L + +L+ S + CL A P++ NS+L + N+QQ+ + +D
Sbjct: 368 TLHF-EGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTT 426
Query: 469 GRRLGFGPGNCN 480
RLG CN
Sbjct: 427 NSRLGIARELCN 438
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 130/417 (31%), Positives = 189/417 (45%), Gaps = 41/417 (9%)
Query: 87 EILRRDQQRLHLKNSRRLQ-KAIPDNFKKTKAFTFPAKTGIVAA------DEYYIVVAIG 139
E++ RD + + N + D +++ + T V A EY + +++G
Sbjct: 33 ELIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGLVTNTVEAPIYNNRGEYLMKLSVG 92
Query: 140 KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF 199
P + + DTGS I WTQC PC +C QQ P F+PSKS T+ K+ C+S C E
Sbjct: 93 TPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGE-- 150
Query: 200 PPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNN 257
+ CS K +C Y I+Y D S G +A D +T+ +G +P +GC +N
Sbjct: 151 ----DNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGR--VVAFPRTAIGCGHDN 204
Query: 258 TG--DQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS----TGYITFGKPDTV 308
G D N SGI+GL GP S+I + + F YCL +P G+ + + FG V
Sbjct: 205 AGSFDAN-VSGIVGLGLGPASLIKQMGSAVGGKFSYCL-TPIGNDDGGSNKLNFGSNANV 262
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT---KLSTEIDSGTIITRF 365
+ TPI + + FY + L +SVG + K + IDSGT +T
Sbjct: 263 SGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDSGTTLTLL 322
Query: 366 PAPVYSALRSAFRKRMKKYKM---GKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
P +Y A + + + +E F+T D YK VP I +HF G +L
Sbjct: 323 PVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTD--DYK---VPFIAMHF-EGANLR 376
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L L+ S +CL FA + SI GN+ Q + V YDV L F P NC
Sbjct: 377 LQRENVLIRVSDNVICLAFAGAQDNDISI-YGNIAQINFLVGYDVTNMSLSFKPMNC 432
>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 500
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 133/479 (27%), Positives = 208/479 (43%), Gaps = 70/479 (14%)
Query: 42 PPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNS 101
PP C+ + G L VL R PCS LN G ++T S ++ R +RL
Sbjct: 51 PPVSCSPIPSGASNG---KKLPVLHRLNPCSPLNAGGKQSTTSSVDVSHRAGRRL----- 102
Query: 102 RRLQKAIPDN---------FKKTKAFTFPA----KTGIVAADEYYIVVAIGKPKQYVSLL 148
R L A+ + T P + G +Y +VV G P Q +++
Sbjct: 103 RSLFAAVQSGDDAAPAPAPAAASGGVTIPTTGTPEPGAPGFHDYTVVVGYGTPAQQLAMA 162
Query: 149 LDTGSGITWTQCKPC---IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
DTG GI+ +C C C FDPS+S TF+ +PC S C+ +G
Sbjct: 163 FDTGLGISLVRCAACRPGAPCDGLAS--FDPSRSSTFAPVPCGSPDCR--------SGCS 212
Query: 206 KCSSKECPY-DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
S+ CP ++ G+ A D +T+ + F GC + ++G+ GA
Sbjct: 213 SGSTPSCPLTSFPFLSGA-----VAQDVLTLTPSA-----SVDDFTFGCVEGSSGEPLGA 262
Query: 265 SGIMGLDRGPVSIISKTNISY---FFYCLH-SPYGSTGYITFGKPDTVNKKFVKYT---P 317
+G++ L R S+ S+ F YCL S S G++ G+ D + + + T P
Sbjct: 263 AGLLDLSRDSRSVASRLAADAGGTFSYCLPLSTTSSHGFLAIGEADVPHNRTARVTAVAP 322
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEI-DSGTIITRFPAPVYSALRSA 376
+V P Y I L G+S+GG +P+ T + + D+ T +Y+ LR A
Sbjct: 323 LVYDPAFPNHYVIDLAGVSLGGRDIPIPPHAATASAAMVLDTALPYTYMKPSMYAPLRDA 382
Query: 377 FRKRMKKYKMGKGIEDLFDTCYDLSAYK-TVVVPKITIHFLGGVDLELDVRGTLVVESV- 434
FR+ M +Y + DL DTCY+ + + V++P + + F G L + +
Sbjct: 383 FRRAMARYPRAPAMGDL-DTCYNFTGVRHEVLIPLVHLTFRGIGGGGGGQVLGLGADQMF 441
Query: 435 ---------RQVCLGFALLPSD-----PNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL FA LPSD P ++++G + Q EV +DV G ++GF PG+C
Sbjct: 442 YMSEPGNFFSVTCLAFAALPSDGDAEAPLAMVMGTLAQSSMEVVHDVPGGKIGFIPGSC 500
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 129/416 (31%), Positives = 194/416 (46%), Gaps = 46/416 (11%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLL 148
LRRD +H N+R+L A + A T + T A EY + +AIG P +
Sbjct: 57 LRRD---MHRHNARKLALAA-SSGATVSAPTQDSPT----AGEYLMALAIGTPPLPYQAI 108
Query: 149 LDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNST-----TCKILLEWFPPN 202
DTGS + WTQC PC C +Q P ++PS S TF+ +PCNS+ PP
Sbjct: 109 ADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPP 168
Query: 203 GQDKCSSKECPYDIAYVDGSGETG-FWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTG- 259
G C+ C Y++ Y GSG T F ++ T AR P GC+ ++G
Sbjct: 169 G---CA---CTYNVTY--GSGWTSVFQGSETFTFGSTPAG--HARVPGIAFGCSTASSGF 218
Query: 260 DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPDTVNKKF-VKY 315
+ + ASG++GL RG +S++S+ + F YCL +PY ST + G ++N V
Sbjct: 219 NASSASGLVGLGRGRLSLVSQLGVPKFSYCL-TPYQDTNSTSTLLLGPSASLNGTAGVSS 277
Query: 316 TPIVTTPEQS---EFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFP 366
TP V +P + FY++ LTGIS+G L + F+ L+ + IDSGT IT
Sbjct: 278 TPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFS-LNADGTGGLIIDSGTTITLLG 336
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV--VVPKITIHFLGGVDLELD 424
Y +R+A + + D C+ L + + +P +T+HF G D+ L
Sbjct: 337 NTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFMLPSSTSAPPAMPSMTLHF-NGADMVLP 395
Query: 425 VRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++ + CL +D +LGN QQ+ + YD+ L F P C+
Sbjct: 396 ADSYMMSDDSGLWCLAMQNQ-TDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 450
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 119/405 (29%), Positives = 192/405 (47%), Gaps = 47/405 (11%)
Query: 98 LKNSRRLQKAIPDNFKKTKAFTFPAKT-GIVA--------ADEYYIVVAIGKPKQYVSLL 148
L + RL A + ++ A A T G V + EY + V+IG P +
Sbjct: 49 LSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGI 108
Query: 149 LDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS 208
DTGS +TW QC PC+ C QQ P F+P KS +FS +PCN+ TC + + C
Sbjct: 109 ADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDD-------GHCG 161
Query: 209 SKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGI 267
+ C Y Y D + G +++TI + ++GC ++G ASG+
Sbjct: 162 VQGVCDYSYTYGDRTYSKGDLGFEKITIGS-------SSVKSVIGCGHASSGGFGFASGV 214
Query: 268 MGLDRGPVSIISKTNISY-----FFYCLHSPYG-STGYITFGKPDTVNKKFVKYTPIVTT 321
+GL G +S++S+ + + F YCL + + G I FG+ V+ V TP++ +
Sbjct: 215 IGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGENAVVSGPGVVSTPLI-S 273
Query: 322 PEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRM 381
+Y+ITL IS+G ER ++ + + IDSGT +T P +Y + S+ K +
Sbjct: 274 KNTVTYYYITLEAISIGNER---HMAFAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVV 330
Query: 382 KKYKMGKGIEDLFDTCYD--LSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-- 437
K ++ K D C+D ++A ++ +P IT HF GG ++ L L + + R+V
Sbjct: 331 KAKRV-KDPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVNL-----LPINTFRKVAD 384
Query: 438 ---CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL ++GN+ Q + + YD+ +RL F P C
Sbjct: 385 NVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVC 429
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 177/378 (46%), Gaps = 29/378 (7%)
Query: 115 TKAFTF----PAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR 170
TK F+ P T EY I ++G P V +DTGS I W QC+PC C Q
Sbjct: 68 TKEFSLNKNQPVSTLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQT 127
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWFP--PNGQDKCSSKECPYDIAYVDGSGETGFW 228
P F+PSKS ++ IPC S+TCK + NG D C Y I Y + G
Sbjct: 128 SPIFNPSKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCE-----YSITYGGDAKSQGDL 182
Query: 229 ATDRMTIQEVNGNGYFARYP-FLLGCTDNNT-GDQNGASGIMGLDRGPVSIISKTNISY- 285
+ D +T+ +G+ +P ++GC N D + +SG++G+ RGP+S+I + S
Sbjct: 183 SNDSLTLDSTSGSSVL--FPNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSSV 240
Query: 286 ---FFYCL---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGG 339
F YCL +S S+ + FG+ V+ + V TP+V Q +Y +TL SVG
Sbjct: 241 GSKFSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGN 300
Query: 340 ERLPL-KASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY 398
R+ + S + + IDSGT +T P S L S + +K ++ L CY
Sbjct: 301 NRIEYGERSNASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHL-SLCY 359
Query: 399 DLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQ 458
+ + K + VP IT HF G D++L+ GT +C GF S + GN+ Q
Sbjct: 360 NTTG-KQLNVPDITAHF-NGADVKLNSNGTFFPFEDGIMCFGFI---SSNGLEIFGNIAQ 414
Query: 459 RGYEVHYDVAGRRLGFGP 476
+ YD+ + F P
Sbjct: 415 NNLLIDYDLEKEIISFKP 432
>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 289
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 99/274 (36%), Positives = 145/274 (52%), Gaps = 19/274 (6%)
Query: 208 SSKECPYDIAYVDGSGETGFWATDRMTIQ--EVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
S K+C + I+Y DG+ G ++ D++T+ + N YF GC +
Sbjct: 33 SGKQCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYF-------GCGHGKHAVRGLFD 85
Query: 266 GIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQS 325
G++GL R S+ ++ F YCL S G++ G N +TP+ T P Q
Sbjct: 86 GVLGLGRLRESLGARYG-GVFSYCLPSVSSKPGFLALGAGK--NPSGFVFTPMGTVPGQP 142
Query: 326 EFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK 385
F +TL GI+VGG++L L+ S F+ +DSGT+IT + Y ALRSAFRK M+ Y+
Sbjct: 143 TFSTVTLAGINVGGKKLDLRPSAFSG-GMIVDSGTVITGLQSTAYRALRSAFRKAMEAYR 201
Query: 386 MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLP 445
+ + DTCY+L+ YK VVVPKI + F GG + LDV ++V CL FA
Sbjct: 202 LLPNGD--LDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG----CLAFAESG 255
Query: 446 SDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
D ++ +LGNV QR +EV +D + + GF C
Sbjct: 256 PDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 289
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 154 bits (390), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 129/410 (31%), Positives = 184/410 (44%), Gaps = 37/410 (9%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI----VAADEYYIVVAIGKP 141
E++RR R + R L + + T P G V EY + +AIG P
Sbjct: 51 RELMRRMALRSKARAPRLL----------SSSATAPVSPGAYDDGVPMTEYLLHLAIGTP 100
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
Q V L LDTGS + WTQC+PC C Q P++D S+S TF+ C+ST CK+
Sbjct: 101 PQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMC 160
Query: 202 NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGD 260
Q + + C Y +Y D S GF D T+ V G A P + GC NNTG
Sbjct: 161 VNQ---TVQTCAYSYSYGDKSATIGFL--DVETVSFVAG----ASVPGVVFGCGLNNTGI 211
Query: 261 -QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYG-STGYITFGKPDTVNKK---FVKY 315
++ +GI G RGP+S+ S+ + F +C + G + F P + K V+
Sbjct: 212 FRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQT 271
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS----TEIDSGTIITRFPAPVYS 371
TP++ P FY+++L GI+VG RLP+ S F + T IDSGT T P VY
Sbjct: 272 TPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYR 331
Query: 372 ALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY-KTVVVPKITIHFLGGVDLELDVRGTLV 430
+ F + K + E C+ K VPK+ +HF G + L R V
Sbjct: 332 LVHDEFAAHV-KLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGAT-MHLP-RENYV 388
Query: 431 VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
E+ L + ++GN QQ+ V YD+ +L F C+
Sbjct: 389 FEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCD 438
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 126/415 (30%), Positives = 190/415 (45%), Gaps = 41/415 (9%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
+ + LRRD R H + +R L + + P + + EY + +AIG P
Sbjct: 48 VRDALRRDMHR-HARFTRELASS------GDRTVAAPTRKDLPNGGEYIMTLAIGTPPLS 100
Query: 145 VSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNSTT--CKILLEWFPP 201
+ DTGS + WTQC PC C +Q ++PS S TF +PCNS+ C L PP
Sbjct: 101 YPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPP 160
Query: 202 NGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTG 259
G CS C Y+ Y G+G T G + + T + R P GC++ ++
Sbjct: 161 PG---CS---CMYNQTY--GTGWTAGIQSVETFTFGSTPADQ--TRVPGIAFGCSNASSD 210
Query: 260 DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPDTVNKKFVKYT 316
D NG++G++GL RG +S++S+ F YCL +P+ ST + G +N V T
Sbjct: 211 DWNGSAGLVGLGRGSMSLVSQLGAGMFSYCL-TPFQDANSTSTLLLGPSAALNGTGVLTT 269
Query: 317 PIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPA 367
P V +P + S +Y++ LTGIS+G L + + F L T+ IDSGT IT
Sbjct: 270 PFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAF-ALRTDGTGGLIIDSGTTITSLVD 328
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV--VVPKITIHFLGGVDLELDV 425
Y +R+A + D C+ L++ + +P +T HF G D+ L V
Sbjct: 329 AAYQQVRAAIESLVTLPVADGSDSTGLDLCFALTSETSTPPSMPSMTFHF-DGADMVLPV 387
Query: 426 RGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+++ S CL S GN QQ+ + YD+ L F P C+
Sbjct: 388 DNYMILGS-GVWCLAMRNQTVGAMST-FGNYQQQNVHLLYDIHEETLSFAPAKCS 440
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 136/448 (30%), Positives = 212/448 (47%), Gaps = 41/448 (9%)
Query: 70 PCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTF--------- 120
P S++ Q R T S+ ++ +D R+ ++R + N K K T
Sbjct: 80 PQSRIKQETKRTTHSVVDLQIQDLTRIKTLHARFNKSKKQKNEKVRKKITSDISLVGAPE 139
Query: 121 --PAK------TGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD 171
P K +G+ + + EY++ V +G P ++ SL+LDTGS + W QC PC C Q
Sbjct: 140 VSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNG 199
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATD 231
F+DP S +F I CN C ++ PP Q + ++ CPY Y D S TG +A +
Sbjct: 200 MFYDPKTSASFKNITCNDPRCSLISSPDPP-VQCESDNQSCPYFYWYGDRSNTTGDFAVE 258
Query: 232 RMTIQEVNGNGYFARYP---FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY--- 285
T+ G + Y + GC N G +GASG++GL RGP+S S+ Y
Sbjct: 259 TFTVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHS 318
Query: 286 FFYCL---HSPYGSTGYITFGK-PDTVNKKFVKYTPIVTTPEQS--EFYHITLTGISVGG 339
F YCL +S + + FG+ D +N + +T V E S FY+I + I VGG
Sbjct: 319 FSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGG 378
Query: 340 ERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKK-YKMGKGIEDL 393
+ L + + S T IDSGT ++ F P Y +++ F ++MK+ Y + + +
Sbjct: 379 KALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFP-V 437
Query: 394 FDTCYDLSAYK--TVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI 451
D C+++S + + +P++ I F+ G + + S VCL P SI
Sbjct: 438 LDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSI 497
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+GN QQ+ + + YD RLGF P C
Sbjct: 498 -IGNYQQQNFHILYDTKRSRLGFTPTKC 524
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 122/393 (31%), Positives = 184/393 (46%), Gaps = 33/393 (8%)
Query: 102 RRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK 161
R + +A ++F K P T I EY + ++G P + ++DTGS I W QC+
Sbjct: 59 RSINRA--NHFYKYSLANIPQSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCE 116
Query: 162 PCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVD 220
PC C Q P F+PSKS ++ IPC S C+ + + C+ K C Y Y D
Sbjct: 117 PCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDT-------SCNDKNYCEYSTYYGD 169
Query: 221 GSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGDQNGA-SGIMGLDRGPVSII 278
S G + D +T++ NG +P ++GC NN GA SGI+G GP S I
Sbjct: 170 NSHSGGDLSVDTLTLEST--NGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFI 227
Query: 279 SKTNISY---FFYCLHSPY-------GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFY 328
++ S F YCL + +T + FG TV+ V TPI+ ++ FY
Sbjct: 228 TQLGSSTGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPET-FY 286
Query: 329 HITLTGISVGGERLPLKA--SYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKM 386
++TL SVG R+ + + + + IDSGT +T YS L SA +K ++
Sbjct: 287 YLTLEAFSVGNRRVEIGGVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERV 346
Query: 387 GKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPS 446
+ L + CY + A + P IT+HF G D++L T V + CL F S
Sbjct: 347 DDPTQTL-NLCYSVKA-EGYDFPIITMHF-KGADVDLHPISTFVSVADGVFCLAFE---S 400
Query: 447 DPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ + GN+ Q+ V YD+ + + F P +C
Sbjct: 401 SQDHAIFGNLAQQNLMVGYDLQQKIVSFKPSDC 433
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 126/406 (31%), Positives = 185/406 (45%), Gaps = 33/406 (8%)
Query: 90 RRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI----VAADEYYIVVAIGKPKQYV 145
R +R+ L++ R + + + + T P G V EY + +AIG P Q V
Sbjct: 51 RELMRRMALRSKARAPRLL------SSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPV 104
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
L LDTGS + WTQC+PC C Q P++D S+S TF+ C+ST CK+ Q
Sbjct: 105 QLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQ- 163
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGD-QNG 263
+ + C + +Y D S GF D T+ V G A P + GC NNTG ++
Sbjct: 164 --TVQTCAFSYSYGDKSATIGFL--DVETVSFVAG----ASVPGVVFGCGLNNTGIFRSN 215
Query: 264 ASGIMGLDRGPVSIISKTNISYFFYCLHSPYG-STGYITFGKPDTVNKK---FVKYTPIV 319
+GI G RGP+S+ S+ + F +C + G + F P + K V+ TP++
Sbjct: 216 ETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLI 275
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYFTKLS----TEIDSGTIITRFPAPVYSALRS 375
P FY+++L GI+VG RLP+ S F + T IDSGT T P VY +
Sbjct: 276 KNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHD 335
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAY-KTVVVPKITIHFLGGVDLELDVRGTLVVESV 434
F + K + E C+ K VPK+ +HF G + L R V E+
Sbjct: 336 EFAAHV-KLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGAT-MHLP-RENYVFEAK 392
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L + ++GN QQ+ V YD+ +L F C+
Sbjct: 393 DGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCD 438
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 170/371 (45%), Gaps = 43/371 (11%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +AIGKP L DTGS +TWTQC+PC C Q P +DPS S TFS +PC+S
Sbjct: 70 EYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSA 129
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC P ++ S C Y AY DG+ G T+ +T+ + F
Sbjct: 130 TC------LPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAF- 182
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYC--------LHSPY--GSTGYI 300
GC +N GD ++G +GL RG +S++++ + F YC L SP+ G+ +
Sbjct: 183 -GCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSALDSPFLLGTLAEL 241
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE----- 355
G P TV TP++ +P+ Y ++L GIS+G RLP+ F
Sbjct: 242 APG-PSTVQS-----TPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMI 295
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG-----IEDLFDTCYDLSAYKTVVVPK 410
+DSGT T S FR+ + + G L C+ A + +P
Sbjct: 296 VDSGTTFTIL-------AESGFREVVGRVARVLGQPPVNASSLDAPCFPAPAGEPPYMPD 348
Query: 411 ITIHFLGGVDLELDVRGTLVV-ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
+ +HF GG D+ L + E CL A + S+ LGN QQ+ ++ +D
Sbjct: 349 LVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSV-LGNFQQQNIQMLFDTTV 407
Query: 470 RRLGFGPGNCN 480
+L F P +C+
Sbjct: 408 GQLSFLPTDCS 418
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 127/435 (29%), Positives = 199/435 (45%), Gaps = 46/435 (10%)
Query: 62 LEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTF 120
L V+ YG CS KS + ++ ++ +D R+ +S QK T A
Sbjct: 32 LSVIPIYGKCSPFTAPKSESWMNTVIDMASKDPARIRYLSSLTAQK--------TVAAPI 83
Query: 121 PAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSK 180
+ ++ Y + V +G P Q + ++LDT + W C CI CS F S
Sbjct: 84 ASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTTT--FSAQNSS 141
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
TF+ + C+ C P + +C ++ Y G++ F AT +Q+
Sbjct: 142 TFATLDCSKPECTQARGLSCPTTGN----VDCLFNQTY---GGDSTFSAT---LVQDSLH 191
Query: 241 NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYG 295
G F GC + +G G+MGL RGP+S+IS++ Y F YCL S Y
Sbjct: 192 LGPNVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYY 251
Query: 296 STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----T 350
+G + G K ++ TP++ P + Y++ LTGISVG +P+ T
Sbjct: 252 FSGSLKLGP--VGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNT 309
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSAYKTVVV 408
T IDSGT+ITRF +Y+A+R FRK ++G L FDTC+ + V
Sbjct: 310 GAGTIIDSGTVITRFVPAIYTAVRDEFRK-----QVGGSFSPLGAFDTCF--ATNNEVSA 362
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLP--SDPNSILLGNVQQRGYEVHY 465
P IT+H L G+DL+L + +L+ S + CL A P + ++ N+QQ+ + + +
Sbjct: 363 PAITLH-LSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILF 421
Query: 466 DVAGRRLGFGPGNCN 480
D+ +LG CN
Sbjct: 422 DINNSKLGIARELCN 436
>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
Length = 477
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 108/350 (30%), Positives = 164/350 (46%), Gaps = 64/350 (18%)
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+++++DTGS +TW QCKPC C QRDP FDPS S +++ +PCN++ C+ L+
Sbjct: 176 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKA-ATGVP 234
Query: 205 DKCS----------SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
C+ S+ C Y +AY DGS G ATD + + + +G F+ GC
Sbjct: 235 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDG------FVFGCG 288
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVK 314
+N G G +G+MGL P G+ + G P
Sbjct: 289 LSNRGLFGGTAGLMGL---------------------GPDGALAGLPDGAP--------- 318
Query: 315 YTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALR 374
FY + +TG SV + A+ + +DSGT+ITR VY A+R
Sbjct: 319 ----------PPFYFMNVTGASV--GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVR 366
Query: 375 SAFRKRM--KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVV- 431
+ F ++ ++Y L D CY+L+ + V VP +T+ GG D+ +D G L +
Sbjct: 367 AEFARQFGAERYPAAPPFS-LLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMA 425
Query: 432 -ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ QVCL A L + + ++GN QQ+ V YD G RLGF +C+
Sbjct: 426 RKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 475
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 175/369 (47%), Gaps = 39/369 (10%)
Query: 125 GIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSK 184
IV + Y + IG P Q + + LDT + W C C+ CS FDPSKS +
Sbjct: 81 AIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRT 138
Query: 185 IPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
+ C + CK PN C+ SK C +++ Y GS + D +T+ +
Sbjct: 139 LQCEAPQCKQA-----PN--PSCTVSKSCGFNMTY-GGSTIEAYLTQDTLTL----ASDV 186
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS--TG 298
Y F GC + +G A G+MGL RGP+S+IS++ Y F YCL + S +G
Sbjct: 187 IPNYTF--GCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSG 244
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF---TKLS 353
+ G + + +K TP++ P +S Y++ L GI VG + +P A F T
Sbjct: 245 SLRLGPKNQPIR--IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAG 302
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T DSGT+ TR P Y A+R+ FR+R+K FDTCY +VV P +T
Sbjct: 303 TIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATS--LGGFDTCYS----GSVVFPSVTF 356
Query: 414 HFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGR 470
F G+++ L L+ S + CL A P + NS+L + ++QQ+ + V DV
Sbjct: 357 MF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNS 415
Query: 471 RLGFGPGNC 479
RLG C
Sbjct: 416 RLGISRETC 424
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 170/365 (46%), Gaps = 23/365 (6%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V EY + +AIG P Q V L LDTGS + WTQC+PC C Q P++D S+S TF+
Sbjct: 30 VPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPS 89
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C+ST CK+ Q + + C Y +Y D S GF D T+ V G A
Sbjct: 90 CDSTQCKLDPSVTMCVNQ---TVQTCAYSYSYGDKSATIGFL--DVETVSFVAG----AS 140
Query: 247 YP-FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYG-STGYITFG 303
P + GC NNTG ++ +GI G RGP+S+ S+ + F +C + G + F
Sbjct: 141 VPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVSGRKPSTVLFD 200
Query: 304 KPDTVNKK---FVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS----TEI 356
P + K V+ TP++ P FY+++L GI+VG RLP+ S F + T I
Sbjct: 201 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 260
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY-KTVVVPKITIHF 415
DSGT T P VY + F + K + E C+ K VPK+ +HF
Sbjct: 261 DSGTAFTSLPPRVYRLVHDEFAAHV-KLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHF 319
Query: 416 LGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
G + L R V E+ L + ++GN QQ+ V YD+ +L F
Sbjct: 320 EGAT-MHLP-RENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 377
Query: 476 PGNCN 480
C+
Sbjct: 378 RAKCD 382
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/306 (34%), Positives = 148/306 (48%), Gaps = 26/306 (8%)
Query: 122 AKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKT 181
A G +A +EY + +A+G P + V+L LDTGS + WTQC PC C Q P DP+ S T
Sbjct: 76 AAAGGIATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASST 135
Query: 182 FSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE---V 238
++ +PC + C+ L C + C Y Y D S G ATDR T +
Sbjct: 136 YAALPCGAPRCRAL-------PFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRR 188
Query: 239 NGNGYF-ARYPFLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS 296
NG+G A GC N G Q+ +GI G RG S+ S+ N + F YC S + S
Sbjct: 189 NGDGSLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCFTSMFDS 248
Query: 297 TGYITF--GKPDTV----NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
I G P + + V+ TP+ P Q Y ++L GISVG RLP+ + F
Sbjct: 249 KSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFR 308
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE-DLFDTCYDL---SAYKTV 406
ST IDSG IT P VY A+++ F ++ G+E D C+ L + ++
Sbjct: 309 --STIIDSGASITTLPEEVYEAVKAEFAAQVGLPP--SGVEGSALDVCFALPVSALWRRP 364
Query: 407 VVPKIT 412
VP +T
Sbjct: 365 AVPSLT 370
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 120/369 (32%), Positives = 175/369 (47%), Gaps = 39/369 (10%)
Query: 125 GIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSK 184
IV + Y + IG P Q + + LDT + W C C+ CS FDPSKS +
Sbjct: 81 AIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV--LFDPSKSSSSRT 138
Query: 185 IPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
+ C + CK PN C+ SK C +++ Y GS + D +T+ +
Sbjct: 139 LQCEAPQCKQA-----PN--PSCTVSKSCGFNMTY-GGSTIEAYLTQDTLTL----ASDV 186
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS--TG 298
Y F GC + +G A G+MGL RGP+S+IS++ Y F YCL + S +G
Sbjct: 187 IPNYTF--GCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSG 244
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF---TKLS 353
+ G + + +K TP++ P +S Y++ L GI VG + +P A F T
Sbjct: 245 SLRLGPKNQPIR--IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAG 302
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T DSGT+ TR P Y A+R+ FR+R+K FDTCY S VV P +T
Sbjct: 303 TIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATS--LGGFDTCYSGS----VVFPSVTF 356
Query: 414 HFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGR 470
F G+++ L L+ S + CL A P + NS+L + ++QQ+ + V DV
Sbjct: 357 MF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNS 415
Query: 471 RLGFGPGNC 479
RLG C
Sbjct: 416 RLGISRETC 424
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 136/426 (31%), Positives = 197/426 (46%), Gaps = 46/426 (10%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTG--IVAADEYYIVVAIGKPK- 142
E+LRR R + + A A T P G V + EY I + IG P+
Sbjct: 52 HELLRRMVARSKARLASLRSSAC------DTALTAPVDHGGSDVGSSEYLIHLGIGTPRP 105
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Q V L LDTGS + WTQC C C Q P F S S TFS++PC+ C + + P +
Sbjct: 106 QRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAV-YLPLS 163
Query: 203 GQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCTDNNTG 259
G C++++ C Y Y+D S TG A D T + + A P + GC N G
Sbjct: 164 G---CAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYG 220
Query: 260 D-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS--TGYITFGKPDTVNKKF---V 313
SGI G GP+S+ S+ + F YC + S + I G+P+ + +
Sbjct: 221 LFTPNQSGIAGFGTGPLSLPSQLKVRRFSYCFTAMEESRVSPVILGGEPENIEAHATGPI 280
Query: 314 KYTPIVTTPE-----QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIIT 363
+ TP P FY ++L G++VG RLP AS F T IDSGT IT
Sbjct: 281 QSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAIT 340
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLFD-TCYDLSAYKTV-VVPKITIHFLGGVDL 421
FP V+ +LR AF ++ + KG D + C+ + A K VPK+ +H L G D
Sbjct: 341 FFPQAVFRSLREAFVAQV-PLPVAKGYTDPDNLLCFSVPAKKKAPAVPKLILH-LEGADW 398
Query: 422 ELDVRGTLVVE-------SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
EL R V++ + R++C+ L + N ++GN QQ+ + YD+ ++ F
Sbjct: 399 ELP-RENYVLDNDDDGSGAGRKLCV-VILSAGNSNGTIIGNFQQQNMHIVYDLESNKMVF 456
Query: 475 GPGNCN 480
P C+
Sbjct: 457 APARCD 462
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 180/368 (48%), Gaps = 22/368 (5%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY+I + +G P ++V L+LDTGS ++W QC PC C +Q P ++P++S ++ I C
Sbjct: 169 EYFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDP 228
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV--NGNGYFAR-Y 247
C+ L+ P K ++ CPY Y DGS TG +A + T+ NG F
Sbjct: 229 RCQ-LVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVV 287
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGY---IT 301
+ GC N G +GA G++GL RGP+S S+ Y F YCL + +T +
Sbjct: 288 DVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLI 347
Query: 302 FGKPDTV----NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL--PLKASYFTKL--- 352
FG+ + N F K TP+ + FY++ + I VGGE L P K +++
Sbjct: 348 FGEDKELLNHHNLNFTKLLAGEETPDDT-FYYLQIKSIVVGGEVLDIPEKTWHWSSEGVG 406
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
T IDSG+ +T FP Y ++ AF K++K ++ + + CY++S V +P
Sbjct: 407 GTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAAD-DFIMSPCYNVSGAMQVELPDYG 465
Query: 413 IHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
IHF G E +CL P+ + ++GN+ Q+ + + YDV R
Sbjct: 466 IHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYDVKRSR 525
Query: 472 LGFGPGNC 479
LG+ P C
Sbjct: 526 LGYSPRRC 533
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 123/355 (34%), Positives = 173/355 (48%), Gaps = 34/355 (9%)
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCI---HCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
+G+P+Q +LDTGS +TW QC PC C +Q P FDP S +++ + C+S C++
Sbjct: 3 VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L E C+ C Y + Y DGS G AT+ +T N + +GC
Sbjct: 63 LDEA-------GCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNIS-----IGCG 110
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---HSPYGSTGYITFGKP-DTVNK 310
+N G GA G++GL G +SI S+ S F YCL SP ST P D++
Sbjct: 111 HDNEGLFVGADGLIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSL-- 168
Query: 311 KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRF 365
+P+V F ++ + G+SVGG+ LP+ +S F + +DSGT IT+
Sbjct: 169 ----ISPLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQL 224
Query: 366 PAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
P+ VY LR AF I FDTCYDLS+ V VP I G L+L
Sbjct: 225 PSDVYEVLREAFLGLTTNLPPAPEISP-FDTCYDLSSQSNVEVPTIAFILPGENSLQLPA 283
Query: 426 RGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ L+ V+S CL F + + P SI +GN QQ+G V YD+ +GF C
Sbjct: 284 KNCLIQVDSAGTFCLAF-VSATFPLSI-IGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
Length = 495
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 134/460 (29%), Positives = 201/460 (43%), Gaps = 63/460 (13%)
Query: 62 LEVLGRYGPCSKLNQG--KSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTK--- 116
L ++ R PCS + G + + PSL+EIL RD RL + + A
Sbjct: 54 LPLVHRLSPCSPVTGGGAQKKGKPSLQEILHRDGLRLQYLSQVQAATAAAAPAAAPAPSA 113
Query: 117 -----AFTFPAKTGIVAAD----EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCS 167
+ PA I+++ EY ++ G P Q + L D SG++ +CKPC S
Sbjct: 114 TTPASGLSVPATQNIISSLPGVFEYTVLAGYGTPAQQLPLFFDV-SGMSNMRCKPCFSGS 172
Query: 168 QQR------DPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDI---A 217
D FDPS S +F + C S C G CS+ C + +
Sbjct: 173 SGGETTTTCDVAFDPSMSSSFRSVLCGSPDC----------GGHSCSAGGSCTFTLQNST 222
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT--DNNTGDQNGASGIMGLDRGPV 275
+V G+G T M ++ + F F +GC DN+ A G + L
Sbjct: 223 FVFGNG------TIVMDTLTLSPSATFEN--FAVGCMQLDNDLFTDGVAVGNIDLSLSRH 274
Query: 276 SIISKT------NISYFFYCLHSPYGSTGYITFGKP--DTVNKKFVKYTPIVTTPEQSEF 327
S+ ++ ++ F YCL + + G++T D + VKY P+VT P F
Sbjct: 275 SLATRVLNSSPPGMAAFSYCLPADTDTHGFLTIAPALSDYSDHAGVKYVPLVTNPTGPNF 334
Query: 328 YHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMG 387
Y++ L I++ GE LP+ + FT T IDS + T P+Y+ALR FRK M +Y+
Sbjct: 335 YYVDLVAIAINGEDLPIPPALFTGNGTMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQPV 394
Query: 388 KGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL------VVESVRQVCLGF 441
L DTCY+ + + + +P IT+ F G ++LD R + + + CL F
Sbjct: 395 PAFGGL-DTCYNFTLAENIYLPDITLRFSNGETMDLDDRQFMYFFREHLTDGFPFGCLAF 453
Query: 442 ALLPSDPNSI--LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
A P D N LG+ QR E+ YDV G + F P C
Sbjct: 454 AAAP-DQNFPWNYLGSQVQRTKEIVYDVRGGMVAFVPSRC 492
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 133/431 (30%), Positives = 196/431 (45%), Gaps = 44/431 (10%)
Query: 78 KSRNTPSLEEILRRDQQRLHLKNSR-----RLQKAIPDNFKKTKAFT---------FPAK 123
+S+N E++ D R N R R+ + + K+ P
Sbjct: 21 ESQNRGFSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLPKP 80
Query: 124 TGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTF 182
T I A YY++ +IG P + ++DTGS W QCKPC C Q P F+PSKS T+
Sbjct: 81 TIIPYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTY 140
Query: 183 SKIPCNSTTCKILLEWFPPNGQDKCSS---KECPYDIAYVDGSGETGFWATDRMTIQEVN 239
I C+S CK + +CSS ++C Y+I Y+D SG G + D +T+ +
Sbjct: 141 KNIRCSSPICK-------RGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSND 193
Query: 240 GNGYFARYP-FLLGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISY---FFYCLHSPY 294
G+ +P ++GC N+ G ASGI+G RG SI+S+ S F YCL S +
Sbjct: 194 GSP--ISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLF 251
Query: 295 GS---TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-- 349
+ + FG V+ V TP++ + ++ L SVG + LK S
Sbjct: 252 SKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYF-TNLEAFSVGDHIIKLKDSSLIP 310
Query: 350 -TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
+ + IDSG+ IT+ P VYS L +A M K K K CY + K V
Sbjct: 311 DNEGNAVIDSGSTITQLPNDVYSQLETAVIS-MVKLKRVKDPTQQLSLCYK-TTLKKYEV 368
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P IT HF G D++L+ T + + +C FA S ++ GN+ Q+ + V YD
Sbjct: 369 PIITAHFRGA-DVKLNAFNTFIQMNHEVMC--FAFNSSAFPWVVYGNIAQQNFLVGYDTL 425
Query: 469 GRRLGFGPGNC 479
+ F P NC
Sbjct: 426 KNIISFKPTNC 436
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 115/377 (30%), Positives = 170/377 (45%), Gaps = 40/377 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +AIG P L DTGS +TWTQCKPC C Q P +D + S +FS +PC S
Sbjct: 94 EYLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASA 153
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-- 248
TC L W ++ C Y AY DG+ G T+ +T G+ A P
Sbjct: 154 TC--LPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFA---GSSPGAPGPGV 208
Query: 249 ----FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYC--------LHSPYGS 296
GC +N G ++G +GL RG +S++++ + F YC L SP
Sbjct: 209 SVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLF 268
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----K 351
P T+ V+ TP+V P Y+++L GIS+G RLP+ F
Sbjct: 269 GSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGS 328
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK--MGKGI---EDLFDTCYDLSAYKTV 406
+DSGTI T + SAFR + + + + L C+ +A +
Sbjct: 329 GGMIVDSGTIFTVL-------VESAFRVVVNHVAGVLNQPVVNASSLDSPCFPATAGEQQ 381
Query: 407 V--VPKITIHFLGGVDLELDVRGTLVV-ESVRQVCLGFALLPSDPNSILLGNVQQRGYEV 463
+ +P + +HF GG D+ L + + CL A PS SI LGN QQ+ ++
Sbjct: 382 LPDMPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSI-LGNFQQQNIQM 440
Query: 464 HYDVAGRRLGFGPGNCN 480
+D+ +L F P +C+
Sbjct: 441 LFDITVGQLSFVPTDCS 457
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 114/388 (29%), Positives = 177/388 (45%), Gaps = 28/388 (7%)
Query: 116 KAFTFPAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP-- 172
+F P +G + +Y++ + IG P Q + L+ DTGS + W +C PC +CS R P
Sbjct: 69 NSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCS-HRSPGS 127
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATD 231
F S T+S I C S C+++ P P + + S C Y Y D S TGF++ +
Sbjct: 128 AFFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHS-PCRYQYTYADSSTTTGFFSKE 186
Query: 232 RMTIQEVNG-----NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY- 285
+T+ G NG F + GA G+MGL R P+S S+ +
Sbjct: 187 ALTLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFG 246
Query: 286 --FFYCLH----SPYGSTGYITFGKPDTV---NKKFVKYTPIVTTPEQSEFYHITLTGIS 336
F YCL SP T ++T G V K + +TP++ P FY+I + G+
Sbjct: 247 SKFSYCLMDYTLSP-PPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVY 305
Query: 337 VGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
V G +LP+ S ++ T IDSGT +T P Y+ + AF+KR+K +
Sbjct: 306 VNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTP 365
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI 451
FD C ++S +P+++ + GG R + + CL + D
Sbjct: 366 G-FDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFS 424
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+LGN+ Q+G+ + +D RLGF C
Sbjct: 425 VLGNLMQQGFLLEFDRDKSRLGFTRRGC 452
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 179/364 (49%), Gaps = 33/364 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 139
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 140 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 191
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ + F YCL S G +TGY
Sbjct: 192 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGY 251
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 252 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 309
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 310 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 367
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
+L G V SV++ CL FA P++ SI +G++ Q EV YD+ + +G GP
Sbjct: 368 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSI-IGSLMQTSKEVVYDLKRQLIGIGP 424
Query: 477 -GNC 479
G C
Sbjct: 425 SGAC 428
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 114/366 (31%), Positives = 169/366 (46%), Gaps = 14/366 (3%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V + EY + + +G P + +++DTGS + W QC PC+ C +QR P FDP+ S ++ +
Sbjct: 147 VGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVT 206
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C C ++ P + S CPY Y D S TG A + T+
Sbjct: 207 CGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV 266
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGY-ITF 302
+ GC +N G +GA+G++GL RG +S S+ Y F YCL S G I F
Sbjct: 267 DDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSVGSKIVF 326
Query: 303 GKPDT-VNKKFVKYT--PIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLST 354
G D + + YT FY++ L G+ VGGE+L + S + T
Sbjct: 327 GDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGT 386
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
IDSGT ++ F P Y +R AF +RM K + CY++S + V VP+ ++
Sbjct: 387 IIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLL 446
Query: 415 FLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F G + V ++ +CL P SI +GN QQ+ + V YD+ RLG
Sbjct: 447 FADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSI-IGNFQQQNFHVLYDLQNNRLG 505
Query: 474 FGPGNC 479
F P C
Sbjct: 506 FAPRRC 511
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 114/366 (31%), Positives = 169/366 (46%), Gaps = 14/366 (3%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V + EY + + +G P + +++DTGS + W QC PC+ C +QR P FDP+ S ++ +
Sbjct: 147 VGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVT 206
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C C ++ P + S CPY Y D S TG A + T+
Sbjct: 207 CGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV 266
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGY-ITF 302
+ GC +N G +GA+G++GL RG +S S+ Y F YCL S G I F
Sbjct: 267 DDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSVGSKIVF 326
Query: 303 GKPDT-VNKKFVKYT--PIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLST 354
G D + + YT FY++ L G+ VGGE+L + S + T
Sbjct: 327 GDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGT 386
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
IDSGT ++ F P Y +R AF +RM K + CY++S + V VP+ ++
Sbjct: 387 IIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLL 446
Query: 415 FLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F G + V ++ +CL P SI +GN QQ+ + V YD+ RLG
Sbjct: 447 FADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSI-IGNFQQQNFHVLYDLQNNRLG 505
Query: 474 FGPGNC 479
F P C
Sbjct: 506 FAPRRC 511
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 128/417 (30%), Positives = 191/417 (45%), Gaps = 48/417 (11%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKT-GIVAADEYYIVVAIGKPKQYVSL 147
LRRD +H N+R+L A + T A T A EY + +AIG P
Sbjct: 55 LRRD---MHRHNARKLALA------ASSGATVSAPTQNSPTAGEYLMALAIGTPPLPYQA 105
Query: 148 LLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNST-----TCKILLEWFPP 201
+ DTGS + WTQC PC C +Q P ++PS S TF+ +PCNS+ PP
Sbjct: 106 IADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPP 165
Query: 202 NGQDKCSSKECPYDIAYVDGSGETG-FWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTG 259
G C+ C Y++ Y GSG T F ++ T +R P GC+ ++G
Sbjct: 166 PG---CA---CTYNVTY--GSGWTSVFQGSETFTFGSTPAGQ--SRVPGIAFGCSTASSG 215
Query: 260 -DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPDTVNKKF-VK 314
+ + ASG++GL RG +S++S+ + F YCL +PY ST + G ++N V
Sbjct: 216 FNASSASGLVGLGRGRLSLVSQLGVPKFSYCL-TPYQDTNSTSTLLLGPSASLNGTAGVS 274
Query: 315 YTPIVTTPEQS---EFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRF 365
TP V +P + FY++ LTGIS+G L + F L+ + IDSGT IT
Sbjct: 275 STPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAF-LLNADGTGGLIIDSGTTITLL 333
Query: 366 PAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV--VVPKITIHFLGGVDLEL 423
Y +R+A + D C+ L + + +P +T+HF G D+ L
Sbjct: 334 GNTAYQQVRAAVVSLVTLPTTDGSAATGLDLCFMLPSSTSAPPAMPSMTLHF-NGADMVL 392
Query: 424 DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++ + CL +D +LGN QQ+ + YD+ L F P C+
Sbjct: 393 PADSYMMSDDSGLWCLAMQNQ-TDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 448
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 126/401 (31%), Positives = 186/401 (46%), Gaps = 26/401 (6%)
Query: 94 QRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTG 152
QR+ R + +A N K A T A++ + A+ EY + ++G P + ++DTG
Sbjct: 58 QRVANAMRRSINRANHFNKKSFVASTNTAESTVKASQGEYLMSYSVGTPPFEILGVVDTG 117
Query: 153 SGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKEC 212
SGITW QC+ C C +Q P FDPSKSKT+ +PC+S C+ ++ P DK C
Sbjct: 118 SGITWMQCQRCEDCYEQTTPIFDPSKSKTYKTLPCSSNMCQSVIST-PSCSSDKIG---C 173
Query: 213 PYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGDQNGASGIMGLD 271
Y I Y DGS G + + +T+ NG+ ++P ++GC NN G G +
Sbjct: 174 KYTIKYGDGSHSQGDLSVETLTLGSTNGSS--VQFPNTVIGCGHNNKGTFQGEGSGVVGL 231
Query: 272 RGPVSIISKTNISY----FFYCL---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQ 324
G + S F YCL S S+ + FG V+ TP+V+
Sbjct: 232 GGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGS 291
Query: 325 SEFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPVYSALRSAFR 378
FY++TL SVG +R+ + S+ IDSGT +T P YS L SA
Sbjct: 292 EVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVA 351
Query: 379 KRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVC 438
++ ++ + CY + + VP IT HF G D+EL+ T V + VC
Sbjct: 352 DAIQANRVSDP-SNFLSLCYQTTPSGQLDVPVITAHF-KGADVELNPISTFVQVAEGVVC 409
Query: 439 LGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
FA S+ SI GN+ Q V YD+ + + F P +C
Sbjct: 410 --FAFHSSEVVSI-FGNLAQLNLLVGYDLMEQTVSFKPTDC 447
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 132/441 (29%), Positives = 203/441 (46%), Gaps = 51/441 (11%)
Query: 58 GKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKA 117
G S++++ R P S T L + R R+ R Q A+ + +++
Sbjct: 30 GGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFHRSASRV----GRFRQSAMTSDGIQSRL 85
Query: 118 FTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
+ +A EY + ++IG P V ++DTGS +TWTQC+PC HC +Q PFFDP
Sbjct: 86 --------VPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPK 137
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKC--SSKECPYDIAYVDGSGETGFWATDRMTI 235
S T+ C ++ C L G D+ + K+C + +Y DGS G A + +T+
Sbjct: 138 NSSTYRDSSCGTSFCLAL-------GNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTV 190
Query: 236 QEVNGNGYFARYP-FLLGCTDNNTG--DQNGASGIMGLDRGPVSIIS--KTNIS-YFFYC 289
G +P F GC + G D++ +SGI+GL +S+IS K+ I+ F YC
Sbjct: 191 ASTAGKP--VSFPGFAFGCVHRSGGIFDEH-SSGIVGLGVAELSMISQLKSTINGRFSYC 247
Query: 290 LHSPYGSTGY---ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
L + + I FG+ V+ TP+V + +Y ITL G SVG +RL K
Sbjct: 248 LLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKG 307
Query: 347 SYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED---LFDTCY 398
+ K E +DSGT T P Y L + +K GK + D + CY
Sbjct: 308 -FSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIK----GKRVRDPNGISSLCY 362
Query: 399 DLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQ 458
+ + + P IT HF ++EL T + VC F +LP+ I LGN+ Q
Sbjct: 363 N-TTVDQIDAPIITAHF-KDANVELQPWNTFLRMQEDLVC--FTVLPTSDIGI-LGNLAQ 417
Query: 459 RGYEVHYDVAGRRLGFGPGNC 479
+ V +D+ +R+ F +C
Sbjct: 418 VNFLVGFDLRKKRVSFKAADC 438
>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 524
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 111/346 (32%), Positives = 164/346 (47%), Gaps = 49/346 (14%)
Query: 170 RDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG--------------QDKCSSKECPYD 215
R+ FDP+KS + + +PC S C+ L + NG + S+ +C Y
Sbjct: 192 RNALFDPTKSFSAAAVPCGSRACRALGNY--GNGCSNNSRRNKKKNKSKSNNSTGDCNYR 249
Query: 216 IAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG-ASGIMGLDRGP 274
+AY DG +G + TD +TI G + F GC+ G +G SG M L G
Sbjct: 250 VAYSDGRVSSGTYMTDILTISP--GTSFLN---FRFGCSHGVRGSFSGETSGTMSLGGGR 304
Query: 275 VSIISKTNISY---FFYCLHSPYGSTGYITFGKP-DTVNKKFVKYTPIVTTPEQSE---- 326
S++S+T +Y F YC+ P S G+++ G + + + VTTP
Sbjct: 305 QSLLSQTARAYGNAFSYCVPKPSAS-GFLSLGGAINDGDSDSDSPSSFVTTPLMRNARIV 363
Query: 327 ---FYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKK 383
+Y + L GI V G RL + F+ T +DS ++T+ P Y ALR AFR M+
Sbjct: 364 NPTYYVVRLQGIDVAGRRLNVPPVVFSG-GTLMDSSAVVTQLPPTAYRALRLAFRNAMRG 422
Query: 384 YKMG----------KGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVES 433
Y+M G E + DTCYD V VP +++ F GG ++LD +++E
Sbjct: 423 YRMNTRNGSTSSTPAGGEMILDTCYDFEGLDNVTVPTVSLVFFGGAVVDLDPTTAVMMEG 482
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL F P+D + +GNVQQ+ +EV YDV R +GF G C
Sbjct: 483 ----CLAFVPTPADFDLGFIGNVQQQTHEVLYDVGARNVGFRRGAC 524
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 111/367 (30%), Positives = 167/367 (45%), Gaps = 33/367 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +AIG P L DTGS +TWTQC+PC C Q P +DPS S TFS +PC+S
Sbjct: 65 EYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSA 124
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC L W N + S C Y +Y DG+ G T+ +TI +
Sbjct: 125 TC--LPTWRSRNCSNP--SSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVA 180
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST-------GYITFG 303
GC +N GD ++G +GL RG +S++++ + F YCL + ST G +
Sbjct: 181 FGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSTMDSPFFLGTLAEL 240
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS-----TEIDS 358
P V+ TP++ +P Y + L GIS+G RLP+ F + +DS
Sbjct: 241 AP---GPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDS 297
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT----VVVPKITIH 414
GT T +S FR+ + + G + + D + + +P + +H
Sbjct: 298 GTTFTIL-------AKSGFREVVDRVAQLLGQPPVNASSLDSPCFPSPDGEPFMPDLVLH 350
Query: 415 FLGGVDLELDVRGTLVV-ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F GG D+ L + E CL PS + LGN QQ+ ++ +D+ +L
Sbjct: 351 FAGGADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWSR--LGNFQQQNIQMLFDMTVGQLS 408
Query: 474 FGPGNCN 480
F P +C+
Sbjct: 409 FLPTDCS 415
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 118/361 (32%), Positives = 173/361 (47%), Gaps = 30/361 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
E+ + +AIG P + S ++DTGS + WTQCKPC C Q P FDP KS +FSK+ C+S
Sbjct: 96 EFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSK 155
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ L Q CS C Y Y D S G A++ +T G +
Sbjct: 156 LCEAL-------PQSTCSDG-CEYLYGYGDYSSTQGMLASETLTF------GKVSVPEVA 201
Query: 251 LGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS-TGYITFGKPDTV 308
GC ++N G + SG++GL RGP+S++S+ F YCL S + + G +V
Sbjct: 202 FGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTSVDDTKASTLLMGSLASV 261
Query: 309 --NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTI 361
+ +K TP++ Q FY+++L GISVG LP+K S F+ IDSGT
Sbjct: 262 KASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTT 321
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKITIHFLGGVD 420
IT + + F ++ G L + C+ L + T + VPK+ HF G D
Sbjct: 322 ITYLEQSAFDLVAKEFTSQINLPVDNSGSTGL-EVCFTLPSGSTDIEVPKLVFHF-DGAD 379
Query: 421 LELDVRGTLVVESVRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
LEL ++ ++ V CL S + GN+QQ+ V +D+ L F P C
Sbjct: 380 LELPAENYMIADASMGVACLAMG---SSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQC 436
Query: 480 N 480
+
Sbjct: 437 D 437
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 125/409 (30%), Positives = 190/409 (46%), Gaps = 43/409 (10%)
Query: 96 LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGI 155
+H N+R+L A + A T + T A EY + +AIG P + DTGS +
Sbjct: 1 MHRHNARKLALAA-SSGATVSAPTQDSPT----AGEYLMALAIGTPPLPYQAIADTGSDL 55
Query: 156 TWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNST-----TCKILLEWFPPNGQDKCSS 209
WTQC PC C +Q P ++PS S TF+ +PCNS+ PP G C+
Sbjct: 56 IWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPG---CA- 111
Query: 210 KECPYDIAYVDGSGETG-FWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTG-DQNGASG 266
C Y++ Y GSG T F ++ T AR P GC+ ++G + + ASG
Sbjct: 112 --CTYNVTY--GSGWTSVFQGSETFTFGSTPAG--HARVPGIAFGCSTASSGFNASSASG 165
Query: 267 IMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPDTVNKKF-VKYTPIVTTP 322
++GL RG +S++S+ + F YCL +PY ST + G ++N V TP V +P
Sbjct: 166 LVGLGRGRLSLVSQLGVPKFSYCL-TPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASP 224
Query: 323 EQS---EFYHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITRFPAPVYSAL 373
+ FY++ LTGIS+G L + F+ L+ + IDSGT IT Y +
Sbjct: 225 STAPMNTFYYLNLTGISLGTTALSIPPDAFS-LNADGTGGLIIDSGTTITLLGNTAYQQV 283
Query: 374 RSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV--VVPKITIHFLGGVDLELDVRGTLVV 431
R+A + + D C+ L + + +P +T+HF G D+ L ++
Sbjct: 284 RAAVVSLVTLPTTDGSADTGLDLCFMLPSSTSAPPAMPSMTLHF-NGADMVLPADSYMMS 342
Query: 432 ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ CL +D +LGN QQ+ + YD+ L F P C+
Sbjct: 343 DDSGLWCLAMQNQ-TDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCS 390
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 129/422 (30%), Positives = 191/422 (45%), Gaps = 44/422 (10%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLL 148
LRRD R H + +R Q A P + + EY + ++IG P +
Sbjct: 46 LRRDMHR-HARFARE-QLAPSSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPLSYRAI 103
Query: 149 LDTGSGITWTQCKPC--------IHCSQQRDPFFDPSKSKTFSKIPCNS--TTCKILLEW 198
DTGS + WTQC PC C +Q ++PS S TF +PCNS + C +
Sbjct: 104 ADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGP 163
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYP-FLLGCTDN 256
PP G C+ C Y+ Y G+G T G + + T + R P GC++
Sbjct: 164 SPPPG---CA---CMYNQTY--GTGWTAGVQSVETFTFGS-SSTPPAVRVPNIAFGCSNA 214
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPDTVNKKF- 312
++ D NG++G++GL RG +S++S+ F YCL +P+ ST + G K
Sbjct: 215 SSNDWNGSAGLVGLGRGSMSLVSQLGAGAFSYCL-TPFQDANSTSTLLLGPSAAAALKGT 273
Query: 313 --VKYTPIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTII 362
V+ TP V P + S +Y++ LTGISVG L + F+ + IDSGT I
Sbjct: 274 GPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTI 333
Query: 363 TRFPAPVYSALRSAFRKRM-KKYKMGKGIEDL--FDTCYDLSAYK-TVVVPKITIHFLGG 418
T Y +R+A R + + + G + D C+ L A +P +T+HF GG
Sbjct: 334 TTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFEGG 393
Query: 419 VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
D+ L V +++ S CL S ++GN QQ+ V YDV L F P
Sbjct: 394 ADMVLPVENYMILGS-GVWCLAMRNQTVGAMS-MVGNYQQQNIHVLYDVRKETLSFAPAV 451
Query: 479 CN 480
C+
Sbjct: 452 CS 453
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 121/391 (30%), Positives = 175/391 (44%), Gaps = 37/391 (9%)
Query: 121 PAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK----PCIHCSQQ---RDP 172
P ++G + +Y + +A G P Q V L+ DTGS + W QC P C ++ R P
Sbjct: 42 PMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRP 101
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGETGFWA 229
F SKS T S +PC++ C LL P CS C Y Y DGS TGF A
Sbjct: 102 AFVASKSATLSVVPCSAAQC--LLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLA 159
Query: 230 TDRMTIQEVNGNGYFARYPFLLGC-TDNNTGDQNGASGIMGLDRGPVSIISKTNISY--- 285
D TI G R GC T N G +G G++GL +G +S +++ +
Sbjct: 160 RDTATISNGTSGGAAVR-GVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQT 218
Query: 286 FFYCLHSPYG-----STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE 340
F YCL G S+ ++ G+P+ + YTP+V+ P FY++ + I VG
Sbjct: 219 FSYCLLDLEGGRRGRSSSFLFLGRPE--RRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNR 276
Query: 341 RLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRK--RMKKYKMGKGIEDL 393
LP+ S + T IDSG+ +T Y L SAF + +
Sbjct: 277 VLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQG 336
Query: 394 FDTCYDLSAYKTVV-----VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDP 448
+ CY++S+ ++ P++TI F G+ LEL LV + CL S
Sbjct: 337 LELCYNVSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPF 396
Query: 449 NSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+LGN+ Q+GY V +D A R+GF C
Sbjct: 397 AFNVLGNLMQQGYHVEFDRASARIGFARTEC 427
>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
Length = 424
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 133/472 (28%), Positives = 188/472 (39%), Gaps = 88/472 (18%)
Query: 31 HSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGR-YGPCSKLNQGKSRNTPSLEEIL 89
H +V SSL+ P A+P G + L R YGPCS S L ++L
Sbjct: 18 HYIVVETSSLLKPKAICSGLKAMPSSNG--TWVALHRPYGPCSPSPTTTSPPL--LVDML 73
Query: 90 RRDQQRLHLKNSRRLQKAIPD---------------NFKKTKAFTFPAKTGIVAADEYYI 134
R D +LH RR A D ++K +F ++
Sbjct: 74 RWD--KLHTDAIRRKATAGGDVVLEPDKPIVDVQQSDYKMQASFGIGTGGRSGSSSSSSS 131
Query: 135 VV----AIGKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCN 188
+ AI P + +DT + W QC PC C Q++ FDP +S+T + +PC
Sbjct: 132 RISRPSAIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCG 191
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
S C L + CS+ +C Y + Y DG +G + D +T+
Sbjct: 192 SAACGELGRY-----GAGCSNNQCQYFVDYGDGRATSGTYMVDALTLN------------ 234
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
P +++ +++ F C H+ G+ T G
Sbjct: 235 -------------------------PSTVV----MNFRFGCSHAVRGNFSASTSGT---- 261
Query: 309 NKKFVKYTPIVTTPEQ-SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPA 367
TP+V P Y + L GI VGG RL + F +DS IIT+ P
Sbjct: 262 ---MFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAG-GAVMDSSVIITQLPP 317
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
Y ALR AFR M Y G DTCYD + +V VP +++ F GG + LD G
Sbjct: 318 TAYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMG 377
Query: 428 TLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+V + CL F P D +GNVQQ+ +EV YDV G +GF G C
Sbjct: 378 VMV-----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 124/411 (30%), Positives = 178/411 (43%), Gaps = 35/411 (8%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFP-AKTGIVAADEYYIVVAIGKPK-Q 143
E+LRR + +++ R P + + T P + EY I ++IG P+ Q
Sbjct: 49 RELLRR----MVVRSRARAANLCPYSGATARPATAPVGRANTDVNSEYLIHLSIGAPRSQ 104
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
V L LDTGS + WTQC+PC C Q P FD + S T + C+ C +
Sbjct: 105 PVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSDPLCNA-------HS 157
Query: 204 QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QN 262
+ C C Y Y DGS G + D T + G G GC N G
Sbjct: 158 EHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQ 217
Query: 263 GASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITF--GKPDTVNKKFVKYTPIVT 320
+GI G RGP+S+ S+ + F YC + + + F G D K PI++
Sbjct: 218 TETGIAGFGRGPLSLPSQLKVRQFSYCFTTRFEAKSSPVFLGGAGDL---KAHATGPILS 274
Query: 321 TP--------EQSEFYHITLTGISVGGERLPL-KASYFTKLSTEIDSGTIITRFPAPVYS 371
TP + Y ++ G++VG RLP+ + +T IDSGT IT FP V+
Sbjct: 275 TPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGSGATFIDSGTDITTFPDAVFR 334
Query: 372 ALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVV 431
L+SAF + ED D C+ KT +PK+ H L G D +L R V
Sbjct: 335 QLKSAFIAQAALPVNKTADED--DICFSWDGKKTAAMPKLVFH-LEGADWDLP-RENYVT 390
Query: 432 ESVR--QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
E QVC+ + + L+GN QQ+ + YD+A +L P C+
Sbjct: 391 EDRESGQVCVAVS-TSGQMDRTLIGNFQQQNTHIVYDLAAGKLLLVPAQCD 440
>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
Length = 442
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 133/472 (28%), Positives = 188/472 (39%), Gaps = 88/472 (18%)
Query: 31 HSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGR-YGPCSKLNQGKSRNTPSLEEIL 89
H +V SSL+ P A+P G + L R YGPCS S L ++L
Sbjct: 36 HYIVVETSSLLKPKAICSGLKAMPSSNG--TWVALHRPYGPCSPSPTTTSPPL--LVDML 91
Query: 90 RRDQQRLHLKNSRRLQKAIPD---------------NFKKTKAFTFPAKTGIVAADEYYI 134
R D +LH RR A D ++K +F ++
Sbjct: 92 RWD--KLHTDAIRRKATAGGDVVLEPDKPIVDVQQSDYKMQASFGIGTGGRSGSSSSSSS 149
Query: 135 VV----AIGKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCN 188
+ AI P + +DT + W QC PC C Q++ FDP +S+T + +PC
Sbjct: 150 RISRPSAIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCG 209
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
S C L + CS+ +C Y + Y DG +G + D +T+
Sbjct: 210 SAACGELGRY-----GAGCSNNQCQYFVDYGDGRATSGTYMVDALTLN------------ 252
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
P +++ +++ F C H+ G+ T G
Sbjct: 253 -------------------------PSTVV----MNFRFGCSHAVRGNFSASTSGT---- 279
Query: 309 NKKFVKYTPIVTTPEQ-SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPA 367
TP+V P Y + L GI VGG RL + F +DS IIT+ P
Sbjct: 280 ---MFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAG-GAVMDSSVIITQLPP 335
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
Y ALR AFR M Y G DTCYD + +V VP +++ F GG + LD G
Sbjct: 336 TAYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMG 395
Query: 428 TLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+V + CL F P D +GNVQQ+ +EV YDV G +GF G C
Sbjct: 396 VMV-----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442
>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
Length = 424
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 133/472 (28%), Positives = 188/472 (39%), Gaps = 88/472 (18%)
Query: 31 HSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGR-YGPCSKLNQGKSRNTPSLEEIL 89
H +V SSL+ P A+P G + L R YGPCS S L ++L
Sbjct: 18 HYIVVETSSLLKPKAICSGLKAMPSSNG--TWVALHRPYGPCSPSPTTTSPPL--LVDML 73
Query: 90 RRDQQRLHLKNSRRLQKAIPD---------------NFKKTKAFTFPAKTGIVAADEYYI 134
R D +LH RR A D ++K +F ++
Sbjct: 74 RWD--KLHTDAIRRKATAGGDVVLEPDKPIVDVQQSDYKMQASFGIGTGGRSGSSSSSSS 131
Query: 135 VV----AIGKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCN 188
+ AI P + +DT + W QC PC C Q++ FDP +S+T + +PC
Sbjct: 132 RISRPSAIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCG 191
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
S C L + CS+ +C Y + Y DG +G + D +T+
Sbjct: 192 SAACGELGRY-----GAGCSNNQCQYFVDYGDGRATSGTYMVDALTLN------------ 234
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
P +++ +++ F C H+ G+ T G
Sbjct: 235 -------------------------PSTVV----MNFRFGCSHAVRGNFSASTSGT---- 261
Query: 309 NKKFVKYTPIVTTPEQ-SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPA 367
TP+V P Y + L GI VGG RL + F +DS IIT+ P
Sbjct: 262 ---MFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAG-GAVMDSSVIITQLPP 317
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
Y ALR AFR M Y G DTCYD + +V VP +++ F GG + LD G
Sbjct: 318 TAYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMG 377
Query: 428 TLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+V + CL F P D +GNVQQ+ +EV YDV G +GF G C
Sbjct: 378 VMV-----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 128/419 (30%), Positives = 195/419 (46%), Gaps = 46/419 (10%)
Query: 87 EILRRDQQRLHLKN-----SRRLQKAIPDNFKKTKAFT----------FPAKTGIVAADE 131
+++ RD + N S+R++ AI +F + FT P E
Sbjct: 34 DLIHRDSPKSPFYNPAETPSQRIRNAIHRSFNRVSHFTDLSEMDASLNSPQTDITPCGGE 93
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + +++G P + + DTGS + WTQCKPC C Q DP FDP S T+ + C+S+
Sbjct: 94 YLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQ 153
Query: 192 CKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C L Q CS+ K C Y ++Y DGS G +A D +T+ + N
Sbjct: 154 CTAL------ENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTD-NRPVQLKNI 206
Query: 250 LLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKP 305
++GC NN +N +SG++GL G VS+I + S F YCL T I FG
Sbjct: 207 IIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKINFGTN 266
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRF 365
V+ TP+V + FY++TL ISVG + + S K + IDSGT +T
Sbjct: 267 AVVSGPGTVSTPLVVK-SRDTFYYLTLKSISVGSKNMQTPDSNI-KGNMVIDSGTTLTLL 324
Query: 366 PAPVYSALRSAFRK-----RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
P Y + +A + K ++G + CY+ +A + +P IT+HF G D
Sbjct: 325 PVKYYIEIENAVASLINADKSKDERIGSSL------CYNATA--DLNIPVITMHF-EGAD 375
Query: 421 LELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++L + + VCL F + S + + GNV Q+ + V YD A + + F P +C
Sbjct: 376 VKLYPYNSFFKVTEDLVCLAFGM--SFYRNGIYGNVAQKNFLVGYDTASKTMSFKPTDC 432
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 179/373 (47%), Gaps = 25/373 (6%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY++ V +G P ++ SL+LDTGS + W QC PC C QQ F+DP S ++ I
Sbjct: 150 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNIT 209
Query: 187 CNSTTCKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
CN C ++ PP+ C S + CPY Y D S TG +A + T+ G
Sbjct: 210 CNDPRCNLVS---PPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSS 266
Query: 245 ARY---PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYG 295
Y + GC N G +GA+G++GL RGP+S S+ Y F YCL +S
Sbjct: 267 ELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTN 326
Query: 296 STGYITFGK-PDTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKL 352
+ + FG+ D ++ + +T V E FY++ + I V GE L + +
Sbjct: 327 VSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNIS 386
Query: 353 S-----TEIDSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSAYKTV 406
S T IDSGT ++ F P Y +++ ++ K KY + + + D C+++S ++
Sbjct: 387 SDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFP-ILDPCFNVSGIDSI 445
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
+P++ I F G + + + VCL P SI +GN QQ+ + + YD
Sbjct: 446 QLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSI-IGNYQQQNFHILYD 504
Query: 467 VAGRRLGFGPGNC 479
RLG+ P C
Sbjct: 505 TKRSRLGYAPTKC 517
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 97/277 (35%), Positives = 137/277 (49%), Gaps = 17/277 (6%)
Query: 212 CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLD 271
C Y I Y DGS G +++ G F+ GC NN G G SG+MGL
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKF------GTILVKDFIFGCGRNNKGLFGGVSGLMGLG 186
Query: 272 RGPVSIISKTNISY---FFYCLHS-PYGSTGYITFGKPDTV--NKKFVKYTPIVTTPEQS 325
R +S+IS+T+ + F YCL S +G + G +V N + Y ++ P+
Sbjct: 187 RSDLSLISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLY 246
Query: 326 EFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK 385
FY I LTGIS+GG + L+A +DSGT+ITR P +Y AL++ F K+ +
Sbjct: 247 NFYFINLTGISIGG--VALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFP 304
Query: 386 MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT--LVVESVRQVCLGFAL 443
+ DTC++LSAY+ V +P I +HF G +L +DV G V QVCL A
Sbjct: 305 PAPAF-SILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALAS 363
Query: 444 LPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L +LGN QQ+ V YD ++GF C+
Sbjct: 364 LEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 118/423 (27%), Positives = 192/423 (45%), Gaps = 42/423 (9%)
Query: 83 PSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKP 141
PS + L D +RLH + RR K IP F K+ P +G + +Y++ + IG+P
Sbjct: 43 PSPTQALALDTRRLHFLSLRR--KPIP--FVKS-----PVVSGAASGSGQYFVDLRIGQP 93
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDP-FFDPSKSKTFSKIPCNSTTCKILLEWFP 200
Q + L+ DTGS + W +C C +CS F P S TFS C C+++ +
Sbjct: 94 PQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPK--- 150
Query: 201 PNGQDKCSSKE----CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDN 256
P+ C+ C Y+ Y DGS +G +A + +++ +G + GC
Sbjct: 151 PDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKS-VAFGCGFR 209
Query: 257 NTGDQ------NGASGIMGLDRGPVSIISKTNISY---FFYCLH----SPYGSTGYITFG 303
+G NGA+G+MGL RGP+S S+ + F YCL SP ++ I
Sbjct: 210 ISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGN 269
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
D ++K F +TP++T P FY++ L + V G +L + S + T +DS
Sbjct: 270 GGDGISKLF--FTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDS 327
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYK--TVVVPKITIHFL 416
GT + P Y ++ +A R+R+ K + + FD C ++S ++P++ F
Sbjct: 328 GTTLAFLAEPAYRSVIAAVRRRV-KLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFS 386
Query: 417 GGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
GG R + + CL + ++GN+ Q+G+ +D RLGF
Sbjct: 387 GGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSR 446
Query: 477 GNC 479
C
Sbjct: 447 RGC 449
>gi|21668075|gb|AAM74221.1|AF518565_1 putative chloroplast nucleoid DNA-binding protein [Brassica
oleracea]
Length = 165
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 70/166 (42%), Positives = 103/166 (62%), Gaps = 1/166 (0%)
Query: 315 YTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALR 374
+TPI T + + FY + + GISVGG++L + + F+ IDSGT+I+R P Y+ALR
Sbjct: 1 FTPISTITDGTSFYGLDIVGISVGGQKLAIPQTVFSTPGALIDSGTVISRLPPKAYAALR 60
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESV 434
AF+ +M +YK + + DTC+DL+ +KTV +P ++ +F GG +EL +G L +
Sbjct: 61 GAFKAKMSQYKNTSAVS-ILDTCFDLTGFKTVTIPTVSFYFNGGAVVELGSKGVLYAFKM 119
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
QVCL FA D N+ + GNVQQ+ EV YD A R+GF P C+
Sbjct: 120 SQVCLAFAGNSDDNNAAIFGNVQQQTLEVVYDGAAGRVGFAPNGCS 165
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 187/372 (50%), Gaps = 23/372 (6%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY+I V +G P ++ SL+LDTGS + W QC PC C +Q P +DP +S ++ I
Sbjct: 176 LGSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIG 235
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNG--YF 244
C+ + C ++ PP K ++ CPY Y D S TG +A + T+ +G
Sbjct: 236 CHDSRCHLVSSPDPPQ-PCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPEL 294
Query: 245 ARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGST 297
R + GC N G +GA+G++GL RGP+S S+ Y F YCL +S +
Sbjct: 295 RRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVS 354
Query: 298 GYITFGK-PDTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKLS- 353
+ FG+ D ++ + +T +V E FY++ + I VGGE + + + +
Sbjct: 355 SKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATD 414
Query: 354 ----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T IDSGT ++ F P Y ++ AF ++K Y + K + + CY+++ + +P
Sbjct: 415 GSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFP-VLEPCYNVTGVEQPDLP 473
Query: 410 KITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDV 467
I F G V + +E VCL A+L + P+++ ++GN QQ+ + + YD
Sbjct: 474 DFGIVFSDGAVWNFPVENYFIEIEPREVVCL--AILGTPPSALSIIGNYQQQNFHILYDT 531
Query: 468 AGRRLGFGPGNC 479
RLGF P C
Sbjct: 532 KKSRLGFAPTKC 543
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 183/371 (49%), Gaps = 22/371 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY++ V +G P ++ SL+LDTGS + W QC PCI C +Q P++DP S +F I
Sbjct: 192 LGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNIS 251
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C+ C+++ PP K ++ CPY Y DGS TG +A + T+ NG
Sbjct: 252 CHDPRCQLVSAPDPPKPC-KAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSEL 310
Query: 247 YP---FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGST 297
+ GC N G +GA+G++GL +GP+S S+ Y F YCL +S +
Sbjct: 311 KHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVS 370
Query: 298 GYITFGK-PDTVNKKFVKYTPIVTTPEQS--EFYHITLTGISVGGERLPLKASYFTKLST 354
+ FG+ + ++ + +T + S FY++ + + V E L + + LS+
Sbjct: 371 SKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETW-HLSS 429
Query: 355 E------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
E IDSGT +T F P Y ++ AF +++K Y++ +G+ L CY++S + + +
Sbjct: 430 EGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPL-KPCYNVSGIEKMEL 488
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P I F V + VCL P SI +GN QQ+ + + YD+
Sbjct: 489 PDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSALSI-IGNYQQQNFHILYDMK 547
Query: 469 GRRLGFGPGNC 479
RLG+ P C
Sbjct: 548 KSRLGYAPMKC 558
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 177/371 (47%), Gaps = 21/371 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY++ V +G P ++ SL+LDTGS + W QC PC C QQ F+DP S ++ I
Sbjct: 165 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNIT 224
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
CN C ++ PP K ++ CPY Y D S TG +A + T+ G
Sbjct: 225 CNDQRCNLVSSPDPP-MPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSEL 283
Query: 247 Y---PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGST 297
Y + GC N G +GA+G++GL RGP+S S+ Y F YCL +S +
Sbjct: 284 YNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVS 343
Query: 298 GYITFGK-PDTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKLS- 353
+ FG+ D ++ + +T V E FY++ + I V GE L + + S
Sbjct: 344 SKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSD 403
Query: 354 ----TEIDSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSAYKTVVV 408
T IDSGT ++ F P Y +++ ++ K KY + + + D C+++S V +
Sbjct: 404 GAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFP-ILDPCFNVSGIHNVQL 462
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P++ I F G + + + VCL P SI +GN QQ+ + + YD
Sbjct: 463 PELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSI-IGNYQQQNFHILYDTK 521
Query: 469 GRRLGFGPGNC 479
RLG+ P C
Sbjct: 522 RSRLGYAPTKC 532
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 116/385 (30%), Positives = 177/385 (45%), Gaps = 51/385 (13%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCK------PCIHCS-----QQRDPFFDPSKSK 180
Y ++ ++G P Q VSL+LDTGS + WT C C +C+ + P + +KS
Sbjct: 74 YSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSS 133
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKECP-YDIAYVDGSGETGFWATDRMTIQEVN 239
T +PC S C W + + ++K CP Y + Y GS TG +D + + ++N
Sbjct: 134 TVQSLPCRSPKC----NWVFGSDLNCSTTKRCPYYGLEYGLGS-TTGQLVSDVLGLSKLN 188
Query: 240 GNGYFARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS------ 292
R P FL GC+ GI G RG SI ++ ++ F YCL S
Sbjct: 189 ------RIPDFLFGCS---LVSNRQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDT 239
Query: 293 PYGSTGYITFGKPDT-VNKKFVKYTPIVTTPE---QSEFYHITLTGISVGGERLPLKASY 348
P + G+ V Y P +P SE+Y+I+L+ I VGG+ +P+ Y
Sbjct: 240 PQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRY 299
Query: 349 FTKLSTE------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDL 400
S E +DSG+ T ++ + K M KYK K IED CY++
Sbjct: 300 LVP-SKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNI 358
Query: 401 SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNS-----ILLGN 455
+ V VPK+T F GG +++L + + + VC+ P +P S I+LGN
Sbjct: 359 TGQSEVDVPKLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAIILGN 418
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNCN 480
QQ+ + + YD+ +R GF P C+
Sbjct: 419 YQQQNFYIEYDLKKQRFGFKPQQCD 443
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 167/371 (45%), Gaps = 38/371 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +AIG P L DTGS +TWTQC+PC C Q P +DPS S TFS +PC+S
Sbjct: 76 EYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSA 135
Query: 191 TCKILLEWFPPNGQDKCS--SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
TC +L CS S C Y +Y DG+ G T+ +T+ +
Sbjct: 136 TCLPVLR------SRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSD 189
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST-------GYIT 301
GC +N GD ++G +GL RG +S++++ + F YCL + ST G +
Sbjct: 190 VAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSTLDSPFLLGTLA 249
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----I 356
P V+ TP++ +P Y ++L GI++G RLP+ F + +
Sbjct: 250 ELAP---GPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVV 306
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG-----IEDLFDTCYDLSAYKTVV--VP 409
DSGT + P S FR + G L C+ A + + +P
Sbjct: 307 DSGTTFSILP-------ESGFRVVVDHVAQVLGQPPVNASSLDSPCFPAPAGERQLPFMP 359
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
+ +HF GG D+ L R + + ++ + +LGN QQ+ ++ +D+
Sbjct: 360 DLVLHFAGGADMRLH-RDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQNIQMLFDMTV 418
Query: 470 RRLGFGPGNCN 480
+L F P +C+
Sbjct: 419 GQLSFLPTDCS 429
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 176/365 (48%), Gaps = 38/365 (10%)
Query: 126 IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKI 185
I++ Y +G P Q + + +D + W C C C+ P F P++S T+ +
Sbjct: 77 ILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAAS-SPSFSPTQSSTYRTV 135
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
PC S C + P G C +++ Y + + D + ++ N
Sbjct: 136 PCGSPQCAQVPSPSCPAGVGS----SCGFNLTYAASTFQ-AVLGQDSLALE----NNVVV 186
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF 302
Y F GC +G+ G++G RGP+S +S+T +Y F YCL + Y S+ +
Sbjct: 187 SYTF--GCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPN-YRSSNFSGT 243
Query: 303 GKPDTVNK-KFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYFTKLS---TEI 356
K + + K +K TP++ P + Y++ + GI VG + ++P A F ++ T I
Sbjct: 244 LKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTII 303
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
D+GT+ TR APVY+A+R AFR R++ +G FDTCY++ TV VP +T
Sbjct: 304 DAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG-----FDTCYNV----TVSVPTVTF 354
Query: 414 HFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDP-NSIL--LGNVQQRGYEVHYDVAG 469
F G V + L ++ S V CL A PSD N+ L L ++QQ+ V +DVA
Sbjct: 355 MFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVAN 414
Query: 470 RRLGF 474
R+GF
Sbjct: 415 GRVGF 419
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 176/365 (48%), Gaps = 38/365 (10%)
Query: 126 IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKI 185
I++ Y +G P Q + + +D + W C C C+ P F P++S T+ +
Sbjct: 96 ILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAAS-SPSFSPTQSSTYRTV 154
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
PC S C + P G C +++ Y + + D + ++ N
Sbjct: 155 PCGSPQCAQVPSPSCPAGVGS----SCGFNLTYAASTFQ-AVLGQDSLALE----NNVVV 205
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF 302
Y F GC +G+ G++G RGP+S +S+T +Y F YCL + Y S+ +
Sbjct: 206 SYTF--GCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPN-YRSSNFSGT 262
Query: 303 GKPDTVNK-KFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYFTKLS---TEI 356
K + + K +K TP++ P + Y++ + GI VG + ++P A F ++ T I
Sbjct: 263 LKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTII 322
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
D+GT+ TR APVY+A+R AFR R++ +G FDTCY++ TV VP +T
Sbjct: 323 DAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG-----FDTCYNV----TVSVPTVTF 373
Query: 414 HFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDP-NSIL--LGNVQQRGYEVHYDVAG 469
F G V + L ++ S V CL A PSD N+ L L ++QQ+ V +DVA
Sbjct: 374 MFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVAN 433
Query: 470 RRLGF 474
R+GF
Sbjct: 434 GRVGF 438
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 118/423 (27%), Positives = 192/423 (45%), Gaps = 42/423 (9%)
Query: 83 PSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKP 141
PS + L D +RLH + RR K +P F K+ P +G + +Y++ + IG+P
Sbjct: 42 PSPTQALALDTRRLHFLSLRR--KPVP--FVKS-----PVVSGASSGSGQYFVDLRIGQP 92
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDP-FFDPSKSKTFSKIPCNSTTCKILLEWFP 200
Q + L+ DTGS + W +C C +CS F P S TFS C C+++ +
Sbjct: 93 PQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPK--- 149
Query: 201 PNGQDKCSSKE----CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDN 256
P +C+ CPY+ Y DGS +G +A + +++ +G + GC
Sbjct: 150 PGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKS-VAFGCGFR 208
Query: 257 NTGDQ------NGASGIMGLDRGPVSIISKTNISY---FFYCLH----SPYGSTGYITFG 303
+G NGA+G+MGL RGP+S S+ + F YCL SP ++ I
Sbjct: 209 ISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGD 268
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
D V+K F +TP++T P FY++ L + V G +L + S + T +DS
Sbjct: 269 GGDAVSKLF--FTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDS 326
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYK--TVVVPKITIHFL 416
GT + P Y + +A ++R+K + + FD C ++S ++P++ F
Sbjct: 327 GTTLAFLADPAYRLVIAAVKQRIKLPNADE-LTPGFDLCVNVSGVTKPEKILPRLKFEFS 385
Query: 417 GGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
GG R + + CL + ++GN+ Q+G+ +D RLGF
Sbjct: 386 GGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSR 445
Query: 477 GNC 479
C
Sbjct: 446 RGC 448
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 107/346 (30%), Positives = 147/346 (42%), Gaps = 26/346 (7%)
Query: 149 LDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS 208
+DTGS + WTQC PC+ C+ Q P+FD KS T+ +PC S+ C L C
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASL-------SSPSCF 53
Query: 209 SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIM 268
K C Y Y D + G A + T N A GC N GD +SG++
Sbjct: 54 KKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATN-IAFGCGSLNAGDLANSSGMV 112
Query: 269 GLDRGPVSIISKTNISYFFYCLHSPYGSTG-------YITFGKPDTVNKKFVKYTPIVTT 321
G RGP+S++S+ S F YCL S +T Y +T + V+ TP V
Sbjct: 113 GFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVIN 172
Query: 322 PEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSA 376
P Y ++L IS+G + LP+ F IDSGT IT Y A+R
Sbjct: 173 PALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRG 232
Query: 377 FRKRMKKYKMGKGIEDLFDTCYDL--SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESV 434
+ M L DTC+ TV VP + HF L L+ +
Sbjct: 233 LVSAIPLPAMNDTDIGL-DTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTT 291
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+CL A P+ +I +GN QQ+ + YD+ L F P C+
Sbjct: 292 GYLCLVMA--PTGVGTI-IGNYQQQNLHLLYDIGNSFLSFVPAPCD 334
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 184/371 (49%), Gaps = 21/371 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY++ V +G P ++ SL+LDTGS + W QC PC C +Q P++DP S +F I
Sbjct: 190 LGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNIT 249
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNG---Y 243
C+ C+++ PP K ++ CPY Y D S TG +A + T+ G
Sbjct: 250 CHDPRCQLVSSPDPPQ-PCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPEL 308
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGST 297
+ GC N G +GA+G++GL RGP+S ++ Y F YCL +S +
Sbjct: 309 KIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNSSVS 368
Query: 298 GYITFGK-PDTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISVGGE--RLPLKASYFTKL 352
+ FG+ + ++ + +T V E FY++ + I VGGE ++P + + +
Sbjct: 369 SKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLSAQ 428
Query: 353 ---STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T IDSGT +T F P Y ++ AF +++K + + + L CY++S + + +P
Sbjct: 429 GGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPL-KPCYNVSGVEKMELP 487
Query: 410 KITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
+ I F G + V + +E VCL P SI +GN QQ+ + + YD+
Sbjct: 488 EFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSI-IGNYQQQNFHILYDLK 546
Query: 469 GRRLGFGPGNC 479
RLG+ P C
Sbjct: 547 KSRLGYAPMKC 557
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 124/417 (29%), Positives = 184/417 (44%), Gaps = 49/417 (11%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
L L+RD +R ++ A P+N TG + EY + +G P +
Sbjct: 86 LARRLQRDMRRAAWIITKAATPADPENGTVV--------TGAPTSGEYIAKITVGTPYEN 137
Query: 145 VS-----LLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF 199
S L D GS +TW QC PC C Q P ++ KS + S + C + C+ L
Sbjct: 138 DSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVGCYAPACRAL---- 193
Query: 200 PPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDN 256
C EC Y + Y DGS G + + +T R P +GC +
Sbjct: 194 --GSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPP------GVRVPGVAIGCGSD 245
Query: 257 NTGDQNG-ASGIMGLDRGPVSIISKTNISY---FFYCL--HSPYGSTGYITFGKPDTV-- 308
N G A+GI+GL RG +S S+ Y F YCL G + +TFG +
Sbjct: 246 NQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRSSTLTFGSGASATT 305
Query: 309 -NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-------IDSGT 360
+TP++T FY++ L GISVGG R+ +L +DSGT
Sbjct: 306 TTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGVIVDSGT 365
Query: 361 IITRFPAPVYSALRSAFRKRMKK---YKMGKGIEDLFDTCYDLSAYKTV-VVPKITIHFL 416
+TR P Y+A R AFR K + G FDTCY + + VP +++HF
Sbjct: 366 AVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFA 425
Query: 417 GGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVAGRRL 472
GGV+++L + L+ + + FA S + ++GN+Q +G+ V YDV G+R+
Sbjct: 426 GGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQGFRVVYDVDGQRV 482
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 126/418 (30%), Positives = 190/418 (45%), Gaps = 49/418 (11%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
+ L RD +H N+R+L + D + P V E+ + +AIG P
Sbjct: 47 VRAALHRD---MHRHNARKLAASSSDG-----TVSAPVSPTTVPG-EFLMTLAIGTPPLP 97
Query: 145 VSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+ DTGS + WTQC PC C QQ P ++PS S TFS +PCNS+ L P
Sbjct: 98 FLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSS-----LGLCAP-- 150
Query: 204 QDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG-DQ 261
C+ C Y++ Y GSG T F T+ T GC++ ++G +
Sbjct: 151 --ACA---CMYNMTY--GSGWTYVFQGTETFTFGSSTPADQVRVPGIAFGCSNASSGFNA 203
Query: 262 NGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPDTVNKK-FVKYTP 317
+ ASG++GL RG +S++S+ F YCL +PY ST + G ++N V TP
Sbjct: 204 SSASGLVGLGRGSLSLVSQLGAPKFSYCL-TPYQDTNSTSTLLLGPSASLNDTGVVSSTP 262
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSA 372
V +P S +Y++ LTGIS+G LP+ + F+ + IDSGT IT Y
Sbjct: 263 FVASPS-SIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQ 321
Query: 373 LRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV--VVPKITIHFLGGVDLELDVRGTLV 430
+R+A + D C++L + + +P +T+HF G D+ L ++
Sbjct: 322 VRAAVLSLVTLPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHF-DGADMVLPADNYMM 380
Query: 431 VESVRQV-----CLGFALLPSDPNSI---LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
S CL +D + + +LGN QQ+ + YDV L F P C+
Sbjct: 381 SLSDPDSDSSLWCLAMQNQ-TDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKCS 437
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 140/419 (33%), Positives = 200/419 (47%), Gaps = 42/419 (10%)
Query: 87 EILRRDQQRLHL-----------KNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EYYI 134
E++ RD R L N+ R ++FKK T A++ +VA+ EY +
Sbjct: 34 EMIHRDSSRSPLYRPTETPFQRVANAVRRSINRGNHFKKAFVSTDSAESTVVASQGEYLM 93
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
++G P V ++DTGS I W QC+PC C +Q P FDPSKSKT+ +PC+S TC+
Sbjct: 94 RYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCSSNTCES 153
Query: 195 LLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLG 252
L CSS C Y I Y DGS G + + +T+ +G+ +P ++G
Sbjct: 154 LR-------NTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSS--VHFPKTVIG 204
Query: 253 CTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGSTGYITFGKP 305
C NN G Q SGI+GL GPVS+IS+ + S F YCL S S+ + FG
Sbjct: 205 CGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDA 264
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGT 360
V+ + TP+ Q FY +TL SVG R+ S + + IDSGT
Sbjct: 265 AVVSGRGTVSTPLDPLNGQV-FYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGT 323
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
+T P Y L SA + K + + L CY ++ + +P IT HF G D
Sbjct: 324 TLTLLPQEDYLNLESAVSDVI-KLERARDPSKLLSLCYKTTS-DELDLPVITAHF-KGAD 380
Query: 421 LELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+EL+ T V VC FA + S +I GN+ Q+ V YD+ + + F P +C
Sbjct: 381 VELNPISTFVPVEKGVVC--FAFISSKIGAI-FGNLAQQNLLVGYDLVKKTVSFKPTDC 436
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 100/364 (27%), Positives = 166/364 (45%), Gaps = 28/364 (7%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+ + V +G P Q ++LD GS + WTQC ++Q +P FD ++S +FS +PC+S
Sbjct: 107 HSLTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKL 166
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C+ C+ ++C Y+ Y + TG AT+ T G +
Sbjct: 167 CEA-----GTFTNKTCTDRKCAYENDYGIMTA-TGVLATETFTF----GAHHGVSANLTF 216
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-------HSPYGSTGYITFGK 304
GC G ASGI+GL GP+S++ + I+ F YCL SP GK
Sbjct: 217 GCGKLANGTIAEASGILGLSPGPLSMLKQLAITKFSYCLTPFADRKTSPVMFGAMADLGK 276
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSG 359
T K V+ P++ P + +Y++ + G+SVG +RL + T +DS
Sbjct: 277 YKTTGK--VQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSA 334
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS---AYKTVVVPKITIHFL 416
T + P ++ L+ A + +K + ++D + C++L + + V VP + +HF
Sbjct: 335 TTLAYLVEPAFTELKKAVMEGIKLPVANRSVDD-YPVCFELPRGMSMEGVQVPPLVLHFD 393
Query: 417 GGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
G ++ L S +CL P + ++GNVQQ+ V YDV R+ + P
Sbjct: 394 GDAEMSLPRDNYFQEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAP 453
Query: 477 GNCN 480
C+
Sbjct: 454 TKCD 457
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 126/423 (29%), Positives = 192/423 (45%), Gaps = 51/423 (12%)
Query: 87 EILRRDQQRLHLKNS-----RRLQKAIPDNFKKTKAFT-------FPAKTGIVAADEYYI 134
+++ RD + NS +R++ AI + + T F+ P EY +
Sbjct: 29 DLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSNRGEYLM 88
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
++IG P + + DTGS + WTQC PC C QQ P FDP +S T+ K+ C+S+ C+
Sbjct: 89 NISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQCRA 148
Query: 195 LLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP---- 248
L + CS+ E C Y I Y D S G A D +T+ G R P
Sbjct: 149 LEDA-------SCSTDENTCSYTITYGDNSYTKGDVAVDTVTM------GSSGRRPVSLR 195
Query: 249 -FLLGCTDNNTGDQNGASGIMGLDRGP----VSIISKTNISYFFYCL---HSPYGSTGYI 300
++GC NTG + A + G VS + K+ F YCL S G T I
Sbjct: 196 NMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKI 255
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--TKLSTEIDS 358
FG V+ V T +V + + +Y + L ISVG +++ ++ F + + IDS
Sbjct: 256 NFGTNGIVSGDGVVSTSMV-KKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDS 314
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY-DLSAYKTVVVPKITIHFLG 417
GT +T P+ Y L S +K ++ + + + CY D S++K VP IT+HF G
Sbjct: 315 GTTLTLLPSNFYYELESVVASTIKAERV-QDPDGILSLCYRDSSSFK---VPDITVHFKG 370
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G D++L T V S C FA ++ + GN+ Q + V YD + F
Sbjct: 371 G-DVKLGNLNTFVAVSEDVSCFAFA---ANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKT 426
Query: 478 NCN 480
+C+
Sbjct: 427 DCS 429
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/366 (30%), Positives = 167/366 (45%), Gaps = 28/366 (7%)
Query: 125 GIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFS 183
GI ++V + +G P Q ++ D + TW QC+PCI C Q D FDPS+S +++
Sbjct: 179 GITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYT 238
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNG 242
+ C + C +L PN CS C Y+I Y DG+ G + ++ + +G
Sbjct: 239 LLSCETKHCNLL-----PNS--SCSDDGYCRYNITYKDGTNTEGVLINETVSFES---SG 288
Query: 243 YFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-HSPYG-STGYI 300
+ R LGC++ N G G+ G GL RG +S S+ N S YCL S G S+ +
Sbjct: 289 WVDRVS--LGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCLVESKDGYSSSTL 346
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTE 355
F P K ++ P+ Y++ L GI VGGE++ + S FT
Sbjct: 347 EFNSPPCSGSVKAK---LLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMI 403
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF 415
+ S ++IT Y+ +R AF + + + K FDTCY+LS+ TV +P +
Sbjct: 404 VSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQ-FDTCYNLSSNNTVELPILEFEV 462
Query: 416 LGGVDLELDVRGTL-VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G L L V+ C FA PS + +LG +QQ G V +D+ +
Sbjct: 463 NDGKSWLLPKESYLYAVDKNGTFCFAFA--PSKGSFSILGTLQQYGTRVTFDLVNSFVYL 520
Query: 475 GPGNCN 480
CN
Sbjct: 521 HTLCCN 526
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 132/419 (31%), Positives = 188/419 (44%), Gaps = 41/419 (9%)
Query: 87 EILRRDQQRLHLKN-----SRRLQKAIPDNFKKTKAFT----------FPAKTGIVAADE 131
+++ RD + N S+RL+ AI + + FT P + E
Sbjct: 34 DLIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDLTSNSGE 93
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + +++G P + + DTGS + WTQCKPC C Q DP FDP S T+ + C+S+
Sbjct: 94 YLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQ 153
Query: 192 CKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C L Q CS+++ C Y +Y D S G A D +T+ + +
Sbjct: 154 CTAL------ENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKN-I 206
Query: 250 LLGCTDNNTGDQN-GASGIMGLDRGPVSIISKTNISY---FFYC---LHSPYGSTGYITF 302
++GC NN G N SGI+GL G VS+I++ S F YC L S T I F
Sbjct: 207 IIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINF 266
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL--PLKASYFTKLSTEIDSGT 360
G V+ V TP++ Q FY++TL ISVG + + P S + + IDSGT
Sbjct: 267 GTNAVVSGTGVVSTPLI-AKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGT 325
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
+T P YS L A + K + + CY SA + VP IT+HF G D
Sbjct: 326 TLTLLPTEFYSELEDAVASSIDAEKK-QDPQTGLSLCY--SATGDLKVPAITMHF-DGAD 381
Query: 421 LELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ L V S VC F P+ + GNV Q + V YD + + F P +C
Sbjct: 382 VNLKPSNCFVQISEDLVCFAFR---GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 437
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 116/371 (31%), Positives = 171/371 (46%), Gaps = 34/371 (9%)
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
IG P + V LL+DT S +TW Q C +CS + P F+P S +F PC S+ C L
Sbjct: 5 IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVC---LG 61
Query: 198 WFPPNGQDKC--SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
Q C S+ C + +AY+DGS G A + ++Q +G + GC
Sbjct: 62 RSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAAS-TLGDVIFGCAS 120
Query: 256 NNTGDQ-NGASGIMGLDRGPVSI------ISKTNIS-YFFYCL---HSPYGSTGYITFGK 304
+ + +SG +GL+RG S SK+ +S F YC S+G I FG
Sbjct: 121 KDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGD 180
Query: 305 PDTVNKKFVKYTPIVTTPEQS---EFYHITLTGISVGGERLPLKASYFT-----KLSTEI 356
F +Y + P + +FY++ L GISVGGE L + S F T
Sbjct: 181 SGIPAHHF-QYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYF 239
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA--YKTVVVPKITIH 414
DSGT ++ P ++AL AF +R+ G + + CYD++A + P +T+H
Sbjct: 240 DSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTLH 299
Query: 415 FLGGVDLELDVRGTLV----VESVRQVCLGF--ALLPSDPNSILLGNVQQRGYEVHYDVA 468
F VD+EL V V +CL F A + ++GN QQ+ Y + +D+
Sbjct: 300 FKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLE 359
Query: 469 GRRLGFGPGNC 479
R+GF P NC
Sbjct: 360 RSRIGFAPANC 370
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 169/374 (45%), Gaps = 33/374 (8%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + +Y++ ++G P+Q L++DTGS + + QC PC C +Q P + PS S TF+ +P
Sbjct: 29 LGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTPVP 88
Query: 187 CNSTTCKILLEWFPPNGQDKCSSK--------ECPYDIAYVDGSGETGFWATDRMTIQEV 238
C+S C ++ P CSS C Y+ Y D S G +A + T+ +
Sbjct: 89 CDSAECLLI----PAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATVGGI 144
Query: 239 NGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH---S 292
N GC + N G A G++GL +G +S S+ ++ F YCL S
Sbjct: 145 RVNH------VAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLS 198
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
P + FG +++TP+V+ P Y++ + I GGE L + S +
Sbjct: 199 PTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKID 258
Query: 353 S-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV 407
S T DSGT +T + Y+ + +AF K + + + L C ++S +
Sbjct: 259 SVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGL-PLCVNVSGIDHPI 317
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYD 466
P TI F G + + S CL A+L S + ++GN+ Q+ Y V YD
Sbjct: 318 YPSFTIEFDQGATYRPNQGNYFIEVSPNIDCL--AMLESSSDGFNVIGNIIQQNYLVQYD 375
Query: 467 VAGRRLGFGPGNCN 480
R+GF NC+
Sbjct: 376 REEHRIGFAHANCD 389
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 121/391 (30%), Positives = 174/391 (44%), Gaps = 37/391 (9%)
Query: 121 PAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK----PCIHCSQQ---RDP 172
P ++G + +Y + +A G P Q V L+ DTGS + W QC P C ++ R P
Sbjct: 41 PMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRP 100
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGETGFWA 229
F SKS T S +PC++ C LL P CS C Y Y DGS TGF A
Sbjct: 101 AFVASKSATLSVVPCSAAQC--LLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLA 158
Query: 230 TDRMTIQEVNGNGYFARYPFLLGC-TDNNTGDQNGASGIMGLDRGPVSIISKTNISY--- 285
D TI G R GC T N G +G G++GL +G +S +++ +
Sbjct: 159 RDTATISNGTSGGAAVR-GVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQT 217
Query: 286 FFYCLHSPYG-----STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE 340
F YCL G S+ ++ G+P+ + YTP+V+ P FY++ + I VG
Sbjct: 218 FSYCLLDLEGGRRGRSSSFLFLGRPE--RRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNR 275
Query: 341 RLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRK--RMKKYKMGKGIEDL 393
LP+ S + T IDSG+ +T Y L SAF + +
Sbjct: 276 VLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQG 335
Query: 394 FDTCYDLSAYKTVV-----VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDP 448
+ CY++S+ + P++TI F G+ LEL LV + CL S
Sbjct: 336 LELCYNVSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPF 395
Query: 449 NSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+LGN+ Q+GY V +D A R+GF C
Sbjct: 396 AFNVLGNLMQQGYHVEFDRASARIGFARTEC 426
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 114/358 (31%), Positives = 175/358 (48%), Gaps = 18/358 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKI 185
V Y + +G P + +++DTGS +TW QC PC+ C +Q P F+P S +++ +
Sbjct: 124 VGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSV 183
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C++ C L N +S C Y +Y D S G+ + D ++ G +
Sbjct: 184 SCSAQQCSDLTTATL-NPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTS 236
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF 302
F GC +N G ++G++GL R +S++ + S F YCL P S+ +
Sbjct: 237 VPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGY 294
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
+ N YTP+ ++ Y I +TGI V G+ L + +S ++ L T IDSGT+I
Sbjct: 295 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 354
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
TR P VYSAL A MK + DTC+ A + + VP++T+ F GG L+
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTPRASAFS-ILDTCFQGQAAR-LRVPEVTMAFAGGAALK 412
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L R LV CL FA S + ++GN QQ+ + V YDV ++GF G C+
Sbjct: 413 LAARNLLVDVDSATTCLAFAPARS---AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 149/300 (49%), Gaps = 22/300 (7%)
Query: 55 QGPGKVSLEVLGR-YGPCSKLN-QGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNF 112
Q G + LE+ R Y K+N K N +L+++ R Q RL+K + +
Sbjct: 72 QEKGAIMLEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRSMQN-------RLRKMVSSHS 124
Query: 113 KKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP 172
+ P +G+ YIV + Q +++++DTGS +TW QC+PC+ C Q+ P
Sbjct: 125 VEVSQIQIPLASGVNFQTLNYIV-TMELGGQDMTVIIDTGSDLTWVQCEPCMSCYNQQGP 183
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDR 232
F PS S ++ IPCNS+TC+ L G + + C Y + Y DGS G +
Sbjct: 184 VFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEH 243
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYC 289
++ G + F+ GC NN G G SG+MGL R +S+IS+TN ++ F YC
Sbjct: 244 LSF------GGISVSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYC 297
Query: 290 L-HSPYGSTGYITFGKPDTVNKKF--VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
L + G++G + G +V K + YT +V P+ S FY + LTGI VG L+A
Sbjct: 298 LPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVGVWLFKLQA 357
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 120/414 (28%), Positives = 176/414 (42%), Gaps = 60/414 (14%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYV 145
E +RRD R+ S + +F + G+ Y + +++G P
Sbjct: 44 SEAVRRDSHRIAFL-SDATAAGKATTTNSSVSFQALLENGV---GGYNMNISVGTPLLTF 99
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
S++ DTGS + WTQC PC C QQ P F P+ S TFSK+PC S+ C+ L PN
Sbjct: 100 SVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFL-----PNSIR 154
Query: 206 KCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
C++ C Y+ Y GSG T G+ AT+ + + G+ F F GC+ N
Sbjct: 155 TCNATGCVYNYKY--GSGYTAGYLATETLKV----GDASFPSVAF--GCSTEN------- 199
Query: 265 SGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY-ITFGKPDTVNKKFVKYTPIVTTPE 323
G+ LD G + F YCL S + I FG + V+ TP V P
Sbjct: 200 -GLGQLDLG---------VGRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPA 249
Query: 324 -QSEFYHITLTGISVGGERLPLKASYF------TKLSTEIDSGTIITRFPAPVYSALRSA 376
+Y++ LTGI+VG LP+ S F T +DSGT +T Y ++ A
Sbjct: 250 VHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQA 309
Query: 377 FRKRMKKYKMGKGIEDLFDTCYDLS--AYKTVVVPKITIHFLGGVD---------LELDV 425
F + G L D C+ + + VP + + F GG + +E D
Sbjct: 310 FLSQTADVTTVNGTRGL-DLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDS 368
Query: 426 RGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+G++ V CL D ++GNV Q + YD+ G F P +C
Sbjct: 369 QGSVTV-----ACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 417
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 122/379 (32%), Positives = 169/379 (44%), Gaps = 44/379 (11%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V EY + +AIG P Q V L LDTGS + WTQC+PC C Q P+FDPS S T S
Sbjct: 30 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 89
Query: 187 CNSTTCKIL------LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
C+ST C+ L F PN + C Y +Y D S TGF D+ T V
Sbjct: 90 CDSTLCQGLPVASCGSPKFWPN-------QTCVYTYSYGDKSVTTGFLEVDKFTF--VGA 140
Query: 241 NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYI 300
F G +N N +GI G RGP+S+ S+ + F +C + I
Sbjct: 141 GASVPGVAFGCGLFNNGVFKSN-ETGIAGFGRGPLSLPSQLKVGNFSHCFTT-------I 192
Query: 301 TFGKPDTV-----------NKKFVKYTPIVTTPEQSE---FYHITLTGISVGGERLPLKA 346
T P TV + V+ TP++ + Y+++L GI+VG RLP+
Sbjct: 193 TGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPE 252
Query: 347 SYFTKLS----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA 402
S F + T IDSGT IT P VY +R F ++ K + G TC+ +
Sbjct: 253 SAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGHYTCFSAPS 311
Query: 403 YKTVVVPKITIHFLGG-VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGY 461
VPK+ +HF G +DL + V + + A+ D +I +GN QQ+
Sbjct: 312 QAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTI-IGNFQQQNM 370
Query: 462 EVHYDVAGRRLGFGPGNCN 480
V YD+ L F C+
Sbjct: 371 HVLYDLQNNMLSFVAAQCD 389
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 130/453 (28%), Positives = 210/453 (46%), Gaps = 52/453 (11%)
Query: 47 NRTRTALPQGPGKV--SLEVLGRYGPCSKLNQGKSRNTPS----LEEILRRDQQRLHLKN 100
+ +R++ P P +L+V +GPCS L G + PS L + RD RL +
Sbjct: 29 SHSRSSCPATPPDAGNTLQVSHAFGPCSPLGPGTA--APSWAGFLADQASRDASRLLYLD 86
Query: 101 SRRLQKAIPDNFKKTKAFTFPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWT 158
S ++ + +A+ P +G ++ Y + ++G P Q + L +DT + +W
Sbjct: 87 SLAVRG-------RARAYA-PIASGRQLLQTPTYVVRASLGTPPQQLLLAVDTSNDASWI 138
Query: 159 QCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAY 218
C C C FDP+ S ++ +PC S C PN K C + + Y
Sbjct: 139 PCAGCAGCPTSSAAPFDPASSASYRTVPCGSPLCAQA-----PNAACPPGGKACGFSLTY 193
Query: 219 VDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSII 278
D S + + D + V GN A + GC TG G++GL RGP+S +
Sbjct: 194 ADSSLQAAL-SQDSL---AVAGNAVKA---YTFGCLQRATGTAAPPQGLLGLGRGPLSFL 246
Query: 279 SKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLT 333
S+T Y F YCL S +G + G+ + +K TP++ P +S Y++ +T
Sbjct: 247 SQTKDMYEATFSYCLPSFKSLNFSGTLRLGR--NGQPQRIKTTPLLANPHRSSLYYVNMT 304
Query: 334 GISVGGERLPLKA-SYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED 392
GI VG + +P+ A T T +DSGT+ TR AP Y A+R R+R +G +
Sbjct: 305 GIRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRR-----VGAPVSS 359
Query: 393 L--FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPN 449
L FDTC++ +A V P +T+ F G+ + L ++ + + CL A P N
Sbjct: 360 LGGFDTCFNTTA---VAWPPVTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVN 415
Query: 450 SIL--LGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++L + ++QQ+ + V +DV R+GF C
Sbjct: 416 TVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 448
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 118/371 (31%), Positives = 169/371 (45%), Gaps = 74/371 (19%)
Query: 128 AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPC 187
+A Y + ++IG P S+L DTGS + WTQC PC C+ + P F P+ S TFSK+PC
Sbjct: 86 SAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPC 145
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFAR 246
S+ C+ L + C++ C Y Y G G T G+ AT+ + + G F
Sbjct: 146 ASSLCQFLTSPY-----RTCNATGCVYYYPY--GMGFTAGYLATETLHV----GGASFPG 194
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY-GSTGYITFGKP 305
F GC+ N G N +SGI+GL R P+S++S+ ++ F YCL S I FG
Sbjct: 195 VTF--GCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLRSNADAGDSPILFGSL 251
Query: 306 DTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIIT 363
V V+ TP++ PE S +Y++ LTGI+VG LP+ + T ++ T
Sbjct: 252 AKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPMAMANLTTVNG--------T 303
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD---LSAYKTVVVPKITIHFLGGVD 420
RF FD C+D V VP + + F GG +
Sbjct: 304 RFG---------------------------FDLCFDATAAGGGGGVPVPTLVLRFAGGAE 336
Query: 421 -----------LELDVRGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVA 468
+E+D +G VE CL L S+ SI ++GNV Q V YD+
Sbjct: 337 YAVRRRSYFGVVEVDSQGRAAVE-----CL-LVLPASEKLSISIIGNVMQMDLHVLYDLD 390
Query: 469 GRRLGFGPGNC 479
G F P +C
Sbjct: 391 GGMFSFAPADC 401
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 114/358 (31%), Positives = 175/358 (48%), Gaps = 18/358 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKI 185
V Y + +G P + +++DTGS +TW QC PC+ C +Q P F+P S +++ +
Sbjct: 122 VGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASV 181
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C++ C L N +S C Y +Y D S G+ + D ++ G +
Sbjct: 182 SCSAQQCSDLTTATL-NPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTS 234
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF 302
F GC +N G ++G++GL R +S++ + S F YCL P S+ +
Sbjct: 235 VPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGY 292
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
+ N YTP+ ++ Y I +TGI V G+ L + +S ++ L T IDSGT+I
Sbjct: 293 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 352
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
TR P VYSAL A MK + DTC+ A + + VP++T+ F GG L+
Sbjct: 353 TRLPTGVYSALSKAVAGAMKGTPRASAFS-ILDTCFQGQAAR-LRVPEVTMAFAGGAALK 410
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L R LV CL FA S + ++GN QQ+ + V YDV ++GF G C+
Sbjct: 411 LAARNLLVDVDSATTCLAFAPARS---AAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 129/430 (30%), Positives = 189/430 (43%), Gaps = 48/430 (11%)
Query: 87 EILRRDQ--QRLHLKN---SRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKP 141
+++ RD LH N S RLQ + + + + EY + ++IG P
Sbjct: 30 DLIHRDSPLSPLHTPNLTFSDRLQASFLRAISRQSRHVDFQTDLLPSGGEYMMNLSIGTP 89
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
+ + DTGS +TW Q KPC C Q+ P FDPS S TF K+PC + C L E
Sbjct: 90 PFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTAPCNALDE---- 145
Query: 202 NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD- 260
+ + C Y +Y D S TG+ A+D +T+ GN GC N G+
Sbjct: 146 SARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTV----GNASVQIRNVAFGCGTRNGGNF 201
Query: 261 ---QNGASGIMGLDRGPVSIISKTNISYFFYCL----------HSPYGSTGYITFGKPDT 307
+G G+ G + VS + T F YCL S +T I FG
Sbjct: 202 DEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRIVFGDNPV 261
Query: 308 VNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERL------PLKASY--FTKLSTE 355
+ TTP E S +Y++T+ I+VG ++L ASY +K S E
Sbjct: 262 FSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDSGSKSSVE 321
Query: 356 -----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
IDSGT +T Y AL +A + +K ++ +F C+ S + V +P
Sbjct: 322 EGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFK-SGKEEVELPL 380
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
+ +HF GG D+EL T V VC F +LP++ I GN+ Q + V YD+ R
Sbjct: 381 MKVHFRGGADVELKPVNTFVRAEEGLVC--FTMLPTNDVGI-YGNLAQMNFVVGYDLGKR 437
Query: 471 RLGFGPGNCN 480
+ F P +C+
Sbjct: 438 TVSFLPADCS 447
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 113/365 (30%), Positives = 164/365 (44%), Gaps = 26/365 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +AIG P L DTGS +TWTQC+PC C Q P +D + S +FS +PC S
Sbjct: 92 EYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASA 151
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC L W N SS C Y AY DG+ G T+ +T G +
Sbjct: 152 TC--LPIWSSRNC--TASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPG---VSVGGIA 204
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLH--------SPYGSTGYITF 302
GC +N G ++G +GL RG +S++++ + F YCL SP
Sbjct: 205 FGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGALAEL 264
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
P T V+ TP+V +P +Y+++L GIS+G RLP+ F L + G I+
Sbjct: 265 AAPST--GAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTF-DLRDDGSGGMIV 321
Query: 363 ---TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDT-CYDLSA--YKTVVVPKITIHFL 416
T F V SA R + D+ C+ + + +P + +HF
Sbjct: 322 DSGTTFTFLVESAFRVVVDHVAGVLRQPVVNASSLDSPCFPAATGEQQLPAMPDMVLHFA 381
Query: 417 GGVDLELDVRGTLVV-ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
GG D+ L + + CL A PS SI LGN QQ+ ++ +D+ +L F
Sbjct: 382 GGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSI-LGNFQQQNIQMLFDITVGQLSFM 440
Query: 476 PGNCN 480
P +C
Sbjct: 441 PTDCG 445
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 117/360 (32%), Positives = 177/360 (49%), Gaps = 22/360 (6%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKI 185
V Y + +G P + +++DTGS +TW QC PC+ C +Q P F+P S +++ +
Sbjct: 124 VGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSV 183
Query: 186 PCNSTTCKILL-EWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
C++ C L P CS S C Y +Y D S G+ + D ++ G
Sbjct: 184 SCSAQQCSDLTTATLSPA---SCSTSNVCIYQASYGDSSFSVGYLSKDTVSF------GS 234
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYI 300
+ F GC +N G ++G++GL R +S++ + S F YCL P S+
Sbjct: 235 TSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSS 292
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGT 360
+ + N YTP+ ++ Y I +TGI V G+ L + +S ++ L T IDSGT
Sbjct: 293 GYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGT 352
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
+ITR P VYSAL A MK + DTC+ A + + VP++T+ F GG
Sbjct: 353 VITRLPTGVYSALSKAVAGAMKGTPRASAFS-ILDTCFQGQAAR-LRVPEVTMAFAGGAA 410
Query: 421 LELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L+L R LV CL FA P+ +I +GN QQ+ + V YDV ++GF G C+
Sbjct: 411 LKLAARNLLVDVDSATTCLAFA--PARSAAI-IGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 180/365 (49%), Gaps = 36/365 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
E+ + + IG P V + DTGS +TWTQC PC C Q P F+P +S ++ K+ C S
Sbjct: 89 EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC+ L + G D + C Y +Y D S G A+D++TI G F +
Sbjct: 149 TCRSLESYH--CGPDL---QSCSYGYSYGDRSFTYGDLASDQITI------GSFKLPKTV 197
Query: 251 LGCTDNNTGDQNGAS-GIMGLDRGPVSIISKTNI-----SYFFYCLHSPYGS---TGYIT 301
+GC N G G + GI+GL G +S++S+ F YCL + + + TG I+
Sbjct: 198 IGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTIS 257
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE----ID 357
FG+ V+ + V TP+V + FY +TL ISVG +R A+ + ++ ID
Sbjct: 258 FGRKAVVSGRQVVSTPLVPRSPDT-FYFLTLEAISVGKKRFK-AANGISAMTNHGNIIID 315
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED---LFDTCYDLSAYKTVVVPKITIH 414
SGT +T P +Y + S + +K K ++D + + CY + +P IT H
Sbjct: 316 SGTTLTLLPRSLYYGVFSTLARVIK----AKRVDDPSGILELCYSAGQVDDLNIPIITAH 371
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F GG D++L T + CL FA P+ +I GN+ Q +EV YD+ +RL F
Sbjct: 372 FAGGADVKLLPVNTFAPVADNVTCLTFA--PATQVAI-FGNLAQINFEVGYDLGNKRLSF 428
Query: 475 GPGNC 479
P C
Sbjct: 429 EPKLC 433
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 115/361 (31%), Positives = 165/361 (45%), Gaps = 26/361 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
Y + V+IG P + + DTGS +TWT C PC C +QR+P FDP KS ++ I C+S
Sbjct: 24 HYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSK 83
Query: 191 TCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C L CS K C Y AY + G A + +T+ G +
Sbjct: 84 LCHKLDTGV-------CSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLK-GI 135
Query: 250 LLGCTDNNTGDQNGAS-GIMGLDRGPVSIISKTNISY----FFYCL---HSPYGSTGYIT 301
+ GC NNTG N GI+GL GPVS IS+ S+ F CL H+ + ++
Sbjct: 136 VFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMS 195
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS---YFTKLSTEIDS 358
GK V+ K V TP+V +++ ++ +TL GISVG L S K + +DS
Sbjct: 196 LGKGSEVSGKGVVSTPLVAKQDKTPYF-VTLLGISVGNTYLHFNGSSSQSVEKGNVFLDS 254
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT T P +Y L + R + + ++ CY + P +T HF GG
Sbjct: 255 GTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCY--RTKNNLRGPVLTAHFEGG 312
Query: 419 VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
D++L T V CLGF SD + GN Q Y + +D+ + + F P +
Sbjct: 313 -DVKLLPTQTFVSPKDGVFCLGFTNTSSDGG--VYGNFAQSNYLIGFDLDRQVVSFKPMD 369
Query: 479 C 479
C
Sbjct: 370 C 370
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 128/452 (28%), Positives = 196/452 (43%), Gaps = 52/452 (11%)
Query: 54 PQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFK 113
P + LE++ R+ G +++ ++RD+ R N R + D+ +
Sbjct: 27 PVAVNSMRLELVHRHHERFAGGGGDVDRVEAVKGFVKRDKLRRQRMNQRWGVVSNYDSRR 86
Query: 114 KT-KAFTFPAKTGI-------VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH 165
K + T PA+ + A EY+ V +G P Q L++DTGS TW C
Sbjct: 87 KGFEMTTTPAEVEMPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNC----- 141
Query: 166 CSQQRDPFFDPSKSKTFSKIPCNSTTCKI-LLEWFPPNGQDKCSSKECPYDIAYVDGSGE 224
SK+F + C S CK+ L E F + K S C YDI+Y DGS
Sbjct: 142 -------------SKSFEAVTCASRKCKVDLSELFSLSVCPK-PSDPCLYDISYADGSSA 187
Query: 225 TGFWATDRMTIQEVNG-NGYFARYPFLLGCTD---NNTGDQNGASGIMGLDRGPVSIISK 280
GF+ TD +T+ NG G +GCT N GI+GL S I K
Sbjct: 188 KGFFGTDSITVGLTNGKQGKLNN--LTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDK 245
Query: 281 TNISY---FFYCL--HSPYGS-TGYITFGKPDTVNKKF---VKYTPIVTTPEQSEFYHIT 331
Y F YCL H + S + +T G N K ++ T ++ P FY +
Sbjct: 246 AANKYGAKFSYCLVDHLSHRSVSSNLTIGGHH--NAKLLGEIRRTELILFPP---FYGVN 300
Query: 332 LTGISVGGERL---PLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK 388
+ GIS+GG+ L P + + T IDSGT +T P Y A+ A K + K K
Sbjct: 301 VVGISIGGQMLKIPPQVWDFNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVT 360
Query: 389 GIE-DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSD 447
G + D + C+D + VVP++ HF GG E V+ ++ + C+G +
Sbjct: 361 GEDFDALEFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGI 420
Query: 448 PNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ ++GN+ Q+ + +D++ +GF P C
Sbjct: 421 GGASVIGNIMQQNHLWEFDLSTNTVGFAPSTC 452
>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
Length = 484
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 118/438 (26%), Positives = 186/438 (42%), Gaps = 36/438 (8%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN--TPSLEEILRRDQQRLHL----KNSRRLQKAIPDNFKK 114
+L V+ R PCS L + + PS+ +IL RD R N A
Sbjct: 64 TLPVVHRLSPCSPLGAARIQQLEKPSVADILHRDALRFRSLFRDHNHGSAAPAPTSPGAD 123
Query: 115 TKAFTFPAKTGIV----AADEYYIVVAIGKPKQYVSLLLDTGS-GITWTQCKPCIHCSQQ 169
+ P++ + A EY++ G P Q ++ DT + G T QCKPC +
Sbjct: 124 GGGLSIPSRGDPIQELPGAFEYHVTAGFGTPVQQFTVGFDTTTTGATQLQCKPCA-ADEP 182
Query: 170 RDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWA 229
FDPS S + + +PC S C CS C ++ + +
Sbjct: 183 CHHAFDPSASSSIAHVPCGSPDCPF---------NKGCSGHSCTLSVSINNTLLGNATFF 233
Query: 230 TDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS----- 284
TD++T+ N F C + + ++GI+ L R S+ S+ S
Sbjct: 234 TDKLTLTPWN-----IVDDFRFVCLEAGFRPDDDSTGILDLSRNSHSLASRAAPSSPDAV 288
Query: 285 YFFYCLHSPYGSTGYITFG--KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL 342
F YCL S G+++ G KP+ + +K V YTP+ + Y + L G+ +GG L
Sbjct: 289 AFSYCLPSYPSDVGFLSLGATKPELLGRK-VSYTPLRSNRHNGNLYVVELVGLGLGGVDL 347
Query: 343 PLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA 402
P+ + T ++ T T VY+ALR FRK M +Y + L DTCY+ +A
Sbjct: 348 PVPRAAIAGGGTILELHTTFTYLKPKVYAALRDEFRKSMSQYPVAPPQGSL-DTCYNFTA 406
Query: 403 YKTVVVPKITIHFLGGVDLELDVRGTLVV-ESVRQVCLGFALLPSDPNSILLGNVQQRGY 461
+ VP +T+ F GG + +L + + E +G + ++G++ Q
Sbjct: 407 LSSYSVPAVTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGCLAFVAQDGGAVIGSMAQMST 466
Query: 462 EVHYDVAGRRLGFGPGNC 479
EV YDV G ++GF P C
Sbjct: 467 EVVYDVRGGKVGFVPYRC 484
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 129/453 (28%), Positives = 210/453 (46%), Gaps = 52/453 (11%)
Query: 47 NRTRTALPQGPGKV--SLEVLGRYGPCSKLNQGKSRNTPS----LEEILRRDQQRLHLKN 100
+ +R++ P P +L+V +GPCS L G + PS L + RD RL +
Sbjct: 29 SHSRSSCPATPPDAGNTLQVSHAFGPCSPLGPGTA--APSWAGFLADQASRDASRLLYLD 86
Query: 101 SRRLQKAIPDNFKKTKAFTFPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWT 158
S ++ + +A+ P +G ++ Y + ++G P Q + L +DT + +W
Sbjct: 87 SLAVRG-------RARAYA-PIASGRQLLQTLTYVVRASLGTPPQQLLLAVDTSNDASWI 138
Query: 159 QCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAY 218
C C C FDP+ S ++ +PC S C PN K C + + Y
Sbjct: 139 PCAGCAGCPTSSAAPFDPAASASYRTVPCGSPLCAQA-----PNAACPPGGKACGFSLTY 193
Query: 219 VDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSII 278
D S + + D + V GN A + GC TG G++GL RGP+S +
Sbjct: 194 ADSSLQAAL-SQDSL---AVAGNAVKA---YTFGCLQRATGTAAPPQGLLGLGRGPLSFL 246
Query: 279 SKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLT 333
S+T Y F YCL S +G + G+ + +K TP++ P +S Y++ +T
Sbjct: 247 SQTKDMYEATFSYCLPSFKSLNFSGTLRLGR--NGQPQRIKTTPLLANPHRSSLYYVNMT 304
Query: 334 GISVGGERLPLKA-SYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED 392
G+ VG + +P+ A T T +DSGT+ TR AP Y A+R R+R +G +
Sbjct: 305 GVRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRR-----VGAPVSS 359
Query: 393 L--FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPN 449
L FDTC++ +A V P +T+ F G+ + L ++ + + CL A P N
Sbjct: 360 LGGFDTCFNTTA---VAWPPMTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVN 415
Query: 450 SIL--LGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++L + ++QQ+ + V +DV R+GF C
Sbjct: 416 TVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 448
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 119/406 (29%), Positives = 190/406 (46%), Gaps = 24/406 (5%)
Query: 94 QRLHLKNSRRLQKAIPDNFKKTKAFT----FPAKTGI-VAADEYYIVVAIGKPKQYVSLL 148
+ +H + +R +P + +A + ++G+ V + EY I V +G P + ++
Sbjct: 106 ETMHRRAARSGVARMPASSSPRRALSERMVATVESGVAVGSGEYLIDVYVGTPPRRFRMI 165
Query: 149 LDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS 208
+DTGS + W QC PC+ C +QR P FDP+ S ++ + C C ++ P + +
Sbjct: 166 MDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRPA 225
Query: 209 SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIM 268
CPY Y D S TG A + T+ + GC N G +GA+G++
Sbjct: 226 EDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLL 285
Query: 269 GLDRGPVSIISKTNISY---FFYCLHSPYGSTGY-ITFGKPDTV-NKKFVKYTPIVTTPE 323
GL RGP+S S+ Y F YCL G + FG+ V +KYT T
Sbjct: 286 GLGRGPLSFASQLRAVYGHTFSYCLVEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSS 345
Query: 324 QSE-FYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAF 377
++ FY++ L G+ VGG+ L + + + T IDSGT ++ F P Y +R AF
Sbjct: 346 PADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAF 405
Query: 378 RKRMKK-YKMGKGIED--LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VES 433
M + Y + I D + + CY++S + VP++++ F G + V ++
Sbjct: 406 VDLMSRLYPL---IPDFPVLNPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDP 462
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+CL P SI +GN QQ+ + V YD+ RLGF P C
Sbjct: 463 DGIMCLAVRGTPRTGMSI-IGNFQQQNFHVVYDLQNNRLGFAPRRC 507
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 120/405 (29%), Positives = 183/405 (45%), Gaps = 32/405 (7%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPKQYV 145
++R++ H+ RRL + T ++ I A +Y++ ++IG P +
Sbjct: 32 NLIRKNSSHAHVLPLRRLMEL------SAMEKTLTPQSPIYAYLGHYLMELSIGTPPFKI 85
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
+ DTGS +TWT C PC +C +QR+P FDP KS T+ I C+S C L
Sbjct: 86 YGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKLCHKL-------DTG 138
Query: 206 KCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
CS K C Y AY + G A + +T+ G + + GC NNTG N
Sbjct: 139 VCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLK-GIVFGCGHNNTGGFNDH 197
Query: 265 S-GIMGLDRGPVSIISKTNISY----FFYCL---HSPYGSTGYITFGKPDTVNKKFVKYT 316
GI+GL GPVS+IS+ S+ F CL H+ + ++FGK V+ K V T
Sbjct: 198 EMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVST 257
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLPLKASY--FTKLSTEIDSGTIITRFPAPVYSALR 374
P+V +++ ++ +TL GISV L S K + +DSGT T P +Y +
Sbjct: 258 PLVAKQDKTPYF-VTLLGISVENTYLHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVV 316
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESV 434
+ R + + + CY + P +T HF G D++L T +
Sbjct: 317 AQVRSEVAMKPVTDDPDLGPQLCY--RTKNNLRGPVLTAHF-EGADVKLSPTQTFISPKD 373
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CLGF SD + GN Q Y + +D+ + + F P +C
Sbjct: 374 GVFCLGFTNTSSDGG--VYGNFAQSNYLIGFDLDRQVVSFKPKDC 416
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 126/382 (32%), Positives = 170/382 (44%), Gaps = 34/382 (8%)
Query: 117 AFTFPAKT--GIVAA----DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR 170
F+FP IV + D Y I IG P + ++DT + W QC PC C
Sbjct: 68 VFSFPPNKVPNIVVSPFMGDGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTT 127
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS---KECPYDIAYVDGSGETGF 227
P FDPSKS T+ IPC+S CK + CSS K C Y Y + G
Sbjct: 128 SPMFDPSKSSTYKTIPCSSPKCKNV-------ENTHCSSDDKKVCEYSFTYGGEAYSQGD 180
Query: 228 WATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISY- 285
+ D +T+ N + + ++GC N G G SG +GL RGP+S IS+ N S
Sbjct: 181 LSIDTLTLNS-NNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIG 239
Query: 286 --FFYC---LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE 340
F YC L S G +G + FG V+ TPI T E Y TL +SVG
Sbjct: 240 GKFSYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPI-TAGEIG--YSTTLNALSVGDH 296
Query: 341 RLPLKASYFTKL---STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTC 397
+ + S +T IDSGT +T P VYS L S M K + K F C
Sbjct: 297 IIKFENSTSKNDNLGNTIIDSGTTLTILPENVYSRLESIVTS-MVKLERAKSPNQQFKLC 355
Query: 398 YDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQ 457
Y + K + VP IT HF G D+ L+ T VC F + + P +I +GN+
Sbjct: 356 YK-ATLKNLDVPIITAHF-NGADVHLNSLNTFYPIDHEVVCFAFVSVGNFPGTI-IGNIA 412
Query: 458 QRGYEVHYDVAGRRLGFGPGNC 479
Q+ + V +D+ + F P +C
Sbjct: 413 QQNFLVGFDLQKNIISFKPTDC 434
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 110/363 (30%), Positives = 163/363 (44%), Gaps = 45/363 (12%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
Y + +G P Q + L LDT + TW+ C PC C F P+ S +++ +PC S
Sbjct: 78 SYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASD 135
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C + P + + D+ + + T R + G+ AR P
Sbjct: 136 WCPLFRRPAVPGEPGRVGAAA---DVRLLQAASRT-----PRSGVLAATRCGW-ARTPS- 185
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGSTGYITFGKP 305
GP+S++S+T Y F YCL S Y +G + G
Sbjct: 186 -----------------PATRSGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAA 228
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEIDSGT 360
+ V+YTP++T P + Y++ +TG+SVG + A F T T IDSGT
Sbjct: 229 G--QPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGT 286
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
+ITR+ APVY+ALR FR+++ G FDTC++ P +T+H GGVD
Sbjct: 287 VITRWTAPVYAALRDEFRRQVAA-PSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGVD 345
Query: 421 LELDVRGTLVVESVRQV-CLGFALLP--SDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
L L + TL+ S + CL A P + ++ N+QQ+ V DVAG R+GF
Sbjct: 346 LTLPMENTLIHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRVGFARE 405
Query: 478 NCN 480
CN
Sbjct: 406 PCN 408
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 174/358 (48%), Gaps = 18/358 (5%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKI 185
V Y + +G P + +++DTGS +TW QC PC+ C +Q P F+P S +++ +
Sbjct: 122 VGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASV 181
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C++ C L N +S C Y +Y D S G+ + D ++ G +
Sbjct: 182 SCSAQQCSDLTTATL-NPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF------GSTS 234
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITF 302
F GC +N G ++G++GL R +S++ + S F YCL P S+ +
Sbjct: 235 VPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGY 292
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
+ N YTP+ ++ Y I +TGI V G+ L + +S ++ L T IDSGT+I
Sbjct: 293 LSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVI 352
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
TR P VYSAL A MK + DTC+ A + + VP++T+ F GG L+
Sbjct: 353 TRLPTGVYSALSKAVAGAMKGTPRASAFS-ILDTCFQGQAAR-LRVPEVTMAFAGGAALK 410
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L R LV CL FA S + ++GN QQ+ + V YDV ++GF C+
Sbjct: 411 LAARNLLVDVDSATTCLAFAPARS---AAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 130/406 (32%), Positives = 195/406 (48%), Gaps = 50/406 (12%)
Query: 103 RLQKAI------PDNFKKTKAFTFPAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTGSGI 155
RLQKA ++F+ T ++ +++ + EY + +++G P + + DTGS +
Sbjct: 59 RLQKAFHRSISRANHFRANGVSTNSIQSPVISNNGEYLMNISLGTPPVSMHGIADTGSDL 118
Query: 156 TWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPY 214
W QCKPC C +Q +P FDP+KSKT+ + C +C L GQ CS C Y
Sbjct: 119 LWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKSCSNL------GGQGGCSDDNTCIY 172
Query: 215 DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRG 273
+Y DGS +G A D +TI G + + GC NN G + SG++GL G
Sbjct: 173 SYSYGDGSHTSGDLAVDTLTIGSTTGRP-VSVPKVVFGCGHNNGGTFELHGSGLVGLGGG 231
Query: 274 PVSIISKTNI---SYFFYCLHSPYGS----TGYITFGKPDTVNKKFVKYTPIVTTPEQSE 326
P+S+IS+ F YCL P G+ + + FG V+ TP+ + +
Sbjct: 232 PLSMISQLRPLIGGRFSYCL-VPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLASR-QPDT 289
Query: 327 FYHITLTGISVGGERLPLKASYFTKLSTE----------IDSGTIITRFPAPVYSALRSA 376
FY++TL +SVG ++L K F+K+ + IDSGT +T P Y L S
Sbjct: 290 FYYLTLESMSVGSKKLAYKG--FSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESN 347
Query: 377 FRKRMKKYKMGKGIED---LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVES 433
+ GK + D +F CY S + +P IT HF+ G DLEL T V
Sbjct: 348 VVSAIG----GKPVRDPNNVFSLCY--SNLSGLRIPTITAHFV-GADLELKPLNTFV--Q 398
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V++ FA++P +I GN+ Q + V YD+ R + F P +C
Sbjct: 399 VQEDLFCFAMIPVSDLAI-FGNLAQMNFLVGYDLKSRTVSFKPTDC 443
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 173/373 (46%), Gaps = 40/373 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
+ + V+IG P Q +L+LDTGS + WTQCK + P +DP+KS +F+ PC+
Sbjct: 88 HHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGR 147
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYPF 249
C+ CS +C Y Y GS T G A++ T G
Sbjct: 148 LCET-----GSFNTKNCSRNKCIYTYNY--GSATTKGELASETFTF----GEHRRVSVSL 196
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---GSTGYITFGKPD 306
GC +G GASGI+G+ +S++S+ I F YCL +P+ +T +I FG
Sbjct: 197 DFGCGKLTSGSLPGASGILGISPDRLSLVSQLQIPRFSYCL-TPFLDRNTTSHIFFGAMA 255
Query: 307 TVNKKF----VKYTPIVTTPEQSE-FYHITLTGISVGGERLPLKASYFT-----KLSTEI 356
++K ++ T +VT P+ S +Y++ L GISVG +RL + S F T +
Sbjct: 256 DLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFV 315
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDL-----SAYKTVV- 407
DSG P+ V AL+ A + +K G E ++ C+ L A +T V
Sbjct: 316 DSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYE--YELCFQLPRNGGGAVETAVQ 373
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
VP + HF GG + L +V S ++CL ++ S ++GN QQ+ V +DV
Sbjct: 374 VPPLVYHFDGGAAMLLRRDSYMVEVSAGRMCL---VISSGARGAIIGNYQQQNMHVLFDV 430
Query: 468 AGRRLGFGPGNCN 480
F P CN
Sbjct: 431 ENHEFSFAPTQCN 443
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 177/386 (45%), Gaps = 46/386 (11%)
Query: 128 AAD----EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ-----------QRDP 172
AAD +Y++ +G P Q L+ DTGS +TW CK HC +
Sbjct: 75 AADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCK--YHCRSRNCSNRKARRIRHKR 132
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECP-------YDIAYVDGSGET 225
F + S +F IPC + CKI L D S CP YD Y DGS
Sbjct: 133 VFHANLSSSFKTIPCLTDMCKIEL-------MDLFSLTNCPTPLTPCGYDYRYSDGSTAL 185
Query: 226 GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNIS 284
GF+A + +T++ G + L+GC+++ G A G+MGL S K
Sbjct: 186 GFFANETVTVELKEGRK-MKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEK 244
Query: 285 Y---FFYCL--H-SPYGSTGYITFGKPDTVNKKF--VKYTPIVTTPEQSEFYHITLTGIS 336
+ F YCL H S + Y+TFG + + YT +V S FY + + GIS
Sbjct: 245 FGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNS-FYAVNMMGIS 303
Query: 337 VGGERLPLKASYFT---KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL 393
+GG L + + + T +DSG+ +T P Y + +A R + K++ +
Sbjct: 304 IGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGP 363
Query: 394 FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILL 453
+ C++ + ++ +VP++ HF G + E V+ ++ + CLGF + + P + ++
Sbjct: 364 LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSV-AWPGTSVV 422
Query: 454 GNVQQRGYEVHYDVAGRRLGFGPGNC 479
GN+ Q+ + +D+ ++LGF P +C
Sbjct: 423 GNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 137/425 (32%), Positives = 199/425 (46%), Gaps = 50/425 (11%)
Query: 87 EILRRDQ-------------QRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD-EY 132
EI+ RD QR+ R + +A N A T A++ ++A+ EY
Sbjct: 35 EIIHRDSSRSPYYRPTETQFQRVANALRRSINRANHFNKPNLVASTNTAESTVIASQGEY 94
Query: 133 YIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTC 192
+ ++G P + ++DTGS I W QC+PC C Q P FDPS+SKT+ +PC+S C
Sbjct: 95 LMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSSNIC 154
Query: 193 KILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-F 249
+ + CSS EC Y I Y D S G + + +T+ +G+ ++P
Sbjct: 155 QSV------QSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSS--VQFPKT 206
Query: 250 LLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYC---LHSPYGSTGYITF 302
++GC NN G Q SGI+GL GPVS+IS+ + S F YC L S S+ + F
Sbjct: 207 VIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNF 266
Query: 303 GKPDTVNKKFVKYTPIVTTPEQS-EFYHITLTGISVGGERLPLKASYFTKLSTE----ID 357
G V+ + TPIV P+ FY +TL SVG R+ +S F E ID
Sbjct: 267 GDEAVVSGRGTVSTPIV--PKNGLGFYFLTLEAFSVGDNRIEFGSSSFESSGGEGNIIID 324
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED---LFDTCYDLSAYKTVVVPKITIH 414
SGT +T P Y L SA ++ + +ED CY ++ + VP IT H
Sbjct: 325 SGTTLTILPEDDYLNLESAVADAIELER----VEDPSKFLRLCYRTTSSDELNVPVITAH 380
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F G D+EL+ T + VC F P + GN+ Q+ V YD+ + + F
Sbjct: 381 F-KGADVELNPISTFIEVDEGVVCFAFRSSKIGP---IFGNLAQQNLLVGYDLVKQTVSF 436
Query: 475 GPGNC 479
P +C
Sbjct: 437 KPTDC 441
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 112/393 (28%), Positives = 184/393 (46%), Gaps = 35/393 (8%)
Query: 98 LKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITW 157
L + RL A + ++ A A T + I IG P + DTGS +TW
Sbjct: 49 LSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQSSI---IGTPPVDYLGIADTGSDLTW 105
Query: 158 TQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPYDI 216
QC PC+ C QQ P F+P KS +FS +PCN+ TC + + C + C Y
Sbjct: 106 AQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDD-------GHCGVQGVCDYSY 158
Query: 217 AYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVS 276
Y D + G +++TI + ++GC ++G ASG++GL G +S
Sbjct: 159 TYGDRTYSKGDLGFEKITIGS-------SSVKSVIGCGHASSGGFGFASGVIGLGGGQLS 211
Query: 277 IISKTNISY-----FFYCLHSPYG-STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHI 330
++S+ + + F YCL + + G I FG+ V+ V TP++ + +Y+I
Sbjct: 212 LVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLI-SKNTVTYYYI 270
Query: 331 TLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGI 390
TL IS+G ER ++ + + IDSGT ++ P +Y + S+ K +K ++ K
Sbjct: 271 TLEAISIGNER---HMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRV-KDP 326
Query: 391 EDLFDTCYD--LSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDP 448
+ +D C+D ++ + +P IT F GG ++ L T + CL L P+ P
Sbjct: 327 GNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCL--TLTPASP 384
Query: 449 NSI--LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN+ + + YD+ +RL F P C
Sbjct: 385 TDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 417
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 105/336 (31%), Positives = 165/336 (49%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + LT ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S LR R+ + K G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLRQRIRELL--LKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P+ SI+
Sbjct: 287 RFDLGSHGVFVERSVQEQDVWCLAFA--PTKSVSII 320
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 94/263 (35%), Positives = 131/263 (49%), Gaps = 17/263 (6%)
Query: 212 CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLD 271
C Y I Y DGS G +++ G F+ GC NN G G SG+MGL
Sbjct: 76 CNYAINYGDGSFTRGELGHEKLKF------GTILVKDFIFGCGRNNKGLFGGVSGLMGLG 129
Query: 272 RGPVSIISKTNISY---FFYCLHS-PYGSTGYITFGKPDTV--NKKFVKYTPIVTTPEQS 325
R +S+IS+T+ + F YCL S +G + G +V N + Y ++ P+
Sbjct: 130 RSDLSLISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLY 189
Query: 326 EFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK 385
FY I LTGIS+GG + L+A +DSGT+ITR P +Y AL++ F K+ +
Sbjct: 190 NFYFINLTGISIGG--VALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFP 247
Query: 386 MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT--LVVESVRQVCLGFAL 443
+ DTC++LSAY+ V +P I +HF G +L +DV G V QVCL A
Sbjct: 248 PAPAFS-ILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALAS 306
Query: 444 LPSDPNSILLGNVQQRGYEVHYD 466
L +LGN QQ+ V YD
Sbjct: 307 LEYQDEVAILGNYQQKNLRVIYD 329
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 180/380 (47%), Gaps = 32/380 (8%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKP---CIHCSQQRDPFFDPSKSKTFS 183
+ + +Y++ + +G P + L++DTGS +TW QC P + S P++D S S ++
Sbjct: 54 IGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYR 113
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCS---SKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
+IPC C+ L P CS C Y Y D S TG A + ++++
Sbjct: 114 EIPCTDDECQFL----PAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKR 169
Query: 241 NGYFARYP---------FLLGCTDNNTGDQN-GASGIMGLDRGPVSIISKTNIS----YF 286
+G A LGC+ + G GASG++GL +GP+S+ ++T + F
Sbjct: 170 SGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIF 229
Query: 287 FYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
YCL + +F + + + +TPIV P FY++ +TG++V G+ + A
Sbjct: 230 SYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 289
Query: 347 SYFTKLS------TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
S + T DSGT ++ P YS + A + + + I + F+ CY++
Sbjct: 290 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPR-AQEIPEGFELCYNV 348
Query: 401 SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRG 460
+ + + PK+ + F GG +EL +V+ + C+ + + S +LGN+ Q+
Sbjct: 349 TRMEKGM-PKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQD 407
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
+ + YD+A R+GF C+
Sbjct: 408 HHIEYDLAKARIGFKWSPCH 427
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 179/380 (47%), Gaps = 32/380 (8%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKP---CIHCSQQRDPFFDPSKSKTFS 183
+ + +Y++ + +G P + L++DTGS +TW QC P + S P++D S S ++
Sbjct: 22 IGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYR 81
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGETGFWATDRMTIQEVNG 240
+IPC C L P CS K C Y Y D S TG A + ++++
Sbjct: 82 EIPCTDDECLFL----PAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKR 137
Query: 241 NGYFARYP---------FLLGCTDNNTGDQN-GASGIMGLDRGPVSIISKTNIS----YF 286
+G A LGC+ + G GASG++GL +GP+S+ ++T + F
Sbjct: 138 SGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIF 197
Query: 287 FYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
YCL + +F + + +TPIV P FY++ +TG++V G+ + A
Sbjct: 198 SYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 257
Query: 347 SYFTKLS------TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
S + T DSGT ++ P YS + A + + + I + F+ CY++
Sbjct: 258 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPR-AQEIPEGFELCYNV 316
Query: 401 SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRG 460
+ + + PK+ + F GG +EL +V+ + C+ + + S +LGN+ Q+
Sbjct: 317 TRMEKGM-PKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQD 375
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
+ + YD+A R+GF C+
Sbjct: 376 HHIEYDLAKARIGFKWSPCH 395
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 130/414 (31%), Positives = 180/414 (43%), Gaps = 58/414 (14%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI----VAADEYYIVVAIGKPK 142
E+LRR QR + + L D + ++ + P G EY + +A G P
Sbjct: 41 ELLRRMAQRSKARATHLLSAQ--DQSGRGRSASAPVNPGAYDDGFPFTEYLVHLAAGTPP 98
Query: 143 QYVSLLLDTGSGITWTQCK--PCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFP 200
Q V L LDTGS ITWTQCK P C Q P FDPS S +F+ +PC+S C E P
Sbjct: 99 QEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPAC----ETTP 154
Query: 201 P-NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL-GCTDNNT 258
P G + +S+ C Y I+Y DGS G + T G G A P L+ GC N
Sbjct: 155 PCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANR 214
Query: 259 GD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS-TGYITFGKPDTVNKKFVKYT 316
G + +GI G RG +S+ S+ + F +C + GS T + G P
Sbjct: 215 GVFTSNETGIAGFGRGSLSLPSQLKVGNFSHCFTTITGSKTSAVLLGLPG---------- 264
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-IDSGTIITRFPAPVYSALRS 375
V P S G R + SY + + +SGT IT P Y A+R
Sbjct: 265 --VAPPSASPL-----------GRR---RGSYRCRSTPRSSNSGTSITSLPPRTYRAVRE 308
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYD--LSAYKTVVVPKITIHFLGG-VDLELDVRGTLVVE 432
F ++K + D F TC+ L K VP + +HF G + L + VV+
Sbjct: 309 EFAAQVKLPVVPGNATDPF-TCFSAPLRGPKP-DVPTMALHFEGATMRLPQENYVFEVVD 366
Query: 433 ------SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
S R +CL + I+LGN+QQ+ V YD+ +L F P C+
Sbjct: 367 DDDAGNSSRIICLAVI----EGGEIILGNIQQQNMHVLYDLQNSKLSFVPAQCD 416
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 104/336 (30%), Positives = 165/336 (49%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS TW C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L RG V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSRGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 163/336 (48%), Gaps = 29/336 (8%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + L +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGFS 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ G + V+YT +V + +E + + LT ISV GERL L S F++ DSG
Sbjct: 171 FSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSG 230
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 231 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 288
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 289 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 322
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 143/313 (45%), Gaps = 39/313 (12%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V EY + +AIG P Q V L LDTGS + WTQC+PC C Q P+FDPS S T S
Sbjct: 77 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 136
Query: 187 CNSTTCKIL------LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
C+ST C+ L F PN + C Y +Y D S TGF D+ T V
Sbjct: 137 CDSTLCQGLPVASCGSPKFWPN-------QTCVYTYSYGDKSVTTGFLEVDKFTF--VGA 187
Query: 241 NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYI 300
F G +N N +GI G RGP+S+ S+ + F +C + G
Sbjct: 188 GASVPGVAFGCGLFNNGVFKSN-ETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGL---- 242
Query: 301 TFGKPDTV-----------NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
KP TV + V+ TP++ P FY+++L GI+VG RLP+ S F
Sbjct: 243 ---KPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEF 299
Query: 350 TKLS----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
+ T IDSGT +T P VY +R AF ++K + D + C
Sbjct: 300 ALKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPY-FCLSAPLRAK 358
Query: 406 VVVPKITIHFLGG 418
VPK+ +HF G
Sbjct: 359 PYVPKLVLHFEGA 371
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 138/327 (42%), Gaps = 29/327 (8%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
L + R + R+ S + + D + ++ EY + +AIG P Y
Sbjct: 48 LSRAIARSKARVAALQSAAVLPPVVDPITAARVLV------TASSGEYLVDLAIGTPPLY 101
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+ ++DTGS + WTQC PC+ C+ Q P+FD KS T+ +PC S+ C L
Sbjct: 102 YTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASL-------SS 154
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA 264
C K C Y Y D + G A + T N A GC N GD +
Sbjct: 155 PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATN-IAFGCGSLNAGDLANS 213
Query: 265 SGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTG-------YITFGKPDTVNKKFVKYTP 317
SG++G RGP+S++S+ S F YCL S +T Y +T + V+ TP
Sbjct: 214 SGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTP 273
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSA 372
V P Y ++L IS+G + LP+ F IDSGT IT Y A
Sbjct: 274 FVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEA 333
Query: 373 LRSAFRKRMKKYKMGKGIEDL-FDTCY 398
+R + M D+ DTC+
Sbjct: 334 VRRGLVSAIPLTAMND--TDIGLDTCF 358
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 113/388 (29%), Positives = 178/388 (45%), Gaps = 47/388 (12%)
Query: 128 AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCS-QQRDPFFDPSKSKTFSKI 185
+ +Y++ + +G P Q + L+ DTGS +TW +C C +CS F S TFS
Sbjct: 79 GSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPT 138
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGN---- 241
C S+ C+++ + P C Y+ Y DGS +GF++ + T+ +G
Sbjct: 139 HCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKL 198
Query: 242 -------GYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL- 290
G+ A P L+G + NGASG+MGL RGP+S S+ + F YCL
Sbjct: 199 KSIAFGCGFHASGPSLIGSS------FNGASGVMGLGRGPISFASQLGRRFGRSFSYCLL 252
Query: 291 ---HSPYGSTGYITFGKPDTV-----NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL 342
SP T Y+ G D V NK + +TP++ PE FY+I++ G+ V G +L
Sbjct: 253 DYTLSP-PPTSYLMIG--DVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKL 309
Query: 343 PLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLF 394
+ S ++ T IDSGT +T P Y + SAF++ +K G F
Sbjct: 310 HIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGF 369
Query: 395 DTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNS---I 451
D C +++ P++++ G R + S CL A+ P + S
Sbjct: 370 DLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCL--AIQPVEAESGRFS 427
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN+ Q+G+ + +D RLGF C
Sbjct: 428 VIGNLMQQGFLLEFDRGKSRLGFSRRGC 455
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 141 bits (355), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 121/413 (29%), Positives = 187/413 (45%), Gaps = 38/413 (9%)
Query: 76 QGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV 135
+ ++ TPS EI +R H + +R + + + + F P +G EY I
Sbjct: 43 RSETLKTPS--EIFIAAVKRGHERRARLAKHVLAGD----QLFETPVASG---NGEYLID 93
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
++ G P Q + ++DTGS + W QC PC C + FDPSKS ++ + C S C+ L
Sbjct: 94 ISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTLGCGSNFCQDL 153
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
+ + C YD Y DGS +G +TD +TI G G F GC +
Sbjct: 154 --------PFQSCAASCQYDYMYGDGSSTSGALSTDDVTI----GTGKIPNVAF--GCGN 199
Query: 256 NNTGDQNGASGIMGLDRGPVSIISK---TNISYFFYCLHSPYGSTGYITFGKPDTVNKKF 312
+N G GA G++GL +GP+S++S+ T F YCL P GST D+
Sbjct: 200 SNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCL-VPLGSTKTSPLYIGDSTLAGG 258
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPA 367
V YTP++T FY+ L GISV G+ + A+ F +T +DSGT +T
Sbjct: 259 VAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLDV 318
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
++ + +A + + Y G + C+ + P + HF G D+ L
Sbjct: 319 DAFNPMVAALKAAL-PYPEADGSFYGLEYCFSTAGVANPTYPTVVFHF-NGADVALAPDN 376
Query: 428 TLVVESVR-QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
T + CL A S + GN+QQ + + +D+ +R+GF NC
Sbjct: 377 TFIALDFEGTTCLAMA---SSTGFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 129/420 (30%), Positives = 193/420 (45%), Gaps = 39/420 (9%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
+ + LRRD R + R L + + + P + + EY + +AIG P Q
Sbjct: 47 VRDALRRDMHR-RARFGRELASSS-SSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 104
Query: 145 VSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNST--TCKI---LLEW 198
+ DTGS + WTQC PC C +Q P ++PS S TF +PC+S C L
Sbjct: 105 YPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGA 164
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYP-FLLGCTDN 256
PP G C+ C Y+ Y G+G T G ++ T + R P GC++
Sbjct: 165 TPPPG---CA---CRYNQTY--GTGWTSGLQGSETFTFGSSPADQ--VRVPGIAFGCSNA 214
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST---GYITFG---KPDTVNK 310
++ D NG++G++GL RG +S++S+ F YCL +P+ T + G +N
Sbjct: 215 SSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL-TPFQDTKSKSTLLLGPAAAAAALNG 273
Query: 311 KFVKYTPIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTII 362
V+ TP V +P + S +Y++ LTGISVG LP+ F + IDSGT I
Sbjct: 274 TGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGTGGLIIDSGTTI 333
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL--SAYKTVVVPKITIHFLGGVD 420
T Y +R+A R +K D C+ L S+ +P +T+HF GG D
Sbjct: 334 TSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGAD 393
Query: 421 LELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ L V ++++ CL +D LGN QQ+ + YDV L F P C+
Sbjct: 394 MVLPVENYMILDG-GMWCLAMRSQ-TDGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/336 (30%), Positives = 165/336 (49%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + L +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFS 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + LT ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 131/444 (29%), Positives = 201/444 (45%), Gaps = 62/444 (13%)
Query: 60 VSLEVLGRYGPCSKLNQGKSRNTPS----LEEILRRDQQRLHLKNSRRLQKAIPDNFKKT 115
+L+V +GPCS L G + PS L + RD RL +S +
Sbjct: 42 ATLQVSHAFGPCSPL--GNAAAAPSWAGFLADQSSRDASRLLYLDSLAVAG--------- 90
Query: 116 KAFTFPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
+A+ P +G ++ Y + +G P Q + L +DT + W C C C
Sbjct: 91 RAYA-PIASGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTP-- 147
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
F+P+ SK++ +PC S C PN ++K C + + Y D S E + D +
Sbjct: 148 FNPAASKSYRAVPCGSPACSRA-----PNPSCSLNTKSCGFSLTYADSSLEAAL-SQDSL 201
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
+ N Y F GC TG G++GL RGP+S +S+T Y F YCL
Sbjct: 202 AV----ANDVVKSYTF--GCLQKATGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCL 255
Query: 291 HS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASY 348
S +G + G+ + +K TP++ P +S Y++++TGI VG + +P+ +
Sbjct: 256 PSFKSLNFSGTLRLGRKGQPLR--IKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAA 313
Query: 349 F-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLS 401
T T +DSGT+ TR AP Y A+R R+R++ G + L FDTCY+
Sbjct: 314 LAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRIR----GAPLSSLGGFDTCYN-- 367
Query: 402 AYKTVVVPKITIHFLG-GVDLELDVRGTLVVESV--RQVCLGFALLPSDPNSIL--LGNV 456
TV P +T F G V L D LV+ S CL A P N++L + ++
Sbjct: 368 --TTVKWPPVTFMFTGMQVTLPAD---NLVIHSTYGTTSCLAMAAAPDGVNTVLNVIASM 422
Query: 457 QQRGYEVHYDVAGRRLGFGPGNCN 480
QQ+ + + +DV R+GF C
Sbjct: 423 QQQNHRILFDVPNGRVGFAREQCT 446
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 116/368 (31%), Positives = 175/368 (47%), Gaps = 38/368 (10%)
Query: 118 FTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDP 176
F+ P +G+ + EY+ V +G P L+LDTGS + W QC PC C Q FDP
Sbjct: 127 FSAPVVSGLAQGSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDP 186
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
+S++++ + C + C+ L G C Y +AY DGS G AT+ T+
Sbjct: 187 RRSRSYAAVRCGAPPCRGLDAGG--GGGCDRRRGTCLYQVAYGDGSVTAGDLATE--TLW 242
Query: 237 EVNGNGYFARYPFL-LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS 292
G AR P + +GC +N G A+G++GL RG +S+ ++T Y F YC
Sbjct: 243 FARG----ARVPRVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCFQ- 297
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
GS + + I+ T Q H+ + GER +
Sbjct: 298 --GSD---------------LDHRTIIRTVHQ----HVGGARVRGVGERSLRLDPSTGRG 336
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
+DSGT +TR PVY A+R AFR ++ G LFDTCYDL + V VP ++
Sbjct: 337 GVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVS 396
Query: 413 IHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
+H GG ++ L L+ V++ CL AL +D ++GN+QQ+G+ V +D +R
Sbjct: 397 VHLAGGAEVALPPENYLIPVDTRGTFCL--ALAGTDGGVSIVGNIQQQGFRVVFDGDRQR 454
Query: 472 LGFGPGNC 479
+ P +C
Sbjct: 455 VALVPKSC 462
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 176/386 (45%), Gaps = 46/386 (11%)
Query: 128 AAD----EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ-----------QRDP 172
AAD +Y + +G P Q L+ DTGS +TW CK HC +
Sbjct: 75 AADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCK--YHCRSRNCSNRKARRIRHKR 132
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECP-------YDIAYVDGSGET 225
F + S +F IPC + CKI L D S CP YD Y DGS
Sbjct: 133 VFHANLSSSFKTIPCLTDMCKIEL-------MDLFSLTNCPTPLTPCGYDYRYSDGSTAL 185
Query: 226 GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNIS 284
GF+A + +T++ G + L+GC+++ G A G+MGL S K
Sbjct: 186 GFFANETVTVELKEGRK-MKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEK 244
Query: 285 Y---FFYCL--H-SPYGSTGYITFGKPDTVNKKF--VKYTPIVTTPEQSEFYHITLTGIS 336
+ F YCL H S + Y+TFG + + YT +V S FY + + GIS
Sbjct: 245 FGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNS-FYAVNMMGIS 303
Query: 337 VGGERLPLKASYFT---KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL 393
+GG L + + + T +DSG+ +T P Y + +A R + K++ +
Sbjct: 304 IGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGP 363
Query: 394 FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILL 453
+ C++ + ++ +VP++ HF G + E V+ ++ + CLGF + + P + ++
Sbjct: 364 LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSV-AWPGTSVV 422
Query: 454 GNVQQRGYEVHYDVAGRRLGFGPGNC 479
GN+ Q+ + +D+ ++LGF P +C
Sbjct: 423 GNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
Length = 484
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 118/446 (26%), Positives = 187/446 (41%), Gaps = 46/446 (10%)
Query: 61 SLEVLGRYGPCSKLNQGKSRNTP---SLEEILRRDQQRLH--LKNSRRLQKAIPDNFKKT 115
++ V+ R PCS L P S+ ++L RD RL L +
Sbjct: 58 AVPVVHRLSPCSPLAGAARNQQPERRSVADVLHRDALRLRSLLHREEDNHRTPAPAAPPG 117
Query: 116 KAFTFPAK----TGIVAADEYYIVVAIGKPKQYVSLLLDTGS-GITWTQCKPCIHCSQQR 170
+ P++ + A EY++V G P Q + + DT + G T QC PC
Sbjct: 118 GGVSIPSRGEPIEELPGAFEYHVVAGFGTPMQKLPVGFDTTTTGATLLQCTPC---GSGA 174
Query: 171 DPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVD---GSGETG 226
D FDPS S + S++PC S C CS + C +++ + G+
Sbjct: 175 DHAFDPSASSSVSQVPCGSPDCPF----------HGCSGRPSCTLSVSFNNTLLGNATFF 224
Query: 227 FWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS-- 284
+ + R+ L G ++G++GI+ L R S+ S+ S
Sbjct: 225 TDTLTLTPSSSATVDKF--RFACLEGIAPGPA--EDGSAGILDLSRNSHSLPSRLVASSP 280
Query: 285 ----YFFYCLHSPYGSTGYITFG--KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
F YCL + G+++ G KP+ + +K V YTP+ +P Y + L G+ +G
Sbjct: 281 PHAVAFSYCLPASTADVGFLSLGATKPELLGRK-VSYTPLRGSPSNGNLYVVDLVGLGLG 339
Query: 339 GERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY 398
G LP+ + T ++ T T VY LR +FRK M +Y + L DTCY
Sbjct: 340 GPDLPIPPAAIAGDDTILELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPLGSL-DTCY 398
Query: 399 DLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV----CLGFALLPSDPN-SILL 453
+ + VP +T+ F GG D++L + + CL F D + ++
Sbjct: 399 NFTGLDAFSVPAVTLKFAGGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDDCDGGTVI 458
Query: 454 GNVQQRGYEVHYDVAGRRLGFGPGNC 479
G++ Q EV YDV G ++GF P C
Sbjct: 459 GSMAQMSTEVVYDVRGGKVGFVPYRC 484
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 157/358 (43%), Gaps = 39/358 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + + +G P + +DTGS + WTQC PC +C Q P FDPS S TF
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFK-------- 112
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
+ +C+ C Y I Y D + G AT+ +TI +G F +
Sbjct: 113 ------------EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEP-FVMPETTI 159
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTV 308
GC N++ + SG++GL GP S+I++ Y YC S T I FG V
Sbjct: 160 GCGHNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQ--GTSKINFGTNAIV 217
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE--IDSGTIITRFP 366
V T + T + Y++ L +SVG + + F L IDSGT +T FP
Sbjct: 218 AGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFP 277
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV---PKITIHFLGGVDLEL 423
+ +R A + + T D+ Y T + P IT+HF GG DL L
Sbjct: 278 VSYCNLVREAVDHYVTAVRTAD------PTGNDMLCYYTDTIDIFPVITMHFSGGADLVL 331
Query: 424 DVRGTLVVESVRQVCLGFALLPSD-PNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
D + + +E++ + A++ ++ P + GN Q + V YD + + F P NC+
Sbjct: 332 D-KYNMYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/336 (30%), Positives = 165/336 (49%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGFS 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + LT ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + K G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 178/361 (49%), Gaps = 24/361 (6%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
++ + Y + + IG P Q +L+ DT S +TWTQC ++Q +P FDP+KS +F+ +
Sbjct: 86 ISDEGYTVTIGIGTPPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVT 145
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C+S C E P G +CS+K C Y YV G A + T+ + N + +
Sbjct: 146 CSSKLCT---EDNP--GTKRCSNKTCRYVYPYVSVEA-AGVLAYESFTLSDNNQHICMS- 198
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS--TGYITFGK 304
F GC G+ GASGI+G+ +S++S+ I F YCL +PY + + FG
Sbjct: 199 --FGFGCGALTDGNLLGASGILGMSPAILSMVSQLAIPKFSYCL-TPYTDRKSSPLFFGA 255
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT--KLSTEIDSGTII 362
+ ++ PI + + +Y++ L G+S+G RL + A+ F + T +D G +
Sbjct: 256 WADLG-RYKTTGPIQKS--LTFYYYVPLVGLSLGTRRLDVPAATFALKQGGTVVDLGCTV 312
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS---AYKTVVVPKITIHFLGGV 419
+ P ++AL+ A + + ++D + C+ L A V P + ++F GG
Sbjct: 313 GQLAEPAFTALKEAVLHTLNLPLTNRTVKD-YKVCFALPSGVAMGAVQTPPLVLYFDGGA 371
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
D+ L + +CL AL+P SI +GNVQQ+ + + +DV + F P C
Sbjct: 372 DMVLPRDNYFQEPTAGLMCL--ALVPGGGMSI-IGNVQQQNFHLLFDVHDSKFLFAPTIC 428
Query: 480 N 480
+
Sbjct: 429 D 429
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 125/456 (27%), Positives = 194/456 (42%), Gaps = 75/456 (16%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPK 142
SL ++ R D++R+ +SR ++A + AF P +G +Y++ +G P
Sbjct: 42 SLADLARMDRERMAFISSRGRRRAA----ETASAFAMPLSSGAYTGTGQYFVRFRVGTPA 97
Query: 143 QYVSLLLDTGSGITWTQCK----------------PCIHCSQQRDPFFDPSKSKTFSKIP 186
Q L+ DTGS +TW +C P + R F P KS+T++ IP
Sbjct: 98 QPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRR-TFRPDKSRTWAPIP 156
Query: 187 CNSTTCKILLEWFPPNGQDKCS--SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C+S TC+ L P C+ + C YD Y DGS G D TI +G
Sbjct: 157 CSSATCRESL----PFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIAL---SGRA 209
Query: 245 ARYPFL----LGCTDNNTGDQNGAS-GIMGLDRGPVSIISKTNISY---FFYCL--H-SP 293
AR L LGCT + G AS G++ L +S S+ + F YCL H +P
Sbjct: 210 ARKAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAP 269
Query: 294 YGSTGYITFGKPDTVNKK-----------------------FVKYTPIVTTPEQSEFYHI 330
+T Y+TFG + + + TP+V FY +
Sbjct: 270 RNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAV 329
Query: 331 TLTGISVGGERLPLKASYFTKLS---TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMG 387
T+ G+SV GE L + + + +DSGT +T P Y A+ +A KR+ +
Sbjct: 330 TVKGVSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLA--GLP 387
Query: 388 KGIEDLFDTCYDLSAYK----TVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFAL 443
+ D FD CY+ ++ +P + +HF G LE + ++ + C+G
Sbjct: 388 RVTMDPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQE 447
Query: 444 LPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
P P ++GN+ Q+ + YD+ RRL F C
Sbjct: 448 GPW-PGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 111/370 (30%), Positives = 170/370 (45%), Gaps = 32/370 (8%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQC-KPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
Y + AIG P +S +LDTGS + WTQC PC C Q P + P++S T++ + C S
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGSR 159
Query: 191 TCKILLEWFPPNGQDKCSSKE------CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C L P + +S C Y +Y DGS G AT+ T G G
Sbjct: 160 LCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTF----GAGTT 215
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST---GYIT 301
+ GC +N G + +SG++G+ RGP+S++S+ ++ F YC +P+ T +
Sbjct: 216 V-HDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTKFSYCF-TPFNDTTTSSPLF 273
Query: 302 FGKPDTVN--KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---- 355
G +++ K + P + P +S +Y+++L GI+VG LP+ + F ++
Sbjct: 274 LGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASGRGGL 333
Query: 356 -IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL---SAYKTVVVPKI 411
IDSGT T + L A + G C+ + V VP++
Sbjct: 334 IIDSGTTFTALEERAFVVLARA-VAARVALPLASGAHLGLSVCFAAPQGRGPEAVDVPRL 392
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
+HF G D+EL +V + V V CLG S +LG++QQ+ V YDV
Sbjct: 393 VLHF-DGADMELPRSSAVVEDRVAGVACLGIV---SARGMSVLGSMQQQNMHVRYDVGRD 448
Query: 471 RLGFGPGNCN 480
L F P NC
Sbjct: 449 VLSFEPANCG 458
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 172/379 (45%), Gaps = 42/379 (11%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ-----------QRDPFFDPSKS 179
+Y + +G P Q L+ DTGS +TW CK HC + F + S
Sbjct: 11 QYSVAFKVGTPSQKFMLVADTGSDLTWMSCK--YHCRSRNCSNRKARRIRHKRVFHANLS 68
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECP-------YDIAYVDGSGETGFWATDR 232
+F IPC + CKI L D S CP YD Y DGS GF+A +
Sbjct: 69 SSFKTIPCLTDMCKIEL-------MDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANET 121
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISY---FFY 288
+T++ G + L+GC+++ G A G+MGL S K + F Y
Sbjct: 122 VTVELKEGRK-MKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 180
Query: 289 CL---HSPYGSTGYITFGKPDTVNKKF--VKYTPIVTTPEQSEFYHITLTGISVGGERLP 343
CL S + Y+TFG + + YT +V S FY + + GIS+GG L
Sbjct: 181 CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNS-FYAVNMMGISIGGAMLK 239
Query: 344 LKASYFT---KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
+ + + T +DSG+ +T P Y + +A R + K++ + + C++
Sbjct: 240 IPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNS 299
Query: 401 SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRG 460
+ ++ +VP++ HF G + E V+ ++ + CLGF + + P + ++GN+ Q+
Sbjct: 300 TGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSV-AWPGTSVVGNIMQQN 358
Query: 461 YEVHYDVAGRRLGFGPGNC 479
+ +D+ ++LGF P +C
Sbjct: 359 HLWEFDLGLKKLGFAPSSC 377
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 127/441 (28%), Positives = 199/441 (45%), Gaps = 54/441 (12%)
Query: 61 SLEVLGRYGPCSKLNQGKSRNTPS----LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTK 116
+L+V +GPCS L G + PS L + RD RL +S + K +
Sbjct: 43 TLQVSHAFGPCSPLGPGTT--APSWAGFLADQASRDASRLLYLDSLAARG-------KAR 93
Query: 117 AFTFPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFF 174
A+ P +G ++ Y + +G P Q + L +DT + W C C C P F
Sbjct: 94 AYA-PIASGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPF 152
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
DP+ S ++ +PC S C PN K C + + Y D S + + D +
Sbjct: 153 DPAASTSYRSVPCGSPLCAQA-----PNAACPPGGKACGFSLTYADSSLQAAL-SQDSL- 205
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
V G+ A + GC TG G++GL RGP+S +S+T Y F YCL
Sbjct: 206 --AVAGD---AVKTYTFGCLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLP 260
Query: 292 S--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
S +G + G+ +K TP++ P +S Y++ +TGI VG + +P+
Sbjct: 261 SFKSLNFSGTLRLGR--NGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPAL 318
Query: 350 -----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSA 402
T T +DSGT+ TR AP Y A+R R+R +G + L FDTC++ +A
Sbjct: 319 AFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRR-----VGAPVSSLGGFDTCFNTTA 373
Query: 403 YKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQR 459
V P +T+ F G+ + L ++ + + CL A P N++L + ++QQ+
Sbjct: 374 ---VAWPPVTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQ 429
Query: 460 GYEVHYDVAGRRLGFGPGNCN 480
+ V +DV R+GF C
Sbjct: 430 NHRVLFDVPNGRVGFARERCT 450
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 129/420 (30%), Positives = 193/420 (45%), Gaps = 39/420 (9%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
+ + LRRD R + R L + + + P + + EY + +AIG P Q
Sbjct: 47 VRDALRRDMHR-RARFGRELASSS-SSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 104
Query: 145 VSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNST--TCKI---LLEW 198
+ DTGS + WTQC PC C +Q P ++PS S TF +PC+S C L
Sbjct: 105 YPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGA 164
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYP-FLLGCTDN 256
PP G C+ C Y+ Y G+G T G ++ T + R P GC++
Sbjct: 165 TPPPG---CA---CRYNQTY--GTGWTSGLQGSETFTFGSSPADQ--VRVPGIAFGCSNA 214
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST---GYITFG---KPDTVNK 310
++ D NG++G++GL RG +S++S+ F YCL +P+ T + G +N
Sbjct: 215 SSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL-TPFQDTKSKSTLLLGPAAAAAALNG 273
Query: 311 KFVKYTPIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTII 362
V+ TP V +P + S +Y++ LTGISVG LP+ F + IDSGT I
Sbjct: 274 TGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTI 333
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL--SAYKTVVVPKITIHFLGGVD 420
T Y +R+A R +K D C+ L S+ +P +T+HF GG D
Sbjct: 334 TSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGAD 393
Query: 421 LELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ L V ++++ CL +D LGN QQ+ + YDV L F P C+
Sbjct: 394 MVLPVENYMILDG-GMWCLAMRSQ-TDGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 170/370 (45%), Gaps = 34/370 (9%)
Query: 126 IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ-QRDPFFDPSKSKTFSK 184
I+ Y +G P Q + + +D + W C C+ C+ P FDP++S T+
Sbjct: 94 ILRTPSYVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRP 153
Query: 185 IPCNSTTCKILLEWFPPNGQDKCSS---KECPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
+ C + C + P C + C ++++Y S D +++ + NG
Sbjct: 154 VRCGAPQCAQV-----PPATPSCPAGPGASCAFNLSYAS-STLHAVLGQDALSLSDSNGA 207
Query: 242 GYFARYPFLLGCTDNNTGDQNGA--SGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS 296
+ + GC TG G++G RGP+S +S+T +Y F YCL S S
Sbjct: 208 AVPDDH-YTFGCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSS 266
Query: 297 --TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT---- 350
+G + G + +K TP+++ P + Y++ + G+ V G+ +P+ AS
Sbjct: 267 NFSGTLRLGPAG--QPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAA 324
Query: 351 --KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
+ T +D+GT+ TR P Y+ALR+AFR+ + FDTCY ++ K+ V
Sbjct: 325 TGRGGTIVDAGTMFTRLSPPAYAALRNAFRRGVSAPAAPA--LGGFDTCYYVNGTKS--V 380
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSI---LLGNVQQRGYEVH 464
P + F GG + L ++ + V CL A PSD + +L ++QQ+ + V
Sbjct: 381 PAVAFVFAGGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVV 440
Query: 465 YDVAGRRLGF 474
+DV R+GF
Sbjct: 441 FDVGNGRVGF 450
>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
Length = 492
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/364 (29%), Positives = 169/364 (46%), Gaps = 34/364 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
+Y + V G P+Q + LDT G++ CKPC S DP FD S+S TF+ +PC+S
Sbjct: 148 DYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFDTSQSTTFTHVPCDSP 207
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C P+ + + CP+++ +V+G+ ++ D +T+ A F
Sbjct: 208 DC--------PSTANCSAGSVCPFNLFFVEGT-----FSQDVLTVAP-----SVAVQDFT 249
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISK---TNISYFFYCLHSPYGSTGYITFGKPDT 307
C D D G + L R S+ S+ + + F YC+ S G+++ G T
Sbjct: 250 FVCLDAGASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCMPQYPDSPGFLSLGDDAT 309
Query: 308 V-NKKFVKYTPIVTT--PEQSEFYHITLTGISVGGERLPLKASYF-TKLSTEIDSGTIIT 363
V + P++++ P+ + Y I + G+S+G LP+ + F ST +++GT T
Sbjct: 310 VRGDNCTAHAPLLSSDDPDLANMYFIDVVGMSLGDVDLPIPSGTFGNNASTIVEAGTTFT 369
Query: 364 RFPAPVYSALRSAFRKRMKKYKMG-KGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
Y+ LR AFR+ M +Y G D FDTCY+ + + + VP + F G L
Sbjct: 370 MLAPDAYTPLRDAFRQAMAQYNRSVPGFYD-FDTCYNFTGLQELTVPLVEFKFGNGDSLL 428
Query: 423 LDVRGTLVVESVRQ-----VCLGFALL--PSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
+D L + + CL F+ L D S ++G EV YDVAG +GF
Sbjct: 429 IDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVAGGTVGFI 488
Query: 476 PGNC 479
P +C
Sbjct: 489 PESC 492
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 170/373 (45%), Gaps = 34/373 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + + +G P + + ++DTGS + W QCKPC C Q DP +DPS S TF+K C++++
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTSS 63
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C+ L P +G S+K C Y Y D S G +A + +T++ G+ +P F
Sbjct: 64 CQSL----PASGCSS-SAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSS--KAFPNFQ 116
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGSTGYITFGK 304
GC N+G GA+GI+GL +G +S+ ++ + F YCL T + FG
Sbjct: 117 FGCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGS 176
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS----------- 353
+ + TPI+ +S +Y + L GISVGG++L L LS
Sbjct: 177 SASTGSGAIS-TPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRA 235
Query: 354 -------TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
T DSGT +T VYS ++SAF + + FD CYD+S K
Sbjct: 236 LEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTV-DASSSGFDLCYDVSKSKNF 294
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
P +T+ F G ++V++ V ++GN+ Q+ Y V YD
Sbjct: 295 KFPALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYD 354
Query: 467 VAGRRLGFGPGNC 479
+ P C
Sbjct: 355 RGTSTISMSPAQC 367
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 91/238 (38%), Positives = 127/238 (53%), Gaps = 25/238 (10%)
Query: 59 KVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAF 118
K SL V+ +G CS L+ K +EILRRD+ R+ +S+ L K I D K K+
Sbjct: 62 KSSLRVVHMHGACSHLSSNKDARLDH-DEILRRDEARVESIHSK-LSKNIADEVSKAKST 119
Query: 119 TFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDP 176
PAK GI+ YIV + IG PK +SL+ DTGS +TWTQC+PC+ C Q++P F+P
Sbjct: 120 KLPAKNGIILGSPNYIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLGSCYSQKEPKFNP 179
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI- 235
S S ++ + C+S C + CS+ C Y I Y DGS GF A ++ T+
Sbjct: 180 SSSSSYHNVSCSSPMC---------GNPESCSASNCLYGIGYGDGSVTVGFLAKEKFTLT 230
Query: 236 -QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYC 289
+V + YF GC +NN G G++GI+GL G S +T +Y F YC
Sbjct: 231 NSDVLDDIYF-------GCGENNKGVFIGSAGILGLGPGKFSFPLQTTTTYNNIFSYC 281
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 114/370 (30%), Positives = 160/370 (43%), Gaps = 56/370 (15%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V EY + +AIG P Q V L LDTGS + WTQC+PC C Q P+FDPS S T S
Sbjct: 84 VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 143
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C+ST C+ L +A + S + F G G F
Sbjct: 144 CDSTLCQGL-------------------PVASLPRSDKFTFVGAGASVPGVAFGCGLF-- 182
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPD 306
NN ++ +GI G RGP+S+ S+ + F +C + IT P
Sbjct: 183 ---------NNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTT-------ITGAIPS 226
Query: 307 TV-----------NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS-- 353
TV + V+ TP++ P FY+++L GI+VG RLP+ S F +
Sbjct: 227 TVLLDLPADLFSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGT 286
Query: 354 --TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
T IDSGT +T P VY +R AF ++K + D + C VPK+
Sbjct: 287 GGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPY-FCLSAPLRAKPYVPKL 345
Query: 412 TIHFLGG-VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
+HF G +DL + VE L A++ +GN QQ+ V YD+
Sbjct: 346 VLHFEGATMDLPRE-NYVFEVEDAGSSILCLAIIEGG-EVTTIGNFQQQNMHVLYDLQNS 403
Query: 471 RLGFGPGNCN 480
+L F P C+
Sbjct: 404 KLSFVPAQCD 413
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 129/420 (30%), Positives = 193/420 (45%), Gaps = 39/420 (9%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
+ + LRRD R + R L + + + P + + EY + +AIG P Q
Sbjct: 52 VRDALRRDMHR-RARFGRELASSS-SSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 109
Query: 145 VSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNST--TCKI---LLEW 198
+ DTGS + WTQC PC C +Q P ++PS S TF +PC+S C L
Sbjct: 110 YPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGA 169
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYP-FLLGCTDN 256
PP G C+ C Y+ Y G+G T G ++ T + R P GC++
Sbjct: 170 TPPPG---CA---CRYNQTY--GTGWTSGLQGSETFTFGSSPADQ--VRVPGIAFGCSNA 219
Query: 257 NTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST---GYITFG---KPDTVNK 310
++ D NG++G++GL RG +S++S+ F YCL +P+ T + G +N
Sbjct: 220 SSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL-TPFQDTKSKSTLLLGPAAAAAALNG 278
Query: 311 KFVKYTPIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTII 362
V+ TP V +P + S +Y++ LTGISVG LP+ F + IDSGT I
Sbjct: 279 TGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTI 338
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL--SAYKTVVVPKITIHFLGGVD 420
T Y +R+A R +K D C+ L S+ +P +T+HF GG D
Sbjct: 339 TSLVDAAYKRVRAAVRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGAD 398
Query: 421 LELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ L V ++++ CL +D LGN QQ+ + YDV L F P C+
Sbjct: 399 MVLPVENYMILDG-GMWCLAMRSQ-TDGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 456
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 179/376 (47%), Gaps = 42/376 (11%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY +A+G P L +DTGS ITW QC+PC C Q P FDP S ++ ++ ++
Sbjct: 133 EYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDPRHSTSYREMGYDAP 192
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYV-DGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C+ L +G C Y + Y DGS G + + +T + P
Sbjct: 193 DCQALGR----SGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAG------GVQVPH 242
Query: 250 L-LGCTDNNTGD-QNGASGIMGLDRGPVSIISKT-----NISYFFYC-----LHSPYGS- 296
+ +GC +N G A+GI+GL RG +S S+ N++ F YC L SP S
Sbjct: 243 MSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSPGRSV 302
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGG--------ERLPLKASY 348
+ +T G +TP V + FY++ L G+SVGG + L L Y
Sbjct: 303 SSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLD-PY 361
Query: 349 FTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK----GIEDLFDTCYDLSAYK 404
+ +DSGT +TR Y + R +G+ G FDTCY + +
Sbjct: 362 TGRGGVILDSGTAVTRLARRAY--IAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGG-R 418
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEV 463
+ VP +++HF GGV+L L + L+ V+S+ VC FA D + ++GN+QQ+G+ V
Sbjct: 419 AMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGT-GDRSVSIIGNIQQQGFRV 477
Query: 464 HYDVAGRRLGFGPGNC 479
Y++ G R+GF P +C
Sbjct: 478 VYNIGGGRVGFAPNSC 493
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 119/409 (29%), Positives = 180/409 (44%), Gaps = 56/409 (13%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLL 148
+ +DQ RL +S +K++ + G++ + Y + +G P Q + +
Sbjct: 1 MAKDQARLQFLSSLVAKKSV---------VPIASGRGVIQSPSYIVKAKVGTPPQTLLMA 51
Query: 149 LDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS 208
LD W CK C+ CS F+ KS TF + C + CK + PN C
Sbjct: 52 LDNSYDAAWIPCKGCVGCSST---VFNTVKSTTFKTLGCGAPQCKQV-----PN--PICG 101
Query: 209 SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIM 268
C ++ Y GS T ++ Y+A GC TG G++
Sbjct: 102 GSTCTWNTTY--GSSTILSNLTRDTIALSMDPVPYYA-----FGCIQKATGSSVPPQGLL 154
Query: 269 GLDRGPVSIISKTNISY---FFYCLHS-----PYGSTGYITFGKPDTVNKKFVKYTPIVT 320
G RGP+S +S+T Y F YCL S GS G+P +K TP++
Sbjct: 155 GFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPVGQPPR-----IKTTPLLK 209
Query: 321 TPEQSEFYHITLTGISVGGE--RLPLKASYF---TKLSTEIDSGTIITRFPAPVYSALRS 375
P +S Y++ L GI VG + +P A F T T DSGT+ TR AP Y A+R+
Sbjct: 210 NPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFTRLVAPAYIAVRN 269
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVR 435
FRKR+ + FDTCY + +V P IT F G+++ + L++ S
Sbjct: 270 EFRKRVGNATVSS--LGGFDTCYSVP----IVPPTITFMF-SGMNVTMPPE-NLLIHSTA 321
Query: 436 QV--CLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
V CL A P + NS+L + ++QQ+ + + +DV RLG C+
Sbjct: 322 GVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQCS 370
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 103/336 (30%), Positives = 164/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ + F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L RG V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGRRGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 132/447 (29%), Positives = 204/447 (45%), Gaps = 48/447 (10%)
Query: 57 PGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTK 116
P +S+E++ R P S L K+ T L R R SRRL +
Sbjct: 23 PKNLSVELIHRDSPLSPLYNPKNTVTDRLNAAFLRSISR-----SRRLNNILSQT----- 72
Query: 117 AFTFPAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFD 175
++G++ AD E+++ + IG P V + DTGS +TW QCKPC C ++ P FD
Sbjct: 73 ----DLQSGLIGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFD 128
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI 235
KS T+ PC+S C L G D+ S C Y +Y D S G AT+ ++I
Sbjct: 129 KKKSSTYKSEPCDSRNCHALSS--SERGCDE-SKNVCKYRYSYGDQSFSKGDVATETISI 185
Query: 236 QEVNGNGYFARYP-FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
+G+ +P + GC NN G SGI+GL G +S+IS+ S F YCL
Sbjct: 186 DSASGSP--VSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL 243
Query: 291 HSPYGS---TGYITFGKPDTVNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERLP 343
+ T I G +++ K + +++TP E +Y++TL ISVG +++P
Sbjct: 244 SHKSATTNGTSVINLGT-NSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIP 302
Query: 344 LKASYF----------TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL 393
S + T + IDSGT +T + + +A + + K + L
Sbjct: 303 YTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGL 362
Query: 394 FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILL 453
C+ S + +P+IT+HF G D+ L V S VCL +++P+ +I
Sbjct: 363 LSHCFK-SGSAEIGLPEITVHFTGA-DVRLSPINAFVKVSEDMVCL--SMVPTTEVAI-Y 417
Query: 454 GNVQQRGYEVHYDVAGRRLGFGPGNCN 480
GN Q + V YD+ R + F +C+
Sbjct: 418 GNFAQMDFLVGYDLETRTVSFQRMDCS 444
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 176/379 (46%), Gaps = 45/379 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQC-------KPCIHCSQQRDPFFDPSKSKTFSK 184
+ + V IG P Q +L++DTGS + WTQC + S+QR+P ++P +S +F+
Sbjct: 84 HSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAY 143
Query: 185 IPCNSTTCKILLEWFPPNGQ---DKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
+PC+ C+ GQ C+ + C YD Y GS E G VN
Sbjct: 144 LPCSDRLCQ--------EGQFSYKNCARNNRCMYDELY--GSAEAGGVLASETFTFGVNA 193
Query: 241 NGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS--TG 298
P GC + GD GASG+MGL G +S++S+ ++ F YCL +P+ T
Sbjct: 194 K---VSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPRFSYCL-TPFAERKTS 249
Query: 299 YITFGKPDTVNK----KFVKYTPIVTTPE-QSEFYHITLTGISVGGERLPLKASYFTKL- 352
+ FG + + V+ T I+ P ++ +Y++ L G+S+G +RL + A+ +
Sbjct: 250 PLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIK 309
Query: 353 -----STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE---DLFDTCYDLS--- 401
T +DSG+ ++ + A++ A + + + + G + D ++ C+ L
Sbjct: 310 PDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAV-RLPVANGTDEDYDDYELCFALPTGV 368
Query: 402 AYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGY 461
A + V P + +HF GG + L +CL P ++GNVQQ+
Sbjct: 369 AMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFGVSIIGNVQQQNM 428
Query: 462 EVHYDVAGRRLGFGPGNCN 480
V +DV ++ F P C+
Sbjct: 429 HVLFDVRNQKFSFAPTKCD 447
>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
Length = 468
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 108/351 (30%), Positives = 148/351 (42%), Gaps = 44/351 (12%)
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
AI P + +DT + W QC PC C Q++ FDP +S+T + +PC S C
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 213
Query: 195 LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
L G G W + + C
Sbjct: 214 L---------------------------GRYGRWLLQQPVPVLRRLRRRQGQP-RGRTCH 245
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNK- 310
SG M L G S++S+T ++ F YC+ P S+G+++ G P
Sbjct: 246 AVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDP-SSSGFLSLGGPADGGGA 304
Query: 311 -KFVKYTPIVTTPEQ-SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAP 368
+F + TP+V P Y + L GI VGG RL + F +DS IIT+ P
Sbjct: 305 GRFAR-TPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAG-GAVMDSSVIITQLPPT 362
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT 428
Y ALR AFR M Y G DTCYD + +V VP +++ F GG + LD G
Sbjct: 363 AYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGV 422
Query: 429 LVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+V + CL F P D +GNVQQ+ +EV YDV G +GF G C
Sbjct: 423 MV-----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 157/358 (43%), Gaps = 39/358 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + + +G P + +DTGS + WTQC PC +C Q P FDPS S TF
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFK-------- 112
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
+ +C+ C Y I Y D + G AT+ +TI +G F +
Sbjct: 113 ------------EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEP-FVMPETTI 159
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTV 308
GC N++ + SG++GL GP S+I++ Y YC S T I FG V
Sbjct: 160 GCGHNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQ--GTSKINFGTNAIV 217
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE--IDSGTIITRFP 366
V T + T + Y++ L +SVG + + F L IDSGT +T FP
Sbjct: 218 AGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFP 277
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV---PKITIHFLGGVDLEL 423
+ +R A + + T D+ Y T + P IT+HF GG DL L
Sbjct: 278 VSYCNLVREAVDHYVTAVRTAD------PTGNDMLCYYTDTIDIFPVITMHFSGGADLVL 331
Query: 424 DVRGTLVVESVRQVCLGFALLPSD-PNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
D + + +E++ + A++ ++ P + GN Q + V YD + + F P NC+
Sbjct: 332 D-KYNMYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 175/364 (48%), Gaps = 32/364 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + ++IG P V + DTGS + WTQC PC+ C +Q++P FDPSKS +F ++ C S
Sbjct: 90 EYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQ 149
Query: 191 TCKILLEWFPPNGQDKCS--SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C++L CS K C + Y DGS G AT+ +T+ +G +
Sbjct: 150 QCRLL-------DTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPX-SIXN 201
Query: 249 FLLGCTDNNTGDQN-GASGIMGLDRGPVSIISKTNISY-----FFYCLHSPYGS----TG 298
+ GC NN+G N G+ G P+S+ S+ + F CL P+ + T
Sbjct: 202 IVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCL-VPFRTDPSITS 260
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS--YFTKLSTEI 356
I FG V+ V TP+VT + +Y +TL GISVG + P +S TK + I
Sbjct: 261 KIIFGPEAEVSGSXVVSTPLVTK-DDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFI 319
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVVPKITIHF 415
D+GT T P Y+ L ++ + + DL CY + + P +T HF
Sbjct: 320 DAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQD--PDLQPQLCY--RSATLIDGPILTAHF 375
Query: 416 LGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
G D++L T + S ++ FA+ P D ++ + GN Q + + +D+ G+++ F
Sbjct: 376 -DGADVQLKPLNTFI--SPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFK 432
Query: 476 PGNC 479
+C
Sbjct: 433 AVDC 436
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 159/355 (44%), Gaps = 34/355 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + +IG P Q +S L DTGS + W +C C C Q P + P+KS +FSK+PC+ +
Sbjct: 82 YDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSL 141
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSG----ETGFWATDRMTI--QEVNGNGYFA 245
C L P+ Q EC Y +Y S G+ ++ T+ V G G+
Sbjct: 142 CSDL-----PSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLGSDAVPGIGF-- 194
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKP 305
GCT + G SG++GL RGP+S++S+ N+ F YCL S T + FG
Sbjct: 195 ------GCTTMSEGGYGSGSGLVGLGRGPLSLVSQLNVGAFSYCLTSDAAKTSPLLFGS- 247
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRF 365
+ V+ TP++ T + +Y + L IS+G S DSGT +
Sbjct: 248 GALTGAGVQSTPLLRT--STYYYTVNLESISIGAATTAGTGSS----GIIFDSGTTVAFL 301
Query: 366 PAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV 425
P Y+ + A + M G D ++ C+ S V P + +HF GG D++L
Sbjct: 302 AEPAYTLAKEAVLSQTTNLTMASG-RDGYEVCFQTSG---AVFPSMVLHFDGG-DMDLPT 356
Query: 426 RGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
C ++ P+ ++GN+ Q Y + YDV L F P NC+
Sbjct: 357 ENYFGAVDDSVSCW---IVQKSPSLSIVGNIMQMNYHIRYDVEKSMLSFQPANCD 408
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 127/440 (28%), Positives = 200/440 (45%), Gaps = 52/440 (11%)
Query: 60 VSLEVLGRYGPCSKLNQGKSRNTPS----LEEILRRDQQRLHLKNSRRLQKAIPDNFKKT 115
+L+V +GPCS L G PS L + RD RL +S + K
Sbjct: 41 ATLQVSHAFGPCSPL--GAESAAPSWAGFLADQAARDASRLLYLDSLAV---------KG 89
Query: 116 KAFTFPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
+A+ P +G ++ Y + +G P Q + L +DT + W C C C
Sbjct: 90 RAYA-PIASGRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP-- 146
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
F+P+ S ++ +PC S C + PN ++K C + ++Y D S + + D +
Sbjct: 147 FNPAASASYRPVPCGSPQCVLA-----PNPSCSPNAKSCGFSLSYADSSLQAAL-SQDTL 200
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
V G+ A + GC TG G++GL RGP+S +S+T Y F YCL
Sbjct: 201 ---AVAGDVVKA---YTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCL 254
Query: 291 HS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASY 348
S +G + G+ + +K TP++ P +S Y++ +TGI VG + + + AS
Sbjct: 255 PSFKSLNFSGTLRLGR--NGQPRRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASA 312
Query: 349 F-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY 403
T T +DSGT+ TR APVY ALR R+R+ FDTCY+
Sbjct: 313 LAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCYN---- 368
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRG 460
TV P +T+ F G+ + L ++ + CL A P N++L + ++QQ+
Sbjct: 369 TTVAWPPVTLLF-DGMQVTLPEENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQN 427
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
+ V +DV R+GF +C
Sbjct: 428 HRVLFDVPNGRVGFARESCT 447
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 164/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L RG V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSRGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 164/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ + F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L +G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSKGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 173/360 (48%), Gaps = 32/360 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + V+IG P + DTGS + W QC PC+ C +Q P FDP KS +FS +PCNS
Sbjct: 91 EYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQ 150
Query: 191 TCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
CK + C ++ C Y Y D + G +++TI +
Sbjct: 151 NCKAI-------DDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGS-------SSVKS 196
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY-----FFYCLHSPYG-STGYITFG 303
++GC + G ASG++GL G +S++S+ + + F YCL + + G I FG
Sbjct: 197 VIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFG 256
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIIT 363
+ V+ V TP++ + +Y++TL IS+G ER A + + IDSGT ++
Sbjct: 257 QNAVVSGPGVVSTPLI-SKNPVTYYYVTLEAISIGNERHMASAK---QGNVIIDSGTTLS 312
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD--LSAYKTVVVPKITIHFLGGVDL 421
P +Y + S+ K +K ++ K + +D C+D ++ + +P IT F GG ++
Sbjct: 313 FLPKELYDGVVSSLLKVVKAKRV-KDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANV 371
Query: 422 ELDVRGTLVVESVRQVCLGFALLPSDPNSI--LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L T + CL L P+ P ++GN+ + + YD+ +RL F P C
Sbjct: 372 NLLPVNTFQKVANNVNCL--TLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 429
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 164/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 175/372 (47%), Gaps = 34/372 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EYY + +G P Q L++DTGS +TW QC PC C+ D +D ++S ++ + CN++
Sbjct: 99 EYYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNS 158
Query: 191 TCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
L C+ +C + Y DGS G +TD + ++ V G F
Sbjct: 159 Q---LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDF 215
Query: 250 LLGCTDNNTG-DQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGSTGYITF 302
GC + GASGI+GL+ G +++ + + F +C S STG + F
Sbjct: 216 AFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFF 275
Query: 303 GKPDTVNKKFVKYTPIVTTPE--QSEFYHITLTGISVGGERLPLKASYFTKLSTEI-DSG 359
G + +++ V+YT + T Q +FYH+ L G+S+ L + + S I DSG
Sbjct: 276 GNAELPHEQ-VQYTSVALTNSELQRKFYHVALKGVSINSHEL----VFLPRGSVVILDSG 330
Query: 360 TIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLS------AYKTVVVPK 410
+ + F P +S LR AF K K+ G DL TC+ +S ++T +P
Sbjct: 331 SSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDL-GTCFKVSNDDIDELHRT--LPS 387
Query: 411 ITIHFLGGVDLELDVRGTL--VVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDV 467
+++ F GV + + G L V V + FA PN + ++GN QQ+ V YD+
Sbjct: 388 LSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDI 447
Query: 468 AGRRLGFGPGNC 479
R+GF +C
Sbjct: 448 QRSRVGFARASC 459
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 186/382 (48%), Gaps = 44/382 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+ + + IG ++ +S ++DTGS QC S+ R P FDP+ S+++ ++PC S
Sbjct: 100 FSMQLGIGSLQKNLSAIIDTGSEAVLVQCG-----SRSR-PVFDPAASQSYRQVPCISQL 153
Query: 192 CKILLEWFPPNGQDKC--SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY-P 248
C + + C SS C Y ++Y D TG ++ D + + N +G ++
Sbjct: 154 CLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRD 213
Query: 249 FLLGCTDNNTG--DQNGASGIMGLDRGPVSIISKTNI----SYFFYCLHS-PYG--STGY 299
GC + G G+ GI+G +RG +S+ S+ S F YC S P+ +TG
Sbjct: 214 VAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGV 273
Query: 300 ITFGKPDTVNKKFVKYTPIV---TTPEQSEFYHITLTGISVGGERLPLKASYFTKLS--- 353
I G ++K V YTP++ TP +S+ Y++ LT ISV G+ L + S F KL
Sbjct: 274 IFLGDSG-LSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAF-KLDPST 331
Query: 354 ----TEIDSGTIITRFPAPVYSALRSAF----RKRMKKYKMGKGIEDLFDTCYDLSAYKT 405
T +DSGT TR Y+A R+AF R ++K K+G FD CY++SA +
Sbjct: 332 GDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRK-KVGAAAG--FDDCYNISAGSS 388
Query: 406 V-VVPKITIHFLGGVDLELDVRGTLVVESV--RQVCLGFALLPSDPNSI----LLGNVQQ 458
+ VP++ + V LEL V S +V + A+L S + +LGN QQ
Sbjct: 389 LPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQ 448
Query: 459 RGYEVHYDVAGRRLGFGPGNCN 480
Y V YD R+GF +C+
Sbjct: 449 SNYLVEYDNERSRVGFERADCS 470
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 175/364 (48%), Gaps = 32/364 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + ++IG P V + DTGS + WTQC PC+ C +Q++P FDPSKS +F ++ C S
Sbjct: 90 EYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQ 149
Query: 191 TCKILLEWFPPNGQDKCS--SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C++L CS K C + Y DGS G AT+ +T+ +G +
Sbjct: 150 QCRLL-------DTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPT-SILN 201
Query: 249 FLLGCTDNNTGDQN-GASGIMGLDRGPVSIISKTNISY-----FFYCLHSPYGS----TG 298
+ GC NN+G N G+ G P+S+ S+ + F CL P+ + T
Sbjct: 202 IVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCL-VPFRTDPSITS 260
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS--YFTKLSTEI 356
I FG V+ V TP+VT + +Y +TL GISVG + P +S TK + I
Sbjct: 261 KIIFGPEAEVSGSDVVSTPLVTK-DDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFI 319
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVVPKITIHF 415
D+GT T P Y+ L ++ + + DL CY + + P +T HF
Sbjct: 320 DAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQD--PDLQPQLCY--RSATLIDGPILTAHF 375
Query: 416 LGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
G D++L T + S ++ FA+ P D ++ + GN Q + + +D+ G+++ F
Sbjct: 376 -DGADVQLKPLNTFI--SPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFK 432
Query: 476 PGNC 479
+C
Sbjct: 433 AVDC 436
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 164/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 164/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y V +G P + + +DTGS I+W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSSGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 132/447 (29%), Positives = 203/447 (45%), Gaps = 48/447 (10%)
Query: 57 PGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTK 116
P S+E++ R P S + + T L R R SRR +
Sbjct: 23 PKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSR-----SRRFNHQLSQT----- 72
Query: 117 AFTFPAKTGIVAAD-EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFD 175
++G++ AD E+++ + IG P V + DTGS +TW QCKPC C ++ P FD
Sbjct: 73 ----DLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFD 128
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTI 235
KS T+ PC+S C+ L G D+ S+ C Y +Y D S G AT+ ++I
Sbjct: 129 KKKSSTYKSEPCDSRNCQALSST--ERGCDE-SNNICKYRYSYGDQSFSKGDVATETVSI 185
Query: 236 QEVNGNGYFARYP-FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
+G+ +P + GC NN G SGI+GL G +S+IS+ S F YCL
Sbjct: 186 DSASGSP--VSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL 243
Query: 291 HSPYGS---TGYITFGKPDTVNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERLP 343
+ T I G +++ K + +V+TP E +Y++TL ISVG +++P
Sbjct: 244 SHKSATTNGTSVINLGT-NSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIP 302
Query: 344 LKASYF----------TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL 393
S + T + IDSGT +T A + SA + + K + L
Sbjct: 303 YTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGL 362
Query: 394 FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILL 453
C+ S + +P+IT+HF G D+ L V S VCL +++P+ +I
Sbjct: 363 LSHCFK-SGSAEIGLPEITVHFTGA-DVRLSPINAFVKLSEDMVCL--SMVPTTEVAI-Y 417
Query: 454 GNVQQRGYEVHYDVAGRRLGFGPGNCN 480
GN Q + V YD+ R + F +C+
Sbjct: 418 GNFAQMDFLVGYDLETRTVSFQHMDCS 444
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 164/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 131/442 (29%), Positives = 203/442 (45%), Gaps = 55/442 (12%)
Query: 72 SKLNQGKSRNTPSLEEILRRDQQRLHLKNS-----RRLQKAI------PDNFKKTKAFTF 120
+K +Q +++ + + RD R N +RLQKA ++F+ +A
Sbjct: 22 AKHSQAEAKVDGFTTDFISRDSPRSPFYNPSETKYQRLQKAFRRSILRGNHFRAIRASPN 81
Query: 121 PAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
++ +++ Y++ +++G P + + DTGS + W QC PC C +Q +P FDP KS
Sbjct: 82 DIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKS 141
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYV--DGSGETGFWATDRMTIQE 237
KT+ + CN+ C+ L + + C+S D +Y D S ET TI
Sbjct: 142 KTYKTLGCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSET-------FTIGS 194
Query: 238 VNGNGYFARYPFL-LGCTDNNTGDQN----GASGIMGLDRGPVSIISKTNISYFFYC--- 289
G+ A +P L GC +N G N G G+ G V +S F YC
Sbjct: 195 TEGDP--ASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVP 252
Query: 290 LHSPYGSTGYITFGKPDTVNKKFVKYTPIVT-TPEQSEFYHITLTGISVGGERLPLKASY 348
L S ++ I FGK V+ TP++ TP+ FY++TL G+S+G E++ K
Sbjct: 253 LSSDSTASSKINFGKSAVVSGSGTVSTPLIKGTPD--TFYYLTLEGMSLGSEKVAFKGFS 310
Query: 349 FTKLSTE--------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED---LFDTC 397
K S IDSGT +T P Y+ + SA K + G+ D F C
Sbjct: 311 KNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALTKVIG----GQTTTDPRGTFSLC 366
Query: 398 YDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQ 457
Y S K + +P IT HF+ G D++L T V VC F+++PS N + GN+
Sbjct: 367 Y--SGVKKLEIPTITAHFI-GADVQLPPLNTFVQAQEDLVC--FSMIPSS-NLAIFGNLS 420
Query: 458 QRGYEVHYDVAGRRLGFGPGNC 479
Q + V YD+ ++ F P +C
Sbjct: 421 QMNFLVGYDLKNNKVSFKPTDC 442
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 101/336 (30%), Positives = 164/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L + G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGIHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
Length = 507
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 117/431 (27%), Positives = 183/431 (42%), Gaps = 54/431 (12%)
Query: 80 RNTPSLEEILRRDQQR---LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAA-DEYYIV 135
R PSL ++LR+DQ R +H++ + + + +K P ++ ++ D+ I
Sbjct: 35 RPPPSLADLLRQDQLRVDHIHMRLLSSSSQGVRVSKQKQGPVKEPVRSEVIHLHDQPVIQ 94
Query: 136 VAIGKPKQYV--------------------SLLLDTGSGITWTQCKPCIHCSQQRDPF-- 173
V IG ++ +++LDT S + W QC P +
Sbjct: 95 VTIGSERKGASGGSGGSGDQQQSQAAGVVQTVVLDTASDVPWVQCHPLASSATTDSSSSS 154
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGET---GFWAT 230
+DP++S T+ + CNS C L + + C + +C Y + + G + +
Sbjct: 155 YDPARSSTYYALACNSAACTELGRLY----RGACVNNQCQYRVPIPSSPASSSSSGTYGS 210
Query: 231 DRMTIQEVNGNGYFARYPFLLGCTDNNTGD------QNGASGIMGLDRGPVSIISKTNIS 284
D + + +G A F GC+ N +GIM L GP S++S+
Sbjct: 211 DLLKLTADPADG--ASMSFKFGCSHGEAKQGGEGSIDNATAGIMALGGGPESLVSQNAAM 268
Query: 285 Y---FFYCL---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
Y F YC+ S + G D TP++ Y + L I+V
Sbjct: 269 YGSAFSYCIPATESRRPGFFVLGGGVGDLSGAGGYAVTPMLRYARVPTLYRVRLLAIAVD 328
Query: 339 GERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY 398
G++L + S F S +DS T ITR P Y ALR AFR RM Y+ +L DTCY
Sbjct: 329 GQQLNVTPSVFASGSV-LDSRTAITRLPPTAYQALREAFRSRMAMYREAPPQGNL-DTCY 386
Query: 399 DLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQ 458
D + V+VP++ + G + LD +G L + CL F D +LGNVQQ
Sbjct: 387 DFAGAFLVMVPRVALLLDGNAVVALDRQGILFHD-----CLVFTSNTDDRMPGILGNVQQ 441
Query: 459 RGYEVHYDVAG 469
+ EV Y+V G
Sbjct: 442 QTMEVLYNVGG 452
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 163/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ + F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 100/336 (29%), Positives = 162/336 (48%), Gaps = 29/336 (8%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ G +S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ G + V+YT +V + +E + + LT ISV GERL L S F++ DSG
Sbjct: 171 FSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSG 230
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 231 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 288
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 289 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 322
>gi|297811183|ref|XP_002873475.1| hypothetical protein ARALYDRAFT_909036 [Arabidopsis lyrata subsp.
lyrata]
gi|297319312|gb|EFH49734.1| hypothetical protein ARALYDRAFT_909036 [Arabidopsis lyrata subsp.
lyrata]
Length = 292
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 94/279 (33%), Positives = 139/279 (49%), Gaps = 51/279 (18%)
Query: 204 QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ-N 262
Q CS C Y + Y D S GF A ++ T+ + +F F GC +NNTGD
Sbjct: 63 QGSCSDSTCGYSVGYGDTSTSQGFVAKEKFTLMSSD---FFDGVNF--GCGENNTGDYYE 117
Query: 263 GASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTP 322
G +G++G G H +GSTG K VK+TP+ ++P
Sbjct: 118 GVAGLLGNTSG-----------------HLTFGSTGI----------SKSVKFTPVSSSP 150
Query: 323 EQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMK 382
+ +FY++ + GI+V ++L + + I+S T Y+AL+SAF+++M
Sbjct: 151 SK-DFYYLNIEGITVCDKQLEIPS---------IESST------PRAYAALKSAFKEKMS 194
Query: 383 KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVR-QVCLGF 441
KY + + DTCYD + KTV + KI F GG +ELD +G L S R ++CL F
Sbjct: 195 KYTITSSGDSELDTCYDFTGLKTVTITKIAFSFSGGTVVELDPKGILYSSSERSKLCLAF 254
Query: 442 ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
A P D N + G+VQQ+ +V YD G R+GF P C+
Sbjct: 255 AEYP-DDNVAIFGSVQQQTLQVVYDGVGGRVGFAPNGCS 292
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 100/336 (29%), Positives = 162/336 (48%), Gaps = 29/336 (8%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ G +S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ G + V+YT +V + +E + + LT ISV GERL L S F++ DSG
Sbjct: 171 FSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSG 230
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 231 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 288
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 289 RFDLGRHGVFVERSVQEQDVWCLAFA--PTESVSII 322
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 163/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ + F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 121/410 (29%), Positives = 185/410 (45%), Gaps = 33/410 (8%)
Query: 82 TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKP 141
TPS E ++ R ++ RRL+ + D+ + T P + EY + IG P
Sbjct: 49 TPS--ERIKNTVLRSFARSKRRLRLSQNDD-RSPGTITIPDE----PITEYLMRFYIGTP 101
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
+ DTGS + W QC PC C Q P FDP KS TF +PC+S C +L P
Sbjct: 102 PVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLL-----P 156
Query: 202 NGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT--DNN 257
Q C K +C Y Y D + +G + + N F + F GCT +N+
Sbjct: 157 PSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTF--GCTFSNND 214
Query: 258 TGDQNGAS-GIMGLDRGPVSIISKT------NISYFFYCLHSPYGSTGYITFGKPDTVNK 310
T D++ + G++GL GP+S+IS+ SY F L S ST + FG V +
Sbjct: 215 TVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSS--NSTSKMRFGNDAIVKQ 272
Query: 311 -KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPV 369
K V TP++ +Y++ L G+S+G +++ S T + IDSGT T
Sbjct: 273 IKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQ-TDGNILIDSGTSFTILKQSF 331
Query: 370 YSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL 429
Y+ A K + + K +++ C++ + K P + F G + +D
Sbjct: 332 YNKF-VALVKEVYGVEAVKIPPLVYNFCFE-NKGKRKRFPDVVFLFTGA-KVRVDASNLF 388
Query: 430 VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
E +C+ AL SD + + GN Q GY+V YD+ G + F P +C
Sbjct: 389 EAEDNNLLCM-VALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPADC 437
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 122/412 (29%), Positives = 190/412 (46%), Gaps = 47/412 (11%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
++ ++R Q+RL +LQ N + K P T + + EY I +AIG P
Sbjct: 1 MKRAIQRSQERL-----EKLQITSAVNTHQMKDIETPV-TPDIGSGEYLIQMAIGTPALS 54
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+S ++DTGS + WT+C PC CS +DPS S T+SK+ C S+ C+ PP+
Sbjct: 55 LSAIMDTGSDLVWTKCNPCTDCSTSS--IYDPSSSSTYSKVLCQSSLCQ------PPSIF 106
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCTDNNTG-DQN 262
+ +C Y Y D S +G + + +I P + GC +N G D+
Sbjct: 107 SCNNDGDCEYVYPYGDRSSTSGILSDETFSISS-------QSLPNITFGCGHDNQGFDKV 159
Query: 263 GASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS--TGYITFGKPDTVNKKFVKYTP 317
G G++G RG +S++S+ S F YCL S S T + G ++ V TP
Sbjct: 160 G--GLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTP 217
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSA 372
+V + + Y+++L GISVGG+ L + F S IDSGT +T Y A
Sbjct: 218 LVQS-SSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDA 276
Query: 373 LRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVE 432
++ A + + + D C++ P +T HF G D ++ L +
Sbjct: 277 VKEAMVSSINLPQA----DGQLDLCFNQQGSSNPGFPSMTFHF-KGADYDVPKENYLFPD 331
Query: 433 SVRQ-VCLGFALLPSDP---NSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
S VCL A++P++ N + GNVQQ+ Y++ YD L F P C+
Sbjct: 332 STSDIVCL--AMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTACD 381
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 112/408 (27%), Positives = 178/408 (43%), Gaps = 27/408 (6%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQ 143
+E+++ DQ+R L + +R T +GI +Y+ + +G P +
Sbjct: 67 IEDVIGADQKRHSLISRKR---------NSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAK 117
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+++DTGS +TW C+ R F +SK+F + C + TCK+ L
Sbjct: 118 KFRVVVDTGSELTWVNCRYRARGKDNRR-VFRADESKSFKTVGCLTQTCKVDLMNLFSLT 176
Query: 204 QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGDQ- 261
S C YD Y DGS G +A + +T+ NG AR P L+GC+ + TG
Sbjct: 177 TCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGR--MARLPGHLIGCSSSFTGQSF 234
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGSTGYITFGKPDTVNKKFVKY 315
GA G++GL S S Y F YCL S + Y+ FG + F +
Sbjct: 235 QGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRT 294
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS---TEIDSGTIITRFPAPVYSA 372
TP+ T FY I + GIS+G + L + + + S T +DSGT +T Y
Sbjct: 295 TPLDLT-RIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQ 353
Query: 373 LRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITIHFLGGVDLELDVRGTLVV 431
+ + + + + K K + C+ S + +P++T H GG E + LV
Sbjct: 354 VVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVD 413
Query: 432 ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ CLGF + P + ++GN+ Q+ Y +D+ L F P C
Sbjct: 414 AAPGVKCLGF-VSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 460
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 112/409 (27%), Positives = 178/409 (43%), Gaps = 27/409 (6%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQ 143
+E+++ DQ+R L + +R T +GI +Y+ + +G P +
Sbjct: 45 IEDVIGADQKRHSLISRKR---------NSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAK 95
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+++DTGS +TW C+ R F +SK+F + C + TCK+ L
Sbjct: 96 KFRVVVDTGSELTWVNCRYRARGKDNRR-VFRADESKSFKTVGCLTQTCKVDLMNLFSLT 154
Query: 204 QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGDQ- 261
S C YD Y DGS G +A + +T+ NG AR P L+GC+ + TG
Sbjct: 155 TCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGR--MARLPGHLIGCSSSFTGQSF 212
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGSTGYITFGKPDTVNKKFVKY 315
GA G++GL S S Y F YCL S + Y+ FG + F +
Sbjct: 213 QGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRT 272
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS---TEIDSGTIITRFPAPVYSA 372
TP+ T FY I + GIS+G + L + + + S T +DSGT +T Y
Sbjct: 273 TPLDLT-RIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQ 331
Query: 373 LRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITIHFLGGVDLELDVRGTLVV 431
+ + + + + K K + C+ S + +P++T H GG E + LV
Sbjct: 332 VVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVD 391
Query: 432 ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ CLGF + P + ++GN+ Q+ Y +D+ L F P C
Sbjct: 392 AAPGVKCLGF-VSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 113/374 (30%), Positives = 172/374 (45%), Gaps = 26/374 (6%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V + EY + V +G P + +++DTGS + W QC PC+ C Q P FDP+ S ++ +
Sbjct: 146 VGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVT 205
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C C ++ PP + CPY Y D S TG A + T+
Sbjct: 206 CGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV 265
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS--TGYIT 301
+ GC N G +GA+G++GL RGP+S S+ Y F YCL +GS +
Sbjct: 266 DDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVD-HGSDVASKVV 324
Query: 302 FGKPDTVNKKF----VKYTPIVTTPEQSE-FYHITLTGISVGGERLPLKASYF------- 349
FG+ D + + YT ++ FY++ L G+ VGGE L + + +
Sbjct: 325 FGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGEG 384
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIED--LFDTCYDLSAYKTV 406
T IDSGT ++ F P Y +R AF RM + Y + I D + CY++S
Sbjct: 385 GSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPL---IPDFPVLSPCYNVSGVDRP 441
Query: 407 VVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
VP++++ F G + + ++ +CL P SI +GN QQ+ + V Y
Sbjct: 442 EVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSI-IGNFQQQNFHVVY 500
Query: 466 DVAGRRLGFGPGNC 479
D+ RLGF P C
Sbjct: 501 DLKNNRLGFAPRRC 514
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 131/438 (29%), Positives = 191/438 (43%), Gaps = 54/438 (12%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+LEV + PCS K + S+ ++ +DQ RL S ++I
Sbjct: 34 TLEVFHVFSPCSPFRPSKPLSWAESVLQLQAKDQARLQFLASMVAGRSI----------- 82
Query: 120 FPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G I+ + Y + IG P Q + L +DT + W C C C+ F P
Sbjct: 83 VPIASGRQIIQSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCTST---LFAPE 139
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
KS TF + C S C + C + C +++ Y S D +T+
Sbjct: 140 KSTTFKNVSCGSPECNKV-------PSPSCGTSACTFNLTY-GSSSIAANVVQDTVTLAT 191
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS-- 292
GY GC TG G++GL RGP+S++S+T Y F YCL S
Sbjct: 192 DPIPGY------TFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFK 245
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF- 349
+G + G +KYTP++ P +S Y++ L I VG + +P A F
Sbjct: 246 SLNFSGSLRLGP--VAQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFN 303
Query: 350 --TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDL--FDTCYDLSAYK 404
T T DSGT+ TR APVY+A+R FR+R+ K + L FDTCY +
Sbjct: 304 AATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVP--- 360
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGY 461
+V P IT F G+++ L L+ + CL A P + NS+L + N+QQ+ +
Sbjct: 361 -IVAPTITFMF-SGMNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNH 418
Query: 462 EVHYDVAGRRLGFGPGNC 479
V YDV RLG C
Sbjct: 419 RVLYDVPNSRLGVARELC 436
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 164/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGFS 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ G +S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNMDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + LT ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 174/371 (46%), Gaps = 32/371 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EYY + +G P Q L++DTGS +TW +C PC C+ D +D ++S ++ + CN++
Sbjct: 99 EYYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNS 158
Query: 191 TCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
L C+ +C + Y DGS G +TD + ++ V G F
Sbjct: 159 Q---LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDF 215
Query: 250 LLGCTDNNTG-DQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGSTGYITF 302
GC + GASGI+GL+ G +++ + + F +C S STG + F
Sbjct: 216 AFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFF 275
Query: 303 GKPDTVNKKFVKYTPIVTTPE--QSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGT 360
G + +++ V+YT + T Q +FYH+ L G+S+ L L + +DSG+
Sbjct: 276 GNAELPHEQ-VQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVVI---LDSGS 331
Query: 361 IITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLS------AYKTVVVPKI 411
+ F P +S LR AF K K+ G DL TC+ +S ++T +P +
Sbjct: 332 SFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDL-GTCFKVSNDDIDELHRT--LPSL 388
Query: 412 TIHFLGGVDLELDVRGTL--VVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVA 468
++ F GV + + G L V V + FA PN + ++GN QQ+ V YD+
Sbjct: 389 SLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQ 448
Query: 469 GRRLGFGPGNC 479
R+GF +C
Sbjct: 449 RSRVGFARASC 459
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 132/491 (26%), Positives = 194/491 (39%), Gaps = 74/491 (15%)
Query: 53 LPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRD-------QQRLHLKNSRRLQ 105
LP + LE++ R+ G +++ + RD QR + N R +
Sbjct: 26 LPVAVNSMRLELVHRHHERFSGGGGDVDQVEAVKGFVNRDGLRRQRMNQRWGVSNYDRRR 85
Query: 106 KAIPDNFKKTKAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQC---- 160
K + T P + G A EY+ V +G P Q L DTGS TW C
Sbjct: 86 KGL--ETTTTTEVEMPMRAGRDDALGEYFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRN 143
Query: 161 -------------------------------KPCIHCSQQRDP---FFDPSKSKTFSKIP 186
+ + +P F P +SK+F +
Sbjct: 144 ATTTATTKKTRKNKTKKKHHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCPHRSKSFQAVT 203
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG-NGYFA 245
C S CKI L S C YDI+Y DGS GF+ TD +T+ NG G
Sbjct: 204 CASQKCKIDLSQLFSLSLCPKPSDPCLYDISYADGSSAKGFFGTDTITVDLKNGKEGKLN 263
Query: 246 RYPFLLGCT---DNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGS 296
+GCT +N GI+GL S I K Y F YCL S
Sbjct: 264 N--LTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYEYGAKFSYCLVDHLSHRNV 321
Query: 297 TGYITFGKPDTVNKKF---VKYTPIVTTPEQSEFYHITLTGISVGGERL---PLKASYFT 350
+ Y+T G N K +K T ++ P FY + + GIS+GG+ L P + +
Sbjct: 322 SSYLTIGGHH--NAKLLGEIKRTELILFP---PFYGVNVVGISIGGQMLKIPPQVWDFNS 376
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSAYKTVVV 408
+ T IDSGT +T P Y + A K + K K G ED D C+D + VV
Sbjct: 377 QGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTG-EDFGALDFCFDAEGFDDSVV 435
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P++ HF GG E V+ ++ + C+G + + ++GN+ Q+ + +D++
Sbjct: 436 PRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLS 495
Query: 469 GRRLGFGPGNC 479
+GF P C
Sbjct: 496 TNTIGFAPSIC 506
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 175/381 (45%), Gaps = 48/381 (12%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + + IG P+ Y S +DT S + W QC+PC+ C +Q DP F+P S +++ +PC+S
Sbjct: 87 EYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSD 146
Query: 191 TCKILLEWFPPNGQ--DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
TC L +G D+ + C Y+ Y + G A D++ V GN + A
Sbjct: 147 TCSQL------DGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLA---VGGNVFHA--- 194
Query: 249 FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST-GYITFGK-- 304
+LGC+D++ G ASG++GL RGP+S++S+ ++ F YCL P T G + G
Sbjct: 195 VVLGCSDSSVGGPPPQASGLVGLARGPLSLLSQLSVRRFMYCLPPPMSRTPGKLVLGAGA 254
Query: 305 -PDTVNKKFVKYTPIVTTPEQ-SEFYHITLTGISVGGE-----RLPLK------------ 345
D V + T +++ + +Y++ G++VG + R P
Sbjct: 255 GADAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGG 314
Query: 346 ---ASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS- 401
S +D + I+ A +Y L + ++ + D C+ L
Sbjct: 315 GDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEIRLPRATPSTRLGLDLCFILPE 374
Query: 402 --AYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQR 459
V VP +++ F G LEL+ R L +E R +CL ++ +LGN QQ+
Sbjct: 375 GVGIDRVYVPTVSMSF-DGRWLELE-RDRLFLEDGRMMCL---MIGRTSGVSILGNYQQQ 429
Query: 460 GYEVHYDVAGRRLGFGPGNCN 480
V Y++ ++ F +C+
Sbjct: 430 NMHVLYNLRRGKITFAKASCD 450
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 166/369 (44%), Gaps = 57/369 (15%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + + +G P + +DTGS I WTQC PC +C Q P FDPSKS TF
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFR-------- 472
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
+ +C+ C Y+I Y D + G AT+ +TI +G F +
Sbjct: 473 ------------EQRCNGNSCHYEIIYADKTYSKGILATETVTIPSTSGEP-FVMAETKI 519
Query: 252 GCTDNNTGDQ-----NGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS-----TG 298
GC +NT Q + +SGI+GL+ GP+S+IS+ ++ Y YC S T
Sbjct: 520 GCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQGTSKINFGTN 579
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE--- 355
I G F+K + + FY++ L +SV L A+ T E
Sbjct: 580 AIVAGDGTVAADMFIK--------KDNPFYYLNLDAVSVEDN---LIATLGTPFHAEDGN 628
Query: 356 --IDSGTIITRFPAPVYSALRSAFRKRMKKYKM-GKGIEDLFDTCYDLSAYKTVVVPKIT 412
IDSGT +T FP + +R A + + K+ G ++L CY + P IT
Sbjct: 629 IFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLL--CYYSDTID--IFPVIT 684
Query: 413 IHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNS-ILLGNVQQRGYEVHYDVAGRR 471
+HF GG DL LD + + +E++ A+ +DP+ + GN Q + V YD +
Sbjct: 685 MHFSGGADLVLD-KYNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNV 743
Query: 472 LGFGPGNCN 480
+ F P NC+
Sbjct: 744 ISFSPTNCS 752
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 159/361 (44%), Gaps = 57/361 (15%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + + +G P ++ +DTGS + WTQC PC C Q DP FDPSKS TF+
Sbjct: 82 YLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFN-------- 133
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
+ +C K C Y+I Y D + G AT+ +TI +G F +
Sbjct: 134 ------------EQRCHGKSCHYEIIYEDNTYSKGILATETVTIHSTSGEP-FVMAETTI 180
Query: 252 GC----TD-NNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS-----TG 298
GC TD +N+G + +SGI+GL+ GP S+IS+ ++ Y YC S T
Sbjct: 181 GCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQGTSKINFGTN 240
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE--I 356
I G F+K + + FY++ L +SV R+ + F I
Sbjct: 241 AIVAGDGTVAADMFIK--------KDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGNIVI 292
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKM----GKGIEDLFDTCYDLSAYKTVVVPKIT 412
DSG+ +T FP + +R A + + ++ G + F D + P IT
Sbjct: 293 DSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCYFSETID-------IFPVIT 345
Query: 413 IHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDP-NSILLGNVQQRGYEVHYDVAGRR 471
+HF GG DL LD + + +ES A++ + P + GN Q + V YD +
Sbjct: 346 MHFSGGADLVLD-KYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLL 404
Query: 472 L 472
L
Sbjct: 405 L 405
>gi|147794033|emb|CAN68918.1| hypothetical protein VITISV_035156 [Vitis vinifera]
Length = 398
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 128/452 (28%), Positives = 183/452 (40%), Gaps = 108/452 (23%)
Query: 37 VSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL 96
VSSL+P C + QG L + +YGPCS G S+ PS +EI RD+ R+
Sbjct: 46 VSSLLPKNKCLASARGGSQG-----LPITQKYGPCS--GSGHSQ-PPSPQEIXGRDESRV 97
Query: 97 HLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGI 155
NS+ + N K + D ++V VA G P Q L+LDTGS I
Sbjct: 98 SFINSK-CNQYTSGNLKN-----HAHNNNLFDEDGNFLVDVAFGTPPQXFXLILDTGSSI 151
Query: 156 TWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYD 215
TWTQCK C++C Q +FB S S T+S C T E Y+
Sbjct: 152 TWTQCKACVNCLQDSXRYFBXSASSTYSXGSCIPXTV------------------ENNYN 193
Query: 216 IAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGP 274
+ Y D S G + MT++ + F ++ F G NN GD +GA G++GL +G
Sbjct: 194 MTYGDDSTSVGNYGCXTMTLEPSD---VFQKFQF--GXGRNNKGDFGSGADGMLGLGQGQ 248
Query: 275 VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
+S +S+T + F YCL S G + FG+ T +K+T +V P
Sbjct: 249 LSTVSQTASKFXKVFSYCLPEE-DSIGSLLFGEKATSQSSSLKFTSLVNGP--------- 298
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
G L YF KL
Sbjct: 299 ------GTSGLXESGYYFVKL--------------------------------------- 313
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA---LLPSDP 448
D D V++P+I +HF GG D+ L+ + ++CL FA +P
Sbjct: 314 --LDISVD------VLLPEIVLHFGGGADVRLNGTNIVWGSDASRLCLAFAGNSKSTMNP 365
Query: 449 NSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++GN QQ V YD+ G R+GF C+
Sbjct: 366 ELTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 397
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 163/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGFS 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + K G E+ CYD+ + +P I++HF
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDAA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGSHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 129/431 (29%), Positives = 187/431 (43%), Gaps = 61/431 (14%)
Query: 61 SLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTF 120
+LE++ R S Q + +RR R+ ++F K +
Sbjct: 30 TLELIHRDSSKSPFYQPTQNKYERIANAVRRSINRV-------------NHFYKYSLTST 76
Query: 121 PAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSK 180
P T EY + +IG P V +DTGS + W QC+PC C Q P FDPS S
Sbjct: 77 PQSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSS 136
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
++ IPC S TC S + D+ G+ + + +T+
Sbjct: 137 SYQNIPCLSDTCH--------------SMRTTSCDVR--------GYLSVETLTLDST-- 172
Query: 241 NGYFARYP-FLLGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISY---FFYCLHSPY- 294
GY +P ++GC NTG +G +SGI+GL GP+S+ S+ S F YCL P+
Sbjct: 173 TGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCL-GPWL 231
Query: 295 -GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--TK 351
ST + FG V TPIV QS +Y +TL SVG + + + +
Sbjct: 232 PNSTSKLNFGDAAIVYGDGAMTTPIVKKDAQSGYY-LTLEAFSVGNKLIEFGGPTYGGNE 290
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED---LFDTCYDLSAYKTVVV 408
+ IDSGT T P VY SA + +Y + +ED F CY++ AY
Sbjct: 291 GNILIDSGTTFTFLPYDVYYRFESA----VAEYINLEHVEDPNGTFKLCYNV-AYHGFEA 345
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P IT HF G D++L T + S CL F +PS + + GNV Q+ V Y++
Sbjct: 346 PLITAHF-KGADIKLYYISTFIKVSDGIACLAF--IPSQ--TAIFGNVAQQNLLVGYNLV 400
Query: 469 GRRLGFGPGNC 479
+ F P +C
Sbjct: 401 QNTVTFKPVDC 411
>gi|242086418|ref|XP_002443634.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
gi|241944327|gb|EES17472.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
Length = 486
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 130/463 (28%), Positives = 194/463 (41%), Gaps = 66/463 (14%)
Query: 37 VSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRL 96
S L P T C+ T L L ++ R P S L+ S T ++L RD +
Sbjct: 54 ASRLPPATTCSSMATGLDNN----KLPIVHRQSPWSPLHGLPSLTT---ADVLHRDTSLV 106
Query: 97 HLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD------------EYYIVVAIGKPKQY 144
+ Q ++ T A + PA I+ A+ +Y ++V+ G P+Q
Sbjct: 107 RRRRRFSSQSSV--VAAPTPALS-PAAATIIPANGSSDPSTLPGALDYIVLVSYGSPEQQ 163
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+ L T G + +CKPC S +P FD +S TF+ +PC+S C +
Sbjct: 164 FPVFLGTNVGTSLLRCKPCASGSDDCNPAFDTLQSSTFAHVPCSSPDCPV---------- 213
Query: 205 DKCSSKECP-YDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN- 262
CSS CP YD+ G G +ATD +T+ + A + F C D + +
Sbjct: 214 -NCSSSVCPFYDLYGTVG----GTFATDVLTLAPSS----MAVHDFRFVCMDVESPSPDL 264
Query: 263 GASGIMGLDR---------GPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV---NK 310
+G + L R S I+ T S F YCL S G+++ G TV +
Sbjct: 265 PEAGSIDLSRHRNSLPSQLSSSSGIAPTAAS-FSYCLPQSRNSQGFLSLGGDATVVGDDD 323
Query: 311 KFVKYTPIV--TTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAP 368
+ P+V P+ + Y I L G+S+GGE LP+ + F ST +D G T
Sbjct: 324 NLTVHAPMVWNNDPDLASMYFIDLVGMSLGGEDLPIPSGTFGNASTNLDVGATFTMLAPE 383
Query: 369 VYSALRSAFRKRMKKY--KMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
Y+ LR AFRK M +Y + D FDTC++ + +VVP + + F G L +D
Sbjct: 384 AYTTLRDAFRKEMSQYNNRSSPAGFDGFDTCFNFTGLNELVVPLVQLKFSNGESLMIDGD 443
Query: 427 GTL-----VVESVRQVCLGFALLP-SDPNSILLGNVQQRGYEV 463
L CL F+ L D S ++G EV
Sbjct: 444 QMLYYHDPAAGPFTMACLAFSSLDVGDSFSAVIGTYTLASTEV 486
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 178/393 (45%), Gaps = 30/393 (7%)
Query: 98 LKNSRRLQKA--IPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGI 155
L++ RLQ+ D K ++ P K EY + IG P ++DTGS +
Sbjct: 59 LRSMSRLQRVSHFLDENKLPESLLIPDK------GEYLMRFYIGSPPVERLAMVDTGSSL 112
Query: 156 TWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYD 215
W QC PC +C Q P F+P KS T+ C+S C +L P+ +D +C Y
Sbjct: 113 IWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQPCTLL----QPSQRDCGKLGQCIYG 168
Query: 216 IAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC-TDNN--TGDQNGASGIMGLDR 272
I Y D S G T+ ++ G + + GC DNN N GI GL
Sbjct: 169 IMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGA 228
Query: 273 GPVSIISKTNISY---FFYCLHSPYGSTGY--ITFGKPDTVNKKFVKYTPIVTTPEQSEF 327
GP+S++S+ F YCL PY ST + FG + V TP++ P +
Sbjct: 229 GPLSLVSQLGAQIGHKFSYCLL-PYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTY 287
Query: 328 YHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMG 387
Y + L +++G + + ++ T + IDSGT +T Y+ ++ ++ + K+
Sbjct: 288 YFLNLEAVTIGQK---VVSTGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETL-GVKLL 343
Query: 388 KGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSD 447
+ + TC+ A + +P I F G + L + L+ + + L A++PS
Sbjct: 344 QDLPSPLKTCFPNRA--NLAIPDIAFQFTGA-SVALRPKNVLIPLTDSNI-LCLAVVPSS 399
Query: 448 PNSI-LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
I L G++ Q ++V YD+ G+++ F P +C
Sbjct: 400 GIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDC 432
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 116/374 (31%), Positives = 177/374 (47%), Gaps = 26/374 (6%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
V + EY + V +G P + +++DTGS + W QC PC+ C +QR P FDP+ S ++ +
Sbjct: 141 VGSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLT 200
Query: 187 CNSTTCKILLEWFPPNGQD--KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C C + P + + CPY Y D S TG A + T+ + G
Sbjct: 201 CGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVN-LTAPGAS 259
Query: 245 ARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY----FFYCLHSPYGS--T 297
+R + GC N G +GA+G++GL RGP+S S+ Y F YCL +GS
Sbjct: 260 SRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLVD-HGSDVA 318
Query: 298 GYITFGKPDTV------NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLK-----A 346
+ FG+ D + K+ + P ++P + FY++ LTG+ VGGE L + A
Sbjct: 319 SKVVFGEDDALALAAHPRLKYTAFAP-ASSPADT-FYYVRLTGVLVGGELLNISSDTWDA 376
Query: 347 SYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
S T IDSGT ++ F P Y +R AF RM + CY++S +
Sbjct: 377 SEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSGVERP 436
Query: 407 VVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
VP++++ F G + + ++ +CL P SI +GN QQ+ + V Y
Sbjct: 437 EVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSI-IGNFQQQNFHVAY 495
Query: 466 DVAGRRLGFGPGNC 479
D+ RLGF P C
Sbjct: 496 DLHNNRLGFAPRRC 509
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 108/393 (27%), Positives = 169/393 (43%), Gaps = 27/393 (6%)
Query: 98 LKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITW 157
L++ +L +A + + K + I EY + IG P + DT S + W
Sbjct: 59 LRSIYQLNRASHSDLNEKKTL---ERVRIPNHGEYLMRFYIGTPPVERLAIADTASDLIW 115
Query: 158 TQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIA 217
QC PC C Q P F+P KS TF+ + C+S C ++ P C Y
Sbjct: 116 VQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCP-----LVGNLCLYTNT 170
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ---NGASGIMGLDRGP 274
Y DGS G T+ + G+ + GC NN N +GI+GL GP
Sbjct: 171 YGDGSSTKGVLCTESIHF----GSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGP 226
Query: 275 VSIISKT--NISY-FFYCLHSPYGSTGYI--TFGKPDTVNKKFVKYTPIVTTPEQSEFYH 329
+S++S+ I + F YCL P+ ST I FG T+ V TP++ P +Y
Sbjct: 227 LSLVSQLGDQIGHKFSYCL-LPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYF 285
Query: 330 ITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG 389
+ L GI++G + L ++ + T + ID GT++T Y + R+ + +
Sbjct: 286 LHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDD 345
Query: 390 IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPS--D 447
I FD C+ A + PKI F G + + +CL A+LP
Sbjct: 346 IPYPFDFCFPNQA--NITFPKIVFQFTGAKVFLSPKNLFFRFDDLNMICL--AVLPDFYA 401
Query: 448 PNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ GN+ Q ++V YD G+++ F P +C+
Sbjct: 402 KGFSVFGNLAQVDFQVEYDRKGKKVSFAPADCS 434
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 114/418 (27%), Positives = 194/418 (46%), Gaps = 42/418 (10%)
Query: 93 QQRLHLKNSRRLQKAIPDNFKKTKAFT------FPAKTGIVAAD-EYYIVVAIGKPK-QY 144
+Q L N+RR + + + KAF P +G + +Y++ + IG P+ Q
Sbjct: 73 RQLLQSDNARRQMISSLRHGTRRKAFEVSHTAQIPIHSGADSGQSQYFVSIRIGTPRPQK 132
Query: 145 VSLLLDTGSGITWTQC----KPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFP 200
L+ DTGS +TW C K C + F + S +F IPC+S CKI L
Sbjct: 133 FILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTIPCSSDDCKIEL---- 188
Query: 201 PNGQDKCSSKECP-------YDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC 253
QD S ECP +D Y++G G +A + +T+ +N + + L+GC
Sbjct: 189 ---QDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVG-LNDHKKIRLFDVLIGC 244
Query: 254 TDNNTGDQNGASGIMGLDRGPVSI---ISKTNISYFFYCLHSPYGSTG---YITFGKPDT 307
T++ G+MGL S+ +++ + F YCL S+ +++FG
Sbjct: 245 TESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSNHKNFLSFGDIPE 304
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---IDSGTIITR 364
+ +++T ++ + FY + ++GISVGG L + + + +DSGT +T
Sbjct: 305 MKLPKMQHTELLLG-YINAFYPVNVSGISVGGSMLSISSDIWNVTGVGGMIVDSGTSLTM 363
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIE--DLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
Y + A + K+K IE +L + C++ + VP++ IHF G +
Sbjct: 364 LAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCFEDKGFDRAAVPRLLIHFADGAIFK 423
Query: 423 LDVRGTLVVESVRQVCLGFALLPSD-PNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
V+ ++ + CLG ++ +D P S +LGNV Q+ + YD+ +LGFGP +C
Sbjct: 424 PPVKSYIIDVAEGIKCLG--IIKADFPGSSILGNVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 101/336 (30%), Positives = 163/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSFT 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ GP+S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + L ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGRHGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 127/437 (29%), Positives = 196/437 (44%), Gaps = 53/437 (12%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+L+V + PCS K + S+ ++ +DQ R+ +S +++I
Sbjct: 35 TLQVFHVFSPCSPFRPSKPMSWEESVLKLQAKDQARMQYLSSLVARRSI----------- 83
Query: 120 FPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G I + Y + IG P Q + L +DT + +W C C+ CS F P+
Sbjct: 84 VPIASGRQITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTTTP--FAPA 141
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
KS TF K+ C ++ CK + C C ++ Y S D +T+
Sbjct: 142 KSTTFKKVGCGASQCKQVRN-------PTCDGSACAFNFTY-GTSSVAASLVQDTVTLAT 193
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS-- 292
Y GC TG G++GL RGP+S++++T Y F YCL S
Sbjct: 194 DPVPAY------AFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFK 247
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF- 349
+G + G K +K+TP++ P +S Y++ L I VG +P +A F
Sbjct: 248 TLNFSGSLRLGP--VAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFN 305
Query: 350 --TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSAYKT 405
T T DSGT+ TR P Y+A+R+ FR+R+ +K + L FDTCY
Sbjct: 306 ANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHK-KLTVTSLGGFDTCYT----AP 360
Query: 406 VVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYE 462
+V P IT F G+++ L L+ + V CL A P + NS+L + N+QQ+ +
Sbjct: 361 IVAPTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHR 419
Query: 463 VHYDVAGRRLGFGPGNC 479
V +DV RLG C
Sbjct: 420 VLFDVPNSRLGVARELC 436
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 164/336 (48%), Gaps = 31/336 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I V +G P + + +DTGS +W C+ C C F S+S T +K+ C ++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQ-SRSTTCAKVSCGTSM 58
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
C LL P+ QD + +CP+ ++Y DGS G D +T +V + P F
Sbjct: 59 C--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGFS 110
Query: 251 LGCTDNNTGDQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGY 299
GC ++ G G++G+ G +S++ +++ ++ F YCL S G +TGY
Sbjct: 111 FGCNMDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGY 170
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
+ GK T + V+YT +V + +E + + LT ISV GERL L S F++ DSG
Sbjct: 171 FSLGKVAT--RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSG 228
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
+ ++ P S L R+ + + G E+ CYD+ + +P I++HF G
Sbjct: 229 SELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 286
Query: 420 DLELDVRGTLVVESVRQV---CLGFALLPSDPNSIL 452
+L G V SV++ CL FA P++ SI+
Sbjct: 287 RFDLGRGGVFVERSVQEQDVWCLAFA--PTESVSII 320
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 116/388 (29%), Positives = 182/388 (46%), Gaps = 42/388 (10%)
Query: 113 KKTKAFTFPAKTGIVAADE---YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ 169
++ T + +VA D + + ++G+P + +DTGS + W QC+PC C +Q
Sbjct: 69 RRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQ 128
Query: 170 RDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFW 228
P FDPSKS T+ + +S C P + Q K + +C Y+ +Y DGS +G
Sbjct: 129 STPIFDPSKSSTYVDLSYDSPIC-------PNSPQKKYNHLNQCIYNASYADGSTSSGNL 181
Query: 229 ATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISYFF 287
AT+ + E + G + GC +N G +G SGI+GL G SI+S+ S F
Sbjct: 182 ATEDIVF-ETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLG-SRFS 239
Query: 288 YC---LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPL 344
YC L P+ + + G D V K TP T + FY++TL GISVG RL +
Sbjct: 240 YCIGDLFDPHYTHNQLVLG--DGV-KMEGSSTPFHTF---NGFYYVTLEGISVGETRLDI 293
Query: 345 KASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
F + + +DSGT T + L + ++ ++ G + ++ T
Sbjct: 294 NPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVR----GHFQQVIYRTIPG 349
Query: 400 LSAYKTVV------VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI-- 451
YK V P++ HF G DL LD V ++ CL A+L S+ +I
Sbjct: 350 WLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFVQKNQDVFCL--AVLESNLKNIGS 407
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++G + Q+ Y V YD+ G+R+ F +C
Sbjct: 408 VIGIMAQQHYNVAYDLIGKRVYFQRTDC 435
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 169/374 (45%), Gaps = 40/374 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCK----PCIHCSQQRDPFFDPSKSKTFSKIPC 187
+ + V IG P Q L++DTGS + WTQCK + P +DP +S TF+ +PC
Sbjct: 91 HSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPC 150
Query: 188 NSTTCKILLEWFPPNGQ---DKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
+ C+ GQ C+SK C Y+ Y + G A++ T G
Sbjct: 151 SDRLCQ--------EGQFSFKNCTSKNRCVYEDVY-GSAAAVGVLASETFTF----GARR 197
Query: 244 FARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS--TGYIT 301
GC + G GA+GI+GL +S+I++ I F YCL +P+ T +
Sbjct: 198 AVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCL-TPFADKKTSPLL 256
Query: 302 FGKPDTVNK----KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL----- 352
FG +++ + ++ T IV+ P ++ +Y++ L GIS+G +RL + A+
Sbjct: 257 FGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGG 316
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL------SAYKTV 406
T +DSG+ + + A++ A ++ + +ED ++ C+ L +A + V
Sbjct: 317 GTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVED-YELCFVLPRRTAAAAMEAV 375
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
VP + +HF GG + L +CL ++GNVQQ+ V +D
Sbjct: 376 QVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFD 435
Query: 467 VAGRRLGFGPGNCN 480
V + F P C+
Sbjct: 436 VQHHKFSFAPTQCD 449
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 170/364 (46%), Gaps = 35/364 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+ + ++G+P ++DTGS I W +C PC C+QQ P DPSKS T++ +PC +T
Sbjct: 99 FLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTM 158
Query: 192 CKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C + P+ C+ +C Y+++Y G G AT+++ + G A +
Sbjct: 159 CH-----YAPSAY--CNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSD-EGVNAVPSVV 210
Query: 251 LGCTDNNTGDQNGA--SGIMGLDRGPVSIISKTNISYFFYCLHS---PYGSTGYITFGKP 305
GC+ N GD +G+ GL +G S +++ S F YCL + P+ + FG+
Sbjct: 211 FGCSHEN-GDYKDRRFTGVFGLGKGITSFVTRMG-SKFSYCLGNIADPHYGYNQLVFGE- 267
Query: 306 DTVNKKFVKY-TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE----IDSGT 360
F Y TP+ + Y++TL GISVG +RL + ++ F+ E IDSGT
Sbjct: 268 ---KANFEGYSTPLKVV---NGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSGT 321
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-VPKITIHFLGGV 419
+T + AL + R+ + M CY + + ++ P +T HF GG
Sbjct: 322 ALTWLAESAFRALDNEVRQLLDGVLMPFWRGSF--ACYKGTVSQDLIGFPVVTFHFSGGA 379
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSI----LLGNVQQRGYEVHYDVAGRRLGFG 475
DL+LD + +C+ + N ++G + Q+ Y + YD+ +L F
Sbjct: 380 DLDLDTESMFYQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQ 439
Query: 476 PGNC 479
+C
Sbjct: 440 RIDC 443
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 134/280 (47%), Gaps = 26/280 (9%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVV-----AIG 139
L +L D+ R + RR K ++ + P +GI Y+ + G
Sbjct: 45 LRRLLAADESRANSFQPRR-NKDRASASTQSASAEVPLTSGIRLQTLNYVTTISLGGSSG 103
Query: 140 KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWF 199
P +++++DTGS +TW QCKPC C QRDP FDP+ S T++ + CN++ C L
Sbjct: 104 SPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRAA 163
Query: 200 PPN----GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
G S++C Y +AY DGS G ATD + + + G F+ GC
Sbjct: 164 TGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGG------FVFGCGL 217
Query: 256 NNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYG--STGYITFGKPDTVNK 310
+N G G +G+MGL R +S++S+T Y F YCL + ++G ++ G D
Sbjct: 218 SNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDDAAS 277
Query: 311 KF-----VKYTPIVTTPEQSEFYHITLTGISVGGERLPLK 345
+ V YT ++ P Q FY + +TG +VGG L +
Sbjct: 278 SYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ 317
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 168/365 (46%), Gaps = 34/365 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+ + ++IG P L +DT S + W QC+PCI+C Q P FDPS+S T C ++
Sbjct: 85 FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTS- 143
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV-NGNGYFARYPFL 250
++ P+ + ++ C Y + Y+DG+G G A + + + + + A + +
Sbjct: 144 -----QYSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVV 198
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISK--TNISYFFYCLHSPYGSTGYITFGKPDTV 308
GC +N G+ +GI+GL G S++ + T SY F L P + G D
Sbjct: 199 FGCGHDNYGEPLVGTGILGLGYGEFSLVHRFGTKFSYCFGSLDDPSYPHNVLVLG--DDG 256
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS------TEIDSGTII 362
TP+ + FY++T+ ISV G LP+ F + T ID+G +
Sbjct: 257 ANILGDTTPLEI---YNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSL 313
Query: 363 TRFPAPVYSALRSA----FRKRMKKYKMGKGIEDLFDT-CYDLSAYKTVV---VPKITIH 414
T Y L++ F R + + +D+F CY+ + + +V P +T H
Sbjct: 314 TSLVEEAYKPLKNKIEDYFEGRFTAADVNQ--DDMFKVECYNGNLERDLVESGFPIVTFH 371
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F G +L LDV+ + S CL A+ P + NSI G Q+ Y + YD+ +++ F
Sbjct: 372 FSDGAELSLDVKSVFMKLSPNVFCL--AVTPGNMNSI--GATAQQSYNIGYDLEAKKISF 427
Query: 475 GPGNC 479
+C
Sbjct: 428 ERIDC 432
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 116/388 (29%), Positives = 182/388 (46%), Gaps = 42/388 (10%)
Query: 113 KKTKAFTFPAKTGIVAADE---YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ 169
++ T + +VA D + + ++G+P + +DTGS + W QC+PC C +Q
Sbjct: 37 RRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQ 96
Query: 170 RDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFW 228
P FDPSKS T+ + +S C P + Q K + +C Y+ +Y DGS +G
Sbjct: 97 STPIFDPSKSSTYVDLSYDSPIC-------PNSPQKKYNHLNQCIYNASYADGSTSSGNL 149
Query: 229 ATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISYFF 287
AT+ + E + G + GC +N G +G SGI+GL G SI+S+ S F
Sbjct: 150 ATEDIVF-ETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLG-SRFS 207
Query: 288 YC---LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPL 344
YC L P+ + + G D V K TP T + FY++TL GISVG RL +
Sbjct: 208 YCIGDLFDPHYTHNQLVLG--DGV-KMEGSSTPFHTF---NGFYYVTLEGISVGETRLDI 261
Query: 345 KASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
F + + +DSGT T + L + ++ ++ G + ++ T
Sbjct: 262 NPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVR----GHFQQVIYRTIPG 317
Query: 400 LSAYKTVV------VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI-- 451
YK V P++ HF G DL LD V ++ CL A+L S+ +I
Sbjct: 318 WLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFVQKNQDVFCL--AVLESNLKNIGS 375
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++G + Q+ Y V YD+ G+R+ F +C
Sbjct: 376 VIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 131/439 (29%), Positives = 192/439 (43%), Gaps = 51/439 (11%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKK-----TKAFTFPAKTGIVA-AD---EYYIVV 136
E+LRR R + SR + + + + A T P G V AD EY I +
Sbjct: 45 RELLRRLATRSRARASRLYSSSSSSSSARPAGAGSHAVTAPLARGTVGDADIDSEYLIHL 104
Query: 137 AIGKPK-QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
+IG P+ Q V+L LDTGS + WTQC C C Q P FD S+T +PC+ C
Sbjct: 105 SIGTPRPQRVALTLDTGSDLVWTQCA-CHVCFAQPFPTFDALASQTTLAVPCSDPICTS- 162
Query: 196 LEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL--- 250
+P +G C+ + C Y Y D S +G D T + GN + +
Sbjct: 163 -GKYPLSG---CTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAVP 218
Query: 251 ---LGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITF--GK 304
GC N G ++ SGI G RGP+S+ S+ ++ F +C + + F G
Sbjct: 219 NVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKVARFSHCFTAIADARTSPVFLGGA 278
Query: 305 PDTVNKKFVKYTPIVTTP---EQSEFYHITLTGISVGGERLPLKASYFTKLSTE------ 355
P N P+ +TP Y++TL GI+VG RLPL A F T
Sbjct: 279 PGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGKGTGSGSGGT 338
Query: 356 -IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDT-CYDLS-------AYKTV 406
IDSGT I P P+Y +LR+AF R+K + D T C++ +
Sbjct: 339 IIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTLCFEAARSASLPPEAPAP 398
Query: 407 VVPKITIHFLGGVDLELDVRGTL--VVESVRQVCLGFALL---PSDPNSILLGNVQQRGY 461
+PK+ +H + G D +L + ++E G L+ D + ++GN QQ+
Sbjct: 399 ALPKVVLH-VAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIGNFQQQNM 457
Query: 462 EVHYDVAGRRLGFGPGNCN 480
V YD+ +L F P C+
Sbjct: 458 HVAYDLEKNKLVFVPARCD 476
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 107/365 (29%), Positives = 162/365 (44%), Gaps = 35/365 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + ++IG P + DTGS + W QC PC C +Q++P FDP S +++ I C +
Sbjct: 59 EYLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTE 118
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
+C L Q K C Y +Y D S G A + +T+ G A +
Sbjct: 119 SCNKLDSSLCSTDQ-----KTCNYTYSYADNSITQGVLAQETLTLTSTTGEP-VAFQGII 172
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS------YFFYCLHSPYGS----TGYI 300
GC NN+G + G++GL RGP+S+IS+ S F CL P+ + T +
Sbjct: 173 FGCGHNNSGFNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCL-VPFNTDPSITSQM 231
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPL----KASYFTKLSTEI 356
FGK V TP+++ + Y TL GISV LP TK + I
Sbjct: 232 NFGKGSEVLGNGTVSTPLIS--KDGTGYFATLLGISVEDINLPFSNGSSLGTITKGNILI 289
Query: 357 DSGTIITRFPAPVYSALRSAFRKR--MKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
DSGT IT P Y L R + ++ +++ D ++ CY + P +TIH
Sbjct: 290 DSGTTITYLPEEFYHRLIEQVRNKVALEPFRI-----DGYELCYQTPT--NLNGPTLTIH 342
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F GG D+ L + C FA+ ++ + GN Q Y + +D+ + + F
Sbjct: 343 FEGG-DVLLTPAQMFIPVQDDNFC--FAVFDTNEEYVTYGNYAQSNYLIGFDLERQVVSF 399
Query: 475 GPGNC 479
+C
Sbjct: 400 KATDC 404
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 117/374 (31%), Positives = 178/374 (47%), Gaps = 38/374 (10%)
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
+ IG ++ +S ++DTGS QC S+ R P FDP+ S+++ ++PC S C +
Sbjct: 3 LGIGSLQKNLSAIIDTGSEAVLVQCG-----SRSR-PVFDPAASQSYRQVPCISQLCLAV 56
Query: 196 LEWFPPNGQDKC--SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY-PFLLG 252
+ C SS C Y ++Y D TG ++ D + + N + ++ G
Sbjct: 57 QQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFG 116
Query: 253 CTDNNTG--DQNGASGIMGLDRGPVSIISKTNI----SYFFYCLHS-PYG--STGYITFG 303
C + G G+ GI+G +RG +S+ S+ S F YC S P+ +TG I G
Sbjct: 117 CAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLG 176
Query: 304 KPDTVNKKFVKYTPIV---TTPEQSEFYHITLTGISVGGERLPLKASYFTKLS------- 353
++K V YTP++ TP +S+ Y++ LT ISV G+ L + S F KL
Sbjct: 177 D-SGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAF-KLDPSTGDGG 234
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK-GIEDLFDTCYDLSAYKTVV-VPKI 411
T +DSGT TR Y+A R+AF + K G FD CY++SA ++ VP++
Sbjct: 235 TVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEV 294
Query: 412 TIHFLGGVDLELDVRGTLVVESV--RQVCLGFALLPSDPNSI----LLGNVQQRGYEVHY 465
+ V LEL V S +V + A+L S + +LGN QQ Y V Y
Sbjct: 295 RLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEY 354
Query: 466 DVAGRRLGFGPGNC 479
D R+GF +C
Sbjct: 355 DNERSRVGFERADC 368
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 125/460 (27%), Positives = 207/460 (45%), Gaps = 49/460 (10%)
Query: 40 LIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLK 99
+I + ++T G + ++ R P S L K+ R Q H +
Sbjct: 13 VIFVALISKTSLTASMNNGSFTASLIHRDSPISPLYNPKNT-------YFDRLQSSFH-R 64
Query: 100 NSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQ 159
+ R + P++ K + G EY++ ++IG P V ++ DTGS + W Q
Sbjct: 65 SISRANRFTPNSVSAAKTLEYDIIPG---GGEYFMRISIGTPPIEVLVIADTGSDLIWVQ 121
Query: 160 CKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS----KECPYD 215
C+PC C +Q+ P F+P +S T+ ++ C + C L + CS+ K C Y
Sbjct: 122 CQPCQECYKQKSPIFNPKQSSTYRRVLCETRYCNAL-----NSDMRACSAHGFFKACGYS 176
Query: 216 IAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGASGIMGLDRGP 274
+Y D S G+ AT+R I N + GC ++N G+ SGI+GL G
Sbjct: 177 YSYGDHSFTMGYLATERFIIGSTNN----SIQELAFGCGNSNGGNFDEVGSGIVGLGGGS 232
Query: 275 VSIISK--TNI-SYFFYC----LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEF 327
+S+IS+ T I + F YC L S G I FG ++ + + + E F
Sbjct: 233 LSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEPETF 292
Query: 328 YHITLTGISVGGERLPLKASY----FTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKK 383
Y++TL ISVG ERL + S K + IDSGT +T + +Y+ L K ++
Sbjct: 293 YYLTLEAISVGNERLAYENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVE- 351
Query: 384 YKMGKGIED---LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLG 440
G+ + D +F C+ + +P IT+HF D +++++ + L
Sbjct: 352 ---GERVSDPNGIFSICFRDKI--GIELPIITVHF---TDADVELKPINTFAKAEEDLLC 403
Query: 441 FALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
F ++PS+ +I GN+ Q + V YD+ + F P +C+
Sbjct: 404 FTMIPSNGIAI-FGNLAQMNFLVGYDLDKNCVSFMPTDCS 442
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 150/314 (47%), Gaps = 26/314 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
+Y + +IG+P + +DTGS + W +C PC C+ P +DP++S++ K+PC+S
Sbjct: 86 KYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQ 145
Query: 191 TCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQE--VNGNGYFAR 246
C+ L + D+CS C Y AY G +G +T + E G+GY A
Sbjct: 146 LCQALGRGRIIS--DQCSDDPPLCGYHYAY----GHSGDHSTQGVLGTETFTFGDGYVAN 199
Query: 247 YPFLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKP 305
G +D G Q G +G++GL RG +S++S+ F YCL + I FG
Sbjct: 200 N-VSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGAGRFAYCLAADPNVYSTILFGSL 258
Query: 306 DTVNKKF--VKYTPIVTT--PEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----I 356
++ V TP+VT P++ Y++ L GISVGG RLP+K F S
Sbjct: 259 AALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFF 318
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-VPKITIHF 415
DSG I T Y +R A +++ G DTC+ + + V +P + +HF
Sbjct: 319 DSGAIDTSLKDAAYQVVRQAITSEIQRLGYDAGD----DTCFVAANQQAVAQMPPLVLHF 374
Query: 416 LGGVDLELDVRGTL 429
G D+ L+ R L
Sbjct: 375 DDGADMSLNGRNYL 388
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 115/378 (30%), Positives = 179/378 (47%), Gaps = 42/378 (11%)
Query: 123 KTGIVAADE---YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
+ +VA D + + ++G+P + +DTGS + W QC+PC C +Q P FDPSKS
Sbjct: 47 QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKS 106
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEV 238
T+ + +S C P + Q K + +C Y+ +Y DGS +G AT+ + E
Sbjct: 107 STYVDLSYDSPIC-------PNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVF-ET 158
Query: 239 NGNGYFARYPFLLGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISYFFYC---LHSPY 294
+ G + GC +N G +G SGI+GL G SI+S+ S F YC L P+
Sbjct: 159 SDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLG-SRFSYCIGDLFDPH 217
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
+ + G D V K TP T + FY++TL GISVG RL + F + +
Sbjct: 218 YTHNQLVLG--DGV-KMEGSSTPFHTF---NGFYYVTLEGISVGETRLDINPEVFQRTES 271
Query: 355 E-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-- 407
+DSGT T + L + ++ ++ G + ++ T YK V
Sbjct: 272 GQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVR----GHFQQVIYRTIPGWLCYKGRVNE 327
Query: 408 ----VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI--LLGNVQQRGY 461
P++ HF G DL LD V ++ CL A+L S+ +I ++G + Q+ Y
Sbjct: 328 DLRGFPELAFHFAEGADLVLDANSLFVQKNQDVFCL--AVLESNLKNIGSVIGIMAQQHY 385
Query: 462 EVHYDVAGRRLGFGPGNC 479
V YD+ G+R+ F +C
Sbjct: 386 NVAYDLIGKRVYFQRTDC 403
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 110/374 (29%), Positives = 178/374 (47%), Gaps = 38/374 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + ++G P Q + L +DT + W C C C P F+P+ S TF +PC +
Sbjct: 94 YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGCPTTA-PSFNPASSATFRPVPCGAPP 152
Query: 192 CKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C P+ SK C + ++Y D S + + D + + NG G Y F
Sbjct: 153 CSQAPN---PSCTSLAKSKNSCGFSLSYGDSSLDATL-SQDNLAV-TANG-GVIKGYTF- 205
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS----TGYITFG 303
GC + G A G++GL RGP+ +++T Y F YCL S Y S +G +T G
Sbjct: 206 -GCLTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLG 264
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEIDS 358
+ + +K TP++ +P + Y++ +TG+ +G + +P+ S T T +DS
Sbjct: 265 RKGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDS 324
Query: 359 GTIITRFPAPVYSALRSAFRKRMK-------KYKMGKGIEDL--FDTCYDLSAYKTVVVP 409
GT+ R P Y+A+R R+R+ + L FDTCY++S TV P
Sbjct: 325 GTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNVS---TVAWP 381
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDP-NSIL--LGNVQQRGYEVHY 465
+T+ F GG+++ L ++ + CL A P+D N+ L +G++QQ+ + V +
Sbjct: 382 AVTLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVLF 441
Query: 466 DVAGRRLGFGPGNC 479
DV R+GF C
Sbjct: 442 DVPNARVGFARERC 455
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 130/435 (29%), Positives = 194/435 (44%), Gaps = 51/435 (11%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+L+V + CS K + S+ + +DQ R+ +S +K++
Sbjct: 34 TLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVARKSV---------VP 84
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
+ I+ + Y + G P Q + L LDT S W C C+ CS + F P KS
Sbjct: 85 IASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKP--FAPIKS 142
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN 239
+F + C S CK + PN C C ++ Y S D +T+
Sbjct: 143 TSFRNVSCGSPHCKQV-----PN--PTCGGSACAFNFTY-GSSSIAASVVQDTLTLAADP 194
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PY 294
GY GC + TG G++GL RGP+S++S++ Y F YCL S
Sbjct: 195 IPGY------TFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSI 248
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF--- 349
+G + G K +KYTP++ P +S Y++ L I VG + +P A F
Sbjct: 249 NFSGSLRLGP--VYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPT 306
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T T DSGT+ TR PVY+A+R+ FR+R+ K+ FDTCY++ +VVP
Sbjct: 307 TGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGP-KLPVTTLGGFDTCYNVP----IVVP 361
Query: 410 KITIHFLG-GVDLELDVRGTLVVESV--RQVCLGFALLPSDPNSIL--LGNVQQRGYEVH 464
IT F G V L D +V+ S CL A P + NS+L + N+QQ+ + V
Sbjct: 362 TITFLFSGMNVALPPD---NIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVL 418
Query: 465 YDVAGRRLGFGPGNC 479
+DV R+G C
Sbjct: 419 FDVPNSRIGIARELC 433
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 130/438 (29%), Positives = 193/438 (44%), Gaps = 54/438 (12%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+LEV + PCS K + S+ ++ +DQ RL S +++
Sbjct: 35 TLEVFHVFSPCSPFRPPKPLSWAESVLQLQAKDQARLQFLASMVAGRSV----------- 83
Query: 120 FPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G I+ + Y + IG P Q + L +DT + W C C C+ F P
Sbjct: 84 VPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTST---LFAPE 140
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
KS TF + C S C + PN C + C +++ Y S D +T+
Sbjct: 141 KSTTFKNVSCGSPQCNQV-----PN--PSCGTSACTFNLTY-GSSSIAANVVQDTVTL-- 190
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS-- 292
Y F GC TG G++GL RGP+S++S+T Y F YCL S
Sbjct: 191 --ATDPIPDYTF--GCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFK 246
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF- 349
+G + G +KYTP++ P +S Y++ L I VG + +P +A F
Sbjct: 247 SLNFSGSLRLGP--VAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFN 304
Query: 350 --TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDL--FDTCYDLSAYK 404
T T DSGT+ TR AP Y+A+R F++R+ K + L FDTCY +
Sbjct: 305 AATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVP--- 361
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLVVESV-RQVCLGFALLPSDPNSIL--LGNVQQRGY 461
+V P IT F G+++ L L+ + CL A P + NS+L + N+QQ+ +
Sbjct: 362 -IVAPTITFMF-SGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNH 419
Query: 462 EVHYDVAGRRLGFGPGNC 479
V YDV RLG C
Sbjct: 420 RVLYDVPNSRLGVARELC 437
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 130/435 (29%), Positives = 194/435 (44%), Gaps = 51/435 (11%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+L+V + CS K + S+ + +DQ R+ +S +K++
Sbjct: 34 TLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVARKSV---------VP 84
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
+ I+ + Y + G P Q + L LDT S W C C+ CS + F P KS
Sbjct: 85 IASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKP--FAPIKS 142
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN 239
+F + C S CK + PN C C ++ Y S D +T+
Sbjct: 143 TSFRNVSCGSPHCKQV-----PN--PTCGGSACAFNFTY-GSSSIAASVVQDTLTLATDP 194
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PY 294
GY GC + TG G++GL RGP+S++S++ Y F YCL S
Sbjct: 195 IPGY------TFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSI 248
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF--- 349
+G + G K +KYTP++ P +S Y++ L I VG + +P A F
Sbjct: 249 NFSGSLRLGP--VYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPT 306
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T T DSGT+ TR PVY+A+R+ FR+R+ K+ FDTCY++ +VVP
Sbjct: 307 TGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGP-KLPVTTLGGFDTCYNVP----IVVP 361
Query: 410 KITIHFLG-GVDLELDVRGTLVVESV--RQVCLGFALLPSDPNSIL--LGNVQQRGYEVH 464
IT F G V L D +V+ S CL A P + NS+L + N+QQ+ + V
Sbjct: 362 TITFLFSGMNVTLPPD---NIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVL 418
Query: 465 YDVAGRRLGFGPGNC 479
+DV R+G C
Sbjct: 419 FDVPNSRIGIARELC 433
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 122/423 (28%), Positives = 182/423 (43%), Gaps = 43/423 (10%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFK-----KTKAFTFPAKTGIV-AADEYYIVVAI 138
L LRRD++R ++ A + + F P +G+ + EY+ + +
Sbjct: 94 LAHRLRRDKRRASRISAAAGGAAAANGTRVGGGGGGSGFVAPVVSGLAQGSGEYFTKIGV 153
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
G P ++LDTGS + W QC PC C Q FDP S ++ + C + C+ L
Sbjct: 154 GTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPLCRRL--- 210
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL-LGCTDNN 257
+G K C Y +AY DGS G +AT+ +T AR P + LGC +N
Sbjct: 211 --DSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS------GARVPRVALGCGHDN 262
Query: 258 TGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL-------HSPYGSTGYITFGKPDT 307
G A+G++GL RG +S S+ + + F YCL S + +TFG
Sbjct: 263 EGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGSGAR 322
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII--TRF 365
P P+ + G P + G +I +
Sbjct: 323 GALGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPPDPSTGRGGVIVDSGR 382
Query: 366 PAPVYS--------ALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
P+P ++ A RS R ++ G LFDTCYDLS K V VP +++HF G
Sbjct: 383 PSPAWARAGRTPPCATRS--RAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAG 440
Query: 418 GVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
G + L L+ V+S C FA +D ++GN+QQ+G+ V +D G+RLGF P
Sbjct: 441 GAEAALPPENYLIPVDSRGTFCFAFA--GTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVP 498
Query: 477 GNC 479
C
Sbjct: 499 KGC 501
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/363 (30%), Positives = 156/363 (42%), Gaps = 44/363 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + + +G P + +DTGS + WTQC PC +C Q P FDPSKS TF
Sbjct: 61 YLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFK-------- 112
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
+ +C CPY+I Y D S TG AT+ +TIQ +G F +
Sbjct: 113 ------------EKRCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEP-FVMAETSI 159
Query: 252 GCTDNNT-----GDQNGASGIMGLDRGPVSIISKTNI---SYFFYCLHSPYGSTGYITFG 303
GC NN+ G +SGI+GL+ GP S+IS+ ++ YC S T I FG
Sbjct: 160 GCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQ--GTSKINFG 217
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE--IDSGTI 361
V + +Q FY++ L +SVG +R+ + F IDSGT
Sbjct: 218 TNAVVAGDGTVAADMFIKKDQ-PFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTT 276
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGK---GIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
T P Y L E+L CY+ + + P IT+HF GG
Sbjct: 277 YTYLPTS-YCNLVREAVAASVVAANQVPDPSSENLL--CYNWDTME--IFPVITLHFAGG 331
Query: 419 VDLELDVRGTLVVESVRQVCLGFALLPSDPNS-ILLGNVQQRGYEVHYDVAGRRLGFGPG 477
DL LD + + VE++ A+ DP+ + GN V YD + + F P
Sbjct: 332 ADLVLD-KYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPT 390
Query: 478 NCN 480
NC+
Sbjct: 391 NCS 393
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 178/391 (45%), Gaps = 35/391 (8%)
Query: 114 KTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP 172
+ AF P +G +Y++ +G P Q L+ DTGS +TW +C+ S P
Sbjct: 91 EASAFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASP 150
Query: 173 F-----FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-----KECPYDIAYVDGS 222
F P+ SK+++ IPC+S TCK + P CS+ C YD Y D S
Sbjct: 151 LASPRVFRPANSKSWAPIPCSSDTCKSYV----PFSLANCSAGTTPPAPCGYDYRYKDKS 206
Query: 223 GETGFWATDRMTIQEVNGNGYFARYPF---LLGCTDNNTGDQ-NGASGIMGLDRGPVSII 278
G TD TI ++G+G + +LGCT + G + G++ L +S
Sbjct: 207 SARGVVGTDAATI-ALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFA 265
Query: 279 SKTNISY---FFYCL--H-SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITL 332
S+ + F YCL H +P +T Y+TFG + TP++ + + FY +T+
Sbjct: 266 SRAAARFGGRFSYCLVDHLAPRNATSYLTFGPVGAAHSP--SRTPLLLDAQVAPFYAVTV 323
Query: 333 TGISVGGERLPLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG 389
+SV G+ L + A + +DSGT +T P Y A+ +A K++ ++ +
Sbjct: 324 DAVSVAGKALNIPAEVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLA--RVPRV 381
Query: 390 IEDLFDTCYDLSA-YKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDP 448
D F+ CY+ +A + VP++ + F G L + ++ + C+G P
Sbjct: 382 TMDPFEYCYNWTATRRPPAVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQ-EGVWP 440
Query: 449 NSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN+ Q+ + +D+A R L F C
Sbjct: 441 GVSVIGNILQQEHLWEFDLANRWLRFQESRC 471
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 99/358 (27%), Positives = 161/358 (44%), Gaps = 32/358 (8%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + +IG P Q ++ L DTGS + WT+C + + P+ S TF+++PC+
Sbjct: 100 YDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPCSDRL 159
Query: 192 CKILLEWFPPNGQDKCSS--KECPYDIAYVDGSG---ETGFWATDRMTI--QEVNGNGYF 244
C L + +C++ EC Y AY G GF ++ T+ V G G+
Sbjct: 160 CAALRSY----SLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGDAVPGVGF- 214
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK 304
GCT GD +G++GL RGP+S++S+ + F YCL + + FG
Sbjct: 215 -------GCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAGTFMYCLTADASKASPLLFGA 267
Query: 305 PDTVN--KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
T+ V+ T ++ + + FY + L I++G A DSGT +
Sbjct: 268 LATMTGAGAGVQSTGLLAS---TTFYAVNLRSITIGSAT---TAGVGGPGGVVFDSGTTL 321
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
T P Y+ ++AF + +G F+ CY+ ++P + +HF GG D+
Sbjct: 322 TYLAEPAYTEAKAAFLSQTTSLTPVEGRYG-FEACYE-KPDSARLIPAMVLHFDGGADMA 379
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L V +V VC ++ P+ ++GN+ Q Y V +DV L F P NC+
Sbjct: 380 LPVANYVVEVDDGVVCW---VVQRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANCD 434
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 170/379 (44%), Gaps = 33/379 (8%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR-DPFFDPSKSKTFSKI 185
+ +Y++ + +G P Q + L+ DTGS + W +C C +C++ F S TFS
Sbjct: 84 TGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPN 143
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKE----CPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
C + C+++ P +C+ C Y+ +Y DGS +GF++ + T+ +G
Sbjct: 144 HCYDSACQLV----PLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGR 199
Query: 242 GYFARYPFLLGCTDNNTGDQ------NGASGIMGLDRGPVSIISKTNISY---FFYCLH- 291
+ GC +G NGA G+MGL RGP+S+ S+ + F YCL
Sbjct: 200 EAKLKG-IAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMD 258
Query: 292 ---SPYGSTGYITFGKPD---TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLK 345
SP T Y+ G K+ +++TP+ P FY+I + +SV G +LP+
Sbjct: 259 HDISP-SPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPIN 317
Query: 346 ASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
S + T +DSGT +T P P Y + + ++R++ + FD C ++
Sbjct: 318 PSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPG-FDLCVNV 376
Query: 401 SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRG 460
S + +PK++ G R V CL + + ++GN+ Q+G
Sbjct: 377 SEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQG 436
Query: 461 YEVHYDVAGRRLGFGPGNC 479
+ + +D RLGF C
Sbjct: 437 FLLEFDKDRTRLGFSRHGC 455
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 116/428 (27%), Positives = 189/428 (44%), Gaps = 54/428 (12%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFK-KTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
LEE+ RRD R H + RRL + + P G+ Y+ V +G P +
Sbjct: 47 LEELRRRDAAR-HRVSRRRLLGGVAGVVDFPVEGSANPYMVGL-----YFTRVKLGNPAK 100
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEW 198
+ +DTGS I W C PC C F+P S T S+I C+ C +
Sbjct: 101 EFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQ- 159
Query: 199 FPPNGQDKC-----SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR--YPFLL 251
G+ C S C Y Y DGSG +G++ +D M + V GN A +
Sbjct: 160 ---TGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVF 216
Query: 252 GCTDNNTGDQNGA----SGIMGLDRGPVSIISKTNI-----SYFFYCLHSPYGSTGYITF 302
GC+++ +GD A GI G + +S+IS+ N F +CL G +
Sbjct: 217 GCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVL 276
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---IDSG 359
G+ + + + YTP+V P Q Y++ L I+V G++LP+ +S FT +T+ +DSG
Sbjct: 277 GE---IVEPGLVYTPLV--PSQPH-YNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSG 330
Query: 360 TIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFL 416
T + Y SA + + + KG + C+ S+ P +T++F+
Sbjct: 331 TTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-----CFITSSSVDSSFPTVTLYFM 385
Query: 417 GGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
GGV + + L+ V++ C+G+ +I LG++ + YD+A R+
Sbjct: 386 GGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITI-LGDLVLKDKIFVYDLANMRM 444
Query: 473 GFGPGNCN 480
G+ +C+
Sbjct: 445 GWADYDCS 452
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 116/428 (27%), Positives = 189/428 (44%), Gaps = 54/428 (12%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFK-KTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
LEE+ RRD R H + RRL + + P G+ Y+ V +G P +
Sbjct: 49 LEELRRRDAAR-HRVSRRRLLGGVAGVVDFPVEGSANPYMVGL-----YFTRVKLGNPAK 102
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEW 198
+ +DTGS I W C PC C F+P S T S+I C+ C +
Sbjct: 103 EFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQ- 161
Query: 199 FPPNGQDKC-----SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR--YPFLL 251
G+ C S C Y Y DGSG +G++ +D M + V GN A +
Sbjct: 162 ---TGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVF 218
Query: 252 GCTDNNTGDQNGA----SGIMGLDRGPVSIISKTNI-----SYFFYCLHSPYGSTGYITF 302
GC+++ +GD A GI G + +S+IS+ N F +CL G +
Sbjct: 219 GCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVL 278
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---IDSG 359
G+ + + + YTP+V P Q Y++ L I+V G++LP+ +S FT +T+ +DSG
Sbjct: 279 GE---IVEPGLVYTPLV--PSQPH-YNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSG 332
Query: 360 TIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFL 416
T + Y SA + + + KG + C+ S+ P +T++F+
Sbjct: 333 TTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-----CFITSSSVDSSFPTVTLYFM 387
Query: 417 GGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
GGV + + L+ V++ C+G+ +I LG++ + YD+A R+
Sbjct: 388 GGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITI-LGDLVLKDKIFVYDLANMRM 446
Query: 473 GFGPGNCN 480
G+ +C+
Sbjct: 447 GWADYDCS 454
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 120/406 (29%), Positives = 188/406 (46%), Gaps = 48/406 (11%)
Query: 102 RRLQKAI------PDNFKKTKAFTFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSG 154
+RLQKA ++F+ +A ++ +++ Y++ +++G P + + DTGS
Sbjct: 57 QRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGGGAYLMNISLGTPPVPMLGIADTGSD 116
Query: 155 ITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CP 213
+ W QC PC +C +Q +P FDP +S+T+ + C++ C+ L + Q C C
Sbjct: 117 LIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEFCQDLGQ------QGSCDDDNTCT 170
Query: 214 YDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGDQN----GASGIM 268
Y +Y D S G ++D +TI G+ A +P GC +N G N G G+
Sbjct: 171 YSYSYGDRSYTRGDLSSDTLTIGSTEGDP--ASFPGIAFGCGHDNGGTFNEKDGGLIGLG 228
Query: 269 GLDRGPVSIISKTNISYFFYC---LHSPYGSTGYITFGKPDTVNKKFVKYTPIVT-TPEQ 324
G V +S F YC L S + I FGK V+ TP++ TP+
Sbjct: 229 GGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTPDT 288
Query: 325 SEFYHITLTGISVGGERLPLKASYFTKLSTE--------IDSGTIITRFPAPVYSALRSA 376
FY++TL G+SVG E + K K S IDSGT +T P Y+ + SA
Sbjct: 289 --FYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESA 346
Query: 377 FRKRMKKYKMGKGIED---LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVES 433
+ G+ D +F CY S+ + +P IT HF G D++L T V
Sbjct: 347 LTNAIG----GQTTTDPNGIFSLCY--SSVNNLEIPTITAHFT-GADVQLPPLNTFVQVQ 399
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
VC F+++PS N + GN+ Q + V YD+ ++ F +C
Sbjct: 400 EDLVC--FSMIPSS-NLAIFGNLAQINFLVGYDLKNNKVSFKQTDC 442
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 102/349 (29%), Positives = 157/349 (44%), Gaps = 31/349 (8%)
Query: 99 KNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWT 158
K+ RL+ +KT A ++ Y + V +G P Q + ++LDT + W
Sbjct: 12 KDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWV 71
Query: 159 QCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAY 218
C C CS F P+ S T + C+ C + + P S C ++ +Y
Sbjct: 72 PCSGCTGCSSTT---FLPNASTTLGSLDCSEAQCSQVRGFSCP----ATGSSACLFNQSY 124
Query: 219 VDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSII 278
S D +T+ G F GC + +G G++GL RGP+S+I
Sbjct: 125 GGDSSLAATLVQDAITLANDVIPG------FTFGCINAVSGGSIPPQGLLGLGRGPISLI 178
Query: 279 SKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLT 333
S+ Y F YCL S Y +G + G K ++ TP++ P + Y++ LT
Sbjct: 179 SQAGAMYSGVFSYCLPSFKSYYFSGSLKLGP--VGQPKSIRTTPLLRNPHRPSLYYVNLT 236
Query: 334 GISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK 388
G+SVG ++P+ + T T IDSGT+ITRF PVY A+R FRK++
Sbjct: 237 GVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSL 296
Query: 389 GIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
G FDTC+ +A P +T+HF G++L L + +L+ S V
Sbjct: 297 GA---FDTCF--AATNEAEAPAVTLHF-EGLNLVLPMENSLIHSSSGSV 339
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 111/433 (25%), Positives = 181/433 (41%), Gaps = 50/433 (11%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYV 145
E+LRR QR + + + +P + + K A + A EY + + +G P+
Sbjct: 44 HELLRRAIQRSRDRLASIAPRLLPTS-SRNKVVVAEAPV-LSAGGEYLVKLGLGTPQHCF 101
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
+ +DT S + WTQC+PC+ C +Q DP F+P S +++ +PCNS TC L D
Sbjct: 102 TAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARDGD 161
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-QNGA 264
C Y +Y + G A DR+ I G+ F F GC+ ++ G
Sbjct: 162 SDDEDACQYTYSYGGNATTRGILAVDRLAI----GDDVFRGVVF--GCSSSSVGGPPPQV 215
Query: 265 SGIMGLDRGPVSIISKTNISYFFYCLHSPYG-STGYITFGKPDTV---NKKFVKYTPIVT 320
SG++GL RG +S++S+ ++ F YCL P S G + G N P+ T
Sbjct: 216 SGVVGLGRGALSLVSQLSVRRFMYCLPPPVSRSAGRLVLGADAAATVRNASERVVVPMST 275
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE------------------------- 355
+Y++ L GIS+G + ++ +T
Sbjct: 276 GSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPD 335
Query: 356 -----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA---YKTVV 407
ID + IT +Y + + ++ + G G + D C+ L V
Sbjct: 336 AYGMIIDIASTITFLEESLYEEMVDDLEEEIRLPR-GSGSDLGLDLCFILPEGVPMSRVY 394
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
P +++ F GV L LD + + VE + + +D SI LGN QQ+ +V Y++
Sbjct: 395 APPVSLAF-EGVWLRLD-KEQMFVEDRASGMMCLMVGKTDGVSI-LGNYQQQNMQVMYNL 451
Query: 468 AGRRLGFGPGNCN 480
R+ F C
Sbjct: 452 RRGRITFIKTACE 464
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 126/441 (28%), Positives = 199/441 (45%), Gaps = 59/441 (13%)
Query: 78 KSRNTPSLEEILRRDQQRLHLKNSR-----RLQKAIPDNFKKTKAF--TFPAKTGIVAAD 130
++RN ++ RD L N R RL+ + + + F + +V +D
Sbjct: 26 EARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRNSFHRSISRANRFKPNSISARALVQSD 85
Query: 131 ------EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSK 184
EY + ++IG P+ + + DTGS + W QC+PC C +Q P FDP +S ++
Sbjct: 86 IVPGGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRN 145
Query: 185 IPCNSTTCKILLEWFPPNGQDK-CSS----KECPYDIAYVDGSGETGFWATDRMTIQEVN 239
+ C + C L +G+ + C + K C Y +Y D S G A +R I N
Sbjct: 146 VLCGNEFCNKL------DGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTN 199
Query: 240 GN-----GYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
N YF F G + T D+ G+ I G +S++S+ F YCL
Sbjct: 200 SNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLG-GGSMSLVSQLGPKLSGKFSYCLV 258
Query: 292 SPYGSTGY---ITFGKPDTVNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERLP- 343
+ Y I FG + +N Y +V+TP + +Y++TL ISV +RLP
Sbjct: 259 PTSEQSNYTSKINFG--NDINISGSNYN-VVSTPLLPKKPETYYYLTLEAISVENKRLPY 315
Query: 344 --LKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED---LFDTCY 398
L K + IDSGT +T + ++ L SA + +K G+ + D LF+ C+
Sbjct: 316 TNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVK----GERVSDPHGLFNICF 371
Query: 399 DLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQ 458
K + +P IT HF G D+EL T V + L F ++PS+ +I GN+ Q
Sbjct: 372 --KDEKAIELPIITAHFTGA-DVELQPVNTFA--KVEEDLLCFTMIPSNDIAI-FGNLAQ 425
Query: 459 RGYEVHYDVAGRRLGFGPGNC 479
+ V YD+ + + F P +C
Sbjct: 426 MNFLVGYDLEKKAVSFLPTDC 446
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 102/365 (27%), Positives = 163/365 (44%), Gaps = 30/365 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHC--SQQRDPFFDPSKSKTFSKIPCN 188
EY + ++IG P Q + ++DTGS + W +C C HC + F S ++ K+PCN
Sbjct: 4 EYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCN 63
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY- 247
ST C + G + C Y Y DGS +G +DR++ + +G G R
Sbjct: 64 STHCSGM----SSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRS-HGAGEDHRSF 118
Query: 248 --PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISK--TNISY-FFYCL---HSPYGSTGY 299
FL GC GD N G++GL + S+I + + Y F YCL SP + +
Sbjct: 119 FDGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSF 178
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSE-FYHITLTGISVGGERLPL---------KASYF 349
+ G + V TPI+ + Y++ L I++GG + + F
Sbjct: 179 LFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPF 238
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T IDSGT T PVY A+R + +++ +G D C++ S + P
Sbjct: 239 LANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAG--LDLCFNSSGDTSYGFP 296
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
+T +F V L L V S VCL ++ S + ++GN+QQ+ + + YD+
Sbjct: 297 SVTFYFANQVQLVLPFENIFQVTSRDVVCL--SMDSSGGDLSIIGNMQQQNFHILYDLVA 354
Query: 470 RRLGF 474
++ F
Sbjct: 355 SQISF 359
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 116/404 (28%), Positives = 178/404 (44%), Gaps = 34/404 (8%)
Query: 92 DQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAAD--EYYIVVAIGKPKQYVSLLL 149
D RL SR + + N KTKA + + + EY++ ++IG P V ++
Sbjct: 55 DFDRLRNAFSRSISRV---NVFKTKAVDINSFQNDLVPNGGEYFMKMSIGTPLVEVIVIA 111
Query: 150 DTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS- 208
DTGS +TW QC PC C +Q+ P FDPS+S ++ + C S C L + C+
Sbjct: 112 DTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRFCNAL-----DVSEQACTM 166
Query: 209 -SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN----- 262
+ C Y +Y D S G AT++ TI + P + GC N G +
Sbjct: 167 DTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLS-PIVFGCGTGNGGTFDELGSG 225
Query: 263 ---GASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIV 319
G + L SII K SY L T I FG ++ V TP+V
Sbjct: 226 IVGLGGGALSLVSQLSSII-KGKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLV 284
Query: 320 TTPEQSEFYHITLTGISVGGERLP----LKASYFTKLSTEIDSGTIITRFPAPVYSALRS 375
+ + +Y++TL ISVG +RLP L K + IDSGT +T + ++ L
Sbjct: 285 SK-QPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFTELER 343
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVR 435
+ +K ++ LF C+ + + +P I +HF D++L T V
Sbjct: 344 VLEETVKAERVSDP-RGLFSVCF--RSAGDIDLPVIAVHF-NDADVKLQPLNTFVKADED 399
Query: 436 QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+C F ++ S+ I GN+ Q + V YD+ R + F P +C
Sbjct: 400 LLC--FTMISSNQIGI-FGNLAQMDFLVGYDLEKRTVSFKPTDC 440
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 109/381 (28%), Positives = 172/381 (45%), Gaps = 47/381 (12%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI---HCSQQRDPFFDPSKSKTFS 183
+A +Y IG P Q + L+DTGS + WTQC C++Q P+++ S+S TF+
Sbjct: 79 LATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFA 138
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNG 242
+PC + + NG C C + +Y GS G T+ T Q
Sbjct: 139 AVPCADSA-----KLCAANGVHLCGLDGSCTFAASYGAGS-VFGSLGTEAFTFQSGA--- 189
Query: 243 YFARYPFLLGC---TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY----G 295
A+ F GC T G NGASG++GL RG +S++S+T + F YCL +PY G
Sbjct: 190 --AKLGF--GCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATKFSYCL-TPYLRNHG 244
Query: 296 STGYITFGKPDTVN--KKFVKYTPIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFT 350
++ ++ G +++ V P V +PE S FY++ L GISVG +LP+ ++ F
Sbjct: 245 ASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFE 304
Query: 351 KLSTE---------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
ID+G+ +T YSAL +++ + + + D C
Sbjct: 305 LRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPADTGLDLCV--- 361
Query: 402 AYKTV--VVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQR 459
A + V VVP + HF GG D+ + C+ L+ ++GN QQ+
Sbjct: 362 ARQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKSTACM---LIEEGGYETVIGNFQQQ 418
Query: 460 GYEVHYDVAGRRLGFGPGNCN 480
+ YD+ L F +C+
Sbjct: 419 DVHLLYDIGKGELSFQTADCS 439
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 103/365 (28%), Positives = 163/365 (44%), Gaps = 30/365 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHC--SQQRDPFFDPSKSKTFSKIPCN 188
EY + ++IG P Q + ++DTGS + W +C C HC + F S ++ K+PCN
Sbjct: 4 EYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCN 63
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY- 247
ST C + G + C Y Y DGS +G +DR++ + +G G R
Sbjct: 64 STHCSGM----SSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRS-HGAGEDHRSF 118
Query: 248 --PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISK--TNISY-FFYCL---HSPYGSTGY 299
FL GC GD N G++GL + S+I + + Y F YCL SP + +
Sbjct: 119 FDGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSF 178
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSE-FYHITLTGISVGGERLPL---------KASYF 349
+ G + V TPI+ + Y++ L I+VGG + + F
Sbjct: 179 LFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPF 238
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T IDSGT T PVY A+R + +++ +G D C++ S + P
Sbjct: 239 LANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAG--LDLCFNSSGDTSYGFP 296
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
+T +F V L L V S VCL ++ S + ++GN+QQ+ + + YD+
Sbjct: 297 SVTFYFANQVQLVLPFENIFQVTSRDVVCL--SMDSSGGDLSIIGNMQQQNFHILYDLVA 354
Query: 470 RRLGF 474
++ F
Sbjct: 355 SQISF 359
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 124/428 (28%), Positives = 190/428 (44%), Gaps = 70/428 (16%)
Query: 91 RDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLD 150
R Q H+ R + + K T F + A+ + IG P Q ++++LD
Sbjct: 35 RIQNNHHISTRRLFSNS---SSKTTGKLLFHHNVTLTAS------LTIGTPPQNITMVLD 85
Query: 151 TGSGITWTQCKPCIHCSQQRDP----FFDPSKSKTFSKIPCNSTTCKILL-EWFPPNGQD 205
TGS ++W +CK ++P F+P SKT++KIPC+S TCK + P D
Sbjct: 86 TGSELSWLRCK--------KEPNFTSIFNPLASKTYTKIPCSSQTCKTRTSDLTLPVTCD 137
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD----NNTGDQ 261
+K C + I+Y D S G A + G R + GC D +NT +
Sbjct: 138 P--AKLCHFIISYADASSVEGHLAFETFRF------GSLTRPATVFGCMDSGSSSNTEED 189
Query: 262 NGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIV-- 319
+G+MG++RG +S +++ F YC+ S STG++ G+ K + YTP+V
Sbjct: 190 AKTTGLMGMNRGSLSFVNQMGFRKFSYCI-SGLDSTGFLLLGEARYSWLKPLNYTPLVQI 248
Query: 320 TTPE---QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYS 371
+TP Y + L GI V + LPL S F T +DSGT T PVYS
Sbjct: 249 STPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYS 308
Query: 372 ALRSAF--------RKRMKKYKMGKGIEDLFDTCYDLSAYKTVV--VPKITIHFLGGVDL 421
ALR F R + + +G DL CY + + + + +P + + F G
Sbjct: 309 ALRKEFLLQTAGVLRVLNEPQYVFQGAMDL---CYLIDSTSSTLPNLPVVKLMFRGA--- 362
Query: 422 ELDVRGTLVVESVRQVCLG------FALLPSDP---NSILLGNVQQRGYEVHYDVAGRRL 472
E+ V G ++ V G F SD +S L+G+ QQ+ + YD+ R+
Sbjct: 363 EMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGHHQQQNVWMEYDLENSRI 422
Query: 473 GFGPGNCN 480
GF C+
Sbjct: 423 GFAELRCD 430
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 120/376 (31%), Positives = 184/376 (48%), Gaps = 34/376 (9%)
Query: 121 PAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSK 180
P T I +Y + ++G P ++DTGS I W QC+PC C Q P F+PSKS
Sbjct: 76 PESTVISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSS 135
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
++ I C+S C+ + + + DK K C Y I Y + S G + + +T++ G
Sbjct: 136 SYKNISCSSKLCQSVRD---TSCNDK---KNCEYSINYGNQSHSQGDLSLETLTLESTTG 189
Query: 241 NGYFARYP-FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCL----- 290
+P ++GC NN G + +SG++GL GP S+I++ S F YCL
Sbjct: 190 RP--VSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSI 247
Query: 291 ---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS 347
+ GS+ + FG V+ V TPIV + S FY++T+ SVG +R+ S
Sbjct: 248 TLKNMSMGSSK-LNFGDVAIVSGHNVLSTPIV-KKDHSFFYYLTIEAFSVGDKRVEFAGS 305
Query: 348 YFTKLSTE----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY 403
+K E IDS TI+T P+ VY+ L SA + ++ + F CY++S+
Sbjct: 306 --SKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQ-FSLCYNVSSD 362
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEV 463
+ P +T HF G D+ L T VE R V L FA PS+ +I G+ Q+ + V
Sbjct: 363 EEYDFPYMTAHF-KGADILLYATNTF-VEVARDV-LCFAFAPSNGGAI-FGSFSQQDFMV 418
Query: 464 HYDVAGRRLGFGPGNC 479
YD+ + + F +C
Sbjct: 419 GYDLQQKTVSFKSVDC 434
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 127/447 (28%), Positives = 187/447 (41%), Gaps = 52/447 (11%)
Query: 57 PGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTK 116
P SLE++ RY S G N E I R + L R AI +
Sbjct: 25 PDGFSLEIVHRYSRESPFYPG---NITDYERITRL----VELSKIRAHNLAI----TTSS 73
Query: 117 AFTFPAKTGIVAADE--YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFF 174
F+ A ++ D+ Y + V IG P + L+ DTGSG+ WTQC+PC +Q P F
Sbjct: 74 GFSPEAFRLRISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIF 133
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQD--KCSSKECPYDIAYVDGSGETGFWATDR 232
+ + S+T+ +PC C N Q+ +C +C Y IAY GS G A D
Sbjct: 134 NSTASRTYRDLPCQHQFCT--------NNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDI 185
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTG-----DQNGASGIMGLDRGPVSIISKTNI---S 284
+ E + R PF GC+ +N GI+GL+ PVS++ + N +
Sbjct: 186 LQSAEND------RIPFYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKN 239
Query: 285 YFFYCLH-----SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGG 339
F YCL+ SP +T + FG +++ TP V +P Y + L +SV G
Sbjct: 240 RFSYCLNLFDLSSPSHATSLLRFGNDIRKSRRKYLSTPFV-SPRGMPNYFLNLIDVSVAG 298
Query: 340 ERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK-GIEDL 393
R+ + F T IDSGT +T Y + +AF+ ++ + I+
Sbjct: 299 NRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLS 358
Query: 394 FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNS-IL 452
CY + P + HF G L V+ C+ AL P P +
Sbjct: 359 GYICYKQQGHTFHNYPSMAFHFQGADFFVEPEYVYLTVQDRGAFCV--ALQPISPQQRTI 416
Query: 453 LGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+G + Q + YD A R+L F P NC
Sbjct: 417 IGALNQANTQFIYDAANRQLLFTPENC 443
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 122/412 (29%), Positives = 186/412 (45%), Gaps = 53/412 (12%)
Query: 102 RRLQKA-IPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQC 160
RL KA +P+ ++T A+ P I YY+ + IG P + L +DTGS +TW QC
Sbjct: 2 ERLSKASVPETAQRTAAY--PIGGNIYPDGLYYMAMRIGNPAKLYYLDMDTGSDLTWLQC 59
Query: 161 -KPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS--KECPYDIA 217
PC C+ +DP +++ + C TC + GQ CS ++C Y++
Sbjct: 60 DAPCRSCAVGPHGLYDPKRARV---VDCRRPTCAQVQR----GGQFTCSGDVRQCDYEVD 112
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA----SGIMGLDRG 273
YVDGS G D +T+ NG + R ++GC + G A G++GL
Sbjct: 113 YVDGSSTMGILVEDTITLVLTNGTRFQTR--AVIGCGYDQQGTLAKAPAVTDGVIGLSSS 170
Query: 274 PVSIISKTNI-----SYFFYCLHSPYGSTGYITFGKPDTVNKKF-VKYTPIVTTPEQSEF 327
+S+ S+ + +CL GY+ FG DT+ + +TP++ P E
Sbjct: 171 KISLPSQLAAKGIANNVIGHCLAGGSNGGGYLFFG--DTLVPALGMTWTPMIGRP-LVEG 227
Query: 328 YHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMG 387
Y L I GGE L L+ + DSGT T Y+A+ SA ++ ++ +
Sbjct: 228 YQARLRSIKYGGEVLELEGTTDDVGGAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLE 287
Query: 388 KGIEDL-----------FDTCYDLSAY-KTVVVPKITIHFLG------GVDLELDVRGTL 429
+ D F++ D+SAY KTV T+ F G G LEL G L
Sbjct: 288 RIKTDTTLPFCWRGPSPFESVADVSAYFKTV-----TLDFGGSTWWSSGKLLELSPEGYL 342
Query: 430 VVESVRQVCLGF--ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+V + VCLG A + S + +LG++ RGY V YD ++G+ NC
Sbjct: 343 IVSTQGNVCLGVLDASVASLEVTNILGDISMRGYLVVYDNMREQIGWVRRNC 394
>gi|242086416|ref|XP_002443633.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
gi|241944326|gb|EES17471.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
Length = 503
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 126/452 (27%), Positives = 194/452 (42%), Gaps = 53/452 (11%)
Query: 58 GKVSLEVLGRYGPCSKLNQGKSRNTPSL--EEILRRDQQRLHLKNSRRLQKAIPD----- 110
G L ++ + PCS L+ PSL + L D + + S + P
Sbjct: 75 GNNKLPIVHQQSPCSPLH-----GLPSLTAADGLHHDASLIRRRFSSKSSPVAPPASSLA 129
Query: 111 -NFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGS-GITWTQCKPCIHCSQ 168
T + P + + +Y ++V+ G P+Q +LLDT S G++ +CKPC S
Sbjct: 130 VTIIPTNGSSDPTRKPVTL--QYSVLVSYGTPEQQFPVLLDTSSIGMSLLRCKPCASGSD 187
Query: 169 QRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN-GQDKCSSKECPYDIAY--VDGSGET 225
FD S+S TF+ + C S C P N D CP D Y +DG+
Sbjct: 188 DCHLAFDTSRSSTFAHVLCGSPDC-------PTNCSGDGDGDSFCPLDSTYSIIDGA--- 237
Query: 226 GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN-GASGIMGLDRG------PVSII 278
+A D +T+ + A F C D + D + +G + L R +S
Sbjct: 238 --FAEDVLTLAPSSK----AIENFRFVCLDVDEPDDDLPVAGTLDLSRDRNSLPSQLSSS 291
Query: 279 SKTNISYFFYCLHSPYGSTGYITFGKPDTV-NKKFVKYTPIVTT---PEQSEFYHITLTG 334
+ F YCL S GY++ TV + K + P+V+ PE + Y I L G
Sbjct: 292 PGQATAAFSYCLPKSPSSQGYLSLAVDATVRHDKVTAHAPLVSNGGDPELASMYFIDLVG 351
Query: 335 ISVGGERLPL-KASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL 393
+S+G + +P+ A F +D GT T+ VY LR +FRK+M + D
Sbjct: 352 MSLGVDDIPIPPAGSFGNNGVNLDLGTTFTKLTPEVYMTLRDSFRKQMSQNNHSLLGFDG 411
Query: 394 FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL-----VVESVRQVCLGFALLPS-D 447
FDTC++L+ + + +P + F G L +D+ L CL F+ L + D
Sbjct: 412 FDTCFNLTGVRDLAMPLLWFKFSNGERLLIDLDQMLYYDDPAAAPFTMACLAFSSLDAGD 471
Query: 448 PNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
S ++G EV YDVAG ++GF P +C
Sbjct: 472 SFSAVIGTHTLASTEVIYDVAGGKVGFIPRSC 503
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 110/361 (30%), Positives = 173/361 (47%), Gaps = 30/361 (8%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC--IHCSQQRDPFFDPSKSKTFSKIPCNS 189
Y + + IG P + DTGS +TW QC PC C Q P +DP S TF+ +PC+S
Sbjct: 96 YLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDS 155
Query: 190 TTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C L P Q CS +C Y Y D S G ++D + + + + Y ++
Sbjct: 156 QPCTQL-----PYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLH-YNSKIC 209
Query: 249 FLLGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISY---FFYCLHSPY--GSTGYITF 302
F G + T D++G +GI+GL GP+S++S+ F YCL P+ S + F
Sbjct: 210 FGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLL-PFSSNSNSKLKF 268
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
G+ V V TP++ P+ FY++ L GI+VG + + T + IDSG+ +
Sbjct: 269 GEAAIVQGNGVVSTPLIIKPDL-PFYYLNLEGITVGAKTVKTGQ---TDGNIIIDSGSTL 324
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV--PKITIHFLGGVD 420
T Y+ S ++ + + + I FD C+ YK + P + HF GG D
Sbjct: 325 TYLEESFYNEFVSLVKETV-AVEEDQYIPYPFDFCF---TYKEGMSTPPDVVFHFTGG-D 379
Query: 421 LELDVRGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ L TLV+ +C ++PS + I + GN+ Q + V YD+ G ++ F P +C
Sbjct: 380 VVLKPMNTLVLIEDNLICS--TVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDC 437
Query: 480 N 480
+
Sbjct: 438 S 438
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 172/376 (45%), Gaps = 55/376 (14%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + +AIG P L DTGS +TWTQCKPC C Q P +D + S +FS +PC+S
Sbjct: 82 EYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSA 141
Query: 191 TCKILLEWFPPNGQDKCS--SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
TC L W +CS S C Y AY DG+ ++ + I V G +
Sbjct: 142 TC--LPIW-----SSRCSTPSATCRYRYAYDDGA-----YSPECAGI-SVGGIAF----- 183
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS--TGYITFG--- 303
GC +N G ++G +GL RG +S++++ + F YCL + + + + FG
Sbjct: 184 ---GCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLSSPVFFGSLA 240
Query: 304 ----KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---- 355
+ + V+ TP+V +P Y+++L GIS+G RLP+ F +
Sbjct: 241 ELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGG 300
Query: 356 --IDSGTIITRFPAPVYSALRSAFRKRMKKYK--MGKGI---EDLFDTCYDLSA---YKT 405
+DSGTI T + + FR + +G+ + L C+ A +
Sbjct: 301 MIVDSGTIFTIL-------VETGFRVVVDHVAGVLGQPVVNASSLDRPCFPAPAAGVQEL 353
Query: 406 VVVPKITIHFLGGVDLELDVRGTLVV-ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVH 464
+P + +HF GG D+ L + E CL S S+ LGN QQ+ ++
Sbjct: 354 PDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSV-LGNFQQQNIQML 412
Query: 465 YDVAGRRLGFGPGNCN 480
+D+ +L F P +C+
Sbjct: 413 FDITVGQLSFMPTDCS 428
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 125/436 (28%), Positives = 201/436 (46%), Gaps = 49/436 (11%)
Query: 62 LEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFP 121
L ++ Y CS P +E L + K+ RL+ + T A
Sbjct: 34 LSIIPIYSKCSPF-------IPPKQEPLVNTVIDMASKDPARLKYLSSLAAQMTTAVPIA 86
Query: 122 AKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKT 181
++ Y + V +G P Q++ ++LDT + W C C CS + S T
Sbjct: 87 PGQQVLNIGNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTF---STNTSST 143
Query: 182 FSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWAT---DRM-TIQE 237
+ + C+ C + + P S C ++ +Y G++ F AT D + + +
Sbjct: 144 YGSLDCSMAQCTQVRGFSCP----ATGSSSCVFNQSY---GGDSSFSATLVEDSLRLVND 196
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS-- 292
V N F GC ++ +G G++GL RGP+S+I+++ Y F YCL S
Sbjct: 197 VIPN-------FAFGCINSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPSFK 249
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--- 349
Y +G + G K ++YTP++ P + Y++ LTG+SVG +P+
Sbjct: 250 SYYFSGSLKLGPAG--QPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFN 307
Query: 350 --TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV 407
T T IDSGT+ITRF P+Y+A+R FRK++ G FDTC+ +A V
Sbjct: 308 PNTGAGTIIDSGTVITRFVQPIYTAIRDEFRKQVAGPFSSLGA---FDTCF--AATNEAV 362
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEVH 464
P +T+HF G++L L + +L+ S + CL A P++ NS+L + N+QQ+ +
Sbjct: 363 APAVTLHFT-GLNLVLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLL 421
Query: 465 YDVAGRRLGFGPGNCN 480
+DV RLG CN
Sbjct: 422 FDVPNSRLGIARELCN 437
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 105/364 (28%), Positives = 167/364 (45%), Gaps = 35/364 (9%)
Query: 126 IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKI 185
+++ Y +G P Q + + +D + W C R P FDP++S T+ +
Sbjct: 101 LLSIPSYVARARLGTPAQALLVAIDPSNDAAWVPCA--ACAGCARAPSFDPTRSSTYRPV 158
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE-VNGNGYF 244
C + C P G C ++++Y + + D + + + V+
Sbjct: 159 RCGAPQCSQAPAPSCPGGL----GSSCAFNLSYAASTFQ-ALLGQDALALHDDVDA---V 210
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS--TGY 299
A Y F GC TG G++G RGP+S S+T Y F YCL S S +G
Sbjct: 211 AAYTF--GCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFSGT 268
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLST 354
+ G K +K TP+++ P + Y++ + GI VGG +P+ AS + T
Sbjct: 269 LRLGP--AGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGT 326
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+D+GT+ TR APVY+A+R FR R++ G FDTCY++ T+ VP +T
Sbjct: 327 IVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVAGP--LGGFDTCYNV----TISVPTVTFS 380
Query: 415 FLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSI---LLGNVQQRGYEVHYDVAGR 470
F G V + L ++ S + CL A P D +L ++QQ+ + V +DVA
Sbjct: 381 FDGRVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANG 440
Query: 471 RLGF 474
R+GF
Sbjct: 441 RVGF 444
>gi|242086414|ref|XP_002443632.1| hypothetical protein SORBIDRAFT_08g022630 [Sorghum bicolor]
gi|241944325|gb|EES17470.1| hypothetical protein SORBIDRAFT_08g022630 [Sorghum bicolor]
Length = 556
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 112/376 (29%), Positives = 166/376 (44%), Gaps = 42/376 (11%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGS-GITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNS 189
+Y ++V+ G P+Q + LDT S G + +CKPC S DP FD S S TF+ + C S
Sbjct: 196 DYSVLVSYGTPEQQFPVFLDTSSVGASMIRCKPCASGSVDCDPAFDTSLSSTFNHVLCGS 255
Query: 190 TTCKILLEWFPPN-GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C P N D CP D Y S G + D +T+ A
Sbjct: 256 PDC-------PTNCSGDGDGDSFCPLDGTY---SVINGTFVEDVLTLAPST-----AIND 300
Query: 249 FLLGCTDNNTGD-QNGASGIMGLDRG-----------PVSIISKTNISYFFYCLHSPYGS 296
F C D + D A G + L R S + + F YCL S
Sbjct: 301 FKFVCLDVHKPDVLQTAVGTLDLSRDRNSLPSQLSSSSSSSGQASAAAAFSYCLPKSSSS 360
Query: 297 TGYITFGKPDTV-NKKFVKYTPIVTT--PEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G+++ G TV + + +V++ PE + Y I L GIS+G E L + A F S
Sbjct: 361 QGFLSLGINATVKDDNATAHATLVSSGNPELASMYFIDLVGISLGDEDLSIPAGTFGNRS 420
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL---FDTCYDLSAYKTVVVPK 410
T +D GT T Y+ALR +F+++M +Y D+ FDTC++ + +V+P
Sbjct: 421 TNLDVGTTFTILAPDAYTALRESFKRQMSQYNFSSSPTDIAGGFDTCFNFTDLNDLVIPN 480
Query: 411 ITIHFLGGVDLELDVRGTLVVES------VRQVCLGFALLPS-DPNSILLGNVQQRGYEV 463
+ + F G L +D L + CL F+ L + D + ++G+ EV
Sbjct: 481 VQLKFSNGDMLVIDADQMLYYDDDTDAAPFTMACLAFSSLDAGDSFAAVIGSYTLATTEV 540
Query: 464 HYDVAGRRLGFGPGNC 479
YDVAG ++GF P +C
Sbjct: 541 VYDVAGGQVGFIPWSC 556
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 101/349 (28%), Positives = 156/349 (44%), Gaps = 31/349 (8%)
Query: 99 KNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWT 158
K+ RL+ +KT A ++ Y + V +G P Q + ++LDT + W
Sbjct: 12 KDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWV 71
Query: 159 QCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAY 218
C C CS F P+ S T + C+ C + + P S C ++ +Y
Sbjct: 72 PCSGCTGCSSTT---FLPNASTTLGSLDCSEAQCSQVRGFSCP----ATGSSACLFNQSY 124
Query: 219 VDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSII 278
S D +T+ G F GC + +G G++GL RGP+S+I
Sbjct: 125 GGDSSLAATLVQDAITLANDVIPG------FTFGCINAVSGGSIPPQGLLGLGRGPISLI 178
Query: 279 SKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLT 333
S+ Y F YCL S Y +G + G K ++ TP++ P + Y++ LT
Sbjct: 179 SQAGAMYSGVFSYCLPSFKSYYFSGSLKLGP--VGQPKSIRTTPLLRNPHRPSLYYVNLT 236
Query: 334 GISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK 388
G+SVG ++P+ + T T IDSGT+ITRF PVY A+R FRK++
Sbjct: 237 GVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSL 296
Query: 389 GIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV 437
G FDTC+ + P +T+HF G++L L + +L+ S V
Sbjct: 297 GA---FDTCF--AETNEAEAPAVTLHF-EGLNLVLPMENSLIHSSSGSV 339
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 106/364 (29%), Positives = 162/364 (44%), Gaps = 60/364 (16%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQC-KPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
Y + +AIG P ++ +LDTGS + WTQC PC C Q P + P++S T++ + C S
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 191 TCKILLE-WFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTI---QEVNGNGYF 244
C+ L W +CS + C Y +Y DG+ G AT+ T+ V G +
Sbjct: 152 MCQALQSPW------SRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAF- 204
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK 304
GC N G + +SG++G+ RGP+S++S+ ++ P S
Sbjct: 205 -------GCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVT-------RPRRSC------- 243
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS------TEIDS 358
P T+P L GI+VG LP+ + F +L+ IDS
Sbjct: 244 -RARAAARGGGAPTTTSP---------LEGITVGDTLLPIDPAVF-RLTPMGDGGVIIDS 292
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT T + AL A R+ + + G C+ ++ + V VP++ +HF G
Sbjct: 293 GTTFTALEERAFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHF-DG 350
Query: 419 VDLELDVRGTLVVE--SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
D+EL R + VVE S CLG S +LG++QQ+ + YD+ L F P
Sbjct: 351 ADMELR-RESYVVEDRSAGVACLGMV---SARGMSVLGSMQQQNTHILYDLERGILSFEP 406
Query: 477 GNCN 480
C
Sbjct: 407 AKCG 410
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 130/427 (30%), Positives = 194/427 (45%), Gaps = 52/427 (12%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+L+V+ + PCS K + S+ ++ +D RL +S +K+I
Sbjct: 30 TLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVARKSI----------- 78
Query: 120 FPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G I+ + Y + IG P Q + L +DT + W C C C+ F P
Sbjct: 79 VPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCAST---LFAPE 135
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
KS TF + C + CK + PN SS+ +++ Y S D +T+
Sbjct: 136 KSTTFKNVSCAAPECKQV-----PNPGCGVSSRN--FNLTYGSSSIAANL-VQDTITL-- 185
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS-- 292
Y F GC TG G++GL RGP+S++S+T Y F YCL S
Sbjct: 186 --ATDPVPSYTF--GCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFK 241
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF- 349
+G + G K +KYTP++ P +S Y++ L I VG + +P A F
Sbjct: 242 SLNFSGSLRLGP--VAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFN 299
Query: 350 --TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV 407
T T DSGT+ TR APVY A+R FR+R+ K+ FDTCY++ +V
Sbjct: 300 PTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGP-KLTVTSLGGFDTCYNVP----IV 354
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESV-RQVCLGFALLPSDPNSIL--LGNVQQRGYEVH 464
VP IT F G+++ L L+ + CL A P + NS+L + N+QQ+ + V
Sbjct: 355 VPTITFIFT-GMNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVL 413
Query: 465 YDVAGRR 471
YDV R
Sbjct: 414 YDVPNSR 420
>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
gi|194703714|gb|ACF85941.1| unknown [Zea mays]
Length = 208
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 80/215 (37%), Positives = 114/215 (53%), Gaps = 10/215 (4%)
Query: 268 MGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQ 324
MGL G S++S+T + F YCL S+G++T G TP++ + +
Sbjct: 1 MGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQV 60
Query: 325 SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKY 384
FY + L I VGG +L + AS F+ T +DSGT+ITR P YSAL SAF+ MK+Y
Sbjct: 61 PTFYGVRLQAIRVGGRQLSIPASVFSA-GTVMDSGTVITRLPPTAYSALSSAFKAGMKQY 119
Query: 385 KMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALL 444
+ + DTC+D S +V +P + + F GG + LD G ++ CL FA
Sbjct: 120 PPAQ-PSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIILSN-----CLAFAGN 173
Query: 445 PSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
D + ++GNVQQR +EV YDV +GF G C
Sbjct: 174 SDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 119/440 (27%), Positives = 192/440 (43%), Gaps = 34/440 (7%)
Query: 73 KLNQGKSRNTPSLEEILRRDQQR---LHLKNSRRLQKAIPDNFKKTKAFT----FPAKTG 125
+ +G SL ++ +D R ++ + +R +P + +A + ++G
Sbjct: 84 RAAEGGRTREESLLDLAEKDAVRIETMYRRAARSGGGRMPASSSPRRALSERMVATVESG 143
Query: 126 I-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSK 184
+ V + EY + V +G P + +++DTGS + W QC PC+ C +QR P FDP+ S ++
Sbjct: 144 VAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRN 203
Query: 185 IPCNSTTCKILLEWFPPNGQDKCSSKE-----CPYDIAYVDGSGETGFWATDRMTIQEVN 239
+ C C + P + + CPY Y D S TG A + T+
Sbjct: 204 VTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTA 263
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS 296
+ GC N G +GA+G++GL RGP+S S+ Y F YCL
Sbjct: 264 PGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSD 323
Query: 297 TGY-ITFGKPDT----VNKKFVKYTPI----VTTPEQSEFYHITLTGISVGGERLPLKAS 347
G + FG+ D +KYT ++ FY++ L G+ VGGE L + +
Sbjct: 324 VGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISSD 383
Query: 348 YFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA 402
+ T IDSGT ++ F P Y +R AF RM + + CY++S
Sbjct: 384 TWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLSPCYNVSG 443
Query: 403 YKTVVVPKITIHFLGGVDLELDVRGTLVV---ESVRQVCLGFALLPSDPNSILLGNVQQR 459
+ VP++++ F G + + + +CL P SI +GN QQ+
Sbjct: 444 VERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTGMSI-IGNFQQQ 502
Query: 460 GYEVHYDVAGRRLGFGPGNC 479
+ V YD+ RLGF P C
Sbjct: 503 NFHVVYDLQNNRLGFAPRRC 522
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 176/382 (46%), Gaps = 55/382 (14%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCK--PCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+ + +G P Q V+++LDTGS ++W CK P +H FDP +S ++S IPC S T
Sbjct: 65 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPIPCTSPT 118
Query: 192 CKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ F P DK K C I+Y D S G A+D I GN F
Sbjct: 119 CRTRTRDFSIPVSCDK--KKLCHAIISYADASSIEGNLASDTFHI----GNSAIPATIF- 171
Query: 251 LGCTD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPD 306
GC D +N+ + + +G++G++RG +S +++ + F YC+ S S+G + FG+
Sbjct: 172 -GCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFSYCI-SGQDSSGILLFGESS 229
Query: 307 TVNKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEI 356
K +KYTP+V + Y + L GI V L L S + T +
Sbjct: 230 FSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMV 289
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED-------LFDTCYDLSAYKTVV-- 407
DSGT T PVY+AL++ F ++ K K +ED D CY + + +
Sbjct: 290 DSGTQFTFLLGPVYTALKNEFVRQTKASL--KVLEDPNFVFQGAMDLCYRVPLTRRTLPP 347
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLG------FALLPSD---PNSILLGNVQQ 458
+P +T+ F G E+ V ++ V V G F S+ S ++G+ Q
Sbjct: 348 LPTVTLMFRGA---EMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQ 404
Query: 459 RGYEVHYDVAGRRLGFGPGNCN 480
+ + +D+A R+GF C+
Sbjct: 405 QNVWMEFDLAKSRVGFAEVRCD 426
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 167/364 (45%), Gaps = 42/364 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + +++G P + + DTGS + W Q +PC CS FDP +S TF ++ C+S
Sbjct: 55 YVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQL 112
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYP-F 249
C L P S C Y Y GSGET G +A D +++ +G ++P F
Sbjct: 113 CTELPGSCEPG------SSACSYSYEY--GSGETEGEFARDTISLGTTSGGS--QKFPSF 162
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNI---SYFFYCLH--SPYGSTGYITFGK 304
+GC N+G +G G++GL +GPVS+ S+ + S F YCL + + + FG
Sbjct: 163 AVGCGMVNSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGP 221
Query: 305 PDTVNKKFVKYTPIVTTPEQS--EFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
++ ++ T I T P + +Y +T+ GI+V G+ + +T IDSGT +
Sbjct: 222 SAALHGTGIQSTKI-TPPSDTYPTYYLLTVNGIAVAGQTMGSPG------TTIIDSGTTL 274
Query: 363 TRFPAPVYSALRSAFRK-----RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
T P+ VY + S R+ MG D CYD S+ + P +TI G
Sbjct: 275 TYVPSGVYGRVLSRMESMVTLPRVDGSSMG------LDLCYDRSSNRNYKFPALTIRLAG 328
Query: 418 GVDLELDVRGTLVV-ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
LVV +S VCL P SI +GNV Q+GY + YD L F
Sbjct: 329 ATMTPPSSNYFLVVDDSGDTVCLAMGSAGGLPVSI-IGNVMQQGYHILYDRGSSELSFVQ 387
Query: 477 GNCN 480
C
Sbjct: 388 AKCE 391
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 115/398 (28%), Positives = 169/398 (42%), Gaps = 25/398 (6%)
Query: 96 LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSG 154
+ N R K IP N + A+T + V +Y + ++IG P +DTGS
Sbjct: 22 IEAHNGRFTVKLIPRNSSQVLFNRITAQTPVSVHHYDYLMELSIGTPPVKTYAQVDTGSD 81
Query: 155 ITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPY 214
+ W QC PC +C +Q +P FDP S T+S I S +C L Q+ C+ Y
Sbjct: 82 LIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCN-----Y 136
Query: 215 DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS-GIMGLDRG 273
+Y D S G A + +T+ G A + GC NN G N GI+GL RG
Sbjct: 137 TYSYEDDSITEGVLAQETLTLTSTTGKP-VALKGVIFGCGHNNNGVFNDKEMGIIGLGRG 195
Query: 274 PVSIISKTNISY----FFYCL---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSE 326
P+S++S+ S+ F CL H+ T ++FGK V V TP+V+
Sbjct: 196 PLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQA 255
Query: 327 FYHITLTGISVGGERLPLKASY----FTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMK 382
FY +TL GISV LP TK + IDSGT T P Y L R ++
Sbjct: 256 FYFVTLLGISVEDINLPFNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVA 315
Query: 383 KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA 442
+ + CY + +T HF G ++ + T + V+ FA
Sbjct: 316 LDPIPIDPTLGYQLCYRTPT--NLKGTTLTAHFEGA---DVLLTPTQIFIPVQDGIFCFA 370
Query: 443 LLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ N + GN Q Y + +D+ + + F +C
Sbjct: 371 FTSTFSNEYGIYGNHAQSNYLIGFDLEKQLVSFKATDC 408
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 174/381 (45%), Gaps = 44/381 (11%)
Query: 126 IVAADE-YYIVVAIGKPKQYVSLLLDTGSGITWTQCK----PCIHCSQQRDPFFDPSKSK 180
I+ +D+ + + V I +P++ L++DTGS + WTQCK P +DP +S
Sbjct: 9 ILLSDQGHSLTVGIVQPRK---LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESS 65
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQ---DKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQ 236
TF+ +PC+ C+ GQ C+SK C Y+ Y + G A++ T
Sbjct: 66 TFAFLPCSDRLCQ--------EGQFSFKNCTSKNRCVYEDVY-GSAAAVGVLASETFTF- 115
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS 296
G GC + G GA+GI+GL +S+I++ I F YCL +P+
Sbjct: 116 ---GARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCL-TPFAD 171
Query: 297 --TGYITFGKPDTVNK----KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
T + FG +++ + ++ T IV+ P ++ +Y++ L GIS+G +RL + A+
Sbjct: 172 KKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLA 231
Query: 351 KL-----STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL----- 400
T +DSG+ + + A++ A ++ + +ED ++ C+ L
Sbjct: 232 MRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVED-YELCFVLPRRTA 290
Query: 401 -SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQR 459
+A + V VP + +HF GG + L +CL ++GNVQQ+
Sbjct: 291 AAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQ 350
Query: 460 GYEVHYDVAGRRLGFGPGNCN 480
V +DV + F P C+
Sbjct: 351 NMHVLFDVQHHKFSFAPTQCD 371
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 107/361 (29%), Positives = 169/361 (46%), Gaps = 34/361 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + +G P Q + L +DT + W C C C F+P+ S ++ +PC S
Sbjct: 54 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 111
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C + PN ++K C + ++Y D S + + D + V G+ A +
Sbjct: 112 CVLA-----PNPSCSPNAKSCGFSLSYADSSLQAAL-SQDTL---AVAGDVVKA---YTF 159
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGSTGYITFGKPD 306
GC TG G++GL RGP+S +S+T Y F YCL S +G + G+
Sbjct: 160 GCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGR-- 217
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEIDSGTI 361
+ +K TP++ P +S Y++ +TGI VG + + + AS T T +DSGT+
Sbjct: 218 NGQPRRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTM 277
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
TR APVY ALR R+R+ FDTCY+ TV P +T+ F G+ +
Sbjct: 278 FTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCYN----TTVAWPPVTLLF-DGMQV 332
Query: 422 ELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGRRLGFGPGN 478
L ++ + CL A P N++L + ++QQ+ + V +DV R+GF +
Sbjct: 333 TLPEENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARES 392
Query: 479 C 479
C
Sbjct: 393 C 393
>gi|115448347|ref|NP_001047953.1| Os02g0720500 [Oryza sativa Japonica Group]
gi|113537484|dbj|BAF09867.1| Os02g0720500, partial [Oryza sativa Japonica Group]
Length = 172
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 73/182 (40%), Positives = 97/182 (53%), Gaps = 11/182 (6%)
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDS 358
YI+ G P + TP++T +Y + L GISVGG+ L + AS F +D+
Sbjct: 1 YISLGGPSSTAG--FSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFAS-GAVVDT 57
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKG-IEDLFDTCYDLSAYKTVVVPKITIHFLG 417
GT++TR P YSALRSAFR M Y + DTCYD + Y TV +P I+I F G
Sbjct: 58 GTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGG 117
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G ++L G L CL FA D + +LGNVQQR +EV +D G +GF P
Sbjct: 118 GAAMDLGTSGILT-----SGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPA 170
Query: 478 NC 479
+C
Sbjct: 171 SC 172
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 109/375 (29%), Positives = 174/375 (46%), Gaps = 43/375 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
YY V +G P + ++ +DTGS + W C C C + + FFDP S + S +
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGN--GY 243
C+ C + + CS C Y Y DGSG +GF+ +D M+ V +
Sbjct: 144 CSDRRCYSNFQ-----TESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAI 198
Query: 244 FARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPY 294
+ PF+ GC++ TGD + GI GL +G +S+IS+ + F +CL
Sbjct: 199 NSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 295 GSTGYITFG---KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTK 351
G + G +PDTV YTP+V P Q Y++ L I+V G+ LP+ S FT
Sbjct: 259 SGGGIMVLGQIKRPDTV------YTPLV--PSQPH-YNVNLQSIAVNGQILPIDPSVFTI 309
Query: 352 LS---TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
+ T ID+GT + P YS A + +Y G+ I C++++A V
Sbjct: 310 ATGDGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQY--GRPITYESYQCFEITAGDVDVF 367
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQV---CLGFALLPSDPNSILLGNVQQRGYEVHY 465
P++++ F GG + L L + S C+GF + S +LG++ + V Y
Sbjct: 368 PEVSLSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRM-SHRRITILGDLVLKDKVVVY 426
Query: 466 DVAGRRLGFGPGNCN 480
D+ +R+G+ +C+
Sbjct: 427 DLVRQRIGWAEYDCS 441
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 114/381 (29%), Positives = 177/381 (46%), Gaps = 55/381 (14%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCK--PCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+ + +G P Q V+++LDTGS ++W CK P +H FDP +S ++S IPC S T
Sbjct: 58 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPIPCTSPT 111
Query: 192 CKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C+ F P DK K C I+Y D S G A+D I GN F
Sbjct: 112 CRTRTRDFSIPVSCDK--KKLCHAIISYADASSIEGNLASDTFHI----GNSAIPATIF- 164
Query: 251 LGCTD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPD 306
GC D +N+ + + +G++G++RG +S +++ + F YC+ S S+G + FG+
Sbjct: 165 -GCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFSYCI-SGQDSSGILLFGESS 222
Query: 307 TVNKKFVKYTPIV--TTPE---QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEI 356
K +KYTP+V +TP Y + L GI V L L S + T +
Sbjct: 223 FSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMV 282
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED-------LFDTCYDLSAYKTVV-- 407
DSGT T PVY+AL++ F ++ K K +ED D CY + + +
Sbjct: 283 DSGTQFTFLLGPVYTALKNEFVRQTKASL--KVLEDPNFVFQGAMDLCYRVPLTRRTLPP 340
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLG------FALLPSD---PNSILLGNVQQ 458
+P +T+ F G E+ V ++ V V G F S+ S ++G+ Q
Sbjct: 341 LPTVTLMFRGA---EMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQ 397
Query: 459 RGYEVHYDVAGRRLGFGPGNC 479
+ + +D+A R+GF C
Sbjct: 398 QNVWMEFDLAKSRVGFAEVRC 418
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 161/369 (43%), Gaps = 32/369 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH--CSQQRDPFFDPSKSKTFSKIPCN 188
+Y IG P Q L+DTGS + WTQC C+ C++Q P+++ S S TF+ +PC
Sbjct: 89 QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
+ C + S Y V G+ T +A T + G F R
Sbjct: 149 ARICAANDDIIHFCDLAAGCSVIAGYGAGVVAGTLGTEAFAFQSGTAELAFGCVTFTRI- 207
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY----GSTGYITFGK 304
G +GASG++GL RG +S++S+T + F YCL +PY G+TG++ G
Sbjct: 208 --------VQGALHGASGLIGLGRGRLSLVSQTGATKFSYCL-TPYFHNNGATGHLFVGA 258
Query: 305 PDTVNKKF-VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-------- 355
++ V T V P+ S FY++ L G++VG RLP+ A+ F
Sbjct: 259 SASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGV 318
Query: 356 -IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV--VVPKIT 412
IDSG+ T Y AL S R+ + + D A + V VVP +
Sbjct: 319 IIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDA--DDGALCVARRDVGRVVPAVV 376
Query: 413 IHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
HF GG D+ + V+ + P S+ +GN QQ+ V YD+A
Sbjct: 377 FHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQSV-IGNYQQQNMRVLYDLANGD 435
Query: 472 LGFGPGNCN 480
F P +C+
Sbjct: 436 FSFQPADCS 444
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 125/411 (30%), Positives = 185/411 (45%), Gaps = 51/411 (12%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTG--IVAADEYYIVVAIGKP 141
S+ ++ +D RL +S +K++ P +G I+ + Y + IG P
Sbjct: 39 SVLQMQAKDTTRLQFLDSLVARKSV-----------VPIASGRQIIQSPTYIVRAKIGTP 87
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
Q + L +DT + W C C C+ F P KS TF + C + CK + P
Sbjct: 88 PQTLLLAMDTSNDAAWIPCTACDGCAST---LFAPEKSTTFKNVSCAAPECKQV-----P 139
Query: 202 NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ 261
N C C +++ Y S D +T+ Y F GC TG
Sbjct: 140 N--PGCGVSSCNFNLTY-GSSSIAANLVQDTITL----ATDPVPSYTF--GCVSKTTGTS 190
Query: 262 NGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKYT 316
G++GL RGP+S++S+T Y F YCL S +G + G K +KYT
Sbjct: 191 APPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGP--VAQPKRIKYT 248
Query: 317 PIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF---TKLSTEIDSGTIITRFPAPVYS 371
P++ P +S Y++ L I VG + +P A F T T DSGT+ TR APVY
Sbjct: 249 PLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLVAPVYV 308
Query: 372 ALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVV 431
A+R FR+R+ K+ FDTCY++ +VVP IT F G+++ L L+
Sbjct: 309 AVRDEFRRRVGP-KLTVTSLGGFDTCYNVP----IVVPTITFIFT-GMNVTLPQDNILIH 362
Query: 432 ESV-RQVCLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ CL A P + NS+L + N+QQ+ + V YDV R+G C
Sbjct: 363 STAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELC 413
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 108/375 (28%), Positives = 175/375 (46%), Gaps = 43/375 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
YY V +G P + ++ +DTGS + W C C C + + FFDP S + S +
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGN--GY 243
C+ C + + CS C Y Y DGSG +G++ +D M+ V +
Sbjct: 144 CSDRRCYSNFQ-----TESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAI 198
Query: 244 FARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPY 294
+ PF+ GC++ +GD + GI GL +G +S+IS+ + F +CL
Sbjct: 199 NSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 295 GSTGYITFG---KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTK 351
G + G +PDTV YTP+V P Q Y++ L I+V G+ LP+ S FT
Sbjct: 259 SGGGIMVLGQIKRPDTV------YTPLV--PSQPH-YNVNLQSIAVNGQILPIDPSVFTI 309
Query: 352 LS---TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
+ T ID+GT + P YS A + +Y G+ I C++++A V
Sbjct: 310 ATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQY--GRPITYESYQCFEITAGDVDVF 367
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQV---CLGFALLPSDPNSILLGNVQQRGYEVHY 465
P++++ F GG + L R L + S C+GF + S +LG++ + V Y
Sbjct: 368 PQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRM-SHRRITILGDLVLKDKVVVY 426
Query: 466 DVAGRRLGFGPGNCN 480
D+ +R+G+ +C+
Sbjct: 427 DLVRQRIGWAEYDCS 441
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 129/441 (29%), Positives = 191/441 (43%), Gaps = 75/441 (17%)
Query: 50 RTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIP 109
AL +G G S++++ R P S L + RR R+
Sbjct: 23 EVALARG-GGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRV------------- 68
Query: 110 DNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ 168
F+ T + ++ IV +A EY + + IG P V ++DTGS +TWTQC+PC HC +
Sbjct: 69 GRFRPTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK 128
Query: 169 QRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETG 226
Q P FDP S T+ C ++ C L G+D+ SKE C + +Y DGS G
Sbjct: 129 QVVPLFDPKNSSTYRDSSCGTSFCLAL-------GKDRSCSKEKKCTFRYSYADGSFTGG 181
Query: 227 FWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGD-QNGASGIMGLDRGPVSIIS--KTN 282
A++ +T+ G +P F GC ++ G +SGI+GL G +S+IS K+
Sbjct: 182 NLASETLTVDSTAGKP--VSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKST 239
Query: 283 ISYFF-YCL---HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
I+ F YCL + + I FG V+ TP+
Sbjct: 240 INGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPL-------------------- 279
Query: 339 GERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED- 392
RLP K Y K E +DSGT T P YS L + +K GK + D
Sbjct: 280 --RLPYKG-YSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIK----GKRVRDP 332
Query: 393 --LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNS 450
+F CY+ +A + P IT HF ++EL T + VC F + P+
Sbjct: 333 NGIFSLCYNTTA--EINAPIITAHF-KDANVELQPLNTFMRMQEDLVC--FTVAPTSDIG 387
Query: 451 ILLGNVQQRGYEVHYDVAGRR 471
+ LGN+ Q + V +D+ +R
Sbjct: 388 V-LGNLAQVNFLVGFDLRKKR 407
>gi|125555046|gb|EAZ00652.1| hypothetical protein OsI_22673 [Oryza sativa Indica Group]
Length = 340
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 136/288 (47%), Gaps = 37/288 (12%)
Query: 161 KPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVD 220
PC+ D FDPS+S +F+ IPC S C + +C+ CP+ I + +
Sbjct: 21 APCVG-GAPCDVAFDPSRSSSFAAIPCGSPECAV-----------ECTGASCPFTIQFGN 68
Query: 221 GSGETGFWATDRMTIQEVNGNGYFARYPFLLGC----TDNNTGDQNGASGIMGLDRGPVS 276
+ G D +T+ + FA + F GC D +T D GA G++ L R S
Sbjct: 69 VTVANGTLVRDTLTLSP---SATFAGFTF--GCIEVGADADTFD--GAVGLIDLSRSSHS 121
Query: 277 IISKT--------NISYFFYCLHSPYG--STGYITFG--KPDTVNKKFVKYTPIVTTPEQ 324
+ S+ + F YCL S S G+++ G +P+ +KY P+ + P
Sbjct: 122 LASRVISNGATTTTTAAFSYCLPSLSSTRSRGFLSIGASRPEYSGGD-IKYAPMSSNPNH 180
Query: 325 SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKY 384
Y + L GISVGGE LP+ + T +++ T T Y+ALR AFR M +Y
Sbjct: 181 PNSYFVDLVGISVGGEDLPVPPAVLAAHGTLLEAATEFTFLAPAAYAALRDAFRNDMAQY 240
Query: 385 KMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVE 432
+ DTCY+L+ ++ VP + + F GG +LELDVR T+ E
Sbjct: 241 PAAPPFR-VLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQTMYFE 287
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 119/442 (26%), Positives = 185/442 (41%), Gaps = 52/442 (11%)
Query: 84 SLEEILRRDQQRLHL------KNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVV 136
SL ++ R D+QR+ + +R AF P +G +Y++
Sbjct: 42 SLADLARSDRQRMAFIASHGRRRTRETAAGSSSASSAAAAFAMPLTSGAYTGIGQYFVRF 101
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP---------FFDPSKSKTFSKIPC 187
+G P Q L+ DTGS +TW +C+ + P F P S+T++ I C
Sbjct: 102 RVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTWAPISC 161
Query: 188 NSTTCKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
S TC L P C + C YD Y DGS G T+ TI A
Sbjct: 162 ASDTCTKSL----PFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERKA 217
Query: 246 RYP-FLLGCTDNNTGDQNGAS-GIMGLDRGPVSIISKTNISY---FFYCL--H-SPYGST 297
+ +LGC+ + TG AS G++ L +S S + F YCL H SP +T
Sbjct: 218 KLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRNAT 277
Query: 298 GYITFGKPDTVNK------------KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLK 345
Y+TFG V+ + TP++ FY ++L ISV GE L +
Sbjct: 278 SYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEFLKIP 337
Query: 346 ASYFTKLS---TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA 402
+ + + +DSGT +T P Y A+ +A K + + + D F+ CY+ ++
Sbjct: 338 RAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLA--GLPRVTMDPFEYCYNWTS 395
Query: 403 YK----TVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQ 458
V VPK+ +HF G LE + ++ + C+G P P ++GN+ Q
Sbjct: 396 PSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPW-PGISVIGNILQ 454
Query: 459 RGYEVHYDVAGRRLGFGPGNCN 480
+ + +D+ RRL F C
Sbjct: 455 QEHLWEFDIKNRRLKFQRSRCT 476
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 156/356 (43%), Gaps = 44/356 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + +IG P + L+DTG+ W QCKPC C Q P F PSKS T+ IPC S
Sbjct: 90 YVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPI 149
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
CK + + + D +T+ NG + ++
Sbjct: 150 CK----------------------------NADGHYLGVDTLTLNSNNGTP-ISFKNIVI 180
Query: 252 GCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGSTGYITFGK 304
GC N G G SG +GL RGP+S IS+ N S F YCL S + + FG
Sbjct: 181 GCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKLHFGD 240
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITR 364
TV+ TPI ++ Y ++L SVG + L+ S + ++ IDSGT +T
Sbjct: 241 KSTVSGLGTVSTPI----KEENGYFVSLEAFSVGDHIIKLENSD-NRGNSIIDSGTTMTI 295
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-VPKITIHFLGGVDLEL 423
P VYS L S M K K K F+ CY ++ + V IT HF G ++ L
Sbjct: 296 LPKDVYSRLESVVLD-MVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHF-SGSEVHL 353
Query: 424 DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ T + +C F + + + GNV Q+ + V +D+ + + F P +C
Sbjct: 354 NALNTFYPITDEVICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTISFKPTDC 409
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 170/377 (45%), Gaps = 42/377 (11%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ +A+G P Q V+++LDTGS ++W C P ++ F P S TF+ +PC S C+
Sbjct: 87 VSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCR 146
Query: 194 ILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC 253
PP D SS+ C ++Y DGS G ATD + G+G R F GC
Sbjct: 147 SRDLPSPP-ACDGASSR-CSVSLSYADGSSSDGALATDVFAV----GSGPPLRAAF--GC 198
Query: 254 TD---NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNK 310
+++ D ++G++G++RG +S +S+ + F YC+ S G + G D
Sbjct: 199 MSSAFDSSPDGVASAGLLGMNRGALSFVSQASTRRFSYCI-SDRDDAGVLLLGHSDLPTF 257
Query: 311 KFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEIDSGT 360
+ YTP+ + Y + L GI VGG+ LP+ AS T +DSGT
Sbjct: 258 LPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGT 317
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGK-----GIEDLFDTCYDLSAYK---TVVVPKIT 412
T YSAL++ F ++ + ++ FDTC+ + + T +P +T
Sbjct: 318 QFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVT 377
Query: 413 IHFLGGVDLELDVRGTLVVESVRQV--------CLGFALLPSDP-NSILLGNVQQRGYEV 463
+ F G E+ V G ++ V CL F P + ++G+ Q V
Sbjct: 378 LLFNGA---EMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAYVIGHHHQMNVWV 434
Query: 464 HYDVAGRRLGFGPGNCN 480
YD+ R+G P C+
Sbjct: 435 EYDLERGRVGLAPVRCD 451
>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 87/271 (32%), Positives = 120/271 (44%), Gaps = 48/271 (17%)
Query: 212 CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLD 271
C Y I Y DGS G +++ G F+ GC NN G G SG+MGL
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKF------GTILVKDFIFGCGRNNKGLFGGVSGLMGLG 186
Query: 272 RGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
R +S+IS+T+ P+ FY I
Sbjct: 187 RSDLSLISQTS-------------------------------------ENPQLYNFYFIN 209
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
LTGIS+GG + L+A +DSGT+ITR P +Y AL++ F K+ +
Sbjct: 210 LTGISIGG--VALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAF- 266
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT--LVVESVRQVCLGFALLPSDPN 449
+ DTC++LSAY+ V +P I +HF G +L +DV G V QVCL A L
Sbjct: 267 SILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDE 326
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+LGN QQ+ V YD ++GF C+
Sbjct: 327 VAILGNYQQKNLRVIYDTKETKVGFALETCS 357
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 119/460 (25%), Positives = 207/460 (45%), Gaps = 43/460 (9%)
Query: 31 HSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILR 90
H +++++ L+ + + +++ + S++++ R+ P S L + T ++
Sbjct: 2 HHFVLTLFFLVSTMLVDASKSLM-----GFSIDLIPRHSPISPLYNSQMTQTELVKSAAL 56
Query: 91 RDQQRLHLKNSRRLQKAIPDNF-KKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLL 149
R R S+R+ NF + P T I EY + ++G P +
Sbjct: 57 RSITR-----SKRV------NFIGQISPPLSPIITPIPDHGEYLMRFSLGTPSVERLAIF 105
Query: 150 DTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS 209
DTGS ++W QC PC C Q P FDP++S T+ +PC S C + FP N ++ SS
Sbjct: 106 DTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQPCTL----FPQNQRECGSS 161
Query: 210 KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGC---TDNNTGDQNGAS 265
K+C Y Y S G D ++ A +P + GC ++ A+
Sbjct: 162 KQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGCAFYSNFTFKISTKAN 221
Query: 266 GIMGLDRGPVSIISKT--NISY-FFYCLHSPYG--STGYITFGKPDTVNKKFVKYTPIVT 320
G +GL GP+S+ S+ I + F YC+ P+ STG + FG N+ V TP +
Sbjct: 222 GFVGLGPGPLSLASQLGDQIGHKFSYCM-VPFSSTSTGKLKFGSMAPTNE--VVSTPFMI 278
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKR 380
P +Y + L GI+VG +++ L + IDS I+T +Y+ S+ ++
Sbjct: 279 NPSYPSYYVLNLEGITVGQKKV-LTGQIGGNII--IDSVPILTHLEQGIYTDFISSVKEA 335
Query: 381 MKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLG 440
+ ++ + F+ C + + P+ HF G D+ L + + VC+
Sbjct: 336 I-NVEVAEDAPTPFEYC--VRNPTNLNFPEFVFHFTGA-DVVLGPKNMFIALDNNLVCM- 390
Query: 441 FALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++PS SI GN Q ++V YD+ +++ F P NC+
Sbjct: 391 -TVVPSKGISI-FGNWAQVNFQVEYDLGEKKVSFAPTNCS 428
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 120/425 (28%), Positives = 185/425 (43%), Gaps = 36/425 (8%)
Query: 70 PCSKLNQGKSRNTPSLEEILRRDQQRL-HLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVA 128
P S +T +E + R + RL +L +L + DN + T +
Sbjct: 18 PLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALDNDVSL------SPTLVNE 71
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPF---FDPSKSKTFSK 184
EY + IG P V LDT +G+ W QC C C ++ F SKS T+
Sbjct: 72 GGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEM 131
Query: 185 IPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
PC S C L + N DK C Y + Y D +G ++D +G
Sbjct: 132 EPCGSNFCNSLTGFQTCNSSDKW----CKYRLVYGDNKATSGILSSDSFGFD--TSDGML 185
Query: 245 ARYPFL-LGCTDNN-TGDQNGASGIMGLDRGPVSIISKTNISYFFYCL--HSPYGSTGYI 300
FL GC++ TGD+ +G +GL++ P+S+IS+ I F YCL + GST +
Sbjct: 186 VDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIKKFSYCLVPFNNLGSTSKM 245
Query: 301 TFGK-PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA---SYFTKLSTEI 356
FG P T + TP++ S+ Y++ + GIS+G + Y + I
Sbjct: 246 YFGSLPVTSGGQ----TPLLY--PNSDAYYVKVLGISIGNDEPHFDGVFDVYEVRDGWII 299
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTVVVPKITIHF 415
D+G + + +L + F + ++ F+ C++L +A P +T+HF
Sbjct: 300 DTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTVHF 359
Query: 416 LGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G DL L+V T V +E CL ALL S +LGN Q + Y V YD+ + + F
Sbjct: 360 -DGADLILNVESTFVKIEDDGIFCL--ALLRSGSPVSILGNFQLQNYHVGYDLEAQVISF 416
Query: 475 GPGNC 479
P +C
Sbjct: 417 APVDC 421
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 113/436 (25%), Positives = 190/436 (43%), Gaps = 41/436 (9%)
Query: 69 GPCSKLNQGKSRNTPSLEEILRRDQQR-----LHLKNSRRLQKAIPDNFKKTKAFTFPAK 123
G ++L+ + S+ R D++R L + R ++ + + A + P
Sbjct: 22 GKSARLDLFPAAPGASVTARARGDRRRHAYISAQLPSRRGGRQRVAAEVASSSAVSLPMS 81
Query: 124 TGIVAA-DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTF 182
+G A +Y++ V +G P Q +L+ DTGS +TW +C + F P SK++
Sbjct: 82 SGAYAGTGQYFVKVLVGTPAQEFTLVADTGSELTWVKCA---GGASPPGLVFRPEASKSW 138
Query: 183 SKIPCNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGS-GETGFWATDRMTIQEVN 239
+ +PC+S TCK+ + P CSS C YD Y +GS G G TD TI +
Sbjct: 139 APVPCSSDTCKLDV----PFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATI-ALP 193
Query: 240 GNGYFARYPFLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISY---FFYCL--H-S 292
G +LGC+ + G G++ L +S S+ + F YCL H +
Sbjct: 194 GGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLA 253
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
P +TGY+ FG P V + T + P FY + + + V G+ L + A +
Sbjct: 254 PRNATGYLAFG-PGQVPRTPATQTKLFLDPAM-PFYGVKVDAVHVAGQALDIPAEVWDPK 311
Query: 353 STEI--DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL----FDTCYDLSAYKTV 406
S + DSGT +T P Y A+ +A K + G+ + F+ CY+ +A +
Sbjct: 312 SGGVILDSGTTLTVLATPAYKAVVAALTKLL------AGVPKVDFPPFEHCYNWTAPRPG 365
Query: 407 V--VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVH 464
+PK+ + F G LE + ++ C+G P ++GN+ Q+ +
Sbjct: 366 APEIPKLAVQFTGCARLEPPAKSYVIDVKPGVKCIGLQ-EGEWPGVSVIGNIMQQEHLWE 424
Query: 465 YDVAGRRLGFGPGNCN 480
+D+ + F P C
Sbjct: 425 FDLKNMEVRFMPSTCT 440
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 124/449 (27%), Positives = 189/449 (42%), Gaps = 77/449 (17%)
Query: 73 KLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEY 132
+L ++ ++EE +RR +R H RRL T P G +Y
Sbjct: 26 ELTHVDAKEHYTVEERVRRATERTH----RRL--------ASMGGVTAPIHWG--GQSQY 71
Query: 133 YIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
IG P Q ++DTGS + WTQC C C +Q P++DPS+S+ + CN
Sbjct: 72 IAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDAA 131
Query: 192 CKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGE-TGFWATDRMTIQEVNGNGYFARYP 248
C + + +C S K C Y G+G G AT+ +T Q
Sbjct: 132 CAL-------GSETQCLSDNKTCAVVTGY--GAGNIAGTLATENLTFQS-------ETVS 175
Query: 249 FLLGC---TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST---GYITF 302
+ GC T + G NGASGI+GL RG +S+ S+ + F YCL + T ++
Sbjct: 176 LVFGCIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDTRFSYCLTPYFEDTIEPSHMVV 235
Query: 303 GKPDTVNKKFVKYTPIVTTP--------EQSEFYHITLTGISVGGERLPLKASYF----- 349
G + TP+ T P S FY++ LTGI+ G +L + ++ F
Sbjct: 236 GASAGLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQV 295
Query: 350 ---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-----FDTCYDLS 401
T IDSG +T Y ALR+ +++ ++ L FD C L
Sbjct: 296 APGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAAL----VQPLAGTTGFDLCVALK 351
Query: 402 AYKTVVVPKITIHFLGGVDLELDVRGTLV-----VESVRQVCLGFA-----LLPSDPNSI 451
+ +VP + +HF GG D+ V+S + F+ LP + ++
Sbjct: 352 DAER-LVPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTV 410
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+GN Q+ V YD+AG L F P +C+
Sbjct: 411 -IGNYMQQNMHVLYDLAGGVLSFQPADCS 438
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 127/486 (26%), Positives = 197/486 (40%), Gaps = 93/486 (19%)
Query: 80 RNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKT-------------KAFTFPAKTGI 126
R+ +E+ R DQ+R S ++A K +AF P +G
Sbjct: 41 RDEAPWDEVARMDQERTAFICSHARRRATEAGDAKHKAKAKAKGAPAADEAFAMPLSSGA 100
Query: 127 -VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-------------------- 165
+Y++ +G P + L+ DTGS +TW +C H
Sbjct: 101 YTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTS 160
Query: 166 -------CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS--KECPYDI 216
S F P +S+T++ IPC+S TC L P C + C YD
Sbjct: 161 SLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASL----PFSLAACPTPGSPCAYDY 216
Query: 217 AYVDGSGETGFWATDRMTIQEVNGNGYFARYP------FLLGCTDNNTGDQNGAS-GIMG 269
Y DGS G TD TI ++G G + +LGCT + TGD AS G++
Sbjct: 217 RYKDGSAARGTVGTDSATI-ALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDGVLS 275
Query: 270 LDRGPVSIISKTNISY---FFYCL--H-SPYGSTGYITFGKPDTVNKK------------ 311
L +S S+ + F YCL H +P +T Y+TFG V+
Sbjct: 276 LGYSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTACAGGGS 335
Query: 312 ---------FVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYFTKLSTEI-DSG 359
+ TP++ FY +T+ GISV GE R+P K I DSG
Sbjct: 336 PAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWDVAKGGGAILDSG 395
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYK-----TVVVPKITIH 414
T +T +P Y A+ +A K++ + + D FD CY+ ++ TV +P++ +H
Sbjct: 396 TSLTVLVSPAYRAVVAALNKKLA--GLPRVTMDPFDYCYNWTSPSTGEDLTVAMPELAVH 453
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F G L+ + ++ + C+G P ++GN+ Q+ + +D+ RRL F
Sbjct: 454 FAGSARLQPPAKSYVIDAAPGVKCIGLQ-EGEWPGVSVIGNILQQEHLWEFDLKNRRLRF 512
Query: 475 GPGNCN 480
C
Sbjct: 513 KRSRCT 518
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 128/422 (30%), Positives = 199/422 (47%), Gaps = 39/422 (9%)
Query: 87 EILRRDQQRLHLKN-----SRRLQKAIPDNFKKTKAFTFPA--KTGIVA-ADEYYIVVAI 138
E++ RD L N S RL A + +++ FT ++G+++ EY++ ++I
Sbjct: 32 ELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRSRRFTTKTDLQSGLISNGGEYFMSISI 91
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
G P V + DTGS +TW QCKPC C +Q P FD KS T+ C+S TC+ L E
Sbjct: 92 GTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEH 151
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNN 257
G D+ S C Y +Y D S G AT+ TI + +G +P + GC NN
Sbjct: 152 --EEGCDE-SKDICKYRYSYGDNSFTKGDVATE--TISIDSSSGSSVSFPGTVFGCGYNN 206
Query: 258 TGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTG--YITFGK---PDT 307
G + SGI+GL GP+S++S+ S F YCL H+ + G I G P
Sbjct: 207 GGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGTSVINLGTNSIPSN 266
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPL---------KASYFTKLSTEIDS 358
+K T + + +Y +TL ++VG +LP K+S T + IDS
Sbjct: 267 PSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTG-NIIIDS 325
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT +T + Y +A + + K + L C+ S K + +P IT+HF
Sbjct: 326 GTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFK-SGDKEIGLPAITMHFT-N 383
Query: 419 VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
D++L V + VCL +++P+ +I GN+ Q + V YD+ + + F +
Sbjct: 384 ADVKLSPINAFVKLNEDTVCL--SMIPTTEVAI-YGNMVQMDFLVGYDLETKTVSFQRMD 440
Query: 479 CN 480
C+
Sbjct: 441 CS 442
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 115/389 (29%), Positives = 178/389 (45%), Gaps = 56/389 (14%)
Query: 113 KKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK---PCIHCSQ 168
++ F P +G+ EY+ V +G P ++LDTGS + W + P + +
Sbjct: 102 RRRGGFAAPLLSGLPQGTGEYFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVR 161
Query: 169 QRDPFFDPSKSKTFSKIP---CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGET 225
Q S + P C + C+ L G D+ C Y +AY DGS
Sbjct: 162 Q-----GSSTGAAPAPTPRWNCVAPICRRLDS----AGCDR-RRNSCLYQVAYGDGSVTA 211
Query: 226 GFWATDRMTIQEVNGNGYFARYP----FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKT 281
G +A++ +T FAR +GC +N G ASG++GL RG +S S+
Sbjct: 212 GDFASETLT---------FARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQI 262
Query: 282 NISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVG 338
S+ F YCL T +++ TP + FY++ L G SVG
Sbjct: 263 ARSFGRSFSYCLVD-------------RTSSRRARPSRRWGGTPRMATFYYVHLLGFSVG 309
Query: 339 GERLPLKASYFTKLSTE-------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
G R+ + +L+ +DSGT +TR PVY A+R AFR ++ G
Sbjct: 310 GARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGF 369
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNS 450
LFDTCY+LS + V VP +++H GG + L L+ V++ C FA+ +D
Sbjct: 370 SLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFC--FAMAGTDGGV 427
Query: 451 ILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++GN+QQ+G+ V +D +R+GF P +C
Sbjct: 428 SIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 166/364 (45%), Gaps = 42/364 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + +++G P + + DTGS + W Q +PC CS FDP +S TF ++ C+S
Sbjct: 55 YVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQL 112
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYP-F 249
C L P S C Y Y GSGET G +A D +++ + ++P F
Sbjct: 113 CAELPGSCEPG------SSTCSYSYEY--GSGETEGEFARDTISLGTTSDGS--QKFPSF 162
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNI---SYFFYCLH--SPYGSTGYITFGK 304
+GC N+G +G G++GL +GPVS+ S+ + S F YCL + + + FG
Sbjct: 163 AVGCGMVNSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGP 221
Query: 305 PDTVNKKFVKYTPIVTTPEQS--EFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTII 362
++ ++ T I T P + +Y +T+ GI+V G+ + +T IDSGT +
Sbjct: 222 SAALHGTGIQSTKI-TPPSDTYPTYYLLTVNGIAVAGQTMGSPG------TTIIDSGTTL 274
Query: 363 TRFPAPVYSALRSAFRK-----RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
T P+ VY + S R+ MG D CYD S+ + P +TI G
Sbjct: 275 TYVPSGVYGRVLSRMESMVTLPRVDGSSMG------LDLCYDRSSNRNYKFPALTIRLAG 328
Query: 418 GVDLELDVRGTLVV-ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
LVV +S VCL P SI +GNV Q+GY + YD L F
Sbjct: 329 ATMTPPSSNYFLVVDDSGDTVCLAMGSASGLPVSI-IGNVMQQGYHILYDRGSSELSFVQ 387
Query: 477 GNCN 480
C
Sbjct: 388 AKCE 391
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 117/379 (30%), Positives = 185/379 (48%), Gaps = 32/379 (8%)
Query: 123 KTGIVA-ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKT 181
++G+++ EY++ ++IG P + DTGS +TW QCKPC C +Q P FD KS T
Sbjct: 75 QSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKSST 134
Query: 182 FSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
+ C+S TC L E G D+ S C Y +Y D S G AT+ ++I +G+
Sbjct: 135 YKTESCDSITCNALSEH--EEGCDE-SRNACKYRYSYGDESFTKGEVATETISIDSSSGS 191
Query: 242 GYFARYP-FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYG 295
+P GC NN G + SGI+GL GP+S++S+ S F YCL H+
Sbjct: 192 P--VSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTSAT 249
Query: 296 STG--YITFGKPDTVNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERLPLKASYF 349
+ G I G +++ K K + I+TTP + +Y +TL I+VG +LP
Sbjct: 250 TNGTSVINLGT-NSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGG 308
Query: 350 TKLSTE--------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
L+ + IDSGT +T + Y + + + K + + C+ S
Sbjct: 309 YSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVSDPQGILTHCFK-S 367
Query: 402 AYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGY 461
K + +P IT+HF G D++L + V S VCL +++P+ +I GN+ Q +
Sbjct: 368 GDKEIGLPTITMHFT-GADVKLSPINSFVKLSEDIVCL--SMIPTTEVAI-YGNMVQMDF 423
Query: 462 EVHYDVAGRRLGFGPGNCN 480
V YD+ + + F +C+
Sbjct: 424 LVGYDLETKTVSFQRMDCS 442
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 114/427 (26%), Positives = 191/427 (44%), Gaps = 59/427 (13%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKA--IPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPK 142
L ++ RD+ L++ R LQ + + D F F P + G+ YY V +G P
Sbjct: 40 LSQLRARDE----LRHRRMLQSSSGVVD-FSVQGTFD-PFQVGL-----YYTKVQLGTPP 88
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLE 197
++ +DTGS + W C C C Q FFDP S T S I C+ C
Sbjct: 89 VEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCN---- 144
Query: 198 WFPPNGQDK----CSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF--ARYPF 249
NG+ CSS+ +C Y Y DGSG +G++ +D M + + + P
Sbjct: 145 ----NGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNSTAPV 200
Query: 250 LLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYI 300
+ GC++ TGD GI G + +S+IS+ + F +CL G +
Sbjct: 201 VFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGIL 260
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS---TEID 357
G+ + + + YT +V P Q Y++ L ISV G+ L + +S F + T +D
Sbjct: 261 VLGE---IVEPNIVYTSLV--PAQPH-YNLNLQSISVNGQTLQIDSSVFATSNSRGTIVD 314
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG 417
SGT + Y SA + + + + + CY +++ T V P+++++F G
Sbjct: 315 SGTTLAYLAEEAYDPFVSAITAAIPQSV--RTVVSRGNQCYLITSSVTDVFPQVSLNFAG 372
Query: 418 GVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
G + L + L+ + C+GF + +I LG++ + V YD+AG+R+G
Sbjct: 373 GASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITI-LGDLVLKDKIVVYDLAGQRIG 431
Query: 474 FGPGNCN 480
+ +C+
Sbjct: 432 WANYDCS 438
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 112/429 (26%), Positives = 192/429 (44%), Gaps = 51/429 (11%)
Query: 79 SRNTPSLEEILRRDQQRLHLKNSRRLQKA--IPDNFKKTKAFTFPAKTGIVAADEYYIVV 136
+ +T L ++ RD L++ R LQ + + D F F P + G+ YY V
Sbjct: 31 TNHTVELSQLRARDA----LRHRRMLQSSNGVVD-FSVQGTFD-PFQVGL-----YYTKV 79
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTT 191
+G P ++ +DTGS + W C C C Q FFDP S T S I C+
Sbjct: 80 QLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQR 139
Query: 192 CKILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF--ARY 247
C ++ + CSS+ +C Y Y DGSG +G++ +D M + + +
Sbjct: 140 CNNGIQ----SSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTA 195
Query: 248 PFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTG 298
P + GC++ TGD GI G + +S+IS+ + F +CL G
Sbjct: 196 PVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGG 255
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS---TE 355
+ G+ + + + YT +V P Q Y++ L I+V G+ L + +S F + T
Sbjct: 256 ILVLGE---IVEPNIVYTSLV--PAQPH-YNLNLQSIAVNGQTLQIDSSVFATSNSRGTI 309
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF 415
+DSGT + Y SA + + + + CY +++ T V P+++++F
Sbjct: 310 VDSGTTLAYLAEEAYDPFVSAITASIPQSV--HTVVSRGNQCYLITSSVTEVFPQVSLNF 367
Query: 416 LGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
GG + L + L+ + C+GF + +I LG++ + V YD+AG+R
Sbjct: 368 AGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITI-LGDLVLKDKIVVYDLAGQR 426
Query: 472 LGFGPGNCN 480
+G+ +C+
Sbjct: 427 IGWANYDCS 435
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 108/406 (26%), Positives = 183/406 (45%), Gaps = 42/406 (10%)
Query: 95 RLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAA-DEYYIVVAIGKPKQYVSLLLDTGS 153
R SRR+ + + A + P +G + +Y++ + +G P Q +L+ DTGS
Sbjct: 82 RSRQGGSRRVAAEV----ASSSAVSLPMSSGAYSGTGQYFVKLRVGTPVQEFTLVADTGS 137
Query: 154 GITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS--KE 211
+TW +C + F P S++++ IPC+S TCK+ + + N CSS
Sbjct: 138 DLTWVKCA----GASPPGRVFRPKTSRSWAPIPCSSDTCKLDVPFTLAN----CSSPASP 189
Query: 212 CPYDIAYVDGS-GETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ-NGASGIMG 269
C YD Y +GS G G T+ TI + G +LGC+ ++ G A G++
Sbjct: 190 CTYDYRYKEGSAGARGIVGTESATI-ALPGGKVAQLKDVVLGCSSSHDGQSFRSADGVLS 248
Query: 270 LDRGPVSIISKTNISY---FFYCL--H-SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPE 323
L +S ++ + F YCL H +P +TGY+ FG P V + T + PE
Sbjct: 249 LGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAFG-PGQVPRTPATQTKLFLDPE 307
Query: 324 QSEFYHITLTGISVGGERLPLKASYFTKLSTEI--DSGTIITRFPAPVYSALRSAFRKRM 381
FY + + I V G+ L + A + S + DSG +T AP Y A+ +A K +
Sbjct: 308 M-PFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSGNTLTVLAAPAYKAVVAALSKHL 366
Query: 382 KKYKMGKGIEDL----FDTCYDLSAYK---TVVVPKITIHFLGGVDLELDVRGTLVVESV 434
G+ + F+ CY+ +A + ++PK+ + F G LE + ++
Sbjct: 367 ------DGVPKVSFPPFEHCYNWTARRPGAPEIIPKLAVQFAGSARLEPPAKSYVIDVKP 420
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
C+G P ++GN+ Q+ + +D+ ++ F NC
Sbjct: 421 GVKCIGVQ-EGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQSNCT 465
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 114/430 (26%), Positives = 184/430 (42%), Gaps = 40/430 (9%)
Query: 77 GKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIV 135
G S + + +++ R R L +SRR ++A AF P +G +Y++
Sbjct: 48 GASLSDRARDDLHRHAYIRSQLASSRRGRRAAEVG---ASAFAMPLSSGAYTGTGQYFVR 104
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP----FFDPSKSKTFSKIPCNSTT 191
+G P Q L+ DTGS +TW +C+ + F + SK+++ I C+S T
Sbjct: 105 FRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIACSSDT 164
Query: 192 CKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP- 248
C + P CSS C YD Y DGS G TD TI +G+G
Sbjct: 165 CTSYV----PFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSS 220
Query: 249 ---------FLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISY---FFYCL--H-S 292
+LGC G + G++ L +S S+ + F YCL H +
Sbjct: 221 GGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLA 280
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-- 350
P +T Y+TFG T TP++ + FY +T+ + V GE L + A +
Sbjct: 281 PRNATSYLTFGPGATAP---AAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVD 337
Query: 351 -KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
+DSGT +T P Y A+ +A K + + + D F+ CY+ + + +P
Sbjct: 338 RNGGAILDSGTSLTILATPAYRAVVTALSKHLA--GLPRVTMDPFEYCYNWTDAGALEIP 395
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
K+ +HF G LE + ++ + C+G S P ++GN+ Q+ + +D+
Sbjct: 396 KMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQ-EGSWPGVSVIGNILQQEHLWEFDLRD 454
Query: 470 RRLGFGPGNC 479
R L F C
Sbjct: 455 RWLRFKHTRC 464
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 168/380 (44%), Gaps = 33/380 (8%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ-RDPFFDPSKSKTFSKI 185
+ +Y++ + +G P Q + L+ DTGS + W +C C +CS F P S +FS
Sbjct: 83 TGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPF 142
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKE----CPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
C C++L P C+ C + +Y DGS +GF++ + T++ ++G+
Sbjct: 143 HCFDPHCRLL----PHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGS 198
Query: 242 GYFARYPFLLGCTDNNTGDQ------NGASGIMGLDRGPVSIISKTNISY---FFYCLH- 291
+ GC +G NGA G+MGL RG +S S+ + F YCL
Sbjct: 199 EIHLKG-LSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMD 257
Query: 292 ---SPYGSTGYITFGKPDTV---NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLK 345
SP ++ + G ++ N + YTP+ P FY+IT+ I++ G +LP+
Sbjct: 258 YTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPIN 317
Query: 346 ASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
+ + T +DSGT +T Y + + R+R+K + + FD C +
Sbjct: 318 PAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAE-LTPGFDLCVNA 376
Query: 401 SA-YKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQR 459
S + +P++ GG R + +CL + S ++GN+ Q+
Sbjct: 377 SGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQ 436
Query: 460 GYEVHYDVAGRRLGFGPGNC 479
G+ + +D RLGF C
Sbjct: 437 GFLLEFDKEESRLGFTRRGC 456
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 109/354 (30%), Positives = 163/354 (46%), Gaps = 68/354 (19%)
Query: 165 HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGE 224
C+ + P F P+ S TFSK+PC S+ C+ L + C++ C Y Y G G
Sbjct: 87 ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYL-----TCNATGCVYYYPY--GMGF 139
Query: 225 T-GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNI 283
T G+ AT+ + + G F F GC+ N G N +SGI+GL R P+S++S+ +
Sbjct: 140 TAGYLATETLHV----GGASFPGVAF--GCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGV 192
Query: 284 SYFFYCLHSP---------YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQ--SEFYHITL 332
F YCL S +GS +T GK I+ PE S +Y++ L
Sbjct: 193 GRFSYCLRSDADAGDSPILFGSLAKVTGGKSSPA---------ILENPEMPSSSYYYVNL 243
Query: 333 TGISVGGERLPLKASY--FTKLS-------TEIDSGTIITRFPAPVYSALRSAFRKRMKK 383
TGI+VG LP+ ++ FT+ + T +DSGT +T Y+ ++ AF +M
Sbjct: 244 TGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMAT 303
Query: 384 YKMGKGIEDL---FDTCYDLSAY---KTVVVPKITIHFLGGVD-----------LELDVR 426
+ + FD C+D +A V VP + + F GG + +E+D +
Sbjct: 304 ANLTTTVNGTRFGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQ 363
Query: 427 GTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
G VE CL L S+ SI ++GNV Q V YD+ G F P +C
Sbjct: 364 GRAAVE-----CL-LVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 128/440 (29%), Positives = 204/440 (46%), Gaps = 47/440 (10%)
Query: 57 PGKVSLEVLGRYGPCSKLNQGKSRNTPS-LEEILRRDQQRLHLKNSRRLQKAIPDNFKKT 115
P L V+ Y CS K+ + + + +D R+ ++ QK +
Sbjct: 31 PDNSDLNVIPIYSKCSPFKPPKADTWDNRIINMASKDPVRVKYLSTLVSQKTVS------ 84
Query: 116 KAFTFPAKTGIVAADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFF 174
T P +G Y+V V +G P Q + ++LDT + + C C CS D F
Sbjct: 85 ---TAPIASGQAFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGCS---DTTF 138
Query: 175 DPSKSKTFSKIPCNSTTC-KILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
P S ++ + C+ C ++ P G CS ++ +Y GS + D +
Sbjct: 139 SPKASTSYGPLDCSVPQCGQVRGLSCPATGTGACS-----FNQSYA-GSSFSATLVQDAL 192
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
+ Y F GC + TG A G++GL RGP+S++S++ +Y F YCL
Sbjct: 193 RL----ATDVIPYYSF--GCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCL 246
Query: 291 HS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASY 348
S Y +G + G K ++ TP++ +P + Y++ TGISVG +P + Y
Sbjct: 247 PSFKSYYFSGSLKLGP--VGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEY 304
Query: 349 F-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY 403
T T IDSGT+ITRF PVY+A+R FRK++ FDTC+ + Y
Sbjct: 305 LGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGTTFTS--IGAFDTCF-VKTY 361
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRG 460
+T + P IT+HF G+DL+L + +L+ S + CL A P + NS+L + N QQ+
Sbjct: 362 ET-LAPPITLHF-EGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQN 419
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
+ +D+ ++G CN
Sbjct: 420 LRILFDIVNNKVGIAREVCN 439
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 170/387 (43%), Gaps = 45/387 (11%)
Query: 119 TFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCS-QQRDPFFDP 176
T P + +Y + +G P + ++++DTGS +T+ C C C +D FDP
Sbjct: 65 TMPLHGAVKDYGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDP 124
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ 236
S T S+I C S C P G CS+++C Y +Y + S +G D + +
Sbjct: 125 EASSTASRISCTSPKCSC---GSPRCG---CSTQQCTYTRSYAEQSSSSGILLEDVLALH 178
Query: 237 EVNGNGYFARYPFLLGCTDNNTGD--QNGASGIMGLDRGPVSIIS---KTNI--SYFFYC 289
+ P + GC TG+ + A G+ GL S+++ K + F C
Sbjct: 179 D-----GLPGAPIIFGCETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLC 233
Query: 290 LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
G G + G + ++YTP++T+ +Y++ + ++V G+ LP+ S F
Sbjct: 234 FGMVEGD-GALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLF 292
Query: 350 TK-LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE-------DLFDTCY--- 398
+ T +DSGT T P+PV+ AF ++KY + G++ D C+
Sbjct: 293 DQGYGTVLDSGTTFTYMPSPVF----KAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQA 348
Query: 399 ----DLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVR--QVCLGFALLPSDPNSIL 452
DL A + V P + + F G L L L V + + CLG + + L
Sbjct: 349 PSHDDLEALSS-VFPSMEVQFDQGTSLVLGPLNYLFVHTFNSGKYCLG--VFDNGRAGTL 405
Query: 453 LGNVQQRGYEVHYDVAGRRLGFGPGNC 479
LG + R V YD A +R+GFGP C
Sbjct: 406 LGGITFRNVLVRYDRANQRVGFGPALC 432
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 115/404 (28%), Positives = 174/404 (43%), Gaps = 51/404 (12%)
Query: 118 FTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF---- 173
T A TGI +Y++ +G P Q L+ DTGS +TW +C+P + +
Sbjct: 84 LTSAAYTGI---GQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSAS 140
Query: 174 -------FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGE 224
F P KSKT++ IPC S TC L P C + C YD Y DGS
Sbjct: 141 ASSPRRAFRPEKSKTWAPIPCASDTCSKSL----PFSLSTCPTPGSPCAYDYRYKDGSAA 196
Query: 225 TGFWATDRMTIQ-------EVNGNGYFARYPFLLGCTDNNTGDQNGAS-GIMGLDRGPVS 276
G T+ TI N +LGCT + TG AS G++ L VS
Sbjct: 197 RGTVGTESATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVS 256
Query: 277 IISKTNISY---FFYCL--H-SPYGSTGYITFGKPDTVNKKF-------VKYTPIVTTPE 323
S + F YCL H SP +T Y+TFG ++ + TP+V
Sbjct: 257 FASHAASRFGGRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSR 316
Query: 324 QSEFYHITLTGISVGGERLPLKASYFT---KLSTEIDSGTIITRFPAPVYSALRSAFRKR 380
FY +++ ISV GE L + + +DSGT +T P Y A+ +A K+
Sbjct: 317 MRPFYDVSIKAISVDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKK 376
Query: 381 MKKYKMGKGIEDLFDTCYDLSAY----KTVVVPKITIHFLGGVDLELDVRGTLVVESVRQ 436
+ ++ + D F+ CY+ ++ + +PK+ +HF G LE + ++ +
Sbjct: 377 LARFP--RVAMDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGV 434
Query: 437 VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
C+G P P ++GN+ Q+ + +D+ RRL F C
Sbjct: 435 KCIGVQEGPW-PGISVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 114/395 (28%), Positives = 164/395 (41%), Gaps = 63/395 (15%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I + +G P Q +LDTGS + W C CS P DP+K TF IP NS+T
Sbjct: 88 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTF--IPKNSST 145
Query: 192 CKILL-------EWFPPNGQDKCS----------SKECPYDIAYVDGSGETGFWATDRM- 233
K+L F P+ + +C S CP I GF D +
Sbjct: 146 AKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNLN 205
Query: 234 ----TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYC 289
T+ + FL+GC+ + SGI G RG S+ S+ N+ F YC
Sbjct: 206 FPGKTVPQ-----------FLVGCSILSIRQ---PSGIAGFGRGQESLPSQMNLKRFSYC 251
Query: 290 LHS------PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQS----EFYHITLTGISVGG 339
L S P S + + YTP + P + E+Y++TL + VGG
Sbjct: 252 LVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGG 311
Query: 340 ERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDL 393
+ + + S T +DSG+ T PVY+ + F +++ KKY + +E
Sbjct: 312 VDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQ 371
Query: 394 --FDTCYDLSAYKTVVVPKITIHFLGGVDLE------LDVRGTLVVESVRQVCLGFALLP 445
C+++S KT+ P+ T F GG + G V V G A P
Sbjct: 372 SGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQP 431
Query: 446 SDPN-SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+I+LGN QQ+ + V YD+ R GFGP NC
Sbjct: 432 KTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 113/364 (31%), Positives = 177/364 (48%), Gaps = 36/364 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
Y + V +G P Q + ++LDT + + C C CS D F P S ++ + C+
Sbjct: 99 NYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGCS---DTTFSPKASTSYGPLDCSVP 155
Query: 191 TC-KILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C ++ P G CS ++ +Y GS + D + + Y F
Sbjct: 156 QCGQVRGLSCPATGTGACS-----FNQSYA-GSSFSATLVQDSLRL----ATDVIPNYSF 205
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGSTGYITFGK 304
GC + TG A G++GL RGP+S++S++ +Y F YCL S Y +G + G
Sbjct: 206 --GCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLGP 263
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEIDSG 359
K ++ TP++ +P + Y++ TGISVG +P + Y T T IDSG
Sbjct: 264 --VGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSG 321
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T+ITRF PVY+A+R FRK++ FDTC+ + Y+T + P IT+HF G+
Sbjct: 322 TVITRFVEPVYNAVREEFRKQVGGTTFTS--IGAFDTCF-VKTYET-LAPPITLHF-EGL 376
Query: 420 DLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGRRLGFGP 476
DL+L + +L+ S + CL A P + NS+L + N QQ+ + +D ++G
Sbjct: 377 DLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAR 436
Query: 477 GNCN 480
CN
Sbjct: 437 EVCN 440
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 177/380 (46%), Gaps = 51/380 (13%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + +G P Q VS++LDTGS ++W +C +Q FDP++S ++S +PC+S TC
Sbjct: 87 VSLTVGTPPQNVSMVLDTGSELSWLRCNK----TQTFQTTFDPNRSSSYSPVPCSSLTCT 142
Query: 194 ILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
FP P D S++ C ++Y D S G A+D I + G + G
Sbjct: 143 DRTRDFPIPASCD--SNQLCHAILSYADASSSEGNLASDTFYIGNSDMPGT------IFG 194
Query: 253 CTDN----NTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
C D+ NT + + +G+MG++RG +S +S+ + F YC+ S +G + G +
Sbjct: 195 CMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFPKFSYCI-SDSDFSGVLLLGDANFS 253
Query: 309 NKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
+ YTP++ + Y + L GI V + LPL S F T +DS
Sbjct: 254 WLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDS 313
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-------FDTCYDLSAYKTVV--VP 409
GT T PVYSALR+ F + ++ + +ED D CY + +T + +P
Sbjct: 314 GTQFTFLLGPVYSALRNEFLNQTS--QILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLP 371
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLG------FALLPSDPNSI---LLGNVQQRG 460
+++ F G E+ V G ++ V G F SD ++ ++G+ Q+
Sbjct: 372 TVSLMFRGA---EMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQN 428
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
+ +D+ R+GF C+
Sbjct: 429 VWMEFDLEKSRIGFAQVQCD 448
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 127/440 (28%), Positives = 191/440 (43%), Gaps = 56/440 (12%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+L+VL Y PCS + + S+ ++ +D+ RL +S +K++
Sbjct: 38 TLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSLVARKSV----------- 86
Query: 120 FPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G IV Y + IG P Q + + +DT S + W C C+ CS F+
Sbjct: 87 VPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSST---LFNSP 143
Query: 178 KSKTFSKIPCNSTTCKILLEWFPP-------NGQDKCSSKECPYDIAYVDGSGETGFWAT 230
S T+ + C + CK +L P + C C +++ Y GS +
Sbjct: 144 ASTTYKSLGCQAAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTY-GGSSLAANLSQ 202
Query: 231 DRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FF 287
D +T+ GY GC TG A G++GL RGP+S++S+T Y F
Sbjct: 203 DTITLATDAVPGYS------FGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFS 256
Query: 288 YCLHS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLK 345
YCL S +G + G K +KYTP++ P + Y + L + VG + +
Sbjct: 257 YCLPSFKSLNFSGSLRLGP--VGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVP 314
Query: 346 ASYF-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
F T T DSGT+ TR P Y A+R AFR R+ + + FDTCY +
Sbjct: 315 PGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGG-FDTCYTV 373
Query: 401 SAYKTVVVPKITIHFLG-GVDLELDVRGTLVVESV--RQVCLGFALLPSDPNSIL--LGN 455
+ P IT F G V L D L++ S CL A P + NS+L + N
Sbjct: 374 P----IAAPTITFMFTGMNVTLPPD---NLLIHSTAGSTTCLAMAAAPDNVNSVLNVIAN 426
Query: 456 VQQRGYEVHYDVAGRRLGFG 475
+QQ+ + + YDV RLG
Sbjct: 427 LQQQNHRLLYDVPNSRLGVA 446
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 100/378 (26%), Positives = 171/378 (45%), Gaps = 45/378 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P + + +DTGS I W C PC C FF+P S T SKIP
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
C+ C L+ + C + + C Y Y DGSG +G++ +D M V GN
Sbjct: 151 CSDDRCTAALQ----TSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQ 206
Query: 244 FAR--YPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHS 292
A + GC+++ +GD GI G + +S++S+ N F +CL
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG 266
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
G + G+ + + + YTP+V P Q Y++ L I V G++LP+ +S FT
Sbjct: 267 SDNGGGILVLGE---IVEPGLVYTPLV--PSQPH-YNLNLESIVVNGQKLPIDSSLFTTS 320
Query: 353 STE---IDSGTIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLSAYKTV 406
+T+ +DSGT + Y +A + + + KG + C+ S+
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKG-----NQCFVTSSSVDS 375
Query: 407 VVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
P ++++F+GGV + + L+ +++ C+G+ +I LG++ +
Sbjct: 376 SFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITI-LGDLVLKDKI 434
Query: 463 VHYDVAGRRLGFGPGNCN 480
YD+A R+G+ +C+
Sbjct: 435 FVYDLANMRMGWTDYDCS 452
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 132/274 (48%), Gaps = 28/274 (10%)
Query: 55 QGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQK--AIPDNF 112
Q G V + + +GP S L + S ++L D R+ NSR +K P +
Sbjct: 35 QSGGVVQMTIHHVHGPGSSL---APQPPVSFSDVLAWDDARVKTLNSRLTRKDTRFPKSV 91
Query: 113 KKTKAFTFPAKTGI-------VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI- 164
K FP + + + YY+ V G P +Y S+++DTGS ++W QCKPC+
Sbjct: 92 LTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVV 151
Query: 165 HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGE 224
+C Q DP FDPS SKT+ + C S+ C L++ N + SS C Y +Y D S
Sbjct: 152 YCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYS 211
Query: 225 TGFWATDRMTI---QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKT 281
G+ + D +T+ Q + G F+ GC ++ G A+GI+GL R +S++ +
Sbjct: 212 MGYLSQDLLTLAPSQTLPG--------FVYGCGQDSDGLFGRAAGILGLGRNKLSMLGQV 263
Query: 282 NISY---FFYCLHSPYGSTGYITFGKPDTVNKKF 312
+ + F YCL + G G+++ GK +
Sbjct: 264 SSKFGYAFSYCLPT-RGGGGFLSIGKASLAGSAY 296
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 128/457 (28%), Positives = 192/457 (42%), Gaps = 64/457 (14%)
Query: 46 CNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLE----EILRRDQQRLHLKNS 101
C+ T+T Q G +L + PCS KS + S E + L +DQ RL +S
Sbjct: 41 CDLTKT---QDQGS-TLRIFHIDSPCSPF---KSSSPLSWEARVLQTLAQDQARLQYLSS 93
Query: 102 RRLQKAIPDNFKKTKAFTFPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQ 159
+++ P +G ++ + Y + IG P Q + L +DT S + W
Sbjct: 94 LVAGRSV-----------VPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIP 142
Query: 160 CKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYV 219
C C+ C + F P+KS +F + C++ CK + PN C ++ C +++ Y
Sbjct: 143 CSGCVGCPS--NTAFSPAKSTSFKNVSCSAPQCKQV-----PN--PTCGARACSFNLTYG 193
Query: 220 DGSGETGFWA-TDRMTIQEVNGNGYFARYPFLLGCTDNNTG-----DQNGASGIMGLDRG 273
S T R+ + F GC + G G G+
Sbjct: 194 SSSIAANLSQDTIRLAADPIKA--------FTFGCVNKVAGGGTIPPPQGLLGLGRGPLS 245
Query: 274 PVSIISKTNISYFFYCLHSPYGST--GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
+S S F YCL S T G + G T + VKYT ++ P +S Y++
Sbjct: 246 LMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGP--TSQPQRVKYTQLLRNPRRSSLYYVN 303
Query: 332 LTGISVGGE--RLPLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKM 386
L I VG + LP A F T T DSGT+ TR PVY A+R+ FRKR+K
Sbjct: 304 LVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTA 363
Query: 387 GKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLP 445
FDTCY V VP IT F GV++ + ++ + CL A P
Sbjct: 364 VVTSLGGFDTCYS----GQVKVPTITFMF-KGVNMTMPADNLMLHSTAGSTSCLAMAAAP 418
Query: 446 SDPNSI--LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ NS+ ++ ++QQ+ + V DV RLG C+
Sbjct: 419 ENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 127/439 (28%), Positives = 191/439 (43%), Gaps = 59/439 (13%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+++V Y P S K + S+ ++L DQ RL +S +K+
Sbjct: 27 TVKVFHVYSPQSPFRPSKPVSWEDSVLQMLAEDQARLQFLSSLVGRKSW----------- 75
Query: 120 FPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G IV + Y + +G P Q + LDT + W C C+ CS F+
Sbjct: 76 VPIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSST---VFNSV 132
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
S TF + C++ CK + PN C C ++ Y GS D + +
Sbjct: 133 TSTTFKTLGCDAPQCKQV-----PN--PTCGGSTCTWNTTY-GGSTILSNLTRDTIALST 184
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS-- 292
GY GC TG G++GL RGP+S +S+T Y F YCL S
Sbjct: 185 DIVPGY------TFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFR 238
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--- 349
+G + G + +K TP++ P +S Y++ L GI VG + + + AS
Sbjct: 239 TLNFSGTLRLGPAGQPLR--IKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFN 296
Query: 350 --TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL---FDTCYDLSAYK 404
T T DSGT+ TR APVY+A+R FRKR +G I FDTCY
Sbjct: 297 PTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKR-----VGNAIVSSLGGFDTCYT----G 347
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGY 461
+V P +T F G+++ L L+ + CL A P + NS+L + N+QQ+ +
Sbjct: 348 PIVAPTMTFMF-SGMNVTLPTDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNH 406
Query: 462 EVHYDVAGRRLGFGPGNCN 480
+ +DV R+G C+
Sbjct: 407 RILFDVPNSRIGVAREPCS 425
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 100/378 (26%), Positives = 171/378 (45%), Gaps = 45/378 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P + + +DTGS I W C PC C FF+P S T SKIP
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
C+ C L+ + C + + C Y Y DGSG +G++ +D M V GN
Sbjct: 151 CSDDRCTAALQ----TSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQ 206
Query: 244 FAR--YPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHS 292
A + GC+++ +GD GI G + +S++S+ N F +CL
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG 266
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
G + G+ + + + YTP+V P Q Y++ L I V G++LP+ +S FT
Sbjct: 267 SDNGGGILVLGE---IVEPGLVYTPLV--PSQPH-YNLNLESIVVNGQKLPIDSSLFTTS 320
Query: 353 STE---IDSGTIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLSAYKTV 406
+T+ +DSGT + Y +A + + + KG + C+ S+
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKG-----NQCFVTSSSVDS 375
Query: 407 VVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
P ++++F+GGV + + L+ +++ C+G+ +I LG++ +
Sbjct: 376 SFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITI-LGDLVLKDKI 434
Query: 463 VHYDVAGRRLGFGPGNCN 480
YD+A R+G+ +C+
Sbjct: 435 FVYDLANMRMGWTDYDCS 452
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 116/412 (28%), Positives = 175/412 (42%), Gaps = 53/412 (12%)
Query: 87 EILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTG--IVAADEYYIVVAIGKPKQY 144
+ L +DQ RL +S +++ P +G ++ + Y + V IG P Q
Sbjct: 63 QTLAQDQARLQYLSSLVAGRSV-----------VPIASGRQMLQSTTYIVKVLIGTPAQP 111
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+ L +DT S + W C C+ C + F P+KS +F + C++ CK + PN
Sbjct: 112 LLLAMDTSSDVAWIPCSGCVGCPS--NTAFSPAKSTSFKNVSCSAPQCKQV-----PN-- 162
Query: 205 DKCSSKECPYDIAYVDGSGETGFWA-TDRMTIQEVNGNGYFARYPFLLGCTDNNTG---- 259
C ++ C +++ Y S T R+ + F GC + G
Sbjct: 163 PACGARACSFNLTYGSSSIAANLSQDTIRLAADPIKA--------FTFGCVNKVAGGGTI 214
Query: 260 -DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST--GYITFGKPDTVNKKFVKYT 316
G G+ +S S F YCL S T G + G T + VKYT
Sbjct: 215 PPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGSLRLGP--TSQPQRVKYT 272
Query: 317 PIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF---TKLSTEIDSGTIITRFPAPVYS 371
++ P +S Y++ L I VG + LP A F T T DSGT+ TR PVY
Sbjct: 273 QLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYE 332
Query: 372 ALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVV 431
A+R+ FRKR+K FDTCY V VP IT F GV++ + ++
Sbjct: 333 AVRNEFRKRVKPPTAVVTSLGGFDTCYS----GQVKVPTITFMF-KGVNMTMPADNLMLH 387
Query: 432 ESVRQV-CLGFALLPSDPNSI--LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ CL A P + NS+ ++ ++QQ+ + V DV RLG C+
Sbjct: 388 STAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 104/362 (28%), Positives = 163/362 (45%), Gaps = 54/362 (14%)
Query: 126 IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKI 185
I++ Y +G P Q + + +D + W C C C+ P F P++S T+ +
Sbjct: 96 ILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAAS-SPSFSPTQSSTYRTV 154
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
PC S C + P G C +++ Y + + D + ++ N
Sbjct: 155 PCGSPQCAQVPSPSCPAGVGS----SCGFNLTYAASTFQ-AVLGQDSLALE----NNVVV 205
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGL-DRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK 304
Y F GC G+ A+G L R + +++ G G I G+
Sbjct: 206 SYTF--GCLRVVNGNSRAAAGAHRLRPRAALLLVAD-------------QGHLGPI--GQ 248
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYFTKLS---TEIDSG 359
P K +K TP++ P + Y++ + GI VG + ++P A F ++ T ID+G
Sbjct: 249 P-----KRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAG 303
Query: 360 TIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFL 416
T+ TR APVY+A+R AFR R++ +G FDTCY++ TV VP +T F
Sbjct: 304 TMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG-----FDTCYNV----TVSVPTVTFMFA 354
Query: 417 GGVDLELDVRGTLVVESVRQV-CLGFALLPSD-PNSIL--LGNVQQRGYEVHYDVAGRRL 472
G V + L ++ S V CL A PSD N+ L L ++QQ+ V +DVA R+
Sbjct: 355 GAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRV 414
Query: 473 GF 474
GF
Sbjct: 415 GF 416
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 109/432 (25%), Positives = 184/432 (42%), Gaps = 59/432 (13%)
Query: 86 EEILRRDQQRLHLKNS--RRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
+E++RR QR + R D K A P G EY + + G P+
Sbjct: 47 QELIRRAVQRSLDRPGIVARSGGGAADEAGKAVASEAPLVPG---GGEYLVKLGTGTPQH 103
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+ S +DT S + W QC+PC+ C +Q DP F+P S +++ +PC S TC L +G
Sbjct: 104 FFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQL------DG 157
Query: 204 QDKCSSKE---CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD 260
+C + C Y Y G A D++ I G F + + GC+D++ G
Sbjct: 158 H-RCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI----GGDVF--HAVVFGCSDSSVGG 210
Query: 261 QNG-ASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST-GYITFGK-PDTVNKKFVKYTP 317
ASG++GL RGP+S++S+ ++ F YCL P T G + G D V + T
Sbjct: 211 PAAQASGLVGLGRGPLSLVSQLSVHRFMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTV 270
Query: 318 IVTTPEQ-SEFYHITLTGISVGGERLPLKASYFTKLSTE--------------------- 355
+++ + +Y++ L G++V G++ P T +
Sbjct: 271 TMSSSTRYPSYYYLNLDGLAV-GDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANA 329
Query: 356 ----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS---AYKTVVV 408
+D + I+ +Y L + ++ + + D C+ L V V
Sbjct: 330 YGMIVDVASTISFLETSLYDELADDLEEEIRLPRATPSLRLGLDLCFILPEGVGMDRVYV 389
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P +++ F G LELD R L V R +CL ++ +LGN Q + V +++
Sbjct: 390 PTVSLSF-DGRWLELD-RDRLFVTDGRMMCL---MIGRTSGVSILGNFQLQNMRVLFNLR 444
Query: 469 GRRLGFGPGNCN 480
++ F +C+
Sbjct: 445 RGKITFAKASCD 456
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 100/378 (26%), Positives = 171/378 (45%), Gaps = 45/378 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P + + +DTGS I W C PC C FF+P S T SKIP
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE---CPYDIAYVDGSGETGFWATDRMTIQEVNGNGY 243
C+ C L+ + C + + C Y Y DGSG +G++ +D M V GN
Sbjct: 177 CSDDRCTAALQ----TSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQ 232
Query: 244 FAR--YPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHS 292
A + GC+++ +GD GI G + +S++S+ N F +CL
Sbjct: 233 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG 292
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL 352
G + G+ + + + YTP+V P Q Y++ L I V G++LP+ +S FT
Sbjct: 293 SDNGGGILVLGE---IVEPGLVYTPLV--PSQPH-YNLNLESIVVNGQKLPIDSSLFTTS 346
Query: 353 STE---IDSGTIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLSAYKTV 406
+T+ +DSGT + Y +A + + + KG + C+ S+
Sbjct: 347 NTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKG-----NQCFVTSSSVDS 401
Query: 407 VVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
P ++++F+GGV + + L+ +++ C+G+ +I LG++ +
Sbjct: 402 SFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITI-LGDLVLKDKI 460
Query: 463 VHYDVAGRRLGFGPGNCN 480
YD+A R+G+ +C+
Sbjct: 461 FVYDLANMRMGWTDYDCS 478
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 168/373 (45%), Gaps = 45/373 (12%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP-----FFDPSKSKTFSKIPCN 188
I + IG P Q ++LDTGS ++W QC +++ P FDPS S +FS +PC+
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCH------RKKLPPKPKTSFDPSLSSSFSTLPCS 127
Query: 189 STTCKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
CK + F P D S++ C Y Y DG+ G +++T
Sbjct: 128 HPLCKPRIPDFTLPTSCD--SNRLCHYSYFYADGTFAEGNLVKEKITFSNTE-----ITP 180
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK--- 304
P +LGC ++ D+ GI+G++RG +S +S+ IS F YC+ G+ G
Sbjct: 181 PLILGCATESSDDR----GILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYL 236
Query: 305 PDTVNKKFVKYTPIVTTPEQSEF-------YHITLTGISVGGERLPLKASYFTKLS---- 353
D N KY ++T PE Y + + GI G ++L + S F +
Sbjct: 237 GDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSG 296
Query: 354 -TEIDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDLFDTCYDLS-AYKTVVVPK 410
T +DSG+ T Y +R+ R+ ++ K G D C+D + A ++
Sbjct: 297 QTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGD 356
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSDPNSILLGNVQQRGYEVHYDV 467
+ F GV++ + LV C+G ++L + N ++GNV Q+ V +DV
Sbjct: 357 LVFVFTRGVEILVPKERVLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDV 414
Query: 468 AGRRLGFGPGNCN 480
RR+GF +C+
Sbjct: 415 TNRRVGFAKADCS 427
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 159/358 (44%), Gaps = 29/358 (8%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCK-PCI-HCSQQRDPFFDPSKSKTFSKIPCNS 189
Y + ++G P Q ++ L DTGS + W +C C C Q P + P+ S TF+K+PC+
Sbjct: 91 YDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSD 150
Query: 190 TTCKIL----LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C +L + W G EC Y +Y G G+ T +E G A
Sbjct: 151 RLCSLLRSDSVAWCAAAG------AECDYRYSY--GLGDDDHHYTQGFLARETFTLGADA 202
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKP 305
GCT + G SG++GL RGP+S++S+ N S F YCL S + FG
Sbjct: 203 VPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLTSDASKASPLLFGSL 262
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRF 365
++ V+ T ++ + + FY + L IS+G P DSGT +T
Sbjct: 263 ASLTGAQVQSTGLLAS---TTFYAVNLRSISIGSATTPGVGE---PEGVVFDSGTTLTYL 316
Query: 366 PAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSA---YKTVVVPKITIHFLGGVDLE 422
P YS ++AF + ++ D F+ C+ A VP + +HF G D+
Sbjct: 317 AEPAYSEAKAAFLSQTSLDQVED--TDGFEACFQKPANGRLSNAAVPTMVLHF-DGADMA 373
Query: 423 LDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L V +V VC ++ P+ ++GN+ Q Y V +DV L F P NC+
Sbjct: 374 LPVANYVVEVEDGVVCW---IVQRSPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANCD 428
>gi|295830689|gb|ADG39013.1| AT5G10770-like protein [Neslia paniculata]
Length = 159
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 67/158 (42%), Positives = 97/158 (61%), Gaps = 6/158 (3%)
Query: 275 VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
+S S+T +Y F YCL S TG++TFG + VK+TPI T + + FY ++
Sbjct: 1 LSFPSQTATAYNKIFSYCLPSSASYTGHLTFGSAGI--SRSVKFTPISTITDGTSFYGLS 58
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
+ I+VGG++LP+ ++ F+ IDSGT+ITR P Y+ALRS F+ +M KY G+
Sbjct: 59 IVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSEFKAKMSKYPTTSGVS 118
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL 429
L DTC+DLS +KTV +PK+ F GG +EL +G L
Sbjct: 119 IL-DTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIL 155
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 127/452 (28%), Positives = 192/452 (42%), Gaps = 84/452 (18%)
Query: 73 KLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEY 132
+L ++ S EE +RR +R H + + + + P ++ ++ +Y
Sbjct: 27 ELTHVDAKQNCSTEERMRRATERTHRRLASMGEASAPVHWAES---------------QY 71
Query: 133 YIVVAIGKPKQYVSLLLDTGSGITWTQCKPC--IHCSQQRDPFFDPSKSKTFSKIPCNST 190
IG P Q ++DTGS + WTQC C C Q F+DPS+S+T + CN T
Sbjct: 72 IAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACNDT 131
Query: 191 TCKILLEWFPPNGQDKCS--SKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARY 247
C + + +C+ +K C AY G+G G T+ T Q + N A
Sbjct: 132 ACAL-------GSETRCARDNKACAVLTAY--GAGVIGGVLGTEAFTFQPQSENVSLA-- 180
Query: 248 PFLLGC---TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY---------- 294
GC T G +GASGI+GL RG +S++S+ + F YCL +PY
Sbjct: 181 ---FGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCL-TPYFSQSTNTSRL 236
Query: 295 ---GSTGYITFGKPDTVNKKFVKYTPIVTTPEQ---SEFYHITLTGISVGGERLPLKASY 348
S G + G P T P + P+ S FY++ LTGI+VG +L + +
Sbjct: 237 FVGASAGLSSGGAPAT-------SVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAA 289
Query: 349 F------TKL--STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKM--GKGIEDLFDTCY 398
F T L T IDSG+ T Y ALR +++ + G E L D C
Sbjct: 290 FDLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGL-DLCA 348
Query: 399 DLSAYKTV--VVPKITIHF-LGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSIL--- 452
+ A+ V +VP + +HF GG D+ + C+ PNS L
Sbjct: 349 AV-AHGDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACM-VVFSSGGPNSTLPMN 406
Query: 453 ----LGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+GN Q+ + YD+ L F P +C+
Sbjct: 407 ETTIIGNYMQQDMHLLYDLEKGMLSFQPADCS 438
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 121/452 (26%), Positives = 196/452 (43%), Gaps = 53/452 (11%)
Query: 43 PTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNS 101
P+ CN P +L+V + PCS K + ++ ++ +DQ RL +S
Sbjct: 28 PSNCN------PAADRSSTLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSS 81
Query: 102 RRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK 161
+++ + ++ + + + IG P Q + L LDT + W C
Sbjct: 82 LVARRSF---------VPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCS 132
Query: 162 PCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDG 221
CI C F KS +F +PC S C + PN CS C +++ Y
Sbjct: 133 GCIGCPSTT--VFSSDKSSSFRPLPCQSPQCNQV-----PN--PSCSGSACGFNLTY-GS 182
Query: 222 SGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKT 281
S D +T+ + Y GC TG G++GL RGP+S++ ++
Sbjct: 183 STVAADLVQDNLTLATDSVPSY------TFGCIRKATGSSVPPQGLLGLGRGPLSLLGQS 236
Query: 282 NISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGIS 336
Y F YCL S +G + G +KYTP++ P +S Y++ L I
Sbjct: 237 QSLYQSTFSYCLPSFKSVNFSGSLRLGP--VAQPIRIKYTPLLRNPRRSSLYYVNLISIR 294
Query: 337 VGGE--RLPLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
VG + +P A F T T IDSGT TR AP Y+A+R FR+R+ + +
Sbjct: 295 VGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLG 354
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESV-RQVCLGFALLPSDPNS 450
FDTCY + ++ P IT F G+++ L L+ + CL A P + NS
Sbjct: 355 G-FDTCYTVP----IISPTITFMF-AGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNS 408
Query: 451 IL--LGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+L + ++QQ+ + + +D+ R+G +C+
Sbjct: 409 VLNVIASMQQQNHRILFDIPNSRVGVARESCS 440
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 127/439 (28%), Positives = 191/439 (43%), Gaps = 59/439 (13%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+++V Y P S K + S+ ++L DQ RL +S +K+
Sbjct: 27 TVKVFHVYSPQSPFRPSKPVSWEDSVLQMLAEDQARLQFLSSLVGRKSW----------- 75
Query: 120 FPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G IV + Y + +G P Q + LDT + W C C+ CS F+
Sbjct: 76 VPIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSST---VFNSV 132
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
S TF + C++ CK + PN C C ++ Y GS D + +
Sbjct: 133 TSTTFKTLGCDAPQCKQV-----PN--PTCGGSTCTWNTTY-GGSTILSNLTRDTIALST 184
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS-- 292
GY GC TG G++GL RGP+S +S+T Y F YCL S
Sbjct: 185 DIVPGY------TFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFR 238
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--- 349
+G + G + +K TP++ P +S Y++ L GI VG + + + AS
Sbjct: 239 TLNFSGTLRLGPAGQPLR--IKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFN 296
Query: 350 --TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL---FDTCYDLSAYK 404
T T DSGT+ TR APVY+A+R FRKR +G I FDTCY
Sbjct: 297 PTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKR-----VGNAIVSSLGGFDTCYT----G 347
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGY 461
+V P +T F G+++ L L+ + CL A P + NS+L + N+QQ+ +
Sbjct: 348 PIVAPTMTFMF-SGMNVTLPPDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNH 406
Query: 462 EVHYDVAGRRLGFGPGNCN 480
+ +DV R+G C+
Sbjct: 407 RILFDVPNSRIGVAREPCS 425
>gi|125555054|gb|EAZ00660.1| hypothetical protein OsI_22681 [Oryza sativa Indica Group]
Length = 337
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 163/362 (45%), Gaps = 54/362 (14%)
Query: 147 LLLDTGSGITWTQCKPC---IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+ DTG GI+ +C C C FDPS+S TF+ +PC S C+ +G
Sbjct: 1 MAFDTGLGISLARCAACRPGAPCDGLAS--FDPSRSSTFAPVPCGSPDCR--------SG 50
Query: 204 QDKCSSKECPY-DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN 262
S+ CP ++ G+ A D +T+ F GC + ++G+
Sbjct: 51 CSSGSTPSCPLTSFPFLSGA-----VAQDVLTLTPSASVDDFT-----FGCVEGSSGEPL 100
Query: 263 GASGIMGLDRGPVSIISKTNISY---FFYCLH-SPYGSTGYITFGKPDTVNKKFVKYT-- 316
GA+G++ L R S+ S+ F YCL S S G++ G+ D + + + T
Sbjct: 101 GAAGLLDLSRDSRSLASRLAAGAGGTFSYCLPLSTTSSHGFLVIGEADVPHNRSARVTAV 160
Query: 317 -PIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRS 375
P+V P Y I L G+S+GG +P+ + +D+ T +Y+ LR
Sbjct: 161 APLVYDPAFPNHYVIDLAGVSLGGRDIPIP----PHAAMVLDTALPYTYMKPSMYAPLRD 216
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYK-TVVVPKITIHFLGGVDLELDVRGTLVVESV 434
AFR+ M +Y + DL DTCY+ + + V++P + + F G L + +
Sbjct: 217 AFRRAMARYPRAPAMGDL-DTCYNFTGVRHEVLIPLVHLTFRGISGGGGGEGQVLGLGAD 275
Query: 435 RQV------------CLGFALLPSD-----PNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
+ + CL FA LPSD P ++++G + Q EV +DV G ++GF PG
Sbjct: 276 QMLYMSEPGNFFSVTCLAFAALPSDGDAAAPLAMVMGTLAQSSMEVVHDVQGGKIGFIPG 335
Query: 478 NC 479
+C
Sbjct: 336 SC 337
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 104/373 (27%), Positives = 168/373 (45%), Gaps = 45/373 (12%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP-----FFDPSKSKTFSKIPCN 188
I + IG P Q ++LDTGS ++W QC +++ P FDPS S +FS +PC+
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCH------RKKLPPKPKTSFDPSLSSSFSTLPCS 127
Query: 189 STTCKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
CK + F P D S++ C Y Y DG+ G +++T
Sbjct: 128 HPLCKPRIPDFTLPTSCD--SNRLCHYSYFYADGTFAEGNLVKEKITFSNTE-----ITP 180
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK--- 304
P +LGC ++ D+ GI+G++RG +S +S+ IS F YC+ G+ G
Sbjct: 181 PLILGCATESSDDR----GILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYL 236
Query: 305 PDTVNKKFVKYTPIVTTPEQSEF-------YHITLTGISVGGERLPLKASYFTKLS---- 353
D N KY ++T PE Y + + GI G ++L + S F +
Sbjct: 237 GDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSG 296
Query: 354 -TEIDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDLFDTCYDLS-AYKTVVVPK 410
T +DSG+ T Y +R+ R+ ++ K G D C+D + A ++
Sbjct: 297 QTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGD 356
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSDPNSILLGNVQQRGYEVHYDV 467
+ F GV++ + LV C+G ++L + N ++GNV Q+ V +DV
Sbjct: 357 LVFVFTRGVEIFVPKERVLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDV 414
Query: 468 AGRRLGFGPGNCN 480
RR+GF +C+
Sbjct: 415 TNRRVGFAKADCS 427
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 128/457 (28%), Positives = 192/457 (42%), Gaps = 64/457 (14%)
Query: 46 CNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLE----EILRRDQQRLHLKNS 101
C+ T+T Q G +L + PCS KS + S E + L +DQ RL +S
Sbjct: 25 CDLTKT---QDQGS-TLRIFHIDSPCSPF---KSSSPLSWEARVLQTLAQDQARLQYLSS 77
Query: 102 RRLQKAIPDNFKKTKAFTFPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQ 159
+++ P +G ++ + Y + IG P Q + L +DT S + W
Sbjct: 78 LVAGRSV-----------VPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIP 126
Query: 160 CKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYV 219
C C+ C + F P+KS +F + C++ CK + PN C ++ C +++ Y
Sbjct: 127 CSGCVGCPS--NTAFSPAKSTSFKNVSCSAPQCKQV-----PN--PTCGARACSFNLTYG 177
Query: 220 DGSGETGFWA-TDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-----QNGASGIMGLDRG 273
S T R+ + F GC + G G G+
Sbjct: 178 SSSIAANLSQDTIRLAADPIKA--------FTFGCVNKVAGGGTIPPPQGLLGLGRGPLS 229
Query: 274 PVSIISKTNISYFFYCLHSPYGST--GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
+S S F YCL S T G + G T + VKYT ++ P +S Y++
Sbjct: 230 LMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGP--TSQPQRVKYTQLLRNPRRSSLYYVN 287
Query: 332 LTGISVGGE--RLPLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKM 386
L I VG + LP A F T T DSGT+ TR PVY A+R+ FRKR+K
Sbjct: 288 LVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVKPTTA 347
Query: 387 GKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLP 445
FDTCY V VP IT F GV++ + ++ + CL A P
Sbjct: 348 VVTSLGGFDTCYS----GQVKVPTITFMF-KGVNMTMPADNLMLHSTAGSTSCLAMAAAP 402
Query: 446 SDPNSI--LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ NS+ ++ ++QQ+ + V DV RLG C+
Sbjct: 403 ENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 161/360 (44%), Gaps = 34/360 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+ + ++IG P L +DT S + W QC PCI+C Q P FDPS+S T C ++
Sbjct: 85 FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTS- 143
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV-NGNGYFARYPFL 250
++ P+ + +++ C Y + YVD +G G A + + + + + A + +
Sbjct: 144 -----QYSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVV 198
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISK--TNISYFFYCLHSPYGSTGYITFGKPDTV 308
GC +N G+ +GI+GL G S++ + SY F L P + G D
Sbjct: 199 FGCGHDNYGEPLVGTGILGLGYGEFSLVHRFGKKFSYCFGSLDDPSYPHNVLVLG--DDG 256
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS------TEIDSGTII 362
TP+ + FY++T+ ISV G LP+ F + T ID+G +
Sbjct: 257 ANILGDTTPLEI---HNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSL 313
Query: 363 TRFPAPVYSALRS----AFRKRMKKYKMGKGIEDLFDT-CYDLSAYKTVV---VPKITIH 414
T Y L++ F R + + +D+ CY+ + + +V P +T H
Sbjct: 314 TSLVEEAYKPLKNRIEDIFEGRFTAADVSQ--DDMIKMECYNGNFERDLVESGFPIVTFH 371
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F G +L LDV+ + S CL A+ P + NSI G Q+ Y + YD+ + F
Sbjct: 372 FSEGAELSLDVKSLFMKLSPNVFCL--AVTPGNLNSI--GATAQQSYNIGYDLEAMEVSF 427
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/408 (27%), Positives = 175/408 (42%), Gaps = 46/408 (11%)
Query: 102 RRLQKAIPDNFKKTKAFTFPAKTGIVA-----ADEYYIVVAIGKPKQYVSLLLDTGSGIT 156
R+++AI + + A T G+ A +Y +G P Q L+DTGS +
Sbjct: 51 ERVRRAIALSRQINLASTRAEGGGVSAPVHWATRQYIAEYMVGDPPQRAEALIDTGSSLI 110
Query: 157 WTQCKPCIH--CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS-SKECP 213
WTQC C+ C +Q P+F+ S S +F+ +PC C N C+ C
Sbjct: 111 WTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQDKACA-------GNYLHFCALDGTCT 163
Query: 214 YDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRG 273
+ + Y G G GF TD T Q F F + +GASG++GL RG
Sbjct: 164 FRVTYGAG-GIIGFLGTDAFTFQSGGATLAFGCVSFTRFAAPDVL---HGASGLIGLGRG 219
Query: 274 PVSIISKTNISYFFYCLHSPY----GSTGYITFGKPDTVN--KKFVKYTPIVTTPEQ--- 324
+S+ S+T F YCL +PY G++ ++ G +++ V V +P+
Sbjct: 220 RLSLASQTGAKRFSYCL-TPYFHNNGASSHLFVGAAASLSGGGGAVMSMAFVESPKDYPY 278
Query: 325 SEFYHITLTGISVGGERLPLKASYFTKLSTE---------IDSGTIITRFPAPVYSALRS 375
S FY++ L GI+VG +L + ++ F E IDSG+ T Y L
Sbjct: 279 STFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDSGSPFTSLVEDAYEPLMG 338
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYK---TVVVPKITIHFLGGVDLELDVRGTLVVE 432
+++ + ED D L + VVP + +HF GG D+ L
Sbjct: 339 ELARQLNGSLVPPPGED--DGGMALCVARGDLDRVVPTLVLHFSGGADMALPPENYWAPL 396
Query: 433 SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
C+ A++ SI +GN QQ+ + +DV G RL F +C+
Sbjct: 397 EKSTACM--AIVRGYLQSI-IGNFQQQNMHILFDVGGGRLSFQNADCS 441
>gi|295830681|gb|ADG39009.1| AT5G10770-like protein [Capsella grandiflora]
gi|295830683|gb|ADG39010.1| AT5G10770-like protein [Capsella grandiflora]
gi|295830685|gb|ADG39011.1| AT5G10770-like protein [Capsella grandiflora]
gi|295830687|gb|ADG39012.1| AT5G10770-like protein [Capsella grandiflora]
Length = 159
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 6/156 (3%)
Query: 275 VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
+S S+T +Y F YCL S TG++TFG + VK+TPI T + + FY +
Sbjct: 1 LSFPSQTATAYNKIFSYCLPSSASYTGHLTFGSAGI--SRSVKFTPIATISDGNSFYGLN 58
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
+ GI+VGG++L + ++ F+ IDSGT+ITR P Y+ALRS+F+ +M KY G+
Sbjct: 59 IVGITVGGQKLAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTASGVS 118
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
L DTC+DLS +KTV +PK+ F GG +EL +G
Sbjct: 119 IL-DTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKG 153
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 171/380 (45%), Gaps = 47/380 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P + + +DTGS I W C PC C F+P S T S+I
Sbjct: 5 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 64
Query: 187 CNSTTCKILLEWFPPNGQDKC-----SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
C+ C + G+ C S C Y Y DGSG +G++ +D M + V GN
Sbjct: 65 CSDDRCTAGFQ----TGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGN 120
Query: 242 GYFAR--YPFLLGCTDNNTGDQNGA----SGIMGLDRGPVSIISKTNI-----SYFFYCL 290
A + GC+++ +GD A GI G + +S+IS+ N F +CL
Sbjct: 121 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 180
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
G + G+ + + + YTP+V P Q Y++ L I+V G++LP+ +S FT
Sbjct: 181 KGSDNGGGILVLGE---IVEPGLVYTPLV--PSQPH-YNLNLESIAVNGQKLPIDSSLFT 234
Query: 351 KLSTE---IDSGTIITRFPAPVYSALRSAFRKRMK---KYKMGKGIEDLFDTCYDLSAYK 404
+T+ +DSGT + Y SA + + + KG + C+ S+
Sbjct: 235 TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-----CFITSSSV 289
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRG 460
P +T++F+GGV + + L+ V++ C+G+ +I LG++ +
Sbjct: 290 DSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITI-LGDLVLKD 348
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
YD+A R+G+ +C+
Sbjct: 349 KIFVYDLANMRMGWADYDCS 368
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 115/406 (28%), Positives = 179/406 (44%), Gaps = 35/406 (8%)
Query: 95 RLHLKNSRRLQKAIPDNFKKTKAFTF---PAKTGIVAA---------DEYYIVVAIGKPK 142
+L KNS +NF K K +F P K+ + + +Y + + +G P
Sbjct: 33 KLIHKNSPNSPFYKSNNFHKNKLRSFYQVPKKSFVQKSPYTRVTSNNGDYLMKLTLGSPP 92
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
+ L+DTGS + W QC PC C +Q+ P F+P +SKT+S IPC S C
Sbjct: 93 VDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSPIPCESEQCSFF------- 145
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN 262
G K C Y +Y D S G A + +T +G+ + GC +N+G N
Sbjct: 146 GYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVG-DIIFGCGHSNSGTFN 204
Query: 263 -GASGIMGLDRGPVSIISKTNISY----FFYCL---HSPYGSTGYITFGKPDTVNKKFVK 314
GI+G+ GP+S++S+ Y F CL H+ ++G I FG+ V+ + V
Sbjct: 205 ENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINFGEESDVSGEGVV 264
Query: 315 YTPIVTTPEQSEFYHITLTGISVGGERLPLKASY-FTKLSTEIDSGTIITRFPAPVYSAL 373
TP+ + Q+ Y +TL GISVG + +S +K + IDSGT T P Y L
Sbjct: 265 TTPLASEEGQTS-YLVTLEGISVGDTFVRFNSSETLSKGNIMIDSGTPATYIPQEFYERL 323
Query: 374 RSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVES 433
+ + + + CY + + P +T HF G D++L T +
Sbjct: 324 VEELKVQSSLLPIEDDPDLGTQLCY--RSETNLEGPILTAHF-EGADVQLLPIQTFIPPK 380
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
C FA+ S + GN Q + +D+ + + F P +C
Sbjct: 381 DGVFC--FAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDC 424
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/432 (26%), Positives = 180/432 (41%), Gaps = 54/432 (12%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQY 144
E+LRR QR + + + A + KA A+T I+ A EY + + IG P
Sbjct: 45 HELLRRAIQRSRYRLAG-IGMARGEAASARKAVV--AETPIMPAGGEYLVKLGIGTPPYK 101
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+ +DT S + WTQC+PC C Q DP F+P S T++ +PC+S TC L +
Sbjct: 102 FTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDD 161
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ--N 262
D + C Y Y + G A D++ I E G GC+ ++TG
Sbjct: 162 D----ESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRG------VAFGCSTSSTGGAPPP 211
Query: 263 GASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST-GYITFGKPDTVNKKFVK--YTPIV 319
ASG++GL RGP+S++S+ ++ F YCL P G + G + P+
Sbjct: 212 QASGVVGLGRGPLSLVSQLSVRRFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMR 271
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYF----------------------------TK 351
P +Y++ L G+ +G + L + +
Sbjct: 272 RDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANR 331
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY---DLSAYKTVVV 408
ID + IT A +Y L + ++ + G G D C+ D A+ V V
Sbjct: 332 YGMIIDIASTITFLEASLYDELVNDLEVEIRLPR-GTGSSLGLDLCFILPDGVAFDRVYV 390
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDV 467
P + + F G L LD + L E + + ++ S+ +LGN QQ+ +V Y++
Sbjct: 391 PAVALAF-DGRWLRLD-KARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNL 448
Query: 468 AGRRLGFGPGNC 479
R+ F C
Sbjct: 449 RRGRVTFVQSPC 460
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/432 (26%), Positives = 180/432 (41%), Gaps = 54/432 (12%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQY 144
E+LRR QR + + + A + KA A+T I+ A EY + + IG P
Sbjct: 45 HELLRRAIQRSRYRLAG-IGMARGEAASARKAVV--AETPIMPAGGEYLVKLGIGTPPYK 101
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+ +DT S + WTQC+PC C Q DP F+P S T++ +PC+S TC L +
Sbjct: 102 FTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDD 161
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ--N 262
D + C Y Y + G A D++ I E G GC+ ++TG
Sbjct: 162 D----ESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRG------VAFGCSTSSTGGAPPP 211
Query: 263 GASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST-GYITFGKPDTVNKKFVK--YTPIV 319
ASG++GL RGP+S++S+ ++ F YCL P G + G + P+
Sbjct: 212 QASGVVGLGRGPLSLVSQLSVRRFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMR 271
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYF----------------------------TK 351
P +Y++ L G+ +G + L + +
Sbjct: 272 RDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANR 331
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY---DLSAYKTVVV 408
ID + IT A +Y L + ++ + G G D C+ D A+ V V
Sbjct: 332 YGMIIDIASTITFLEASLYDELVNDLEVEIRLPR-GTGSSLGLDLCFILPDGVAFDRVYV 390
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDV 467
P + + F G L LD + L E + + ++ S+ +LGN QQ+ +V Y++
Sbjct: 391 PAVALAF-DGRWLRLD-KARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNL 448
Query: 468 AGRRLGFGPGNC 479
R+ F C
Sbjct: 449 RRGRVTFVQSPC 460
>gi|295830679|gb|ADG39008.1| AT5G10770-like protein [Capsella grandiflora]
Length = 159
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 6/156 (3%)
Query: 275 VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
+S S+T +Y F YCL S TG++TFG + VK+TPI T + + FY +
Sbjct: 1 LSFPSQTATAYNKIFSYCLPSSASYTGHLTFGSAGI--SRSVKFTPIXTISDGNSFYGLN 58
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
+ GI+VGG++L + ++ F+ IDSGT+ITR P Y+ALRS+F+ +M KY G+
Sbjct: 59 IVGITVGGQKLAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTASGVS 118
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
L DTC+DLS +KTV +PK+ F GG +EL +G
Sbjct: 119 IL-DTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKG 153
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 164/368 (44%), Gaps = 38/368 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP----FFDPSKSKTFSKIP 186
EY + V +G P + + DTGS + W C D F P++S T+S++
Sbjct: 102 EYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLS 161
Query: 187 CNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C S C+ L Q C + EC Y +Y DGS G +T+ + + G G
Sbjct: 162 CQSNACQAL-------SQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQV- 213
Query: 246 RYPFL-LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFF-----YCLHSPY--GST 297
R P + GC+ + G + G++GL G S++S+ + YCL Y S+
Sbjct: 214 RVPRVNFGCSTASAGTFR-SDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANSS 272
Query: 298 GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEID 357
+ FG V++ TP+V + S +Y + L ++VGG+ + S +D
Sbjct: 273 STLNFGSRAVVSEPGAASTPLVPSDVDS-YYTVALESVAVGGQEVATHDSRII-----VD 326
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL---SAYKTVVVPKITIH 414
SGT +T + L + +R+K ++ + E L CYD+ S +P +T+
Sbjct: 327 SGTTLTFLDPALLGPLVTELERRIKLQRV-QPPEQLLQLCYDVQGKSETDNFGIPDVTLR 385
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLP---SDPNSILLGNVQQRGYEVHYDVAGRR 471
F GG + L T + +CL L+P S P SI LGN+ Q+ + V YD+ R
Sbjct: 386 FGGGAAVTLRPENTFSLLQEGTLCL--VLVPVSESQPVSI-LGNIAQQNFHVGYDLDART 442
Query: 472 LGFGPGNC 479
+ F +C
Sbjct: 443 VTFAAADC 450
>gi|345292859|gb|AEN82921.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292861|gb|AEN82922.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292863|gb|AEN82923.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292865|gb|AEN82924.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292867|gb|AEN82925.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292869|gb|AEN82926.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292871|gb|AEN82927.1| AT5G10770-like protein, partial [Capsella rubella]
gi|345292873|gb|AEN82928.1| AT5G10770-like protein, partial [Capsella rubella]
Length = 161
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 96/156 (61%), Gaps = 6/156 (3%)
Query: 275 VSIISKTNISY---FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
+S S+T +Y F YCL S TG++TFG + VK+TPI T + + FY +
Sbjct: 1 LSFPSQTATAYNKIFSYCLPSSASYTGHLTFGSAGI--SRSVKFTPISTISDGNSFYGLN 58
Query: 332 LTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
+ GI+VGG++L + ++ F+ IDSGT+ITR P Y+ALRS+F+ +M KY G+
Sbjct: 59 IVGITVGGQKLAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTASGVS 118
Query: 392 DLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
L DTC+DLS +KTV +PK+ F GG +EL +G
Sbjct: 119 IL-DTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKG 153
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 163/387 (42%), Gaps = 46/387 (11%)
Query: 119 TFPAKTGIVAADEY------YIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD 171
T PA G VA Y Y+ IG P Q VS ++D + WTQC PC C +Q
Sbjct: 37 TPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDL 96
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWA-T 230
P FDP+KS TF +PC S C+ + P C+S C Y+ +G+TG A T
Sbjct: 97 PLFDPTKSSTFRGLPCGSHLCESI-----PESSRNCTSDVCIYEAP--TKAGDTGGMAGT 149
Query: 231 DRMTIQEVNGNGYFARYPFLLGC---TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFF 287
D I A+ GC TD G SGI+GL R P S++++ N++ F
Sbjct: 150 DTFAIGA-------AKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFS 202
Query: 288 YCLHSP------YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGER 341
YCL G+T G ++ +K + + + +Y + L GI GG
Sbjct: 203 YCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGA- 261
Query: 342 LPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
PL+A+ + + +D+ + + Y AL+ A + + + YDL
Sbjct: 262 -PLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKP-----YDLC 315
Query: 402 AYKTVV--VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA------LLPSDPNSILL 453
K V P++ F GG L + L+ VCL L + +L
Sbjct: 316 FSKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASIL 375
Query: 454 GNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G++QQ V +D+ L F P +C+
Sbjct: 376 GSLQQENVHVLFDLKEETLSFKPADCS 402
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 171/378 (45%), Gaps = 48/378 (12%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + +G P Q V+++LDTGS ++W CK S F+P S ++S IPC+S C+
Sbjct: 42 VSLTVGSPPQQVTMVLDTGSELSWLHCKK----SPNLTSVFNPLSSSSYSPIPCSSPVCR 97
Query: 194 ILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC 253
PN K C ++Y D S G A+D I G A L GC
Sbjct: 98 TRTRDL-PNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI------GSSALPGTLFGC 150
Query: 254 TD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVN 309
D +N+ + +G+MG++RG +S +++ + F YC+ S S+G + FG
Sbjct: 151 MDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI-SGRDSSGVLLFGDSHLSW 209
Query: 310 KKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEIDSG 359
+ YTP+V + Y + L GI VG + LPL S F T +DSG
Sbjct: 210 LGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSG 269
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKG-----IEDLFDTCYDLSA-YKTVVVPKITI 413
T T PVY+ALR+ F ++ K G + D CY + A K +P +++
Sbjct: 270 TQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSL 329
Query: 414 HFLGGVDLELDVRGTLVVESVRQV--------CLGFA---LLPSDPNSILLGNVQQRGYE 462
F G E+ V G +++ V + CL F LL + + ++G+ Q+
Sbjct: 330 MFRGA---EMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIE--AFVIGHHHQQNVW 384
Query: 463 VHYDVAGRRLGFGPGNCN 480
+ +D+ R+GF C+
Sbjct: 385 MEFDLVKSRVGFVETRCD 402
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 112/393 (28%), Positives = 171/393 (43%), Gaps = 59/393 (15%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + ++ G P Q +S ++DTGS + W C C++ P DP+K TF IP S++
Sbjct: 90 YSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTF--IPKLSSS 147
Query: 192 CKILLEWFPPNG-------QDKC---------SSKECP-YDIAYVDGSGETGFWATDRMT 234
KI+ P G + +C +K CP Y I Y G+ +
Sbjct: 148 AKIVGCLNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVF 207
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---- 290
+ + F++GC+ SGI G RGP S+ + + F YCL
Sbjct: 208 AERTEPD-------FVVGCS---ILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHR 257
Query: 291 --HSPYGSTGYITFGKPDTVNKKF--VKYTPIVTTPEQS-----EFYHITLTGISVGGER 341
SP S + G PD+ + K + YTP P S E+Y++TL I VG +R
Sbjct: 258 FDDSPKSSKMTLYVG-PDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKR 316
Query: 342 LPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--F 394
+ + S+ S T +DSG+ T PV+ A+ + F ++M Y +E L
Sbjct: 317 VKVPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGL 376
Query: 395 DTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL-VVESVRQVCLGF-------ALLPS 446
C++LS +V +P + F GG +EL V +V + +CL + L S
Sbjct: 377 KPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSS 436
Query: 447 DPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
P SI+LGN Q + + YD+ R GF C
Sbjct: 437 GP-SIILGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 120/442 (27%), Positives = 188/442 (42%), Gaps = 55/442 (12%)
Query: 84 SLEEILRRDQQRLHLKNS---RRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIG 139
SL ++ R D+QR+ S RR ++ + AF P +G +Y++ +G
Sbjct: 44 SLADLARSDRQRMAFIASHGRRRARETAAGS--SAAAFEMPLTSGAYTGIGQYFVRFRVG 101
Query: 140 KPKQYVSLLLDTGSGITWTQC-KPCIHCSQQRDPF---FDPSKSKTFSKIPCNSTTCKIL 195
P Q L+ DTGS +TW +C +P + S+ F P S+T++ I C S TC
Sbjct: 102 TPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCASDTCTKS 161
Query: 196 LEWFPPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP----F 249
L P C + C YD Y DGS G T+ TI ++G G R
Sbjct: 162 L----PFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATI-ALSGRGREERKAKLKGL 216
Query: 250 LLGCTDNNTGDQNGAS-GIMGLDRGPVSIISKTNISY---FFYCL--H-SPYGSTGYITF 302
+LGCT + TG S G++ L VS S + F YCL H SP +T Y+TF
Sbjct: 217 VLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATSYLTF 276
Query: 303 G--------------------KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL 342
G + + TP++ FY + + +SV G+ L
Sbjct: 277 GPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVAGQFL 336
Query: 343 PLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
+ + + +DSGT +T P Y A+ +A + + + + D F+ CY+
Sbjct: 337 KIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLA--GLPRVTMDPFEYCYN 394
Query: 400 L-SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQ 458
S V +PK+ +HF G LE + ++ + C+G P P ++GN+ Q
Sbjct: 395 WTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPW-PGISVIGNILQ 453
Query: 459 RGYEVHYDVAGRRLGFGPGNCN 480
+ + +D+ RRL F C
Sbjct: 454 QEHLWEFDIKNRRLKFQRSRCT 475
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 116/438 (26%), Positives = 192/438 (43%), Gaps = 56/438 (12%)
Query: 73 KLNQGKSRNTP-SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADE 131
KL +G N L ++ RD+ R H + + L I +F F P G+
Sbjct: 30 KLERGIPANHEMELSQLKARDKAR-HGRLLQSLGGVI--DFPVDGTFD-PFVVGL----- 80
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
YY + +G P + + +DTGS + W C C C Q FFDP S T + +
Sbjct: 81 YYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVS 140
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C+ C W + CS + C Y Y DGSG +GF+ +D + + G+
Sbjct: 141 CSDQRCS----WGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLV 196
Query: 245 --ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSP 293
+ P + GC+ + TGD GI G + +S+IS+ F +CL
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGE 256
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G G + G+ N F TP+V P Q Y++ L ISV G+ LP+ S F+ +
Sbjct: 257 NGGGGILVLGEIVEPNMVF---TPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN 310
Query: 354 ---TEIDSGTIITRFP----APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
T ID+GT + P A+ +A + ++ + KG + CY ++
Sbjct: 311 GQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKG-----NQCYVIATSVAD 364
Query: 407 VVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
+ P ++++F GG + L+ + L+ V C+GF + + +I LG++ +
Sbjct: 365 IFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITI-LGDLVLKDKI 423
Query: 463 VHYDVAGRRLGFGPGNCN 480
YD+ G+R+G+ +C+
Sbjct: 424 FVYDLVGQRIGWANYDCS 441
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 169/378 (44%), Gaps = 46/378 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
YY + +G P + + +DTGS + W C C C Q FFDP S T S I
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C+ C W + CS + C Y Y DGSG +GF+ +D + + G+
Sbjct: 141 CSDQRCS----WGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLV 196
Query: 245 --ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSP 293
+ P + GC+ + TGD GI G + +S+IS+ F +CL
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE 256
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G G + G+ N F TP+V P Q Y++ L ISV G+ LP+ S F+ +
Sbjct: 257 NGGGGILVLGEIVEPNMVF---TPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN 310
Query: 354 ---TEIDSGTIITRFP----APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
T ID+GT + P A+ +A + ++ + KG + CY ++
Sbjct: 311 GQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKG-----NQCYVITTSVGD 364
Query: 407 VVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
+ P ++++F GG + L+ + L+ V C+GF + + +I LG++ +
Sbjct: 365 IFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITI-LGDLVLKDKI 423
Query: 463 VHYDVAGRRLGFGPGNCN 480
YD+ G+R+G+ +C+
Sbjct: 424 FVYDLVGQRIGWANYDCS 441
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 115/433 (26%), Positives = 180/433 (41%), Gaps = 57/433 (13%)
Query: 79 SRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAI 138
S N P + LR + R + R K T F + + +
Sbjct: 24 SSNQPPIVLALRTQKHRTPISTPRLFSTTS----KTTDKLLFHHNVTLTVS------LTA 73
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
G P Q ++++LDTGS ++W CK + F+P SKT++KIPC+S TC+
Sbjct: 74 GTPLQNITMVLDTGSELSWLHCKK----EPNFNSIFNPLASKTYTKIPCSSPTCETRTRD 129
Query: 199 FP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD-- 255
P P D +K C + I+Y D S G A + + V G + GC D
Sbjct: 130 LPLPVSCDP--AKLCHFIISYADASSVEGNLAFETFRVGSVTGPAT------VFGCMDSG 181
Query: 256 --NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFV 313
+N+ + +G+MG++RG +S +++ F YC+ S S+G + G+ K +
Sbjct: 182 FSSNSEEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCI-SDRDSSGVLLLGEASFSWLKPL 240
Query: 314 KYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIIT 363
YTP+V + Y + L GI V + L L S F T +DSGT T
Sbjct: 241 NYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFT 300
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKG-----IEDLFDTCYDLSAYKTVV--VPKITIHFL 416
PVYSAL+ F + K + D CY + + + +P + + F
Sbjct: 301 FLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLMFR 360
Query: 417 GGVDLELDVRGTLVVESVRQVCLG------FALLPSDP---NSILLGNVQQRGYEVHYDV 467
G E+ V G ++ V G F SD S ++G+ QQ+ + YD+
Sbjct: 361 GA---EMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIGHHQQQNVWMEYDL 417
Query: 468 AGRRLGFGPGNCN 480
R+GF C+
Sbjct: 418 EKSRIGFAEVRCD 430
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 168/378 (44%), Gaps = 44/378 (11%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKP-CIHCSQQRDPF-FDPSKSKTFSKIPCNSTT 191
+ +A+G P Q V+++LDTGS ++W C P R F P S TF+ +PC+S
Sbjct: 68 VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQ 127
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C+ PP +SK+C ++Y DGS G AT+ T+ G G R F
Sbjct: 128 CRSRDLPSPPACDG--ASKQCRVSLSYADGSSSDGALATEVFTV----GQGPPLRAAF-- 179
Query: 252 GCTD---NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
GC + + D +G++G++RG +S +S+ + F YC+ S G + G D
Sbjct: 180 GCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCI-SDRDDAGVLLLGHSDLP 238
Query: 309 NKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
+ YTP+ + Y + L GI VGG+ LP+ AS T +DS
Sbjct: 239 FLP-LNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDS 297
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGK-----GIEDLFDTCYDLSAYKT--VVVPKI 411
GT T YSAL++ F ++ K + ++ FDTC+ + + +P +
Sbjct: 298 GTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAV 357
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQV--------CLGFALLPSDP-NSILLGNVQQRGYE 462
T+ F G ++ V G ++ V CL F P + ++G+ Q
Sbjct: 358 TLLFNGA---QMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVW 414
Query: 463 VHYDVAGRRLGFGPGNCN 480
V YD+ R+G P C+
Sbjct: 415 VEYDLERGRVGLAPIRCD 432
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 169/384 (44%), Gaps = 50/384 (13%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKP---CIHCS-QQRDPFFDPSKSKTFSKIPC 187
Y I ++ G P Q +S ++DTGS W C C +CS R F P S + I C
Sbjct: 77 YSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGC 136
Query: 188 NSTTCKILLEWFPP---------NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV 238
+ C W N CS PY I Y GSG TG A + +
Sbjct: 137 KNPKC----SWIHQTDLRCTDCDNNSRNCSQICPPYLILY--GSGTTGGVALS----ETL 186
Query: 239 NGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS-----P 293
+ +G FL+GC+ +GI G RGP S+ S+ ++ F YCL S
Sbjct: 187 HLHGLIVPN-FLVGCS---VFSSRQPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHKFDDT 242
Query: 294 YGSTGYITFGKPDTVNK-KFVKYTPIVTTPEQSE------FYHITLTGISVGGERLPLKA 346
S+ + + D+ K + YTP+V P+ + +Y+++L IS+GG + +
Sbjct: 243 QESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPY 302
Query: 347 SYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYD 399
Y + T IDSGT T + L + F ++K Y+ +E L C++
Sbjct: 303 KYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFN 362
Query: 400 LSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNS---ILLGN 455
+S K + +P++ +HF GG D+EL + R+V C ++ S ++LGN
Sbjct: 363 VSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILGN 422
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNC 479
Q + + V YD+ RLGF +C
Sbjct: 423 FQMQNFYVEYDLQNERLGFKKESC 446
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 169/378 (44%), Gaps = 46/378 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
YY + +G P + + +DTGS + W C C C Q FFDP S T S I
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C+ C W + CS + C Y Y DGSG +GF+ +D + + G+
Sbjct: 141 CSDQRC----SWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLV 196
Query: 245 --ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSP 293
+ P + GC+ + TGD GI G + +S+IS+ F +CL
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE 256
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G G + G+ N F TP+V P Q Y++ L ISV G+ LP+ S F+ +
Sbjct: 257 NGGGGILVLGEIVEPNMVF---TPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN 310
Query: 354 ---TEIDSGTIITRFP----APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
T ID+GT + P A+ +A + ++ + KG + CY ++
Sbjct: 311 GQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKG-----NQCYVITTSVGD 364
Query: 407 VVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYE 462
+ P ++++F GG + L+ + L+ V C+GF + + +I LG++ +
Sbjct: 365 IFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITI-LGDLVLKDKI 423
Query: 463 VHYDVAGRRLGFGPGNCN 480
YD+ G+R+G+ +C+
Sbjct: 424 FVYDLVGQRIGWANYDCS 441
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 162/362 (44%), Gaps = 27/362 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
+Y + + IG P +S +DTGS + W QC PC+ C Q +P FDP KS T++ I C+S
Sbjct: 63 QYLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDSP 122
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C + P + K C Y Y D S G A + +T+ G + L
Sbjct: 123 LC------YKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKP-ISLQGIL 175
Query: 251 LGCTDNNTGDQNGAS-GIMGLDRGPVSIISKTNISY----FFYCLHSPYGS----TGYIT 301
GC NNTG+ N G++GL GP S++S+ + F CL P+ + + ++
Sbjct: 176 FGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCL-VPFLTDITISSQMS 234
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTI 361
FGK V + V TP+V + Y++TL GISV LP+ ++ K + +DSGT
Sbjct: 235 FGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNST-IEKGNMLVDSGTP 293
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
P +Y + + ++ + CY + P +T HF G +L
Sbjct: 294 PNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCY--RTQTNLKGPTLTYHF-EGANL 350
Query: 422 ELDVRGTLV---VESVRQVCLGFA-LLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
L T + E+ CL SDP + GN Q Y + +D+ + + F P
Sbjct: 351 LLTPIQTFIPPTPETKGVFCLAITNCANSDPG--IYGNFAQTNYLIGFDLDRQIVSFKPT 408
Query: 478 NC 479
+C
Sbjct: 409 DC 410
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 121/435 (27%), Positives = 189/435 (43%), Gaps = 50/435 (11%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+L+V + PCS K + S+ ++ +DQ R+ ++ +++I
Sbjct: 43 TLQVFHVFSPCSPFRPSKPMSWEESVLQLQAKDQARMQYLSNLVARRSI----------- 91
Query: 120 FPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G I + Y + G P Q + L +DT + W C C+ CS F P
Sbjct: 92 VPIASGRQITQSPTYIVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGCSTTTP--FAPP 149
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
KS TF K+ C ++ CK + C C ++ Y S D +T+
Sbjct: 150 KSTTFKKVGCGASQCKQVRN-------PTCDGSACAFNFTY-GTSSVAASLVQDTVTLAT 201
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPY 294
Y GC TG G++GL RGP+S++++T Y F YCL S +
Sbjct: 202 DPVPAY------TFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPS-F 254
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF--- 349
+ + V + + P P +S Y++ L I VG +P +A F
Sbjct: 255 KTLNFSGHXDLXPVAQPRDQVYPSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPX 314
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSAYKTVV 407
T T DSGT+ TR P Y+A+R+ FR+R+ +K + L FDTCY + +V
Sbjct: 315 TGAGTVFDSGTVFTRLVEPAYTAVRNEFRRRVSVHK-KLTVTSLGGFDTCYTVP----IV 369
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEVH 464
P IT F G+++ L L+ + V CL A P + NS+L + N+QQ+ + V
Sbjct: 370 APTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVL 428
Query: 465 YDVAGRRLGFGPGNC 479
+DV RLG C
Sbjct: 429 FDVPNSRLGVARELC 443
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 112/406 (27%), Positives = 172/406 (42%), Gaps = 50/406 (12%)
Query: 118 FTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP---- 172
F P +G +Y++ +G P Q L+ DTGS +TW +C+ S
Sbjct: 95 FAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPA 154
Query: 173 -----------FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK--ECPYDIAYV 219
F P SKT+S IPC+S TCK + P CSS C YD Y
Sbjct: 155 AAPSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTI----PFSLANCSSSTAACSYDYRYN 210
Query: 220 DGSGETGFWATDRMTIQ-------EVNGNGYFARYPFLLGCTDNNTGDQNGAS-GIMGLD 271
D S G TD T+ G+ +LGCT + G AS G++ L
Sbjct: 211 DNSAARGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLG 270
Query: 272 RGPVSIISKTNISY---FFYCL--H-SPYGSTGYITFGK-PDTVNKKFV---KYTPIVTT 321
+S S+ + F YCL H +P +T Y+TFG PD + TP++
Sbjct: 271 YSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLD 330
Query: 322 PEQSEFYHITLTGISVGGERLPLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFR 378
FY + + +SV G L + A + + T IDSGT +T P Y A+ +A
Sbjct: 331 ARVRPFYAVAVDSVSVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALS 390
Query: 379 KRMKKYKMGKGIEDLFDTCYDLSAY----KTVVVPKITIHFLGGVDLELDVRGTLVVESV 434
+++ + + D FD CY+ +A + VPK+ + F G LE + ++ +
Sbjct: 391 EQLA--GLPRVAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAP 448
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
C+G + P ++GN+ Q+ + +D+ R L F +C
Sbjct: 449 GVKCIGVQ-EGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSCT 493
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 163/387 (42%), Gaps = 46/387 (11%)
Query: 119 TFPAKTGIVAADEY------YIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD 171
T PA G VA Y Y+ IG P Q VS ++D + WTQC PC C +Q
Sbjct: 37 TPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDL 96
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWA-T 230
P FDP+KS TF +PC S C+ + P C+S C Y+ +G+TG A T
Sbjct: 97 PLFDPTKSSTFRGLPCGSHLCESI-----PESSRNCTSDVCIYEAP--TKAGDTGGKAGT 149
Query: 231 DRMTIQEVNGNGYFARYPFLLGC---TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFF 287
D I A+ GC TD G SGI+GL R P S++++ N++ F
Sbjct: 150 DTFAIGA-------AKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFS 202
Query: 288 YCLHSP------YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGER 341
YCL G+T G ++ +K + + + +Y + L GI GG
Sbjct: 203 YCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGA- 261
Query: 342 LPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
PL+A+ + + +D+ + + Y AL+ A + + + YDL
Sbjct: 262 -PLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKP-----YDLC 315
Query: 402 AYKTVV--VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA------LLPSDPNSILL 453
K V P++ F GG L + L+ VCL L + +L
Sbjct: 316 FPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASIL 375
Query: 454 GNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G++QQ V +D+ L F P +C+
Sbjct: 376 GSLQQENVHVLFDLKEETLSFKPADCS 402
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 118/396 (29%), Positives = 172/396 (43%), Gaps = 65/396 (16%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I + +G P Q +LDTGS + W C CS P D +K TF IP NS+T
Sbjct: 92 YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTF--IPKNSST 149
Query: 192 CKILLEWFPPNG-------QDKCS---------SKECP-YDIAYVDGSGETGFWATDRM- 233
K+L P G Q +C S CP Y I Y GS GF D +
Sbjct: 150 AKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLN 208
Query: 234 ----TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYC 289
T+ + FL+GC+ + SGI G RG S+ S+ N+ F YC
Sbjct: 209 FPGKTVPQ-----------FLVGCSILSIRQ---PSGIAGFGRGQESLPSQMNLKRFSYC 254
Query: 290 LHS------PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQS-----EFYHITLTGISVG 338
L S P S + + YTP + P + E+Y++TL + VG
Sbjct: 255 LVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVG 314
Query: 339 GERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKK-YKMGKGIED 392
G+ + + ++ S T +DSG+ T PVY+ + F K+++K Y + E
Sbjct: 315 GKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAET 374
Query: 393 L--FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL-VVESVRQVCL-----GFALL 444
C+++S KTV P++T F GG + ++ +V VCL G A
Sbjct: 375 QSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGP 434
Query: 445 PSDPN-SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
P +I+LGN QQ+ + + YD+ R GFGP +C
Sbjct: 435 PKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSC 470
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 125/433 (28%), Positives = 189/433 (43%), Gaps = 56/433 (12%)
Query: 61 SLEVLGRYGPCSKLNQGKSRN-TPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFT 119
+L+VL Y PCS + + S+ ++ +D+ RL +S +K++
Sbjct: 38 TLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSLVARKSV----------- 86
Query: 120 FPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPS 177
P +G IV Y + IG P Q + + +DT S + W C C+ CS F+
Sbjct: 87 VPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSST---LFNSP 143
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
S T+ + C + CK + + C C +++ Y GS + D +T+
Sbjct: 144 ASTTYKSLGCQAAQCKQV-------PKPTCGGGVCSFNLTY-GGSSLAANLSQDTITLAT 195
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS-- 292
GY GC TG A G++GL RGP+S++S+T Y F YCL S
Sbjct: 196 DAVPGYS------FGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFK 249
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF--- 349
+G + G K +KYTP++ P + Y + L + VG + + F
Sbjct: 250 SLNFSGSLRLGP--VGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFN 307
Query: 350 --TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV 407
T T DSGT+ TR P Y A+R AFR R+ + + FDTCY + +
Sbjct: 308 PSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGG-FDTCYTVP----IA 362
Query: 408 VPKITIHFLG-GVDLELDVRGTLVVESV--RQVCLGFALLPSDPNSIL--LGNVQQRGYE 462
P IT F G V L D L++ S CL A P + NS+L + N+QQ+ +
Sbjct: 363 APTITFMFTGMNVTLPPD---NLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHR 419
Query: 463 VHYDVAGRRLGFG 475
+ YDV RLG
Sbjct: 420 LLYDVPNSRLGVA 432
>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
Length = 609
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 112/393 (28%), Positives = 172/393 (43%), Gaps = 59/393 (15%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + ++ G P Q +S ++DTGS + W C C++ P DP+K TF IP S++
Sbjct: 90 YSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTF--IPKLSSS 147
Query: 192 CKILLEWFPPNG-------QDKC---------SSKECP-YDIAYVDGSGETGFWATDRMT 234
KI+ P G + +C +K CP Y I Y G+ +
Sbjct: 148 AKIVGCLNPKCGFVMDSEVRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVF 207
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL---- 290
+ + F++GC+ ++ SGI G RGP S+ + + F YCL
Sbjct: 208 AERTEPD-------FVVGCSILSS---RQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHR 257
Query: 291 --HSPYGSTGYITFGKPDTVNKKF--VKYTPIVTTPEQS-----EFYHITLTGISVGGER 341
SP S + G PD+ + K + YTP P S E+Y++TL I VG +R
Sbjct: 258 FDDSPKSSKMTLYVG-PDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKR 316
Query: 342 LPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--F 394
+ S+ S T +DSG+ T PV+ A+ + F ++M Y +E L
Sbjct: 317 VKXPYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGL 376
Query: 395 DTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL-VVESVRQVCLGF-------ALLPS 446
C++LS +V +P + F GG +EL V +V + +CL + L S
Sbjct: 377 KPCFNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSS 436
Query: 447 DPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
P SI+LGN Q + + YD+ R GF C
Sbjct: 437 GP-SIILGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 106/378 (28%), Positives = 167/378 (44%), Gaps = 44/378 (11%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKP-CIHCSQQRDPF-FDPSKSKTFSKIPCNSTT 191
+ +A+G P Q V+++LDTGS ++W C P R F P S TF+ +PC S
Sbjct: 67 VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQ 126
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C+ PP +SK+C ++Y DGS G AT+ T+ G G R F
Sbjct: 127 CRSRDLPSPPACDG--ASKQCRVSLSYADGSSSDGALATEVFTV----GQGPPLRAAF-- 178
Query: 252 GCTD---NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
GC + + D +G++G++RG +S +S+ + F YC+ S G + G D
Sbjct: 179 GCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCI-SDRDDAGVLLLGHSDLP 237
Query: 309 NKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
+ YTP+ + Y + L GI VGG+ LP+ AS T +DS
Sbjct: 238 FLP-LNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDS 296
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGK-----GIEDLFDTCYDLSAYKT--VVVPKI 411
GT T YSAL++ F ++ K + ++ FDTC+ + + +P +
Sbjct: 297 GTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAV 356
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQV--------CLGFALLPSDP-NSILLGNVQQRGYE 462
T+ F G ++ V G ++ V CL F P + ++G+ Q
Sbjct: 357 TLLFNGA---QMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVW 413
Query: 463 VHYDVAGRRLGFGPGNCN 480
V YD+ R+G P C+
Sbjct: 414 VEYDLERGRVGLAPIRCD 431
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 110/405 (27%), Positives = 176/405 (43%), Gaps = 26/405 (6%)
Query: 89 LRRDQQRLHLKNSRRLQKA--IPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVS 146
++R + RL + +R + A P +T P K G + +Y + IG P +S
Sbjct: 55 VQRSRSRLSMLAARAVSNAGAAPGESAQT-----PLKKG---SGDYAMSFGIGTPATGLS 106
Query: 147 LLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN-GQD 205
DTGS + WT+C C CS + P + P+ S + + + C TC L N
Sbjct: 107 GEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGG 166
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
S C Y AY + + MT G+ A GCT + G S
Sbjct: 167 GSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGS 226
Query: 266 GIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV---NKKFVKYTPIVTTP 322
G++GL RG +S++++ N+ F Y L S + I+FG V N TP++T P
Sbjct: 227 GLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNP 286
Query: 323 --EQSEFYHITLTGISVGGERLPLKASYFT------KLSTEIDSGTIITRFPAPVYSALR 374
+ FY++ LTGISVGG+ + + + F+ DSGT +T P P Y+ +R
Sbjct: 287 VVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR 346
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL--VVE 432
+M K D C+ T P + +HF GG D++L L +
Sbjct: 347 DELLSQMGFQKPPPAANDDDLICF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQG 405
Query: 433 SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR-RLGFGP 476
+ ++++ S ++GN+ Q + V +D++G R+ F P
Sbjct: 406 QNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARMLFQP 450
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 158/366 (43%), Gaps = 43/366 (11%)
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Q L LD G G++W QC PC HC Q P FDP+KS TFS IP ++T W P
Sbjct: 109 QNYQLALDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTV------WCRPP 162
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG--D 260
Q ++ C +DIAY D + +G+ A D + N + + + GC +
Sbjct: 163 YQ-PLANGACGFDIAYRDNTHASGYLARDTFSFPAGN-DDFVPLSAIVFGCAHQTEHFKN 220
Query: 261 QNGASGIMGLDRGPVS--------IISKTNISYFFYCLHSPYGST-GYITFGK------P 305
Q +GI+GL GP + + F YC P S Y+ FG P
Sbjct: 221 QRAVAGILGLGMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSYLRFGSDIPSHPP 280
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP-LKASYFTKLS-----TEIDSG 359
V++ + TP++ SE Y + L G+SVG RL + + F + + +D G
Sbjct: 281 PNVHR---QSTPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIG 337
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T +T F Y + A R+ +++ + + +TC A V+P +T+HF G
Sbjct: 338 TRMTAFIHSAYVHIDHAVRQHLQR-RGAHIVVVRGNTCVQQPAPHHDVLPSMTLHFENGA 396
Query: 420 DLEL---DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR--RLGF 474
L + V VV C GF S + ++G QQ + +D+ + F
Sbjct: 397 WLRVMPEHVFMPFVVGGHHYQCFGFV---SSTDLTVIGARQQVNHRFIFDLHDTIPIMSF 453
Query: 475 GPGNCN 480
P +C+
Sbjct: 454 NPEDCH 459
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 110/405 (27%), Positives = 176/405 (43%), Gaps = 26/405 (6%)
Query: 89 LRRDQQRLHLKNSRRLQKA--IPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVS 146
++R + RL + +R + A P +T P K G + +Y + IG P +S
Sbjct: 55 VQRSRSRLSMLAARAVSNAGAAPGESAQT-----PLKKG---SGDYAMSFGIGTPATGLS 106
Query: 147 LLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN-GQD 205
DTGS + WT+C C CS + P + P+ S + + + C TC L N
Sbjct: 107 GEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGG 166
Query: 206 KCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
S C Y AY + + MT G+ A GCT + G S
Sbjct: 167 GSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGS 226
Query: 266 GIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV---NKKFVKYTPIVTTP 322
G++GL RG +S++++ N+ F Y L S + I+FG V N TP++T P
Sbjct: 227 GLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNP 286
Query: 323 --EQSEFYHITLTGISVGGERLPLKASYFT------KLSTEIDSGTIITRFPAPVYSALR 374
+ FY++ LTGISVGG+ + + + F+ DSGT +T P P Y+ +R
Sbjct: 287 VVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR 346
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL--VVE 432
+M K D C+ T P + +HF GG D++L L +
Sbjct: 347 DELLSQMGFQKPPPAANDDDLICF-TGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQG 405
Query: 433 SVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR-RLGFGP 476
+ ++++ S ++GN+ Q + V +D++G R+ F P
Sbjct: 406 QNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARMLFQP 450
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 100/380 (26%), Positives = 169/380 (44%), Gaps = 49/380 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P + + +DTGS I W C PC C FF+P S T S+IP
Sbjct: 89 YFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIP 148
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE-----CPYDIAYVDGSGETGFWATDRMTIQEVNGN 241
C+ C L+ G+ C S + C Y Y DGSG +GF+ +D M V GN
Sbjct: 149 CSDDRCTAALQ----TGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGN 204
Query: 242 GYFAR--YPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISK-----TNISYFFYCL 290
A + GC+++ +GD GI G + +S++S+ + F +CL
Sbjct: 205 EQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCL 264
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
G + G+ + + + +TP+V P Q Y++ L I+V G++LP+ +S F
Sbjct: 265 KGSDNGGGILVLGE---IVEPGLVFTPLV--PSQPH-YNLNLESIAVSGQKLPIDSSLFA 318
Query: 351 KLSTE---IDSGTIITRFPAPVYSALRSAFRKR---MKKYKMGKGIEDLFDTCYDLSAYK 404
+T+ +DSGT + Y +A + + KGI+ C+ ++
Sbjct: 319 TSNTQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-----CFVTTSSV 373
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRG 460
P T++F GGV + + L+ V++ C+G+ +LG++ +
Sbjct: 374 DSSFPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQ---RSQGITILGDLVLKD 430
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
YD+A R+G+ +C+
Sbjct: 431 KIFVYDLANMRMGWADYDCS 450
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 177/384 (46%), Gaps = 59/384 (15%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + +G P Q VS+++DTGS ++W C + FDP++S ++ IPC+S TC
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYPTT----FDPTRSTSYQTIPCSSPTCT 88
Query: 194 ILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
+ FP P D S+ C ++Y D S G A+D I + +G + G
Sbjct: 89 NRTQDFPIPASCD--SNNLCHATLSYADASSSDGNLASDVFHIGSSDISG------LVFG 140
Query: 253 CTD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
C D +N+ + + ++G+MG++RG +S +S+ F YC+ S +G + G+ +
Sbjct: 141 CMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFPKFSYCI-SGTDFSGLLLLGESNLT 199
Query: 309 NKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
+ YTP++ + Y + L GI V + LP+ S F T +DS
Sbjct: 200 WSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQTMVDS 259
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED-------LFDTCY--DLSAYKTVVVP 409
GT T PVY+ALRSAF + + + +ED D CY LS ++P
Sbjct: 260 GTQFTFLLGPVYNALRSAFLNQTS--SVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLLP 317
Query: 410 KITIHFLGGVDLELDVRGTLVV----------ESVRQVCLGFA---LLPSDPNSILLGNV 456
+T+ F G E+ V G V+ +SV CL F LL + + ++G+
Sbjct: 318 TVTLVFRGA---EMTVSGDRVLYRVPGELRGNDSVH--CLSFGNSDLLGVE--AYVIGHH 370
Query: 457 QQRGYEVHYDVAGRRLGFGPGNCN 480
Q+ + +D+ R+G C+
Sbjct: 371 HQQNVWMEFDLEKSRIGLAQVRCD 394
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 161/369 (43%), Gaps = 40/369 (10%)
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
+ IG P Q ++LDTGS ++W QC FDPS S TFS +PC CK
Sbjct: 101 LPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCKPR 160
Query: 196 LEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCT 254
+ F P D+ ++ C Y Y DG+ G ++ T P +LGC
Sbjct: 161 IPDFTLPTSCDQ--NRLCHYSYFYADGTYAEGNLVREKFTFSRS-----LFTPPLILGCA 213
Query: 255 DNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYI---TFGKPDTVNKK 311
+T + GI+G++RG +S S++ I+ F YC+ + GY +F N
Sbjct: 214 TESTDPR----GILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYTPTGSFYLGHNPNSN 269
Query: 312 FVKYTPIVTTPEQSEF-------YHITLTGISVGGERLPLKASYFTKLS-----TEIDSG 359
+Y ++T Y + L GI +GG +L + + F + T +DSG
Sbjct: 270 TFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSG 329
Query: 360 TIITRFPAPVYSALRS----AFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKITIH 414
+ T Y +R+ A RMKK + G+ D+ C+D +A + ++ +
Sbjct: 330 SEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADM---CFDGNAIEIGRLIGDMVFE 386
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSD---PNSILLGNVQQRGYEVHYDVAGRR 471
F GV + + L C+G A SD S ++GN Q+ V +D+ RR
Sbjct: 387 FEKGVQIVVPKERVLATVEGGVHCIGIA--NSDKLGAASNIIGNFHQQNLWVEFDLVNRR 444
Query: 472 LGFGPGNCN 480
+GFG +C+
Sbjct: 445 MGFGTADCS 453
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 164/369 (44%), Gaps = 36/369 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP--FFDPSKSKTFSKIPCN 188
EY + V +G P + + DTGS + W C D F PS+S T+S + C
Sbjct: 99 EYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLSCQ 158
Query: 189 STTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF-AR 246
S C+ L Q C + EC Y AY DGS G +T+ + G G R
Sbjct: 159 SAACQAL-------SQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVR 211
Query: 247 YPFL-LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY-----FFYCLHSPYG---ST 297
P + GC+ + G + G++GL G +S++S+ + F YCL PY S+
Sbjct: 212 VPRVSFGCSTGSAGSFR-SDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSS 270
Query: 298 GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEID 357
++FG V+ TP+V + E +Y + L ++V G+ + S +D
Sbjct: 271 STLSFGARAVVSDPGAASTPLVPS-EVDSYYTVALESVAVAGQDVASANSSRII----VD 325
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL---SAYKTVVVPKITIH 414
SGT +T + L + +R++ + + E L CYD+ S + +P +T+
Sbjct: 326 SGTTLTFLDPALLRPLVAELERRIRLPR-AQPPEQLLQLCYDVQGKSQAEDFGIPDVTLR 384
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLP---SDPNSILLGNVQQRGYEVHYDVAGRR 471
F GG + L T + +CL L+P S P SI LGN+ Q+ + V YD+ R
Sbjct: 385 FGGGASVTLRPENTFSLLEEGTLCL--VLVPVSESQPVSI-LGNIAQQNFHVGYDLDART 441
Query: 472 LGFGPGNCN 480
+ F +C
Sbjct: 442 VTFAAVDCT 450
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 107/362 (29%), Positives = 161/362 (44%), Gaps = 28/362 (7%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
++ + + IG P ++ L+DTGS + W QC PC+ C +Q P FDP KS T++ I C+S
Sbjct: 67 QHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDSP 126
Query: 191 TCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C L CS K C Y Y D S G A D T G + F
Sbjct: 127 LCHKL-------DTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKP-VSLSRF 178
Query: 250 LLGCTDNNTGDQNGAS-GIMGLDRGPVSIISKTNISY----FFYCLHSPYGS----TGYI 300
L GC NNTG N G++GL GP S+IS+ + F CL P+ + + +
Sbjct: 179 LFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCL-VPFLTDIKISSRM 237
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGT 360
+FGK V V TP+V E+ Y +TL GISV P+ ++ K + +DSGT
Sbjct: 238 SFGKGSQVLGNGVVTTPLVPR-EKDTSYFVTLLGISVEDTYFPMNST-IGKANMLVDSGT 295
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
P +Y + + R ++ + CY + P +T HF+G
Sbjct: 296 PPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCY--RTQTNLKGPTLTFHFVGANV 353
Query: 421 LELDVRGTL--VVESVRQVCLG-FALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
L ++ + ++ CL + SDP + GN Q Y + +D+ + + F P
Sbjct: 354 LLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPG--VYGNFAQSNYLIGFDLDRQVVSFKPT 411
Query: 478 NC 479
+C
Sbjct: 412 DC 413
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 117/448 (26%), Positives = 183/448 (40%), Gaps = 36/448 (8%)
Query: 44 TVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRR 103
+V + T + P +++++ R P S + + R RL+ R
Sbjct: 13 SVSSLFSTEANESPSGFTVDLIHRDSPLSPFYNPSLTPSQRIINAALRSISRLN-----R 67
Query: 104 LQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC 163
+ + N K P I+ EY + IG P DTGS + W QC PC
Sbjct: 68 VSNLLDQNNK------LPQSVLILHNGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPC 121
Query: 164 IHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDG-S 222
C Q P F P KS TF C S C +LL P + S EC Y Y D S
Sbjct: 122 ASCFPQSTPLFQPLKSSTFMPTTCRSQPCTLLL----PEQKGCGKSGECIYTYKYGDQYS 177
Query: 223 GETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN---TGDQNGASGIMGLDRGPVSIIS 279
G +T+ + G A GC N +GIMGL GP+S++S
Sbjct: 178 FSEGLLSTETLRFDSQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVS 237
Query: 280 KT--NISY-FFYCLHSPYGSTGY--ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTG 334
+ I + F YCL P GST + FG + + V TP++ P +Y + L
Sbjct: 238 QIGDQIGHKFSYCLL-PLGSTSTSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEA 296
Query: 335 ISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF 394
++V + +P + T + IDSGT++T Y ++ ++ + + ++D+
Sbjct: 297 VTVAQKTVPTGS---TDGNVIIDSGTLLTYLGESFYYNFAASLQESLAV----ELVQDVL 349
Query: 395 DTCYDLSAYK-TVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI-L 452
Y+ V P+I F G ++ E VCL A PS + I +
Sbjct: 350 SPLPFCFPYRDNFVFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMIA--PSSVSGISI 407
Query: 453 LGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G+ Q ++V YD+ G+++ F P +C+
Sbjct: 408 FGSFSQIDFQVEYDLEGKKVSFQPTDCS 435
>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
Length = 437
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 129/440 (29%), Positives = 203/440 (46%), Gaps = 48/440 (10%)
Query: 57 PGKVSLEVLGRYGPCSKLNQGKSRNTPS-LEEILRRDQQRLHLKNSRRLQKAIPDNFKKT 115
P L V+ YG CS N K+ + + + + +D R+ ++ QK
Sbjct: 30 PDDSDLNVIPMYGKCSPFNPPKADSWDNRVINMASKDPARMSYLSTLVAQK--------- 80
Query: 116 KAFTFPAKTG-IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFF 174
A + P +G Y + V IG P Q + ++LDT + + CI CS F
Sbjct: 81 TATSAPIASGQTFNIGNYVVRVKIGTPGQLLFMVLDTSTDEAFVPSSGCIGCSATT---F 137
Query: 175 DPSKSKTFSKIPCNSTTC-KILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
P+ S +F + C+ C ++ P G CS ++ +Y GS + D +
Sbjct: 138 YPNVSTSFVPLDCSVPQCGQVRGLSCPATGSGACS-----FNQSYA-GSTFSATLVQDSL 191
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL 290
+ Y F G + +G A G++GL RGP+S++S++ Y F YCL
Sbjct: 192 RL----ATDVIPSYSF--GSINAISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYCL 245
Query: 291 HS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASY 348
S Y +G + G K ++ TP++ P + Y++ LT ISVG +PL +
Sbjct: 246 PSFKSYYFSGSLKLGP--VGQPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSEL 303
Query: 349 F-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY 403
T T IDSGT+ITRF P+Y+A+R FRK++ G FDTC+ + Y
Sbjct: 304 LAFNPSTGAGTIIDSGTVITRFVEPIYNAVRDEFRKQVTGPFSSLGA---FDTCF-VKNY 359
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRG 460
+T + P IT+HF +DL+L + +L+ S + CL A PS+ NS+L + N QQ+
Sbjct: 360 ET-LAPAITLHFT-DLDLKLPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQN 417
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
V +D ++G CN
Sbjct: 418 LRVLFDTVNNKVGIARELCN 437
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 108/393 (27%), Positives = 174/393 (44%), Gaps = 47/393 (11%)
Query: 115 TKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK--PCIHCSQQRDP 172
+ +TF ++ I + + + IG P Q L+LDTGS ++W QC
Sbjct: 65 SSPYTF--RSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTT 122
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATD 231
FDPS S +FS +PC+ CK + F P D S++ C Y Y DG+ G +
Sbjct: 123 SFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCD--SNRLCHYSYFYADGTFAEGNLVKE 180
Query: 232 RMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLH 291
+ T P +LGC +T ++ GI+G++ G +S IS+ IS F YC+
Sbjct: 181 KFTFSNSQ-----TTPPLILGCAKESTDEK----GILGMNLGRLSFISQAKISKFSYCIP 231
Query: 292 SP-----YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEF-------YHITLTGISVGG 339
+ STG G D N + KY ++T P+ Y + L GI +G
Sbjct: 232 TRSNRPGLASTGSFYLG--DNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQ 289
Query: 340 ERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRK----RMKK-YKMGKG 389
+RL + S F + T +DSG+ T Y ++ + R+KK Y G
Sbjct: 290 KRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGST 349
Query: 390 IEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPS 446
+ FD + + + ++ + F GV++ ++ + LV C+G ++L +
Sbjct: 350 ADMCFDGNHSMEIGR--LIGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGA 407
Query: 447 DPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
N ++GNV Q+ V +DV RR+GF C
Sbjct: 408 ASN--IIGNVHQQNLWVEFDVTNRRVGFSKAEC 438
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 109/372 (29%), Positives = 163/372 (43%), Gaps = 47/372 (12%)
Query: 149 LDTGSGITWTQCK---PCIHCSQQR--DPFFDPSKSKTFSKIPCNSTTCKIL------LE 197
+DTGS + W C CI+C + + F P S + + C + CK L L
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60
Query: 198 WFPPNGQDKCSSKECP-YDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDN 256
G K S+ CP Y I Y GS G T+ + + NG G A F +GC+
Sbjct: 61 CQSCAGSLKNCSETCPPYGIQYGRGS-TAGLLLTETLNLPLENGEGARAITHFAVGCSIV 119
Query: 257 NTGDQNGASGIMGLDRGPVSIISK----TNISYFFYCLHS----PYGSTGYITFGKPDTV 308
++ SGI G RG +S+ S+ F YCL S + G
Sbjct: 120 SS---QQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDKALP 176
Query: 309 NKKFVKYTPIVT---TPEQSEF---YHITLTGISVGGERLPLKASYFTKLSTE------I 356
N + YTP +T P S++ Y+I L G+S+GG+RL S + T+ I
Sbjct: 177 NNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGGTII 236
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSAYKTVVVPKITIH 414
DSGT T F ++ + + F ++ + G+ +ED CYD++ + +V+P+ H
Sbjct: 237 DSGTTFTVFSDEIFKHIAAGFASQIGYRRAGE-VEDKTGMGLCYDVTGLENIVLPEFAFH 295
Query: 415 FLGGVDLELDVRGTL-VVESVRQVCL------GFALLPSDPNSILLGNVQQRGYEVHYDV 467
F GG D+ L V S +CL G + S P +++LGN QQ+ + + YD
Sbjct: 296 FKGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGP-AVILGNDQQQDFYLLYDR 354
Query: 468 AGRRLGFGPGNC 479
RLGF C
Sbjct: 355 EKNRLGFTQQTC 366
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 114/422 (27%), Positives = 192/422 (45%), Gaps = 52/422 (12%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTF-PAKTGIVAADEYYIVVAIGKPKQYVSL 147
L + ++R +++ R LQ + TF P G+ YY + +G P + +
Sbjct: 13 LSKLKERDRVRHGRMLQSSGVGVVDFPVQGTFDPFLVGL-----YYTRLQLGTPPRDFYV 67
Query: 148 LLDTGSGITWTQCKPCIHC---SQQRDP--FFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
+DTGS + W C C C S P FFDP S T S I C+ C + L+ +
Sbjct: 68 QIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQ----S 123
Query: 203 GQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF--ARYPFLLGCTDNNT 258
CS++ C Y+ Y DGSG +G++ +D + V G + P + GC+ T
Sbjct: 124 SDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCSALQT 183
Query: 259 GD----QNGASGIMGLDRGPVSIISK---TNIS--YFFYCLHSPYGSTGYITFGKPDTVN 309
GD GI G + +S++S+ IS F +CL G + G+ +
Sbjct: 184 GDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGILVLGE---IV 240
Query: 310 KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---IDSGTIITRFP 366
+ + YTP+V P Q Y++ + ISV G+ L + S F S++ IDSGT +
Sbjct: 241 EPNIVYTPLV--PSQPH-YNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGTTLAYLA 297
Query: 367 APVY----SALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
Y SA+ S ++ Y + KG + CY +S+ + P+++++F GG +
Sbjct: 298 EAAYDPFISAITSIVSPSVRPY-LSKG-----NHCYLISSSINDIFPQVSLNFAGGASMI 351
Query: 423 LDVRGTLVVES----VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
L + L+ +S C+GF + +I LG++ + YD+A +R+G+ +
Sbjct: 352 LIPQDYLIQQSSIGGAALWCIGFQKIQGQGITI-LGDLVLKDKIFVYDIANQRIGWANYD 410
Query: 479 CN 480
C+
Sbjct: 411 CS 412
>gi|413953789|gb|AFW86438.1| hypothetical protein ZEAMMB73_078928 [Zea mays]
Length = 155
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/161 (43%), Positives = 95/161 (59%), Gaps = 9/161 (5%)
Query: 320 TTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRK 379
T P Q F +TL GI+VGG++L L+ S F+ +D GT+IT + Y ALRSAFRK
Sbjct: 3 TVPGQPTFSTVTLAGINVGGKKLDLRPSAFSG-GMIVDCGTVITGLQSTAYRALRSAFRK 61
Query: 380 RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDV-RGTLVVESVRQVC 438
M+ Y++ + DTCY+L+ YK VVVPKI + F GG + LDV G+LV C
Sbjct: 62 AMEAYRLLPNGD--LDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGSLV-----NGC 114
Query: 439 LGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L FA D ++ +LGNV QR +EV +D + + GF C
Sbjct: 115 LAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 155
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 145/361 (40%), Gaps = 28/361 (7%)
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN 188
A Y IG P Q S ++D + WTQCK C C +Q P FDP+ S T+ PC
Sbjct: 48 AMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCG 107
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
+ C+ + P+ CS C Y A + G TD + A+
Sbjct: 108 TPLCESI-----PSDSRNCSGNVCAYQ-ASTNAGDTGGKVGTDTFAVGT-------AKAS 154
Query: 249 FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCL--HSPYGSTGYITFGKP 305
GC + D G SGI+GL R P S++++T ++ F YCL H ++
Sbjct: 155 LAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGRNSALFLGSSA 214
Query: 306 DTVNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTI 361
TP V + S +Y + L G+ G +PL S T L +D+ +
Sbjct: 215 KLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVL---LDTFSP 271
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
I+ Y A++ A + M +E FD C+ S + P + F GG +
Sbjct: 272 ISFLVDGAYQAVKKAVTAAVGAPPMATPVEP-FDLCFPKSG-ASGAAPDLVFTFRGGAAM 329
Query: 422 ELDVRGTLVVESVRQVCLGF---ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
+ L+ VCL A L S LLG++QQ +D+ L F P +
Sbjct: 330 TVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPAD 389
Query: 479 C 479
C
Sbjct: 390 C 390
>gi|357118734|ref|XP_003561105.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 404
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 81/236 (34%), Positives = 123/236 (52%), Gaps = 15/236 (6%)
Query: 251 LGCTDNNTGDQNG-ASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKPD 306
GC+ + G +G SG M L G S+ S+T +Y F YC+ P S G+++ G
Sbjct: 177 FGCSHSVRGRFSGQTSGTMSLGGGRQSLRSQTASAYGDAFSYCVPQPSAS-GFLSLGGAI 235
Query: 307 TVNKKFVKY--TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITR 364
+ + TP+V T + FY + L GI V G RL + + F+ T +DS ++T+
Sbjct: 236 GSSGSGSGFASTPLVATANPT-FYVVRLQGIDVAGRRLNVPPAVFSA-GTLMDSSAVVTQ 293
Query: 365 FPAPVYSALRSAFRKRMKKYK-MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
P Y ALR AFR M++Y+ + G + + DTCYD V VP +++ F GG + L
Sbjct: 294 LPPTAYRALRRAFRNAMRRYRRVPAGGKQILDTCYDFEGLGNVTVPAVSLVFSGGAVVRL 353
Query: 424 DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ ++ + CL F P+D + +GNVQQ+ +EV YDV R +GF G C
Sbjct: 354 EPMAVMM-----EGCLAFVPTPADSDLGFIGNVQQQTHEVLYDVGARNVGFRRGAC 404
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 116/396 (29%), Positives = 174/396 (43%), Gaps = 38/396 (9%)
Query: 104 LQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKP-KQYVSLLLDTGSGITWTQCKP 162
L+K + + K + + AA I + +G P Q VS L+D S W QC P
Sbjct: 60 LKKQLRNRGNKQQQQQLGGEAASGAAPPLVINITVGTPVAQTVSGLVDITSYFVWAQCAP 119
Query: 163 CIHCSQQRDP---FFDPSKSKTFSKIPCNS--------TTCKILLEWFPPNGQDKCSSKE 211
C + P F P+ S TFS +PC+S TC +C S
Sbjct: 120 CAAAAGCLPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDS-- 177
Query: 212 CPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGL 270
Y + Y + T G+ ATD T G A + GC+D + GD GASG++G+
Sbjct: 178 --YSLTYGGSAANTSGYLATDTFTF------GATAVPGVVFGCSDASYGDFAGASGVIGI 229
Query: 271 DRGPVSIISKTNISYFFYCLHSPYG-----STGYITFGKPDTVNKKFVKYTPIVTTPEQS 325
RG +S+IS+ F Y L +P + I FG K + TP++++
Sbjct: 230 GRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYP 289
Query: 326 EFYHITLTGISVGGERL-PLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRK 379
+FY++ LTG+ V G RL + A F + + S T +T Y +R+A
Sbjct: 290 DFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVAS 349
Query: 380 RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-C 438
R+ + D CY+ S+ V VPK+T+ F GG D++L +++ + C
Sbjct: 350 RIGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLEC 409
Query: 439 LGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
L +LPS S+ LG + Q G + YDV RL F
Sbjct: 410 L--TMLPSQGGSV-LGTLLQTGTNMIYDVDAGRLTF 442
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 171/374 (45%), Gaps = 38/374 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHC---SQQRDP--FFDPSKSKTFSKIP 186
YY V +G P + + +DTGS + W C C C S + P FFDP S T S +
Sbjct: 83 YYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVS 142
Query: 187 CNSTTCKILLEWFPPNGQDKC--SSKECPYDIAYVDGSGETGFWATDRMTIQEV--NGNG 242
C+ C + ++ + C S +C Y Y DGSG +G++ D + + V +
Sbjct: 143 CSDQICALGVQ----SSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVT 198
Query: 243 YFARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHSP 293
+ + GC+ + TGD GI G + +S+IS+ + F +CL
Sbjct: 199 SNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGD 258
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G + G+ + + V YTP+V P Q Y++ L ISV G+ LP+ + F S
Sbjct: 259 DSGGGILVLGE---IVEPNVVYTPLV--PSQPH-YNLNLQSISVNGQVLPISPAVFATSS 312
Query: 354 TE---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
++ IDSGT + Y+A A + + ++ + CY S+ + + P+
Sbjct: 313 SQGTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKG--NRCYVTSSSVSDIFPQ 370
Query: 411 ITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYD 466
++++F GG L L + L+ V C+GF +P +I LG++ + YD
Sbjct: 371 VSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITI-LGDLVLKDKIFIYD 429
Query: 467 VAGRRLGFGPGNCN 480
+A +R+G+ +C+
Sbjct: 430 LANQRIGWTNYDCS 443
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 165/373 (44%), Gaps = 41/373 (10%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCK--PCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+ + IG P Q L+LDTGS ++W QC FDPS S +FS +PC+
Sbjct: 83 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 142
Query: 192 CKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
CK + F P D S++ C Y Y DG+ G ++ T P +
Sbjct: 143 CKPRIPDFTLPTSCD--SNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTP-----PLI 195
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSP-----YGSTGYITFGKP 305
LGC +T GI+G++ G +S IS+ IS F YC+ + STG G
Sbjct: 196 LGCAKEST----DVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLG-- 249
Query: 306 DTVNKKFVKYTPIVTTPEQSEF-------YHITLTGISVGGERLPLKASYFTKLS----- 353
+ N + KY ++T P+ Y + L GI +G +RL + +S F +
Sbjct: 250 ENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQ 309
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDLFDTCYDLSAYKTV--VVPK 410
T +DSG+ T Y ++ + + + K G D C+D + + ++
Sbjct: 310 TMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGD 369
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSDPNSILLGNVQQRGYEVHYDV 467
+ F GV++ ++ + LV C+G ++L + N ++GNV Q+ V +DV
Sbjct: 370 LVFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASN--IIGNVHQQNLWVEFDV 427
Query: 468 AGRRLGFGPGNCN 480
A RR+GF C+
Sbjct: 428 ANRRVGFSKAECS 440
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 145/361 (40%), Gaps = 28/361 (7%)
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN 188
A Y IG P Q S ++D + WTQCK C C +Q P FDP+ S T+ PC
Sbjct: 48 AMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCG 107
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
+ C+ + P+ CS C Y A + G TD + A+
Sbjct: 108 TPLCESI-----PSDSRNCSGNVCAYQ-ASTNAGDTGGKVGTDTFAVGT-------AKAS 154
Query: 249 FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCL--HSPYGSTGYITFGKP 305
GC + D G SGI+GL R P S++++T ++ F YCL H ++
Sbjct: 155 LAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKNSALFLGSSA 214
Query: 306 DTVNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTI 361
TP V + S +Y + L G+ G +PL S T L +D+ +
Sbjct: 215 KLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVL---LDTFSP 271
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
I+ Y A++ A + M +E FD C+ S + P + F GG +
Sbjct: 272 ISFLVDGAYQAVKKAVTVAVGAPPMATPVEP-FDLCFPKSG-ASGAAPDLVFTFRGGAAM 329
Query: 422 ELDVRGTLVVESVRQVCLGF---ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
+ L+ VCL A L S LLG++QQ +D+ L F P +
Sbjct: 330 TVAASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPAD 389
Query: 479 C 479
C
Sbjct: 390 C 390
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 116/434 (26%), Positives = 191/434 (44%), Gaps = 51/434 (11%)
Query: 80 RNTPSLEEI-LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTF-PAKTGIVAAD---EYYI 134
R P+ ++ L + ++R +++SR LQ + TF P G YY
Sbjct: 33 RGVPASHKLKLSQLKERDRVRHSRMLQSSGGGVVDFPVQGTFDPFLVGFYFGSFCRLYYT 92
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHC---SQQRDP--FFDPSKSKTFSKIPCNS 189
+ +G P + + +DTGS + W C C C S P FFDP S T S I C+
Sbjct: 93 RLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLISCSD 152
Query: 190 TTCKILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF--A 245
C + L+ + C+++ +C Y Y DGSG +G++ +D + + G +
Sbjct: 153 QRCSLGLQ----SSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNS 208
Query: 246 RYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGS 296
P + GC+ TGD GI G + +S+IS+ F +CL
Sbjct: 209 SAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSG 268
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE- 355
G + G+ + + + YTP+V P Q Y++ L I V G+ L + S F S +
Sbjct: 269 GGILVLGE---IVEPNIVYTPLV--PSQPH-YNLNLQSIYVNGQTLAIDPSVFATSSNQG 322
Query: 356 --IDSGTIITRFPAPVY----SALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVP 409
IDSGT + Y SA+ S + Y + KG + CY S+ V P
Sbjct: 323 TIIDSGTTLAYLTEAAYDPFISAITSTVSPSVSPY-LSKG-----NQCYLTSSSINDVFP 376
Query: 410 KITIHFLGGVDLELDVRGTLVVES----VRQVCLGFALLPSDPNSILLGNVQQRGYEVHY 465
+++++F GG + L + L+ +S C+GF + +I LG++ + Y
Sbjct: 377 QVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITI-LGDLVLKDKIFVY 435
Query: 466 DVAGRRLGFGPGNC 479
D+AG+R+G+ +C
Sbjct: 436 DIAGQRIGWANYDC 449
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 116/396 (29%), Positives = 174/396 (43%), Gaps = 38/396 (9%)
Query: 104 LQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKP-KQYVSLLLDTGSGITWTQCKP 162
L+K + + K + + AA I + +G P Q VS L+D S W QC P
Sbjct: 60 LKKQLRNRGNKQQQQQLGGEAASGAAPPLVINITVGTPVAQTVSGLVDITSYFVWAQCAP 119
Query: 163 CIHCSQQRDP---FFDPSKSKTFSKIPCNS--------TTCKILLEWFPPNGQDKCSSKE 211
C + P F P+ S TFS +PC+S TC +C S
Sbjct: 120 CAAAAGCLPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDS-- 177
Query: 212 CPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGL 270
Y + Y + T G+ ATD T G A + GC+D + GD GASG++G+
Sbjct: 178 --YSLTYGGSAANTSGYLATDTFTF------GATAVPGVVFGCSDASYGDFAGASGVIGI 229
Query: 271 DRGPVSIISKTNISYFFYCLHSPYG-----STGYITFGKPDTVNKKFVKYTPIVTTPEQS 325
RG +S+IS+ F Y L +P + I FG K + TP++++
Sbjct: 230 GRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYP 289
Query: 326 EFYHITLTGISVGGERL-PLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRK 379
+FY++ LTG+ V G RL + A F + + S T +T Y +R+A
Sbjct: 290 DFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVAS 349
Query: 380 RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-C 438
R+ + D CY+ S+ V VPK+T+ F GG D++L +++ + C
Sbjct: 350 RIGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLEC 409
Query: 439 LGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
L +LPS S+ LG + Q G + YDV RL F
Sbjct: 410 L--TMLPSQGGSV-LGTLLQTGTNMIYDVDAGRLTF 442
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 141/307 (45%), Gaps = 30/307 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR--DPFFDPSKSKTFSKIPCNS 189
+++ ++G+P ++DTGS + W QC PC HCS P F+P+ S TF + C+
Sbjct: 68 FFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDD 127
Query: 190 TTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C+ + PNG CSS +C Y+ Y+ G+G G A +R+T NGN + P
Sbjct: 128 RFCR-----YAPNGH--CSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQ-PI 179
Query: 250 LLGCTDNNTGDQ--NGASGIMGLDRGPVSIISKTNISYFFYCL----HSPYGSTGYITFG 303
GC N G+Q + +GI+GL P S+ + S F YC+ + YG +
Sbjct: 180 AFGCGHEN-GEQLESEFTGILGLGAKPTSLAVQLG-SKFSYCIGDLANKNYGYNQLVLGE 237
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE----IDSG 359
D + TPI E Y++ L GISVG ++L ++ F + + +D+G
Sbjct: 238 DADILGDP----TPIEFETENG-IYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTG 292
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-VPKITIHFLGG 418
T+ T Y L + + + D CY + ++ P +T HF GG
Sbjct: 293 TLYTWLADIAYRELYNEIKSILDPKLERFWFRDFL--CYHGRVNEELIGFPVVTFHFAGG 350
Query: 419 VDLELDV 425
+L ++
Sbjct: 351 AELAMEA 357
>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 451
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 121/428 (28%), Positives = 196/428 (45%), Gaps = 40/428 (9%)
Query: 68 YGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV 127
YG CS + + ++ +D +R+ +S + + ++ P +G
Sbjct: 49 YGNCSPFKNYSTSWENIIIDMASKDPERVVYLSS------LDASLRRKPISAAPIASGQA 102
Query: 128 AADEYYIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFS-KI 185
Y+V V +G P Q ++LDT + W C C CS ++ P S T+ +
Sbjct: 103 FGIGSYVVRVKLGSPNQLFFMVLDTSTDEAWVPCTGCTGCSSSST-YYSPQASTTYGGAV 161
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C + C P SK C ++ +Y GS + D + + G
Sbjct: 162 ACYAPRCAQARGALP---CPYTGSKACTFNQSYA-GSTFSATLVQDSLRL----GIDTLP 213
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGS--TGYI 300
Y F GC ++ +G A G++GL RGP+S+ S+++ Y F YCL S S +G +
Sbjct: 214 SYAF--GCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPSFQSSYFSGSL 271
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-----KLSTE 355
G T + ++ TP++ P + Y++ LTG++VG ++PL Y T
Sbjct: 272 KLGP--TGQPRRIRTTPLLQNPRRPSLYYVNLTGVTVGRVKVPLPIEYLAFDPNKGSGTI 329
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF 415
+DSGT+ITRF PVYSA+R FR ++K +G FDTC+ + Y+ + P I + F
Sbjct: 330 LDSGTVITRFVGPVYSAIRDEFRNQVKGPFFSRG---GFDTCF-VKTYEN-LTPLIKLRF 384
Query: 416 LGGVDLELDVRGTLVVESV-RQVCLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGRRL 472
G+D+ L TL+ + CL A P++ NS+L + N QQ+ V +D R+
Sbjct: 385 T-GLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQQQNLRVLFDTVNNRV 443
Query: 473 GFGPGNCN 480
G CN
Sbjct: 444 GIARELCN 451
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 146/361 (40%), Gaps = 28/361 (7%)
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN 188
A Y IG P Q S ++D + WTQCK C C +Q P FDP+ S T+ PC
Sbjct: 48 AMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCG 107
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
+ C+ + P+ CS C Y+ A + G TD + A+
Sbjct: 108 TPLCESI-----PSDVRNCSGNVCAYE-ASTNAGDTGGKVGTDTFAVGT-------AKAS 154
Query: 249 FLLGCTDNNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCL--HSPYGSTGYITFGKP 305
GC + D G SGI+GL R P S++++T ++ F YCL H ++
Sbjct: 155 LAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKNSALFLGSSA 214
Query: 306 DTVNKKFVKYTPIVTTP----EQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTI 361
TP V + S +Y + L G+ G +PL S T L +D+ +
Sbjct: 215 KLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVL---LDTFSP 271
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
I+ Y A++ A + M +E FD C+ S + P + F GG +
Sbjct: 272 ISFLVDGAYQAVKKAVTVAVGAPPMATPVEP-FDLCFPKSG-ASGAAPDLVFTFRGGAAM 329
Query: 422 ELDVRGTLVVESVRQVCLGF---ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
+ L+ VCL A L S LLG++QQ +D+ L F P +
Sbjct: 330 TVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPAD 389
Query: 479 C 479
C
Sbjct: 390 C 390
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 166/374 (44%), Gaps = 46/374 (12%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ +A+G P Q +S++LDTGS ++W CK S F+P S T+S +PC+S C+
Sbjct: 67 VTLAVGDPPQNISMVLDTGSELSWLHCKK----SPNLGSVFNPVSSSTYSPVPCSSPICR 122
Query: 194 ILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC 253
P + C I+Y D + G A + I V G L GC
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGT------LFGC 176
Query: 254 TD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVN 309
D +N+ + ++G+MG++RG +S +++ S F YC+ S S+G++ G
Sbjct: 177 MDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI-SGSDSSGFLLLGDASYSW 235
Query: 310 KKFVKYTPIV--TTPE---QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSG 359
++YTP+V +TP Y + L GI VG + L L S F T +DSG
Sbjct: 236 LGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSG 295
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-----FDTCYDLSAYKT---VVVPKI 411
T T PVY+AL++ F + K D D CY + + +P +
Sbjct: 296 TQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMV 355
Query: 412 TIHFLGGVDLELDVRGTLVVESV-------RQVCLGFALLPSD---PNSILLGNVQQRGY 461
++ F G E+ V G ++ V ++ F SD + ++G+ Q+
Sbjct: 356 SLMFRGA---EMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNV 412
Query: 462 EVHYDVAGRRLGFG 475
+ +D+A R+GF
Sbjct: 413 WMEFDLAKSRVGFA 426
>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
Length = 225
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 85/228 (37%), Positives = 121/228 (53%), Gaps = 17/228 (7%)
Query: 263 GASGIMGLDRGPVSIISKTNISY---FFYCLHS-PYGSTGYITFGKPDT-VNKKFVKYTP 317
GA+G++GL GP+S + + F YCL S S+G + FG+ V +V
Sbjct: 4 GAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGTESSGSLEFGRESVPVGASWVS--- 60
Query: 318 IVTTPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPAPVYSA 372
++ P FY+I L+G+ VGG R+P+ F + +D+GT +TR PA Y+A
Sbjct: 61 LIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAYNA 120
Query: 373 LRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-V 431
R AF + G+ +FDTCYDL+ + TV VP I+ +FLGG L L R L+ V
Sbjct: 121 FRDAFVAQTTNLPKTSGVS-IFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLIPV 179
Query: 432 ESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+SV C FA PS ++GN+QQ G E+ D A +GFGP C
Sbjct: 180 DSVGTFCFAFA--PSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 158/356 (44%), Gaps = 27/356 (7%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + ++G P Q +S L DTGS + W +C C C+ + + P+KS +FSK+PC+S
Sbjct: 81 YDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSAL 140
Query: 192 CKIL-LEWFPPNGQDKCSSKECPYDIAYVDGSG----ETGFWATDRMTI--QEVNGNGYF 244
C+ L + G + C Y +Y S G+ ++ T+ V G G+
Sbjct: 141 CRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQGIGF- 199
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK 304
GCT + G SG++GL RG +S++ + + F YCL S ++ + FG
Sbjct: 200 -------GCTTMSEGGYGSGSGLVGLGRGKLSLVRQLKVGAFSYCLTSDPSTSSPLLFGA 252
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITR 364
+ V+ TP+V + S FY + L IS+G + P + DSGT +T
Sbjct: 253 -GALTGPGVQSTPLVNL-KTSTFYTVNLDSISIGAAKTPGTGRH----GIIFDSGTTLTF 306
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
P Y+ + + G D ++ C+ S V P + +HF GG D+ L
Sbjct: 307 LAEPAYTLAEAGLLSQTTNLTRVPGT-DGYEVCFQTSG--GAVFPSMVLHFDGG-DMALK 362
Query: 425 VRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ C PS+ + ++GN+ Q Y + YD+ L F P NC+
Sbjct: 363 TENYFGAVNDSVSCWLVQKSPSEMS--IVGNIMQMDYHIRYDLDKSVLSFQPTNCD 416
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 148/363 (40%), Gaps = 42/363 (11%)
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Q L++DTGS T+ CK C C + ++D +S F ++ C + L E
Sbjct: 49 QTYDLIVDTGSARTYVPCKGCARCGEHAHGYYDYDRSMEFERLDCGEASDATLCE---ET 105
Query: 203 GQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD- 260
+ C S C Y ++Y +GS G+ DR+ + E + A GC + T
Sbjct: 106 MKGTCQSDGRCSYVVSYAEGSSSRGYVVRDRVRLGEGTLSAMLA-----FGCEEAETNAI 160
Query: 261 -QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYITFGKPD-TVNKKFV 313
+ A G+ G RG ++ ++ + F +C+ + G +T G+ D + +
Sbjct: 161 YEQKADGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLGRFDFGADAPAL 220
Query: 314 KYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSAL 373
TP+V P F+++ + +G + SY T L DSGT T P V+ +
Sbjct: 221 ARTPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTL----DSGTTFTFVPRSVWVS- 275
Query: 374 RSAFRKRMKKYKMGKGIEDLF-------DTCYDLSAYKTVVV----------PKITIHFL 416
F+ R+ G+E + D CY +SA + P +TI +
Sbjct: 276 ---FKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEWFPPLTIAYE 332
Query: 417 GGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
GGV L L L + + N ILLG + R + +DVA R+G P
Sbjct: 333 GGVSLTLGPENYLFAHETNSAAFCVGIFANPNNQILLGQITMRDTLMEFDVANSRVGMAP 392
Query: 477 GNC 479
NC
Sbjct: 393 ANC 395
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 168/376 (44%), Gaps = 50/376 (13%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ +A+G P Q +S++LDTGS ++W CK S F+P S T+S +PC+S C+
Sbjct: 63 VTLAVGSPPQNISMVLDTGSELSWLHCKK----SPNLGSVFNPVSSSTYSPVPCSSPICR 118
Query: 194 ILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC 253
P + C I+Y D + G A D I V G L GC
Sbjct: 119 TRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVTRPGT------LFGC 172
Query: 254 TD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVN 309
D +++ + ++G+MG++RG +S +++ S F YC+ S S+G + G
Sbjct: 173 MDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI-SGSDSSGILLLGDASYSW 231
Query: 310 KKFVKYTPIV--TTPE---QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSG 359
++YTP+V TTP Y + L GI VG + L L S F T +DSG
Sbjct: 232 LGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSG 291
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIED-------LFDTCYDLSAYKT---VVVP 409
T T PVY+AL++ F + K + ++D D CY + + +P
Sbjct: 292 TQFTFLMGPVYTALKNEFIAQTKSVL--RIVDDPNFVFQGTMDLCYRVGSSTRPNFTGLP 349
Query: 410 KITIHFLGGVDLELDVRGTLVVESV-------RQVCLGFALLPSD---PNSILLGNVQQR 459
I++ F G E+ V G ++ V ++ F SD + ++G+ Q+
Sbjct: 350 VISLMFRGA---EMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQ 406
Query: 460 GYEVHYDVAGRRLGFG 475
+ +D+A R+GF
Sbjct: 407 NVWMEFDLAKSRVGFA 422
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 106/350 (30%), Positives = 161/350 (46%), Gaps = 51/350 (14%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + +G P Q V+++LDTGS ++W CK S F+P S ++S IPC+S C+
Sbjct: 1002 VSLTVGSPPQQVTMVLDTGSELSWLHCKK----SPNLTSVFNPLSSSSYSPIPCSSPICR 1057
Query: 194 ILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
P C K+ C ++Y D S G A+D I G A L G
Sbjct: 1058 TRTRDLP--NPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI------GSSALPGTLFG 1109
Query: 253 CTD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
C D +N+ + +G+MG++RG +S +++ + F YC+ S S+G + FG
Sbjct: 1110 CMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI-SGRDSSGVLLFGDLHLS 1168
Query: 309 NKKFVKYTPIV--TTPE---QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
+ YTP+V +TP Y + L GI VG + LPL S F T +DS
Sbjct: 1169 WLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDS 1228
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKG-----IEDLFDTCYDLSA-YKTVVVPKIT 412
GT T PVY+ALR+ F ++ K G + D CY ++A K +P ++
Sbjct: 1229 GTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVS 1288
Query: 413 IHFLGGVDLELDVRGTLVVESVRQV--------CLGFALLPSDPNSILLG 454
+ F G E+ V G +++ V ++ CL F NS LLG
Sbjct: 1289 LMFRGA---EMVVGGEVLLYRVPEMMKGNEWVYCLTFG------NSDLLG 1329
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 116/421 (27%), Positives = 171/421 (40%), Gaps = 55/421 (13%)
Query: 91 RDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYY------------IVVAI 138
+D+ + LKNS + K+ A AAD+ Y + +I
Sbjct: 57 KDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAADDAYEASLLSELCTFLVNFSI 116
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEW 198
G+P ++DTGS +TW QC+PCI+C QQ+ P ++PS S T+ T +
Sbjct: 117 GQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNPSSSSTYVSCSDFDRTDT---TF 173
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNT 258
+G D C Y Y D + G +A +++ E +G + + GC NNT
Sbjct: 174 TATHGSD------CNYSQTYADKTTTRGTYAREQLLF-ETPDDGITIMHDVIFGCGHNNT 226
Query: 259 ---GDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKY 315
G ASG+ GL SIISK F YC+ G+ G +G +K
Sbjct: 227 QLPGPTGYASGVFGLGDSGSSIISKLGFG-FSYCI----GNIGDPLYGFHRLTLGNKLKI 281
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-------IDSGTIITRFPAP 368
T Y+ITL GIS+G ERL + F ++ IDSG ++ P
Sbjct: 282 EGYSTPLVPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIVIDSGATLSYIPRQ 341
Query: 369 VYSALR----SAFRKRMKKYKMGKGIEDLFDTCY------DLSAYKTVVVPKITIHFLGG 418
Y+ +R S + +Y+ I CY DL + P T H G
Sbjct: 342 AYNVVRDKVSSILSGFLSRYRY---IARHLSLCYIGKLNQDLQGF-----PDATFHLADG 393
Query: 419 VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
DL V G + +CL SD + L+G + Q+ Y V YD+ ++L F
Sbjct: 394 ADLVFQVEGLFFQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQKLYFQRIE 453
Query: 479 C 479
C
Sbjct: 454 C 454
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 164/371 (44%), Gaps = 27/371 (7%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKS 179
P I EY + + IG P + DTGS + W QC PC +C Q P F+P KS
Sbjct: 80 LPESLLIPENGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKS 139
Query: 180 KTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVN 239
TF C+S C + PP+ + +C Y +Y D S G T+ ++
Sbjct: 140 STFKAATCDSQPCTSV----PPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGS-T 194
Query: 240 GNGYFARYP-FLLGCTDNN-----TGDQNGASGIMGLDRGPVSIISKTNISY-FFYCLHS 292
G+ +P + GC N T D+ +G + I Y F YCL
Sbjct: 195 GDAQTVSFPSSIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCLL- 253
Query: 293 PYG--STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
P+ ST + FG V V TP++ P FY + L +++G + +P T
Sbjct: 254 PFSSNSTSKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPTGR---T 310
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
+ IDSGT++T Y+ ++ ++ + + + + F C+ Y+ + +P
Sbjct: 311 DGNIIIDSGTVLTYLEQTFYNNFVASLQEVL-SVESAQDLPFPFKFCF---PYRDMTIPV 366
Query: 411 ITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVA 468
I F G + L + L+ ++ +CL A++PS + I + GNV Q ++V YD+
Sbjct: 367 IAFQFTGA-SVALQPKNLLIKLQDRNMLCL--AVVPSSLSGISIFGNVAQFDFQVVYDLE 423
Query: 469 GRRLGFGPGNC 479
G+++ F P +C
Sbjct: 424 GKKVSFAPTDC 434
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 97/391 (24%), Positives = 160/391 (40%), Gaps = 43/391 (10%)
Query: 119 TFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPC-IHCS-QQRDPFFDP 176
T P + +Y + +G P + ++++DTGS IT+ C C +C +D FDP
Sbjct: 49 TLPLHGAVKDYGYFYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDP 108
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTI 235
+ S + + I C+S C + PP G CS K EC Y Y + S G +D++ +
Sbjct: 109 ASSSSSAVIGCDSDKC---ICGRPPCG---CSEKRECTYQRTYAEQSSSAGLLVSDQLQL 162
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGD--QNGASGIMGLDRGPVSIISKTNIS-----YFFY 288
++ + GC TG+ A GI+GL VS++++ S F
Sbjct: 163 RD-------GAVEVVFGCETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFAL 215
Query: 289 CLHSPYGSTGYITFGKPDTVNKKF-VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS 347
C S G G + G D ++YT ++++ +Y + L + VGG++LP+K
Sbjct: 216 CFGSVEGD-GALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPE 274
Query: 348 -YFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMG---------KGIEDLFDTC 397
Y T +DSGT T P+ + + A ++ + K D C
Sbjct: 275 RYEEGYGTVLDSGTTFTYLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDIC 334
Query: 398 YDLSAYK--------TVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN 449
+ + + V P + F GV L L + + + + +
Sbjct: 335 FGGAPHAGHADQSKLEKVFPVFELQFADGVRLRTGPLNYLFMHTGEMGAYCLGVFDNGAS 394
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
LLG + R V YD RR+GFG +C
Sbjct: 395 GTLLGGISFRNILVQYDRRNRRVGFGAASCQ 425
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 172/372 (46%), Gaps = 42/372 (11%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + IG P Q ++LDTGS ++W QC + FDPS S +FS +PCN CK
Sbjct: 79 VSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCK 138
Query: 194 ILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
+ F P D ++ C Y Y DG+ G +++T P +LG
Sbjct: 139 PRIPDFTLPTSCDL--NRLCHYSYFYADGTLAEGNLVREKITFSTSQSTP-----PLILG 191
Query: 253 CTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK---PDTVN 309
C ++ + D+ GI+G++ G +S S+ I+ F YC+ + G+ G + N
Sbjct: 192 CAEDASDDK----GILGMNLGRLSFASQAKITKFSYCVPTRQVRPGFTPTGSFYLGENPN 247
Query: 310 KKFVKYTPIVT------TPEQSEFYH-ITLTGISVGGERLPLKASYFTKL-----STEID 357
+Y ++T P H + L GI +G ++L + S F + ID
Sbjct: 248 SAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMID 307
Query: 358 SGTIITRFPAPVYSALRSAFRK----RMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKIT 412
SG+ T Y+ +R + R+KK + G+ D+ C+D +A + ++ +
Sbjct: 308 SGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVSDM---CFDGNAMEIGRLIGNMV 364
Query: 413 IHFLGGVDLELDVRGTLVVESVRQV-CLGFA---LLPSDPNSILLGNVQQRGYEVHYDVA 468
F GV++ ++ +G ++ + V C+G +L + N ++GN Q+ V +D+A
Sbjct: 365 FEFDKGVEIVIE-KGRVLADVGGGVHCVGIGRSEMLGAASN--IIGNFHQQNLWVEFDIA 421
Query: 469 GRRLGFGPGNCN 480
RR+GFG +C+
Sbjct: 422 NRRVGFGKADCS 433
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 162/373 (43%), Gaps = 38/373 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P ++ +DTGS I W C C +C FFD S T +
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164
Query: 187 CNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C+ C + + +CS + +C Y Y DGSG +G++ TD + G A
Sbjct: 165 CSDPICSSVFQ----TTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVA 220
Query: 246 R--YPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPY 294
P + GC+ +GD GI G +G +S++S+ + F +CL
Sbjct: 221 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 280
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G G+ + + Y+P+V P Q Y++ L I V G+ LPL A+ F +T
Sbjct: 281 SGGGVFVLGE---ILVPGMVYSPLV--PSQPH-YNLNLLSIGVNGQMLPLDAAVFEASNT 334
Query: 355 E---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
+D+GT +T Y +A + ++ I + CY +S + + P +
Sbjct: 335 RGTIVDTGTTLTYLVKEAYDLFLNAISNSVS--QLVTPIISNGEQCYLVSTSISDMFPSV 392
Query: 412 TIHFLGGVDLELDVRGTL----VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
+++F GG + L + L + + C+GF P + +LG++ + YD+
Sbjct: 393 SLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQT--ILGDLVLKDKVFVYDL 450
Query: 468 AGRRLGFGPGNCN 480
A +R+G+ +C+
Sbjct: 451 ARQRIGWASYDCS 463
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/425 (24%), Positives = 184/425 (43%), Gaps = 54/425 (12%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
LE + RD+ R + R LQ + F+ + Y+ V +G P +
Sbjct: 44 LEALRARDRAR----HGRILQGVV----GGVVDFSVQGTSDPYFVGLYFTKVKLGSPAKE 95
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWF 199
+ +DTGS I W C C +C FFD + S T + + C C ++
Sbjct: 96 FYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCGDPICSYAVQ-- 153
Query: 200 PPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTI------QEVNGNGYFARYPFLL 251
+CSS+ +C Y Y DGSG TG++ +D M Q V N + +
Sbjct: 154 --TATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVAN---SSSTIIF 208
Query: 252 GCTDNNTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHSPYGSTGYITF 302
GC+ +GD GI G G +S+IS+ + F +CL G +
Sbjct: 209 GCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVL 268
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---IDSG 359
G+ + + + Y+P+V P Q Y++ L I+V G+ LP+ ++ F + + +DSG
Sbjct: 269 GE---ILEPSIVYSPLV--PSQPH-YNLNLQSIAVNGQLLPIDSNVFATTNNQGTIVDSG 322
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T + Y+ A + ++ K I + CY +S + P+++++F+GG
Sbjct: 323 TTLAYLVQEAYNPFVKAITAAVSQFS--KPIISKGNQCYLVSNSVGDIFPQVSLNFMGGA 380
Query: 420 DLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
+ L+ L+ ++ C+GF + + +LG++ + YD+A +R+G+
Sbjct: 381 SMVLNPEHYLMHYGFLDGAAMWCIGFQKV--EQGFTILGDLVLKDKIFVYDLANQRIGWA 438
Query: 476 PGNCN 480
+C+
Sbjct: 439 DYDCS 443
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/377 (27%), Positives = 168/377 (44%), Gaps = 44/377 (11%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ +A+G P Q V+++LDTGS ++W C + D F P S TF+ +PC S C
Sbjct: 63 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAAD-SFRPRASATFAAVPCGSARCS 121
Query: 194 ILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC 253
PP+ +S+ C ++Y DGS G ATD + G+ R F GC
Sbjct: 122 SRDLPAPPSCD--AASRRCRVSLSYADGSASDGALATDVFAV----GDAPPLRSAF--GC 173
Query: 254 TD---NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNK 310
+++ D +G++G++RG +S +++ + F YC+ S G + G D +
Sbjct: 174 MSAAYDSSPDAVATAGLLGMNRGALSFVTQASTRRFSYCI-SDRDDAGVLLLGHSD-LPF 231
Query: 311 KFVKYTPIVT-TPEQSEF----YHITLTGISVGGERLPLKASYFT-----KLSTEIDSGT 360
+ YTP+ TP F Y + L GI VGG+ LP+ S T +DSGT
Sbjct: 232 LPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGT 291
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGK-----GIEDLFDTCYDLSAYK---TVVVPKIT 412
T YSA+++ F K+ K ++ FDTC+ + + + +P +T
Sbjct: 292 QFTFLLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVT 351
Query: 413 IHFLGGVDLELDVRGTLVVESVRQV--------CLGFALLPSDP-NSILLGNVQQRGYEV 463
+ F G ++ V G ++ V CL F P + ++G+ Q V
Sbjct: 352 LLFNGA---QMSVAGDRLLYKVPGERRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWV 408
Query: 464 HYDVAGRRLGFGPGNCN 480
YD+ R+G P C+
Sbjct: 409 EYDLERGRVGLAPVKCD 425
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 81/227 (35%), Positives = 109/227 (48%), Gaps = 20/227 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
+Y + ++IG P + DTGS + W QC PC +C +Q +P FD S TFS I C S
Sbjct: 58 DYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSE 117
Query: 191 TCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
+C L CS + C Y+ +YVDGS G A + +T+ G A
Sbjct: 118 SCSKLYS-------TSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEP-VAFKG 169
Query: 249 FLLGCTDNNTGDQNGAS-GIMGLDRGPVSIISKTNIS----YFFYCLHSPYGSTGYI--- 300
+ GC NN G N GI+GL RGP+S++S+ S F CL P+ + I
Sbjct: 170 VIFGCGHNNNGAFNDKEMGIIGLGRGPLSLVSQIGSSLGGNMFSQCL-VPFNTNPSISSP 228
Query: 301 -TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
+FGK V V TP+V+ FY +TL GISV LP A
Sbjct: 229 MSFGKGSEVLGNGVVSTPLVSKTTYQSFYFVTLLGISVEDINLPFNA 275
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 154/368 (41%), Gaps = 43/368 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C C + +DP F P S T+ + CN +
Sbjct: 88 YTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQCNPSC 147
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
N D+ K+C Y+ Y + S +G A D ++ +
Sbjct: 148 ----------NCDDE--GKQCTYERRYAEMSSSSGLLAEDVLSF---GNESELTPQRAIF 192
Query: 252 GCTDNNTGD--QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVN 309
GC TG+ A GIMGL RGP+S++ + I G++ + +G D V
Sbjct: 193 GCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVV-------GNSFSLCYGGMDVVG 245
Query: 310 KKFV--------KYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGT 360
V + P +S +Y+I L + V G+RL L F K T +DSGT
Sbjct: 246 GAMVLGNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSGT 305
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCY-----DLSAYKTVVVPKITIH 414
P + A + A K +K K G + + D C+ D+S + P++ +
Sbjct: 306 TYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSK-IFPEVNMV 364
Query: 415 FLGGVDLELDVRGTLVVES--VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRL 472
F G L L L + CLG DP + LLG + R V YD ++
Sbjct: 365 FGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTT-LLGGIVVRNTLVTYDRDNDKI 423
Query: 473 GFGPGNCN 480
GF NC+
Sbjct: 424 GFWKTNCS 431
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 162/373 (43%), Gaps = 38/373 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P ++ +DTGS I W C C +C FFD S T +
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 187 CNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C+ C + + +CS + +C Y Y DGSG +G++ TD + G A
Sbjct: 160 CSDPICSSVFQ----TTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVA 215
Query: 246 R--YPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPY 294
P + GC+ +GD GI G +G +S++S+ + F +CL
Sbjct: 216 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 275
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G G+ + + Y+P+V P Q Y++ L I V G+ LPL A+ F +T
Sbjct: 276 SGGGVFVLGE---ILVPGMVYSPLV--PSQPH-YNLNLLSIGVNGQMLPLDAAVFEASNT 329
Query: 355 E---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
+D+GT +T Y +A + ++ I + CY +S + + P +
Sbjct: 330 RGTIVDTGTTLTYLVKEAYDLFLNAISNSVS--QLVTPIISNGEQCYLVSTSISDMFPSV 387
Query: 412 TIHFLGGVDLELDVRGTL----VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
+++F GG + L + L + + C+GF P + +LG++ + YD+
Sbjct: 388 SLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQT--ILGDLVLKDKVFVYDL 445
Query: 468 AGRRLGFGPGNCN 480
A +R+G+ +C+
Sbjct: 446 ARQRIGWASYDCS 458
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 167/383 (43%), Gaps = 49/383 (12%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF------FDPSKSKTFSKIPC 187
+ +A+G P Q V+++LDTGS ++W C S F P S TF+ +PC
Sbjct: 65 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPC 124
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
ST C PP+ +S++C ++Y DGS G ATD + E R
Sbjct: 125 GSTQCSSRDLPAPPSCDG--ASRQCHVSLSYADGSASDGALATDVFAVGEAPP----LRS 178
Query: 248 PFLLGCTD---NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK 304
F GC +++ D +G++G++RG +S +++ + F YC+ S G + G
Sbjct: 179 AF--GCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCI-SDRDDAGVLLLGH 235
Query: 305 PDTVNKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLST 354
D + + YTP+ + Y + L GI VGG+ LP+ AS T
Sbjct: 236 SD-LPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQT 294
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK-----GIEDLFDTCYDLSAYK---TV 406
+DSGT T YSAL++ F K+ K ++ DTC+ + A + +
Sbjct: 295 MVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSA 354
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQV--------CLGFALLPSDP-NSILLGNVQ 457
+P +T+ F G E+ V G ++ V CL F P + ++G+
Sbjct: 355 RLPPVTLLFNGA---EMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVPLTAYVIGHHH 411
Query: 458 QRGYEVHYDVAGRRLGFGPGNCN 480
Q V YD+ R+G P C+
Sbjct: 412 QMNLWVEYDLERGRVGLAPVKCD 434
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/422 (24%), Positives = 182/422 (43%), Gaps = 48/422 (11%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
LE + RD+ R + R LQ + F+ + Y+ V +G P +
Sbjct: 44 LEALRARDRAR----HGRILQGVV----GGVVDFSVQGTSDPYFVGLYFTKVKLGSPAKD 95
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWF 199
+ +DTGS I W C C +C FFD + S T + + C C ++
Sbjct: 96 FYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCADPICSYAVQ-- 153
Query: 200 PPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEV-NGNGYFARYP--FLLGCT 254
CSS+ +C Y Y DGSG TG++ +D M V G A + GC+
Sbjct: 154 --TATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANSSSTIVFGCS 211
Query: 255 DNNTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHSPYGSTGYITFGKP 305
+GD GI G G +S+IS+ + F +CL G + G+
Sbjct: 212 TYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGE- 270
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---IDSGTII 362
+ + + Y+P+V + Y++ L I+V G+ LP+ ++ F + + +DSGT +
Sbjct: 271 --ILEPSIVYSPLVPSLPH---YNLNLQSIAVNGQLLPIDSNVFATTNNQGTIVDSGTTL 325
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
Y+ A + ++ K I + CY +S + P+++++F+GG +
Sbjct: 326 AYLVQEAYNPFVDAITAAVSQFS--KPIISKGNQCYLVSNSVGDIFPQVSLNFMGGASMV 383
Query: 423 LDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
L+ L+ ++S C+GF + + +LG++ + YD+A +R+G+ N
Sbjct: 384 LNPEHYLMHYGFLDSAAMWCIGFQKV--ERGFTILGDLVLKDKIFVYDLANQRIGWADYN 441
Query: 479 CN 480
C+
Sbjct: 442 CS 443
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 161/372 (43%), Gaps = 38/372 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P ++ +DTGS I W C C +C FFD S T +
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 187 CNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C+ C + + +CS + +C Y Y DGSG +G++ TD + G A
Sbjct: 160 CSDPICSSVFQ----TTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVA 215
Query: 246 R--YPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPY 294
P + GC+ +GD GI G +G +S++S+ + F +CL
Sbjct: 216 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 275
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G G+ + + Y+P+V P Q Y++ L I V G+ LPL A+ F +T
Sbjct: 276 SGGGVFVLGE---ILVPGMVYSPLV--PSQPH-YNLNLLSIGVNGQMLPLDAAVFEASNT 329
Query: 355 E---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
+D+GT +T Y +A + ++ I + CY +S + + P +
Sbjct: 330 RGTIVDTGTTLTYLVKEAYDLFLNAISNSVS--QLVTPIISNGEQCYLVSTSISDMFPSV 387
Query: 412 TIHFLGGVDLELDVRGTL----VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
+++F GG + L + L + + C+GF P + +LG++ + YD+
Sbjct: 388 SLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEE--QTILGDLVLKDKVFVYDL 445
Query: 468 AGRRLGFGPGNC 479
A +R+G+ +C
Sbjct: 446 ARQRIGWASYDC 457
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 118/431 (27%), Positives = 182/431 (42%), Gaps = 76/431 (17%)
Query: 116 KAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK------------- 161
+AF P +G +Y++ +G P + L+ DTGS +TW +C+
Sbjct: 38 EAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAP 97
Query: 162 ---------------PCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK 206
+ F P +S+T++ IPC+S TC L P
Sbjct: 98 GYNYGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASL----PFSLAA 153
Query: 207 CSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGN--GYFARYPFL----LGCTDNNT 258
C + C Y+ Y DGS G TD TI ++G G R L LGCT + T
Sbjct: 154 CPTPGSPCAYEYRYKDGSAARGTVGTDSATI-ALSGRRAGKKQRRAKLRGVVLGCTTSYT 212
Query: 259 GDQNGAS-GIMGLDRGPVSIISKTNISY---FFYCL--H-SPYGSTGYITFGKPDTVNKK 311
G+ AS G++ L VS S+ + F YCL H +P +T Y+TFG V+
Sbjct: 213 GESFLASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSA 272
Query: 312 F--------------VKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYFTKLSTE 355
+ TP++ FY + + G+SV GE R+P K
Sbjct: 273 SASRTACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGA 332
Query: 356 I-DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-----VVVP 409
I DSGT +T +P Y A+ +A K++ + + D FD CY+ ++ T V VP
Sbjct: 333 ILDSGTSLTVLVSPAYRAVVAALGKKL--VGLPRVAMDPFDYCYNWTSPLTGEDLAVAVP 390
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSD-PNSILLGNVQQRGYEVHYDVA 468
+ +HF G L+ + ++ + C+G L D P ++GN+ Q+ + +D+
Sbjct: 391 ALAVHFAGSARLQPPPKSYVIDAAPGVKCIG--LQEGDWPGVSVIGNILQQEHLWEFDLK 448
Query: 469 GRRLGFGPGNC 479
RRL F C
Sbjct: 449 NRRLRFKRSRC 459
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 166/368 (45%), Gaps = 37/368 (10%)
Query: 126 IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKI 185
++ + + + IG P Q + L LDT + W C CI C F KS +F +
Sbjct: 20 LIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTT--VFSSDKSSSFRPL 77
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
PC S C + PN CS C +++ Y S D +T+ + Y
Sbjct: 78 PCQSPQCNQV-----PN--PSCSGSACGFNLTY-GSSTVAADLVQDNLTLATDSVPSY-- 127
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHS--PYGSTGYI 300
GC TG G++GL RGP+S++ ++ Y F YCL S +G +
Sbjct: 128 ----TFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSL 183
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYF---TKLSTE 355
G +KYTP++ P +S Y++ L I VG + +P A F T T
Sbjct: 184 RLGP--VAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTV 241
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF 415
IDSGT TR AP Y+A+R FR+R+ + + FDTCY + ++ P IT F
Sbjct: 242 IDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGG-FDTCYTVP----IISPTITFMF 296
Query: 416 LGGVDLELDVRGTLV-VESVRQVCLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGRRL 472
G+++ L L+ S CL A P + NS+L + ++QQ+ + + +D+ R+
Sbjct: 297 -AGMNVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRV 355
Query: 473 GFGPGNCN 480
G +C+
Sbjct: 356 GVARESCS 363
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 160/360 (44%), Gaps = 36/360 (10%)
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
++IG P LL+DTGS +TW QC PC C Q PFF PS+S T+ C S
Sbjct: 92 ISIGDPPVPQLLLIDTGSDLTWIQCLPC-KCYPQTIPFFHPSRSSTYRNASCESAP---- 146
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
P +D+ + C Y + Y D S G A +++T Q + G ++ + GC
Sbjct: 147 -HAMPQIFRDE-KTGNCRYHLRYRDFSNTRGILAKEKLTFQ-TSDEGLISKPNIVFGCGQ 203
Query: 256 NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS---PYGSTGYITFGKPDTVNKKF 312
+N+G SG++GL G SI+++ S F YC S P ++ G N
Sbjct: 204 DNSGFTQ-YSGVLGLGPGTFSIVTRNFGSKFSYCFGSLIDPTYPHNFLILG-----NGAR 257
Query: 313 VKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYF----TKLSTEIDSGTIITRFP 366
++ P TP Q + Y++ L IS+G + L ++ F +K T ID+G T
Sbjct: 258 IEGDP---TPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKGGTVIDTGCSPTILA 314
Query: 367 APVYSALRSAFRKRMKK-YKMGKGIEDLFDTCYD----LSAYKTVVVPKITIHFLGGVDL 421
Y L + + + K E + CY+ L Y P +T HF GG +L
Sbjct: 315 REAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYG---FPVVTFHFAGGAEL 371
Query: 422 ELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
LDV V ES CL + D S+ +G + Q+ Y V Y++ ++ F +C
Sbjct: 372 ALDVESLFVSSESGDSFCLAMTMNTFDDMSV-IGAMAQQNYNVGYNLRTMKVYFQRTDCE 430
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 168/370 (45%), Gaps = 31/370 (8%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HC---SQQRDPFFDPSKSKTF 182
+ +++++ +++G P + + +DTGS I+W QC+ CI HC Q+ P F+ S S T+
Sbjct: 18 IRKNQFFMGISLGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTY 77
Query: 183 SKIPCNSTTCKIL-LEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVN 239
++ C++ C + + P+G C +E C Y + Y G G+ + DR+T+
Sbjct: 78 RRVGCSAQVCHDMHVSQNIPSG---CVEEEDSCIYSLRYASGEYSAGYLSQDRLTL---- 130
Query: 240 GNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISK----TNISYFFYCLHSPYG 295
N Y + F+ GC +N + + A GI+G S ++ TN S F YC S
Sbjct: 131 ANSYSIQ-KFIFGCGSDNRYNGHSA-GIIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQE 188
Query: 296 STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE 355
+ G+++ G P + + T + Y + + V G RL + +T T
Sbjct: 189 NEGFLSIG-PYVRDSNKLILTQLFDYGAHLPVYALQQFDMMVNGMRLQVDPPVYTTRMTV 247
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY--DLSAYKTVVVPKITI 413
+DSGT+ T +PV+ AL A K M +G D + C+ + + +P + I
Sbjct: 248 VDSGTVETFVLSPVFRALDRALTKAMVAEGYVRG-SDSKEICFHSNGDSVDWSKLPVVEI 306
Query: 414 HFLGGVDLELDVRGTLVVE-SVRQVCLGFALLPSD---PNSILLGNVQQRGYEVHYDVAG 469
F + L+L E S +C F P D P +LGN R + V +D+
Sbjct: 307 KFSRSI-LKLPAENVFYYETSDGSICSTFQ--PDDAGVPGVQILGNRATRSFRVVFDIQQ 363
Query: 470 RRLGFGPGNC 479
R GF G C
Sbjct: 364 RNFGFEAGAC 373
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 172/392 (43%), Gaps = 36/392 (9%)
Query: 114 KTKAFTFPAKTGI-VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQC-------KPCIH 165
++ AF P +G +Y++ + +G P Q L+ DTGS +TW +C
Sbjct: 85 ESSAFAMPLTSGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAA 144
Query: 166 CSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS--KECPYDIAYVDGSG 223
QR F P+ SK++S +PC+S TCK + P CSS C YD Y D S
Sbjct: 145 SPPQR--VFRPAGSKSWSPLPCDSDTCKS----YVPFSLANCSSPPDPCSYDYRYKDNSS 198
Query: 224 ETGFWATDRMTIQEVNGNGYFAR--YPFLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISK 280
G D T+ +G +LGCT + G + G++ L +S S+
Sbjct: 199 ARGVVGLDSATVSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASR 258
Query: 281 TNISY---FFYCL--H-SPYGSTGYITFGK--PDTVNKKFVKYTPIVTTPEQSE--FYHI 330
+ F YCL H +P +T ++TFG + + TP+V + FY +
Sbjct: 259 AASRFGGRFSYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFV 318
Query: 331 TLTGISVGGER---LPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMG 387
++ ++V GER LP + +DSGT +T P Y A+ A K+ +
Sbjct: 319 SVDAVTVAGERLEILPDVWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFA--GVP 376
Query: 388 KGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSD 447
+ D F+ CY+ + + +P++ + F G L + ++ + C+G + +
Sbjct: 377 RVNMDPFEYCYNWTGV-SAEIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIG-VVEGAW 434
Query: 448 PNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
P ++GN+ Q+ + +D+A R L F C
Sbjct: 435 PGVSVIGNILQQEHLWEFDLANRWLRFKQSRC 466
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 162/371 (43%), Gaps = 57/371 (15%)
Query: 127 VAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP 186
+ + EY++ V +G P ++ SL+LDTGS + W QC PC C QQ D
Sbjct: 165 LGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQND--------------- 209
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
++ CPY Y D S TG +A + T+ G
Sbjct: 210 ----------------------NQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSEL 247
Query: 247 Y---PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGST 297
Y + GC N G +GA+G++GL RGP+S S+ Y F YCL +S +
Sbjct: 248 YNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVS 307
Query: 298 GYITFGK-PDTVNKKFVKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKLS- 353
+ FG+ D ++ + +T V E FY++ + I V GE L + + S
Sbjct: 308 SKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSD 367
Query: 354 ----TEIDSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSAYKTVVV 408
T IDSGT ++ F P Y +++ ++ K KY + + + D C+++S V +
Sbjct: 368 GAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFP-ILDPCFNVSGIHNVQL 426
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVA 468
P++ I F G + + + VCL P SI +GN QQ+ + + YD
Sbjct: 427 PELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSI-IGNYQQQNFHILYDTK 485
Query: 469 GRRLGFGPGNC 479
RLG+ P C
Sbjct: 486 RSRLGYAPTKC 496
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 177/392 (45%), Gaps = 41/392 (10%)
Query: 114 KTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
+TK ++ ++ + + + IG P Q ++LDTGS ++W QC +
Sbjct: 62 QTKQPSYNYRSSFKYSMALIVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTT 121
Query: 174 -FDPSKSKTFSKIPCNSTTCKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATD 231
FDPS S +FS +PCN CK + F P D+ ++ C Y Y DG+ G +
Sbjct: 122 SFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTTCDQ--NRLCHYSYFYADGTYAEGSLVRE 179
Query: 232 RMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL- 290
++T P +LGC + +T ++ GI+G++ G S S+ IS F YC+
Sbjct: 180 KITFSSSQSTP-----PLILGCAEASTDEK----GILGMNLGRRSFASQAKISKFSYCVP 230
Query: 291 ----HSPYGSTGYITFGK-PDTVNKKFVK---YTPIVTTPEQSEF-YHITLTGISVGGER 341
+ STG G P++ +++ +TP +P Y I + GI +G R
Sbjct: 231 TRQARAGLSSTGSFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNAR 290
Query: 342 LPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRK----RMKKYKMGKGIED 392
L + A+ F T IDSG+ T Y+ +R + ++KK + G+ D
Sbjct: 291 LNISATLFRPDPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSD 350
Query: 393 LFDTCYDLSAYKT-VVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA---LLPSDP 448
+ C+D + + ++ + F GV++ +D L C+G +L +
Sbjct: 351 M---CFDGNPMEIGRLIGNMVFEFEKGVEIVIDKWRVLADVGGGVHCIGIGRSEMLGAAS 407
Query: 449 NSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
N ++GN Q+ V YD+A RR+G G +C+
Sbjct: 408 N--IIGNFHQQNLWVEYDLANRRIGLGKADCS 437
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 168/371 (45%), Gaps = 40/371 (10%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + IG P Q ++LDTGS ++W QC + FDPS S +FS +PCN CK
Sbjct: 84 VSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLCK 143
Query: 194 ILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
+ F P D+ ++ C Y Y DG+ G +++T + P +LG
Sbjct: 144 PRIPDFTLPTSCDQ--NRLCHYSYFYADGTLAEGNLVREKITFSRSQ-----STPPLILG 196
Query: 253 CTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK---PDTVN 309
C + + + A GI+G++ G +S S+ ++ F YC+ + G+ G + N
Sbjct: 197 CAE----ESSDAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTGSFYLGENPN 252
Query: 310 KKFVKYTPIVTTPEQSEF-------YHITLTGISVGGERLPLKASYFT-----KLSTEID 357
+Y ++T + Y + + GI +G ++L + S F T ID
Sbjct: 253 SGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMID 312
Query: 358 SGTIITRFPAPVYSALRSAFRK----RMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKIT 412
SG+ T Y+ +R + R+KK + G+ D+ C++ +A + ++ +
Sbjct: 313 SGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDM---CFNGNAIEIGRLIGNMV 369
Query: 413 IHFLGGVDLELDVRGTLVVESVRQVCLGFA---LLPSDPNSILLGNVQQRGYEVHYDVAG 469
F GV++ ++ L C+G +L + N ++GN Q+ V +D+A
Sbjct: 370 FEFDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAASN--IIGNFHQQNIWVEFDLAN 427
Query: 470 RRLGFGPGNCN 480
RR+GFG +C+
Sbjct: 428 RRVGFGKADCS 438
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 121/461 (26%), Positives = 202/461 (43%), Gaps = 70/461 (15%)
Query: 60 VSLEVLGRYGPCSKLNQGKSRNTPSLEEILR---------RDQQRLHLKNSRRLQKAIPD 110
+++E++ + P S L G N P E+IL+ Q + N + + +
Sbjct: 14 LTMELIHKDSPQSPLYPG---NLPPGEQILQPAACPFAGLHHQTSMMSTNKAVMNRMMSP 70
Query: 111 NFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH----C 166
F F A+ G+ + E K Y +DTG+ ++W QC+ C + C
Sbjct: 71 LTSYGDPFLFLAQVGVGSFQEKSHRTHF---KTYY-FQIDTGNELSWIQCEGCQNKGNMC 126
Query: 167 SQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETG 226
+DP + S+SK++ + CN + + PN +C C Y++ Y GS +G
Sbjct: 127 FPHKDPPYTSSQSKSYKPVSCNQHS------FCEPN---QCKEGLCAYNVTYGPGSYTSG 177
Query: 227 FWATDRMTIQEVNGNGYFARYPFLLGCTDNNTG-------DQNGASGIMGLDRGPVSIIS 279
A + T +G + A GC+ ++ D+N SG++G+ GP S ++
Sbjct: 178 NLANETFTFYSNHGK-HTALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLA 236
Query: 280 KT-NISY--FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGIS 336
+ +IS+ F YC+ + Y+ FGK V K ++ T I+ + S YH+ L GIS
Sbjct: 237 QLGSISHGKFSYCITANNTHNTYLRFGK-HVVKSKNLQTTKIMQV-KPSAAYHVNLLGIS 294
Query: 337 VGGERLPLKASYFTKLSTE--------IDSGTIITRFPAPVYSALRSAF------RKRMK 382
V G +L + T L+ ID+GT+ T P++ L +A + +K
Sbjct: 295 VNGVKLNITK---TDLAVRKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLK 351
Query: 383 KYKMGKGIEDLFDTCYD-LSAYKTVVVPKITIHFLGGVDLELDVRGTLVV---ESVRQVC 438
++ + K +DL CY+ LS +P +T H L DLE+ + E C
Sbjct: 352 RWVIHKLHKDL---CYEQLSDAGRKNLPVVTFH-LENADLEVKPEAIFLFREFEGKNVFC 407
Query: 439 LGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L + SD + ++G QQ + YD R L FGP +C
Sbjct: 408 LS---MLSDDSKTIIGAYQQMKQKFVYDTKARVLSFGPEDC 445
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 126/440 (28%), Positives = 195/440 (44%), Gaps = 47/440 (10%)
Query: 57 PGKVSLEVLGRYGPCSKLNQGKSRNTPS-LEEILRRDQQRLHLKNSRRLQKAIPDN-FKK 114
P L V+ YG CS N K+ + + + + +D R+ +S QK +
Sbjct: 30 PDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLSSLVAQKTVSSAPIAS 89
Query: 115 TKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFF 174
+AF Y + V IG P Q + ++LDT + + CI CS F
Sbjct: 90 GQAFNI---------GNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGCSATT---F 137
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
P+ S ++ + C+ C + P S C ++ +Y GS + D +
Sbjct: 138 SPNASTSYVPLECSVPQCSQVRGLSCP----ATGSGACSFNKSYA-GSTYSATLVQDSLR 192
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
+ Y F G + +G A G++GL RGP+S++S+T Y F YCL
Sbjct: 193 L----ATDVIPSYSF--GSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLP 246
Query: 292 S--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
S Y +G + G K ++ TP++ P + Y + LTGI+VG +P
Sbjct: 247 SFKSYYFSGSLKLGP--VGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELL 304
Query: 350 -----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYK 404
T T IDSGT+ITRF PVY+A+R FRK++ G FDTC+ + Y+
Sbjct: 305 AFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPFSSLGA---FDTCF-VKNYE 360
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSILL---GNVQQRG 460
T + P IT+HF +DL+L + +L+ S + CL A P + N +L N QQ+
Sbjct: 361 T-LAPAITLHFT-DLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQN 418
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
V +D ++G CN
Sbjct: 419 LRVLFDTVNNKVGIARELCN 438
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 109/423 (25%), Positives = 190/423 (44%), Gaps = 52/423 (12%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
L+E+ RD+ R + R LQ ++ + P + G+ Y+ V +G P +
Sbjct: 45 LDELKARDRVR----HGRFLQSSVGVVDFPVEGTYDPYRVGL-----YFTRVLLGSPPKE 95
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWF 199
+ +DTGS + W C C C Q FFDP S T S I C+ C + ++
Sbjct: 96 FYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSDQRCSLGVQ-- 153
Query: 200 PPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF-ARYPFLLGCTDN 256
+ CSS+ +C Y Y DGSG +G++ +D + + G+ + + GC+ +
Sbjct: 154 --SSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSASIVFGCSIS 211
Query: 257 NTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYITFGKPDT 307
TGD GI G + +S+IS+ + F +CL G G + G+
Sbjct: 212 QTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGILVLGE--- 268
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TKLSTEIDSGTIITR 364
+ ++ + Y+P+V P Q Y++ L ISV G+ L + F T T +DSGT +
Sbjct: 269 IVEEDIVYSPLV--PSQPH-YNLNLQSISVNGKSLAIDPEVFATSTNRGTIVDSGTTLAY 325
Query: 365 FPAPVYSALRSAFRKRMKKYK---MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
Y SA + + + + KG + CY +++ + P ++++F GGV +
Sbjct: 326 LAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-----CYLITSSVKGIFPTVSLNFAGGVSM 380
Query: 422 ELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
L L+ + C+GF + +I LG++ + YD+AG+R+G+
Sbjct: 381 NLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITI-LGDLVLKDKIFVYDLAGQRIGWANY 439
Query: 478 NCN 480
+C+
Sbjct: 440 DCS 442
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 165/374 (44%), Gaps = 46/374 (12%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ +A+G P Q +S++LDTGS ++W CK S F+P S T+S +PC+S C+
Sbjct: 67 VTLAVGDPPQNISMVLDTGSELSWLHCKK----SPNLGSVFNPVSSSTYSPVPCSSPICR 122
Query: 194 ILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC 253
P + C I+Y D + G A + I V G L GC
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGT------LFGC 176
Query: 254 TD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVN 309
D +N+ + ++G+MG++RG +S +++ S F YC+ S S+ ++ G
Sbjct: 177 MDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI-SGSDSSVFLLLGDASYSW 235
Query: 310 KKFVKYTPIV--TTPE---QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSG 359
++YTP+V +TP Y + L GI VG + L L S F T +DSG
Sbjct: 236 LGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSG 295
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-----FDTCYDLSAYKT---VVVPKI 411
T T PVY+AL++ F + K D D CY + + +P +
Sbjct: 296 TQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMV 355
Query: 412 TIHFLGGVDLELDVRGTLVVESV-------RQVCLGFALLPSD---PNSILLGNVQQRGY 461
++ F G E+ V G ++ V ++ F SD + ++G+ Q+
Sbjct: 356 SLMFRGA---EMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNV 412
Query: 462 EVHYDVAGRRLGFG 475
+ +D+A R+GF
Sbjct: 413 WMEFDLAKSRVGFA 426
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 101/366 (27%), Positives = 164/366 (44%), Gaps = 39/366 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP--------FFDPSKSKTFS 183
+Y VVA+G P + LDTGS + W C CI C+ P + P KS T
Sbjct: 99 HYAVVALGTPNVTFLVALDTGSDLFWVPCD-CIKCAPLASPDYGDLKFDMYSPRKSSTSR 157
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYV-DGSGETGFWATDRMTIQEVNGNG 242
K+PC+S+ C P +S CPY I Y+ + + G D + + +G
Sbjct: 158 KVPCSSSLCD-------PQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESGQS 210
Query: 243 YFARYPFLLGCTDNNTGDQNGAS---GIMGL---DRGPVSIISKTNISYFFYCLHSPYGS 296
+ P GC +G G++ G++GL + S+++ I+ + +
Sbjct: 211 KITQAPITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFGEDG 270
Query: 297 TGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEI 356
G I FG DT + ++ TP+ +Q+ +Y+I++TG VGG+ S+ TK S +
Sbjct: 271 HGRINFG--DTGSSDQLE-TPL-NIYKQNPYYNISITGAMVGGK------SFDTKFSAVV 320
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFL 416
DSGT T P+Y+ + S F ++K+ + F+ CY +SA V P I++
Sbjct: 321 DSGTSFTALSDPMYTEITSTFNAQVKESRKHLDASMPFEYCYSISAQGAVNPPNISLTAK 380
Query: 417 GGVDLELDVRG---TLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
GG V G T+ S R + A++ S+ L+G G ++ +D LG
Sbjct: 381 GGSIFP--VNGPIITITDTSSRPIAYCLAIMKSE-GVNLIGENFMSGLKIVFDRERLVLG 437
Query: 474 FGPGNC 479
+ NC
Sbjct: 438 WKTFNC 443
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 168/379 (44%), Gaps = 43/379 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P ++ + +DTGS + W C+PC C ++ +DP +S T S +
Sbjct: 2 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61
Query: 187 CNSTTCKILLEWFPPNGQDKCS--SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C+ C + + +CS + C Y +Y DGS G++ D M ++ NG
Sbjct: 62 CSDPLCVRGRRF----AEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLA 117
Query: 245 -ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSI----ISKTNISYFF-YCLHSPY 294
L GC+ TGD Q GI+G + +S+ ++ NI F +CL
Sbjct: 118 NTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLE--- 174
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G + + + YTP+V S Y++ L GISV RLP+ A F+ +
Sbjct: 175 GEKRGGGILVIGGIAEPGMTYTPLV---PDSVHYNVVLRGISVNSNRLPIDAEDFSSTND 231
Query: 355 E---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
+DSGT + FP+ Y+ A R+ + ++ + C+ +S + + P +
Sbjct: 232 TGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPV--RVQGMDTQCFLVSGRLSDLFPNV 289
Query: 412 TIHFLGGV-----DLELDVRGTLVVESVRQVCLGF-----ALLPSDPNSI-LLGNVQQRG 460
T++F GG D L GT + C+G+ + P D + + +LG++ +
Sbjct: 290 TLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKD 349
Query: 461 YEVHYDVAGRRLGFGPGNC 479
V YD+ R+G+ NC
Sbjct: 350 KLVVYDLDNSRIGWMSYNC 368
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 102/370 (27%), Positives = 153/370 (41%), Gaps = 43/370 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR---DPFFDPSKSKTFSKIPCN 188
Y V IG P Q +L++DTGS +T+ C C HC + DP F P S ++ + CN
Sbjct: 99 YTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVSCN 158
Query: 189 STTCKILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNG-YFA 245
S C + C ++ +C Y+ Y + S G D + GNG
Sbjct: 159 SPDCITKM----------CDARVHQCKYERVYAEMSSSKGVLGKDLLGF----GNGSRLQ 204
Query: 246 RYPFLLGCTDNNTGD--QNGASGIMGLDRGPVSIISK-----TNISYFFYCLHSPYGSTG 298
+P L GC TGD A GIMGL RGP+SI+ + F C G
Sbjct: 205 PHPLLFGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGG 264
Query: 299 YITFGK-PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEI 356
+ G P F K + P +S +Y++ L+ I V G L + + F +L T +
Sbjct: 265 SMVLGAIPPPPAMVFAK-----SDPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLGTVL 319
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVV----PKI 411
DSGT P + A + A +++ + G + + D C+ + + + P +
Sbjct: 320 DSGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPV 379
Query: 412 TIHFLGGVDLELDVRGTLVVESV--RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
F G + L L + CLGF + + LLG + R V YD A
Sbjct: 380 DFVFSGNQKVFLAPENYLFKHTKVPGAYCLGF--FKNQDATTLLGGIVVRNTLVTYDRAN 437
Query: 470 RRLGFGPGNC 479
++GF NC
Sbjct: 438 HQIGFFKTNC 447
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 109/393 (27%), Positives = 170/393 (43%), Gaps = 59/393 (15%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + +A G P Q +S + DTGS + W C CS+ P+ DP+ F +P S++
Sbjct: 132 YSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKF--VPKLSSS 189
Query: 192 CKIL------LEW-FPPNGQDKC---------SSKECP-YDIAYVDGSGET-GFWATDRM 233
K++ W F PN + +C S CP Y + Y GSG T G ++ +
Sbjct: 190 VKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQY--GSGATAGILLSETL 247
Query: 234 TIQEVNGNGYFARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-- 290
++ R P FL+GC+ + + +GI G RGP S+ S+ + F +CL
Sbjct: 248 DLEN-------KRVPDFLVGCSVMSV---HQPAGIAGFGRGPESLPSQMRLKRFSHCLVS 297
Query: 291 ----HSPYGSTGYITFG-KPDTVNKKFVKYTPIVTTPEQS-----EFYHITLTGISVGGE 340
SP S + G + D K Y P P S E+Y+++L I +GG+
Sbjct: 298 RGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGK 357
Query: 341 RLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-- 393
+ Y ST IDSG+ T P++ A+ K++ KY K +E
Sbjct: 358 PVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSG 417
Query: 394 FDTCYDL-SAYKTVVVPKITIHFLGGVDLELDVRGTL-VVESVRQVCLGF-----ALLPS 446
C+++ ++ P + + F GG L L L +V VCL +
Sbjct: 418 LRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVGGG 477
Query: 447 DPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+I+LG QQ+ V YD+A +R+GF C
Sbjct: 478 GGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 113/446 (25%), Positives = 188/446 (42%), Gaps = 66/446 (14%)
Query: 75 NQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYI 134
+G+S L +R + L + RL+ F+ + T P
Sbjct: 18 GEGRSPAGTVLPLQVRVQEVELEAPAANRLR------FRHNVSLTVP------------- 58
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
VA+G P Q V+++LDTGS ++W C + P F+ S S ++ +PC ST C+
Sbjct: 59 -VAVGTPPQNVTMVLDTGSELSWLLCNGSY--APPLTPAFNASGSSSYGAVPCPSTACEW 115
Query: 195 LLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ----EVNGNGYFA---R 246
P P D S C ++Y D S G ATD + V YF
Sbjct: 116 RGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAYFGCITS 175
Query: 247 YPFLLGCTDNNTGD--QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK 304
Y N TG A+G++G++RG +S +++T F YC+ +P G + G
Sbjct: 176 YSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCI-APGEGPGVLLLGD 234
Query: 305 PDTVNKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLST 354
V YTP++ + + Y + L GI VG LP+ S T T
Sbjct: 235 DGGVAPPL-NYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQT 293
Query: 355 EIDSGTIITRFPAPVYSALRSAF--RKRMKKYKMGKG---IEDLFDTCY----DLSAYKT 405
+DSGT T A Y+AL++ F + R+ +G+ + FD C+ A +
Sbjct: 294 MVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRGPEARVAAAS 353
Query: 406 VVVPKITIHFLGGVDLEL-----------DVRGTLVVESVRQVCLGFALLPSDPNSILLG 454
++P++ + L G ++ + + RG E+V + G + + + ++ ++G
Sbjct: 354 GLLPEVGL-VLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDM-AGMSAYVIG 411
Query: 455 NVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ Q+ V YD+ R+GF P C+
Sbjct: 412 HHHQQNVWVEYDLQNGRVGFAPARCD 437
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 169/376 (44%), Gaps = 50/376 (13%)
Query: 141 PKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF--FDPSKSKTFSKIPCNSTTCKILL-E 197
P Q +S+++DTGS ++W +C S +P FDP++S ++S IPC+S TC+ +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNR----SSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 198 WFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN 257
+ P D S K C ++Y D S G A + + + GC +
Sbjct: 138 FLIPASCD--SDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNL-----IFGCMGSV 190
Query: 258 TG----DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFV 313
+G + +G++G++RG +S IS+ F YC+ G++ G + +
Sbjct: 191 SGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPL 250
Query: 314 KYTPI--VTTPE---QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIIT 363
YTP+ ++TP Y + LTGI V G+ LP+ S T +DSGT T
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFT 310
Query: 364 RFPAPVYSALRSAFRKR----MKKYKMGKGI-EDLFDTCYDLSAYKTVV-----VPKITI 413
PVY+ALRS F R + Y+ + + D CY +S + +P +++
Sbjct: 311 FLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSL 370
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLG------FALLPSD---PNSILLGNVQQRGYEVH 464
F G E+ V G ++ V + +G F SD + ++G+ Q+ +
Sbjct: 371 VFEGA---EIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIE 427
Query: 465 YDVAGRRLGFGPGNCN 480
+D+ R+G P C+
Sbjct: 428 FDLQRSRIGLAPVECD 443
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 109/423 (25%), Positives = 190/423 (44%), Gaps = 52/423 (12%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
L+E+ RD+ R + R LQ ++ + P + G+ Y+ V +G P +
Sbjct: 30 LDELKARDRVR----HGRFLQSSVGVVDFPVEGTYDPYRVGL-----YFTRVLLGSPPKE 80
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWF 199
+ +DTGS + W C C C Q FFDP S T S I C+ C + ++
Sbjct: 81 FYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSDQRCSLGVQ-- 138
Query: 200 PPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF-ARYPFLLGCTDN 256
+ CSS+ +C Y Y DGSG +G++ +D + + G+ + + GC+ +
Sbjct: 139 --SSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSASIVFGCSIS 196
Query: 257 NTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYITFGKPDT 307
TGD GI G + +S+IS+ + F +CL G G + G+
Sbjct: 197 QTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGILVLGE--- 253
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TKLSTEIDSGTIITR 364
+ ++ + Y+P+V P Q Y++ L ISV G+ L + F T T +DSGT +
Sbjct: 254 IVEEDIVYSPLV--PSQPH-YNLNLQSISVNGKSLAIDPEVFATSTNRGTIVDSGTTLAY 310
Query: 365 FPAPVYSALRSAFRKRMKKYK---MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
Y SA + + + + KG + CY +++ + P ++++F GGV +
Sbjct: 311 LAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-----CYLITSSVKGIFPTVSLNFAGGVSM 365
Query: 422 ELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
L L+ + C+GF + +I LG++ + YD+AG+R+G+
Sbjct: 366 NLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITI-LGDLVLKDKIFVYDLAGQRIGWANY 424
Query: 478 NCN 480
+C+
Sbjct: 425 DCS 427
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 104/359 (28%), Positives = 161/359 (44%), Gaps = 22/359 (6%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
+Y + + +G P V L+DTGS + W QC PC C +Q+ P F+P +S T++ IPC+S
Sbjct: 49 DYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSNTYTPIPCDSE 108
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C L G K C Y AY D S G A + +T +G +
Sbjct: 109 ECNSLF------GHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVG-DIV 161
Query: 251 LGCTDNNTGDQN-GASGIMGLDRGPVSIISKTNISY----FFYCL---HSPYGSTGYITF 302
GC +N+G N GI+GL GP+S++S+ Y F CL H+ + G I+F
Sbjct: 162 FGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTISF 221
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS-YFTKLSTEIDSGTI 361
G V+ + V TP+V+ Q+ Y +TL GISVG + +S +K + IDSGT
Sbjct: 222 GDASDVSGEGVAATPLVSEEGQTP-YLVTLEGISVGDTFVSFNSSEMLSKGNIMIDSGTP 280
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
T P Y L + + + + CY + + P + HF G D+
Sbjct: 281 ATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCY--RSETNLEGPILIAHF-EGADV 337
Query: 422 ELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+L T + C FA+ + + GN Q + +D+ + + F +C+
Sbjct: 338 QLMPIQTFIPPKDGVFC--FAMAGTTDGEYIFGNFAQSNVLIGFDLDRKTVSFKATDCS 394
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 103/419 (24%), Positives = 177/419 (42%), Gaps = 49/419 (11%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLL 148
L + + R HL+++R LQ + F+ + Y+ V +G P + ++
Sbjct: 42 LAQLRARDHLRHARLLQGFV----GGVVDFSVQGSSDPYLVGLYFTRVKLGTPPREFNVQ 97
Query: 149 LDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+DTGS + W C C +C Q +FD + S T +PC+ C ++
Sbjct: 98 IDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVPCSHPICTSQIQ----TT 153
Query: 204 QDKC--SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR--YPFLLGCTDNNTG 259
+C S +C Y Y DGSG +G++ +D V G A + GC+ +G
Sbjct: 154 ATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSG 213
Query: 260 D----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYITFGKPDTVNK 310
D GI G +G +S+IS+ + F +CL G + G+ + +
Sbjct: 214 DLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGEDSGGGILVLGE---ILE 270
Query: 311 KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS---TEIDSGTIITRFPA 367
+ Y+P+V P Q Y++ L I+V G+ LP+ + F S T ID+GT +
Sbjct: 271 PGIVYSPLV--PSQPH-YNLDLQSIAVSGQLLPIDPAAFATSSNRGTIIDTGTTLAYLVE 327
Query: 368 PVYSALRSAFRKRMKKYK---MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
Y SA + + + KG + CY +S + V P ++ +F GG + L
Sbjct: 328 EAYDPFVSAITAAVSQLATPTINKG-----NQCYLVSNSVSEVFPPVSFNFAGGATMLLK 382
Query: 425 VRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L+ C+GF + +LG++ + YD+A +R+G+ +C
Sbjct: 383 PEEYLMYLTNYAGAALWCIGFQKIQGGIT--ILGDLVLKDKIFVYDLAHQRIGWANYDC 439
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 99/379 (26%), Positives = 168/379 (44%), Gaps = 43/379 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P ++ + +DTGS + W C+PC C ++ +DP +S T S +
Sbjct: 29 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 88
Query: 187 CNSTTCKILLEWFPPNGQDKCS--SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C+ C + + +CS + C Y +Y DGS G++ D M ++ NG
Sbjct: 89 CSDPLCVRGRRF----AEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLA 144
Query: 245 -ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSI----ISKTNISYFF-YCLHSPY 294
L GC+ TGD Q GI+G + +S+ ++ NI F +CL
Sbjct: 145 NTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLE--- 201
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G + + + YTP+V S Y++ L GISV RLP+ A F+ +
Sbjct: 202 GEKRGGGILVIGGIAEPGMTYTPLV---PDSVHYNVVLRGISVNSNRLPIDAEDFSSTND 258
Query: 355 E---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
+DSGT + FP+ Y+ A R+ + ++ + C+ +S + + P +
Sbjct: 259 TGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPV--RVQGMDTQCFLVSGRLSDLFPNV 316
Query: 412 TIHFLGGV-----DLELDVRGTLVVESVRQVCLGF-----ALLPSDPNSI-LLGNVQQRG 460
T++F GG D L GT + C+G+ + P D + + +LG++ +
Sbjct: 317 TLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKD 376
Query: 461 YEVHYDVAGRRLGFGPGNC 479
V YD+ R+G+ NC
Sbjct: 377 KLVVYDLDNSRIGWMSYNC 395
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 148/360 (41%), Gaps = 35/360 (9%)
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN 188
A Y IG P Q VS LD S + WT C F+P +S T + +PC
Sbjct: 97 AGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP--------FNPVRSTTVADVPCT 148
Query: 189 STTCKILLEWFPPN---GQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYF 244
C + F P S EC Y Y G+ T G T+ T + +G
Sbjct: 149 DDAC----QQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDG-- 202
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL--HSPYGSTGYITF 302
+ GC N GD +G SG++GL RG +S++S+ + F Y + +I F
Sbjct: 203 ----VVFGCGLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDSVDTQSFILF 258
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGT-- 360
G T T ++ + Y++ L GI V G+ L + + F + + G
Sbjct: 259 GDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFL 318
Query: 361 ----IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFL 416
++T Y LR A ++ + G D CY + VP + + F
Sbjct: 319 SITDLVTVLEEAAYKPLRQAVASKIGLPAV-NGSALGLDLCYTGESLAKAKVPSMALVFA 377
Query: 417 GGVDLELDVRGTLVVESVRQV-CLGFALLPSDP-NSILLGNVQQRGYEVHYDVAGRRLGF 474
GG +EL++ ++S + CL +LPS + +LG++ Q G + YD+ G +L F
Sbjct: 378 GGAVMELELGNYFYMDSTTGLACL--TILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 435
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 173/383 (45%), Gaps = 54/383 (14%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF-----FDPSKSKTFSKIPCN 188
+ + +G P Q V++++DTGS ++W +HC+ ++ F+P S ++S IPC+
Sbjct: 75 VSLTVGTPPQNVTMVIDTGSELSW------LHCNTSQNSSSSSSTFNPVWSSSYSPIPCS 128
Query: 189 STTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
S+TC FP + C S + C ++Y D S G ATD I G
Sbjct: 129 SSTCTDQTRDFPI--RPSCDSNQFCHATLSYADASSSEGNLATDTFYI------GSSGIP 180
Query: 248 PFLLGCTD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFG 303
+ GC D +N+ + + +G+MG++RG +S +S+ F YC+ S Y +G + G
Sbjct: 181 NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCI-SEYDFSGLLLLG 239
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLS 353
+ + YTP++ + Y + L GI V + LP+ S F
Sbjct: 240 DANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQ 299
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKR----MKKYKMGKGI-EDLFDTCYDLSAYKTVV- 407
T +DSGT T P Y+ALR F + ++ Y+ + + D CY + +T +
Sbjct: 300 TMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLP 359
Query: 408 -VPKITIHFLGGVDLELDVRGTLVVESVRQVCLG------FALLPSD---PNSILLGNVQ 457
+P +T+ F G E+ V G ++ V G F SD + ++G++
Sbjct: 360 PLPSVTLVFRGA---EMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLH 416
Query: 458 QRGYEVHYDVAGRRLGFGPGNCN 480
Q+ + +D+ R+G C+
Sbjct: 417 QQNVWMEFDLKKSRIGLAEIRCD 439
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 109/395 (27%), Positives = 172/395 (43%), Gaps = 46/395 (11%)
Query: 114 KTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF 173
KT+ T P K I + IG P Q V+++LDTGS ++W CK + +
Sbjct: 41 KTQTQTPPRKLAFQHNVTLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNST---- 96
Query: 174 FDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRM 233
F+P S +++ PCNS+ C ++K C ++Y D S G A +
Sbjct: 97 FNPLLSSSYTPTPCNSSVCMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETF 156
Query: 234 TIQEVNGNGYFARYPFLLGCTDNN--TGDQN---GASGIMGLDRGPVSIISKTNISYFFY 288
++ G L GC D+ T D N +G+MG++RG +S++++ + F Y
Sbjct: 157 SLAGAAQPGT------LFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQMVLPKFSY 210
Query: 289 CLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLP 343
C+ S + G + G + ++YTP+VT S + Y + L GI V + L
Sbjct: 211 CI-SGEDAFGVLLLGDGPSAPSP-LQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQ 268
Query: 344 LKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED------ 392
L S F T +DSGT T PVY++L+ F ++ K + IED
Sbjct: 269 LPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTK--GVLTRIEDPNFVFE 326
Query: 393 -LFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESV---RQVCLGFALLPSDP 448
D CY A VP +T+ F G E+ V G ++ V R F SD
Sbjct: 327 GAMDLCYHAPA-SLAAVPAVTLVFSGA---EMRVSGERLLYRVSKGRDWVYCFTFGNSDL 382
Query: 449 NSI---LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
I ++G+ Q+ + +D+ R+GF C+
Sbjct: 383 LGIEAYVIGHHHQQNVWMEFDLVKSRVGFTETTCD 417
>gi|194702702|gb|ACF85435.1| unknown [Zea mays]
gi|414885969|tpg|DAA61983.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 163
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/159 (40%), Positives = 87/159 (54%), Gaps = 3/159 (1%)
Query: 323 EQSEFYHITLTGISVGGERLPLKASYF-TKLSTEIDSGTIITRFPAPVYSALRSAFRKRM 381
+ FY++ LTGI+V G + + S F T T IDSGT + P Y+ALRS+ R M
Sbjct: 5 QHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTIIDSGTAFSCLPPSAYAALRSSVRSAM 64
Query: 382 KKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVES-VRQVCLG 440
+YK +FDTCYDL+ ++TV +P + + F G + L G L S V Q CL
Sbjct: 65 GRYKRAPS-STIFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLA 123
Query: 441 FALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
F P D + +LGN QQR V YDV +++GFG C
Sbjct: 124 FLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGC 162
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 160/362 (44%), Gaps = 67/362 (18%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
EY + ++IG P V + DTGS + WTQC PC+ C +Q++P FDPSKS +F ++ C S
Sbjct: 23 EYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQ 82
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C++L + P I + +
Sbjct: 83 QCRLL---------------DTPTSILNI------------------------------V 97
Query: 251 LGCTDNNTGDQN-GASGIMGLDRGPVSIISKTNISY-----FFYCLHSPYGS----TGYI 300
GC NN+G N G+ G P+S+ S+ + F CL P+ + T I
Sbjct: 98 FGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCL-VPFRTDPSITSKI 156
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS--YFTKLSTEIDS 358
FG V+ V TP+VT + +Y +TL GISVG + P +S TK + ID+
Sbjct: 157 IFGPEAEVSGSDVVSTPLVTK-DDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDA 215
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVVPKITIHFLG 417
GT T P Y+ L ++ + + DL CY + + P +T HF
Sbjct: 216 GTPPTLLPRDFYNRLVQGVKEAIPMEPVQD--PDLQPQLCY--RSATLIDGPILTAHF-D 270
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G D++L T + S ++ FA+ P D ++ + GN Q + + +D+ G+++ F
Sbjct: 271 GADVQLKPLNTFI--SPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAV 328
Query: 478 NC 479
+C
Sbjct: 329 DC 330
>gi|297605070|ref|NP_001056627.2| Os06g0118000 [Oryza sativa Japonica Group]
gi|55296430|dbj|BAD68553.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|215692556|dbj|BAG87976.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255676664|dbj|BAF18541.2| Os06g0118000 [Oryza sativa Japonica Group]
Length = 175
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/165 (39%), Positives = 90/165 (54%), Gaps = 8/165 (4%)
Query: 316 TPIVTTPEQSE-FYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALR 374
TP++++ S FY + L I V G LP+ + F+ S+ IDS T+I+R P Y ALR
Sbjct: 18 TPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSA-SSVIDSATVISRIPPTAYQALR 76
Query: 375 SAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESV 434
+AFR M Y+ + + DTCYD S +++ +P I + F GG + LD G L+
Sbjct: 77 AAFRSAMTMYRPAPPVS-ILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL---- 131
Query: 435 RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
Q CL FA SD +GNVQQR EV YDV G+ + F C
Sbjct: 132 -QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 175
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 111/390 (28%), Positives = 164/390 (42%), Gaps = 53/390 (13%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKP---CIHC------SQQRDPFFDPSKSKTF 182
Y + ++ G P Q +S ++DTGS I W C C HC R F P +S +
Sbjct: 67 YSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSS 126
Query: 183 SKIPCNSTTCKILLEWFPPNGQDKCSSKEC------PYDIAYVDGSGETGFWATDRMTIQ 236
+ C + C + QD CS K C PY I Y GSG TG A +
Sbjct: 127 KLLGCKNPKCSWIHHSNINCDQD-CSIKSCLNQTCPPYMIFY--GSGTTGGVA-----LS 178
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS---- 292
E ++ FL+GC+ + +GI G RG S+ S+ + F YCL S
Sbjct: 179 ETLHLHSLSKPNFLVGCS---VFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHRFD 235
Query: 293 ---PYGSTGYITFGKPDTVNK-KFVKYTPIVTTPEQ------SEFYHITLTGISVGGERL 342
S+ + + D+ K + YTP V P+ S +Y++ L I+VGG +
Sbjct: 236 DDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHV 295
Query: 343 PLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDT- 396
+ Y + IDSGT T + L F +++K Y+ K IED
Sbjct: 296 KVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLR 355
Query: 397 -CYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALL-PSDPNSI--- 451
C+++S KTV P++ ++F GG D+ L V CL + P +
Sbjct: 356 PCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGVAGPERVGGP 415
Query: 452 --LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+LGN Q + + V YD+ RLGF C
Sbjct: 416 GMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 139/307 (45%), Gaps = 29/307 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR--DPFFDPSKSKTFSKIPCNS 189
+ + ++G+P ++DTGS + W QC+PC HCS P F+P+ S TF + C+
Sbjct: 96 FLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDD 155
Query: 190 TTCKILLEWFPPNGQDKC-SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C+ + PNG C SS +C Y+ Y+ G+G G A +R+T NGN + P
Sbjct: 156 RFCR-----YAPNGH--CGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQ-P 207
Query: 249 FLLGC-TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL----HSPYGSTGYITFG 303
GC +N ++ +GI+GL P S+ + S F YC+ + YG +
Sbjct: 208 IAFGCGYENGEQLESHFTGILGLGAKPTSLAVQLG-SKFSYCIGDLANKNYGYNQLVLGE 266
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE----IDSG 359
D + TPI E S Y++ L GISVG +L ++ F + +DSG
Sbjct: 267 DADILGDP----TPIEFETENS-IYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSG 321
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-VPKITIHFLGG 418
T+ T Y L + + + D CY + ++ P +T HF GG
Sbjct: 322 TLYTWLADIAYRELYNEIKSILDPKLERFWFRDFL--CYHGRVSEELIGFPVVTFHFAGG 379
Query: 419 VDLELDV 425
+L ++
Sbjct: 380 AELAMEA 386
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/416 (24%), Positives = 182/416 (43%), Gaps = 48/416 (11%)
Query: 91 RDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLD 150
RD+ R H + R + + D + + P G+ YY V +G P + ++ +D
Sbjct: 45 RDRAR-HARMLRGVAGGVVD--FSVQGTSDPNSVGL-----YYTKVKMGTPPKEFNVQID 96
Query: 151 TGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQD 205
TGS I W C C +C Q FFD S T + IPC+ C ++
Sbjct: 97 TGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDPICTSRVQ----GAAA 152
Query: 206 KCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGN--GYFARYPFLLGCTDNNTGD- 260
+CS + +C Y Y DGSG +G++ +D M + G + + GC+ + +GD
Sbjct: 153 ECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVNSSATIVFGCSISQSGDL 212
Query: 261 ---QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHSPYGSTGYITFGKPDTVNKKF 312
GI G GP+S++S+ + F +CL G + G+ + +
Sbjct: 213 TKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDGDGGGVLVLGE---ILEPS 269
Query: 313 VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT----KLSTEIDSGTIITRFPAP 368
+ Y+P+V P Q Y++ L I+V G+ LP+ + F+ + T +D GT +
Sbjct: 270 IVYSPLV--PSQPH-YNLNLQSIAVNGQLLPINPAVFSISNNRGGTIVDCGTTLAYLIQE 326
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT 428
Y L +A + + + + CY +S + P ++++F GG + L
Sbjct: 327 AYDPLVTAINTAVS--QSARQTNSKGNQCYLVSTSIGDIFPSVSLNFEGGASMVLKPEQY 384
Query: 429 LV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L+ ++ C+GF + +LG++ + V YD+A +R+G+ +C+
Sbjct: 385 LMHNGYLDGAEMWCIGFQKFQEGAS--ILGDLVLKDKIVVYDIAQQRIGWANYDCS 438
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 106/392 (27%), Positives = 159/392 (40%), Gaps = 60/392 (15%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ--------QRDPFFDPSKSKTFS 183
Y I + G P Q ++DTGS + W C CS+ P F P +S + +
Sbjct: 92 YSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSN 151
Query: 184 KIPCNSTTCKILLEWFPPNGQDKC-----SSKEC-----PYDIAYVDGSGETGFWATDRM 233
I C + C L F P Q KC +++ C PY I Y G +T +
Sbjct: 152 LIGCKNHKCSWL---FGPKVQSKCQECDPTTQNCTQSCPPYVIQY-------GLGSTAGL 201
Query: 234 TIQEVNGNGYFARYP-FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS 292
+ E + P FL+GC+ + GI G R P S+ S+ + F YCL S
Sbjct: 202 LLSETLDFPHKKTIPGFLVGCSLFSI---RQPEGIAGFGRSPESLPSQLGLKKFSYCLVS 258
Query: 293 ------PYGSTGYITFGK-PDTVNKKFVKYTPIVTTPEQS--EFYHITLTGISVGGERLP 343
P S + G D + YTP P + ++Y++ L I +G +
Sbjct: 259 HAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVK 318
Query: 344 LKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDT 396
+ + S T +DSGT T PVY + F K++ Y + +++
Sbjct: 319 VPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRP 378
Query: 397 CYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNS------ 450
C+++S K+V VP+ HF GG + L + +CL SD S
Sbjct: 379 CFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDSGVICLTIV---SDNMSGSGIGG 435
Query: 451 ---ILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
I+LGN QQR + V +D+ R GF NC
Sbjct: 436 GPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 113/423 (26%), Positives = 186/423 (43%), Gaps = 53/423 (12%)
Query: 87 EILR-RDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYV 145
E+LR RDQ R H + R + + D FT + Y+ V +G P +
Sbjct: 48 EVLRARDQAR-HGRLLRGVVGGVVD-------FTVYGTSDPYLVGLYFTKVKLGSPPREF 99
Query: 146 SLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWFP 200
++ +DTGS I W C C C + FFDPS S T S + C+ C L++
Sbjct: 100 NVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQ--- 156
Query: 201 PNGQDKCS--SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR--YPFLLGCTDN 256
+CS S +C Y Y DGSG TG++ +D + V G+ A + GC+
Sbjct: 157 -TTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFGCSTY 215
Query: 257 NTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHSPYGSTGYITFGKPDT 307
+GD GI G + +S++S+ + F +CL G + G+
Sbjct: 216 QSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDGGGKLVLGE--- 272
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT---KLSTEIDSGTIITR 364
+ + + Y+P+V P QS Y++ L ISV G+ LP+ + F T +DSGT +T
Sbjct: 273 ILEPNIIYSPLV--PSQSH-YNLNLQSISVNGQLLPIDPAVFATSNNQGTIVDSGTTLTY 329
Query: 365 FPAPVYSALRSAFRKRMKKYK---MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
Y SA + + KG + CY +S + P ++++F GG +
Sbjct: 330 LVETAYDPFVSAITATVSSSTTPVLSKG-----NQCYLVSTSVDEIFPPVSLNFAGGASM 384
Query: 422 ELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
L L+ + C+GF + ++P +LG++ + YD+A +R+G+
Sbjct: 385 VLKPGEYLMHLGFSDGAAMWCIGFQKV-AEPGITILGDLVLKDKIFVYDLAHQRIGWANY 443
Query: 478 NCN 480
+C+
Sbjct: 444 DCS 446
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 170/376 (45%), Gaps = 50/376 (13%)
Query: 141 PKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF--FDPSKSKTFSKIPCNSTTCKILL-E 197
P Q +S+++DTGS ++W +C S +P FDP++S ++S IPC+S TC+ +
Sbjct: 82 PPQNISMVIDTGSELSWLRCNR----SSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 198 WFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN 257
+ P D S K C ++Y D S G A + + + GC +
Sbjct: 138 FLIPASCD--SDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNL-----IFGCMGSV 190
Query: 258 TG----DQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFV 313
+G + +G++G++RG +S IS+ F YC+ G++ G + +
Sbjct: 191 SGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPL 250
Query: 314 KYTPI--VTTPE---QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIIT 363
YTP+ ++TP Y + LTGI V G+ LP+ S T +DSGT T
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFT 310
Query: 364 RFPAPVYSALRSAFRKR----MKKYKMGKGI-EDLFDTCYDLSAYKTVV-----VPKITI 413
PVY+ALRS F + + Y+ + + + D CY +S ++ +P +++
Sbjct: 311 FLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVSL 370
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLG------FALLPSD---PNSILLGNVQQRGYEVH 464
F G E+ V G ++ V + G F SD + ++G+ Q+ +
Sbjct: 371 VFEGA---EIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIE 427
Query: 465 YDVAGRRLGFGPGNCN 480
+D+ R+G P C+
Sbjct: 428 FDLQRSRIGLAPVQCD 443
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 113/446 (25%), Positives = 187/446 (41%), Gaps = 66/446 (14%)
Query: 75 NQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYI 134
+G+S L +R + L + RL+ F+ + T P
Sbjct: 18 GEGRSPAGTVLPLQVRVQEVELEAPAANRLR------FRHNVSLTVP------------- 58
Query: 135 VVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKI 194
VA+G P Q V+++LDTGS ++W C + P F+ S S ++ +PC ST C+
Sbjct: 59 -VAVGTPPQNVTMVLDTGSELSWLLCNGSY--APPLTPAFNASGSSSYGAVPCPSTACEW 115
Query: 195 LLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQ----EVNGNGYFA---R 246
P P D S C ++Y D S G ATD + V YF
Sbjct: 116 RGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAYFGCITS 175
Query: 247 YPFLLGCTDNNTGD--QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGK 304
Y N TG A+G++G++RG +S +++T F YC+ +P G + G
Sbjct: 176 YSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCI-APGEGPGVLLLGD 234
Query: 305 PDTVNKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLST 354
V YTP++ + + Y + L GI VG LP+ S T T
Sbjct: 235 DGGVAPPL-NYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQT 293
Query: 355 EIDSGTIITRFPAPVYSALRSAF--RKRMKKYKMGKG---IEDLFDTCY----DLSAYKT 405
+DSGT T A Y+AL++ F + R+ +G+ + FD C+ A +
Sbjct: 294 MVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRGPEARVAAAS 353
Query: 406 VVVPKITIHFLGGVDLEL-----------DVRGTLVVESVRQVCLGFALLPSDPNSILLG 454
++P + + L G ++ + + RG E+V + G + + + ++ ++G
Sbjct: 354 GLLPVVGL-VLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDM-AGMSAYVIG 411
Query: 455 NVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ Q+ V YD+ R+GF P C+
Sbjct: 412 HHHQQNVWVEYDLQNGRVGFAPARCD 437
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 119/403 (29%), Positives = 175/403 (43%), Gaps = 61/403 (15%)
Query: 91 RDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTG--IVAADEYYIVVAIGKPKQYVSLL 148
+D+ RL +S +K++ P +G IV Y + IG P Q + +
Sbjct: 4 KDKARLQFLSSLVARKSV-----------VPIASGRQIVQNPTYIVRAKIGTPAQTMLMA 52
Query: 149 LDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS 208
+DT S + W C C+ CS F+ S T+ + C + CK + + C
Sbjct: 53 MDTSSDVAWIPCNGCLGCSST---LFNSPASTTYKSLGCQAAQCKQV-------PKPTCG 102
Query: 209 SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIM 268
C +++ Y GS + D +T+ GY GC TG A G++
Sbjct: 103 GGVCSFNLTY-GGSSLAANLSQDTITLATDAVPGYS------FGCIQKATGGSLPAQGLL 155
Query: 269 GLDRGPVSIISKTNISY---FFYCLHS-----PYGSTGYITFGKPDTVNKKFVKYTPIVT 320
GL RGP+S++S+T Y F YCL S GS G+P K +KYTP++
Sbjct: 156 GLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQP-----KRIKYTPLLK 210
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPAPVYSALRS 375
P + Y + L + VG + + F T T DSGT+ TR P Y A+R
Sbjct: 211 NPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRD 270
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLG-GVDLELDVRGTLVVESV 434
AFR R+ + + FDTCY + + P IT F G V L D L++ S
Sbjct: 271 AFRNRVGRNLTVTSLGG-FDTCYTVP----IAAPTITFMFTGMNVTLPPD---NLLIHST 322
Query: 435 --RQVCLGFALLPSDPNSIL--LGNVQQRGYEVHYDVAGRRLG 473
CL A P + NS+L + N+QQ+ + + YDV RLG
Sbjct: 323 AGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLG 365
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 149/357 (41%), Gaps = 33/357 (9%)
Query: 129 ADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN 188
A Y IG P Q VS LD S + WT C F+P +S T + +PC
Sbjct: 97 AGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP--------FNPVRSTTVADVPCT 148
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARY 247
C + F P +S EC Y Y G+ T G T+ T + +G
Sbjct: 149 DDAC----QQFAPQTCGAGAS-ECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDG----- 198
Query: 248 PFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL--HSPYGSTGYITFGKP 305
+ GC N GD +G SG++GL RG +S++S+ + F Y + +I FG
Sbjct: 199 -VVFGCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDSVDTQSFILFGDD 257
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGT----- 360
T T ++ + Y++ L GI V G+ L + + F + + G
Sbjct: 258 ATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSIT 317
Query: 361 -IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
++T Y LR A ++ + L D CY + VP + + F GG
Sbjct: 318 DLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGL-DLCYTGESLAKAKVPSMALVFAGGA 376
Query: 420 DLELDVRGTLVVESVRQV-CLGFALLPSDP-NSILLGNVQQRGYEVHYDVAGRRLGF 474
+EL++ ++S + CL +LPS + +LG++ Q G + YD+ G +L F
Sbjct: 377 VMELELGNYFYMDSTTGLACL--TILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 431
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 115/411 (27%), Positives = 180/411 (43%), Gaps = 47/411 (11%)
Query: 97 HLKNSRRLQKAIPDNFKK---TKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGS 153
H KNS ++ FK+ TK ++ ++ + + + IG P Q ++LDTGS
Sbjct: 41 HSKNSL-FSSSLASQFKQNPNTKTTSYNYRSSFKYSMALIVSLPIGTPPQTQQMVLDTGS 99
Query: 154 GITWTQCKPCIHCSQQRDP-FFDPSKSKTFSKIPCNSTTCKILL-EWFPPNGQDKCSSKE 211
++W QCK + P FDP S +FS +PCN + CK + ++ P D+ ++
Sbjct: 100 QLSWIQCK----VPPKTPPTAFDPLLSSSFSVLPCNHSLCKPRVPDYTLPTSCDQ--NRL 153
Query: 212 CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLD 271
C Y Y DG+ G ++ T P +LGC +++ Q GI+G++
Sbjct: 154 CHYSYFYADGTYAEGNLVREKFTFSSSQ-----TTPPLILGCATDSSDTQ----GILGMN 204
Query: 272 RGPVSIISKTNISYFFYCL---HSPYGS--TGYITFGKPDTVNKKFVKYTPIVTTPEQSE 326
G +S S IS F YC+ S GS TG G P+ + F KY ++T +
Sbjct: 205 LGRLSFSSLAKISKFSYCVPPRRSQSGSSPTGSFYLG-PNPSSAGF-KYVNLMTYRQSQR 262
Query: 327 F-------YHITLTGISVGGERLPLKASYFTKL-----STEIDSGTIITRFPAPVYSALR 374
Y + + GI + G++L + S F T IDSGT T YS ++
Sbjct: 263 MPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLIDSGTWFTFLVDEAYSKVK 322
Query: 375 SAFRKRM-KKYKMGKGIEDLFDTCYDLSAYKT-VVVPKITIHFLGGVDLELDVRGTLVVE 432
K K K G D C+D A ++ + F GV++ ++ L
Sbjct: 323 EEIVKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAFEFENGVEIVVEREKMLADV 382
Query: 433 SVRQVCLGFA---LLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CLG LL N ++GN Q+ V +D+ GRR+GFG +C+
Sbjct: 383 GGGVQCLGIGRSDLLGVASN--IIGNFHQQDLWVEFDLVGRRVGFGRTDCS 431
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/377 (27%), Positives = 162/377 (42%), Gaps = 42/377 (11%)
Query: 128 AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKP-CI--HCSQQRDPFFDPSKSKTFSK 184
A +Y IG P Q L+DTGS + WTQC C+ C++Q P+++ S+S TF
Sbjct: 82 ATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVP 141
Query: 185 IPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNG 242
+PC + NG C C + +Y G+G G T+ + +G
Sbjct: 142 VPCADKA-----GFCAANGVHLCGLDGSCTFIASY--GAGRVIGSLGTESFAFE--SGTT 192
Query: 243 YFARYPFLLGC---TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGY 299
A GC T +G N ASG++GL RG +S++S+ + F YCL + S+G
Sbjct: 193 SLA-----FGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATRFSYCLTPYFHSSGA 247
Query: 300 IT--FGKPDTVNKKFVKYTPIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFTKL-- 352
+ F P V +P+ S FY++ L GI+VG RLP S +L
Sbjct: 248 SSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQ 307
Query: 353 --------STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-FDTCYDLSAY 403
ID+G+ +T+ + Y AL+ ++ + ED + C +
Sbjct: 308 LFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELCVAREGF 367
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEV 463
+ VVP + HF GG D+ + C+ +L +SI +GN QQ+ +
Sbjct: 368 QK-VVPALVFHFGGGADMAVPAASYWAPVDKAAACM--MILEGGYDSI-IGNFQQQDMHL 423
Query: 464 HYDVAGRRLGFGPGNCN 480
YD+ R F +C
Sbjct: 424 LYDLRRGRFSFQTADCT 440
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/402 (25%), Positives = 165/402 (41%), Gaps = 44/402 (10%)
Query: 103 RLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQC-K 161
R +KA + + FP + Y + + IG+P + L LDTGS +TW QC
Sbjct: 28 RWRKAADRFTRAASSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDA 87
Query: 162 PCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVD 220
PC+HC + P + PS IPCN CK L NG +C + E C Y++ Y D
Sbjct: 88 PCVHCLEAPHPLYQPSN----DLIPCNDPLCKALHF----NGNHRCETPEQCDYEVEYAD 139
Query: 221 GSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGA---SGIMGLDRGPVSI 277
G G D ++ G R LGC + +G G++GL RG VSI
Sbjct: 140 GGSSLGVLVRDVFSLNYTKGLRLTPR--LALGCGYDQIPGASGHHPLDGVLGLGRGKVSI 197
Query: 278 ISKTNISYFF-----YCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITL 332
+S+ + + +CL S G G + FG D + V +TP+ E S+ Y +
Sbjct: 198 LSQLHSQGYVKNVVGHCLSSLGG--GILFFGN-DLYDSSRVSWTPMAR--ENSKHYSPAM 252
Query: 333 TG-ISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE 391
G + GG LK L T DSG+ T F + Y A+ ++ + + + +
Sbjct: 253 GGELLFGGRTTGLK-----NLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARD 307
Query: 392 DL-----------FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLG 440
D F + ++ Y + + E+ L++ VCLG
Sbjct: 308 DHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLG 367
Query: 441 F--ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
N L+G++ + + YD + +G+ P +C+
Sbjct: 368 ILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWIPADCD 409
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 148/362 (40%), Gaps = 39/362 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHC-----SQQRDPFFDPSKSKTFSKIP 186
Y + ++G P Q V+ +LD S W QC C C + P F S T ++
Sbjct: 97 YVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREVR 156
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGE----TGFWATDRMTIQEVNGNG 242
C + C+ L+ CS+ + P +YV G G G A D V +G
Sbjct: 157 CANRGCQRLVP-------QTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADG 209
Query: 243 YFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL--HSPYGSTGYI 300
+ GC GD G++GL RG +S++S+ I F Y L +I
Sbjct: 210 ------VIFGCAVATEGD---IGGVIGLGRGELSLVSQLQIGRFSYYLAPDDAVDVGSFI 260
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGT 360
F TP+V Y++ L GI V GE L + F L + G
Sbjct: 261 LFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTF-DLQADGSGGV 319
Query: 361 I------ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+ +T A Y +R A ++ + G E D CY + T VP + +
Sbjct: 320 VLSITIPVTFLDAGAYKVVRQAMASKI-GLRAADGSELGLDLCYTSESLATAKVPSMALV 378
Query: 415 FLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDP-NSILLGNVQQRGYEVHYDVAGRRL 472
F GG +EL++ ++S + CL +LPS + LLG++ Q G + YD++G RL
Sbjct: 379 FAGGAVMELEMGNYFYMDSTTGLECL--TILPSPAGDGSLLGSLIQVGTHMIYDISGSRL 436
Query: 473 GF 474
F
Sbjct: 437 VF 438
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 94/318 (29%), Positives = 145/318 (45%), Gaps = 41/318 (12%)
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTI-QEVNGNGYF 244
C T C +L C + C Y Y DG+ G +AT+R T G
Sbjct: 3 CAGTLCSDIL-------HHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTT 55
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS---------PYG 295
P GC N G N SGI+G R P+S++S+ +I F YCL S +G
Sbjct: 56 TTVPLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYASRRQSTLLFG 115
Query: 296 STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT----- 350
S +G D + V+ TP++ +P+ FY++ TG++VG RL + S F
Sbjct: 116 SLSDGVYG--DATGR--VQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDG 171
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMK-KYKMGKGIEDLFDTCYDL-------SA 402
+DSGT +T PA V + + AFR++++ + G ED C+ + S+
Sbjct: 172 SGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPED--GVCFLVPAAWRRSSS 229
Query: 403 YKTVVVPKITIHFLGGVDLELDVRGTLVVESVR-QVCLGFALLPSDPNSILLGNVQQRGY 461
+ VP++ +HF G DL+L R ++ + R ++CL A D ++I GN+ Q+
Sbjct: 230 TSQMPVPRMVLHF-QGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTI--GNLVQQDM 286
Query: 462 EVHYDVAGRRLGFGPGNC 479
V YD+ L P C
Sbjct: 287 RVLYDLEAETLSIAPARC 304
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 105/366 (28%), Positives = 154/366 (42%), Gaps = 39/366 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C HC + +DP F P S+T+ + C
Sbjct: 89 YTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKCT--- 145
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
P+ + +C YD Y + S +G D ++ ++ A +
Sbjct: 146 ---------PDCNCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSE---LAPQRAVF 193
Query: 252 GCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNISYFFYCLHS--PYGSTGYITFG 303
GC ++ TGD A GIMGL RG +SI + K IS F + G I G
Sbjct: 194 GCENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILGG 253
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTII 362
+ F + P++S +Y+I L + V G++L L F K T +DSGT
Sbjct: 254 ISPPEDMVFTH-----SDPDRSPYYNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGTTY 308
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCY-----DLSAYKTVVVPKITIHFL 416
P + A + A K K G + + D C+ D+S P + + F
Sbjct: 309 AYLPETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAK-SFPVVDMVFE 367
Query: 417 GGVDLELDVRGTLVVES-VRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G L L L S VR CLG DP + LLG + R V YD ++GF
Sbjct: 368 NGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTT-LLGGIFVRNTLVMYDRENSKIGF 426
Query: 475 GPGNCN 480
NC+
Sbjct: 427 WKTNCS 432
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 108/384 (28%), Positives = 156/384 (40%), Gaps = 41/384 (10%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFF 174
P TG+ YY + IG P + + +DTGS I W C C C ++ +
Sbjct: 82 LPTDTGL-----YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLY 136
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
DP S T SK+ C+ C P +S C Y + Y DGS TG++ +D +
Sbjct: 137 DPKDSSTGSKVSCDQGFCAATYGGLLPG---CTTSLPCEYSVTYGDGSSTTGYFVSDLLQ 193
Query: 235 IQEVNGNGYF--ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS---- 284
+V+G+G A GC GD GI+G + S++S+ + +
Sbjct: 194 FDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVK 253
Query: 285 -YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP 343
F +CL + G F + V K VK TP+V P Y++ L I VGG L
Sbjct: 254 KIFAHCLDTINGGG---IFAIGNVVQPK-VKTTPLV--PNMPH-YNVNLKSIDVGGTALK 306
Query: 344 LKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
L + F K T IDSGT +T P VY + A + K E L C+
Sbjct: 307 LPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEFL---CFQY 363
Query: 401 SAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSDPNS-ILLGNV 456
PKIT HF + L + C+GF L D +LLG++
Sbjct: 364 VGRVDDDFPKITFHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDL 423
Query: 457 QQRGYEVHYDVAGRRLGFGPGNCN 480
V YD+ + +G+ NC+
Sbjct: 424 VLSNKLVVYDLENQVIGWTEYNCS 447
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 108/399 (27%), Positives = 164/399 (41%), Gaps = 66/399 (16%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKP---CIHCS-----QQRDPFFDPSKSKTFS 183
Y + +++G P Q V L++DTGS + W C C C+ + P F P S +
Sbjct: 84 YSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSK 143
Query: 184 KIPCNSTTCKILLEW-FPPNGQDKC-----SSKEC-----PYDIAYVDGSGETGFWATDR 232
I C + C W F + Q KC ++ C PY I Y G +T
Sbjct: 144 LIGCKNPKC----AWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQY-------GLGSTAG 192
Query: 233 MTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL-- 290
+ + E FL GC+ +T GI G R S+ + + F YCL
Sbjct: 193 LLLSETINFPNKTISDFLAGCSLLST---RQPEGIAGFGRSQESLPLQLGLKKFSYCLVS 249
Query: 291 ----HSPYGSTGYITFGKPDTVNKKF--VKYTPI------VTTPEQSEFYHITLTGISVG 338
SP S + G P T + K + YTP + P E+Y++ L I VG
Sbjct: 250 RRFDDSPVSSDLILDMG-PSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVG 308
Query: 339 GERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL 393
+ + S+ S T +DSG+ T V+ L F K+M Y + ++ L
Sbjct: 309 KTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKL 368
Query: 394 --FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL-----------G 440
C+D+S K+VV+P +T F GG ++L + + VCL G
Sbjct: 369 TGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTIVSDNAAALGG 428
Query: 441 FALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ S +I+LGN QQ+ + + YD+ R GF +C
Sbjct: 429 DGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 163/369 (44%), Gaps = 60/369 (16%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
YY + +G P + SL++DTGS +TW +C PC P S TF ++ N T
Sbjct: 3 YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPC-----------SPDCSSTFDRLASN--T 49
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FL 250
K L C+ Y Y DGS G + D + + + +P F+
Sbjct: 50 YKAL----------TCADD---YSYGYGDGSFTQGDLSVDTLKMAGAASD-ELEEFPGFV 95
Query: 251 LGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL----------HSP--YG 295
GC G +G GI+ L G +S S+ Y F YCL SP +G
Sbjct: 96 FGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFG 155
Query: 296 STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS-- 353
+ +P + + ++YTPI E S +Y + L GISVG +RL L S F
Sbjct: 156 EAA-VELKEPGSGKLQELQYTPI---GESSIYYTVRLDGISVGNQRLDLSPSAFLNGQDK 211
Query: 354 -TEIDSGTIITRFPAPVYSALRSAFRKRMK--KYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
T DSGT +T P V +++ + + ++ KG+ D C+ + +P
Sbjct: 212 PTIFDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIKGL----DACFRVPPSSGQGLPD 267
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
IT HF GG D + V++ CL F +P++ SI GN+QQ+ + V +D+ R
Sbjct: 268 ITFHFNGGADF-VTRPSNYVIDLGSLQCLIF--VPTNEVSI-FGNLQQQDFFVLHDMDNR 323
Query: 471 RLGFGPGNC 479
R+GF +C
Sbjct: 324 RIGFKETDC 332
>gi|222615721|gb|EEE51853.1| hypothetical protein OsJ_33366 [Oryza sativa Japonica Group]
Length = 315
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/272 (30%), Positives = 134/272 (49%), Gaps = 27/272 (9%)
Query: 201 PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTG 259
P+ QD + +CP+ ++Y DGS G D +T +V + P F GC ++ G
Sbjct: 9 PHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGFSFGCNMDSFG 62
Query: 260 DQN--GASGIMGLDRGPVSIISKTNISY--FFYCL---HSPYG----STGYITFGKPDTV 308
G++G+ GP+S++ +++ ++ F YCL S G +TGY + GK T
Sbjct: 63 ANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLGKVAT- 121
Query: 309 NKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAP 368
+ V+YT +V + +E + + LT ISV GERL L S F++ DSG+ ++ P
Sbjct: 122 -RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGSELSYIPDR 180
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT 428
S L R+ + K G E+ CYD+ + +P I++HF G +L G
Sbjct: 181 ALSVLSQRIRELL--LKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARFDLGSHGV 238
Query: 429 LVVESVRQ---VCLGFALLPSDPNSILLGNVQ 457
V SV++ CL FA P++ SI+ +Q
Sbjct: 239 FVERSVQEQDVWCLAFA--PNESVSIIGSLIQ 268
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 168/373 (45%), Gaps = 48/373 (12%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
I + IG P Q ++LDTGS ++W QC H Q FDPS S TFS +PC CK
Sbjct: 77 INLPIGTPPQTQPMVLDTGSQLSWIQC----HKKQPPTASFDPSLSSTFSILPCTHPLCK 132
Query: 194 ILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
+ F P D+ ++ C Y Y DG+ G ++ T + P +LG
Sbjct: 133 PRIPDFTLPTSCDQ--NRLCHYSYFYADGTYAEGNLVREKFTFSRS-----VSTPPLILG 185
Query: 253 CTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYI---TFGKPDTVN 309
C +T + GI+G++ G +S ++ I+ F YC+ G+ +F + +
Sbjct: 186 CATESTDPR----GILGMNLGRLSFAKQSKITKFSYCVPPRQTRPGFTPTGSFYLGNNPS 241
Query: 310 KKFVKYTPIVTTPEQSE------FYHITLTGISVGGERLPLKASYFTKLS-----TEIDS 358
K KY ++T+ Q Y I + GI + G++L + + F + T IDS
Sbjct: 242 SKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDS 301
Query: 359 GTIITRFPAPVYSALRS----AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV----VVPK 410
G+ T + Y +R+ A R+KK + G+ D+ C+D + K V ++ +
Sbjct: 302 GSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADM---CFD--SVKAVEIGRLIGE 356
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSD---PNSILLGNVQQRGYEVHYDV 467
+ F GV++ + L C+G SD S ++GN Q+ V +D+
Sbjct: 357 MVFEFERGVEVVIPKERVLADVGGGVHCVGIG--SSDKLGAASNIIGNFHQQNLWVEFDL 414
Query: 468 AGRRLGFGPGNCN 480
RR+GFG +C+
Sbjct: 415 VRRRVGFGKADCS 427
>gi|222634868|gb|EEE65000.1| hypothetical protein OsJ_19937 [Oryza sativa Japonica Group]
Length = 402
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 128/472 (27%), Positives = 173/472 (36%), Gaps = 110/472 (23%)
Query: 31 HSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGR-YGPCSKLNQGKSRNTPSLEEIL 89
H +V SSL+ P A+P G + L R YGPCS S L ++L
Sbjct: 18 HYIVVETSSLLKPKAICSGLKAMPSSNG--TWVALHRPYGPCSPSPTTTSPPL--LVDML 73
Query: 90 RRDQQRLHLKNSRRLQKAIPD---------------NFKKTKAFTFPAKTGIVAADEYYI 134
R D +LH RR A D +++ +F ++
Sbjct: 74 RWD--KLHTDAIRRKATAGGDVVLEPDKPIVDVQQSDYQMQASFGIGTGGRSGSSSSSSS 131
Query: 135 VV----AIGKPKQYVSLLLDTGSGITWTQCKPCI--HCSQQRDPFFDPSKSKTFSKIPCN 188
+ AI P + +DT + W QC PC C Q++ FDP +S+T + +P
Sbjct: 132 RISRPSAIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVP-- 189
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C S C GE G + N YF Y
Sbjct: 190 ------------------CGSAAC----------GELGRYGAGCSN----NQCQYFVDY- 216
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYF-FYCLHSPYGSTGYITFGKPDT 307
GD SG P ++ T + F F C H+ G+ T G
Sbjct: 217 ----------GDGRATSGRTWWT--PSTLNPSTVVMNFRFGCSHAVRGNFSASTSGT--- 261
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPA 367
GI VGG RL + F + +DS IIT+ P
Sbjct: 262 -------------------------MGIEVGGRRLNVPPVVFAGGAV-MDSSVIITQLPP 295
Query: 368 PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRG 427
Y ALR AFR M Y G DTCYD + +V VP +++ F GG + LD G
Sbjct: 296 TAYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMG 355
Query: 428 TLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+V + CL F P D +GNVQQ+ +EV YDV G +GF G C
Sbjct: 356 VMV-----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 402
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 150/366 (40%), Gaps = 39/366 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN-ST 190
Y + IG P Q +L++D+GS +T+ C C C +DP F P S T+S + CN
Sbjct: 91 YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 150
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC +C Y+ Y + S +G D M+ + +
Sbjct: 151 TC-------------DNERSQCTYERQYAEMSSSSGVLGEDIMSFGK---ESELKPQRAV 194
Query: 251 LGCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNISYFFYCLHS--PYGSTGYITF 302
GC + TGD A GIMGL RG +SI + K IS F + G +
Sbjct: 195 FGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLG 254
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-TKLSTEIDSGTI 361
G P + F P+ +S +Y+I L I V G+ L L F +K T +DSGT
Sbjct: 255 GMPAPPDMVFSHSNPV-----RSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTT 309
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYK----TVVVPKITIHFL 416
P + A + A ++ K +G + + D C+ + + V P + + F
Sbjct: 310 YAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFG 369
Query: 417 GGVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G L L L S + CLG DP + LLG + R V YD ++GF
Sbjct: 370 NGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTT-LLGGIVVRNTLVTYDRHNEKIGF 428
Query: 475 GPGNCN 480
NC+
Sbjct: 429 WKTNCS 434
>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
Length = 216
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 79/219 (36%), Positives = 115/219 (52%), Gaps = 16/219 (7%)
Query: 275 VSIISKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYH 329
+S++S+T Y F YCL S Y +G + G + V+YTP++T P + Y+
Sbjct: 1 MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAG--QPRNVRYTPLLTNPHRPSLYY 58
Query: 330 ITLTGISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKY 384
+ +TG+SVG + + A F T T IDSGT+ITR+ APVY+ALR FR+++
Sbjct: 59 VNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAA- 117
Query: 385 KMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFAL 443
G FDTC++ P +T+H GGVDL L + TL+ S + CL A
Sbjct: 118 PSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAE 177
Query: 444 LPS--DPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
P + ++ N+QQ+ V DVAG R+GF CN
Sbjct: 178 APQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 216
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 149/362 (41%), Gaps = 39/362 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHC-----SQQRDPFFDPSKSKTFSKIP 186
Y + ++G P Q V+ +LD S W QC C C + P F S T ++
Sbjct: 97 YVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREVR 156
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGE----TGFWATDRMTIQEVNGNG 242
C + C+ L+ CS+ + P +YV G G G A D V +G
Sbjct: 157 CANRGCQRLVP-------QTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADG 209
Query: 243 YFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL--HSPYGSTGYI 300
+ GC GD G++GL RG +S +S+ I F Y L +I
Sbjct: 210 ------VIFGCAVATEGD---IGGVIGLGRGELSPVSQLQIGRFSYYLAPDDAVDVGSFI 260
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGT 360
F TP+V + Y++ L GI V GE L + F L + G
Sbjct: 261 LFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTF-DLQADGSGGV 319
Query: 361 I------ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+ +T A Y +R A ++ + + G E D CY + T VP + +
Sbjct: 320 VLSITIPVTFLDAGAYKVVRQAMASKI-ELRAADGSELGLDLCYTSESLATAKVPSMALV 378
Query: 415 FLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDP-NSILLGNVQQRGYEVHYDVAGRRL 472
F GG +EL++ ++S + CL +LPS + LLG++ Q G + YD++G RL
Sbjct: 379 FAGGAVMELEMGNYFYMDSTTGLECL--TILPSPAGDGSLLGSLIQVGTHMIYDISGSRL 436
Query: 473 GF 474
F
Sbjct: 437 VF 438
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 161/373 (43%), Gaps = 38/373 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ + IG P + + +DTGS I W C C C ++ + +DP S++ +
Sbjct: 90 YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF- 244
C+ C P+ C+S C Y I+Y DGS GF+ TD + +V+G+G
Sbjct: 150 CDQQFCVANYGGVLPS----CTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTT 205
Query: 245 -ARYPFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS-----YFFYCLHSPY 294
A GC GD ++ GI+G + S++S+ + F +CL +
Sbjct: 206 PANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVN 265
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TK 351
G F + V K VK TP+V+ Y++ L GI VGG L L + F
Sbjct: 266 GGG---IFAIGNVVQPK-VKTTPLVSDMPH---YNVILKGIDVGGTALGLPTNIFDSGNS 318
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
T IDSGT + P VY AL + + + + + ++D +C+ S P++
Sbjct: 319 KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISV-QTLQDF--SCFQYSGSVDDGFPEV 375
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQVCLGF----ALLPSDPNSILLGNVQQRGYEVHYDV 467
T HF G V L + L C+GF + +LLG++ V YD+
Sbjct: 376 TFHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDL 435
Query: 468 AGRRLGFGPGNCN 480
+ +G+ NC+
Sbjct: 436 ENQAIGWADYNCS 448
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 158/371 (42%), Gaps = 39/371 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ + +G P + + +DTGS I W CKPC C + + FD + S T K+
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVG 133
Query: 187 CNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGN---G 242
C+ C + + D C + C Y I Y D S G + D++T+++V G+ G
Sbjct: 134 CDDDFCSFISQ------SDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTG 187
Query: 243 YFARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSP 293
+ + GC + +G + G+MG + S++S+ + F +CL +
Sbjct: 188 PLGQ-EVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G G G V+ VK TP+V P Q Y++ L G+ V G L L S
Sbjct: 247 KGG-GIFAVG---VVDSPKVKTTPMV--PNQMH-YNVMLMGMDVDGTALDLPPSIMRNGG 299
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T +DSGT + FP +Y +L R + K+ +ED F C+ S V P ++
Sbjct: 300 TIVDSGTTLAYFPKVLYDSLIETILAR-QPVKL-HIVEDTFQ-CFSFSENVDVAFPPVSF 356
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSDPNS-ILLGNVQQRGYEVHYDVAG 469
F V L + L C G+ L + ILLG++ V YD+
Sbjct: 357 EFEDSVKLTVYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLEN 416
Query: 470 RRLGFGPGNCN 480
+G+ NC+
Sbjct: 417 EVIGWADHNCS 427
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 79/265 (29%), Positives = 122/265 (46%), Gaps = 19/265 (7%)
Query: 86 EEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIV-AADEYYIVVAIGKPKQY 144
E+LRR QR + + + A + KA A+T I+ A EY + + IG P
Sbjct: 45 HELLRRAIQRSRYRLAG-IGMARGEAASARKAVV--AETPIMPAGGEYLVKLGIGTPPYK 101
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
+ +DT S + WTQC+PC C Q DP F+P S T++ +PC+S TC L +
Sbjct: 102 FTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDD 161
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ--N 262
D + C Y Y + G A D++ I E G GC+ ++TG
Sbjct: 162 D----ESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRG------VAFGCSTSSTGGAPPP 211
Query: 263 GASGIMGLDRGPVSIISKTNISYFFYCLHSPYGST-GYITFGKPDTVNKKFVK--YTPIV 319
ASG++GL RGP+S++S+ ++ F YCL P G + G + P+
Sbjct: 212 QASGVVGLGRGPLSLVSQLSVRRFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMR 271
Query: 320 TTPEQSEFYHITLTGISVGGERLPL 344
P +Y++ L G+ +G + L
Sbjct: 272 RDPRYPSYYYLNLDGLLIGDRTMSL 296
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 113/437 (25%), Positives = 172/437 (39%), Gaps = 65/437 (14%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
SL + R H S + NF K FP G Y I + G P Q
Sbjct: 46 SLNHLASLSLSRAHHIKSPK------TNFSLIKTPLFPRSYG-----GYSISLNFGTPPQ 94
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQ--------QRDPFFDPSKSKTFSKIPCNSTTCKIL 195
++DTGS + W C CS+ P F P S + I C + C ++
Sbjct: 95 TTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMI 154
Query: 196 LEWFPPNGQDKC-----SSKEC-----PYDIAYVDGSGET-GFWATDRMTIQEVNGNGYF 244
F P Q KC +++ C PY I Y GSG T G ++ +
Sbjct: 155 ---FGPEIQSKCQECDSTAQNCTQTCPPYVIQY--GSGSTAGLLLSETLDFPNKK----- 204
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS------PYGSTG 298
FL+GC+ + GI G R P S+ S+ + F YCL S P S
Sbjct: 205 TIPDFLVGCSIFSIKQ---PEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPTSSDL 261
Query: 299 YITFGKPDTVNKKF-VKYTPIVTTPEQS--EFYHITLTGISVGGERLPLKASYFTKLS-- 353
+ G V K + +TP + P + ++Y++ L I +G + + + +
Sbjct: 262 VLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDG 321
Query: 354 ---TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSAYKTVVV 408
T +DSGT T PVY + F K+M Y + I++L CY++S K++ V
Sbjct: 322 NGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLRPCYNISGEKSLSV 381
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGFA------LLPSDPNSILLGNVQQRGYE 462
P + F GG + L + + +CL +I+LGN QQR +
Sbjct: 382 PDLIFQFKGGAKMALPLSNYFSIVDSGVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFY 441
Query: 463 VHYDVAGRRLGFGPGNC 479
V +D+ + GF +C
Sbjct: 442 VEFDLENEKFGFKQQSC 458
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 162/387 (41%), Gaps = 48/387 (12%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ---QRDPFFDPSKSKTFSKIPCNST 190
+ VA+G P Q V+++LDTGS ++W +C S Q F+ S S T++ C+S
Sbjct: 62 VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSP 121
Query: 191 TCKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C+ P P S C ++Y D S G A D + G
Sbjct: 122 ECQWRGRDLPVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLL------GGAPPVXA 175
Query: 250 LLGC-------TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITF 302
L GC T N+ D A+G++G++RG +S +++T F YC+ +P G +
Sbjct: 176 LFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCI-APGDGPGLLVL 234
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KL 352
G + YTP++ + Y + L GI VG LP+ S
Sbjct: 235 GGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAG 294
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-----FDTCYDLS----AY 403
T +DSGT T A Y+ L+ F + G D FD C+ S A
Sbjct: 295 QTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEARVAA 354
Query: 404 KTVVVPKITIHF------LGGVDLELDV----RGTLVVESVRQVCLGFALLPSDPNSILL 453
+ ++P++ + +GG L V RG E+V + G + + + ++ ++
Sbjct: 355 ASXMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDM-AGMSAYVI 413
Query: 454 GNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G+ Q+ V YD+ R+GF P C+
Sbjct: 414 GHHHQQNVWVEYDLQNGRVGFAPARCD 440
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 174/380 (45%), Gaps = 51/380 (13%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + +G P Q V+++LDTGS ++W CK +Q + F+P SKT+SK+PC S TCK
Sbjct: 71 VSLTVGSPPQNVTMVLDTGSELSWLHCKK----TQFLNSVFNPLSSKTYSKVPCLSPTCK 126
Query: 194 I-LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
+ P D ++K C ++Y D + G A + + G + + G
Sbjct: 127 TRTRDLTIPVSCD--ATKLCHVIVSYADATSIEGNLAFETFRL------GSLTKPATIFG 178
Query: 253 CTD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
C D +N+ + + +G++G++RG +S +++ F YC+ S + S G + G
Sbjct: 179 CMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYPKFSYCI-SGFDSAGVLLLGNASFP 237
Query: 309 NKKFVKYTPIV--TTPE---QSEFYHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
K + YTP+V +TP Y + L GI V + L L S F T +DS
Sbjct: 238 WLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDS 297
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED-------LFDTCYDLSAYKTVV--VP 409
GT T PVY+AL++ F + + + K + D D CY L + + + +P
Sbjct: 298 GTQFTFLLGPVYTALKNEFLSQTR--GILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLP 355
Query: 410 KITIHFLGGVDLELDVRGTLVVESV------RQVCLGFALLPSD---PNSILLGNVQQRG 460
+++ F G E+ V G ++ V R F SD + ++G+ Q+
Sbjct: 356 VVSLMFQGA---EMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAFVIGHHHQQN 412
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
+ +D+ R+G C+
Sbjct: 413 VWMEFDLEKSRIGLADVRCD 432
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 162/373 (43%), Gaps = 38/373 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ + IG P + + +DTGS I W C C C ++ + +DP S++ +
Sbjct: 90 YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF- 244
C+ C P+ C+S C Y I+Y DGS GF+ TD + +V+G+G
Sbjct: 150 CDQQFCVANYGGVLPS----CTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTT 205
Query: 245 -ARYPFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS-----YFFYCLHSPY 294
A GC GD ++ GI+G + S++S+ + F +CL +
Sbjct: 206 PANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVN 265
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TK 351
G F + V K VK TP+V P+ Y++ L GI VGG L L + F
Sbjct: 266 GGG---IFAIGNVVQPK-VKTTPLV--PDMPH-YNVILKGIDVGGTALGLPTNIFDSGNS 318
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
T IDSGT + P VY AL + + + + + ++D +C+ S P++
Sbjct: 319 KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISV-QTLQDF--SCFQYSGSVDDGFPEV 375
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQVCLGF----ALLPSDPNSILLGNVQQRGYEVHYDV 467
T HF G V L + L C+GF + +LLG++ V YD+
Sbjct: 376 TFHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDL 435
Query: 468 AGRRLGFGPGNCN 480
+ +G+ NC+
Sbjct: 436 ENQAIGWADYNCS 448
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 159/374 (42%), Gaps = 42/374 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ + +G P + + +DTGS I W C PC C + D +D S T +
Sbjct: 77 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNVG 136
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C C +++ + C +K+ C Y + Y DGS G + D +T+ +V GN A
Sbjct: 137 CEDAFCSFIMQ------SETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTA 190
Query: 246 --RYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPY 294
+ GC N +G ++ GIMG + S+IS+ F +CL +
Sbjct: 191 PLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMN 250
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL---PLKASYFTK 351
G G G+ V VK TP+V P Q Y++ L G+ V GE + P AS
Sbjct: 251 GG-GIFAIGE---VESPVVKTTPLV--PNQVH-YNVILKGMDVDGEPIDLPPSLASTNGD 303
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAF-RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
T IDSGT + P +Y++L K+ K M +++ F C+ ++ P
Sbjct: 304 GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHM---VQETF-ACFSFTSNTDKAFPV 359
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSD-PNSILLGNVQQRGYEVHYD 466
+ +HF + L + L C G+ + D + ILLG++ V YD
Sbjct: 360 VNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYD 419
Query: 467 VAGRRLGFGPGNCN 480
+ +G+ NC+
Sbjct: 420 LENEVIGWADHNCS 433
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 154/364 (42%), Gaps = 35/364 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C C + +DP F P S T+ + CN +
Sbjct: 77 YTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCNPSC 136
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
N D+ K+C Y+ Y + S +G A D ++ +
Sbjct: 137 ----------NCDDE--GKQCTYERRYAEMSSSSGVIAEDVVSF---GNESELKPQRAVF 181
Query: 252 GCTDNNTGD--QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHSPYGSTGYITFGK 304
GC + TGD A GIMGL RG +S++ + F C G + G+
Sbjct: 182 GCENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLGQ 241
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-TKLSTEIDSGTIIT 363
+ + P +S +Y+I L + V G+ L LK F K T +DSGT
Sbjct: 242 ISPPPNMVFSH----SNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTYA 297
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIE-DLFDTCYDLS----AYKTVVVPKITIHFLGG 418
FP + AL+ A K ++ K G + + D C+ + ++ + V P++ + F G
Sbjct: 298 YFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSG 357
Query: 419 VDLELDVRGTLVVES--VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
L L L + CLG +D + LLG + R V YD ++GF
Sbjct: 358 QKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTT-LLGGIVVRNTLVTYDRENDKIGFWK 416
Query: 477 GNCN 480
NC+
Sbjct: 417 TNCS 420
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 110/405 (27%), Positives = 164/405 (40%), Gaps = 51/405 (12%)
Query: 98 LKNSRR-LQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGIT 156
L +SRR LQ++ T P ++ Y + IG P Q +L++DTGS +T
Sbjct: 60 LSHSRRHLQRS---ESHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLT 116
Query: 157 WTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK--ECPY 214
+ C C C + +DP F P S T+ + C S C C S+ C Y
Sbjct: 117 YVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-SMEC-------------TCDSEMMHCVY 162
Query: 215 DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD--QNGASGIMGLDR 272
D Y + S +G D I + GC + TGD A GIMGL R
Sbjct: 163 DRQYAEMSSSSGVLGED---IVSFGKQSELKPQRTVFGCENVETGDIYSQRADGIMGLGR 219
Query: 273 GPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFV--KYTPIV------TTPEQ 324
G +SI+ + G++ + +G D V +P + P +
Sbjct: 220 GDLSIVDQL-------VEKGVIGNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDPAR 272
Query: 325 SEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKK 383
S +Y+I L I + G++LP+ F K T +DSGT P P + A + A K +
Sbjct: 273 SAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNS 332
Query: 384 YKMGKGIEDLF-DTCY-----DLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQ- 436
K+ +G + + D C+ D+S P + + F G L L L S
Sbjct: 333 LKLIQGPDRNYNDICFSGVGSDVSQLSK-TFPAVDLVFSNGNRLSLSPENYLFQHSKAHG 391
Query: 437 -VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CLG +D + LLG + R V YD ++GF NC+
Sbjct: 392 AYCLGIFQNEND-QTTLLGGIIVRNTLVMYDREHLKIGFWKTNCS 435
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 159/358 (44%), Gaps = 40/358 (11%)
Query: 149 LDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+DTGS I W C C +C Q FFD S T + IPC+ C ++
Sbjct: 85 IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGVQ----GA 140
Query: 204 QDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGN--GYFARYPFLLGCTDNNTG 259
+CS + +C Y Y DGSG +G++ +D M + G + + GC+ + +G
Sbjct: 141 AAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNLIMGQPPAVNSTATIVFGCSISQSG 200
Query: 260 D----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYITFGKPDTVNK 310
D GI G GP+S++S+ + F +CL G + G+ + +
Sbjct: 201 DLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGILVLGE---ILE 257
Query: 311 KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT----KLSTEIDSGTIITRFP 366
+ Y+P+V P Q Y++ L I+V G+ LP+ + F+ + T +D GT +
Sbjct: 258 PSIVYSPLV--PSQPH-YNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDCGTTLAYLI 314
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVR 426
Y L +A + + + + CY +S + P ++++F GG + L
Sbjct: 315 QEAYDPLVTAINTAVSQS--ARQTNSKGNQCYLVSTSIGDIFPLVSLNFEGGASMVLKPE 372
Query: 427 GTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L+ ++ C+GF L + +LG++ + V YD+A +R+G+ +C+
Sbjct: 373 QYLMHNGYLDGAEMWCVGFQKLQEGAS--ILGDLVLKDKIVVYDIAQQRIGWANYDCS 428
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 154/366 (42%), Gaps = 39/366 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C C + +DP F P S T+ + CN
Sbjct: 13 YTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCN-ID 71
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C N D+ ++C Y+ Y + S +G D ++ ++ A +
Sbjct: 72 C---------NCDDE--KQQCVYERQYAEMSTSSGVLGEDIISFGNLSA---LAPQRAVF 117
Query: 252 GCTDNNTGD--QNGASGIMGLDRGPVSIISK------TNISYFFYCLHSPYGSTGYITFG 303
GC + TGD A GIMG+ RG +SI+ N S+ G + G
Sbjct: 118 GCENMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGG 177
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTII 362
N F + P+ +S +Y+I L I V G+ LPL + F K T +DSGT
Sbjct: 178 ISPPSNMVFSQSDPV-----RSPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGTTY 232
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCY-----DLSAYKTVVVPKITIHFL 416
P + + + A K + K +G + + D C+ D+S + P + + F
Sbjct: 233 AYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSS-SFPAVEMVFG 291
Query: 417 GGVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G L L L S CLG DP + LLG + R V YD ++GF
Sbjct: 292 NGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTT-LLGGIVVRNTLVLYDRENSKIGF 350
Query: 475 GPGNCN 480
NC+
Sbjct: 351 WKTNCS 356
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 163/387 (42%), Gaps = 48/387 (12%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQ---QRDPFFDPSKSKTFSKIPCNST 190
+ VA+G P Q V+++LDTGS ++W +C S Q F+ S S T++ C+S
Sbjct: 64 VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSP 123
Query: 191 TCKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C+ P P S C ++Y D S G A D + G R
Sbjct: 124 ECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLL----GGAPPVRA-- 177
Query: 250 LLGC-------TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITF 302
L GC T N+ D A+G++G++RG +S +++T F YC+ +P G +
Sbjct: 178 LFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCI-APGDGPGLLVL 236
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KL 352
G + YTP++ + Y + L GI VG LP+ S
Sbjct: 237 GGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAG 296
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-----FDTCYDLS----AY 403
T +DSGT T A Y+ L+ F + G D FD C+ S A
Sbjct: 297 QTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEARVAA 356
Query: 404 KTVVVPKITIHF------LGGVDLELDV----RGTLVVESVRQVCLGFALLPSDPNSILL 453
+ ++P++ + +GG L V RG E+V + G + + + ++ ++
Sbjct: 357 ASQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDM-AGMSAYVI 415
Query: 454 GNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G+ Q+ V YD+ R+GF P C+
Sbjct: 416 GHHHQQNVWVEYDLQNGRVGFAPARCD 442
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 108/413 (26%), Positives = 167/413 (40%), Gaps = 53/413 (12%)
Query: 83 PSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPK 142
P +E+ RR RLH Q +P+ K +++ Y + IG P
Sbjct: 44 PRVEDFRRR---RLH-------QSQLPNAHMKLY-------DDLLSNGYYTTRLWIGTPP 86
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Q +L++DTGS +T+ C C C + +DP F P S ++ + CN P+
Sbjct: 87 QEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN------------PD 134
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-- 260
K C Y+ Y + S +G + D ++ + + GC + TGD
Sbjct: 135 CNCDDEGKLCVYERRYAEMSSSSGVLSEDLISF---GNESQLSPQRAVFGCENEETGDLF 191
Query: 261 QNGASGIMGLDRGPVSIISK-----TNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKY 315
A GIMGL RG +S++ + F C G + GK +
Sbjct: 192 SQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPPGMVFSH 251
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIITRFPAPVYSALR 374
+ P +S +Y+I L + V G+ L L F K T +DSGT FP + A++
Sbjct: 252 ----SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIK 307
Query: 375 SAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVV----PKITIHFLGGVDLELDVRGTL 429
A K + K G + + D C+ + + P+I + F G L L L
Sbjct: 308 DAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYL 367
Query: 430 VVES-VRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ VR CLG + P ++ LLG + R V YD +LGF NC+
Sbjct: 368 FRHTKVRGAYCLG--IFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 110/405 (27%), Positives = 164/405 (40%), Gaps = 51/405 (12%)
Query: 98 LKNSRR-LQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGIT 156
L +SRR LQ++ T P ++ Y + IG P Q +L++DTGS +T
Sbjct: 60 LSHSRRHLQRS---ESHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLT 116
Query: 157 WTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK--ECPY 214
+ C C C + +DP F P S T+ + C S C C S+ C Y
Sbjct: 117 YVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-SMEC-------------TCDSEMMHCVY 162
Query: 215 DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD--QNGASGIMGLDR 272
D Y + S +G D I + GC + TGD A GIMGL R
Sbjct: 163 DRQYAEMSSSSGVLGED---IVSFGKQSELKPQRTVFGCENVETGDIYSQRADGIMGLGR 219
Query: 273 GPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVNKKFV--KYTPIV------TTPEQ 324
G +SI+ + G++ + +G D V +P + P +
Sbjct: 220 GDLSIVDQL-------VEKGVIGNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDPAR 272
Query: 325 SEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKK 383
S +Y+I L I + G++LP+ F K T +DSGT P P + A + A K +
Sbjct: 273 SAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNS 332
Query: 384 YKMGKGIEDLF-DTCY-----DLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQ- 436
K+ +G + + D C+ D+S P + + F G L L L S
Sbjct: 333 LKLIQGPDRNYNDICFSGVGSDVSQLSK-TFPAVDLVFSNGNRLSLSPENYLFQHSKAHG 391
Query: 437 -VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CLG +D + LLG + R V YD ++GF NC+
Sbjct: 392 AYCLGIFQNEND-QTTLLGGIIVRNTLVMYDREHLKIGFWKTNCS 435
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 160/374 (42%), Gaps = 42/374 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ + +G P + + +DTGS I W C PC C + D +D S T +
Sbjct: 74 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVG 133
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C C +++ + C +K+ C Y + Y DGS G + D +T+++V GN A
Sbjct: 134 CEDDFCSFIMQ------SETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTA 187
Query: 246 --RYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISK-----TNISYFFYCLHSPY 294
+ GC N +G + GIMG + SIIS+ + F +CL +
Sbjct: 188 PLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN 247
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL---PLKASYFTK 351
G G G+ V VK TPIV P Q Y++ L G+ V G+ + P AS
Sbjct: 248 GG-GIFAVGE---VESPVVKTTPIV--PNQVH-YNVILKGMDVDGDPIDLPPSLASTNGD 300
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAF-RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
T IDSGT + P +Y++L K+ K M +++ F C+ ++ P
Sbjct: 301 GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHM---VQETF-ACFSFTSNTDKAFPV 356
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSD-PNSILLGNVQQRGYEVHYD 466
+ +HF + L + L C G+ + D + ILLG++ V YD
Sbjct: 357 VNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYD 416
Query: 467 VAGRRLGFGPGNCN 480
+ +G+ NC+
Sbjct: 417 LENEVIGWADHNCS 430
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 155/385 (40%), Gaps = 43/385 (11%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFF 174
P TG+ YY V +G P + + +DTGS I W C C C + +
Sbjct: 81 LPTDTGL-----YYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLY 135
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRM 233
DP S T S + C+ C P KCS+ C Y + Y DGS G + D +
Sbjct: 136 DPKASSTGSTVMCDQGFCADTFGGRLP----KCSANVPCEYSVTYGDGSSTVGSFVNDAL 191
Query: 234 TIQEVNGNGYF--ARYPFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS--- 284
+V G+G A + GC GD +S GI+G S++S+ +
Sbjct: 192 QFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKV 251
Query: 285 --YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL 342
F +CL + G F D V K VK TP+V Y++ L I VGG L
Sbjct: 252 KKIFAHCLDTIKGGG---IFAIGDVVQPK-VKTTPLVADKPH---YNVNLKTIDVGGTTL 304
Query: 343 PLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
L A F K T IDSGT +T P V+ + A + + ++D C++
Sbjct: 305 ELPADIFKPGEKRGTIIDSGTTLTYLPELVFKKVMLAVFNKHQDITF-HDVQDFL--CFE 361
Query: 400 LSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSDPNSI-LLGN 455
S P +T HF + L + C+GF AL D I L+G+
Sbjct: 362 YSGSVDDGFPTLTFHFEDDLALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGD 421
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNCN 480
+ V YD+ R +G+ NC+
Sbjct: 422 LVLSNKLVVYDLENRVIGWTDYNCS 446
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 160/374 (42%), Gaps = 42/374 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ + +G P + + +DTGS I W C PC C + D +D S T +
Sbjct: 78 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVG 137
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C C +++ + C +K+ C Y + Y DGS G + D +T+++V GN A
Sbjct: 138 CEDDFCSFIMQ------SETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTA 191
Query: 246 --RYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISK-----TNISYFFYCLHSPY 294
+ GC N +G + GIMG + SIIS+ + F +CL +
Sbjct: 192 PLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN 251
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL---PLKASYFTK 351
G G G+ V VK TPIV P Q Y++ L G+ V G+ + P AS
Sbjct: 252 GG-GIFAVGE---VESPVVKTTPIV--PNQVH-YNVILKGMDVDGDPIDLPPSLASTNGD 304
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAF-RKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPK 410
T IDSGT + P +Y++L K+ K M +++ F C+ ++ P
Sbjct: 305 GGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHM---VQETF-ACFSFTSNTDKAFPV 360
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSD-PNSILLGNVQQRGYEVHYD 466
+ +HF + L + L C G+ + D + ILLG++ V YD
Sbjct: 361 VNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYD 420
Query: 467 VAGRRLGFGPGNCN 480
+ +G+ NC+
Sbjct: 421 LENEVIGWADHNCS 434
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 161/373 (43%), Gaps = 38/373 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P ++ +DTGS I W C C +C FFD S T +
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVT 159
Query: 187 CNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
C+ C + + +CS + +C Y Y DGSG +G++ TD + G A
Sbjct: 160 CSDPICSSVFQ----TTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVA 215
Query: 246 R--YPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPY 294
P + GC+ +GD GI G +G +S++S+ + F +CL
Sbjct: 216 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 275
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G G+ + + Y+P++ P Q Y++ L I V G+ LP+ A+ F +T
Sbjct: 276 SGGGVFVLGE---ILVPGMVYSPLL--PSQPH-YNLNLLSIGVNGQILPIDAAVFEASNT 329
Query: 355 E---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
+D+GT +T Y +A + ++ I + CY +S + + P +
Sbjct: 330 RGTIVDTGTTLTYLVKEAYDPFLNAISNSVS--QLVTLIISNGEQCYLVSTSISDMFPPV 387
Query: 412 TIHFLGGVDLELDVRGTL----VVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
+++F GG + L + L + C+GF P + +LG++ + YD+
Sbjct: 388 SLNFAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEE--QTILGDLVLKDKVFVYDL 445
Query: 468 AGRRLGFGPGNCN 480
A +R+G+ +C+
Sbjct: 446 ARQRIGWANYDCS 458
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 108/413 (26%), Positives = 167/413 (40%), Gaps = 53/413 (12%)
Query: 83 PSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPK 142
P +E+ RR RLH Q +P+ K +++ Y + IG P
Sbjct: 44 PRVEDFRRR---RLH-------QSQLPNAHMKLY-------DDLLSNGYYTTRLWIGTPP 86
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Q +L++DTGS +T+ C C C + +DP F P S ++ + CN P+
Sbjct: 87 QEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN------------PD 134
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-- 260
K C Y+ Y + S +G + D ++ + + GC + TGD
Sbjct: 135 CNCDDEGKLCVYERRYAEMSSSSGVLSEDLISF---GNESQLSPQRAVFGCENEETGDLF 191
Query: 261 QNGASGIMGLDRGPVSIISK-----TNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKY 315
A GIMGL RG +S++ + F C G + GK +
Sbjct: 192 SQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPPGMVFSH 251
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIITRFPAPVYSALR 374
+ P +S +Y+I L + V G+ L L F K T +DSGT FP + A++
Sbjct: 252 ----SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIK 307
Query: 375 SAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVV----PKITIHFLGGVDLELDVRGTL 429
A K + K G + + D C+ + + P+I + F G L L L
Sbjct: 308 DAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYL 367
Query: 430 VVES-VRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ VR CLG + P ++ LLG + R V YD +LGF NC+
Sbjct: 368 FRHTKVRGAYCLG--IFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 164/381 (43%), Gaps = 75/381 (19%)
Query: 137 AIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT-CKIL 195
+IG+P ++DTGS +TW C PC CSQQ P FDPSKS T+S + C+ C ++
Sbjct: 98 SIGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCSECNKCDVV 157
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL-GC- 253
NG ECPY + YV G +A +++T++ ++ + + P L+ GC
Sbjct: 158 ------NG-------ECPYSVEYVGSGSSQGIYAREQLTLETIDES--IIKVPSLIFGCG 202
Query: 254 ----TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVN 309
+N G +G+ GL G S++ F YC+ G N
Sbjct: 203 RKFSISSNGYPYQGINGVFGLGSGRFSLLPSFG-KKFSYCI------------GNLRNTN 249
Query: 310 KKFVKYTPIVTTPEQSE---------FYHITLTGISVGGERLPLKASYFTKLSTEIDSGT 360
KF + Q + Y++ L IS+GG +L + + F + T+ +SG
Sbjct: 250 YKFNRLVLGDKANMQGDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGV 309
Query: 361 II---------TRFPAPVYS-ALRSAFRKRMKKYKMGKGIEDLFDTCY------DLSAYK 404
II T++ V S + + + + K + + CY DLS +
Sbjct: 310 IIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDK--HNPYTLCYSGVVSQDLSGF- 366
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLP-----SDPNSI-LLGNVQQ 458
P +T HF G L+LDV + + + C+ A+LP D S +G + Q
Sbjct: 367 ----PLVTFHFAEGAVLDLDVTSMFIQTTENEFCM--AMLPGNYFGDDYESFSSIGMLAQ 420
Query: 459 RGYEVHYDVAGRRLGFGPGNC 479
+ Y V YD+ R+ F +C
Sbjct: 421 QNYNVGYDLNRMRVYFQRIDC 441
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 95/366 (25%), Positives = 161/366 (43%), Gaps = 39/366 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP--------FFDPSKSKTFS 183
+Y VVA+G P + LDTGS + W C C+ C+ P + P KS T
Sbjct: 108 HYAVVALGTPNVTFLVALDTGSDLFWVPCD-CLKCAPLSSPDYGNLKFDVYSPRKSSTSR 166
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCS--SKECPYDIAYV-DGSGETGFWATDRMTIQEVNG 240
K+PC+S C + Q +CS S CPY I Y+ D + G D M + +G
Sbjct: 167 KVPCSSNMCDL---------QTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESG 217
Query: 241 NGYFARYPFLLGCTDNNTGDQNGAS---GIMGL---DRGPVSIISKTNISYFFYCLHSPY 294
+ + P GC TG G++ G++GL + S+++ ++ + +
Sbjct: 218 HSKITQAPITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFGE 277
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G I FG + ++ TP+ + + +Y+I++ G GG+ ++ TK S
Sbjct: 278 DGHGRINFGDTGSADQL---ETPL-NIYKHNPYYNISIVGAMAGGK------TFSTKFSA 327
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+DSGT T P+Y+ + SAF K++K+ + F+ CY +S+ V P I++
Sbjct: 328 VVDSGTSFTALSDPMYTEITSAFDKQVKEKRNPADSSLPFEYCYTISSKGAVSPPNISLT 387
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLG 473
GG + + + + +G+ L + L+G G +V +D LG
Sbjct: 388 AKGGSVFPVK-DPIITITDISSSPVGYCLAIMKSEGVNLIGENFMSGLKVVFDRERLVLG 446
Query: 474 FGPGNC 479
+ NC
Sbjct: 447 WKSFNC 452
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 121/265 (45%), Gaps = 26/265 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ V +G P + + +DTGS I W C PC C FF+P S T SKIP
Sbjct: 91 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
C+ C L+ Q +S C Y Y DGSG +G++ +D M V GN A
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSP-CGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 209
Query: 247 --YPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHSPYG 295
+ GC+++ +GD GI G + +S++S+ N F +CL
Sbjct: 210 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 269
Query: 296 STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE 355
G + G+ + + + YTP+V P Q Y++ L I V G++LP+ +S FT +T+
Sbjct: 270 GGGILVLGE---IVEPGLVYTPLV--PSQPH-YNLNLESIVVNGQKLPIDSSLFTTSNTQ 323
Query: 356 ---IDSGTIITRFPAPVYSALRSAF 377
+DSGT + Y +A
Sbjct: 324 GTIVDSGTTLAYLADGAYDPFVNAI 348
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 160/360 (44%), Gaps = 39/360 (10%)
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
++IG+P +++DTGS I W C PC +C FDPS S TFS + CK
Sbjct: 105 LSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPL------CKT- 157
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
P G C P+ I+YVD S +G + D + + E G ++GC
Sbjct: 158 -----PCGFKGCKCDPIPFTISYVDNSSASGTFGRD-ILVFETTDEGTSQISDVIIGCGH 211
Query: 256 NNTGDQN-GASGIMGLDRGPVSIISKTNISYFFYC---LHSPYGSTGYITFGKPDTVNKK 311
N + + G +GI+GL+ GP S+ ++ F YC L PY + + G+ +
Sbjct: 212 NIGFNSDPGYNGILGLNNGPNSLATQIG-RKFSYCIGNLADPYYNYNQLRLGEGADLEG- 269
Query: 312 FVKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKL-----STEIDSGTIITR 364
+TP + FY++T+ GISVG +RL + F +DSGT IT
Sbjct: 270 -------YSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITY 322
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDL-FDTC-YDLSAYKTVVVPKITIHFLGGVDLE 422
+ L + R +K E+ + C Y + + V P +T HF+ G DL
Sbjct: 323 LVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLA 382
Query: 423 LDVRGTLVVESVRQVCLGF---ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
LD G+ + C+ ++L + + ++G + Q+ Y V YD+ + + F +C
Sbjct: 383 LDT-GSFFSQRDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 159/363 (43%), Gaps = 46/363 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ-RDPFFDPSKSKTFSKIPCNST 190
+ + ++G+P ++DTGS + W QC PC CSQQ P FDPS S T+ + C +
Sbjct: 102 FLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNI 161
Query: 191 TCKILLEWFPPNGQDKC-SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C+ + P+G+ C SS +C Y+ YV+G G AT+++ + G A
Sbjct: 162 ICR-----YAPSGE--CDSSSQCVYNQTYVEGLPSVGVIATEQLIFGS-SDEGRNAVNNV 213
Query: 250 LLGCTDNNTGDQNGA-SGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
L GC+ N ++ +G+ GL G S++++ S F YC+ + PD
Sbjct: 214 LFGCSHRNGNYKDRRFTGVFGLGSGITSVVNQMG-SKFSYCIGN---------IADPDYS 263
Query: 309 NKKFVKYTPI----VTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKLSTE----IDS 358
+ V + +TP Y + L GISVG RL + S F + + IDS
Sbjct: 264 YNQLVLSEGVNMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDS 323
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-VPKITIHFLG 417
GT T Y AL R + ++ E CY + +V P +T HF
Sbjct: 324 GTAPTWLAENEYRALEREVRNLLDRFLTPFMRESFL--CYKGKVGQDLVGFPAVTFHFAE 381
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFGP 476
G DL +D +RQ ++ D ++G + Q+ Y V YD+ +L F
Sbjct: 382 GADLVVDTE-------MRQA----SVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQR 430
Query: 477 GNC 479
+C
Sbjct: 431 IDC 433
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 152/372 (40%), Gaps = 36/372 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
YY + IG P + + +DTGS I W C C C ++ +DP S T SK+
Sbjct: 4 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 63
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF-- 244
C+ C P +S C Y + Y DGS TG++ +D + +V+G+G
Sbjct: 64 CDQGFCAATYGGLLPGCT---TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRP 120
Query: 245 ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYG 295
A GC GD GI+G + S++S+ + + F +CL + G
Sbjct: 121 ANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTING 180
Query: 296 STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TKL 352
F + V K VK TP+V P Y++ L I VGG L L + F K
Sbjct: 181 GG---IFAIGNVVQPK-VKTTPLV--PNMPH-YNVNLKSIDVGGTALKLPSHMFDTGEKK 233
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
T IDSGT +T P VY + A + K E L C+ PKIT
Sbjct: 234 GTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEFL---CFQYVGRVDDDFPKIT 290
Query: 413 IHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSDPNS-ILLGNVQQRGYEVHYDVA 468
HF + L + C+GF L D +LLG++ V YD+
Sbjct: 291 FHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLE 350
Query: 469 GRRLGFGPGNCN 480
+ +G+ NC+
Sbjct: 351 NQVIGWTEYNCS 362
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 161/387 (41%), Gaps = 57/387 (14%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCK---PCIHCS-----QQRDPFFDPSKSKTFSKI 185
I ++ G P Q +S L+DTGS + W C C +CS ++ P FDP S + +
Sbjct: 80 ISLSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKIL 139
Query: 186 PCNSTTCKILLEWFP--------PNGQDKCSSKECPYDIAYVDGSGETGFWATD----RM 233
C + C + +FP NG K S CPY Y G+ F + R
Sbjct: 140 DCRNPKC--VSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGASSGYFLLENLKFPRK 197
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS- 292
TI+ FLLGCT + + + + G R S+ + + F YCL+S
Sbjct: 198 TIRN-----------FLLGCT-TSAARELSSDALAGFGRSMFSLPIQMGVKKFAYCLNSH 245
Query: 293 PYGST---GYITFGKPDTVNKKFVKYTPIVTTPEQSEF-YHITLTGISVGGERLPLKASY 348
Y T G + D K + YTP + +P S F YH+ + I +G + L + + Y
Sbjct: 246 DYDDTRNSGKLILDYRDGKTKG-LSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKY 304
Query: 349 FTKLSTEIDSGTIITR-------FPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYD 399
++ SG II PV+ + + +K+M KY+ E CY+
Sbjct: 305 LAP-GSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYN 363
Query: 400 LSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL------GFALLPSDPN-SIL 452
+ +K++ +P + F GG ++ + + + + G L P+ SI+
Sbjct: 364 FTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPSII 423
Query: 453 LGNVQQRGYEVHYDVAGRRLGFGPGNC 479
LGN Q Y V YD+ R GF C
Sbjct: 424 LGNSQHVDYYVEYDLKNDRFGFRRQTC 450
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 164/360 (45%), Gaps = 45/360 (12%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCK-PCIHCSQQRDPFFDPSKSKTFSKIPCNS 189
EY + + + P + L DTGS + W +CK P H S +++++PC++
Sbjct: 75 EYLMALDVSTPPVRMLALADTGSSLVWLKCKLPAAHTPA----------SSSYARLPCDA 124
Query: 190 TTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
CK L + + C Y A+ DGS G D T + R F
Sbjct: 125 FACKALGDAASCRATGS-GNNICVYRYAFADGSCTAGPVTVDAFT--------FSTRLDF 175
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIIS----KTNISY-FFYCLHSPY----GSTGYI 300
GC G G++GL GP+S++S KT ++ F YCL PY + +
Sbjct: 176 --GCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCL-VPYSSSETVSSSL 232
Query: 301 TFGKPDTVNKK-FVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSG 359
FG V+ TP+V +S FY I L I V G+ +PL+ + TKL +DSG
Sbjct: 233 NFGSHAIVSSSPGAATTPLVAGRNKS-FYTIALDSIKVAGKPVPLQTTT-TKLI--VDSG 288
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL--SAYKTV--VVPKITIHF 415
T++T P V L +A +K ++ K E L+ CYD+ A + V +P +T+
Sbjct: 289 TMLTYLPKAVLDPLVAALTAAIKLPRV-KSPETLYAVCYDVRRRAPEDVGKSIPDVTLVL 347
Query: 416 LGGVDLELDVRGTLVVESV-RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
GG ++ L T VVE+ VCL AL+ S +LGNV Q+ V +D+ R + F
Sbjct: 348 GGGGEVRLPWGNTFVVENKGTTVCL--ALVESHLPEFILGNVAQQNLHVGFDLERRTVSF 405
>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
Length = 216
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 78/219 (35%), Positives = 115/219 (52%), Gaps = 16/219 (7%)
Query: 275 VSIISKTNISY---FFYCLHS--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYH 329
+S++S+T Y F YCL S Y +G + G + V++TP++T P + Y+
Sbjct: 1 MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAG--QPRNVRHTPLLTNPHRPSLYY 58
Query: 330 ITLTGISVGGERLPLKASYF-----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKY 384
+ +TG+SVG + + A F T T IDSGT+ITR+ APVY+ALR FR+++
Sbjct: 59 VNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAA- 117
Query: 385 KMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFAL 443
G FDTC++ P +T+H GGVDL L + TL+ S + CL A
Sbjct: 118 PSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAE 177
Query: 444 LPS--DPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
P + ++ N+QQ+ V DVAG R+GF CN
Sbjct: 178 APQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 216
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 107/365 (29%), Positives = 157/365 (43%), Gaps = 50/365 (13%)
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSK-IPCNSTTCKILL 196
+G P V L L+ G+ + W P C +Q P+F+P TFS+ +P S C
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEP---LTFSRGLPFAS--CGSPK 55
Query: 197 EWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTD 255
W PN + C Y +Y D S TGF D+ T G A P GC
Sbjct: 56 FW--PN-------QTCVYTYSYGDKSVTTGFLEVDKFTFV-----GAGASVPGVAFGCGL 101
Query: 256 NNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV------ 308
N G ++ +GI G RGP+S+ S+ + F +C + IT P TV
Sbjct: 102 FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTT-------ITGAIPSTVLLDLPA 154
Query: 309 -----NKKFVKYTPIVTTPEQSE---FYHITLTGISVGGERLPLKASYFTKLS----TEI 356
+ V+ TP++ + Y+++L GI+VG RLP+ S F + T I
Sbjct: 155 DLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTII 214
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFL 416
DSGT IT P VY +R F ++ K + G TC+ + VPK+ +HF
Sbjct: 215 DSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFE 273
Query: 417 GG-VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
G +DL + V + + A+ D +I +GN QQ+ V YD+ L F
Sbjct: 274 GATMDLPRENYVFEVPDDAGNSIICLAINKGDETTI-IGNFQQQNMHVLYDLQNNMLSFV 332
Query: 476 PGNCN 480
C+
Sbjct: 333 AAQCD 337
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 165/363 (45%), Gaps = 42/363 (11%)
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
++IG P LL+DTGS +TW C PC C Q PFF PS+S T+ C S
Sbjct: 82 ISIGNPPVPQLLLIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAP---- 136
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
P +D+ + C Y + Y D S G A +++T E + +G ++ + GC
Sbjct: 137 -HAMPQIFRDE-KTGNCQYHLRYRDFSNTRGILAEEKLTF-ETSDDGLISKQNIVFGCGQ 193
Query: 256 NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYC---LHSPYGSTGYITFGKPDTVNKKF 312
+N+G SG++GL G SI+++ S F YC L +P + G N
Sbjct: 194 DNSGFTK-YSGVLGLGPGTFSIVTRNFGSKFSYCFGSLTNPTYPHNILILG-----NGAK 247
Query: 313 VKYTPIVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVY 370
++ P TP Q + Y++ L IS G + L ++ F + ++ GT+I +P
Sbjct: 248 IEGDP---TPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQ--GGTVIDTGCSPTI 302
Query: 371 SALRSAFRKRMKK--YKMGKGIEDLFD------TCYD----LSAYKTVVVPKITIHFLGG 418
A R A+ ++ + +G+ + + D CY+ L Y P +T HF GG
Sbjct: 303 LA-REAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYG---FPVVTFHFAGG 358
Query: 419 VDLELDVRGTLV-VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
+L LDV V ES CL + D S+ +G + Q+ Y V Y++ ++ F
Sbjct: 359 AELALDVESLFVSSESGDSFCLAMTMNTFDDMSV-IGAMAQQNYNVGYNLRTMKVYFQRT 417
Query: 478 NCN 480
+C
Sbjct: 418 DCE 420
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 90/327 (27%), Positives = 145/327 (44%), Gaps = 41/327 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
YY + +G P + + +DTGS + W C C C Q FFDP S T S I
Sbjct: 81 YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C+ C W + CS + C Y Y DGSG +GF+ +D + + G+
Sbjct: 141 CSDQRCS----WGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLV 196
Query: 245 --ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSP 293
+ P + GC+ + TGD GI G + +S+IS+ F +CL
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE 256
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G G + G+ N F TP+V P Q Y++ L ISV G+ LP+ S F+ +
Sbjct: 257 NGGGGILVLGEIVEPNMVF---TPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN 310
Query: 354 ---TEIDSGTIITRFP----APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
T ID+GT + P A+ +A + ++ + KG + CY ++
Sbjct: 311 GQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKG-----NQCYVITTSVGD 364
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVES 433
+ P ++++F GG + L+ + L+ ++
Sbjct: 365 IFPPVSLNFAGGASMFLNPQDYLIQQN 391
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 149/338 (44%), Gaps = 43/338 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
YY V +G P ++ +DTGS + W C C C Q FFDP S T S I
Sbjct: 25 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 84
Query: 187 CNSTTCKILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C+ C ++ + CSS+ +C Y Y DGSG +G++ +D M + +
Sbjct: 85 CSDQRCNNGIQ----SSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVT 140
Query: 245 --ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSP 293
+ P + GC++ TGD GI G + +S+IS+ + F +CL
Sbjct: 141 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGD 200
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G + G+ + + + YT +V P Q Y++ L I+V G+ L + +S F +
Sbjct: 201 SSGGGILVLGE---IVEPNIVYTSLV--PAQPH-YNLNLQSIAVNGQTLQIDSSVFATSN 254
Query: 354 ---TEIDSGTIITRFPAPVYSALRSAFRKRMKKY---KMGKGIEDLFDTCYDLSAYKTVV 407
T +DSGT + Y SA + + + +G + CY +++ T V
Sbjct: 255 SRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSRG-----NQCYLITSSVTEV 309
Query: 408 VPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGF 441
P+++++F GG + L + L+ + C+GF
Sbjct: 310 FPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGF 347
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 119/465 (25%), Positives = 180/465 (38%), Gaps = 85/465 (18%)
Query: 81 NTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIG- 139
N S+ ++R R ++ R +P + ++ + + P G +Y + +++G
Sbjct: 37 NGTSIHHLIRSSSLRSAARHGRHRTHHLPSS-RRHRQLSLPLAPG----SDYTLSLSVGP 91
Query: 140 -KPKQYVSLLLDTGSGITWTQCKP--CIHC---------SQQRDPFFDPSKSKTFSKIPC 187
VSL LDTGS + W C P C+ C + +P P+ S+ +IPC
Sbjct: 92 LSTANPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNNSSNPLPPPTDSR---RIPC 148
Query: 188 NSTTCKILLEWFPPNGQDKCSSKECPYD-----------------IAYVDGSGETGFWAT 230
S C PP D C++ CP D AY DGS
Sbjct: 149 ASPFCSAAHSSAPP--ADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGS------LV 200
Query: 231 DRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY----F 286
R+ V A F C G+ G+ G RGP+S+ ++ + F
Sbjct: 201 ARLRRGRVGIAASVAVENFTFACAHTALGEP---VGVAGFGRGPLSLPAQLAPAALSGRF 257
Query: 287 FYCL--HS-----PYGSTGYITFGKP--DTVNKKFVKYTPIVTTPEQSEFYHITLTGISV 337
YCL HS P + I P D ++ + YTP++ P+ FY + L +SV
Sbjct: 258 SYCLVAHSFRADRPIRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSV 317
Query: 338 GGERLPLKASY-----FTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK---- 388
GG R+P + +DSGT T P Y+ + F + M + +
Sbjct: 318 GGTRIPARPELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAA 377
Query: 389 ----GIEDLFDTCYDLSAYK---TVVVPKITIHFLGGVDLELDVR----GTLVVESVRQV 437
G+ + +D SA + VP + +HF G + L R G E R
Sbjct: 378 EDQTGLAPCYYYDHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVG 437
Query: 438 CLGFALLPSDPN---SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
CL D + LGN QQ+G+EV YDV R+GF C
Sbjct: 438 CLMLMNGGEDDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 165/391 (42%), Gaps = 52/391 (13%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK-PCIHCSQQRDPFFDPSK 178
P K + +YY + +G P + L +DTGS +TW QC PC +C++ P + P+K
Sbjct: 175 LPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTK 234
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQE 237
K +P C+ L Q+ C + K+C Y+I Y D S G A D M +
Sbjct: 235 EKI---VPPRDLLCQEL-----QGNQNYCETCKQCDYEIEYADQSSSMGVLARDDMHL-- 284
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGA----SGIMGLDRGPVSIISK-------TNISYF 286
+ NG + F+ GC + G + GI+GL +S+ S+ +NI F
Sbjct: 285 IATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNI--F 342
Query: 287 FYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
+C+ G GY+ G D V + + +T I + P+ YH + G ++L ++
Sbjct: 343 GHCITREQGGGGYMFLGD-DYVPRWGITWTSIRSGPD--NLYHTEAHHVKYGDQQLRMRE 399
Query: 347 SYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDT----CYD--- 399
+ DSG+ T P +Y L +A KY ++D D C+
Sbjct: 400 QAGNTVQVIFDSGSSYTYLPDEIYENLVAAI-----KYASPGFVQDSSDRTLPLCWKADF 454
Query: 400 ----LSAYKTVVVPKITIHF-----LGGVDLELDVRGTLVVESVRQVCLGF--ALLPSDP 448
L K P + +HF + L++ VCLG +
Sbjct: 455 PVRYLEDVKQFFKP-LNLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHG 513
Query: 449 NSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++I++G+V RG V YD R++G+ +C
Sbjct: 514 STIIVGDVSLRGKLVVYDNQRRQIGWTNSDC 544
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 86/304 (28%), Positives = 134/304 (44%), Gaps = 37/304 (12%)
Query: 96 LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGI 155
L + RRL++ +P+ +F I A YY +++G P Q + +DTGS +
Sbjct: 9 LRKHDQRRLRRMLPE----VVSFPISGDNDIFAMGLYYTRISLGTPPQQFYVDVDTGSNV 64
Query: 156 TWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK 210
W +C PC C D FDP KS T I C C +L N + +CS +
Sbjct: 65 AWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGVL------NKKLQCSPE 118
Query: 211 E--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR---YPFLLGCTDNNTGDQNGAS 265
CPY + Y DGS G++ D T +V + A+ + GC TG +
Sbjct: 119 RLSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSWS-VD 177
Query: 266 GIMGLDRGPVSI---ISKTNISY--FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVT 320
G++G VS+ +++ NIS F +CL G + G T+ + + YTP+V
Sbjct: 178 GLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIG---TIREPDLVYTPMVF 234
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLS--TEIDSGTIITRFPAPVYSALR---S 375
+ Y++ L I + G + AS+ + + IDSGT +T P Y R S
Sbjct: 235 GEDH---YNVQLLNIGISGRNVTTPASFDLEYTGGVIIDSGTTLTYLVQPAYDEFRRGVS 291
Query: 376 AFRK 379
F++
Sbjct: 292 VFKQ 295
>gi|388520263|gb|AFK48193.1| unknown [Lotus japonicus]
Length = 157
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/155 (41%), Positives = 90/155 (58%), Gaps = 3/155 (1%)
Query: 327 FYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRM-KKYK 385
Y + LT I+VGG+ L L AS + K+ T IDSGT+ITR P PVY+AL+++F + M KKY
Sbjct: 5 LYGLDLTAITVGGKPLGLAASSY-KVPTIIDSGTVITRLPMPVYTALKNSFVRIMSKKYA 63
Query: 386 MGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLP 445
GI + DTC+ + + VP+I + F GG DL L TL+ CL A
Sbjct: 64 QAPGIS-ILDTCFKGNVKEMSEVPEIQMIFGGGADLPLKAHNTLIELDKGVTCLAIAGSS 122
Query: 446 SDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ ++GN QQ+ ++V YDVA ++GF G C
Sbjct: 123 ENNPIAIIGNYQQQTFKVAYDVANSKIGFAAGGCQ 157
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 110/414 (26%), Positives = 177/414 (42%), Gaps = 53/414 (12%)
Query: 91 RDQQRLHLKNSR--------RLQKAIPDNFKKTKAFTFPAKTG-IVAADEYYIVVAIGKP 141
+D+ L +++S R++ ++ N KA P+ TG + A+ ++IG+P
Sbjct: 57 KDRMELDIQHSAARLANIQARIEGSLVSN-NDYKARVSPSLTGRTIMAN-----ISIGQP 110
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
+++DTGS I W C PC +C FDPSKS TFS + CK P
Sbjct: 111 PIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPL------CKT------P 158
Query: 202 NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQ 261
+ C P+ + Y D S +G + D + E G L GC N D
Sbjct: 159 CDFEGCRCDPIPFTVTYADNSTASGTFGRDTVVF-ETTDEGTSRISDVLFGCGHNIGHDT 217
Query: 262 N-GASGIMGLDRGPVSIISKTNISYFFYC---LHSPYGSTGYITFGKPDTVNKKFVKYTP 317
+ G +GI+GL+ GP S+++K F YC L PY + + G+ +
Sbjct: 218 DPGHNGILGLNNGPDSLVTKLG-QKFSYCIGNLADPYYNYHQLILGEGADLEG------- 269
Query: 318 IVTTPEQ--SEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVY 370
+TP + + FY++T+ GISVG +RL + F ID+G+ IT V+
Sbjct: 270 -YSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSVH 328
Query: 371 SALRSAFRKRMK-KYKMGKGIEDLFDTCYDLSAYKTVV-VPKITIHFLGGVDLELDVRGT 428
L R + ++ + + C+ S + +V P +T HF G DL LD
Sbjct: 329 KLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLALDSGSF 388
Query: 429 LVVESVRQVCLGFALLPS---DPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ C+ + S L+G + Q+ Y V YD+ + + F +C
Sbjct: 389 FNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDC 442
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 110/401 (27%), Positives = 160/401 (39%), Gaps = 42/401 (10%)
Query: 97 HLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGIT 156
H R LQ + ++ + F ++ Y + IG P Q +L++DTGS +T
Sbjct: 61 HFNPRRHLQGSQSEHHPNARMRLF---DDLLRNGYYTTRLWIGTPPQRFALIVDTGSTVT 117
Query: 157 WTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS--KECPY 214
+ C C HC +DP F P S+T+ + C W Q C K+C Y
Sbjct: 118 YVPCSTCKHCGSHQDPKFRPEASETYQPVKCT---------W-----QCNCDDDRKQCTY 163
Query: 215 DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD--QNGASGIMGLDR 272
+ Y + S +G D + + + GC ++ TGD A GIMGL R
Sbjct: 164 ERRYAEMSTSSGVLGED---VVSFGNQSELSPQRAIFGCENDETGDIYNQRADGIMGLGR 220
Query: 273 GPVSI----ISKTNIS-YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEF 327
G +SI + K IS F C G + G + + P +S +
Sbjct: 221 GDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGISPPADMVFTH----SDPVRSPY 276
Query: 328 YHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKM 386
Y+I L I V G+RL L F K T +DSGT P + A + A K K
Sbjct: 277 YNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKR 336
Query: 387 GKGIEDLF-DTCYDLSAYKTVVV----PKITIHFLGGVDLELDVRGTLVVES-VR-QVCL 439
G + + D C+ + + P + + F G L L L S VR CL
Sbjct: 337 ISGPDPHYNDICFSGAEINVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCL 396
Query: 440 GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G +DP + LLG + R V YD ++GF NC+
Sbjct: 397 GVFSNGNDPTT-LLGGIVVRNTLVMYDREHSKIGFWKTNCS 436
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 163/382 (42%), Gaps = 61/382 (15%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-------------PFFDPSK 178
YY + +G P Q+++ ++DTGS I W +CK C CS +++ +DP
Sbjct: 88 YYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIMQGPITLYDPEL 147
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV 238
S T S C+ C G + ++ C YDI+Y D S TG + D + +
Sbjct: 148 SITASPATCSDPLCS-------EGGSCRGNNNSCAYDISYEDTSSSTGIYFRDVVHL--- 197
Query: 239 NGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSI---ISKTNISY--FFYCLHSP 293
G+ LGC + +G GIMG R VS+ ++ SY F++CL
Sbjct: 198 -GHKASLNTTMFLGCATSISGLWP-VDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLSGE 255
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G + GK D + + YTP++ Y++ L +SV + LP++AS F +
Sbjct: 256 KEGGGILVLGKNDEFPE--MVYTPMLA---NDIVYNVKLVSLSVNSKALPIEASEFEYNA 310
Query: 354 TE------IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY-DLSAYKTV 406
T IDSGT FP+ + A K +E C+ +S +V
Sbjct: 311 TVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAIPTAP-LESSGSPCFISISDRNSV 369
Query: 407 VV--PKITIHFLGGVDLELDVRGTLVV------------ESVRQVCLGFALLPSDPNSIL 452
V P +T+ F GG +EL L + VR VC+ +++ NS +
Sbjct: 370 EVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLVCISWSV----GNSTI 425
Query: 453 LGNVQQRGYEVHYDVAGRRLGF 474
LG+ + V YD+ R+G+
Sbjct: 426 LGDAILKDKVVVYDMEKSRIGW 447
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/408 (25%), Positives = 165/408 (40%), Gaps = 46/408 (11%)
Query: 94 QRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGS 153
+ L + RRL++ +P+ AF YY + +G P Q + +DTGS
Sbjct: 14 RTLREHDQRRLRRILPE----VVAFPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGS 69
Query: 154 GITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS 208
+ W C PC +C + + FDP KS + + I C C + N + +
Sbjct: 70 DVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEEC-----YLASNSKCSFN 124
Query: 209 SKECPYDIAYVDGSGETGFWATDRMTIQEV---NGNGYFARYPFLLGCTDNNTGDQNGAS 265
S CPY Y DGS G+ D ++ +V N GC N TG
Sbjct: 125 SMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTWL-TD 183
Query: 266 GIMGLDRGPVSI---ISKTNISY--FFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVT 320
G++G + VS+ +SK N+S F +CL +G + G + + + YTPIV
Sbjct: 184 GLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIGH---IREPGLVYTPIV- 239
Query: 321 TPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEI--DSGTIITRFPAPVYSALRSAFR 378
P+QS Y++ L I V G + ++ S + DSGT +T P Y ++ R
Sbjct: 240 -PKQSH-YNVELLNIGVSGTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKVR 297
Query: 379 KRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVC 438
M+ + + F T + Y P +T++F GG + L L E +
Sbjct: 298 DCMRSGVLPVAFQ-FFCT---IEGY----FPNVTLYFAGGAAMLLSPSSYLYKEMLTTGL 349
Query: 439 LGFALLPSDPNSI-------LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ + S+ + G+ + V YD R+G+ +C
Sbjct: 350 SAYCFSWLESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDC 397
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 160/371 (43%), Gaps = 39/371 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ-----RDPFFDPSKSKTFSKIP 186
Y+ + +G P + + +DTGS I W CKPC C + R FD + S T K+
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVG 133
Query: 187 CNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGN---G 242
C+ C + + D C + C Y I Y D S G + D +T+++V G+ G
Sbjct: 134 CDDDFCSFISQ------SDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTG 187
Query: 243 YFARYPFLLGCTDNNTGD-QNGAS---GIMGLDRGPVSIISKTNIS-----YFFYCLHSP 293
+ + GC + +G NG S G+MG + S++S+ + F +CL +
Sbjct: 188 PLGQ-EVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G G G V+ VK TP+V P Q Y++ L G+ V G L L S
Sbjct: 247 KGG-GIFAVG---VVDSPKVKTTPMV--PNQMH-YNVMLMGMDVDGTSLDLPRSIVRNGG 299
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T +DSGT + FP +Y +L R + K+ +E+ F C+ S P ++
Sbjct: 300 TIVDSGTTLAYFPKVLYDSLIETILAR-QPVKL-HIVEETFQ-CFSFSTNVDEAFPPVSF 356
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGFAL--LPSDPNS--ILLGNVQQRGYEVHYDVAG 469
F V L + L C G+ L +D S ILLG++ V YD+
Sbjct: 357 EFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDN 416
Query: 470 RRLGFGPGNCN 480
+G+ NC+
Sbjct: 417 EVIGWADHNCS 427
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 115/420 (27%), Positives = 174/420 (41%), Gaps = 48/420 (11%)
Query: 90 RRDQQRLHLKN-----SRRLQKAIPDNFK-KTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
R D HL N +RR +++ P +TG+ Y+ + IG P +
Sbjct: 38 RHDGSGKHLANLRAHDARRHGRSLAAAVDLPLGGNGLPTETGL-----YFTQIGIGTPAK 92
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEW 198
+ +DTGS I W C C C ++ +DPS S + + + C C
Sbjct: 93 SYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCGQDFCVATHGG 152
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGY--FARYPFLLGCTDN 256
P+ + C Y I+Y DGS TGF+ TD + +V+GN A GC
Sbjct: 153 VIPS---CVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANTSITFGCGAK 209
Query: 257 NTGDQNGAS----GIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYITFGKPDT 307
GD +S GI+G + S++S+ + F +CL + G F D
Sbjct: 210 IGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDTINGGG---IFAIGDV 266
Query: 308 VNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TKLSTEIDSGTIITR 364
V K V TP+V Y++ L I VGG +L L + F T IDSGT +
Sbjct: 267 VQPK-VSTTPLVPGMPH---YNVNLEAIDVGGVKLQLPTNIFDIGESKGTIIDSGTTLAY 322
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELD 424
P VY+A+ S + + K +D C+ S P IT HF GG+ L +
Sbjct: 323 LPGVVYNAIMSKVFAQYGDMPL-KNDQDF--QCFRYSGSVDDGFPIITFHFEGGLPLNIH 379
Query: 425 VRGTLVVESVRQVCLGF---ALLPSD-PNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L ++ C+GF L D + +LLG++ V YD+ + +G+ NC+
Sbjct: 380 PHDYL-FQNGELYCMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDYNCS 438
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 108/413 (26%), Positives = 166/413 (40%), Gaps = 53/413 (12%)
Query: 83 PSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPK 142
P +E+ RR RLH Q +P+ K +++ Y + IG P
Sbjct: 48 PRVEDFRRR---RLH-------QSQLPNAHMKLY-------DDLLSNGYYTTRLWIGTPP 90
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPN 202
Q +L++DTGS +T+ C C C + +DP F P S ++ + CN P+
Sbjct: 91 QEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN------------PD 138
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-- 260
K C Y+ Y + S +G + D ++ + GC + TGD
Sbjct: 139 CNCDDEGKLCVYERRYAEMSSSSGVLSEDLISF---GNESQLTPQRAVFGCENVETGDLF 195
Query: 261 QNGASGIMGLDRGPVSIISK-----TNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKY 315
A GIMGL RG +S++ + F C G + GK +
Sbjct: 196 SQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPAGMVFSH 255
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIITRFPAPVYSALR 374
+ P +S +Y+I L + V G+ L L F K T +DSGT FP + A++
Sbjct: 256 ----SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIK 311
Query: 375 SAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVV----PKITIHFLGGVDLELDVRGTL 429
A K + K G + + D C+ + + P+I + F G L L L
Sbjct: 312 DAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLILSPENYL 371
Query: 430 VVES-VRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ VR CLG + P ++ LLG + R V YD +LGF NC+
Sbjct: 372 FRHTKVRGAYCLG--IFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 422
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 124/436 (28%), Positives = 193/436 (44%), Gaps = 47/436 (10%)
Query: 57 PGKVSLEVLGRYGPCSKLNQGKSRNTPS-LEEILRRDQQRLHLKNSRRLQKAIPDN-FKK 114
P L V+ YG CS N K+ + + + + +D R+ +S QK +
Sbjct: 30 PDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLSSLVAQKTVSSAPIAS 89
Query: 115 TKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFF 174
+AF Y + V IG P Q + ++LDT + + CI CS F
Sbjct: 90 GQAFNI---------GNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGCSATT---F 137
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
P+ S ++ + C+ C + P S C ++ +Y GS + D +
Sbjct: 138 SPNASTSYVPLECSVPQCSQVRGLSCP----ATGSGACSFNKSYA-GSTYSATLVQDSLR 192
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLH 291
+ Y F G + +G A G++GL RGP+S++S+T Y F YCL
Sbjct: 193 L----ATDVIPSYSF--GSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLP 246
Query: 292 S--PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
S Y +G + G K ++ TP++ P + Y + LTGI+VG +P
Sbjct: 247 SFKSYYFSGSLKLGP--VGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELL 304
Query: 350 -----TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYK 404
T T IDSGT+ITRF PVY+A+R FRK++ G FDTC+ + Y+
Sbjct: 305 AFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPFSSLGA---FDTCF-VKNYE 360
Query: 405 TVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSILL---GNVQQRG 460
T + P IT+HF +DL+L + +L+ S + CL A P + N +L N QQ+
Sbjct: 361 T-LAPAITLHFT-DLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQN 418
Query: 461 YEVHYDVAGRRLGFGP 476
V +D + + P
Sbjct: 419 LRVLFDTVNNKGWYCP 434
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 147/367 (40%), Gaps = 42/367 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C HC + +DP F P +S T+ + CN
Sbjct: 88 YTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN-MD 146
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C C Y+ Y + S +G D I +
Sbjct: 147 CNC-----------DHDGVNCVYERRYAEMSSSSGVLGED---IISFGNQSEVVPQRAVF 192
Query: 252 GCTDNNTGD--QNGASGIMGLDRGPVSIISK---TNI--SYFFYCLHSPYGSTGYITFG- 303
GC + TGD A GIMGL RG +SI+ + N+ F C + G + G
Sbjct: 193 GCENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVLGG 252
Query: 304 ---KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSG 359
PD V + + P +S +Y+I L I V G+ L L S F K T +DSG
Sbjct: 253 IPPPPDMVFSR--------SDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSG 304
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCY-----DLSAYKTVVVPKITI 413
T P + A R A K+ K G + + D C+ D+S P++ +
Sbjct: 305 TTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSK-AFPEVDM 363
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
F G L L L + + + ++ LLG + R V YD ++G
Sbjct: 364 VFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIG 423
Query: 474 FGPGNCN 480
F NC+
Sbjct: 424 FWKTNCS 430
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 167/366 (45%), Gaps = 40/366 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP--------FFDPSKSKTFS 183
+Y VVA+G P + LDTGS + W C C+ C+ + P + P++S T
Sbjct: 99 HYAVVALGTPNVTFLVALDTGSDLFWVPCD-CLKCAPLQSPNYGSLKFDVYSPAQSTTSR 157
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSK--ECPYDIAYV-DGSGETGFWATDRMTIQEVNG 240
K+PC+S C + Q+ C SK CPY I Y+ D + +G D + + +
Sbjct: 158 KVPCSSNLCDL---------QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 208
Query: 241 NGYFARYPFLLGCTDNNTGDQNGAS---GIMGL---DRGPVSIISKTNISYFFYCLHSPY 294
P + GC TG G++ G++GL + S+++ ++ + +
Sbjct: 209 QSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGD 268
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G I FG + ++ K TP+ +Q+ +Y+IT+TGI+VG + + T+ S
Sbjct: 269 DGHGRINFGDTGSSDQ---KETPL-NVYKQNPYYNITITGITVGSKSIS------TEFSA 318
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+DSGT T P+Y+ + S+F +++ + F+ CY +SA +V P +++
Sbjct: 319 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSA-NGIVHPNVSLT 377
Query: 415 FLGGVDLEL-DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
GG + D T+ + V A++ S+ L+G G +V +D LG
Sbjct: 378 AKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSE-GVNLIGENFMSGLKVVFDRERMVLG 436
Query: 474 FGPGNC 479
+ NC
Sbjct: 437 WKNFNC 442
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 111/415 (26%), Positives = 174/415 (41%), Gaps = 33/415 (7%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPKQ 143
+E+I+ DQ+R L + +R K +GI +Y+ V +G P +
Sbjct: 49 IEDIIGADQKRHSLISRKRKFKG---------GVKMDLGSGIDYGTAQYFTEVRVGTPAK 99
Query: 144 YVSLLLDTGSGITWTQC--KPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
+++DTGS +TW C + + F +SK+F + C + TCK+ L
Sbjct: 100 KFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGCFTQTCKVDLMNLFS 159
Query: 202 NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC-TDNNTGD 260
S C YD Y DGS G +A + +T+ NG R L+GC + +
Sbjct: 160 LSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLR-GLLVGCSSSFSGQS 218
Query: 261 QNGASGIMGLDRGPVSIISKTNISYF----FYCL---HSPYGSTGYITFGKPDTVNKKFV 313
GA G++GL S S T S F YCL S + Y+ FG + +
Sbjct: 219 FQGADGVLGLAFSDFSFTS-TATSLFGAKLSYCLVDHLSNKNISNYLIFGY--SSSSTST 275
Query: 314 KYTPIVTTPEQ----SEFYHITLTGISVGGERLPLKASYF---TKLSTEIDSGTIITRFP 366
K P TTP FY I + GIS+G + L + + T T +DSGT +T
Sbjct: 276 KTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVWDATTGGGTILDSGTSLTLLA 335
Query: 367 APVYSALRSAFRKRMKKYKMGKGIEDLFDTCY-DLSAYKTVVVPKITIHFLGGVDLELDV 425
Y + + + + + K K + C+ S + +P++T H GG E
Sbjct: 336 EAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNESKLPQLTFHLKGGARFEPHR 395
Query: 426 RGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ LV + CLGF + P + ++GN+ Q+ Y +D+ L F P C
Sbjct: 396 KSYLVDAAPGVKCLGF-MSAGTPATNVVGNIMQQNYLWEFDLMASTLSFAPSTCT 449
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/411 (25%), Positives = 168/411 (40%), Gaps = 50/411 (12%)
Query: 99 KNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWT 158
K R++ A + P K + +YY + IG P + L +DTGS +TW
Sbjct: 154 KARNRMEVAKAATARTNSTALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWI 213
Query: 159 QCK-PCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDI 216
QC PC +C++ P + P+K K +P C+ L Q+ C + K+C Y+I
Sbjct: 214 QCDAPCTNCAKGPHPLYKPAKEKI---VPPRDLLCQEL-----QGNQNYCETCKQCDYEI 265
Query: 217 AYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD----QNGASGIMGLDR 272
Y D S G A D M + + NG + F+ GC + G GI+GL
Sbjct: 266 EYADQSSSMGVLARDDMHM--IATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSS 323
Query: 273 GPVSIISKTN-----ISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEF 327
+S S+ + F +C+ G GY+ G D V + V +T I + P+
Sbjct: 324 AAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGD-DYVPRWGVTWTSIRSGPD--NL 380
Query: 328 YHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMG 387
YH + G ++L + + DSG+ T P +Y L +A KY
Sbjct: 381 YHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAI-----KYASP 435
Query: 388 KGIEDLFDT----CYD-------LSAYKTVVVPKITIHF-----LGGVDLELDVRGTLVV 431
++D D C+ L K P + +HF + L++
Sbjct: 436 GFVQDTSDRTLPLCWKADFPVRYLEDVKQFFEP-LNLHFGKKWLFMSKTFTISPEDYLII 494
Query: 432 ESVRQVCLGFALLPSDPN---SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
VCLG L ++ N +I++G+V RG V YD +++G+ +C
Sbjct: 495 SDKGNVCLGL-LNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQIGWADSDC 544
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 153/367 (41%), Gaps = 41/367 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN-ST 190
Y + IG P Q +L++D+GS +T+ C C C +DP F P S ++S + CN
Sbjct: 89 YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNVDC 148
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC DK K+C Y+ Y + S +G D I +
Sbjct: 149 TCD----------SDK---KQCTYERQYAEMSSSSGVLGED---IVSFGRESELKPQRAV 192
Query: 251 LGCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNISYFFYCLHS--PYGSTGYITF 302
GC ++ TGD A GIMGL RG +SI + K IS F + G +
Sbjct: 193 FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLG 252
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-TKLSTEIDSGTI 361
G P + F P+ +S +Y+I L I V G+ L + + F +K T +DSGT
Sbjct: 253 GVPAPSDMVFSHSDPL-----RSPYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTT 307
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTV-----VVPKITIHF 415
P + A + A ++ K +G + + D C+ A + V V P + + F
Sbjct: 308 YAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICF-AGAGRNVSKLHEVFPDVDMVF 366
Query: 416 LGGVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
G L L L S CLG DP + LLG + R V YD ++G
Sbjct: 367 GNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTT-LLGGIIVRNTLVTYDRHNEKIG 425
Query: 474 FGPGNCN 480
F NC+
Sbjct: 426 FWKTNCS 432
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/377 (26%), Positives = 167/377 (44%), Gaps = 50/377 (13%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + +G P Q V+++LDTGS ++W CK + + F+P S +++ PCNS+ C
Sbjct: 62 VSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNST----FNPLLSSSYTPTPCNSSICT 117
Query: 194 ILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC 253
++K C ++Y D S G A + ++ G L GC
Sbjct: 118 TRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGT------LFGC 171
Query: 254 TD-----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
D ++ + + +G+MG++RG +S++++ ++ F YC+ S + G + G T
Sbjct: 172 MDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSLPKFSYCI-SGEDALGVLLLGD-GTD 229
Query: 309 NKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
++YTP+VT S + Y + L GI V + L L S F T +DS
Sbjct: 230 APSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDS 289
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED-------LFDTCYDLSAYKTVVVPKI 411
GT T VYS+L+ F ++ K + IED D CY A VP +
Sbjct: 290 GTQFTFLLGSVYSSLKDEFLEQTK--GVLTRIEDPNFVFEGAMDLCYHAPA-SFAAVPAV 346
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQ-----VCLGFALLPSDPNSI---LLGNVQQRGYEV 463
T+ F G E+ V G ++ V + C F SD I ++G+ Q+ +
Sbjct: 347 TLVFSGA---EMRVSGERLLYRVSKGSDWVYCFTFG--NSDLLGIEAYVIGHHHQQNVWM 401
Query: 464 HYDVAGRRLGFGPGNCN 480
+D+ R+GF C+
Sbjct: 402 EFDLLKSRVGFTQTTCD 418
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 101/373 (27%), Positives = 151/373 (40%), Gaps = 61/373 (16%)
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
IG P Q S ++D + WTQC C C +Q P F P+ S TF PC + CK +
Sbjct: 73 IGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSI-- 130
Query: 198 WFPPNGQDKCSSKECPYD--IAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
CSS C Y+ I G G ATD I + F GC
Sbjct: 131 -----PTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATASLGF-------GCVV 178
Query: 256 NNTGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY----------GSTGYITFGK 304
+ D G SG++GL R P S++S+ NI+ F YCL +P+ GS+ + G
Sbjct: 179 ASGIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCL-TPHDSGKNSRLLLGSSAKLAGGG 237
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITR 364
++ FVK +P + S++Y I L GI G + L S T++ +
Sbjct: 238 -NSTTTPFVKTSP---GDDMSQYYPIQLDGIKAGDAAIALPPS----------GNTVLVQ 283
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDL------FDTCYDLSAYKTVVVPKITIHFLGG 418
AP+ + SA++ K+ G FD C+ + P + F G
Sbjct: 284 TLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQG 343
Query: 419 VDL--------ELDV---RGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
+DV +GT+ + + L L D N +LG++QQ D+
Sbjct: 344 AAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTAL--DENLNILGSLQQENTHFLLDL 401
Query: 468 AGRRLGFGPGNCN 480
+ L F P +C+
Sbjct: 402 EKKTLSFEPADCS 414
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 153/367 (41%), Gaps = 41/367 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C HC +DP F P S+T+ + C
Sbjct: 93 YTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKCT--- 149
Query: 192 CKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
W Q C + K+C Y+ Y + S +G D ++ +
Sbjct: 150 ------W-----QCNCDNDRKQCTYERRYAEMSTSSGALGEDVVSF---GNQTELSPQRA 195
Query: 250 LLGCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNIS-YFFYCLHSPYGSTGYITF 302
+ GC ++ TGD A GIMGL RG +SI + K IS F C G +
Sbjct: 196 IFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVL 255
Query: 303 GK-PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGT 360
G + F + P+ +S +Y+I L I V G+RL L F K T +DSGT
Sbjct: 256 GGISPPADMVFTRSDPV-----RSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGT 310
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVV----PKITIHF 415
P + A + A K K G + + D C+ + + P + + F
Sbjct: 311 TYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVEMVF 370
Query: 416 LGGVDLELDVRGTLVVES-VR-QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
G L L L S VR CLG +DP + LLG + R V YD ++G
Sbjct: 371 GNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTT-LLGGIVVRNTLVMYDREHTKIG 429
Query: 474 FGPGNCN 480
F NC+
Sbjct: 430 FWKTNCS 436
>gi|147776519|emb|CAN74010.1| hypothetical protein VITISV_003547 [Vitis vinifera]
Length = 429
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/197 (36%), Positives = 105/197 (53%), Gaps = 22/197 (11%)
Query: 292 SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-- 349
+PY S G+P K ++ TP++ P + Y++ LTG+SVG +P+
Sbjct: 247 APYASD---PLGQP-----KNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAF 298
Query: 350 ---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV 406
T T IDSGT+ITRF PVY+A+R FRK++K G FDTC+ +A
Sbjct: 299 DPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPFATIGA---FDTCF--AATNED 353
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL--LGNVQQRGYEV 463
+ P +T HF G+DL+L + TL+ S + CL A P++ NS+L + N+QQ+ +
Sbjct: 354 IAPPVTFHFT-GMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRI 412
Query: 464 HYDVAGRRLGFGPGNCN 480
+DV RLG CN
Sbjct: 413 MFDVTNSRLGIARELCN 429
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 104/365 (28%), Positives = 151/365 (41%), Gaps = 39/365 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C C + +DP F P S T+ + C +
Sbjct: 84 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC-TID 142
Query: 192 CKILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C C S +C Y+ Y + S +G D ++ A
Sbjct: 143 C-------------NCDSDRMQCVYERQYAEMSTSSGVLGEDLISF---GNQSELAPQRA 186
Query: 250 LLGCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNIS-YFFYCLHSPYGSTGYITF 302
+ GC + TGD A GIMGL RG +SI + K IS F C G +
Sbjct: 187 VFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVL 246
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTI 361
G + Y + P +S +Y+I L I V G+RLPL A+ F K T +DSGT
Sbjct: 247 GGISPPSDMAFAY----SDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSGTT 302
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVV----PKITIHFL 416
P + A + A K ++ K G + + D C+ + + P + + F
Sbjct: 303 YAYLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFE 362
Query: 417 GGVDLELDVRGTLVVES-VRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G L + S VR CLG +D + LLG + R V YD ++GF
Sbjct: 363 NGQKYTLSPENYMFRHSKVRGAYCLGVFQNGND-QTTLLGGIIVRNTLVVYDREQTKIGF 421
Query: 475 GPGNC 479
NC
Sbjct: 422 WKTNC 426
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 167/366 (45%), Gaps = 40/366 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP--------FFDPSKSKTFS 183
+Y VVA+G P + LDTGS + W C C+ C+ + P + P++S T
Sbjct: 62 HYAVVALGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSR 120
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSK--ECPYDIAYV-DGSGETGFWATDRMTIQEVNG 240
K+PC+S C + Q+ C SK CPY I Y+ D + +G D + + +
Sbjct: 121 KVPCSSNLCDL---------QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 171
Query: 241 NGYFARYPFLLGCTDNNTGDQNGAS---GIMGL---DRGPVSIISKTNISYFFYCLHSPY 294
P + GC TG G++ G++GL + S+++ ++ + +
Sbjct: 172 QSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGD 231
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G I FG + ++ K TP+ +Q+ +Y+IT+TGI+VG + + T+ S
Sbjct: 232 DGHGRINFGDTGSSDQ---KETPL-NVYKQNPYYNITITGITVGSKSIS------TEFSA 281
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+DSGT T P+Y+ + S+F +++ + F+ CY +SA +V P +++
Sbjct: 282 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSA-NGIVHPNVSLT 340
Query: 415 FLGGVDLEL-DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
GG + D T+ + V A++ S+ L+G G +V +D LG
Sbjct: 341 AKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSE-GVNLIGENFMSGLKVVFDRERMVLG 399
Query: 474 FGPGNC 479
+ NC
Sbjct: 400 WKNFNC 405
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 109/428 (25%), Positives = 172/428 (40%), Gaps = 83/428 (19%)
Query: 103 RLQKAIPD-----NFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITW 157
RLQ A P F+ + T P VA+G P Q V+++LDTGS ++W
Sbjct: 43 RLQAASPPPANRLRFRHNVSLTVP--------------VAVGTPPQNVTMVLDTGSELSW 88
Query: 158 TQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIA 217
C H D FD S S +++ +PC+S C L P + C S C ++
Sbjct: 89 LLCNGSRH-----DAPFDASASSSYAPVPCSSPACTWLGRDLPV--RPFCDSSACRVSLS 141
Query: 218 YVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGC----TDNNTGDQNGASGIMGLDRG 273
Y D S G A D + + P L GC + + + +G++G++RG
Sbjct: 142 YADASSADGLLAADTFLLGS-------SPMPALFGCITSYSSSTDPSETPPTGLLGMNRG 194
Query: 274 PVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTVN------KKFVKYTPIVTTPEQSEF 327
+S +++T F YC+ + G G + G DT ++ + YTP+V + +
Sbjct: 195 GLSFVTQTATRRFAYCIAAGQGP-GILLLGGNDTETPLTSPPQQQLNYTPLVEISQPLPY 253
Query: 328 -----YHITLTGISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAF 377
Y + L GI VG L + T T +DSGT T Y+AL++ F
Sbjct: 254 FDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTGAGQTMVDSGTRFTFLLPDAYAALKAEF 313
Query: 378 RKRMKKYKMGKGIEDL----------FDTCYD------LSAYKTVVVPKI-------TIH 414
++ + + G+ L FD C+ +A ++P++ +
Sbjct: 314 ANQLTR-SLDGGLAPLGEPGFVFQGAFDACFRGTEARVSAAAAGGLLPEVGLVLRGAEVV 372
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSI---LLGNVQQRGYEVHYDVAGRR 471
G L V G E CL F SD + ++G+ Q+ V YD+ R
Sbjct: 373 VAGAEKLLYRVPGERRGEGEGVWCLTFG--SSDMAGVSAYVIGHHHQQDVWVEYDLRNAR 430
Query: 472 LGFGPGNC 479
LGF C
Sbjct: 431 LGFAAARC 438
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 151/370 (40%), Gaps = 48/370 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y V IG P SL++DTGS +T+ C C HC +DP F P+ S ++ + C S
Sbjct: 35 YTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGS-- 92
Query: 192 CKILLEWFPPNGQDKCSSKEC----PYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARY 247
+CS+ C Y Y + S +G D + + G
Sbjct: 93 --------------ECSTGFCDGSRKYQRQYAEKSTSSGVLGKDVIGFSNSSDLG---GQ 135
Query: 248 PFLLGCTDNNTGD--QNGASGIMGLDRGPVSIISK-----TNISYFFYCLHSPYGSTGYI 300
+ GC TGD A GI+GL RGP+SII + F C G +
Sbjct: 136 RLVFGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAM 195
Query: 301 TFG--KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEID 357
G +P K + +T + P +S +Y++ L GI VGG L LK F K T +D
Sbjct: 196 ILGGFQP----PKDMVFT--ASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYGTVLD 249
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYK----TVVVPKIT 412
SGT FP + A +SA ++++ K G ++ F D CY + + P +
Sbjct: 250 SGTTYAYFPGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVD 309
Query: 413 IHFLGGVDLELDVRGTLVVES--VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
F G + L L + CLG DP + LLG + R V Y+
Sbjct: 310 FVFGDGQSVTLSPENYLFRHTKISGAYCLG-VFENGDPTT-LLGGIIVRNMLVTYNRGKA 367
Query: 471 RLGFGPGNCN 480
+GF CN
Sbjct: 368 SIGFLKTKCN 377
>gi|326526699|dbj|BAK00738.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 182
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 90/167 (53%), Gaps = 5/167 (2%)
Query: 314 KYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSAL 373
YTP+V++ Y I L+G++V G+ L + +S ++ L T IDSGT+ITR P VY AL
Sbjct: 21 SYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSGTVITRLPTTVYDAL 80
Query: 374 RSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVES 433
A MK K + DTC+ + ++ VP +++ F GG L+L + LV
Sbjct: 81 SKAVAGAMKGTKRADAYS-ILDTCF-VGQASSLRVPAVSMAFSGGAALKLSAQNLLVDVD 138
Query: 434 VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
CL FA P+ +I +GN QQ+ + V YDV R+GF G C
Sbjct: 139 SSTTCLAFA--PARSAAI-IGNTQQQTFSVVYDVKSNRIGFAAGGCT 182
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 167/366 (45%), Gaps = 40/366 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP--------FFDPSKSKTFS 183
+Y VVA+G P + LDTGS + W C C+ C+ + P + P++S T
Sbjct: 76 HYAVVALGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSR 134
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSK--ECPYDIAYV-DGSGETGFWATDRMTIQEVNG 240
K+PC+S C + Q+ C SK CPY I Y+ D + +G D + + +
Sbjct: 135 KVPCSSNLCDL---------QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 185
Query: 241 NGYFARYPFLLGCTDNNTGDQNGAS---GIMGL---DRGPVSIISKTNISYFFYCLHSPY 294
P + GC TG G++ G++GL + S+++ ++ + +
Sbjct: 186 QSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGD 245
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G I FG + ++ K TP+ +Q+ +Y+IT+TGI+VG + S T+ S
Sbjct: 246 DGHGRINFGDTGSSDQ---KETPL-NVYKQNPYYNITITGITVGSK------SISTEFSA 295
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+DSGT T P+Y+ + S+F +++ + F+ CY +SA +V P +++
Sbjct: 296 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSA-NGIVHPNVSLT 354
Query: 415 FLGGVDLEL-DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
GG + D T+ + V A++ S+ L+G G +V +D LG
Sbjct: 355 AKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSE-GVNLIGENFMSGLKVVFDRERMVLG 413
Query: 474 FGPGNC 479
+ NC
Sbjct: 414 WKNFNC 419
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 108/387 (27%), Positives = 161/387 (41%), Gaps = 47/387 (12%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFF 174
P TG+ YY + IG P + + +DTGS I W C C C +
Sbjct: 78 LPTATGL-----YYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQY 132
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKC---SSKECPYDIAYVDGSGETGFWATD 231
DP+ S T + C+ C PNG +S C + IAY DGS TGF+ +D
Sbjct: 133 DPAGSGT--TVGCDQEFCVA----NSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSD 186
Query: 232 RMTIQEVNGNGYF--ARYPFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS- 284
+ +V+GNG + GC GD +S GI+G + S++S+ +
Sbjct: 187 SVQYNQVSGNGQTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAAR 246
Query: 285 ----YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE 340
F +CL + +G F + V K VK TP+V + Y++ L GISVGG
Sbjct: 247 KVRKIFAHCLDTVHGGG---IFAIGNVVQPK-VKTTPLV---QNVTHYNVNLQGISVGGA 299
Query: 341 RLPLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTC 397
L L +S F T IDSGT + P VY L +A + + + +D C
Sbjct: 300 TLQLPSSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLAL-HNYQDF--VC 356
Query: 398 YDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF----ALLPSDPNSILL 453
+ S P +T F G + L + L C+GF + +LL
Sbjct: 357 FQFSGSIDDGFPVVTFSFEGEITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLL 416
Query: 454 GNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G++ V YD+ + +G+ NC+
Sbjct: 417 GDLVLSNKLVVYDLEKQVIGWADYNCS 443
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 98/340 (28%), Positives = 144/340 (42%), Gaps = 40/340 (11%)
Query: 119 TFPAKTGIVAADEY------YIV-VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD 171
T PA G VA Y Y+ IG P Q VS ++D + WTQC PC C +Q
Sbjct: 37 TPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDL 96
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWA-T 230
P FDP+KS TF +PC S C+ + P C+S C Y+ +G+TG A T
Sbjct: 97 PLFDPTKSSTFRGLPCGSHLCESI-----PESSRNCTSDVCIYEAP--TKAGDTGGKAGT 149
Query: 231 DRMTIQEVNGNGYFARYPFLLGC---TDNNTGDQNGASGIMGLDRGPVSIISKTNISYFF 287
D I A+ GC TD G SGI+GL R P S++++ N++ F
Sbjct: 150 DTFAIGA-------AKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFS 202
Query: 288 YCLHSP------YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGER 341
YCL G+T G ++ +K + + + +Y + L GI GG
Sbjct: 203 YCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGA- 261
Query: 342 LPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS 401
PL+A+ + + +D+ + + Y AL+ A + + + YDL
Sbjct: 262 -PLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKP-----YDLC 315
Query: 402 AYKTVV--VPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
K V P++ F GG L + L+ VCL
Sbjct: 316 FPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCL 355
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 159/385 (41%), Gaps = 43/385 (11%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFF 174
P TG+ YY + IG P + + +DTGS I W C C C ++ D +
Sbjct: 76 LPTDTGL-----YYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLY 130
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRM 233
DP S + S + C+ C P C+ C Y + Y DGS TG++ +D +
Sbjct: 131 DPKGSSSGSTVSCDQKFCAATYGGKLPG----CAKNIPCEYSVMYGDGSSTTGYFVSDSL 186
Query: 234 TIQEVNGNGY--FARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS--- 284
+V+G+G A + GC GD GI+G + S++S+ +
Sbjct: 187 QYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEV 246
Query: 285 --YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL 342
F +CL + G F D V K VK TP+V P+ Y++ L I+VGG L
Sbjct: 247 KKIFSHCLDTIKGGG---IFAIGDVVQPK-VKSTPLV--PDMPH-YNVNLESINVGGTTL 299
Query: 343 PLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
L + F K T IDSGT +T P VY + +A + ++D Y
Sbjct: 300 QLPSHMFETGEKKGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTF-HSVQDFLCIQYF 358
Query: 400 LSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSD-PNSILLGN 455
S PKIT HF + L + C GF L D + +LLG+
Sbjct: 359 QSVDDG--FPKITFHFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGD 416
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNCN 480
+ V YD+ + +G+ NC+
Sbjct: 417 LVLSNKVVVYDLENQVVGWTDYNCS 441
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 104/385 (27%), Positives = 172/385 (44%), Gaps = 55/385 (14%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHC---SQQRDP--FFDPSKSKTFSKIP 186
Y+ V +G P + + +DTGS + W C C C S + P FFDP S T + +
Sbjct: 84 YFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALVS 143
Query: 187 CNSTTCKILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEV---NG- 240
C+ C ++ + CSS+ +C Y Y DGSG +G++ D M + + +G
Sbjct: 144 CSDQRCTAGIQ----SSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGE 199
Query: 241 -----NGYFARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YF 286
Y + F+ C+ TGD GI G + +S+IS+ F
Sbjct: 200 LSQICQTYDSSVSFM--CSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVF 257
Query: 287 FYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
+CL G + G+ + + + YTP+V P Q Y++ L ISV G+ L +
Sbjct: 258 SHCLKGDDSGGGVLVLGE---IVEPNIVYTPLV--PSQPH-YNLYLQSISVAGQTLAIDP 311
Query: 347 SYFTKLSTE---IDSGTIITRFPA----PVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
S F S + +DSGT + P SA+ S + Y + KG + CY
Sbjct: 312 SVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTY-LSKG-----NQCYL 365
Query: 400 LSAYKTVVVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGN 455
+++ V P+++++F GG L L+ + L+ V C+GF P +I LG+
Sbjct: 366 VTSSVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITI-LGD 424
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNCN 480
+ + YD+A +R+G+ +C+
Sbjct: 425 LVLKDKIFVYDIANQRVGWTNYDCS 449
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 112/418 (26%), Positives = 178/418 (42%), Gaps = 60/418 (14%)
Query: 91 RDQQRLHLKNSR--------RLQKAIPDNFKKTKAFTFPAKTG-IVAADEYYIVVAIGKP 141
+D+ L +++S R++ ++ N + KA P+ TG + A+ ++IG+P
Sbjct: 57 KDRMELDIQHSAARFAYIQARIEGSLVSN-NEYKARVSPSLTGRTIMAN-----ISIGQP 110
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFS---KIPCNSTTCKILLEW 198
+++DTGS I W C PC +C FDPS S TFS K PC+ C
Sbjct: 111 PIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPLCKTPCDFKGCS----- 165
Query: 199 FPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNN 257
+C P+ + Y D S +G + D + + + +R P L GC N
Sbjct: 166 -------RC--DPIPFTVTYADNSTASGMFGRDTVVFETTDEGT--SRIPDVLFGCGHNI 214
Query: 258 TGDQN-GASGIMGLDRGPVSIISKTNISYFFYC---LHSPYGSTGYITFGKPDTVNKKFV 313
D + G +GI+GL+ GP S+ +K F YC L PY + + G+ +
Sbjct: 215 GQDTDPGHNGILGLNNGPDSLATKIG-QKFSYCIGDLADPYYNYHQLILGEGADLEGYST 273
Query: 314 KYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAP 368
+ + FY++T+ GISVG +RL + F ID+G+ IT
Sbjct: 274 PFE------VHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVDS 327
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDL-FDTCYDLSAYKTVV-VPKITIHFLGGVDLELDVR 426
V+ L R + IE + C+ S + +V P +T HF G DL LD
Sbjct: 328 VHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALDSG 387
Query: 427 GTLVVESVRQVCLGFAL-----LPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+ C+ L S P+ L+G + Q+ Y V YD+ + + F +C
Sbjct: 388 SFFNQLNDNVFCMTVGPVSSLNLKSKPS--LIGLLAQQSYSVGYDLVNQFVYFQRIDC 443
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 151/364 (41%), Gaps = 37/364 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C C + +DP F P S T+ + C +
Sbjct: 112 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC-TID 170
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C +C Y+ Y + S +G D ++ A +
Sbjct: 171 CNC-----------DGDRMQCVYERQYAEMSTSSGVLGEDVISF---GNQSELAPQRAVF 216
Query: 252 GCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNIS-YFFYCLHSPYGSTGYITFGK 304
GC + TGD A GIMGL RG +SI + K IS F C G + G
Sbjct: 217 GCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVLGG 276
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIIT 363
+ Y + P++S +Y+I L + V G+RLPL A+ F K T +DSGT
Sbjct: 277 ISPPSDMTFAY----SDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYA 332
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCY-----DLSAYKTVVVPKITIHFLG 417
P + A + A K ++ K G + + D C+ D+S P + + F
Sbjct: 333 YLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSK-SFPVVDMVFGN 391
Query: 418 GVDLELDVRGTLVVES-VRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
G L + S VR CLG +D + LLG + R V YD ++GF
Sbjct: 392 GHKYSLSPENYMFRHSKVRGAYCLGIFQNGND-QTTLLGGIIVRNTLVMYDREQTKIGFW 450
Query: 476 PGNC 479
NC
Sbjct: 451 KTNC 454
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 106/370 (28%), Positives = 154/370 (41%), Gaps = 47/370 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN-ST 190
Y + IG P Q +L++D+GS +T+ C C C +DP F P S ++S + CN
Sbjct: 88 YTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKCNVDC 147
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC DK K+C Y+ Y + S +G D I +
Sbjct: 148 TCD----------SDK---KQCTYERQYAEMSSSSGVLGED---IVSFGRESELKPQHAI 191
Query: 251 LGCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNIS-YFFYCLHSPYGSTGYITFG 303
GC ++ TGD A GIMGL RG +SI + K IS F C G + G
Sbjct: 192 FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLG 251
Query: 304 ----KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-TKLSTEIDS 358
PD + + P +S +Y+I L I V G+ L +++ F +K T +DS
Sbjct: 252 GMLAPPDMIFSN--------SDPLRSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDS 303
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTV-----VVPKIT 412
GT P + A + A ++ K +G + + D C+ A + V V P +
Sbjct: 304 GTTYAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICF-AGAGRNVSKLHEVFPDVD 362
Query: 413 IHFLGGVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGR 470
+ F G L L L S CLG DP + LLG + R V YD
Sbjct: 363 MVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTT-LLGGIIVRNTLVTYDRHNE 421
Query: 471 RLGFGPGNCN 480
++GF NC+
Sbjct: 422 KIGFWKTNCS 431
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 166/376 (44%), Gaps = 35/376 (9%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR-DPFFDPSKSKTFSKIPCNS 189
+Y++ +G P Q L+ DTGS +TW +C + F + S++++ I C+S
Sbjct: 111 QYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAPIACSS 170
Query: 190 TTCKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTI-----QEVNGNG 242
TC + P CSS C YD Y DGS G TD TI + +G G
Sbjct: 171 DTCTS----YVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGG 226
Query: 243 YFARYP-FLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISY---FFYCL--H-SPY 294
A+ +LGCT + G + G++ L +S S+ + F YCL H +P
Sbjct: 227 RRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPR 286
Query: 295 GSTGYITFGKPD--------TVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
+T Y+TFG P + + TP++ S FY + + + V GE L + A
Sbjct: 287 NATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPA 346
Query: 347 SYFTKL---STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAY 403
+ +DSGT +T P Y A+ +A +R+ + + D F+ CY+ +A
Sbjct: 347 DVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERLA--GLPRVSMDPFEYCYNWTA- 403
Query: 404 KTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEV 463
+ +P + + F G L+ + +V + C+G + P ++GN+ Q+ +
Sbjct: 404 AALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQ-EGAWPGVSVIGNILQQDHLW 462
Query: 464 HYDVAGRRLGFGPGNC 479
+D+ R L F C
Sbjct: 463 EFDLRDRWLRFKHTRC 478
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 113/413 (27%), Positives = 183/413 (44%), Gaps = 37/413 (8%)
Query: 91 RDQQRLHLKNSRR-LQKAIPDNFKKTK---AFTFPAKTGIVAADEYYIVVAIGKPKQYVS 146
+ ++L L S+ LQ + N ++ + +FP K YY + +G P Q +
Sbjct: 38 KQNEKLGLGMSKHHLQHLVEHNDRRGRFLQGISFPLKGNYSDLGLYYTEIGLGNPVQKLK 97
Query: 147 LLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDK 206
+++DTGS I W +C PC C ++D P S ++ ++++ + Q
Sbjct: 98 VIVDTGSDILWVKCSPCRSCLSKQDII--PPLS-IYNLSASSTSSVSSCSDPLCTGEQAV 154
Query: 207 C----SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQN 262
C S+ C Y I+Y D S G + D M GN + F GC N TG
Sbjct: 155 CSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVLQGGNATTSHIFF--GCAINITGSWP 212
Query: 263 GASGIMGLDR----GPVSIISKTNISYFF-YCLHSPYGSTGYITFG-KPDTVNKKFVKYT 316
A GIMG + P I ++ N+S F +CL G + FG +P+T F T
Sbjct: 213 -ADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGEEPNTTEMVF---T 268
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEI-DSGTII---TRFPAPVYSA 372
P++ Y++ L ISV + LP+ + F+ +S ++G II T F A
Sbjct: 269 PLLNVTTH---YNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVIIDSGTSFALLATKA 325
Query: 373 LRSAFR--KRMKKYKMGKGIEDLFDTCYDLSAYKTVVV--PKITIHFLGGVDLELDVRGT 428
R F K + K+G +E L C+ L + TV P +T+ F GG ++L
Sbjct: 326 NRILFSEIKNLTTAKLGPKLEGL--QCFYLKSGLTVETSFPNVTLTFSGGSTMKLKPDNY 383
Query: 429 LVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
LV+ +++ G+ S + + + G + + V YDV RR+G+ NC+
Sbjct: 384 LVMVELKKKRNGYCYAWSSADGLTIFGEIVLKDKLVFYDVENRRIGWKGQNCS 436
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 118/461 (25%), Positives = 174/461 (37%), Gaps = 82/461 (17%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKA--IPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKP 141
++EE +RR +R H RRL A + KT +Y IG P
Sbjct: 37 TMEERVRRATERTH---HRRLLHASTAAAAGGVAAPLRWSGKT------QYIASYGIGDP 87
Query: 142 KQYVSLLLDTGSGITWTQCKPC----------IHCSQQRDPFFDPSKSKTFSKIPCN--- 188
Q ++DTGS + WTQC C C Q P+++ S S+T +PC+
Sbjct: 88 PQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDD 147
Query: 189 STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C + E C +Y G G TD T +
Sbjct: 148 GALCGVAPETAGCARGGGSGDDACVVAASYGAGVA-LGVLGTDAFTFPSSS------SVT 200
Query: 249 FLLGCTDN---NTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY----GSTGYIT 301
GC + G NGASGI+GL RG +S++S+ N + F YCL +PY S ++
Sbjct: 201 LAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCL-TPYFRDTVSPSHLF 259
Query: 302 FGKPDTVNKKF-----------VKYTPIVTTPEQ---SEFYHITLTGISVGGERLPLKAS 347
G + V P P+ S FY++ L G++ G + L A
Sbjct: 260 VGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAG 319
Query: 348 YFTKLSTE---------IDSGTIITRFPAPVYSALRSAFRKRMK--------KYKMGKGI 390
F IDSG+ TR P + AL ++++ K+G +
Sbjct: 320 AFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGAL 379
Query: 391 EDLFDTCYDLSAYKTVVVPKITIHFLGGV--DLEL---------DVRGTLVVESVRQVCL 439
E + D + VP + + F GV EL V + +V
Sbjct: 380 ELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSAS 439
Query: 440 GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G A LP++ +I +GN Q+ V YD+A L F P NC+
Sbjct: 440 GNATLPTNETTI-IGNFMQQDMRVLYDLANGLLSFQPANCS 479
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 107/392 (27%), Positives = 163/392 (41%), Gaps = 56/392 (14%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCS--------QQRDPFFDPSKSKTFS 183
Y + ++ G P Q + + DTGS + W C CS P F P S +
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149
Query: 184 KIPCNSTTCKILLEWFPPNGQDK--------CSSKECPYDIAYVDGSGETGFWATDRMTI 235
I C S C+ L + PN Q + C+ PY + Y GS G T+++
Sbjct: 150 IIGCQSPKCQFL---YGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEKLDF 205
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS-PY 294
++ F++GC+ +T +GI G RGPVS+ S+ N+ F +CL S +
Sbjct: 206 PDLTVPD------FVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRF 256
Query: 295 GSTGYITFGKPDT-------VNKKFVKYTPIVTTPEQS-----EFYHITLTGISVGGERL 342
T T DT + YTP P S E+Y++ L I VG + +
Sbjct: 257 DDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV 316
Query: 343 PLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FD 395
+ Y + + +DSG+ T PV+ + F +M Y K +E
Sbjct: 317 KIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLG 376
Query: 396 TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFA----LLPSDPN- 449
C+++S V VP++ F GG LEL + V + VCL + PS
Sbjct: 377 PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTG 436
Query: 450 -SILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+I+LG+ QQ+ Y V YD+ R GF C+
Sbjct: 437 PAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 115/430 (26%), Positives = 179/430 (41%), Gaps = 58/430 (13%)
Query: 80 RNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIG 139
R SL I D R R+ A+ N P TG+ Y+ + +G
Sbjct: 30 RRQASLTGIKAHDSSR-----RGRILSAVDFNLGGNG---LPTVTGL-----YFTKIGLG 76
Query: 140 KPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKI 194
P + + +DTGS I W C C C ++ D +DP +SKT + C C
Sbjct: 77 SPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSS 136
Query: 195 LLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA--RYPFLL 251
E C ++ CPY I+Y DGS TG++ D +T VNGN + A +
Sbjct: 137 TYEGRILG----CKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIF 192
Query: 252 GCTDNNTG-----DQNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYIT 301
GC +G + GI+G + S++S+ S F +CL + G G +
Sbjct: 193 GCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGG-GIFS 251
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS---TEIDS 358
G+ V + VK TP+V P + Y++ L I V G+ L L + F + T IDS
Sbjct: 252 IGE---VVEPKVKTTPLV--PNMAH-YNVILKNIEVDGDILQLPSDTFDSENGKGTVIDS 305
Query: 359 GTIITRFPAPVYSALRS---AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHF 415
GT + P VY L S A + R+K Y +E+ + +C+ + P + +HF
Sbjct: 306 GTTLAYLPRIVYDQLMSKVLAKQPRLKVYL----VEEQY-SCFQYTGNVDSGFPIVKLHF 360
Query: 416 LGGVDLELDVRGTLV-VESVRQVCLGFALLPSD----PNSILLGNVQQRGYEVHYDVAGR 470
+ L + L + C+G+ S+ + LLG+ V YD+
Sbjct: 361 EDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLENM 420
Query: 471 RLGFGPGNCN 480
+G+ NC+
Sbjct: 421 TIGWTDYNCS 430
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 147/367 (40%), Gaps = 41/367 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C C + +DP FDP S T+ I CN
Sbjct: 83 YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCNIDC 142
Query: 192 CKILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C S +C Y+ Y + S +G D ++
Sbjct: 143 I--------------CDSDGVQCVYERQYAEMSTSSGVLGEDVISF---GNQSELIPQRA 185
Query: 250 LLGCTDNNTGD--QNGASGIMGLDRGPVSIISK------TNISYFFYCLHSPYGSTGYIT 301
+ GC + TGD A GIMGL G +S++ + N S F C G +
Sbjct: 186 VFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDS-FSLCYGGMDIGGGAMV 244
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGT 360
G + Y + P +S +Y++ L I V G++LPL + F + +DSGT
Sbjct: 245 LGGISPPSDMIFTY----SDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGT 300
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVV----PKITIHF 415
PA +SA + A + K G + F D C+ + + P + + F
Sbjct: 301 TYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVF 360
Query: 416 LGGVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
G L L S CLG +D + LLG + R V YD A ++G
Sbjct: 361 ENGQKLSLTPENYFFRHSKVHGAYCLGIFENGND-QTTLLGGIVVRNTLVMYDRANSKIG 419
Query: 474 FGPGNCN 480
F NC+
Sbjct: 420 FWKTNCS 426
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 147/367 (40%), Gaps = 41/367 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C C + +DP FDP S T+ I CN
Sbjct: 83 YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCNIDC 142
Query: 192 CKILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
C S +C Y+ Y + S +G D ++
Sbjct: 143 I--------------CDSDGVQCVYERQYAEMSTSSGVLGEDVISF---GNQSELIPQRA 185
Query: 250 LLGCTDNNTGD--QNGASGIMGLDRGPVSIISK------TNISYFFYCLHSPYGSTGYIT 301
+ GC + TGD A GIMGL G +S++ + N S F C G +
Sbjct: 186 VFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDS-FSLCYGGMDIGGGAMV 244
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGT 360
G + Y + P +S +Y++ L I V G++LPL + F + +DSGT
Sbjct: 245 LGGISPPSDMIFTY----SDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGT 300
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTVVV----PKITIHF 415
PA +SA + A + K G + F D C+ + + P + + F
Sbjct: 301 TYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVF 360
Query: 416 LGGVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
G L L S CLG +D + LLG + R V YD A ++G
Sbjct: 361 ENGQKLSLTPENYFFRHSKVHGAYCLGIFENGND-QTTLLGGIVVRNTLVMYDRANSKIG 419
Query: 474 FGPGNCN 480
F NC+
Sbjct: 420 FWKTNCS 426
>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
Length = 445
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 96/336 (28%), Positives = 148/336 (44%), Gaps = 41/336 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + ++ G P Q +S ++DTGS + W C C++ P DP+K TF IP S++
Sbjct: 106 YSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTF--IPKLSSS 163
Query: 192 CKILLEWFPPNG------QDKCSSKECP-YDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
KI+ P G +K CP Y I Y G+ + + +
Sbjct: 164 AKIVGCLNPKCGFVMDSENSANCTKACPTYAIQYGLGTTVGLLLLESLVFAERTEPD--- 220
Query: 245 ARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL------HSPYGSTG 298
F++GC+ SGI G RGP S+ + + F YCL SP S
Sbjct: 221 ----FVVGCS---ILSSRQPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKM 273
Query: 299 YITFGKPDTVNKKF--VKYTPIVTTPEQS-----EFYHITLTGISVGGERLPLKASYFTK 351
+ G PD+ + K + YTP P S E+Y++TL I VG +R+ + S+
Sbjct: 274 TLYVG-PDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVA 332
Query: 352 LS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSAYK 404
S T +DSG+ T PV+ A+ + F ++M Y +E L C++LS
Sbjct: 333 GSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVG 392
Query: 405 TVVVPKITIHFLGGVDLELDVRGTL-VVESVRQVCL 439
+V +P + F GG +EL V +V + +CL
Sbjct: 393 SVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCL 428
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 167/366 (45%), Gaps = 40/366 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP--------FFDPSKSKTFS 183
+Y VVA+G P + LDTGS + W C C+ C+ + P + P++S T
Sbjct: 99 HYAVVALGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSR 157
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSK--ECPYDIAYV-DGSGETGFWATDRMTIQEVNG 240
K+PC+S C + Q+ C SK CPY I Y+ D + +G D + + +
Sbjct: 158 KVPCSSNLCDL---------QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 208
Query: 241 NGYFARYPFLLGCTDNNTGDQNGAS---GIMGL---DRGPVSIISKTNISYFFYCLHSPY 294
P + GC TG G++ G++GL + S+++ ++ + +
Sbjct: 209 QSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGD 268
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G I FG + ++ K TP+ +Q+ +Y+IT+TGI+VG + S T+ S
Sbjct: 269 DGHGRINFGDTGSSDQ---KETPL-NVYKQNPYYNITITGITVGSK------SISTEFSA 318
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+DSGT T P+Y+ + S+F +++ + F+ CY +SA +V P +++
Sbjct: 319 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSA-NGIVHPNVSLT 377
Query: 415 FLGGVDLEL-DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLG 473
GG + D T+ + V A++ S+ L+G G +V +D LG
Sbjct: 378 AKGGSIFPVNDPIITITDNAFNPVGYCLAIMKSE-GVNLIGENFMSGLKVVFDRERMVLG 436
Query: 474 FGPGNC 479
+ NC
Sbjct: 437 WKNFNC 442
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 113/391 (28%), Positives = 177/391 (45%), Gaps = 57/391 (14%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y I ++ G P Q + L++DTGS + W PC H R+ F S + IP +S++
Sbjct: 90 YSIPLSFGTPPQTLPLIMDTGSDLVWF---PCTHRYVCRNCSFSTSNPSSNIFIPKSSSS 146
Query: 192 CKILLEWFPPNG-------QDKC-----SSKEC-----PYDIAYVDGSGETGFWATDRMT 234
K+L P G Q +C +S C PY + Y GSG TG
Sbjct: 147 SKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFY--GSGITGGIMLSETL 204
Query: 235 IQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHS-- 292
++ G G F++GC+ +T +GI G RGP S+ S+ + F YCL S
Sbjct: 205 --DLPGKGVPN---FIVGCSVLSTSQ---PAGISGFGRGPPSLPSQLGLKKFSYCLLSRR 256
Query: 293 ---PYGSTGYITFGKPDTVNKKF-VKYTPIVTTPEQ------SEFYHITLTGISVGGERL 342
S+ + G+ D+ K + YTP V P+ S +Y++ L I+VGG+ +
Sbjct: 257 YDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHV 316
Query: 343 PLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRM--KKYKMGKGIEDLFD 395
+ Y + T IDSGT T ++ + + F K++ K+ +GI L
Sbjct: 317 KIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGL-R 375
Query: 396 TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFALLP------SDP 448
C+++S T P++T+ F GG ++EL + + + VCL S
Sbjct: 376 PCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGG 435
Query: 449 NSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
+I+LGN QQ+ + V YD+ RLGF +C
Sbjct: 436 PAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 99/387 (25%), Positives = 163/387 (42%), Gaps = 52/387 (13%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCK---PCIHCS-----QQRDPFFDPSKSKTFS 183
+ I ++ G P Q +S L+DTGS + W C C +CS ++ P F+P S +
Sbjct: 87 HSIPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSK 146
Query: 184 KIPCNSTTC------KILLEWFPPNGQDKCSSKEC-PYDIAYVDGSGETGFWATDRMTIQ 236
+ C + C + L P NG K S C PY + Y G+ F ++
Sbjct: 147 ILGCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQYGTGASSGDFL------LE 200
Query: 237 EVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSP--- 293
+N G + FL+GCT + G+ A+ + G R S+ + + F YCL+S
Sbjct: 201 NLNFPGK-TIHEFLVGCTTSAVGEVTSAA-LAGFGRSMFSLPMQMGVKKFAYCLNSHDYD 258
Query: 294 ---YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSE-FYHITLTGISVGGERLPLKASYF 349
S + + +T K + Y P + P +Y++ + I +G + L + + Y
Sbjct: 259 DTRNSSKLILDYSDGET---KGLSYAPFLKNPPDFPIYYYLGVKDIKIGNKLLRIPSKYL 315
Query: 350 TKLST-----EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDT--CYDLSA 402
S IDSG PV+ + + +KRM KY+ E CY+ +
Sbjct: 316 APGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEIGVTPCYNFTG 375
Query: 403 YKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN----------SIL 452
K++ +P + F GG + + + V+ + ++ L L +D SI+
Sbjct: 376 QKSIKIPDLIYQFRGGATMVVPGKNYFVL--IPEISLACFPLTTDAGTNTLEFTPGPSII 433
Query: 453 LGNVQQRGYEVHYDVAGRRLGFGPGNC 479
LGN Q Y V +D+ RLGF C
Sbjct: 434 LGNSQHVDYYVEFDLKNERLGFRQQTC 460
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 162/374 (43%), Gaps = 41/374 (10%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP------- 186
+ + IG P Q L+LDTGS ++W QC ++ P P + +
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KKIKKRLPPLPKPKTTSFDPSLSSSFSLLP 126
Query: 187 CNSTTCKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
CN CK + F P D+ ++ C Y Y DG+ G ++ T + +
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQ--NRLCHYSYFYADGTLAEGNLVREKFTFSKS-----LS 179
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKP 305
P +LGC +T ++ GI+G++RG +S IS+ IS F YC+ S GS F
Sbjct: 180 TPPVILGCAQASTENR----GILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLG 235
Query: 306 DTVNKKFVKYTPIVTTPEQSE-------FYHITLTGISVGGERLPLKASYFTKLS----- 353
D N KY ++T PE Y + + I + G+RL + + F +
Sbjct: 236 DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDAGGSGQ 295
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG--IEDLFDTCYDLSAYKTV--VVP 409
T IDSG+ +T Y ++ R+ M KG D+ D C+D V +
Sbjct: 296 TMIDSGSDLTYLVDEAYEKVKEEV-VRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIG 354
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQ--VCLGFALLPS-DPNSILLGNVQQRGYEVHYD 466
I+ F GV++ + RG V+ V + C+G S ++G V Q+ V YD
Sbjct: 355 GISFEFDNGVEIFVG-RGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYD 413
Query: 467 VAGRRLGFGPGNCN 480
+A +R+GFG C+
Sbjct: 414 LANKRVGFGGAECS 427
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 99/390 (25%), Positives = 161/390 (41%), Gaps = 50/390 (12%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK-PCIHCSQQRDPFFDPSK 178
P K + +YY + +G P + L +DTGS +TW QC PC +C++ P + P+K
Sbjct: 182 LPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAK 241
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQE 237
K +P C+ L Q+ C++ K+C Y+I Y D S G A D M +
Sbjct: 242 EKI---VPPRDLLCQEL-----QGDQNYCATCKQCDYEIEYADRSSSMGVLAKDDMHMIA 293
Query: 238 VNGNGYFARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISK-------TNISYF 286
NG + F+ GC + G GI+GL +S+ S+ +N+ F
Sbjct: 294 TNGGR--EKLDFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNV--F 349
Query: 287 FYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
+C+ GY+ G D V + + + PI P+ YH ++ G ++L +
Sbjct: 350 GHCITKEPNGGGYMFLGD-DYVPRWGMTWAPIRGGPD--NLYHTEAQKVNYGDQQLRMHG 406
Query: 347 SYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL--SAYK 404
+ + DSG+ T P +Y L +A KY ++D DT L A
Sbjct: 407 QAGSSIQVIFDSGSSYTYLPDEIYKKLVTAI-----KYDYPSFVQDTSDTTLPLCWKADF 461
Query: 405 TVVVPKITIHFLGGVDLELDVR-------------GTLVVESVRQVCLGF--ALLPSDPN 449
V + F ++L R L++ VCLG +
Sbjct: 462 DVRYLEDVKQFFKPLNLHFGNRWFVIPRTFTILPDDYLIISDKGNVCLGLLNGAEIDHAS 521
Query: 450 SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++++G+V RG V YD R++G+ C
Sbjct: 522 TLIVGDVSLRGKLVVYDNERRQIGWADSEC 551
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 103/423 (24%), Positives = 183/423 (43%), Gaps = 60/423 (14%)
Query: 92 DQQRLHLKNSRRLQKAIPDNF-------KKTKAFTFPAKTGIVAADE---YYIVVAIGKP 141
DQ S+R Q + + F K+ K+ A++ ++ + + + ++IG P
Sbjct: 54 DQNETVEDRSKREQTSSIERFDFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSP 113
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
+++DTGS + W QC PCI+C QQ +FDP KS +F + C FP
Sbjct: 114 PVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCG----------FPG 163
Query: 202 ----NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN 257
NG + Y + Y+ G G A + + + ++ G + GC N
Sbjct: 164 YNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLD-EGKIKKSNITFGCGHMN 222
Query: 258 --TGDQNGASGIMGLDRGPVSIISKTNISYFFYC---LHSPYGSTGYITFGKPDTVNKKF 312
T + + +G+ GL P ++ + F YC +++P + ++ G+ +
Sbjct: 223 IKTNNDDAYNGVFGLGAYPHITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGD- 281
Query: 313 VKYTPIVTTPEQSEF--YHITLTGISVGGERLPLKASYFTKLSTE------IDSGTIITR 364
+TP Q F Y++TL ISVG + L + + F K+S++ IDSG T+
Sbjct: 282 -------STPLQIHFGHYYVTLQSISVGSKTLKIDPNAF-KISSDGSGGVLIDSGMTYTK 333
Query: 365 FP----APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-VPKITIHFLGGV 419
+Y + + +++ + E L C+ + +V P +T HF GG
Sbjct: 334 LANGGFELLYDEIVDLMKGLLERIPTQRKFEGL---CFKGVVSRDLVGFPAVTFHFAGGA 390
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDP---NSILLGNVQQRGYEVHYDVAGRRLGFGP 476
DL L+ + CL A+LPS+ N ++G + Q+ Y V +D+ ++ F
Sbjct: 391 DLVLESGSLFRQHGGDRFCL--AILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRR 448
Query: 477 GNC 479
+C
Sbjct: 449 IDC 451
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 120/450 (26%), Positives = 182/450 (40%), Gaps = 69/450 (15%)
Query: 73 KLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEY 132
+L ++ + +E +RR +R H RRL + + + +Y
Sbjct: 36 ELTHVDAKQNCTTKERMRRATERTH----RRLASMAGGGGEASAPIHWNET-------QY 84
Query: 133 YIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH--CSQQRDPFFDPSKSKTFSKIPCNST 190
IG P Q + ++DTGS + WTQC C C Q F+DPS+S+T + CN T
Sbjct: 85 IAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDT 144
Query: 191 TCKILLEWFPPNGQDKCS--SKECPYDIAYVDGSGET-GFWATDRMTIQEVNGNGYFARY 247
C + + +C+ K C AY G+G GF T+ T +G
Sbjct: 145 ACLL-------GSETRCARDGKACAVLTAY--GAGAIGGFLGTEVFTFG--HGQSSENNV 193
Query: 248 PFLLGCTDNN---TGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY------GSTG 298
GC + G +GASGI+GL RG +S+ S+ + F YCL +PY ST
Sbjct: 194 SLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNKFSYCL-TPYFSDAANTSTL 252
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQ---SEFYHITLTGISVGGERLPLKASYFTKLS-- 353
++ + P + P+ FY++ LTGI+VG +L + A+ F
Sbjct: 253 FVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVA 312
Query: 354 ------TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKM--GKGIEDLFDTCYDLSAYKT 405
T IDSG+ T Y ALR +++ + G E L D C A
Sbjct: 313 PAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGL-DLCVGGVAPGD 371
Query: 406 V--VVPKITIHFLGGVDLELDVRGTLVVESV------RQVCLGFALLPSDPNSIL----- 452
+VP + +HF G DV + E+ C+ PNS L
Sbjct: 372 AGKLVPPLVLHFGSGGGGGGDV--VVPPENYWGPVDDSTACM-VVFSSGGPNSTLPLNET 428
Query: 453 --LGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+GN Q+ + YD+ L F P +C+
Sbjct: 429 TIIGNYMQQDMHLLYDLGQGVLSFQPADCS 458
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 109/420 (25%), Positives = 172/420 (40%), Gaps = 32/420 (7%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGI-VAADEYYIVVAIGKPK 142
SL E R D +R S+ + AF P +G +Y++ +G P
Sbjct: 56 SLGERARDDARRHAYIRSQLASRRRRAADVGASAFAMPLSSGAYTGTGQYFVRFRVGTPA 115
Query: 143 QYVSLLLDTGSGITWTQCKPCIHCSQQRDPF--FDPSKSKTFSKIPCNSTTCKILLEWFP 200
Q L+ DTGS +TW +C+ P F S+S++++ + C+S TC +
Sbjct: 116 QPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDTCTSYV---- 171
Query: 201 PNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP---------F 249
P CSS C YD Y DGS G TD TI
Sbjct: 172 PFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRAKLQGV 231
Query: 250 LLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISY---FFYCL--H-SPYGSTGYITF 302
+LGCT G + G++ L +S S+ + F YCL H +P ++ Y+TF
Sbjct: 232 VLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNASSYLTF 291
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT---KLSTEIDSG 359
G TP+V S FY + + + V GE L + A + +DSG
Sbjct: 292 GPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGRGGGAILDSG 351
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T +T P Y A+ +A R+ + + D F+ CY+ +A +PK+ + F G
Sbjct: 352 TSLTVLATPAYRAVVAALGGRLA--ALPRVAMDPFEYCYNWTA-GAPEIPKLEVSFAGSA 408
Query: 420 DLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
LE + ++ + C+G + P ++GN+ Q+ + +D+ R L F C
Sbjct: 409 RLEPPAKSYVIDAAPGVKCIGVQ-EGAWPGVSVIGNILQQEHLWEFDLRDRWLRFKHTRC 467
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 123/274 (44%), Gaps = 28/274 (10%)
Query: 225 TGFWATDRMTI---QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKT 281
TG AT+ T Q + N F GC G GASGIMG+ GP+S++ +
Sbjct: 4 TGVLATETFTFGAHQNFSANLTF-------GCGKLTNGTIAGASGIMGVSPGPLSVLKQL 56
Query: 282 NISYFFYCLH-------SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTG 334
+I+ F YCL SP GK T K V+ P++ P + +Y++ + G
Sbjct: 57 SITKFSYCLTPFTDHKTSPVMFGAMADLGKYKTTGK--VQTIPLLKNPVEDIYYYVPMVG 114
Query: 335 ISVGGERLPLKASYFT-----KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG 389
IS+G +RL + + T +DS T + P + L+ A + MK +
Sbjct: 115 ISIGSKRLDVPEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANRS 174
Query: 390 IEDLFDTCYDLS---AYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPS 446
I+D + C++L + + V VP + +HF G ++ L S +CL P
Sbjct: 175 IDD-YPVCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYFQEPSPGMMCLAVMQAPF 233
Query: 447 DPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ ++GNVQQ+ V YD+ R+ + P C+
Sbjct: 234 EGAPNVIGNVQQQNMHVLYDLGNRKFSYAPTKCD 267
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 169/382 (44%), Gaps = 48/382 (12%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP---FFDPSKSKTFSKIPC 187
EY + + +G P V + DTGS + W +CK + + P +F PS S T+ ++ C
Sbjct: 109 EYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGC 168
Query: 188 NSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGNG---- 242
++ C+ L + CS C Y +Y DGS +G +T+ T + +
Sbjct: 169 DTKACRAL------SSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNS 222
Query: 243 --------------YFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY--- 285
A+ F GC+ TG A G++GL GPVS+ S+ +
Sbjct: 223 HGNNNNNSSSHGQVEIAKLDF--GCSTTTTGTFR-ADGLVGLGGGPVSLASQLGATTSLG 279
Query: 286 --FFYCLHSPYGST---GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE 340
F YCL +PY +T + FG V++ TP++T E +Y I L I+V G
Sbjct: 280 RKFSYCL-APYANTNASSALNFGSRAVVSEPGAASTPLITG-EVETYYTIALDSINVAGT 337
Query: 341 RLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL 400
+ P A+ + +DSGT +T + + + L +R+K + + E + D CYD+
Sbjct: 338 KRPTTAA---QAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPR-AESPEKILDLCYDI 393
Query: 401 SAYK---TVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQ 457
S + + +P +T+ GG ++ L T VV +CL + +LGN+
Sbjct: 394 SGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVSILGNIA 453
Query: 458 QRGYEVHYDVAGRRLGFGPGNC 479
Q+ V YD+ + F +C
Sbjct: 454 QQNLHVGYDLEKGTVTFAAADC 475
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 105/409 (25%), Positives = 162/409 (39%), Gaps = 43/409 (10%)
Query: 96 LHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGI 155
L + + RR + + P TG+ Y+ + +G P + + +DTGS I
Sbjct: 53 LRVHDGRRHGRLLAAADLPLGGLGLPTDTGL-----YFTEIKLGTPPKRYYVQVDTGSDI 107
Query: 156 TWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK 210
W C C C ++ F+DP S + S + C+ C P C++
Sbjct: 108 LWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYGGKLPG----CTAN 163
Query: 211 -ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF--ARYPFLLGCTDNNTGD----QNG 263
C Y + Y DGS TGF+ TD + +V G+G GC GD
Sbjct: 164 VPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNATVTFGCGAQQGGDLGSSNQA 223
Query: 264 ASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPI 318
GI+G + S++S+ + F +CL + G G G V + VK TP+
Sbjct: 224 LDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIKGG-GIFAIGN---VVQPKVKTTPL 279
Query: 319 VTTPEQSEFYHITLTGISVGGERLPLKASYF---TKLSTEIDSGTIITRFPAPVYSALRS 375
V Y++ L I VGG L L A F + T IDSGT +T P V+ + +
Sbjct: 280 VA---DMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTIIDSGTTLTYLPELVFKEVMA 336
Query: 376 AFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVR 435
A + + ++D C+ P IT HF + L +
Sbjct: 337 AIFNKHQDIVF-HNVQDFM--CFQYPGSVDDGFPTITFHFEDDLALHVYPHEYFFPNGND 393
Query: 436 QVCLGF---ALLPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
C+GF AL D I L+G++ V YD+ + +G+ NC+
Sbjct: 394 MYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTDYNCS 442
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 118/457 (25%), Positives = 183/457 (40%), Gaps = 96/457 (21%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLL 148
+ Q+ HL+N ++ + T +FT + P Q+VSL
Sbjct: 57 FQHQHQKRHLRNRHQVSLPLSPGSDYTLSFTLNSN-----------------PPQHVSLY 99
Query: 149 LDTGSGITWTQCKP--CIHCSQQRDPFFD----PSKSKTFSKIPCNSTTCKILLEWFPPN 202
LDTGS + W CKP CI C + + P S T + C S+ C P +
Sbjct: 100 LDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACSAAHSNLPTS 159
Query: 203 GQDKCSSKECPYD----------------IAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
D C+ +CP + AY DGS + +I+ +
Sbjct: 160 --DLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHD---SIKLPLATPSLSL 214
Query: 247 YPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNI------SYFFYCLHSPYGSTGYI 300
+ F GC + G+ G RG +S+ ++ + F YCL S ++ +
Sbjct: 215 HNFTFGCAHTALAE---PVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRL 271
Query: 301 TFGKP----------DTVNKKFVK--YTPIVTTPEQSEFYHITLTGISVGGERLPLKASY 348
P VNK V+ YT ++ P+ FY + L GIS+G +++P +
Sbjct: 272 RLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKIP-APEF 330
Query: 349 FTKLSTE------IDSGTIITRFPAPVYSALRSAFRKRMKK-YKMGKGIEDL--FDTCYD 399
++ E +DSGT T PA +Y+++ + F R+ + Y+ K +ED CY
Sbjct: 331 LKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTGLGPCY- 389
Query: 400 LSAYKTVV-VPKITIHFLGGVDLELDVR----------GTLVVESVRQVCLGF------A 442
Y TVV +P + +HF+G + + G V R CL A
Sbjct: 390 --YYDTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEA 447
Query: 443 LLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L P + LGN QQ G+EV YD+ RR+GF C
Sbjct: 448 ELTGGPGAT-LGNYQQHGFEVVYDLEQRRVGFARRKC 483
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 116/426 (27%), Positives = 173/426 (40%), Gaps = 52/426 (12%)
Query: 81 NTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGK 140
N P EE L L + RRL A+ P TG+ Y+ + IG
Sbjct: 50 NGPGGEEHL----AALRKHDGRRLLTAVDLPLGGNG---IPTDTGL-----YFTQIGIGT 97
Query: 141 PKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKIL 195
P + + +DTGS I W C C C ++ +DP+ S + + C C
Sbjct: 98 PSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTVTCGQEFCATA 157
Query: 196 LEW-FPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGY--FARYPFLL 251
PP+ C++ C Y I Y DGS TGF+ D + +V+G+G A
Sbjct: 158 TNGGVPPS----CAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNLANASVTF 213
Query: 252 GC---TDNNTGDQNGA-SGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYITF 302
GC G N A GI+G + S++S+ + F +CL + G G
Sbjct: 214 GCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVNGG-GIFAI 272
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT----KLSTEIDS 358
G V + VK TP+V Y++ L I VGG L L + F T IDS
Sbjct: 273 GN---VVQPKVKTTPLVPGMPH---YNVVLKTIDVGGSTLQLPTNIFDIGGGSRGTIIDS 326
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT + P VY A+ SA + K ++D C+ S P++T HF G
Sbjct: 327 GTTLAYLPEVVYKAVLSAVFSNHPDVTL-KNVQDFL--CFQYSGSVDNGFPEVTFHFDGD 383
Query: 419 VDLELDVRGTLVVESVRQVCLGF---ALLPSD-PNSILLGNVQQRGYEVHYDVAGRRLGF 474
+ L + L + C+GF + D + +LLG++ V YD+ + +G+
Sbjct: 384 LPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLENQVIGW 443
Query: 475 GPGNCN 480
NC+
Sbjct: 444 TNYNCS 449
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/365 (27%), Positives = 151/365 (41%), Gaps = 39/365 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
Y + IG P Q +L++DTGS +T+ C C C + +DP F P S T+ + C +
Sbjct: 81 YTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC-TLD 139
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C D+ +C Y+ Y + S +G D + A +
Sbjct: 140 CNC--------DNDR---MQCVYERQYAEMSTSSGVLGED---VVSFGNQSELAPQRAVF 185
Query: 252 GCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNISYFFYCLHS--PYGSTGYITFG 303
GC + TGD A GIMGL RG +SI + K +S F + G + G
Sbjct: 186 GCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGG 245
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTII 362
+ F + P+ +S +Y+I L I V G+RLPL S F K + +DSGT
Sbjct: 246 ISPPSDMVFAQSDPV-----RSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLDSGTTY 300
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCY-----DLSAYKTVVVPKITIHFL 416
P + A + A K ++ + G + + D C+ D+S P + + F
Sbjct: 301 AYLPEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSK-TFPVVDMIFG 359
Query: 417 GGVDLELDVRGTLVVES-VRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G L + S VR CLG DP + LLG + R V YD ++GF
Sbjct: 360 NGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTT-LLGGIVVRNTLVLYDREQTKIGF 418
Query: 475 GPGNC 479
NC
Sbjct: 419 WKTNC 423
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/435 (25%), Positives = 177/435 (40%), Gaps = 68/435 (15%)
Query: 100 NSRRLQ----KAIPDNFKKTKAFTFPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGS 153
+SRR Q +P+ T F P ++ I Y + V G P +L+LDT +
Sbjct: 89 SSRRRQAKESSKLPEVMSATSMFELPMRSALNIAHVGMYLVSVRFGTPALPYNLVLDTAN 148
Query: 154 GITWTQCK--------------------PCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+TW C+ +R ++ P+KS ++ +I C+ C
Sbjct: 149 DLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARRKNWYRPAKSSSWRRIRCSQKECA 208
Query: 194 ILLEWFPPNG-QDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLL 251
+L P N Q ++ C Y DG+ G + ++ T+ +G A+ P +L
Sbjct: 209 LL----PYNTCQSPSKAESCSYYQQMQDGTLTMGIYGKEKATV--TVSDGRMAKLPGLIL 262
Query: 252 GCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGSTGYITFGK 304
GC+ G + G++ L G +S + F +CL +S ++ Y+TFG
Sbjct: 263 GCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQRFSFCLLSANSSRDASSYLTFGP 322
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPL-----KASYFTKLSTEIDSG 359
V T IV + Y +TGI VGGERL + A +D+
Sbjct: 323 NPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIPQEIWDAEKVVGGGVILDTS 382
Query: 360 TIITRFPAPVYSALRSAFRKRM----KKYKMGKGIEDLFDTCY---------DLSAYKTV 406
T +T Y+A+ SA + + + Y++ D F+ CY DL+ V
Sbjct: 383 TSVTSLVPEAYAAVTSALDRHLSHLPRVYEL-----DGFEYCYRWTFAGDGVDLT--HNV 435
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSILLGNVQQRGYEVHY 465
VP++T+ GG LE + + ++ E V V CL F LP I LGNV + Y
Sbjct: 436 TVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPRGGPGI-LGNVLMQEYIWEI 494
Query: 466 DVAGRRLGFGPGNCN 480
D ++ F CN
Sbjct: 495 DHGKGKMRFRKDKCN 509
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 166/370 (44%), Gaps = 58/370 (15%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI-HCSQQRDPFFDPSKSKTFSKIPC-NS 189
YY + +G P + SL++DTGS +TW +C PC CS FD S T+ + C +
Sbjct: 124 YYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSST----FDRLASNTYKALTCADD 179
Query: 190 TTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPF 249
+LL + + S D + G+ A+D + +E G F
Sbjct: 180 LRLPVLLRLW----RRLFHSGRSLRDTLKMAGA------ASDEL--EEFPG--------F 219
Query: 250 LLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCL----------HSP--Y 294
+ GC G +G GI+ L G +S S+ Y F YCL SP +
Sbjct: 220 VFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVF 279
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS- 353
G + +P + + ++YTPI E S +Y + L GISVG +RL L S F
Sbjct: 280 GEAA-VELKEPGSGKPQELQYTPI---GESSIYYTVRLDGISVGNQRLDLSPSTFLNGQD 335
Query: 354 --TEIDSGTIITRFPAPVYSALRSAFRKRMK--KYKMGKGIEDLFDTCYDLSAYKTVVVP 409
T DSGT +T P+ V +++ + + ++ KG+ D C+ + +P
Sbjct: 336 KPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKGL----DACFRVPPSSGQGLP 391
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
IT HF GG D + V++ CL F +P++ SI GN+QQ+ + V +D+
Sbjct: 392 DITFHFNGGADF-VTRPSNYVIDLGSLQCLIF--VPTNEVSI-FGNLQQQDFFVLHDMDN 447
Query: 470 RRLGFGPGNC 479
RR+GF +C
Sbjct: 448 RRIGFKETDC 457
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 165/385 (42%), Gaps = 42/385 (10%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFF 174
P+ TG+ YY V +G P + + +DTGS I W C C C ++ +
Sbjct: 65 LPSSTGL-----YYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLY 119
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMT 234
DP+ SKT + +PC C P +G + CPY I Y DGS +G + D +T
Sbjct: 120 DPNGSKTSNAVPCGDGFCTDTYSG-PISGCKQ--DMSCPYSITYGDGSTTSGSFVNDSLT 176
Query: 235 IQEVNGNGYFA--RYPFLLGCTDNNTGDQNGAS-----GIMGLDRGPVSIISKTNIS--- 284
EV+GN + + GC +G + S GI+G + S++S+ S
Sbjct: 177 FDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKV 236
Query: 285 --YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL 342
F +CL S +G G + G+ + KF TP+V P + Y++ L + V GE +
Sbjct: 237 KRIFSHCLDSHHGG-GIFSIGQ--VMEPKF-NTTPLV--PRMAH-YNVILKDMDVDGEPI 289
Query: 343 PLKASYFTKLS---TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
L F S T IDSGT + P +Y+ L R K+ +ED F TC+
Sbjct: 290 LLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKL-MIVEDQF-TCFH 347
Query: 400 LSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNS----ILLGN 455
S P + HF G+ L + L + C+G+ + IL+G+
Sbjct: 348 YSDKLDEGFPVVKFHF-EGLSLTVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDLILIGD 406
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNCN 480
+ V YD+ +G+ NC+
Sbjct: 407 LVLSNKLVVYDLENMVIGWTNFNCS 431
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/366 (27%), Positives = 151/366 (41%), Gaps = 39/366 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST- 190
Y + IG P Q +L++D+GS +T+ C C C +DP F P S T+S + C++
Sbjct: 85 YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCSADC 144
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC DK +C Y+ Y + S +G D I +
Sbjct: 145 TCD----------SDK---SQCTYERQYAEMSSSSGVLGED---IVSFGTESELKPQRAV 188
Query: 251 LGCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNISYFFYCLHS--PYGSTGYITF 302
GC ++ TGD A GIMGL RG +SI + K I F + G +
Sbjct: 189 FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 248
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-TKLSTEIDSGTI 361
P + F + P+ +S +Y+I L I V G+ L L F +K T +DSGT
Sbjct: 249 AMPAPPDMVFSRSDPV-----RSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDSGTT 303
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYK----TVVVPKITIHFL 416
P + A + A +++ K +G + + D C+ + + P + + F
Sbjct: 304 YAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFG 363
Query: 417 GGVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G L L L S + CLG DP + LLG + R V YD ++GF
Sbjct: 364 DGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTT-LLGGIVVRNTLVTYDRHNEKIGF 422
Query: 475 GPGNCN 480
NC+
Sbjct: 423 WKTNCS 428
>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
Length = 376
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 147/357 (41%), Gaps = 40/357 (11%)
Query: 17 SSNNGAYANDNDLSHSYIVSVSSLIPPTVCNRTRTALPQGPGKVSLEVLGRY---GPCSK 73
SS A+ D + +++ SS+ P C+ + A P ++ + GPCS
Sbjct: 16 SSTLVAHGGDAEAGAYMLIATSSMKPKASCSGHKVA-PSNEASLNSTWAPLHLVSGPCSP 74
Query: 74 L------NQGKSRNTPSLEEILRRDQQRLHL--------KNSRRLQKAIPDNFKKTKAFT 119
N + S+ ++L DQ R+ S + A D
Sbjct: 75 AYSRGTDNSSTDDDVTSIAKMLDADQHRVAYIQKRLAGGDTSNGVAGASWDGQTTDVGTY 134
Query: 120 FPAKTGIVAADEYYIVVAI-GKPKQYVSLLLDTGSGITWTQCKPC--IHCSQQRDPFFDP 176
PA V A A G ++++D+GS + W QC+PC + C QRDP FDP
Sbjct: 135 LPASNVGVGAKMIGTTAAPDGTSAVRQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDP 194
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMTI 235
+ S T+S +PC+S C L + + CS+ +C + Y DG+ TG +++D +T+
Sbjct: 195 ATSTTYSAVPCSSAACARLGPY-----RRGCSANVQCQFGFTYTDGATATGTYSSDDLTL 249
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNG--ASGIMGLDRGPVSIISKTNISY---FFYCL 290
Y FL GC + G SG + L G S + +T Y F YC+
Sbjct: 250 GP-----YDVVRGFLFGCAHADRGSTFSFDVSGTLALGGGAQSFVQQTATQYGRVFSYCI 304
Query: 291 HSPYGSTGYITFGKP---DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPL 344
S G+IT G P + FV + ++ FY + L I V G LP+
Sbjct: 305 PPSPSSLGFITLGVPPQRAALVPTFVSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPV 361
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 103/369 (27%), Positives = 158/369 (42%), Gaps = 39/369 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ-----RDPFFDPSKSKTFSKIP 186
Y+ + +G P + + +DTGS I W CKPC C + R FD + S T K+
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVG 133
Query: 187 CNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTIQEVNGN---G 242
C+ C + + D C + C Y I Y D S G + D +T+++V G+ G
Sbjct: 134 CDDDFCSFISQ------SDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTG 187
Query: 243 YFARYPFLLGCTDNNTGD-QNGAS---GIMGLDRGPVSIISKTNIS-----YFFYCLHSP 293
+ + GC + +G NG S G+MG + S++S+ + F +CL +
Sbjct: 188 PLGQ-EVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNV 246
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLS 353
G G G V+ VK TP+V P Q Y++ L G+ V G L L S
Sbjct: 247 KGG-GIFAVG---VVDSPKVKTTPMV--PNQMH-YNVMLMGMDVDGTSLDLPRSIVRNGG 299
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITI 413
T +DSGT + FP +Y +L R + K+ +E+ F C+ S P ++
Sbjct: 300 TIVDSGTTLAYFPKVLYDSLIETILAR-QPVKL-HIVEETF-QCFSFSTNVDEAFPPVSF 356
Query: 414 HFLGGVDLELDVRGTLVVESVRQVCLGFAL--LPSDPNS--ILLGNVQQRGYEVHYDVAG 469
F V L + L C G+ L +D S ILLG++ V YD+
Sbjct: 357 EFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDN 416
Query: 470 RRLGFGPGN 478
+G+ N
Sbjct: 417 EVIGWADHN 425
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 96/398 (24%), Positives = 170/398 (42%), Gaps = 54/398 (13%)
Query: 114 KTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQC-KPCIHCSQQRDP 172
K A + ++ +YY + IG P + L +DTGS +TW QC PC +C++ P
Sbjct: 111 KAAAAEEGSTAAVLPERQYYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHP 170
Query: 173 FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATD 231
+ P+K +P + C+ L Q+ C + K+C Y+IAY D S G A D
Sbjct: 171 LYKPAKENI---VPPRDSHCQELQ-----GNQNYCDTCKQCDYEIAYADRSSSAGVLARD 222
Query: 232 RMTIQEVNGNGYFARYPFLLGCTDNNTGDQNG----ASGIMGLDRGPVSI---ISKTNI- 283
M + + +G + GC + G G + GI+GL G +S+ ++K I
Sbjct: 223 NMEL--ITADGERENMDLVFGCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGII 280
Query: 284 -SYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL 342
+ F +C+ + + Y+ G D V + + + P+ PE + Y + ++ G + L
Sbjct: 281 SNVFGHCIATDPSGSAYMFLGD-DYVPRWGMTWVPVRNGPE--DVYSTVVQKVNYGCQEL 337
Query: 343 PLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKR----------------MKKYKM 386
++ DSG+ T FP +Y++L ++ MK
Sbjct: 338 NVREQAGKLTQVIFDSGSSYTYFPHEIYTSLITSLEAVSPGFVRDESDQTLPFCMKPNFP 397
Query: 387 GKGIEDLFDTCYDLSAY--KT-VVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF-- 441
+ ++D+ L + KT +V+P+ E+ L++ VCLG
Sbjct: 398 VRSVDDVKQLHKPLLLHFSKTWLVIPRT---------FEISPENYLIISGKGNVCLGVLD 448
Query: 442 ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++I++G+V RG V YD ++G+ +C
Sbjct: 449 GTEIGHSSTIVIGDVSLRGKLVAYDNDANQIGWAQSDC 486
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 151/364 (41%), Gaps = 26/364 (7%)
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
+++G P Q ++ L SG +W C + F P S + +K+PC S +C
Sbjct: 3 LSLGTPPQPLNFTLAVDSGFSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSCSA- 61
Query: 196 LEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
F S C Y+ +Y G +D T+ V A LGC
Sbjct: 62 ---FSAVSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLS--LGCGR 116
Query: 256 NNTG--DQNGASGIMGLDRGPVSIISKTNI----SYFFYCLHSPYGSTGYITFGKPDTVN 309
++ G + SG +G D+G VS + + + S F YCL S G + G N
Sbjct: 117 DSGGLLELLDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDT-FRGKLVIGNYKLRN 175
Query: 310 KKF---VKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---IDSGTIIT 363
+ YTP++T P+ +E Y I L+ IS+ + + F T ID+ T ++
Sbjct: 176 ASISSSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGTGGTVIDTTTFLS 235
Query: 364 RFPAPVYSALRSAFRKRMKKY-KMGKGIEDLF--DTCYDLSAYKTVVVPK-ITIHFLGGV 419
+ Y+ L A + ++ + D + CY++SA P +T HFLGG
Sbjct: 236 YLTSDFYTQLVQAIKNYTTNLVEVSSSVADALGVELCYNISANSDFPPPATLTYHFLGGA 295
Query: 420 DLELDVRGTL-VVESVRQ-VCLGFALLPS-DPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
+E+ L +SV +C+ S PN ++G QQ V YD+ R GFG
Sbjct: 296 GVEVSTWFLLDDSDSVNNTICMAIGRSESVGPNLNVIGTYQQLDLTVEYDLEQMRYGFGA 355
Query: 477 GNCN 480
CN
Sbjct: 356 QGCN 359
>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
Length = 565
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 80/243 (32%), Positives = 122/243 (50%), Gaps = 18/243 (7%)
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKP 305
+ GC TG + G++G +RGP+S S+ Y F YCL S S T
Sbjct: 327 YTFGCLCVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVFSYCLPSYKSSNFSGTLRLG 386
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE--RLPLKASYFTKLS---TEIDSGT 360
K +K TP+++ P + Y++ + GI VGG +P A F S T +D+GT
Sbjct: 387 PAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVAVPASALAFDPASGHGTIVDAGT 446
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
+ TR APVY+A+ FR R++ G FDTCY++ T+ VP +T F G V
Sbjct: 447 MFTRLSAPVYAAVCDVFRSRVRAPVAGP--LGGFDTCYNV----TISVPTVTFLFDGRVS 500
Query: 421 LELDVRGTLVVESVRQV-CLGFALLPSDP-NSIL--LGNVQQRGYEVHYDVAGRRLGFGP 476
+ L ++ S+ + CL A PSD +++L + ++QQ+ + V +DVA R+GF
Sbjct: 501 VTLPEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHRVLFDVANGRVGFSR 560
Query: 477 GNC 479
C
Sbjct: 561 ELC 563
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 160/388 (41%), Gaps = 49/388 (12%)
Query: 117 AFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD----- 171
F P TG+ YY + IG P + LDTGS W C C + D
Sbjct: 73 GFNIPYGTGL-----YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKL 127
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWAT 230
F+DP S + ++ C+ T C PP C+ + CPY Y DG G T
Sbjct: 128 TFYDPRSSVSSKEVKCDDTICTSR----PP-----CNMTLRCPYITGYADGGLTMGILFT 178
Query: 231 DRMTIQEVNGNGYF--ARYPFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS 284
D + ++ GNG GC +G N ++ GI+G + +S+ +
Sbjct: 179 DLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAA 238
Query: 285 -----YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYH-ITLTGISVG 338
F +CL S G F + V K VK TPIV + +E YH + L I+V
Sbjct: 239 GKTKKIFSHCLDSTNGGG---IFAIGEVVEPK-VKTTPIV---KNNEVYHLVNLKSINVA 291
Query: 339 GERLPLKASYFTKLSTE---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFD 395
G L L A+ F T+ IDSG+ + P +YS L A + MG +++
Sbjct: 292 GTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAVFAKHPDITMGA----MYN 347
Query: 396 -TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF--ALLPSDPNSIL 452
C+ PKIT HF + L++ L+ Q C GF A + + I+
Sbjct: 348 FQCFHFLGSVDDKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMII 407
Query: 453 LGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
LG++ V YD+ + +G+ NC+
Sbjct: 408 LGDMVISNKVVVYDMEKQAIGWTEHNCS 435
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 101/425 (23%), Positives = 180/425 (42%), Gaps = 46/425 (10%)
Query: 81 NTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGK 140
N LE L + + R L+++R LQ + F+ + Y+ V +G
Sbjct: 21 NNHGLE--LHQLRARDRLRHARLLQGFV----GGVVDFSVQGSSDPYLVGLYFTKVKLGS 74
Query: 141 PKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKIL 195
P + ++ +DTGS + W C C +C + FFD S S T ++ C+ C
Sbjct: 75 PPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSDPICTSA 134
Query: 196 LEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL-- 251
++ +CSS+ +C Y Y DGSG +G++ +D + + G L+
Sbjct: 135 VQ----TTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALIVF 190
Query: 252 GCTDNNTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHSPYGSTGYITF 302
GC+ +GD GI G +G +S+IS+ + F +CL G +
Sbjct: 191 GCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDGSGGGILVL 250
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---IDSG 359
G+ + + + Y+P+V P Q Y++ L I+V G+ LP+ + F +++ +DSG
Sbjct: 251 GE---ILEPGIVYSPLV--PSQPH-YNLNLLSIAVNGQLLPIDPAAFATSNSQGTIVDSG 304
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGV 419
T + A Y SA + I + CY +S + + P + +F GG
Sbjct: 305 TTLAYLVAEAYDPFVSAVNAIVSPSV--TPITSKGNQCYLVSTSVSQMFPLASFNFAGGA 362
Query: 420 DLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
+ L L+ C+GF + +LG++ + YD+ +R+G+
Sbjct: 363 SMVLKPEDYLIPFGSSGGSAMWCIGFQKV---QGVTILGDLVLKDKIFVYDLVRQRIGWA 419
Query: 476 PGNCN 480
+C+
Sbjct: 420 NYDCS 424
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 114/421 (27%), Positives = 181/421 (42%), Gaps = 49/421 (11%)
Query: 87 EILRRDQQRLHLKNSR-----RLQKAIPDNFKKTKAFTFPAKTGIVAAD--------EYY 133
E++ RD L N+ RL A+ + + F I AA+ ++
Sbjct: 40 ELIHRDSPNSPLFNASETTDIRLANAVERSADRVNRFNDLISNSITAAEFPSILDNGDFL 99
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQC---KPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
+ ++IG P + + + TGS + W C KPC H R FFDP +S T+ +PC+S
Sbjct: 100 MKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLR--FFDPMESSTYKNVPCDSY 157
Query: 191 TCKILLEWFPPNGQDKCSSKECPY--DIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C+I C +C Y D + D S G A D +T+ G +
Sbjct: 158 RCQI-------TNAATCQFSDCFYSCDPRHQD-SCPDGDLAMDTLTLNSTTGKSFMLPNT 209
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY-----FFYCLHSPYGS--TGYIT 301
+ C + GD G GI+GL G +S++++ IS+ F +C+ PY S T ++
Sbjct: 210 GFI-CGNRIGGDYPGV-GILGLGHGSLSLLNR--ISHLIDGKFSHCI-VPYSSNQTSKLS 264
Query: 302 FGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLP---LKASYFTKLSTEIDS 358
FG V+ + T + T Y ++ GISVG + + + + Y+ +DS
Sbjct: 265 FGDKAVVSGSAMFSTRLDMTGGPYS-YTLSFYGISVGNKSISAGGIGSDYYMN-GLGMDS 322
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT+ T FP YS L R +++ + CY S P IT+HF GG
Sbjct: 323 GTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLCYRYS--PDFSPPTITMHFEGG 380
Query: 419 VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
+EL + + + VCL FA S+ +++ G QQ + YD+ L F +
Sbjct: 381 -SVELSSSNSFIRMTEDIVCLAFATSSSEQDAV-FGYWQQTNLLIGYDLDAGFLSFLKTD 438
Query: 479 C 479
C
Sbjct: 439 C 439
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/370 (26%), Positives = 146/370 (39%), Gaps = 45/370 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+Y + +G P++ S+++DTGS IT+ CK C HC + +FDP KS T K+ C
Sbjct: 13 FYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACGDPL 72
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
C C++ C Y Y + S G+ D + + + +
Sbjct: 73 CNC------GTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSD-----SPVRLVF 121
Query: 252 GCTDNNTGD--QNGASGIMGLDRGPVSIIS-----KTNISYFFYCLHSPYGSTGYITFGK 304
GC + TG+ + A GIMG+ + S K F C P G + G
Sbjct: 122 GCENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYP--KDGILLLGD 179
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTK-LSTEIDSGTIIT 363
YTP++T +Y++ + GI+V G+ L AS F + T +DSGT T
Sbjct: 180 VTLPEGANTVYTPLLTH-LHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFT 238
Query: 364 RFPAPVYSALRSAFRKRMKKYKMG--------------KGIEDLFDTCYDLSAYKTVVVP 409
P + A+ A ++K + KG D F DL Y P
Sbjct: 239 YLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFK---DLDKY----FP 291
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAG 469
F GG L L L + + CLG + + + L+G V R V YD
Sbjct: 292 PAEFVFGGGAKLTLPPLRYLFLSKPAEYCLG--IFDNGNSGALVGGVSVRDVVVTYDRRN 349
Query: 470 RRLGFGPGNC 479
++GF C
Sbjct: 350 SKVGFTTMAC 359
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 161/374 (43%), Gaps = 41/374 (10%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIP------- 186
+ + IG P Q L+LDTGS ++W QC ++ P P + +
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTASFDPSLSSSFSLLP 126
Query: 187 CNSTTCKILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA 245
CN CK + F P D+ ++ C Y Y DG+ G ++ T + +
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQ--NRLCHYSYFYADGTLAEGNLVREKFTFSKS-----LS 179
Query: 246 RYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKP 305
P +LGC +T ++ GI+G++ G +S IS+ IS F YC+ S GS F
Sbjct: 180 TPPVILGCAQASTENR----GILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLG 235
Query: 306 DTVNKKFVKYTPIVTTPEQSE-------FYHITLTGISVGGERLPLKASYFTKLS----- 353
D N KY ++T PE Y + + I + G+RL + + F +
Sbjct: 236 DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQ 295
Query: 354 TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKG--IEDLFDTCYDLSAYKTV--VVP 409
T IDSG+ +T Y ++ R+ M KG D+ D C+D V +
Sbjct: 296 TMIDSGSDLTYLVDEAYEKVKEEV-VRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIG 354
Query: 410 KITIHFLGGVDLELDVRGTLVVESVRQ--VCLGFALLPS-DPNSILLGNVQQRGYEVHYD 466
I+ F GV++ + RG V+ V + C+G S ++G V Q+ V YD
Sbjct: 355 GISFEFDNGVEIFVG-RGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYD 413
Query: 467 VAGRRLGFGPGNCN 480
+A +R+GFG C+
Sbjct: 414 LANKRVGFGGAECS 427
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 107/423 (25%), Positives = 172/423 (40%), Gaps = 64/423 (15%)
Query: 108 IPDNFKKTKAFTFPAKTG--IVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK---- 161
+P+ T F P ++ I Y + V G P +L+LDT + +TW C+
Sbjct: 101 LPEVMSATSMFELPMRSALNIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRR 160
Query: 162 ----------------PCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG-Q 204
+R ++ P+KS ++ +I C+ C +L P N Q
Sbjct: 161 KGKHYGRTMSVGAGDDGAAAKEARRKNWYRPAKSSSWRRIRCSQKECALL----PYNTCQ 216
Query: 205 DKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGDQ-N 262
++ C Y DG+ G + ++ T+ +G A+ P +LGC+ G +
Sbjct: 217 SPSKAESCSYYQQMQDGTLTMGIYGKEKATV--TVSDGRMAKLPGLILGCSVLEAGGSVD 274
Query: 263 GASGIMGLDRGPVSIISKTNISY---FFYCL---HSPYGSTGYITFGKPDTVNKKFVKYT 316
G++ L G +S + F +CL +S ++ Y+TFG V T
Sbjct: 275 AHDGVLSLGNGEMSFAVHAAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMET 334
Query: 317 PIVTTPEQSEFYHITLTGISVGGERLPL-----KASYFTKLSTEIDSGTIITRFPAPVYS 371
IV + Y +TGI VGGERL + A +D+ T +T Y+
Sbjct: 335 DIVYNVDVKPAYGPLVTGIFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYA 394
Query: 372 ALRSAFRKRM----KKYKMGKGIEDLFDTCY---------DLSAYKTVVVPKITIHFLGG 418
A+ SA + + + Y++ D F+ CY DL+ V VP++T+ GG
Sbjct: 395 AVTSALDRHLSHLPRVYEL-----DGFEYCYRWTFAGDGVDLA--HNVTVPRLTVEMAGG 447
Query: 419 VDLELDVRGTLVVESVRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
LE + + ++ E V V CL F LP I LGNV + Y D ++ F
Sbjct: 448 ARLEPEAKSVVMPEVVPGVACLAFRKLPRGGPGI-LGNVLMQEYIWEIDHGKGKMRFRKD 506
Query: 478 NCN 480
CN
Sbjct: 507 KCN 509
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 83/304 (27%), Positives = 143/304 (47%), Gaps = 38/304 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP--------FFDPSKSKTFS 183
+Y VVA+G P + LDTGS + W C C+ C+ + P + P++S T
Sbjct: 35 HYAVVALGTPNVTFLVALDTGSDLFWVPCD-CLKCAPFQSPNYGSLKFDVYSPAQSTTSR 93
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSK--ECPYDIAYV-DGSGETGFWATDRMTIQEVNG 240
K+PC+S C + Q+ C SK CPY I Y+ D + +G D + + +
Sbjct: 94 KVPCSSNLCDL---------QNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSA 144
Query: 241 NGYFARYPFLLGCTDNNTGDQNGAS---GIMGL---DRGPVSIISKTNISYFFYCLHSPY 294
P + GC TG G++ G++GL + S+++ ++ + +
Sbjct: 145 QSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGD 204
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G I FG + ++ K TP+ +Q+ +Y+IT+TGI+VG + S T+ S
Sbjct: 205 DGHGRINFGDTGSSDQ---KETPL-NVYKQNPYYNITITGITVGSK------SISTEFSA 254
Query: 355 EIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIH 414
+DSGT T P+Y+ + S+F +++ + F+ CY +SA +V P +++
Sbjct: 255 IVDSGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSA-NGIVHPNVSLT 313
Query: 415 FLGG 418
GG
Sbjct: 314 AKGG 317
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 169/382 (44%), Gaps = 50/382 (13%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQC-KPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
YY+ + +G P + L +DTGS +TW QC PC +C+ ++P K+K + C+
Sbjct: 40 YYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLYNPKKAKV---VDCHLP 96
Query: 191 TCKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
C + + G +C+S K+C Y++ Y DGS G D +T++ NG +
Sbjct: 97 VCAQIQQ----GGSYECNSDVKQCDYEVEYADGSSTMGVLVEDTLTVRLTNGT--LIQTK 150
Query: 249 FLLGCTDNNTG----DQNGASGIMGLDRGPVSI---ISKTNI--SYFFYCLHSPYGSTGY 299
++GC + G G++GL V++ +++ I + +CL GY
Sbjct: 151 AIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCLADGSNGGGY 210
Query: 300 ITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE---I 356
+ FG + V + +TP++ PE Y L I GG+ L L ST
Sbjct: 211 LFFGD-ELVPSWGMTWTPMMGKPEMLG-YQARLQSIRYGGDSLVLNNDEDLTRSTSSVMF 268
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIE--------DLFDTCYDLSAYKTVVV 408
DSGT T Y+++ SA K+ ++ F + D+ Y
Sbjct: 269 DSGTSFTYLVPQAYASVLSAVTKQSGLLRVKSDTTLPYCWRGPSPFQSITDVHQY----F 324
Query: 409 PKITIHFLG----GVDLELDV--RGTLVVESVRQVCLGFALLPSDPNSI----LLGNVQQ 458
+T+ F G D LD+ +G L+V + VCLG +L + S+ ++G+V
Sbjct: 325 KTLTLDFGGRNWFATDSTLDLSPQGYLIVSTQGNVCLG--ILDASGASLEVTNIIGDVSM 382
Query: 459 RGYEVHYDVAGRRLGFGPGNCN 480
RGY V YD R+G+ NC+
Sbjct: 383 RGYLVVYDNVRDRIGWIRRNCH 404
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 92/383 (24%), Positives = 159/383 (41%), Gaps = 33/383 (8%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQC-KPCIHCSQQRDPFFDPSK 178
FP + + Y+ + +G P + L +DTGS +TW QC PC C++ +P + P K
Sbjct: 302 FPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKK 361
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV 238
+P + C + + C ++C Y+I Y D S G A+D + +
Sbjct: 362 GNL---VPLKDSLCVEVQRNLKTGYCETC--EQCDYEIEYADHSSSMGVLASDDLHLMLA 416
Query: 239 NGNGYFARYPFLLGCTDNNTG----DQNGASGIMGLDRGPVSIISKTNISYFF-----YC 289
NG+ + + GC + G GI+GL + VS+ S+ +C
Sbjct: 417 NGS--LTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHC 474
Query: 290 LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
L S GY+ G D V + + P++ + S YH + IS G +L L
Sbjct: 475 LTSDATGGGYMFLGD-DFVPYWGMAWVPMLNS--HSPNYHSQIMKISHGSRQLSLGRQDG 531
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS-AYKTVVV 408
D+G+ T FP Y AL ++ + + + G + C+ ++V+
Sbjct: 532 RTERVVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVID 591
Query: 409 PK-----ITIHF-----LGGVDLELDVRGTLVVESVRQVCLGF--ALLPSDPNSILLGNV 456
K +T+ F + + G L++ + VCLG D ++I+LG++
Sbjct: 592 VKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGSTIILGDI 651
Query: 457 QQRGYEVHYDVAGRRLGFGPGNC 479
RG V YD +++G+ C
Sbjct: 652 SLRGKLVVYDNVNQKIGWAQSTC 674
>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 598
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 81/244 (33%), Positives = 119/244 (48%), Gaps = 18/244 (7%)
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKP 305
+ GC TG G++G GP+S S+ Y F YCL S S T
Sbjct: 360 YTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFSSTLRLG 419
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL--PLKASYFTKLS---TEIDSGT 360
K +K TP+++ P + Y++ + GI VGG + P A F S T +D+GT
Sbjct: 420 PAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLVPASALAFDPASGRGTIVDAGT 479
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
+ TR APVY+A+R FR R++ G FDTCY++ T+ VP +T F G V
Sbjct: 480 MFTRLSAPVYAAVRDVFRSRVRAPVTGP--LGGFDTCYNV----TISVPTVTFSFDGRVS 533
Query: 421 LELDVRGTLVVESVRQV-CLGFALLPSD-PNSIL--LGNVQQRGYEVHYDVAGRRLGFGP 476
+ L ++ S + CL A PSD +++L L ++QQ+ + V +DVA R+GF
Sbjct: 534 VTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVGFSR 593
Query: 477 GNCN 480
C
Sbjct: 594 ELCT 597
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 162/385 (42%), Gaps = 44/385 (11%)
Query: 121 PAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFD 175
PA+ G+ Y+ + +G P + + +DTGS I W C C C + D +D
Sbjct: 76 PAEAGL-----YFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYD 130
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRM 233
P S + ++I C+ C NG + +K+ C Y + Y DGS GF+ D +
Sbjct: 131 PQSSTSATRIYCDDDFCAATY-----NGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNL 185
Query: 234 TIQEVNGN--GYFARYPFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS--- 284
V GN A + GC +G+ +S GI+G + S+IS+ +
Sbjct: 186 QFDRVTGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKV 245
Query: 285 --YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL 342
F +CL + G F + V+ K V TP+V P Q Y++ + I VGG L
Sbjct: 246 KRVFAHCLDNVKGGG---IFAIGEVVSPK-VNTTPMV--PNQPH-YNVVMKEIEVGGNVL 298
Query: 343 PLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
L F + T IDSGT + P VY ++ + K+ +E+ F TC+
Sbjct: 299 ELPTDIFDTGDRRGTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKL-HTVEEQF-TCFQ 356
Query: 400 LSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSDPNSI-LLGN 455
+ P + HF G + L ++ L C G+ + D + LLG+
Sbjct: 357 YTGNVNEGFPVVKFHFNGSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGD 416
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNCN 480
+ V YD+ + +G+ NC+
Sbjct: 417 LVLSNKLVLYDLENQAIGWTDYNCS 441
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 90/164 (54%), Gaps = 9/164 (5%)
Query: 322 PEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSA 376
P+ +Y++ L GISVGGE L + + F S +DSGT +TR + VY+ +R A
Sbjct: 5 PQLDTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVRDA 64
Query: 377 FRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVR 435
F K K + LFDTCYDLS+ +V VP + HF G L L + LV V+SV
Sbjct: 65 FVKGTKDLLATNEVS-LFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVDSVG 123
Query: 436 QVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
C FA P+ + ++GN+QQ+G V +D+A +GF P C
Sbjct: 124 TFCFAFA--PTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 165/377 (43%), Gaps = 55/377 (14%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + IG P Q ++LDTGS ++W QC H FDPS S +F +PC CK
Sbjct: 90 VTLPIGTPPQPQQMVLDTGSQLSWIQC----HNKTPPTASFDPSLSSSFYVLPCTHPLCK 145
Query: 194 ILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
+ F P D+ ++ C Y Y DG+ G +++ P +LG
Sbjct: 146 PRVPDFTLPTTCDQ--NRLCHYSYFYADGTYAEGNLVREKLAFSPSQ-----TTPPLILG 198
Query: 253 CTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL--HSPYGSTGYIT--FGKPDTV 308
C+ + A GI+G++ G +S + ++ F YC+ P + + T F +
Sbjct: 199 CSS----ESRDARGILGMNLGRLSFPFQAKVTKFSYCVPTRQPANNNNFPTGSFYLGNNP 254
Query: 309 NKKFVKYTPIVTTPEQSEF-------YHITLTGISVGGERLPLKASYFTKLS-----TEI 356
N +Y ++T P+ Y + + GI +GG +L + S F + T +
Sbjct: 255 NSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMV 314
Query: 357 DSGTIITRFPAPVYSALRSAFRK----RMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKI 411
DSG+ T Y +R + R+KK + G+ D+ C+D +A + ++ +
Sbjct: 315 DSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADM---CFDGNAMEIGRLLGDV 371
Query: 412 TIHFLGGVDLEL-------DVRGTLVVESV-RQVCLGFALLPSDPNSILLGNVQQRGYEV 463
F GV++ + DV G + + R LG A S ++GN Q+ V
Sbjct: 372 AFEFEKGVEIVVPKERVLADVGGGVHCVGIGRSERLGAA-------SNIIGNFHQQNLWV 424
Query: 464 HYDVAGRRLGFGPGNCN 480
+D+A RR+GFG +C+
Sbjct: 425 EFDLANRRIGFGVADCS 441
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 104/411 (25%), Positives = 167/411 (40%), Gaps = 50/411 (12%)
Query: 99 KNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWT 158
K R++ A + P K + +YY + IG P + L +DTGS +TW
Sbjct: 154 KARNRMEVAKAATARTNSTALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWI 213
Query: 159 QCK-PCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDI 216
QC PC + ++ P + P+K K +P C+ L Q+ C + K+C Y+I
Sbjct: 214 QCDAPCTNFAKGPHPLYKPAKEKI---VPPRDLLCQEL-----QGNQNYCETCKQCDYEI 265
Query: 217 AYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD----QNGASGIMGLDR 272
Y D S G A D M + + NG + F+ GC + G GI+GL
Sbjct: 266 EYADQSSSMGVLARDDMHM--IATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSS 323
Query: 273 GPVSIISKTN-----ISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEF 327
+S S+ + F +C+ G GY+ G D V + V +T I + P+
Sbjct: 324 AAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGD-DYVPRWGVTWTSIRSGPD--NL 380
Query: 328 YHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMG 387
YH + G ++L + + DSG+ T P +Y L +A KY
Sbjct: 381 YHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAI-----KYASP 435
Query: 388 KGIEDLFDT----CYD-------LSAYKTVVVPKITIHF-----LGGVDLELDVRGTLVV 431
++D D C+ L K P + +HF + L++
Sbjct: 436 GFVQDTSDRTLPLCWKADFPVRYLEDVKQFFEP-LNLHFGKKWLFMSKTFTISPEDYLII 494
Query: 432 ESVRQVCLGFALLPSDPN---SILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
VCLG L ++ N +I++G+V RG V YD +++G+ +C
Sbjct: 495 SDKGNVCLGL-LNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQIGWADSDC 544
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 101/403 (25%), Positives = 162/403 (40%), Gaps = 45/403 (11%)
Query: 103 RLQKAIPDNF-KKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQC- 160
R D F + + FP + Y + + IG+P + L LDTGS +TW QC
Sbjct: 30 RKTAGFSDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD 89
Query: 161 KPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYV 219
PC+ C + P + PS IPCN CK L N +C + E C Y++ Y
Sbjct: 90 APCVRCLEAPHPLYQPSS----DLIPCNDPLCKAL----HLNSNQRCETPEQCDYEVEYA 141
Query: 220 DGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN---TGDQNGASGIMGLDRGPVS 276
DG G D ++ G R LGC + + G++GL RG VS
Sbjct: 142 DGGSSLGVLVRDVFSMNYTKGLRLTPR--LALGCGYDQIPGASSHHPLDGVLGLGRGKVS 199
Query: 277 IISKTNISYFF-----YCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
I+S+ + + +CL S G G + FG D + V +TP+ + E S+ Y
Sbjct: 200 ILSQLHSQGYVKNVIGHCLSSLGG--GILFFGD-DLYDSSRVSWTPM--SREYSKHYSPA 254
Query: 332 LTG-ISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGI 390
+ G + GG LK L T DSG+ T F + Y A+ ++ + + +
Sbjct: 255 MGGELLFGGRTTGLK-----NLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEAR 309
Query: 391 EDL-----------FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
+D F + ++ Y + + E+ L++ VCL
Sbjct: 310 DDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCL 369
Query: 440 GF--ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G N L+G++ + + YD + +G+ P +C+
Sbjct: 370 GILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPADCD 412
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 108/433 (24%), Positives = 189/433 (43%), Gaps = 58/433 (13%)
Query: 79 SRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAI 138
S + L E+ RD L++ R LQ K P++ G+ YY V +
Sbjct: 33 SNDGVELSELRARDS----LRHRRMLQSTNYVVDFPVKGTFDPSQVGL-----YYTKVKL 83
Query: 139 GKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCK 193
G P + + +DTGS + W C C C Q +FDP S T S I C+ C+
Sbjct: 84 GTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSSLISCSDRRCR 143
Query: 194 ILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF------- 244
++ CSS+ +C Y Y DGSG +G++ +D M G F
Sbjct: 144 SGVQ----TSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFA-----GIFEGTLTTN 194
Query: 245 ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLHSPYG 295
+ + GC+ TGD + GI G + +S+IS+ ++ F +CL
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKGDNS 254
Query: 296 STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT---KL 352
G + G+ + + + Y+P+V + Y++ L ISV G+ +P+ + F
Sbjct: 255 GGGVLVLGE---IVEPNIVYSPLV---QSQPHYNLNLQSISVNGQIVPIAPAVFATSNNR 308
Query: 353 STEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTV-VVPKI 411
T +DSGT + Y+ +A + + + + + CY ++ V + P++
Sbjct: 309 GTIVDSGTTLAYLAEEAYNPFVNAITALVPQSV--RSVLSRGNQCYLITTSSNVDIFPQV 366
Query: 412 TIHFLGGVDLELDVRGTLVVESV----RQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
+++F GG L L + L+ ++ C+GF +P +I LG++ + YD+
Sbjct: 367 SLNFAGGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIPGQSITI-LGDLVLKDKIFVYDL 425
Query: 468 AGRRLGFGPGNCN 480
AG+R+G+ +C+
Sbjct: 426 AGQRIGWANYDCS 438
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 165/379 (43%), Gaps = 47/379 (12%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y V +G P + ++ +DTGS I W C C +C + FFD S T + +P
Sbjct: 84 YTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVP 143
Query: 187 CNSTTCKILLEWFPPNGQDKCSSK--ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
C+ C ++ +CS + +C Y Y DGSG +G + +D M + G
Sbjct: 144 CSDPMCASAIQ----GAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTP 199
Query: 245 ARYP----FLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLH 291
A + GC+ +GD GI+G G +S++S+ + F +CL
Sbjct: 200 ANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLK 259
Query: 292 SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT- 350
G + G+ + + + Y+P+V P Q Y++ L I+V G+ L + + F
Sbjct: 260 GDGNGGGILVLGE---ILEPSIVYSPLV--PSQPH-YNLNLQSIAVNGQVLSINPAVFAT 313
Query: 351 --KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYK---MGKGIEDLFDTCYDLSAYKT 405
K T IDSGT ++ Y L +A + ++ + KG + CY +
Sbjct: 314 SDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQ-----CYLVLTSID 368
Query: 406 VVVPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGY 461
P ++ +F GG ++L L+ + + C+GF + +LG++ +
Sbjct: 369 DSFPTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKV--QEGVTILGDLVLKDK 426
Query: 462 EVHYDVAGRRLGFGPGNCN 480
V YD+A +++G+ +C+
Sbjct: 427 IVVYDLARQQIGWTNYDCS 445
>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 537
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 81/244 (33%), Positives = 119/244 (48%), Gaps = 18/244 (7%)
Query: 249 FLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISY---FFYCLHSPYGSTGYITFGKP 305
+ GC TG G++G GP+S S+ Y F YCL S S T
Sbjct: 299 YTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFSSTLRLG 358
Query: 306 DTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL--PLKASYFTKLS---TEIDSGT 360
K +K TP+++ P + Y++ + GI VGG + P A F S T +D+GT
Sbjct: 359 PAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLVPASALAFDPASGRGTIVDAGT 418
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVD 420
+ TR APVY+A+R FR R++ G FDTCY++ T+ VP +T F G V
Sbjct: 419 MFTRLSAPVYAAVRDVFRSRVRAPVTGP--LGGFDTCYNV----TISVPTVTFSFDGRVS 472
Query: 421 LELDVRGTLVVESVRQV-CLGFALLPSD-PNSIL--LGNVQQRGYEVHYDVAGRRLGFGP 476
+ L ++ S + CL A PSD +++L L ++QQ+ + V +DVA R+GF
Sbjct: 473 VTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVGFSR 532
Query: 477 GNCN 480
C
Sbjct: 533 ELCT 536
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 148/365 (40%), Gaps = 37/365 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN-ST 190
Y + IG P Q +L++D+GS +T+ C C C +DP F P S T+S + CN
Sbjct: 88 YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 147
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC DK +C Y+ Y + S +G D I +
Sbjct: 148 TCD----------SDK---NQCTYERQYAEMSSSSGVLGED---IVSFGTESELKPQRAV 191
Query: 251 LGCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNI-SYFFYCLHSPYGSTGYITFG 303
GC ++ TGD A GIMGL RG +SI + K I F C G + G
Sbjct: 192 FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTII 362
++ V +P +Y+I L + V G+ L + F K T +DSGT
Sbjct: 252 AMPAPPGMIYTHSNAVRSP----YYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTY 307
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYK----TVVVPKITIHFLG 417
P + A + A ++ K +G + + D C+ + + V PK+ + F
Sbjct: 308 AYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFGN 367
Query: 418 GVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
G L L L S + CLG DP + LLG + R V YD ++GF
Sbjct: 368 GQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTT-LLGGIVVRNTLVTYDRHNEKIGFW 426
Query: 476 PGNCN 480
NC+
Sbjct: 427 KTNCS 431
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 92/383 (24%), Positives = 159/383 (41%), Gaps = 33/383 (8%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK-PCIHCSQQRDPFFDPSK 178
FP + + Y+ + +G P + L +DTGS +TW QC PC C++ +P + P K
Sbjct: 89 FPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKK 148
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEV 238
+P + C + + C ++C Y+I Y D S G A+D + +
Sbjct: 149 GNL---VPLKDSLCVEVQRNLKTGYCETC--EQCDYEIEYADHSSSMGVLASDDLHLMLA 203
Query: 239 NGNGYFARYPFLLGCTDNNTG----DQNGASGIMGLDRGPVSIISKTNISYFF-----YC 289
NG+ + + GC + G GI+GL + VS+ S+ +C
Sbjct: 204 NGS--LTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHC 261
Query: 290 LHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF 349
L S GY+ G D V + + P++ + S YH + IS G +L L
Sbjct: 262 LTSDATGGGYMFLGD-DFVPYWGMAWVPMLNS--HSPNYHSQIMKISHGSRQLSLGRQDG 318
Query: 350 TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLS-AYKTVVV 408
D+G+ T FP Y AL ++ + + + G + C+ ++V+
Sbjct: 319 RTERVVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVID 378
Query: 409 PK-----ITIHF-----LGGVDLELDVRGTLVVESVRQVCLGF--ALLPSDPNSILLGNV 456
K +T+ F + + G L++ + VCLG D ++I+LG++
Sbjct: 379 VKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGSTIILGDI 438
Query: 457 QQRGYEVHYDVAGRRLGFGPGNC 479
RG V YD +++G+ C
Sbjct: 439 SLRGKLVVYDNVNQKIGWAQSTC 461
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 147/361 (40%), Gaps = 41/361 (11%)
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
IG P Q +L++DTGS +T+ C C C +DP F P S T+ + CN
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN--------- 52
Query: 198 WFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN 257
P+ + +C Y+ Y + S +G D ++ ++ + GC +
Sbjct: 53 ---PDCTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMS---ELKPQRAVFGCENAE 106
Query: 258 TGD--QNGASGIMGLDRGPVSIISK------TNISYFFYCLHSPYGSTGYITFGKPDTVN 309
TGD A GIMGL RG +SI+ + N S F C G + G+ +
Sbjct: 107 TGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDS-FSLCYGGMEVGGGAMVLGQISPPS 165
Query: 310 KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIITRFPAP 368
+ + P++S +Y+I L G+ V G++L + F K T +DSGT P
Sbjct: 166 DMVFSH----SDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEA 221
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSA------YKTVVVPKITIHFLGGVDL 421
+ A + K +G + + D C+ + YKT P + + F G
Sbjct: 222 AFLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKT--FPSVDMVFDNGEKY 279
Query: 422 ELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L L S CLG DP + LLG + R V YD ++GF NC
Sbjct: 280 SLSPENYLFKHSKVHGAYCLGVFQNGKDPTT-LLGGIVVRNTLVTYDREHSKVGFWKTNC 338
Query: 480 N 480
+
Sbjct: 339 S 339
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 147/361 (40%), Gaps = 41/361 (11%)
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
IG P Q +L++DTGS +T+ C C C +DP F P S T+ + CN
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN--------- 52
Query: 198 WFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN 257
P+ + +C Y+ Y + S +G D ++ ++ + GC +
Sbjct: 53 ---PDCTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMS---ELKPQRAVFGCENAE 106
Query: 258 TGD--QNGASGIMGLDRGPVSIISK------TNISYFFYCLHSPYGSTGYITFGKPDTVN 309
TGD A GIMGL RG +SI+ + N S F C G + G+ +
Sbjct: 107 TGDLFSQHADGIMGLGRGDLSIVDQLVEKGVINDS-FSLCYGGMEVGGGAMVLGQISPPS 165
Query: 310 KKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIITRFPAP 368
+ + P++S +Y+I L G+ V G++L + F K T +DSGT P
Sbjct: 166 DMVFSH----SDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEA 221
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSA------YKTVVVPKITIHFLGGVDL 421
+ A + K +G + + D C+ + YKT P + + F G
Sbjct: 222 AFLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKT--FPSVDMVFDNGEKY 279
Query: 422 ELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
L L S CLG DP + LLG + R V YD ++GF NC
Sbjct: 280 SLSPENYLFKHSKVHGAYCLGVFQNGKDPTT-LLGGIVVRNTLVTYDREHSKVGFWKTNC 338
Query: 480 N 480
+
Sbjct: 339 S 339
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 163/390 (41%), Gaps = 54/390 (13%)
Query: 121 PAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSK 180
P+++G+ Y+ + +G P Q + +DTGS I W C C +C ++ D + S
Sbjct: 68 PSESGL-----YFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYS 122
Query: 181 TFSKIPCNSTTCKILLEWFPPNGQDKCSSKE------------CPYDIAYVDGSGETGFW 228
S N TC QD C+S C Y +AY DGS G++
Sbjct: 123 PSSSSTSNRVTCN----------QDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYF 172
Query: 229 ATDRMTIQEVNGNGYFARY--PFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTN 282
D + + V GN + GC +G S GI+G + S+IS+
Sbjct: 173 VRDHVVLDRVTGNFQTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLA 232
Query: 283 IS-----YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISV 337
S F +CL + G G G+ V + V+ TP+V P+Q+ Y++ + I V
Sbjct: 233 SSGKVKRVFAHCLDNINGG-GIFAIGE---VVQPKVRTTPLV--PQQAH-YNVFMKAIEV 285
Query: 338 GGERLPLKASYFT---KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF 394
E L L F + T IDSGT + FP +Y L S R K+ +E+ F
Sbjct: 286 DNEVLNLPTDVFDTDLRKGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKL-HTVEEQF 344
Query: 395 DTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF----ALLPSDPNS 450
TC++ P +T HF + L + L + C+G+ A +
Sbjct: 345 -TCFEYDGNVDDGFPTVTFHFEDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDM 403
Query: 451 ILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
ILLG++ + V YD+ + +G+ NC+
Sbjct: 404 ILLGDLVLQNRLVMYDLENQTIGWTEYNCS 433
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 108/403 (26%), Positives = 168/403 (41%), Gaps = 68/403 (16%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWT------QCKPCIHCSQQRDPFFDPSKSKTFSKI 185
Y ++G P Q + +LLDTGS +TW +C+ C S P F P S + +
Sbjct: 99 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 158
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKEC----------------PYDIAYVDGSGET-GFW 228
C + +C+ + N KC C PY + Y GSG T G
Sbjct: 159 GCRNPSCQWVHSAA--NLATKCRRAPCSPGAANCPAAASNVCPPYAVVY--GSGSTAGLL 214
Query: 229 ATD--RMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYF 286
D R + V G F+LGC+ + SG+ G RG S+ ++ + F
Sbjct: 215 IADTLRAPGRAVPG--------FVLGCSLVSV--HQPPSGLAGFGRGAPSVPAQLGLPKF 264
Query: 287 FYCLHSPYGSTGYITFGK---PDTVNKKFVKYTPIVTTPEQSE-----FYHITLTGISVG 338
YCL S G T + ++Y P+V + + +Y++ L G++VG
Sbjct: 265 SYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVG 324
Query: 339 GERLPLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIED 392
G+ + L A F + T +DSGT T V+ + A + +YK K ED
Sbjct: 325 GKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAED 384
Query: 393 --LFDTCYDL-SAYKTVVVPKITIHFLGGVDLELDVRGTLVVE---SVRQVCLGF----- 441
C+ L +++ +P+++ HF GG ++L V VV +V +CL
Sbjct: 385 GLGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFG 444
Query: 442 ----ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
A +I+LG+ QQ+ Y V YD+ RLGF +C
Sbjct: 445 GGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCT 487
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 148/365 (40%), Gaps = 37/365 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN-ST 190
Y + IG P Q +L++D+GS +T+ C C C +DP F P S T+S + CN
Sbjct: 88 YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 147
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC DK +C Y+ Y + S +G D I +
Sbjct: 148 TCD----------SDK---NQCTYERQYAEMSSSSGVLGED---IVSFGTESELKPQRAV 191
Query: 251 LGCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNI-SYFFYCLHSPYGSTGYITFG 303
GC ++ TGD A GIMGL RG +SI + K I F C G + G
Sbjct: 192 FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251
Query: 304 KPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTII 362
++ V +P +Y+I L + V G+ L + F K T +DSGT
Sbjct: 252 AMPAPPGMIYTHSNAVRSP----YYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTY 307
Query: 363 TRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYK----TVVVPKITIHFLG 417
P + A + A ++ K +G + + D C+ + + V PK+ + F
Sbjct: 308 AYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGN 367
Query: 418 GVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFG 475
G L L L S + CLG DP + LLG + R V YD ++GF
Sbjct: 368 GQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTT-LLGGIVVRNTLVTYDRHNEKIGFW 426
Query: 476 PGNCN 480
NC+
Sbjct: 427 KTNCS 431
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 93/330 (28%), Positives = 144/330 (43%), Gaps = 34/330 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
Y+ + IG P + + +DTGS I W C C C ++ + +DP S++ +
Sbjct: 90 YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149
Query: 187 CNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF- 244
C+ C P+ C+S C Y I+Y DGS GF+ TD + +V+G+G
Sbjct: 150 CDQQFCVANYGGVLPS----CTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTT 205
Query: 245 -ARYPFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS-----YFFYCLHSPY 294
A GC GD ++ GI+G + S++S+ + F +CL +
Sbjct: 206 PANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVN 265
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TK 351
G F + V K VK TP+V P+ Y++ L GI VGG L L + F
Sbjct: 266 GGG---IFAIGNVVQPK-VKTTPLV--PDMPH-YNVILKGIDVGGTALGLPTNIFDSGNS 318
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKI 411
T IDSGT + P VY AL + + + + + ++D +C+ S P++
Sbjct: 319 KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISV-QTLQDF--SCFQYSGSVDDGFPEV 375
Query: 412 TIHFLGGVDLELDVRGTLVVESVRQVCLGF 441
T HF G V L + L C+GF
Sbjct: 376 TFHFEGDVSLIVSPHDYLFQNGKNLYCMGF 405
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 159/382 (41%), Gaps = 44/382 (11%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK-PCIHCSQQRDPFFDPSK 178
P K + +YY + +G P + L +DTGS +TW QC PC +C++ P + P+K
Sbjct: 179 LPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAK 238
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQE 237
K +P + C+ L Q+ C + K+C Y+I Y D S G A D M +
Sbjct: 239 EKI---VPPRDSLCQELQ-----GDQNYCETCKQCDYEIEYADRSSSMGVLAKDDMHL-- 288
Query: 238 VNGNGYFARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVS----IISKTNISYFF-Y 288
+ NG + F+ GC + G GI+GL +S + SK IS F +
Sbjct: 289 IATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGH 348
Query: 289 CLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASY 348
C+ GY+ G D V + + + PI P+ YH ++ G + L S
Sbjct: 349 CITRETNGGGYMFLGD-DYVPRWGMTWAPIRGGPDN--LYHTEAQKVNYGDQELHAGNS- 404
Query: 349 FTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDT----CYDLSAYK 404
+ DSG+ T P +Y L A ++ + ++D DT C+
Sbjct: 405 ---VQVIFDSGSSYTYLPEEMYKNLIDAIKEDSPSF-----VQDSSDTTLPLCWKADFSV 456
Query: 405 TVVVPKITIH-----FLGGVDLELDVRGTLVVESVRQVCLGF--ALLPSDPNSILLGNVQ 457
+ +H F+ + L++ VCLG + ++I++G+V
Sbjct: 457 RSFFKPLNLHFGRRWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVS 516
Query: 458 QRGYEVHYDVAGRRLGFGPGNC 479
RG V YD R++G+ C
Sbjct: 517 LRGKLVVYDNERRQIGWANSEC 538
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 150/376 (39%), Gaps = 49/376 (13%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR----------DPFFDPSKSKT 181
Y + IG P Q +L++D+GS +T+ C C C + DP F P S T
Sbjct: 91 YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSST 150
Query: 182 FSKIPCN-STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
+S + CN TC +C Y+ Y + S +G D M+ +
Sbjct: 151 YSPVKCNVDCTC-------------DNERSQCTYERQYAEMSSSSGVLGEDIMSFGK--- 194
Query: 241 NGYFARYPFLLGCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNISYFFYCLHS-- 292
+ GC + TGD A GIMGL RG +SI + K IS F +
Sbjct: 195 ESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGM 254
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-TK 351
G + G P + F P+ +S +Y+I L I V G+ L L F +K
Sbjct: 255 DVGGGTMVLGGMPAPPDMVFSHSNPV-----RSPYYNIELKEIHVAGKALRLDPKIFNSK 309
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYK----TV 406
T +DSGT P + A + A ++ K +G + + D C+ + +
Sbjct: 310 HGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSE 369
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVH 464
V P + + F G L L L S + CLG DP + LLG + R V
Sbjct: 370 VFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTT-LLGGIVVRNTLVT 428
Query: 465 YDVAGRRLGFGPGNCN 480
YD ++GF NC+
Sbjct: 429 YDRHNEKIGFWKTNCS 444
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 107/413 (25%), Positives = 163/413 (39%), Gaps = 46/413 (11%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
S +L RD + HL+N L K N + ++ Y + IG P Q
Sbjct: 50 SHRRVLDRDHRLRHLQN---LVKPHSSNAR------MRLHDDLLTNGYYTTRLWIGSPPQ 100
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST-TCKILLEWFPPN 202
+L++DTGS +T+ C C+ C +DP F P S T+ + CN+ C N
Sbjct: 101 EFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNADCNCD-------EN 153
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-- 260
G +C Y+ Y + S +G A D M+ + + GC +GD
Sbjct: 154 G------VQCTYERRYAEMSTSSGVLAEDVMSFGK---ESELVPQRAVFGCETMESGDLY 204
Query: 261 QNGASGIMGLDRGPVSIISK-----TNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKY 315
A GIMGL RG +S++ + + F C G + G + +
Sbjct: 205 TQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSH 264
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIITRFPAPVYSALR 374
+ P +S +Y+I L I V G+ L L F K +DSGT FP Y A +
Sbjct: 265 ----SDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFK 320
Query: 375 SAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTV----VVPKITIHFLGGVDLELDVRGTL 429
A K++ K G + F D C+ + V P++ + F G + L L
Sbjct: 321 DAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYL 380
Query: 430 V--VESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ CLG +D + LLG + R V Y+ +GF NC+
Sbjct: 381 FRHTKVSGAYCLGIFKNGND-QTTLLGGIIVRNTLVTYNRENSTIGFWKTNCS 432
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 109/413 (26%), Positives = 164/413 (39%), Gaps = 46/413 (11%)
Query: 84 SLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
S +L RD + HL+N L K N + ++ Y + IG P Q
Sbjct: 50 SHRRVLDRDHRLRHLQN---LVKPHSSNAR------MRLHDDLLTNGYYTTRLWIGSPPQ 100
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNST-TCKILLEWFPPN 202
+L++DTGS +T+ C C+ C +DP F P S T+ + CN+ C N
Sbjct: 101 EFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNADCNCD-------EN 153
Query: 203 GQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD-- 260
G +C Y+ Y + S +G A D M+ + + GC +GD
Sbjct: 154 G------VQCTYERRYAEMSTSSGVLAEDVMSFGK---ESELVPQRAVFGCETMESGDLY 204
Query: 261 QNGASGIMGLDRGPVSI----ISKTNIS-YFFYCLHSPYGSTGYITFGKPDTVNKKFVKY 315
A GIMGL RG +S+ + K +S F C G + G + +
Sbjct: 205 TQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSH 264
Query: 316 TPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT-KLSTEIDSGTIITRFPAPVYSALR 374
+ P +S +Y+I L I V G+ L L F K +DSGT FP Y A +
Sbjct: 265 ----SDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFK 320
Query: 375 SAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTV----VVPKITIHFLGGVDLELDVRGTL 429
A K++ K G + F D C+ + V P++ + F G + L L
Sbjct: 321 DAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYL 380
Query: 430 VVES--VRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ CLG +D + LLG + R V Y+ +GF NC+
Sbjct: 381 FRHTKVSGAYCLGIFKNGND-QTTLLGGIIVRNTLVTYNRENSTIGFWKTNCS 432
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 150/376 (39%), Gaps = 49/376 (13%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQR----------DPFFDPSKSKT 181
Y + IG P Q +L++D+GS +T+ C C C + DP F P S T
Sbjct: 92 YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSST 151
Query: 182 FSKIPCN-STTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNG 240
+S + CN TC +C Y+ Y + S +G D M+ +
Sbjct: 152 YSPVKCNVDCTC-------------DNERSQCTYERQYAEMSSSSGVLGEDIMSFGK--- 195
Query: 241 NGYFARYPFLLGCTDNNTGD--QNGASGIMGLDRGPVSI----ISKTNISYFFYCLHS-- 292
+ GC + TGD A GIMGL RG +SI + K IS F +
Sbjct: 196 ESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGM 255
Query: 293 PYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-TK 351
G + G P + F P+ +S +Y+I L I V G+ L L F +K
Sbjct: 256 DVGGGTMVLGGMPAPPDMVFSHSNPV-----RSPYYNIELKEIHVAGKALRLDPKIFNSK 310
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYK----TV 406
T +DSGT P + A + A ++ K +G + + D C+ + +
Sbjct: 311 HGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSE 370
Query: 407 VVPKITIHFLGGVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLGNVQQRGYEVH 464
V P + + F G L L L S + CLG DP + LLG + R V
Sbjct: 371 VFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTT-LLGGIVVRNTLVT 429
Query: 465 YDVAGRRLGFGPGNCN 480
YD ++GF NC+
Sbjct: 430 YDRHNEKIGFWKTNCS 445
>gi|194690050|gb|ACF79109.1| unknown [Zea mays]
Length = 166
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 92/158 (58%), Gaps = 6/158 (3%)
Query: 324 QSEFYHITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKK 383
Q FY + LTGI+VGG+ + +++ F+ + +DSGT+IT VY+A+R+ F ++ +
Sbjct: 10 QGPFYLVNLTGITVGGQEV--ESTGFSARAI-VDSGTVITSLVPSVYNAVRAEFMSQLAE 66
Query: 384 YKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTL--VVESVRQVCLGF 441
Y G + DTC++++ K V VP +T+ F GG ++E+D G L V QVCL
Sbjct: 67 YPQAPGF-SILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAV 125
Query: 442 ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
A L S+ + ++GN QQ+ V +D + ++GF C
Sbjct: 126 ASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQETC 163
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/403 (25%), Positives = 162/403 (40%), Gaps = 45/403 (11%)
Query: 103 RLQKAIPDNF-KKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQC- 160
R D F + + FP + Y + + IG+P + L LDTGS +TW QC
Sbjct: 30 RKTAGFSDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD 89
Query: 161 KPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYV 219
PC+ C + P + PS IPCN CK L N +C + E C Y++ Y
Sbjct: 90 APCVRCLEAPHPLYQPSS----DLIPCNDPLCKAL----HLNSNQRCETPEQCDYEVEYA 141
Query: 220 DGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN---TGDQNGASGIMGLDRGPVS 276
DG G D ++ G R LGC + + G++GL RG VS
Sbjct: 142 DGGSSLGVLVRDVFSMNYTQGLRLTPR--LALGCGYDQIPGASSHHPLDGVLGLGRGKVS 199
Query: 277 IISKTNISYFF-----YCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
I+S+ + + +CL S G G + FG D + V +TP+ + E S+ Y
Sbjct: 200 ILSQLHSQGYVKNVIGHCLSSLGG--GILFFGD-DLYDSSRVSWTPM--SREYSKHYSPA 254
Query: 332 LTG-ISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGI 390
+ G + GG LK L T DSG+ T F + Y A+ ++ + + +
Sbjct: 255 MGGELLFGGRTTGLK-----NLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEAR 309
Query: 391 EDL-----------FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
+D F + ++ Y + + E+ L++ VCL
Sbjct: 310 DDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCL 369
Query: 440 GF--ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G N L+G++ + + YD + +G+ P +C+
Sbjct: 370 GILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCD 412
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 165/388 (42%), Gaps = 46/388 (11%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK-PCIHCSQQRDPFFDPSK 178
P K + +YY + +G P + L +DTGS +TW QC PC +C++ P + P+K
Sbjct: 191 LPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAK 250
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQE 237
K +P C+ L Q+ C + K+C Y+I Y D S G A D M I
Sbjct: 251 EKI---VPPKDLLCQEL-----QGNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHI-- 300
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGA----SGIMGLDRGPVSIISK-------TNISYF 286
+ NG + F+ GC + G + GI+GL +S+ S+ +N+ F
Sbjct: 301 ITTNGGREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNV--F 358
Query: 287 FYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
+C+ GY+ G D V + + TPI + P+ +H + G ++L ++
Sbjct: 359 GHCITRDPNGGGYMFLGD-DYVPRWGMTSTPIRSAPD--NLFHTEAQKVYYGDQQLSMRG 415
Query: 347 SYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-------FDTCYD 399
+ + DSG+ T P +Y L +A + + L F Y
Sbjct: 416 ASGNSVQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRY- 474
Query: 400 LSAYKTVVVPKITIH-----FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN---SI 451
L K + P + +H F+ + L++ VCLGF L D + ++
Sbjct: 475 LEDVKQLFKP-LNLHFGKRWFVMPRTFTILPDNYLIISDKGNVCLGF-LNGKDIDHGSTV 532
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++G+ RG V YD R++G+ +C
Sbjct: 533 IVGDNALRGKLVVYDNQQRQIGWTNSDC 560
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 165/388 (42%), Gaps = 46/388 (11%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK-PCIHCSQQRDPFFDPSK 178
P K + +YY + +G P + L +DTGS +TW QC PC +C++ P + P+K
Sbjct: 192 LPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAK 251
Query: 179 SKTFSKIPCNSTTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQE 237
K +P C+ L Q+ C + K+C Y+I Y D S G A D M I
Sbjct: 252 EKI---VPPKDLLCQEL-----QGNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHI-- 301
Query: 238 VNGNGYFARYPFLLGCTDNNTGDQNGA----SGIMGLDRGPVSIISK-------TNISYF 286
+ NG + F+ GC + G + GI+GL +S+ S+ +N+ F
Sbjct: 302 ITTNGGREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNV--F 359
Query: 287 FYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
+C+ GY+ G D V + + TPI + P+ +H + G ++L ++
Sbjct: 360 GHCITRDPNGGGYMFLGD-DYVPRWGMTSTPIRSAPD--NLFHTEAQKVYYGDQQLSMRG 416
Query: 347 SYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-------FDTCYD 399
+ + DSG+ T P +Y L +A + + L F Y
Sbjct: 417 ASGNSVQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRY- 475
Query: 400 LSAYKTVVVPKITIH-----FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN---SI 451
L K + P + +H F+ + L++ VCLGF L D + ++
Sbjct: 476 LEDVKQLFKP-LNLHFGKRWFVMPRTFTILPDNYLIISDKGNVCLGF-LNGKDIDHGSTV 533
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
++G+ RG V YD R++G+ +C
Sbjct: 534 IVGDNALRGKLVVYDNQQRQIGWTNSDC 561
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 113/457 (24%), Positives = 193/457 (42%), Gaps = 56/457 (12%)
Query: 40 LIPPTVCNRTRTALPQGPGKVSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLK 99
++ PT+ + T P G + + R C + + G +R++ ++ E+ L L
Sbjct: 139 ILAPTMASSTGCPSPTFDGALEFPLFHRDHSCVQQHLGNTRSSGNIVEM------DLPLP 192
Query: 100 NSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQ 159
+Q +NF F P K +G P + + +DTG+ +++ Q
Sbjct: 193 IDL-IQNGDINNF----LFLMPIK--------------LGTPPVWNLVAVDTGATLSFVQ 233
Query: 160 CKPC-IHCSQQRDP--FFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE--CPY 214
C+PC + C +Q D FDPSKS++FS++ C+ C+ + + C KE C Y
Sbjct: 234 CEPCTLRCHKQTDAGEIFDPSKSESFSRVGCSENKCRTVQRALHLQSK-ACMEKEDSCLY 292
Query: 215 DIAYVDGSG-ETGFWATDRMTIQEVNGNGYFARYP-FLLGCTDNNTGDQNGASGIMGLDR 272
+ + S G DR+ I + GY +P FL GC+ + Q A G++G
Sbjct: 293 SMTFGGTSSYSVGKLVRDRLAIGKY-AKGY--SFPDFLFGCSLDTEYHQYEA-GLVGFAD 348
Query: 273 GPVSIISK----TNISYFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFY 328
P S + N F YC S TGY++ G VN YTP+ +QS Y
Sbjct: 349 EPFSFFEQVAPLVNYKAFSYCFPSDRRKTGYLSIGDYTRVNS---TYTPLFLARQQSR-Y 404
Query: 329 HITLTGISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGK 388
+ L + V G L S +DSG+ T + ++ L +A + M+ +
Sbjct: 405 ALKLDEVLVNGMALVTTPSEMI-----VDSGSRWTILLSDTFTQLDAAITEAMRPLGYNR 459
Query: 389 GIEDLFD-TCYDLSAYKT----VVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFAL 443
D C++ + ++ +P + + F GV + L + + + +C F
Sbjct: 460 NYYRGSDYICFEDAHFQQFSDWAALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMR 519
Query: 444 LPSDPNSI-LLGNVQQRGYEVHYDVAGRRLGFGPGNC 479
S + + LLGN R + +D+ G + GF G+C
Sbjct: 520 DASLGSGVQLLGNTMTRSVGITFDIQGGQFGFRKGDC 556
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/403 (25%), Positives = 162/403 (40%), Gaps = 45/403 (11%)
Query: 103 RLQKAIPDNF-KKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQC- 160
R D F + + FP + Y + + IG+P + L LDTGS +TW QC
Sbjct: 18 RKTAGFSDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD 77
Query: 161 KPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYV 219
PC+ C + P + PS IPCN CK L N +C + E C Y++ Y
Sbjct: 78 APCVRCLEAPHPLYQPSS----DLIPCNDPLCKAL----HLNSNQRCETPEQCDYEVEYA 129
Query: 220 DGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN---TGDQNGASGIMGLDRGPVS 276
DG G D ++ G R LGC + + G++GL RG VS
Sbjct: 130 DGGSSLGVLVRDVFSMNYTQGLRLTPR--LALGCGYDQIPGASSHHPLDGVLGLGRGKVS 187
Query: 277 IISKTNISYFF-----YCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHIT 331
I+S+ + + +CL S G G + FG D + V +TP+ + E S+ Y
Sbjct: 188 ILSQLHSQGYVKNVIGHCLSSLGG--GILFFGD-DLYDSSRVSWTPM--SREYSKHYSPA 242
Query: 332 LTG-ISVGGERLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGI 390
+ G + GG LK L T DSG+ T F + Y A+ ++ + + +
Sbjct: 243 MGGELLFGGRTTGLK-----NLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEAR 297
Query: 391 EDL-----------FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCL 439
+D F + ++ Y + + E+ L++ VCL
Sbjct: 298 DDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCL 357
Query: 440 GF--ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
G N L+G++ + + YD + +G+ P +C+
Sbjct: 358 GILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCD 400
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 91/383 (23%), Positives = 161/383 (42%), Gaps = 44/383 (11%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIH-------------------CSQQRD 171
EY V +G P + DTGS + W +C + +
Sbjct: 81 EYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEAV 140
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATD 231
+F+P S ++S++ C+ +C L NG S C + +Y DG+ TG A D
Sbjct: 141 VYFNPFDSSSYSRVGCDGPSCLALATNASCNGD----SHACDFRYSYRDGASATGLLAAD 196
Query: 232 RMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCL- 290
T N + GC G + A G++GL GP+S+ S+ F +CL
Sbjct: 197 TFTFGGNINNDTTSTASIDFGCATGTAGREFQADGMVGLGAGPLSLASQLG-RKFSFCLT 255
Query: 291 -HSPYGSTGYITFGKPDTVNKKFVKYTPIV-TTPEQSEFYHITLTGISVGGERLPLKASY 348
+ ++ + FG V+ TP++ ++ + +Y I++ + V G+ +P S
Sbjct: 256 AYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPGTTSV 315
Query: 349 FTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGI------EDLFDTCYDLSA 402
+ +D+GT++T +AL + + + + G G+ ++ + CYD+S
Sbjct: 316 SKVI---VDTGTVLTFLD---RAALLAPLTESLARVMDGAGLPRAPPPDETLELCYDVSR 369
Query: 403 YKTV--VVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPN---SILLGNVQ 457
K V V+P +T+ GG E+ + G V++ L A++ + P +LGNV
Sbjct: 370 VKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLCLAVVTTSPELQPLSVLGNVA 429
Query: 458 QRGYEVHYDVAGRRLGFGPGNCN 480
+ V D+ R F NC+
Sbjct: 430 LQDLHVGIDLDARTATFATANCD 452
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 116/423 (27%), Positives = 166/423 (39%), Gaps = 55/423 (13%)
Query: 85 LEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQY 144
L +LR D R + RL A+ P TG+ YY + IG P +
Sbjct: 51 LAALLRHDMGR-----NGRLLGAVD---LPLGGVGLPTATGL-----YYTRIEIGSPPKG 97
Query: 145 VSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTC--KILLE 197
+ +DTGS I W C C + +DP+ S T + C C
Sbjct: 98 YYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT--TVGCEQEFCVANSAAS 155
Query: 198 WFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF--ARYPFLLGC 253
PP C S C + I Y DGS TGF+ TD + +V+GNG + GC
Sbjct: 156 GVPP----ACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQTTPSNVSITFGC 211
Query: 254 TDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYITFGK 304
GD +S GI+G + S++S+ + F +CL + G F
Sbjct: 212 GAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDTVRGGG---IFAI 268
Query: 305 PDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF---TKLSTEIDSGTI 361
+ V VK TP+V + Y++ L GISVGG L L S F T IDSGT
Sbjct: 269 GNVVQPPIVKTTPLV---PNATHYNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTT 325
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
+ P VY L +A + + + ED C+ S P IT F G + L
Sbjct: 326 LAYLPREVYRTLLTAVFDKHPDLAV-RNYEDFI--CFQFSGSLDEEFPVITFSFEGDLTL 382
Query: 422 ELDVRGTLVVESVRQVCLGF----ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
+ L C+GF + +LLG++ V YD+ + +G+
Sbjct: 383 NVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWTDY 442
Query: 478 NCN 480
NC+
Sbjct: 443 NCS 445
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 158/362 (43%), Gaps = 34/362 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCK-----PCIHCSQQRDPFFDPSKSKTFSKIP 186
+Y +V +G P Q + LDTGS + W C+ P + F+ P S T +P
Sbjct: 108 HYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAVP 167
Query: 187 CNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVD-GSGETGFWATDRMTIQEVNGNGYF 244
CNS C + Q +CS+ +CPY + YV G+ +GF D + + N +
Sbjct: 168 CNSNFCDL---------QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI 218
Query: 245 ARYPFLLGCTDNNTG---DQNGASGIMGLDRGPV---SIISKTNISYFFYCLHSPYGSTG 298
+ +LGC TG D +G+ GL V SI+++ ++ + + G
Sbjct: 219 LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIG 278
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDS 358
I+FG + ++ + TP+ +Q Y IT++GI++G + P + T D+
Sbjct: 279 RISFGDQGSSDQ---EETPL-NINQQHPTYAITISGITIGNK--PTDLDFITIF----DT 328
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKITIHFLG 417
GT T P Y+ + +F +++ + F+ CYDLS+ + +P I + +
Sbjct: 329 GTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVS 388
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G + G ++ + A++ S +I+ N G V +D + LG+
Sbjct: 389 GSLFPVIDPGQVISIQEHEYVYCLAIVKSRKLNIIGQNFMT-GLRVVFDRERKILGWKKF 447
Query: 478 NC 479
NC
Sbjct: 448 NC 449
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/385 (26%), Positives = 150/385 (38%), Gaps = 43/385 (11%)
Query: 120 FPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFF 174
P TG+ YY + +G P ++ + +DTGS I W C C C + +
Sbjct: 79 LPTDTGL-----YYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLY 133
Query: 175 DPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRM 233
DP S T S + C+ C P KC + C Y + Y DGS G + TD +
Sbjct: 134 DPKASSTGSMVMCDQAFCAATFGGKLP----KCGANVPCEYSVTYGDGSSTIGSFVTDAL 189
Query: 234 TIQEVNGNGYF--ARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS--- 284
+V +G A + GC GD GI+G S++S+ +
Sbjct: 190 QFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKV 249
Query: 285 --YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERL 342
F +CL + G G + G D V K VK TP+V Y++ L I VGG L
Sbjct: 250 KKIFAHCLDTIKGG-GIFSIG--DVVQPK-VKTTPLVA---DKPHYNVNLKTIDVGGTTL 302
Query: 343 PLKASYF---TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD 399
L A F K T IDSGT +T P V+ + A + + ++ C+
Sbjct: 303 QLPAHIFEPGEKKGTIIDSGTTLTYLPELVFKEVMLAVFNKHQDITF-HDVQGFL--CFQ 359
Query: 400 LSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNS----ILLGN 455
P IT HF + L + C+GF S +L+G+
Sbjct: 360 YPGSVDDGFPTITFHFEDDLALHVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGD 419
Query: 456 VQQRGYEVHYDVAGRRLGFGPGNCN 480
+ V YD+ R +G+ NC+
Sbjct: 420 LVLSNKLVIYDLENRVIGWTDYNCS 444
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/444 (22%), Positives = 179/444 (40%), Gaps = 49/444 (11%)
Query: 54 PQGPGK--VSLEVLGRYGPCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDN 111
P P + L +L R PC+ ++ R +PS + +RL + RL D
Sbjct: 52 PNSPSTSTIRLTILHREHPCAPASKRPVRRSPSALQEYHTRVRRL----ANRLSSCPADE 107
Query: 112 FKKTKAFTFPAKTGIVAAD-------EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCI 164
+G++ A+ Y V +G P + ++L+DT S ++W C+PCI
Sbjct: 108 ---------ATASGLIFANGVPWDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGCEPCI 158
Query: 165 HCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGE 224
+ P F+P+ S T+ + C S C + ++ C Y +Y D S
Sbjct: 159 NACLI--PTFNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLS 216
Query: 225 TGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS 284
G ++D +T F+ GC + G SGI+G+ S+ S+ +
Sbjct: 217 VGVVSSDTLTYG-------LGSQKFIFGCCNLFRGVGGRYSGILGMSVNKFSLFSQMTVG 269
Query: 285 YFF----YCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGE 340
+ + YC P + G++ FG+ D +K +++TP+ Y + ++ + V
Sbjct: 270 HRYRAMSYCFPHPR-NQGFLQFGRYDE-HKSLLRFTPLYIDGNN---YFVHVSNVMVETM 324
Query: 341 RLPLKASYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKK-YKMGKGIEDLFDTCYD 399
L +++S + D+GT T P ++ +L ++ Y++G TC+
Sbjct: 325 SLDVQSSGNQTMRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEGYYRVGASTG---QTCFQ 381
Query: 400 LSA---YKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNV 456
+ +P + I F G + L+ + +E CL F + +D I+LG+
Sbjct: 382 ADGNWIEGDLYMPTVKIEFQNGARITLNSEDLMFMEEPNVFCLAFKM--NDGGDIVLGSR 439
Query: 457 QQRGYEVHYDVAGRRLGFGPGNCN 480
G D+ +G CN
Sbjct: 440 HLMGVHTVVDLEMMTMGLRGQGCN 463
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/361 (25%), Positives = 157/361 (43%), Gaps = 34/361 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCK-----PCIHCSQQRDPFFDPSKSKTFSKIP 186
+Y +V +G P Q + LDTGS + W C+ P + F+ P S T +P
Sbjct: 109 HYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAVP 168
Query: 187 CNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVD-GSGETGFWATDRMTIQEVNGNGYF 244
CNS C + Q +CS+ +CPY + YV G+ +GF D + + N +
Sbjct: 169 CNSNFCDL---------QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI 219
Query: 245 ARYPFLLGCTDNNTG---DQNGASGIMGLDRGPV---SIISKTNISYFFYCLHSPYGSTG 298
+ +LGC TG D +G+ GL V SI+++ ++ + + G
Sbjct: 220 LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIG 279
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDS 358
I+FG ++ ++ + TP+ Q Y IT++GI+VG + P + T D+
Sbjct: 280 RISFGDQESSDQ---EETPLDIN-RQHPTYAITISGITVGNK--PTDMDFITIF----DT 329
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGG 418
GT T P Y+ + +F +++ + F+ CYDLS + +P I + + G
Sbjct: 330 GTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSEAR-FPIPDIILRTVTG 388
Query: 419 VDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGN 478
+ G ++ + A++ S +I+ N G V +D + LG+ N
Sbjct: 389 SMFPVIDPGQVISIQEHEYVYCLAIVKSMKLNIIGQNFMT-GLRVVFDRERKILGWKKFN 447
Query: 479 C 479
C
Sbjct: 448 C 448
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 158/362 (43%), Gaps = 34/362 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCK-----PCIHCSQQRDPFFDPSKSKTFSKIP 186
+Y +V +G P Q + LDTGS + W C+ P + F+ P S T +P
Sbjct: 7 HYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAVP 66
Query: 187 CNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVD-GSGETGFWATDRMTIQEVNGNGYF 244
CNS C + Q +CS+ +CPY + YV G+ +GF D + + N +
Sbjct: 67 CNSNFCDL---------QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI 117
Query: 245 ARYPFLLGCTDNNTG---DQNGASGIMGLDRGPV---SIISKTNISYFFYCLHSPYGSTG 298
+ +LGC TG D +G+ GL V SI+++ ++ + + G
Sbjct: 118 LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIG 177
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDS 358
I+FG ++ ++ + TP+ Q Y IT++GI+VG + P + T D+
Sbjct: 178 RISFGDQESSDQ---EETPLDIN-RQHPTYAITISGITVGNK--PTDMDFITIF----DT 227
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKITIHFLG 417
GT T P Y+ + +F +++ + F+ CYDLS+ + +P I + +
Sbjct: 228 GTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVT 287
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G + G ++ + A++ S +I+ N G V +D + LG+
Sbjct: 288 GSMFPVIDPGQVISIQEHEYVYCLAIVKSMKLNIIGQNF-MTGLRVVFDRERKILGWKKF 346
Query: 478 NC 479
NC
Sbjct: 347 NC 348
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 158/362 (43%), Gaps = 34/362 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCK-----PCIHCSQQRDPFFDPSKSKTFSKIP 186
+Y +V +G P Q + LDTGS + W C+ P + F+ P S T +P
Sbjct: 109 HYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAVP 168
Query: 187 CNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVD-GSGETGFWATDRMTIQEVNGNGYF 244
CNS C + Q +CS+ +CPY + YV G+ +GF D + + N +
Sbjct: 169 CNSNFCDL---------QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI 219
Query: 245 ARYPFLLGCTDNNTG---DQNGASGIMGLDRGPV---SIISKTNISYFFYCLHSPYGSTG 298
+ +LGC TG D +G+ GL V SI+++ ++ + + G
Sbjct: 220 LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIG 279
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEIDS 358
I+FG ++ ++ + TP+ Q Y IT++GI+VG + P + T D+
Sbjct: 280 RISFGDQESSDQ---EETPLDIN-RQHPTYAITISGITVGNK--PTDMDFITIF----DT 329
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKITIHFLG 417
GT T P Y+ + +F +++ + F+ CYDLS+ + +P I + +
Sbjct: 330 GTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVT 389
Query: 418 GVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
G + G ++ + A++ S +I+ N G V +D + LG+
Sbjct: 390 GSMFPVIDPGQVISIQEHEYVYCLAIVKSMKLNIIGQNFMT-GLRVVFDRERKILGWKKF 448
Query: 478 NC 479
NC
Sbjct: 449 NC 450
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 169/383 (44%), Gaps = 54/383 (14%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + +G P Q VS+++DTGS ++W C + F+ ++S ++ IPC+S+TC
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPT-TFNQTRSISYRPIPCSSSTCT 91
Query: 194 ILLEWFP-PNGQDKCSSKECPYDIAYVDGSGETGFWATD--RMTIQEVNGNGYFARYPFL 250
F P D S+ C ++Y D S G A+D M ++ G +
Sbjct: 92 NQTRDFSIPASCD--SNSLCHATLSYADASSSEGNLASDTFHMGASDIPG--------MV 141
Query: 251 LGCTD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPD 306
GC D +N+ + + +G+MG++RG +S +S+ F YC+ S +G + G+ +
Sbjct: 142 FGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCI-SGTDFSGMLLLGESN 200
Query: 307 TVNKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEI 356
+ YTP+V + Y + L GI V LP+ S F T +
Sbjct: 201 FTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMV 260
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIED-------LFDTCYDLSAYKTVV-- 407
DSGT T P Y+ALRS F + + + +ED D CY + + V+
Sbjct: 261 DSGTQFTFLLGPAYTALRSEFLNQTTGFL--RVLEDPDFVFQGAMDLCYRVPISQRVLPR 318
Query: 408 VPKITIHFLGGVDLELDVRGTLVV-------ESVRQVCLGFA---LLPSDPNSILLGNVQ 457
+P +++ F G D R V +SV CL F LL + + ++G+
Sbjct: 319 LPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVH--CLSFGNSDLLGVE--AYVIGHHH 374
Query: 458 QRGYEVHYDVAGRRLGFGPGNCN 480
Q+ + +D+ R+G C+
Sbjct: 375 QQNVWMEFDLERSRIGLAQVRCD 397
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 106/392 (27%), Positives = 162/392 (41%), Gaps = 56/392 (14%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCS--------QQRDPFFDPSKSKTFS 183
Y + ++ G P Q + + DTGS + C CS P F P S +
Sbjct: 90 YSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149
Query: 184 KIPCNSTTCKILLEWFPPNGQDK--------CSSKECPYDIAYVDGSGETGFWATDRMTI 235
I C S C+ L + PN Q + C+ PY + Y GS G T+++
Sbjct: 150 IIGCQSPKCQFL---YGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEKLDF 205
Query: 236 QEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSP-Y 294
++ F++GC+ +T +GI G RGPVS+ S+ N+ F +CL S +
Sbjct: 206 PDLTVPD------FVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRF 256
Query: 295 GSTGYITFGKPDT-------VNKKFVKYTPIVTTPEQS-----EFYHITLTGISVGGERL 342
T T DT + YTP P S E+Y++ L I VG + +
Sbjct: 257 DDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV 316
Query: 343 PLKASYFTKLS-----TEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL--FD 395
+ Y + + +DSG+ T PV+ + F +M Y K +E
Sbjct: 317 KIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLG 376
Query: 396 TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLV-VESVRQVCLGFA----LLPSDPN- 449
C+++S V VP++ F GG LEL + V + VCL + PS
Sbjct: 377 PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTG 436
Query: 450 -SILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+I+LG+ QQ+ Y V YD+ R GF C+
Sbjct: 437 PAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 85/161 (52%), Gaps = 17/161 (10%)
Query: 117 AFTFPAKTGIV-AADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFD 175
F+ +G+ + EY+ + +G P +YV ++LDTGS + W QC PC C Q DP FD
Sbjct: 158 GFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFD 217
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKE-CPYDIAYVDGSGETGFWATDRMT 234
P KS +FS I C S C L C+S++ C Y +AY DGS G ++T+ +T
Sbjct: 218 PKKSGSFSSISCRSPLCLRL-------DSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLT 270
Query: 235 IQEVNGNGYFARYP-FLLGCTDNNTGDQNGASGIMGLDRGP 274
+ R P LGC +N G GA+G++GL R P
Sbjct: 271 FRGT-------RVPKVALGCGHDNEGLFVGAAGLLGLGRQP 304
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 158/386 (40%), Gaps = 49/386 (12%)
Query: 117 AFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD----- 171
F P TG+ YY + IG P + LDTGS W C C + D
Sbjct: 49 GFNIPYGTGL-----YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKL 103
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWAT 230
F+DP S + ++ C+ T C PP C+ + CPY Y DG G T
Sbjct: 104 TFYDPRSSVSSKEVKCDDTICTSR----PP-----CNMTLRCPYITGYADGGLTMGILFT 154
Query: 231 DRMTIQEVNGNGYF--ARYPFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS 284
D + ++ GNG GC +G N ++ GI+G + +S+ +
Sbjct: 155 DLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAA 214
Query: 285 -----YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYH-ITLTGISVG 338
F +CL S G F + V K VK TPIV + +E YH + L I+V
Sbjct: 215 GKTKKIFSHCLDSTNGGG---IFAIGEVVEPK-VKTTPIV---KNNEVYHLVNLKSINVA 267
Query: 339 GERLPLKASYFTKLSTE---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFD 395
G L L A+ F T+ IDSG+ + P +YS L A + MG +++
Sbjct: 268 GTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAVFAKHPDITMGA----MYN 323
Query: 396 -TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF--ALLPSDPNSIL 452
C+ PKIT HF + L++ L+ Q C GF A + + I+
Sbjct: 324 FQCFHFLGSVDDKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMII 383
Query: 453 LGNVQQRGYEVHYDVAGRRLGFGPGN 478
LG++ V YD+ + +G+ N
Sbjct: 384 LGDMVISNKVVVYDMEKQAIGWTEHN 409
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 109/437 (24%), Positives = 182/437 (41%), Gaps = 52/437 (11%)
Query: 70 PCSKLNQGKSRNTPSLEEILRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAA 129
P L + ++P E LR R L+++R LQ + F+ + +
Sbjct: 28 PLLSLYRALPSSSPVQLETLRA---RDRLRHARILQGVVD--------FSVEGSSDPLLV 76
Query: 130 DEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ-----RDPFFDPSKSKTFSK 184
Y+ V +G P ++ +DTGS I W C C C + + FFD S S + S
Sbjct: 77 GLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSL 136
Query: 185 IPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYF 244
+ C+ C + Q S +C Y Y DGSG +G++ ++ M V G
Sbjct: 137 VSCSDPICNSAFQ--TTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMI 194
Query: 245 AR--YPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNI-----SYFFYCLHSP 293
A + GC+ +GD + GI G G +S+IS+ + F +CL
Sbjct: 195 ANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGE 254
Query: 294 YGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT--- 350
G + G+ V + + Y+P+V P Q Y++ L ISV G+ LP+ S F
Sbjct: 255 GNGGGILVLGE---VLEPGIVYSPLV--PSQPH-YNLYLQSISVNGQTLPIDPSVFATSI 308
Query: 351 KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKY---KMGKGIEDLFDTCYDLSAYKTVV 407
T IDSGT + Y+ SA + + + KG + CY +S +
Sbjct: 309 NRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKG-----NQCYLVSTSVGEI 363
Query: 408 VPKITIHFLGGVDLELDVRGTLV----VESVRQVCLGFALLPSDPNSILLGNVQQRGYEV 463
P ++++F G + L L+ + C+GF + +LG++ +
Sbjct: 364 FPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKV--QEGVTILGDLVMKDKIF 421
Query: 464 HYDVAGRRLGFGPGNCN 480
YD+A +R+G+ +C+
Sbjct: 422 VYDLARQRIGWASYDCS 438
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 158/386 (40%), Gaps = 49/386 (12%)
Query: 117 AFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD----- 171
F P TG+ YY + IG P + LDTGS W C C + D
Sbjct: 73 GFNIPYGTGL-----YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKL 127
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWAT 230
F+DP S + ++ C+ T C PP C+ + CPY Y DG G T
Sbjct: 128 TFYDPRSSVSSKEVKCDDTICTSR----PP-----CNMTLRCPYITGYADGGLTMGILFT 178
Query: 231 DRMTIQEVNGNGYF--ARYPFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS 284
D + ++ GNG GC +G N ++ GI+G + +S+ +
Sbjct: 179 DLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAA 238
Query: 285 -----YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYH-ITLTGISVG 338
F +CL S G F + V K VK TPIV + +E YH + L I+V
Sbjct: 239 GKTKKIFSHCLDSTNGGG---IFAIGEVVEPK-VKTTPIV---KNNEVYHLVNLKSINVA 291
Query: 339 GERLPLKASYFTKLSTE---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFD 395
G L L A+ F T+ IDSG+ + P +YS L A + MG +++
Sbjct: 292 GTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAVFAKHPDITMGA----MYN 347
Query: 396 -TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF--ALLPSDPNSIL 452
C+ PKIT HF + L++ L+ Q C GF A + + I+
Sbjct: 348 FQCFHFLGSVDDKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMII 407
Query: 453 LGNVQQRGYEVHYDVAGRRLGFGPGN 478
LG++ V YD+ + +G+ N
Sbjct: 408 LGDMVISNKVVVYDMEKQAIGWTEHN 433
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 173/351 (49%), Gaps = 39/351 (11%)
Query: 136 VAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKIL 195
++IG P V ++LDTGS + W QC+PC C +Q+DP ++ +KS +++++ CN C L
Sbjct: 97 LSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPPCVSL 156
Query: 196 LEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWATDRMTI-QEVNGNGYFARYPFLLGC 253
+ +CS S C Y AY DG+ +G + +++ + A+ F G
Sbjct: 157 ------GREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGL 210
Query: 254 TDNNTGDQNGASGIMGLDRGPVSIISKTNI-----SYFFYCLH--SPYGSTGYITFGKPD 306
+ N N G++GL G VS++S+ + F YC S + G++ FG
Sbjct: 211 QNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFGDAT 270
Query: 307 TVNKKFVKYTPIVTTPEQSEFYHITLTGISVG-GE-RLPLKASYFTKL-----STEIDSG 359
+N TP+V +EFY++ L GI +G GE RL + +S F + IDSG
Sbjct: 271 YLNGDM---TPMVI----AEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSG 323
Query: 360 TIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDT--CYDLSAYKTVVVPKITIHFLG 417
+ ++ FP VY +R+A ++KK G I L + C++ + + + + +L
Sbjct: 324 STLSVFPPEVYEVVRNAVVDKLKK---GYNISPLTSSPDCFEGKIERDLPLFPTLVLYLE 380
Query: 418 GVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
+ L+ R ++ ++ ++ CLGF S ++G + Q+ Y+ Y++
Sbjct: 381 STGI-LNDRWSIFLQRYDELFCLGFT---SGEGLSIIGTLAQQSYKFGYNL 427
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 158/386 (40%), Gaps = 49/386 (12%)
Query: 117 AFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD----- 171
F P TG+ YY + IG P + LDTGS W C C + D
Sbjct: 49 GFNIPYGTGL-----YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKL 103
Query: 172 PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDGSGETGFWAT 230
F+DP S + ++ C+ T C PP C+ + CPY Y DG G T
Sbjct: 104 TFYDPRSSVSSKEVKCDDTICTSR----PP-----CNMTLRCPYITGYADGGLTMGILFT 154
Query: 231 DRMTIQEVNGNGYF--ARYPFLLGCTDNNTGDQNGAS----GIMGLDRGPVSIISKTNIS 284
D + ++ GNG GC +G N ++ GI+G + +S+ +
Sbjct: 155 DLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAA 214
Query: 285 -----YFFYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYH-ITLTGISVG 338
F +CL S G F + V K VK TPIV + +E YH + L I+V
Sbjct: 215 GKTKKIFSHCLDSTNGGG---IFAIGEVVEPK-VKTTPIV---KNNEVYHLVNLKSINVA 267
Query: 339 GERLPLKASYFTKLSTE---IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFD 395
G L L A+ F T+ IDSG+ + P +YS L A + MG +++
Sbjct: 268 GTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAVFAKHPDITMGA----MYN 323
Query: 396 -TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF--ALLPSDPNSIL 452
C+ PKIT HF + L++ L+ Q C GF A + + I+
Sbjct: 324 FQCFHFLGSVDDKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMII 383
Query: 453 LGNVQQRGYEVHYDVAGRRLGFGPGN 478
LG++ V YD+ + +G+ N
Sbjct: 384 LGDMVISNKVVVYDMEKQAIGWTEHN 409
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 156/376 (41%), Gaps = 42/376 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIP 186
YY + IG P + + +DTGS I W C C C + +DP S + S +
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 187 CNSTTCKILL---EWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVNGNG 242
C++ C E P C++ K C Y Y DGS G + +D + +++GN
Sbjct: 147 CDNKFCAATYGSGEKLP-----GCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNA 201
Query: 243 Y--FARYPFLLGCTDNNTGD----QNGASGIMGLDRGPVSIISKTNIS-----YFFYCLH 291
A+ + GC GD GI+G + S +S+ + F +CL
Sbjct: 202 QTRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLD 261
Query: 292 SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-- 349
+ G G G+ V + VK TP++ P S Y++ L I V G L L F
Sbjct: 262 TIKGG-GIFAIGE---VVQPKVKSTPLL--PNMSH-YNVNLQSIDVAGNALQLPPHIFET 314
Query: 350 -TKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVV 408
K T IDSGT +T P VY + +A ++ + + I+ C++ S
Sbjct: 315 SEKRGTIIDSGTTLTYLPELVYKDILAAVFQKHQDITF-RTIQGFL--CFEYSESVDDGF 371
Query: 409 PKITIHFLGGVDLELDVRGTLVVESVRQVCLGF---ALLPSDP-NSILLGNVQQRGYEVH 464
PKIT HF + L + CLGF P D + +LLG++ V
Sbjct: 372 PKITFHFEDDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVV 431
Query: 465 YDVAGRRLGFGPGNCN 480
YD+ + +G+ NC+
Sbjct: 432 YDLEKQVIGWTDYNCS 447
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 160/365 (43%), Gaps = 38/365 (10%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQ--------RDPFFDPSKSKTFS 183
+Y +V +G P Q + LDTGS + W C+ C C+ + F+ P S T
Sbjct: 109 HYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPATAASGSFQATFYIPGMSSTSK 167
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVD-GSGETGFWATDRMTIQEVNGN 241
+PCNS C + Q +CS+ +CPY + YV G+ +GF D + + N +
Sbjct: 168 AVPCNSNFCDL---------QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAH 218
Query: 242 GYFARYPFLLGCTDNNTG---DQNGASGIMGLDRGPV---SIISKTNISYFFYCLHSPYG 295
+ +LGC TG D +G+ GL V SI+++ ++ + +
Sbjct: 219 PQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRD 278
Query: 296 STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTE 355
G I+FG ++ ++ + TP+ Q Y IT++GI+VG + P + T
Sbjct: 279 GIGRISFGDQESSDQ---EETPLDIN-RQHPTYAITISGITVGNK--PTDMDFITIF--- 329
Query: 356 IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKITIH 414
D+GT T P Y+ + +F +++ + F+ CYDLS+ + +P I +
Sbjct: 330 -DTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILR 388
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
+ G + G ++ + A++ S +I+ N G V +D + LG+
Sbjct: 389 TVTGSMFPVIDPGQVISIQEHEYVYCLAIVKSMKLNIIGQNFMT-GLRVVFDRERKILGW 447
Query: 475 GPGNC 479
NC
Sbjct: 448 KKFNC 452
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 156/372 (41%), Gaps = 31/372 (8%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPF--FDPSKSKTFSKIPCN 188
+Y++ +G P Q L+ DTGS +TW +C+ P F S+S++++ + C+
Sbjct: 13 QYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACS 72
Query: 189 STTCKILLEWFPPNGQDKCSS--KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFAR 246
S TC + P CSS C YD Y DGS G TD TI
Sbjct: 73 SDTCTSYV----PFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGS 128
Query: 247 YP---------FLLGCTDNNTGDQ-NGASGIMGLDRGPVSIISKTNISY---FFYCL--- 290
+LGCT G + G++ L +S S+ + F YCL
Sbjct: 129 GGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDH 188
Query: 291 HSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFT 350
+P ++ Y+TFG TP+V S FY + + + V GE L + A +
Sbjct: 189 LAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWD 248
Query: 351 ---KLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV 407
+DSGT +T P Y A+ +A R+ + + D F+ CY+ +A
Sbjct: 249 VGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLA--ALPRVAMDPFEYCYNWTA-GAPE 305
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDV 467
+PK+ + F G LE + ++ + C+G + P ++GN+ Q+ + +D+
Sbjct: 306 IPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQ-EGAWPGVSVIGNILQQEHLWEFDL 364
Query: 468 AGRRLGFGPGNC 479
R L F C
Sbjct: 365 RDRWLRFKHTRC 376
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/228 (31%), Positives = 113/228 (49%), Gaps = 22/228 (9%)
Query: 90 RRDQQRLHLKNSR------RLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQ 143
RR Q++L L + R R+++ + + P +GI YIV +G +
Sbjct: 16 RRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINLQTLNYIVT-MGLGSK 74
Query: 144 YVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNG 203
+++++DT S +TW QC+PC+ C Q+ P F PS S ++ + CNS+TC+ L F
Sbjct: 75 NMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSL--QFATGN 132
Query: 204 QDKCSSKE---CPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGD 260
C S C Y + Y DGS G + ++ G + F+ GC NN G
Sbjct: 133 TGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSF------GGVSVSDFVFGCGRNNKGL 186
Query: 261 QNGASGIMGLDRGPVSIISKTNISY---FFYCL-HSPYGSTGYITFGK 304
G SG+MGL R +S++S+TN ++ F YCL + GS+G + G
Sbjct: 187 FGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGN 234
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 124/262 (47%), Gaps = 21/262 (8%)
Query: 225 TGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNIS 284
+G+ ATD T G A + GC+D + GD GASG++G+ RG +S+IS+
Sbjct: 130 SGYLATDTFTF------GATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFG 183
Query: 285 YFFYCLHSPYG-----STGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGG 339
F Y L +P + I FG K + TP++++ +FY++ LTG+ V G
Sbjct: 184 KFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDG 243
Query: 340 ERL-PLKASYFTKLSTE-----IDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL 393
RL + A F + + S T +T Y +R+A R+ +
Sbjct: 244 NRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALE 303
Query: 394 FDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQV-CLGFALLPSDPNSIL 452
D CY+ S+ V VPK+T+ F GG D++L +++ + CL +LPS S+
Sbjct: 304 LDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECL--TMLPSQGGSV- 360
Query: 453 LGNVQQRGYEVHYDVAGRRLGF 474
LG + Q G + YDV RL F
Sbjct: 361 LGTLLQTGTNMIYDVDAGRLTF 382
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 108/397 (27%), Positives = 166/397 (41%), Gaps = 68/397 (17%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWT------QCKPCIHCSQQRDPFFDPSKSKTFSKI 185
Y ++G P Q + +LLDTGS +TW +C+ C S P F P S + +
Sbjct: 67 YAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLV 126
Query: 186 PCNSTTCKILLEWFPPNGQDKCSSKEC----------------PYDIAYVDGSGETGFWA 229
C + +C+ + N KC C PY + Y GS G
Sbjct: 127 GCRNPSCQWVHSAA--NLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLI 183
Query: 230 TD--RMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFF 287
D R + V G F+LGC+ + SG+ G RG S+ ++ + F
Sbjct: 184 ADTLRAPGRAVPG--------FVLGCSLVSV--HQPPSGLAGFGRGAPSVPAQLGLPKFS 233
Query: 288 YCLHSPYGSTGYITFGK---PDTVNKKFVKYTPIVTTPEQSE-----FYHITLTGISVGG 339
YCL S G T + ++Y P+V + + +Y++ L G++VGG
Sbjct: 234 YCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGG 293
Query: 340 E--RLPLKASYFTKL---STEIDSGTIITRFPAPVYSALRSAFRKRM-KKYKMGKGIEDL 393
+ RLP +A T +DSGT T V+ + A + +YK K ED
Sbjct: 294 KAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDE 353
Query: 394 --FDTCYDL-SAYKTVVVPKITIHFLGGVDLELDVRGTLVVE---SVRQVCL-------- 439
C+ L +++ +P+++ HF GG ++L V VV +V +CL
Sbjct: 354 LGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSG 413
Query: 440 --GFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
G S P +I+LG+ QQ+ Y V YD+ RLGF
Sbjct: 414 GSGAGNEGSGP-AIILGSFQQQNYLVEYDLEKERLGF 449
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 103/435 (23%), Positives = 183/435 (42%), Gaps = 71/435 (16%)
Query: 92 DQQRLHLKNSRRLQKAIPDNF-------KKTKAFTFPAKTGIVAADE---YYIVVAIGKP 141
DQ S+R Q + + F K+ K+ A++ ++ + + + ++IG P
Sbjct: 54 DQNETVEDRSKREQTSSIERFDFLESKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSP 113
Query: 142 KQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPP 201
+++DTGS + W QC PCI+C QQ +FDP KS +F + C FP
Sbjct: 114 PVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCG----------FPG 163
Query: 202 ----NGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA------------ 245
NG + Y + Y+ G G A + + + ++ F
Sbjct: 164 YNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQISKIK 223
Query: 246 RYPFLLGCTDNN--TGDQNGASGIMGLDRGPVSIISKTNISYFFYC---LHSPYGSTGYI 300
+ GC N T + + +G+ GL P ++ + F YC +++P + ++
Sbjct: 224 KSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLGNKFSYCIGDINNPLYTHNHL 283
Query: 301 TFGKPDTVNKKFVKYTPIVTTPEQSEF--YHITLTGISVGGERLPLKASYFTKLSTE--- 355
G+ + +TP Q F Y++TL ISVG + L + + F K+S++
Sbjct: 284 VLGQGSYIEGD--------STPLQIHFGHYYVTLQSISVGSKTLKIDPNAF-KISSDGSG 334
Query: 356 ---IDSGTIITRFP----APVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV- 407
IDSG T+ +Y + + +++ + E L C+ + +V
Sbjct: 335 GVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGL---CFKGVVSRDLVG 391
Query: 408 VPKITIHFLGGVDLELDVRGTLVVESVRQVCLGFALLPSDP---NSILLGNVQQRGYEVH 464
P +T HF GG DL L+ + CL A+LPS+ N ++G + Q+ Y V
Sbjct: 392 FPAVTFHFAGGADLVLESGSLFRQHGGDRFCL--AILPSNSELLNLSVIGILAQQNYNVG 449
Query: 465 YDVAGRRLGFGPGNC 479
+D+ ++ F +C
Sbjct: 450 FDLEQMKVFFRRIDC 464
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 97/385 (25%), Positives = 163/385 (42%), Gaps = 34/385 (8%)
Query: 119 TFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK-PCIHCSQQRDPFFDPS 177
FP + Y+ ++ +G P + L +DTGS +TW QC PCI C + + P+
Sbjct: 179 VFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHVLYKPT 238
Query: 178 KSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQE 237
+S S + C + ++ NG S +C Y+I Y D S G D + +
Sbjct: 239 RSNVVSSV---DALC-LDVQKNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHL-- 292
Query: 238 VNGNGYFARYPFLLGCTDNNTG----DQNGASGIMGLDRGPVS----IISKTNISYFF-Y 288
V NG + + GC + G GIMGL R VS + SK I +
Sbjct: 293 VTTNGSKTKLNVVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGH 352
Query: 289 CLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASY 348
CL + GY+ G D V + + P+ T ++ Y + GI+ G +L
Sbjct: 353 CLSNDGAGGGYMFLGD-DFVPYWGMNWVPMAYT-LTTDLYQTEILGINYGNRQLRFDGQ- 409
Query: 349 FTKLSTEI-DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCY--------- 398
+K+ + DSG+ T FP Y L ++ + + + C+
Sbjct: 410 -SKVGKMVFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICWQANFPIKSV 468
Query: 399 -DLSAY-KTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF--ALLPSDPNSILLG 454
D+ Y KT+ + + ++ ++ G L++ + VCLG +D +SI+LG
Sbjct: 469 KDVKDYFKTLTLRFGSKWWILSTLFQISPEGYLIISNKGHVCLGILDGSNVNDGSSIILG 528
Query: 455 NVQQRGYEVHYDVAGRRLGFGPGNC 479
++ RGY V YD +++G+ +C
Sbjct: 529 DISLRGYSVVYDNVKQKIGWKRADC 553
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 98/389 (25%), Positives = 161/389 (41%), Gaps = 49/389 (12%)
Query: 117 AFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLLDTGSGITWTQCK-PCIHCSQQRDPFFD 175
+ P + Y + + IG+P + L +DTGS +TW QC PC+ C++ P++
Sbjct: 19 SIVLPLHGNVYPNGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYR 78
Query: 176 PSKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSSK-ECPYDIAYVDGSGETGFWATDRMT 234
P + +PC C+ L NG +C + +C Y++ Y DG G TD
Sbjct: 79 PRN----NLVPCMDPICQSLHS----NGDHRCENPGQCDYEVEYADGGSSFGVLVTDTFN 130
Query: 235 IQEVNGNGYFARYPFL-LGCTDNN--TGDQNGASGIMGLDRGPVSIISKTNI-----SYF 286
+ N P L LGC + G + G++GL +G SI+S+ + +
Sbjct: 131 L---NFTSEKRHSPLLALGCGYDQFPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVI 187
Query: 287 FYCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKA 346
+CL G G F D + V +TP+ +P+ ++ Y L ++ G K
Sbjct: 188 GHCLS---GHGGGFLFFGDDLYDSSRVAWTPM--SPD-AKHYSPGLAELTFDG-----KT 236
Query: 347 SYFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDL-----------FD 395
+ F L T DSG T + Y L S +K + + + ++D F
Sbjct: 237 TGFKNLLTTFDSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFK 296
Query: 396 TCYDLSAYKTVVVPKITIHFLGGVDLELDVRGTLVVESVRQVCLGF----ALLPSDPNSI 451
+ D+ Y T +LE L++ S CLG + +D N
Sbjct: 297 SIRDVKKYFKTFALSFTNERKSKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLN-- 354
Query: 452 LLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
++G++ + V YD R+G+ PGNCN
Sbjct: 355 VIGDISMQDRVVIYDNEKERIGWAPGNCN 383
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 166/382 (43%), Gaps = 53/382 (13%)
Query: 134 IVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCK 193
+ + +G P Q V+++LDTGS ++W CK Q + F+P S +++ IPC S CK
Sbjct: 72 VSLTVGTPPQSVTMVLDTGSELSWLHCKK----QQNINSVFNPHLSSSYTPIPCMSPICK 127
Query: 194 I-LLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLG 252
++ P D S+ C ++Y D + G A+D I +G+G + + G
Sbjct: 128 TRTRDFLIPVSCD--SNNLCHVTVSYADFTSLEGNLASDTFAI---SGSG---QPGIIFG 179
Query: 253 CTD----NNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGSTGYITFGKPDTV 308
D +N + + +G+MG++RG +S +++ F YC+ S ++G + FG
Sbjct: 180 SMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFPKFSYCI-SGKDASGVLLFGDATFK 238
Query: 309 NKKFVKYTPIVTTPEQSEF-----YHITLTGISVGGERLPLKASYFT-----KLSTEIDS 358
+KYTP+V + Y + L GI VG + L + F T +DS
Sbjct: 239 WLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMVDS 298
Query: 359 GTIITRFPAPVYSALRSAFRKRMKKYKM-----GKGIEDLFDTCYDLSAYKTV-VVPKIT 412
GT T VY+ALR+ F + + E D C+ + V VP +T
Sbjct: 299 GTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPAVT 358
Query: 413 IHFLGGVDLELDVRGTLVVESV-----------RQVCLGFALLPSDPNSI---LLGNVQQ 458
+ F G E+ V G ++ V CL F SD I ++G+ Q
Sbjct: 359 MVFEGA---EMSVSGERLLYRVGGDGDVAKGNGDVYCLTFG--NSDLLGIEAYVIGHHHQ 413
Query: 459 RGYEVHYDVAGRRLGFGPGNCN 480
+ + +D+ R+GF C
Sbjct: 414 QNVWMEFDLVNSRVGFADTKCE 435
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 101/368 (27%), Positives = 159/368 (43%), Gaps = 44/368 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP--------FFDPSKSKTFS 183
++ V++G P + LDTGS + W C C C + + +D S T
Sbjct: 102 HFANVSVGTPPLSFLVALDTGSDLFWLPCN-CTKCVRGVESNGEKIAFNIYDLKGSSTSQ 160
Query: 184 KIPCNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYV-DGSGETGFWATDRMTIQEVNG 240
+ CNS C++ Q +C S + CPY++ Y+ +G+ TGF D + + +
Sbjct: 161 TVLCNSNLCEL---------QRQCPSSDSICPYEVNYLSNGTSTTGFLVEDVLHLITDDD 211
Query: 241 NGYFARYPFLLGCTDNNTG---DQNGASGIMGLDRGPVS---IISKTNISYFFYCLHSPY 294
A GC TG D +G+ GL G S I++K ++ + +
Sbjct: 212 ETKDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMGNESVPSILAKEGLTSNSFSMCFGS 271
Query: 295 GSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLST 354
G ITFG ++ + + P Y+IT+T I VGG L+
Sbjct: 272 DGLGRITFGDNSSLVQGKTPFNLRALHPT----YNITVTQIIVGGNAADLE------FHA 321
Query: 355 EIDSGTIITRFPAPVYSALRSAFRK--RMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKIT 412
DSGT T P Y + ++F ++++Y E F+ CYDLS+ KTV +P I
Sbjct: 322 IFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYDLSSNKTVELP-IN 380
Query: 413 IHFLGGVD-LELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRR 471
+ GG + L D T+ E V +CLG +L S+ N ++G GY + +D
Sbjct: 381 LTMKGGDNYLVTDPIVTISGEGVNLLCLG--VLKSN-NVNIIGQNFMTGYRIVFDRENMI 437
Query: 472 LGFGPGNC 479
LG+ NC
Sbjct: 438 LGWRESNC 445
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 164/380 (43%), Gaps = 57/380 (15%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCK---PCI----------HCSQQRDPF--FDP 176
+Y V IG P Q+ + LDTGS + W C C+ H + QR ++P
Sbjct: 111 HYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGETHMNAQRIRLNIYNP 170
Query: 177 SKSKTFSKIPCNSTTCKILLEWFPPNGQDKCSS--KECPYDIAYVD-GSGETGFWATDRM 233
S S + SK+ CNST C + +++C S +CPY I Y+ GS TG D +
Sbjct: 171 SISTSSSKVTCNSTLCAL---------RNRCISPLSDCPYRIRYLSPGSKSTGVLVEDVI 221
Query: 234 TIQEVNGNGYFARYPFLLGCTDNNTG--DQNGASGIMGLDRGPVSI---ISKTNI-SYFF 287
+ G AR F GC++ G + +GIMGL +++ + K + S F
Sbjct: 222 HMSTEEGEARDARITF--GCSETQLGLFQEVAVNGIMGLAMADIAVPNMLVKAGVASDSF 279
Query: 288 YCLHSPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKAS 347
P G G I+FG + ++ TP+ T FY +++T VG K +
Sbjct: 280 SMCFGPNGK-GTISFGDKGSSDQH---ETPLGGTISP-LFYDVSITKFKVG------KVT 328
Query: 348 YFTKLSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL-SAYKTV 406
TK S DSGT +T P Y+AL + F + ++ ++ F+ CY + S
Sbjct: 329 VETKFSAIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPANVDSTFEFCYIITSTSDEE 388
Query: 407 VVPKITIHFLGGVDLELDVRGTLVV-----ESVRQVCLGFALLPSDPNSI-LLGNVQQRG 460
+P I+ GG DV ++V S + CL A+L D ++G
Sbjct: 389 KLPSISFEMKGGA--AYDVFSPILVFDTSDGSFQVYCL--AVLKQDKADFNIIGQNFMTN 444
Query: 461 YEVHYDVAGRRLGFGPGNCN 480
Y + +D LG+ NCN
Sbjct: 445 YRIVHDRERMILGWKKSNCN 464
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 158/363 (43%), Gaps = 36/363 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCS------QQRDPFFDPSKSKTFSKI 185
+Y +V +G P Q + LDTGS + W C+ C C+ F+ PS S T +
Sbjct: 116 HYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQAV 174
Query: 186 PCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDG-SGETGFWATDRMTIQEVNGNGY 243
PCNS C++ + +CS + +CPY + YV + +GF D + + +
Sbjct: 175 PCNSQFCEL---------RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQ 225
Query: 244 FARYPFLLGCTDNNTG---DQNGASGIMGLDRGPV---SIISKTNISYFFYCLHSPYGST 297
+ L GC TG D +G+ GL + SI+++ ++ + +
Sbjct: 226 ILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGI 285
Query: 298 GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEID 357
G I+FG + ++ + TP+ P Q Y I+++ I+VG L + ST D
Sbjct: 286 GRISFGDQGSSDQ---EETPLDVNP-QHPTYTISISEITVGNSLTDL------EFSTIFD 335
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKITIHFL 416
+GT T P Y+ + +F ++ + F+ CYDLS+ + + P I++ +
Sbjct: 336 TGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTV 395
Query: 417 GGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
GG + G ++ + A++ S +I+ N G V +D + LG+
Sbjct: 396 GGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMT-GLRVVFDRERKILGWKK 454
Query: 477 GNC 479
NC
Sbjct: 455 FNC 457
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 143/364 (39%), Gaps = 50/364 (13%)
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
IG P Q S +D + WTQC CIHC +Q P F P+ S TF PC + CK +
Sbjct: 60 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSI-- 117
Query: 198 WFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN 257
KC+S C YD G G ATD I G A GC +
Sbjct: 118 -----PTPKCASDVCAYDGVTGLGGHTVGIVATDTFAI------GTAAPASLGFGCVVAS 166
Query: 258 TGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY--GSTGYITFGKPDTVNKKFVK 314
D G SG +GL R P S++++ ++ F YCL +P+ G + G +
Sbjct: 167 DIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCL-APHDTGKNSRLFLGASAKLAGGG-A 224
Query: 315 YTPIV-TTPE--QSEFYHITLTGISVG--------GERLPLKASYFTKLSTEIDSGTIIT 363
+TP V T+P S++Y I L I G G L + ++S +DS
Sbjct: 225 WTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLVDS----- 279
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
VY + A + + F+ C+ + P + F G L +
Sbjct: 280 -----VYQEFKKAVMASVGAAPTATPVGAPFEVCFPKAGVSG--APDLVFTFQAGAALTV 332
Query: 424 -------DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
DV V SV + L + D +I LG+ QQ + +D+ L F P
Sbjct: 333 PPANYLFDVGNDTVCLSVMSIAL-LNITALDGLNI-LGSFQQENVHLLFDLDKDMLSFEP 390
Query: 477 GNCN 480
+C+
Sbjct: 391 ADCS 394
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 150/359 (41%), Gaps = 26/359 (7%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQC-KPCIHCSQQRDPFFDPSKSKTFSKIPCNST 190
Y + + IG P Q VS ++D G + WTQC + C C +Q P FD + S TF PC +
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
C E P C Y+ + G G TD + I G AR F
Sbjct: 111 VC----ESIPTRSCAGDGGGACGYEASTSFGR-TVGRIGTDAVAI----GTAATARLAF- 160
Query: 251 LGCTDNNTGDQN-GASGIMGLDRGPVSIISKTNISYFFYCLHSP-YGSTGYITFGKPDTV 308
GC + D G+SG +GL R +S+ ++ N + F YCL P G + + G +
Sbjct: 161 -GCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPPDTGKSSALFLGASAKL 219
Query: 309 --NKKFVKYTPIV--TTPEQSEF---YHITLTGISVGGERLPLKASYFTKLSTEIDSGTI 361
K TP V +TP S Y + L I G + + S T + + + T
Sbjct: 220 AGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQSGNTIM---VSTATP 276
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDL 421
+T VY LR A + + +++ +D C+ A + P + + F GG ++
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVPPPVQN-YDLCFP-KASASGGAPDLVLAFQGGAEM 334
Query: 422 ELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
+ V L C+ P+ +LG++QQ + +D+ L F P +C+
Sbjct: 335 TVPVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 168/371 (45%), Gaps = 40/371 (10%)
Query: 131 EYYIVVAIGKPKQYVSLLLDTGSGITWTQC-KPCIHCSQQRDPFFDPSKSKTFSKIPCNS 189
YY+ + IG P + L +DTGS +TW QC PC C++ P + P+K+K +PC +
Sbjct: 56 HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKL---VPCAN 112
Query: 190 TTCKILLEWFPPNGQDKCSS-KECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYP 248
+ C L PN KC++ ++C Y I Y D + G TD ++ N + R
Sbjct: 113 SICTALHSGSSPN--KKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKSN--VRPS 168
Query: 249 FLLGCTDNNTGDQNGAS-----GIMGLDRGPVSIISK-----TNISYFFYCLHSPYGSTG 298
GC + +NGA+ G++GL RG VS++S+ + +CL + G G
Sbjct: 169 LSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGG--G 226
Query: 299 YITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEI-- 356
++ FG D V V + P+V + + + S G L + E+
Sbjct: 227 FLFFGD-DMVPTSRVTWVPMVRSTSGNYY--------SPGSATLYFDRRSLSTKPMEVVF 277
Query: 357 DSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYD-LSAYKTVVVPK---IT 412
DSG+ T F A Y A SA + + K + + + C+ A+K+V K +
Sbjct: 278 DSGSTYTYFSAQPYQATISAIKGSLSK-SLKQVSDPSLPLCWKGQKAFKSVSDVKKDFKS 336
Query: 413 IHFLGGVD--LELDVRGTLVVESVRQVCLGFALLPSDPNSI-LLGNVQQRGYEVHYDVAG 469
+ F+ G + +E+ L+V VCLG + S ++G++ + V YD
Sbjct: 337 LQFIFGKNAVMEIPPENYLIVTKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNEK 396
Query: 470 RRLGFGPGNCN 480
+LG+ G+C+
Sbjct: 397 AQLGWIRGSCS 407
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 94/341 (27%), Positives = 141/341 (41%), Gaps = 40/341 (11%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCN-ST 190
Y + IG P Q +L++D+GS +T+ C C C +DP F P S ++S + CN
Sbjct: 89 YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNVDC 148
Query: 191 TCKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFL 250
TC DK K+C Y+ Y + S +G D I +
Sbjct: 149 TCD----------SDK---KQCTYERQYAEMSSSSGVLGED---IVSFGRESELKAQRAV 192
Query: 251 LGCTDNNTGD--QNGASGIMGLDRGPVSIISK------TNISYFFYCLHSPYGSTGYITF 302
GC ++ TGD A GIMGL RG +SI+ + N S+ G +
Sbjct: 193 FGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLG 252
Query: 303 GKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYF-TKLSTEIDSGTI 361
G P + F + P+ +S +Y+I L I V G+ L + + F +K T +DSGT
Sbjct: 253 GVPTPSDMVFSRSDPL-----RSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGTT 307
Query: 362 ITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLF-DTCYDLSAYKTV-----VVPKITIHF 415
P + A + A ++ K +G + + D C+ A + V V P + + F
Sbjct: 308 YAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICF-AGARRNVSKLHEVFPDVDMVF 366
Query: 416 LGGVDLELDVRGTLVVESVRQ--VCLGFALLPSDPNSILLG 454
G L L L S CLG DP ++L G
Sbjct: 367 GNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGG 407
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 158/363 (43%), Gaps = 36/363 (9%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCS------QQRDPFFDPSKSKTFSKI 185
+Y +V +G P Q + LDTGS + W C+ C C+ F+ PS S T +
Sbjct: 116 HYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQAV 174
Query: 186 PCNSTTCKILLEWFPPNGQDKCS-SKECPYDIAYVDG-SGETGFWATDRMTIQEVNGNGY 243
PCNS C++ + +CS + +CPY + YV + +GF D + + +
Sbjct: 175 PCNSQFCEL---------RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQ 225
Query: 244 FARYPFLLGCTDNNTG---DQNGASGIMGLDRGPV---SIISKTNISYFFYCLHSPYGST 297
+ L GC TG D +G+ GL + SI+++ ++ + +
Sbjct: 226 ILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGI 285
Query: 298 GYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKLSTEID 357
G I+FG + ++ + TP+ P Q Y I+++ I+VG L + ST D
Sbjct: 286 GRISFGDQGSSDQ---EETPLDVNP-QHPTYTISISEITVGNSLTDL------EFSTIFD 335
Query: 358 SGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKT-VVVPKITIHFL 416
+GT T P Y+ + +F ++ + F+ CYDLS+ + + P I++ +
Sbjct: 336 TGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTV 395
Query: 417 GGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
GG + G ++ + A++ S +I+ N G V +D + LG+
Sbjct: 396 GGSVFPVIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMT-GLRVVFDRERKILGWKK 454
Query: 477 GNC 479
NC
Sbjct: 455 FNC 457
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 144/364 (39%), Gaps = 50/364 (13%)
Query: 138 IGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLE 197
IG P Q S +D + WTQC CIHC +Q P F P+ S TF PC + CK +
Sbjct: 30 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSI-- 87
Query: 198 WFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNN 257
KC+S C +D G G ATD I G A GC +
Sbjct: 88 -----PTPKCASDVCAFDGVTGLGGHTVGIVATDTFAI------GTAAPASLGFGCVVAS 136
Query: 258 TGD-QNGASGIMGLDRGPVSIISKTNISYFFYCLHSPY--GSTGYITFGKPDTVNKKFVK 314
D G SG +GL R P S++++ ++ F YCL +P+ G + G +
Sbjct: 137 DIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCL-APHDTGKNSRLFLGASAKLAGGG-A 194
Query: 315 YTPIV-TTPE--QSEFYHITLTGISVG--------GERLPLKASYFTKLSTEIDSGTIIT 363
+TP V T+P S++Y I L I G G L + ++S +DS
Sbjct: 195 WTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLVDS----- 249
Query: 364 RFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLEL 423
VY + A + + + F+ C+ + P + F G L +
Sbjct: 250 -----VYQEFKKAVMASVGAAPTATPVGEPFEVCFPKAGVSG--APDLVFTFQAGAALTV 302
Query: 424 -------DVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGP 476
DV V SV + L + D +I LG+ QQ + +D+ L F P
Sbjct: 303 PPANYLFDVGNDTVCLSVMSIAL-LNITALDGLNI-LGSFQQENVHLLFDLDKDMLSFEP 360
Query: 477 GNCN 480
+C+
Sbjct: 361 ADCS 364
>gi|300681439|emb|CBH32531.1| hypothetical protein TAA_ctg0091b.00060.1 [Triticum aestivum]
Length = 426
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 108/412 (26%), Positives = 172/412 (41%), Gaps = 60/412 (14%)
Query: 94 QRLHLKNSRRLQKAIPDNFKKTKAFTFPAKT--GIVAADEYYIVV---AIGKPKQYVSLL 148
Q L K ++ KA+ + + F P K G A D +VV ++G ++ S +
Sbjct: 38 QELWRKPAKSAPKAVIN-----RPFRAPDKDRLGSAATDNAGLVVYKISVGVAEEVFSGV 92
Query: 149 LDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQDKC- 207
+D + W QC S F+++ C S TC++ L+ +D C
Sbjct: 93 VDVATDFIWAQCP----------------VSSDFTEVFCFSQTCQLALDE-----EDACG 131
Query: 208 --SSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTDNNTGDQNGAS 265
+S CPY Y G TG+ ++ +EV G L GC+ +T +G S
Sbjct: 132 NSTSFTCPYAYQYGPGISTTGY-----ISAEEVTAVGTHITGRALFGCSLASTVPLDGES 186
Query: 266 GIMGLDRGPVSIISKTNISYFFYCL----HSPYGSTGYITFGKPDTVNKKFVKYTPIVTT 321
G++G RGP S++S+ IS F Y + S + G + TP++
Sbjct: 187 GVLGFSRGPYSLLSQLKISRFSYFMLPDDADKPDSESVLLLGDDAVPQTNSSRSTPLLRN 246
Query: 322 PEQSEFYHITLTGISVGGERLP-LKASYFTKLSTEIDSGTI------ITRFPAPVYSALR 374
+ Y++ LTGI V + L + A F + G + IT Y+AL
Sbjct: 247 EAYPDLYYVKLTGIKVDDKSLSGIPAGTFDLAANGCSGGVVMSTLSPITYLQPAAYNALT 306
Query: 375 SAFRKRMKKYKMGKGIEDLFD--TCYDLSAYKTVVVPKITIHFLGGVD-----LELDVRG 427
A ++K + +D+ D CY++ + + PKIT+ F GVD +EL
Sbjct: 307 RALASKIKSQPVRPKADDVADLRLCYNIQSVANLTFPKITLVF-HGVDGRPAPMELTTAH 365
Query: 428 TLVVE-SVRQVCLGFALLPS-DPNSILLGNVQQRGYEVHYDVAGRRLGFGPG 477
+ E S CL P+ P S +LG++ Q G + YD+ G L F G
Sbjct: 366 YFIRENSTGLQCLTMLPTPAGSPVSSVLGSLLQTGTHMIYDLRGGSLTFEKG 417
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 107/427 (25%), Positives = 179/427 (41%), Gaps = 64/427 (14%)
Query: 89 LRRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLL 148
+ R H K S +Q ++ FP G + I ++ G P Q +S L
Sbjct: 60 MSRSHHLKHGKASPLIQTSL-----------FPHSYG-----AHTIPLSFGTPPQKLSFL 103
Query: 149 LDTGSGITWTQCK---PCIHCS---QQRDPFFDPSKSKTFSKIPCNSTTC------KILL 196
+DTGS + W C C +CS ++ P F+P S + + C C + L
Sbjct: 104 MDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHL 163
Query: 197 EWFPPNGQDKCSSKECP-YDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLLGCTD 255
NG K S CP Y + Y G+ +GF+ + + + G + FL+GCT
Sbjct: 164 GXPRCNGNSKKCSHACPQYTLQYGTGAA-SGFFLLENL---DFPGK---TIHKFLVGCT- 215
Query: 256 NNTGDQNGAS-GIMGLDRGPVSIISKTNISYFFYCLHS-PYGST---GYITFGKPDTVNK 310
+ D+ +S + G R S+ + + F YCL+S Y T G + D +
Sbjct: 216 -TSADREPSSDALAGFGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILDYSDGETQ 274
Query: 311 KFVKYTPIVTT-PEQSEFYHITLTGISVGGERLPLKASYFTKLSTE-----IDSGTIITR 364
+ Y P P+ +Y++ + + +G + L + Y T S IDSG +
Sbjct: 275 G-LSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSY 333
Query: 365 FPAPVYSALRSAFRKRMKKYKMGKGIEDL--FDTCYDLSAYKTVVVPKITIHFLGGVDLE 422
PV+ + + +K+M KY+ +E CY+ + +K++ +P + F GG ++
Sbjct: 334 MTLPVFKIVTNELKKQMSKYRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGANMV 393
Query: 423 LDVRGTLVVESVRQVCLG-FALLPSDPN---------SILLGNVQQRGYEVHYDVAGRRL 472
+ ++ S + LG F + P SI+LGN QQ + V +D+ RL
Sbjct: 394 VPGMNYFLLFS--EASLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERL 451
Query: 473 GFGPGNC 479
GF C
Sbjct: 452 GFRQQTC 458
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 154/366 (42%), Gaps = 52/366 (14%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDPFFDPSKSKTFSKIPCNSTT 191
+ + V I +P++ L++DTGS + WTQCK +S+T
Sbjct: 43 HSLTVGIVQPRK---LIVDTGSDLIWTQCK-------------------------LSSST 74
Query: 192 CKILLEWFPPNGQDKCSSKECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFARYPFLL 251
PP + ++ + + G A++ T G
Sbjct: 75 AAAARHGSPPLSR-TAPARTGAFTRTCTASAAAVGVLASETFTF----GARRAVSLRLGF 129
Query: 252 GCTDNNTGDQNGASGIMGLDRGPVSIISKTNISYFFYCLHSPYGS--TGYITFGKPDTVN 309
GC + G GA+GI+GL +S+I++ I F YCL +P+ T + FG ++
Sbjct: 130 GCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCL-TPFADKKTSPLLFGAMADLS 188
Query: 310 K----KFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTKL-----STEIDSGT 360
+ + ++ T IV+ P ++ +Y++ L GIS+G +RL + A+ T +DSG+
Sbjct: 189 RHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGS 248
Query: 361 IITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDL------SAYKTVVVPKITIH 414
+ + A++ A ++ + +ED ++ C+ L +A + V VP + +H
Sbjct: 249 TVAYLVEAAFEAVKEAVMDVVRLPVANRTVED-YELCFVLPRRTAAAAMEAVQVPPLVLH 307
Query: 415 FLGGVDLELDVRGTLVVESVRQVCLGFALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGF 474
F GG + L +CL ++GNVQQ+ V +DV + F
Sbjct: 308 FDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSF 367
Query: 475 GPGNCN 480
P C+
Sbjct: 368 APTQCD 373
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 107/416 (25%), Positives = 171/416 (41%), Gaps = 43/416 (10%)
Query: 90 RRDQQRLHLKNSRRLQKAIPDNFKKTKAFTFPAKTGIVAADEYYIVVAIGKPKQYVSLLL 149
+R + ++RR + + P +TG+ Y+ + +G P + + +
Sbjct: 33 KRSLNAVKAHDARRRGRILSAVDLNLGGNGLPTETGL-----YFTKLGLGSPPKDYYVQV 87
Query: 150 DTGSGITWTQCKPCIHCSQQRD-----PFFDPSKSKTFSKIPCNSTTCKILLEWFPPNGQ 204
DTGS I W C C C ++ D +DP S+T I C+ C + P
Sbjct: 88 DTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELISCDQEFCSATYDGPIPG-- 145
Query: 205 DKCSSK-ECPYDIAYVDGSGETGFWATDRMTIQEVNGNGYFA--RYPFLLGCTDNNTGDQ 261
C S+ CPY I Y DGS TG++ D +T VN N A + GC +G
Sbjct: 146 --CKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNSSIIFGCGAVQSGTL 203
Query: 262 NGAS-----GIMGLDRGPVSIISKTNIS-----YFFYCLHSPYGSTGYITFGKPDTVNKK 311
+ +S GI+G + S++S+ S F +CL + G G G+ V +
Sbjct: 204 SSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIRGG-GIFAIGE---VVEP 259
Query: 312 FVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTK---LSTEIDSGTIITRFPAP 368
V TP+V P + Y++ L I V + L L + F T IDSGT + PA
Sbjct: 260 KVSTTPLV--PRMAH-YNVVLKSIEVDTDILQLPSDIFDSGNGKGTIIDSGTTLAYLPAI 316
Query: 369 VYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVVVPKITIHFLGGVDLELDVRGT 428
VY L R + K+ +E F +C+ + P + +HF + L +
Sbjct: 317 VYDELIPKVMARQPRLKL-YLVEQQF-SCFQYTGNVDRGFPVVKLHFEDSLSLTVYPHDY 374
Query: 429 LVVESVRQVCLGF----ALLPSDPNSILLGNVQQRGYEVHYDVAGRRLGFGPGNCN 480
L C+G+ A + + LLG++ V YD+ +G+ NC+
Sbjct: 375 LFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMAIGWTDYNCS 430
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 103/376 (27%), Positives = 158/376 (42%), Gaps = 49/376 (13%)
Query: 132 YYIVVAIGKPKQYVSLLLDTGSGITWTQCKPCIHCSQQRDP----------FFDPSKSKT 181
YY V++G P + LDTGS + W C C + + + P+ S T
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTT 161
Query: 182 FSKIPCNSTTCKILLEWFPPNGQDKCSSKE--CPYDIAYVDGSGETGFWATDRMTIQEVN 239
S I C+ C G KCSS + CPY I+Y + +G TG D + + +
Sbjct: 162 SSSIRCSDKRCF---------GSKKCSSPKSICPYQISYSNSTGTTGTLLQDVLHLATED 212
Query: 240 GNGYFARYPFLLGCTDNNTG---DQNGASGIMGLDRGPVSI---ISKTNISY--FFYCLH 291
N + LGC TG N +G++GL S+ ++K NI+ F C
Sbjct: 213 ENLTPVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITADSFSMCFG 272
Query: 292 SPYGSTGYITFGKPDTVNKKFVKYTPIVTTPEQSEFYHITLTGISVGGERLPLKASYFTK 351
G+ G I+FG +++ TP ++ S Y + +TG+SVGG+ P+ F K
Sbjct: 273 RVIGNVGRISFGDKGYTDQE---ETPFISV-APSTAYGLNVTGVSVGGD--PVGTRLFAK 326
Query: 352 LSTEIDSGTIITRFPAPVYSALRSAFRKRMKKYKMGKGIEDLFDTCYDLSAYKTVV-VPK 410
D+G+ T P Y L +F ++ + E F+ CYDLS T + P
Sbjct: 327 F----DTGSSFTHLMEPAYGVLTKSFDDLVEDKRRPVDPELPFEFCYDLSPNATSIEFPF 382
Query: 411 ITIHFLGGVDLELDVRGTLVVESVRQ------VCLGFALLPSDPNSI-LLGNVQQRGYEV 463
+ + F+GG + L+ R CLG +L S I ++G GY +
Sbjct: 383 VEMTFVGGSKIILNNPFFTARTQARHGEGNVMYCLG--VLKSVGLKINVIGQNFVAGYRI 440
Query: 464 HYDVAGRRLGFGPGNC 479
+D LG+ P C
Sbjct: 441 VFDRERMILGWKPSLC 456
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.138 0.425
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,117,251,839
Number of Sequences: 23463169
Number of extensions: 361274069
Number of successful extensions: 677310
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1103
Number of HSP's successfully gapped in prelim test: 1872
Number of HSP's that attempted gapping in prelim test: 669135
Number of HSP's gapped (non-prelim): 3641
length of query: 480
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 334
effective length of database: 8,933,572,693
effective search space: 2983813279462
effective search space used: 2983813279462
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)